Data Mining and Database Environment Discussion
Part 1:
Discuss the possible data request scenarios in a distributed database environment.
Part 2: Respond to a post below
“Let’s discuss the data request scenarios one by one:
- Single request accessing a single remote database: This is a basic scenario in which a single SQL statement is directed to a single remote DBMS, also known as a remote request.
- Multiple requests accessing a single remote database: In this scenario, a single unit of work consists of multiple SQL statements directed to a single remote DBMS. The user can define blocks of the work using commit, rollback and save, but the remote DBMS managed the processing of this unit of work.
- Multiple requests accessing multiple remote databases: In this case, a unit of work may be composed of multiple SQL statements directed to multiple remote DBMSes. However, any one SQL statement may access only one of the remote DBMSes. Similar to the second scenario, the local user defines the units of work using commit, rollback, etc., but the remote DBMS to which the SQL statement was directed manages the unit of work’s processing. In this scenario, a two phase COMMIT must be used to coordinate COMMIT processing for the multiple locations.
- Multiple requests accessing any combination of multiple remote DBMSes: In this scenario, a unit of work now may consist of multiple SQL statements addressed to multiple remote DBMSes, and each SQL statement may address any combination of databases. Similar to the previous scenario, the local user defines the units of work using commit, rollback, etc., but the remote DBMS to which the SQL statement was directed manages the unit of work’s processing. A two phase COMMIT must be used to coordinate COMMIT processing for the multiple locations.”
Part 3:
- How does data mining work? Discuss the different phases in the data mining process.
- Describe the characteristics of predictive analytics. What is the impact of Big Data (social media) in predictive analytics?