Chapter 12 Flashcards
what are the Challenges with Physical distribution?
Addressing data across the globe, assessing data in different schemas, optimizing distributed queries
what are the Challenges of logical distribution?
Maintaining consistency among redundant data, provide metadata to enable data localization, detect and resolve duplicates, detect and resolve data inconsistencies and conflicts
What is the major problem of integration?
autonomy is the major problem of integration.
Technical heterogeneity:
communication protocols (http, soap,…); exchange formats (binary, text, xml…); APIS (jdbc, odbc,…); query mechanisms; query language
Mention six types of heterogeneity:
Data Model Heterogeneity: Technical heterogeneity Syntactic heterogeneity structural heterogeneity schematic heterogeneity semantic heterogeneity
what are the Two architectural approaches to achieve data integration?
- materialized integration (replication, data warehousing)
* virtual integration (federated DBMS, multi-database systems)
Distribution controlled by DBMS (distributed transaction processing)
- single select statement can access tables in different data sources
- virtual integration
- degrees of transparency
Materialized and virtual integration
Materialized: ETL (Extract, transfor, load)
Virtual: wrappers