L5 - Information-oriented System Integration Flashcards

1
Q

What is the goal of the lecture?

A

To explain an overview and requirements of information-oriented system integration and the main existing solutions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three existing IoSI solutions for enterprise system integration?

A

Data Warehousing

Data Federation

Data Replication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is metadata?

A

Metadata is data about data (information about the actual data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a schema in databases?

A

A schema is a blueprint that defines the structure of database components such as tables, attributes, and data types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a schema in XML?

A

A schema is a set of rules defining allowed elements, attributes, structures, and data types in an XML document.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are data types in databases?

A

Data types define the type of data that can be stored in a column, such as char, varchar, int, tinyint, float, decimal, and money.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is latency?

A

Latency is the time delay when data is sent from one source to a destination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Information-oriented System Integration (IoSI)?

A

IoSI is an integration approach based on the exchange of simple data between systems, usually between databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why is IoSI needed?

A

Enterprises process large amounts of data from various sources, which may use different tools, technologies, and formats, making integration necessary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the advantages of IoSI?

A

Easy to understand and develop

Standardized API support, such as ODBC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the disadvantages of IoSI?

A

Deals only with data sharing

Does not handle business logic, states, or behaviors

Not sufficient for functional or behavior integration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the first requirement for IoSI?

A

Represent data in a canonical format, such as XML, to ensure interoperability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the second requirement for IoSI?

A

Maintain exchanged data using metadata to enable efficient interoperability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the third requirement for IoSI?

A

Facilitate data integration in real-time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the three data latency levels in IoSI?

A

Real-time integration

Near real-time integration

Batch processing (non-real-time)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a schema conflict in data integration?

A

A schema conflict occurs when tables containing the same concept are structured differently, e.g., one table includes an “email” attribute while another does not.

17
Q

What is a naming conflict in data integration?

A

A naming conflict occurs when different terms are used for the same metadata, such as “telephone” vs. “phone_number” vs. “TeleF”.

18
Q

What is a data representation conflict in data integration?

A

A data representation conflict occurs when different formats are used for the same data, such as “Donald Trump” vs. “D. Trump”.

19
Q

What is a precision conflict in data integration?

A

A precision conflict occurs when different data types are used for the same data, such as Double vs. Money vs. Float vs. Int.

20
Q

What is data warehousing?

A

A batch information integration technique that extracts, transforms, and loads data from multiple databases into a single repository.

21
Q

What is data federation?

A

A real-time information integration technique that allows querying disparate databases through a unified schema without copying data.

22
Q

What is data replication?

A

A near-real-time integration technique where data is copied between databases at specific intervals.

23
Q

What are the key features of data warehousing?

A

Extracts, transforms, and loads data (ETL).

Stores data in a single database.

Uses dimensional data models with fact and dimension tables.

24
Q

What are the advantages of data warehousing?

A

Standardized ETL services make it easy to implement and maintain.

Low cost and reliable as it integrates data into one place.

25
Q

What are the disadvantages of data warehousing?

A

Data freshness issues due to batch updates.

Data conflicts must be resolved.

Primarily designed for batch analysis.

26
Q

What are the key features of database federation?

A

Provides seamless access to multiple databases.

Uses a meta-database (reference database).

Queries are transformed and executed across multiple sources.

27
Q

What are the advantages of database federation?

A

Provides real-time access to original data sources.

Enables users to create integrated views of data.

28
Q

What are the disadvantages of database federation?

A

Requires resolving data conflicts.

Needs continuous synchronization and maintenance.

Requires separate tools for implementation.

29
Q

What are the key features of data replication?

A

Copies data between databases to ensure consistency.

Uses a master-slave database system.

Can be implemented with different replication modes.

30
Q

What are the advantages of data replication?

A

Low-cost integration method.

Simple to develop and implement.

Improves data availability and performance.

31
Q

What are the disadvantages of data replication?

A

Increases network overhead and storage requirements.

Requires high maintenance and synchronization.

Does not resolve schema conflicts.

32
Q

What is full replication?

A

Copies all data from the master to the slave databases, including new, updated, and existing data.

33
Q

What is key-based incremental replication?

A

Copies only data that has changed since the last replication by scanning keys or indexes.

34
Q

What is log-based incremental replication?

A

Copies only changed data by scanning the master database’s log files.

35
Q

What is snapshot replication?

A

Copies a snapshot of the master database at a specific point in time, without tracking changes.

36
Q

What is merge replication?

A

Merges multiple master databases into a single database.