All Content Flashcards

1
Q

Enterprise Information Systems focus on integration. Which are the 3 types of integration?

A
  • Data/Information Integration
  • Enterprise Application Integration
  • Business-to-business Integration
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which is the goal of integration?

A

Provide a homogeneous, integrated view on multiple, distributed, autonomous and heterogeneous systems, components, or data sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which are the 3 fundamental challenges of integration?

A
  • Distribution
  • Autonomy
  • Heterogeneity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which are the 2 types of distribution?

A
  • Logical

- Physical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which are the 4 types of autonomy?

A
  • Design
  • Interface
  • Access
  • Judicial
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which are the 2 main approaches to resolve heterogeneity?

A
  • When needed

- Enforce by establishing standards

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which are the 6 types of heterogeneity?

A
  • Technical
  • Data Model
  • Syntactic
  • Structural
  • Schematic
  • Semantic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which are the 2 architectural approaches to achieve data integration?

A
  • Materialized integration: replication, data warehousing

- Virtual integration: federated DBMS, multi-DB systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which are the 4 characteristics of DDBMS?

A
  • Distribution by design
  • 4-layer schema architecture
  • Tightly couple
  • Heterogeneity not an issue
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which are the 4 characteristics of FDBMS?

A
  • Provides homogeneous and integrated view of data from multiple sources (federated conceptual schema)
  • “On-Demand” data integration
  • Requires: wrapper/mediator technology, data and schema integration mechanisms
  • Distribution is given
  • Preserves high degree of autonomy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which are the 2 alternatives for federated schema creation?

A
  • bottom-up: schema integration

- top-down: schema design, schema mapping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which are the 5 layers of the Schema Reference Architecture?

A
  • external schema
  • federated schema
  • export schema
  • component schema
  • local schema
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Mediator-based IS are a generalization of DDBMS and FDBMS. Which are the 2 main components?

A
  • wrappers

- mediator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which are the 4 tasks of wrappers?

A
  • encapsulate data source for uniform access
  • help overcome heterogeneity of data sources
  • support global query evaluation and optimization
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which are the 3 main characteristics of Garlic?

A
  • Garlic and wrappers cooperator for query processing
  • Extensibility
  • Wrapper evolution (until provide full support of data source functionality)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which are the 4 wrapper services of Garlic?

A
  • modelling data as object collection
  • method calls
  • query planning
  • query execution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Garlic optimizer builds query plan using bottom-up approach. Which are the 4 steps?

A
  1. create plan fragments for single collections
  2. plan the join processing for local joins
  3. plan the join processing across multiple sources
  4. finalize the plan (compensations)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Which are the 3 methods that wrappers provide to Garlic work requests?

A
  • plan_access( )
  • plan_join( )
  • plan_bind( )
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Which are the 4 characteristics of MDBMS?

A
  • loosely coupled
  • no global schema
  • multi-database language
  • only location and physical distribution transparency
  • data model heterogeneity handled by local data source or multi-db language
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the limitation of SQL to solve most forms of schematic heterogeneity?

A

Views can be used under certain specific circumstances. Otherwise the maintenance is expensive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Which are the 3 characteristics of Schematic Query Languages to deal with schematic heterogeneity?

A
  • Transform data to metadata (and v.v.)
  • Extend relational model to allow dynamic schemas
  • mechanism to access different DBs in a single query
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Which is the main idea of SchemaSQL?

A

Extended FROM clause: ->, db->, db::rel->, db::rel, db::rel.attr

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Which are the 2 points of criticism over SchemaSQL?

A
  • merge not well-defined for all source relations

- semantics of SELECT differs depending on context

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Which is the main idead of FIRA/FISQL?

A

Provide federated relational model with additional algebra operators.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Which are the 6 new operator of FIRA/FISQL?

A
  • Drop-projection
  • Down
  • Attribute Dereference
  • Generalized Union
  • Transpose
  • Partition operator
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Which are the 2 motivations for replication?

A
  • increase availability

- increase performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Which are the 3 challenges for replication?

A
  • transaction processing
  • enforcing consistency
  • detecting and resolving conflicts
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Which are the 2 types of replication?

A
  • eager replication

- lazy replication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Which are the advantages and disadvantages of eager replication?

A
\+ guarantees globally serializable execution
\+ avoid inconsistencies
- deadlocks
- lack of scalability
- cannot be used if nodes offline
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Which are the advantages and disadvantages of lazy replication?

A
\+ minimal update overhead
\+ works if nodes offline
- stale data
- conflicting updates may cause inconsistencies
- potential for system delusion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Which are the 2 ways of bidirectional replication?

A
  • master/slave

- peer-to-peer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Which are the 2 replication methods?

A
  • Target table refresh

- Change-capture

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Which are the 4 types changes can be capture for replication?

A
  • continuous
  • interval-based
  • DB events
  • one-time snaptshot
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Which are the 2 realization approaches for capturing data?

A
  • analyze DB log files

- use DB triggers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Which are the 3 main data that should be captured?

A
  • type of change (insert, update, delete)
  • new values and item ID
  • (optional) before-image information
36
Q

Which are the 2 ways that captured change-data can be provided?

A
  • staging table at source location

- message-oriented middleware (message queue)

37
Q

How are replication conflicts detected?

A

Based on timestamps. Object carry most recent update timestamp. Update carries new value and old object timestamp. Provoke wait situations in eager replication. Reconciliation in lazy replication.

38
Q

Which are the 2 possible ownership of replicas?

A
  • Group

- Master (results in waits in lazy replication)

39
Q

Which are the 3 approach for replication conflicts reconciliation?

A
  • automatically based on rules
  • manually (conflicts table or queue)
  • non-transactional schemes (abandon serializability for convergence, tolerate lost updates)
40
Q

What is the main goal of data warehousing?

A
  • materialized integration of data from numerous heterogeneous data sources to enable data analysis
41
Q

Which are the 2 main components of OLAP data?

A
  • Facts

- Dimensions

42
Q

Which are the 2 ways of implementing OLAP schema in a RDBMS?

A
  • Star structure

- Snowflake structure

43
Q

Which are the __ components of Data Warehouse Architecture?

A
  • Data warehouse manager
  • Monitor
  • Extractor
  • Transformer
  • Loader
  • Staging area (DB)
  • Main warehouse (DB)
44
Q

Which are the 5 approaches for monitoring and capturing change data in data warehousing?

A
  • log-based
  • trigger-based
  • using replication middleware
  • audit columns
  • snapshot differentials
45
Q

Which are 5 examples of transformation performer by the Transformer in data warehousing?

A
  • (de-) normalized schema
  • generation of global IDs
  • data type conversion
  • unit standardization
  • derived values
46
Q

What are short-term queues used for?

A

Load control: store requests in temporary queue to avoid creating new processes.

47
Q

What are persistent queues in TP-Monitors used for?

A
  • End-user control (redelivery of output until user explicitly ack)
  • Recoverable data entry
  • Multi-transactional requests (single request processed in multiple transactions)
48
Q

Which are the 2 advantages of Asynchronous Transaction Processing?

A
  • optimize for throughput

- avoid resource contention of single-transaction

49
Q

Which are the 3 advantages of Stratified Transactions over single, global TA?

A
  • early commit of individual strata (less contention, higher throughput)
  • reduced observed end user response time (commit of root stratum)
  • if all TAs in a stratum execute on same node, there is no network traffic for 2PC and TA-Manager does not need to support external coordinator.
50
Q

Which are the 2 requirements for Stratified Transactions?

A
  • all resources of TA must be recoverable

- resource managers able to participate in 2PC

51
Q

Which are 3 client variations of Stratified Transactions?

A
  • non-transactional client
  • one-way messaging
  • multiple clients submitting requests (one reply queue per client)
52
Q

In the case of a non-transactional client in Stratified Transactions, how can the fault-tolerance be implemented?

A
  • guarantee that message is persistently stored when enqueue message operation return to client
  • provide message_id about the last request submitted, last reply received when client reconnects after failure
53
Q

Which are the 5 characteristics of Message Queuing Systems?

A
  • evolved from queue systems in TP-monitors
  • message-oriented
  • loosely-coupled
  • provide persistent message queues
  • transactional
54
Q

Why are Message Queue Systems loosely coupled?

A
  • client is not blocked during request processing
  • server can chose processing time and release resources/locks early
  • components do not need to be active at the same time
55
Q

Which are 3 variations of interactions with MQS?

A
  • shared queues (multiple consumers per queue, but particular message has single consumer)
  • additional message properties (priority, time-out)
  • enhanced flexibility for consumers (beyond FIFO)
56
Q

Which are 2 messaging delivery modes in MQS?

A
  • persistent

- non-persistent

57
Q

How is the message order in the messaging model of MQS?

A

Message sent by single session are received in order. Sending order affected by priority. Sending order guaranteed only within same delivery mode. Sending order affected by transaction order of message.

58
Q

How are transactions provied by MQS?

A

Transactional sessions. Messages are acked at TA commit. Queues recovered automatically at rollback.

59
Q

How does message acknowledgement works in MQS?

A

Messages have to be acked after received and then they are removed from queue.

60
Q

Which are the 3 ways non-transactional sessions can acknowledge messages in MQS?

A
  • lazy ack
  • auto ack (after successful receive)
  • client ack (explicit)
61
Q

How do message selectors work in MQS?

A

Queue based on logical conditions that refers header and properties fields of messages.

62
Q

What is the main idea of Message Routing?

A

Message router use scripts to separate routing+transformation logic from the applications.

63
Q

Which are the 3 categories in which a subscription can be classified in a Publish/Subscribe system?

A
  • static (deployment) vs. dynamic (run-time)
  • type-based vs. parameter-based (boolean condition)
  • non-durable vs. durable
64
Q

Which are the 5 main aspects of Message Brokering?

A
  • Message transformations
  • Message routing and transport
  • Rules-based processing and distribution of message based on message fields
  • Message repository
  • Message warehouse
65
Q

What is a message repository in Message Brokering used for?

A
  • definition of message structures
  • mapping rules
  • transformation functions
  • routing scripts
  • subscription requests
66
Q

Which are the 3 topologies of a Message Broker?

A
  • hub-and-spoke
  • multi-hub
  • federation
67
Q

Which are the 2 roles of DBMS in a messaging world?

A
  • persistence manager

- provided advanced DBMS capabilities to achieve synergy (queue messaging data)

68
Q

Which are the 3 dimensions of Workflow?

A
  • what (Business Process)
  • who (Organization)
  • with (Business Function Implementation)
69
Q

Which are the 3 steps of Business Process Re-Engineering?

A
  1. Discovery
  2. Optimization (static)
  3. Analysis (instrumentation, analytic simulation, quantitative aspects)
70
Q

What is defined by atomic spheres?

A

A set of activities where either all activities commit or none.

71
Q

What is defined by compensation spheres?

A

Set of activities that must complete as a whole or semantically undone. Each activity in the sphere is associated with a compensation action.

72
Q

What is the Phoenix Behavior?

A

Forward recovery. Workflow is made persistent and recoverable.

73
Q

What is the motivation for Web Services composition?

A
  • Interact with business partners
  • New web services out of existing ones
  • Composition can be iterated
74
Q

Which are the 3 main elements of a Web Service Composition Middleware?

A
  • composition model and language
  • development environment
  • run-time environment
75
Q

What is the difference between middleware for composition and coordination of Web Services?

A
  • Composition: focus on implementation of operation in WS. Internal and private. Used to automate execution of composed WS.
  • Coordination: focus on conversation protocols. Public and standardized protocols.
76
Q

Which are the 2 types of process that can be defined using BPEL?

A
  • executable process (composition)

- abstract process (coordination)

77
Q

What is defined by the business process in BPEL?

A
  • execution order of web services
  • data shared between web services
  • correlation information
  • partners involved in business process and their interfaces
  • exception handling
78
Q

What is the BPEL Component Model?

A

Components in BPEL are Web Services described using WSDL.

79
Q

Which are the basic activities in BPEL?

A
  • Invoke (async or sync)
  • Receive (wait for message)
  • Reply (sync response to receive)
80
Q

Which are the structured activities defined by BPEL?

A
  • Sequence
  • Switch
  • While / For each
  • Pick
  • Flow (parallel execution)
81
Q

Which BPEL activities can start a process?

A
  • receive

- pick

82
Q

Which are the 5 phases of Information Integration?

A
  • Analysis
  • Discovery
  • Planning
  • Deployment
  • Run-time
83
Q

Which are the 2 approaches for Information Integration?

A
  • bottom-up (well-known set of schemas)

- top-down (schemas not know, changes, create based on application requirements)

84
Q

Which are the 3 phases of CUPID?

A
  1. Linguistic matching
  2. Structure matching
  3. Creation of mappings/matches
85
Q

Which are the 3 phases of CUPID Linguistic Matching?

A
  1. Normalization
  2. Categorization
  3. Comparation
86
Q

What is the trade-off of CUPID in terms of similarity matches?

A
  • confidence of false positives reduces

- confidence of false negative increased

87
Q

Which are the 4 properties of an integrated schema?

A
  • complete
  • minimal
  • correct
  • intelligible