Week 4 - Distrbuted DB system Flashcards

1
Q

What does a query Processor do?

A

The query processor basically takes
this high level query language (SQL) and then generate low level data manipulation,
commands for the distributed database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Query Execution Methodology

A

The steps that one goes through in executing high-level (declarative) user queries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Query Optimization

A

How do we determine the “best” execution plan?

get the answer query very fast

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the main objective of distributed query processing?

1) Form a high level query on a distributed database for optimal execution.
2) Form multiple low level queries that roll up into the needed dataset.

3_ Define a process for query evaluation.

A

1) Form a high level query on a distributed database for optimal execution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is decomposition

3 items

A

1) Checking the sanity of the queries
2) Eliminate redundancy
3) Normalization algebraic conversion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Localization

3 times

A

1) Only on distributed DB
2) replace local relations (tables) by corresponding fragments (distributed instances)
3) use UNION

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Total Cost of a Query Execution Plan

43items

A

1) CPU cost
2) I/O cost
3) communications cost

total cost = CPU + I/O + Com

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Total Cost of a Query Execution Plan

43items

A

1) CPU cost
2) I/O cost (getting data from Disk)
3) communications cost (across the network)

total cost = CPU + I/O + Com

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the response time

A
CPU Time
     \+
I/O time
      =
Com time
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the primary cost factor

A

Size of intermediate relations

tables that are split and have to be joined over the network

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a query execution plan impact?

1) Data distribution strategy
2) Data conflict outcome
3) Fragmentation strategy
4) Query performance

A

4) Query performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Is IOs a good metrics for a distributed cost analysis?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Is query separation a good plan for distributed systems

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does one create a horizontal fragmentation of a database table?

1) By placing the same table in to multiple databases that are connected horizontally.
2) By placing table records in to multiple smaller tables.
3) By placing table fields in to multiple smaller tables.
4) By placing table attributes in to multiple smaller tables.

A

2) By placing table records in to multiple smaller tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which table key should be copied to each of the subtables created in a vertical fragmentation?

1) Primary key
2) Foreign key
3) Vertical key
4) Alternating key

A

1) Primary key

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which of the following areas is negatively affected when data is replicated across multiple sites?

1) Data redundancy
2) Data availability
3) Query evaluation
4) Data updates

A

4) Data updates

17
Q

What is the high-level query processing language used by database management systems?

1) SQL
2) HTML
3) XML
4) PL

A

1) SQL

18
Q

Which of the following cannot be a goal in a query

1) Maximizing solution space
2) Minimizing processing time
3) Maximizing throughput
4) Minimizing transfers among distributed sites

A

1) Maximizing solution space