chapter 3 [ B ] Flashcards

1
Q

horizontal fragmentation is defined by a selection operation on the _______ of a database schema.

A

owner relations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A horizontal fragment Ri of relation R consists of all the tuples of R that satisfy a minterm predicate mi.

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

given a set of minterm predicates M, how many horizontal fragments of relation R are there?

A

same number as the minterm predicates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

set of horizontal fragments is aka _______ ?

A

minterm fragments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

An important aspect of simple predicates is their _____ and ______ ?

A

completeness and their minimality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain Completeness of predicates?

A

a set of predicates is considered complete if it can address all access patterns / queries under which tuples from the fragmented relations may need to be accessed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is The reason completeness is a desired?

A
  • Fragments are logically uniform
  • Fragments are statistically homogeneous in the way applications access them.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

______ results in a balanced workload across all the fragments.

A

Completness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Explain minimality.

A

means that we only include the essential predicates that impact how we break down a dataset into fragments.

  • if a predicate causes a fragment f to be further fragmented into, say, fi and fj, then there should be at least one application that accesses fi and fj differently.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If all the predicates of a set Pr are relevant, Pr is minimal

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

_______ is An iterative algorithm that would generate a complete and minimal set of predicates Pr’ given a set of simple predicates Pr.

A

COM-MIN

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Write down COM-MIN algorithm.

A
  • Input: Set of simple predicates Pr
  • Output: Complete & minimal set of predicates Pr’
  1. Initialize Pr’ as an empty set.
  2. For each simple predicate p in Pr:
    a. Add p to Pr’.
  3. For each pair of distinct minterm predicates mi, mj in Pr’a. Check if mi and mj are identical in their
    definitions except for one simple predicate pib. If pi is relevant, Keep both mi and mj in Pr’
    c. If pi is not relevant, Remove the less relevant
    predicate
  4. Repeat steps 3 until no further changes can be made to Pr’.
  5. Output Pr’ as the complete and minimal set of predicates.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

reduction of minterms can be achieved by _____ ?

A

eliminating minterms that might be contradictory to a set of implications I.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Give an example of contradicting minterm elimination.

A
  • We have an attribute ‘att’ that can take on values from a set {value_1, value_2}.

Implication
- i1: If ‘att’ is ‘value_1’, then it can’t be ‘value_2’.
- i2: If ‘att’ is not ‘value_1’, then it must be ‘value_2’.

minterms
- m1: (att = value_1) AND (att = value_2)
- m2: ¬(att = value_1) AND (att = value_2)
- m3: (att = value_1) AND ¬(att = value_2)
- m4: ¬(att = value_1) AND ¬(att = value_2)

—> minterm predicates m1 and m4 are contradictory to the implications I

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Explain the PHORIZONTAL Algorithm?

A

Input
- Relation R: Table to be horizontally fragmented.
- Pr: The set of simple predicates

Steps

  1. Initialize empty set Fragments
  2. For each simple predicate p in Pra. Create a fragment Fi
    - consists of the rows that satisfy the simple
    predicate p.
    - p is “age > 30,” Fi will contain all rows where
    the age is greater than 30.
  3. Add Fi to the Fragments set
  4. Repeat steps 2-3 for all predicates in Pr
  5. Output Fragments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Given these Scenarios:

  • Employee records are managed in two places.
  • One handles records of those with salaries less than or equal to $30,000.
  • The other handles records of those who earn more than $30,000.

Workout the :
- Simple Predicates
- Application of COM_MIN Algorithm &
- Minterm Predicates

A
  1. Simple Predicates
    - p1: Salary is less than or equal to $30,000.
    - p2: Salary is more than $30,000.
  2. Application of COM_MIN Algorithm
    - Initial Set: Pr = {p1, p2}.
    - Iteration 1 (i=1): Applying COM_MIN with i=1 results
    in Pr’ = {p1}.
    This is complete and minimal since p2 would not
    partition f1 ( fragment due to p1 )
  3. Minterm Predicates:
    - m1: Salary < $30,000.
    - m2: Salary > $30,000.
17
Q

Explain Derived Horizontal Fragmentation.

A
  • Derived Horizontal Fragmentation is a way of dividing data based on a selection operation specified on the owner of a link.
  • There’s a link (L), has an owner relation and a member relation
18
Q

Give Example of vertical fragmentation using Engineers’ Salaries.

A
  • Consider a link L1 where owner(L1) = PAY and member(L1) = EMP.
  • Group engineers into two categories based on salary: those making <= $30,000 and those making > $30,000.
  • RESULT : 2 tables with column ENO, ENAME, TITLE
19
Q

a vertical fragmentation of a relation R, produces fragments that contain a subset of R’s attributes as well as the primary key of R.

A

True

20
Q

What is meant by an “optimal” fragmentation

A

Reducing the overall computational load since it :

  • Helps in handling user queries with smaller relations, leading to fewer page accesses.
  • Identifying and placing the most “active” sub- relations in a faster memory subsystem can enhance performance.
21
Q

What are the heuristic approaches [ rule of thumb ] for Vertical fragmentation?

A
  1. Grouping
  2. Splitting:
21
Q

Explain the Grouping heuristic approach for the vertical fragmentation

A

Initially assigns each attribute to one fragment then Joins fragments until certain criteria are met.

21
Q

_______ approach fits more naturally within the top-down design methodology

A

Grouping

22
Q

Explain the Splitting heuristic approach for the vertical fragmentation.

A

Starts with a relation and decides partitioning based on application access behavior

22
Q

When Splitting, the solution is probably closer to the full relation than to a set of fragments

A

False
- in grouping new

23
Q

_______ generates non-overlapping fragments whereas _______ results in overlapping fragments

A

Splitting , grouping

23
Q

______ is a characteristic of vertical fragmentation that allows the reconstruction of the global relation.

A

replication of the global relation’s key in the fragments.

24
Q

vertical fragmentation is about dividing a big table into smaller pieces by grouping attributes together or by splitting them.

A

True

24
Q

vertical fragmentation is about dividing a big table into smaller pieces by grouping attributes together or by splitting them.

A

True

25
Q

The major information required for vertical fragmentation is related to ________ ?

A

applications

25
Q

What is the notion of togetherness?

A

We want to know how closely related or connected certain attributes of data are.

26
Q

What is Access Frequeny?

A
  • how often different parts of the data are queried
  • For each application and each piece of data, we create a measure called “attribute usage value.”
27
Q

What is the The Allocation Problem?

A
  • Assume we have fragments (f1…,fn) and Sites (s1…,sn) in our DDB network.
  • The problem is deciding the best way to spread these fragments across the network, especially when different applications are happening.
28
Q

The optimality can be defined with respect to ________ and _______ ?

A

Minimal cost and Performance

29
Q

What does The cost function consists of?

A

– the cost of storing each Fi at a site Sj,
– the cost of querying Fi at site Sj ,
– the cost of updating Fi at all sites where it is stored,

30
Q

What are the Two well-known performance metrics?

A
  • to minimize the response time
  • to maximize the system throughput