Query Processing Flashcards by Megan Briers

What are the three basic steps in query processing?

<ol>
<li>Parser and translator</li>
<li>Optimiser</li>
<li>Evaluation</li>
</ol>

How well did you know this?

Not at all

Perfectly

What happens at the parser and translator stage in query processing?

<ul>
<li>Syntax is checked and referenced relations are verified</li>
<li>Query parsed and translated into relational algebra</li>
</ul>

How well did you know this?

Not at all

Perfectly

What happens at the optimiser stage in query processing?

<ul>
<li>Query converted to more efficient form</li>
<li>Query evaluation plan is created</li>
</ul>

How well did you know this?

Not at all

Perfectly

What happens in the evaluation stage in query processing?

Query evaluation plan is executed and the answer is returned

How well did you know this?

Not at all

Perfectly

What is contained within a query evaluation plan?

Evaluation primitives which are relational algebra operations annotated with execution guidance

How well did you know this?

Not at all

Perfectly

What are the cost of execution plans estimated using?

Statistical information/estimation for

<ul>
<li>the data dictionary (number of tuples etc)</li>
<li>intermediate results for complex expressions</li>
<li>evaluation estimation</li>
</ul>

How well did you know this?

Not at all

Perfectly

What is a theta join?

Cartestian product combined with a select statement

How well did you know this?

Not at all

Perfectly

What are the heuristics for getting a sensible equivalent SQL statement?

<ol>
<li>Perform selections as early as possible</li>
<li>Change conjunctive selection to nested selections</li>
<li>Perform projections as early as possible</li>
<li>Re-order joins to minimise intermediate data</li>
</ol>

How well did you know this?

Not at all

Perfectly

What is conjunctive selection?

Where two predicates are applied as an and statement, as opposed to applying one predicate then applying the next

How well did you know this?

Not at all

Perfectly

What are the two factors in estimating query sizes?

<ul>
<li>Size of relations</li>
<li>Distribution of values in tuples</li>
</ul>

How well did you know this?

Not at all

Perfectly

What statistics stored in the systems catalogue can be useful for estimating the query size?

<ul>
<li>Number of tuples in a relation</li>
<li>Size of a tuple in a relation</li>
<li>Number of distinct values appearing for an attribute</li>
</ul>

How well did you know this?

Not at all

Perfectly

How do we find the size of a cartesian product?

(number of tuples in q and r) times (the bytes per tuple for each table)

How well did you know this?

Not at all

Perfectly

How do we estimate the size of a selection?

(number of tuples)/(number of values they can taken)

How well did you know this?

Not at all

Perfectly

What does the estimation of size of selection assume?

Uniform distribution of values that the attributes do take

How well did you know this?

Not at all

Perfectly

What is the estimation of size of a natural join of two relations if they have no overlapping elements?

The size of their relations multipled together

How well did you know this?

Not at all

Perfectly

What is the estimation of the size of a natural join of two relations if the common elements is a key for the first relation?

Study These Flashcards

No more tuples than in the other relation

What is the estimation of the size of a natural join of two relations if none of the clever conditions apply (no overlap or overlap is key of a relation)?

Study These Flashcards

Number of tuples in one relation times number of tuples in the othr all divided by the number of values the attribute that is the common attribute can take

What is cost generally measured as when looking at queries?

Study These Flashcards

Total elapsed time for answering the query

What is taken into account when looking at access costs?

Study These Flashcards

<ul>
<li>Number of seeks * average seek cost</li>
<li>Number of blocks read * average block read cost</li>
<li>Number of blocks written * average block write cost</li>
</ul>

What is tT?

Study These Flashcards

Time to transfer one block

What is tS?

Study These Flashcards

Time for one seek

What is br?

Study These Flashcards

The number of blocks containing tuples of relation r

What is fr?

Study These Flashcards

The number of tuples of relation r that fit into one block (blocking factor)

What is hi?

Study These Flashcards

The height of the index

What is the cost estimate for direct selections where the attribute does not have an index on it?

The time to seek a block + time to transfer a block * number of blocks needed

What is the cost estimate for direct selections where the attribute does have a primary index on it and the attribute is a key?

(The height of the index plus one) times (the time to transfer + the time to seek a block)

What is the cost estimate for direct selections where the attribute does have a primary index on it but the attribute is not a key?

(the height of the index)*(time to transfer + time to seek) + time to seek + (number of blocks with matching records * time to transfer)

What is the cost estimate for direct selections where there is a secondary index on the attribute?

(height of the index + number of matching records) * (time to transfer + time to seek)

How are comparison selections performed when there is no index on A?

Each block is scanned and all records are tested by the condition

How are comparison selections performed when there is a primary index on A?

Scan sequentially from either the start ()

How are comparison selections performed when there is a secondary index on A?

Linear file scan might be cheaper but find the index pointing to a record where the first of the condition holds then scan the INDEX sequentially from there

What are evaluation primitives?

Relational algebra operations annotated with execution guidance

Query Processing Flashcards

(32 cards)