508 - 533: Inverted Index, Top-k and Skyline Queries Flashcards

1
Q

In an inverted index, what does the posting list store?

a) Hash values of terms
b) Document IDs containing the term
c) Scores of terms in documents
d) Aggregated top-k scores

A

b) Document IDs containing the term

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the goal of a Top-k Query?

a) Retrieve all documents with the highest scores
b) Compute the k objects with the highest aggregated scores
c) Find the k smallest indexed objects
d) List all objects in sorted order

A

b) Compute the k objects with the highest aggregated scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does Fagin’s Algorithm do first during query processing?

a) Perform random access to all lists
b) Aggregate scores directly
c) Read sequentially from each list to find k distinct objects in all lists
d) Skip lists with low scores

A

c) Read sequentially from each list to find k distinct objects in all lists

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What defines a Skyline in a dataset?

a) The most frequent data points
b) Points dominated by all others
c) Points not dominated by any other point
d) The first k highest-scoring points

A

c) Points not dominated by any other point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the key difference between Fagin’s Algorithm and the Threshold Algorithm?

A

Fagin’s Algorithm processes until k objects are found in all lists, while the Threshold Algorithm uses a threshold (τ) to terminate earlier when no unseen objects can exceed the current scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain Dominance in Skyline Queries.

A

A point p dominates another point q if p is at least as good as q in all dimensions and strictly better in at least one dimension.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How does an inverted index improve search efficiency?

A

By mapping terms to a list of documents containing those terms, enabling efficient query retrieval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the purpose of the k-skyband in Skyline Queries?

A

It consists of points dominated at most k−1 times, generalizing the concept of Skyline Queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why is monotonicity important in Top-k Query processing?

A

It ensures that if an object has lower individual scores than another in all lists, its aggregated score cannot exceed the other’s.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the role of aggregation functions in Top-k Queries.

A

They combine individual scores from index lists to compute an overall score for each object, used to rank results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly