334 - 442: Multidimensional Data Flashcards

1
Q

What is a Quadtree?

A

A Quadtree is a tree-based data structure used for indexing two-dimensional spatial data by recursively dividing the space into quadrants.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which structure is used to index multi-dimensional spatial data in Quadtrees?
a) Grids
b) B+ Trees
c) K-D Trees
d) Quadrants

A

d) Quadrants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does a Quadtree handle non-uniform spatial data distribution?

A

By recursively subdividing space into quadrants, adapting to the density of data in each region.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the tradeoff associated with the Curse of Dimensionality?

A

It becomes increasingly difficult to differentiate points in high-dimensional space due to diminishing contrast in distances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Quadtree used for in multi-dimensional data?

A

A Quadtree is used to partition a 2D space into quadrants recursively, helping to organize data efficiently for spatial queries like range searches.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the primary difference between a Quadtree and a PR (Point-Region) Quadtree?

A

In a standard Quadtree, partitioning depends on the inserted data points, whereas in a PR Quadtree, the space is partitioned into equal-sized regions, independent of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In a PR Quadtree, where are the data points stored?

Inner nodes
Leaf nodes
Root node
In all nodes

A

Leaf nodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Minimum Bounding Rectangle (MBR) in an R-tree?

A

An MBR is the smallest rectangle that encloses a group of spatial objects, used for indexing and searching in an R-tree.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the primary drawback of overlapping MBRs in R-trees?

A

Overlapping MBRs can increase the number of nodes that must be visited during a query, reducing efficiency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define MINDIST in the context of R-trees.

A

MINDIST is the minimum possible distance between a point and any point within an MBR, used for approximating the nearest neighbor search.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which algorithm is primarily used for choosing the correct leaf node to insert data in an R-tree?

Breadth-First Search
ChooseLeaf
SplitNode
AdjustTree

A

ChooseLeaf

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does the MINMAXDIST help during a nearest neighbor query in an R-tree?

A

MINMAXDIST provides an upper bound for the distance to the nearest neighbor, allowing the algorithm to prune subtrees that cannot contain closer points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a major advantage of the R* Tree over the standard R Tree?

A

The R* Tree improves node splitting by considering both area increase and the degree of overlap, leading to better query performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the condition for splitting a node in an R-tree?

When it has only one entry.
When it exceeds its capacity (M).
When its MBR does not overlap.
When its parent node is full.

A

When it exceeds its capacity (M).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the difference between a kd-tree and a Quadtree?

A

A kd-tree partitions space alternately along different dimensions, while a Quadtree partitions space uniformly into quadrants.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is the original Guttman R-tree split algorithm considered inefficient for large M?

A

The algorithm has 2^𝑀−1 possible splits to evaluate, making it computationally expensive for large node capacities.

17
Q

What are the three pruning strategies used in nearest neighbor (NN) search in R-trees?

A

Downward pruning: Ignore MBRs with MINDIST larger than the MINMAXDIST of another MBR.

Ignore objects with distance to P larger than MINMAXDIST of an MBR.

Upward pruning: Ignore MBRs with MINDIST larger than the distance to a known object.

18
Q

In a VP Tree, how are the partitions created during the indexing process?

A

Select a pivot object.
Compute the median of all distances to the pivot.
Partition the data into two sets: points within the median distance and points outside.

19
Q

How does a Generalized Hyperplane Tree (GH Tree) partition objects?

A

Pick two pivot objects.
Partition data into two sets: closer to the first pivot or the second pivot.

20
Q

What are the components of an M-Tree node?

A

Pivot object
Radius of the enclosing ball
Distance to the parent node
Pointer to a subtree

21
Q

What are the three properties a distance function must satisfy in metric space indexing?

A

Symmetry
Non-negativity
Triangle inequality

22
Q

How does ball partitioning differ from other metric space indexing methods?

A

It partitions data based on whether points lie inside or outside a ball defined by a pivot.

23
Q

What does the pivot in a VP Tree represent?

A) The smallest object.
B) A randomly selected object.
C) The object at the median distance.
D) The furthest object.

A

C) The object at the median distance.

24
Q

What are the two methods to prioritize MBRs in NN search?

A) According to height and depth
B) According to MINMAXDIST and MINDIST
C) By alphabetical order
D) Using the number of contained objects

A

B) According to MINMAXDIST and MINDIST

25
Q

In downward pruning, when can an MBR be ignored?

A) When MINDIST is less than MINMAXDIST of another MBR.
B) When MINDIST is greater than MINMAXDIST of another MBR.
C) When the query point lies within the MBR.
D) When the MBR is at the root level.

A

B) When MINDIST is greater than MINMAXDIST of another MBR.