Database Flashcards

Question 1

Q

What is the main purpose of the vector database in the movie application?

a) To store movie reviews
b) To support searching by data embedded as a vector
c) To store movie scripts
d) To manage user accounts

Answer

A

b) To support searching by data embedded as a vector

Question 2

Q

Which of the following is NOT a functional requirement of the movie application?

a) Look up a movie
b) Add/update an image for a movie
c) Stream movies in real time
d) Suggest similar movies based on the current movie

Answer

A

c) Stream movies in real time

Question 3

Q

Why was Apache Cassandra chosen as the database for this application?

a) It supports distributed storage and scalability
b) It is the only database available on Astra
c) It has built-in streaming capabilities
d) It does not require indexing

Answer

A

a) It supports distributed storage and scalability

Question 4

Q

What type of similarity algorithm is commonly used in vector searches?

a) Binary Search Algorithm
b) K-Nearest Neighbor (KNN)
c) Merge Sort
d) Dijkstra’s Algorithm

Answer

A

b) K-Nearest Neighbor (KNN)

Question 5

Q

What is the purpose of the ‘movies’ table in the database schema?

a) To store detailed information about each movie, including vector embeddings
b) To store user reviews and ratings
c) To manage movie streaming data
d) To store only movie titles

Answer

A

a) To store detailed information about each movie

Question 6

Q

Why do we create a ‘movies_by_title’ table instead of using a secondary index?

a) It improves query performance by avoiding high resource consumption
b) Secondary indexes do not work in Apache Cassandra
c) The database does not support queries by title
d) It allows us to store duplicate movie titles

Answer

A

a) It improves query performance by avoiding high resource consumption

Question 7

Q

What is the primary key of the ‘movies_by_title’ table?

a) movie_id
b) title
c) imdb_id
d) movie_vector

Question 8

Q

What is the significance of using a token-aware load-balancing policy in Apache Cassandra?

a) It ensures that queries are sent directly to the nodes responsible for the requested data
b) It improves the accuracy of vector searches
c) It allows for better video streaming
d) It removes the need for partitioning

Answer

A

a) It ensures that queries are sent directly to the nodes responsible for the requested data

Question 9

Q

Which provider is selected for the Astra DB database in this setup?

a) Microsoft Azure
b) Amazon Web Services
c) Google Cloud
d) IBM Cloud

Answer

A

c) Google Cloud

Question 10

Q

What is the purpose of the ‘StorageAttachedIndex’ in the database schema?

a) To enable efficient vector search queries
b) To store user authentication tokens
c) To manage movie streaming data
d) To enforce uniqueness constraints on movie titles

Answer

A

a) To enable efficient vector search queries

Database Flashcards

(10 cards)