Recommender System Primer Flashcards

Question 1

Q

What data does collaborative filtering use?

Answer

A

Only uses Users’ history of ordering or rating items

Question 2

Q

What are the data sources that can be used for a recommender system? (3)

Answer

A

Question 3

Q

What do we know about data for collaborative filtering?

Answer

A

Very sparse

Question 4

Q

What are the representations of the collaborative filtering problem?

Answer

A

User-user approach: estimate a user’s rating of an item by finding “similar” users and then looking at their ratings for that item.
Item-item approach: estimate a user’s rating of an item by finding similar items and then looking at that user’s rating of these similar items.
Matrix factorization: construct two low-rank matrices that approximate the observed entries of X.

Question 5

Q

What’s The task of collaborative filtering?

Answer

A

“fill in” the missing values (i.e., the predicted user ratings) based on the existing ratings

Question 6

Q

How do you convert a dataframe to a Numpy matrix?

Answer

A

df.to_numpy()

Question 7

Q

In the user-user approach, how is the similarity weight calculated?

Answer

A

Two common options:

Question 8

Q

Answer

A

Can’t keep the user-rating matrix in sparse format, which does not scale well with a large number of users / items.
Use matrix factorization.

Question 9

Q

What’s an advantage of the matrix factorization approach?

Answer

A

We don’t need to break the sparsity of the user ratings matrix?

Question 10

Q

What is this? “singularity issues with matrix inverses”

Question 11

Q

Answer

A

- It’s feasible with sparse matrices, which can considerably reduce runtime and memory usage.

Question 12

Q

What’s matrix factorization?

Answer

A

This approach aims to approximate the observed entries in X as a product of two lower-rank matrices