Recommender System Primer Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What data does collaborative filtering use?

A

Only uses Users’ history of ordering or rating items

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the data sources that can be used for a recommender system? (3)

A
  • Users’ demographic information (e.g., age, gender, location)
  • Items’ attributes (e.g., price, category)
  • Users’ history of ordering or rating items
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do we know about data for collaborative filtering?

A

Very sparse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the representations of the collaborative filtering problem?

A
  • User-user approach: estimate a user’s rating of an item by finding “similar” users and then looking at their ratings for that item.
  • Item-item approach: estimate a user’s rating of an item by finding similar items and then looking at that user’s rating of these similar items.
  • Matrix factorization: construct two low-rank matrices that approximate the observed entries of X.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What’s The task of collaborative filtering?

A

“fill in” the missing values (i.e., the predicted user ratings) based on the existing ratings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you convert a dataframe to a Numpy matrix?

A

df.to_numpy()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In the user-user approach, how is the similarity weight calculated?

A

Two common options:

  • Pearson correlation
  • Cosine similarity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
  • What’s a drawback of the user-user and item-item approaches?
  • How do you solve it?
A
  • Can’t keep the user-rating matrix in sparse format, which does not scale well with a large number of users / items.
  • Use matrix factorization.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What’s an advantage of the matrix factorization approach?

A

We don’t need to break the sparsity of the user ratings matrix?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is this? “singularity issues with matrix inverses”

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  • What is alternating least squares?
  • What’s an important point about it?
A
    • It’s feasible with sparse matrices, which can considerably reduce runtime and memory usage.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What’s matrix factorization?

A

This approach aims to approximate the observed entries in X as a product of two lower-rank matrices

How well did you know this?
1
Not at all
2
3
4
5
Perfectly