7: Recommender Systems Flashcards

1
Q

What is a recommender system ?

A

A system which attempts to recommend information items likely to be of interest to the users.

Eg movies, TV, music, books, news, jokes, by Netflix or Amazon.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two ways to collect data from users ?

A

Explicit:

  • Ask a user to rate an item
  • Ask a user to create a (wish) list

Implicit:

  • Keep a record of the items a user searches/views/purchases
  • Analyse the user’s social network to discover likes and dislikes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What other data can be used ?

A
  • Domain-based knowledge (category)
  • Patterns and relations
  • Higher level context
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Types recommendation algorithms ?

A
  • Popularity-based
  • Demographic based
  • Content based
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is item co-occurrence ?

A

‘People who like this also like …’

  • For each pair of items, count how often they are bought together (slow, but offline)
  • For each item that is being purchased, rank the other items by their co-occurrence counts.
  • Recommend the items from the top pairs that were bought together in the past.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Content-based recommendation ?

A

Offers similar products with similar features, eg another computer with higher RAM and it costs about the same.

  • Hard to guess which features are important for particular user
  • Hard to compare different product features in different product domains
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Collaborative Filtering ?

A
  • Used for spam control

Eg is an email address is blacklisted by one user, future messages can be blocked for all users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Give 2 types of CF ?

A
  • User-based CF: find the most similar users and calculate their average rating for the new item
  • Item-based CF: find the most similar items and calculate their current user’s average ratings.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the advantages and disadvantages with CF ?

A
  • Pros: simplicity, bc no content analysis is necessary
  • Cons:
    • Sparsity (not enough data)
    • Cold-start (no past similar purchases)
    • Users may get recommendations for low quality products bought by friends
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What factors should be taken into account for designing recommendation systems ?

A
  • Explanations
  • Minimum nonsense recommendations
  • Privacy
  • Spam recommender
  • No vandalism/offensive/explicit content
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What data to collect and how ?

A
  • Find ways to collect as much input as possible without being disruptive
  • A lot of data that can train a system: votes, clicks, page-view time, purchases, tagging, adding a title.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What’s more important between:

  • Data collection algorithm
  • Data collected
A

Data collected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly