Case Study — NextStar Recommender System Flashcards
What is Explicit Behavioural Data?
Personal Data/Profile, user is aware that it is being collected.
Example of Explicit Behavioural Data.
When a user rates a video clip.
What they rate that video clip as.
What is Implicit Data?
Data that the user is not aware that it is being collected.
Example of Implicit Behavioural Data.
Click Data, Purchase Data, Key logger.
What is bad about implicit Data?
The advertisements could become too specialised and becomes too compelling. Too much user spending.
What is right to anonymity?
they don’t know who you are but they know what you are doing
What is right to privacy?
they know who you are but don’t know what you are doing.
What is Machine Learning?
Algorithm makes intelligent decisions based on what it has previously learned. Programmer inputs data and answers and the algorithm will output rules to help produce future answers.
What are the three types of machine learning?
Supervised, Unsupervised, Reinforced.
What does machine learning entail?
Using the data and the answers to find the rules/patterns.
Why is supervised learning effective?
It’s precise.
What is supervised learning?
use of labeled datasets to train algorithms that to classify data or predict outcomes accurately.
What two techniques are used in supervised learning?
Regression and Classification.
What is regression?
Looking at relationship between results and features.
What is Classification?
Sorting items into categories by features.
What is unsupervised learning?
Giving data and letting the machine find patterns and labels itself.
Why is unsupervised learning effective?
Good for sorting data.
What three techniques are used in unsupervised learning?
Clustering and Dimensionality reduction and Association.
What is clustering?
Divide by similarity.
What is association?
Identifying sequences.
What id dimensionality reduction?
Taking high dimension data (complicated) and reducing it to low dimensional data (understandable and meaningful.)
What is reinforced learning?
Gives algorithm labels and rules and lets it achieve its goals. Then rewarding it or punishing it depending on the outcome.
What do most recommender systems use for learning?
Supervised Learning
What are the two types of filtering?
Content and Collaborative
What is content filtering?
Personalised results based on previous data interactions.
What is collaborative filtering?
What similar people interacted with.
What do the best recommender systems used for filtering?
A hybrid of both. In combination with different machine learning methods.
What is good about Collaborative filtering?
Other users are used.
Chance is involved.
What is bad about Collaborative filtering?
Needs more data
Problems for new users/products
If product has no ratings it can’t be used.
People aren’t all the same.
What is good about content filtering?
Works with lesser data
User specific
What is bad about content filtering?
Over-specification (won’t try new stuff)