Introduction & Collaborative Filtering Flashcards

1
Q

What is data mining?

A

Analyzing large datasets to discover:
- patterns and insights,
- enabling data driven decision making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 3 components of mine data?

A
  1. Rapid growth,
  2. technology advancement and
  3. competitive advantage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is rapid growth of data?

A

Volume of data generated increasing exp. Due to:
- transactions,
- media, -IoT and
- cloud

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the technology advancements? (3)

A
  1. Modern data storage,
  2. processing,
  3. large scale data analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is competitive advantage? (3)

A
  1. Uncover trends,
  2. optimize operations,
  3. strategic advantage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the goal of supervised learning?

A

Predict a single variable where the target value is known

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are two methods in supervised learning?

A

Classification and regression(prediction)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 2 goals of unsupervised learning?

A
  1. Segment data into groups;
  2. detect patterns where the target variable is unknown
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are four methods of unsupervised learning?

A
  1. Association rules & recommendation systems
  2. Cluster analysis
  3. Data & dimension reduction
  4. Data exploration/visualization
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the 7 steps in data mining?

A
  1. Define business purpose
  2. Obtain data (random sampling)
  3. Explore, clean, pre-process (reduce data)
  4. Specify task and choose technique
  5. Iterative implementation and tuning
  6. Assess and compare results
  7. Deploy solution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is collaborative filtering?

A

Technique to make predictions/recommendations by leveraging:
- preferences,
- behaviors, or
- interactions of groups/users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does collaborative filtering operate?

A

Individuals with similar preferences in the past are likely to share preference in the future

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are three examples of real world applications recommendation systems?

A
  1. e-commerce platforms,
  2. streaming services, and
  3. social networks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is association rules mining?

A

Focuses on discovering relationships or patterns between items in transactional data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does collaborative filtering aim to provide? and how?

A

Aims to provide personalized recommendations by
leveraging user interactions and similarities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the two ways to measure similarity?

A
  1. Pearson correlation and
  2. cosine similarity
17
Q

What are the ranges when using pearson correlation?

A

-1 (perfect negative) to 1 (perfect positive)

18
Q

What are the ranges when using cosine similarity?

A

0 (no similarity) to 1 (perfect similarity)

19
Q

Why isnt collaborative filtering not be used to create recommendations for new users or new items?

A

Suffers from cold start

20
Q

What are the advantages of the clustering alternative?

A

Move large computations and faster/cheaper

21
Q

What are the disadvantages of the clustering alternative?

A

Accuracy in recommendations

22
Q

What is item-based alternative?

A

Finding items that were co-rated by KNN user(s) with:
item of interest &
recommend the most popular items among the similar items

23
Q

What is user-based alternative?

A

Recommends items by identifying users with similar preferences