ML Basics Flashcards

1
Q

Which Python library is primarily used for numerical operations essential in collaborative filtering?

A

NumPy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which Python library should you use for manipulating data tables when working with collaborative filtering?

A

Pandas.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the standard Python library for interacting with SQLite databases?

A

SQLite3.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What machine learning library in Python provides tools to implement collaborative filtering?

A

Scikit-Learn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the three basic types of tables you need in a database for collaborative filtering?

A

User table, item table, and ratings/interactions table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which technique is often used in collaborative filtering to decompose the user-item interaction matrix?

A

Matrix factorization, specifically methods like Singular Value Decomposition (SVD) or Alternating Least Squares (ALS).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What metric can be used to evaluate the accuracy of a collaborative filtering model’s predictions?

A

RMSE (Root Mean Square Error).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you load data from an SQLite database into a Pandas DataFrame for processing in Python?

A

Use the read_sql_query function from Pandas, providing it with an SQL query and a connection object.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the purpose of decomposing the interaction matrix in collaborative filtering?

A

To reduce its dimensionality, enabling the approximation of missing entries and facilitating easier computation of recommendations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What evaluation metrics can be used to measure the performance of a recommender system in terms of relevance and user satisfaction?

A

Precision and recall.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What distinguishes content-based filtering from collaborative filtering?

A

Content-based filtering recommends items based on the features of the items and a profile of the user’s preferences, unlike collaborative filtering which uses user-item interactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are hybrid recommender systems?

A

Hybrid recommender systems combine collaborative filtering, content-based filtering, and other methods to improve recommendation performance and overcome single approach limitations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the cold start problem in recommender systems?

A

The cold start problem occurs when there is insufficient data to make accurate recommendations for new users or items.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does sparsity mean in the context of collaborative filtering?

A

Sparsity refers to the condition where the user-item interaction matrix has a large percentage of missing values, posing challenges in learning effective recommendations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How are implicit feedback and explicit feedback different in recommender systems?

A

Implicit feedback is derived from user actions such as views and clicks, whereas explicit feedback involves direct ratings or preferences provided by the user.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is dimensionality reduction and why is it important in machine learning?

A

Dimensionality reduction techniques like PCA reduce the number of variables under consideration, simplifying models while retaining essential information, crucial for managing large datasets efficiently.

17
Q

What is regularization and why is it used in machine learning?

A

Regularization adds a penalty to the loss function to prevent overfitting, maintaining model simplicity and reliability even with complex data.

18
Q

What are neighborhood methods in collaborative filtering?

A

Neighborhood methods involve computing similarities between users or items to make predictions and recommendations based on nearest neighbors.

19
Q

How does deep learning enhance recommender systems?

A

Deep learning models complex non-linear relationships using neural networks, potentially improving accuracy in recommendations by capturing intricate patterns in data.

20
Q

What are ensemble methods in machine learning?

A

Ensemble methods improve prediction accuracy by combining multiple models, using techniques like random forests and gradient boosting machines.

21
Q

Besides RMSE and precision, what are other metrics for evaluating recommender systems?

A

Additional metrics include F1-score, NDCG (Normalized Discounted Cumulative Gain), and AUC (Area Under Curve), used to measure the effectiveness of recommender systems.

22
Q

What makes a recommender system context-aware?

A

Context-aware recommender systems consider additional contextual information such as location, time of day, or social context to provide more relevant recommendations.

23
Q

What is a recommender system?

A

A recommender system is a type of information filtering system that seeks to predict user preferences and recommend items that are likely of interest to the user.

24
Q

What is the primary purpose of using machine learning in recommender systems?

A

Machine learning in recommender systems is used to predict user preferences based on past data, improving the accuracy of recommendations.

25
Q

What is an interaction matrix in collaborative filtering?

A

An interaction matrix is a grid used in collaborative filtering that stores values representing interactions between users and items, like ratings or purchases.

26
Q

Why is data sparsity a challenge in collaborative filtering?

A

Data sparsity is challenging because most of the interactions are not recorded, making it difficult to accurately predict preferences from limited available data.

27
Q

What does PCA stand for and what is its role in machine learning?

A

PCA stands for Principal Component Analysis. It is a dimensionality reduction technique that simplifies data by reducing the number of variables, while retaining important information.

28
Q

How does regularization help in preventing overfitting?

A

Regularization helps prevent overfitting by introducing a penalty term to the loss function used to train the model, which discourages overly complex models.

29
Q

What is the difference between explicit and implicit feedback in recommender systems?

A

Explicit feedback is direct input from users, like ratings, while implicit feedback is derived from user behavior, such as browsing history or purchase records.

30
Q

What does SVD stand for and why is it used in collaborative filtering?

A

SVD stands for Singular Value Decomposition. It is used in collaborative filtering to decompose the interaction matrix into simpler matrices, making it easier to fill missing values and make predictions.

31
Q

How do neural networks contribute to the performance of recommender systems?

A

Neural networks can model complex and non-linear relationships, which allows them to capture deeper patterns in the data, potentially improving the accuracy of recommendations.

32
Q

What is a hybrid recommender system and why is it advantageous?

A

A hybrid recommender system combines multiple recommendation techniques (like collaborative filtering and content-based filtering) to leverage the strengths of each and mitigate their weaknesses, often resulting in better performance.

33
Q

Why is context important in recommender systems?

A

Contextual information (like time or location) can significantly influence user preferences, and incorporating it into a recommender system can help make more relevant and timely recommendations.

34
Q

What does NDCG stand for and what does it measure in recommender systems?

A

NDCG stands for Normalized Discounted Cumulative Gain. It measures the effectiveness of a recommender system by evaluating the ranking of recommended items, giving more importance to the relevance of items ranked higher on the list.