Types of ml Flashcards

1
Q

What is supervised learning?

A

Supervised learning is a type of machine learning where models are trained on labeled data, meaning each input has a corresponding correct output.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two main types of supervised learning?

A

The two main types are regression (predicting continuous values) and classification (predicting discrete categories).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is regression in supervised learning?

A

Regression is a type of supervised learning where the model predicts a continuous numerical value based on input data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is classification in supervised learning?

A

Classification is a type of supervised learning where the model assigns input data to predefined categories or labels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Give an example of a regression problem.

A

Predicting house prices based on features like area, number of rooms, and location.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Give an example of a classification problem.

A

Spam detection, where emails are classified as ‘spam’ or ‘not spam’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the key difference between regression and classification?

A

Regression deals with continuous outputs, while classification deals with discrete category outputs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are some common regression algorithms?

A

Linear Regression, Polynomial Regression, Ridge Regression, Lasso Regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are some common classification algorithms?

A

Logistic Regression, Decision Trees, Random Forest, Support Vector Machines (SVM), Naive Bayes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the loss function used in linear regression?

A

Mean Squared Error (MSE) is commonly used in linear regression to measure prediction errors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which loss function is used in classification problems?

A

Cross-Entropy Loss (Log Loss) is commonly used for classification problems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is overfitting in supervised learning?

A

Overfitting occurs when a model learns patterns too specific to the training data, reducing its performance on unseen data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can overfitting be prevented?

A

Using techniques like regularization (Lasso, Ridge), cross-validation, pruning (for decision trees), and increasing training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is underfitting in supervised learning?

A

Underfitting occurs when a model is too simple to capture patterns in the data, leading to poor performance on both training and test data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is logistic regression, and why is it used for classification?

A

Logistic Regression is a statistical model used for binary classification. It predicts probabilities and applies a threshold to classify data into categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the role of a decision boundary in classification?

A

A decision boundary is a surface that separates different classes in classification problems.

17
Q

What is a confusion matrix?

A

A confusion matrix is a table used to evaluate the performance of a classification model by comparing actual and predicted values.

18
Q

What are precision, recall, and F1-score in classification?

A

Precision measures correctness of positive predictions, recall measures completeness of positive predictions, and F1-score is the harmonic mean of both.

19
Q

What is multi-class classification?

A

Multi-class classification refers to problems where there are more than two possible output categories.

20
Q

What is multi-label classification?

A

Multi-label classification is where each instance can belong to multiple classes simultaneously, unlike single-label classification.

21
Q

What is class imbalance, and how do you handle it?

A

Class imbalance occurs when one class significantly outnumbers others. It can be handled using oversampling, undersampling, or synthetic techniques like SMOTE.

22
Q

What is Ridge Regression, and how does it help?

A

Ridge Regression is a regularization technique that reduces overfitting by adding an L2 penalty to the loss function.

23
Q

What is Lasso Regression?

A

Lasso Regression is a type of regression that adds an L1 penalty to the loss function to perform feature selection by shrinking some coefficients to zero.

24
Q

What are some real-world applications of supervised learning?

A

Spam detection, fraud detection, medical diagnosis, stock price prediction, speech recognition, and image classification.

25
Q

What evaluation metrics are used for regression models?

A

Common metrics include Mean Squared Error (MSE), Mean Absolute Error (MAE), and R-squared (R²).