ML Flashcards

1
Q

What training is used for datasets without labels?

A

Unsupervised training

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What training is used for datasets with labels?

A

Supervised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What training is used for smaller datasets with labels and bigger ones wo labels?

A

Semi-supervised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What training is used when positive and negative feedbacks are given?

A

Reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What training is used based on previous knowledge?

A

Trasnfer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What algorithm predicts the numerical value of a variable based on independent variables?

A

Linear regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What algorithm predicts a binary outcome of independent variables?

A

Logistic regression (binary)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What algorithm predicts multiple outcomes of independent variables?

A

Logistic regression (multinomial)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What algorithm predicts outcome with order of independent variables?

A

Logistic regression (ordinal)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What algorithm predicts results on data with curvilineal patterns?

A

Polinomial regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What algorithms try to decrease overfitting in linear regression?

A

Ridge and Lasso regressions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the name of how much changing training sets change the model?

A

Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the name of the phenomenon that measures how much the training set is different from real world scenarios?

A

Bias, vies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Simplistic models tend to have what kind of bias variance trade-off?

A

High Bias and low variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Complex models tend to have what kind of bias variance trade-off?

A

Low bias and high variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which bias-variance trade off results in Underfitting?

A

High Bias and low variance

17
Q

Which bias-variance trade off results in Overfitting?

A

Low Bias and high variance

18
Q

What is the name of the specific table layout that allows visualization of the performance of an algorithm in ML?

A

Confusion matrix

19
Q

In an error matrix, what is the name of a prediction that is 1 when the actual classification is 0?

A

False positive

20
Q

In an error matrix, what is the name of a prediction that is 0 when the actual classification is 1?

A

False negative

21
Q

Which dimension reduction algorithm is used to simplify the data model while keeping the variance?

A

Principal component analysis PCA

22
Q

What kind of supervised algorithm is used for classification and regression of multi-dimensional data models that cannot be separated linearly?

A

Support Vector Machines SVM

23
Q

Which algorithm uses multiple decision trees trained on different subsets of data and then use a combination of their results?

A

Random forests

24
Q

Which clustering algorithm groups elements in levels so that it iterates grouping or dividing clusters in a tree structure?

A

Hierarchical clustering

25
Q

Which algorithm uses a majority of the closest nodes to define a classification?

A

K nearest neighbors

26
Q

How knn makes regression?

A

It uses commonly an average of the distance of the nearest neighbors

27
Q

What heapens if we chose a low K value in knn?

A

Low K results in noise sensitivity, which is high variance and low bias, thus overfitting

28
Q

What heapens if we chose a high K value in knn?

A

High K values result in oversimplification, so high bias and low variance, underfitting

29
Q

Which algorithm uses iterative centroids that get closer to clusters in a unsuperviserd learning?

A

K-means