Lesson 1 Flashcards

1
Q

What are the 4 steps of machine learning?

A
  1. Collect data
  2. Learning algorithm
  3. Evaluation
  4. Deployment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 2 parts of supervised learning?

A

Regression and classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What 3 kinds of machine learning do you have?

A

Supervised, unsupervised, and reinforcement learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Can a regression target be a positive number?

A

Yes

The target is a real number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Give 3 classification goals and an example.

A
  1. Binary classification (give loan, or not (0,1))
  2. Multiple classification (identifying birds (more than one label))
  3. Sequence labeling (partioning a sound sample into words)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are all the data split sets and what are they used for?

A
  1. Train, learn about data and making the model.
  2. Validation, monitor performance and choose the best hyperparameters.
  3. Test, Evaluate generalization to the real-world.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is K-fold cross validation?

A

It splits the training set into K amount of folds and evaluates its performance on 1 of those folds while using the other sets as training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is stratification in data splitting?

A

It ensures that each subset (like train/test) has the same proportion of classes or categories as the full dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is time-series cross-validation?

A

It evaluates models on time-ordered data by respecting the chronological order. It splits data into train/test sets multiple times, always using past data to predict future data, avoiding data leakage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What 3 evaluation metrics for reggression are there?

A
  1. Mean Absolute Error
  2. Mean Squared Error
  3. Coefficient of Determination (R2, low is low variance)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What 2 basic evaluation metrics for classification are there?

A
  1. Error rate (proportion of wrongly classified data)
  2. Accuracy (1-error rate)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the confusion matrix say?

A

amount of TP, TN, FP, FN in a classification problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the F-score say?

A

It is a evaluation metric that will give more weight to either precision or recall

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What happens when β is low en when is β high?

A

β > 1 -> less weight to precision and more to recall.
β < 1 -> more weight to precision, less to recall.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly