Introduction to machine learning Flashcards

1
Q

What is an AI?

A

Algorithm capable of learning and making decisions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the machine learning pipeline?

A
- Training:
     Load data
     Extract features
     Train model
     Evaluate
- Testing:
    Load data
    Extract features
    Predict using model
    Evaluate

(Only difference is that you initally train, and then use the trained model when testing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Briefly describe the 4 types of machine learning

A
  • Supervised: Humans “teach” the algorithm
  • Unsupervised: Algorithm does everything by itself
  • Semi-supervised: In between the two previous
  • Reinforcement: Algorithm left to itself, but gets
    rewarded if it does something correctly
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is classification? How do you calculate loss in classification?

A

Classification is a classifier used in machine learning. The output is discrete. Loss i calculated by counting the number of misclassified samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is regression? How do you calculate loss in regression?

A

Regression is another classifier. The output is continuous. Loss is calculated by squaring (to avoid outliers) and adding up and taking the average.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are some other classifiers (besides classification and regression)?

A

Mainly tree based models, like Random Forest. Works by using a combination of N decision trees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is bagging?

A

Combining several classifiers. Can reduce variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the difference between classification and regression? Explain with an example if possible.

A

Classification has a discrete (categorical). If you are trying to predict house prices, it can tell you which houses might sell for below or above the predicted price (categories).

Regression has a continuous output (numerical). So if predicting housing prices, it actually gives you the predicted price.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly