MODULE 2 S1 Flashcards

M2S1

1
Q

It predicts consecutive numbers (real numbers).

A

Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

T/F
Simply duplicating the same data points or collecting very similar data will not help.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

If your model performs well on the training set but poorly on the validation set.

A

Overfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

It occurs when you fit a model too closely to the particularities of the training set and obtain a model that works well on the training set but is not able to generalize to new data.

A

Overfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

This occurs when a model learns the training data too well, including its noise and outliers.

A

Overfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If your model is too simple then you might not be able to capture all the aspects of and variability in the data, and your model will do badly even on the training set. Choosing too simple a model is called ____________.

A

Underfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The two phases of supervised ML process: Training, ________.

A

Predicting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Input object : __________
Output value : __________

A

Feature
Label

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A model that performs poorly on both training and new data because it hasn’t learned enough from the training data.

A

Underfitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

It refers to algorithms that address classification problems where the output variable is categorical.

A

Classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Where labeled training data refers to a dataset that includes both the input data and the corresponding correct output.

A

Supervised Learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

It refers to the error from having wrong / too simple assumptions in the learning algorithm.

A

Bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Classification : _____________ variable
Regression : ______________ variable

A

Categorical
Continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

It predicts one of the possible class labels.

A

Classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Two Types of Classification

A

Binary Classification
Multiple Classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

T/F
Classification algorithms address classification problems where the output variable is categorical.

A

TRUE

17
Q

It refers to the error resulting from sensitivity to the noise / fluctuations in the training data

A

Variance

18
Q

T/F
The more complex we allow our model to be, the better we will be able to predict on the training data.

A

TRUE

19
Q

Categories of Supervised Learning

A

Classification
Regression

20
Q

T/F
The larger variety of data points your data set contains, the more complex a model you can use without overfitting.

A

TRUE

21
Q

T/F
Model complexity does not depend on the variation of inputs contained in the training dataset.

A

FALSE
(it is intimately tied)

22
Q

These concepts helps to understand how well a model performs: Overfitting, Underfitting, _________.

A

Generalization

23
Q

If a model is able to make accurate predictions on unseen data, we say it is able to _____________ from the training set to the test set.

A

Generalize

24
Q

T/F
The primary objective of the supervised learning technique is to map the input variable with the output variable.

A

TRUE

25
Q

These are algorithms that handle regression problems where input and output variables have a linear relationship

A

Regression

26
Q

It refers to when a model is built on the training data and then is able to make accurate predictions on new, unseen data.

A

Generalization

27
Q

Classification Algorithms

A

Random Forest Algorithm
Decision Tree Algorithm
Logistic Regression Algorithm
Support Vector Machine Algorithm

28
Q

Regression Algorithms

A

Simple Linear Regression Algorithm
Multivariate Regression Algorithm
Decision Tree Algorithm
Ridge Algorithm
Lasso Algorithm

29
Q

In supervised learning, market trend is an example of _______________

A

Regression