Lecture 1 Flashcards

1
Q

Machine Learning

A

the field of study that gives computers the ability to learn without being explicitly programmed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Supervised Learning

A

An algorithm maps a new input to an output based on example input-output pairs of the training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Unsupervised Learning

A

Only the input data is known, and no known output data is given to the algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Accuracy

A

the fraction of inputs for which the right output was predicted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Training Data

A

Data used to build a machine learning model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Test Data

A

Data used to assess how well the model works

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is a z-score computed?

A

Subtracting the mean and dividing by the standard-deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Reinforcement Learning

A

Involves reasoning under uncertainty and how agents take actions to maximize their reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Semi-supervised Learning

A

Involves a small portion of labeled examples and a large number of unlabeled examples from which a model must learn and make predictions on new examples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Active Learning

A

A learning algorithm can interactively query a user to label new data points with the desired outputs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Model

A

An equation that links the values of some features to the predicted value of the target variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Score functions/Fit statistics/Score metrics

A

measures of how well

the model fits the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Feature selection

A

reducing the number of predictors by selecting the important ones (dimensionality reduction)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Feature extraction

A
reducing the number of predictors by means of a 
mathematical operation (e.g., PCA)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Model Building

A

finding the equation of the model and the coefficients in it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are two typical tasks for Machine Learning?

A
  1. Prediction (supervised learning)

2. To learn something previously unknown (unsupervised learning)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the two main types of Supervised Learning?

A

Classification and Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Classification

A

A discrete output such as color, gender, yes/no, class membership

question example: “Will you pass this course?”

19
Q

Regression

A

A continuous output like temperature, age, distance, salary

question example: “How many points will you get in the exam?”

20
Q

Preprocessing

A

Cleaning and/or transforming the data

21
Q

When do machine learning algorithms not preform well?

A

When the input numerical attributes have a very different scale

22
Q

Standard Scaler

A

z-scores or standard scores where the mean is 0 and the standard deviation is 1

23
Q

What type of data does the Standard Scaler work well with?

A

it’s a common method in data normalization so it’s good for non-skewed data

24
Q

Robust Scaler

A

The median is 0 and the interquartile range is 1

25
Q

What type of data does the Robust Scaler work well with?

A

It’s better for skewed data because it deals better with outliers

26
Q

MinMax Scaler

A

Shifts data to an interval set by Xmin and Xmax.

Formula:

Xnew = (x - Xmin) / (Xmax - Xmin)

27
Q

What happens when you log scale your data?

A

You get a better prediction accuracy

28
Q

Normalizer

A

Each row of the data is rescaled so that its norm becomes 1. Doesn’t work by feature (column) and is only used when the direction of the data matters

29
Q

What type of graph is a normalizer helpful for

A

histograms

30
Q

Binning

A

Separating the feature values into n categories. You can replace all the values within each category with a single value like their mean

31
Q

What is Binning effective for?

A

Models with few parameters like regression models

32
Q

What is Binning not effective for?

A

Models with many parameters like decision trees

33
Q

Cross-validation

A

Evaluates the model’s ability to predict new data; detects overfitting or selection bias

34
Q

Feature

A

properties that describe data points

35
Q

Sample/Instance

A

a data point; each entity or row in the data

36
Q

Pipeline

A

The end-to-end construct that orchestrates the flow of data into, and output from, a machine learning model

37
Q

Clustering

A

A type of unsupervised learning where the algorithm finds natural groups or clusters in data

38
Q

Feature Vector

A

A vector listing all the feature values

39
Q

Feature Value

A

The value of a property or feature of the data point/instance, e.g. white, 66, yes

40
Q

Features

A

An individual measurable property or characteristic of a data point, e.g. color, age, is rich

41
Q

True or false: Classification problems can be used to predict only two discrete valued output such as 0 and 1.

A

False

Classification can be used for an arbitrary number of classes, not just 2.

42
Q

Which scaling method results in a range between 0 and 1?

A

Min-Max scaler

43
Q

The standard scaler uses z-scores. How do you compute z-scores?

A

Subtracting the mean and dividing by the standard-deviation