Lecture 1 Flashcards

1
Q

Machine Learning

A

the field of study that gives computers the ability to learn without being explicitly programmed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Supervised Learning

A

An algorithm maps a new input to an output based on example input-output pairs of the training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Unsupervised Learning

A

Only the input data is known, and no known output data is given to the algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Accuracy

A

the fraction of inputs for which the right output was predicted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Training Data

A

Data used to build a machine learning model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Test Data

A

Data used to assess how well the model works

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is a z-score computed?

A

Subtracting the mean and dividing by the standard-deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Reinforcement Learning

A

Involves reasoning under uncertainty and how agents take actions to maximize their reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Semi-supervised Learning

A

Involves a small portion of labeled examples and a large number of unlabeled examples from which a model must learn and make predictions on new examples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Active Learning

A

A learning algorithm can interactively query a user to label new data points with the desired outputs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Model

A

An equation that links the values of some features to the predicted value of the target variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Score functions/Fit statistics/Score metrics

A

measures of how well

the model fits the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Feature selection

A

reducing the number of predictors by selecting the important ones (dimensionality reduction)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Feature extraction

A
reducing the number of predictors by means of a 
mathematical operation (e.g., PCA)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Model Building

A

finding the equation of the model and the coefficients in it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are two typical tasks for Machine Learning?

A
  1. Prediction (supervised learning)

2. To learn something previously unknown (unsupervised learning)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the two main types of Supervised Learning?

A

Classification and Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Classification

A

A discrete output such as color, gender, yes/no, class membership

question example: “Will you pass this course?”

19
Q

Regression

A

A continuous output like temperature, age, distance, salary

question example: “How many points will you get in the exam?”

20
Q

Preprocessing

A

Cleaning and/or transforming the data

21
Q

When do machine learning algorithms not preform well?

A

When the input numerical attributes have a very different scale

22
Q

Standard Scaler

A

z-scores or standard scores where the mean is 0 and the standard deviation is 1

23
Q

What type of data does the Standard Scaler work well with?

A

it’s a common method in data normalization so it’s good for non-skewed data

24
Q

Robust Scaler

A

The median is 0 and the interquartile range is 1

25
What type of data does the Robust Scaler work well with?
It's better for skewed data because it deals better with outliers
26
MinMax Scaler
Shifts data to an interval set by Xmin and Xmax. Formula: Xnew = (x - Xmin) / (Xmax - Xmin)
27
What happens when you log scale your data?
You get a better prediction accuracy
28
Normalizer
Each row of the data is rescaled so that its norm becomes 1. Doesn't work by feature (column) and is only used when the direction of the data matters
29
What type of graph is a normalizer helpful for
histograms
30
Binning
Separating the feature values into n categories. You can replace all the values within each category with a single value like their mean
31
What is Binning effective for?
Models with few parameters like regression models
32
What is Binning not effective for?
Models with many parameters like decision trees
33
Cross-validation
Evaluates the model's ability to predict new data; detects overfitting or selection bias
34
Feature
properties that describe data points
35
Sample/Instance
a data point; each entity or row in the data
36
Pipeline
The end-to-end construct that orchestrates the flow of data into, and output from, a machine learning model
37
Clustering
A type of unsupervised learning where the algorithm finds natural groups or clusters in data
38
Feature Vector
A vector listing all the feature values
39
Feature Value
The value of a property or feature of the data point/instance, e.g. white, 66, yes
40
Features
An individual measurable property or characteristic of a data point, e.g. color, age, is rich
41
True or false: Classification problems can be used to predict only two discrete valued output such as 0 and 1.
False | Classification can be used for an arbitrary number of classes, not just 2.
42
Which scaling method results in a range between 0 and 1?
Min-Max scaler
43
The standard scaler uses z-scores. How do you compute z-scores?
Subtracting the mean and dividing by the standard-deviation