Lecture 5 - Machine Learning Flashcards

1
Q

What is machine learning?

A

Making decisions or predictions from patterns observed in data gives “computers the ability to
learn without being explicitly programmed.”
[Arthur Samuels, 1959]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

People will often talk about the generated code as…

A

being a “model” a “classifier” or a “regressor”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Machine learning is…

A

Machine learning is applying an algorithm to data to produce a model that makes predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is supervised learning?

A

where we know the answer we want for a large number of samples and we want to learn to predict the answer for new ones

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is unsupervised learning?

A

Where we don’t have any particular answers in mind (much more useful for exploratory analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the attribute we want to learn or predict?

A

This is called the target variable or often just y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the other variables called in a machine learning algorithm.

A

All the other columns are often called X.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data with an included target are called ?

A

“labelled data” (sometimes also “ground truth”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data without an included target are called?

A

“unlabelled” and are mostly useless for supervised learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

In supervised learning we’ll use the labelled training data to?

A

learn from

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

We define a “cost function” which

helps us define how

A

“wrong” an answer might be in order to decide how much to “correct” later predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Examples of supervised machine learning models

A

Given an email as input, classify it as spam or not-spam
Given data about a house, predict its sale price
Given a date, predict that day’s rainfall
Given a website visitor, predict if they’re likely to sign up
Given a network traffic stream, predict if it’s normal or malicious

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Two major types of problems we solve with ML?

A
Classification - Given two (or more) classes, which class does each sample belong to
Regression - Given an input, predict a continuous output variable (e.g. temperature, house price)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A “decision boundary”…

A

Threshold value or tipping point above which we will classify values into class 1 and below which we classify values into class 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

A line as a decision boundary in

A

2-dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A surface as the decision boundary in

A

3-dimensions

17
Q

Dimension n-1 as the decision boundary in n-dimenions

A

A hyperplane