ML Overview Flashcards

1
Q

What are the 6 stages of the ML Pipeline

A
  1. Define the problem
  2. Data collection
  3. Data preprocessing
  4. Data modelling / machine learning
  5. Model evaluation
  6. Model application (on new/unseen data)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define ML

A

Input data and answers and output rules that can be applied by the computer to new situations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 4 types of data sets?

A
  1. Record
  2. Graph and network
  3. Ordered
  4. Spatial, image, multimedia
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are data objects?

A
  1. Make up datasets
  2. Represent the entity being measured
  3. Are a row in a database
  4. Aka entities, instances, points, samples, tuples, patterns, vectors, examples
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are data attributes?

A
  1. Describe the data objects
  2. Are the columns in the database
  3. Aka features, variables, dimensions, predictors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the five types of data attributes

A
  1. Nominal - categories
  2. Binary - 0,1
  3. Ordinal - meaningful order but magnitude between successive values is not necessarily meaningful
  4. Interval scaled - equal sized units, ordered, no tire zero
  5. Ratio scaled - as per interval but with a true zero
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

3 major categories of machine learning

A

Supervised
Unsupervised
Reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define supervised ML

A

Learning with a labelled training dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define unsupervised ML

A

Learning patterns in unlabelled data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define reinforcement ML

A

Learning based on feedback and reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

2 types of supervised ML

A

Classification
Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

2 types of unsupervised ML

A

Clustering
Anomaly detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

2 types of reinforcement learning

A

Game play
Control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

3 regression methods

A

Linear regression, polynomial regression
Ridge regression, LASSO, elastic net
Artificial neural networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

4 classification methods

A

Logistic regression
K nearest neighbours
Support vector machines
Decision trees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

3 things that determine which ML approach to take

A

Data
Computing requirements
Interpretatability

17
Q

What 3 aspects of the data determine which ML approach to take

A
  1. Size of data (more model parameters need bigger datasets)
  2. Feature types
  3. Linear or non-linear relationships
18
Q

What 2 computing requirements determine which ML approach to take?

A
  1. Training period available
  2. Production performance requirements