L1 - Intro to Machine Learning Flashcards

1
Q

What are the basic steps involved with machine learning?

A

Raw data converted to matrix data.
Matrix data used to produce a test and training set.
Test and training set fed into model.
Model is evaluated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is raw data?

A

Data that is unprocessed
Missing or incomplete observations
Improperly formatted data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explain data types in the context of data preparation?

A

Transforms data so it can be used for ML
ML algorithm inputs must be certain data types (e.g. numeric)
Raw data often has many variables with different data types that will not work in a ML model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does machine learning “learn”?

A

You find historic data that has inputs and outputs (e.g. football teams as an input and total points as an output)
The model is trained by learning from the relationships between the inputs and outputs
When new data comes (without a prediction/output yet) this can be inputted into a trained model and predict values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is CRISP-DM?

A

The leading ML methodology used by industry.
Cross Industry Standard Processing for Data Mining.
Includes:
Business understanding; data understanding; data preparation; modelling; evaluation; deployment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Business understanding

A

Explore what your client wants to get from data mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data understanding

A

Assess what data is available and verify their quality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data preparation

A

Transform initial raw data into one suitable for modelling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Modelling

A

Select the appropriate modelling technique and tune the parameter settings to optimise the results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Evaluation

A

Evaluate the model in the context of the business goals and success criteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Deployment

A

Communicate results and suggestion implementation or actions based on findings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly