Machine Learning Flashcards

1
Q

Decision Tree

A

Type of ML model. Easy to understand, and they are the basic building block for some of the best models in data science. You can capture more factors using a tree that has more “splits.” These are called “deeper” trees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Fitting / Training

A

Capturing patterns from data. After the model has been fit, you can apply it to new data to predict or identify patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Training Data

A

The data used to fit the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Leaf

A

Point at the bottom of a decision tree where we make a prediction is called a leaf.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Pandas

A

The primary library data scientists use for exploring and manipulating data. Most people abbreviate pandas in their code as pd.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

DataFrame

A

holds the type of data you might think of as a table. This is similar to a sheet in Excel, or a table in a SQL database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Prediction Target

A

The column we want to predict.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Features

A

The columns that are inputted into our model (and later used to make predictions). Sometimes, you will use all columns except the target as features. Other times you’ll be better off with fewer features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Scikit-learn

A

most popular library for modeling the types of data typically stored in DataFrames.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly