Intro To Classification Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is classification?

A

The prediction of objects by class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is binary classification?

A

The model is making a prediction between two classes, example dog or cat and yes or no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is multiclass classification?

A

A model prediction between multiple classes, example: meat, dairy, fruit or vegetables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is balanced or unbalanced classification?

A

Evaluating how balanced the classes are such as both sides have equal amount if dogs and cats. The balance of dataset will help us interpret of classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is decision trees for classification?

A

Works similar to Regression except final prediction is not an average but the class itself, also called classification trees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is false positives? (Type 1 errors)

A

Also known as false positives is a common error type, for example phrase like want to find employees but avoid bad hires(false positives) even turning away qualified people(false negatives)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is False negatives? (Type 2 errors)

A

Is another common error Type where we predict that something is negative when it is in fact, positive. Example a false negative for a covid test but in reality the person does have covid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a confusion matrix?

A

A table that is often used to describe the performance of a classification model (or “classifier”) on a set of test data for which the true values are known.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is accuracy?

A

Accuracy is the most intuitive metric, in other worlds accuracy is correct predictions our model made out of the total number of precitions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Recall?

A

When we want to reduce the number of false negatives, we want to improve recall

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Precision?

A

When we want to reduce the numbers of the false positives, we want to improve precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is class probabilities?

A

Calculates the probabilities of each class and returns the most likely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Decision Thresholds?

A

When working with a binary classification problem, the model prediction will, by default, be the class whoe probability is greater than 0.5 or 50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is K Nearest Neighbors (KNN)?

A

K-nearest neighbors is a model that uses the “k” most similar observations in order to make a prediction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the Advantages of KNN?

A

Easier to understand and explain than other machine learning algorithms. Can be used for classification or Regression or can be used for multi-class classification problems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the Disadvantages of KNN?

A

It must be store all of the data it was fit on. It’s prediction phase can be slow when the data it was “fit” on is large. Typically worse performance than other supervised learning methods

17
Q

What is Logistic Regression?

A

Logistic Regression uses something called a sigmoid function which bounds the output between 0 and 1

18
Q

What is Cross-Validation?

A

Cross-validation creates and trains multiple identical models using different subsets of the data

19
Q

What is Tuning Hyperparameters?

A

Choosing a set of optimal hyperparameters for learning algorithm. A hyperparameter is a model argument whose value is set before the learning process begins

20
Q

What is definition of ROC curve?

A

A plot of true positive rate (y axis) vs the false positive rate (x axis) for every possible classification threshold

21
Q

What is the point of regularization?

A

To reduce variance

22
Q

What is another name for ‘L1’ Regularization?

A

Lasso

23
Q

What do we want to do with the cost function?

A

Minimize the cost function