Fundamentals of Machine Learning I (Not Part of Certification) Flashcards

1
Q

Characteristics of Regression?

A
  • Predicts amount of something based on numerical values
  • Uses data splitting and penalty to get an effective mathematical function
  • Supervised
  • Metrics
    • MAE, MSE, RMSE, R^2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is binary classification + characteristics

A
  • Prediction of one of two classes ex(diabetic to not diabetic)
  • Supervised
  • Uses probability
  • Classifies probability with sigmoid curve
  • Logistic Regression is a binary classifier
  • made with random subset of data
  • Confusion Matrixes
  • F1, Recall, Precision, AUC, TPR, FPR, ROC
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is RMSE

A

RMSE (Root Mean Squared Error) Used to measure number of
incorrect predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is MAE

A

MAE (Mean Absolute Error) Calculated with mean error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is F1

A

F1 Score

  • Harmonic mean of Precision and Recall.
  • Formula: 2 * (Precision * Recall) / (Precision + Recall).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Recall

A

Recall (Sensitivity, TPR)

  • Proportion of actual positives correctly identified by the model.
  • Formula: TP / (TP + FN).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is AUC

A

AUC (Area Under the Curve)

  • Measures overall classification performance as the area under the ROC curve.
  • Values range from 0 to 1, with 1 being perfect and 0.5 representing random guessing.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is TPR

A

TPR (True Positive Rate, Recall)

  • Proportion of actual positives classified as positive.
  • Formula: TP / (TP + FN).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Precision

A

Precision

  • Proportion of predicted positives that are actual positives.
  • Formula: TP / (TP + FP).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is FPR

A

FPR (False Positive Rate)

  • Proportion of actual negatives incorrectly classified as positive.
  • Formula: FP / (FP + TN).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is ROC

A

ROC (Receiver Operating Characteristic) Curve

  • Plots TPR vs. FPR across different thresholds.
  • Used to assess model performance over varying decision boundaries.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is R^2

A

R^2 (Coefficient of determination) Used to measure variance in
data to calculate the fit of the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is MSE

A

MSE (Mean squared error) Mean of error amount squared. Used
to amplify the error amount

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is multiclass classification?

A

Multiclass classification is used to predict which of multiple possible classes an observation belongs to. It calculates probability values for each class label and predicts the most probable class.

(Supervised)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the two types of algorithms used in multiclass classification?

A

The two types of algorithms are:

One-vs-Rest (OvR): Trains a binary classification function for each class.
Multinomial: Creates a single function that returns a probability distribution for all possible classes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How does the One-vs-Rest (OvR) algorithm work?

A

The OvR algorithm trains separate binary classification functions for each class. Each function calculates the probability that an observation belongs to a specific class compared to all others, and the class with the highest probability is predicted.

17
Q

What is a multinomial algorithm, and how does it work?

A

A multinomial algorithm creates a single function that returns a vector with probability values for each class. These values add up to 1, and the class with the highest probability is predicted. An example is the softmax function.

18
Q

How can you evaluate a multiclass classification model?

A

You can evaluate by calculating binary classification metrics for each class or aggregate metrics across all classes. Metrics like accuracy, recall, precision, and F1-score can be derived from a multiclass confusion matrix.

19
Q

How are binary classification metrics used in evaluating multiclass classification models?

A

In multiclass classification, binary metrics such as accuracy, recall, precision, and F1-score can be calculated for each individual class. These are derived from the confusion matrix, where True Positives (TP), False Positives (FP), True Negatives (TN), and False Negatives (FN) are recorded for each class. Aggregate metrics can then be calculated for overall model evaluation.

20
Q

What metrics can be derived from a multiclass confusion matrix?

A

Metrics derived from a multiclass confusion matrix include:

Accuracy, Recall, Precision, F1-Score

21
Q

Define Unsupervised Learning

A

Unsupervised learning uses data without labels to find patterns or groupings. Examples: clustering, dimensionality reduction.

22
Q

Define Supervised Learning

A

Supervised learning uses labeled data (features and known outcomes) to train a model to make predictions. Examples: classification, regression.

23
Q

What is clustering in machine learning?

A

Clustering is an unsupervised learning method where observations are grouped into clusters based on similarities in their features, without using labels.

24
Q

Why is clustering considered unsupervised learning?

A

Clustering is unsupervised because it doesn’t rely on known label values. Instead, it groups data points based solely on feature similarities.

25
Q

What is an example of clustering?

A

In a flower dataset with features like the number of leaves and petals, clustering groups similar flowers based on these features without knowing their species.

26
Q

What is the centroid process in K-Means clustering?

A
  1. Initialization: Choose 𝑘 clusters and randomly select 𝑘 initial centroids.
  2. Assignment Step: Calculate distances from each point to centroids, assigning points to the nearest centroid.
  3. Update Step: Recalculate centroids by finding the mean of assigned points.
  4. Repeat: Iterate the assignment and update steps until centroids stabilize or a maximum number of iterations is reached.
27
Q

What is K-Means clustering?

A

K-Means is a clustering algorithm that assigns data points to clusters by minimizing the distance to the cluster’s centroid, repeatedly adjusting centroids until stable clusters are formed.

28
Q

How do you evaluate a clustering model?

A

Clustering models are evaluated by how well the clusters are separated, using metrics like average distance to centroid, maximum distance, and silhouette score

29
Q

What is a silhouette score in clustering?

A

A silhouette score measures cluster separation, ranging from -1 to 1, where values closer to 1 indicate better-defined clusters.

30
Q

What metrics are used to evaluate clusters?

A

Metrics include average distance to cluster center, average distance to other centers, maximum distance to center, and silhouette score.

31
Q

What is a Centroid

A

A centroid is the central point of a cluster, representing the mean position of all data points assigned to that cluster in the feature space. It serves as the reference for assigning new points to clusters in algorithms like K-Means.