8.1 Intro to Machine learning Flashcards

1
Q

Descriptive Analysis Recap of Univariate Analysis and Bivariate Analysis

A
  • Univariate Analysis examines one variable at a time (e.g., distribution, skewness, subgroups).
  • Bivariate Analysis looks at relationships between two variables (e.g., correlation strength and direction).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is using predictive anaylsis important ?

A
  • Patterns in data might indicate future trends.
  • Predictive analytics takes this a step further by using past data to train models that predict unknown values for new observations.
  • This process is a fundamental part of Machine Learning—where algorithms learn from data to make informed predictions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does predictive analytics help us understand simply ?

A
  • predictive analytics helps us move beyond understanding “what has happened” to anticipating “what is likely to happen next” using machine learning models.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are columns in a dataset called in machine learning?

A
  • Variables (Independent Variables) – Inputs that might influence the target.
  • Factors – Often used in categorical data (e.g., gender, yes/no responses).
  • Features – A common term in machine learning for inputs used to make predictions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the column that we want to predict called?

A
  • Target Variable – What the model aims to predict.
  • Predicted Variable – The value we want the model to generate.
  • Dependent Variable – The output that depends on the independent variables.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are rows in a dataset called in machine learning ?

A
  • Samples – Often used in statistics and machine learning.
  • Observations – Common term in research and analysis.
  • Records – Used in databases and spreadsheets.
  • Instances – Common in machine learning, referring to each real-world example.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Supervised learning ?

A
  • We already have examples where we know the correct answer.

The model learns from past data where inputs (features) are linked to known outputs (target).

E.g what height will my child be by age 10 ?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Regression in Machine Learning ?

A
  • A type of Supervised Learning where we predict a number (numeric value) based on labeled data.

Examples:

  • Predicting how far a car can drive on a full tank.
  • Estimating a child’s height at age 10.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is unsupervised learning ?

A
  • We do not have the correct answers (no labels).
  • The model finds patterns, clusters, or groupings in the data by itself.

e.g how many objects are in an image ?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Classification in Machine Learning ?

A
  • A type of Supervised Learning where we predict a category (label) based on labeled data.

Examples:

  • Deciding what jacket to wear based on the weather.
  • Determining if a person is infected (Yes/No).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Clustering in Machine Learning ?

A
  • Clustering (Unsupervised Learning) means the machine groups things that are similar without being told what those groups are.

Examples:

  • Finding groups of customers who respond well to marketing.
  • Identifying different species of bacteria that are resistant to antibiotics.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do I remember the difference between Regression & Classification ?

A

✅ Regression = Predicts numbers (How much? How far? How tall?)
✅ Classification = Predicts categories (Yes/No, Red/Blue, Disease/No Disease)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do I remember the difference between Supervised & Unsupervised Learning ?

A

✅ Supervised = We already have labeled data (we know the answers).
✅ Unsupervised = No labels, we find patterns & groupings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly