Week 1 Flashcards

1
Q

What is ‘Machine Learning’

A

a subset of artificial intelligence that enables systems to learn from data, identify patterns, and make decisions with minimal human intervention. This process is similar to how humans learn from their experiences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 3 foundations of Machine Learning?

A

data, algorithms, and model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Data?

A

data fuels machine learning. It encompasses different forms like numbers, text, images, or audio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Algorithms?

A

an algorithm is the logical procedure used to process data and learn patterns to make predictions or decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Models?

A

models are the end products of the machine learning process. It represents what the algorithm has learned from the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the 3 types of Machine Learning?

A

Supervised Learning, Unsupervised Learning, and Reinforcement Learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is ‘Supervised Learning’?

A

in supervised learning, the algorithm is trained using a dataset containing input features and their corresponding output labels. This is similar to a teacher providing examples with answers to a student, who learns to predict answers for new examples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 2 types of ‘Supervised Learning’

A

Regression and Classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Regression?

A

predicts a continuous numerical value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Classification?

A

predicts a categorical value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is ‘Unsupervised Learning’

A

are trained on datasets with only input features provided. The algorithm must discover patterns or structures within the data. It resembles a learner organizing classmates without instructions. The learner observes similarities, such as clothing or backpacks, and forms groups based on patterns they discover, just as an algorithm finds structure in unlabeled data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the 2 types of ‘Unsupervised Learning’

A

Clustering and Dimensionality Reduction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Clustering?

A

groups similar data points together.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is ‘Dimensionality Reduction’

A

is the process of simplifying a dataset by reducing the number of features (or dimensions) while still keeping the most important information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is ‘Reinforcement Learning’

A

reinforcement learning algorithms learn through interaction with an environment, receiving rewards or penalties based on their actions.
The algorithm learns to improve decisions based on feedback.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the 5 steps in making ‘Machine Learning Models’

A
  1. Data Collection and Preparation
  2. Algorithm Selection
  3. Model Training
  4. Model Evaluation
  5. Model Deployment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Give examples of ‘Sypervised Learning Models’

A

Linear Regression, Logistic Regression, Decision Trees, and Random Forest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is ‘Linear Regression’

A

predicts a continuous output based on input features.

ex. predicting the cost of a new house based on house’s size.

19
Q

What is ‘Logistic Regression’

A

classifies data into discrete categories based on input features.

ex. classifying if an email is span or not spam, based on the presence of specific keywords.

20
Q

What is ‘Decision Trees’

A

makes decisions by splitting the data based on feature values.

ex. predicting if a new customer will purchase a product based on his age and income.

21
Q

What is ‘Random Forest’

A

a group of decision trees that improves accuracy and reduces overfitting.

22
Q

Give examples of ‘Unsupervised Learning Model’

A

K-Means Clustering, Hierarchial Clustering, and Principal Component Analysis(PCA)

23
Q

What is ‘K-Means Clustering’

A

divides data into non-overlapping clusters based on similarity.

ex. groups customer into distinct clusters based on their spending patterns.

24
Q

What is ‘Hierarcial Clustering’

A

builds a tree-like hierarchy of clusters.

25
Q

What is the difference between ‘K-Means Clustering’ and ‘Hierarchy Clustering’

A

K-Means Clustering: Groups data into a fixed number of clusters (
𝐾
K) by assigning points to the nearest cluster center. It’s fast and works well for large datasets but requires you to choose the number of clusters in advance.

Hierarchical Clustering: Creates a tree-like structure of clusters by merging or splitting data points step-by-step. It’s better for smaller datasets and doesn’t require choosing the number of clusters upfront, but it’s slower and harder to use for large data.

26
Q

What is ‘Principal Component Analysis’

A

reduces the dimensionality of data while retaining as much variance as possible.

27
Q

Give examples of ‘Reinforcement Learning Models’

A

Q-Learning and Deep Q-Networks (DQN).

27
Q

What is ‘Q-Learning’

A

bases its actions in a specific situations to maximize its rewards over time.

28
Q

What is ‘Deep Q-Networks’

A

an advanced version of Q-Learning that can handle more complex and large-scale environments, such as video games.

29
Q

What is ‘Data Analytics’

A

the process of examining data to extract insights and make informed decisions.

29
Q

What are the 4 main types of Data Analytics

A

Descriptive Analytics. Diagnostic Analysis, Predictive Analytics, and Prescriptive Analytics

30
Q

What is ‘Descriptive Analytics’

A

focuses on summarizing and describing past data. It helps to understand what has happened.

31
Q

What is ‘Diagnostic Analytics’

A

explores the underlying causes of events or trends. It helps to understand why something happened.

32
Q

What is ‘Predictive Analytics’

A

uses historical data to predict future outcomes. It helps to anticipate what will happen.

33
Q

What is ‘Prescriptive Analytics’

A

suggests optimal actions based on data analysis and predictions. It helps determine what should be done.

34
Q

What is ‘Exploratory Data Analysis’?

A

Exploratory Data Analysis (EDA) is the process of analyzing and visualizing data to understand its patterns, trends, and relationships before applying formal modeling techniques. It’s like taking a first look at your data to see what’s interesting or unusual.

It involves ‘Summarizing Data’, ‘Visualizing Data’, ‘Checking Relationsips’ and ‘Spotting Issues’

EDA is like getting to know your data before diving into complex analysis. It’s the “exploration” phase where you clean up, visualize, and make sense of the data to guide your next steps.

34
Q

What are the 6 steps in Data Analytics?

A
  1. Define the Problem Statement.
  2. Collect the data.
  3. Clean the Collected Data.
  4. Analyze and Interpret the Cleaned Data.
  5. Visualize the Interpreted Data.
  6. Present the Analysis Results.
35
Q

What are the key characteristics of ‘Descriptive Analytics’

A

Data Summarization, Trend Identification, and Performance Metrics.

36
Q

What are the key characteristics of ‘Diagnostic Analytics’

A

Root Cause Analysis, Data Drill-Down, Correlation and Causality, and Hypothesis Testing.

37
Q

What are the key features of ‘Prescriptive Analytics’

A

Actionable Recommendations, Optimization, Predictive Models, Simulation and Scenario Analysis, and Complex Algorithms and Machine Learning.

38
Q

What are the key features of ‘Predictive Analytics’

A

Forecasting, Pattern Recognition, Probability Scores, Use of Advanced Algorithms, Scenario Testing.

39
Q

What are the techniques used in ‘Predictive Analytics’

A

Regression Analysis, Time Series Analysis, Machine Learning Models, Classification Models, and Clustering.

40
Q

What is ‘Regression Analysis’

A

a statistical method usd to determine the relationship between a dependent variable and one or more independent variable.

41
Q
A