Week 1 Flashcards

Question 1

Q

What is ‘Machine Learning’

Answer

A

a subset of artificial intelligence that enables systems to learn from data, identify patterns, and make decisions with minimal human intervention. This process is similar to how humans learn from their experiences.

Question 2

Q

What are the 3 foundations of Machine Learning?

Answer

A

data, algorithms, and model

Question 3

Q

What is Data?

Answer

A

data fuels machine learning. It encompasses different forms like numbers, text, images, or audio.

Question 4

Q

What is Algorithms?

Answer

A

an algorithm is the logical procedure used to process data and learn patterns to make predictions or decisions.

Question 5

Q

What is Models?

Answer

A

models are the end products of the machine learning process. It represents what the algorithm has learned from the data.

Question 6

Q

What are the 3 types of Machine Learning?

Answer

A

Supervised Learning, Unsupervised Learning, and Reinforcement Learning.

Question 7

Q

What is ‘Supervised Learning’?

Answer

A

in supervised learning, the algorithm is trained using a dataset containing input features and their corresponding output labels. This is similar to a teacher providing examples with answers to a student, who learns to predict answers for new examples.

Question 8

Q

What are the 2 types of ‘Supervised Learning’

Answer

A

Regression and Classification

Question 9

Q

What is Regression?

Answer

A

predicts a continuous numerical value.

Question 10

Q

What is Classification?

Answer

A

predicts a categorical value.

Question 11

Q

What is ‘Unsupervised Learning’

Answer

A

are trained on datasets with only input features provided. The algorithm must discover patterns or structures within the data. It resembles a learner organizing classmates without instructions. The learner observes similarities, such as clothing or backpacks, and forms groups based on patterns they discover, just as an algorithm finds structure in unlabeled data.

Question 12

Q

What are the 2 types of ‘Unsupervised Learning’

Answer

A

Clustering and Dimensionality Reduction.

Question 13

Q

What is Clustering?

Answer

A

groups similar data points together.

Question 14

Q

What is ‘Dimensionality Reduction’

Answer

A

is the process of simplifying a dataset by reducing the number of features (or dimensions) while still keeping the most important information.

Question 15

Q

What is ‘Reinforcement Learning’

Answer

A

reinforcement learning algorithms learn through interaction with an environment, receiving rewards or penalties based on their actions.
The algorithm learns to improve decisions based on feedback.

Question 16

Q

What are the 5 steps in making ‘Machine Learning Models’

Answer

A

Data Collection and Preparation
Algorithm Selection
Model Training
Model Evaluation
Model Deployment

Question 17

Q

Give examples of ‘Sypervised Learning Models’

Answer

A

Linear Regression, Logistic Regression, Decision Trees, and Random Forest

Question 18

Q

What is ‘Linear Regression’

Answer

A

predicts a continuous output based on input features.

ex. predicting the cost of a new house based on house’s size.

Question 19

Q

What is ‘Logistic Regression’

Answer

A

classifies data into discrete categories based on input features.

ex. classifying if an email is span or not spam, based on the presence of specific keywords.

Question 20

Q

What is ‘Decision Trees’

Answer

A

makes decisions by splitting the data based on feature values.

ex. predicting if a new customer will purchase a product based on his age and income.

Question 21

Q

What is ‘Random Forest’

Answer

A

a group of decision trees that improves accuracy and reduces overfitting.

Question 22

Q

Give examples of ‘Unsupervised Learning Model’

Answer

A

K-Means Clustering, Hierarchial Clustering, and Principal Component Analysis(PCA)

Question 23

Q

What is ‘K-Means Clustering’

Answer

A

divides data into non-overlapping clusters based on similarity.

ex. groups customer into distinct clusters based on their spending patterns.

Question 24

Q

What is ‘Hierarcial Clustering’

Answer

A

builds a tree-like hierarchy of clusters.

Question 25

Q

What is the difference between ‘K-Means Clustering’ and ‘Hierarchy Clustering’

Answer

A

K-Means Clustering: Groups data into a fixed number of clusters (
𝐾
K) by assigning points to the nearest cluster center. It’s fast and works well for large datasets but requires you to choose the number of clusters in advance.

Hierarchical Clustering: Creates a tree-like structure of clusters by merging or splitting data points step-by-step. It’s better for smaller datasets and doesn’t require choosing the number of clusters upfront, but it’s slower and harder to use for large data.

Question 26

Q

What is ‘Principal Component Analysis’

Answer

A

reduces the dimensionality of data while retaining as much variance as possible.

Question 27

Q

Give examples of ‘Reinforcement Learning Models’

Answer

A

Q-Learning and Deep Q-Networks (DQN).

Question 28

Q

What is ‘Q-Learning’

Answer

A

bases its actions in a specific situations to maximize its rewards over time.

Question 29

Q

What is ‘Deep Q-Networks’

Answer

A

an advanced version of Q-Learning that can handle more complex and large-scale environments, such as video games.

Question 30

Q

What is ‘Data Analytics’

Answer

A

the process of examining data to extract insights and make informed decisions.

Question 31

Q

What are the 4 main types of Data Analytics

Answer

A

Descriptive Analytics. Diagnostic Analysis, Predictive Analytics, and Prescriptive Analytics

Question 32

Q

What is ‘Descriptive Analytics’

Answer

A

focuses on summarizing and describing past data. It helps to understand what has happened.

Question 33

Q

What is ‘Diagnostic Analytics’

Answer

A

explores the underlying causes of events or trends. It helps to understand why something happened.

Question 34

Q

What is ‘Predictive Analytics’

Answer

A

uses historical data to predict future outcomes. It helps to anticipate what will happen.

Question 35

Q

What is ‘Prescriptive Analytics’

Answer

A

suggests optimal actions based on data analysis and predictions. It helps determine what should be done.

Question 36

Q

What is ‘Exploratory Data Analysis’?

Answer

A

Exploratory Data Analysis (EDA) is the process of analyzing and visualizing data to understand its patterns, trends, and relationships before applying formal modeling techniques. It’s like taking a first look at your data to see what’s interesting or unusual.

It involves ‘Summarizing Data’, ‘Visualizing Data’, ‘Checking Relationsips’ and ‘Spotting Issues’

EDA is like getting to know your data before diving into complex analysis. It’s the “exploration” phase where you clean up, visualize, and make sense of the data to guide your next steps.

Question 37

Q

What are the 6 steps in Data Analytics?

Answer

A

Define the Problem Statement.
Collect the data.
Clean the Collected Data.
Analyze and Interpret the Cleaned Data.
Visualize the Interpreted Data.
Present the Analysis Results.

Question 38

Q

What are the key characteristics of ‘Descriptive Analytics’

Answer

A

Data Summarization, Trend Identification, and Performance Metrics.

Question 39

Q

What are the key characteristics of ‘Diagnostic Analytics’

Answer

A

Root Cause Analysis, Data Drill-Down, Correlation and Causality, and Hypothesis Testing.

Question 40

Q

What are the key features of ‘Prescriptive Analytics’

Answer

A

Actionable Recommendations, Optimization, Predictive Models, Simulation and Scenario Analysis, and Complex Algorithms and Machine Learning.

Question 41

Q

What are the key features of ‘Predictive Analytics’

Answer

A

Forecasting, Pattern Recognition, Probability Scores, Use of Advanced Algorithms, Scenario Testing.

Question 42

Q

What are the techniques used in ‘Predictive Analytics’

Answer

A

Regression Analysis, Time Series Analysis, Machine Learning Models, Classification Models, and Clustering.

Question 43

Q

What is ‘Regression Analysis’

Answer

A

a statistical method usd to determine the relationship between a dependent variable and one or more independent variable.

Question 44

Q

Brainscape's Knowledge GenomeTM

Week 1 Flashcards

Brainscape's Knowledge Genome^TM