1 Chapter 3 Flashcards

Question 1

Q

What is the role of a data engineer in AI/ML products?

Answer

A

To power the data flow needed for product success and maintain the ETL pipeline

ETL stands for Extract, Transform, Load, a process for data integration.

Question 2

Q

What does ETL stand for?

Answer

A

Extract, Transform, Load

Question 3

Q

How often are ETL pipelines generally updated?

Answer

A

In batches and not in real time

Question 4

Q

What is a data pipeline that is updated continuously used for?

Answer

A

To provide real-time insights for dashboards used by internal business users

Question 5

Q

What is MLOps?

Answer

A

A practice that combines machine learning and operations to maintain AI systems

Question 6

Q

What does IaaS stand for?

Answer

A

Infrastructure as a Service

Question 7

Q

Why is strategizing and planning for AI adoption crucial?

Answer

A

To avoid technical debt and ensure sustainable implementation

Question 8

Q

What is model decay?

Answer

A

The decline in model performance over time due to changes in underlying data

Question 9

Q

What is one deployment strategy involving a new model alongside an existing one?

Answer

A

Shadow deployment

Question 10

Q

In A/B testing, what is the primary goal?

Answer

A

To compare the performance of two slightly different models

Question 11

Q

What is a gradual deployment strategy that tests new models on subsets of users called?

Answer

A

Canary deployment

Question 12

Q

What platform does Databricks offer for managing the ML life cycle?

Question 13

Q

What is the purpose of Google’s AI Platform?

Answer

A

To deploy production-level ML pipelines

Question 14

Q

What is Uber’s ML management tool called?

Answer

A

Michelangelo

Question 15

Q

What does Meta’s ML platform aim to achieve?

Answer

A

Reusability of ML algorithms and easy access to past projects

Question 16

Q

What service does Amazon provide for building and deploying ML models?

Answer

A

Amazon SageMaker

Question 17

Q

What tools did Airbnb use to orchestrate their ML platform?

Answer

A

Zipline, Redspot, DeepThought

Question 18

Q

What is the promise of AI rooted in?

Answer

A

Quantifying prediction and optimization

Question 19

Q

What percentage of Amazon’s sales come from their recommendation engine?

Question 20

Q

What is a smart strategy for implementing AI/ML projects?

Answer

A

Start small, apply to a clear business goal, and track effectiveness

Question 21

Q

What is essential for justifying investment in AI projects?

Answer

A

Communicating the strength and capabilities of AI

Question 22

Q

What do we learn through in the context of AI/ML projects?

Answer

A

Iteration

Question 23

Q

What is the importance of iteration in learning?

Answer

A

Iteration builds confidence through successful task completion.

Question 24

Q

How does GE utilize AI for customer benefit?

Answer

A

GE offers cost savings to its customers.

Question 25

Q

What role does Highmark play in preventing future bottlenecks?

Answer

A

Highmark predicts fraud.

Question 26

Q

How did Amazon benefit from machine learning?

Answer

A

Amazon grew its revenues through ML.

Question 27

Q

What is the significance of AI in the context of industrial revolution?

Answer

A

AI promises benefits to both companies and consumers.

Question 28

Q

What are the stages of the NPD cycle for AI/ML products?

Answer

A

Stages include discovery, define, design, implementation, marketing, training, and launch.

Question 29

Q

What is the focus during the discovery stage of NPD?

Answer

A

Identifying the market need and why AI should address it.

Question 30

Q

What is defined in the define stage of NPD?

Answer

A

Product requirements and screening ideas from the discovery stage.

Question 31

Q

What does the design stage of NPD involve?

Answer

A

Creating mockups and defining UI/UX elements.

Question 32

Q

What is the purpose of the implementation phase in NPD?

Answer

A

Materializing the planned product and achieving performance expectations.

Question 33

Q

What is a key consideration in marketing AI products?

Answer

A

Balancing communication about AI capabilities without overselling.

Question 34

Q

What is the focus of the training phase in NPD?

Answer

A

Training users and managing expectations regarding product performance.

Question 35

Q

What happens during the launch phase of NPD?

Answer

A

Officially releasing the product and assessing its performance against original metrics.

Question 36

Q

What is the Naive Bayes algorithm used for?

Answer

A

It’s used for classification problems by treating each feature as independent.

Question 37

Q

What does the Support Vector Machine (SVM) algorithm do?

Answer

A

It splits data into two classes to predict future data points.

Question 38

Q

What is linear regression used for?

Answer

A

Predicting future data points using one or more variables.

Question 39

Q

What does logistic regression predict?

Answer

A

A future binary categorical state.

Question 40

Q

What is the function of decision trees in ML?

Answer

A

They predict both categorical and numerical values using a flowchart-like structure.

Question 41

Q

How does the random forest algorithm work?

Answer

A

It creates multiple decision trees from random samples and averages the predictions.

Question 42

Q

What is K-Nearest Neighbors (KNN) used for?

Answer

A

Predicting future values based on the characteristics of neighboring data points.

Question 43

Q

What does clustering aim to achieve in ML?

Answer

A

Finding patterns or clusters in data without supervision.

Question 44

Q

What is the purpose of Principal Component Analysis (PCA)?

Answer

A

Reducing dimensions of large datasets while preserving information.

Question 45

Q

What do deep learning models mimic?

Answer

A

The way the human brain processes information through layers.

Question 46

Q

What is the goal of the implementation phase in the NPD process?

Answer

A

Achieving optimal performance based on the defined metrics.

Question 47

Q

What are neural networks primarily used for?

Answer

A

Neural networks are used to make up the models in AI/ML products.

Question 48

Q

What is the most important factor for AI/ML products?

Answer

A

Data accessibility.

Question 49

Q

What types of data might you initially start with for model training?

Answer

A

Third-party data or public data.

Question 50

Q

Why is partnering with customers important in AI/ML product development?

Answer

A

It helps build a product that can be successful with real-world data.

Question 51

Q

What is a potential risk of using pristine datasets for model training?

Answer

A

The model may perform poorly with real-world data it hasn’t seen before.

Question 52

Q

Why is having a variety of data crucial for model training?

Answer

A

To ensure good model performance and usability ethics.

Question 53

Q

What is iterative hyperparameter tuning?

Answer

A

It involves continuously retraining models for performance.

Question 54

Q

What informs ML engineers on how to tune hyperparameters?

Answer

A

Performance metrics and benchmarks from the define phase of the NPD process.

Question 55

Q

What are hyperparameters?

Answer

A

Settings that define how a model functions and optimizes performance.

Question 56

Q

What is an example of a hyperparameter in a decision tree model?

Answer

A

Maximum depth allowed for the decision tree.

Question 57

Q

What is the coefficient of determination also known as?

Answer

A

R-squared.

Question 58

Q

What was the R-squared value for the OLS regression model tested?

Question 59

Q

What R-squared value did the random forest model achieve?

Question 60

Q

What hyperparameter was used in the KNN model?

Answer

A

6 neighbors.

Question 61

Q

What score did the KNN model achieve?

Question 62

Q

What phenomenon occurs when a model performs exceptionally well on training data but poorly on new data?

Answer

A

Overfitting.

Question 63

Q

What should you be suspicious of when a model gets very close to a perfect score?

Answer

A

That the model may not generalize well to new datasets.

Question 64

Q

What should AI/ML enthusiasts look for in model performance over time?

Answer

A

Incremental improvement in performance.

Answer 60

A

Moving forward to deployment.

Answer 61

A

[collaborative].