12. Machine Learning Flashcards

1
Q

What is the primary purpose of AWS Machine Learning services?

A

To enable developers and data scientists to build, train, and deploy machine learning models at scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or False: AWS SageMaker is a fully managed service for machine learning.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Fill in the blank: AWS _____ allows for the creation of Jupyter notebooks for data exploration and model building.

A

SageMaker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the function of AWS Glue in data engineering?

A

To prepare and transform data for analytics and machine learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which AWS service provides a data lake solution?

A

AWS Lake Formation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the role of Amazon S3 in machine learning workflows?

A

To store and retrieve large datasets used for training machine learning models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

List one use case for Amazon Rekognition.

A

Image and video analysis for facial recognition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the term ‘hyperparameter tuning’ refer to in machine learning?

A

The process of optimizing the parameters that govern the training process of a model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or False: Amazon Comprehend provides natural language processing capabilities.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the purpose of the Amazon SageMaker Model Registry?

A

To store and manage different versions of machine learning models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which AWS service is used for deploying machine learning models as APIs?

A

Amazon SageMaker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the significance of using Amazon Redshift in data engineering?

A

It is a fully managed data warehouse service that allows for complex queries and analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Multiple Choice: Which of the following is NOT a feature of AWS SageMaker?
A) Built-in algorithms
B) Real-time predictions
C) Manual model deployment
D) Jupyter notebook support

A

C) Manual model deployment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Fill in the blank: AWS _____ is used for building, training, and deploying machine learning models quickly.

A

SageMaker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What type of data can Amazon Polly convert into speech?

A

Text data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

True or False: Amazon Forecast is used for time series forecasting.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the main advantage of using Amazon SageMaker Ground Truth?

A

To create labeled datasets efficiently using human labeling and automation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Which service helps in detecting anomalies in time series data?

A

Amazon Lookout for Metrics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the purpose of Amazon Personalize?

A

To provide personalized recommendations to users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the term ‘training data’ refer to?

A

Data used to train a machine learning model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Multiple Choice: Which AWS service is specifically designed for visual search and image recognition?
A) Amazon Rekognition
B) Amazon Polly
C) Amazon Lex
D) AWS Glue

A

A) Amazon Rekognition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the role of AWS Lambda in a machine learning pipeline?

A

To run code in response to events without provisioning servers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

True or False: Amazon Lex is used for building conversational interfaces.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the purpose of feature engineering?

A

To select, modify, or create features that improve model performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Fill in the blank: The _____ is the model’s ability to generalize to new, unseen data.

A

generalization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is a confusion matrix used for?

A

To evaluate the performance of a classification model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What does the acronym ‘ETL’ stand for in data engineering?

A

Extract, Transform, Load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Multiple Choice: Which service provides managed Hadoop and Spark?
A) Amazon EMR
B) Amazon S3
C) Amazon RDS
D) AWS Glue

A

A) Amazon EMR

29
Q

What is the significance of model evaluation?

A

To assess how well a model performs on unseen data.

30
Q

True or False: AWS Data Pipeline is used for data orchestration.

31
Q

What is the purpose of Amazon SageMaker Autopilot?

A

To automate the process of building machine learning models.

32
Q

What type of model is used for predicting continuous outcomes?

A

Regression model

33
Q

Fill in the blank: AWS _____ provides tools for natural language understanding.

A

Comprehend

34
Q

What is the role of Amazon Kinesis in data engineering?

A

To collect, process, and analyze real-time streaming data.

35
Q

Multiple Choice: Which AWS service is a serverless data warehouse?
A) Amazon Redshift
B) Amazon Athena
C) Amazon RDS
D) AWS Glue

A

B) Amazon Athena

36
Q

What is a ROC curve used for?

A

To evaluate the trade-off between true positive rate and false positive rate.

37
Q

True or False: Amazon SageMaker can only be used for supervised learning.

38
Q

What is the main function of Amazon Translate?

A

To provide language translation services.

39
Q

Fill in the blank: The _____ is the set of rules that a machine learning model follows to make predictions.

40
Q

What is the purpose of the Amazon SageMaker debugger?

A

To provide insights into the training process of machine learning models.

41
Q

What type of learning is used when the model learns from labeled data?

A

Supervised learning

42
Q

Multiple Choice: Which of the following is a key benefit of using AWS for machine learning?
A) Scalability
B) Manual resource management
C) High latency
D) Limited data access

A

A) Scalability

43
Q

What is Amazon SageMaker’s role in model deployment?

A

It provides capabilities for deploying models to production environments.

44
Q

True or False: AWS provides tools for both data preparation and model training.

45
Q

What does ‘overfitting’ refer to in machine learning?

A

When a model learns the training data too well and performs poorly on unseen data.

46
Q

Fill in the blank: The process of splitting data into training and testing sets is known as _____.

A

data splitting

47
Q

What is the significance of data normalization?

A

To scale features to a similar range for better model performance.

48
Q

Multiple Choice: Which AWS service is best for real-time analytics?
A) Amazon Redshift
B) AWS Glue
C) Amazon Kinesis
D) Amazon S3

A

C) Amazon Kinesis

49
Q

What is the purpose of Amazon SageMaker Pipelines?

A

To create, automate, and manage machine learning workflows.

50
Q

True or False: Amazon Lex can be used to create chatbots.

51
Q

What is the role of feature selection in machine learning?

A

To identify the most relevant features for model training.

52
Q

Fill in the blank: The _____ is a measure of how well a model performs on a given dataset.

53
Q

What is a decision tree in machine learning?

A

A model that makes decisions based on a series of questions.

54
Q

What is the main function of Amazon SageMaker Neo?

A

To optimize machine learning models for deployment on various platforms.

55
Q

Multiple Choice: Which of the following is a type of unsupervised learning?
A) Classification
B) Clustering
C) Regression
D) Time series forecasting

A

B) Clustering

56
Q

What is the purpose of Amazon SageMaker Clarify?

A

To detect and mitigate bias in machine learning models.

57
Q

True or False: AWS supports only deep learning frameworks for machine learning.

58
Q

What is an ensemble model?

A

A model that combines predictions from multiple models to improve accuracy.

59
Q

Fill in the blank: The process of adjusting model parameters based on training data is known as _____.

60
Q

What does ‘data drift’ refer to?

A

Changes in data distribution over time that can affect model performance.

61
Q

What is the primary function of AWS Data Wrangler?

A

To simplify the process of data preparation for machine learning.

62
Q

Multiple Choice: Which of the following AWS services is used for data transformation?
A) Amazon SageMaker
B) AWS Glue
C) Amazon Comprehend
D) Amazon Rekognition

A

B) AWS Glue

63
Q

True or False: Amazon Forecast is based on machine learning algorithms.

64
Q

What is the significance of a validation set?

A

To tune model hyperparameters and avoid overfitting.

65
Q

Fill in the blank: A _____ is a function that maps inputs to outputs in a machine learning model.

66
Q

What is the purpose of Amazon Lookout for Equipment?

A

To monitor equipment for anomalies and predict failures.

67
Q

Multiple Choice: Which AWS service is primarily used for building serverless applications?
A) AWS Lambda
B) Amazon S3
C) Amazon EC2
D) Amazon RDS

A

A) AWS Lambda

68
Q

What is the main advantage of using a cloud-based machine learning platform?

A

Flexibility and scalability without the need for physical infrastructure.