AI Practitioner Flashcards

1
Q

Model Artifacts

A

Artifacts produced during model training, consisting of trained parameters, model definition, and metadata, often stored in Amazon S3.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Inference Code

A

Software that implements the model by reading the model artifacts and making it deployable for inference tasks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Real-Time Inference

A

An inference type where an endpoint is always available to accept requests, suitable for low-latency, high-throughput tasks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Batch Inference

A

An inference type suitable for offline processing, where large amounts of data are processed upfront and a persistent endpoint is not needed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Supervised Learning

A

A machine learning style where models are trained on pre-labeled data, with both input and desired output specified.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Unsupervised Learning

A

A machine learning style that works with unlabeled data, focusing on recognizing patterns and grouping data into clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Reinforcement Learning

A

A machine learning method focused on autonomous decision-making by an agent, which learns through trial and error by receiving rewards for goal-oriented actions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Amazon SageMaker Ground Truth

A

A service from Amazon that helps label training data for supervised learning, often leveraging Amazon Mechanical Turk for crowdsourcing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Amazon Mechanical Turk

A

A crowdsourcing platform that provides access to a global pool of affordable labor, often used in labeling data for machine learning models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Clustering

A

A process in unsupervised learning where data is grouped based on patterns, useful in anomaly detection and pattern recognition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Anomaly Detection

A

A use case for unsupervised learning where irregularities in data, such as outliers, are identified for further analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Reinforcement Learning Agent

A

The entity in reinforcement learning that takes actions within an environment to achieve specific goals, learning through trial and error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

AWS DeepRacer

A

A reinforcement learning platform where users teach a model race car (the agent) to navigate a track (the environment) by taking actions to stay on course.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Exploratory Approach (Reinforcement Learning)

A

A learning approach where the agent explores actions without knowing the outcome, with successful actions being reinforced for goal achievement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

End Goal (Reinforcement Learning)

A

A predetermined objective in reinforcement learning that the agent works towards by refining its actions through trial and error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Artificial Intelligence (AI)

A

The field of computer science dedicated to solving cognitive problems like learning, creation, and image recognition, aiming to create self-learning systems that derive meaning from data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Machine Learning (ML)

A

A branch of AI that focuses on using data and algorithms to imitate the way humans learn, gradually improving accuracy to make predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Deep Learning

A

A subset of machine learning inspired by the human brain, using layers of neural networks to recognize speech, images, and more.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Inference

A

A prediction made by an AI model, essentially an educated guess with a probabilistic result.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Regression Analysis

A

A technique used in AI to process historical time series data and predict future values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Natural Language Processing (NLP)

A

A branch of AI that allows machines to understand, interpret, and generate human language in a natural way.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Generative AI

A

AI technology capable of generating original content such as text, images, videos, and music, based on a given prompt.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Computer Vision

A

AI technology used to process images and video for tasks like object identification, classification, and facial recognition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Anomaly Detection

A

The process of recognizing deviations from expected patterns in data, often used in fraud detection or identifying system failures.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Amazon Bedrock

A

A platform that supports generative AI tasks such as generating content based on prompts, like creating song lyrics or stories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Chatbot

A

An AI application that uses NLP to engage in conversations with users, often used in customer service or for booking systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Time Series Data

A

Historical data points collected over time, which AI models use to identify patterns and make future predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Alexa

A

An AI-powered voice assistant that uses NLP to respond to user questions and perform tasks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Amazon SageMaker

A

A platform that allows developers to build, train, and deploy machine learning models at scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Fraud Detection

A

An AI application used by financial institutions to detect anomalous activity and prevent fraudulent transactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Computer Vision Example

A

A model that can detect scratches on surfaces or missing components on circuit boards using image processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

HR AI Applications

A

AI used to process resumes and match candidates to job roles, improving hiring efficiency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Product Recommendation

A

An AI-driven system that uses shopping history to suggest products to customers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Discovery (AI Example)

A

A media platform using AI to recommend personalized content based on viewing history.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Customer Support Translation

A

An AI system that translates between languages in real-time during customer support interactions, such as from English to Spanish.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Taxi Demand Forecasting

A

An AI application that helps taxi companies position cars based on forecasted customer demand.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Pandemic Prediction

A

AI systems used by agencies like the CDC to predict outbreaks and pandemics, aiding in the distribution of resources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

AI in Manufacturing

A

The use of AI with computer vision to monitor assembly lines, maintain product quality, and predict equipment maintenance needs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

AI in Medical Diagnosis

A

AI used to read X-rays and scans to help doctors make faster and more accurate diagnoses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Call Center Monitoring

A

An AI system that detects deviations in call volumes to identify issues, such as system outages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Machine Learning (ML)

A

The science of developing algorithms and statistical models that enable computers to perform tasks without explicit instructions by identifying patterns in large datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Inference

A

The process where a trained machine learning model uses new data to make predictions or generate output it hasn’t seen during training.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Structured Data

A

Data stored in a table format, such as text files (CSV) or relational databases (RDS, Redshift), with clearly defined rows and columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Semi-Structured Data

A

Data that doesn’t fully follow the tabular structure, like JSON files, where features are stored as key-value pairs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Unstructured Data

A

Data that does not follow any structured model, such as images, videos, or social media posts, typically stored as objects in systems like Amazon S3.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Time Series Data

A

Data that is labeled with timestamps and stored sequentially, often used for models that predict trends over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Amazon S3 (Simple Storage Service)

A

Amazon’s storage service, which can store any type of data, offers lower cost, virtually unlimited storage capacity, and is a primary source for training data.

48
Q

Features

A

Input variables (columns in a table, pixels in an image) used to train machine learning models and correlate with expected outputs.

49
Q

Model Parameters

A

Internal variables in a machine learning model, adjusted during training to improve the model’s ability to produce accurate predictions.

50
Q

Linear Regression

A

A simple machine learning algorithm that models the relationship between inputs and outputs using a linear equation.

51
Q

Tokenization

A

A text processing technique that breaks down text into individual units such as words or phrases, commonly used for unstructured data.

52
Q

Training Data

A

Known data used to train a machine learning model, consisting of features (inputs) and possibly expected outputs.

53
Q

Amazon RDS (Relational Database Service)

A

Amazon’s service for structured data storage in relational databases, often used as a source for machine learning training data.

54
Q

Amazon Redshift

A

A fully managed data warehouse service from Amazon, used for analyzing large datasets stored in structured formats.

55
Q

Amazon DynamoDB

A

Amazon’s NoSQL database designed to handle semi-structured data, such as JSON files.

56
Q

Amazon DocumentDB

A

Amazon’s document database with MongoDB compatibility, used for semi-structured data in key-value pairs.

57
Q

Sampling Rate (Time Series Data)

A

The frequency at which data points are captured in time series data, which can impact data size and model training.

58
Q

Linear Equation

A

An equation in the form y=mx+b, or for machine learning, an equation used to model the relationship between independent and dependent variables (e.g., h=mw+b).

59
Q

Model Training

A

The process of adjusting model parameters to reduce errors and improve the accuracy of predictions using known data.

60
Q

Errors (in ML)

A

The differences between predicted outputs and actual data points, used to refine and improve machine learning models.

61
Q

Object Storage System

A

A storage system like Amazon S3 that holds unstructured data in the form of objects rather than tables.

62
Q

Machine Learning (ML)

A

The science of developing algorithms that allow computers to learn from data and make predictions without being explicitly programmed.

63
Q

Supervised Learning

A

A type of ML where the model is trained on labeled data to make predictions or classifications.

64
Q

Unsupervised Learning

A

A type of ML where the model is trained on unlabeled data to find hidden patterns or groupings.

65
Q

Reinforcement Learning

A

An ML method where an agent learns by interacting with its environment and receiving feedback in the form of rewards or penalties.

66
Q

Structured Data

A

Data that is organized in a tabular format with rows and columns, such as databases or CSV files.

67
Q

Semi-Structured Data

A

Data that does not have a strict tabular format but includes tags or markers to separate elements, such as JSON or XML.

68
Q

Unstructured Data

A

Data that does not have a predefined structure, such as images, videos, or social media posts.

69
Q

Amazon S3

A

Amazon’s cloud storage service that stores any type of data with virtually unlimited storage capacity.

70
Q

Inference

A

The process of making predictions or decisions using a trained ML model on new data.

71
Q

Regression

A

An ML task that predicts a numerical value based on input features.

72
Q

Classification

A

An ML task that assigns labels or categories to data points based on input features.

73
Q

Natural Language Processing (NLP)

A

A field of AI focused on the interaction between computers and human language, enabling tasks like translation and sentiment analysis.

74
Q

Bias

A

A systematic error in an ML model that occurs when certain data is over or underrepresented.

75
Q

Explainability

A

The ability to understand and interpret how an AI model makes its decisions.

76
Q

Computer Vision

A

A field of AI that enables computers to interpret and process visual data like images and videos.

77
Q

Generative AI

A

A type of AI that can create new content such as text, images, or music, based on learned data.

78
Q

Amazon SageMaker

A

A fully managed service that provides tools to build, train, and deploy machine learning models at scale.

79
Q

Time Series Data

A

Data that is indexed over time, often used for trend prediction in ML models.

80
Q

Deep Learning

A

A subset of ML that uses neural networks with multiple layers (deep neural networks) to process data and find patterns.

81
Q

Model Deployment

A

The process of putting an ML model into production so it can make real-world predictions or inferences.

82
Q

Amazon Lex

A

AWS service for building conversational interfaces like chatbots.

83
Q

Amazon Rekognition

A

AWS service for analyzing images and videos for tasks like object detection and facial recognition.

84
Q

Data Augmentation

A

A technique used to increase the amount of training data by creating modified versions of the original data.

85
Q

Hyperparameter Tuning

A

The process of optimizing the parameters that control how an ML algorithm learns.

86
Q

Amazon Lex

A

Chatbots, like Alexa. Conversational interfaces.

87
Q

Transcribe

A

Convert speech into text. Transcribe audio recording. Use a custom vocabulary for domain-specific terms.

88
Q

Comprehend

A

Natural Language Processing. Can identify the language used in text, extract key phrases, and analyze positive/negative sentiment.

89
Q

Textract

A

Extract data from scanned documents.

90
Q

Rekognition

A

Computer vision. Analyze images and videos.

91
Q

Kendra

A

Intelligent search service provider.

92
Q

Personalize

A

Realtime customer recommendations.

93
Q

DeepRacer

A

Race car hands on experience with reinforcement learning.

94
Q

CloudWatch

A

Gather and view metrics that relate to account resources. View the number of API calls to Amazon Bedrock.

95
Q

CloudTrail

A

Monitor and log API calls in AWS accounts.

96
Q

Amazon Augmented AI (A2I)

A

Monitoring and human reviews of model quality: Allow for human review of MLs. Automatically builds workflows to allow for human review without burden of building a whole system.

97
Q

AWS AI Service Cards

A

Part of responsible AI documentation. Provide transparency. Provide a single place to find information on intended use cases, design choices, deployment, optimization best practices. Basic concepts to help customers understand services and features, intended use cases and limitations, responsible AI design considerations, guidance on deployment/optimization.

98
Q

Amazon Q

A

AI assistant designed for business data. Generative AI virtual assistant that can answer questions, summarize content, generate content, and complete tasks based on the provided data. Does not provide access to FMs and is not open source.

99
Q

Amazon Q Developer

A

Formerly CodeWhisperer.

100
Q

SageMaker

A

Amazon’s machine learning creation service. Build, train, and deploy custom models. Bring your own machine learning algorithms. Managed API - model hosting services.

101
Q

SageMaker JumpStart

A

Pre-built foundation models for the most common use cases. Pre-trained models, open source models, FMs for summarization and audit use cases.

102
Q

SageMaker Clarify

A

FM evaluation: Use metrics such as accuracy, robustness, and toxicity to support your responsible AI initiative. Explainability: Provide insight into your overall model prediction process. Bias detection analysis: by selecting criteria like gender or age. Model Prediction Explanation: Integrated with SageMaker Experiments to provide insight into your overall model prediction process.

103
Q

SageMaker Model Monitor

A

Monitoring and human reviews: Monitor quality of models in production. Continuous/batch monitoring, on-schedule monitoring. Set alerts: Notify you when there are deviations in model quality.

104
Q

SageMaker Experiments

A

Provide insight into your overall model prediction process. Help you determine what the impact of incremental changes on model accuracy is.

105
Q

SageMaker Automatic Model Tuning

A

Hyperparameter tuning: Run many jobs with different hyperparameters and measure each of them.

106
Q

SageMaker Autopilot

A

Explainability: Provide insights into how ML models make predictions. Determines what contribution certain features or inputs have on the model’s output to understand why a model made a prediction after training or use it to provide per-instance explanation during inference.

107
Q

SageMaker Data Wrangler

A

Rebalance imbalanced data to avoid bias using: Random Undersampling, Random Oversampling, Synthetic Minority Oversampling Technique (SMOTE).

108
Q

SageMaker Feature Store

A

Create, share, and manage ML development.

109
Q

SageMaker Canvas

A

Low-code/No-code option to generate predictions without needing to write any code.

110
Q

SageMaker Model Cards

A

Governance: create records and document details about ML models in a single place. Intended use, risk ratings, training details. Transparency and Explainability: Provides comprehensive, immutable documentation of essential model information.

111
Q

SageMaker Role Manager

A

Governance: Admins can define user permissions. Control access and security levels for ML activities.

112
Q

SageMaker Model Dashboard

A

Governance: Keep your team informed on model behavior.

113
Q

SageMaker Ground Truth

A

Human-in-the-loop feedback. Incorporates human feedback across the ML lifecycle by ranking or classifying responses based on Reinforcement learning from human feedback (RLHF) for fine-tuning.

114
Q

Bedrock

A

Makes FMs available through an API. Privately customize FMs and deploy.

115
Q

Guardrails for Amazon Bedrom

A

Generate safeguards based on your use cases and responsible AI policies. Create multiple guardrails for different use cases. Control interactions between users and FMs. Filter undesirable/harmful content. Redact PII. Monitor and analyze user inputs and FM responses.