AI Practitioner Flashcards
Model Artifacts
Artifacts produced during model training, consisting of trained parameters, model definition, and metadata, often stored in Amazon S3.
Inference Code
Software that implements the model by reading the model artifacts and making it deployable for inference tasks.
Real-Time Inference
An inference type where an endpoint is always available to accept requests, suitable for low-latency, high-throughput tasks.
Batch Inference
An inference type suitable for offline processing, where large amounts of data are processed upfront and a persistent endpoint is not needed.
Supervised Learning
A machine learning style where models are trained on pre-labeled data, with both input and desired output specified.
Unsupervised Learning
A machine learning style that works with unlabeled data, focusing on recognizing patterns and grouping data into clusters.
Reinforcement Learning
A machine learning method focused on autonomous decision-making by an agent, which learns through trial and error by receiving rewards for goal-oriented actions.
Amazon SageMaker Ground Truth
A service from Amazon that helps label training data for supervised learning, often leveraging Amazon Mechanical Turk for crowdsourcing.
Amazon Mechanical Turk
A crowdsourcing platform that provides access to a global pool of affordable labor, often used in labeling data for machine learning models.
Clustering
A process in unsupervised learning where data is grouped based on patterns, useful in anomaly detection and pattern recognition.
Anomaly Detection
A use case for unsupervised learning where irregularities in data, such as outliers, are identified for further analysis.
Reinforcement Learning Agent
The entity in reinforcement learning that takes actions within an environment to achieve specific goals, learning through trial and error.
AWS DeepRacer
A reinforcement learning platform where users teach a model race car (the agent) to navigate a track (the environment) by taking actions to stay on course.
Exploratory Approach (Reinforcement Learning)
A learning approach where the agent explores actions without knowing the outcome, with successful actions being reinforced for goal achievement.
End Goal (Reinforcement Learning)
A predetermined objective in reinforcement learning that the agent works towards by refining its actions through trial and error.
Artificial Intelligence (AI)
The field of computer science dedicated to solving cognitive problems like learning, creation, and image recognition, aiming to create self-learning systems that derive meaning from data.
Machine Learning (ML)
A branch of AI that focuses on using data and algorithms to imitate the way humans learn, gradually improving accuracy to make predictions.
Deep Learning
A subset of machine learning inspired by the human brain, using layers of neural networks to recognize speech, images, and more.
Inference
A prediction made by an AI model, essentially an educated guess with a probabilistic result.
Regression Analysis
A technique used in AI to process historical time series data and predict future values.
Natural Language Processing (NLP)
A branch of AI that allows machines to understand, interpret, and generate human language in a natural way.
Generative AI
AI technology capable of generating original content such as text, images, videos, and music, based on a given prompt.
Computer Vision
AI technology used to process images and video for tasks like object identification, classification, and facial recognition.
Anomaly Detection
The process of recognizing deviations from expected patterns in data, often used in fraud detection or identifying system failures.
Amazon Bedrock
A platform that supports generative AI tasks such as generating content based on prompts, like creating song lyrics or stories.
Chatbot
An AI application that uses NLP to engage in conversations with users, often used in customer service or for booking systems.
Time Series Data
Historical data points collected over time, which AI models use to identify patterns and make future predictions.
Alexa
An AI-powered voice assistant that uses NLP to respond to user questions and perform tasks.
Amazon SageMaker
A platform that allows developers to build, train, and deploy machine learning models at scale.
Fraud Detection
An AI application used by financial institutions to detect anomalous activity and prevent fraudulent transactions.
Computer Vision Example
A model that can detect scratches on surfaces or missing components on circuit boards using image processing.
HR AI Applications
AI used to process resumes and match candidates to job roles, improving hiring efficiency.
Product Recommendation
An AI-driven system that uses shopping history to suggest products to customers.
Discovery (AI Example)
A media platform using AI to recommend personalized content based on viewing history.
Customer Support Translation
An AI system that translates between languages in real-time during customer support interactions, such as from English to Spanish.
Taxi Demand Forecasting
An AI application that helps taxi companies position cars based on forecasted customer demand.
Pandemic Prediction
AI systems used by agencies like the CDC to predict outbreaks and pandemics, aiding in the distribution of resources.
AI in Manufacturing
The use of AI with computer vision to monitor assembly lines, maintain product quality, and predict equipment maintenance needs.
AI in Medical Diagnosis
AI used to read X-rays and scans to help doctors make faster and more accurate diagnoses.
Call Center Monitoring
An AI system that detects deviations in call volumes to identify issues, such as system outages.
Machine Learning (ML)
The science of developing algorithms and statistical models that enable computers to perform tasks without explicit instructions by identifying patterns in large datasets.
Inference
The process where a trained machine learning model uses new data to make predictions or generate output it hasn’t seen during training.
Structured Data
Data stored in a table format, such as text files (CSV) or relational databases (RDS, Redshift), with clearly defined rows and columns.
Semi-Structured Data
Data that doesn’t fully follow the tabular structure, like JSON files, where features are stored as key-value pairs.
Unstructured Data
Data that does not follow any structured model, such as images, videos, or social media posts, typically stored as objects in systems like Amazon S3.
Time Series Data
Data that is labeled with timestamps and stored sequentially, often used for models that predict trends over time.