Udemy Test 1 Flashcards
What is a token
A generative AI concept used to represent words, sub words or characters that a model processes as discrete units of text
What is Sampling Bias?
It occurs when the data used to train a model does not reflect the diversity of the real world population
What is confirmation bias?
Selectively searching for or interpreting information to confirm existing beliefs
What is observer bias?
Human errors or subjectivity during data analysis or observation
What is measurement bias?
Inaccuracies in data collection
What is hallucination?
The model generates seemingly accurate information, that is incorrect
What is negative prompting?
Guiding a Gen AI model to avoid certain outputs, or behaviors when generating text
What is Sagemaker Clarify?
A tool to help explain how machine learning models make predictions
What is Amazon Poly?
Used to deploy high quality natural sounding human voices in dozens of languages
What is Amazon Comprehend?
Uses machine learning to find insights and relationships in text
What is knowledge bases for Amazon Bedrock?
You can give foundational models and Agents contextual information from your companies private data sources for RAG to deliver more relevant, accurate and customized responses
What is BLEU
A metric that is designed to evaluate the quality of text that has been machine translated by comparing with reference translations
What can Amazon bedrock Guardrails do?
It can detect confidential information in prompts or model responses
What is Exploritory Data Analysis (EDA)?
A phase of the data science process that involves examining the data through Statistical Summeries and visualizations to identify patterns, detect anomalies and form a hypothesis
What is Amazon Personalized?
A ML service that uses your data to generate product and content recommendations for your users
Transformer models
Use a self attention mechanism and implement contextual ebbeddings
What is Amazon Rekognition
A cloud based image and video service that makes it easy to add computer vision capabilities to your application
Key features of Amazon Sagemaker Jumpstart
- Pre-trained models that are fully customizable for your use case with your data
- You can evaluate, compare, and select foundational models quickly based on Pre-Trained quality and responsibility metrics
Narrowly defined use cases
Provide clear and specific requirements for the application, helping the research team understand exactly what the model need to accomplish
Amazon OpenSearch Service
Is designed to provide fast search capabilities and supports full text search, indexing, and scoring
BERT
A model that is used to Differentiate the contextual meaning of words when applied to different phrases
AWS Global Infrastructure
- Each availability zone has one or more data centers
- Each AWS region has a minimum of 3 AZ’s
Few Shot Prompting
Involves providing the model with a few examples that include user input and the correct user intent
Plagiarism
Presenting someone else’s work, ideas or creations as your own. A concern that exists around AI
Foundational models
Use self-supervised learning to create labels from input data
Fine Tuning a foundational model
Is a supervised learning process
Continued Pre-training
Uses Unlabeled data to pre-train a model
Fine Tuning
Uses labeled data to train a model
Testing and Deploying customized models for Bedrock
It is mandatory to use provisional throughput
Hyperparameter Tuning
Allows you to adjust settings such as reguklaization, learning rates, and drop out rates to enhance the models ability to generalize Local to new data
Asynchronous Inference
Allows you to process smaller payloads without requiring real-time responses by queuing requests and handling them in the background. It’s cost effective and efficient when a delay is acceptable
Valid use case for Gen AI
Create Photorealistic images from text descriptions is a valid use case