Udemy Test 1 Flashcards
What is a token
A generative AI concept used to represent words, sub words or characters that a model processes as discrete units of text
What is Sampling Bias?
It occurs when the data used to train a model does not reflect the diversity of the real world population
What is confirmation bias?
Selectively searching for or interpreting information to confirm existing beliefs
What is observer bias?
Human errors or subjectivity during data analysis or observation
What is measurement bias?
Inaccuracies in data collection
What is hallucination?
The model generates seemingly accurate information, that is incorrect
What is negative prompting?
Guiding a Gen AI model to avoid certain outputs, or behaviors when generating text
What is Sagemaker Clarify?
A tool to help explain how machine learning models make predictions
What is Amazon Poly?
Used to deploy high quality natural sounding human voices in dozens of languages
What is Amazon Comprehend?
Uses machine learning to find insights and relationships in text
What is knowledge bases for Amazon Bedrock?
You can give foundational models and Agents contextual information from your companies private data sources for RAG to deliver more relevant, accurate and customized responses
What is BLEU
A metric that is designed to evaluate the quality of text that has been machine translated by comparing with reference translations
What can Amazon bedrock Guardrails do?
It can detect confidential information in prompts or model responses
What is Exploritory Data Analysis (EDA)?
A phase of the data science process that involves examining the data through Statistical Summeries and visualizations to identify patterns, detect anomalies and form a hypothesis
What is Amazon Personalized?
A ML service that uses your data to generate product and content recommendations for your users
Transformer models
Use a self attention mechanism and implement contextual ebbeddings
What is Amazon Rekognition
A cloud based image and video service that makes it easy to add computer vision capabilities to your application
Key features of Amazon Sagemaker Jumpstart
- Pre-trained models that are fully customizable for your use case with your data
- You can evaluate, compare, and select foundational models quickly based on Pre-Trained quality and responsibility metrics
Narrowly defined use cases
Provide clear and specific requirements for the application, helping the research team understand exactly what the model need to accomplish
Amazon OpenSearch Service
Is designed to provide fast search capabilities and supports full text search, indexing, and scoring
BERT
A model that is used to Differentiate the contextual meaning of words when applied to different phrases
AWS Global Infrastructure
- Each availability zone has one or more data centers
- Each AWS region has a minimum of 3 AZ’s
Few Shot Prompting
Involves providing the model with a few examples that include user input and the correct user intent
Plagiarism
Presenting someone else’s work, ideas or creations as your own. A concern that exists around AI
Foundational models
Use self-supervised learning to create labels from input data
Fine Tuning a foundational model
Is a supervised learning process
Continued Pre-training
Uses Unlabeled data to pre-train a model
Fine Tuning
Uses labeled data to train a model
Testing and Deploying customized models for Bedrock
It is mandatory to use provisional throughput
Hyperparameter Tuning
Allows you to adjust settings such as reguklaization, learning rates, and drop out rates to enhance the models ability to generalize Local to new data
Asynchronous Inference
Allows you to process smaller payloads without requiring real-time responses by queuing requests and handling them in the background. It’s cost effective and efficient when a delay is acceptable
Valid use case for Gen AI
Create Photorealistic images from text descriptions is a valid use case
use MFLOW with Amazon Sagemaker:
To track, organize, view, analyze and compare iterative ML experimentation to gain insights and register and deploy models
Rag approach
The least costly and most efficient solution for providing up to date and relevant responses
What does Amazon Q business web application workflow use?
- Retrieval augmented Generation (RAG)
- Large Language Model (LLM)
Approaches that are best for changing a Foundation Model(FM) into a domain specific expert
- Continual Pre-Training
- Domain Adaptive Fine Tuning
What can Amazon Rekognition not be use for?
To create multilingual User Experiances
What do you use to get more creative responses from the same prompt in Bedrock?
Use higher Tempature
What is the Shared Security model for AWS
AWS is responsible for security of the cloud, while the user is responsible for security in the cloud
What is the best pricing cloud option for a company that is uncertain about the time or commitment or extent of use
On demand pricing
What are decision trees ?
Highly interpretable models that provide a clear and straightforward visualization of the decision making process
What are Agents for Amazon Bedrock?
Fully managed capabilities that make it easier for developers to create Generative AI based applications that can complete complex tasks for a wide range of use cases
What are Partial Dependable Plots(PDP)?
The provide a global view of a models behavior by showing how the model changes as a single feature is varied across it’s range
What is Amazon transcribe?
It converts audio input into text
What is amazon Comprehend
A virtual NLP (Natural Language Processing) service that uses machine learning to find insights and relationships in text
What is Generative AI
It encompasses models and algorithms capable of creating new content such as text, images and audio based on patterns learned from existing data
What is reinforcement learning?
It involves an agent interacting with an environment by taking actions and receiving rewards or penalties, learning to maximize cumulative rewards over time
What is invocation logging?
Allows for detailed logging of all requests and responses during model invocations
What is the default vector database for amazon bedrock
Opensearch serverless vector store
Confusion matrix
A tool designed to evaluate the performance of classification models by displaying the number of positives, negatives as well as false positives and false negatives
What is top P
The percentage of most likely candidates that a model considers for the next token
When using top p, how do you increase the size of the pool?
Choose a higher value
When using top P, how do you decrease the size of the pool
Choose a lower value to decrease the size of the pool and limit options to likely outputs
What is Amazon Kendra?
A highly accurate and easy to use search service
What powers Amazon Kendra?
Machine learning
What is a context window?
It defines how much text the AI Model can process process at one time to generate good output
What is Sagemaker model dashboard?
A centralized portal where you can view, search and explain all models in your account
What is Amazon Forecast?
A fully managed service that uses statistical and Machine Learning Algorithms to deliver highly accurate time series forecasts
What is Model evaluation?
The process of evaluating and comparing model output to determine the model that is best suited for a case
What is Model evaluation?
The process of evaluating and comparing model output to determine the model that is best suited for a case
What is model Inference?
The process of a model generating output from a given input
What is Top K?
The number of most likely candidates that the models consider for the next token
What is Supervised learning?
Algorithms that train on sample data that specifies the Algorithms input and output
What is Linear regression?
Refers to supervised learning models that predict a value from a continuous scale
What is a Neural network?
A more complex supervised learning technique
What is Amazon Q developer powered by?
Amazon Bedrock
How do you expose a model to a more diverse set of inputs?
By Augmenting Data
What is Amazon Comprehend ?
A Natural Language Processing(NLP) service that uses Machine Language to uncover insights and relationships in text
What is an AI service that provides access to foundation models via an API
Amazon Bedrock
What is a service that provides guidance to help you provision your resources following AWS best practices?
AWS Trusted Advisor
What is AWS trusted advisor?
A service that provides guidance to help you provision your resources using AWS best practices
What is the most effective way to minimize costs associated with the use of a Gen AI model in Bedrock?
Reduce the number of tokens in the input
Foundational Models
Serve as a broad base for various AI applications by providing generalized capabilities
What is Amazon Sagemaker ground truth?
It helps you to build high quality training datasets for your Machine learning models
What are the key components of a good prompting technique?
- Instructions
- Context
- Input Data
- Output indicator
What is Amazon Q?
A Generative AI powered assistant that allows you to create pre-packaged Gen AI applications
Amazon Bedrock provides what?
An environment to build and scale Gen AI applications using a foundation models via
What is Transfer learning?
A method that allows a model to utilize the knowledge learned from one task or dataset to improve it performance on a new, but related, task
Amazon bedrock is the easiest way to?
Build and scale generative AI Applications with foundation models
What is Amazon Sagemaker Jumpstart?
A machine learning hub with Foundation Models, Built in Algorithms, and pre-built ML solutions that you can deploy with just a few clicks.
What can Generative AI do?
It can automate the creation of new data based on existing patterns, enhancing productivity and innovation