AIF-C01 Flashcards

Question 1

Q

You ONLY want to manage Applications and Data. Which type of Cloud Computing model should you use?
* On-premises
* Infrastructure as a Service (IaaS)
* Software as a Service (SaaS)
* Platform as a Service (PaaS)

Answer

A

Platform as a Service model

Question 2

Q

Is Ec2 a PaaS or IaaS?

Question 3

Q

Give an example of a PaaS?

Answer

A

AWS Beanstalk

Question 4

Q

What is the pricing model of Cloud Computing?
* Discounts over time
* Pay-as-you-go pricing
* Pay once a year
* Flat-rate pricing

Answer

A

Pay as you go

Question 5

Q

Which Global Infrastructure identity is composed of one or more discrete data centers with redundant power, networking, and connectivity, and are used to deploy infrastructure?
* Edge Locations
* Availability Zones
* Regions

Answer

A

Availability Zones

Question 6

Q

Which of the following is NOT one of the Five Characteristics of Cloud Computing?
* Rapid elasticity and scalability
* Multi-tenancy and resource pooling
* Dedicated Support Agent to help you deploy applications
* On-demand self service

Answer

A

Dedicated Support Agent to help you deploy applications

Question 7

Q

Which are the 3 pricing fundamentals of the AWS Cloud?
* Compute, Storage, and Data transfer in the AWS Cloud
* Compute, Networking, and Data transfer out of the AWS Cloud
* Compute, Storage, and Data transfer out of the AWS Cloud
* Storage, Functions, and Data transfer in the AWS Cloud

Answer

A

Compute, Storage, and data transfer out of the AWS Cloud are the 3 pricing fundamentals of the AWS Cloud.

Question 8

Q

Which of the following options is NOT a point of consideration when choosing an AWS Region?
* Compliance with data governance
* Latency
* Capacity availability
* Pricing

Answer

A

Capacity is unlimited in the cloud, you do not need to worry about it. The 4 points of considerations when choosing an AWS Region are: compliance with data governance and legal requirements, proximity to customers, available services and features within a Region, and pricing.

Question 9

Q

Which of the following is NOT an advantage of Cloud Computing?
* Trade capital expense (CAPEX) for operational expense (OPEX)
* Train your employees less
* Go global in minutes
* Stop spending money running and maintaining data centers

Answer

A

You must train your employees more so they can use the cloud effectively.

Question 10

Q

AWS Regions are composed of?
* Two or more Edge Locations
* One or more discrete data centers
* Three or more Availability Zones

Answer

A

AWS Regions consist of multiple, isolated, and physically separate Availability Zones within a geographic area.

Question 11

Q

Which of the following services has a global scope?
* EC2
* IAM
* Lambda
* Rekognition

Answer

A

IAM is a global service (encompasses all regions).

Question 12

Q

Which of the following is the definition of Cloud Computing?
* Rapidly develop, test and launch software applications
* Automatic and quick ability to acquire resources as you need them and release resources when you no longer need them
* On-demand availability of computer system resources, especially data storage (cloud storage) and computing power, without direct active management by the user
* Change resource types when needed

Answer

A

On-demand availability of computer system resources, especially data storage (cloud storage) and computing power, without direct active management by the user.

Question 13

Q

What defines the distribution of responsibilities for security in the AWS Cloud?
* AWS Pricing Fundamentals
* The Shared Responsibility Model
* AWS Acceptable Use Policy
* The AWS Management Console

Answer

A

The Shared Responsibility Model defines who is responsible for what in the AWS Cloud.

Question 14

Q

A company would like to benefit from the advantages of the Public Cloud but would like to keep sensitive assets in its own infrastructure. Which deployment model should the company use?
* Private Cloud
* Public Cloud
* Hybrid Cloud

Answer

A

Using a Hybrid Cloud deployment model allows you to benefit from the flexibility, scalability and on-demand storage access while keeping security and performance of your own infrastructure.

Question 15

Q

What is NOT authorized to do on AWS according to the AWS Acceptable Use Policy?
* Building a gaming application
* Deploying a website
* Run analytics on stolen content
* Backup your data

Answer

A

Run analytics on stolen content
You can run analytics on AWS, but you cannot run analytics on fraudulent content. Refer to the AWS Acceptable Use Policy to see what is not authorized to do on AWS.

Question 16

Q

GenAI is a subset of X which is a subset of Y which is a subset of Z.

Answer

A

X - Deep Learning
Y - Machine Learning
Z - Artificial Intelligince

Question 17

Q

What does GenAI generate?

Answer

A

new data/content that is similar to the data that it was trained on like Text, images, Audio, Code, Video and etc.

Question 18

Q

What is the cost of generating a foundation model and why? who can do this?

Answer

A

Tens of Millions of dollars for training, so it only can be done by large companies who can afford it.

Question 19

Q

Which famous GenAI models are open source?

Answer

A

Meta and Google BERT

Question 20

Q

Which famous GenAI models are commercial and not open source?

Answer

A

OpenAI and Anthropic

Question 21

Q

What does LLM stand for?

Answer

A

Large Language Models

Question 22

Q

What is LLM?

Answer

A

Type of GenAI that generates coherent human-like text.

Question 23

Q

What is the most famous LLM?

Question 24

Q

What is a prompt?

Answer

A

GenAI Model’s input from a user. The question that the user asks from GenAI model.

Question 25

Q

What does it mean that a prompt is non-deterministic?

Answer

A

2 users with the same prompt from the same GenAI model, may get different answers.

Question 26

Q

Name a famous image generative AI method

Answer

A

Diffusion Models - e.g. Stable Diffusion

Question 27

Q

How is a diffusion model trained and generates?

Answer

A

trained by Forward diffusion process (by adding noise to a picture in multiple steps) and generates by reverse diffusion

Question 28

Q

What is AWS bedrock?

Answer

A

A fully managed AWS service to build Gen AI applications

Question 29

Q

Is my training data secure in Bedrock?

Answer

A

yes. it’s all within the same account and not leaving it. any Foundation Model used is a copy of the original model, trained by customer data and stored locally.

Question 30

Q

What are the elements of Bedrock service?

Answer

A

Foundation Models
Interactive Playground for users
Knowledge Bases (RAG): to fetch data from external data sources to generate more relevant and accurate responses.
Fine-Tuning: Update the model with your data
Unified APIs across all the models used by GenAI applications.

Question 31

Q

Does enabling a model cost?

Answer

A

No, we only pay for using a model.

Question 32

Q

What is Amazon Titan?

Answer

A

High-performing Foundation Models from AWS
Image, text, multimodal model choices via a fully-managed APIs
Can be customized with your own data

Question 33

Q

What is Amazon Titan Text Express model good for?

Answer

A

High-performance
text model, +100
languages
Content creation,
classification,
education…

Question 34

Q

What is Claude model from Anthropic (an AI leading company) good for?

Answer

A

High-capacity text
generation, multilanguage
Analysis, forecasting,
document
comparison…

Question 35

Q

What is (Llama-2 70b-chat) model good for?

Answer

A

Large-scale tasks,
dialogue, English
Text generation,
customer service…

Question 36

Q

What is Stable Diffusion model from Stability.ai good for?

Answer

A

Image creation for
advertising, media…

Question 37

Q

How in aws console we can play with models and choose the right one?

Answer

A

Playground in console has compare models feature, that you can run a command across multi models and compare their result.
Compare, by the result content, pricing, response time, etc.

Question 38

Q

What is a Custom Model in Bedrock?

Answer

A

By choosing a base model, and tuning based on our own data, we can create a custom model.

Question 39

Q

What are the options to customize a model in Bedrock?

Answer

A

Fine Tune (one off)
Continued Pre-training (ongoing)

Question 40

Q

Which models can be fine tuned?

Answer

A

not all, usually open-sources, e.g. Amazon, Cohere and Meta

Question 41

Q

For Fine tuning a model, how the data is provided?

Answer

A

from S3
must be dataset format
use Sagemaker Ground Truth to create and label training datasets

Question 42

Q

What are Hyperparameters?

Answer

A

variables for the machine learning algorithms

Question 43

Q

What is Epochs Hyperparameter?

Answer

A

The total number of iterations of all the training data in one cycle for training the model.

Question 44

Q

What is Batch Size Hyperparameter?

Answer

A

The number of samples proceeded before model parameters are updated.

Question 45

Q

What is learning rate hyperparameter?

Answer

A

The rate at which model parameters are updated after each batch of training data. basically, how fast the model learns.

Question 46

Q

What is Learning Rate warmup steps hyperparameter?

Answer

A

Number of iterations over which learning rate is gradually increased to the initial rate specified.

Question 47

Q

Where is model fine tuning result saved?

Question 48

Q

What is a model fine tuning result?

Answer

A

A fine tuned model trained with our data

Question 49

Q

What is the pricing limitation of working with fine tuned models?

Answer

A

must provision throughput

Question 50

Q

Instructions based fine tuning uses…?

Answer

A

Labeled examples that are [prompt-response] based.

Question 51

Q

How is continued pre-training?

Answer

A

Provide Unlabeled data. e.g. any unstructured knowledge! it has only “input” field.

Question 52

Q

Continued Pre-training is also called ….

Answer

A

Domain Adoption Fine tuning. make the model expert in a domain.

Question 53

Q

what are 2 types of instruction based fine tuning?

Answer

A

Single Turn Messaging
Multi Turn Messaging => e.g. chatbot with multi turn conversation

Question 54

Q

What is Multi Turn Messaging schema?

Answer

A

{
“system”: “context”,
[{
“role”: “user/assistant”,
“content”: “message”
}]
}

Question 55

Q

Re-training vs instruction based fine tuning? (5 items)

Answer

A

Re-training an FM requires a higher budget
Instruction-based fine-tuning is usually cheaper as computations are
less intense and the amount of data required usually less
It also requires experienced ML engineers to perform the task
You must prepare the data, do the fine-tuning, evaluate the model
Running a fine-tuned model is also more expensive (provisioned
throughput)

Question 56

Q

What is Transfer Learning

Answer

A

Transfer Learning – the broader concept of reusing a pre-trained model to adapt it to a new related task
* Widely used for image classification
* And for NLP (models like BERT and GPT)
* Can appear in the exam as a general ML concept
* Fine-tuning is a specific kind of transfer learning

Question 57

Q

Say 4 use cases for file-tuning

Answer

A

A chatbot designed with a particular persona or tone, or geared
towards a specific purpose (e.g., assisting customers, crafting
advertisements)
Training using more up-to-date information than what the language
model previously accessed
Training with exclusive data (e.g., your historical emails or messages,
records from customer service interactions)
Targeted use cases (categorization, assessing accuracy)

Question 58

Q

How is Amazon Bedrock Evaluating a Model

Answer

A

Using automatic evaluation.
- it evaluate a model for quality control
- Built-in task types:
* Text summarization
* question and answer
* text classification
* open-ended text generation…
- Bring your own prompt dataset or use built-in curated prompt datasets (Benchmark datasets - questions and answers)
- Scores are calculated automatically
Model scores are calculated using various statistical methods (e.g. BERTScore, F1…)

Question 59

Q

What is a Judge model

Answer

A

In bedrock, a model that evaluates the evaluating model’s answers vs benchmark answers, and gives a grading score.

Question 60

Q

What is a Bias score?

Answer

A

Some benchmark datasets in Bedrock can quickly identify bias like discrimination against a specific group. they generate a Bias score.

Question 61

Q

What is the diff between human and automatic model evaluation?

Answer

A

in Human approach, instead of the Judge model, groups of humans evaluate by giving thumbs up or down or score like 1 to 5, and generating a grade.
Also, in human evaluation, we can have custom tasks types that only humans can evaluate accurately.

Question 62

Q

What are automated metrics to evaluate a FM?

Answer

A

ROUGE
BLEU
BERTScore
Perplexity

Question 63

Q

What is ROUGE? and what are ROUGE-N and ROUGE-L?

Answer

A

Recall Oriented Understudying for Gisting Evaluation => it’s a FM auto evaluation metric

Evaluating automatic summarization and machine translation systems
* ROUGE-N – measure the number of matching n-grams between reference and generated text.
(1gram is a word - it means how many words the model’s answer is from the benchmark answer)
* ROUGE-L – longest common subsequence between reference and generated text.
i.e. the longest sequence of words (not necessarily consecutive, but still in order) that is shared between both.

Question 64

Q

what is Gisting?

Answer

A

the word’s meaning= engage in chat or gossip.
in ML, it means using machine translation (MT) to quickly understand the general meaning or essence of foreign text, without requiring a perfect translation.

Focus on meaning, not precision:
Gisting prioritizes understanding the core message over achieving a grammatically perfect translation.

Answer 62

A

Bilingual Evaluation Understudy => it’s a FM auto evaluation metric
* Evaluate the quality of generated text, especially for translations
* Considers both precision and penalizes too much brevity
* Looks at a combination of n-grams (1, 2, 3, 4)

Answer 63

A

Bidirectional Encoder Representations from Transformers => it’s a FM auto evaluation metric.
Semantic similarity between generated text
* Uses pre-trained BERT models to compare the contextualized embeddings of both texts and computes the cosine similarity between them.
* Capable of capturing more nuance between the texts

Answer 64

A

it’s a FM auto evaluation metric.
how well the model predicts the next token (lower is better)

Answer 65

A

User Satisfaction – gather users’ feedbacks and assess their satisfaction with the model responses (e.g., user satisfaction for an ecommerce platform)
Average Revenue Per User (ARPU) – average revenue per user attributed to
the Gen-AI app (e.g., monitor ecommerce user base revenue)
Cross-Domain Performance – measure the model’s ability to perform cross
different domains tasks (e.g., monitor multi-domain ecommerce platform)
Conversion Rate – generate recommended desired outcomes such as purchases
(e.g., optimizing ecommerce platform for higher conversion rate)
Efficiency – evaluate the model’s efficiency in computation, resource utilization…
(e.g., improve production line efficiency)

Answer 66

A

General Text Generation
Text summarization
Question and Answer
Text classification

Answer 67

A

Toxity: offensive and inappropriate content
Accuracy
Robustness
Relevance
Consistency
Completeness

Answer 68

A

Retrieval - Augmented Generation

Answer 69

A

Allows a FM to reference a data source outside of training data.
the bedrock using RAG builds a knowledge base, backed by a vector database.

Answer 70

A

Bedrocks takes care of building the knowledge base based on customer data source.

Answer 71

A

When user sends a “Query” to Bedrock Prompt, it sends a “Search” to the knowledge base and Retrieves the “Retrieval Text”. then it sends the “Query”+”Retrieval Text” called the “Augmented Prompt” to the Foundation Model to generate the final response.
The response had reference to the actual data source chunks.

Answer 72

A

Where realtime data is needed to be fed into the foundation model.

Answer 73

A

AWS Aurora, AWS OpenSearch, Redis, MangoDB, Pinecone

Answer 74

A

A vector database is a specialized database designed to store and manage data as high-dimensional vectors, enabling efficient similarity searches and retrieval of data based on semantic meaning, rather than structured data organization.

Here’s a more detailed explanation:

Data Representation:
Instead of storing data in rows and columns like traditional databases, vector databases store data as mathematical vectors, which are numerical representations of data features.

Similarity Search:
The primary purpose of vector databases is to perform similarity searches, finding data points that are “close” to a given query vector based on their vector representations.

Applications:
Vector databases are used in various applications, including:
* Recommender Systems: Suggesting similar items or content to users.
* Semantic Search: Finding documents or data that are semantically similar to a query.
* Image and Audio Recognition: Matching images or audio clips based on their vector representations.
* Anomaly Detection: Identifying unusual patterns or outliers in data.

Answer 75

A

Using the Embeddings Models like Amazon Titan and Cohere. The Embeddings models doesn’t need to be the same as the Foundation Model.

The documents, for example from S3, is sectioned into “Document Chunks”, then the embedding models convert them to vectors and place these vectors in the vector database.

Answer 76

A

Amazon OpenSearch Service – search & analytics database real time similarity queries, store millions of vector embeddings scalable index management, and fast nearest-neighbor (kNN) high performance search capability
Amazon DocumentDB [with MongoDB compatibility] – NoSQL database
real time similarity queries, store millions of vector embeddings
Amazon Aurora – relational database, proprietary on AWS
Amazon RDS for PostgreSQL – relational database, open-source
Amazon Neptune – graph database

Answer 77

A

Amazon S3
Confluence
Microsoft SharePoint
Salesforce
Web pages (your website, your social
media feed, etc…)
More added over time…

Answer 78

A

Converting raw text into a sequence of tokens
* Word-based tokenization: text is split into individual words
* Subword tokenization: some words can be split too (helpful for long words…)

Answer 79

A

https://platform.openai.com/tokenizer

Answer 80

A

The number of tokens an LLM can
process at once, and is primarily about the prompt and the information it contains. it’s a race now between models to have the greatest context window.

Answer 81

A

Pros: more information and
coherence
Cons: more memory and processing
power

Answer 82

A

its context window

Answer 83

A

array of numerical values out of text, images or audio. These are scores, a rating for each dimension such as semantic meaning, syntactic role and sentiment. can be positive or negative.

Answer 84

A

Creating vectors out of text, images and videos

Answer 85

A

text > tokens > each token gets a tokenID > the tokens are fed into an embedding model > each token is converted into a vector (an array of scores) > vectors are stored in a Vector db

Answer 86

A

it generates vectors with scores that are searchable using the nearest neighbor capability of search engines like open search.

Answer 87

A

dimentionality reduction: visualize in 2D
color visualization

Answer 88

A

Control the interaction between users and Foundation Models (FMs)
* Filter undesirable and harmful content => Blocked topics list
* Remove Personally Identifiable Information (PII)
* Enhanced privacy
* Reduce hallucinations
* Ability to create multiple Guardrails and monitor and analyze user inputs that can
violate the Guardrails

Answer 89

A

the error message (could be different ones for prompt and response)
harmful categories filter (boolean) - e.g. hate, sexual, insults, violence and misconduct
prompt attacks filter (boolean): user inputs trying to override the system instructions.
content filters
custom word filters
denied topics
PII filters
contextual grounding check (reduce hallucination): verify if the response is meaningful based on the knowledge provided
Relevance check

Answer 90

A

It is priced based on the number of text units processed, with content filters and denied topics costing $0.15 per 1,000 text units.

Answer 91

A

Manage and carry out various multi-step tasks related to infrastructure
provisioning, application deployment, and operational activities
Task coordination: perform tasks in the correct order and ensure
information is passed correctly between tasks
Agents are configured to perform specific pre-defined action groups
Integrate with other systems, services, databases and API to exchange data
or initiate actions
Leverage RAG to retrieve information when necessary

Answer 92

A

assigned a task
The agent sends the following to a Bedrock model:
- Prompt
- Instructions
- Action groups and knowledge bases
- Conversation history
- Task
The Bedrock model runs “Chain of thought” => a list of steps
Each step can be calling an API from the actions groups, executing a lambda or searching a knowledge base.
The result is sent back to the agent
The agent sends the Task and The Result to another Bedrock model to generate the final refined response.

Answer 93

A

gives us a list of steps generated by “Chain of thought”, so we can debug them.

Answer 94

A

All the calls to Bedrock models (including request and response, the model Id, number of token, the applied guadrails, the region, latency in ms, etc.) are logged and sent to Cloudwatch or S3 or both.
This can include the Text, the Images and the Embeddings.
Then we also can define Alerts in Cloudwatch based on logs analytics.
It should be enabled in Bedrock settings.

Answer 95

A

For the guardrails: “ContentFilteredCount”
Invocations (the count)
InvocationLatency
OutputTokenCount
InputTokenCount

Answer 96

A

On-Demand:

Pay-as-you-go (no commitment)
Text Models – charged for every input/output token processed
Embedding Models – charged for every input token processed
Image Models – charged for every image generated
IMPORTANT - * Works with Base Models only

Batch:

Multiple predictions at a time (output is a single file in Amazon S3)
Can provide discounts of up to 50%

Provisioned Throughput:

Purchase Model units for a certain time (1 month, 6 months…)
Throughput – max. number of input/output tokens processed per minute
IMPORTANT - * Works with Base, Fine-tuned, and Custom Models

Answer 97

A

Prompt Engineering
* No model training needed (no additional computation or fine-tuning)
Retrieval Augmented Generation (RAG)
* Uses external knowledge (FM doesn’t need to ”know everything”, less complex)
* No FM changes (no additional computation or fine-tuning)
Instruction-based Fine-tuning
* FM is fine-tuned with specific instructions (requires additional computation)
Domain Adaptation Fine-tuning
* Model is trained on a domain-specific dataset (requires intensive computation)

Answer 98

A

On-Demand – great for unpredictable workloads, no long-term commitment
Batch – provides up to 50% discounts
Provisioned Throughput – (usually) not a cost-saving measure, great to “reserve”
capacity
Temperature, Top K, Top P – They are Model configurations - no impact on pricing
Model size – usually a smaller model will be cheaper (varies based on providers)
Number of Input and Output Tokens – main driver of cost

Answer 99

A

Multimodel model

Answer 100

A

Human evaluation

Answer 101

A

developing, designing, and optimizing prompts to
enhance the output of FMs for your needs

Answer 102

A

Instructions – a task for the model to do (description, how the model should perform)
Context – external information to guide the model
Input data – the input for which you want a response
Output Indicator – the output type or format

NOTE - write the keyword in the prompt, for example: Context: xyz

Answer 103

A

A technique where you explicitly instruct the model on what not to include or do in its response:
* Negative Prompting helps to:
* Avoid Unwanted Content – explicitly states what not to include, reducing the chances of irrelevant or inappropriate content
* Maintain Focus – helps the model stay on topic and not stray into areas that are not useful or desired
* Enhance Clarity – prevents the use of complex terminology or detailed data, making the output clearer and more accessible

Answer 104

A

Enhanced Promping

Answer 105

A

Pros: Accuracy
Cons: Cost

Cost Implications:
Because the enhanced prompt is longer and contains more tokens, it will cost more to process, even if the response itself is not significantly longer.

Trade-offs:
While enhanced prompting can lead to better quality and more relevant outputs, the increased cost needs to be weighed against the value of the improved results.

Optimization:
Prompt engineers need to balance the need for detailed prompts with the cost of token usage, optimizing for both quality and efficiency.

Answer 106

A

The LLM parameters:
* System Prompts – how the model should behave and reply
* Temperature (0 to 1) = creativity
how likely it is to choose less probable words. 0 means the response is always the same which is the most probable answer.
* Low (ex: 0.2) – outputs are more conservative, repetitive, focused on most likely response
* High (ex: 1.0) – outputs are more diverse, creative, and unpredictable, maybe less coherent
* Top P (0 to 1) = tokens with the highest probability scores until the sum (NOT AVE) of the scores reaches the specified threshold value. (Top-p sampling is also called nucleus sampling.)
* Low P (ex: 0.25) – consider the 25% most likely words, will make a more coherent response
* High P (ex: 0.99) – consider a broad range of possible words, possibly more creative and diverse output
* Top K - tokens with the highest probabilities until the specified number of tokens is reached.
* Low K (ex: 10) – more coherent response, less probable words
* High K (ex: 500) – more probable words, more diverse and creative
* Length – maximum length of the answer
* Stop Sequences – tokens that signal the model to stop generating output

NOTE:
* the more number of token = more diversity
* the higher probability =

Answer 107

A

The model will only choose from the most likely words (low TopP), but won’t go for the most most likely (High Temp).
It’s perfect for “creative” models, e.g., for writing fiction.

Answer 108

A

nucleus sampling

Answer 109

A

With a high Top-P (nucleus sampling) and low Top-K, the model will focus on a larger set of probable tokens (due to high Top-P) but only consider the lowest number of tokens (due to low Top-K) in the selection process, potentially leading to more predictable and less diverse.

Answer 110

A

Higher Temperature values encourage the model to take more risks, producing more creative and diverse outputs.

Answer 111

A

Top-p sampling dynamically selects tokens based on cumulative probability, adapting the number of tokens considered. Top-k sampling fixes the number of tokens to the top k most probable, regardless of their cumulative probability.

Answer 112

A

Yes, combining these parameters allows for finer control over the model’s output, but it’s essential to adjust them carefully to avoid unintended consequences.

Answer 113

A

If the randomness is too low (low Temperature, low Top-k/Top-p), the model may loop over high-probability tokens. Increasing the randomness can help introduce more variety

Answer 114

A

“coherence” refers to the logical flow, consistency, and clarity of the generated text, ensuring it makes sense as a whole and is easy for users to understand.

Answer 115

A

The model size
The model type itself (Llama has a different performance than Claude)
The number of tokens in the input (the bigger the slower)
The number of tokens in the output (the bigger the slower)

Answer 116

A

Latency is not impacted by Top P, Top K, Temperature

Answer 117

A

Present a task to the model
without providing examples or
explicit training for that specific task
You fully rely on the model’s
general knowledge
The larger and more capable the
FM, the more likely you’ll get good
results

Answer 118

A

Provide examples of a task to
the model to guide its output
We provide a “few shots” to
the model to perform the task
If you provide one example
only, this is also called
“one-shot” or “single-shot”

Answer 119

A

Divide the task into a sequence of
reasoning steps, leading to more
structure and coherence
Using a sentence like “Think step
by step” helps
Helpful when solving a problem as
a human usually requires several
steps
Can be combined with Zero-Shot
or Few-Shots Prompting

Answer 120

A

Simplify and standardize the process of
generating Prompts. the prompt will have a defined format with variables that user provides, then response will be a defined format too.

It Helps with:
* Processes user input text and output prompts from foundation models (FMs)
* Orchestrates between the FM, action groups, and knowledge bases
* Formats and returns responses to the user

You can also provide examples with few-shots prompting to improve the model performance

Prompt templates can be used with Bedrock Agents

Answer 121

A

Similar to SQL injection. The hacker passes values to prompt template variables which override the purpose of the template to do the harm that hacker intended.

Answer 122

A

Add explicit instructions to ignore any unrelated or potential
malicious content.

Answer 123

A

Fully managed Gen-AI assistant for your employees
Based on your company’s knowledge and data
- Answer questions, provide summaries, generate content, automate tasks
- Perform routine actions (e.g., submit time-off requests, send meeting invites)

Answer 124

A

Built on Amazon Bedrock (but you can’t choose the underlying FM). it uses multiple FMs.
40+ data connectors (Fully managed RAGs) - including AWS services and external data sources.
plugins - to interact with 3rd parties, e.g. create a ticket in JIRA
- we can create custom plugins too using APIs

Answer 125

A

It has a web application interface

Answer 126

A

User is authenticated and authorised using IAM Identity centre which can also be integrated with 3rd party IDPs

Answer 127

A

Controls and customize responses to your organizational needs
Admin controls == Guardrails
Block specific words or topics
Respond only with internal information (vs using external knowledge)
Global controls & topic-level controls (more granular rules)

Answer 128

A

as part of the Amazon Q business:
* Create Gen AI-powered apps without coding by using natural language
* Leverages your company’s internal data
* Possibility to leverage plugins (Jira, etc…)

Answer 129

A

Answer questions about the AWS
documentation and AWS service selection
Answer questions about resources in your AWS
account
Suggest CLI (Command Line Interface) to run
to make changes to your account
Helps you do bill analysis, resolve errors,
troubleshooting…
AI code companion to help you
code new applications (similar to
GitHub Copilot)
- Supports many languages: Java,
  JavaScript, Python, TypeScript, C#…
- Real-time code suggestions and
  security scans
- Software agent to implement
  features, generate documentation,
  bootstrapping new projects
  *IDE extension

Answer 130

A

yes if enabled

Answer 131

A

No, it can generates CLI commands for us, then we should run the command ourselves in Cloud shell.

Answer 132

A

Quicksight, EC2, AWS Chatbot, Glue

Answer 133

A

A AWS GenAI app-building playground (powered by Amazon Bedrock)
* Allows you to experiment creating GenAI apps with various FMs (no coding
or AWS account required)
* UI is similar to Amazon Q Apps (with less setup and no AWS account