Amazon Bedrock and Generative AI Flashcards

Question 1

Q

What is Generative AI?

Answer

A

A subset of deep learning that generates new data similar to its training data.

Question 2

Q

What types of data can Generative AI be trained on?

Answer

A

Text, images, audio, code, video, and more.

Question 3

Q

What is a foundation model?

Answer

A

A large, general-purpose AI model trained on massive amounts of data for a variety of tasks.

Question 4

Q

Name a few companies that create foundation models.

Answer

A

OpenAI, Meta, Amazon, Google, Anthropic.

Question 5

Q

What is an example of an open-source foundation model?

Answer

A

Meta’s LLaMA, Google’s BERT.

Question 6

Q

What is an LLM?

Answer

A

A Large Language Model trained to understand and generate human-like text.

Question 7

Q

How are LLMs trained?

Answer

A

On massive text datasets like books, websites, articles.

Question 8

Q

What does non-deterministic output mean in LLMs?

Answer

A

Same prompt can produce different outputs due to probabilistic word generation.

Question 9

Q

Why is LLM output non-deterministic?

Answer

A

It selects next words based on probability distributions, not fixed rules.

Question 10

Q

What are some tasks LLMs can perform?

Answer

A

Translation, summarization, Q&A, content generation.

Question 11

Q

How do diffusion models generate images?

Answer

A

By reversing a process that gradually adds noise to images.

Question 12

Q

What is forward diffusion?

Answer

A

A process where noise is added to an image over time until it’s unrecognizable.

Question 13

Q

What is reverse diffusion?

Answer

A

The process of removing noise step-by-step to generate an image from random noise.

Question 14

Q

What is Stable Diffusion?

Answer

A

A model/company using diffusion methods to generate images from text or other images.

Question 15

Q

Can Gen AI generate text from images?

Answer

A

Yes, it can analyze an image and generate descriptive text or answer questions.

Question 16

Q

What is Amazon Bedrock?

Answer

A

A fully managed AWS service to build and scale generative AI applications using various foundation models.

Question 17

Q

Does your data leave your AWS account when using Bedrock?

Answer

A

No, all operations occur within your AWS account; data stays private.

Question 18

Q

What is the pricing model of Amazon Bedrock?

Answer

A

Pay-per-use.

Question 19

Q

What is meant by ‘unified API’ in Bedrock?

Answer

A

A standardized interface to access all supported foundation models, simplifying integration.

Question 20

Q

What companies provide models on Amazon Bedrock?

Answer

A

AI21 Labs, Cohere, Stability.ai, Amazon, Anthropic, Meta, Mistral AI, and more.

Question 21

Q

Can you fine-tune foundation models on Amazon Bedrock?

Answer

A

Yes, using your own data, within your own account.

Question 22

Q

Does fine-tuning share your data with the model provider?

Answer

A

No, your data is never sent back to the model provider.

Question 23

Q

What is the Amazon Bedrock Playground?

Answer

A

An interactive interface to experiment with foundation models by submitting prompts.

Question 24

Q

What advanced features does Amazon Bedrock offer?

Answer

A

RAG (Retrieval-Augmented Generation), LLM agents, knowledge bases, security, and responsible AI features.

Question 25

Q

What is RAG in Amazon Bedrock?

Answer

A

A method to enhance model answers by retrieving relevant information from external data sources.

Question 26

Q

What is a knowledge base in Bedrock?

Answer

A

An external data store connected to Bedrock to provide domain-specific context for more accurate responses.

Question 27

Q

How does Amazon Bedrock support application integration?

Answer

A

Through a single unified API, making it easy to interact with different models programmatically.

Question 28

Q

Can you use Bedrock to build a chatbot?

Answer

A

Yes, using LLMs and additional tools like knowledge bases and RAG to create intelligent conversational agents.

Question 29

Q

What factors should you consider when selecting a foundation model on Amazon Bedrock?

Answer

A

Model type, performance, customization options, inference capabilities, licensing, context window, latency, modality support, compliance, and cost.

Question 30

Q

What is a multimodal foundation model?

Answer

A

A model that can accept and produce multiple types of data, such as text, audio, image, and video.

Question 31

Q

What is Amazon Titan?

Answer

A

A high-performing foundation model developed by AWS with support for text and image generation, available via Amazon Bedrock.

Question 32

Q

Can Amazon Titan be customized with your own data?

Answer

A

Yes, it supports fine-tuning using your own data within your AWS account.

Question 33

Q

What is the trade-off between smaller and larger models?

Answer

A

Smaller models are more cost-effective but have limited knowledge; larger models are more capable but expensive.

Question 34

Q

What is Llama-2 and who created it?

Answer

A

A foundation model created by Meta, focused on English text generation and large-scale tasks.

Question 35

Q

What is Claude and who developed it?

Answer

A

A foundation model developed by Anthropic, known for its large context window and strong document analysis capabilities.

Question 36

Q

What is Stability AI known for on Bedrock?

Answer

A

Image generation using the Stable Diffusion model, useful for advertising and media content.

Question 37

Q

Why might a larger context window be useful?

Answer

A

It allows you to input large documents, code bases, or books, enabling the model to reason over more content.

Question 38

Q

What are use cases for Amazon Titan?

Answer

A

Content creation, classification, and educational applications.

Question 39

Q

What are use cases for Claude?

Answer

A

Analysis, forecasting, and document comparison due to its large context window.

Question 40

Q

What are use cases for Stability AI?

Answer

A

Image generation for advertising, media, and creative projects.

Question 41

Q

How does pricing affect foundation model choice?

Answer

A

More capable models may be more expensive; choosing a model that balances cost and performance is crucial.

Question 42

Q

How is pricing typically measured on Amazon Bedrock?

Answer

A

By the number of tokens processed (e.g., cost per 1,000 tokens).

Question 43

Q

What is a potential risk when using foundation models with pay-per-use pricing?

Answer

A

Costs can escalate quickly if usage isn’t carefully monitored.

Question 44

Q

What is fine-tuning in Amazon Bedrock?

Answer

A

Adapting a copy of a foundation model by training it with your own data to improve performance on domain-specific tasks.

Question 45

Q

Where must training data be stored for fine-tuning in Amazon Bedrock?

Answer

A

In Amazon S3.

Question 46

Q

Does fine-tuning change the foundation model itself?

Answer

A

Yes, it updates the model’s weights based on your data.

Question 47

Q

What pricing model must you use for a fine-tuned model on Amazon Bedrock?

Answer

A

Provisioned throughput.

Question 48

Q

Are all models on Amazon Bedrock fine-tunable?

Answer

A

No, only some models, typically open-source ones, support fine-tuning.

Question 49

Q

What is instruction-based fine-tuning?

Answer

A

Fine-tuning using labeled data with prompt-response pairs to improve performance on specific tasks.

Question 50

Q

What kind of data is used for instruction-based fine-tuning?

Answer

A

Labeled data with prompt-response pairs.

Question 51

Q

What is continued pre-training in Bedrock?

Answer

A

Fine-tuning using unlabeled data to adapt a foundation model to a specific domain.

Question 52

Q

What is another name for continued pre-training?

Answer

A

Domain-adaptation fine-tuning.

Question 53

Q

When should you use continued pre-training?

Answer

A

When you have large amounts of unlabeled domain-specific data.

Question 54

Q

What is an example use case of continued pre-training?

Answer

A

Feeding the entire AWS documentation to make the model an AWS expert.

Question 55

Q

What are single-turn and multi-turn messaging in fine-tuning?

Answer

A

Fine-tuning approaches that teach a model how to handle one-turn or conversational multi-turn chat interactions.

Question 56

Q

What roles are defined in multi-turn messaging format?

Answer

A

System (optional context), User, and Assistant.

Question 57

Q

Which fine-tuning method is cheaper: instruction-based or continued pre-training?

Answer

A

Instruction-based fine-tuning is generally cheaper and uses less data.

Question 58

Q

What does continued pre-training require?

Answer

A

A large amount of unlabeled data and more computation, thus higher cost.

Question 59

Q

What is transfer learning?

Answer

A

Using a pre-trained model and adapting it to a new but related task—fine-tuning is a form of transfer learning.

Question 60

Q

What is a practical use case for transfer learning in image classification?

Answer

A

Using a pre-trained model for edge detection and adapting it to classify a specific kind of image.

Question 61

Q

What’s the difference between transfer learning and fine-tuning?

Answer

A

Fine-tuning is a specific application of transfer learning tailored to refining model behavior with new data.

Question 62

Q

When is fine-tuning a good idea?

Answer

A

When you need a custom tone/persona, work with proprietary data, or aim to improve accuracy for specific tasks.

Question 63

Q

What kind of data would trigger instruction-based fine-tuning?

Answer

A

Labeled data with prompt-response examples.

Question 64

Q

What kind of data would trigger continued pre-training?

Answer

A

Unlabeled data, such as raw domain-specific documentation.

Answer 65

A

It provides dedicated infrastructure for consistent performance with fine-tuned models.

Answer 66

A

A machine learning engineer, though Bedrock simplifies the process.

Answer 67

A

It’s a feature to evaluate a model for quality control by submitting it tasks and using benchmark datasets, then automatically scoring its performance using judge models.

Answer 68

A

Text summarization, question and answer, text classification, and open-ended text generation.

Answer 69

A

They help test the model by comparing its generated answers to ideal (benchmark) answers to assess accuracy.

Answer 70

A

The judge model compares the model-generated answer to the benchmark answer and assigns a score based on similarity.

Answer 71

A

Yes, you can use your own or a curated dataset from AWS.

Answer 72

A

They help measure accuracy, speed, scalability, and detect bias in the model.

Answer 73

A

Automatic uses judge models and metrics, while human evaluation involves people scoring the outputs based on criteria like relevance or correctness.

Answer 74

A

Thumbs up/down, ranking, and other grading scales.

Answer 75

A

Recall-Oriented Understudy for Gisting Evaluation.

Answer 76

A

Evaluating summarization and machine translation by comparing n-grams in reference and generated text.

Answer 77

A

A ROUGE metric measuring how many n-grams (e.g., 1-gram, 2-gram) match between reference and generated texts.

Answer 78

A

It computes the longest common subsequence between the reference and generated text.

Answer 79

A

Bilingual Evaluation Understudy.

Answer 80

A

Evaluating the quality of translated text, focusing on precision and penalizing brevity.

Answer 81

A

Semantic similarity between texts using embeddings and cosine similarity.

Answer 82

A

Because it compares meanings using embeddings rather than just word overlap.

Answer 83

A

A measure of how well the model predicts the next token; lower is better.

Answer 84

A

That the model is confident and accurate in predicting the next token.

Answer 85

A

They can be used to retrain and improve model outputs over time.

Answer 86

A

User satisfaction, average revenue per user, cross-domain performance, conversion rates, efficiency.

Answer 87

A

To evaluate the model using criteria specific to your business needs.

Answer 88

A

Retrieval Augmented Generation

Answer 89

A

It allows a foundation model to reference external data sources without fine-tuning.

Answer 90

A

Amazon Bedrock

Answer 91

A

Amazon S3

Answer 92

A

Vector database

Answer 93

A

Vector embeddings of chunks of data for semantic search

Answer 94

A

Numerical representations of text used to measure similarity

Answer 95

A

It’s augmented with retrieved information from the knowledge base, First model will search all related data to query from vector DB then pass it to main FM “Original Query + Retrieved Text “ . then main FM generates final output

Answer 96

A

Amazon OpenSearch Service, Amazon Aurora

Answer 97

A

MongoDB, Redis, Pinecone

Answer 98

A

AWS automatically creates a serverless OpenSearch vector database

Answer 99

A

Amazon Titan, Cohere

Answer 100

A

To split them into smaller parts for vector embedding and search

Answer 101

A

Real-time similarity search with scalable index management and KNN support

Answer 102

A

NoSQL compatibility and support for real-time vector similarity search

Answer 103

A

Amazon Aurora, Amazon RDS for PostgreSQL

Answer 104

A

Amazon Neptune

Answer 105

A

Amazon S3, Confluence, SharePoint, Salesforce, Webpages

Answer 106

A

Building a chatbot that retrieves answers from product documentation and FAQs

Answer 107

A

Chatbot answering legal queries based on case law, regulations, and legal opinions

Answer 108

A

AI assistant answering medical questions based on treatments and research papers