Amazon Bedrock and Generative AI Flashcards

1
Q

What is Generative AI?

A

A subset of deep learning that generates new data similar to its training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What types of data can Generative AI be trained on?

A

Text, images, audio, code, video, and more.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a foundation model?

A

A large, general-purpose AI model trained on massive amounts of data for a variety of tasks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name a few companies that create foundation models.

A

OpenAI, Meta, Amazon, Google, Anthropic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an example of an open-source foundation model?

A

Meta’s LLaMA, Google’s BERT.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an LLM?

A

A Large Language Model trained to understand and generate human-like text.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How are LLMs trained?

A

On massive text datasets like books, websites, articles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does non-deterministic output mean in LLMs?

A

Same prompt can produce different outputs due to probabilistic word generation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why is LLM output non-deterministic?

A

It selects next words based on probability distributions, not fixed rules.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some tasks LLMs can perform?

A

Translation, summarization, Q&A, content generation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do diffusion models generate images?

A

By reversing a process that gradually adds noise to images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is forward diffusion?

A

A process where noise is added to an image over time until it’s unrecognizable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is reverse diffusion?

A

The process of removing noise step-by-step to generate an image from random noise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Stable Diffusion?

A

A model/company using diffusion methods to generate images from text or other images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Can Gen AI generate text from images?

A

Yes, it can analyze an image and generate descriptive text or answer questions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Amazon Bedrock?

A

A fully managed AWS service to build and scale generative AI applications using various foundation models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Does your data leave your AWS account when using Bedrock?

A

No, all operations occur within your AWS account; data stays private.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the pricing model of Amazon Bedrock?

A

Pay-per-use.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is meant by ‘unified API’ in Bedrock?

A

A standardized interface to access all supported foundation models, simplifying integration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What companies provide models on Amazon Bedrock?

A

AI21 Labs, Cohere, Stability.ai, Amazon, Anthropic, Meta, Mistral AI, and more.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Can you fine-tune foundation models on Amazon Bedrock?

A

Yes, using your own data, within your own account.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Does fine-tuning share your data with the model provider?

A

No, your data is never sent back to the model provider.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the Amazon Bedrock Playground?

A

An interactive interface to experiment with foundation models by submitting prompts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What advanced features does Amazon Bedrock offer?

A

RAG (Retrieval-Augmented Generation), LLM agents, knowledge bases, security, and responsible AI features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is RAG in Amazon Bedrock?

A

A method to enhance model answers by retrieving relevant information from external data sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is a knowledge base in Bedrock?

A

An external data store connected to Bedrock to provide domain-specific context for more accurate responses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

How does Amazon Bedrock support application integration?

A

Through a single unified API, making it easy to interact with different models programmatically.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Can you use Bedrock to build a chatbot?

A

Yes, using LLMs and additional tools like knowledge bases and RAG to create intelligent conversational agents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What factors should you consider when selecting a foundation model on Amazon Bedrock?

A

Model type, performance, customization options, inference capabilities, licensing, context window, latency, modality support, compliance, and cost.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is a multimodal foundation model?

A

A model that can accept and produce multiple types of data, such as text, audio, image, and video.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is Amazon Titan?

A

A high-performing foundation model developed by AWS with support for text and image generation, available via Amazon Bedrock.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Can Amazon Titan be customized with your own data?

A

Yes, it supports fine-tuning using your own data within your AWS account.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is the trade-off between smaller and larger models?

A

Smaller models are more cost-effective but have limited knowledge; larger models are more capable but expensive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is Llama-2 and who created it?

A

A foundation model created by Meta, focused on English text generation and large-scale tasks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is Claude and who developed it?

A

A foundation model developed by Anthropic, known for its large context window and strong document analysis capabilities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is Stability AI known for on Bedrock?

A

Image generation using the Stable Diffusion model, useful for advertising and media content.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Why might a larger context window be useful?

A

It allows you to input large documents, code bases, or books, enabling the model to reason over more content.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What are use cases for Amazon Titan?

A

Content creation, classification, and educational applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What are use cases for Claude?

A

Analysis, forecasting, and document comparison due to its large context window.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What are use cases for Stability AI?

A

Image generation for advertising, media, and creative projects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

How does pricing affect foundation model choice?

A

More capable models may be more expensive; choosing a model that balances cost and performance is crucial.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

How is pricing typically measured on Amazon Bedrock?

A

By the number of tokens processed (e.g., cost per 1,000 tokens).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is a potential risk when using foundation models with pay-per-use pricing?

A

Costs can escalate quickly if usage isn’t carefully monitored.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is fine-tuning in Amazon Bedrock?

A

Adapting a copy of a foundation model by training it with your own data to improve performance on domain-specific tasks.

45
Q

Where must training data be stored for fine-tuning in Amazon Bedrock?

A

In Amazon S3.

46
Q

Does fine-tuning change the foundation model itself?

A

Yes, it updates the model’s weights based on your data.

47
Q

What pricing model must you use for a fine-tuned model on Amazon Bedrock?

A

Provisioned throughput.

48
Q

Are all models on Amazon Bedrock fine-tunable?

A

No, only some models, typically open-source ones, support fine-tuning.

49
Q

What is instruction-based fine-tuning?

A

Fine-tuning using labeled data with prompt-response pairs to improve performance on specific tasks.

50
Q

What kind of data is used for instruction-based fine-tuning?

A

Labeled data with prompt-response pairs.

51
Q

What is continued pre-training in Bedrock?

A

Fine-tuning using unlabeled data to adapt a foundation model to a specific domain.

52
Q

What is another name for continued pre-training?

A

Domain-adaptation fine-tuning.

53
Q

When should you use continued pre-training?

A

When you have large amounts of unlabeled domain-specific data.

54
Q

What is an example use case of continued pre-training?

A

Feeding the entire AWS documentation to make the model an AWS expert.

55
Q

What are single-turn and multi-turn messaging in fine-tuning?

A

Fine-tuning approaches that teach a model how to handle one-turn or conversational multi-turn chat interactions.

56
Q

What roles are defined in multi-turn messaging format?

A

System (optional context), User, and Assistant.

57
Q

Which fine-tuning method is cheaper: instruction-based or continued pre-training?

A

Instruction-based fine-tuning is generally cheaper and uses less data.

58
Q

What does continued pre-training require?

A

A large amount of unlabeled data and more computation, thus higher cost.

59
Q

What is transfer learning?

A

Using a pre-trained model and adapting it to a new but related task—fine-tuning is a form of transfer learning.

60
Q

What is a practical use case for transfer learning in image classification?

A

Using a pre-trained model for edge detection and adapting it to classify a specific kind of image.

61
Q

What’s the difference between transfer learning and fine-tuning?

A

Fine-tuning is a specific application of transfer learning tailored to refining model behavior with new data.

62
Q

When is fine-tuning a good idea?

A

When you need a custom tone/persona, work with proprietary data, or aim to improve accuracy for specific tasks.

63
Q

What kind of data would trigger instruction-based fine-tuning?

A

Labeled data with prompt-response examples.

64
Q

What kind of data would trigger continued pre-training?

A

Unlabeled data, such as raw domain-specific documentation.

65
Q

Why is provisioned throughput more expensive?

A

It provides dedicated infrastructure for consistent performance with fine-tuned models.

66
Q

What type of expert might be needed for fine-tuning a model?

A

A machine learning engineer, though Bedrock simplifies the process.

67
Q

What is Automatic Evaluation in Amazon Bedrock?

A

It’s a feature to evaluate a model for quality control by submitting it tasks and using benchmark datasets, then automatically scoring its performance using judge models.

68
Q

What are the built-in task types available for automatic evaluation in Bedrock?

A

Text summarization, question and answer, text classification, and open-ended text generation.

69
Q

What are benchmark questions and answers used for?

A

They help test the model by comparing its generated answers to ideal (benchmark) answers to assess accuracy.

70
Q

What is the purpose of a judge model in automatic evaluation?

A

The judge model compares the model-generated answer to the benchmark answer and assigns a score based on similarity.

71
Q

Can you bring your own benchmark dataset in Amazon Bedrock?

A

Yes, you can use your own or a curated dataset from AWS.

72
Q

What are the benefits of using benchmark datasets?

A

They help measure accuracy, speed, scalability, and detect bias in the model.

73
Q

What is the difference between automatic and human evaluation?

A

Automatic uses judge models and metrics, while human evaluation involves people scoring the outputs based on criteria like relevance or correctness.

74
Q

What kind of metrics are used in human evaluation?

A

Thumbs up/down, ranking, and other grading scales.

75
Q

What does ROUGE stand for?

A

Recall-Oriented Understudy for Gisting Evaluation.

76
Q

What is ROUGE used for?

A

Evaluating summarization and machine translation by comparing n-grams in reference and generated text.

77
Q

What is ROUGE-N?

A

A ROUGE metric measuring how many n-grams (e.g., 1-gram, 2-gram) match between reference and generated texts.

78
Q

What is ROUGE-L?

A

It computes the longest common subsequence between the reference and generated text.

79
Q

What does BLEU stand for?

A

Bilingual Evaluation Understudy.

80
Q

What is BLEU used for?

A

Evaluating the quality of translated text, focusing on precision and penalizing brevity.

81
Q

What does BERTScore evaluate?

A

Semantic similarity between texts using embeddings and cosine similarity.

82
Q

Why is BERTScore better than ROUGE or BLEU for nuanced text?

A

Because it compares meanings using embeddings rather than just word overlap.

83
Q

What is perplexity in the context of language models?

A

A measure of how well the model predicts the next token; lower is better.

84
Q

What does low perplexity indicate?

A

That the model is confident and accurate in predicting the next token.

85
Q

What can be done with evaluation metrics in a feedback loop?

A

They can be used to retrain and improve model outputs over time.

86
Q

Name some business metrics to evaluate a foundation model.

A

User satisfaction, average revenue per user, cross-domain performance, conversion rates, efficiency.

87
Q

Why would you create a custom benchmark dataset?

A

To evaluate the model using criteria specific to your business needs.

88
Q

What does RAG stand for in generative AI?

A

Retrieval Augmented Generation

89
Q

What is the core idea behind RAG?

A

It allows a foundation model to reference external data sources without fine-tuning.

90
Q

What AWS service is used to manage the knowledge base in a RAG system?

A

Amazon Bedrock

91
Q

What storage service is commonly used as the data source for the knowledge base in AWS Bedrock?

92
Q

What type of database underlies a knowledge base in a RAG system?

A

Vector database

93
Q

What does a vector database store in the context of RAG?

A

Vector embeddings of chunks of data for semantic search

94
Q

What are embeddings in the context of RAG?

A

Numerical representations of text used to measure similarity

95
Q

What happens to a user’s query in RAG before being sent to the foundation model?

A

It’s augmented with retrieved information from the knowledge base, First model will search all related data to query from vector DB then pass it to main FM “Original Query + Retrieved Text “ . then main FM generates final output

96
Q

Name two AWS services that can be used as vector databases for RAG.

A

Amazon OpenSearch Service, Amazon Aurora

97
Q

Name three third-party vector databases supported by AWS Bedrock.

A

MongoDB, Redis, Pinecone

98
Q

What happens if no vector database is specified in AWS Bedrock?

A

AWS automatically creates a serverless OpenSearch vector database

99
Q

Which two models can be used for embeddings in AWS Bedrock?

A

Amazon Titan, Cohere

100
Q

Can the embeddings model and foundation model be different in AWS Bedrock?

101
Q

What is the purpose of chunking documents in RAG?

A

To split them into smaller parts for vector embedding and search

102
Q

What kind of performance does Amazon OpenSearch offer for RAG?

A

Real-time similarity search with scalable index management and KNN support

103
Q

What is Amazon DocumentDB best known for in RAG use cases?

A

NoSQL compatibility and support for real-time vector similarity search

104
Q

Which two relational databases are supported for vector storage in AWS Bedrock?

A

Amazon Aurora, Amazon RDS for PostgreSQL

105
Q

What AWS service should you choose for graph-based data in a RAG system?

A

Amazon Neptune

106
Q

What are the common data sources for AWS Bedrock knowledge bases?

A

Amazon S3, Confluence, SharePoint, Salesforce, Webpages

107
Q

Give one use case for RAG in customer support.

A

Building a chatbot that retrieves answers from product documentation and FAQs

108
Q

Give one use case for RAG in legal research.

A

Chatbot answering legal queries based on case law, regulations, and legal opinions

109
Q

Give one use case for RAG in healthcare.

A

AI assistant answering medical questions based on treatments and research papers