Section 3: Intro to AWS and Cloud Computing Flashcards

Question 1

Q

What are the 6 advantages of Cloud Computing?

Answer

A

Trade CAPEX for OPEX
Massive Economies of Scale
Stop Guessing Capacity
Increase Speed and Agility
Stop spending money running and maintaining DC
Go Global in minutes.

Question 2

Q

Types of Cloud Computing

Answer

A

IaaS
SaaS
PaaS

Question 3

Q

You Manage in IaaS

Answer

A

Applications
Data
Runtime
Middleware
OS

Question 4

Q

You Manage by in PaaS

Answer

A

Applications
Data

Question 5

Q

You manage in SaaS

Question 6

Q

Pricing Models

Answer

A

Compute - Pay for compute time.

Storage - Pay for data stored in the cloud

Data transfer OUT of the cloud

Question 7

Q

What is Gen AI?

Answer

A

Creating NEW data based on prompts.

Question 8

Q

Name of ChatGPT Foundation Model?

Question 9

Q

LLM

Answer

A

Large Language Model

Designed to create coherent, human-like, text.

ChatGPT is example.

Question 10

Q

What is needed to use an LLM?

Answer

A

Prompt

“what is AWS”

Question 11

Q

AWS’s Gen AI Tool

Question 12

Q

Amazon Titan

Answer

A

High Performing FM from AWS

Question 13

Q

What is Fine-Tuning?

Answer

A

When you adapt a copy of a foundation model with your own data.

Question 14

Q

Where do you add data for fine tuning?

Answer

A

S3 Bucket

Question 15

Q

Instruction based fine tuning uses what?

Answer

A

labeled examples that are prompt-response pairs.

Question 16

Q

Single Turn Messaging

Answer

A

Part of instruction based fine tuning, to determine how a chatbot should reply.

System
Messages
role
Content

Question 17

Q

Multi-Turn messaing

Answer

A

Chatbot conversation. How to handle them.

Question 18

Q

Transfer Learning

Answer

A

The broader concept of re-using a pre-trained model to adapt it to a new related task.

Widely used for image classification

Question 19

Q

Use Cases for Fine Tuning

Answer

A

A chatbot
Training using up to date information
training with exclusive data

Question 20

Q

ROUGE

Answer

A

Recall-Oriented Understudy for Gisting Evaluation

Evaluating automatic summarization and machine translation systems.

ROUGE-N = Measure the number of matching n-grams between reference and generated text.

Question 21

Q

ROUGE-N

Answer

A

Measure the number of matching n-grams between reference and generated text

Question 22

Q

ROUGE-L

Answer

A

Longest common subsequence between a reference and generated text.

Question 23

Q

BLEU

Answer

A

Bilingual Evaluation Understudy

-Evaluate translation text. Considers precision and brevity.

Question 24

Q

BERTScore

Answer

A

Semantic similarity between generated texts

Question 25

Q

Perplexity

Answer

A

How well the model predicts the next token (lower is better).

Question 26

Q

ARPU

Answer

A

Average Revenue per User

Business Metric to evaluate a model

Question 27

Q

RAG

Answer

A

Retrieval-Augmented Generation

Allows a FM to reference a data source outside of its training data.

Question 28

Q

RAG Vector Databases

Answer

A

Amazon OpenSearch Service - search and analytics database.
Amazon DocumentDB [MongoDB compatibility] - NoSQL database.
Aurora - ralational DB, proprietary on AWS
Amazon RDS for PostgreSQl - relational DB, open source
Amazon Neptune - graph database

Question 29

Q

RAG Data Sources

Answer

A

S3
Confluence
SharePoint
Salesforce
Web pages

Question 30

Q

tokenization

Answer

A

converting raw text into a sequence of tokens

Question 31

Q

types of tokenization

Answer

A

Word-based - text is split into individual words

Subword - some words can be split too (un-help-ful)

Question 32

Q

Context Window

Answer

A

the number of tokens an LLM can consider when generating text.

The larger the context window, the more information.

Question 33

Q

What is the first factor to consider when looking at a model?

Answer

A

Context Window

Question 34

Q

Embeddings

Answer

A

Create vectors out of text, images, or audio.

Question 35

Q

Vector

Answer

A

Array of numerical values. So each word as some/many numerical values.

Question 36

Q

What can really power search applications?

Answer

A

Embedding models

Question 37

Q

Guardrails

Answer

A

Control the interaction between users and FM in Bedrock.

Filter out harmful and undesirable content.

Can remove PII, enhance privacy

Question 38

Q

Agents

Answer

A

manages and carry out various multi-step tasks related to infrastructure provisioning, application deployment, and operational activities.

Think like a chatbot agent

Question 39

Q

Model Invocation Logging?

How?

Answer

A

Sending logs of all invocations to Amazon CloudWatch and S3

AWS Cloudwatch

Question 40

Q

Bedrock Studio

Answer

A

Gives access to Amazon Bedrock to your team so they can easily create AI-powered applications.

Question 41

Q

Watermark Detection

Answer

A

Check if an image was generated by Amazon Titan Generator

Question 42

Q

Bedrock Pricing Models

Answer

A

On-Demand, Text and Embedding are per token, image is per generated image.
Batch - multiple predictions at a time, discounts up to 50%.
Provisioned - purchase model unites for a certain time.

Question 43

Q

Model Improvement Cost order

Answer

A

$ - Prompt Engineering
$$ - Retrieval Augmented Generation (RAG)
$$$ - Instruction-based Fine-tuning
$$$$ - Domain Adaption fine tuning

Question 44

Q

What type of Gen AI can recognize and interpret various forms of input data, such as text, images, and audio?

Answer

A

Multimodal model

Question 45

Q

Which AWS service can help store embeddings within vector databases?

Answer

A

Amazon OpenSearch Serverless

Question 46

Q

Prompt Engineering

Answer

A

Developing, designing, and optimizing prompts to enhance the output of FMs for your needs.

Question 47

Q

Improved Prompting Consists of?

Answer

A

Instructions - a task for the model to do.
Context - external information to guide the model
Input Data - the input for which you want a response.
Output Indicator - the output type or format.

Question 48

Q

Negative Prompting

Answer

A

A technique where you explicitly instruct the model on what NOT to include or do in its response.

Question 49

Q

Prompt Performance - System Prompts

Answer

A

How the model should behave and reply.

Question 50

Q

Prompt Performance - Temperature

Answer

A

Value: 0-1

Creativity of the model’s output.

Low Value - more conservative

High Value - more diverse, less predictable, less coherent.

Question 51

Q

Prompt Performance - Top P

Answer

A

Value 0-1

Low P - Consider the 25% most likely words, more coherent.

High P - Consider a broad range of possible words.

Question 52

Q

Prompt Performance - Top K

Answer

A

Limits the number of probable words.

Low K - more coherent, less probable words.

High K - more probable words, more diverse

Question 53

Q

Prompt Performance - Length

Answer

A

Maximum Length of the answer

Question 54

Q

Prompt Performance - Stop Sequence

Answer

A

Tokens that signal the model to stop generating output.

Question 55

Q

Prompt Latency

Answer

A

How fast the model responds.

Impacted by model size, model type, number of token in input, number of tokens in output.

Not impacted by Top P, Top K, Temperature!!

Question 56

Q

Zero Shot Prompting

Answer

A

Present a task to the model without providing examples or explicit training for that specific task.

Question 57

Q

Few Shots Prompting

Answer

A

Provide examples of a task to the model to guides its output.

Question 58

Q

Chain of Thought Prompting

Answer

A

Divide the task into a sequence of reasoning steps, leading to more structure and coherence.

Think ‘Step by Step”

Question 59

Q

How to simplify and standardize the process of generating prompts?

Answer

A

Prompt Templates

Question 60

Q

AWS’s Solution for a fully managed Gen AI based on your company’s knowledge and data?

Answer

A

Amazon Q Business

Question 61

Q

What is Amazon Q Built on?

Which FM?

Answer

A

Built on Amazon Bedrock

Can’t choose the FM, it consists of a few.

Question 62

Q

What benefit is there by having Amazon Q + IAM Identity Center?

Answer

A

Users receive responses generated only from the documents they have access to.

Question 63

Q

Amazon Q Business - Admin Controls

Answer

A

Controls and customize responses to your organizational needs.

Admin Controls = Gaurdrails

Question 64

Q

Q Apps

Answer

A

Part of Amazon Q Business

Create Gen AI powered apps without coding by using natural language.

Answer 62

A

Answer questions about the AWS documentation and AWS Service selection.
Answer questions about resources in your AWS account.
Suggest CLI to run to make changes to your account.
Helps you do bill analysis, resolve errors, troubleshooting
AI Code companion

Answer 63

A

Used to visualize your data and create dashboards about them.

Answer 64

A

EC2 - instances are virtual servers.

Amazon Q for EC2 - provides guidance and suggestions for EC2 instance types that are best suited to your new workload.

Answer 65

A

Glue - is an ETL (Extract Transform and Load) service used to move data across places.

Answer 66

A

GenAI app-building playground (powered by Bedrock)

Answer 67

A

An AI Coding assistant

Answer 68

A

Data Layer - where you collect vast amount of data.
ML Framework & Algorithm Layer
Model Layer - implement a model and train it.

Answer 69

A

Machine Learning

Type of AI for building methods that allow machines to learn.

Data is what is leveraged.

Great for making predictions.

Answer 70

A

Subset of Machine Learning.

Uses neurons and synapses like our brain, to train models.

Process more complex patterns in the data than traditional ML.

Answer 71

A

Deep because there’s more than one layer of learning.

Answer 72

A

Part of Deep Learning

Image classification, object detection, and image segmentation.

Answer 73

A

Natural Language Processing

Part of Deep Learning

test classification, sentiment analysis, machine translation, language generation.

Answer 74

A

Able to process a sentence as a whole instead of word by word.

Answer 75

A

Adding or subtracting noise from an image.

Answer 76

A

Multiple types of inputs, and can create multiple types of outputs.

Answer 77

A

GENERATIVE PRE-TRAINED TRANSFORMER

Generate human text or computer code based on input prompts.

Answer 78

A

BIDIRECTIONAL ENCODER REPRESENTATIONS FROM TRANSFORMERS

Similar intent to GPT, but reads the text in two directions.

Answer 79

A

RECURRENT NEURAL NETWORK

Meant for sequential data such as time-series or text, useful in speech recognition, time-series prediction.

Answer 80

A

RESIDUAL NETWORK

Deep Convolutional Neural Network (CDN) used for image recognition tasks, objects detection, facial recognition.

Answer 81

A

SUPPORT VECTOR MACHINE

ML algorithm for classification and regression.

Answer 82

A

model to generate raw audio waveform, used in speech synthesis.

Answer 83

A

GENERATIVE ADVERSARIAL NETWORK

Models used to gnerate synthetic data such as images, videos, or sounds that resemble the training data.

Helpful for data augmentation.

Answer 84

A

EXTREME GRADIENT BOOSTING

An implementation of gradient boosting.

Answer 85

A

Labeled - includes both input features and corresponding output labels.

Unlabeled - Data that includes only input features without any output labels.

Answer 86

A

Structured - Put into rows and columns (like excel)

Unstrcuted - no rhyme or reason.

Answer 87

A

Data that is arranged in a table with rows. Structued Data.

Answer 88

A

Structured Data

Data points collected or recorded at successive points in time.

Answer 89

A

Unstructured Data

Answer 90

A

Used to predict a numeric value based on input data.

Output variable is CONTINUOUS

Answer 91

A

Used to predict the categorical label of input data.

Output variable is DISCRETE

Answer 92

A

Used to tune model parameters and validate performance.

Answer 93

A

Process of using domain knowledge to select and transform raw data into meaningful features.