Chapter 4 - AI Flashcards

Question 1

Q

What is a narrow ML Model

Answer

A

Designed for specific tasks using custom-collected and
labeled datasets.

Question 2

Q

What is a foundation model in ML

Answer

A

pretrained for general purposes, i.e.: not one
specific purpose, adaptable to a variety of specific tasks.

Question 3

Q

Advantages of Narrow ML Models

Answer

A

Tailored to specific tasks, potentially offering higher precision for particular applications.
Greater control over the model development process, allowing for customized adjustments and integration of safety measures (guardrails).

Question 4

Q

Challenges with Narrow ML Models

Answer

A

High cost of data collection, labeling, and computation during training.
Potentially limited by the quality and quantity of available training data.

Question 5

Q

Advantages of Foundation Models

Answer

A

Can reduce the amount of human labor and time required for model development.
Offer a flexible base that can be fine-tuned for accuracy improvements across diverse tasks.
Economical in terms of reuse and scalability, especially when shared within an organization or accessed via API

Question 6

Q

Challenges of FMs:

Answer

A

Training effort infeasible for many organizations
Potentially high cost / compute requirements for inference (i.e. runtime use)

Question 7

Q

What is a key factor in deciding between Narrow ML and FMs

Answer

A

Cost is a key factor in deciding between Narrow ML models and FMs.

▪ Development cost – similar for system, lower for using existing FMs
▪ Maintenance cost – depends on customization / model size
▪ Operation costs – typically higher for FMs

Question 8

Q

Techniques for Customizing FMs

Answer

A

▪ Prompt Engineering
▪ Retrieval-Augmented Generation (RAG)
▪ Fine-Tuning
▪ Distilling

Question 9

Q

What is Distilling?

Answer

A

Simplifying and compressing the model to enhance efficiency while retaining performance, useful for deployment on resource-constrained environments.

▪ Distillation is used to create a smaller “student” model that mimics the behavior of a larger, more complex “teacher” foundation model.
▪ Goal: the resulting student model is (close to) as good as the teacher model in some areas of interest, but more efficient.

Question 10

Q

What are Distilling Techniques?

Answer

A

▪ Knowledge Distillation - Combines data loss (encouraging the student to learn directly from training data) with distillation loss (guiding the student to replicate the teacher’s outputs on a separate dataset).
+ Aims to transfer the comprehensive knowledge from the teacher model to the student model effectively.

▪ Attention Distillation - Focuses on transferring attention mechanisms from the teacher to the student model.
+ Helps the student model to identify and concentrate on the most relevant parts of the input data, enhancing task-specific performance.

▪ Parameter Sharing - Involves sharing layers or parameters from the teacher model with the student model.
+ Reduces the learning burden on the student model by utilizing pre-trained components of the teacher model

Question 11

Q

What are the Advantages of Distillation

Answer

A

▪ Produces models that are lighter and faster, suitable for deployment in environments with resource constraints.
▪ Retains a significant level of the teacher model’s performance, making it ideal for applications requiring both efficiency and effectiveness.
▪ Enables the distilled model to operate independently of the large FM, enhancing scalability and deployment flexibility.

Question 12

Q

How does a Fine-Tuning work?

Answer

A

Adjusting the model’s parameters on a specific dataset to improve performance for particular tasks.

▪ It allows organizations to leverage the powerful capabilities of FMs while addressing specific needs with minimal additional investment.
▪ It enhances the model’s utility in specialized fields, providing outputs that are both accurate and contextually appropriate.

Question 13

Q

What are 2 Techniques for Fine-Tuning

Answer

A

▪ Full Fine-Tuning - Involves retraining all parameters of the FM, allowing comprehensive learning from the domain-specific dataset.
+ Challenges include high computational costs and extensive training time, especially for large models like GPT-3 or GPT-4.

▪ Parameter-Efficient Fine-Tuning - Focuses on modifying only a subset of the model’s parameters, significantly reducing resource requirements.
+ Utilizes adapter modules or low-rank adaptation layers that target specific layers within the FM, preserving the original model’s structure.

Question 14

Q

How does Retrieval-Augmented Generation (RAG) work?

Answer

A

Enhancing the FM’s responses by dynamically incorporating external knowledge or data during the generation process.

▪ Data Retrieval - Utilizes vector databases like Pinecone and Milvus to store and retrieve organizational or personal data as vector embeddings.
▪ Prompt Augmentation - Retrieves relevant external information based on the initial prompt and uses this data to enrich the prompt before it is input into the FM.

Question 15

Q

What are the benefits of Retrieval-Augmented Generation (RAG)

Answer

A

▪ Improved Accuracy and Relevance - By accessing vast and specific external information, RAG significantly enhances the model’s output quality.
▪ Dynamic Content Updates - Keeps the model up-to-date with the latest information, ensuring the responses are current and reflective of the most recent data.
▪ Enhanced Customization - Allows for tailored responses based on the specific information needs of the organization or the query context.

Question 16

Q

How does Prompt Engineering work?

Answer

A

Crafting specific prompts or input sequences that guide the FM towards producing desired outputs.

It allows users and developers to manipulate model outputs creatively and strategically by adding meaning to words like microservice or agile, ensuring that the model aligns with specific task requirements.

Question 17

Q

What does a static Prompt Engineering mean?

Answer

A

Development teams create and maintain a consistent “system prompt” that sets the context and tone for all interactions.
▪ Example: A system prompt instructs the FM to adopt a professional and concise tone when assisting knowledge workers in the IT industry

Question 18

Q

What are the Dynamic Prompt Engineering Techniques?

Answer

A

▪ Incorporating Contextual Adjustment - Modifies the prompt to align with the current user interaction, ensuring that the FM has the most pertinent information for generating responses.
▪ Iterative Refinement - Involves querying the FM multiple times with variations of the initial prompt,
using the responses to refine the prompt until the desired accuracy and coherence are achieved.
▪ Progressive Prompting - Gradually introduces more information through a series of prompts that build
on each other, leading to more detailed and nuanced responses from the FM.
▪ Few-Shot Learning - Provides the FM with a few examples within the prompt to quickly adapt and generate appropriate responses for similar tasks, enhancing learning efficiency.
▪ Adaptive Learning - Similar to few-shot learning but includes dynamic adjustment of examples based on the FM’s performance, continuously refining the learning process to improve response quality over time.

Question 19

Q

What are challenges in Dynamic Prompt Engineering?

Answer

A

▪ Dynamic prompt engineering requires careful implementation to avoid misrepresentations or inaccuracies, as demonstrated by instances like Google Gemini’s portrayal of historical events.
▪ Sufficient resources and rigorous feedback mechanisms to mitigate potential negative outcomes.

Question 20

Q

What are sophisticated Prompt Patterns?

Answer

A

▪ Self-Consistency - Enhances reliability by querying the FM multiple times with similar prompts, selecting the most consistent response as the final answer.
▪ Chain of Thought - Facilitates complex reasoning by decomposing tasks into manageable steps, allowing the FM to process and address each component sequentially.
▪ Tree of Thought - Extends the chain of thought by employing a tree structure to explore multiple reasoning pathways simultaneously. Includes mechanisms to assess the effectiveness of each path, deciding whether to proceed or explore alternative branches.

Question 21

Q

What are Guardrails, what’s their goal?

Answer

A

▪ Guardrails are designed to monitor and control the inputs and outputs of: foundation models, users, RAG, external tools
▪ Goal is to meet specific requirements, including
function, accuracy. aspects needed due to policy (AI Act, etc.), standards and laws.

Question 22

Q

What are the 5 Types of Guardrails?

Answer

A

▪ Input guardrails are applied to the inputs received from users, and their possible effects include refusing or modifying user prompts.
▪ Output guardrails focus on the output generated by the foundation model, and may modify the output of the foundation model or prevent certain outputs from
being returned to the user.
▪ RAG guardrails are used to ensure the retrieved data is appropriate, either by validating or modifying the retrieved data where needed.
▪ Execution guardrails ensure that the called tools or models do not have any known vulnerabilities and the actions only run on the intended target environment and do not have negative side-effects.
▪ Intermediate guardrails can be used to assert that each intermediate step meets the necessary criteria.

Question 23

Q

What is Maturity of FMs in Organizations?

Answer

A

Easiest to Hardest:
1. Prompt Engineering
2. RAG
3. Fine-Tuning/distilling
4. Own FM

Question 24

Q

What makes FhGenie so good?

Answer

A

It is a FM by Frauenhofer that is capable of working with restricted data (but not highly confidential) based on OpenAI GPT models.

Question 25

Q

What is Limited Grounding?

Answer

A

FMs focus on identifying statistical patterns within data sequences, not grounded in facts or authoritative knowledge. They identify correlations but lack an underlying causal model or a world model. This can lead to significant
inaccuracies in their outputs.

Question 26

Q

What are Hallucinations?

Answer

A

Without grounding, FMs lack the ability to evaluate the confidence and truthfulness of their outputs while having a tendency to provide an answer by making one up—often termed “hallucinations.” The term “hallucination” has
been critiqued for anthropomorphizing AI, suggesting false perception.

Question 27

Q

What are the steps in Model Development

Answer

A

Data Managment
Feature Engineering
Dividing the data
Generating Model

Question 28

Q

What does it mean to divide the data?

Answer

A

Split data into 3 main sets:
Training set 60% - 80%
Validation 10 - 20%
Test 10 - 20%
sets cannot overlap

Question 29

Q

How to evaluate a Model?

Answer

A

▪ Models are initially tested in isolation and then re-tested when integrated into the broader system
▪ Quality Attributes: Describe non-functional requirements such as system reliability, maintainability, and efficiency.
▪ Risk Assessment: Identifies potential immediate harms and evaluates the likelihood and impact of adverse events.
▪ Impact Assessment
▪ Analyzes the broader, long-term effects of AI systems on individuals, communities, and societal aspects. Includes evaluations across economic, social, and environmental dimensions.