RESPONSIBLE AI Flashcards

1
Q

Remember ML AI and GenAI

A

AI > Machine Learning > Deep Learning > GenAI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is tokenization?

A

converting raw text into a sequence of tokens, can be word-based or sub-word based

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a context window?

A

the number of tokens an LLM can consider when generating text, the larger the context window, the more information and coherence (at the cost or memory and power) - first factor when choosing a FM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is an embedding model?

A

a model that creates a vector for each token. converting tokens into models encodes many features for one input token (information about that word) and stores it into a high-dimensionality vector (used for vector databases and RAG)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

is there a semantic relationship between tokens with similar embeddings?

A

yes, that’s why we use them. embedding models can be easily searchable so it’s a good idea to use an embeddings model to power a search application

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are the basic components of AI?

A

Data layer (in vast ammounts)
ML framework and algorithm layers
Model layer (implement and train it)
Application layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is responsible AI?

A

making sure that systems are transparent and trustworthy, mitigating potential risk and negative outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is secure AI?

A

ensuring that confidentiality, integrity and availability are maintained on organization data and information assets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is AI governance?

A

ensuring that we can add value and manage risk in the operation business; clear policies, guidelines and oversight mechanisms to ensure AI systems align with legal and regulatory requirements - IMPROVE TRUST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is AI compliance?

A

ensuring the adherence to regulations and guidelinew, specially for sensitive domains such as healthcare, finance, and legal applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are the pillars of responsible AI?

A

fairness
explainability
privacy and security
transparency
veracity and robustness (reliability)
governance (responsible AI practices)
safety
controllability (align with human values)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what are AWS services for implementing responsible AI?

A

Bedrock (human or automatic model evaluation)
Guardrails for Bedrock
Sagemaker Clarify
Sagemaker Data Wrangler
Sagemaker Model Monitor
A2A (Amazon Augmented AI - human review of ML predictions)
and for governance: Sagemaker Role Manager, Model Cards and Model Dashboard

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are AWS AI Service Cards?

A

responsible AI documentation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what are the capabilities of GenAI?

A

adaptability
responsiveness
simplicity
creativity and exploration
data efficiency
personalization
scalability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what are the challenges of GenAI?

A

regulatory violations
social risks
data security and privacy concerns
toxicity
hallucinations
interpretability
nondeterminism
plagiarism and cheating

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what are toxicity and hallucinations? and what are their mitigations?

A

toxicity: generating offensive, disturbing or innapropriate content
mitigation: curate training data - guardrail

hallucinations: assertions or claims that sound true, but are incorrect (because of next-probability sampling)
mitigation: educate users about fact checking, mark generated content as unverified

17
Q

what are the prompt misuse methods?

A

poisoning: intentional introduction of malicious or biased data intontue training dataset

hijacking and prompt injection: influencing outputs by embedding specific instructions within the prompts

exposure: of sensitive or confidential information

prompt leaking: unintenional disclosure or leakage of prompts or inputs used within a model - can expose protected data

jailbreaking: many-shot jailbreaking

18
Q

what are the AWS tools for governance?

A

aws config
aws inspector
aws audit manager
aws artifact
aws cloudtrail
aws trusted advisor