RESPONSIBLE AI Flashcards
Remember ML AI and GenAI
AI > Machine Learning > Deep Learning > GenAI
What is tokenization?
converting raw text into a sequence of tokens, can be word-based or sub-word based
what is a context window?
the number of tokens an LLM can consider when generating text, the larger the context window, the more information and coherence (at the cost or memory and power) - first factor when choosing a FM
what is an embedding model?
a model that creates a vector for each token. converting tokens into models encodes many features for one input token (information about that word) and stores it into a high-dimensionality vector (used for vector databases and RAG)
is there a semantic relationship between tokens with similar embeddings?
yes, that’s why we use them. embedding models can be easily searchable so it’s a good idea to use an embeddings model to power a search application
what are the basic components of AI?
Data layer (in vast ammounts)
ML framework and algorithm layers
Model layer (implement and train it)
Application layer
what is responsible AI?
making sure that systems are transparent and trustworthy, mitigating potential risk and negative outcomes
what is secure AI?
ensuring that confidentiality, integrity and availability are maintained on organization data and information assets
what is AI governance?
ensuring that we can add value and manage risk in the operation business; clear policies, guidelines and oversight mechanisms to ensure AI systems align with legal and regulatory requirements - IMPROVE TRUST
what is AI compliance?
ensuring the adherence to regulations and guidelinew, specially for sensitive domains such as healthcare, finance, and legal applications
what are the pillars of responsible AI?
fairness
explainability
privacy and security
transparency
veracity and robustness (reliability)
governance (responsible AI practices)
safety
controllability (align with human values)
what are AWS services for implementing responsible AI?
Bedrock (human or automatic model evaluation)
Guardrails for Bedrock
Sagemaker Clarify
Sagemaker Data Wrangler
Sagemaker Model Monitor
A2A (Amazon Augmented AI - human review of ML predictions)
and for governance: Sagemaker Role Manager, Model Cards and Model Dashboard
what are AWS AI Service Cards?
responsible AI documentation
what are the capabilities of GenAI?
adaptability
responsiveness
simplicity
creativity and exploration
data efficiency
personalization
scalability
what are the challenges of GenAI?
regulatory violations
social risks
data security and privacy concerns
toxicity
hallucinations
interpretability
nondeterminism
plagiarism and cheating