Adapting Foundation Models Flashcards
What is adaptation in the context of foundation models?
The process of customizing pre-trained models to perform better in specific tasks or domains by re-training them with new data.
Why is adaptation important?
It enhances performance in specialized tasks, aligns outputs with organizational needs, and respects privacy constraints.
Give an example of domain-specific adaptation.
A chatbot for banking fine-tuned to answer financial questions about loans and accounts.
How can private data be used in adaptation?
Fine-tuning a model with an organization’s internal data improves relevance while maintaining privacy.
What is instruction-based adaptation?
Fine-tuning models using instructional data (e.g., manuals) to improve their performance on unseen tasks.
What is RAG?
RAG stands for Retrieval-Augmented Generation, a framework combining retrieval of external data with natural language generation.
Why is RAG useful?
It overcomes knowledge cutoffs, improves accuracy, reduces hallucinations, and enables domain adaptability.
What are the main components of RAG?
Components of a RAG System
- Retriever:
- - Fetches relevant information from external sources.
- - Techniques:
- - - Sparse Retrieval: BM25, TF-IDF.
- - - Dense Retrieval: Embeddings via transformers like BERT.
- - Tools: Elasticsearch, FAISS, Pinecone.
- Generator:
- A language model that processes the retrieved information and generates the output.
- Common Models: GPT, T5, BART.
- Knowledge Base:
- The external data source where the retriever looks for information.
- Examples: Databases, Wikipedia, proprietary datasets.
Give an example of a RAG application.
Customer support systems retrieving answers from a company knowledge base.
What is a common challenge in RAG systems?
Retriever Quality:
Poor retrieval mechanisms can provide irrelevant or incomplete data.
Latency:
Real-time retrieval and generation can be slower compared to static responses from an LLM.
Integration Complexity:
Combining retrievers, generators, and external databases requires careful engineering.