Generative AI Flashcards
Generative AI
AI that can produce high-quality content, including text, images, video, sound, synthetic data, etc.
Subset of Deep Learning, along with LLMs.
How LLMs Work
Use Supervised Learning to repeatedly predict the next word in the sequence.
Large Language Model
Key Tasks an LLM Can Carry Out
- Reading
- Writing
- Chatting
Common Use Cases for an LLM
- Brainstorming
- Drafting
- Proofreading
- Summarizing
- Analysis
- Sentiment
- Specialized Chatbots
How to Deploy Chatbots
- Deploy internally first
- Deploy externally with human-in-the-loop
- After deemed safe, allow chatbot to communicate directly with customers
May not be practical to have humans check every message.
Hallucination
Hallucination
When the LLM generates new content that is not accurate, factual, or relevant.
Causes of Hallucination
Causes:
- Ambiguous Input: if the user input is vague or unclear, the LLM might struggle to generate a relevant response, causing it to create content.
- Training Data: insufficient, biased, or otherwise flawed training data.
- Limitations of the model.
Context Window
Amount of input data or tokens (words, characters, or other elements) that a model can process or “remember” at once. Determines how much of the preceding information the model can use to understand the current input. If the context window is too small, the model might miss long-range dependencies or context that could be crucial for accurate predictions, especially in tasks like language translation or summarization.
Structured vs. Unstructured Data
Generative AI works better with unstructured versus structured (e.g., tabular) data.
GPT Stands For
“Generative Pre-trained Transformer”
- Generative refers to generating new data in the form of text, images, etc.
- Pre-training is where the model learns from training data to generate probability distributions, usually with a combination of supervised and unsupervised ML techniques.
- Transformers are machine learning models built with neural networks that carry out the pre-training in a computationally effective manner.
Sentiment Analysis
Process that uses natural language processing and machine learning techniques to identify opinions expressed in text input. Determines whether the attitude towards a particular topic, product, or service is positive, negative, or neutral.