LLM Terms Flashcards

Question

Attention Mask

Answer 1

A mechanism that controls which parts of the input sequence an LLM should focus on, used to handle padding and manage context in sequence processing.

Answer 2

A technique used in Transformer models to provide information about the order of tokens in the input sequence.

Answer 3

A neural network framework that consists of an encoder processing the input sequence and a decoder generating the output sequence, foundational to many NLP tasks.

Answer 4

The initial phase of training an LLM on a large corpus of text data to learn general language patterns and representations.

Answer 5

Adapting a pre-trained LLM to a new task or domain by fine-tuning it on a smaller, specialized dataset.

Answer 6

The process of generating output text from an LLM given an input prompt or query.

Answer 7

Adjusting model parameters (e.g., learning rate, batch size, temperature) to optimize the performance of LLMs for specific tasks.

Answer 8

Saving intermediate states of an LLM during training, allowing for recovery and further fine-tuning if necessary.

Answer 9

A technique used to overcome memory limitations during LLM training by accumulating gradients over multiple batches before updating model weights.

Answer 10

The process of training an LLM across multiple GPUs or machines to handle large datasets and model sizes efficiently.

Answer 11

Techniques for increasing the diversity of training data, such as paraphrasing or adding noise, to improve LLM performance.

Answer 12

The task of generating a concise and coherent summary of a longer text, often facilitated by LLMs in various applications.

Answer 13

A technique for identifying and classifying entities (e.g., names, dates, locations) in text, used to extract structured information in LLM applications.

Answer 14

A system that uses an LLM to interact with users in natural language, often designed to handle tasks like customer support, information retrieval, or personal assistance.

Answer 15

Mechanisms in an LLM application that retain context across multiple user interactions to facilitate coherent dialogue.

Answer 16

The time delay between a user input and the model's response, a critical factor in the user experience of LLM-based applications.

Answer 17

Techniques such as model quantization or pruning to reduce the computational cost and latency of LLMs during inference.

Answer 18

The process of transferring knowledge from a large, complex LLM to a smaller, more efficient model to improve performance in resource-constrained environments.

Answer 19

The process of adjusting prompts used with an LLM to elicit more accurate, relevant, or contextually appropriate responses.

Answer 20

The presence of stereotypes or unfair associations learned from training data, necessitating strategies to mitigate bias in LLM applications.

Answer 21

The ability of an LLM to perform tasks in new domains without explicit retraining on domain-specific data.

Answer 22

A single interaction between a user and an LLM in a dialogue, which includes both the user input and the model's response.

Answer 23

Word embeddings that change depending on the context in which a word appears, enabling more nuanced understanding in LLMs.

Answer 24

The process of analyzing and interpreting a user's input to understand intent and extract relevant information for the LLM application.

Answer 25

An LLM equipped with external memory components to store and retrieve information beyond its fixed parameters, enhancing its capability to answer specific queries.

Answer 26

The ability of an LLM application to handle an increasing amount of work, data, or users without sacrificing performance or accuracy.