Terminology Flashcards

Question

Define 'Entity recognition'.

Answer 1

Identifying and extracting key pieces of information (like names or dates) from user input.

Answer 2

Determining the user's purpose or goal behind a message, guiding the chatbot’s response.

Answer 3

The system that decides the next action or response based on user input and conversation context.

Answer 4

Strategies the chatbot uses to ask follow-up questions or rephrase ambiguous queries to understand user intent.

Answer 5

The process of converting structured data or intents into fluent, human-like language for responses.

Answer 6

The overall design and structure of the chatbot system, including its components and their interactions.

Answer 7

The methods used to train models from data, enabling the chatbot to learn language patterns and decision-making rules.

Answer 8

Neural networks designed for sequential data that process one input at a time while maintaining a memory of previous inputs.

Answer 9

Neural networks that use self-attention to process entire sequences in parallel, capturing long-range dependencies more effectively.

Answer 10

A computational model composed of interconnected layers of nodes that learns patterns from data through adjustable weights.

Answer 11

Data where the order of elements matters, such as words in a sentence or turns in a conversation.

Answer 12

The first layer of a neural network that receives and processes the initial input data.

Answer 13

Intermediate layers in a neural network that extract and transform features from the input data.

Answer 14

The final layer of a neural network that produces the model’s response or prediction.

Answer 15

A training algorithm for RNNs that unfolds the network over time and propagates error gradients back through each time step.

Answer 16

A challenge in training deep or sequential models where gradients diminish over time, hindering learning of long-term dependencies.

Answer 17

A type of RNN that uses gating mechanisms to maintain and control long-term memory, mitigating the vanishing gradient problem.

Answer 18

In an LSTM cell, the component that controls how much new information is added to the cell state.

Answer 19

In an LSTM cell, the mechanism that decides which information to discard from the cell state.

Answer 20

In an LSTM cell, the mechanism that determines what part of the cell state is output at each time step.

Answer 21

The internal storage within an LSTM cell that retains information over long sequences.

Answer 22

A method used in Transformers where each token in a sequence assesses its relevance to every other token, allowing the model to capture global dependencies.

Answer 23

A large-scale language model based on the Transformer architecture, known for generating highly fluent and contextually relevant text.

Answer 24

A dataset curated to be as balanced and representative as possible, minimizing skew that could cause the chatbot to favor certain patterns or groups.

Answer 25

A large, well-organized, and accurate collection of data that is relevant to the chatbot's domain, ensuring reliable learning and performance.

Answer 26

Data collected from actual user interactions or real-world events, as opposed to artificially generated data.

Answer 27

Artificially created data used to augment real data, often generated to simulate specific scenarios or increase dataset diversity.

Answer 28

This form of bias occurs when the dataset is biased towards a particular viewpoint, such as training data that only include customer queries related to certain types of policies.

Answer 29

This form of bias occurs when the training data do not reflect changes over time. For example, if the natural language processing model is trained on data from several years ago, it may not be able to accurately predict recent customer queries.

Answer 30

This form of bias occurs when the labels applied to the data are subjective, inaccurate or incomplete. For example, if the labels assigned to customer queries are too generic, the model may not be able to accurately predict the customer’s intent.

Answer 31

This form of bias occurs when the dataset is biased towards certain linguistic features, such as dialect or vocabulary. For example, if a dataset is built on formal written language, the model may not be able to accurately interpret informal language.

Answer 32

This form of bias occurs when the training dataset is not representative of the entire population, such as training data that only include customer queries from one demographic.

Answer 33

This form of bias occurs when the training data are not randomly selected but are instead chosen based on some criteria. A language model trained on data that suggest certain demographics may be more likely to file insurance claims that are biased towards people who fall under that category.

Answer 34

The process of cleaning and transforming raw text data into a usable format for model training, such as tokenization and normalization.

Answer 35

A simple method for text representation that counts word frequencies, disregarding grammar and order, to provide a basic input for models.

Answer 36

Encoding text as numerical vectors (embeddings) that capture semantic meaning, allowing the chatbot to process and compare textual information.

Answer 37

The process of adjusting a machine learning model’s weights by exposing it to data so that it learns to perform its task accurately.

Answer 38

Integrating and running the trained model in a live environment where it can interact with users in real time.

Answer 39

The processing capability of hardware (like CPUs, GPUs, or TPUs) that is required to train and run models efficiently.

Answer 40

A general-purpose processor that executes instructions and performs basic computations; adequate for simple tasks but slower for large-scale neural network processing.

Answer 41

A specialized processor that excels at parallel computations, significantly speeding up the training and inference of neural networks.

Answer 42

A custom hardware accelerator designed specifically for machine learning tasks, offering high efficiency for large-scale model training and inference.

Answer 43

Fast-access storage used during computation; RAM is used by CPUs and VRAM by GPUs, both essential for holding model data and intermediate results during processing.

Answer 44

Long-term data retention hardware (such as SSDs or HDDs) used to store datasets and model files, which are loaded into memory for processing.

Answer 45

The data transfer rate available within a system or network, important for quickly moving data between storage, memory, and processors.

Answer 46

Using multiple computers or servers simultaneously to share the computational load, enabling faster training or processing of large models.

Answer 47

Executing many calculations simultaneously, which is critical for efficiently training or running neural networks in chatbots.

Answer 48

A method where multiple processors handle different portions of a dataset concurrently during model training, speeding up the process.

Answer 49

A technique that splits different parts of a large model across multiple processors, allowing training of models too big for a single device.

Answer 50

Grouping multiple inputs together to process them simultaneously, improving efficiency by leveraging parallel computations.

Answer 51

Optimizing computations by processing data in batches as vectors or matrices, rather than one element at a time, to fully utilize hardware capabilities.

Answer 52

The practice of analyzing a program's performance to identify bottlenecks or inefficient code segments for optimization.

Answer 53

Optimizing and refining the model’s implementation or hyperparameters after initial training to improve performance without altering core functionality.

Answer 54

Distributing workload evenly across multiple servers or processors to ensure no single component is overwhelmed, which is essential for handling many chatbot requests concurrently.

Answer 55

Remote computing resources provided over the internet that enable scalable deployment and training of chatbots without maintaining physical hardware.

Answer 56

An architecture separating the system into two layers—typically a front-end interface and a back-end processing server—used to efficiently manage chatbot interactions.

Answer 57

The process of encoding data to prevent unauthorized access, both during transmission (in transit) and when stored (at rest).

Answer 58

Removing or masking personal identifiers in data so that individual users cannot be recognized, protecting privacy in training and logs.

Answer 59

Application interfaces that are protected with authentication and encryption, ensuring safe data exchange between the chatbot and other systems.

Answer 60

Systematic errors or unfair tendencies in a model’s outputs, often resulting from skewed training data.

Answer 61

The principle that a chatbot should treat all users equitably without discrimination or undue preference.

Answer 62

Clarity regarding how a chatbot functions, including its decision-making process and limitations, to build user trust.

Answer 63

False or inaccurate information generated by a chatbot, which can mislead users if not properly controlled.

Answer 64

Recording chatbot interactions for monitoring, troubleshooting, and future model improvements, while ensuring user privacy.

Answer 65

The process of detecting and blocking inappropriate or harmful content in a chatbot's responses or user inputs.

Answer 66

A training method where human evaluations of responses guide the model to align more closely with desired behaviors and ethical standards.

Answer 67

A mathematical function that measures the error between a model's predictions and the actual target values; minimizing this function improves model accuracy.

Answer 68

A subset of machine learning that uses deep (multi-layered) neural networks to learn complex representations from large amounts of data.

Answer 69

The process of adjusting the external parameters of a model (such as learning rate or number of layers) to optimize performance.

Answer 70

A neural network with a vast number of parameters trained on extensive text data, capable of understanding and generating human-like language.

Answer 71

The adjustable parameters within a neural network that are optimized during training to capture patterns and make accurate predictions.

Terminology Flashcards

(95 cards)