Midterm Flashcards

Question 1

Q

SOTA

Answer

A

State of the Art

Question 2

Q

Clark-Fisher Hypothesis

Answer

A

As jobs get displaced people will move sectors

Question 3

Q

Turing Test

Answer

A

Test to see how well a robot can imitate a juman

Question 4

Q

AI winter

Answer

A

period of time where people weren’t investing or researching AI

Question 5

Q

Expert Systems

Answer

A

Machine expert in a topic, usually consists of a knowledge base and an inference engine

Question 6

Q

knowledge engineer

Answer

A

responsible for designing and maintaining expert systems, they gather knowledge from human experts to turn into rules or facts from the system

Question 7

Q

knowledge base

Answer

A

What the AI system knows. Ex: rules, facts, information about the role of the AI, etc.

Question 8

Q

Polanyi principle

Answer

A

We often know more than we can articulate.

Question 9

Q

backpropogation

Answer

A

minimizes error in predictions made by the network. The network can learn from its mistakes and adjust weights between neurons to create better inferences in the future

Question 10

Q

Artificial neural network

Answer

A

computational model of layers of nodes or neurons

Question 11

Q

GPU

Answer

A

graphics processing unit

Question 12

Q

FLOPS

Answer

A

Floating point calculations a computer can perform in one second

Question 13

Q

Tops

Answer

A

Trillions of floating point calculations

Question 14

Q

RNN

Answer

A

Recurrent neural networks are great handling sequential data and finding patterns

Question 15

Q

sequence to sequence

Answer

A

A model that takes one sequence of data, like an english sentence, and translates it into a different sequence, like a portuguese sentence

Question 16

Q

long short-term memory

Answer

A

a type of RNN achitecture designed to effectively learn from and remember long sequences of data

Question 17

Q

transformer architecture

Answer

A

Through the use of a “self-attention” mechanism, it weighs the importance of words, better understanding relationships and context

Question 18

Q

parallel processing

Answer

A

simultaneous execution of multiple tasks or processes. Helps with handling large amounts of data at once, instead of working through it sequentially

Question 19

Q

emergent behavior

Answer

A

when the AI does something unexpected because of all the pieces working together

Question 20

Q

AGI

Answer

A

artificial general intelligence- an AI that can understand, learn, and apply knowledge in many ways

Question 21

Q

Singularity

Answer

A

a point when AIs become more intelligent than humans, leading to rapid tech advancements and societal changes

Question 22

Q

GenAi

Answer

A

Ai that can generate new content

Question 23

Q

machine learning

Answer

A

focuses on development of algorithms and statistical models that help computers learn from and make predictions or decisions based on data

Question 24

Q

deep learning

Answer

A

specialized area within machine learning that uses neural networks with many layers to analyze and interpret complex data patterns. Mimcs the human brain in tasks like image and speech recognition and languages

Question 25

Q

data features

Answer

A

individual measurable properties or characteristics of the data used in MLMs. They are the inputs that algorithms analyze

Question 26

Q

perceptron

Answer

A

an artificial neuron and one of the simplest forms of a neural network

Question 27

Q

activation function

Answer

A

mathematical function used in neural networks to determine the output of a neuron based on the input

Question 28

Q

Parameters

Answer

A

internal variables of a model that are learned from training

Question 29

Q

multilayer perceptron

Answer

A

(MLP) a type of artificial neural network that consists of multiple layers of nodes, or neurons, including an input layer, one or more hidden layers, and an output layer

Question 30

Q

feature detection

Answer

A

process of identifying and extracting important characteristics or patterns from raw data, which can be used for further analysis or modeling

Question 31

Q

ANN model loss

Answer

A

a measure of how well an ANN is performing during training. It quantifies the difference between predicted outputs and actual target values from the training data. Lower loss is better

Question 32

Q

Hallucinations

Answer

A

when AI models generate fake made up outcomes that sound plausible

Question 33

Q

Self supervised learning

Answer

A

machine learning where models predict parts of the data from other parts, without needing labeled data.

Question 34

Q

Training rate

Answer

A

a hyperparameter in machine learning that determines how much model’s weights are updated during training in response to the calculated error.

Question 35

Q

LLM completion

Answer

A

the process of generating text based on a prompt

Question 36

Q

open source

Answer

A

publicly available for modification, use, and distribution

Question 37

Q

Fine-tuning for instruction

Answer

A

Take a pre-trained mode, and add additional training on a specific topic/dataset

Question 38

Q

Model alignment

Answer

A

ensuring AI outputs correspond with human values and expectations

Question 39

Q

Supervised learning

Answer

A

model is trained on labeled data. Clear examples of desired outcomes helps the model make accurate predictions in real-world scenarios

Question 40

Q

Reinforcement Learning from human Feedback

Answer

A

machine learning with human feedback instead of solely predefined rewards

Question 41

Q

Foundation model

Answer

A

large scale model trained on big data, commonly used as parent models

Question 42

Q

Semantic embedding

Answer

A

a technique that transforms words, phrases, or documents into vectors in a high-dimensional space, capturing meanings and relationships

Question 43

Q

attention mechanism

Answer

A

a technique where models focus on specific parts of input data when making predictions

Question 44

Q

Autoregression

Answer

A

the model predicts the next value in a sequence based on the previous values

Question 45

Q

digital twin

Answer

A

a virtual representation of a physical object, system, or process that uses real-time data to simulate its behavior and performance

Question 46

Q

Latency

Answer

A

time delay between the input being provided to an AI system and the output being generated (includes processing time, poor connection, etc)

Question 47

Q

AI use case

Answer

A

specific application or scenario where ai tech is implemented to solve a particular problem

Question 48

Q

GLUE benchmark

Answer

A

general language understanding evaluation; assess models on a variety of language understanding tasks; language comprehension skills

Question 49

Q

SQuAD Benchmark

Answer

A

stanford question answering dataset; reading comprehension and question answering; given passages can it extract certain answers; retrieval skills

Question 50

Q

RACE benchmark

Answer

A

reading comprehension from examinations. evaluate reading comprehension, can AI mimic human-like understanding of complex texts; understand and interpret text

Question 51

Q

5 ai accuracy metrics

Answer

A

quantititative metrics;
1. Accuracy 2. Precision
3. Recall (sensitivity)
4. F1 Score
5. AUC-ROC

Question 52

Q

AI model error rate

Answer

A

frequency of incorrect predictions made by an AI model; lower error rate is more accurate

Question 53

Q

Model Perplexity

Answer

A

how well a probability model predicts a sample; quantifies uncertainty or unpredictability of a model when generating text; lower perplexity score= model is better at predicting the next word in a sequence

Question 54

Q

BLEU score

Answer

A

bilingual evaluation understudy; machine translation. 0-1 with 1 being better

Question 55

Q

AI Recall/sensitivity

Answer

A

how many of the actual positive cases the model successfully identified; important where missing a positive (like a medical diagnosis) is really important

Question 56

Q

F1 score

Answer

A

combines precision and recall; how well a model performs when both false positives and false negatives are important; high F1 score is good; precision (accuracy of positive precisions); recall (ability to identify all positive cases)

Question 57

Q

AI model Efficiency

Answer

A

Inference time: make predictions time
Model size: amount of memory required to store the model
energy consumption: how much energy the model uses during training and inference
throughput: number of predictions the model can make in a given time period

Question 58

Q

key aspects of load handling

Answer

A

ability of an AI system to manage and process varying amounts of data or requests efficiently;
1. scalability
2. throughput
3. latency
4. Load balancing: distributing incoming requests across multiple instances
5. fault tolerance: continue functioning correctly even when some components fail

Question 59

Q

Kubernetes

Answer

A

automates deployment, scaling, and management of containerized applications; manages resources

Question 60

Q

Deployment elasticity

Answer

A

ability to dynamically adjust its resources based on demand; efficiently handle varying workloads

Question 61

Q

Needle in a haystack

Answer

A

the challenge of finding a specific piece of valuable information or insight within a vast amount of data

Question 62

Q

Agentic workflow

Answer

A

AI agents autonomously perform tasks and make decisions with a defined workflow, work with human users too

Question 63

Q

Low rank adapters

Answer

A

instead of retraining an entire model, low-rank adapters introduce small, trainable modules or adapters that can be inserted into existing architecture

Question 64

Q

context size/window

Answer

A

amount of info/tokens a model can consider at one time when processing input data

Answer 65

A

FLOPS: floating point operations per second; measure of a computer’s performance on complex calculations: how well can we train and deploy these ai systems

TOPS: trillions of operations per second; measure performance of computing systems; comparing AI hardware

Answer 66

A

reduce the size and computational requirements of models; more efficient

Answer 67

A

process data closer to the source of data generation rather than centralized data centers; allows for quicker data processing and less latency

Answer 68

A

create smaller more efficient versions of LLMs while retaining much of the performance

Answer 69

A

Open-source tools and models; large transformers library

Answer 70

A

programming code that is executed line-by-line by an interpreter rather than being compiled into machine code beforehand; easier to read, but slower than compiled code

Answer 71

A

programming code that is transformed into machine code by a compiler before it’s executed; creates a file that a computer can run directly on its hardware

Answer 72

A

low-level programming language that is close to machine code (similar to binary); harder to write in but more customizable

Answer 73

A

machine readable file that doesn’t need further translation or interpretation

Answer 74

A

permissive free software license that allows developers to use, modify, and distribute software with few restrictions; just need to include the original copyright notice

Answer 75

A

open source software license; no need to disclose their own source code; proper attribution required; more terms than MIT, doesn’t require derivative works to be open source

Answer 76

A

copyleft license where any modified versions of the software must also be distributed under the GPL license

Answer 77

A

allows LGPL software to be linked with proprietary software without forcing the proprietary software to be also licensed LGPL; can take advantage of LGPT software without giving up proprietary rights

Answer 78

A

free use and redistribution of software, but restricted to ethical uses

Answer 79

A

analogous to DevOps in software development. processes involved in managing, deploying, and maintaining language models

Answer 80

A

store previously used prompts and their corresponding responses to retrieve those responses for future prompts

Answer 81

A

model focuses on specific parts of the input data when generating an output; the model learns to weigh the importance of different parts of the input based on the context of the task; weighted sums

Answer 82

A

indicates position/order of elements in a sequence

Answer 83

A

javascript; easy for us to read and easy for machines to generate

Answer 84

A

how random the responses are

Answer 85

A

Executing an AI algorithm directly on a local machine (e.g., a laptop or on-prem server), without sending data to the cloud. This allows for faster inference times, improved data privacy, and offline capabilities but often requires more powerful hardware.

Answer 86

A

the primary machine that runs virtual machines, containers, or locally hosted AI models

Answer 87

A

where data is processed at or near the source of generation, rather than in centralized cloud servers. essential for low latency applications

Answer 88

A

the point of interaction between the user and a computer system; chatbots, dashboards, apps

Answer 89

A

a network where a client makes requests to a centralized server. common in web based ai tools where the model is on the server and accessed via the web

Answer 90

A

the visible and interactive part of a software application that users interact with. handles the presentation layer and sends inputs to the back-end

Answer 91

A

handles business logic, database interactions, and processing tasks, back end hosts models, processed input data, and returns data to the front end

Answer 92

A

a company or platform that offers infrastructure for storing and running software or websites. ex: google

Answer 93

A

the setup in which a software application runs, including operating system, libraries, dependencies, and configurations

Answer 94

A

a cloud-based delivery model where users access applications via a web browser without installing anything locally

Answer 95

A

a group of networked servers housed in one location used to provide large-scale computing resources. ai models are trained and deployed using such server clusters

Answer 96

A

the cost advantages that arise with increased production; where cloud providers operate vast server farms to reduce the per unit cost of computation and storage

Answer 97

A

software that enables virtualization by allowing multiple virtual machines to run on a single host. it allocates hardware resources to each VM

Answer 98

A

software emulations of physical computers. each VM has its own OS and can run applications independently. useful for isolating environments in ai experiments

Answer 99

A

a virtual server sold as a service by hosting providers. offers more control and isolation than shared hosting and is often used for hosting ai models with moderate traffic

Answer 100

A

a lightweight form of virtualization that packages software and its dependencies into containers. unlike vm’s containers share the host OS, making them faster and more efficient

Answer 101

A

a popular platform for building, packaging, and running containerized applications. used in ai development to ensure models are portable and consistent

Answer 102

A

deploying applications across multiple cloud platforms. reduces reliance on a single provider and can optimize performance and cost for ai services

Answer 103

A

an open-source system for automating the deployment, scaling, and management of containerized applications. used in managing ai services at scale

Answer 104

A

a feature that automatically adjusts the number of containers or resources based on demand.

Answer 105

A

the time it takes for an ai model to return a result after receiving input. lower latency is crucial for real-time applications like voice assistants or fraud detection systems

Answer 106

A

deploying and managing software or ai models on your own infrastructure instead of using cloud providers. more control and privacy, but requires more technical know-how

Answer 107

A

the page displayed by search engines in response to the user’s query, ai ranks results

Answer 108

A

software that acts as a bridge between different systems or applications. connects the UI to the AI model= example

Answer 109

A

a minimal computing device that relies heavily on a central server for processing. often used in enterprise settings with centralized ai processing

Answer 110

A

fully functional device that performs most computing tasks locally. clients don’t depend on constant server access

Answer 111

A

a small, independent component of a larger application that performs one function. ai models are often deployed as microservices so they can be updated or scaled independently

Answer 112

A

allows users to interact with AI models from various providers (like OpenAI, Anthropic, etc.) in a unified chat interface. It supports integration and comparison of model outputs.

Answer 113

A

A no-code automation platform that integrates various software tools. With AI integrations, it can automate tasks like generating content or categorizing inputs using language models.

Answer 114

A

The process of embedding AI functionality into existing systems like CRM, ERP, or web applications. This requires APIs, middleware, and thoughtful workflow design.

Answer 115

A

A platform for running large language models locally. It simplifies downloading, running, and experimenting with AI models like LLaMA on your own machine.

Answer 116

A

Originally designed for rendering graphics, GPUs are now heavily used for AI model training and inference due to their parallel processing capabilities.

Answer 117

A

A group of GPUs working together, often distributed across machines, to speed up AI computations, especially model training and large batch inference.

Answer 118

A

Custom hardware by Google designed specifically for AI and machine learning workloads. They’re optimized for matrix operations typical in neural networks.

Answer 119

A

Chips designed to accelerate neural network processing, commonly found in mobile and edge devices to allow local AI inference with low power consumption.

Answer 120

A

A detailed description of a software system’s functionality, constraints, and environment. It outlines what the AI system should do, including inputs, outputs, and performance expectations—serving as a contract between stakeholders and developers.

Answer 121

A

A combination of “Development” and “Operations,” DevOps is a set of practices that aims to automate and integrate software development and IT operations. It promotes continuous integration, continuous delivery (CI/CD), collaboration between teams, and faster release cycles—critical for AI tools needing constant updates and improvements.

Answer 122

A

The system’s ability to dynamically allocate or release resources (e.g., computing power or memory) based on current demand. In AI deployments, elasticity ensures cost-effectiveness by scaling resources up during high traffic and down during low usage.

Answer 123

A

The ability of an application to handle increased workload or user traffic by upgrading its infrastructure. This can be vertical (more powerful machines) or horizontal (more instances). AI services benefit from scaling when many users query models simultaneously.

Answer 124

A

The delay between a user’s action (like submitting a prompt) and the software’s response. In AI systems, latency is affected by model size, compute power, network conditions, and system design. Lower latency = better user experience.

Answer 125

A

Refers to how easy and efficient it is for users to interact with a software application. In AI, good usability means non-technical users can still get accurate results or insights without needing deep knowledge of machine learning.

Answer 126

A

The art and science of crafting input prompts to get desired outputs from AI models—especially large language models (LLMs). Effective prompt engineering considers wording, structure, examples, and constraints

Answer 127

A

A predefined instruction embedded into the AI model to set its tone, behavior, or scope (e.g., “You are a helpful assistant”). It shapes how the model interprets and responds to user queries.

Answer 128

A

A setting that controls the randomness of an AI model’s output. A low temperature (e.g., 0.2) makes answers more focused and deterministic. A high temperature (e.g., 0.9) introduces more creativity and variation.

Answer 129

A

An interactive web environment for testing and experimenting with AI models, often provided by companies like OpenAI or Hugging Face. These tools help users understand how AI models behave and respond to prompts.

Answer 130

A

Platforms that let users build apps, workflows, or AI integrations without writing code. They’re useful for business teams and operations managers to implement automation or deploy AI tools quickly (e.g., Zapier, Bubble).

Answer 131

A

Platforms that require minimal coding to build software. They offer more flexibility than no-code while still reducing the need for deep technical knowledge. Ideal for rapidly prototyping AI solutions.

Answer 132

A

Refers to software or platforms that work with multiple types of AI models, regardless of their architecture or provider. This enables flexibility to swap models like GPT-4, Claude, or LLaMA based on use case or performance.

Answer 133

A

An AI technique where the model pulls information from external sources (like a database or document store) before generating a response. This improves factual accuracy and enables dynamic answers grounded in current or domain-specific data.

Answer 134

A

A defined sequence of tasks or steps within a system or business process. AI can automate, monitor, or optimize workflows to save time, reduce errors, and increase scalability.

Answer 135

A

A platform (typically no-code or low-code) designed to create and deploy custom AI workflows or apps. It allows users to build AI tools, chatbots, or agents by configuring behavior without deep programming knowledge.

Answer 136

A

A cloud-based Jupyter notebook environment that supports Python and GPU/TPU acceleration. Popular for prototyping machine learning models and testing AI code collaboratively.

Answer 137

A

A bundled set of code, libraries, and configuration files that can be installed to add functionality.

Answer 138

A

Stands for JavaScript Object Notation. A lightweight data-interchange format that is easy to read and write. AI systems use JSON to structure inputs/outputs, store model responses, and send data via APIs.

Answer 139

A

URLs or addresses where APIs receive requests and return responses. In AI, endpoints often refer to access points for querying a model

Answer 140

A

A unique string used to authenticate a user or application when accessing a software API. In AI systems, an API key ensures secure access to model endpoints and usage tracking.

Answer 141

A

Refers to the presence of harmful, offensive, or biased language in responses generated by Large Language Models (LLMs). Managing toxicity is essential for maintaining ethical AI usage

Answer 142

A

The techniques and systems used to filter or flag inappropriate, harmful, or unsafe content generated by LLMs. It may involve human review or automatic moderation tools.

Answer 143

A

A workflow that involves one or more AI agents carrying out tasks or decision-making processes. This may include coordination between multiple agents, or chaining multiple model actions for complex tasks (e.g., retrieve → analyze → generate → act).

Answer 144

A

A method of training or using AI models where behavior is guided solely by the prompt, rather than altering the model’s internal parameters. Useful for adapting pre-trained models to new tasks without re-training.

Answer 145

A

The ability of LLMs to learn patterns and make predictions based solely on examples given in the prompt, without updating model weights. For instance, providing examples of Q&A in the prompt allows the model to generalize to a new question.

Answer 146

A

The use of back-and-forth conversation (multiple turns) with an LLM to iteratively refine or clarify outputs. It’s common in chatbot interfaces or collaborative tasks.

Answer 147

A

A method that describes how many examples are included in the prompt to guide model behavior.

Answer 148

A

A method where the prompt guides the model to generate intermediate reasoning steps, not just the final answer. Helps improve performance on complex reasoning tasks.

Answer 149

A

Similar to chain-of-thought, but often used more formally to guide logical progression through substeps. Especially useful for solving multi-part or technical problems.

Answer 150

A

Setting a context or role for the AI, like “You are a helpful tutor” or “Act like a data scientist.” This helps steer tone, perspective, and behavior in multi-turn interactions.

Answer 151

A

Refers to randomness in a process. LLMs are stochastic models, meaning the same prompt may return different outputs due to probabilistic sampling (especially at higher temperature settings).

Answer 152

A

A term coined by researchers to critique LLMs as merely repeating patterns found in training data without true understanding—parroting information in a statistically likely way.

Answer 153

A

Opposite of stochastic: a process that always gives the same output for the same input. In AI, setting temperature to 0 usually makes models behave deterministically.

Answer 154

A

Unexpected capabilities that appear when LLMs reach a certain scale—e.g., performing math or translation even though they weren’t explicitly trained for it. These behaviors are not directly programmed but “emerge” from the model’s structure and training data.

Answer 155

A

Refers to the underlying design and structure of a large language model. Most LLMs use a transformer architecture, with layers of attention mechanisms that process input sequences in parallel and understand context better than earlier models.

Answer 156

A

The process of continuing to train a pre-trained AI model on a new, more specific dataset. This customizes the model for a particular use case (e.g., legal or medical language).

Answer 157

A

The maximum number of tokens (words or symbols) a model can process at once. LLMs have limits (e.g., 4k, 16k, or 128k tokens), and exceeding the context window means earlier content may be truncated or lost.

Answer 158

A

Describes a system that doesn’t retain memory of previous interactions unless that context is explicitly provided again. Many LLMs operate statelessly unless memory is implemented at the application level.

Answer 159

A

AI systems (or LLMs configured in special ways) that are optimized to solve logical, mathematical, or multi-step problems by breaking down and reasoning through intermediate steps.

Answer 160

A

A standard language for accessing and manipulating relational databases. AI systems might generate or interpret SQL queries in data-centric applications.

Answer 161

A

A popular programming language widely used in AI and data science

Answer 162

A

AI models that can process and integrate multiple types of input data (modalities) such as text, images, audio, or video.

Answer 163

A

A historical period when interest and funding in AI drastically declined due to unmet expectations or lack of progress

Answer 164

A

A neural network where every neuron in one layer is connected to every neuron in the next layer. Common in early neural networks and some dense layers in deep learning models. Less efficient for image data due to high parameter counts.

Answer 165

A

A mathematical formula used during training to measure the difference between the AI model’s prediction and the actual (true) value. The model aims to minimize this loss to improve performance.

Answer 166

A

A key algorithm for training neural networks. It computes gradients of the loss function with respect to each weight in the network, allowing the model to adjust weights in the right direction using an optimizer like gradient descent.

Answer 167

A

The process of identifying points in an image where brightness changes sharply—edges represent boundaries of objects. Edge detection is often an early layer function in vision models, enabling higher-level features like shape recognition.

Answer 168

A

A mathematical operation that slides a filter (kernel) over input data (e.g., an image) to extract localized features like edges or textures. It’s the core building block of Convolutional Neural Networks (CNNs).

Answer 169

A

A small matrix (filter) used in the convolution operation to extract specific features from input data, such as edges, curves, or textures. Different kernels specialize in detecting different features.

Answer 170

A

A characteristic or pattern (e.g., edges, shapes, textures) detected by filters (kernels) at various layers. As you move deeper in the network, the features become more abstract (e.g., eyes, faces, objects).

Answer 171

A

The process of identifying meaningful patterns in the input data. In CNNs, early layers detect low-level features (edges, gradients), while deeper layers detect complex structures (faces, animals, objects).

Answer 172

A

A deep learning architecture specialized for image and spatial data. It uses convolutional layers, pooling layers, and fully connected layers to automatically learn features and perform tasks like image classification, object detection, and segmentation.

Answer 173

A

The process of reducing an image’s resolution or size by removing pixels. This reduces computational load and highlights general patterns while removing fine details. Often used in CNNs via pooling layers.

Answer 174

A

A task in which an AI model assigns a label or category to an image (e.g., “cat”, “car”, “stop sign”). CNNs are commonly used for this task, trained on labeled datasets.

Answer 175

A

An AI model developed by OpenAI that learns to match images with text descriptions. It’s trained to associate a caption with the correct image and distinguish it from mismatched ones, enabling tasks like zero-shot image classification or search.

Answer 176

A

An architecture with two networks: a generator that creates synthetic data (e.g., images), and a discriminator that tries to detect whether data is real or fake. They compete in a “game” until the generator produces highly realistic data. GANs are behind many deep fakes and AI art.

Answer 177

A

A class of generative models that learn to generate data by reversing a process that gradually adds noise to input (diffusion). They start with noise and iteratively denoise to create realistic outputs—used in tools like DALL·E 2 and MidJourney.

Answer 178

A

A process where AI removes visual noise from an image, often used in pre-processing or as a task for diffusion models. It enhances clarity and quality of visuals while preserving important features.

Answer 179

A

Models that recombine the style of one image (e.g., brush strokes of Van Gogh) with the content of another (e.g., a selfie). These models use CNNs to separate and recombine style and content layers of images.

Answer 180

A

Technology that converts written text into spoken audio using neural networks. Modern TTS systems, often powered by deep learning, can produce human-like voices and emotions.

Answer 181

A

AI-generated or modified synthetic media—especially videos—where someone’s appearance or voice is altered to resemble someone else. Based on GANs and other generative models, deep fakes can be used for both creative and malicious purposes.

Answer 182

A

A label for content that is inappropriate in professional or public settings, typically including violence, nudity, or explicit language. AI models must filter or block NSFW content when generating text or images to ensure safety and compliance.

Answer 183

A

When an AI model generates factually incorrect, fabricated, or misleading information while sounding confident. This is a known issue in LLMs that stems from how they predict the next word statistically, not logically.

Answer 184

A

Processes and tools used to control who can access software systems, features, or data. Includes authentication (who you are) and authorization (what you can do), and is essential for securing AI tools and platforms.

Answer 185

A

A security system that monitors and controls incoming and outgoing network traffic based on predetermined rules. It protects AI systems and servers from unauthorized access or malware.

Answer 186

A

The process of encoding data so that only authorized parties can access it. AI applications use encryption to protect sensitive inputs/outputs and stored data from being intercepted or stolen.

Answer 187

A

An older protocol (now succeeded by TLS) used to encrypt communication between a user’s browser and a web server. It ensures that data sent to/from AI platforms is secure and private.

Answer 188

A

A type of encryption using a pair of keys: a public key to encrypt data and a private key to decrypt it. Used in secure communications for AI APIs, login credentials, and sensitive data transfers.

Answer 189

A

A technique where network traffic is monitored to capture data packets. Malicious actors may use sniffing to steal unencrypted data; AI systems need protection via encryption and secure protocols.

Answer 190

A

The process of recording events, errors, and user actions in a system. In AI, logs are crucial for debugging, monitoring usage, detecting attacks, and ensuring accountability.

Answer 191

A

Creations of the mind—such as inventions, designs, symbols, or software—that are legally protected. In AI, IP can include code, trained models, datasets, and even prompts or generated content.

Answer 192

A

When proprietary or confidential information is unintentionally exposed or inferred from AI model outputs. Can happen through LLMs trained on sensitive data or careless system configurations.

Answer 193

A

Legal protections granted to inventors, allowing exclusive rights to an invention for a limited time. Patents can cover AI algorithms, systems, or specific technical solutions.

Answer 194

A

A requirement that the invention must be new, not publicly disclosed or used before the filing date. If a method has already been shared (even on GitHub), it may not be patentable.

Answer 195

A

The invention must not be an obvious solution to someone with standard skills in the field. In AI, combining existing models in a slightly different way might not qualify unless it solves a problem in a novel way.

Answer 196

A

The invention must be useful and have a practical application. This is usually easy to demonstrate for software and AI inventions that perform real tasks or improve efficiency.

Answer 197

A

Proprietary processes, data, or methods kept confidential to maintain a competitive edge. Unlike patents, trade secrets aren’t registered but must be protected from disclosure or theft (e.g., Google’s search algorithm).

Answer 198

A

Refers to content or inventions that are not protected by IP laws and can be freely used by anyone. AI-generated work may or may not qualify for IP protection, depending on the jurisdiction.

Answer 199

A

The degree to which the workings of an AI model can be explained, understood, or inspected. Transparency is key to ethical AI, especially in high-risk domains like healthcare or finance.

Answer 200

A

Algorithms or rule-based systems that restrict or modify the inputs/outputs of large language models to prevent harmful or unwanted content (e.g., profanity, misinformation, NSFW material).

Answer 201

A

Mechanisms for reviewing and managing the prompts users send to a model and the responses it returns. This may include auto-moderation tools or human oversight.

Answer 202

A

A type of attack where malicious data is inserted into the model’s training set to corrupt or bias its behavior. For example, inserting harmful associations into a chatbot’s training data.

Answer 203

A

Tiny, carefully designed changes to input data (like an image) that cause a model to misclassify or behave incorrectly—even if humans don’t notice the difference. A key threat in AI vision systems.

Answer 204

A

A defense technique where models are trained on adversarial examples to become more robust against these attacks. Often used in computer vision and cybersecurity-sensitive applications.

Answer 205

A

A malicious actor who exploits vulnerabilities in software or AI systems for unethical or illegal purposes, such as stealing data, manipulating outcomes, or spreading misinformation.

Answer 206

A

The right and practice of ensuring that individuals’ personal data is collected, processed, and shared in a secure and lawful manner. In AI, data privacy is vital when using user-generated content, medical records, or personal profiles for training or inference.

Answer 207

A

Any information that can identify an individual, such as names, addresses, email addresses, social security numbers, and biometric data. AI systems that process PII must comply with regulations like GDPR or CCPA to prevent misuse or exposure.

Answer 208

A

Legal protection granted to creators of original works, such as text, images, code, music, and more. Copyright gives authors exclusive rights to reproduce, distribute, and adapt their work, and is highly relevant in AI-generated content debates.

Answer 209

A

For a work to be protected by copyright, it must be “fixed in a tangible medium” (e.g., written, recorded, saved digitally). Transient ideas or spoken words not recorded typically don’t qualify.

Answer 210

A

A copyrightable work must show a minimal degree of creativity and be independently created. AI-generated content that mimics existing works too closely may fail this standard.

Answer 211

A

A legal principle stating that ideas are not protected by copyright—only the expression of those ideas is. This means anyone can use the idea behind a copyrighted work but not copy how it’s expressed.

Answer 212

A

A new creation that is based on or adapted from an existing copyrighted work (e.g., a remix, translation, or fan fiction). These works may still be subject to the original copyright holder’s permission.

Answer 213

A

A non-profit organization that regularly crawls and stores petabytes of public web data. It’s used by many AI companies (e.g., OpenAI, Meta) to train language models, though its inclusion of copyrighted content has raised legal and ethical concerns.

Answer 214

A

A doctrine allowing limited use of copyrighted works without permission for purposes like criticism, commentary, teaching, and research. In AI, fair use is a legal gray area, especially when models are trained on copyrighted data.

Answer 215

A

Legal frameworks that allow software to be used, modified, and distributed freely under specific conditions. Examples include MIT, Apache 2.0, and GNU GPL. Open source fosters collaboration, but AI developers must understand the terms.

Answer 216

A

A newer license type developed to promote ethical use of open-source AI models. It allows sharing and usage while placing restrictions on harmful use cases (e.g., surveillance, weaponization).

Answer 217

A

A set of licenses that enable creators to specify how others can use their work (e.g., share, remix, or use commercially). Ranges from CC-BY (attribution only) to CC0 (public domain dedication).

Answer 218

A

A licensing approach that allows derivative works but requires them to remain under the same license. Ensures that modified versions of software or models remain open and free. Example: GNU General Public License (GPL).

Answer 219

A

A legal agreement where one party agrees to compensate another for harm or liability. In AI contracts, software providers may offer indemnification if their tool causes legal trouble (e.g., IP infringement or data breach).

Answer 220

A

Legal responsibility for harm or damages caused by an AI system or its outputs. Companies deploying AI must assess whether liability falls on the developers, users, or integrators—especially when AI makes autonomous decisions.

Answer 221

A

Internal or organizational rules that ensure the ethical, legal, and responsible use of AI systems. These cover data use, bias mitigation, transparency, auditing, access control, and adherence to AI principles.

Answer 222

A

Ensuring AI systems do not produce biased or discriminatory outcomes against any group based on race, gender, age, or other attributes. Fairness is a key consideration in training data, model evaluation, and deployment.

Answer 223

A

Ensuring that an AI model’s behavior aligns with human values, goals, and expectations. Misalignment occurs when models act in ways that are harmful, manipulative, or unexpected—even if technically correct.

Answer 224

A

Occurs when an AI system incorrectly flags something as inappropriate, dangerous, or incorrect (e.g., marking harmless content as toxic). Important to minimize in moderation systems.

Answer 225

A

Tricking an AI model into ignoring safety constraints or providing unintended responses by crafting misleading or manipulative prompts. Related to jailbreak attacks.

Answer 226

A

A more advanced version of prompt hacking where users bypass built-in restrictions of a model to make it behave in unsafe, unethical, or unauthorized ways.

Answer 227

A

An approach where models are guided by a “constitution”—a set of ethical rules or principles that shape how they behave. Introduced by Anthropic, it’s an attempt to enforce value alignment through self-feedback rather than human oversight alone.

Answer 228

A

A set of techniques and tools that help humans understand how and why an AI system made a specific decision or prediction. Critical for trust, adoption, and compliance in regulated industries.

Answer 229

A

A form of XAI where the system explains how an output would have changed if the input had been slightly different. Example: “If your income had been $5,000 higher, your loan would have been approved.”

Answer 230

A

An XAI technique that explains a model’s prediction by approximating it with a simpler, interpretable model near the prediction. It works with any black-box model and is used for debugging and compliance.

Answer 231

A

A key innovation in transformer-based models like GPT. It allows the model to “attend” to relevant parts of the input when making predictions. Attention helps models handle long-range dependencies and understand context.

Answer 232

A

Combining multiple LLMs or variations of a model to improve reliability, reduce hallucinations, or increase robustness. The ensemble may vote, average responses, or select the best output from multiple generations.

Answer 233

A

Connecting multiple AI models in sequence, where the output of one model becomes the input to another. Useful for multi-step reasoning, multi-modal generation, or agentic workflows.

Answer 234

A

A statistical pattern where a small number of items (the “head”) are very popular, while a vast number of items (the “tail”) each have low frequency or demand. A few jobs make the majority of the money, while many jobs make little to no money

Answer 235

A

The cost of producing one additional unit of a product or service. In AI, the marginal cost of using a model (once trained) is often near zero—running an extra query costs very little, especially with scalable cloud infrastructure.

Answer 236

A

A large and structured collection of texts or data used to train language models. Common corpora include books, websites (like Common Crawl), news articles, and research papers. The quality, diversity, and bias of the corpus greatly affect model behavior.

Answer 237

A

A network of physical devices (e.g., thermostats, fridges, vehicles, sensors) connected to the internet, able to collect and share data. When integrated with AI, IoT enables smart automation, predictive maintenance, and personalized services.

Answer 238

A

An economic proposal where all citizens receive a regular, unconditional payment from the government. UBI is often discussed in the context of AI-driven job automation and economic disruption as a potential solution for income stability.

Answer 239

A

Occurs when AI systems, especially recommendation engines or chatbots, reinforce users’ existing beliefs or biases by repeatedly presenting similar views or responses. This can lead to information bubbles and polarization.

Answer 240

A

A hypothetical AI that surpasses human intelligence across all domains—logic, creativity, emotional intelligence, and more. It’s the subject of much debate in AI safety and philosophy due to its potential to reshape civilization or pose existential risks.

Answer 241

A

A theorized future point when AI systems achieve recursive self-improvement, rapidly evolving beyond human understanding or control. It marks the potential moment of irreversible change in human society, sometimes seen as either utopian or catastrophic.

Answer 242

A

A positive outlook where AI improves lives, eliminates tedious labor, cures diseases, helps solve climate change, and creates a fairer, more abundant society. Often tied to visions of harmonious human-AI collaboration.

Answer 243

A

A negative projection where AI leads to mass surveillance, authoritarian control, mass unemployment, misinformation, or loss of human autonomy. These fears are fueled by unchecked deployment, misalignment, or misuse of powerful models.