Terminology Flashcards

Question

Reinforcement learning

Answer 1

create a model that learns through setting goals and rules to reward and punish decisions without pre-labeled data AKA "Reinforcement Learning from Human Feedback (RLHF)"

Answer 2

group data that is similar or identical

Answer 3

data mining to identify objects that are associated Example: toothbrush, toothpaste, and mouthwash

Answer 4

determine how to group or categorize data through decision boundaries Example: classify animals

Answer 5

generate new content based on the underlying characteristics learned

Answer 6

large scale neural networks trained on massive amounts of data

Answer 7

an ML model that has at least one hidden layer through complex nonlinear relationships and patterns

Answer 8

a foundational model trained to perform one task and applied to different but related tasks Think using an LLM that is focused on legal

Answer 9

further training of a model on specific data for a specific purpose. Ex: LLM focused on poetry

Answer 10

weak AI that performs a single function under narrow constraints Ex: Deepblue

Answer 11

subset of Artificial Narrow AI that combines multiple narrow AI working together

Answer 12

strong, full, or deep AI human-level intelligence with the capacity for generalization by considering multiple possibilities carrying out discrete tasks to achieve a larger goal (turn $1k into $2k) DOES NOT CURRENTLY EXIST

Answer 13

intellectual capabilities that far exceed humanity's, including showing consciousness and emotions DOES NOT CURRENTLY EXIST

Answer 14

systems designed to emulate human decision-making

Answer 15

a method of reasoning that adds in some vagueness to mimic degrees of uncertainty. It relies on: - linguistic variables (cold, cool, warm, hot) - fuzzy rules: if-then statement

Answer 16

multi-diciplinary field that designs, constructs, operates and programs robots to use AI to interact with the physical world

Answer 17

4th industrial revolution that improves increased interconnectivity and automation through robotics and manufacturing

Answer 18

convergence of AI and robotics through sensors that allow machines to understand their surroundings

Answer 19

software robots that automate repetitive tasks through natural language processing (NLP) Example: data entry or forms processing

Answer 20

models that map a relationship between two variables (temp vs ice cream sales) Model is explainable

Answer 21

model that determines the flow of decisions Model is explainable

Answer 22

subfield of AI that is a neural network with more than 3 hidden layers Black box that cannot retrace the path to the output

Answer 23

foundation model that uses deep learning and a large number of parameters to understand and interact using language Black box

Answer 24

internal variable or value that the model learns from the training data used for the basis of ML predictions on new data

Answer 25

measurable attribute that can take on different values, including quantitative or qualitative

Answer 26

variables that are set before training and control the learning process

Answer 27

input variables and attributes that are characteristics of the data used to make predictions

Answer 28

type of parameter that determines the strength of the connection between nodes and can be adjusted during training to optimize predictions

Answer 29

type of neural network that learns the context and relationship between words in a sentence

Answer 30

looks at each word in a sentence to guage the relative importance and meaning "he went to the bank..."

Answer 31

can process and produce diverse inputs or outputs

Answer 32

medium of input or output, such as text, speech, image, and video

Answer 33

AI designed to simulate human-like conversations or interactions

Answer 34

Specialized NLP with few parameters (<1 billion) that are faster to train and more secure

Answer 35

proposes suggestions and new content Think Netflix recommendations

Answer 36

discern different images, speech, faces, and palm geometry Think plagiarism

Answer 37

identify statistical anomalies Think credit card fraud detection

Answer 38

prediction of future changes Think weather forecasting

Answer 39

determines the best steps from start to finish Think travel route optimization

Answer 40

customer support assistance Think support chatbots

Answer 41

tailor user experience to specific preferences through profiling Think news feeds

Answer 42

very powerful computer used for LLMs

Answer 43

Floating Point Operations over time which measure the model training requirements NOT FLOPS

Answer 44

all the computers processing resources, including CPU, GPU, memory, storage, and data processing

Answer 45

processing is not confined to a single server Features: - loose coupling: data from multiple sources - scaling: multiple instances of code

Answer 46

isolated cluster of computer in close proximity that utilizes high-speed networking and specialized chips

Answer 47

secure area of processor that preserves data and code confidentiality as well as privacy

Answer 48

how an AI system is used

Answer 49

software system used to provide an AI system Functions: - data analysis - streamline development and workflows - collaborate - automate tasks - monitor Examples: AWS, Microsoft Azure, Google Cloud

Answer 50

decentralized development model where the public can contribute and use the code or model

Answer 51

processing data into a format that will support a model

Answer 52

prepare data for an ML model by improving its quality through cleaning, filling in missing data, normalizing, etc Think "cleaning data"

Answer 53

adjusting the model to improve fairness and meet business objectives Think removing the chance that all Nazi's are black

Answer 54

ensuring data accuracy and consistency affecting performance

Answer 55

statistical properties or attributes of data can change over time Think before and after COVID

Answer 56

monitor the health of the system by comparing pre-determined indices and metrics

Answer 57

subset of data used for validating the model during the training phase to assess performance, fine tune the parameters, and prevent overfitting 10-20% of data set

Answer 58

subset of the data used for the final evaluation of the trained model, used to assess performance, ensure real-world readiness, and measure accuracy 10-20% of total data

Answer 59

when the model learns the training data to well and fails to generalize to new, unseen data. Results in poor performance and limited real-world applicability

Answer 60

when the model fails to capture the complexity of the application due to too few parameters, excessive regularization, and insufficient features Results in poor predictions, low accuracy, and weak performance.

Answer 61

data that the model hasn't seen, such as that when the system is implemented

Answer 62

known, verified facts that serve as reference data for measuring AI performance

Answer 63

Primary indicator of model performance, measuring for correctness, performance, and success

Answer 64

combining data into large datasets

Answer 65

the personal privacy rights of individuals

Answer 66

using data for a purpose other than what it was collected for

Answer 67

generative AI that creates contradictory or factually inaccurate content

Answer 68

synthetic content intentionally manipulated to cause harm or spread disinformation, such as images, audio, or video

Answer 69

deliberately deceptive information meant to confuse or mislead

Answer 70

incorrect information which was a mistake and NOT deliberate

Answer 71

individuals exposed only to ideologically similar content or lack of exposure to differing view points

Answer 72

assumption that computer systems are infallible and better than humans, causing people to be less likely to challenge outputs

Answer 73

ability of AI systems to pursue and achieve goals that match the operators intended objectives

Answer 74

a preference or inclination that inhibits impartiality stemming from prejudice and impacts outcomes or creates risks to individual's rights and liberties

Answer 75

deliberate manipulation of an AI model that causes it to malfunction

Answer 76

attacker reverse engineers model to extract information

Answer 77

attacker gains access to models parameters

Answer 78

intentionally altering training data to adversely impact model performance

Answer 79

unintentional loss of data

Answer 80

irretrievable data (lost laptop)

Answer 81

Same as echo chamber

Answer 82

attacker designs inputs so that the outputs are wrongly classified (noise introduction)

Answer 83

attacker manipulates model parameters to cause model to misbehave

Answer 84

model that is trained to attack other systems

Answer 85

data outlives the data subject

Answer 86

data used beyond its intended purpose

Answer 87

incidental collection of data

Answer 88

mapping of harms and negative consequences that could affect data subjects or organizations

Answer 89

framework to help in the identification, assessment and mitigation of risks

Answer 90

reinforcement of unjust societal biases Example: "thugs"

Answer 91

unfair distribution of resources Example: hiring algorithms

Answer 92

disproportionate underperformance for certain social groups Example: facial recognition for darker skin

Answer 93

systems adversely shape relations between people or communities Example: enabling stalking

Answer 94

macro level effects that destabilize social systems Example: misinformation

Answer 95

ability to explain or present the AI's reasoning for an output or decision

Answer 96

providing an explanation after a decision is made

Answer 97

transparency of the ML models decision, including trust, transparency, bias, and fairness

Answer 98

decision making oversight of the system, including "Human ___ of the loop"

Answer 99

Humans have oversight of the system, review outputs Example: Autonomous vehicle

Answer 100

Humans have control over, guide the system Example: recommendation algorithm

Answer 101

Humans are not involved in the decision making process Example: Misalignment

Answer 102

obligation and responsibility of creators and regulators to ensure systems operate in ethical, fair way

Answer 103

Ensure AI system output and actions can be questioned and challenged, including an appeal to a human for review

Answer 104

Ensuring a system behaves as expected, including consistency, accuracy

Answer 105

system performs accurately in a variety of circumstances

Answer 106

Extent to which info is made available about a system

Answer 107

individual or org that develops or significantly modifies an AI system

Answer 108

uses an AI system under its authority

Answer 109

end user or consumer of an AI system

Answer 110

teaching skills to targeted groups

Answer 111

focus attention on an issue or set of issues for everyone

Answer 112

embedding privacy and data protection into design and operation of IT systems

Answer 113

highest level of protection is applied automatically combined with PbD to be PbDD

Answer 114

privacy assessment for processing activities that include personal data

Answer 115

privacy assessment for high risk processing activities that include personal data

Answer 116

the amount of time and manner for which data is deleted

Answer 117

comprehensive privacy laws which went into affect in 2018 for the EU and EEA

Answer 118

transforming data into a non-identifiable form, especially making data non-personally identifiable

Answer 119

protecting data by transforming it, but it could still be identifiable through re-identification

Answer 120

high risk data that requires enhanced protection as defined by GDPR

Answer 121

high risk data that requires enhanced protection as defined by CCPA

Answer 122

using less sensitive proxy data for inferring insights

Answer 123

organization that processes personal data and determines the purposes and means for processing

Answer 124

inventions, brands, new tech and source code

Answer 125

time limited, exclusive rights protection for inventions

Answer 126

log, slogan, or brand name

Answer 127

protects technology (data, code) from unauthorized use and reproduction

Answer 128

confidential information that provides a competitive advantage (secret sauce)

Answer 129

expressive creation of an underlying work that is substantial and shows authors personality

Answer 130

certain cases that allow copyrighted works to be used without authors permission Example: criticism, satire, comment, reporting...

Answer 131

official permission to do, use or own something Licensor: giving permission Licensee: receiving permission

Answer 132

contractual obligation by one part to pay for a loss incurred by the other party

Answer 133

policies or procedures that result in negative effects on a protected group, focusing on outcomes, even if unintentional Also "disparate impact"

Answer 134

decision making, in part or whole, by means of technology without human involvement

Answer 135

when a model is built on data the company should not have had, the model and data must be deleted

Answer 136

central banking system of the US

Answer 137

sets out substantive rules, rights and obligations

Answer 138

provides additional details and technical specifications

Answer 139

Explain reasons, context, and objectives of legislation

Answer 140

applies to companies that exist in a given region or country, and those that do not but do business in that region or country

Answer 141

developer of an AI system or GPAI that is on the market Most heavily regulated

Answer 142

org that uses an AI system under its authority

Answer 143

an org that is located or established in the EU and places an AI system on the market (under the developers name)

Answer 144

org that makes an AI system available in the EU as a follow-on action to importation and placement on the market

Answer 145

puts an AI system on the market OR operationalizes together with their own product Three types 1. solely product manufacturer (no obligations) 2. becomes a provider 3. becomes a deployer

Answer 146

person located in the EU that receives a mandate from the AI system or GPAI provider and carries out the provider obligations

Answer 147

guidelines that provide consistency, standards, and ethical use of AI

Answer 148

guidance on operationalizing principles and values

Answer 149

org or individuals that play an active role in an AI system

Answer 150

coordinated activities to direct and control an org risk

Answer 151

readiness to bear risk to achieve objectives

Answer 152

implementation of an AI RMF for a specific setting or application

Answer 153

testing, evaluation, verification (internal), validation (external) For risk

Answer 154

meets internal stakeholder requirements

Answer 155

meets external stakeholder requirements

Answer 156

adversarial approach to testing the safety, security and performance

Answer 157

Data preparation for AI

Answer 158

accuracy and trustworthy

Answer 159

Pre-processing step for transforming raw data into relevant info to create predictive models

Answer 160

Final set of data that includes: - Training data - validation data (used during training phase) - Testing data (assess model and fine tune)

Answer 161

Description of the model for external stakeholders about how it works and considerations for its use

Answer 162

False positive (unauthorized access)

Answer 163

False negative (inconvenience)

Answer 164

systematic measure for day-to-day management of AI systems

Answer 165

evaluation method to assess and compare performance of an AI system

Answer 166

framework for managing data assets throughout the data lifecycle

Answer 167

origin and authenticity of data

Answer 168

tracking the flow of data through systems, including transformations and dependencies

Answer 169

requirement that data must be stored and processed in geographical borders where it was collected

Answer 170

financial institutions must verify customer identity and assess risk in doing business with them

Answer 171

predefined schema that is searchable and analyzable Example: spreadsheets

Answer 172

data that has no format or organization Example: social media, video

Answer 173

organizational properties without rigid structure Example: email, medical records

Answer 174

monitoring of AI systems

Answer 175

frameworks, policies, processes and controls to measure, evaluate and promote trustworthy AI. Think assessments and certifications

Answer 176

assessment of AI systems for compliance with laws, regulations, and standards

Answer 177

GenAI model that trains on a lot of data to be able to analyze, understand, and generate new content

Answer 178

image generation through the refinement of noise into an image

Answer 179

an input or situation that falls outside of the expectations of the system

Answer 180

removal of some personal identifiers

Answer 181

modifying sensitive data

Answer 182

collecting only the data needed

Answer 183

mathematical process to encode data

Answer 184

enables encrypted computations and training of data so it is never exposed (not scalable)

Answer 185

orgs compute on a combined data set without revealing the info about the data

Answer 186

parties train a shared ML model without aggregating the data by training on the edge and then updating the global model

Answer 187

"noise" is added to the data on a sliding scale to increase privacy OR increase accuracy

Answer 188

same team gets the same results with same inputs over time

Answer 189

identify and list threats with their counter measures

Answer 190

minor tweaks to inputs ruin a system

Answer 191

new data overwrites or weakens weights in LLMs

Answer 192

owned and controlled in a closed nature and inaccessible to the public