1 Deck Flashcards

1
Q

A/B testing

A

The process of testing two variations of the same web page to determine which page is more successful at attracting user traffic and generating revenue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Access control

A

Features such as password protection, user permissions, and encryption that are used to protect a spreadsheet

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Accuracy

A

The degree to which the data conforms to the actual entity being measured or described

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Action-oriented question

A

A question whose answers lead to change

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Administrative metadata

A

Metadata that indicates the technical source of a digital asset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Agenda

A

A list of scheduled appointments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Algorithm

A

A process or set of rules followed for a specific task

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Analytical skills

A

Qualities and characteristics associated with using facts to solve problems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Analytical thinking

A

The process of identifying and defining a problem, then solving it by using data in an organized, step-by-step manner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Attribute

A

A characteristic or quality of data used to label a column in a table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Audio file

A

Digitized audio storage usually in an MP3, AAC, or another compressed format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

AVERAGE

A

A spreadsheet function that returns an average of the values from a selected range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Bad data source

A

A data source that is not reliable, original, comprehensive, current, and cited (ROCCC)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Bias

A

A conscious or subconscious preference in favor of or against a person, group of people, or thing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Big data

A

Large, complex datasets typically involving long periods of time, which enable data analysts to address far-reaching business problems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Boolean data

A

A data type with only two possible values, usually true or false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Borders

A

Lines that can be added around two or more cells on a spreadsheet

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Business task

A

The question or problem data analysis resolves for a business

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

CAST

A

A SQL function that converts data from one datatype to another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Cell reference

A

A cell or a range of cells in a worksheet typically used in formulas and functions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Clean data

A

Data that is complete, correct, and relevant to the problem being solved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Cloud

A

A place to keep data online, rather than a computer hard drive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

COALESCE

A

A SQL function that returns non-null values in a list

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Compatibility

A

How well two or more datasets are able to work together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Completeness

A

The degree to which the data contains all desired components or measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

CONCAT

A

A SQL function that adds strings together to create new text strings that can be used as unique keys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

CONCATENATE

A

A spreadsheet function that joins together two or more text strings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Conditional formatting

A

A spreadsheet tool that changes how cells appear when values meet specific conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Confidence interval

A

A range of values that conveys how likely a statistical estimate reflects the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Confidence level

A

The probability that a sample size accurately reflects the greater population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Confirmation bias

A

The tendency to search for or interpret information in a way that confirms pre-existing beliefs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Consent

A

The aspect of data ethics that presumes an individual’s right to know how and why their personal data will be used before agreeing to provide it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Consistency

A

The degree to which data is repeatable from different points of entry or collection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Context

A

The condition in which something exists or happens

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Continuous data

A

Data that is measured and can have almost any numeric value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Cookie

A

A small file stored on a computer that contains information about its users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

COUNT

A

A spreadsheet function that counts the number of cells in a range that meet a specified criteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

COUNTIF

A

A spreadsheet function that returns the number of cells in a range that match a specified value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Cross-field validation

A

A process that ensures certain conditions for multiple data fields are satisfied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

CSV (comma-separated values) file

A

A delimited text file that uses a comma to separate values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Currency

A

The aspect of data ethics that presumes individuals should be aware of financial transactions resulting from the use of their personal data and the scale of those transactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Dashboard

A

A tool that monitors live, incoming data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Data

A

A collection of facts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Data analysis

A

The collection, transformation, and organization of data in order to draw conclusions, make predictions, and drive informed decision-making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Data analysis process

A

The six phases of ask, prepare, process, analyze, share, and act whose purpose is to gain insights that drive informed decision-making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Data analyst

A

Someone who collects, transforms, and organizes data in order to draw conclusions, make predictions, and drive informed decision-making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Data analytics

A

The science of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Data anonymization

A

The process of protecting people’s private or sensitive data by eliminating identifying information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

Data bias

A

When a preference in favor of or against a person, group of people, or thing systematically skews data analysis results in a certain direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

Data constraints

A

The criteria that determine whether a piece of a data is clean and valid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

Data design

A

How information is organized

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

Data-driven decision-making

A

Using facts to guide business strategy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

Data ecosystem

A

The various elements that interact with one another in order to produce, manage, store, organize, analyze, and share data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

Data element

A

A piece of information in a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

Data engineer

A

A professional who transforms data into a useful format for analysis and gives it a reliable infrastructure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

Data ethics

A

Well-founded standards of right and wrong that dictate how data is collected, shared, and used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Data governance

A

A process for ensuring the formal management of a company’s data assets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Data-inspired decision-making

A

Exploring different data sources to find out what they have in common

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

Data integrity

A

The accuracy, completeness, consistency, and trustworthiness of data throughout its life cycle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Data interoperability

A

The ability to integrate data from multiple sources and a key factor leading to the successful use of open data among companies and governments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

Data life cycle

A

The sequence of stages that data experiences, which include plan, capture, manage, analyze, archive, and destroy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

Data manipulation

A

The process of changing data to make it more organized and easier to read

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

Data mapping

A

The process of matching fields from one data source to another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

Data merging

A

The process of combining two or more datasets into a single dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

Data model

A

A tool for organizing data elements and how they relate to one another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

Data privacy

A

Preserving a data subject’s information any time a data transaction occurs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

Data range

A

Numerical values that fall between predefined maximum and minimum values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

Data replication

A

The process of storing data in multiple locations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

Data science

A

A field of study that uses raw data to create new ways of modeling and understanding the unknown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

Data security

A

Protecting data from unauthorized access or corruption by adopting safety measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

Data strategy

A

The management of the people, processes, and tools used in data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

Data transfer

A

The process of copying data from a storage device to computer memory or from one computer to another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

Data type

A

An attribute that describes a piece of data based on its values, its programming language, or the operations it can perform

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

Data validation

A

A tool for checking the accuracy and quality of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

Data visualization

A

The graphical representation of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

Data warehousing specialist

A

A professional who develops processes and procedures to effectively store and organize data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

Database

A

A collection of data stored in a computer system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

Dataset

A

A collection of data that can be manipulated or analyzed as one unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

DATEDIF

A

A spreadsheet function that calculates the number of days, months, or years between two dates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

Delimiter

A

A character that indicates the beginning or end of a data item

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

Descriptive metadata

A

Metadata that describes a piece of data and can be used to identify it at a later point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

Digital photo

A

An electronic or computer-based image usually in BMP or JPG format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

Dirty data

A

Data that is incomplete, incorrect, or irrelevant to the problem to be solved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

Discrete data

A

Data that is counted and has a limited number of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

DISTINCT

A

A keyword that is added to a SQL SELECT statement to retrieve only non-duplicate entries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

Duplicate data

A

Any record that inadvertently shares data with another record

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

Equation

A

A calculation that involves addition, subtraction, multiplication, or division

88
Q

Estimated response rate

A

The average number of people who typically complete a survey

89
Q

Ethics

A

Well-founded standards of right and wrong that prescribe what humans ought to do, usually in terms of rights, obligations, benefits to society, fairness, or specific virtues

90
Q

Experimenter bias

A

The tendency for different people to observe things differently

91
Q

External data

A

Data that lives, and is generated, outside of an organization

92
Q

Fairness

A

A quality of data analysis that does not create or reinforce bias

93
Q

Field

A

A single piece of information from a row or column of a spreadsheet; in a data table, typically a column in the table

94
Q

Field length

A

A tool for determining how many characters can be keyed into a spreadsheet field

95
Q

Fill handle

A

A box in the lower-right-hand corner of a selected spreadsheet cell that can be dragged through neighboring cells in order to continue an instruction

96
Q

Filtering

A

The process of showing only the data that meets a specified criteria while hiding the rest

97
Q

First-party data

A

Data collected by an individual or group using their own resources

98
Q

Float

A

A number that contains a decimal

99
Q

Foreign key

A

A field within a database table that is a primary key in another table

100
Q

Formula

A

A set of instructions used to perform a calculation using the data in a spreadsheet

101
Q

FROM

A

The section of a query that indicates from which table(s) to extract the data

102
Q

Function

A

A preset command that automatically performs a specified process or task using the data in a spreadsheet

103
Q

Gap analysis

A

A method for examining and evaluating the current state of a process in order to identify opportunities for improvement in the future

104
Q

General Data Protection Regulation of the European Union (GDPR)

A

Policy-making body in the European Union created to help protect people and their data

105
Q

Geolocation

A

The geographical location of a person or device by means of digital information

106
Q

Good data source

A

A data source that is reliable, original, comprehensive, current, and cited (ROCCC)

107
Q

Incomplete data

A

Data that is missing important fields

108
Q

Inconsistent data

A

Data that uses different formats to represent the same thing

109
Q

Incorrect/inaccurate data

A

Data that is complete but inaccurate

110
Q

Internal data

A

Data that lives within a company’s own systems

111
Q

Interpretation bias

A

The tendency to interpret ambiguous situations in a positive or negative way

112
Q

Leading question

A

A question that steers people toward a certain response

113
Q

LEFT

A

A function that returns a set number of characters from the left side of a text string

114
Q

LEN

A

A function that returns the length of a text string by counting the number of characters it contains

115
Q

Length

A

The number of characters in a text string

116
Q

Long data

A

A dataset in which each row is one time point per subject, so each subject has data in multiple rows

117
Q

Mandatory

A

A data value that cannot be left blank or empty

118
Q

Margin of error

A

The maximum amount that the sample results are expected to differ from those of the actual population

119
Q

Math expression

A

A calculation that involves addition, subtraction, multiplication, or division (also called an equation)

120
Q

Math function

A

A function that is used as part of a mathematical formula

121
Q

MAX

A

A function that returns the largest numeric value from a range of cells

122
Q

Measurable question

A

A question whose answers can be quantified and assessed

123
Q

Mentor

A

Someone who shares knowledge, skills, and experience to help another grow both professionally and personally

124
Q

Merger

A

An agreement that unites two organizations into a single new one

125
Q

Metadata

A

Data about data

126
Q

Metadata repository

A

A database created to store metadata

127
Q

Metric

A

A single, quantifiable type of data that is used for measurement

128
Q

Metric goal

A

A measurable goal set by a company and evaluated using metrics

129
Q

MID

A

A function that returns a segment from the middle of a text string

130
Q

MIN

A

A spreadsheet function that returns the smallest numeric value from a range of cells

131
Q

Naming conventions

A

Consistent guidelines that describe the content, creation date, and version of a file in its name

132
Q

Networking

A

Building relationships by meeting people both in person and online

133
Q

Nominal data

A

A type of qualitative data that is categorized without a set order

134
Q

Normalized database

A

A database in which only related data is stored in each table

135
Q

Notebook

A

An interactive, editable programming environment for creating data reports and showcasing data skills

136
Q

Null

A

An indication that a value does not exist in a dataset

137
Q

Observation

A

The attributes that describe a piece of data contained in a row of a table

138
Q

Observer bias

A

The tendency for different people to observe things differently (also called experimenter bias)

139
Q

Open data

A

Data that is available to the public

140
Q

Openness

A

The aspect of data ethics that promotes the free access, usage, and sharing of data

141
Q

Operator

A

A symbol that names the operation or calculation to be performed

142
Q

Order of operations

A

Using parentheses to group together spreadsheet values in order to clarify the order in which operations should be performed

143
Q

Ordinal data

A

Qualitative data with a set order or scale

144
Q

Outdated data

A

Any data that has been superseded by newer and more accurate information

145
Q

Ownership

A

The aspect of data ethics that presumes individuals own the raw data they provide and have primary control over its usage, processing, and sharing

146
Q

Pivot chart

A

A chart created from the fields in a pivot table

147
Q

Pivot table

A

A data summarization tool used to sort, reorganize, group, count, total, or average data

148
Q

Pixel

A

In digital imaging, a small area of illumination on a display screen that, when combined with other adjacent areas, forms a digital image

149
Q

Population

A

In data analytics, all possible data values in a dataset

150
Q

Primary key

A

An identifier in a database that references a column in which each value is unique

151
Q

Problem domain

A

The area of analysis that encompasses every activity affecting or affected by a problem

152
Q

Problem types

A

The various problems that data analysts encounter, including categorizing things, discovering connections, finding patterns, identifying themes, making predictions, and spotting something unusual

153
Q

Qualitative data

A

A subjective and explanatory measure of a quality or characteristic

154
Q

Quantitative data

A

A specific and objective measure, such as a number, quantity, or range

155
Q

Query

A

A request for data or information from a database

156
Q

Query language

A

A computer programming language used to communicate with a database

157
Q

Random sampling

A

A way of selecting a sample from a population so that every possible type of the sample has an equal chance of being chosen

158
Q

Range

A

A collection of two or more cells in a spreadsheet

159
Q

Record

A

A collection of related data in a data table, usually synonymous with row

160
Q

Redundancy

A

When the same piece of data is stored in two or more places

161
Q

Reframing

A

The process of restating a problem or challenge, then redirecting it toward a potential resolution

162
Q

Regular expression (RegEx)

A

A rule that says the values in a table must match a prescribed pattern

163
Q

Relational database

A

A database that contains a series of tables that can be connected to form relationships

164
Q

Relevant question

A

A question that has significance to the problem to be solved

165
Q

Remove duplicates

A

A spreadsheet tool that automatically searches for and eliminates duplicate entries from a spreadsheet

166
Q

Report

A

A static collection of data periodically given to stakeholders

167
Q

Return on investment (ROI)

A

A formula that uses the metrics of investment and profit to evaluate the success of an investment

168
Q

Revenue

A

The total amount of income generated by the sale of goods or services

169
Q

RIGHT

A

A function that returns a set number of characters from the right side of a text string

170
Q

Root cause

A

The reason why a problem occurs

171
Q

Sample

A

In data analytics, a segment of a population that is representative of the entire population

172
Q

Sampling bias

A

Overrepresenting or underrepresenting certain members of a population as a result of working with a sample that is not representative of the population as a whole

173
Q

Schema

A

A way of describing how something, such as data, is organized

174
Q

Scope of work (SOW)

A

An agreed-upon outline of the tasks to be performed during a project

175
Q

Second-party data

A

Data collected by a group directly from its audience and then sold

176
Q

SELECT

A

The section of a query that indicates from which column(s) to extract the data

177
Q

Small data

A

Small, specific data points typically involving a short period of time, which are useful for making day-to-day decisions

178
Q

SMART methodology

A

A tool for determining a question’s effectiveness based on whether it is specific, measurable, action-oriented, relevant, and time-bound

179
Q

Social media

A

Websites and applications through which users create and share content or participate in social networking

180
Q

Sorting

A

The process of arranging data into a meaningful order to make it easier to understand, analyze, and visualize

181
Q

Specific question

A

A question that is simple, significant, and focused on a single topic or a few closely related ideas

182
Q

Split

A

A function that divides text around a specified character and puts each fragment into a new, separate cell

183
Q

Sponsor

A

A professional advocate who is committed to moving forward the career of another

184
Q

Spreadsheet

A

A digital worksheet

185
Q

SQL

A

Structured Query Language

186
Q

Stakeholders

A

People who invest time and resources into a project and are interested in its outcome

187
Q

Statistical power

A

The probability that a test of significance will recognize an effect that is present

188
Q

Statistical significance

A

The probability that sample results are not due to random chance

189
Q

String data type

A

A sequence of characters and punctuation that contains textual information (also called text data type)

190
Q

Structural metadata

A

Metadata that indicates how a piece of data is organized and whether it is part of one or more than one data collection

191
Q

Structured data

A

Data organized in a certain format such as rows and columns

192
Q

Structured Query Language

A

A computer programming language used to communicate with a database

193
Q

Structured thinking

A

The process of recognizing the current problem or situation, organizing available information, revealing gaps and opportunities, and identifying options

194
Q

SUBSTR

A

A SQL function that extracts a substring from a string variable

195
Q

Substring

A

A subset of a text string

196
Q

SUM

A

A function that adds the values of a selected range of cells

197
Q

Syntax

A

The predetermined structure of a language that includes all required words, symbols, and punctuation, as well as their proper placement

198
Q

Technical mindset

A

The ability to break things down into smaller steps or pieces and work with them in an orderly and logical way

199
Q

Text data type

A

A sequence of characters and punctuation that contains textual information (also called string data type)

200
Q

Text string

A

A group of characters within a cell, most often composed of letters

201
Q

Third-party data

A

Data provided from outside sources who didn’t collect it directly

202
Q

Time-bound question

A

A question that specifies a timeframe to be studied

203
Q

Transaction transparency

A

The aspect of data ethics that presumes all data-processing activities and algorithms should be explainable and understood by the individual who provides the data

204
Q

TRIM

A

A function that removes leading, trailing, and repeated spaces in data

205
Q

Turnover rate

A

The rate at which employees voluntarily leave a company

206
Q

Typecasting

A

Converting data from one type to another

207
Q

Unbiased sampling

A

When the sample of the population being measured is representative of the population as a whole

208
Q

Unfair question

A

A question that makes assumptions or is difficult to answer honestly

209
Q

Unique

A

A value that can’t have a duplicate

210
Q

United States Census Bureau

A

An agency in the U.S. Department of Commerce that serves as the nation’s leading provider of quality data about its people and economy

211
Q

Unstructured data

A

Data that is not organized in any easily identifiable manner

212
Q

Validity

A

The degree to which the data conforms to constraints when it is input, collected, or created

213
Q

Video file

A

A collection of images, audio files, and other data usually encoded in a compressed format such as MP4, MV4, MOV, AVI, or FLV

214
Q

VLOOKUP

A

A spreadsheet function that vertically searches for a certain value in a column to return a corresponding piece of information

215
Q

WHERE

A

The section of a query that specifies criteria that the requested data must meet

216
Q

Wide data

A

A dataset in which every data subject has a single row with multiple columns to hold the values of various attributes of the subject

217
Q

World Health Organization

A

An organization whose primary role is to direct and coordinate international health within the United Nations system