Prepare Data for Exploration (Terms) Flashcards

1
Q

Features such as password protection, user permissions, and encryption that are used to protect a spreadsheet

A

Access control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Metadata that indicates the technical source of a digital asset

A

Administrative metadata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A list of scheduled appointments

A

Agenda

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Digitized audio storage usually in an MP3, AAC, or other compressed format

A

Audio file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A data source that is not reliable, original, comprehensive, current, and cited (ROCCC)

A

Bad data source

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

A conscious or subconscious preference in favor of or against a person, group of people, or thing

A

Bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A data type with only two possible values, usually true or false

A

Boolean data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The tendency to search for or interpret information in a way that confirms pre-existing beliefs

A

Confirmation bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The aspect of data ethics that presumes an individual’s right to know how and why their personal data will be used before agreeing to provide it

A

Consent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data that is measured and can have almost any numeric value

A

Continuous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A small file stored on a computer that contains information about its users

A

Cookie

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A delimited text file that uses a comma to separate values

A

CSV (comma-separated values) file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The aspect of data ethics that presumes individuals should be aware of financial transactions resulting from the use of their personal data and the scale of those transactions

A

Currency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The process of protecting people’s private or sensitive data by eliminating identifying information

A

Data anonymization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When a preference in favor of or against a person, group of people, or thing systematically skews data analysis results in a certain direction

A

Data bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A piece of information in a dataset

A

Data element

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Well-founded standards of right and wrong that dictate how data is collected, shared, and used

A

Data ethics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

A process for ensuring the formal management of a company’s data assets

A

Data governance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The ability to integrate data from multiple sources and a key factor leading to the successful use of open data among companies and governments

A

Data interoperability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

A tool for organizing data elements and how they relate to one another

A

Data model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Preserving a data subject’s information any time a data transaction occurs

A

Data privacy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Protecting data from unauthorized access or corruption by adopting safety measures

A

Data security

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

An attribute that describes a piece of data based on its values, its programming language, or the operations it can perform

A

Data type

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Metadata that describes a piece of data and can be used to identify it at a later point in time

A

Descriptive metadata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

An electronic or computer-based image usually in BMP or JPG format

A

Digital photo

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Data that is counted and has a limited number of values

A

Discrete data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Well-founded standards of right and wrong that prescribe what humans ought to do, usually in terms of rights, obligations, benefits to society, fairness, or specific virtues

A

Ethics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

The tendency for different people to observe things differently (Refer to Observer bias)

A

Experimenter bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Data that lives and is generated outside of an organization

A

External data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

A single piece of information from a row or column of a spreadsheet; in a data table, typically a column in the table

A

Field

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Data collected by an individual or group using their own resources

A

First-party data

32
Q

A field within a database table that is a primary key in another table (Refer to primary key)

A

Foreign key

33
Q

The section of a query that indicates where the selected data comes from

A

FROM

34
Q

Policy-making body in the European Union created to help protect people and their data

A

General Data Protection Regulation of the European Union (GDPR)

35
Q

The geographical location of a person or device by means of digital information

A

Geolocation

36
Q

A data source that is reliable, original, comprehensive, current, and cited (ROCCC)

A

Good data source

37
Q

Data that lives within a company’s own systems

A

Internal data

38
Q

The tendency to interpret ambiguous situations in a positive or negative way

A

Interpretation bias

39
Q

A dataset in which each row is one time point per subject, so each subject has data in multiple rows

A

Long data

40
Q

Someone who shares knowledge, skills, and experience to help another grow both professionally and personally

A

Mentor

41
Q

Data about data

A

Metadata

42
Q

A database created to store metadata

A

Metadata repository

43
Q

Consistent guidelines that describe the content, creation date, and version of a file in its name

A

Naming conventions

44
Q

Building relationships by meeting people both in person and online

A

Networking

45
Q

A type of qualitative data that is categorized without a set order

A

Nominal data

46
Q

A database in which only related data is stored in each table

A

Normalized database

47
Q

An interactive, editable programming environment for creating data reports and showcasing data skills

A

Notebook

48
Q

The tendency for different people to observe things differently (also called experimenter bias)

A

Observer bias

49
Q

The aspect of data ethics that promotes the free access, usage, and sharing of data

A

Openness

50
Q

Qualitative data with a set order or scale

A

Ordinal data

51
Q

The aspect of data ethics that presumes individuals own the raw data they provide and have primary control over its usage, processing, and sharing

A

Ownership

52
Q

In digital imaging, a small area of illumination on a display screen that, when combined with other adjacent areas, forms a digital image

A

Pixel

53
Q

In data analytics, all possible data values in a dataset

A

Population

54
Q

An identifier in a database that references a column in which each value is unique (Refer to foreign key)

A

Primary key

55
Q

A collection of related data in a data table, usually synonymous with row

A

Record

56
Q

When the same piece of data is stored in two or more places

A

Redundancy

57
Q

A database that contains a series of tables that can be connected to form relationships

A

Relational database

58
Q

In data analytics, a segment of a population that is representative of the entire population

A

Sample

59
Q

Overrepresenting or underrepresenting certain members of a population as a result of working with a sample that is not representative of the population as a whole

A

Sampling bias

60
Q

A way of describing how something, such as data, is organized

A

Schema

61
Q

Data collected by a group directly from its audience and then sold

A

Second-party data

62
Q

The section of a query that indicates the subset of a dataset

A

SELECT

63
Q

Websites and applications through which users create and share content or participate in social networking

A

Social media

64
Q

A professional advocate who is committed to moving forward the career of another

A

Sponsor

65
Q

A sequence of characters and punctuation that contains textual information (also called text data type)

A

String data type

66
Q

Metadata that indicates how a piece of data is organized and whether it is part of one or more than one data collection

A

Structural metadata

67
Q

Data organized in a certain format such as rows and columns

A

Structured data

68
Q

A sequence of characters and punctuation that contains textual information (also called string data type)

A

Text data type

69
Q

Data provided from outside sources who didn’t collect it directly

A

Third-party data

70
Q

The aspect of data ethics that presumes all data-processing activities and algorithms should be explainable and understood by the individual who provides the data

A

Transaction transparency

71
Q

When the sample of the population being measured is representative of the population as a whole

A

Unbiased sampling

72
Q

An agency in the U.S. Department of Commerce that serves as the nation’s leading provider of quality data about its people and economy

A

United States Census Bureau

73
Q

Data that is not organized in any easily identifiable manner

A

Unstructured data

74
Q

A collection of images, audio files, and other data usually encoded in a compressed format such as MP4, MV4, MOV, AVI, or FLV

A

Video file

75
Q

The section of a query that specifies criteria that the requested data must meet

A

WHERE

76
Q

A dataset in which every data subject has a single row with multiple columns to hold the values of various attributes of the subject

A

Wide data

77
Q

An organization whose primary role is to direct and coordinate international health within the United Nations system

A

World Health Organization