Prepare Data for Exploration (Terms) Flashcards
Features such as password protection, user permissions, and encryption that are used to protect a spreadsheet
Access control
Metadata that indicates the technical source of a digital asset
Administrative metadata
A list of scheduled appointments
Agenda
Digitized audio storage usually in an MP3, AAC, or other compressed format
Audio file
A data source that is not reliable, original, comprehensive, current, and cited (ROCCC)
Bad data source
A conscious or subconscious preference in favor of or against a person, group of people, or thing
Bias
A data type with only two possible values, usually true or false
Boolean data
The tendency to search for or interpret information in a way that confirms pre-existing beliefs
Confirmation bias
The aspect of data ethics that presumes an individual’s right to know how and why their personal data will be used before agreeing to provide it
Consent
Data that is measured and can have almost any numeric value
Continuous data
A small file stored on a computer that contains information about its users
Cookie
A delimited text file that uses a comma to separate values
CSV (comma-separated values) file
The aspect of data ethics that presumes individuals should be aware of financial transactions resulting from the use of their personal data and the scale of those transactions
Currency
The process of protecting people’s private or sensitive data by eliminating identifying information
Data anonymization
When a preference in favor of or against a person, group of people, or thing systematically skews data analysis results in a certain direction
Data bias
A piece of information in a dataset
Data element
Well-founded standards of right and wrong that dictate how data is collected, shared, and used
Data ethics
A process for ensuring the formal management of a company’s data assets
Data governance
The ability to integrate data from multiple sources and a key factor leading to the successful use of open data among companies and governments
Data interoperability
A tool for organizing data elements and how they relate to one another
Data model
Preserving a data subject’s information any time a data transaction occurs
Data privacy
Protecting data from unauthorized access or corruption by adopting safety measures
Data security
An attribute that describes a piece of data based on its values, its programming language, or the operations it can perform
Data type
Metadata that describes a piece of data and can be used to identify it at a later point in time
Descriptive metadata
An electronic or computer-based image usually in BMP or JPG format
Digital photo
Data that is counted and has a limited number of values
Discrete data
Well-founded standards of right and wrong that prescribe what humans ought to do, usually in terms of rights, obligations, benefits to society, fairness, or specific virtues
Ethics
The tendency for different people to observe things differently (Refer to Observer bias)
Experimenter bias
Data that lives and is generated outside of an organization
External data
A single piece of information from a row or column of a spreadsheet; in a data table, typically a column in the table
Field
Data collected by an individual or group using their own resources
First-party data
A field within a database table that is a primary key in another table (Refer to primary key)
Foreign key
The section of a query that indicates where the selected data comes from
FROM
Policy-making body in the European Union created to help protect people and their data
General Data Protection Regulation of the European Union (GDPR)
The geographical location of a person or device by means of digital information
Geolocation
A data source that is reliable, original, comprehensive, current, and cited (ROCCC)
Good data source
Data that lives within a company’s own systems
Internal data
The tendency to interpret ambiguous situations in a positive or negative way
Interpretation bias
A dataset in which each row is one time point per subject, so each subject has data in multiple rows
Long data
Someone who shares knowledge, skills, and experience to help another grow both professionally and personally
Mentor
Data about data
Metadata
A database created to store metadata
Metadata repository
Consistent guidelines that describe the content, creation date, and version of a file in its name
Naming conventions
Building relationships by meeting people both in person and online
Networking
A type of qualitative data that is categorized without a set order
Nominal data
A database in which only related data is stored in each table
Normalized database
An interactive, editable programming environment for creating data reports and showcasing data skills
Notebook
The tendency for different people to observe things differently (also called experimenter bias)
Observer bias
The aspect of data ethics that promotes the free access, usage, and sharing of data
Openness
Qualitative data with a set order or scale
Ordinal data
The aspect of data ethics that presumes individuals own the raw data they provide and have primary control over its usage, processing, and sharing
Ownership
In digital imaging, a small area of illumination on a display screen that, when combined with other adjacent areas, forms a digital image
Pixel
In data analytics, all possible data values in a dataset
Population
An identifier in a database that references a column in which each value is unique (Refer to foreign key)
Primary key
A collection of related data in a data table, usually synonymous with row
Record
When the same piece of data is stored in two or more places
Redundancy
A database that contains a series of tables that can be connected to form relationships
Relational database
In data analytics, a segment of a population that is representative of the entire population
Sample
Overrepresenting or underrepresenting certain members of a population as a result of working with a sample that is not representative of the population as a whole
Sampling bias
A way of describing how something, such as data, is organized
Schema
Data collected by a group directly from its audience and then sold
Second-party data
The section of a query that indicates the subset of a dataset
SELECT
Websites and applications through which users create and share content or participate in social networking
Social media
A professional advocate who is committed to moving forward the career of another
Sponsor
A sequence of characters and punctuation that contains textual information (also called text data type)
String data type
Metadata that indicates how a piece of data is organized and whether it is part of one or more than one data collection
Structural metadata
Data organized in a certain format such as rows and columns
Structured data
A sequence of characters and punctuation that contains textual information (also called string data type)
Text data type
Data provided from outside sources who didn’t collect it directly
Third-party data
The aspect of data ethics that presumes all data-processing activities and algorithms should be explainable and understood by the individual who provides the data
Transaction transparency
When the sample of the population being measured is representative of the population as a whole
Unbiased sampling
An agency in the U.S. Department of Commerce that serves as the nation’s leading provider of quality data about its people and economy
United States Census Bureau
Data that is not organized in any easily identifiable manner
Unstructured data
A collection of images, audio files, and other data usually encoded in a compressed format such as MP4, MV4, MOV, AVI, or FLV
Video file
The section of a query that specifies criteria that the requested data must meet
WHERE
A dataset in which every data subject has a single row with multiple columns to hold the values of various attributes of the subject
Wide data
An organization whose primary role is to direct and coordinate international health within the United Nations system
World Health Organization