Unit 9 ✔️ Flashcards
Citizen Science
scientific research conducted in whole or part by distributed individuals, many of whom may not be scientists, who contribute relevant data to research using their own computing devices.
Cleaning Data
a process that makes the data uniform without changing its meaning (e.g., replacing all equivalent abbreviations, spellings, and capitalizations with the same word).
Correlation
a relationship between two pieces of data, typically referring to the amount that one relates to the other.
Crowdsourcing
the practice of obtaining input or information from a large number of people via the Internet.
Information
the collection of facts and patterns extracted from data
Data Bias
data that does not accurately reflect the full population or phenomenon being studied
Data Filtering
choosing a smaller subset of a data set to use for analysis, for example by eliminating / keeping only certain rows in a table
Metadata
a set of data that describes and gives information about other data.