General Definitions Flashcards
Define data science
Gathering and manipulating data
“translating” data into presentable forms
Extracting insights from data
Define data
quantifiable
Raw information prior to analysis
large collections/sets of strings, ints, floats, etc.
What do data scientists do?
case studies
deduce meaning from data
predictions, recommendations
Define unstructured data
Requires preprocessing prior to interpretation.
Otherwise not directly conducive to analysis
Define structured data
tabular data (the focus of this class)
Define big data
high-dimensional data
(excessively) large amounts of data
What are some challenges with big data
Sheer scale - time
difficulty in visualization
can be noisy, untrustworthy, or otherwise problematic.
Define little data
low-dimensional data
small number of datapoints
What are some challenges with little data
May identify spurious patterns or trends
insufficient size for reliable machine learning/analysis
Define numerical data
Data represented by numbers
Define categorical data
Data represented by categories