Week 1 & 2 Flashcards
Why do we need to know data analysis?
There is a problem that needs to be solved and we need data and analytics to properly act on it
What is a population?
All entities of interest in a study
What is a sample?
A subset or portion of the populations that is randomly chosen
What is a dataset?
Table of data containing variables in the column section (horizontal), and observations in the row sections (vertical)
What are some examples of variable?
height, gender, income
What are some data types?
Numeric vs categorical; Ordinal vs nominal
What is numeric?
Meaningful arithmetic that can be performed on
What is categorical
otherwise, non numeric (not numbers (?))
What is ordinal?
There is a natural ordering of categories
What is nominal?
No natural ordering
What is a binary decision?
0/1 - a categorical variable with n different categories (n-1) (?)
What is binning or discretizing
Categorizing a numeric variable into discrete (not specific)
What are some more data types?
Discrete vs continuous; Cross sectional vs time series
What is discrete?
Count data (e.g. # of children)
What is continunous?
Continuous measurement like weight