intro to statistics and data Flashcards
population
defined collection of units interested in studying
branches of statistics and probability
- descriptive statistics
- probability statistics
- inferential statistics
descriptive statistics
- describes data
ex) sample of 2500
15% of dog owners in our sample walk their twice a day
probability statistics
methods to use known properties of a population to draw a conclusion about a sample
inferential statistics
- make predictions or generalizations
- > estimate parameters
- > hypothesis tests
ex) sample of 2500
300,000 people walk their dog twice a day
Statistical Process
- collection
- organization
- presentation
- analysis
census
when desired info is gathered for all units in the population
sample
subset of units from which we collect data
variable
any characteristic that can be different values for each unit in a population
categorical (qualitative) variable
places an individual into one of several groups or categories (ex, gender or race)
nominal
-> named variables
Nominal data is a categorical (qualitative) data type, so it describes qualitative characteristics or groups, with no order or rank between categories
(qualitative data)
examples)
Gender, ethnicity, eye colour, blood type
Brand of refrigerator/motor vehicle/television owned
Political candidate/party preference, shampoo preference, favourite meal
ordinal
-> named + ordered variables
Ordinal data is the same as nominal data in that it’s looking at categories, but unlike nominal data, there is also a meaningful order or rank between the options.
An ordinal scale is one where the order matters but not the difference between values
(qualitative data)
examples)
Socio economic level (e.g. low income, middle income, high income)
Income level (“less than 50K”, “50K-100K”, “over 100K”)
Political orientation (e.g. far left, left, centre, right, far right)
Level of agreement (e.g. strongly disagree, disagree, neutral, agree, strongly agree)
quantitative (numerical) variable
is a characteristic that can be counted (ex, number of people in house) or measured (ex, height or weight)
quantitative data can be an interval or ratio
interval
-> named + ordered + proportionate interval between variables (ex, temperature, celcius, ferinheit) (difference, subtracting) (no true 0 starting point)
(quantitative data)
ratio
-> named + ordered + proportionate interval between variables + has a true “0 starting point” (ex, height) (fractions, dividing) (temperature, kevlins)
(quantitative data)