RM and stats defs Flashcards
raw numbers
unrefined data (refined to make descriptive statistics)
central tendency
averages: mean (midpoint), median (midpoint), mode (typical)
dispersion
how spread out around average: range, standard deviation
when 2 number sets have the same average but different level of variation.
- shows us how focused/ representative the average is
CT of Mean
- easily distorted by outliers
- can give silly answers for binary stats, e.g, average person has 1.78… legs bc some only have 1
use: usually/default, unless many outliers
CT of Median
- not distorted like mean as always lies in middle
- non holistic view
use: when too many outliers for mean
CT of Mode
- not as often used - does not describe the middle
use: when collecting frequencies, non numerical data
CT of Range
- clear description of diversion
- v vulnerable to outliers g.g, extreme smallest/largest while majority are close
CT of Standard Deviation
(average deviation from average/ct)
SD=root(total deviation^2/N–1)
how to calculate standard deviation
1) calculate deviation of each stat from mean (stat–mean)
2) square each deviation and find total
3) divide total by number of scores–1 (deg.s of freedom)
4) square root answer
Variance
standard deviation^2
used to also measure distribution of results
1) calculate deviation of each stat from mean (stat–mean)
2) square each deviation and find total
3) divide total by number of scores–1 (deg.s of freedom)
standard error of the mean
used to test how representative the mean is (uncertainty)
- number of people tested
- how consistent they are (similar)
- low value is good
types of data
Categorical - “Nominal”- categories with no intrinsic ranking e.g, female/male, names
Discrete - “Ordinal” - with intrinsic ranking - e.g, rating scales
Continuous - “Scale” - (number scale - interval (integers) or ratio (decimals) - scalar variable where axis represents a metric, e.g, height, test scores
central limit theory
theory that the averages (means) of data collected from samples will form a normal distribution curve around the original mean of the results
p value
if >0.05 results oppose research hypothesis enough to consider null hypothesis is correct
one vs two tailed hypothesis
one tailed: predicts that the independent variable will have an effect on the dependent variable, but the direction of the effect is not specified.
2 tailed: predicts either +ve or -ve impact of iv on dv