Inferential Statistics Flashcards
Parametric assumptions
Normality
Homogeneity of Variance
Independence of observations (not relatied onless repeated measures)
Normality
sampling distribution of the mean
DV like working memory
take a group and get a sample mean
each time you get a sample you have a distrubuation with a mean.
if you did this thousands of times, all the means would appear normal
test of normality
Kolmogorov-Smirnov test and Shapiro-wilk test
statisticians say this is a waste of time. but you shouldl look at your data to make sure there are no major differences in normality.
(skewness or kurtosis)
or just plot your data. Histogram or QQ plot
Histogram
frequency of scores on Y, individual scores on X
you can see most frequent occuring score.
QQ plot
quantile is a type of percentile- how much data are included in this value.
how much data are in the different regions (you want it to be a straight line, circles are data points, 16% here and here and you will need less in the middle.
same info from histogram but its it on the line or not.
Homogeneity of Variance
homoscedasticity
do the different groups have the same variance
most tests are robust to violations as long as the grop sizes are equal. (levenes)
arcsine and rau
arcsine - uses radians
rau - more math - but becomes percent correct
could they have done worse than 0% if ranking extended that far? could they have performed the 100% better than someone else.
positive skew
tail is dragged out in positive direction
welch correction
for unequal variances in two-sample t-tests
default in r
non-parametric
doesn’t assume normality
logistic regression
powerful
using all data, not means
independent observations
two SNR, if its independent T test then only one person can contribute to each
if same person did do both you can run repeated measures analysis
when one person performs two different conditions then there is something related
can you predict snr .1 from snr 2? yes
group comparisons - t test
why use t test?
z is normal gaussian curve (mean 0 SD 1)
t is shifted slightly, not standard gaussian and we use because its better for small sample sizes when we dont actually know the true population mean and SD
(humans are not something we typically know the underlying distribution)
t test comparing signal to noise
what is the effect, divided by the random variation????
single sample
one group vs baseline