Lec 5 Flashcards
parametric statistics are used to analyze ___ data
quantitative
non-parametric statistics are used to analyze ___ data
qualitative
when is non-parametric stats used?
when assumptions are violated
for ordinal and nominal data
T/F: t-tests and ANOVAs are type of regression analyses.
T
a linear regression shows a significant relationship if the slope is
0
what are the 5 main parametric assumptions?
- interval/ratio data
- normality
- homogeneity of variance
- free of extreme outliers
- independence of observations
how is the normality assumption assessed?
check histograms
skewness/kurtosis must be <2 or >-2
Shapiro-Wilk test >0.05
what does the Shapiro-Wilk test measure?
normality
if there is a difference b/w sample and normal distribution
normality is a concern for sample sizes smaller than
30
*unless population normally distributed
what does homogeneity of variance (HOV) mean?
the variance of the outcomes variable should be about the same in each group
how is HOV assessed?
Levene’s Test
what is a good Levene’s test?
if the alpha is >0.05
this means that there is no significant difference in variance between groups
how are influential outliers assessed?
histograms
skewness/kurtosis
boxplots
Cook’s Distance (regression)
what is a good outcome for Cook’s Distance?
<1 = no influential outliers
how is independence of observation assessed?
score must not follow a pattern
1 participant’s score can’t influence other
if a study uses 1 participant’s involved leg and uninvolved leg as 2 separate groups, what assumption is violated?
independence of observation
what are the 3 main regression assumptions?
linearity
homoscedasticity
outlier testing in regression
what is homoscedasticity?
the variance of an outcome variable of a relationship/regression test is evenly distributed; it is the same at all levels of the predictor value
how is linearity assessed?
scatterplot
data points are in a linear pattern
what is the residual of a regression test?
the distance between the data point and the line of best fit
T/F: you want the data to be curvilinear in a regression test.
F
want it to be linear
how is homoscedasticity assessed?
scatterplot
data points should be evenly distrusted around the line of best fit
all residuals should be similar
what is the z score equivalent for regression?
standardized residuals
what are some solutions if an assumption is violated?
- trim the data (sketchy)
- Windsorizing (sketchy)
- transform the data (sketchy)
- analyze bootstrapping in SPSS
- use non-parametric stats