T-Tests and Nonparametrics Flashcards
what are the parametric statistics?
t-tests
one sample t-tests
paired sample t-tests
2 independent sample t-tests
what is a paired sample t-test?
comparing pre and post data
what is a 2 independent sample t-test?
2 unrelated samples compared
what are the nonparametric statistics?
wilcoxon signed tank test
mann whitney U test
what are the parametric statistics assumptions?
it is continuous data (DV)-interval and ratio variables
n is greater than or equal to 30/normally distributed data
homogeneity of variance (homoscedasticity)
what is homoscedasticity?
the variability of group is similar (not applicable when there is only one sample)
when is nonparametric testing used?
when the DV is measured on a continuous scale but n<30/data is not normally distributed
when DV’s level measurement is categorical (ordinal and nominal)
is the assumption of normality violated in parametric or nonparametric testing?
nonparametric testing
t/f: you can’t assume normal distribution with nonparametric testing
true
what is a t distribution?
a symmetrical curve about the mean
shape of the curve depends on the degrees freedom (df)
what is the degrees freedom?
n-1
why use a t distribution instead of a z distribution?
bc using z scores tends to be inaccurate w/small sample sizes
as the t distribution nears 30, does the curve become more or less like a z distribution?
more
t/f: if you can’t assume n=30, you can’t use z
true
what kind of variables are used in t tests?
continuous variables (interval and ratio)
what are the types of t tests?
one sample
two samples (independent and dependent)
what are the assumptions for t tests?
samples were drawn from a normally distributed population
samples are random
homogeneity of variance (only for independent samples t test)
continuous variables
what is homogeneity of variance?
similarity in variance
what are independent samples?
data points that have no relation
ie. one person’s knee angle doesn’t affect another person’s knee angle
do parametric tests use mean or median?
mean
do nonparametric tests use mean or median?
median
what does a one sample t test do?
compares one population mean to a hypothesized value (mu0)
what is the assumption of a one sample t test?
the population is at least approximately normally distributed
OR
the sample size is big enough (typically n is greater than or equal to 30)
what are the 3 possible hypotheses for a one sample t test?
1) H0: mu=mu0; Ha: mu is not equal to mu0 (2-sided)
2) H0: mu is less than or equal to mu0; Ha: mu>mu0 (1-sided)
3) H0: mu is greater than or equal to mu0; Ha: mu<mu0 (1-sided)
what are the different ways of testing the assumptions of normality?
graphical interpretation
numerical values (skewness and kurtosis, statistic tests)
what are the types of statistical tests for assessing the assumption of normality?
Shapiro Wik (most used)
Kolmogorov-Smirnov
Anderson-Darling
what is the graphical interpretation for normality testing?
comparing the histogram of the sample data to a normal probability curve
what is the advantage of the graphical interpretation?
normality testing can be overly sensitive, but this will still work
what is the disadvantage of graphical interpretation?
it can be subjective
what is the advantage of the numerical values to test normality?
it is objective
what is the disadvantage of the numerical values to test normality?
it is sometimes not going to be sensitive enough w/a smaller sample size
what is a widely used test for normality using numerical values?
the Shapiro-Wilk (SW) test
is the SW test a one or two sample testing procedure?
one sample testing procedure
what are the hypothesis of the SW test?
H0: the data are normally distributed (distribution=normal)
Ha: the data are not normally distributed (distribution is not equal to normal)
do you have to reject or fail to reject (accept) H0 to use parametrics?
you have to fail to reject H0 to use parametrics
why does H0 have to be accepted to use parametrics?
bc this means that it is a normal distribution
if p>alpha, do we reject or fail to reject H0? why?
fail to reject H0 bc this means the change is not significant
are the hypothesis for SW testing one or two sided?
two sided bc there is not directionality
with the SW test, if p>alpha, is the data normal or not normal? is parametric testing ok?
the data is normal and you can use parametric testing
with the SW test, if p is less than or equal to alpha, is the data normal or not normal? is parametric testing ok?
the data is not normal, so parametrics should not be used
what kind of variable is IQ?
ratio
we are trying to see if LA residents have a dif IQ than the national average of 100 (alpha=0.05), and 7 LA residents are randomly sampled with the following data: 107, 96, 97, 92, 95, 109, 101.
what is the n?
what is H0?
what is Ha?
are the hypothesis one or two sided?
n=7
H0: mu=100
Ha: mu is not equal to 100
these are 2-sided hypotheses
if we were to hypothesize that residents of LA were dumber than the national IQ of 100, what would the hypotheses be? would these be one or two sided?
H0: mu is greater than or equal to 100
Ha: mu < 100
these are one sided hypotheses
if a SW test has a p value of .433 do we reject or fail to reject H0?
fail to reject because p is greater than .05 (alpha)
if we have t=-.18, df=6, and p=.87, with the LA residents example, what would a conclusion be?
the mean IQ of LA residents isn’t significantly different from the national average (t=-.18, df=6, p=.87)
how do we report confidence intervals (CI)?
(% CI: lower values, upper value)
if we have a CI of 95%, how would we report this?
(95% CI: 93.68, 105.47)
if the LOC is 95%, what is the probability of error (alpha)?
5% (0.05)
if the LOC is 99%, what is the probability of error (alpha)?
1% (0.01)
if H0 is mu=100 and Ha is mu is not equal to 100, and we have a CI of (93.68, 105.47) do we reject or fail to reject H0?
we fail to reject H0 because 100 is contained in the range
if H0 is mu=100 and Ha is mu is not equal to 100, and we have a CI of (90,99) do we reject or fail to reject H0?
reject H0 bc 100 is not in this range
in the nonparametric approach, do we use the mean or median?
median
do we typically make assumptions about the distribution with the nonparametric approach?
no
if the assumptions of normality fail and we incorrectly use parametric testing instead of nonparametric testing, what happens?
inaccurate p value
greater type 1 and 2 errors than stated (alpha is no longer held at 0.05)
our decision (accept or reject H0) is more likely to be wrong
you have 12 pediatric ER patients with ulnar fractures that you ask to rate their pain on a VAS of 0-10. determine if the center of the pain score for this population is different from 5:
data: 6, 10, 9, 4, 10, 3, 5, 7, 9, 10, 3, 9
what is H0?
what is Ha?
are the hypothesis one or two sided?
H0: m=5
Ha: m is equal to 5
these are two sided hypothesis
what type of variable is a pain scale?
ordinal
if p<0.05, do we reject or fail to reject H0?
we reject H0 bc the distribution is not normal
what are the assumptions of the Wilcoxen Signed Rank test?
the samples were selected independently and at random from their respective populations
the data used is not a normal distribution
with the Wilcoxen Signed Rank test, do we use the mean or median?
median bc it is nonparametric
what are the hypothesis for Wilcoxen Signed Rank test?
H0: m=m0
Ha: m is not equal to m0
H0: m is less than or equal to m0
Ha: m>m0
H0: m is greater than or equal to m0
Ha: m<m0