Lecture 4: Analysis of Continuous and Categorical Variables Flashcards
Descriptive vs Inferential Statistics
descriptive: describing the central tendency and dispersion of data
inferential: use sample data to draw conclusions about the population that the sample is mean to represent (sampling will naturally involve error)
o Estimate parameters and test hypotheses to make inferences about the population
o Compare means and evaluate relationships
o Test statistics, p-values, confidence intervals
Which test do I apply if I have 2 related samples and parametric data?
paired t-test
Which test do I apply if I have 2 related samples and non- parametric data?
Wilcoxon test
Which test do I apply if I have 2 independent samples and parametric data?
Independent t-test
Which test do I apply if I have 2 independent samples and non-parametric data?
Mann-Whitney U test
Which test do I apply if I have 3 or more groups and parametric data?
ANOVA
Which test do I apply if I have 3 or more groups and non-parametric data?
Kruskal-Wallis test
What is student t-test?
Used to compare means between two groups
o Related groups: paired t-test (e.g. pre- and post- study measures on the same participants)
o Independent groups: unpaired/independent t-test
What is the null hypothesis in student t-test?
the means of the groups are not statistically different
What is the degrees of freedom in student t-test?
amount of information provided by the data that can be used to estimate population parameters and
variability of the estimates
o df = n - # of estimated parameters
o As df increase, t-distribution more closely resembles a normal distribution
• E.g. One sample independent t-test to estimate the population mean
o Estimates the standard deviation about the mean o Uses a t-distribution with df = n-1
o Df=n–1 for paired t-test as well
• E.g. Two sample independent t-test to compare two means
o Uses a t-distribution with df = n1 + n2 – 2
What are the assumptions of T-test?
• Samples are independent
• Variable is normally distributed
• Variance homogeneity variance within each group is equal
o Levene’s test for equality of variances in SPSS (automatically conducted)
o Informs you whether to use results for pooled or unpooled variance
• T-tests fairly robust even if assumptions are not perfectly met
What is t-statistics?
Difference between the means divided by the pooled or unpooled standard error of the mean
What is confidence interval?
Degree of uncertainty: area around the sample statistics where the corresponding population parameter is likely to be
True/False
The larger the sample, the smaller the CI
True
Greater likelihood that the sample statistics approximates the population parameter
True/False
If CI contains 0 (null value) then the means are not statistically different (non-significant finding)
True