Module 10, Hypothesis Testing Two Samples Flashcards
Between Group Comparisons:
enable us to determine if a difference between two groups is large enough to be statistically significant, in order to reject the null
Within Groups Comparisons:
deals with the effects of experimental interventions, comparing groups before and after the intervention, LIKE A WITHIN SUBJECTS MEASURE
Standard Error of the Mean
Measure of variability of sample means around the population mean in a sampling distribution of the mean
SD of the sampling distribution
Degrees of Freedom pooled
When two independent samples with approximately equal variances the df are determined by summing the sample sizes and then subtracting the number of samples
Df pooled = N1 + n2—2
Order of subtraction
Order of Subtraction
Alternative hypothesis determines the order of subtraction
Subtract the one that is not specified to be higher from the one that is
Independent Samples/ Between Subjects Design
Randomized, Independent groups, between subjects ALL MEAN THE SAME THING
Don’t always have to compare an experimental group to a control group: common misconception is that experiment has to have control condition THIS ISN’T TRUE; sometimes not possible to have a control group
Can compare two different amounts of an IV (compare one dose to another dose of drug when you already know it works)
Can be an experiment or ex post facto: two groups that are preexisting also works
Each participant participates in ONE AND ONLY ONE GROUP
Comparisons made between group
RANDOMIZED DESIGN: Random assignment used to assign participants to groups (IF IT IS A TRUE EXPERIMENT)
Levene’s Test for Equality of Variance
if significant, then variances are not equal
Conditions of Independent T Test
Each population should have a normal distribution AS LONG AS SAMPLE SIZE IS OVER 25
The samples must be independent: one participant’s score does not influence another score
Homogenity of variance: both groups must be sampled from populations with similar variance
Effect size interpretations, uses of effect see
D= 0.2 (SMALL)
d=0.5 (Medium)
d= (0.8) LARGE
Compare how meaningful an effect is between studies without knowing the exact scale they used: ratio of difference to variability
Allows us to determine our ability to detect a difference,s hould one exist for a particular sample size, if we have an estimate of the effect size
We use our best guess of the effect size with the sample size we have ^^
Delta:
value used in referring to power tables that combine effect size and sample size
Power is always between 0 and 1, can never be 1 because you can never be 100% sure that you’re going to find an effect
How does increasing affect size increase power?
Increase effect size: difference between null and alternative is effect size
Move distributions
Make this manipulation stronger: but might compromise external validity/may not be practical/ethical
Advantages and disadvantages of increasing sample size on power
WORKS BECAUSE OF THE CLT: standard error goes down as n increases
Greater sample size = skinnier distribution (less variability/spread)
Disadvantages: may be difficult to recruit a large sample, much more expensive with time, money, lab equipment
STILL THE BEST APPROACH TO INCREASING POWER; DOESN’T JEOPARDIZE EXTERNAL OR INTERNAL VALIDITY (doesn’t increase chance of making Type I error)
Diusadvantages and advantages of decreasing variance on power
Decrease population variance: decreasing “noise of participants” increases power
Decrease variance: more sure about our estimate of the true parameter if everything is less variable
Make distributions skinnier, decreases variance, and increases power
Getting distributions to be skinnier: use less diverse population=homogenity, helps to detect effect
DISADVANTAGES: results may not generalize properly, because you do not have a diverse sample
May be hard to find people with that particular characteristic you’re looking for from the population
Effects of using a one tailed vs two tailed test on power
Use a one-tailed test instead of two tailed: move the criterion line on the one side, same chance of making TYpe I error but now putting it all in one tail: DECREASES BETA AND INCREASES POWER (only use this if youre very sure of the direction of your effect), if you’re not sure you shouldn’t use it, and it goes against convention (leads people to believe you originally did a two tailed test and it didn’t work so then you changed it to do a one tailed test, could manipulate the data dishonestly**) can fix this with pre-registering your hypothesis before you collect data
Effects of changing alpha on power
Could change the alpha level, decreases the threshold needed for significance: BUT INCREASES CHANGE OF MAKING TYPE ONE ERROR (unfortunately) but also increases ability to find effect (usually don’t want to do this)