Midterm 1 Flashcards

Question

What do the effect sizes (pearson R, eta squared and omega squared) all look for?

Answer 1

Proportion of variance in the DV that is explained by the IVs

Answer 2

- η2 is positively biased (overestimates the amount of variance explained in the DV by the IVs) - ω2 is unbiased

Answer 3

- Small ≈ .01 - Medium ≈ .06 - Large ≈ .14 - Report ω2, even if it’s negative

Answer 4

That the factor levels are multiplied by each other (ex: factor 1 has 3 levels and factor 2 has 3 levels then it's a 3x3 factorial design with 9 treatment conditions)

Answer 5

1. 1-2 sentence overview of analyses that includes the independent and dependent variable, stated conceptually. 2. Description of overall results of F -test, in a particular format, including effect size measure 3. Description of the pattern of mean differences among groups, including whether significant differences were found (M for mean and SD for standard dev) -> when working with 3 groups ANOVA test, we’ll have to conduct post-hoc tests to evaluate which pairs of groups have significant mean differences 4. A conceptual conclusion

Answer 6

1. To investigate whether level of fitness (low versus high) had an effect on ego strength (with higher scores indicating more ego strength), we conducted a one-way between-subjects ANOVA 2. This analysis revealed a significant effect of fitness on ego strength, F (1, 8) = 5.32, p < .05, ω2 = .61 3. Participants in the low fitness group (M = 4.40, SD = 0.92) had significantly lower ego strength than those in the high fitness group (M = 6.36, SD = 0.55) 4. We conclude that having high as opposed to low fitness may increase ego strength

Answer 7

- 2 decimal places - 3 decimal places for p-values

Answer 8

FALSE: they could disagree they use a different value of α

Answer 9

- The variable, X, in the population is normally distributed - The sample must be a simple random sample of the population (independence of observations) - The population standard deviation, σ, must be known

Answer 10

0.10 -> small effect 0.30 -> medium effect 0.50 -> large effect

Answer 11

If we repeated our experiment many times, 95% of the time a 95% CI will contain the true effect

Answer 12

The p-value represents the proportion of data sets that would yield a result as extreme or more extreme than the observed result if H0 is true

Answer 13

0.01 -> small 0.09 -> medium 0.25 -> large

Answer 14

0.2 -> small 0.5 -> medium 0.8 -> large

Answer 15

1. Independence of observations 2. Identical distribution (within group) 3. Identical distribution (between groups) 4. Homogeneity of variance 5. Normal Distribution

Answer 16

- Formula describing the linear model underlying everything we do in ANOVA - Yij = person i’s score on the outcome Y and this person i belongs in group j -> Y is the dependant variable - Eij -> experimental error - something that allows individual scores of people in that population to vary from this group mean (assumed to be normal) - Eij is random, but mu + alpha-j is fixed for every member of that population - In this equation, mu + alpha-j is constant for every person in the population (one population = one mean)

Answer 17

- The population - Usually we can examine the sample for evidence about whether these assumptions hold

Answer 18

Descriptive and Inferential Statistics: - Looking at the mean, median, mode - Tests for skewness (testing whether skewness is significant -> normal distribution has skew of 0, any type of skewness means that the distribution isn’t perfectly normal) - Kolmogorov-Smirnov and Shapiro-Wilk tests Visual methods: - Histograms - Normal Quantile (Q-Q) Plot

Answer 19

- Skewness represents symmetry and whether the distribution has a long tail in one direction - Left (negative) skew = Mean < Median - Symmetric (normal) = Mean = Median - Right (positive) skew = Median < Mean - Skewness should be ~0 > 0 - positive/right skew (longer right-hand tail) < 0 - negative/left skew (longer left-hand tail) - Also look at standard errors (SE skewness) - Conducting a significance test for whether skewness is significantly different from 0 - To compute this, we will get an estimate of skewness of our variable, divided by the standard error, and then compare this against a value of 3.2 in absolute value - Reject the null hypothesis that skew is 0 in the population if the ratio tskewness is greater than 3.2 in absolute value - Here we don’t want to reject the null hypothesis because rejecting it would mean we have found evidence that our scores aren’t normally distributed

Answer 20

Median, rather than the mean

Answer 21

- The Kolmogorov-Smirnov (K-S) test - The Shapiro-Wilk (S-W) test - If a test is significant, reject the null hypothesis that the distribution of the variable is normal

Answer 22

- Very general, but usually less power than Shapiro-Wilk (S-W) test - Conceptually, compares sample scores to a set of scores generated from e.g., a normal distribution with the sample mean and standard deviation - Used to see if the scores on your variable follow any distribution you think they follow - Conceptually, this test takes your observed scores on the variable and it compares them to quantiles from this reference distribution you’re trying to assess whether it’s appropriate for your data - If there are large departures from the quantiles from the reference distribution and your observed scores -> this would be evidence against your scores following the distribution you think they follow

Answer 23

- Usually more powerful, but only for normal distributions - Follows a similar logic to the Kolmogorov-Smirnov (K-S) test

Answer 24

- It's easy to find significant results (reject null hypothesis that data is normal) when sample size is large - Same with skewness tests -> as the sample size gets larger, SE gets smaller and with smaller SE, you’re more likely to get a t ratio value larger than 3.2, even with small values of skewness - Solution: do the tests, but plot data as well and examine the histogram for evidence of multimodality, extreme scores (outliers), and asymmetry - More than one mode is evidence of deviation from normality

Answer 25

- Create separate histograms for each group to assess normality - Look for obvious signs of non-normality - Doesn't have to be perfect, just roughly symmetric - Multiple modes may suggest that there are different subpopulations in the sample - If that's the case, include a classification variable as an additional factor in the ANOVA

Answer 26

1. Compute percentile rank for each score - Sort observations from smallest to largest - What percentage of scores are below score X? 2. Calculate (theoretical or expected) z-scores from percentile rank - If the scores were normal, what would the z-score be? 3 Calculate actual z-scores 4 Plot the observed vs. theoretical z-scores - We get some percentiles from the z-distribution and we see how much our observed z-scores deviate from the percentiles from the normal distribution - If the data are close to normal, then the points will like close to a straight line

Answer 27

- Non-normality tends to produce Type I error rates that are lower than the nominal value - Depending on the context of the research study, this may be less concerning than an assumption violation that results in excessive Type I error rates (above the nominal value α) - When we select an alpha of say .05, we’re saying that if the null hypothesis is true, 5% of our findings in the long run will be false positives - If you don’t meet the assumption of normality and you pick an alpha level of .05 -> less than 5% of your results in the long run will be false positives if the null hypothesis is true - This means you have lower power to detect differences if there is an effect in the population - A consequence of the violation of the assumption of normality is that you might miss some effects (not inflating type 1 error rate but you are decreasing your power)

Answer 28

Type 1 error rate and power go hand in hand (as one increases so does the other)

Answer 29

Assuming that all of the group variances are equal

Answer 30

- Serious violation of this assumption tends to inflate the observed value of the F statistic - Too many rejections of H0 = high Type I error - This is a more problematic assumption because if you violate this assumption, you will inflate your type 1 error rates - If you select an alpha of .05, but your assumption of homogeneity of variance is not met, you may end up with more than 5% of false positives if the null hypothesis is true

Answer 31

- The Fmax test of Hartley - Levene’s test - Brown and Forsythe test

Answer 32

- Fmax = ratio of largest group variance to the smallest group variance - Calculate the sample variance for each group, and find the largest and smallest variances - Compute Fmax: Fmax = maxs2g mins2g′ - The observed Fmax value is compared against a critical value of this statistic - If the assumption of homogeneity of variance is satisfied, Fmax ratio would be close to 1 - If the observed value of Fmax exceeds the critical value, we conclude that we have to reject the null hypothesis and the assumption is not met - Easy to compute, but assumes that each group has an equal number of observations

Answer 33

- Measures how much each score deviates from its group mean Zij =|Yij −Ybarj| - Instead of using the original scores Yij to run the ANOVA, you use the absolute deviation scores Zij - If we retain the null hypothesis, we can conclude that the assumption of homogeneity of variance is met - The downside of this test is that it’s easier to obtain a significant F-ratio for this ANOVA when your sample size is large

Answer 34

- It measures how much each score deviates from its group median - The median is less weighed by outliers than the mean and isn’t pulled by a skewed variable - Zij =|Yij −Mdj| - Instead of using the original scores Yij to run the ANOVA, you use the absolute deviation scores Zij - For both the Levene and Brown-Forsythe tests a statistically significant finding (e.g., p ≤ .05) leads to the conclusion that the variances are significantly different across groups (i.e., the assumption of homogeneity of variance is not met) - The Brown-Forsythe test is slightly more robust than Levene’s test

Answer 35

That the variances are significantly different across groups (i.e., the assumption of homogeneity of variance is not met)

Answer 36

Brown-Forsythe test is recommended over the Levene’s test

Answer 37

- Independence of observations (random sampling) - Identical distribution (within groups) (random sampling) - Identical distribution (between groups) - Homogeneity of variance - Normal distribution

Midterm 1 Flashcards

(61 cards)