Midterms material Flashcards

Question

What do the effect sizes (pearson R, eta squared and omega squared) all look for?

Answer 1

Proportion of variance in the DV that is explained by the IVs

Answer 2

- η2 is positively biased (overestimates the amount of variance explained in the DV by the IVs) - ω2 is unbiased

Answer 3

- Small ≈ .01 - Medium ≈ .06 - Large ≈ .14 - Report ω2, even if it’s negative

Answer 4

That the factor levels are multiplied by each other (ex: factor 1 has 3 levels and factor 2 has 3 levels then it's a 3x3 factorial design with 9 treatment conditions)

Answer 5

1. 1-2 sentence overview of analyses that includes the independent and dependent variable, stated conceptually. 2. Description of overall results of F -test, in a particular format, including effect size measure 3. Description of the pattern of mean differences among groups, including whether significant differences were found (M for mean and SD for standard dev) -> when working with 3 groups ANOVA test, we’ll have to conduct post-hoc tests to evaluate which pairs of groups have significant mean differences 4. A conceptual conclusion

Answer 6

1. To investigate whether level of fitness (low versus high) had an effect on ego strength (with higher scores indicating more ego strength), we conducted a one-way between-subjects ANOVA 2. This analysis revealed a significant effect of fitness on ego strength, F (1, 8) = 5.32, p < .05, ω2 = .61 3. Participants in the low fitness group (M = 4.40, SD = 0.92) had significantly lower ego strength than those in the high fitness group (M = 6.36, SD = 0.55) 4. We conclude that having high as opposed to low fitness may increase ego strength

Answer 7

- 2 decimal places - 3 decimal places for p-values

Answer 8

FALSE: they could disagree they use a different value of α

Answer 9

- The variable, X, in the population is normally distributed - The sample must be a simple random sample of the population (independence of observations) - The population standard deviation, σ, must be known

Answer 10

0.10 -> small effect 0.30 -> medium effect 0.50 -> large effect

Answer 11

If we repeated our experiment many times, 95% of the time a 95% CI will contain the true effect

Answer 12

The p-value represents the proportion of data sets that would yield a result as extreme or more extreme than the observed result if H0 is true

Answer 13

0.01 -> small 0.09 -> medium 0.25 -> large

Answer 14

0.2 -> small 0.5 -> medium 0.8 -> large

Answer 15

1. Independence of observations 2. Identical distribution (within group) 3. Identical distribution (between groups) 4. Homogeneity of variance 5. Normal Distribution

Answer 16

- Formula describing the linear model underlying everything we do in ANOVA - Yij = person i’s score on the outcome Y and this person i belongs in group j -> Y is the dependant variable - Eij -> experimental error - something that allows individual scores of people in that population to vary from this group mean (assumed to be normal) - Eij is random, but mu + alpha-j is fixed for every member of that population - In this equation, mu + alpha-j is constant for every person in the population (one population = one mean)

Answer 17

- The population - Usually we can examine the sample for evidence about whether these assumptions hold

Answer 18

Descriptive and Inferential Statistics: - Looking at the mean, median, mode - Tests for skewness (testing whether skewness is significant -> normal distribution has skew of 0, any type of skewness means that the distribution isn’t perfectly normal) - Kolmogorov-Smirnov and Shapiro-Wilk tests Visual methods: - Histograms - Normal Quantile (Q-Q) Plot

Answer 19

- Skewness represents symmetry and whether the distribution has a long tail in one direction - Left (negative) skew = Mean < Median - Symmetric (normal) = Mean = Median - Right (positive) skew = Median < Mean - Skewness should be ~0 > 0 - positive/right skew (longer right-hand tail) < 0 - negative/left skew (longer left-hand tail) - Also look at standard errors (SE skewness) - Conducting a significance test for whether skewness is significantly different from 0 - To compute this, we will get an estimate of skewness of our variable, divided by the standard error, and then compare this against a value of 3.2 in absolute value - Reject the null hypothesis that skew is 0 in the population if the ratio tskewness is greater than 3.2 in absolute value - Here we don’t want to reject the null hypothesis because rejecting it would mean we have found evidence that our scores aren’t normally distributed

Answer 20

Median, rather than the mean

Answer 21

- The Kolmogorov-Smirnov (K-S) test - The Shapiro-Wilk (S-W) test - If a test is significant, reject the null hypothesis that the distribution of the variable is normal

Answer 22

- Very general, but usually less power than Shapiro-Wilk (S-W) test - Conceptually, compares sample scores to a set of scores generated from e.g., a normal distribution with the sample mean and standard deviation - Used to see if the scores on your variable follow any distribution you think they follow - Conceptually, this test takes your observed scores on the variable and it compares them to quantiles from this reference distribution you’re trying to assess whether it’s appropriate for your data - If there are large departures from the quantiles from the reference distribution and your observed scores -> this would be evidence against your scores following the distribution you think they follow

Answer 23

- Usually more powerful, but only for normal distributions - Follows a similar logic to the Kolmogorov-Smirnov (K-S) test

Answer 24

- It's easy to find significant results (reject null hypothesis that data is normal) when sample size is large - Same with skewness tests -> as the sample size gets larger, SE gets smaller and with smaller SE, you’re more likely to get a t ratio value larger than 3.2, even with small values of skewness - Solution: do the tests, but plot data as well and examine the histogram for evidence of multimodality, extreme scores (outliers), and asymmetry - More than one mode is evidence of deviation from normality

Answer 25

- Create separate histograms for each group to assess normality - Look for obvious signs of non-normality - Doesn't have to be perfect, just roughly symmetric - Multiple modes may suggest that there are different subpopulations in the sample - If that's the case, include a classification variable as an additional factor in the ANOVA

Answer 26

1. Compute percentile rank for each score - Sort observations from smallest to largest - What percentage of scores are below score X? 2. Calculate (theoretical or expected) z-scores from percentile rank - If the scores were normal, what would the z-score be? 3 Calculate actual z-scores 4 Plot the observed vs. theoretical z-scores - We get some percentiles from the z-distribution and we see how much our observed z-scores deviate from the percentiles from the normal distribution - If the data are close to normal, then the points will like close to a straight line

Answer 27

- Non-normality tends to produce Type I error rates that are lower than the nominal value - Depending on the context of the research study, this may be less concerning than an assumption violation that results in excessive Type I error rates (above the nominal value α) - When we select an alpha of say .05, we’re saying that if the null hypothesis is true, 5% of our findings in the long run will be false positives - If you don’t meet the assumption of normality and you pick an alpha level of .05 -> less than 5% of your results in the long run will be false positives if the null hypothesis is true - This means you have lower power to detect differences if there is an effect in the population - A consequence of the violation of the assumption of normality is that you might miss some effects (not inflating type 1 error rate but you are decreasing your power)

Answer 28

Type 1 error rate and power go hand in hand (as one increases so does the other)

Answer 29

Assuming that all of the group variances are equal

Answer 30

- Serious violation of this assumption tends to inflate the observed value of the F statistic - Too many rejections of H0 = high Type I error - This is a more problematic assumption because if you violate this assumption, you will inflate your type 1 error rates - If you select an alpha of .05, but your assumption of homogeneity of variance is not met, you may end up with more than 5% of false positives if the null hypothesis is true

Answer 31

- The Fmax test of Hartley - Levene’s test - Brown and Forsythe test

Answer 32

- Fmax = ratio of largest group variance to the smallest group variance - Calculate the sample variance for each group, and find the largest and smallest variances - Compute Fmax: Fmax = maxs2g mins2g′ - The observed Fmax value is compared against a critical value of this statistic - If the assumption of homogeneity of variance is satisfied, Fmax ratio would be close to 1 - If the observed value of Fmax exceeds the critical value, we conclude that we have to reject the null hypothesis and the assumption is not met - Easy to compute, but assumes that each group has an equal number of observations

Answer 33

- Measures how much each score deviates from its group mean Zij =|Yij −Ybarj| - Instead of using the original scores Yij to run the ANOVA, you use the absolute deviation scores Zij - If we retain the null hypothesis, we can conclude that the assumption of homogeneity of variance is met - The downside of this test is that it’s easier to obtain a significant F-ratio for this ANOVA when your sample size is large

Answer 34

- It measures how much each score deviates from its group median - The median is less weighed by outliers than the mean and isn’t pulled by a skewed variable - Zij =|Yij −Mdj| - Instead of using the original scores Yij to run the ANOVA, you use the absolute deviation scores Zij - For both the Levene and Brown-Forsythe tests a statistically significant finding (e.g., p ≤ .05) leads to the conclusion that the variances are significantly different across groups (i.e., the assumption of homogeneity of variance is not met) - The Brown-Forsythe test is slightly more robust than Levene’s test

Answer 35

That the variances are significantly different across groups (i.e., the assumption of homogeneity of variance is not met)

Answer 36

Brown-Forsythe test is recommended over the Levene’s test

Answer 37

- Independence of observations (random sampling) - Identical distribution (within groups) (random sampling) - Identical distribution (between groups) - Homogeneity of variance - Normal distribution

Answer 38

- z-test - One-sample t-test

Answer 39

Independent samples t-test

Answer 40

One-way ANOVA

Answer 41

Two-way ANOVA

Answer 42

Main effect of Factor A Main effect of Factor B Interaction between Factor A and B

Answer 43

3 levels in Factor A 4 levels in Factor B

Answer 44

- Factorial designs are those in which factors are completely crossed - They contain all possible combinations of the levels of factors Ex: when each factor has 3 levels, it is called a 3 × 3 factorial design, resulting in 9 treatment combinations

Answer 45

Every level of factor A is combined with every level of factor B

Answer 46

When sample sizes are equal in each condition

Answer 47

The independent variables

Answer 48

Means of all subjects within each cell are displayed

Answer 49

2 main effects An interaction effect

Answer 50

- The effect of one factor when the other factor is ignored (by averaging the means over all levels of the other factor) - Consists of the differences among marginal means for a factor

Answer 51

- The extent which the effect of one factor depends on the level of the other factor - An interaction is present when the effects of one factor on the DV change at different levels of the other factor - The presence of an interaction indicates that the main effects along do not fully describe the outcome of a factorial experiment - Sometimes called a crossover effect - Considers pattern of results for all cell means

Answer 52

There’s a main effect if there is a difference in average of the 2 dots (coming from both levels) closest to each other -> on both sides of slope

Answer 53

There's a main effect if the average of both lines (slopes) are different

Answer 54

There's no interaction if lines are parallel There's an interaction if they aren't parallel and indicate that they'll eventually cross over

Answer 55

- 2 factors of interest on the DV (Main effects) - Interaction between the different levels of these 2 factors (Interaction effect)

Answer 56

- The population distribution of the DV is normal within each group - The variance of the population distributions are equal for each group (homogeneity of variance assumption) - Independence of observations

Answer 57

- Main effect of Factor A H0A : μA1 = μA2 = ··· = μAa (equal row marginal means) H1A : Not all μAg are the same - Main effect of Factor B H0B : μB1 = μB2 = · · · = μBb (equal column marginal means) H1B : Not all μBj are the same

Answer 58

- Hypotheses for interaction effect H0: A×B : All μAgBj are the same OR The interaction between Factor A and Factor B is equal to zero H1: A×B : Not all μAgBj are the same OR The interaction between Factor A and Factor B is NOT equal to zero

Answer 59

- It's divided into 2 parts: SST = SSM +SSR SSM = Model (Between-group) variation SSR = Residual (Within-group) variation

Answer 60

SSA: Variation between means for Factor A SSB: Variation between means for Factor B SSA×B: Variation between cell means

Answer 61

FA = MSA / MSR

Answer 62

FB = MSB / MSR

Answer 63

FA×B = MSA×B / MSR

Answer 64

If each observed F value is greater than or equal to its critical value

Answer 65

Number of levels for Factor A

Answer 66

Number of levels for Factor B

Answer 67

Sum of raw scores in each treatment group

Answer 68

The sum of all the scores in the experiment

Answer 69

- 7 effects - 3 main effects (A, B, C) - 3 simple (two-way) interactions (AxB, AxC, BxC) - 1 three-way interaction (AxBxC)

Answer 70

- An experimental design in which the DV is measured several times within the same subject - Subjects are crossed with at least one experimental factor - The simplest design of this kind may be a before and after-treatment design (2 conditions)

Answer 71

- Levels of Factor A (only has 1 factor) - Subjects (participants)

Answer 72

n subjects are measured on the DV under k conditions (or levels) of a single IV or factor

Answer 73

* Are there differences in the mean scores of the DV across groups/conditions? - Within-subject effect of the independent variable (each subject is measured at each time point) -> Variation due to the model * Are there differences across subjects? - The variability of subjects (between-subject effect) - Treat each participant as a different level in an experimental design

Answer 74

H0: Vs = 0 - this effect represents the variance between subjects

Answer 75

H0: μ1 = μ2 · · · = μk

Answer 76

* Usually we are NOT interested in the effect of ‘subjects’ or subject-level variability * If this effect is significant, it would simply tell us that subjects differ on the dependant variable which has nothing to do with our treatment (IV) so it's irrelevant * What we are really interested in is whether the IV has an effect on the subjects, regardless of whether differences existed naturally among the subjects

Answer 77

SS(S) and SS(AxS)

Answer 78

* Normality * Homogeneity of variance * Homogeneity of covariance

Answer 79

The distribution of observations on the dependent variable is normal within each level of the factor

Answer 80

The population variance observations is equal at each level of the factor

Answer 81

The population covariance between any pair of repeated measurements is equal (homogenous covariance)

Answer 82

* Homogeneity of variance * Homogeneity of covariance

Answer 83

We assume that the variations within experimental conditions is fairly similar and that no 2 conditions are any more dependent than any other two

Answer 84

* Given that hypotheses about treatment effects are tested on differences between scores, the assumption of compound symmetry can be replaced by the assumption of sphericity (or circularity) * Sphericity means that the variance of differences of a pair of observations is the same across all pairs * In the assumption of sphericity, we assume that the relationship between pairs of experimental conditions is similar * This assumption is tested in practice, and it is a necessary condition for validity of the F test in repeated measures ANOVA

Answer 85

* Use of tests for violations of sphericity, such as Mauchly’s W (1940) - Mauchly's test * When Compound Symmetry is violated, the omnibus Ftests in one-way repeated measures ANOVA tend to be inflated, leading to more false rejections of H0 * Violations of CS require adjustments to the F test * We can use a conservative critical value based on the possible violation of sphericity (conservative Ftest) * The inflation of the F statistic that occurs when sphericity is violated can be adjusted by evaluating the observed F value against a greater critical value, obtained by reducing the degrees of freedom - Some of the most popular approaches involve: 1. measuring the degree of violation of sphericity 2. using the critical value equal to the value of the F distribution that corresponds to εdf (the adjustment is made for both df numerator and df denominator)

Answer 86

DF(B) = epsilon x (k-1) DF(BS) = epsilon x (k-1)(n-1)

Answer 87

It measures the extent to which sphericity was violated

Answer 88

* When sphericity holds, epsilon = 1 (i.e., no correction is needed). * When sphericity is violated, 0 < epsilon < 1 * This reduces both DF(B) and DF(BS), and gives a larger critical value for F - The further the epsilon value is from 1, the worse the violation

Answer 89

- Epsilon ≥ 1/(k-1) - JASP provides two estimates of epsilon: Greenhouse- Geisser & Huynh-Feldt estimates

Answer 90

Greenhouse-Geisser is smaller (more conservative)

Answer 91

Partial Omega squared (ω2) that excludes the variability due to differences between subjects (MSS)

Answer 92

SSW =SSA+SSAxS

Answer 93

F = MSA / MSsxa

Answer 94

If the observed F value is greater than or equal to its critical value, reject the corresponding null hypothesis

Answer 95

Variance of difference scores is equal for all pair-wise comparisons

Answer 96

- The null hypothesis in Mauchly’s test is that the assumption of sphericity is met - Rejecting the null hypothesis indicates that the assumption of sphericity is violated

Answer 97

- An increase in Type I error rates (false positives) - When sphericity is violated, the type 1 error rate is no longer .05 but it is greater

Answer 98

- Greenhouse-Geisser approach - Huynh-Feldt approach - Minimum possible value epsilon can attain which is ε = 1/(a − 1)

Answer 99

- Huynh-feldt - Because it tends to have the highest power - The adjustment we usually use when reporting the results in APA format

Answer 100

- df values - MS values (since they're calculated with df)

Answer 101

- SS values - Observed F ratios -> because if you multiply both sets of degrees of freedom by epsilon, those 2 adjustments cancel each other out so you end up with the same observed F ratio

Answer 102

- (when applicable) Mauchly’s test should always be reported first in APA summary

Answer 103

Epsilon can’t be zero because the formula always includes a minimum of a=2 (2 levels since repeated measures needs a minimum of 2 levels) so the epsilon formula can’t give 0

Answer 104

Omega-squared (ω2)

Answer 105

It's the sum of the squared difference between a group mean and group observations, across all k groups

Answer 106

It's the sum of squared differences between a cell mean and cell observations, across all (a × b) cells

Answer 107

* H0 in this test is “variances of differences between conditions are equal” * If p < .05, the assumption of sphericity (and CS) is violated * Available in JASP

Answer 108

- Greenhouse-Geisser was obtained with (a-1) x epsilon - Huynh-Feldt was obtained with 2 x epsilon

Midterms material Flashcards

(135 cards)