exam 2 Flashcards

Question

Statistical significance

Answer 1

(rejecting the null hypothesis) is not the same as practical significance Huge sample means higher power to reject the null when the effect is very small

Answer 2

measurement of the absolute magnitude of a treatment effect, independent of the sample of the samples being used Sample size doesn't matter, just focusing on how big the effect was For t tests we can use cohen's d Effect size measure we use to compare two means (t-test) Measures weather or not the mean difference matters

Answer 3

Cohen suggest guidelines for interpreting cohen's d Negligible– 0-.19 Small– .20-.49 Medium–.50-.79 Large– .80 Not the best because it all depends on context

Answer 4

When you select a score as a cutoff point between the body and tail, it will be more extreme in a normal distribution than a t-distribution

Answer 5

independent samples t test Measure of average distance between a sample statistic and the corresponding population parameter in the sampling distribution

Answer 6

when sample are diff sizes, the larger sample provides a better estimate of the variance than the smaller sample Solutions– pooled variance Use pooled variance to calculate the standard error If the samples are equal, both the pooled and unpooled formulas will derive the same result If they are not equal, the pooled will be better

Answer 7

distribution of differences between the means Diff between the mean is different from a different score Ex– mean difference between two different samples Control sample mean - experiment sample mean Determines probability of mean difference scores

Answer 8

distribution of means of difference scores Ex– time 1 score - time 2 score

Answer 9

All of the husbands can't have a systematic relationship to each other, same for wives Time point example– observations can not be related to each other

Answer 10

Assuming its coming from a normally distributed population Ways to get around it/violate Sample is 30 or higher

Answer 11

Paired samples t-test are more powerful More likely to reject null if effect exists Need less people Independent– need at least 60 for independent, 30 for paired (when looking at it longitudinally- time 1 and time 2) More flexible Can look at change over time and matched pairs Can never assess change over time with independent sample More powerful– individual differences are controlled for Using the same people/matched pairs = less “noise” in data Adjust the standard error calculation to account for this Independent– two separate groups with two separate standard error Based on raw score Paired samples– standard error is based on difference scores from the same people If you can, always choose paired over independent samples

Answer 12

Effect size for the paired-samples t is measured in the same way that we measured effect size for the one-sample t Can use cohen's d

Answer 13

Step one– State null and alternative hypothesis Step two– Identify cutoff regions .05 Step three- Compute the t statistic In this case, its paired Step four– Make decisions on the null, compute the p value, and interpret the result

Answer 14

Comparing 3 or more means We want to determine whether the sample mean difference are so large, that there is a true population mean differences

Answer 15

One-way anova is more flexible Can be used in situations where there are two or more means being compared T-test are limited to situations where only exactly two means are being compared Main differentiator One-way is accommodating more than two groups However, when there's only two, normally just use independent t test

Answer 16

In anova, each independent variable is called a factor Factor is generally a nominal or ordinal variable Categories, ranked or unranked Not dealing with continuous IV Each factor has at least two levels (categories) We divide observations into groups based on their level of the factor

Answer 17

Null is still that there is no real difference between groups in the population For alternative, not all means are equal to another At least one different mean in the population

Answer 18

If we have three groups why not just run three t tests? Would indeed to run 3 t tests to examine differences between all groups Want to keep experimentwise alpha level around .05 One way anova allows us to evaluation all the mean differences in a single hypothesis using a single alpha level Keeps the risk of type one error under control no matter how many different means are being compared

Answer 19

Goal– estimate true mean differences in the population Problem– we don't have two groups anymore Cannot just use the mean differences– there are now at least three to consider Solutions– use the variance between the groups insead Compare this systematic between-group variance to variance that is randomly occurring within the groups

Answer 20

First calculate the total variability for the entire dataset Separate the total variability for the entire set of data into two basic components Within treatment variability– error Between treatment variability– treatment effect We want to compare the amount of between-treatment variability to the amount of within treatment variability

Answer 21

Size of differences in the scores you are seeing inside of each of the three groups If all within group variances differ, sum up within group variance across all groups in anova Random sampling error Because individuals receive same treatment or are in same group, you did not cause individual differences, they just happened to occur This is a measurement of the random sampling error in our data

Answer 22

how individual differences are accounted for by the fact they receive three diff treatments or are coming from three diff groups Measures the size of the differences between the three groups means Want them to be large Showed that different treatments made a difference Variance across all group means is the between group variability Considered a measure of the treatment effect and random sampling error Between-treatments variance gives us information about the size of the group differences

Answer 23

Logically the mean differences can b e cause by two sources– Treatment effects If treatments have different effects, could cause the mean outcomes score for one treatment to differ from the mean outcome score for another treatment Sampling error Even if there is no treatment effect, you would still expect some differences between samples

Answer 24

test statistics for one-way anova Technically a ratio We divide the between treatment variance by the within treatment variance Bigger f ratio, more likely to reject the null hypothesis Large value indicates treatment effect is large Treatment effect + sampling error divided by sampling error When null is true and there's no treatment effect, the f-ratio is balanced Treatment effect is 0, so it's basically just sampling error over sampling error Comes out to a f-ratio of 1

Answer 25

Large treatment effect– treatment effect in numerator will not be equal to 0 (could be 1, 2, 3, etc but the higher the better) F-ratio will be larger than 1 Want to see ratio larger than 1 so we can assume treatment had an effect Is it high enough to be rare? F-ratio has the same structure as the independent samples t statistic

Answer 26

distribution of f-ratios F-ratio– between group variance / within group variance Within– error Between– error + treatment

Answer 27

Sampling distribution will pile up around 1 if null is true “Family” of distributions, not just one Shape depends on both degrees of freedom The higher the total df, the more closely the possible f-rations under the null will pile around 1 Will look skinnier

Answer 28

Observations are independent of one another Homogeneity of variance– populations from which samples came from all have the same variance Usually violated, but if you have even sample sizes it won't matter Populations distributions are normal If you have large sample sizes (more than 30) it won't matter

Answer 29

Can't use cohen's d anymore Can't take a difference between three groups because were dealing with variance We use eta squared (looks like n squared) What % of total variability is accounted for by treatment variability .01- .05– small .06-.13– medium .14 or higher is large

Answer 30

1. State null and alternative 2. Set alpha and locate the critical f statistic value (.05) 3. Compute the sample f statistic 4. Make a decision/ analyze

Answer 31

Technically called an omnibus test The null hypothesis tests the equality of several means at the same time Keeps type 1 error under control Pairwise comparisons To fully understand our anova results, we need to follow up a significant anova by comparing all possible pairs of means against each other Running all possible pairs and comparing them Controlling for type one error by only doing it once

Answer 32

Running them in response to finding statistically significant results When we test all possible pairs of means for differences, we call them post hoc Post hoc– “after the fact” These are differences we didn't predict/hypothesize

Answer 33

Uses the information from the omnibus test directly to set the standard Asking how big the difference in means needs to be in order to be honestly significant Drawbacks Only works when you have even sample sizes Fairly liberal– does not control type 1 error rate as well as other options Still better than running 3 tests, but not usually the post-hoc test you would choose

Answer 34

Calculates a shepherd MSbetween (between treatment variance) for every pair of groups The f between two groups must exceed our critical f value for the omnibus test to be significant Mean difference has to be larger than the omnibus test result ever was Makes it more conservative(having a strict cut-off value)– what you want for a post-hoc test Called anova because we use variance instead of mean difference and because we break variance down into different parts

Answer 35

Works in situations with uneven groups sizes More conservative– requires larger group differences for significant Does the best in terms of controlling type 1 error

Answer 36

Goal– further breaking down the variance Two main types Main effects– always at least 2 Always have at least two factors (independent variables) Interaction effects– always one that you can probe with post-hoc comparisons

Answer 37

All types of mean differences are statistically independent of one another If you have a significant interaction effect, there's no guarantee their will be a significant main effect Vice versa How two independent variables (factors) depend on each other to influence the outcome/dependent variable Are they constant or change depending on one another?

Answer 38

expect effects of one factor to depend on the effects of the other factor

Answer 39

Two or more factors that are categorical One dependent variable that is numeric how does the presence of snow (no snow vs. snow) and age group (children vs adults) affect levels of Christmas spirit during the holidays?

Answer 40

Each factor needs to be categorical Nominal, sometimes ordinal variable Makes four diff groups Creates 3 means 2 x 2 factorial design

Answer 41

Mean differences between levels of each factor Always have one main effect per factor 2 x 2 design– 2 main effects are of interest Our example– skill level main effect and audience main effect

Answer 42

Typically more interested in this Does the effect of one variable depend on the effect of the other variable You always only have one interaction effect Our example– does the effect of skill level depend on whether an audience is present Depend should always be in the sentence

Answer 43

Use either Matric of cell means Plot of interaction Should be able to get main of interaction effect form either When there is no interaction, interpret main effects When there is an interaction, typically avoid interpreting the main effect Misleading because their effects depend on the interaction

Answer 44

Looking for differences between means in the margins Average difference of single factor Going across the columns

Answer 45

Same process but want to see the avg effects of level one of factor b differing from the avg effect of level two of factor b Going down the columns Still marginal means

Answer 46

Test differences among the cell means Key is relationships– do relationships differ from one another Can look at relationship going down or across columns– both equally as valid Inside the matrix– no longer looking at marginal means

Answer 47

If they do, it's a disordinal interaction If you don't, its an ordinal interaction

Answer 48

If they are, its consistent with a main effect of IV #2 (the variable that defines the lines)

Answer 49

If the lines generally slope up or down, its consistent with main effect of IV #1 (variable on the horizontal axis)

Answer 50

Main effect of factor a– mu a1 = mu a2 Main effect of factor b– mu bl = mu b2 Interaction effect– there is no interaction effect between factors a and b

Answer 51

Main effect of factor a– mu a1 does not = mu a2 Main effect of factor b– mu bl does not = mu b2 Interaction effect– there is an interaction effect between factors a and b

Answer 52

Each of the three hypothesis tests in a two factor anova will have its own f ration Main difference– you have three f ratios now, using the between person variance for factor a, factor b, and the interaction

Answer 53

Parietal eta squared– tells us the amount of variance in the dependent variable that can be accounted for by the effect of interest Compute three separate partial eta squared Factor a main effect Factor b main effect Interaction effect Uses between treatment sum of squares for numerator as well as within treatments sum of squares for denominator

Answer 54

distribution of means

Answer 55

distribution of differences between the means

Answer 56

distribution of means of difference scores between the pairs

Answer 57

distribution of f ratios

Answer 58

no systematic difference Reject the null hypothesis

exam 2 Flashcards

(82 cards)