Week 2: Hypothesis Testing and Its Implications Flashcards

Question

The standard deviation of sample means is known as the

Answer 1

SEM (standard error of the mean)

Answer 2

calculate boundaries and range of values within which we believe the true value of the population mean value will fall. Such boundaries are called confidence intervals.

Answer 3

these intervals (created by samples) will contain the population mean

Answer 4

95 of these samples, the confidence intervals we constructed would contain the true value of the mean in the population.

Answer 5

0 (it does not contain it) or 1 (it does contain it). You have no way of knowing which it is.

Answer 6

* Dots show the means for each sample * Lines sticking out representing Ci for the sample means * If there was a vertical line down it represents population mean * If confidence intervals don't overlap then it shows significant difference between the sample means

Answer 7

would be –1.96 and +1.96 -

Answer 8

-1.96 and 1.96

Answer 9

very different from the true mean, indicating that it is a bad representation of the population

Answer 10

smaller - make sense as more we measure more certain sample mean close to population mean

Answer 11

Know most scores remain at z = 1.96 (upper bound) and z = -1.96 (lower bound) LB = (-1.96* SD of sample) + mean sample UB = (+1.96* SD of sample) + mean sample

Answer 12

LB = Mean - (1.96 * SEM) UB = Mean + (1.96 * SEM)

Answer 13

*** M - 530 * N = 10 * SEM = 100/ square root of 10 = 31.62** * Value of z for 95% CI is number of SD one must go from mean (in both directions) to contain 0.95 of the scores * Value of 1.96 was found in z-table * Since each tail is to contain 0.025 of the scores, you find the values of z for which is 1-0.025 = 0.975 of the socres below * 95% of z scores lie between -1.96 and +1.96 *** Lower limit = 530 - (1.96) (31.62) = 468.02 * Upper limit = 530 + (1.96)(31.62) = 591.98**

Answer 14

no effect of the predictor variable on the outcome variable

Answer 15

the predictor variable on the outcome variable

Answer 16

getting that model (Data) if the Null hypothesis H0 were true (Statistical significance)

Answer 17

we conclude the model fits the data well (explains a lot of the variance) and we gain confidence in the alternative hypothesis H1

Answer 18

1. specify the null hypothesis H0 and the alternative hypothesis H1 2. select a significance level. Typically the 0.05 or the 0.01 level. 3. calculate a statistic analogous to the parameter specified by the null hypothesis. (e.g. if null defined by parameter μ1- μ2 (diff between two means) then the statistic is M1-M2 (difference between sample means)) 4. calculate the probability value of obtaining a statistic (statistic computed from the data) as different or more different from the parameter specified in the null hypothesis (often 0 or based on past evid and mean stay same) 5. probability value computed in Step 4 is compared with the significance level chosen in Step 2. 6. If the outcome is statistically significant, then the null hypothesis is rejected in favor of the alternative hypothesis.

Answer 19

signal/noise

Answer 20

probability of obtaining a certain value or p value.

Answer 21

systematic variation against unsystematic

Answer 22

the null hypothesis is rejected; When the null hypothesis is rejected, the outcome is said to be “statistically significant”

Answer 23

null hypothesis is not rejected.

Answer 24

%, 1% (p<0.05 OR p<0.01) or less probability of a test statistics happening by chance.

Answer 25

extreme results given that H0 is true

Answer 26

think the variance accounted for by the model is larger than the one unaccounted for by the model (i.e. there is a statistically significant effect but in reality there isn’t)

Answer 27

think there was too much variance unaccounted for by the model (i.e. there is no statistically significant effect but in reality there is)

Answer 28

fit of the model

Answer 29

population, when in fact there isn’t.

Answer 30

a-level of usually 0.05

Answer 31

population when, in reality, there is.

Answer 32

β-level (often 0.2)

Answer 33

the size of the an effect

Answer 34

Standardized = comparable across studies Not (as) reliant on the sample size Allows people to objectively evaluate the size of observed effect.

Answer 35

the effect explains 1% of the total variance.

Answer 36

the effect accounts for 9% of the total variance.

Answer 37

effect accounts for 25% of the variance

Answer 38

effect should be placed within the research context.

Answer 39

.8, or an 80% chance of detecting an effect if one genuinely exists.

Answer 40

it may be because we do not have enough statistical power

Answer 41

correctly rejecting a false H0 OR the ability of the test to find an effect assuming there is one in the population,

Answer 42

1 - β OR probability of making Type II error

Answer 43

your sample sizee

Answer 44

1. Probability of a type 1 error or a-level [level at which we decide effect is sig - p-value) --> bigger [more lenient] alpha then more power) 2. True alternate hypothesis H1 [effect size] (degree of overlap, less means more power) - if you find large effect in lit then better chance of detecting something 3. The sampel size [N]) --> bigger the sample, less the noise and more power 4. The particular tests to be employed - parametric tests greater power to detect sig effect since more sensitive

Answer 45

Sample size calculation at a desired level of power (usually power set to 0.8 in formula)

Answer 46

* Kolmogorov-Smirnov test * Shapiro-Wilks test

Answer 47

Just do the parametric tests

Answer 48

with respect to normality --> normality tests have limitations

Answer 49

* Calculate power of test * Calculate sample size necessary to detect an decent effect size and achieve a certain level of power based on past research

Answer 50

Type 1 error p = alpha Type II error p = beta Accepting null hypothesis which is correct - p = 1- alpha Accepting alternate hypo which is correct - p = 1 - beta

Answer 51

bigger difference means higher power and and correctly reject the null hypothesis than distributions that overlap more

Answer 52

This means that the overlap in distributions is smaller and the power is therefore greater, but this time because of a smaller standard error of our estimate of the means.

Answer 53

us how. We usually set the power to 0.8.

Answer 54

A measure of variability: The number of standard deviations from the population mean or a particular data point is Z-scores are a standardised measure, hence they ignore measurement units

Answer 55

Z-scores allow researchers to calculate the probability of a score occurring within a standard normal distribution Enables us to compare two scores that are from different samples (which may have different means and standard deviations)

Answer 56

Let’s say Trish takes a test and scores 25 and the mean is 20 You may calculate the z-score to be 1.25 you would use a z-score table to see what percentile they would be in (marked in red) so to read the table you would go down to the value 1.2 and you would go across to 0.05 which totals to 1.25 and you can see about 89.4% of other students performed worse.

Answer 57

We would use our table and look down the column to a z-score of 1 and across to the 0.00 column (in purple) and we can see 84.1% of students performed worse than Josh so Trish performed better than Josh.

Answer 58

68% of scores are within 1 SD of the mean, 95% are within 2 SDs and 99.7% are within 3 SDs.

Answer 59

: by taking into account the variability and size of our sample we can estimate how far away from the real population mean our mean is!

Answer 60

the 95% confidence interval range

Answer 61

95% confidence interval.

Answer 62

high statistical power

Answer 63

low statistical power

Answer 64

missing a real effect – Type II error)

Answer 65

No actual difference exists in the real world, all data comes from the same population

Answer 66

There is an actual difference and we found it!

Answer 67

reject null hypothesis

Answer 68

FALSE (or TRUE).

Answer 69

us finding an effect when the null hypothesis (H0) is true.

Answer 70

H0 is true

Answer 71

larger than the one we have found if there were no effect in the population (e.g. the null hypothesis were true)

Answer 72

gets further away from the range of test statistics predicted by the null hypothesis.

Answer 73

p = .049, p = .050 are essentially the same thing- the former is ‘statistically significant’. Importance is dependent upon the experimental design/aims: e.g., A statistically significant weight increase of 0.1Kg between two adults experimental groups may be less important than the same increase between two groups of babies.

Answer 74

A as one-tailed is directional and two tailed is non-direcitonal

Answer 75

A as If we’d collected 100 samples, calculated the mean and then calculated a confidence interval for that mean, then for 95 of these samples the confidence intervals we constructed would contain the true value of the mean in the population

Answer 76

A and just because test statistic is sig does not mean its important effect

Answer 77

A as If we use the conventional criterion then the probability of this error is .05 (or 5%) when there is no effect in the population

Answer 78

A The standard error (which is the standard deviation of the distribution of sample means), defined as σ_Χ ̅ =σ/√N, decreases as the sample size (N) increases and vice versa

Answer 79

A The null hypothesis is the opposite of the alternative hypothesis and so usually states that an effect is absent

Answer 80

A A Type II error would occur when we obtain a small test statistic (perhaps because there is a lot of natural variation between our samples)

Answer 81

A - To make sure our estimates of the parameters that define our model and significance tests are accurate we have to assume homoscedasticity (also known as homogeneity of variance)

Week 2: Hypothesis Testing and Its Implications Flashcards

(123 cards)