Midterm 2 Flashcards

Question

sampling distribution of Xi vs Xavg | - which has smaller SD?

Answer 1

``` Xi ~ N(u, sigma) Xavg ~ N(u, sigma/sqrt(n)) - N = normal(gaussian) distribution - u is always the same (population) - Xavg has smaller SD because it is divided by sqrt(n) >> gets smaller as sample size increases ```

Answer 2

as n increases, our confidence in the sample mean increases (CI decreases) - sample mean gets closer to popn mean w increased sample size - typically CI is 95% >> randomly picking a number will have a 95% chance of being within 2SD of u, so shifting that 2SD so that the value picked is the center indicates that there is a 95% chance that u is within 2SD of the value = CI - CI = Xavg +/- z*sigma/sqrt(n)

Answer 3

- pick CI (eg. 95%) - add in one of the tails (because 95% is the 2SDs so you need the end of one of the tails since only one tail is excluded by z score) >> 1-0.95 = 0.05 (both tails) >> one tail = 0.025 - find 0.975 on z-score table >> find z score - the more confidence you want (eg. 95 vs 99%) the bigger your z score will be

Answer 4

- looks the same as gaussian except values don't extend to inf (not gaussian!) - used when you don't have the population SD (sigma), and you only have a sample SD - in general, t will have a wider distribution because it is less exact >> more uncertainty - unlike z, t distribution depends on sample size: as you increase n, t gets closer to gaussian (approximates normal distribution) - defined by degrees of freedom (n-1) CI = Xavg +/- t*s/sqrt(n) s/sqrt(n) is standard error of the sample mean!

Answer 5

- t score is a single tail's probability while z score is everything other than one tail >> find the value of one tail based on CI (eg. 95% >> one tail = 0.025) - find degrees of freedom (n-1) - find corresponding t value * increasing df (inc sample size) will decrease t value >> approximates gaussian ( eg. as df approaches inf, t approaches 1.96, which is the z score for gaussian 95 CI)

Answer 6

narrow error bar = increased confidence wide error bar = lots of noise, less confidence Common error bars: - range - SD >> will not change, no matter the size of the sample, if it represents the population SD - SE >> most common in BNS (SD/sqrt(n)) >> decreases as n increases - CI >> recommended >> like a stat test >> Xavg +/- t*SE >> based on t* and SE, so increasing sample size, which affects both of these, will decrease CI by a lot.

Answer 7

H0: null hypothesis H1: research hypothesis

Answer 8

p > a: retain H0 >> fail to reject >> results are not significant p < a: not significant

Answer 9

- if value (observation) is greater than criterion, reject H0 >> greater obs value = less area under curve = smaller p value - area under the curve and observation line = p value - area under the curve and criterion = alpha level (typically 0.05 >> cannot be greater/more liberal) - criterion will change based on alpha

Answer 10

- if you used sample SD and not population SD, you will have a t distribution >> SE will be the standard dev of the sampling distribution of the sample mean Xavg - since Xavg dist is gaussian, SE can be used to approx CI (eg. SE = 2 >> 95% CI) - t value = (Xavg - u)/SE - find t value of data (obs) and compare to tcrit from table - if t is bigger than tcrit then you reject H0 - t distribution is for the null hypothesis >> big t is unlikely therefore we would reject H0

Answer 11

1 tail: directionality specified, all of probability a is in one tail only two tails: directionality not specified (can be bigger or smaller than value you are comparing to); must divide a by 2 because it can be either of the tails - harder to reject 2 tails because there is possibility for error on either side, so tcrit (dependent on alpha) will be further from the mean; less likely to get data that is further from the mean

Answer 12

- if you're finding 95% CI based on t, you must use the NON-DRECTIONAL t value because it takes both tails into account - a 2 tailed one sample t test is equivalent to asking if a value is within the 95% CI - CI = Xavg +/- t*SE >> calculate the interval

Answer 13

NO. It only shows that the data is consistent w the H0. Study could just be underpowered and not show the effect because it couldn't detect the difference

Answer 14

probability of making a correct decision of rejecting an incorrect H0 (assuming it is actually false) - correct decision: - H0 is incorrect and you reject H0 (hit) - H0 is correct and you retain H0 (correct reject) - type 1 error (a): H0 is correct and you reject H0 (false alarm); occurs when you get a large t value just by chance - type 2 error (B): H0 is incorrect and you retain H0 (miss) - B depends on alpha level (if a is too low it increases chance of type 2 error >> miss), sample size (inc sample size dec chance of type 2), and effect size (a small effect would go undetected >> inc chance of type 2 error >> depends on cohen's d) (reject H0 = yes, retain H0 = no; H0 incorrect = signal present, H0 correct = signal absent) - Power = 1-B

Answer 15

1. increase P(type 1 error) >> would increase false alarm rate, and misses would go down - not helpful: type 1 error (a) cannot exceed 5% 2. increase separation between the sample mean and the population mean (H0 value) - not helpful: this is not possible. you can't change the data 3. increase the sample size - this is helpful! increasing n will decrease SE (spread) >> tcrit is moved closer to the population mean >> easier to reject the H0 >> increases power *even if you reject H0, you are not proving H1 correct >> other hypotheses that align w data could also be correct

Answer 16

- scale of measurement 1. ratio/interval: t test/ANOVA 2. categorical: chi-square - # of groups/levels of IV 1. 1 group = sample t test 2. 2 groups = 2 sample t test 3. 3+ groups = ANOVA - experimental design 1. within sub: paired t test, repeated measures ANOVA 2. between sub: 2 sample t test, one way ANOVA