Module 6- Inferential Statistics Flashcards

1
Q

Inferential Statistics

A
  • what we do to make inferences about a population based on our sample
  • how we test hypotheses
  • make inferences from sample to population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Population

A
  • the larger group of all participants of interest to the researcher
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sample

A
  • subset of the population
  • never represent the population perfectly due to sampling error
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sampling Error

A
  • natural variability you expect from one sample to another
  • not really an error
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Population Parameter

A
  • A descriptive Statistic (MCT , Variability measure)
  • computed from everyone in the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sample Statistic

A
  • Descriptive Stat (ex, mean) computed from everyone in the sample
  • not a true representation of the population parameter bc of sampling error
  • an approximation of the population parameter
  • deviates away from the parameter bc of the sampling error
  • close to parameter but not perfect
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

if we had an infinite amount of samples

A
  • distributions of means of an infinite amount of samples would form a normal curve
  • even if each sample itself is skewed, the plot of means would be normally distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Central Limit theorem

A
  • if draw a large number of samples from a population at random, the means of those samples will make a normal distribution
  • can never draw an infinite amount of samples
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sampling Distribution of the Mean

A
  • plot or distribution of means from different samples of the same population
  • makes a normal curve
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Law of Large Numbers

A
  • Larger the sample, the more the mean of each sample will approximate the mean of the population
  • larger the sample, less the mean is impacted by outliers
  • larger the sample, the smaller the SD of each sample and therefore each sample will have a mean similar in value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Characteristics of Sampling Distributions

A
  • approximate the population mean
  • approximately normal in shape
  • can answer probability qs about the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Standard deviation of the Sampling Distribution of the Mean

A
  • Standard Error of the Mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Standard Error of the Mean

A
  • defines the variation around the population mean (u “mu”)
  • percentage of data will fall within 1,2,3 standard error units from the mean
    68% of the sample means fall within -/+ 1 standard error units from the mean of the sampling distribution
    95% within -/+ 2 standard error units
    99% within -/+ 3 standard error units
  • like the standard deviation
  • difference due to chance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Can never obtain a Sampling Distribution of the Mean bc

A
  • can never collect an infinite amount number of samples
  • therefore, can’t collect the standard error of the mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Confidence Intervals

A
  • can estimate the standard error of the mean to calculate these
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

smaller the standard error

A

the smaller our confidence intervals will be
- want our standard error to be as small as possible
- want dis to be tall and skinny

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Influences to the size of the standard error of the mean

A
  • if variability of the variable is large within the population, then the standard error will be large
  • if the variability of the variable is small within the population, then the standard error will be less and ^ have a tall and skinny distribution
  • Law of large numbers; larger the sample size, the smaller the standard error due to less influence of outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Null Hypothesis

A
  • for hyp testing
  • no difference bw our sample and population mean (come from the same distribution)
  • no difference bw 2 group means bc they come from the same pop mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Reject Null Hypothesis

A
  • what we want
  • says the 2 groups are from 2 different population distributions
  • there is a difference bw groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Fail to reject null hypothesis

A
  • comes from not enough evidence
  • says 2 groups are from the same population distribution
  • no difference exists
  • say “fail to reject” bc can never prove anything true
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

test the null hypothesis by

A
  • Test statistic
  • test stat= observed difference/ difference due to change (standard error of the mean)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

big or small test stat?

A
  • want a big test statistic, so it can fall in the critical region of rejection (^ can reject the null hyp)
  • observed difference would be a large number (numerator)
  • want difference due to chance (denominator) to be very small
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Z- distribution

A
  • use to determine if our sample mean differs from the population mean
  • if our observed Z values fell into the extreme regions, which is defined by alpha values then we reject the null hyp
  • want a large z value to fall in the rejection region
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

what does a= 0.05 mean

A
  • means 5 out of 100 times we are making an error
  • 5% chance of incorrectly rejecting the null hypothesis when it is true; Type 1 Error
  • as we lower the alpha value we are likely to be more confident
  • results occur by chance less than 5/100 times
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

T-Test

A
  • examines whether 2 group means come from the same population or different populations
  • get more variable scores, but harder to see group differences
  • want a large t value; bigger numerator ( between group difference) and a small denominator (w/in group diff/ standard error)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

the further along the t and z values are in the distribution…

A
  • closer to the extreme ends
  • making it more unlikely to occur by chance
27
Q

Between group Variance

A
  • Treatment Effect
  • Numerator of the test stat
28
Q

Within group Variance

A
  • Denominator of the test stat
  • variability of scores within each group
29
Q

_______ is key to Inferential statistics

30
Q

Variation in the DV could be due to

A
  • chance/ error variance
  • variance due to the IV
  • confound variance
31
Q

Chance/ Error Variance

A
  • random variation in the DV that comes from individual differences
  • inherent and cannot be eliminated
  • bc random it does not impact the overall mean of the group
  • contributes to within group variance
    ex. ideally everyone in the noisy group would get the exact same score on the math test. However that is not realistic due to chance variance
32
Q

Systematic variation

A
  • influences the entire group mean
  • creates between group differences
    2 types; variance due to IV and confound variance
33
Q

Variance due to the IV

A
  • type of systematic variation
  • good kind of variance
  • what we want; this is the treatment effect
  • comes from the manipulation of the IV
    ex. manipulate the IV; noisy room vs quiet room. expect participants in the noisy room to make more math errors
34
Q

Confound Variance

A
  • type of systematic variance
  • creates between group differences
  • confounds; unintentional IVs
  • not what we want, very bad; lowers internal validity
  • acts like variance due to IV
    ex. if the IV was the type of room (noisy vs quiet), but one room was hotter than the other, this is a confound. It contributes variance in the math errors between groups, but not the source of variance we want
35
Q

F Ratio

A
  • used to statistically test if the IV causes changes in the DV
  • F= Between group variance/ Within group variance
    = (IV variance/ treatment effect+ Confound
    variance)/ Error(chance) variance
36
Q

We want F ratio to be

A
  • as big as possible
  • want numerator to be as big as possible to maximize the b/w group variance
  • denominator as small as possible to minimize w/in group variance
  • makes us more confident there is a group difference in the DV
    BUT numerator has to be big bc of the IV variance and not the confound variance **
37
Q

small F ratio

A
  • high within group variance
  • fail to reject the null hyp bc no b/w group variance
38
Q

F ratio= 1

A
  • IV did not impact the DV; no treatment effect
  • NULL HYP IS TRUE
  • no b/w group difference
  • all variation was due to chance variance
  • numerator and denominator are the same value
39
Q

F>1

A
  • reject the null hypothesis
  • see a difference in DV b/w groups
40
Q

how to increase b/w group variance in F ratio

A
  • increase the IV variance to make sure IV manipulations are causing a large effect size
41
Q

how to decrease w/in group variance in F ratio

A
  • cannot eliminate error variance
  • but since denominator is like the standard error of the mean, we can reduce it by increasing our sample size
42
Q

In each instance we are testing 3 hypotheses

A
  1. test the null hypothesis to see if our empirical observations are due to chance. If the F ratio is large, we have a large b/w variance and reject the null hyp
  2. bc we have b/w group variance, need to know if it is bc of IV variance or confounds
  3. after ruling out confounds and rejecting the null hyp, we can make inferences of the causal relationship of the IV on the DV
43
Q

alpha value/ level of significance

A
  • scientists feel comfortable at setting it 5% or lower
  • we are saying that 5/100 times we are making an error, but this is not one of those times
  • defines values that are unlikely due to chance
44
Q

Probability Value/ P- Value

A
  • for each test stat ( Z, T or F) we can calc a p value
45
Q

if p value< set alpha value

A
  • reject the null hypothesis and results are statistically significant
46
Q

if p value > set alpha value

A
  • fail to reject the null hypothesis and results are statistically non-significant
  • results likely to have occurred by chance
47
Q

Test the null hypothesis with four possible decisional outcomes

A
  • 2; represent possible correct decisions
  • 2; represent main errors- Type 1 or Type 2
48
Q

Type 1 Error

A
  • Incorrectly rejecting the null hypothesis, when the null hyp is actually true
  • observed value due to chance
  • even says it in the alpha value; “5/100 times we incorrectly reject the null hyp”
  • never know when we make this error but are confident that setting a low alpha value will make this more unlikely of occuring
49
Q

Type 2 Error

A
  • IV did cause a difference, but we did’t detect it
  • failing to reject the null hypothesis, when the null hyp is actually false
  • occur when F is too small
50
Q

Power

A

1-B
- probability we will reject the null hypothesis when it is false
- probability we will detect an effect

51
Q

Probability of type 1 error

52
Q

probability of type 2 error

53
Q

ideally we want type 1 and type 2 to be

A
  • both very low
54
Q

lower alpha in relation to type 1 and 2

A
  • makes it harder to reject the null hypothesis
  • makes it more likely you will miss a small difference in the effect
  • makes type 1 error less likely
  • but makes type 2 more likely; more likely you will fail to reject the null hyp, when the null is false and miss a small treatment effect
55
Q

is type 1 or 2 more serious?

A
  • Type 1 errors are considered more serious
  • more serious to say IV had an effect when it didn’t
56
Q

Effect Size

A
  • how much the groups differ on the DV
  • the effect of the IV on the DV
  • the numerator of the F ratio
  • NOT AFFECTED BY THE SIZE OF THE SAMPLE
57
Q

relationship bw effect size and sample size

A
  • if we have a small effect size, that means we have a small numerator of the F ratio. Because we want the F ratio to be large, we need to decrease the denominator by increasing the sample size
  • if we have a large effect size, that means we have a large numerator of the F ratio. we can still have a large F ratio even if the denominator is isn’t very small ^ would not need a big sample size
  • larger effect is easier to detect
58
Q

statistical significance is a function of

A
  • effect size and sample size
59
Q

when power is low, what error is most common?

A
  • Type 2 errors
60
Q

increase the power by

A
  • increasing the sample size
  • and therefore can reduce type 2 errors
61
Q

statistical significance

A
  • observed group differences are unlikely to be due to chance or error
  • done by increasing the sample size
62
Q

practical significance

A
  • ensure that the differences we observe have practical value in the real world
  • is the treatment effect large enough to have value in a practical sense
  • ex. new drug for depression lowers symptoms by one point; this is statistically significant but not pratical in the real world
63
Q

reduce type 1 and type 2 error by

A

type 2 reduce; larger sample size

type 1 reduce; lowering alpha (a) the significance level