Introduction to Inferential Statistics Flashcards

1
Q

How can we infer something about the population based on what we find in the sample?

A

Normal distribution
Empirical rule
Central limit theorem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Are t-values and standard errors used to estimate population parameters?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are inferential statistics used for?

A

To draw conclusions and make inferences about population parameters by analyzing data collected in a sample
To infer something about the population parameter using sample statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Will parameter estimates based on a sample exactly equal the population parameter?

A

No, because they vary from sample-to-sample
So, when reporting a statistic, we also typically report an interval (e.g., 95% confidence interval) which we believe includes the population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a confidence interval?

A

A range of values within which a population parameter (e.g., mean or proportion) is likely to fall, based on a sample from that population
If you repeated a study of the same size many times, 95% of the resulting confidence intervals would cover (include) the true population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Do statistics estimate the sample?

A

Yes
And parameters estimate the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Since samples describe the population, do statistics describe the parameters?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What notations are used for samples?

A

Number of people: n
Mean: X (with a bar on top)
Variance: s^2
Standard deviation: S or SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What notations are used for the population?

A

Number of people: N
Mean: mu
Variance: sigma^2
Standard deviation: sigma

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the primary characteristics of a normal distribution?

A

Symmetric and unimodal (one peak)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is a normal distribution the most important distribution in inferential statistics?

A

It’s characteristics form the foundational assumptions underlying many interferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Can you apply the empirical rule to any normal distribution?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the empirical rule?

A

68% of the observations fall within 1 standard deviation of the mean
95.4% of the observations fall within 2 standard deviations of the mean
99.7% of the observations fall within 3 standard deviations of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If something has a normal distribution, can you use the central limit theorem?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the three distributions?

A

Sample distributions (distribution of the sample)
Population distribution (distribution of the population)
Sampling distribution (distribution of a statistic over a set of of theoretical samples; distribution of sample means; plotted means of various samples)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Is the sampling distribution of the sample means approximately normal?

A

Yes
The sampling distribution of the sample means becomes “more normal” as n (or number of samples) increases
The mean of the sampling distribution of sample means will be the same as the mean of the population

17
Q

Since the distribution of the sample means is approximately a normal distribution, can the empirical rule be applied?

18
Q

Do we know that 95% of all sample means are within 2 SDs of the population mean?

A

Yes, based on the empirical rule
This theoretical assumption is the basis for inferential statistics

19
Q

What is the standard error of the mean?

A

Analogous to the SD of the population data
SEM = SD of the sample/square root of the sample size

20
Q

How is the standard deviation related to the sample error of the mean?

A

Just as the standard deviation increments the distance of a raw score from the population mean, the SEM increments the distance of the sample mean from the population mean

21
Q

What will make the SEM change?

A

The SD of the sample
The size of the sample

22
Q

Why are the standard error (SD) increments of the distribution of the sample means so much narrower?

A

Because we are just plotting means, not raw scores
Outliers are not applied

23
Q

What are z-scores?

A

Indicate how far a score is from the mean in a population context
Raw scores incremented by standard deviation units
A z-score indicates how far a raw score is from the population mean

24
Q

What are t-scores?

A

Indicate how far a score is from the mean in a sample context
Sample means incremented by standard error units
A t-score indicates how far a sample mean is from another mean
Sample means are distributed following the t-distribution

25
Q

Do t-scores differ depending on sample size?

26
Q

Is there a different t-distribution for each discrete number of degrees of freedom?

A

Yes aka sample sizes

27
Q

As the sample size increases, does the t-value get closer to the z-score?

28
Q

What is a confidence interval?

A

The interval (range of scores) which we believe includes the population parameter
We can be 95% confident that ______ on average between…

29
Q

Since we rarely know the population mean, what do we use instead?

A

the mean score of our sample
the standard error of the mean
the assumption that the sampling means are normally distributed (so we can apply the Central Limit Theorem)

30
Q

What is a critical t value?

A

The “critical value” is the cutoff point for 5% of the distribution
A test statistic value (t) exceeding the cut-off point is statistically significant, p<0.05

31
Q

What do narrower confidence intervals indicate?

A

The more precisely we have estimated the population parameter
Generally, the width of the confidence interval is inversely related to the size of the sample

32
Q

What are the four sources of variability?

A

Differences attributable to group membership (between group)
Sampling error (within group)
Measurement error (within group)
Individual differences (within group)

33
Q

What is important to determine when comparing groups?

A

Does one group constitute their own population
If 95% CI do not overlap between groups, the difference is significant (p<0.05)

34
Q

If the 95% CI do overlap, is the difference significant?

A

No, p>0.05

35
Q

How do you calculate the critical t score when you are comparing groups?

A

Based on the total number of subjects minus 2
*used when calculating the 95% CI around the mean difference for two groups

36
Q

What are the two approaches for comparing the 95% CI of two groups?

A

Do CI for each group separately and then see if they overlap
95% CI around the mean difference (CI for the difference in mean between groups; if the interval crosses zero, they are not significantly different)