Tests and Distributions (Basic Stats) Flashcards

1
Q

Null Hypothesis

A

“no effect”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Alternative Hypothesis

A

Some real effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Test Statistic

A

Standardized difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Obtain p-value from the…

A
sampling distribution
(If we had lots of samples, and the H_0 were true, what is the [theoretical] distribution of the test statistic?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

General framework of a test:

A
  1. State null and alternative hypotheses
  2. Calculate test statistic
  3. Obtain p-value from sampling distribution
  4. Make conclusion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

“Under H_0” means…

A

“if H_0 is true”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

p-value is:

A

probability of observing the data we did (or more extreme), just by chance give that H_0 is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is the t-distribution appropriate?

A

When the model assumptions are met:
- need to check the distribution for normality
(If the underlying distribution isn’t normal, than the distribution from which the p-value comes from will be incorrect, and will give an erroneous value!)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What graphical checks can we do to check normality?

A
  • Boxplot
  • Histogram
  • Normal Probability Plot (Q-Q plot) (compare observed values to expected
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Boxplots don’t really test for…

A

outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a Kernel?

A

Basically a smoothed histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What shape is a Q-Q plot if the distribution is short-tailed?

A

S- shaped!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What shape is a Q-Q plot if the distribution is long-tailed?

A

Inverted S (like x^3 graph)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What shape is a Q-Q plot if the distribution is right-skewed?

A

Like e^x function (curves upwards to inf)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What shape is a Q-Q plot if the distribution is left skewed?

A

like sqrt(x) function. (curves to a steady value)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What do we do if normality assumptions are violated?

A
  • Transform the response variable
  • or try a non-parametric test (aka no distribution is assumed) (these are usually less statistically powerful)
  • Generate the sampling distribution by the Permutation Approach
17
Q

What transformations can we try for right-skewed data?

A
  • log

- sqrt

18
Q

What is the permutation approach? (for t-tests)

A

Since H_0 says the labels don’t matter, if the underlying distribution isn’t normal, then we can rearrange the labels randomly a LOT (or possibly all) of times and find the distribution of all possible t-values.

Then finding the area under the tails from our t-value onwards will give us the p-value!

19
Q

What is resampling?

A

When we generate the sampling distribution with repeat samples of the data
(ie: bootstrap sampling (with replacement), or permutation (w/o replacement)

20
Q

What are non-parametric tests?

A

Wilcoxon Rank sum

Mann-Whitney test

21
Q

Hypotheses for numerical normality tests?

A

H_0: the data are normal
H_A: the data are not normal

22
Q

When to use the Shapiro-Wilk test for normality?

A

When 10 < n < 2000

23
Q

When to use the Kolmogorov-Smirnov Test for normality?

A

When n > 2000