13. Dealing with assumptions Flashcards

1
Q

Assumptions of t-tests

A
  1. random samples
  2. populations are normally distributed
  3. TWO SAMPLE T-TEST: populations have equal variances
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Can we statistically fix random sampling?

A

NO!!!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Use of histograms for normality

A

Not normal if skewed, asymmetrical
**especially clear with a large data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can you check in something is normally distributed?

A

Check previous data/theory

Plot on a histogram - bell curve shape!

Quantile plot (QQ plot) expect dots to fall along a straight line if normal!

Shapiro-Wilk test (formal test of normality) - not particularly useful in deciding what test to use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are quantiles

A

divides range of probability distribution into continuous intervals w/ equal probabilities

10% below this value
20% below this value
30% below this value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Shapiro-Wilk test!

A

Used to test statistically whether a set of data comes from a normal distribution

NOT a good thing to decide if t-test BECAUSE more likely to reject a false null hypothesis when lots of data, BUT mostly we care about distribution of sample means being normally distrib, therefore would be fine to do other tests BUT shapiro-wilk would say no!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Strategies if NOT normal

A

if sample size is large, sometimes parametric tests work OK anyway!

transformations (ex log) - new set of values that may fit assumptions

non-parametric tests - makes fewer assumptions about distributions data came from, often based on ranks

permutation tests - asks if theres association btwn two variables, mix up variables and find association you would get by chance, compare to actual association!

bootstrapping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

rule of thumb for sample size

A

if sample size > ~50, the normal approximations may work

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Really great test:

A

Welch’s t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If sample sizes are equal and large, what sort of difference in variance is APPROXIMATELY OK

A

ten-fold difference!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Requirements of data transformations

A

Same transformation applied to each individual (for a specific variable)

One to one correspondence to original values/transformed values - NOT absolute values!!! have to be able to go backwards!

monotonic relationship w/original values (ex large values stay larger)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Non-Parametric Tests

A

Assume less than parametric about underlying distributions

Most often RANK each data point in all samples from lowest to highest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Log transformation

A

Y’ = ln[Y]

Good when variable is likely to be the result of multiplication or division of various components - what was multiplicative, becomes additive

EX. growth - grow 10% a year not +10 mm, so log normal not normal normal!

Good for RIGHT skewed data, not left!!!

Good when variance becomes larger in groups where mean is larger

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Test to compare central tendencies of two groups using ranks

A

Mann-Whitney U test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly