T-test Flashcards

1
Q

What is a z-score?

A

A z-score quantifies how many standard deviations an observation or data point is away from the mean

z = (x - μ) / σ

x = data point, u = mean, o = sd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the mean and the sd of a z-score?

A

The mean will always be 0, and the sd 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is one benifit of calculating a z-score?

A

It allows us to compare values on different scales.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the equation for a one-sample t-test?

A

t = sampMean – popMean / estimatedpop(sd) / √n]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What happens to our t-statistic if our mean and pop mean are close together? (one-sample)

A

One has a lower t-score as these numbers represent the numerator in the equation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What will happen to our t-statistic if n increases?

A

Our t-statistic is likely to increase as well.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is a t-distribution calculated, and how does it differ from a normal distribution?

A

This is obtained by averaging lots of possible choices for pop(sd). It is different from a normal distribution in that it has heavier tails.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why does the size of the rejection region change as n increases?

A

Becasue the t-distribution depends on the sample size. If sample size goes down, the distriubtion looks more wide. Up, and it becomes more tall and narrow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why aren’t one-sided tests recomended?

A

Because they could be doubling your chance of a type I error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why aren’t one-sided tests recomended?

A

Because they could be doubling your chance of a type I error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the same and what is different about a Welsh and Student t-test?

A

They both assume the population is normal, and that observations are independently sampled. Student population has equal variance, Welsh does not

Welsh is a safer test to run. Student estimates a pooled ‘sd’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is there a debate around checking for normality?

A

Tests are fairly robust to small violations of normality. Because it is always likely to show something is not normal with large enough samples sizes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When using a QQ-plot to asses normality, what should it look like and what is it using?

A

Should look like a scatter-plot where the value points represent percentiles. Normality is represented with a straight line.

limitation: have to eyeball a graph.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is one problem with a Shapio-Wilk test?

A

Often significant if sample size is too large (>50).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a Shapiro-Wilk test print?

A

A W-score and a p-value.

a significant p = means not normal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When testing normality assumption, what do you have to rememeber?

A

That this assumption is that each group is normally distributed. Not just one, or both groups combined.

Test has to be run on each group individually.

17
Q

What are non-paramatic tests? Give example + limitation.

A

These are tests that don’t make assumptions about the shape (normality) of a distribution. Not as powerful, can increase rate of Type II error. eg, Wilcoxon test

18
Q

What does a Wilcoxon test do?

A

Rather than testing the residuels it looks at each data point of two groups and counts how many times the value of one group is higher than the other. W-score output.

Null: you should expect W = approx. half the possible combinations.

19
Q

How should a t-statistic stat block look?

A

t(df) = t_sat, p-value

20
Q

How do you caluclate a cohen’s d?

A

d = mean1 - mean2 / sd

21
Q

Approx. reading of cohen’s d?

A

0.2 (small)
0.5 (med)
0.8 (large)

22
Q

What code do you need to include to run a student test?

A

var.equal=TRUE

23
Q

What code to run a one-sided test?

A

add alternative=“less” or
alternative=“greater” for t.test()