Topic 6 Hypothesis Testing Flashcards

1
Q

One sample testing vs 2 sample testing

A

Single sample against population e.g grades in Birmingham vs population in England

Two samples against EACH OTHER. E.g grades in Birmingham vs London

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

3 assumptions to conduct one sample hypothesis testing

A

Random sampling
Level of measurement is interval or ratio (to calc mean)
Sample distribution is normal (we can be sure of this if large size, using CLT)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Step 1 of one sample hypothesis testing

A

State the null and alternate hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Null hypothesis

A

Statement about the value of a population parameter

H0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Alternate hypothesis h1

A

Statement accepted if null is false.

Inequalities are always part of the alternate i.e <,>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Possible null and alternative hypothesis symbols

A

If h0 is = h1 is ≉
If h0 is greater or equal to, h1 is <
If h0 is less or equal to, h1 is >

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Steps to one sample hypothesis testing

A

1.State null and alternative

2.Choose level of significance e.g 95% confidence so alpha=0.05

  1. Choose test statistic
  2. Formulate decision rule (critical value is where the null hypothesis is rejected)
  3. Make decisions
    - calculate test statistic

Find z value (sample mean (x bar) - population mean/(pop SD/root N (sample size)

E.g if critical region is 1.96 and z= 8, it falls in critical region so we reject null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What happens for one tailed tests

A

E.g left tail test, full 5% will lie in lower tale, changes the decision rule (critical value changes) now -1.65, 8>-1.65 so we now fail to reject null

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

T statistic and z score formulas ( the value that will be compared to the critical region)

A

Z score

Sample mean (X bar) - population mean (μ)
/
σ/√N.

It is similar to Z value, but Z value doesn’t have root N, as no sample size.

T statistic

Same but s instead of σ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When to use one vs two tailed

A

Want to know whether a different number of holidays happen in Edgbaston to Birmingham overall, use 2 tailed.

If we want to know whether edgbaston take MORE holidays then one tailed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When to use sigma or s

A

If sigma unknown use S!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When to use z or t distribution, when unknown sigma

A

If large sample, Z
Small sample, T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

P-value

A

Highest level of confidence we can reject the null

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How to find p value example

A

Lower p value is, the more confident

If p value is 0.01, reject null up to 99% confidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

3 sample hypothesis test assumptions e.g grades London vs Birmingham

A

Two independent random samples used (not related)

Level of measurement is interval or ratio (to calc mean)

Normal sampling distribution (confirm if large, by CLT)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Two sample hypothesis test steps (5)

A

State null and alternate: we state no difference between 2 samples for null.

Choose level of significance e.g alpha 0.05

Choose test statistic (z or t)

Formulate decision rule (find critical region)

Make decision

17
Q

Z score formula for two sample testing, if we know SD

A

(sample mean₁ - sample mean₂)-(μ₁ - μ₂)
/
√(σ²₁/n₁)+(σ²₂/n₂)

18
Q

Z score formula for two sample testing, if σ unknown.

A

Replace sigma1 and sigma2 (population deviation) with s1 and s2 (sample deviation)

19
Q

What do we use if sample is small in 2 sample test

A

T distribution

20
Q

If two populations have same SD, t statistic is….

A

Sample mean1(X bar1) - sample mean 2 (X bar 2)
/
√S²p (1/n₁ + 1/n₂)

S squared P is the pooled estimate of population variance.

21
Q

S²p formula

A

(n₁-1)s²₁ + (n₂-1)s²₂
/
(n₁+n₂-2)

n₁+n₂-2 is degrees of freedom e.g if n1=10 n2=9 DF=17

22
Q

If we assume 2 sample populations have different SDs unlike earlier , t statistic is…

  1. Degrees of freedom parameter for this (different S.D)
A

Sample mean₁(X bar 1) - Sample mean₂ (X bar 2)
/
√s²₁/n₁ + s²₂/n₂

(Same denominator as large but sigma unknown!)

  1. DF= (s²₁/n₁) + (s²₂/n2) all squared (AGAIN)
    /
    (S²₁/n₁)² / n₁-1 + (s²₂/n₂)²/n₂-1
23
Q

T statistic for dependent (paired) samples

A

d bar
/
S of d/√N

D bar is mean change in pair of observations.
S of d is SD of differences.
N is number of observations.

24
Q

S of D formula

A

√Σ(d-d bar)²
/
n-1

25
Q

One sampling and two Sampling test, when to use Z or T. Same rule for both

A

Z-when σ is known. Always use Z
If σ is unknown but sample is large, still use Z

T-when σ is unknown and sample is small.