unit 2 - chapter 9 - hypothesis testing with 1 sample Flashcards

1
Q

6 steps of a hypothesis test

A
  1. State the hypotheses
  2. Set alpha
  3. Calculate the test statistic
  4. Determine the critical value
  5. Make a test decision
  6. Apply test result to context
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

4 facets of the null (Ho)

A

1 - the status quo
2 - everything is unrelated
3 - no difference between groups
4 - everything arises from chance

part of step 1 - state the hypotheses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

the alternative

A

(H1/HA)
The opposite of the null (Ho)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Ho and H1 are

A

Like the sun and the moon… Ho and H1 are mutually exclusive and collectively exhaustive (cover all possible outcomes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

status quo meaning

A

The current baseline (or status quo) is taken as a reference point, and any change from that baseline is perceived as a loss or gain. Corresponding to different alternatives, this current baseline or default option is perceived and evaluated by individuals as a positive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

step 2 - select alpha

A

Alpha →
Level of significance
Statistically significant
Our risk tolerance

Common alphas
0.10, 0.05, 0.01, 0.01

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

step 3 - calculate the test stat

A

1 - take a sample
2 - compute the summary statistics
3 - run the appropriate test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

step 4 - determine the critical value

A

1 - alpha
2 - 1 or 2 tail test
3 - type of test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

important step 5 - make your test decision

A

TS > CV –> reject the null
reject when test stat is large
TS < CV –> fail to reject the null
fail when test stat is small
small test stat is a fail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Failing to reject Ho does not mean Ho is true

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

type 1 vs type 2 error

A

Type 1 error
Rejecting a true null
Probability is alpha (risk of making a type 1 error)

Type 2 error
Failing to reject a false null
Probability is beta

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Whether the null is true or false lies outside any test we do

A

true

We may not know where our population is
Alpha is static (will stay the same)
Not 1 - alpha

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

step 6 - apple rest result to context (conceptual)

A

Context: if i switch to ameriprise auto insurance I can save $532 or more
Ho mew <= $532
H1 mew > $532

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

test decision - 2x2 hypothesis test decision matrix

A

Ho is true + ftr Ho
correct decision

Ho is true + reject Ho
incorrect (type 1 - a)

Ho is false + ftr Ho
incorrect (type 2 - b)

Ho is false + reject Ho
correct decision

when you reject = new action taken
when you ftr = no new action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Ho is true + ftr Ho

A

correct decision
free innocent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Ho is true + reject Ho

A

incorrect (type 1 - a)
convict innocent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Ho is false + ftr Ho

A

incorrect (type 2 - b)
free guilty

18
Q

Ho is false + reject Ho

A

correct decision
convict guilty

19
Q

alpha in tests

A

Certain beyond a reasonable doubt

20
Q

z tests - assumptions for 1 sample test

A

1 - the null is true –> always testing the null
2 - random sampling
3 - central limit theorem is satisfied
4 - interval or ratio data

21
Q

the test stat equation

A

TS = (xbar - mew) / sxbar

xbar = random variable
mew = population mean
sxbar = standard error of the mean

TS = observed value - expected value/chance

22
Q

what is SEM and why do we need it

A

Chance is randomness/error → SEM

we need to have a measure of relative distance

SEM = s/square root of n
standard deviatio adjusted by sample size

23
Q

normal distribution: converting z to Z score

A

Z = (x-x bar)/s

z = random variable
x bar = sample mean
s = standard deviation

24
Q

1 sample test for the mean: converting x bar to test stat

A

TS = xbar - mew/ sxbar

xbar = random variable
mew = population mean
sxbar = standard error of the mean

25
Q

important:

if we reject then we apply…
if we fail to reject then we apply…

A

reject = apply alternative back to context of problem. there is statistical significance

fail to reject = apply hypothesis back to context of the problem. statistically similar.

26
Q

important: why is the alpha critical line so far out on the tail

A

There’s a reason for why it’s so far out (loose numbers is in the center)
Possible type 1 error: rejecting a true null (alpha)
The result is not due to a real reason but due to chance

low score = chance so 1 data point =/= trend

27
Q

t vs z distibutions

A
  • Z and t distributions are both symmetric; centered at mew = 0
  • t distributions shape approaches Z at n = 30
  • t distributions wider tails at n < 30
  • t distributions account for the greater uncertainty (increase in variation) associated with smaller samples
28
Q

lenon pregnancy and hypothesis

A

Ho = lennon is without child
H1 = lennon is pregnant

29
Q

confidence intervals

A

One job of a statistician is to make statistical inferences about populations based on samples taken from the population. Confidence intervals are one way to estimate a population parameter. Another way to make a statistical inference is to make a decision about the value of a specific parameter.

30
Q

hypothesis testing

A

A statistician will make a decision about these claims. This process is called “hypothesis testing.” A hypothesis test involves collecting data from a sample and evaluating the data. Then, the statistician makes a decision as to whether or not there is sufficient evidence, based upon analyses of the data, to reject the null hypothesis.

31
Q

null hypothesis

A

H0: The null hypothesis: It is a statement of no difference between the variables–they are not related. This can often be considered the status quo and as a result if you cannot accept the null it requires some action.

32
Q

alternative hypothesis

A

Ha: The alternative hypothesis: It is a claim about the population that is contradictory to H0 and what we conclude when we cannot accept H0. This is usually what the researcher is trying to prove. The alternative hypothesis is the contender and must win with significant evidence to overthrow the status quo.

This concept is sometimes referred to the tyranny of the status quo because as we will see later, to overthrow the null hypothesis takes usually 90 or greater confidence that this is the proper decision.

33
Q

There are two options for a decision. They are “cannot accept H0” if the sample information favors the alternative hypothesis or “do not reject H0” or “decline to reject H0” if the sample information is insufficient to reject the null hypothesis.

A

Ho

> =
<=

H1
=/=
<
>

34
Q

type 1 and type 2 error from book

A

α = probability of a Type I error = P(Type I error) = probability of rejecting the null hypothesis when the null hypothesis is true: rejecting a good null.

β = probability of a Type II error = P(Type II error) = probability of not rejecting the null hypothesis when the null hypothesis is false. (1 − β) is called the Power of the Test.

α and β should be as small as possible because they are probabilities of errors.

35
Q

critical values

A

z = (x-xbar)/s

critical values, are marked on the bottom panel as the Z values associated with the probability the analyst has set as the level of significance in the test, (α). The probabilities in the tails of both panels are, therefore, the same.
Notice that for each 𝑋⎯⎯⎯there is an associated Zc, called the calculated Z, that comes from solving the equation above.

This calculated Z is nothing more than the number of standard deviations that the hypothesized mean is from the sample mean. If the sample mean falls “too many” standard deviations from the hypothesized mean we conclude that the sample mean could not have come from the distribution with the hypothesized mean, given our pre-set required level of significance.

36
Q

p value approach

A

Simply stated, the p-value approach compares the desired significance level, α, to the p-value which is the probability of drawing a sample mean further from the hypothesized value than the actual sample mean. A large p-value calculated from the data indicates that we should not reject the null hypothesis.

The smaller the p-value, the more unlikely the outcome, and the stronger the evidence is against the null hypothesis. We would reject the null hypothesis if the evidence is strongly against it.

37
Q

alpha vs p value

A

If α > p-value, cannot accept H0. The results of the sample data are significant. There is sufficient evidence to conclude that H0 is an incorrect belief and that the alternative hypothesis, Ha, may be correct.

If α ≤ p-value, cannot reject H0. The results of the sample data are not significant. There is not sufficient evidence to conclude that the alternative hypothesis, Ha, may be correct. In this case the status quo stands.

When you “cannot reject H0”, it does not mean that you should believe that H0 is true. It simply means that the sample data have failed to provide sufficient evidence to cast serious doubt about the truthfulness of H0.

38
Q

one-tailed tests

A

It may be that the analyst has no concern about the value being “too” high or “too” low from the hypothesized value. If this is the case, it becomes a one-tailed test and all of the alpha probability is placed in just one tail and not split into α/2 as in the above case of a two-tailed test. Any test of a claim will be a one-tailed test.
For example, a car manufacturer claims that their Model 17B provides gas mileage of greater than 25 miles per gallon. The null and alternative hypothesis would be:
H0: µ ≤ 25
Ha: µ > 25

39
Q

effects of sample size on test statistics

A

In developing the confidence intervals for the mean from a sample, we found that most often we would not have the population standard deviation, σ. If the sample size were less than 30, we could simply substitute the point estimate for σ, the sample standard deviation, s, and use the student’s t-distribution to correct for this lack of information.

When testing hypotheses we are faced with this same problem and the solution is exactly the same. Namely: If the population standard deviation is unknown, and the sample size is less than 30, substitute s, the point estimate for the population standard deviation, σ, in the formula for the test statistic and use the student’s t-distribution.

If we do not know σ, but the sample size is 30 or more, we simply substitute s for σ and use the normal distribution.

40
Q

one tailed test vs two tailed test

A

hypothesized values of p (probability)…

two tailed=
Ho: p = po
H1: p =/= po

one tailed=

Ho: p<=po
H1: p>po
either direction