9. Testing Hypotheses and Assessing Goodness of Fit Flashcards

Question 1

Q

Null Hypothesis

Definition

Answer

A

-the so-called null-hypothesis refers to some basic premise to which we will adhere unless evidence from the data causes us to abandon it

Question 2

Q

Null Hypothesis

In General

Answer

A

in general:
-we specify the pmf p(x;θ) [pdf f(x;θ)] but there is doubt about the value of parameter θ
-θ is some element of a specified parameter space Θ
-Ho specifies that θϵΘo⊂Θ
-Ho is true if θϵΘo but false if θ∉Θo

Question 3

Q

Alternative Hypothesis

Definition

Answer

A

-as a convention,
-the compliment of Θo is denoted as:
Θ1 = Θ - Θo

Question 4

Q

Null Hypothesis and Alternative Hypothesis

Symbols

Answer

A

the original hypothesis that θϵΘo is called the null hypothesis and denoted Ho
the hypothesis that θϵΘ1 is referred to as the alternative hypothesis and denoted by H1

Question 5

Q

Hypothesis Test

Definition

Answer

A

a hypothesis test is conducted using a test statistic with distribution known under the null hypothesis, Ho
a hypothesis test is used to consider the likely truth of the null hypothesis as opposed to a stated alternative hypothesis

Question 6

Q

p-value

Definition

Answer

A

the p-value (or significance level) is the probability of the test statistic taking a value at least as extreme as its observed value
the p-value is calculated assuming that the test statistic is distributed as if the null hypothesis was true

Question 7

Q

t-test

Definition

Answer

A

-a t-test is used for observations independently drawn from a normal distribution N(μ,σ²) with unknown parameters
-given the sample mean X_ and the sample variance S², the test statistic is:
T = √n(X_-μo)/S² ~ t(n-1)
-under the null hypothesis Ho: μ=μo

Question 8

Q

z-test

Definition

Answer

A

-for observations drawn from a normal distribution N(μ,σ²) but with σ² known
-we use a Z-test of Ho: μ=μo with test statistic:
Z = √n(X_-μo)/σ ~ N(0,1)
-under Ho

Question 9

Q

Paired t-test

Definition

Answer

A

-suppose that we have pairs of random variables (Xi,Yi) and that Di=Xi-Yi, i=1,…,n is a random sample from a normal distribution N(μ,σ²) with unknown parameters
-we use the test statistic
√n(D_-μo)/Sd ~ t(n-1)
-under the null hypothesis Ho: μ=μo
-here Sd² is the sample variance of the difference Di

Question 10

Q

Two Sample t-test

Outline

Answer

A

-consider two random samples X1,…,X, and Y1,…,Yn which are independent, normally distributed with the same variance
-the null hypothesis is Ho: μx=μy
-under Ho we can construct a test statistic T such that:
T ~ t(m+n-2)

Question 11

Q

Two Sample t-test - Two Samples Same Variance

Definition

Answer

A

-use test statistic

T = (X_-Y_) / (S√[1/m + 1/n]) ~ t(m+n-2)

Question 12

Q

Critical Region

Definition

Answer

A

-suppose we have:
–data, x_ : x1,x2,…,xn
–we have set a decision rule in advance: rejecting the null hypothesis when the p-value is less than or equal to a fixed α
-C1 is called the critical region of the test and α is the significance level
-note that α is the probability of rejection of Ho given that it is true
-in other words:
P(x1,x2,…,xnϵC1 | Ho) = α

Question 13

Q

Type I Error

Definition

Answer

A

-the error of rejecting the null hypothesis when it is actually true

Question 14

Q

Type II Error

Definition

Answer

A

-the error of not rejecting the hypothesis when it is actually false

Question 15

Q

Probability of Type I Error

Answer

A

P(Type I error) = P(xϵC1 | Ho) = α

Question 16

Q

Probability of Type II Error

Answer

A

P(Type II error) = P(xϵCo | H1) = β

Question 17

Q

Power Definition

Answer

A

-the probability that Ho is correctly rejected:
P(xϵC1 | H1) = 1 - β
-is called the power of the test

Question 18

Q

Connection Between Hypothesis Testing and Confidence Intervals

Answer

A

when considering random variables with mean μ and null hypothesis μ=μo, a p-value of less than α is equivalent to the appropriate confidence interval at (1-α)100%
i.e. performing a hypothesis test at significance level α is equivalent to checking if a value falls within confidence interval (1-α)100%

Question 19

Q

Choosing a Value for the Significance Level

Answer

A

the significance level chosen must depend on the question being asked and the consequences of being wrong
although p~<0.05 is quite usual, in some cases this error rate would be far too high

Question 20

Q

Sample Size Calculation

Answer

A

the power of a test can be used to compute a sample size
first find the value corresponding to the significance level and null hypothesis
then use the equation for power to gain an expression for n using the alternative hypothesis