Quant: Hypothesis Testing Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is a hypothesis?

A

A statement about the value of a population parameter developed for the purpose of testing a theory or belief.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Steps in Hypothesis Testing Procedure =

A

State the hypothesis

Choose appropriate test statistic

Specify the level of significance

State the decision rule regarding the hypothesis

Take a sample, calculate sample stats

Make a decision regarding the hypothesis

Make a decision based on the results of the test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Null hypothesis =

A

what we want to REJECT.

It is what is ACTUALLY tested, and is the BASIS for the test statistic chosen

always includes the “equal to” condition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Alternative hypothesis =

A

what we actually want to be TRUE (most of the time)

we conclude this if there is sufficient evidence to REJECT the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

One tailed vs Two tailed tests =

A

one sided will be used to show that a value is greater than a single value.

a two tailed test will have 2 critical values/rejection points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Test Statistic =

A

is a RANDOM VARIABLE that follows a T, Z, CHI-SQUARE or F distribution

calculated by comparing the point estimate of the population parameter and the hypothesized value of the parameter.

We then compare the TEST STATISTIC to the CRITICAL VALUE of the test statistic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

population

sample

sample statistic

WHAT IS THE CONNECTION?

A

we want to draw an inference regarding the parameter of a POPULATION. We take a SAMPLE of the population. We compute a SAMPLE STATISTIC. We draw our INFERENCE.

HOWEVER there is a chance that this SAMPLE is not representative of the popuilation–> ERROR!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

when drawing inferences from a population -TYPE I ERROR =

A

rejection of the null hypothesis when it is actually true.

THE SIGNIFICANCE LEVEL is the chance of making a type 1 error - ie 5% significance level, means there is a 5% chance of rejecting the null when it should not be.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

when drawing inferences from a population -TYPE II ERROR =

A

failure to reject the null hypothesis when it is actually false

chance of type II error INCREASES/power of test DECREASES with

  • decrease in level of significance (ie from 5% to 1%)
  • decrease in sample size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

the POWER of the test =

A

1- P(making a TYPE II ERROR)

where a type II error is failing to reject the null when it is false.

We can use the power of the test to compare two different test statistics (and typically choose the one with THE GREATEST POWERS)

The power of the test increases whenyou reduce the chance of a type two error. This happens when you increase the significance level (ie. 1% to 5%), which increases the chance of making a type 1 error.

Seems we can take P.O.T as “the usefulness of the test in rejecting the null’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

CHEAT CARD, T1 & 2 ERRORS, SIGNIFICANCE LEVELS AND POWER OF THE TEST=

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Confidence interval =

A

A range of values within which the researcher believed the true population parameter may lie.

A 95% confidence interval uses a critical value associated with a particular distribution at a 5% level of significance.

A hypothesis test would compare a test statistic to a critical value at a 5% level of significance.

NB. standard error = s/√n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Statistical results vs economically meaningful results =

A

a statisitically significant result does not necessarrily mean an economically meaningful one: transactions costs, taxes and risk are not normally considered in the statistical test.

“Any of these factors could make committing funds to a strategy unattractive, even though the statistical evidence of positive returns is highly significant. By the nature of statistical tests, a very large sample size can result in highly (statistically) significant results that are quite small in absolute terms.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

P-value =

A

probability of obtaining a test statistic that would lead to the rejection of the null hypothesis when it is actually TRUE. Smallest level of significance for which the null hypothesis can be rejected.

take the test statistic, find the corresponding significance level from the distributiont (multiply by two if a two tailed test)

P- VALUE WILL SHOW THE LOWEST LEVEL OF SIGNIFICANCE FOR WHICH WE WOULD ACCEPT/REJECT THE NULL (THE THRESHOLD) - WHAT THE PROBABILITY IS THAT A VALUE WOULD BE GREATER THAN THE TEST STAT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

t-test =

A

Use t-test when you could use a t distribution (DO YOU KNOW WHAT THE NECESSARY CONDITIONS ARE?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

z-test (variance known and unknown) =

A

Used when you could uses t stat (WHAT ARE THE CONDITIONS?)

The test compares a z-statistic with a critical z-value (which corresponds to the significance of the test)

If the variance is known, ‘σ’ becomes ‘s’, the sample standard deviation.

17
Q

CHEAT CARD: CRITICAL Z-VALUES =

A

EXAMPLE FOR A T/Z TEST IS ON P316 OF SCHWESER

18
Q

Hypothesis T-TEST: 2 populations, variance assumed to be EQUAL =

A

We are testing the hypothesis that the means of the two populations are equal.

Variances are unknown, but ASSUMED TO BE EQUAL.

Assumptions (‘we are reasonably certain that..”): populations are independent AND normally distributed.

SAMPLE OBSERVATIONS ARE POOLED.

KEY POINT: the numerator is the difference between sample means. The larger the difference, the bigger the t-stat, the more likely we are to reject the null and conclude the means are not equal (and vice versa if the numerator is small)

19
Q

Hypothesis T-TEST: 2 populations, variance NOT assumed to be EQUAL =

A

We are testing the hypothesis that the means of the two populations are equal.

Variances are unknown, but NOT ASSUMED TO BE EQUAL.

Assumptions (‘we are reasonably certain that..”): populations are independent AND normally distributed.

BOTH SAMPLE VARIANCES ARE USED and T-TEST USES AN APPROXIMATED VALUE FOR THE DEGREES OF FREEDOM.

KEY POINT: the numerator is the difference between sample means. The larger the difference, the bigger the t-stat, the more likely we are to reject the null and conclude the means are not equal (and vice versa if the numerator is small)

20
Q

Forming a hypothesis T-Test (2 populations, are the means equal) =

A

Hypothesis

Level of significance

T stat is calculated using craycray formulas

Deg of freedom is based on number of samples (n1+n2-2) if variance is assumed to be equal, calculated from craycray formula otherwise.

State the decision rule (what is the critical value and when we would reject the null)

21
Q

Paired comparison test =

A

DATA MUST BE NORMALLY DISTRIBUTED

Used when samples may be dependent (ie returns of two steel companies, both would be affected by market returns and the same industry specific risks). We can’t use standard comparison T-TEST.

Test of whether the average difference between the observations of the two samples is SIGNIFICANTLY DIFFERENT FROM ZERO.

KEY POINT: t-test is with n-1 degrees of freedom, where n=number of paired observations.

t = avg diff in paired observations/std error of the differences between observations

22
Q

Chi-square test =

A

for hypothesis tests for the variance of a normally distributed population.

ASSUMPTION: population is normally distributed.

Chi-square distribution is asymmetrical and approaches a normal distribution as DOF increase.

23
Q

F-test =

A

for testing the equality of the variances of two populations.

ASSUMPTIONS: normally distributed and independent populations.

Can be two sided (Var1 = Var2) or one sided (Var1 > Var2)

For F-test always put the larger variance on top (numerator) - this lets us consider only the critical value in the right tail.

24
Q

Parametric tests =

A

rely on assumptions about the distribution of the population and are specific to population parameters.

ie. Z-test: relies on mean and σ to define a normal distribution, requires population is normally distributed or sample is large.

25
Q

Non-Parametric tests =

A

either do not consider a population parameter or have few assumptions about the population that is sampled.

  1. assumptions reqd for a parametric test are not met
  2. data are ranks (ordinal) and not values
  3. hypothesis does not involve the parameters of the distribution ie runs test that provides a probability estimate of whether a series is random

Spearman rank correlation test: could be used on ranked data to show relation between values ie is a high rank in year 1 associated with a high rank in year 2