Section 4 (start of section 2 - p59) Flashcards

Question 1

Q

What is the approach used in statistical inference when the purpose is to obtain information about the population parameters, such as the mean and standard deviation?

Answer

A

Estimation

Question 2

Q

What is the approach used in statistical inference when the purpose is to make comparisons with some hypothesised value?

Answer

A

Hypothesis testing

Question 3

Q

What are the 2 types of estimate?

Answer

A

Point

Interval

Question 4

Q

How are point estimates of population parameters derived?

Answer

A

From the corresponding sample parameters

Question 5

Q

When is a sample parameter said to be an unbiased estimator of the population parameter?

Answer

A

If the average of all possible sample parameters is equal to the population value

Question 6

Q

How is an estimator of a population parameter represented in symbol form?

Answer

A

By the symbol for the parameter with a hat above it

Question 7

Q

How is the uncertainty associated with a point estimation expressed?

Answer

A

By confidence intervals

Question 8

Q

What is a confidence interval?

Answer

A

A range which we would expect, with a given level of confidence, to include the population parameter

Question 9

Q

What is the usual level of confidence used?

Other possible values?

Answer

A

95%

Can also use 99% and 99.9%

Question 10

Q

What happens to the width of the confidence interval as the level of confidence increases?

Answer

A

Width also increases

Question 11

Q

What is the name for the upper and lower values for the confidence interval?

Answer

A

Confidence limits

Question 12

Q

How are the confidence limits obtained?

Answer

A

By adding/ subtracting a values to/ from the value of the point estimate

Question 13

Q

Describe how the confidence interval changes for more or less variable data?

Answer

A

The less variable the data the narrower the confidence interval

Question 14

Q

What does precision refer to?

Answer

A

The variability of an estimate, not its accuracy

Question 15

Q

What does hypothesis testing use instead of an interval to express information about the population parameter?

Answer

A

A numerical value called the test statistic

Question 16

Q

What is the name of the statement based on the test statistic that is used to determine whether a claim about a population parameter, made in the null hypothesis, is accepted or rejected?

Answer

A

Decision rule

Question 17

Q

What is the symbol for the null hypothesis?

Answer

A

H0 (H nought)

Question 18

Q

What idea does the null hypothesis usually express?

Answer

A

There is no effect

Question 19

Q

What is tested against the null hypothesis?

Answer

A

The alternative

Question 20

Q

Symbol for the alternative hypothesis?

Question 21

Q

What is the standard error?

Answer

A

Standard deviation of the mean

Question 22

Q

Is the mean of a sample from a non-normal population normally distributed?

Answer

A

Approximately - the larger the sample, the better the approximation

Question 23

Q

What is the fact that the mean of a sample from n independent items from a non-normal population can be described as approximately normal, with the approximation being better the larger the sample is?

Answer

A

Central limit theorem

Question 24

Q

What is the main concept of the central limit theorem?

Answer

A

No matter what shape of sample you have, if the variables are independent and random, the average of the means will be normally distributed, if the sample size is large enough

Question 25

Q

How large does the sample size need to be for the central limit theorem?

Answer

A

about 30 samples if the population distribution is roughly bell-shaped
At least 40 if the original population is distinctly not normal

Question 26

Q

What factors are required for the mean of a sample from a normal population to be normally distributed?

Answer

A

Known population standard deviation

Observations in the sample are independent

Question 27

Q

What factors are required for the mean of a sample from a not normally distributed population to be normally distributed?

Answer

A

Large enough sample

Independent items

Question 28

Q

What is another name for plausibly?

Answer

A

Approximately

Question 29

Q

What would we expect to happen to the standard error and the confidence interval for the sample mean as the number in the sample increases?

Answer

A

Standard error decreases

Confidence interval becomes narrower

Question 30

Q

If the population standard deviation is unknown, the mean of a sample of n items from a normal population with mean mu has what distribution? Describe properties of this value?

Answer

A

T distribution with mean mu and standard error s/ square root of n

Question 31

Q

Describe the normal and t distributions when n is large?

Answer

A

Almost identical

Question 32

Q

What happens to the distribution of the sample proportion as the number in the sample increases?

Answer

A

Tends towards normality

Question 33

Q

When is the distribution of the sample proportion plausibly normal?

Answer

A

If both np and n(1-p) are greater than 5

Question 34

Q

For estimating proportions from sample parameters, what does the mean equal?

Answer

A

Population proportion p

Question 35

Q

For estimating proportions from sample parameters, what does the standard error equal?

Answer

A

Square root of p(1-p)/ n

Question 36

Q

What is the aim of a hypothesis test?

Answer

A

To asses the validity of a claim about a population parameter

Question 37

Q

When must the hypothesis to be tested be defined?

Answer

A

Before data is collected

Question 38

Q

Should you ever use one-sided hypothesis tests?

Answer

A

No, they are rarely justifiable

Question 39

Q

What is the area called when a random sample lie in the 5% chance area?

Answer

A

Critical region

Question 40

Q

What happens if a value lies in the critical region when doing a hypothesis test?

Answer

A

Casts doubt on the validity of the null hypothesis, which would then be rejected in favour of the alternative

Question 41

Q

What is it called when you reject a true null hypothesis?

Answer

A

Type I error

Question 42

Q

What is it called when you accept a false null hypothesis?

Answer

A

Type II error

Question 43

Q

What happens to type 2 errors if you reduce type 1 errors?

Answer

A

Increase type 2 errors (unless you increase the sample size)

Question 44

Q

What is the level of significance of the test?

Answer

A

The probability of making a type I error

Question 45

Q

What level of risk is considered to be acceptable for failing to detect an effect?

Question 46

Q

What is the complement of the significance level?

Answer

A

The confidence level

Question 47

Q

Why don’t we always use a 1% significance level instead of a 5% significance level?

Answer

A

This would increase the chance of making a type II error (if we wrongly reject a true null hypothesis, we wrongly accept a false alternative - if we decrease the chance of one happening, we increase the chance of the other happening - need to strike a balance)

Question 48

Q

What level do we normally set the probability of making a type I error?

Question 49

Q

What level do we normally set the probability of making a type II error?

Question 50

Q

What is a test statistic?

Answer

A

A measure of the difference between what is expected if the null hypothesis were true and what is obserrved

Question 51

Q

What is the general formula for a test statistic?

Answer

A

Observed value of parameter - expected value of parameter/ standard error of parameter

Question 52

Q

What is the test statistic based on the normal distribution for means when the population standard deviation is know?

Question 53

Q

What is the test statistic used for means when the population is normally distributed but the population standard deviation is not know?

Answer

A

Student t test for mean

Question 54

Q

Test statistic used for variance?

Answer

A

Fisher’s f test for variance

Question 55

Q

Test statistic used for proportions?

Answer

A

Chi squared test for proportion (x^2)

Question 56

Q

What is the decision rule?

Answer

A

A statement of the conditions (values of the test statistic) for which the null hypothesis will be rejected

Question 57

Q

Decision rule number for 5% level of significance?

Question 58

Q

Decision rule number for 1% level of significance?

Question 59

Q

What are the 8 steps for testing a hypothesis?

Answer

A

Identify the distribution of the data
Construct the null and alternative hypothesis
Establish the significance level
Identify the test statistic
Formulate the decision rule
Carry out the study
Conduct the test
Make the decision and interpret the result