Applied Economics & Statistics: Topic 5 - Estimation and Confidence Intervals* Flashcards

1
Q

Why do we estimate?

A
  • We usually don’t have access to data for the whole population.
  • Inferential Statistics estimates population parameters from sample statistics.

Example: Determine the mean income of UK residents.
We don’t have access to everyone’s income values, and it’s too costly
and time consuming to collect this information.
Instead, take a random representative sample of UK residents (using
an appropriate sampling technique).
Use an estimator, the sample mean, to estimate the population value
for mean income: a point estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Describe what makes a good estimator in statistics

A
  • We want our estimator to be accurate.
  • If we don’t know the population parameter, how do we know the
    estimator is close to it?
  • Estimator must have “good properties.”
    Specifically, it must be unbiased and efficient.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When is an estimator bias?

A

An estimator is biased if the mean of its sampling distribution is not
equal to the population value of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain whether the ‘sample mean’ is bias or not

A

We know from the Central Limit Theorem that the sample mean is an
unbiased estimator of the population mean, because its sampling
distribution is centred on the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When choosing between multiple unbiased estimators, which one is preferred?

A

The one with
least variance (more efficient) is preferred.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe how we can tell when an estimator has a high efficiency

A

An estimator with high efficiency will have a low standard deviation in
its sampling distribution.
So a high degree of clustering of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does it mean when an estimator has low standard deviation?

A

It has a high efficiency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe how the efficiency of estimators can be improved

A
  • We can improve the efficiency of estimators by increasing the size of
    the sample used.
  • Recall the standard deviation of the sample mean is σ/√n.
  • So as n increases, the standard deviation gets smaller
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does increasing sample size do to an estimator?

A

It increases its efficiency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does an unbiased and efficient estimator do?

A

It gives us the best chance of
getting an estimate close to the true value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which estimator gives us the best chance of
getting an estimate close to the true value?

A

Unbiased and efficient estimator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does a biased but efficient estimator do?

A

It would provide estimates that are
clustered but around the wrong value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which estimator would provides estimates that are
clustered but around the wrong value?

A

A biased but efficient estimator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does a biased and inefficient estimator do?

A

We wouldn’t even get close on
average, let alone with a single sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What type of estimator wouldn’t even get close on
average, let alone with a single sample?

A

A biased and inefficient estimator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In practice, describe which estimators we use

A

In practice, we might be willing to use an estimator that is biased but the most efficient, if its bias is small. But of course we always prefer
to use the most efficient (minimum variance) and unbiased (on average correct) estimator

17
Q

What are ‘estimates’?

A

Values derived from a sample and used to approximate
population values

18
Q

What’s a ‘point estimate’?

A

The statistic, computed from sample information, that estimates a
population parameter. Example: the maximum temperature tomorrow will be 15C. We can think
of this as our “best guess” as to what tomorrow’s temperature will be

19
Q

What’s the name for ‘the statistic, computed from sample information, that estimates a
population parameter’?

A

Point Estimate

20
Q

What are ‘confidence intervals’?

A

A range of values constructed from sample data so that the population parameter is likely to occur within that range at a specified probability. The specified probability is called the confidence level. Example: the maximum temperature tomorrow will be between 13C and
17C

21
Q

What’s the name for ‘a range of values constructed from sample data so that the population parameter is likely to occur within that range at a specified probability’?

A

Confidence Intervals

22
Q

What are confidence levels often stated as?

A
  • Confidence intervals are often stated as CI = point estimate ± margin of
    error:
    1. We find a point estimate by using the sample mean, ̄x , as we have
    already seen in previous topics.
    2. But we also state this as a CI for which we provide a range of
    values and a measure of how certain we are that this range contains
    the true population mean value.
    Example: For example: “At a confidence level of 95%, between 52% and
    58% of Americans favour the death penalty for people convicted of
    murder.” i.e. “The support for death penalty is currently 55%(±3%).”
23
Q

What are the 2 possible situations for calculating confidence intervals for a population mean?

A
  1. We use sample data to estimate μ with ̄x and the population standard deviation σ is known.
  2. We use sample data to estimate μ with ̄x and the population standard deviation σ is unknown. In this case, we use sample standard deviation s
24
Q

Describe & explain how we find CIs when σ is known

A
  • The distribution of the sample mean is:
    ̄x ∼ N (μ, σ^2/n),
    and we know that
    z = ( ̄x − μ) / (σ/√n) ∼ N(0, 1).
  • We use this information to determine probabilities associated with ̄x
    taking values in certain ranges, and from that build CIs
  • We often want to find a 95% confidence interval, i.e. one in which we are
    95% confident the true population mean lies.
    For what values of z would there be 95% of the area under the standard
    normal distribution curve? You can read this from a z table and you
    should find that the value is 1.96. So, 95% of the area lies in the interval
    (−1.96, +1.96), or:…
25
Q
A