Week 2 - Estimation Flashcards
Population
Is what we want to study, but don’t have immediate access to
Sample
A portion of the population
Sampling
Method of getting a sample from population
Inference
Conclusions from the sample that relate to the population
Estimation
Common type of Inference
Use a sample -> Do calculation -> Obtain approximation to do with population
Parameter
A property of the population
Statistic
A quantity calculated from the sample (data)
Estimator (point estimator)
A statistic used for estimating a parameter (Eg: sample mean)
But refers specifically to random variable version (uppercase X)
Estimate (point estimate)
The observed value of an estimator
Uses specific data (lowercase x)
Sampling variation
Refers to the spread of data points between samples within same population
Sampling distribution
A probability distribution of a statistic that comes from choosing random samples of a given population
X¯ ∼ N(µ, σ2/n) - sample mean
Random sample
Observations are ‘independent’ and ‘identically distributed’ (iid)
Standard error
An estimate of the standard deviation of the estimator
Measures the variability or uncertainty of a sample statistic
se(µ^) = S/√n
Confidence interval
Range of values, derived from sample data, that is likely to contain the true population parameter with a certain level of confidence
Pr(L < µ < U) = 0.95
It has a confidence level of 95%
We say that it is a “95% confidence interval for µ”
uˆ ± c * se(µˆ)
Central Limit Theorem
States that the sampling distribution of a statistic will be approximately normal, regardless of the shape of the population distribution, as long as the sample size is large enough