Sampling and Estimation Flashcards
What is simple random sampling?
A method of selecting a sample in such a way that each item or person in the population being studied has the same probability of being included in the sample.
What is a sampling distribution?
The distribution of all values that a sample statistic can take on when computed from samples of identical size randomly drawn from the same population.
What is sampling error?
- The difference between a sample statistic and its corresponding population parameter.
- Using sample data presents the risk that results found in an analysis do not represent the results that would be obtained from using data involving the entire population from which the sample was derived.
What is stratified random sampling?
- Involves randomly selecting samples proportionally from subgroups that are formed based on one or more distinguishing characteristics, so that the sample will have the same distribution of these characteristics as the overall population.
- Ex: Stratification based on age groups of population
What is time series data?
Time-series data consists of observations taken at specific and equally spaced points in time.
What is cross sectional data?
Cross-sectional data consists of observations taken at a single point in time.
What is the central limit theorem?
States that for a population with a mean μ and a finite variance σ2, the sampling distribution of the sample mean of all possible samples of size n (for n ≥ 30) will be approximately normally distributed with a mean equal to μ and a variance equal to σ2 / n.
What is the standard error of the sample mean?
The standard deviation of the distribution of the sample means.
What is the calculation for the standard error of the sample mean, when the the population standard deviation is known?
where σ, the population standard deviation
What is the calculation for the standard error of the sample mean, when the the population standard deviation is unknown?
Where s, the sample standard deviation, is used because the population standard deviation is unknown
What are the three desirable properties of an estimator?
- Unbiasedness (sign of estimation error is random),
- Efficiency (lower sampling error than any other unbiased estimator), and
- Consistency (variance of sampling error decreases with sample size).
What is a point estimate?
- Point estimates are single value estimates of population parameters.
- Ex: the sample mean is essentially a point estimate of a population mean.
What is an estimator?
- An estimator is a formula used to compute a point estimate.
What is a confidence interval?
Ranges of values, within which the actual value of the parameter will lie with a given probability.
What is the calculation for the confidence interval?
confidence interval = point estimate ± (reliability factor × standard error)