Module 1: Descriptive Statistics & Estimation Flashcards
Define sample.
A sample is a representative portion of a population that makes statistical analysis practical. However, sampling brings uncertainty since it is not a true representation of the entire population.
Define statistics.
Statistics is the study of methods for measuring aspects of populations from samples and for quantifying the uncertainty of these measurements.
Define estimation.
Estimation is the process of inferring an unknown quantity of a target population using sample data.
Define population parameter.
Parameters are quantities that describe some truth about a population, such as averages and proportions. Statistics involves the estimation of these parameters.
Define sample statistic.
A sample statistic is any value calculated from a sample.
Define statistical population.
A population is the entire collection of individual units of interest.
Define sampling unit.
A sampling unit is whatever basic unit the data points of a sample are defined by. This may be actual individuals of 1, or individual groups or selections.
Define sampling error.
Sampling error refers to the chance difference between an estimate describing a sample and the corresponding parameter of the whole population.
What is the goal of sampling?
Sampling aims to increase the precision and accuracy of estimates and to ensure that it is possible to quantify these outcomes.
Define precision.
Precision refers to how consistent estimates are when compared to each other. Larger samples are less affected by chance, therefore allowing for more precise estimates.
Define accuracy.
Accuracy refers to how close estimates are to the true population characteristic. If an estimate is unbiased, estimates will be more accurate.
Define bias.
Bias is a systematic discrepancy between estimates obtained repeatedly and the true population characteristic.
Define random sample.
Random samples contain individuals of a population that have had an equal and independent chance of being selected. The use of random samples minimizes bias and allows the sampling error to be measured.
Define independent observation.
Independent observation assumes that no value from a sample can be inferred from any other value.
Define sample of convenience.
A sample of convenience is a collection of individuals that are easily available for the researcher, but very likely to be biased and not selected independently. A sample of convenience can be a random sample, but is likely not.
Define volunteer bias and provide examples.
Volunteer bias is the systematic differences between volunteer samples and their populations, which occurs when the behaviour of subjects impacts their chance of being sampled.
ex. sicker volunteers for medical studies, liberal volunteers for sex studies, animals willing to be trapped
Define variables.
Variables are any characteristics or measurements that differ across a group of individuals. Estimates are technically variables, since they differ across samples.
Define descriptive statistics.
Descriptive statistics are quantities that capture important features of frequency distributions, which is done by measuring the location and spread of a given distribution.
Define cumulative frequency distribution.
Cumulative frequency distributions show all the quantiles within a sample on a graph.