Week 5 - Sampling and random error & Week 6 - Statistical significance Flashcards
What is a sample?
A sample is a selected subset of a source population
Ideally should be representative of source population
What is a source population?
The source population is the group of all
individuals in which we are interested to assess some parameter(s)
What is sampling?
The process of selecting a number of
individuals from all individuals found in a source population
Many different sampling methods.
What is a sampling frame?
a list (or database) containing all
individuals in a population and is used for sampling
What are sampling units?
The sampling units are the individuals to be potentially
selected.
Sampling units most of the time are individual
people, but we could also have larger sampling units (i.e.
families, streets, hospitals, schools, etc.)
Who can be part of the source population?
The source population can be the general population (i.e.
the total population of a country or city), but can also be a
specific sub-population (i.e. all smokers of a country, all
patients with heart disease, all children with cancer, etc.)
Describe what the sample should represent for each type of research.
- In descriptive research (i.e. when we want to investigate
prevalence/incidence of a condition in a population), it is
particularly important that the sample accurately
represents the specific source population - In analytic research (i.e. when we investigate association
between exposure and outcome), we can be more general
regarding the source population, depending on the
research question of interest - In situations where we investigate a biological effect on
some disease (i.e. effect of smoking on risk of cancer), we
can be more general in identifying the source population
(i.e. not necessarily restricted to specific country/region)
4.In situations where we investigate social/cultural effects
(i.e. effect of social class on risk of heart disease), we have
to more careful and restrict the source population to the
specific country/region from where the sample was derived
What is an estimate?
In order to determine the proportion of a characteristic in a
population, we usually measure
that in a sample
Therefore what we measure is an ESTIMATE. This estimate
carries an inherent error (sampling error)
The sample estimate attempts to quantify the
corresponding population parameter
What is statistical inference?
*When the sample estimate is used to draw conclusions
(inferences) about the population from which the sample
was taken, this is called STATISTICAL INFERENCE
*Statistical inference, as the name suggests, involves the use
of statistics to determine the degree of uncertainty in the
estimate of interest
What is a parameter?
- A parameter is a measurement of a quantity (or association)
in a population, which we are interested about, e.g: - mean age
- prevalence of obesity
- mean difference in blood pressure between men and women
- Odds Ratio for association between smoking and cancer
Population parameter and sample estimate for any given variable is?
Sample estimate mean = 3.75
Population parameter = 3.72
What is sampling variation?
The difference (variation) between different sample
estimates derived from the same source population
What is sampling error?
The difference in magnitude between the sample estimates
and the actual population parameter caused by measuring a
quantity (or association) in a sample rather than in the
source population
Also called “random error”, because it depends on chance
What happens when you decrease sample size?
- Sampling variation = increase
- sampling error = increase
NB! All principles covered thus far apply for all measures of association (incidence, risk ratio, rate ratio, mean diff, correlation coefficient, regression coefficient) and all are termed ‘estimates’ calculated from ‘sample’.
What is sampling distribution?
All the samples calculated plotted on a histogram