Lecture 2-Sampling Flashcards
Why a sample instead of a census?
Impossibility of conducting a census due to
time, cost, lack of access to the entire
population etc.
Sample Objectives
To be as representative as possible of the
underlying population
* Why is this important?
* Because we are going to generalise from the
sample to the population; we are going to use the information from the sample and assume
that it is true of the entire population
Sampling Approaches
Random sampling
– Simple random sampling
– Stratified random sampling
– Cluster sampling
* Non-Random sampling
– Quota sampling
Census
a survey that includes every element
of the target population
Sample survey
a survey that collects
information from a portion of the population
Sampling Frame
includes all the members of the population who are eligible to be part of the sample
Examples
– The population census data
– Telephone directory
– Students registration database at the UWI
Sampling with Replacement or without
A sample may be selected with replacement or without replacement.
* In sampling with replacement, each time we select an element
from the population, we put it back in the population before we select the next element of the sample
* Therefore, the population contains the same number of items each time a selection is made, and we can also select the same
item more than once
* Sampling without replacement occurs when the selected
element is not replaced in the population
* Therefore, each time we make a selection, the size of the population is reduced by one element. We cannot select the same item more than once.
* Most times, sample taken in statistics are without replacement.
Random vs Non-Random Samples
- A sample may be random or nonrandom.
- A random sample is a sample that has been drawn so that each element of the population has a measureable chance of being selected for the sample.
- If a sample does not assign a chance of being included, to a specific set of elements within the
population, it is nonrandom. - It is important to note that the randomness is in the procedure and any corruption of that
procedure is likely to “corrupt” the randomness, thereby introducing selection bias.
Representative Sample
A sample that represents the characteristics of the population as closely as possible
Sample Frames and Random Sampling
No matter what random sampling design is selected, the actual implementation of the exercise and the explicit choice of the individual members of the sample cannot be carried out without a sampling frame.
* Sampling frames may themselves be responsible for many of the inaccuracies of the resultant sample.
* For instance, if the frame is incomplete (which
immediately excludes some members of the population from the sample), if the frame is inaccurate, or if the frame is out of date.
Notwithstanding these limitations, a sampling frame is an immediate requirement for the selection of a sample, and no actual frame will possess all of the characteristics of an ideal sampling frame. What an investigator must consider at this point, therefore, is all the frames that are available and their limitations, and in this context select the frame that will enable the most complete, accurate and convenient sample of the population under study.
Explanation of Random Sampling and Sampling Frames
Suppose, for example, that I want to determine the average age of the statistics class of 300 students by using a random sample of 10 students drawn from the class.
The random nature of the selection procedure does not necessarily ensure a truly representative sample since, for instance, it is quite within the realm of possibility to draw the ten youngest members of the class. Using the average age of these 10 as the average
age of the entire class is therefore erroneous.
Simple Random Sample
is a sample that has
been drawn so that each element of the
population has an equal chance of being
selected for the sample.
Advantages of Simple Random Sample
Simple random samples enjoy the principal
advantage that they eliminate selection bias.
* Furthermore, the elimination of selection bias
in Simple Random Sample has to do with the mode of selection and has nothing to do with the
representativeness of the resultant
sample.
Disadvantages of Simple Random Sampling
Simple random samples can be very costly, for instance when the population being sampled is
distributed across a wide (geographical) area.
* Taking a simple random sample does not automatically ensure that the results obtained are reliable and there is the possibility that a sample selected by a simple random method would be a bad or imprecise indicator of the population from which it was taken, so leading to inaccurate estimates.
* In particular, when a population is stratified, simple random sampling can result in a sample
that is comprised of elements from just one or two strata; in which case the sample will not be
representative of the population.
Stratified Random Sampling- explanation
Suppose I was interested in the average age of a student in ECON 1005
– What is my population?
– Suppose I take this class as my sample, and find the average age
– Will be sample be representative of the population?
– What has gone wrong?
* The stratified random sampling design involves the division of the population into various
categories or strata (singular stratum) using what is known as a stratification factor.
* The stratification factor must be chosen so that the strata are mutually exclusive. i.e. Each
member of the population must be assigned to exactly one stratum.
* A simple random sample is then drawn from each stratum to ensure adequate representation of all strata in the sample.
* The collection of simple random samples constitutes the stratified random sample.
Stratified Random Sample-definition
Stratified Random Sampling is divided into st rara based on shared characteristics for example age and gender, a random sample is then drawn from each stratum
*
Advantages of Stratified Random Sampling
The stratified random sampling design makes use of knowledge of the population to increase the precision of the results obtained from the
sample.
* The chance of any individual being drawn is still measurable and all
possible samples of equal size still have the same chance of selection. So, although the choice of
the stratification factor is almost entirely dependent on human judgment, the procedure still maintains an element of randomness to it, especially since the mode of selection within each stratum is clearly random.
* This method can significantly reduce the cost of sampling, therefore,
because it can achieve accurate results from smaller samples.
* One advantage of stratification is that, besides facilitating the acquisition of
information about the entire population, we can also make inferences within each stratum or compare strata.
Limitations from Stratified Random Sampling
One important consequence of stratifying the
population is that, now, each element of the population no longer has an equal chance of
being drawn.
Best results from stratified random sampling
Stratified random sampling produces its best
results when the variation within each stratum/group is small compared to the variation
between strata/groups.
* When the within-group variation is small, it will
provide results nearly identical to those of Simple
random sampling.
* Go back to the examples of age of an ECON 1005
student, and height of an ECON 1005 student, and the choice of the stratification factors.
Describe the within-group and between-group
variations.
32