Chapter 11 statistical sampling Flashcards
Mean
Sum of the numbers in a data set divided by the total number of values in the data set. Average. Best used in data set with numbers that are close together.
Median
Midpoint value of a data set, where the values are arranged in ascending or descending order. Better with a data set with outliers.
Random Variable
A variable that describes all of the possible outcomes of a random process. For example if you have X for a coin flip, then X=1 when it is heads and X=0 when it is tails.
Discrete
The total number of possible outcomes is countable. An example is heads or tails.
Continous
The total number of possible outcomes is uncountable. An example is time measurements.
Probability Density Function
A continuous probability distribution function. This means that for any measurement x sub 1, there exists a corresponding value for f(x sub1).
Empirical Probabilities
Are probabilities generated from data.
Expected VAlue
Also known as the mean or average of the probability distribution. Can be thought of as the outcome we should expect on average.
E(x)=Sum(x*P(x))
Random Sampling
A method of choosing an equally distributed subset from a larger population. There is simple random samples, stratified random samples, cluster sampling, and systematic random samples
Sampling
A part of a population used to describe the whole group.
Population
All members of a specified group.
Simple Random Sampling
A type of random sampling where the variables have an equal, and unsystematic, chance of selection. Best used when a researcher does not know a lot about the demographics in the population.
Stratified Random Sampling
Divide members of a population into ‘strata’ or homogeneous subgroups. Different in that you seperate the population into groups first. Stratified random sampling cannot have crossover. Stratified random samples must include all members of a population.
Example is splitting up a high school with freshman, sophomore, junior, and senior students to then decide how many of each group is needed to take a sample. Best used when research is familiar with the demographics. No more then four to six strata is recommended but you can have as many as you want.
Cluster Random Samples
The sampling method where different groups within a population are used as a sample. Cluster cannot have crossover and must include all members of a population. Unlike stratified, the cluster sampling does not have to have an equal selection from each group but must be as close to the same size as possible. Use this when the entire population is unclear or unknown.
Systematic Random Sampling
Requires selecting samples based on a system of intervals in a population. For example selecting very 4th customer in a movie theater. Can only do this if the population is homogenous with a randomized list.