Exam 1 Flashcards
What is a population?
All possible units we would like to observe but cannot due to constraints such as time, money, and resources.
What is a sample?
A well-chosen subset of the population that we will study. Results obtained from a sample are only interesting because they can be used to understand the population.
What is an estimation?
The process of inferring an unknown quantity of a population using sample data.
A numerical summary calculated for the sample; always known.
What is a parameter?
Quantity describing a population, whereas an estimate is a related quantity calculated from a sample.
A numerical summary calculated for the population; always unkown
What makes a good sample?
Low sampling error (or variation)
-Sampling error is what causes this difference between the estimate and the parameter.
High precision
-Low sampling variation will translate to high precision
Low/No Bias
-Bias is how much the estimate varies from the parameter
Reduce Bias in sampling
What is random sampling?
When every unit in the population has an equal and independent chance of being in the sample. i.e. when every possible subset (sample) from the population is equally likely
What is a categorical variable?
A variable that takes values that fall into pre-specified categories or groups. Don’t have units. and no magnitude on numerical scale.
Ex. Sex chromosome genotype (XX, XY)
Name the two types of categorical variables and describe them.
Nominal: When the categories have no natural ordering.
Ex. Gender, eye color
Ordinal: When the categories have natural ordering.
Ex. Grade (A, B, C) or Size (S, M, L)
What is a numerical variable?
A variable that can be measured/counted. Always has units.
Ex. Core body temp.
What are the two types of numerical variables?
Continuous: Numerical data that take real number values
Ex. Height, Weight
Discrete: Numerical data that take integer values; that can be counted
Ex. Number of people in a household. number of chairs in a room.
What is a population?
Entire collection of individuals or units that a researcher is interested in.
Ex. all the genes in the human genome
What is a sample?
A much smaller set of individuals selected from the population.
Ex. a selection of 20 human genes.
What is sampling error?
The chance difference between an estimate and the population parameter being estimated
What is bias?
A systematic discrepancy between estimates and the true population characteristic
Volunteer Bias
bias resulting from a systematic difference between the pool of volunteers and the population to which they belong. Problem arises when the behavior of the subjects affects whether they are sampled.