Statistics Flashcards
- OpenStax Introductory Statistics - Introduction to Statistics 4E, Freedman
Average
A number that describes the central tendency of the data
average = sum of entries / number of entries
Blinding
Not telling participants which treatment a subject is receiving
Categorical Variable
Variables that take on values that are names or labels
Cluster Sampling
A method for selecting a random sample and dividing the population into groups (clusters); use simple random sampling to select a set of clusters. Every individual in the chosen clusters is included in the sample.
Continuous Random Variable
A random variable (RV) whose outcomes are measured; the height of trees in the forest is a continuous RV.
Control Group
A group in a randomized experiment that receives an inactive treatment but is otherwise managed exactly as the other groups
Convenience Sampling
A nonrandom method of selecting a sample; this method selects individuals that are easily accessible and may result in biased data.
Cumulative Relative Frequency
The term applies to an ordered set of observations from smallest to largest. The cumulative relative frequency is the sum of the relative frequencies for all values that are less than or equal to the given value.
Data
A set of observations (a set of possible outcomes); most data can be put into two groups: qualitative (an attribute whose value is indicated by a label) or quantitative (an attribute whose value is indicated by a number). Quantitative data can be separated into two subgroups: discrete and continuous. Data is discrete if it is the result of counting (such as the number of students of a given ethnic group in a class or the number of books on a shelf). Data is continuous if it is the result of measuring (such as distance traveled or weight of luggage)
Double-blind experiment
An experiment in which both the subjects of an experiment and the researchers who work with the subjects are blinded
Triple-blind experiment
An experiment in which both the subjects of an experiment, researchers who work with the subjects, and analysts who analyze the data are blinded
Experimental Unit
Any individual or object to be measured
Explanatory Variable
The independent variable in an experiment; the value controlled by researchers
Frequency
The number of times a value of the data occurs
Institutional Review Board
A committee tasked with oversight of research programs that involve human subjects
Informed Consent
Any human subject in a research study must be cognizant of any risks or costs associated with the study. The subject has the right to know the nature of the treatments included in the study, their potential risks, and their potential benefits. Consent must be given freely by an informed, fit participant.
Lurking Variable
A variable, not included in experiment, that has an effect on a study even though it is neither an explanatory variable nor a response variable
Confounding Variable
Difference between the treatment and control groups - other than the treatment - which affects the responses being studied. A third variable, associated with both the dependent and response variables.
“The idea is a bit subtle: a gene that causes cancer but is unrelated to smoking is not a confounder and is sideways to the argument”
Gene needs to A) cause cancer AND B) get people to smoke
Sometime controlled for by cross-tabulation
How is a Lurking Variable different from a Confounding Variable?
Lurking = Unknown or unconsidered
Confounding = Known but not controlled for
Nonsampling Error/Systematic Error/Bias
An issue that affects the reliability of sampling data other than natural variation; it includes a variety of human errors including poor study design, biased sampling methods, inaccurate information provided by study participants, data entry errors, and poor analysis.
Numerical Variable
Variables that take on values that are indicated by numbers
Population Parameter
A number that is used to represent a population characteristic and that generally cannot be determined easily
Placebo
An inactive treatment that has no real effect on the explanatory variable
Population
All individuals, objects, or measurements whose properties are being studied