Exam 1 Flashcards
the science of collecting, describing, and analyzing data
statistics
subjects/objects we obtain information about in a data set
cases/units
any characteristic recorded for each case (columns in the data table)
variable
divides the cases into groups, placing each case into exactly one of two or more categories
categorical variable
measures or records a numerical quantity for each case
quantitative variable
helps explain or predict values of other variables
explanatory variable
gives the reason for a specific variable
response variable
what is a lurking or confounding variable?
a third variable that is not considered
ex: age of children not considered in the reading level/cavity data
includes individuals or objects of interest
population
subset of the population
sample
n =
sample
process of using data from a sample to gain information about the population
statistical inference
method of selecting a sample causes sample to differ from the population in some relevant way
sampling bias
each unit of a population has an equal change of being selected, regardless of the other units chose for the sample
simple random sample
difference between sampling bias and bias?
sampling bias impacts the sample
bias impacts the actual method of data collection
values of one variable tend to be related to the values of another variable
association
how does association and cause relate?
association does NOT imply a cause and effect relationship
changing the value of one variable influences the value of the other variable
causation/casually associated
_____ implies a particular direction and relationship holds an overall trend
causation
a study in which the researcher actively controls one or more of the explanatory variables
experiment
a study in which the researcher does not actively control the value of any variable but simply observes the values as they naturally exist
observational study
what does the word “improve” imply in a study?
causality, cannot happen in observational studies
a casual relationship can only be determined in what study?
experiment
the value of the explanatory variable for each unit is determined randomly, before the response variable is measured
randomized experiment
randomly assign cases to different treatment groups and then compare results on the response variables
randomized comparative experiment
each case gets both treatments in random order and examine individual differences in the response variable between 2 treatments
matched pairs experiment
a summary statistic that helps describe a variable
proportion
how to determine a proportion in a category =
number in that category / total number
proportion for a sample is denoted:
p-hat
p-hat =
proportion for a sample
proportion for a population is denoted:
p
p =
proportion for a population
used to show relationship between 2 categorical values
2 way table
an observed value that is notable distinct from the other values in a data set
outlier
a numerical average of the data values
mean
mean of a sample is denoted:
x-bar
x-bar =
mean of a sample
mean of a population is denoted:
mu
mu =
mean of a population