Ch. 1 : Explore/Understand Data and Graphs Flashcards
What is population?
the collection of ALL data values (usually large, unknown, and impossible to obtain all values)
What is a sample?
a subset of the population and we can measure characteristics
What is a sample size?
number of observations in a sample (n)
Observational units
are ROWS: individual cases about whom (or which) we record some characteristics
Variables
are columns: characteristics of a person or thing that can be assigned a number or a category
Types of Variables
Categorical and Numerical variables
Categorical Variable
aka Qualitative/ names categories and answers questions about how cases fall into these categories. Data values are usually words
Numerical Variable
aka Quantitative/ a measured variable with units that answers questions about the quantity of what is measured. Data Values are numbers
What are frequencies?
(or counts) are a natural way to summarize categorical variables
What are Two-Way Tables?
Tables displays the counts of two categorical variables at a time conditional on each other
What does it mean to establish Causality?
means to show that an outcome is effected by some treatment
Treatment Group?
those who receive the treatment in an experiment
Control Group
those who did NOT receive the treatment in an experiment
What does it mean if two things are Associated?
when an outcome occurs more often with one group than the other (treatment and outcome are _)
What is a confounding variable?
a characteristic other than the treatment that causes both outcomes