Test Flashcards
Categorical
Most common: graphical display
Pie chart
Bar chart
Pictograms
Frequency tables
Numerical summaries: category counts and percentages
Quantatative
Histogram [=including, (=not including
Stem plot
Box plot (5# summary)
- longer/shorter quartile means spread of data not more data
Mean
Average
Median
Middle
More
Most often occurring
Standard Deviation
Use w/ symmetry and mean
68% fall w/in 1 SD of the mean
95% fall w/in 2 SD of the mean
99% fall w/in 3 SD of the mean
IQR
Inner quartile range
Gives us the middle 50%
Used w/skewed data and median
1.5 IQR
Used to detect outliers
Q1-1.5(IQR)
Q3-1.5(IQR)
Explanatory Variable
X
Variable that claims to explain, predict or affect the response
Response variable (Y)
Outcome of the study
C — Q
Box plots
C — C
Two way tables / contingency table
Q — C
Conditional percentile tables
Q — Q
Scatter plot
Increase in X = increase in Y
Decrease in X = decrease in Y
U shape = not positive or negative
r= linear correlation coefficient
( -1 to 1 )
0 to -1 = neg relationship
0 to +1 = pos relationship
Measures strength of linear relationship
Simpsons Paradox
When a lurking variable causes us to think the direction of an association
Population
Group chosen for sampling
Sample Frame
List of individuals to be sampled
Sample
Actual individuals chosen for sample
Simple Random Sample
Individuals sampled at random without replacement.
Selecting names out of a hat
Cluster sample
Used when population is naturally divided into groups
Students in university divided into majors
Stratified sample
Used when population naturally divided into subpopulations
Students in certain college divided by gender or year in college