Data Analysis Flashcards
what is effect size
the size of the outcomes from a determinant
what are the measures of average
mean - average measurement
median - midpoint of measurements
mode - the most common value
what are the measures of spread
range - extremes of all data
standard deviation - average spread of values around the mean
IQR - spread of values around the median
what are the directions of skew
positive skew is to the right
negative skew is to the left
what does normally distributed data measure
report mean and standard deviation
what does skewed data measure (discrete)
report median and inter quartile range
what is the relationship between mean and median in normally distributed data
roughly the same
what values are dependant on distribution
mode and range
what are the two ways of displaying data
skewed data - box plot - median, IQR and range
two continuous variables - scatterplot
how do you measure associations between categorical variables
use risk, risk ratio, odds ratio
what test would you use for a continuous outcome, or categorical exposure
t test or non-parametric equivalent
what measure of association would you use between continuous variables
correlation or lier regression
what is the difference between correlation and linear regression
c - association between two variables
lr - effect of one on the other
what is the definition of correlation and the two types
measure of linear association between two continuous variables (r)
persons - both variables normally distributed
spearman’s - either or both variables skewed
-1 = perfect negative linear relationship 0 = no linear relationship \+1 = perfect positive linear relationship
what are the pros and cons of correlation
simple method of association
order doesn’t matter
calculated between two variables only
assessment of straight line association only
cannot describe an exposure/outcome relationship or make predications