Statistics Flashcards
Nominal vs Ordinal data
Nominal - classified into groups in an unordered manner and with no indication of relative severity (M/F, morality, disease state). Binary data can fall in here (only 2 options)
Ordinal - ranked in a specific order but with no consistent level of magnitude of difference between ranks (NYHA functional class: 1,2,3,4)
Continuous variables? Also what’s the difference between interval and ratio scaled?
counting variables.
Interval scaled: data ranked in a specific order with a consistent change in magnitude between units; the zero point is arbitrary (degrees farenheit)
Ratio scaled” like “interval” but with an absolute zero (degrees Kelvin, pulse, BP, time, distance).
Descriptive vs inferential statistics?
Descriptive - Used to summarize and describe data that are collected or generated in research studies, this is done both visually and numerically.
Inferential - Conclusions or generalizations made about a population (large group) from the study of a sample of that population
What are 4 commonly used types of descriptive statistics?
Frequency distribution, histogram, scatter plot, box plot
How do you measure central tendency?
Mean - used only for continuous and normally distributed data, very sensitive to outliers (tends toward the tail), most commonly used/well-understood. Should be given with SD.
Median - Midpoint of the values when placed in order from highest to lowest, half above and below. Used for ordinal or continuous data (especially for skewed populations). Insensitive to outliers. IQR. NOT SD.
Mode - most common value in a distribution, used for nominal, ordinal, or continuous data. Data may have > one mode (bimodal, trimodal), describes meaningful distributions with a large range of values.
What is standard deviation?
Measure of the variability about the mean, applied to continuous data that are normally distributed or transformed to be.
What is the empirical rule?
68% within +/- 1 SD, 95% within +/- 2 SD, 99% within +/- 3 SD
What is the coefficient of variation (CV) and variance?
CV relates mean to the SD (SD/mean X 100%), Variance = SD^2
What is the range?
Difference between the smallest and largest.
What are percentiles?
Point in a distribution which a value is larger than some % of the other values. IQR- percentile that describes the middle 50%, encompasses the 25th-75th percentile.
What measures of central tendency should be presented with continuous, interval scaled data?
The mean
What measures of central tendency should be presented with ordinal data?
Median
What is gaussian distribution?
Normal distribution, most common model for population distributions, bell shaped or symmetric.
How do we assess gaussian distribution?
Median - mean - most practical and easiest to use
What is the standard error of the mean (SEM)?
Estimate of the certainty that the calculated sample mean represents the true mean. Not variability in the sample. SD/sqrt(n).
Application- 95% confidence interval is around mean +/- 2 SEM