Biostatistics Flashcards
What is continuous data?
numerical data in which the magnitude from one value to the next is equal (HR, age, height, degrees C and F)
What is discrete/categorical data?
- Nominal- arbitrary order (gender, mortality, ethnicity, marital status)
- Ordinal- logical order but magnitude from one value to the next is not equal (1-10 pain scale, NYHA functional class
What are measures of central tendency?
a typical value that describes all the possible values and likelihoods that a random variable can take in a given range:
1. mean
2. median
3. mode
What is mean and when is it preferred?
- average value
2.continuous data that is normally distributed
What is median and when is it preferred?
- middle value
- ordinal or continuous data that is skewed
What is mode and when is it preferred?
- most frequent value
- nominal data
How is the variability of data (spread) described?
- range
- standard deviation
What is the range?
difference between the highest and lowest values
What is standard deviation?
indicates how spread out the data is and to what degree the data is dispersed away from the mean (highly dispersed data have a larger SD)
What are the characteristics of Gaussian (normal) distribution?
- large sets of continuous data
- normal/symmetrical bell-shaped curve
- mean, median, and mode are the same value at the center point of the curve
- 68% of data fall within 1 SD of the mean; 95% of data fall within 2 SD of the mean
- half of the values are on the right and the other half on the left with a small number of data in the tails
What are the characteristics of a skewed distribution?
- not symmetrical
- data is skewed toward outliers (extreme high value is skewed right and extreme lows are skewed left)
- 68% of values do not fall within 1 SD of the mean, median, and mode
- mean, median and mode are not the same value
- usually occurs when the number of values (sample size) is small and/or there are outliers
What are the characteristics of outliers?
- in a small population outliers have a large effect on the mean
- median is a better measure of central tendency
- distortion from outlier is decreased by increasing the population
What is a variable?
any data point that can be measured or counted
What is an independent variable?
variable that is changed (manipulated) by the researcher to determine its effect on the dependent variable (the outcome)
What are examples of dependent variables?
- A1c
- cholesterol values
- mortality
the outcome of an independent variable