Biostats All Flashcards
observation
aka record is a row in a table of data. It represents one person
variable
is a column in a table of data. It contains information about one characteristic of the person (race/gender/DOB)
quantitative/continuous variables examples and definitions
- ratio-scale: is an interval variable with a true zero point (height, BP, duration of illness,#of children)
- interval: value on a scale of equally spaced units with no true zero point (DOB, temperature)
qualitative/categorical variables examples and definitions
- nominal : values with no numerical ranking (residence). These can be dichotomous variables (alive/dead, smoker/non smoker)
- ordinal: has values that can be ranked but are not evenly spaced (stage of cancer, education level, BMI)
properties of frequency distributions are
- central location (where the distribution has its peak)
- spread (how widely it is dispersed on both sides of the peak)
- shape (is it symmetrical on both sides of the peak)
how do you describe the central location
mean
median
mode
how do you describe spread
range
interquartile range
standard deviation
when is a graph positively skewed
when its central location is to the left and its tail is to the right (aka graph is skewed to the right)
what is the IQR
it represents the central portion of distribution, from the 25 to the 75 percentile
how to calculate standard deviation
Calculate the arithmetic mean.
Subtract the mean from each observation.
Square the difference. Sum the squared differences.
Divide the sum of the squared differences by n–1.
Take the square root of the value obtained.
The result is the standard deviation.
define range
The range of a set of data is the difference between its
largest (maximum) value and its smallest (minimum) value.
define probability
measure of likeliness that an event occurs
define odds
ratio of the probablity of having an event to the probability of not having an event (P/1-P)
relationship between probability and odds
probability and odds are more alike the lower the absolute P (risk)
how to calculate risk and odds from a table
risk: event/all events
odds: event/non events
proportion
a ratio in which the denominator includes the numerator
ratio
is a number that expresses the relative size of two other numbers. Numerator is not in the denominator
rate
occurrance of events over a specific time interval. Or the measure of frequency of some phenomena of interest
prevalence
cases of a disease in a given pop at a specific time
incidence
# of new cases of a disease during a period/ healthy pop (at risk) at the beginning of the period - proportion of a pop to acquire the disease in a period of time
incidence rate
new cases / total person time of observation
prevalence tells you
probability of having the disease –> burden
incidence tells you
probability of developing the disease–> risk
risk ratio
risk in group 1 (group of interest) / risk in group 2 (comparison group)
rate ratio
compares the incidence rates or mortality rates of 2 groups.
in a case controlled study what can you measure
the odds ratio
in a prospective study (like cohort or randomized) what can you calculate
risk ratio, rate ratio, odds ratio
with IQR use
median
with standard deviation use
mean
standard deviation and variance
SD is the square root of variance