BIOSTAT & EPIDEMIOLOGY Flashcards
What information do you get from an ecologic (correlational) study?
Ecologic studies give population-level information, not individual-level information. Applying population-level information to an individual level = ecologic fallacy bias.
Kappa statistic
quantitative measure of inter-rater reliability (aka inter-rater concordance). reflect extent to which inter-rater agreement represents an improvement on chance agreement alone. Kappa values range from -1 (perfect disagreement) to +1 ( perfect agreement), with kappa = 0 suggesting agreement due to chance. kappa<0 suggests less than chance agreement, kappa > 0 greater than chance agreement.
Kaplan-Meier Curve
depicts probability of survival at various time points during the study. calculated based on proportion of subjects who are alive (“at risk”) at a given time. the event-free survival rates of 2+ groups can be compared. they are statistically different only if p-value is <0.05
what do overlapping stand error of measurement (SEM) bars suggest in general
overlapping SEM error bars suggest a non-statistically significant difference .
Case fatality rate
proportion of people with a particular condition who end up dying from the condition
Attack rate
proportion of people in whom an illness develops out of the total population at risk from the disease
median survival time
measure of prognosis when studying big groups. defined as the length of time that it takes for half of the study population to die
Quality adjusted life years & Disability adjusted life years
both are burden of disease measures evaluating the impact of specific diagnoses/treatments on individual patients of the economic impact of health interventions on populations . time-trade-off is commonly used to QALY calculations, while years of life lost and years lived with disability are used for DALY calculations . DALYs represent a loss to be minimized and reflect the difference between the current situation and an ideal situation
Meta-analysis
conducted by pooling data from several studies to increase statistical power (ability to detect difference in outcome of interest between groups, if such a difference exists). meta-analysis increases sample size and therefore increases power. major disadvantage: concomitant “pooling” of the biases and limitations of individual studies into one analysis
Standardized Mortality Ratio
SMR=observed # of deaths divided by expected # of deaths
NNT
NNT the number of patients who need to be treated in order to prevent one add’l bad outcome. NNT is the inverse of absolute reduction (ARR). ARR=control group event rate-experimental group event rate. NNT = 1/ARR
Likelihood Ratio
-assesses value of diagnostic test.
Positive LR
LR+ =sensitivity/ (1-specificity)
probability of patient with dx testing +/probability of pt without dx testing +
Negative LR
LR- = (1-sensitivity) /specificity
probability of patient with the dx testing neg / probability of patient without the disease testing neg
- LR range 0 to infinity. LR> 1 suggest disease present. higher the LR more likely dx presence. LR<1 argues against disease. in general smaller LR less likely dx
- LRs are independent of prevalence
Standardized Incidence Ratio
used to determine if the occurrence of cancer in a small population is high or low relative to the an expected value derived from a larger comparison population. SIR=observed cases/expected cases
Standard deviation
1 SD: 68%
2SD: 95%
3 SD: 99.7%
Standard Error of the Mean
- SEM is a measure of how tightly grouped a data set is
- smaller the SEM the more precise the data
z-score
shows how far above or below a score is compared to the mean
confidence intervals
- indication of precision of data
- if CI of outcome crosses 1, the results aren’t significant
- narrow 95% CI is more precise
- wide CI may be due to insufficient power
- if 95% CI reflects lack of statistical significance then corresponding p-value >.05; if 95% CI is significant then p < .05
T-score (t-test) & Analysis of Variance (ANOVA)
both are used to assess different groups of data between different sets of data that are in more than one group.
- T-test used for 2 groups of data. t-test answers questions “are means between two groups different?”
- ANOVA used when there are 2+ groups