Weeks 1 And 2 Flashcards
Pearson correlation
Linear regression for Gaussian data
-null is that the population correlation is 0 (no linear relationship)
Spearman correlation
Linear ranking method used with extreme values and applied to regression for a more Gaussian look
Regression equation
Y hat= bx+a
When there are Equal SD for regression, blank = blank
B=r
Multivariate
Techniques using multiple factors to remove confounding (except analysis of variance)
Multiple linear regression
For predicting a numeric dependent variable with multiple variables
Logistic regression
Used for 2 levels of dependent variables (yes/no, success/failure, etc)
Proportional hazards
Dependent variable is the time until a certain event
Relative risk
How many times more likely it is for 1 outcome vs another
Odds ratio
An approx of relative risk
Used in logistic regression
Hazard ration
Approx of relative risk, used in proportional hazards
Mean duration of survival
Best if all subjects die; mean amount of time they live
Median duration of survival
How long pts live, works better with censored data
Case fatality rate
% of deaths from condition
5 yr mortality rate
Proportion of deaths in a 5 yr period
Mortality rate per person yrs of observation
of deaths/total pt years (alive and dead)
Survival curves
Kaplan Meier, life table
Censored events
If event of interest doesn’t occur by the end of the study or there is a competing cause of death, etc
True experiment
Randomized design
Crossover trial
Each subjects gets 2 or more tx (each subject is his own control)
Equivalence trial
Shows that 2 tx or equivalent or close enough
Non inferiority trial
1 tx is not worse than an existing tx
Quasi experimental design
Strong element of control BUT no random assignment of individuals to groups
Single subject multiple baseline
Quasi exp
-many observations, intervention, many observations
Group multiple baseline
Quasi exp
-multiple baseline, intervention to group, multiple observations
Community trial
- quasi
- unit is the community but unit of analysis is the individual
- need verifiably comparable community
- randomly assigned intervention
Observational design
No intervention, only observation
-researcher does NOT randomly assign or control for various conditions
Case report
Details of 1 interesting case
Case series
Collection of case reports to show a pattern
Ecological/correlational
Look at a group and try to draw causal relationship
Ex: try to draw assoc between rate of homicide/suicide in London and the mean mental illness score in the boroughs
Cross sectional design/prevalence study
Exposure and disease status measure at a GIVEN time
Ex: ask if pt uses condoms CURRENTLY and see if they have an STD
Con: don’t know if exposure (use of condom) was before contraction of disease or started using after contracting disease
Case control
Group of cases with disease compared to group without disease
Cohort study
Group of initially healthy pts are evaluated for exposure status and then followed to see what happens
Attrition bias
Loss of subjects –> distortion of experiment’s effects
-at least 80% of subjects should complete trial
Allocation concealment
People assessing elegibility should not be able to know what group the next entering patient will be assigned to
Rating bias
Inaccurate responses because of beliefs, expectations
File drawer bias
Non significan small studies end up being filed instead of published; can also be done on purpose by drug companies
Patient oriented outcomes
Disability, pain, additional surgeries all are things patients would care about
Surrogate marker
Easily measured indicator of disease status that might not be clinically important (disease oriented outcomes)
Noncompliance bias
When subjects don’t take assigned meds
Small effects
Hard to be confident about these
Want larger effects like higher relative risk with CI not including 1 (or not 0 for mean)
Non-randomized studies
Main kind of study confounding is a problem in
Necessary cause
Must be present for the disease to occur
Sufficient cause
Will produce effect when present
Biological plausibility
Belief in cause and effect relationship is higher if there is a known biological mechanism by which exposure can reasonably alter risk of disease development
Biological gradient
Greater degrees of exposure result in greater risk of death
Temporal sequence
Exposure was present consistently before the disease
Reversibility
Removal of exposure reduces risk of disease in exposed population
Nuremberg Code
Ethical code developed after Nazi experiments
Tuskegee study
Study of syphilis in AA which continued even after a tx was found
Common Rule
Second code of ethics that formally regulates hunam subjects research
Vulnerable populations defined by NIH
Children, preg women, neonates, prisoners, pts without decision making capacity, suicidal persons
Vulnerable population not listed by NIH
Impoverished, unemployed, AA/minorities, immigrants, elderly, LGBTQIA individ, etc
Therapeutic misconception
Congnitive error that participation in research will provide direct benefits to the subject (ex: when physician recomments pt to clinical study, suggesting pt will benefit)
Meta analysis
Math. Method of combining study results to get a combined conclusion
- provides a p value and/or CI
- can increase sample size/serve as guide for N
- deal with conflicted findings
- limitation such as difficult with assignment of different weightage
Qualitative research
- in depth interviews, focus groups, participant observation
- semi structured and structures
- open ended questions
Type 1 error (alpha)
Rejecting the null hypotheses when there is, in fact, no population difference (rejecting null when null is actually true)
Type II error (beta; inverse to power)
You fail to reject the null when there is, in fact, a difference in the population
Prevalence rate
Proportion of the population @ risk with disease at a particular point in time
Period prevalence
Proportion of population with disease @some point during a time interval
Incidence rate
Proportion of initially healthy population at risk that develops disease during pd of interest (number who develop it/#(who develop it+healthy). EXCLUDE IN DENOM (and num) PEOPLE WHO ARE SICK ALREADY
Attack rate
Prop of specified population that develops a disease from a specific cause like an endemic
Odds
Number of times event occurs/number of times it does not
Probability to odds
P/(1-P)
Odds to prob
Odds/(1+odds)
Standard error (SE(M))
Indicates the accuracy with which a sample mean estimates the population mean (NOT individuals)
Z score
Number of standard deviations from the mean
When are t tests employed?
Simple comparison of means
Unpaired t tests
2 independent samples
SEDoM
SE difference of means
Sqrt[(S1^2+S22^2)/n]
T calculation
(Mean1-mean2)/SEDoM
Degrees of freedom in unpaired t
2n-2
Relative risk
Incidence of a disease in exposed population/incidence in an unexposed population
Odds ratio
Odds of disease in exposed group/odds of disease in controls
Attributable risk percent
Percent of people with a disease and risk factor, who got it BECAUSE of their exposure to a risk factor
Population attributable risk percent
Percentage of all cases of a disease due to a particular risk factor
Absolute risk reduction
Absolute difference between rate of bad outcome in treated and untreated groups
Relative risk reduction
Absolute risk reduction as a percentage of risk in untreated group (ARR/Rate in untreated group)
Number to treat
How many you need to treat to see a benefit in 1 person
1/ARR
Lead time bias
Apparent but false increase in survival (b/c of early detection, not actual changes in survival)
Sensitivity
If a person has the disease, the probability that the test will give a positive result
Specificity
If a person doesn’t have the disease, the probability of a negative result
Positive predictive value
If a person gets a positive result, the probability he has the disease
Negative predictive value
If the pt gets a negative result, the probability he doesn’t have the disease
Gold standard
Test/info you regard as true and correct
Likelihood ratio
Sensitivity/(100-specificity)
Power
Probability that the researcher will reject H0 when it should be rejected
Child mortality rate
Under 5 mortality per 1000
Infant mortality rate
Number of deaths of infants under the age of 1 per 1000 LIVE births
Neonatal mortality rate
Number of deaths of infants under 28 days of age per 1000 births
Maternal mortality rate
Number of women who die while preg. Or within 42 days of pregnancy termination per 100,000 live births
YLL
Years lives lost (when person died subtracted from ave life expectancy)
YLD
Years lived with disability (years of healthy life lost because of disability)
DALY- disability adjusted life year
YLL+ YLD
QUALY= Quality adjusted life year
1 QUALY= 1 yr of life in perfect health
Demographic transition
High fertility/high mortality as country develops
-population shift with economic development –> high fertility but low mortality
Developed nations have low fertility and aging populations