Biostatistics and Research Design Flashcards

Question

if mean > median, what direction is the data skewed?

Answer 1

RIGHT (positive)

Answer 2

LEFT (negative - tail towards left)

Answer 3

68 - 95 - 99.7 first SD contains 68% of data second SD contains 95% of data third SD contains 99.7% of data

Answer 4

bell curve of standard deviation values - each point on the x-axis is number of SD away from mean z = 1.96 for cutoff of 95% CI

Answer 5

when n (sample size) increases, CI gets narrower (more confident) when variance (SD) increases, CI gets wider (less confident)

Answer 6

chi-squared statistic - assumes n is large fisher’s exact test - any cell <10

Answer 7

computed line (slope) with the least amount of error when comparing continuous data IV with continuous data DV

Answer 8

absolute risk difference: probably 1 - probability 2 risk ratio: probability 1 / probability 2

Answer 9

larger x^2 signifies a larger difference between expected and measured outcomes/values - signifies more error with a no-difference assumption each x^2 value is associated with p value

Answer 10

people change their behavior in a study, form of subject bias minimize via subject blinding (placebo)

Answer 11

1. treatment allocation (randomization) 2. patient blinding (placebo) 3. clinician blinding (blind to what treatment being provided) 4. outcome assessor blinding (investigator assessing outcome blinded to which group)

Answer 12

intention to treat: analysis of outcome according to treatment assigned, regardless of drop out or lack of compliance —> preserves randomization “as treated”: analysis of outcome according to the treatment actually received, irrespective of what group subjects were originally assigned to

Answer 13

“as treated” protocol would not remove non-compliant subjects “per-protocol” would remove non-compliant subjects from the results in both cases, analysis of outcome is according to what treatment was actually received, regardless of original group assignment

Answer 14

parallel: classic, intervention group v control stratified: randomization into stratified groups if there is a variable that will likely have a large influence on outcome (such as stage of cancer, wealth, etc) crossover: subjects undergo intervention, then control after wash-out period (can serve as their own controls - reduces variability, increases power) - doesn’t work when the order matters or intervention has lasting effect cluster: randomize care systems rather than individual patients (ex - testing 2 different disinfectants in a number of different emergency rooms) - does not work if individual consent is required non-inferiority: basically testing if new treatment is not excessively less effective than current treatment, if new treatment is more favorable in other ways (cost, convenience, availability, etc)

Answer 15

refers to genuine uncertainty about which treatment is better in RCT ethical concern in RCT

Answer 16

secondary: outcomes of interest other than primary, ideally also designated a priori, require more stringent p value to be considered significant (otherwise you can find a difference anywhere if you look hard enough) composite: combining multiple outcomes into one (ex: a OR b OR c - achieving any of these would be considered a primary outcome) surrogate: some number outcome, often a lab measurement, that doesn’t necessarily speak to patient’s experience or quality of life

Answer 17

internal validity: how much results reflect reality for patients in study external validity = generalizability

Answer 18

observational, from exposure to disease exposures can be beneficial or harmful can be prospective (“will x exposure causes disease?”) or retrospective (“did x exposure cause disease”?) outcomes: absolute risk (aka incidence), absolute risk difference (subtraction), and relative risk (risk ratio) best way to look at prognosis and incidence

Answer 19

subtype of retrospective cohort study strength: can use very large populations, most practical method for looking for rare side effects/outcomes weakness: data often gathered for intent other than medicine (billing, etc), important data may not have been collected on outcome date (weakness of all retrospective cohort studies)

Answer 20

working backwards to explore possible causes of developed disease (“what exposure may have caused x disease?”) - can be hypothesis generating cases: newly-incident cases of outcome control: persons without outcome who “had the opportunity” to be exposed outcomes: odds ratio

Answer 21

retrospective cohort - “did x exposure cause disease?” outcome: absolute risk (incidence), absolute risk difference, relative risk ratio case control - “what exposure may have caused x disease?” outcome: odds ratio

Answer 22

case-control without the control - may be no way to know who had the opportunity to be exposed best for describing emergent diseases, rare outcomes, odd exposures

Answer 23

snapshot of individuals in particular population at particular time best way to determine prevalence

Answer 24

aka correlational study snapshot of population and/or environment mostly hypothesis generating

Answer 25

cohort study - follow group from exposure to disease

Answer 26

“Big Data” studies

Answer 27

case series - case-control without the control (unable to determine)

Answer 28

cross-sectional

Answer 29

testing increases perceived survival time without affecting the course of the disease

Answer 30

aka measurement bias the way data is collected is more likely to include some members of a population than others, such as more intense surveillance or screening among exposed individuals than unexposed individuals

Answer 31

ascertainment/ measurement bias - people with high PSA are more likely to get the biopsy

Answer 32

recall bias - patients with liver cancer are probably thinking about what may have led to developing cancer and might remember their past supplement use more than average control

Answer 33

incorporates all pertinent info of medical decision into computer model that simulates multiple patients outcomes: overall mortality, cause-specific morbidity or mortality, utility (ex - QALYs or DALYs), cost helpful for decision analysis of complicated decisions or cost-effective/cost-benefit analysis (cost effective - compares cost to outcomes, cost benefit - represents all outcomes in money terms)

Answer 34

incidence: probability that unaffected people will develop disease during specific time period —> new cases per time / unaffected at-risk people at beginning of time period prevalence: proportion of people in population that have a disease at given time —> affected persons / all persons

Answer 35

chronic illness - increases prevalence, even if incidence is small incidence increase/decrease causes the same in prevalence (in acute illnesses, incidence and prevalence tend to track each other - not the case for chronic illness) in-migration can increase prevalence, out-migration can decrease it

Answer 36

relative risk difference = | RR - 1 | RR 1.62 = 62% RR increase RR 5 = 400% RR increase

Answer 37

absolute risks makes risks and benefits sound smaller relative risks makes risk or benefits sound bigger

Answer 38

1% —> 1:99 25% = 1/4 —> 1:3 80% = 4/5 —> 4:1

Answer 39

NNT = # people treated / one additional good outcome absolute risk decrease = additional good outcomes / # of people treated NNT is the INVERSE of absolute risk decrease (same goes for NNH and absolute risk increase)

Answer 40

1. strength 2. dose-response 3. specificity 4. alternative explanations (have been considered) 5. temporality (cause —> effect) *** 6. reversibility 7. consistency 8. plausibility/ coherence 9. analogy

Answer 41

descriptive/ hypothesis generating: 1. case reports 2. case series 3. cross-sectional 4. ecological analytic/ hypothesis testing: 1. case control 2. retrospective and prospective cohort studies 3. RCt 3. meta-analysis

Answer 42

ascribing relationships observed for groups (in ecological studies) to individuals members

Answer 43

pre-test probability: estimated probability of disease before you do the test nearly synonymous with prevalence specificity and sensitivity should not vary with population

Answer 44

inverse of ARR (absolute risk ratio)

Answer 45

NNT is inverse of ARR | 1/0.25 = 4

Answer 46

predictive value: how likely is it that this test result is correct? PPV: among everyone with positive test, what percent have disease? NPV: among everyone with negative test, what percent do not have disease? sensitivity: # true positives / all with disease PPV: # true positives / all positives

Answer 47

TRUE: prevalence of illness among certain populations differ, if for example, one population is screened more often (ex: blood donors and HIV)

Answer 48

LR+: likelihood of + test in diseased / likelihood of + test in non-diseased LR-: likelihood of - test in diseased / likelihood of - test in non-diseased

Answer 49

LR+ = likelihood + test in disease/ likelihood + test in non-diseased sensitivity = # true positives / everyone with disease PPV = # true positives / # all positives

Answer 50

LR+ = likelihood of + test in disease / likelihood of + test in non-diseased this is the same as saying LR+ = sensitivity / (1 - specificity) aka LR+ = ratio of true positives out of everyone (sensitivity) ————————————— 1 - [ ratio of true negatives out of everyone (specificity) ]

Answer 51

LR- = likelihood of - test in disease / likelihood of - test in non-diseased aka LR- = 1 - sensitivity / specificity aka LR- = 1 - [ratio of true positives out of everyone] ———————————— ratio of true negatives out of everyone

Answer 52

likelihood ratio of 1 means that a test result will be the same whether or not a patient has the disease - so the test is useless large LR+ and low LR- values are associated with big changes between pre-test and post-test probabilities

Answer 53

LR+ = sensitivity / (1 - specificity) ``` LR+ = 0.85 / (1 - 0.9) LR+ = 0.85 / 0.1 LR+ = 8.5 ```

Answer 54

portray the trade-off of various test cut-offs and sensitivity/specificity diagonal line through the middle representes a slope of 0.5, where the test becomes useless

Answer 55

another test, taken near-simulteaneously clinical judgement, simultaneously clinical outcomes over time, needs follow up

Answer 56

case-fatality rate = #deaths/#cases (usually used for disease that kill quickly) x-year survival: show as a curve of survival probability spanning years, usually for more long-lasting disease median survival: time at which half of the cohort has died

Answer 57

to compensate for competing mortality (unrelated to illness being studied), censored patients are removed from the denominator of patients at risk for an event - lowers the number of patients but does not affect survival rate

Answer 58

hazard ratios: similar to relative risk, evaluate one prognosis factor at at time (what’s the slope of survival over time comparing one group to another, such as patients with varying levels of immunoglobulins) - relative risk cares about outcome at the end, hazard risk cares about events along the way to the end outcome stratified survival curves: survival curves comparing subjects grouped by criteria clinical prediction rules for prognosis: patients are “scored” by the amount of criteria they meet to put them in a high/medium/low risk group (combining prognostic variables is more predictive than one alone)

Answer 59

``` accuracy = validity precision = reliability ```

Answer 60

aka normal distribution curve

Answer 61

the chance of one test being in the 95% CI range is 0.95 so the probability of more than one test being in that range would be each of their probabilities multiplied together each test has a 0.95 probability, so total probability they are all in that range would be 0.95^n where n is number of tests hence a smaller probability in other words, with too many tests there is a higher chance you’ll “find” something wrong

Answer 62

x-ray —> compare to CT/MRI all imaging, blood tests —> autopsy/ surgical study (biopsy) everything —> autopsy (“ultimate” gold standard)

Answer 63

cancer in its “usual” place, not breaking through membrane barriers surrounding primary tumor

Answer 64

``` grade = severity under a microscope stage = severity of physical extent (metastasis) ```

Answer 65

cytology: cells not in environment of origin - exfoliative (cells fall off), brush (cells brushed off), needle aspiration (cells pulled out with syringe) pathology: cells with some surrounding environment - excisional biopsy (small volume), surgical resection (large volume), autopsy

Answer 66

medical: hospital or outpatient death, requiring next of kin consent medical examiner: unexpected/ unusual death, possible legal issues involved (ex- homicide) or public health concern (outbreak), no consent needed

Answer 67

intervention/ exposure —> RCT prognosis —> cohort diagnostic test characteristics —> cross-sectional if using simultaneous reference, prospective cohort if using clinical follow-up reference

Answer 68

RCT! because you can control for unknowns, confounding factors, bias, etc

Answer 69

when you don’t have/ know the absolute or relative risk difference

Answer 70

non-adherence to treatment and cross-over

Answer 71

case series (observational) hypothesis generating, like case-control but without controls because it is difficult to determine who the controls would be

Answer 72

cross-sectional: best for establishing prevalence (how many people out of everyone currently have disease)

Answer 73

actually, yes, but not with the same group of people. comparing snapshots of different groups of people in different groups of time

Answer 74

cross-sectional, comparing snapshots of populations over time to establish trends in prevalence

Answer 75

case-control: from disease to exposure (“what exposure might have caused x disease?”)

Answer 76

prospective cohort study

Answer 77

retrospective cohort study: from exposure to disease (“what disease did x exposure cause?”) best for diagnosis or prognosis questions based on information usually captured in routine care, risky or rare exposures, long time-frame issues (such as radiation exposure)

Answer 78

case control studies (from disease to exposure - searching for an exposure) are for examining outcomes or exposures not likely captured in routine care (you’re looking for an unstudied exposure) retrospective cohort studies (from exposure to disease - searching for a disease) are for examining diagnosis or prognosis questions based on information usually captured in routine care (you’ll need this information to study the development of the disease)

Answer 79

cross-sectional

Answer 80

cross-sectional best [observational study] for evaluating a test with a simultaneous reference

Biostatistics and Research Design Flashcards

(105 cards)