Biostats HY Flashcards

Question 1

Q

Give an intervention to one group and give placebo to other group then compare /record outcomes

Answer

A

Random Controlled Clinical Trial
(RCT)

Question 2

Q

Compare group of ppl with an uncommon dz (or characteristic) and a group of ppl w/o the dz and look back in time for exposures

Answer

A

Case-Control
(calculate odds Ratio)

Cohort is opposite it looks at exposure first then future dz.
Case control looks dz first then back in time for exposure!

Question 3

Q

prominent issue with Case Control Studies?

Answer

A

Recall Bias

Question 4

Q

To do a study on a rare phenomena (or disease)
_____ studies are typically the best option on NBME exams.

Answer

A

case-control study

Question 5

Q

Which study looks at 2 groups; one with a risk factor/exposure and one w/o risk factor and then follow into future to see if they develop a particular outcome (disease/adverse effect)

Answer

A

Cohort studies

(calculate relative risk)

Question 6

Q

Lower P value (<0.05) = higher (2)

Answer

A

confidence & power
(that results are not by chance)

Question 7

Q

which P-value is better?

P<0.05 or <0.01?

Answer

A

P<0.01

means 1% chance that results were due to chance

Question 8

Q

__% of population with normal distribution should fall within 2 Standard deviations below & above of the mean (average)

Answer

A

95%
Example: SD is 100 and mean is 1000.
2SD below mean = 800
2SD above mean = 1200
95% of population falls within 800-1200
5% must fall outside this range
2.5% less than 800
2.5% higher than 1200

Question 9

Q

95% of population with normal distribution should fall within __ Standard deviations below & above of the mean (average)

Answer

A

2
Example: SD is 100 and mean is 1000.
2SD below mean = 800
2SD above mean = 1200
95% of population falls within 800-1200
5% must fall outside this range
2.5% less than 800
2.5% higher than 1200

Question 10

Q

__% of population falls outside 2 Standard deviations below & above of the mean (average)
___% falls above 2 SD of average
___% falls below 2 SD of average

Answer

A

5%
2.5% above 2SD of mean
2,5% below 2SD of mean

Question 11

Q

Whenever you have 2 confidence intervals overlap in value (or cross each other) that means results are

Answer

A

not significant

(no difference in effectivity between those two things)

Question 12

Q

In Ratio derived confidence intervals (Relative risk, Odds ratio) if the confidence interval includes (crosses) the number ___ = not significant

Answer

A

1

(can get ONE, if you divide two of the SAME number)

Question 13

Q

In Difference derived confidence intervals (Average/ percents/proportions, RRR, Attributable Risk, ARR) if the confidence interval includes (crosses) the number ___ = not significant

Answer

A

0

(can get zero, if you subtract two numbers that are the SAME)

Question 14

Q

3 Rules for figuring out what a value’s Confidence interval is.
Example: Confidence interval of a Relative Risk of 3.5

Answer

A

Is it a ratio or a difference?
Relative risk is a ratio so CI can’t include #1 (eliminate those ans)
ARR is a difference so CI can’t include #0
Value cannot start or end the CI
(ex: confidence interval can’t start or end with 3.5)
Value must fall within the CI range of numbers & be nearest the center of the range.
(eliminate all ans that do not include 3.5 within the range)

CI must include the value (ex: 3.5) at the center within the range of numbers, but the value must not start or end the interval and the interval can’t include the number 1 or 0

Question 15

Q

Calculate Number Needed to Treat & ARR

Answer

A

ARR = (% of pts who died getting DRUG) minus (% of pts who died getting PLACEBO)
──

NNT= 1 ÷ ARR

Question 16

Q

Calculate Number Needed to Harm

Answer

A

1 ÷ (% of pts harmed by Placebo) minus (% of pts harmed by Drug)

NNH= 1 ÷ AR

Question 17

Q

Calculate Relative Risk

& Relative Risk Reduction (Decreased Relative Risk)

Answer

A

Relative Risk
(% exposed/intervention + dz) ÷ (% unexposed/control + dz)

(ex: 20% of smokers got Lung cancer/ 10% nonsmoker got lung cancer = 2 → aka smoking increases risk of lung cancer 2-fold)
─
RR = rate of outcome in exposed/ rate of outcome of control
RRR= (1– RR)

Question 18

Q

What is the Positive & Negative Likelihood ratio formula?

Answer

A

Positive= (Sensitivity/1– Specificity)
Negative= (1– Sensitivity/Specificity)

Question 19

Q

When to use the positive or negative likelihood ratio on exam to calculate correct answer?

Answer

A

+ve LRs tell you how much more likely a phenomenon is when you have a +ve test result.

-ve LRs tell you how much less likely a phenomenon is when you have a -ve test result.

Question 20

Q

Quick way to calculate Odds ratio

Answer

A

Odds ratio
(Expected Outcomes )÷ (Odd Outcomes)
─

Expected: (exposed got disease) x (unexposed no disease)
÷
Odd: (exposed no disease) x (unexposed got disease)

Question 21

Q

How to Calculate Confidence Interval

Answer

A

90% Z-Score = 1.5
95% Z-Score = 2
99% Z-score = 2.5

Question 22

Q

ROC curves (how well a test can distinguish b/w 2 groups)
The best test (highest sensitivity & specificity) lies at the _____ of the graph.

Answer

A

top left corner

Question 23

Q

Cohort study, 2 groups of individuals are initially identified as “exposed” or “nonexposed” according to their exposure status to a specific risk factor and then followed into future to assess development of the outcome (incidence of disease).

Answer

A

Case-Control = 1 Uncommon diseases are followed back in time to assess exposure(s)

Cohort = Exposures are followed into future for development of common diseases

Question 24

Q

68%, 95%, and 99.7% of a normal population lie b/w __, __, & __ SDs of the mean respectively.

Answer

A

1 (68% → 16%)
2 (95% → 2.5%)
3 (99.7%)

Question 25

Q

Both test require measuring a quantitative (numerical) outcome
Between 2+ qualitative (Intervention/Risk Factor) groups

compares means of 2 groups, ___ test.
compares means of 3+ groups, ___ test.

Answer

A

T test
ANOVA (or F) test

Chi test has qualitative terms for both intervention and outcome

Question 26

Q

When you incorrectly reject the null
(state there is an effect when there is not an effect)
= a Type __ error.

Answer

A

Type 1 error (alpha error)

(aka false positive error)

Question 27

Q

When you incorrectly accept the null
(state there is no effect when there is an effect)
= a Type ___ error.

Answer

A

Type 2 error (beta error)

(aka false negative error)

Question 28

Q

Power = ___

Answer

A

1– beta

Statistical power is the probability of stating that there is an association & it’s actually true.

(aka rejecting a false null hypothesis)

Question 29

Q

Narrower CIs tell you study is more ___.

Answer

A

precise

However, you should feel a lot less confident in the results of the study bc the CIs are too narrow (less room for error).

Question 30

Q

Ways to Increase the power of a study (HY!)

Answer

A

Studies with larger sample sizes have greater statistical power, consequently a lower probability of a type II error

Recruit more people for a study (larger sample size).
Have a large difference b/w 2 quantities you’re trying to measure (larger effect size).
Increase measurement precision (how consistent values are)
lower P values = more power (P<0.01)
Increase data for a measured qty cluster around 1 value.

Question 31

Q

FYI

Answer

A

The fact that something is statistically significant does not mean that it is clinically significant

Question 32

Q

study compares 2+ treatment on one pt and allows them to serve as own controls

Answer

A

Crossover study

Question 33

Q

This test has
2 groups divided by
≥2 categorical/qualitative factors
(exposure or intervention)
and measure the categorical/qualitative outcome
observed in each group

Answer

A

Chi squared test

Qualitative = characteristic
Quantitative = numerical values (Temp, BGL, Percentages)

Question 34

Q

Mean is the average.

Median represents:
1. the ___ #
2. the ___#s

Mode represents the # that ____ in the data set

TIP Arrange data in ascending/descending order before making these determinations.

Answer

A

Middle # (odd # data set)
Avg of the 2 middle #s (even # data set)

Mode = # that is repeated most

Question 35

Q

For a normal distribution, _____

Answer

A

mean = median = mode

Question 36

Q

HY
bimodal distributions found in (3 illnesses)

Answer

A

Hodgkin’s lymphoma
Suicide
slow/fast acetylators in metabolism

Question 37

Q

erroneously thinking that survival has been improved when in fact the “apparent survival improval” arose bc you found disease earlier.

Answer

A

Lead time bias

Question 38

Q

80% sensitivity = 20% ____ test result

Answer

A

False negative test result

(tested negative but have dz)

Question 39

Q

Screening test: High _____ test if Negative rules out dz
Confirmatory test: High ____ test if Positive rules in dz

Answer

A

High Sensitivity test if Negative rules out dz (SN-N-OUT)
High Specificity test if Positive rules in dz (SP-P-IN)

Question 40

Q

Of all the population with the disease what % will have a (+) test result = ___

Answer

A

Sensitivity

High seNsitivity = Low False Negative rate

Missed ones are False Negative
(pt has disease, but result was negative)

Question 41

Q

Of all the population without the disease, what % have (–) test results = ___

Answer

A

Specificity

A highly sPecific test has a low false Positive rate.

Question 42

Q

90% specificity = 10% ____ test result

Answer

A

False positive test result

(tested + but have no disease)

Question 43

Q

Which of the following points best represents the spot with the highest positive predictive value (PPV)

(aka % of people with +ve tests who have disease)

Answer

A

C
The highest PPV region on a graph, corresponds to the region with the highest sPecificity (C)

Question 44

Q

Sensitivity of a test represents the (% of pts with disease) & have _____

PPV of a test represents the (% of pts with +ve test) & have _____

Answer

A

(+ve test result)
→ SN = % of ppl w/dz the test marks positive

(disease)
→ PPV= % of True Positives the test reports

Question 45

Q

Equation for calculating Sensitivity

Answer

A

Sensitivity = (True Positive Test)/ (TP test + FN test)

aka (# of diseased & [+] test ) over (# of diseased regardless of test result)

Question 46

Q

Equation for calculating Specificity

Answer

A

Specificity = (True Negative test)/ (TN test + FP test)

aka (# of healthy & Neg test) over (# of healthy regardless of test result)

Question 47

Q

Equation for calculating PPV%

Answer

A

PPV= (TP test)/ (TP test + FP test)

aka [# of pts w/ (dz & + test)] over [total # of positive test regardless if true or not]

NPV= TN/(TN+FN)

Question 48

Q

Which of the following points best represents the region of the graph with the highest negative predictive value (NPV)

(aka % of healthy pts w/ NEG test results)

Answer

A

B
The highest NPV region on a graph, corresponds to the region with the highest seNsitivity (B), which corresponds to the region that DOES NOT miss anyone with disease.

Question 49

Q

Specificity of a test represents the % of people ____

NPV of a test represents the % of people ____

Answer

A

SPECIFICITY: % people (w/o disease) who have (–ve test results)

NPV: % people with (-ve test results) who are (w/o disease).

Question 50

Q

Lowering cutoff value dose what (6)

Answer

A

Lowering cutoff (B → A)
↑ SN & NPV & FP
↓ SP & PPV & FN

Question 51

Q

Increasing the cutoff value dose what (6)

Answer

A

Increasing cutoff (B → C)
↑ SP & PPV & FN
↓ SN & NPV & FP

Question 52

Q

As Prevalence goes up, ____ should increase too

Question 53

Q

As Prevalence goes up, ____ should decrease

Answer

A

NPV

(Inversely related PPV/NPV)

Question 54

Q

Can Prevalance change Sensitivity or Specificity of a test?

Answer

A

No

(but changing cut off values can)

Question 55

Q

Prevalence vs Incidence

Answer

A

Prevalence counts at all current cases of dz in total population (live longer stay in population longer incr prevalence)

Incidence counts all new cases in the total population

Question 56

Q

Prevalence decreases with (4)

Answer

A

increased mortality
faster recovery
more vaccine/prevention
Lowering risk factors

Question 57

Q

incidence decreases with (2)

Answer

A

more vaccine/prevention
lowering risk factors

Question 58

Q

TIP place Mean, Median, Mode in alphabetical order.
This should help you remember that:

in a Negatively skewed curve (flat portion on the ___): ____
in a Positively skewed curve (flat portion on the ___): _____

Answer

A

Flat left → mean < median < mode.

Flat Right → mean > median > mode.
───
notice how arrow head’s (<) flat part points in direction of the skew’s flat part

Question 59

Q

Case-control studies can consider only ____ per study but can evaluate exposure to several risk factors.

Answer

A

1 outcome (ie, disease)

Question 60

Q

3 actions used to control for confounding variables during the design stage of a study.

Answer

A

Randomization
Matching (same # of pts w & wo risk factor)
Restriction (participation criteria)

Question 61

Q

Nonrandom treatment assignment may lead to ___ bias

Answer

A

selection bias

Question 62

Q

Stratified analysis of the extraneous variable can help distinguish whether that variable is a confounding bias or an effect modifier.

It is a confounding bias if ______.

Answer

A

Stratified analysis of both groups yields similar RR (relative risk) no significant difference

If RR between 2 groups are significantly different → Effect Modifier

Question 63

Q

Because cohort studies measure incidence of disease, they provide a measure of

Answer

A

relative risk of disease

Question 64

Q

A case-control study compares the exposure status of people with & w/o a disease (ie, cases), they provide a measure of

Answer

A

odds ratio

Answer 64

A

A) Case control
B) Cohort
C) Cross Sectional

Answer 65

A

Harder: positive test result ( ↓ False Positives)

Easier: negative test result ( ↑ False Negatives)

Answer 66

A

(True positives + True negatives) / Total number of individuals tested

Answer 67

A

100 – NPV

100- PPV

Answer 68

A

Makes the range larger/wider

Mean 7 → 95% CI= 4–10
Mean 7 → 99% CI= 2– 12

Answer 69

A

If there is no real difference between 2 groups, there is a 5% chance of finding a difference

Answer 70

A

Negative predictive value (NPV)

Answer 71

A

Not significant (no difference/are similar)

if p-value less than α → significant (difference exists)

Answer 72

A

(NEG correlation #) as one variable increases, the other variable decreases

(POS correlation #) both variables increase (or decrease) together.

Answer 73

A

68%
95%
99.5%

Answer 74

A

precise

(FYI: Increasing the sample size increases the precision of the study, but does not affect accuracy.)

Answer 75

A

best compromise between highest sensitivity & specificity

(aka top left corner of ROC curve)

Answer 76

A

risk (increase nor decrease)

Answer 77

A

ARR = (Intervention % outcome) – (Control % outcome)

AR = (Control % outcome) – (Intervention % outcome)

Answer 78

A

cohort

Prevalence = Cross-Sectional

Answer 79

A

Case-control