Biostats 2 Flashcards

1
Q

Internal validity

A

extent to which a piece of evidence supports a claim about cause and effect, within the context of a particular study

That is, cause precedes the effect and they happen together, with little possibility for confounders – within the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

External validity

A

applies well to a population outside of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Least important of mean, median, mode?

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

quantifies the amount of variability, or spread, around the mean of the measurements.

A

Variance (σ2 )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

a measure of variation of scores about the mean

more commonly used

A

standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The statement that establishes a relationship between variables being assessed

A

Alternative hypothesis (Ha or H1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The statement of no difference or no relationship between the variables

A

Null hypothesis (Ho)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A ___ error is made if we reject the null hypothesis when null hypothesis is true.

A

type I

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A ___ error is made if we fail to reject null hypothesis

A

Type 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

More important than p value – a better determination of significance
Any statistic is simply an estimate of the true value of that statistic

A

Confidence interval (CI)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

95% CI states that we can be 95% certain that the “true” value is within the CI range

A

Narrower CI is better

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Odds ratio of 1

A

no assocation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A screening test is used to separate from a large group of apparently well persons those who have a high probability of having the disease, so that they may be given a diagnostic work up, and if diseased can be treated.

What are the conditions for a screening?

A

The target disease is an important cause of mortality and morbidity.

A proven and acceptable test exists to detect individuals at an early stage of disease.

There is a treatment available to prevent mortality and morbidity once positives have been identified.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Sensitivity

A

ability of test to correctly identify those who have the disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Specificity

A

ability of the test correctly identify those who DO NOT have the disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Tends to rule OUT the disease

High Sensitivity means low probability of false positive

A

Sensitivity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Screening test’s ability to identify presence of disease

A test with high sensitivity will not miss many patients
who have the disease

A highly useful test when NEGATIVE

A

Sensitivity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Tends to rule IN the disease

High Specificity means low probability of false negative

A

Specificity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Screening test’s ability to truly identify absence of disease

That is, how likely is a negative test actually reporting the right answer?

A highly useful test when it is POSITIVE

A

Specificity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

A highly ___ test is most useful to the clinician when it is NEGATIVE

A

sensitive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

A highly ___ test is most useful to the clinician when it is POSITIVE

A

specific

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Allows up to calculate the net sensitivity and net specificity of using both tests in sequence. After completing both tests there is a loss in net sensitivity and net gain in specificity.

A

Sequential (Two-Stage) Testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

proportion of patients who HAVE the disease and a positive test

POPULATION related

(e.g., HIV prevalence in suburban city in US vs. HIV prevalence in sub-Saharan Africa)

A

Positive Predictive Value (PPV)

With low prevalence (% of population) of disease:
Lower PPV
False positives increase
Less reliable positive test result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

proportion of patients who DO NOT HAVE the disease, and have a negative test

A

Negative Predictive Value (NPV)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
the occurrence, rate, or frequency of a disease
Incidence Obtained from cohort studies Must follow a cohort through time
26
the number of occurrences at one particular time
Prevalence- Obtained from cross-sectional studies | No time line, only a snap shot
27
Relationship between incidence and prevalence
slide 33
28
Allows the researcher to explore the relationship between two continuous variables
Regression analysis
29
A method of predicting change in the dependent variable by changing one or more independent variables What % of variation in the dependent variable can be explained by a change in the independent variable
Regression analysis
30
Two broad data types
Categorical Continuous
31
Categorical
Nominal Ordinal
32
Continuous
Interval Ration
33
named categories with no implied order
Nominal
34
sequenced or ranked data
Ordinal E.g., smallest to largest, lightest to heaviest, easiest to most difficult
35
intervals along the scale are equal to one another (i.e. integers)
Interval Continuous Data
36
characterized by the presence of absolute zero on the scale Most precise
Ratio Continuous Data
37
In ____ screening, a less ex­pensive, less invasive, or less uncomfortable test is generally performed first, and those who screen positive are recalled for further testing with a more expensive, more invasive, or more uncomfortable test, which may have greater sensitivity and specificity. It is hoped that bringing back for further testing only those who screen positive will reduce the problem of false positives.
sequential or two-stage
38
Summarizes the same kind of information sensitivity and specificity and can be used to calculate the probability of disease in a low prevalence setting
Likelihood ratio (LR)
39
provides indication of the test’s discriminatory power Predictive values are lower with a low prevalence can be defined for the entire range of test result values
Likelihood ratio (LR)
40
is the proportion of diseased people with a negative test result (1-sensitivity) divided by the proportion of non-diseased people with negative test results (specificity)
negative LR (LR-) How good the test is at “Ruling out” disease The smaller the better (Desirable: 0.2 or less)
41
Loss in net specificity, and gain in net specificity
Sequential (Two-stage) Testing
42
Net gain in sensitivity, and net loss in specificity. Patient is considered positive if they test positive on any test or both. Patient is considered negative if they test negative on all the tests performed
Simultaneous Testing
43
one of the most common ways to examine relationships between two or more categorical variables
Chi-square
44
tests the null hypothesis that the variable are independent of each other, that there is no relationship between the two variable
chi-square of independence
45
does not give any information about the strength of the relationship.
Chi-square statistic
46
Computed the same way as the chi-square test for independence, but instead tests the hypothesis that the distribution of some variable is the same in all populations
Chi-square test for equality of proportion
47
Is used to test they hypothesis that the distribution of a categorical variable within a population followed a specific pattern of proportion
Chi-square test of goodness of fit
48
Fisher’s exact test
A non-parametric test similar, similar to the chi-square tests, but can be used with small or sparsely distributed data sets
49
a type of chi-square test used when the data comes from paired samples.
McNemar’s Test for Matched Pairs
50
Measures the strength of association between an exposure and disease the effect of one intervention v another = (AD)/(BC)
Odds ratio (OR)
51
If exposure does not affect (either cause or protect from) disease, the OR is ____ If the exposure is ____ to the disease, the OR > 1 If the exposure is ___ against the disease, the OR < 1
~ 1 related protective
52
to be able to estimate the probability of an outcome associated with a dichotomous response for a single or multiple variables
Logistic Regression
53
a single outcome (or set of outcomes) from an experiment
Event
54
The proportion of subjects in a study group in whom the event is observed. Usually seen as a %.
Rate
55
A measure of how often a particular event (such as response to a drug, adverse event or death) occurs within the scientific control group of an experiment
Control Event Rate (CER) %
56
A measure of how often a particular event (such as response to a drug, adverse event or death) occurs within the experimental group of an experiment
Experimental Event Rate (EER) %
57
Basic risk statements express the likelihood that a particular event will occur within a particular population Identifies what in our environment can lead to beneficial or adverse medical outcomes
Relative risk SAME AS RISK RATIO
58
measures the magnitude of an association between an exposed and non-exposed (control) group. calculated using cumulative incidence data to measure the probability of developing disease
Relative risk (same as Risk Ratio) Must have incidence information to calculate Cohort or clinical trials are conducted over time
59
The percentage difference in outcome between control (C) and experimental (E) groups
Relative risk reduction RRR= (CER-EER)/CER
60
Not a good way to compare outcomes Does not report the baseline risk of outcome Measures such as percent reduction in mortality, is selected because it gives a more optimistic view of the effectiveness of a preventive measure. Makes insignificant findings appear significant
Relative risk reduction
61
The actual reduction in events in the treated group (EER) The arithmetic difference in outcomes between treatment and control groups The “true difference” between the experimental and control intervention
Absolute risk reduction ARR = CER - EER
62
Odds ratio are an ___ estimate of risk,
indirect not a direct measure of risk
63
In a case-control study, only the odds ratio can be calculated as a measure of association ,whereas in a ___ study, either the relative risk or odds ratio can be calculated.
cohort
64
Odds ratios calculated in a case-control study are a good approximation of relative risk in the population when the following conditions are met:
When cases studied are representative, with regard to history of exposure, of all people with the disease in the population from which the cases were drawn. When the controls studied are representative, with regard to history of exposure, of all people without the disease in the population from which the controls were drawn. When the disease being studied does not occur frequently.
65
The number of patients who need to receive the new intervention instead of the standard alternative in order for one additional patient to benefit
Number needed to treat
66
Expresses the likelihood of the treatment to benefit an individual patient There is NO absolute value for NNT that defines whether something is effective or not. NNTs for treatments are usually low because we expect large effects in small numbers of people
Number needed to treat
67
NNTs for very effective treatments are usually in the range of 2 to 4 Rule of thumb: NNT 10 or less for therapy NNT 20 or less for prevention
Larger NNTs can be found useful where few patients are affected in large populations Use for prophylactic measures Example: Aspirin prevents one death at five weeks after a myocardial infarction, NNT of 40
68
When an experimental treatment is detrimental, the term number needed to harm (NNH) is often used. The equations and approach are similar to those described above, except that NNH will have a negative absolute risk reduction
Number needed to harm
69
Generally is used to analyze continuous data Compares the means and standard deviations of two populations Data be must be normally distributed Computes a p-value to test the null hypothesis
T-test
70
Assesses whether a difference between two groups’ averages is unlikely to have occurred because of random chance in sample selection. A difference is more likely to be meaningful and “real” if- The difference between the averages is large. The sample size is large. Responses are consistently close to the average values and not widely spread out (the standard deviation is low).
T-test
71
Independent variable
the variable you're interested in
72
Prevalence
cross-sectional prevalance = disease burden