14. Stats Flashcards

1
Q

What type of data is categorical?

A

Qualitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What type of data is numerical?

A

Quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 2 types of categorical data?

A
  1. Nominal

2. Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data that can be in categories, but have no particular order or magnitude differences?

A

Nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data that can be allocated to an ordered set of categories?

A

Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Discrete data that can only be certain whole numbers and continuous data that can be any numerical value?

A

Numerical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What type of data is blood groups?

A

Nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What type of data is AHA class?

A

Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What type of data is # of surgical procedures?

A

Discrete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What type of data is cardiac index?

A

Continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Case-control advantages?

A
  1. Can study rare disease
  2. Can study disease with long latency between exposure/manifestation
  3. Can be launched/conducted over short time periods
  4. Inexpensive (compared to cohort)
  5. Can study multiple causes of disease
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Case-control disadvantages?

A
  1. Recall bias (information on exposure/past history based on interview)
  2. Validation of info on exposure is difficult
  3. Concerned with one disease only
  4. Can’t provide information on incidence rates of disease
  5. Incomplete control of extraneous variables
  6. Choice of appropriate control group can be challenging
  7. Methodology can be hard to comprehend for non-epidemiologist
  8. Correct interpretation of results can be hard
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Cohort advantages?

A
  1. Complete information on subjects exposure (quality control of data)
  2. Clear temporal sequence of exposure/disease
  3. Study multiple outcomes related to a specific exposure
  4. Calculation of incidence rates (absolute risk and relative risk)
  5. Methodology/results easily understood by non-epidemiologists
  6. Study relatively rare exposures
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Cohort disadvantages?

A
  1. Not suited for rare disease (need large # subjects)
  2. Not suited when time between exposure and disease manifestation is very long (can be overcome in historical cohort studies)
  3. Exposure patterns may change during course of study and make results irrelevant
  4. Maintaining high rates of follow-up can be difficult
  5. Expensive to carry out (need large # subjects)
  6. Baseline data sparse… large # of subjects doesn’t allow for long interviews
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Research involving administration of a test regimen to humans to evaluate both efficacy and safety

A

Clinical trial

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Phases of a clinical trial?

A

1- Safety and pharmacologic profiles
2- Pilot efficacy studies
3- Extensive clinical trial
4- Studies after FDA approval for distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Administration of a single subtherapeutic dose of the drug to a small group (0-15) to gather preliminary data on pharmacokinetics and pharmacodynamics

A

Phase 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

A small group (20-80) of volunteers to assess the safety and pharmacokinetic profile of medication

A

Phase 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

A large group (20-300) to assess safety in a larger group of patients as well as effectiveness of the drug

A

Phase 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Randomized controlled multicenter trial on a relatively large group (300-3000+) depending on the medical condition and is to assess the effectiveness of the drug in comparison with an accepted therapy

A

Phase 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Safety surveillance and ongoing technical support of a drug after permission for it to be distributed

A

Phase 4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What type of test is used when numbers in contingency table of categorical variables are relatively small?

A

Fisher exact

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What test is used for two groups with paired data?

A

McNemar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What test is used to measure the difference between actual/expected frequencies of categorical variables?

A

Chi2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What test is an extension of the Chi2 test used when comparing several 2-way tables (meta-analysis)

A

Mantel-Haenszel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What tests are used to compare samples of normally distributed data?

A

Parametric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What type of tests are used when data are not normally distributed?

A

Non-parametric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What types of tests are student T-test, ANOVA, ANCOVA, Kolmogorov-Smirnov

A

Parametric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What types of tests are Wilcoxon signed-rank, Mann-Whitney U-test, Wilcoxon rank sun, Kruskal-Wallis?

A

Non-parametric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What test is used to compare 2 samples to test probability that samples come from population with same mean value?

A

Student T-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What test is used to compare the means of 2+ samples to see whether they are derived from the same population

A

ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What test is used to compare the means of 2+ samples to see whether they are derived from the same population and accommodates continuous variables?

A

ANCOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What test is used to test hypothesis that the collected data are from a normal distribution so that the parametric stats can be used?

A

Kolmogorov-Smirnov

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What test compares the difference between paired groups?

A

Wilcoxon signed rank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What non-parametric test is like the t-test for parametric data?

A

Wilcoxon signed-rank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What tests compare 2 sets of data that are derived from 2 different sets of subjects?

A

Mann-Whitney U-test or Wilcoxon rank sum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What test compares 2+ independent groups (like the ANOVA for parametric)?

A

Kruskal-Wallis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What describes the frequency of occurrence of new cases during a time period?

A

Incidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What measure is useful to explore causal theories or evaluate effects of preventive measures?

A

Incidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is the equation for incidence?

A

new cases in a population in a period of time/Sum for each individual in population of length of time at risk for getting disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is the equation for cumulative incidence?

A

of individuals who get disease in certain period/Number of individuals in population at beginning of period

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What describes what proportion of the population has a disease at a specific point in time?

A

Prevalence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What does prevalence depend on?

A

Incidence and duration

P = I * D

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What measure is relevant to planning of health services or assessing need for medical care in a population

A

Prevalence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What is the equation for prevalence?

A

Existing # of individuals having disease at a specific time/Number of individuals in the population at that point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Does chronic disease have lower prevalence or incidence?

A

Lower incidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Do acute illnesses have lower prevalence or incidence?

A

Lower prevalence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is used to delineate how one set of data relates to another though a best fit line?

A

Regression analysis

*Regression coefficient is the slope of a line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

Name examples of different types of regression analysis?

A
  1. Simple linear
  2. Logistic
  3. Poisson
  4. Cox proportional hazards
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is the most common survival curve method?

A

Kaplan-Meier curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What does a Kaplan-Meier curve do?

A

Displays survival of a cohort with calculation of survival estimates upon each death or event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What is a nonparametric test to compare the survival between 2 potential Kaplan-Meier curves?

A

Log rank test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What is a well-recognized curve that reflects a continuous probability distribution that is bell-shaped (unimodal) and symmetrical about the mean with 2 parameters, mean and variance?

A

Gaussian distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

Which continuous probability distribution most closely resembles the normal of Gaussian distribution?

A

T-distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What is the measure of dispersion or variability in a sample?

A

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What % of cases fall within 1 SD in normal distribution?

A

68.2%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

What % of cases fall within 2 SD in normal distribution?

A

95.4%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

What % of cases fall within 3 SD in normal distribution?

A

99.7%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

True or False: Mean and median of a normal distribution are equal

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

What is the difference between T-distribution and normal distribution?

A

T-distribution is more spread out with longer tails

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What distribution is right skewed and characterized by degrees of freedom?

A

Chi2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What distribution is right skewed used for comparing 2 variances?

A

F distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

What distribution is highly skewed to the right (it is the probability distribution of a random variable whose log follows the normal distribution)?

A

Log normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

Name 2 discrete probability distributions

A
  1. Binomial

2. Poisson

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

What is a confidence interval?

A

Range that is likely to contain the true population mean valve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

What does a 95% confidence interval mean?

A

There is a 95% chance that the population value lies within stated limits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

What indicates variability in a sample?

A

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

In a normal distribution, 95% of the distribution of the sample means is within what SD of the population mean?

A

1.96

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

The size of the CI is related to what?

A

Sample size of study

*Larger the population, narrower the CI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

How is the 95% confidence interval for the mean calculated?

A

Sample mean – 1.96 x SEM to sample mean + 1.96 x SEM

*SD is the SEM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

Name the 2 types of applied statistics

A
  1. Descriptive

2. Inferential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

What do descriptive statistics do?

A

Describe data in a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

What do inferential statistics do?

A

Estimate whether results suggest a real difference between populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

Examples of descriptive statistics?

A
  1. Mean
  2. Median
  3. Mode
  4. SD
  5. Quartiles
  6. Histograms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

Examples of inferential statistics?

A
  1. Student T-test
  2. ANOVA
  3. Chi2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

What is a type I or alpha error?

A

When null hypothesis that is correct is rejected (stating a difference when there isn’t one)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

What is the chance of making a type I error?

A

P-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

What is a type II or beta error?

A

When null hypothesis that is incorrect is accepted (stating no difference when there is one)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

What is a type III error?

A

Study design that produces the right answer to the wrong question

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

What is the p-value?

A

Probability that defines how likely it is that a hypothesis is true (usually null hypothesis- no difference between groups)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

What is the probability of an observed difference occurring solely by chance?

A

P-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
82
Q

What is the usual p-value level of significance?

A

0.01 to 0.05

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
83
Q

What is the method used to adjust P-value for multiple testing?

A

Bonferranoi adjustment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
84
Q

What is the power of a study?

A

Probability that it would detect a statistically significant difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
85
Q

What is B in statistics?

A

Probability of accepting a hypothesis that is false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
86
Q

What is the equation for the power of a study?

A

1-B

*Probability of rejecting the null hypothesis when it is false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
87
Q

What is the minimum Power a study should have?

A

80%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
88
Q

What things can increase the power of a study?

A
  1. Larger significance level
  2. Larger effects
  3. Decreased variability of the observations
  4. Larger sample size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
89
Q

What is this assessment tool…Economic assessment method utilized in which costs and consequences of alternative cardiac interventions are expressed in costs per unit of health outcome. This is applicable to health programs as well as health services to determine preferred action that requires the least cost to produce a given level of effectiveness.

A

CEA: Cost effective analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
90
Q

What is CUA?

A
  • Uses quality-of life measurements expressed as utilities (QALY) in the value equation.
  • Disability-adjusted life year (DALY) is also a measure but is for the overall “burden of disease”
  • Quantifies the impact of premature death (like QALY), but also disability on a population by combining them into a single, comparable metric
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
91
Q

What is CBA?

A

Seeks to translate all relevant healthcare considerations into monetary terms by analyzing economic and social costs of medical care and benefits of reduced loss of net earnings due to preventing premature death or disability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
92
Q

What is a technique where results from a number of studies that are similar in nature are gathered to give one overall estimate of the effect?

A

Meta-analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
93
Q

List the formal steps for a meta-analysis

A
  1. Decide on effect of interest
  2. Check for statistical homogeneity
  3. Estimate average effect of interest with Cis
  4. Interpret the results and present the findings (forest plot)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
94
Q

List advantages of meta-analysis

A
  1. Refinement and reduction
  2. Efficiency
  3. Generalizability and consistency
  4. Reliability
  5. Power/precision
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
95
Q

List disadvantages of meta-analysis

A
  1. Publication bias
  2. Clinical heterogeneity
  3. Quality differences
  4. Lack of independence of study subjects
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
96
Q

What is a systematic review?

A
  • Uses meta-analysis to render well-informed clinical decisions… essential part of evidence based medicine
  • Major disease categories often have a sufficient number of randomized clinical trials for the at minimum a meta-analysis to determine the value of such an intervention
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
97
Q

When is the risk ratio or relative risk used?

A

Prospective cohort studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
98
Q

How is RR calculated?

A

Divide risk in treated/exposed group by risk in control/unexposed group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
99
Q

How is RR reported?

A

Given with a 95% CI

  • Can be <1, 1 or >1
  • If the CI includes 1, not statistically significant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
100
Q

What is RR similar to?

A

Odds ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
101
Q

What is the relative risk reduction?

A

Proportion by which the intervention reduces the event rate

*Control group risk-Intervention group risk/Control group risk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
102
Q

What is the absolute risk reduction?

A

Difference between the event rates in the intervention versus control groups

*Control group risk-Intervention group risk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
103
Q

What is the number needed to treat?

A

Number of patients who need to be treated for one to get benefit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
104
Q

Relationship of NNT and ARR?

A

NNT is reciprocal of ARR

ARR = 100/NNT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
105
Q

When is odds ratio used?

A

Retrospective case-control studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
106
Q

How is the odds ratio calculated?

A

By comparing odds of the exposed versus control groups

*Calculated by dividing the event occurrence by the number of times that the event doesn’t happen

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
107
Q

How is the odds ratio reported?

A

Given with a 95% CI

  • Odds ratio can be <1, 2 >1
  • If it includes 1, it isn’t statically significant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
108
Q

In a typical receiver operating characteristic (ROC) curve, what is the significance of the upper left corner or coordinate (0,1)?

A

100% sensitivity and specificity

  • Percent classification
  • No false negatives and no false positives
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
109
Q

What is a ROC curve

A

A 2-way plot of the sensitivity (true +) against 1 minus the specificity (false + rate) for different cutoff valves for a continuous variable in a diagnostic test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
110
Q

What shape do you want an ROC curve to have?

A

Sharp upslope then taper off (versus just a straight diagonal line)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
111
Q

What is the measure of precision of the sample mean or how close the sample mean is likely to be to the population mean?

A

Standard error of the mean (SEM)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
112
Q

What is variance?

A

Square of the standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
113
Q

What is the coefficient of variation?

A

Ratio of the SD to the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
114
Q

What is a measure of spread away from the mean?

A

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
115
Q

What is the square root of variance?

A

SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
116
Q

What is a measure of precision of the sample mean or how close the sample mean is likely to be to the population mean?

A

SEM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
117
Q

What is the degree of closeness of measurements to quantity’s true value?

A

Accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
118
Q

What is the reproducibility of a study result with the study to be repeated under the same circumstances?

A

Precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
119
Q

How is precision measured?

A

Standard error of measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
120
Q

What is a Chi squared test?

A

Measure of the difference actual and expected frequencies with categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
121
Q

What needs to be set up to calculate a Chi2 value?

A

Contingency Table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
122
Q

If there is no difference between the actual and expected values, what is the Chi2 value?

A

0

*Larger the difference, bigger the X2 value (and p-value accompanies X2 value)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
123
Q

What is the number of independent comparisons that can be made between members of a sample and is used with X2 to calculate the p-value?

A

Degree of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
124
Q

In the example of some kids with SVT being treated with digoxin v. propranolol, what degrees of freedom is needed to calculate a p-value

A

1

*Number of independent comparisons that can be made between members of sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
125
Q

What is sometime used with a Chi2 test to improve the accuracy of the p-value?

A

Yates continuity correction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
126
Q

When is a Fisher exact test used?

A

When numbers in a contingency table of categorical variables are small

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
127
Q

When is a McNemar test used?

A

For 2 groups with paired data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
128
Q

What is an extension of the Chi2 test that is used when comparing several 2-way tables (like a meta-analysis)?

A

Mantel Haenszel test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
129
Q

What is a correlation coefficient?

A

The strength of the linear relationship between 2 variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
130
Q

What is the range of a correlation coefficient?

A

Denoted by r and ranges from -1 to +1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
131
Q

What is sometimes used with correlation coefficient to correct for negatively corrected relationships?

A

R2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
132
Q

When can a correlation coefficient not be calculated?

A

Non-linear relationship

Outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
133
Q

What is a multiple correlation coefficient?

A

When degree of linear relationship is extended to several variables

134
Q

When is Pearson correlation coefficient used instead of Spearman correlation coefficient?

A
  • Pearson (“r”): Values are sampled from normally distributed populations
  • Spearman (“rs”): Values are sampled from non-normally distributed populations
135
Q

Equation for PPV?

A

True positives/Total positives (A/A+B)

136
Q

Probability that a diseased individual is correctly classified as sick?

A

Sensitivity

137
Q

Equation for sensitivity?

A
#Sick classified as sick/Total # sick
A/(A+C)
138
Q

What does sensitivity assess?

A

How often the test is + in patients who have the disease

139
Q

What is the probability that healthy individuals are classified as healthy?

A

Specificity

140
Q

Equation for specificity?

A
# Healthy classified as healthy/Total # healthy
D/(D+B)
141
Q

What does specificity assess?

A

How often a patient tests negative if they are healthy

142
Q

True or false: There is interdependence between sensitivity and specficity?

A

True

143
Q

What is the likelihood that a patient has the disease if they test positive?

A

PPV

144
Q

Equation for PPV?

A

A/(A+B)

145
Q

What is the likelihood that a patient is healthy if they test negative?

A

NPV

146
Q

Equation for NPV?

A

D/(D+C)

147
Q

True or False: The higher the sensitivity/specificity/PPV/NPV, the more valuable the test

A

True

*Perfect test would be calculated at a 1

148
Q

What is a likelihood ratio?

A

Likelihood that a test result would be expected in a patient with the condition compared to the likelihood that the same result would be in a patient without the condition

149
Q

Equation for likelihood ratio?

A

Sensitivity/(1-Specificity)

A/A+C)/(1-(D/D+B)

150
Q

What does a likelihood ratio imply?

A

That if the test is + in a patient, the patient is many more times likely to have the disease than not

151
Q

What is a randomized controlled trial?

A

Patients randomized to receive either new or control treamtent

152
Q

What is involved in ideal randomization for a RCT?

A
  • Equal group sizes
  • Low selection bias
  • Low probability of confounding (accidental bias)
153
Q

What are refinements of simple randomization (as related to RCT)?

A
  • Stratified randomization
  • Blocked randomization
  • Cluster randomization
154
Q

What does stratified randomization control for?

A

Effects of factors

155
Q

What does blocked randomization assure?

A

Treatment groups to be equal sized

156
Q

What does cluster randomization allocate for?

A

Groups of patients

157
Q

What is response-adaptive randomization (or outcome-adaptive randomization)?

A

Probability of being assigned to a group increases if responses of prior patients is deemed favorable

158
Q

What is a placebo-controlled study?

A

Involves a control group that doesn’t receive the treatment

159
Q

What occurs when there is a systematic difference between the results of a study and the true result?

A

Bias

160
Q

What is bias that occurs when a spurious association is noted due to a failure to adjust fully for factors leading to an erroneous conclusion?

A

Confounding bias

161
Q

What is observer bias?

A

Observer inaccurately assess variable

162
Q

What is confounding bias?

A

Spurious association

163
Q

What is selection bias?

A

Selected study subjects are not representative

164
Q

What is information bias?

A

Measurements are incorrectly recorded

165
Q

What is publication bias?

A

Only positive results are published

166
Q

List types of bias

A

Observer, confounding, selection, information, publication, recall, assessment, allocation

167
Q

What is an association?

A

Any relationship between 2 measured quantities that relates them to be statistically dependent

168
Q

What defines a linear relationship between 2 quantities?

A

Correlation

169
Q

What factors allow you to include causation in addition to association?

A
Temporality
Strength of causality
Dose-response
Repetition in a different population
Consistency with other studies
Biologic plausibility
170
Q

What does the Belmont report include?

A

3 principles of research ethics

171
Q

What are the 3 principles of research ethics included in the Belmont report?

A
  • Respect for persons
  • Beneficence
  • Justice
172
Q

What is described by protecting the autonomy of all people and treating them with courtesy and respect and allowing for informed consent?

A

Respect for person

173
Q

A researcher being truthful about the possibility for negative side effects associated with a study drug is abiding by what?

A

Respect for persons

174
Q

What is described by “do no harm” while maximizing benefits for research project and minimizing risks for research subjects?

A

Beneficence

175
Q

What is described by ensuring reasonable, non-exploitative and well-considered procedures are administered fairly and equally?

A

Justice

176
Q

Who is the independent group of experts that continuously monitor data from various aspects of a clinical trial to ensure patient safety as well as validity and scientific merit?

A

Data Safety Monitoring Board

177
Q

What is a committee designed to approve and review research involving human subjects to protect the rights and welfare of human research subjects?

A

IRB (ethical review board)

178
Q

What is the difference between the IRB and DSMB?

A
  • IRB primarily responsible for review of clinical protocols and related documents
  • DSMB main responsibility is to review the trial safety and efficacy data
179
Q

What is the consistency of a set of measurements or a measurement tool or its repeatability and reproducibility?

A

Reliability

180
Q

What is the extent to which a study measures what it is intended to measure?

A

Validity

181
Q

Validity is a measure of what?

A

Systematic error or bias (confounding and selection bias)

182
Q

What is inversely related to random error?

A

Reliability

183
Q

What is the degree of closeness of measurements to the quantity’s true value?

A

Accuracy

184
Q

What is the reproducibility of a study result with the study to be repeated under the same circumstances?

A

Precision

185
Q

What is precision measured by?

A

Standard error of measurement

186
Q

Sample size calculation involves what?

A
  1. Power (0.8)
  2. Significance level (0.01 or 0.05)
  3. Variability of the observations (SD)
  4. Smallest effect of interest (the standardized difference)
187
Q

List advantages of a case-control study

A
  1. Permit study of rare diseases
  2. Permit study of diseases with long latency between exposure and manifestation
  3. Can be launched and conducted over a relatively short time period
  4. Relatively inexpensive (compared to cohort)
  5. Can study multiple potential causes of disease
188
Q

List disadvantages of a case-control study

A
  1. Information on exposure/past history primarily based on interview and may be subject to recall bias
  2. Validation of information on exposure is difficult, incomplete or impossible
  3. Concerned with one disease only
  4. Cannot usually provide information on incidence rates of disease
  5. Generally incomplete control of extraneous variables
  6. Choice of appropriate control group may be difficulty
  7. Methodology may be hard to comprehend for non-epidemiologists
  8. Correct interpretation of results may be difficult
189
Q

List advantages of a cohort study

A
  1. Allow complete information on subject’s exposure, including quality control of data and experience thereafter
  2. Provides a clear temporal sequence of exposure and disease
  3. Gives an opportunity to study multiple outcomes related to a specific exposure
  4. Permits calculation of incidence rates (absolute risk) as well as relative risk
  5. Methodology and results are easily understood by non-epidemiologists
  6. Enable the study of relatively rare exposures
190
Q

List disadvantages of a cohort study

A
  1. Not suited for the study of rare diseases (because large number of subjects are required)
  2. Not suited when the time between exposure and disease manifestation is very long (can be overcome in historical cohort studies)
  3. Exposure patterns may change during course of study and make results irrelevant
  4. Maintaining high rates of follow-up can be difficult
  5. Expensive to carry out (because a large number of subjects are usually required)
  6. Baseline data may be sparse (because large number of subjects doesn’t allow for long interviews)
191
Q

What statistical method allows for paired comparisons of two non-normal patient populations?

A

Wilcoxon signed-rank test

192
Q

What types of tests are used when data isn’t normally distributed?

A

Non-parametric

193
Q

What test is used for nonparametric data when comparing difference between paired groups?

A

Wilcoxon signed-rank test

194
Q

What test is used for parametric data when comparing difference between paired groups?

A

T-test

195
Q

What 2 tests compare 2 set of data that are derived from 2 different sets of subjects?

A

Mann-Whitney U-test

Wilcoxon ranks sum test

196
Q

What test compares 2+ independent groups with nonparametic data?

A

Kruskal-Wallis

197
Q

What test compares 2+ independent groups with parametric data?

A

ANOVA

198
Q

Numerical data, single group tests?

A

1-sample T-test

Sign test

199
Q

Numerical data, 2 paired groups test?

A

Paired t-test

Wilcoxon signed-rank test

200
Q

Numerical data, 2 unpaired groups test?

A

Unpaired t-test

Wilcoxon rank sum test (Mann-Whitney U-test)

201
Q

Numerical data, multiple (>2) groups test?

A

ANOVA (1-way)

Kruskal-Wallis test

202
Q

Categorical data, single group test?

A

Test of single proportion

Sign test

203
Q

Categorical data, 2 paired groups test?

A

McNemar test

204
Q

Categorical data, 2 unpaired groups test?

A

Chi2 test

Fisher exact test (<5)

205
Q

Categorical data, multiple (>2 groups) test?

A

Chi2 test

206
Q

What are 2 types of categorical data?

A

Nominal

Ordinal

207
Q

What is nominal data?

A

Data that describe data that can be in categories, but have no order or magnitude different

Ex. SV or BV surgical strategies or antiarrhythmic agent for SVT

*Type of categorical data

208
Q

What is ordinal data?

A

Data that can be allocated to an ordered set of categories

Ex. Severity of AVVR from mild to severe

*Type of categorical data

209
Q

What are the 2 types of numerical data?

A

Discrete

Continuous

210
Q

What is discrete data?

A

Can only be certain whole numbers

Ex. Number of reinterventions after Norwood

*Type of numerical data

211
Q

What is continuous data?

A

Can be any numerical value

Ex. BP before and after ACEi

*Type of numerical data

212
Q

What is a retrospective study that studies the relationship between risk factor and outcome and uses relevant exposure or condition information from a sample of individuals with the disease or condition (cases) rather than examining the entire population?

A

Case-control

213
Q

What is a qualitative study of a single patient or small group of patients with a similar disease?

A

Case-series

214
Q

What is a prospective observational study with study subjects (cohort) assigned to an exposure or condition category and then all followed for a defined observation period to see whether they develop disease?

A

Cohort

*Follow-up, longitudinal, prospective, historical

215
Q

What is a historical cohort study?

A

A cohort study using a group of patients from the past- wouldn’t involve active enrollment of new study subjects

216
Q

A Chi2 test is most closely related to what statistical test?

A

Fisher exact

217
Q

When is a Fisher’s exact test used?

A

Used when numbers in the contingency table to categorical variables are relatively small

*Chi2 used with large (>5) populations

218
Q

What is a chi2 test a measure of?

A

The difference between actual and expected frequencies with categorical variables with large (>5) populations

219
Q

What type of data is a chi2 and Fisher exact test used for?

A

Categorical

220
Q

ANOVA, student t-test, Kolmogorov-Smirnov and Wilcoxon signed-rank tests are used for what type of data?

A

Numerical

221
Q

Parametric tests are used to compare what type of data?

A

Normally (Gaussian) distributed data

222
Q

Student T-test, ANOVA and Kolmogorov-Smirnov tests are what type of tests?

A

Parametric

223
Q

What test is used to compare 2 samples to test the probability that the samples come from a population with the same mean value?

A

Student t-test

224
Q

What test is used to compare the means of 2+ samples to see whether they are derived from the same population?

A

ANOVA

225
Q

What test is used to test the hypothesis that the collected data are from a normal distribution, so that the parametric statistics can be used?

A

Kolmogorov-Smirnov

226
Q

What type of test is used when data isn’t normally distributed?

A

Non-parametric

227
Q

What type of test is a Wilcoxon signed rank?

A

Non-parametric

228
Q

What test is used for comparing the difference between paired groups (similar to a T-test for paired data) in non-parametric data?

A

Wilcoxon signed rank

229
Q

What is the definition of prevalence?

A

Existing # of individuals having the disease at a specific time/number of individuals in the population at that point in time

230
Q

What describes frequency of occurrence of new cases during a time period?

A

Incidence

231
Q

What is the proportion of the population that has the disease at a specific point in time?

A

Prevalence

232
Q

What 2 things does prevalence depend on?

A

Incidence and duration of the disease

P = I * D

233
Q

What is an economic assessment methodology that seeks to translate all relevant healthcare considerations into monetary terms by analyzing economic and social costs of medical care and benefits of reduced loss of net earnings due to preventing premature death or disability?

A

CBA

234
Q

What is an economic assessment method in which the costs and consequences of alternative interventions are expressed in costs per unit of health outcome?

A

CEA

235
Q

What uses quality of life measurements expressed as utilities (QALY) in the value equation?

A

CUA

236
Q

What are the designation levels described by the US Preventive Services Task Force used in a review article for medical therapies?

A

A: Good scientific evidence, benefits substantially outweigh risk
B: Fair scientific evidence, benefits outweigh risk
C: At least fair scientific evidence, benefits and risk too close
D: At least fair scientific evidence that risks outweigh the benefit
I: Scientific evidence is lacking, poor quality or conflicting

237
Q

Equation for sensitivity?

A

A/A+C

238
Q

Equation for specificity?

A

D/B+D

239
Q

Equation for PPV?

A

A/A+B

240
Q

Equation for NPV?

A

D/C+D

241
Q

What is a likelihood ratio?

A

Likelihood that a test result would be expected in a patient with the condition compared to the likelihood that the same result would be in a patient without the condition

242
Q

Equation for likelihood ratio?

A

Sensitivity/(1-Specificity)

243
Q

How often the test is positive if the patient has the disease?

A

Sensitivity

244
Q

How often the test is negative if the patient is healthy?

A

Specificity

245
Q

Likelihood that the patient has the disease if the test is positive

A

PPV

246
Q

Likelihood that the patient is healthy if the test is negative?

A

NPV

247
Q

What is a perfect likelihood ratio?

A

1

*Higher the calculated value, the more valuable the test

248
Q

When is the risk ratio (or relative risk) used?

A

Prospective cohort studies

249
Q

How is risk ratio or relative risk calculated?

A

Dividing the risk in the treated or exposure group by the risk in the control or unexposed group

250
Q

How is risk ratio reported?

A

<1, 1, or >1

Given with a 95% CI… if the CI includes 1, it isn’t statistically significant

251
Q

What is the proportion by which the intervention reduces the event rate

A

Relative risk reduction (RRR)

252
Q

What is the difference between the event rates in the intervention v. control groups?

A

Absolute risk reduction (ARR)

253
Q

What is the number of patients who need to be treated for one to get benefit?

A

Number needed to treat (NNT)

254
Q

What is the equation for NNT?

A

100/ARR

*Reciprocal of ARR

255
Q

The most reliable results occur with that type of study?

A

Double-blind, placebo-controlled trial

256
Q

What is the degree a study produces consistent results?

A

Reliability

257
Q

True or False: You improve reliability to minimize bias in a study?

A

True

258
Q

What is the practice of randomly assigning enrolled patients in one of treatment or control groups?

A

Randomization

259
Q

What is study design in a way that providers administering intervention, measuring outcomes and patients receiving therapy are unaware who is in what group

A

Double Blinding

260
Q

What kind of bias can randomization and double blinding minimize?

A

Susceptibility

261
Q

What kind of bias occurs when differences in subjects at baseline between the compared groups cause differences in outcomes beyond what difference in interventions would otherwise cause?

A

Susceptibility

262
Q

What is important in study design to assure changes wouldn’t be seen in study groups regardless of intervention?

A

Placebo control group

263
Q

What type of study involves reviewing RFs for patients who have the disease of interest and comparable control patients who don’t?

A

Case-control studies

264
Q

What are case control studies used for?

A

Determine likelihood that various RFs are more or less associated with the cases versus controls

265
Q

What type of study entails prospectively following patients with a given exposure and those without?

A

Cohort

266
Q

What are 2 advantages of case-control studies?

A

Study multiple RFs and rare conditions

267
Q

What study design involves collection and analysis of data collected from a population at one specific point in time?

A

Cross-sectional

268
Q

A representative cohort of families surveyed to help determine prevalence of chest pain in a pediatric population is an example of what?

A

Cross-sectional study

269
Q

What study design is a review of records from a cohort of patients?

A

Retrospective cohort

270
Q

What study design follows a group of patients forward in time to determine which develop disease?

A

Prospective cohort

271
Q

What study design is useful to determine the prevalence of a disease?

A

Cross-sectional

272
Q

The more participants in a study, the higher the what?

A

Sensitivity for detecting adverse events

273
Q

What is a type II error?

A

Inability to reject null hypothesis when a difference between study groups truly exist

274
Q

What represents a false-negative finding?

A

Type II error

275
Q

How can you decrease the risk of a Type II error?

A

Increase the sample size studied to increase power and ability to find a difference if one truly exists

276
Q

What refers to rejection of the null hypothesis when a true difference doesn’t exist?

A

Type I error

277
Q

What represents a false-positive finding?

A

Type I error

278
Q

How can you impact a Type I error in a study?

A

By adjusting the significance rate

279
Q

What is a significance rate typically set at in a study?

A
  1. 05
    * Statistical analysis of results must show <5% chance that results are related to chance versus true difference to be termed “significant”
280
Q

What is a type of study design where patients with exposure to intervention of interest and those without are followed forward in time for development of measured outcome?

A

Prospective cohort

*Not retrospective because patients followed forward in time rather than record review

281
Q

What is healthy entrant effect?

A

Lower morbidity/mortality in patients entering a study than general population due to study design

282
Q

What is the ability of a test to correctly identify those without disease?

A

Specificity

283
Q

What is the ability of a test to predict patients without disease?

A

NPV

284
Q

What is a tests ability to demonstrate accurate value?

A

Validity

285
Q

What is a tests ability to get consistent results?

A

Reliability

286
Q

What does a 95% CI represent?

A

Range of values that are 95% certain to contain the true mean for the population based on data from respective cohort

  • DOESN’T represent values between which 95% of the sample or population values fall
  • Calculated around the mean value for each group
287
Q

What is recall bias?

A

Inaccurate recollection of events by study participants

*Systematic error

288
Q

What is lead-time bias?

A

Disease recognized earlier and survival hasn’t really changed versus test improving survival time

289
Q

What is selection bias?

A

Nonrandom collection of participants

290
Q

What is referral bias?

A

Only subset of population included

Ex: Study at tertiary care center with only sickest patients and not representative of population of interest

291
Q

What statistically describes an association between an exposure and risk of outcome of interest?

A

Odds ratio

292
Q

What does a negative odds ratio indicate?

A

Decreased risk

293
Q

What does a positive odds ratio indicate?

A

Increased risk

294
Q

If the 95% CI for an odds ratio includes 1, what does that mean?

A

Likely not statistically significant

295
Q

What type of statistical analysis is most appropriate for assessment of a continuous variable both pre/post intervention in the same patient?

A

Paired student t-test

  • Continuous variable with N>25 = Parametric test/Student t-test
  • Paired because same patient studied before/after intervention, so 2 matched cohorts
296
Q

What type of test is used to compare cohort means in samples that aren’t normally distributed or have a low number of participants?

A

Wilcoxon signed-rank

*Nonparametric

297
Q

What type of test is used to compare categorical outcomes versus continuous variables?

A

Chi2

298
Q

What demonstrates the odds of developing a given outcome in patients with a particular exposure and those without?

A

Odds ratio

299
Q

What is a method to display survival results grafically?

A

Kaplan-Meier curve

300
Q

What describes the odds of the outcome of interest in those patients with the exposure?

A

Odds ratio

301
Q

What is the equation for an odds ratio?

A

(AD)/(BC)

302
Q

True or False: Can calculate an odds ratio from multiple different study designs including case-control studies

A

True

303
Q

True or False: Chi2 analysis is used to statistically evaluate continuous data?

A

False- Evaluates nominal data, not appropriate for continuous

304
Q

What assumptions are needed to use a Chi2 analysis?

A
  • Random/completely independent study groups
  • All cells of table must have expected value >5
  • Data must be arranged in table form (nominal)
305
Q

What does a statistically significant P-value of <0.05 represent?

A

<5% chance that data distribution in the study could have occurred by random chance

306
Q

What is the most appropriate test for comparison of survival between 2 non-normally distributed groups, each with 10 participants?

A

Fischer exact test

307
Q

What does normal distribution of data in a study refer to?

A

Random data demonstrates a bell curve when graphed

*Data may be skewed or non-normally distributed especially if a small # of participants

308
Q

What is a non-parametric test used to statistically analyze association between 2 groups which aren’t normally distributed?

A

Fischer exact test

309
Q

What test must have normally distributed data with at least 5 participants in each cell of the table?

A

Chi2

*May provide falsely low p-value if used incorrectly

310
Q

A Wilcoxon signed-rank test and t-test are used for what type of data analysis?

A

Continuous

311
Q

What are censored patients in a survival curve (Kaplan-Meier)?

A

Drop out of the study for reasons other than event of interest (death)

312
Q

What are non-censored patients in a survival curve (Kaplan-Meier)?

A

Had event of interest (death)

313
Q

NNT cannot be calculated with that type of data?

A

Continuous

*Data has to be categorical and binary to calculate

314
Q

Equation for NNT?

A

1/ARR

315
Q

What type of test demonstrates agreement between 2 groups when there isn’t a gold standard?

A

Kappa statistic

316
Q

What are the values of a Kappa statistic?

A

-1 (negative association)
to
+1 (positive association)

0 demonstrates no association

317
Q

When can sensitivity calculations and ROC be used to evaluate a test?

A

When there is a clear gold standard

318
Q

What does a regression analysis demonstrate?

A

Relationship between a dependent variable and 1+ independent variables

319
Q

What is a nonparamateric test used to analyze categorical data with sample sizes too small to allow for use of Chi2?

A

Fischer exact test

320
Q

A Chi2 test requires how many patients in each cell?

A

> 5

321
Q

A student T-test uses what type of data?

A

Continuous

322
Q

What is used to compare mean values from 3+ groups?

A

ANOVA

323
Q

What test is the non-parametric equivalent of an ANOVA?

A

Kruskall-Wallis

324
Q

What is a parametric test used to compare group means with 3+ independent groups?

A

ANOVA

325
Q

What does it mean if the p-value for an ANOVA is <0.05?

A

There is a difference in the group means among multiple study groups, but not specifically where difference lies or how great a difference there is
-Need additional analysis comparing each group to other to find where exact difference is

326
Q

What test is used to compare mean values among multiple groups when ANOVA isn’t appropriate (data aren’t normally distributed)?

A

Kruskal-Wallis

327
Q

What type of data does Chi2 evaluate?

A

Categorical

328
Q

What does it mean is a P-value is <0.05?

A

Null hypothesis disproven and there is a difference between study groups

329
Q

What is the null hypothesis when comparing multiple independent variables to one dependent outcome?

A

None of the independent variables is associated with the outcome of interest

330
Q

What is a type I error?

A

Null hypothesis rejected, but really true (False+)

331
Q

What is a type II error?

A

Null hypothesis isn’t rejection, but really is difference (False-)