14. Stats Flashcards

Question

What test is an extension of the Chi2 test used when comparing several 2-way tables (meta-analysis)

Answer 1

Mantel-Haenszel

Answer 2

Parametric

Answer 3

Non-parametric

Answer 4

Parametric

Answer 5

Non-parametric

Answer 6

Student T-test

Answer 7

Kolmogorov-Smirnov

Answer 8

Wilcoxon signed rank

Answer 9

Wilcoxon signed-rank

Answer 10

Mann-Whitney U-test or Wilcoxon rank sum

Answer 11

Kruskal-Wallis

Answer 12

new cases in a population in a period of time/Sum for each individual in population of length of time at risk for getting disease

Answer 13

of individuals who get disease in certain period/Number of individuals in population at beginning of period

Answer 14

Prevalence

Answer 15

Incidence and duration P = I * D

Answer 16

Prevalence

Answer 17

Existing # of individuals having disease at a specific time/Number of individuals in the population at that point in time

Answer 18

Lower incidence

Answer 19

Lower prevalence

Answer 20

Regression analysis *Regression coefficient is the slope of a line

Answer 21

1. Simple linear 2. Logistic 3. Poisson 4. Cox proportional hazards

Answer 22

Kaplan-Meier curve

Answer 23

Displays survival of a cohort with calculation of survival estimates upon each death or event

Answer 24

Log rank test

Answer 25

Gaussian distribution

Answer 26

T-distribution

Answer 27

Standard deviation

Answer 28

T-distribution is more spread out with longer tails

Answer 29

F distribution

Answer 30

Log normal distribution

Answer 31

1. Binomial | 2. Poisson

Answer 32

Range that is likely to contain the true population mean valve

Answer 33

There is a 95% chance that the population value lies within stated limits

Answer 34

Standard deviation

Answer 35

Sample size of study *Larger the population, narrower the CI

Answer 36

Sample mean – 1.96 x SEM to sample mean + 1.96 x SEM *SD is the SEM

Answer 37

1. Descriptive | 2. Inferential

Answer 38

Describe data in a sample

Answer 39

Estimate whether results suggest a real difference between populations

Answer 40

1. Mean 2. Median 3. Mode 4. SD 5. Quartiles 6. Histograms

Answer 41

1. Student T-test 2. ANOVA 3. Chi2

Answer 42

When null hypothesis that is correct is rejected (stating a difference when there isn’t one)

Answer 43

When null hypothesis that is incorrect is accepted (stating no difference when there is one)

Answer 44

Study design that produces the right answer to the wrong question

Answer 45

Probability that defines how likely it is that a hypothesis is true (usually null hypothesis- no difference between groups)

Answer 46

0.01 to 0.05

Answer 47

Bonferranoi adjustment

Answer 48

Probability that it would detect a statistically significant difference

Answer 49

Probability of accepting a hypothesis that is false

Answer 50

1-B *Probability of rejecting the null hypothesis when it is false

Answer 51

1. Larger significance level 2. Larger effects 3. Decreased variability of the observations 4. Larger sample size

Answer 52

CEA: Cost effective analysis

Answer 53

- Uses quality-of life measurements expressed as utilities (QALY) in the value equation. - Disability-adjusted life year (DALY) is also a measure but is for the overall “burden of disease” - Quantifies the impact of premature death (like QALY), but also disability on a population by combining them into a single, comparable metric

Answer 54

Seeks to translate all relevant healthcare considerations into monetary terms by analyzing economic and social costs of medical care and benefits of reduced loss of net earnings due to preventing premature death or disability

Answer 55

Meta-analysis

Answer 56

1. Decide on effect of interest 2. Check for statistical homogeneity 3. Estimate average effect of interest with Cis 4. Interpret the results and present the findings (forest plot)

Answer 57

1. Refinement and reduction 2. Efficiency 3. Generalizability and consistency 4. Reliability 5. Power/precision

Answer 58

1. Publication bias 2. Clinical heterogeneity 3. Quality differences 4. Lack of independence of study subjects

Answer 59

- Uses meta-analysis to render well-informed clinical decisions... essential part of evidence based medicine - Major disease categories often have a sufficient number of randomized clinical trials for the at minimum a meta-analysis to determine the value of such an intervention

Answer 60

Prospective cohort studies

Answer 61

Divide risk in treated/exposed group by risk in control/unexposed group

Answer 62

Given with a 95% CI * Can be <1, 1 or >1 * If the CI includes 1, not statistically significant

Answer 63

Odds ratio

Answer 64

Proportion by which the intervention reduces the event rate *Control group risk-Intervention group risk/Control group risk

Answer 65

Difference between the event rates in the intervention versus control groups *Control group risk-Intervention group risk

Answer 66

Number of patients who need to be treated for one to get benefit

Answer 67

NNT is reciprocal of ARR ARR = 100/NNT

Answer 68

Retrospective case-control studies

Answer 69

By comparing odds of the exposed versus control groups *Calculated by dividing the event occurrence by the number of times that the event doesn’t happen

Answer 70

Given with a 95% CI * Odds ratio can be <1, 2 >1 * If it includes 1, it isn't statically significant

Answer 71

100% sensitivity and specificity * Percent classification * No false negatives and no false positives

Answer 72

A 2-way plot of the sensitivity (true +) against 1 minus the specificity (false + rate) for different cutoff valves for a continuous variable in a diagnostic test

Answer 73

Sharp upslope then taper off (versus just a straight diagonal line)

Answer 74

Standard error of the mean (SEM)

Answer 75

Square of the standard deviation

Answer 76

Ratio of the SD to the mean

Answer 77

Standard deviation

Answer 78

Standard error of measurement

Answer 79

Measure of the difference actual and expected frequencies with categorical variables

Answer 80

Contingency Table

Answer 81

0 *Larger the difference, bigger the X2 value (and p-value accompanies X2 value)

Answer 82

Degree of freedom

Answer 83

1 *Number of independent comparisons that can be made between members of sample

Answer 84

Yates continuity correction

Answer 85

When numbers in a contingency table of categorical variables are small

Answer 86

For 2 groups with paired data

Answer 87

Mantel Haenszel test

Answer 88

The strength of the linear relationship between 2 variables

Answer 89

Denoted by r and ranges from -1 to +1

Answer 90

Non-linear relationship | Outliers

Answer 91

When degree of linear relationship is extended to several variables

Answer 92

- Pearson ("r"): Values are sampled from normally distributed populations - Spearman ("rs"): Values are sampled from non-normally distributed populations

Answer 93

True positives/Total positives (A/A+B)

Answer 94

Sensitivity

Answer 95

``` #Sick classified as sick/Total # sick A/(A+C) ```

Answer 96

How often the test is + in patients who have the disease

Answer 97

Specificity

Answer 98

``` # Healthy classified as healthy/Total # healthy D/(D+B) ```

Answer 99

How often a patient tests negative if they are healthy

Answer 100

True *Perfect test would be calculated at a 1

Answer 101

Likelihood that a test result would be expected in a patient with the condition compared to the likelihood that the same result would be in a patient without the condition

Answer 102

Sensitivity/(1-Specificity) | A/A+C)/(1-(D/D+B)

Answer 103

That if the test is + in a patient, the patient is many more times likely to have the disease than not

Answer 104

Patients randomized to receive either new or control treamtent

Answer 105

- Equal group sizes - Low selection bias - Low probability of confounding (accidental bias)

Answer 106

- Stratified randomization - Blocked randomization - Cluster randomization

Answer 107

Effects of factors

Answer 108

Treatment groups to be equal sized

Answer 109

Groups of patients

Answer 110

Probability of being assigned to a group increases if responses of prior patients is deemed favorable

Answer 111

Involves a control group that doesn't receive the treatment

Answer 112

Confounding bias

Answer 113

Observer inaccurately assess variable

Answer 114

Spurious association

Answer 115

Selected study subjects are not representative

Answer 116

Measurements are incorrectly recorded

Answer 117

Only positive results are published

Answer 118

Observer, confounding, selection, information, publication, recall, assessment, allocation

Answer 119

Any relationship between 2 measured quantities that relates them to be statistically dependent

Answer 120

Correlation

Answer 121

``` Temporality Strength of causality Dose-response Repetition in a different population Consistency with other studies Biologic plausibility ```

Answer 122

3 principles of research ethics

Answer 123

- Respect for persons - Beneficence - Justice

Answer 124

Respect for person

Answer 125

Respect for persons

Answer 126

Beneficence

Answer 127

Data Safety Monitoring Board

Answer 128

IRB (ethical review board)

Answer 129

- IRB primarily responsible for review of clinical protocols and related documents - DSMB main responsibility is to review the trial safety and efficacy data

Answer 130

Reliability

Answer 131

Systematic error or bias (confounding and selection bias)

Answer 132

Reliability

Answer 133

Standard error of measurement

Answer 134

1. Power (0.8) 2. Significance level (0.01 or 0.05) 3. Variability of the observations (SD) 4. Smallest effect of interest (the standardized difference)

Answer 135

1. Permit study of rare diseases 2. Permit study of diseases with long latency between exposure and manifestation 3. Can be launched and conducted over a relatively short time period 4. Relatively inexpensive (compared to cohort) 5. Can study multiple potential causes of disease

Answer 136

1. Information on exposure/past history primarily based on interview and may be subject to recall bias 2. Validation of information on exposure is difficult, incomplete or impossible 3. Concerned with one disease only 4. Cannot usually provide information on incidence rates of disease 5. Generally incomplete control of extraneous variables 6. Choice of appropriate control group may be difficulty 7. Methodology may be hard to comprehend for non-epidemiologists 8. Correct interpretation of results may be difficult

Answer 137

1. Allow complete information on subject's exposure, including quality control of data and experience thereafter 2. Provides a clear temporal sequence of exposure and disease 3. Gives an opportunity to study multiple outcomes related to a specific exposure 4. Permits calculation of incidence rates (absolute risk) as well as relative risk 5. Methodology and results are easily understood by non-epidemiologists 6. Enable the study of relatively rare exposures

Answer 138

1. Not suited for the study of rare diseases (because large number of subjects are required) 2. Not suited when the time between exposure and disease manifestation is very long (can be overcome in historical cohort studies) 3. Exposure patterns may change during course of study and make results irrelevant 4. Maintaining high rates of follow-up can be difficult 5. Expensive to carry out (because a large number of subjects are usually required) 6. Baseline data may be sparse (because large number of subjects doesn't allow for long interviews)

Answer 139

Wilcoxon signed-rank test

Answer 140

Non-parametric

Answer 141

Wilcoxon signed-rank test

Answer 142

Mann-Whitney U-test | Wilcoxon ranks sum test

Answer 143

Kruskal-Wallis

Answer 144

1-sample T-test | Sign test

Answer 145

Paired t-test | Wilcoxon signed-rank test

Answer 146

Unpaired t-test | Wilcoxon rank sum test (Mann-Whitney U-test)

Answer 147

ANOVA (1-way) | Kruskal-Wallis test

Answer 148

Test of single proportion | Sign test

Answer 149

McNemar test

Answer 150

Chi2 test | Fisher exact test (<5)

Answer 151

Nominal | Ordinal

Answer 152

Data that describe data that can be in categories, but have no order or magnitude different Ex. SV or BV surgical strategies or antiarrhythmic agent for SVT *Type of categorical data

Answer 153

Data that can be allocated to an ordered set of categories Ex. Severity of AVVR from mild to severe *Type of categorical data

Answer 154

Discrete | Continuous

Answer 155

Can only be certain whole numbers Ex. Number of reinterventions after Norwood *Type of numerical data

Answer 156

Can be any numerical value Ex. BP before and after ACEi *Type of numerical data

Answer 157

Case-control

Answer 158

Case-series

Answer 159

Cohort *Follow-up, longitudinal, prospective, historical

Answer 160

A cohort study using a group of patients from the past- wouldn't involve active enrollment of new study subjects

Answer 161

Fisher exact

Answer 162

Used when numbers in the contingency table to categorical variables are relatively small *Chi2 used with large (>5) populations

Answer 163

The difference between actual and expected frequencies with categorical variables with large (>5) populations

Answer 164

Categorical

Answer 165

Normally (Gaussian) distributed data

Answer 166

Parametric

Answer 167

Student t-test

Answer 168

Kolmogorov-Smirnov

Answer 169

Non-parametric

Answer 170

Non-parametric

Answer 171

Wilcoxon signed rank

Answer 172

Existing # of individuals having the disease at a specific time/number of individuals in the population at that point in time

Answer 173

Prevalence

Answer 174

Incidence and duration of the disease | P = I * D

Answer 175

A: Good scientific evidence, benefits substantially outweigh risk B: Fair scientific evidence, benefits outweigh risk C: At least fair scientific evidence, benefits and risk too close D: At least fair scientific evidence that risks outweigh the benefit I: Scientific evidence is lacking, poor quality or conflicting

Answer 176

Likelihood that a test result would be expected in a patient with the condition compared to the likelihood that the same result would be in a patient without the condition

Answer 177

Sensitivity/(1-Specificity)

Answer 178

Sensitivity

Answer 179

Specificity

Answer 180

1 *Higher the calculated value, the more valuable the test

Answer 181

Prospective cohort studies

Answer 182

Dividing the risk in the treated or exposure group by the risk in the control or unexposed group

Answer 183

<1, 1, or >1 | Given with a 95% CI... if the CI includes 1, it isn't statistically significant

Answer 184

Relative risk reduction (RRR)

Answer 185

Absolute risk reduction (ARR)

Answer 186

Number needed to treat (NNT)

Answer 187

100/ARR *Reciprocal of ARR

Answer 188

Double-blind, placebo-controlled trial

Answer 189

Reliability

Answer 190

Randomization

Answer 191

Double Blinding

Answer 192

Susceptibility

Answer 193

Susceptibility

Answer 194

Placebo control group

Answer 195

Case-control studies

Answer 196

Determine likelihood that various RFs are more or less associated with the cases versus controls

Answer 197

Study multiple RFs and rare conditions

Answer 198

Cross-sectional

Answer 199

Cross-sectional study

Answer 200

Retrospective cohort

Answer 201

Prospective cohort

Answer 202

Cross-sectional

Answer 203

Sensitivity for detecting adverse events

Answer 204

Inability to reject null hypothesis when a difference between study groups truly exist

Answer 205

Type II error

Answer 206

Increase the sample size studied to increase power and ability to find a difference if one truly exists

Answer 207

Type I error

Answer 208

Type I error

Answer 209

By adjusting the significance rate

Answer 210

0. 05 * Statistical analysis of results must show <5% chance that results are related to chance versus true difference to be termed “significant”

Answer 211

Prospective cohort *Not retrospective because patients followed forward in time rather than record review

Answer 212

Lower morbidity/mortality in patients entering a study than general population due to study design

Answer 213

Specificity

Answer 214

Reliability

Answer 215

Range of values that are 95% certain to contain the true mean for the population based on data from respective cohort * DOESN'T represent values between which 95% of the sample or population values fall * Calculated around the mean value for each group

Answer 216

Inaccurate recollection of events by study participants *Systematic error

Answer 217

Disease recognized earlier and survival hasn't really changed versus test improving survival time

Answer 218

Nonrandom collection of participants

Answer 219

Only subset of population included Ex: Study at tertiary care center with only sickest patients and not representative of population of interest

Answer 220

Odds ratio

Answer 221

Decreased risk

Answer 222

Increased risk

Answer 223

Likely not statistically significant

Answer 224

Paired student t-test * Continuous variable with N>25 = Parametric test/Student t-test * Paired because same patient studied before/after intervention, so 2 matched cohorts

Answer 225

Wilcoxon signed-rank *Nonparametric

Answer 226

Odds ratio

Answer 227

Kaplan-Meier curve

Answer 228

Odds ratio

Answer 229

(A*D)/(B*C)

Answer 230

False- Evaluates nominal data, not appropriate for continuous

Answer 231

- Random/completely independent study groups - All cells of table must have expected value >5 - Data must be arranged in table form (nominal)

Answer 232

<5% chance that data distribution in the study could have occurred by random chance

Answer 233

Fischer exact test

Answer 234

Random data demonstrates a bell curve when graphed *Data may be skewed or non-normally distributed especially if a small # of participants

Answer 235

Fischer exact test

Answer 236

Chi2 *May provide falsely low p-value if used incorrectly

Answer 237

Continuous

Answer 238

Drop out of the study for reasons other than event of interest (death)

Answer 239

Had event of interest (death)

Answer 240

Continuous *Data has to be categorical and binary to calculate

Answer 241

Kappa statistic

Answer 242

-1 (negative association) to +1 (positive association) 0 demonstrates no association

Answer 243

When there is a clear gold standard

Answer 244

Relationship between a dependent variable and 1+ independent variables

Answer 245

Fischer exact test

Answer 246

Continuous

Answer 247

Kruskall-Wallis

Answer 248

There is a difference in the group means among multiple study groups, but not specifically where difference lies or how great a difference there is -Need additional analysis comparing each group to other to find where exact difference is

Answer 249

Kruskal-Wallis

Answer 250

Categorical

Answer 251

Null hypothesis disproven and there is a difference between study groups

Answer 252

None of the independent variables is associated with the outcome of interest

Answer 253

Null hypothesis rejected, but really true (False+)

Answer 254

Null hypothesis isn't rejection, but really is difference (False-)

14. Stats Flashcards

(331 cards)