Stats and CAT stuff Flashcards

1
Q

What is a 95% confidence interval?

A

The range of values that are 95% certain to contain the true value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is standard deviation?

A

A value that shows how much variation there is from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a P-value, & what is the cut off?

A
  • A value that shows how likely it is that results are due to chance
  • P>0.05 is not statistically significant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is power?

A

The probability of correctly reject the null hypothesis when it is false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the purpose of blinding?

A
  • To reduce conscious or subconscious bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is concealment of allocation, and what bias does it reduce?

A
  • When someone running a trial doesn’t know what group a participant will be put in
  • Reduce selection bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is intention to treat analysis, and give an example of when it may be used?

A
  • Analysis based the initial intended intervention rather than what was ultimately given
  • Can be used when someone drops out of a study
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is treatment fidelity?

A

How well an intervention is reproduced from a protocol or animal model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the purpose of randomisation?

A
  • To equally distribute possible confounding factors between different study groups
  • Reduces selection bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is internal validity?

A
  • Accuracy
  • How well a study is conducted, taking into account confounders and removing bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is external validity?

A
  • Generalisability
  • How well can the results of a study be applied to different patients, situations and environment in the real world
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a type 1 error?

A
  • Rejecting the null hypothesis when it is true
  • False positive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a type 2 error?

A
  • Accepting the null hypothesis when it is false
  • False negative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is absolute risk?

A

The risk of disease/death in the population being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is relative risk?

A

The risk of disease/death in the exposed, compared to that in the non-exposed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the risk ratio?

A

Probability of disease/death in the at-risk group/that of the not-at-risk group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the odds ratio?

A

Ratio of odds of disease/death in exposed, compared to that of the non-exposed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is absolute risk reduction?

A

Rate of disease/death in the unexposed minus the rate in the exposed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the formula for absolute risk reduction?

A

CER-EER

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is attributable risk?

A
  • Rate of disease/death in exposed minus the rate in unexposed
  • (opposite of absolute ARR)
  • (think of it as the actual risk that can be ATTRIBUTED to a risk factor by removing the natural rate of developing a disease in the unexposed)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the formula for attributable risk?

A

EER-CER

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a hazard ratio, and when might it be used?

A
  • Similar to relative risk, but when a risk isn’t constant to time
  • E.g. when looking at survival time
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is numbers needed to treat, and how should you deal with a decimal NNT figure?

A
  • The number of people required to receive an intervention to produce 1 positive outcome
  • Always round up
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is numbers needed to harm, and how should you deal with a decimal NNH figure?

A
  • The number of people required to receive an intervention to produce 1 adverse outcome
  • Always round down
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the formula for NNT?

A

NNT = 1/ARR = 1/CER-EER

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is the formula for NNH?

A

NNH = 1/attributable risk = 1/EER-CER

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is the definition of a P-value?

A

The probability of receiving a result by chance that is at least as extreme as the true value, assuming the null hypothesis is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What does it mean if the P-value of a study that found a new chemo drug to be 50% more effective than cisplatin was 0.1, and what would you do with this result?

A
  • That there is a 10% probability that the new drug will improve chemotherapy outcomes by 50% due to chance alone
  • Therefore we ignore the result and accept the null hypothesis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

How are type 1 errors affected by sample size?

A

They aren’t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

How are type 1 errors affected by the number of possible outcomes/end-points?

A
  • Type 1 errors increase with more possible outcomes/end-points
  • (More likely one will occur due to chance, therefore more likely to have false positive)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How are type 2 errors affected by sample size?

A
  • Type 2 errors are decreased by a larger sample size
  • (occur when study doesn’t have much power)
  • (larger sample means more likely difference is seen, therefore reducing likelihood of false negative)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is the formula for power?

A

1 - probability of type 2 error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

How is power affected by sample size?

A

Power increases with increased sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is sensitivity?

A
  • The proportion of people with a disease who get a positive test result
  • “how good is a test at detecting disease?”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is specificity?

A
  • The proportion of people without a disease who receive a negative result
  • “how good is a test at excluding disease?”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is the formula for sensitivity?

A

TP/(TP+FN)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is the formula for specificity?

A

TN/(TN+FP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is positive predictive value?

A

The chance that a patient has a disease if a test is positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What is negative predictive value?

A

The chance that a patient doesn’t have a disease if a test is negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is the formula of PPV?

A

TP/(TP+FP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is the formula for NPV?

A

TN/(TN+FN)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What is the likelihood ratio for a positive test?

A

How much the odds of a disease increase with a positive test result

43
Q

What is the likelihood ratio for a negative test?

A

How much the odds of a disease decrease with a negative test

44
Q

What is the formula for the positive likelihood ratio?

A

Sensitivity/(1-specificity)

45
Q

What is the formula for for the negative likelihood ratio?

A

(1-sensitivity)/specificty

46
Q

How are PPV and NPV affected by prevalence?

A
  • Increased prevalence increases PPV and decreases NPV
  • Vice versa for decreased prevalence
47
Q

How are likelihood ratios affected by prevalence?

A

They aren’t

48
Q

What does the correlation coefficient indicate?

A
  • How closely the plotted points lie to a line drawn thought the plotted data
  • Essentially the strength of correlation between 2 variables
49
Q

What does the correlation coefficient not tell you?

A

How much one variable will change in relation to another

50
Q

What its the correlation coefficient denoted by, and what is the range of possible values?

A
  • Denoted by the r-value
  • -1≤r≤1
51
Q

What can linear regression be used to predict?

A

How much one variable will change in relation to another being changed

52
Q

What is an equation for linear regression?

A
  • y = mx + c
  • Essentially the equation for a line
53
Q

What do the letters stand for in the regression equation?

A
  • y: the variable being studied
  • m: the gradient of the line
  • x: the variable being changed
  • c: the y-intercept (y value when x=0)
54
Q

What is odds?

A
  • The ratio of the number of people who develop an outcome compared to those who don’t
  • Can be >1
55
Q

What is risk?

A
  • The ratio of the number of people who develop an outcome compared to the total number people
  • Remember risk=probability
  • Must be <1
56
Q

When are odds often used?

A
  • Case-control studies
57
Q

What is variance?

A

The average of the squared differences from the mean

58
Q

What is standard deviation?

A

The square root of the variance

59
Q

What is the standard error of the mean, and how is it affected by sample size?

A
  • SD/square root of sample size (n)
  • SEM gets smaller as sample size increases
60
Q

How do you calculate the lower limit of a confidence interval?

A

Mean - (1.96 x SEM)

61
Q

How do you calculate upper limits of a confidence interval?

A

Mean + (1.96 x SEM)

62
Q

What is parametric data?

A

Data that is normally distributed in a bell shape curve

63
Q

What is paired data, and give an example?

A
  • Comes from a single group of participants
  • E.g. measurement before and after intervention
64
Q

What is non-paired data, and give an example?

A
  • Comes from 2 groups of participants
  • E.g. 2 groups receiving different interventions
65
Q

What statistical test can be used for paired and unpaired parametric data?

A

Student’s T-test

66
Q

What statistical test can be used to show correlation in parametric data?

A

Pearson’s product moment coefficient

67
Q

What statistical test can be used for paired non-parametric data?

A

Wilcoxon signed rank test

68
Q

What statistical test can be used for unpaired non-parametric data?

A

Mann-Whitney U test

69
Q

What statistical test can be used to compare proportions or percentages in non-parametric data?

A

Chi-squared test

70
Q

What statistical test can be used to show correlation in non-parametric data?

A

Spearman’s/Kendall’s rank test

71
Q

What is Phase 1 of a clinical trial?

A
  • Determines pharmacokinetics and pharmacodynamics, and side effects
  • Tested on healthy volunteers
72
Q

What is Phase 2 of a clinical trial?

A
  • Assesses dosage (2a) and efficacy (2b)
  • Tested on small number of affected individuals
73
Q

What’s Phase 3 of a clinical trial?

A
  • Assesses effectiveness
  • Tested on 100-1000s for affected individuals
74
Q

What is Phase 4 of a clinical trial?

A
  • Postmarket surveillance
  • Monitors for long term side effects
75
Q

Describe the characteristics of averages in positively skewed data?

A
  • Mean > median > mode
  • Same order for negatively skewed but with < sign
76
Q

What is a box and whisker plot?

A

A graphical representation of the sample minimum value, lower quartile, median, upper quartile, and sample maximum value for a set of data

77
Q

What is a funnel plot used for?

A

To demonstrate the existence of publication bias in meta-analyses

78
Q

What funnel plot shape suggests no publication bias?

A

An inverted symmetrical funnel

79
Q

An asymmetrical funnel plot suggests what?

A
  • A relationship between study size and treatment effect
  • Either publication bias, or a systematic difference between smaller and larger studies
80
Q

What is a histogram?

A

A graphical display of continuous data that has been put into categories

81
Q

Where are forest plots usually found?

A

In meta-analyses

82
Q

What do forest plots show?

A

The strength of evidence of the constituent trials in a meta-analysis

83
Q

What do the squares in a forest plot show?

A
  • Squares are centred on the result of the trial
  • The size of each square represents the study’s weight in the meta-analysis
84
Q

What does the line running through the middle of the squares on a forest plot show?

A
  • The confidence intervals
  • Usually 95%
85
Q

What does the diamond at the bottom of a forest plot show?

A
  • Diamond is centred on the mean of all the studies
  • The lateral edges show the confidence intervals
  • If the confidence intervals cross the line of no effect, the meta-analysis shows the interventions effect isn’t significant
86
Q

What is a scatter plot?

A
  • A graphical representation using cartesian coordinates to display values for 2 variables for a set of data
87
Q

What is a Kaplan-Meier survival plot?

A

Plot of the Kaplan-Meier estimate of survival function
- Shows decreased survival over time

88
Q

What 2 graphs are you likely to find in a meta-analysis?

A
  • Forest plot
  • Funnel plot
89
Q

Was is selection bias?

A
  • An error in assigning individuals to groups leading to differences between the groups that may affect the outcome of the study
  • Subjects are not representative of the population
90
Q

What are the 3 types of selection bias?

A
  • Sampling bias
  • Volunteer bias
  • Non-responder bias
91
Q

When is selection bias a particular problem?

A

In cohort studies

92
Q

What is recall bias?

A
  • Difference in the accuracy of recollection of study participants
  • May be due to whether they have an outcome or not
93
Q

What is an example of recall bias?

A

People with mesothelioma try harder to remember asbestos exposure than those without

94
Q

When is recall bias a particular problem?

A

In case-control studies

95
Q

What is publication bias?

A

Failure to publish/include results from valid studies due to them having negative or uninteresting results

96
Q

When is publication bias seen?

A

In systematic reviews and meta-analyses

97
Q

What is work-up bias?

A

Refers to a gold-standard test being performed more frequently in patients who already have a positive result from a new test

98
Q

When is work-up bias seen?

A

In studies trying to validate a new diagnostic test

99
Q

What is expectation bias?

A

When observers subconsciously measure or report data in a way that favours the expected outcome of a study

100
Q

When is expectation bias seen?

A

Non-blinded trials

101
Q

What is the Hawthorne effect?

A

When a study group changes their behaviour due to knowledge they are being studied

102
Q

What is length time bias?

A

When screening over represents less aggressive disease

103
Q

What is an example of length time bias?

A

Less aggressive tumours are often picked up more successfully by screening, but already have better outcomes, so it looks like screening has improved survival when in fact survival was better anyway

104
Q

What is lead-time bias?

A

When early diagnosis appears to prolong survival