Statistics Flashcards

1
Q

Sensitivity

A

the proportion of people who test positive among all those who actually have the disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Specificity

A

the proportion of people who test negative among all those who actually do not have that disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

PPV

A

the probability that, following a positive test result, that individual will truly have that specific disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

NPV

A

the probability that, following a negative test result, that individual will not have that specific disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

As prevalence decreases, what happens to a) PPV and b) NPV

A

a) PPV decreases - there will be more false positives for every true positive “needle in a haystack”

b) NPV increases - more true negatives for every false negative
- bc a false negative would mean the person actually has the disease, which is unlikely because the disease is rare (low prevalence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Correlation

A

statistical measure which determines co-relationship or association of two quantitative variables

used to represent a linear relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Regression

A

describes how an independent variable is numerically related to the dependent variable. used to estimate a line of best fit and estimate one variable on the basis of another

can make predictions about what we expect among individuals who have not had the dependent variable measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Statistical inference

A

Statistical inference is the process of using data analysis to deduce properties of an underlying probability distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Adjusted R sq

A

estimates the total proportion of overall variation of ___ explained by the fitted regression model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

R sq

A

___ % of the variability in variable Y is explained by variation in X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Coefficient

A

For every one unit change in X, there is a coefficient (beta) unit change in Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

cox proportional hazard model

A

allows for survival times to be modelled in terms of continuous and categorical explanatory variables

any factor alters the hazard factor at the same magnitude at all time points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Hazard ratio

A

the probability of an individual dying at time (t) compared to background hazard rate in people without intervention

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Bland Altman Plot

A

graphical method to compare two measurement techniques

plots the difference between measurements for each data point against average of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Number needed to treat NNT

A

the number of patients you would need to treat with new treatment to have ONE MORE SUCCESS than with the old treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Parametric

A

assumes variables have been sampled from a normal distribution (as defined by mean and SD)

17
Q

H0 of mann whitney and equivalent

A

H0: both groups observations are sampled from the same underlying distribution

2 sample t

18
Q

H0 of wilcoxon signed ranks and equivalent

A

H0: pops MEDIAN is of a particular value

one sample t

19
Q

H0 of wilcoxon matched pairs test and equivalent

A

HO: median difference in popn is 0 - values of 1st measurement are apron equal to those of the 2nd measurement

paired t test

20
Q

kruskal wallis test

A

ANOVA

h0 all groups come from popns with same distribution

21
Q

Assumption of MW CI

A

that the two groups have the same shape and spread (differ only in medians not variability)

22
Q

I squared

A

measures the percentage of variability in treatment effect estimates that is due to between study heterogeneity rather than chance

23
Q

P value

A

the probability of obtaining the observed data or data more extreme, assuming the null hypothesis is true eg. 5% chance if null is true

due to random sampling error

24
Q

censored observation

A

information about an individuals survival time is incomplete - shows time spent in study before an indivudal left due to unrelated illness/condition, drop out etc

25
Q

Log rank test

A

tests for differences in survival times. goodness of fit

NPM

26
Q

ROC curve

A

plots sensitivity v 1- specificity for different cut off points

helps define cut off point for a test with continuous measurements