stats Flashcards

Question

how do you calculate specificity

Answer 1

no. of people correctly test -ve / total no. of healthy people

Answer 2

no. of people who correctly test +ve / total no. of people who test +ve

Answer 3

determines choice of statistical methods

Answer 4

normal distribution

Answer 5

full set of units (people) to which the study results will be generalised usually infinite in size

Answer 6

variability between people | sample is only a subset of the population - not fully representative

Answer 7

summarising sample data | quantifying uncertainty in results

Answer 8

inferential | descriptive

Answer 9

describe basic features/characteristics in the sample

Answer 10

make inferences about relationships in the population using the sample however can never be 100% certain e.g. standard error, CI, p-values

Answer 11

all the different estimates from different samples and their frequencies

Answer 12

the larger the sample size the narrower the CI

Answer 13

the wider the CI, the greater the uncertainty

Answer 14

the extent to which the sample estimate contradicts the null hypothesis

Answer 15

population/patient intervention comparison outcome

Answer 16

type of study design that would work best

Answer 17

to frame or answer a health related question

Answer 18

if data are matched on criteria e.g. age/gender before comparing on either trial arm if measurements are taken before and after an interventoin

Answer 19

within-pair differences

Answer 20

e.g. t-test, analysis of variance (ANOVA) make distribtuional assumptions eg. Normal summarise data using means and sd

Answer 21

analysis of variance

Answer 22

paired test | repeat measures of ANOVA

Answer 23

if variables are skewed small sample size if sd is different across groups if the variables are more ordinal than quantitative

Answer 24

analyse the rank ordering in the data (not actual scores) only provide p-values (not CIs) compare entire distribution rather than just means

Answer 25

IQR | median

Answer 26

Mann Whitney

Answer 27

Wilcoxon signed-rank

Answer 28

Kruskal Wallis

Answer 29

they are always valid for quantitative data | parametric only valid if assumptions are satisfied

Answer 30

no CIs based only on analysis of ranks no direct inferences about a parameter

Answer 31

sample greater than 50

Answer 32

SD squared

Answer 33

variance in one group should be no more than 4x the variance of the other group

Answer 34

calculate a single CI for the difference between groups

Answer 35

the higher the proportion the higher the odds

Answer 36

no. of participants in category of interest / total no. of participants

Answer 37

the exposure variable is the potential cause of the outcome variable

Answer 38

chi-squared (large samples) | fisher's exact (small samples)

Answer 39

no risk difference | groups equally likely to have the disease

Answer 40

proportion in group A - proportion in group B

Answer 41

proportion in group A / proportion in group B

Answer 42

the strength of association between the intervention and binary variable

Answer 43

no difference in risk between two groups

Answer 44

number needed to treat

Answer 45

1 / risk difference

Answer 46

the number of people that need to receive intervention before 1 person benefits from it

Answer 47

quantifying the impact of an intervention in a given population

Answer 48

measures the effectiveness of an intervention | based on risk difference

Answer 49

the association between two variables

Answer 50

scatter plot outcome = y-axis predictor = x-axis

Answer 51

correlation coefficient pearson's = linear spearmans = non-linear

Answer 52

non-linear correlation e.g. curved line must be 'monotonic' - either never -ve or never +ve e.g. graph cannot be U-shaped

Answer 53

then all the variation in one variable is explained by the other variable

Answer 54

the independent variable | the explanatory variable - potential cause of the outcome variable

Answer 55

line that makes the vertical distance from the data points to the regression line as small as possible

Answer 56

the vertical distance between the observed data point and the regression line (predicted value)

Answer 57

outcome = a + b(predictor) + e

Answer 58

yes | e.g. blood pressure

Answer 59

most distributions of diagnostic test scores will overlap

Answer 60

specificity sensitivity PPV NPV

Answer 61

the severity of the disease

Answer 62

population shave similar disease severity

Answer 63

if symptoms show on non-disease patients specificity is reduced

Answer 64

the likelihood that somebody has the disease based on the test result

Answer 65

if a disease has a greater prevalence (is more common) then the PPV will increase

stats Flashcards

(91 cards)