Statistics Flashcards

Question

What will influence which significance test you use?

Answer 1

Whether the data is parametric (something which can be measured, usually normally distributed) or non-parametric

Answer 2

- Student's t test (paired or unpaired) - Pearson's product-moment coefficient correlation

Answer 3

Data obtained from a single group of patients e.g. measurement before and after an intervention - Parametric and must be normally distributed

Answer 4

Comes from two different groups of patients e.g. comparing response to different interventions in two groups

Answer 5

- Mann-Whitney U test - Wilcoxon signed-rank test - Chi-squared test - Spearman, kendall rank

Answer 6

- Non-parametric data which is unpaired

Answer 7

- Non-parametric - To compare two sets of observations on a single sample

Answer 8

- Non-parametric - Used to compare proportions or percentages

Answer 9

A measure that indicates how many patients would require an intervention to reduce the expected number of outcomes by one - Calculated by 1/absolute risk reduction

Answer 10

The difference between the control event rate (CER) and the experimental event rate (EER) EER = (number who had particular outcome with the intervention) / (total number who had the intervention) CER = (number who had a particular outcome with the control) / (total number who had the control)

Answer 11

Proportion of patients with the condition who have a positive result i.e. we need to know how often a test will be positive if a patient has the disease True positives / (true positives + false negatives)

Answer 12

Proportion of patients without the condition who have a negative result I.e. we need to know how often a test will be negative if the patient is healthy True negatives / (true negatives + false positives)

Answer 13

The chance that the patient has the condition if the diagnostic test is positive True positives / (true positives + false positives)

Answer 14

The chance that the patient does not have the condition if the diagnostic test is negative True negatives / (true negatives + false negatives)

Answer 15

How much the odds of the disease increase when a test is positive Sensitivity / (1 - specificity)

Answer 16

How much the odds of a disease decrease when a test is negative (1-sensitivity) / specificity

Answer 17

When the spread of the data is fairly similar on each side of the mid point e.g. when the data are “normally distributed”.

Answer 18

Skewed by extremes of data, so not giving a typical picture in this instance The median is often better in these circumstances

Answer 19

When data is not symmetrical / i.e. is skewed distribution

Answer 20

The most is the most common set of events, and so used when a label is needed for the most frequently occurring event

Answer 21

- Used for data which is normally distributed - Provides information on how much data varies around their mean - How much a set of values is spread around the average

Answer 22

Instead of simply wanting the mean value of a sample, when you want a range that is likely to contain the true population value. - A range (interval) in which we can be fairly sure (confident) that the “true value” lies.

Answer 23

The CI is narrower because of the larger sample size

Answer 24

The probability of the difference having happened by chance is 0.5 in 1, or 50:50.

Answer 25

The probability of the difference having happened by chance is 0.05 in 1, i.e. 1 in 20.

Answer 26

To compare samples of normally distributed data If the data is not normally distributed, do not use parametric tests

Answer 27

- Based on RCTs - At least one meta-analysis or RCT or systematic review rated as 1++ and directly to the target population

Answer 28

- Based on other robust or experimental or observational studies - Body of evidence including studies rated as 2++ directly applicable to the target population

Answer 29

- Evidence is limited but relies on expert opinion and endorsement of respected authorities

Answer 30

Evidence level 3 or 4 (case, correlation etc)

Answer 31

- Describing a population / giving a picture of what is happening - e.g. case reports, case series, qualitative reports, surveys

Answer 32

- Quantifies the relationship between two factors - i.e. effect of an intervention (I) or exposure (E) on an outcome (O) in a population (P) - To quantify this effect, we need a comparison group (C) Was the intervention randomly allocated - yes = RCT - no = observational study (i.e. cohort, cross-sectional, case-control)

Answer 33

- Cohort (prospective: exposure -> outcome) - Case-control (retrospective: exposure <- outcome) - Cross-sectional (exposure and outcome simultaneously)

Answer 34

- Ethically safe - Subjects can be matched - Can establish timings and directionality of events - Administratively easier and cheaper than RCT

Answer 35

- Controls can be difficult to identify - Exposure may be linked to a hidden confounder - Blinding is difficult - Randomisation not present - For rare diseases, large sample sizes or long follow-up is necessary

Answer 36

- Quick and cheap - Only feasible method for very rare disorders or those with long lad between exposure and outcome - Fewer subjects needed

Answer 37

- Reliance on recall or records to determine exposure status - Confounders - Selection of control groups is difficult - Potential bias: recall and selection

Answer 38

- Cheap and simple - Ethically safe

Answer 39

- Establishes association at most, not causality - Recall bias susceptibility - Confounders may be unequally distributed - Group sizes may be unequal

Answer 40

0 - human microdosing studies 1 - healthy people 2 - people with relevant illness in lab setting 3 - people with illness in clinical setting Market authorisation 4 - post-marketing surveillance studies

Answer 41

- Mean, median, mode, standard deviation - Confidence intervals - P-values (parametric, non-parametric, chi-squared)

Answer 42

- RR - OR - RD - NNT

Answer 43

- Sensitivity, specificity, predictive values, likelihood values

Answer 44

- A measure of variability (SD is square root of variance) - Calculated by taking the average of squared deviations from the mean - Tells you the degree of spread in the data set - The more spread the data, the larger the variance is in relation to the mean

Answer 45

RR = risk in treated or exposed group / risk is unexposed or control group

Answer 46

OR = odds of having been exposed to a risk factor in case group / odds of having been exposed to a risk factor in control group

Answer 47

- Difference between the event rate in the intervention group and control group = improvement rate in the intervention group - improvement rate in control group

Answer 48

- The proportion by which the intervention REDUCES the event rate RRR = ARR / control (placebo) no improvement event rate

Answer 49

The number of patients who need to be treated for one to get benefit = 100/ARR

Answer 50

- Tests a hypothesis on the basis of a difference between sample means - i.e. the t test determines a probability that two populations are the same, with respect to the variable tested - An example null hypothesis might be there is no difference in the mean BMI of patients undergoing vaginal and c-section deliveries

Answer 51

- Chi-squared test of proportions - Non-parametric test used to compare numerical or categorical data sets - E.g. investigating the proportion of women taking pre conception folic acid in different socioeconomic groups

Answer 52

- ANOVA tests the hypothesis that there is no difference between two or more population means (usually at least 3) - Can test for differences without increasing the Type I error rate (which can happen if comparing multiple means by conducting multiple t-tests)

Statistics Flashcards

(79 cards)