Study Design and Statistics for Stratified Medicine Flashcards

1
Q

What is statistics?

A

Methods to:

  • understand variability between different units under study(e.g. people, genes, drugs)
  • formally weigh up quantitative evidence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do descriptive statistics do?

A

Summarise a set of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do Inferential statistics methodsmake inferences about?

A

Some characteristic of apopulationbased on information contained in asample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
  1. A simple linear regression..

2. What can a hypothesis test be used for?

A
  1. Assumes that a linear relationship between X and Y exists.

Y is the dependent variable (or response)

X is the independent variable (or predictor)

Y ~ β0 + β1X

For every one unit change in Xwe expect Y to change by β1 units.

  1. β1 only estimates the true relationship between X and Y.

A hypothesis test can be used to decide whether the true value of β1 is significantly different from zero. If so, X and Y are statistically associated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a multiple regression model?

Why is this harder to represent visually?

A

The joint effect of multiple different independent variables (Xs) on a dependent variable (Y)

A one unit increase in X1 (calorie intake) leads to an expected increase of β1 units of Y (BMI)

A one unit increase in X2 (physical activity) leads to an expected increase of β2 units of Y – β2 could be negative…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What can hypothesis tests determine in a multiple regression?

A

Determine whether each β is significantly different from zero, i.e. whether each of the Xs is associated with Y

Accounting for the other variables in the model!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does a descriptive study describe?

A

A cohort without making comparisons of exposure and outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does an analytical study involve?

A

Comparisons made. Difference depends upon when exposure or outcome is made.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If an investigator does not assign exposures in a study, what kind of study is this? What are different types of studies relating to this and how can they differ?

A

An observational study. This involves analytical study and descriptive study.

If this involves a comparison group it is an analytical study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the 3 different forms of analytical study?

A

Cohort study

Case control

Cross study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the two types of experimental study

How do they differ?

A

Randomised controlled trial

Non-randomised controlled trial

Randomised controlled trial involves random allocation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
  1. What does a randomised controlled trial involve?
  2. What are advantages of this study design?
  3. What is a limitation
A

Experimental: investigator randomly assigns exposures(e.g. drug vs no drug)

  1. Randomisation helps to eliminate bias

Blinding also minimises information bias

Gold standard in clinical research; high internal validity

  1. Expensive, not always appropriate; external validity?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
  1. What does a cross sectional study ascertain?
  2. What is an advantage?
  3. What is a limitation?
A

Exposures and outcomes at the same time

e. g. capture rheumatoid arthritis status and weight
2. Can associate presence or absence of disease (RA) with presence or absence of exposure (weight)
3. Can’t infer causal direction (RA → weight or weight → RA)

Important example: population-based biobank data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
  1. What does a Cohort study involve?
  2. What are two advantages of this
  3. What is a limitation
A
  1. Proceeds “forwards in time”
    Select groups based on exposure and monitor for outcomes
  2. Good for estimating risk of incident disease (new diagnosis)

Direction of causality can be inferred from design(e.g. weight → RA)

  1. Can be expensive and not effective for rare diseases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
  1. What does a Case-control study involve?
  2. What is an advantage?
  3. What are the limitations?
A
  1. Works “backwards in time” – harder to correctly interpret
    Identify groups based on outcome and look back for exposures
  2. Good for rare diseases
  3. Big problem: control group must be similar to cases in all important respects except for not having the outcome in question

Other differences could be picked up as incorrect associations

Also: recall bias = better recollection of exposures among cases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What research method has the highest risk of bias and confounding variables?

What do these variables affect?

A

Observational research

Internal validity of study

17
Q

What are the three main classes of bias and confounding variables?

A

Selection bias
Information bias
Confounding

18
Q

What does selection bias refer to/ask?

What are three examples?

A

Are the groups similar in all important respects?

Examples:

Membership bias
Jogging → heart disease risk?Joggers also differ in diet, smoking, …

Non-respondent bias
Smokers less likely to return questionnaires

Neyman bias
Hospital-based sample of myocardial infarction cases won’t

19
Q

What does information bias refer to/ask?

What are three examples?

A

Was information gathered in the same way?

Examples:

Diagnostic suspicion bias
More intensive search for disease among people exposed to “suspected” cause

Family history bias
Controls less likely to know about family history of a disease

Recall bias
Abortion → cancer?Abortion underreported in contr

20
Q

Investigating confounding variables focuses upon what?

What is a example?

A

Is an extraneous factor blurring the effect?

The effect being observed is not due to the exposure, but a third factor with which it is associated

Example:

Salpingitis appears to be associated with using an intrauterine device (IUD)

But actually salpingitis is associated with having >1 sexual partner

Women with >1 sexual partner more likely to have an IUD

21
Q

How can confounding variables be dealt with prior to study conduction?

A

Matching:

For each case, find a control that matches on potential confounders
e.g. find control with same smoker status

22
Q

How can confounding variables be dealt with after study conduction?

A

Stratification-
Split data into strata and analyse separately
e.g. analyse smokers and non-smokers separately

Multivariate techniques -
Include potential confounders in a joint model with the exposure

e.g. use multiple regression and include a term for smoker status

23
Q

What must a biomarker study design capture?

A

sufficient information to examine the hypothesised effect

Sufficient numbers of:

biomarker + (positive) and biomarker negative participants

outcome + (positive) and outcome (negative) participants

24
Q

Why is there higher potential for biases and confounding in a biomarker study?

A

Focusing on treatment effect study: exposure/outcome and biomarker

E.g. Biomarker study: exposure/biomarker/outcome

25
Q

What is needed for a biomarker study?

A

All need replication via repeated demonstration in independent studies

26
Q

What does a prognostic biomarker need to demonstrate?

A

Need to demonstrate that biomarker + group has better (or worse) outcome than biomarker – group regardless of treatment

27
Q

What does a predictive biomarker need to demonstrate?

A

A significant treatment by biomarker interaction

28
Q

What terms are used for a multiple regression to the model a treatment biomarker interaction design for a predictive biomarker?

A

How outcome changes due to treatment in all patients (regardless of biomarker status)

How outcome changes due to presence of biomarker(regardless of treatment)

Additional impact of treatment in biomarker positive patients

29
Q

What is biomarker sensitivity/ true positive rate?

A

is the ability of a test to correctly identify those with the specified characteristic (e.g. disease, drug response)

“How well does biomarker do at picking out the positive cases?”

30
Q

What is biomarker specificity/ true negative rate?

A

is the ability of a test to correctly identify those without the specified characteristic
“How well does biomarker do at only picking positive cases?”

31
Q

Sensitivity and specificity are intrinsic characteristics of the test, but do not take into account what?

A

The proportion of actual positives and negatives in the population being tested

32
Q

What is the Positive Predictive Value (PPV)?

A

Positive Predictive Value (PPV)

The fraction of the group that test positive that are actually positive

33
Q

What is the Negative Predictive Value (NPV)?

A

The fraction of the group that test negative that are actually negative

34
Q

What do Receiver Operating Characteristic (ROC) curves provide?

What does the area under the curve (AUC) provide and is useful for?

A

a graphical representation of the trade-off by displaying the range of possible cut points of a diagnostic test with their associated sensitivity (or TPR) against (1 – specificity) (or FPR).

An indication of the utility of the predictor(0.5 = random chance, 1 = perfect separation)

Is useful in assigning the best cutoffs for clinical use

Provide a means of comparing two or more predictive tests