Statistics/Research Flashcards
Study designs
Qualitative and quantitative studies
Within Quantitative studies - have descriptive and analytic
Descriptive studies are all observational - (cross-sectional, case series)
Analytical studies -
Observational = cross-sectional (analytical), cohort study, case-control study, ecological study
Experimental = RCT, non-randomised (quasi-experiment), natural experiments
epidemiology
The study of the distribution and determinants of disease or health status in a population.
(Look at the health of populations rather than individuals)
Cohort study
Observational analytical study
Can be retrospective or prospective.
Prospective:
Follows exposed and unexposed patients forward over time to determine outcome. i.e. smokers and nonsmokers overtime and rates of lung cancer development.
Issues with prospective cohort studies
1. Loss to follow up
2. Takes a long time to do.
3. Expensive.
4. Confounders
Retrospective:
Looks back for an exposure on a cohort of patinet.s
Statistical analysis based on exposure categories not outcome/disease categories like it is in a case control trial.
Case control study
Observational analytical study
Looks at patient with the outcome of interest and looking back to see if they had the exposure in question.
Always retrospective.
Take one group with a disease and one without and look back to see if exposures of interest were different.
Very useful for rare diseases.
Get data fast.
Cheap.
Weak evidence.
case series
Observational descriptive study
A report on a series of patients with an outcome of interest, no control group involved.
Systematic reviews
A summary of the literature that uses explicit methods to appraise and combine studies.
randomised controlled trial
Analytical experimental study
Groups of patients are randomised into either a experiment or control group
Gold standard as can reliably test causality.
Hierarchy of evidence
Systematic review versus a narrative review
A narrative or traditional literature review is a comprehensive, critical and objective analysis of the current knowledge on a topic. They are an essential part of the research process and help to establish a theoretical framework and focus or context for your research.
Often helpful when the evidence is limited and the research question is broad.
At high risk of bias
What is a superiority trial
Trials designed with the intention of showing that one treatment is superior to another or a placebo. This is how most RCTs are designed.
“A is better than B”
What is a non-inferiority trial
Trials designed with the intention of showing that one treatment is not inferior to another treatment. (A is not worse than B)
What is an equivalence trial
Trials designed with the intention of showing that one treatment is no better or no worse than another. Rare in medical trials as usually you don’t want to prove something is no better, only that it is no worse. (A and B are the same)
Why design an non-inferiority or equivalence trial and what differences in trial design are required.
Just because a null hypothesis is rejected, it does not mean that the tested treatment is equal or worse. When a ‘traditional’ test does not demonstrate a difference between treatments, this is often presented (erroneously) as evidence of similarity. It may be that no difference exists, or it may be that the study was not of sufficient power to detect the difference between groups.
Both equivalence and non-inferiority trials assess whether the effects of the new treatment, compared with the standard treatment, stay within or go beyond a predefined clinically acceptable equivalence margin/or the acceptable amount of difference (called the delta value)
This margin is determined prior to the trial and is usually derived from expert opinion and consensus and literature review.
REasons to do a non-inferiority trial
Cost reasons: non-inferiority trials often require less patients, therefore costs less for trial to run
Convenience (as above)
When a placebo is considered unethical
When a new treatment
Fill in the blanks
What is ITT and it’s importance
Intention to treat involves including all participants randomised into the analysis.
Participants are counted in the group they were originally allocated to, even if they discontinued the treatment.
Important as it gives a ‘real world’ answer to an intervention’s effectiveness and limits bias.
The effect of ITT on superiority studies is to bring the two study arms closer together, hence we can be more confident in any difference found. “we found a difference in spite of the participants who switched groups”
Explain how intention-to-treat analysis may influence the results of your non-inferiority trial and why per-protocol analysis may have advantages for this trial design.
Per-protocol analysis – when participants are compared on the basis of the treatment they actually received.
In non-inferiority trials we wish to minimise factors that would make the two study arms seem artificially similar, hence per protocol is often the more correct way to proceed. The most convincing results are those in which a non-inferiority is found using both ITT and PP analyses.
“even when the participants received the correct treatment, the treatment was non-inferior”
Difference between parametric and nonparametric tests
Tests that are used to calculate a hypothesis.
Parametric tests are used when the data is normally distributed.
Parametric tests are more powerful so you need a smaller sample size to reject the null hypothesis and can have a smaller difference in outcome measure.
Fill in the table
Answers
Simple T Test
T-Test
* Analysis if there is a statically significant difference between the means of two groups
* Variable for which we want to test a difference must be metric (age, body weight, income)
* Variable must be normally distributed
Simple T-Test
* Used when we want to compare the mean of a sample to a known reference mean
* Null hypothesis: The sample mean is equal to the reference value.
Example: The average birth weight of all babies in NZ is 3200g. The average birth weight of babies born after IVF is 3000g. Is this a statistically significant difference Simple T-test would be used.
Independent sample T-Test
- Want to compare the means of two independent groups or samples
- Null hypothesis: The mean values in both groups are the same.
- Assumptions: Normal distribution, variance for the two samples is equal, samples are independent
- If the variance is not equal use Welch’s t-test not the unpaired T-test.
Example: Patients with mild male factor infertility. One group inseminated with standard IVF the other with ICSI. Is there a statistically significant difference in fertilisation rate?
Paired sample T-Test
- Used to compare the means of two dependent groups
- The measured values are available in pairs.
- Null hypothesis – the mean of the difference between the pairs is zero.
- Assumptions: Normal distribution and equal variance.
Example: Obese patients with PCOS and IGT are given GLP-1 receptor antagonist Semaglutide (Ozempic) and are weighed before and 6 months after starting on treatment. A paired samples t-test would be used to determine if there was a statistically significant difference in mean weight.
Anova (analysis of variance)
- Analysis of variance tests whether there are statistically significant differences between three or more samples.
- Is the extension of the t-test to more than two groups and can be used for independent or dependent samples.
- Assumes normal distribution, observations are independent, variance is equal across the groups.
- Independent samples – single factorial without measurement repetition
- Dependent samples – single factorial with measurement repetition
- Allows us to answer – is there a difference in the population between the different groups of the independent variable (predictors) with respect to the dependent variable (criterion).
- Null hypothesis in one-factor ANOVA
- There are no differences in the population between the means of the individual groups.
- Alternative hypothesis is one-factor ANOVA
- At least two group means differ from each other in the population
- Post-hoc tests to compare the individual groups can be done
Pearson correlation
- Analyses the relationship between two variables.
- Provides strength and direction of the linear relationship between two metric variables.
- Assumes normal distribution.
- Note that correlation does not equal causation.
- Allows us to measure the linear relationship between two variables.
- Allows us to determine how strong the correlation is and whether the correlation is positive or negative.
- Both of these are determined with Pearson correlation coefficient r which is between -1 and 1.
- Strength of correlation: (see image)
- Direction of correlation
- Positive correlation (large values of one variable go along with large values of the other or when small values of one variable go along with small values of the other variable)
- Negative correlation (large values of one variable go along with small values of the other and vice versa)
- Null hypothesis in Pearson correlation: The correlation coefficient does not differ significantly from zero (there is no linear relationship).
Parametric regression anaylsis.
What
Why (goals)
Assumptions
A regression analysis makes it possible to infer or predict a variable based on one or more other variables.
I.e. what variables impact of salary (i.e. age, education, years working, gender etc.)
Dependent variable = criterion (salary)
Independent variable = predictors (age/education etc.)
Used to achieve two goals:
1. Measurement of the influence of one or more variables on another variable.
2. Prediction of a variable by one or more other variables.
Assumptions:
1. Constant variance for different values of dependent variable
2. Dependent variable is a linear combination of the independent variables
3. In theory low/no measurement error
4. No high collinearity (where two or more independent/predictor variables are closely related to one another)
Simple linear regression
Multiple linear regression
Simple linear regression
- dependent variable is a continuous variable
-one independent/predictor variable
Multiple linear regression
- dependent variable is a continuous varibale
- multiple independent variables
Simple logistic regression
Multiple logistic regression
Simple logistic regression
* Dependent variable is a categorical variable
* One independent/predictor variable
Multiple logistic regression
* Dependent variable is a categorical variable
* Multiple independent variables
Internal validity
Internal validity - when an observed difference between groups are related to the intervention tested in the trial and not to flaws in study design.
The internal validity of a clinical trial is directly related to appropriate design, conduction, and reporting of the study.
- ensure confounders have been addressed
- Bias minimised
The two main threats to internal validity are bias and random error. Bias hereby refers to a systematic error that leads to a systematic deviation of the results from the truth due to flaws in the design, conduction, or reporting of the trial.
Typical sources of bias are flaws in collection, statistical analysis, or interpretation of study data.
Consequently, the true difference between study groups may be either under- or overestimated.
External validity
the extent to which the results of an RCT can be generalized into clinical practice and the general population.
internal validity of a study is the prerequisite of its external validity since incorrect data due to missing internal validity can, per se, not be applied to the general population. Even if internal validity is assumed, insufficient external validity may reduce the clinical relevance of an RCT.
Types of bias
Flaws in collection, statistical analysis, or interpretation of study data. Causes an under or overestimation of the true difference between study groups.
Selection bias
Recall bias
Funding bias
Reporting bias
Detection bias
Exclusion bias
Attrition bias
Observer bias
Analytical bias
Selection bias
When the study group is not fully representative of the population intending to be studied.
(Higher education, English speaking patient’s more likely to join a clinical trial).
Can also occur when recruiters selectively enrol paitents based on what the next treatment allocation is likely to be.
(overcome using allocation concealment and radnomly vary block size and use large block sizes)