Medical Statistics Flashcards

1
Q

P-value

A

probability of rejecting H0 with the value of the test statistic obtained from the data given H0 is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Confidence interval

A

measures the degree of uncertainty or certainty in sampling method, and is the range of plausible values for an unknown parameter (finding test statistic)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Odds ratio

A

describes the strength of the association between two events, ad/bc

the ratio of the odds of A in the presence of B and the ratio of the odds of A in the absence of B. p1/1-p1 / p2/1-p2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Confounding

A

a variable that influences the independent and dependent variable (we want to eliminate this in a test) (true effect on X and Y is hidden by confounding variable c)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Interaction

A

relationship between three or more variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Z distribution and t distribution

A

z is when sample is large and variance is known, and t is when sample is small and variance is unknown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Tables, figures and listings

A

how data is represented and analysed for clinical trials with SAS (type of software they use in pharmaceutical companies, especially when considering clinical trials)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Efficacy

A

the effect in ‘perfect conditions’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Effectiveness

A

the effect in real world conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Crossover trial

A

when subjects receive a sequence of different treatments, but the order they receive them might be randomised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

ANCOVA

A

analysis of covariance, comparing one variable in two or more populations while considering other variables (ANOVA doesn’t consider other variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

End points

A

outcomes measures referring to occurrence of disease, symptom, sign, or laboratory abnormality constituting a target outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Equivalence trial

A

statistical test which aims at showing that two treatments are not different in characteristics (so not too different)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Non-inferiority trial

A

demonstrates that the test product in not worse than the competitor by more than a pre-specified amount (so is significantly better than the other)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Block randomisation

A

randomising patients in blocks such that an equal number are assigned to each treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Stratification

A

partitioning of subjects and results in a way other than the treatment given

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Drop-out

A

when patients leave the trial prematurely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Power

A

probability of avoiding a type 2 error (when the type 2 error is to accept a false hypothesis), (1 - p(type 11 error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Classification

A

by purpose or by phase in drug development

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Phase I

A
focus upon the Pharmacokinetics/Pharmacodynamics
(absorption, distribution, metabolism
and excretion of a drug or vaccine)
and toxicity (drug safety); Maximum
Tolerated Dose (MTD)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Phase II

A

initial clinical investigation into doses and dose schedules, (dosefinding,) and early indications of efficacy
(e.g. dose-finding study)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Phase III

A

aimed at full scale evaluation, efficacy, of a new, experimental, treatment compared to a standard
therapy or placebo, acting as a control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Phase IV

A

effectiveness, post marketing
surveillance (information re: uncommon
side effects, long-term effects)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Effect

A

difference between what happened
to the patient as a result of treatment and
what would have happened if treatment had
been denied’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Efficiency
the economics of treatment
26
International Guidelines
Guidelines for Good Clinical Practice, particularly for | assessing safety and efficacy of medicinal products
27
Trialists objectives
minimise bias and maximise efficiency
28
Key design issues include:
replication, control, randomisation, blocking, treatment blinding/ masking, ethics, choice of analysis set,
29
aim
to yield treatment groups which are indeed | comparable in terms of extraneous factors
30
Block Randomisation
balance numbers of participants in each | group
31
Stratified Randomisation
use block randomisation in each stratum
32
Adaptive randomisation: Minimisation
use simple randomisation when groups are balanced: when imbalanced allocate next patient to treatment so that imbalance is minimised (via minimisation score)
33
Treatment Blinding: single blind
patient does not know which | treatment is being received
34
Treatment Blinding: double-blind
neither patient nor physician knows the treatment allocation (GoldStandard)
35
Treatment Blinding: triple-blind
as double-blind also the monitoring group and data analyst do not know which group receives experimental and which control treatment
36
Treatment Blinding: open
all are knowledgeable about the treatment allocation
37
placebo
dummy copy of treatment
38
double-dummy
method for comparison of 2 active treatments with different appearance (ie two treatments which is in two different forms, drug A and drug B, with placebo A and placebo B)
39
Central Limit Theorem
when independent random variables are added, their properly normalized sum tends toward a normal distribution even if the original variables themselves are not normally distributed
40
risk difference
RD = a/(a + c) - b/(b + d) | I think it's the additional risk of taking the drug over not taking it
41
relative risk
RR = (a/(a + c))/(b/(b + d)) | the ratio of the probability of an outcome in an exposed group to the probability of an outcome in an unexposed group
42
parallel group design
different groups of patients are studied concurrently (in parallel). Patients receive a single therapy (or combination of therapies) −→ estimate of treatment effect is based upon so-called ’between-subject’ comparisons. We used the two independent samples t-test for inference
43
paired design
patient receives both treatment for example, matching parts of anatomy (e.g. limbs, eyes, kin etc) −→ estimate of treatment effect is based upon ’within-subject’ comparison. We used a paired t-test (one sample t-test on the within subject differences). We noted asymmetry can be problematic
44
wash-out period
A wash-out period is a period in a trial during which the effect of a treatment given previously is believed to disappear. If no treatment is given during the wash-out period then the wash-out is passive. If a treatment is given during the wash-out period than the wash-out is active
45
Epidemiology
study of the distribution | and determinants of disease in human populations.
46
target population
population about we wish to draw inferences for
47
study population
population from which data are collected
48
generalisability
can we use the study population results to draw accurate conclusions about the target?
49
incidence
number of new cases of the disease within a specified period of time
50
prevalence
number of existing cases of the disease at a particular point in time, P = (number of people with the disease at a specified time t)/(number in the population at risk at the specified time t)
51
incidence rate
I = (number of people who develop disease in a specified time period)/(sum of the length of time during which each person in the population is at risk)
52
cumulative incidence risk
CI = (Number of people who get a disease during a specified period)/(Number of people free of the disease at the beginning of the period)
53
crude mortality rate
number of deaths in a specified period of time, divided by the average population at risk during that period multiplied by the length (years) of the study period (person-years)
54
sensitivity
the proportion of truly diseased persons in the tested population who are identified as diseased by the screening test (probability of diagnosing a true case as diseased) P(Truly diseased given Diseased)
55
specificity
the proportion of truly non-diseased persons who are so identified by the screening test (probability of diagnosing a truly non-diseased person as non-diseased): P(Truly not diseased given non diseased
56
positive predictive value
is the proportion of persons who are in fact diseased among those who test positive
57
negative predictive value
the proportion of persons who are in fact non-diseased among those who test negative
58
receiver operating characteristic
sensitivity, specificity and their sum can be plotted to evaluate the various cut-points but typically a plot of sensitivity against (1 - specificity) is produced for the various cutoff values
59
likelihood ratio
sensitivity/(1 - specificty)
60
observational studies
in which the investigators role is passive in that exposures are not manipulated
61
intervention studies
in which the investigators role is active: groups are exposed to interventions of interest such as treatments. These are experimental studies: clinical trials
62
cross-sectional studies
surveys also feature but since they provide information at a snap-shot in time (both exposure and disease) they are less useful and can only be used to measure disease prevalence
63
Cohort Studies
A cohort study tracks two or more groups | forward from exposure to outcome
64
Case-Control Studies
case and control groups defined and selected according to their disease status: diseased / non-diseased (outcome to exposure)
65
matching
selecting controls to be similar to cases in terms of confounders (e.g. age, gender, smoking habits)
66
stratification
examine exposure-disease associations within strata (e.g age groups, smoking groups) and estimate pooled estimate of association measure adjusting for confounding effect
67
standardisation
controlling confounding using an external population to adjust for age, gender etc yielding standardised rates
68
multivariate analysis/regression models
include confounding variable in the model (model | adjustment
69
Standardisation
to compare the incidence of disease or mortality between two or more populations
70
direct standardisation
the disease rates in the population of interest are applied to the standard population counts
71
indirect standardisation
the disease rates in the standard population are applied to the population of interest
72
direct standardised event rate
the expected event rate in the ’standard’ population if the age-specific event rates in the study population prevailed
73
survival analysis
analysis of data in the form of times from some well-defined time origin to occurrence of some event or endpoint”
74
3 types of Survival Analysis
Positive (such as discharge from hospital or time to conception) Adverse (such as death or recurrence of disease) Neutral (such as cessation of breast feeding)
75
Right Censoring
the event time exceeds the last follow-up time
76
Left Censoring
the event time precedes the last follow-up time but is unknown
77
survivor function
the survival time of individual i
78
hazard function
a realisation of a non-negative random variable T
79
empirical survivor function
S = (number of individuals with survival times > t) / (total number of participants)
80
Type 1 error
h0 is rejected but h0 true
81
Type 2 error
h0 is accepted but h0 false
82
Mantel-Haenszel Method
technique that generates an estimate of an association between an exposure and an outcome after adjusting for or taking into account confounding
83
Hazard function
f(x)/ 1 – F(x)
84
Kaplan-Meier function
The Kaplan Meier Curve is an estimator used to estimate the survival function. The Kaplan Meier Curve is the visual representation of this function that shows the probability of an event at a respective time interval
85
Survival analysis
analyzing the expected duration of time until one or more events happen, such as death in biological organisms and failure in mechanical systems.
86
Non parametric models
Non-parametric models assume that the data distribution cannot be defined in terms of such a finite set of parameters. But they can often be defined by assuming an infinite dimensional θ.
87
Logisitic regression
like linear models but for discrete data
88
Case control study
type of observational study in which two existing groups differing in outcome are identified and compared on the basis of some supposed causal attribute. (outcome to exposure)
89
Cohort study
exposure to outcome: Conducting a study by selecting groups, then waiting a period of time before comparing the groups ie after initial exposure we then pick the groups and wait for outcome
90
type of test to use when input is nominal and output is normal
t test
91
type of test to use when input is continuous and output is nominal
logisitic regression
92
test which tests if there is an association between two variables
Chi squared test
93
Period Effect
Period effects may arise where patients may do better in a subsequent period because their state has changed, for example, their mental or health status has changed, independent of treatment.
94
Carry over
If the effect of a treatment carries on after the treatment is withdrawn then the response to a second treatment may well be due in part to the previous treatment
95
Gold standard
In medicine and statistics, a gold standard test is usually the diagnostic test or benchmark that is the best available under reasonable conditions. Other times, a gold standard is the most accurate test possible without restrictions.
96
Receiver operating characteristic
a graphical plot that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied