Research design Flashcards

Question

Equivalence trials

Answer 1

* Seek to test if a new treatment is similar in effectiveness to an existing treatment * Appropriate if the new treatment has certain benefits such as fewer side effects, being easier to use, or being cheaper * Trial is designed to be able to demonstrate that, within given acceptable limits, the two treatments are equally effective * Equivalence is a pre-set maximum difference between treatments such that, if the observed difference is less than this, the two treatments are regarded as equivalent * The limits of equivalence need to be set to be appropriate clinically * The tighter the limits of equivalence are set, the larger the sample size that will be required * If the condition under investigation is serious then tighter limits for equivalence are likely to be needed than if the condition is less serious * The calculated sample size tends to be bigger for equivalence trials than superiority trials

Answer 2

* Seek to establish that one treatment is better than another * When the trial is designed the sample size is set so that there is high statistical power to detect a clinically meaningful difference between the two treatments * For such a trial a statistically significant result is interpreted as showing that one treatment is more effective than the other

Answer 3

A trial designed to test superiority is unlikely to be able to draw the firm conclusion that two treatments which are not significantly different can be regarded as equivalent

Answer 4

* Analyse subjects in the groups they were originally allocated to even if they don’t comply or change treatment * This provides an unbiased comparison of the treatments i.e. balanced patient characteristics post randomisation remain intact * Per protocol analysis may be useful but only in addition to ITT and not as the primary analysis * Keep an record of all subjects to be able to account for their treatment and for any subjects who withdraw

Answer 5

It is common to choose the sample size so that there is the same number of cases as controls. For a given total sample size this gives the greatest statistical power, i.e. the greatest possibility of detecting a true effect. If the number of available cases is limited, then it is possible to increase the power by choosing more controls than cases However, the gain in power diminishes quickly so that it is rarely worth choosing more than 3 controls per case

Answer 6

* The choice of control group affects the comparisons between cases and controls * Exposure to risk factor data is usually collected retrospectively and may be incomplete, inaccurate, or biased * If the process that leads to the identification of cases is related to a possible risk factor, interpretation of results will be difficult (ascertainment bias) e.g. suppose the cases are young women with high blood pressure recruited from a contraception clinic. In this situation a possible risk factor, the oral contraceptive (OC) pill, is linked to the recruitment of cases and so OC use may be more common among cases than population controls for this reason alone.

Answer 7

* Time-course relationships need careful interpretation since changes in biological quantities may precede the disease or be a result of the disease itself. For example a raised serum troponin level is associated with myocardial infarction, but is only raised after the event. Therefore a case–control study may find that high troponin levels are associated with myocardial infarction but this cannot in fact be a risk factor * Risk estimates for exposures cannot be estimated directly because the case and control groups are not representative samples of their respective target populations and so estimates of risks are biased. This has implications for the statistical analysis and the interpretation of results. Risks are usually estimated using odds and ratios of odds, and these only approximate to risks and ratios of risks when the disease under investigation is rare * This limitation can be overcome with certain designs, for example where a case–control study is nested in a cohort study where all cases and controls are identified prospectively and a truly random sample of controls is available (Research design Cohort studies, p. [link]). In this situation, the relative risk can be calculated directly

Answer 8

Cohorts can be retrospective but requires that full risk factor data are obtained on all individuals with and without the disease of interest using data that were recorded prospectively

Answer 9

* A large number subjects is needed to obtain enough individuals who get the disease or condition, particularly if it is uncommon * The length of follow up may be substantial to get enough diseased individuals and so the cohort study is not feasible for rare diseases * There is difficulty in maintaining contact with subjects, particularly if the follow-up is lengthy * The resources required may be very high

Answer 10

Useful for establishing associations in: - Rare disease - Acute outbreaks in which there is not time for a cohort study - Diseases that take a very long time to develop

Answer 11

In a cohort study it may be worthwhile to identify all individuals with a disease and then retrospectively select a sample of the non-diseased individuals for comparison. This design may be desirable if: * The resource implications of collecting data on all non-diseased individuals is too high * All information was available but unprocessed * Biological samples were collected but not analysed This study is known as a nested case–control study and provides an efficient way of investigating particular factors once the outcomes from the cohort have been established. Bias in risk factor data * In a nested case–control study such as this, the risk factor data should not be as biased as it may be in a conventional case–control study, since it was collected prospectively * There is a potential problem if there is differential loss to follow-up as this would reduce the availability of true controls and bias the comparisons

Answer 12

Assess outcome at one-point in time Use when: • Surveys of prevalence, such as a survey to ascertain the prevalence of asthma * Surveys of attitudes or views, such as: studies of patient satisfaction, patient/professional knowledge; studies of behaviour, such as alcohol use and sexual behaviour * When inter-relationships between variables are of interest, for example a study to determine the characteristics of heavy drinkers, a cross-sectional study allows comparisons by sex, age, and so on CANNOT assess: -Temporal trends/causality e.g. did a disease cause HTN or did the HTN cause a disease Cross-sectional studies that appear to be longitudinal -Cross-sectional studies can be misinterpreted as if they were longitudinal studies. For example, a cross-sectional study in a sample of fetuses where the gestational age of the fetuses spans a range, say 22–28 weeks. Some researchers have used data such as these to estimate growth trends. This is dubious because each fetus is measured just once and so the trend is being estimated from different fetuses. Thus differences between fetuses are likely to contribute to some of the differences observed by gestational age.

Answer 13

Allows causality to be more confidently inferred from observational studies: * Strength of association * Consistency in different studies, settings, etc. * Specificity of association of risk factor with a particular disease * Temporal relationship – exposure precedes disease * Dose–response relationship * Biological plausibility for causality * Coherence – association is consistent with current knowledge * Experimental evidence for causality * Existence of analogous evidence between a similar exposure and disease

Answer 14

‘Clinical audit is a quality improvement process that seeks to improve the patient care and outcomes through systematic review of care against explicit criteria and the implementation of change. Aspects of the structures, processes and outcomes of care are selected and systematically evaluated against explicit criteria. Where indicated, changes are implemented at an individual team, or service level and further monitoring is used to confirm improvement in healthcare delivery.’1

Answer 15

These are data collected and recorded for another research study, and which are available for use. Advantages * Relatively quick to obtain * Usually already processed so that minimal checking and data cleaning is required * Usually much lower cost than primary data collection Disadvantages * No control over data available * Limited control over missing data and ability to fill gaps and resolve queries * Data may not be in required or desirable format * May be out of date

Answer 16

* They allow several outcomes to be combined in settings where different outcomes are of similar importance but reflect different clinical events, for example in a trial of treatment for gestational diabetes, the primary outcome was a composite measure of serious perinatal complications, defined as one or more of: fetal death, shoulder dystocia, bone fracture, and nerve palsy * Main advantage of using a composite outcome is the gain in statistical power – where individual events are uncommon, a large sample will be required to demonstrate conclusive differences. Using a composite will increase the event rate and allows trials to recruit a lower sample size.

Answer 17

* It may be hard to determine the minimum clinical difference for the composite, this requires an estimate of the incidence of the composite itself and not just the incidence of the individual components as well as clinical judgement about what constitutes an important change in rate * The interpretation of results may be difficult – it is important that the separate component effect sizes are each reported as well as the combined effect size, to allow clinical interpretation * If the effect sizes (e.g. relative risks) vary among the components then overall interpretation of the findings is difficult, for example if a new treatment reduces subsequent adverse events but increases death rates

Answer 18

• A surrogate outcome should be closely related to the clinical outcome of interest such as a biomarker or process variable

Answer 19

e.g. cholesterol in mmol into normal vs high Pros: - Allows clinical applicability and aids clinical decision making - Able to summarise data Cons: - leads to a loss of information which in turn has statistical consequences. =loss of power

Answer 20

The following information is required: * The standard deviation (SD) of the measure being estimated * The desired width of the confidence interval (d) * The confidence level The standard deviation is needed because the sample size depends partly on the variability of the measure being estimated. The greater the variability of a measure, the greater the number of subjects needed in the sample to estimate it precisely. The standard deviation can be estimated from previously published studies on the same topic, from contact with another worker in the field or from a small pilot study. The desired width of the confidence interval, d, indicates the precision of the mean and is decided by the researcher. The confidence level is usually set at 95%, giving a sample confidence interval that contains the true population mean with probability 95%. Other values such as 90% or 99% can be used, but are unusual in practice. n=1.962×4SD2/d2

Answer 21

The following information is required: * The expected population proportion, p * The desired width of the confidence interval (d) * The confidence level

Answer 22

We conclude that there is a difference between the groups in the target populations when in fact there is not. This is actually the significance level of the test and so when we use 0.05 or 5% as the cut-off for statistical significance, then the probability of a type 1 error is 5%. This is often denoted by ‘α’.

Answer 23

We conclude that there is no difference between the groups in the target population when in fact a real difference of a given size does exist. The type 2 error is often denoted by ‘β’ and 1–β is the power of the study.

Answer 24

The following information is required: * The standard deviation (SD) of the measure being compared * The minimum difference (d) that is clinically important * The significance level (α) * The power of the test (1–β)

Answer 25

* The expected population proportion in group 1, P1 * The expected population proportion in group 2, P2 * The significance level (α) * The power of the test (1–β) The expected population proportion in group 1 and the expected population proportion in group 2 are the best estimates of what these values will be. The difference therefore reflects the anticipated change in the proportion which would be regarded as clinically important. The significance level, α is the type 1 error and is usually set at 5%. The power of the test, 1–β, is the probability of getting a significant result when the true difference between the proportions is d and is set at 80% or more, preferably 90%.

Answer 26

* There is no attrition, i.e. the total number of patients successfully recruited and who complete the study is equal to the number required * For comparative studies, there are equal numbers of subjects in each group * Samples are simple random samples; any randomization is at the individual level. Sample size calculations are different for cluster samples or cluster randomization and the usual calculations will give too few subjects (see below) * For comparative studies, a simple comparison of two groups only will be made. Multiple regression or logistic regression (Research design Chapter 12, p. [link]) is not planned * The samples are large enough to use large sample methods for the analysis

Answer 27

- Qualitative (descriptive) studies | - Pilot studies

Answer 28

Odds ratio and risk ratios are usually based on log-transformed data which makes the data SYMMETRICAL! Common feature is that ratio statistics all have an absolute lowest value of 0, which can extend up to infinity Without log-transformed data, the number scale is not symmetric e.g. OR of 0.5 is half risk OR of 2 is double the risk In the middle means equal risk but the average of 0.5 and 2 is not 1! However, log to the base e of the above: OR 0.5 --> -0.69 OR 2 --> 0.69 Average to 0 i.e. previous value of 1

Answer 29

Graphical displays for meta-analyses performed on ratio scales usually use a log scale. This has the effect of making the confidence intervals appear symmetric, for the same reasons as log-based odds ratios.

Answer 30

For interventions that increase the chances of events, the odds ratio will be larger than the risk ratio, so the misinterpretation will tend to overestimate the intervention effect, especially when events are common (with, say, risks of events more than 20%). For interventions that reduce the chances of events, the odds ratio will be smaller than the risk ratio, so that, again, misinterpretation overestimates the effect of the intervention. This error in interpretation is unfortunately quite common in published reports of individual studies and systematic reviews.

Answer 31

The risk difference is naturally constrained (like the risk ratio), which may create difficulties when applying results to other patient groups and settings. For example, if a study or meta-analysis estimates a risk difference of –0.1 (or –10%), then for a group with an initial risk of, say, 7% the outcome will have an impossible estimated negative probability of –3%. Similar scenarios for increases in risk occur at the other end of the scale. Such problems can arise only when the results are applied to populations with different risks from those observed in the studies.

Answer 32

Used when outcome was measured in different ways Necessary to standardize the results of the studies to a uniform scale before they can be combined. The SMD expresses the size of the intervention effect in each study relative to the between-participant variability in outcome measurements observed in that study.

Answer 33

Patients who contribute to some period of time but did not results in an event are said to be censored

Answer 34

For each participant, two factors are important: 1. Time that event did not happen 2. Whether the end-point was due to an event occurring or just end of observation It is not appropriate to analyse time-to-event data using methods for continuous outcomes (e.g. using mean times-to-event), as the relevant times are only known for the subset of participants who have had the event.

Answer 35

Hazard ratio describes how many times more (or less) likely a participant is to suffer the event at a particular point in time if they receive the experimental rather than the comparator intervention. When comparing interventions in a study or meta-analysis, a simplifying assumption is often made that the hazard ratio is constant across the follow-up period, even though hazards themselves may vary continuously. This is known as the proportional hazards assumption.

Answer 36

In cluster-randomized trials, groups of individuals rather than individuals are randomized to different interventions. We say the ‘unit of allocation’ is the cluster, or the group. The groups may be, for example, schools, villages, medical practices or families. Cluster-randomized trials may be done for one of several reasons. It may be to evaluate the group effect of an intervention, for example herd-immunity of a vaccine. It may be to avoid ‘contamination’ across interventions when trial participants are managed within the same setting, for example in a trial evaluating training of clinicians in a clinic. A cluster-randomized design may be used simply for convenience. One of the main consequences of a cluster design is that participants within any one cluster often tend to respond in a similar manner, and thus their data can no longer be assumed to be independent. It is important that the analysis of a cluster-randomized trial takes this issue into account. Unfortunately, many studies have in the past been incorrectly analysed as though the unit of allocation had been the individual participants (Eldridge et al 2008). This is often referred to as a ‘unit-of-analysis error’ (Whiting-O’Keefe et al 1984) because the unit of analysis is different from the unit of allocation. If the clustering is ignored and cluster-randomized trials are analysed as if individuals had been randomized, resulting confidence intervals will be artificially narrow and P values will be artificially small. This can result in false-positive conclusions that the intervention had an effect. In the context of a meta-analysis, studies in which clustering has been ignored will receive more weight than is appropriate.

Research design Flashcards

(60 cards)