Final 1 Flashcards

Question

What are some factors that can reduce assay sensitivity?

Answer 1

* Poor compliance * Poor responsiveness of the study population * Poor diagnostic criteria * Biased assessment of the endpoint * Treatment effect

Answer 2

* Magnitude of the benefit of active control is minimal to the placebo * Not serious/life threatening (placebo wouldn’t hurt) * No standard of care established * Placebo effect varies among populations * Efficacy of active control may not be consistent

Answer 3

- Reasonable to plan to test of non-inferiority and then superiority – must be in that order because power - CANNOT test superiority first and then non-inferiority - - Non-inferiority will have a smaller sample size - -- Looking for a smaller effect size for non-inferiority than superiority (superiority is non-inferiority + effectiveness) - --- Superiority margin is from intervention vs placebo (Larger) - --- Non-Inferiority is for Intervention vs control (Smaller_ - -- Non-inferiority is one sided while standard/superiority will be two sided. - - Test order has to do with type I error as well. If superiority first, if it fails it would not move on to the non-inferiority test. Has to pass the first test to move to the second. - -- If first do non-inf before trying sup. Can detect both non-inf and sup. Non-inf is gate keeper. Then test for sup. - -- If do sup first and it fails, we would not move on to non-inferiority. This is due to Type I errors. Can only move on to next sequential test if current test is successful. - - Power issue has to do with the margin and the effect size being smaller (ie* harder to detect) for superiority. Thus sample size must be higher for superiority than non-inferiority. WILL ALSO BE UNDERPOWERED FOR ONE OF THE TESTS IF THE DELTAS ARE NOT EQUAL.

Answer 4

Analyze based on randomization - All randomized subjected included in analysis according to assigned treatment group regardless of adherence to their assigned treatment - All outcomes must be ascertained and included regardless of their purported relationship to treatment - AS RANDOMIZED, SO ANALYZED

Answer 5

Analyze everyone as they're randomized. - Loss to follow up is a big issue – ITT does not minimize this bias and can be a limitation in ability to use ITT. Should be minimizing drop out in general for a study - Randomization is the most important - Deviation from the original randomize can impact the treatment comparison - TRIES TO KEEP THE RANDOMIZED GROUPS AND ADDRESS REALISTIC HYPOTHESIS ABOUT THE CLINICAL UTILITY OF THE TREATMENT - TEST OF THE RANDOM. PROCESS AS MUCH AS IT’S A TEST OF THE INTERVENTION - Primary analysis for clinical trials

Answer 6

o Strict ITT includes drop out before first treatment, crossover, LTFU. o Modified ITT includes crossover, LTFU but not drop out before first treatment. o Exclusion of missing ITT method. Include crossover, drop out before first (although technically LTFU), but not LTFU

Answer 7

- Sometimes outcome/mortality at a single point doesn’t tell the whole picture (ie* 5 year survival doesn’t encompass the whole survival curve). Hazard can change over time so survival function is a function of the hazard which can be a function of time. - Comparison at multiple time points not recommended (increased Type I error). Compare median survival time (rare) or compare the whole curve (standard) - Median/50% survival is when half the group has died - Survival Probability= Percent that has survived to that point - S(t) = exp(-lambda*t) - - Larger lambda -> survival curve decreases faster

Answer 8

- Kaplan Meier Method uses conditional probability approach. Also handles LTFU - Assumption of a constant hazard rate is needed for parametric assumption and this is usually unreasonable. For unadjusted analyses, the log rank test is standard

Answer 9

- Log Rank. Mantel Haenszel Test – compare the whole curve o Compare the proportion of event rates between the two arms at each time point and combine that information across all times – considering those at “Risk” of having the the event at each time o Not parametric o All 2x2 tables weighted equally – later time points given more weight where there are fewer data still at risk (insensitive to early differences)

Answer 10

Gehan and Breslow are rank tests for comparing survival curves. Assumes censoring pattern is equal in the two groups (G). Allows for variation in censoring patterns (B).

Answer 11

- MH test gives more weight to the later times while Gehan gives more weights to the earlier times. Can lead to different results. Logrank is typically standard. Must be prespecified in analysis plan. - Parametric comparison – exponential – just need hazard rates to compare the two curves - Semi-parametric: Cox PH regression analysis (incorporated PRE-SPECIFIED covariates). o Compare survival curves taking covariates into account --- Model the HR as a function of time and cov – product of the unadjusted hazard and adjustment for LC of covariates o Similar to logistic but takes time to event into account o NEED TO TEST ASSUMPTION THAT HAZARDS ARE PROPORTIONAL - Strata: Can include in MH and Cox

Answer 12

- Who does the treatment work best for? - Pre-Specify hypothesis and clearly justify the rationale – make sure it’s not fishing - - Otherwise will increase Type I error – want to protect against Type 1 error - - NIH mandates comparison of sex and racial/ethnic groups - - Usually done to see if the treatment is effective in specific groups (usually for marginally unsuccessful results) - - ALWAYS test for interaction with treatment - -- Don’t interpret the main effects when you’re looking at the interaction - - Still considered post-hoc even when pre-specified - - Only do subgroup analysis if interaction effect is present (usually pvalue will be higher because of lower power for the interaction test so increase alpha to maintain power)

Answer 13

* **LOOK AT GRAPHS AND LEARN*** - Parallel lines indicates lack of confounding/interaction. Different results for men vs women on placebo vs treatment is not an interaction. Interaction would mean different slopes of lines(see ppt). Otherwise you just need to adjust for the var (ie* sex).

Answer 14

- Can use continuous but hard to interpret - Restricted Cubic Spline – visualize for interpretation – curvy lines - - 3-5 variables to represent the continuous variable - Best for interpretation/simplicity: Divide into binomial or quartiles - - Divide range into quartiles – percentiles or equal size - -- Equal sizes - good for interpretation but might be unbalanced (unequal distribution within each group). If a single (or multiple) group is too small, there might be modelling issues - -- Percentiles -Equal number within each group but the group intervals will be different sizes. (ie* 25-54, 55-57, 58-59, 60-65 but 50 people each). Harder to interpret easier to analyze/model. - -- Best would be some type of clinical significance. Equal size is usually more clinically significant. Ideally would be to divide into ranges that have clinical significance (ped, adolescent, adult, 65+). Most interpretable.

Answer 15

- Ethical: o Detect benefit, safety, harm, or anything strongly indicating one of the treatments might be inferior or ineffective early - Administrative: o Be sure the study is being executed as planned, assess the appropriateness of enrollment; find unanticipated problems (ie* non-compliance) that could be correct; check on assumptions (ie* sample size) - Economic o Early stopping for negative results that are wasting resources. Allocation of R&D funds

Answer 16

- Terminate early: - - Clear early effect – continuing is unnecessary or even unethical - - Futility – sufficient evidence that the treatments are not different so continuing is unnecessary or unethical - Impact of early termination on the credibility of the results and the acceptability by the clinical community needs to be taken into account. Mostly for positive results - Have to weigh the pros and cons of stopping early for either positive or negative - Issues: Type I error inflation when looking for problems; What to analyze; preliminary information might impact objectivity about the treatment and enrollment (clinicians, patients, et.c) if anything gets disseminated

Answer 17

o Assess the treatment effect after each part of subjects is enrolled, treated, and evaluated o Not feasible for clinical outcomes (since won’t be instantaneous and a logistical issue) o Logistically prohibitive in clinical setting where subject accrual and outcome evaluation aren’t instantaneous o Cumbersome to monitor for large trials o No prespecified sample size or time frame so hard to plan

Answer 18

- More popular than fully sequential method o Modification of SPRT were interim analysis is conducted at pre-specified times o Assumes that a standard analysis will be performed at the end of the trial once a fixed sample size is obtained and then considers to adjustments for the number of tests conducted along the way o Use large critical values for all interim tests and then their effect on the final test (overall alpha) is negligible so we can use the conventional critical value for the final test --- Need a small pvalue/big effect to stop early o Limitations: Need to specify the number of times to check in (restrictive monitoring time); need to specify when to check the data (equal increments of information/patients); Administrative difficulties with respect to timing DSMB reviews

Answer 19

- There is a cost to looking at the data too much | - At calendar time t, a certain fraction (t*) of the total information is observed. 0

Answer 20

MAR: Probability of measurement not being observed does not depend on what the value would have been. Probability does not depend on the value of Y as Y is missing after controlling for other variables X • P (Y missing | XY) = P(Y missing | X) • Much weaker that MCAR. • Can test where missingness on Y depends on X • No need to model the missingness in the data - Standard Assumption

Answer 21

MCAR: Assumption that some data are missing on Y. If the probability of being missing is unrelated to the measurement of Y that would have been observed or other variables X, these data are MCAR • Ideal if missing data – the analysis of only those units with complete data gives valid inference but loses power • P ( Y missing | XY) = P (Y missing)

Answer 22

MNAR: Probability of missing data is dependent on the value of that missing data • Non-ignorable – missing data mechanism must be modeled to get reliable parameter estimates. Requires good prior knowledge of cause of missingness – no way to test goodness of fit of this and results may be sensitive to its choice

Answer 23

o Naïve analysis, best-case analysis, worst-case analysis o If all scenarios pvalues above alpha (Two sample test of equal proportions) – can report as no difference; if all below then there is a difference; issue is if some above and some below (As in example). o Sensitivity analysis: How do difference scenarios change the conclusion – do we need something more nuanced? - Initial Analysis: Patterns, percentages, correlation, etc.

Answer 24

- Listwise Deletion (procedure based) - Delete all subjects with any missing values. Analyze remaining. Often the software default. - Underestimates important info - Biased if not MCAR - “Complete Case” - Good for MCAR – otherwise bad

Answer 25

- Hot Deck and Last Observation Carried Forward (imputation) - Missing value imputed from a randomly selected similar record - If longitudinally collected outcomes, LOCF would use the previous outcome for their next missing outcome - Using a different observations value for imputation. Randomly shuffle and use the first one before that - Ignores trend over time - Bad

Answer 26

- Mean Imputation - - Bad - Regression Imputation - - Build the model for the missing value(s) and impute them - - Ignores random components and can underestimated standard errors and variances - - Better - Single Random Imputation - - Like regression imputation but takes into account random error - - Still underestimates SEs; treats imputed like observed; ignores imputation variation

Answer 27

- Method for averaging the outcomes across multiple imputed datasets to account for the uncertainty of imputation - - Missing values are imputated m times to create m data sets with complete data: m is usually 5-10 - - Analysis is conducted on each m datasets leading to m analyses - - Pooling – consolidate the m results into one result by calculating the mean, variance, and Cis of the param estimates for the variables of concern - Most popular: Multiple imputation by chained equations (MICE) - Takes into account the uncertainty of imputation process - Best

Final 1 Flashcards

(51 cards)