continuous outcome Flashcards
continuous outcome, non adjusted tests and adusted tests?
non adjusted
> t test
> Wilcoxon Mann-Whitney test
adjusted
> linear regression
> ancova
what is used to determine treatment effect with a continous outcome
mean difference
prior assumptions of a two sample t test?
- Groups are independent
- Individual measurements are independent
- Outcome is normally distributed
what are covariate effects
the quantified baseline covariates
what do we call an analysis based on covariate effects
ANCOVA
analysis of covariance
prior assumptions of linear regression
- Errors (residuals) are independent
- Errors are normally distributed for each value of X
- Errors have the same variance for each value of X (homoscedasticity)
- residual plot (plotting the residuals against a continuous predictor)
How do we check the assumptions of normality in the data?
- Histogram of residuals (should see gaussian distribution)
- Qq plot of residual (residuals should fall on a straight line)
- plotting the residuals against the fitted value (should see random scatter around zero)
what is a Q-Q plot of residuals?
the standardized residuals are plotted against the expected normal quantiles (z-scores)
why would we plot the residuals against the fitted value?
To identify potential issues with the model, such as nonlinearity, heteroscedasticity, or outliers.
If the points on the plot are randomly scattered around zero, = suggests model is a good fit for the data.
If the points show a clear pattern e.g., U-shape, suggests that the relationship between the predictor and outcome variable is nonlinear, or that there is a problem with heteroscedasticity (i.e., the variability of the residuals is not constant across the range of the predictor variable).
what is a residual plot used for?
helps identify potential nonlinear relationships between the predictor and outcome variables.
plot residuals against continuous predictor
If relationship is linear, the residuals should be randomly scattered around zero, with no clear pattern. However, if the relationship is nonlinear, the residuals will show a pattern that suggests a deviation from linearity.
why is it a good idea to check the histogram of continuous variables before reporting them
to determine which summary statistic is best for reporting central tendancy and variability
- variable with normal distribution, report mean and SD
- skewed distribution, report median and interquartile range (IQR)