topics Flashcards

Question

block designs for random effects

Answer 1

1. crossover design --> 2 outcomes per experimental unit (paired samples) --> apply treatment in opposite orders between conditions --> treatment, learning, and sequence effects 2. split plot design --> 2 treatment factors (independent samples) --> subplot and whole plot - to get p-values, anova(reduced model, full model) - (1|f) for random effect block

Answer 2

- order of variables in the model matters - variable of interest goes last - otherwise, p-values are unreliable

Answer 3

RBD: - 1 level of blocks - fixed effects SPD: - 2 levels of (randomized) blocks (whole and subplots) - mixed effects

Answer 4

fixed 1. one way ANOVA 2. two way ANOVA 3. randomized block design 4. repeated measures block design mixed 1. crossover design (paired) 2. split-plot design (independent)

Answer 5

- count of units in cross categories - test statistic: difference between expected and observed counts - always right sided (1-chisq())

Answer 6

- for 2x2 tables - odds ratio is used

Answer 7

comparable to pearson's correlation test - will give exactly the same t-score and p-value

Answer 8

- multiple explanatory variables - to find the best parameters, we minimize the sum of squared differences (SSE)

Answer 9

- sigma hat squared: residual standard error - R^2: proportion of explained variance compared to base model Y = B0 + e - F-statistic and overall p-value - all of these are found at the bottom of the output

Answer 10

- not all variables have explanatory power - we need to find the relevant ones by testing for individual coefficients - these are found in the individual rows of the output

Answer 11

step down: remove highest nonsignificant variable step up: add significant variable that yields maximum increase in R^2

Answer 12

1. least variables 2. highest R^2 (or only slight decrease) 3. interpretability

Answer 13

for population mean Ynew value

Answer 14

- for individual observation of Ynew - larger interval than CI as the error is taken into account

Answer 15

1. linearity of the relationship 2. normality

Answer 16

extremely low or high observation on the response variable

Answer 17

extremely low or high observation on the explanatory variable

Answer 18

- can be studied by testing model fit with and without the leverage point - if parameters change drastically by deleting this point, it's called an influence point - cook's distance quantifies the influence of an observation on predictions (>1)

Answer 19

- dummy vector with all 0s but 1 at outlier index - include as variable in the model - if variable is significant, the outlier is significant

Answer 20

- linear relations between explanatory variables, meaning they explain the same - straight line in scatterplot - reflected in large variances and large CIs --> unreliable estimates

Answer 21

1. pairwise linear correlations 2. VIF factor. (>5 = concern)

Answer 22

- extends ANOVA by including one or more variables that are expected to influence the dependent variable, but are not of primary interest - adjusts the DV for the covariates by holding them constant - variable not of interest is continuous (unlike RBD) - the only relevant p-value is for the variable of interest

Answer 23

gives coefficient estimates as difference between ai and a1

Answer 24

gives us p-values, t-statistics, etc

Answer 25

- H0: B1 = ... = Bi - parallel lines = no interaction - modeled with B_i instead of gamma - look at interaction p-valye in the output, the other values should be calculated separately

Answer 26

1. does not matter in balanced ANOVA 2. matters in unbalanced ANOVA 3. matters in ANCOVA (always) 4. matters in logistic regression (always)

Answer 27

- probability of making a Type I error (false positive) when multiple comparisons are being testsed - to provide FWER < 0.05, we use the bonferroni correction (alpha_ind = 0.05/m)

Answer 28

1. there are many parameters of interest 2. investigating all differences between factprs pf a set of effects in ANOVA

Answer 29

- usually everything is compared to B1 or a1. this is not simultaneous testing - tukey etc. show adjusted p-values for simultaneous testing of all Bs

Answer 30

- binary outcome - linear model for the log odds - probability of success

Answer 31

- log odds = log (p(success)/p(failure) = model - odds = e^model

Answer 32

multiplies the odds by e^delta

Answer 33

coefficient or additive model

Answer 34

1/(1+e^delta)

Answer 35

- if Y ~ poisson(lambda), then E(Y) = var(Y) = lambda - the larger the lambda parameter, the larger the values of Y on average, and the larger the spread in the values of Y - for very large values of lambda, the poisson distribution is approximately normal

Answer 36

- log(lambda) = model - lambda = e^model - QQplot is not useful here

Answer 37

- analysis of lifetimes - survival function: probability of survival until time t

Answer 38

- rate of dying within a short interval - how likely the event is to happen at a particular moment in timee

Answer 39

- incomplete observation of the survival time of a variable - (di = Ti < Ci) = event has not happened yet

Answer 40

- only categorical IVs - survival probabilities for specific times

Answer 41

- step function increases only at times where events occur

Answer 42

- tests whether 2+ survival curves are identical - can only deal with grouped data

Answer 43

- unlike KM model, can take many - kinds of predictors - main feature: coefficients can be estimated by maximizing the partial likelihood

Answer 44

- 1 group is a reference group - ai are expressed as difference between a1 and ai - can be set with 'contrasts' command

Answer 45

- ai are expressed as deviations from the mean - combined ai average is 0

topics Flashcards

(70 cards)