4: Precision, Statistical Inference and Goodness of Fit Flashcards
assumption 1 CLRM
errors have zero mean
assumption 2 CLRM
the errors have constant finite variance, homoscedasticity
assumption 3 CLRM
the errors are linearly independent of each other
assumption 4 CLRM
there is no relationship between an error and its corresponding x variate
- stronger alternative assumption is that the xt’s are stochastic (fixed in repeated samples)
assumption 5 CLRM
error is normally distributed
- required if we want to make inferences about population parameters from sample parameters
comments on SE estimators for intercept and slope
- sample size (the larger it is, the smaller the coefficient variances, the greater the sample size, the more information available)
- error variances (the SEs depend on s^2, the greater this is the more dispersed)
- total sum of squares (the larger this is, the smaller the coefficient variances)
R squared
measures how well the regression model fits the data
how much of the changes in y are explained by changes in x
problems with R squared
- not sensible to compare R squared values for models with different dependent variables
- as R squared never decreases, it cannot tell if a variable should be present in the model