Chapter 1 Concepts plus regression review Flashcards
Three principles of design:
- replication
- randomization
- local control of error (blocking)
Categories of experimental problems
- Treatment comparisons
- Variable screening
- Response surface exploration
- System optimization
- System robustness
Steps to experiment planning
- State objective
- Choose response
- Choose factor and levels
- Choose experimental plan
- Perform experiment
- Analyze the data
- Draw conclusions and make recommendations
Experimental units vs observational units
- Experimental unit is item that is being experimented on and measured
- Observational unit can be thought of as a technical replicate
Replication does the following:
- Replicating experimental units:
- Allows estimation of the experimental error
- Improves the precision of the estimates
- Replicating observational units:
- mimizes the impact of measurement error
- does NOT estimate experiment error
Randomization does the following
- Distributes the impact of any systematic bias
- ensures fair comparisons
- if bias is present, inflatest the estimate of error
- Elimates presumption bias
Local control of error does the following
- Also called blocking
- reduces the random error among the experimental units
- controls for anything which might affect the response other than the factors
Two important forms of local control of error:
- blocking
- covariates
Block
Groups of homogeneous units
Blocking
- arranged so that within block variation is smaller than between block
- should be applied to remove the block-to-block variation
- randomization is applied to assignments of treatments within the blocks
Derive the LSE estimators without matrix notation
Derive the LSE estimators using matrix notation
(X’X)-1X’y
Derive the expectation, variance and covariance estimates of the LSE estimators without using matrix notation
Derive the expectation, variance, and covariance of the LSE estimators using matrix notation
How do you estimate σ2 in LSE
σ_hat2 =
MSE =
RSS/(N-k-1)=
SUM (yi - yi_hat)2 / (N-k-1)
yT(I-H)y / (N-k-1)
For SLR, N-k-1 = N-2
degrees of freedom in multiple linear regression
df overall = N-1
df model = k
df error = N-k-1
Note: this assumes k regressors + intercept
such that model matrix is n x k+1