Regression Part-4 Flashcards
What is the difference between the data collected in observational sampling and designed sampling experiments?
Analyst cannot control the variables under study in one while in another we can control the levels of variables to determine their effect on variables
What is a Dummy Variable Trap?
When we have the same number of dummies as there are categories it creates a perfect collinearity problem
What happens to the dummy we use as reference?
That dummy variable is omitted to avoid the DVT
When is ANOVA used?
When we need to compare the means of 2 or more populations
Which component describes the mean of the base category of the dummy variable?
the intercept
What are the coefficients attached to the dummy variables called?
differential intercept coefficients and not the slope
How can we circumvent DVT?
By removing intercept (check what other problems may arise as a result)
What are main differences between intercept less and with intercept models?
- DF = n-1 / DF = n-2; 2. sum of residuals may not be zero/always zero; 3. r2 can be negative so only use raw r2 value/ r2 cant be negative; 4. Use raw sum of squares/mean adjusted sum of squares
What is an ANCOVA model?
Which contains both quantitative and qualitative variables. We can control the effects of covariates or control variables
What is the principle of marginality?
If the interaction of X1 and X2 are statistically significant then we retain both X1 and X2 even if they are not statistically significant
What is the similarity between concurrent and coincident regressions?
Both types mean that intercepts are the same
Why do we add Dummy Variable D in the interaction term when comparing two regressions/two categories?
To differentiate between the slope coefficients
Why do we add Dummy Variable D in the additive form term when comparing two regressions/two categories?
To distinguish between intercepts
When comparing two regressions, when are they concurrent
When differential slope coefficient of dummy is statistically insignificant, so they have same intercept
when comparing two regressions, when are they coincident
when both the differential slope coefficient of dummy and interaction is zero they have both slope and intercept same