3 - Regressions Flashcards

Question

Describe the role of residuals in determining the fit of a regression model.

Answer 1

Residuals, calculated as εi = Yi - Ŷi, represent the difference between observed values Yi and predicted values Ŷi. Smaller residuals indicate a better fit of the regression model to the data.

Answer 2

An R² value of 0.85 means that 85% of the variance in the dependent variable Yi is explained by the independent variable Xi in the regression model, indicating a strong relationship.

Answer 3

Partitioning SSTot into SSReg (explained variance) and SSRes (unexplained variance) helps in understanding how much of the total variation in Yi is explained by the regression model (SSReg) and how much remains unexplained (SSRes).

Answer 4

Homoscedasticity means that the variance of the error terms εi is constant across all levels of Xi. If this assumption is violated, the standard errors of the coefficients β̂0 and β̂1 may be biased, affecting hypothesis tests and confidence intervals.

Answer 5

Confidence intervals for β0 and β1 provide a range of plausible values for these parameters, reflecting the precision of their estimates. A wider interval indicates more uncertainty, while a narrower interval suggests more precise estimates.

Answer 6

To apply the least squares method, estimate β1 using β1 = (Σ(Xi - X̄) * (Yi - Ȳ)) / Σ(Xi - X̄)², where Xi is the independent variable, Yi is the dependent variable, X̄ is the mean of Xi, and Ȳ is the mean of Yi. Then, estimate β0 using β0 = Ȳ - β1 * X̄.

Answer 7

High variance in the residuals suggests heteroscedasticity. To address this, you could apply transformations to the dependent variable Yi, such as logarithmic or square root transformations, or use weighted least squares to account for varying variance.

Answer 8

The Gauss-Markov theorem justifies the use of least squares estimators because, under the assumptions of linearity, uncorrelated errors, and homoscedasticity (constant variance of εi), the least squares estimators β̂0 and β̂1 are the best linear unbiased estimators (BLUE).

Answer 9

The predicted value of Y for a given X is computed using the regression equation Ŷ = β̂0 + β̂1 * X, where Ŷ is the predicted value of the dependent variable Y, β̂0 is the estimated intercept, and β̂1 is the estimated slope.

Answer 10

The coefficient of determination R² = 1 - (SSRes / SSTot) measures the proportion of the variance in Yi explained by Xi. An R² close to 1 indicates a good fit, meaning the model explains most of the variation in Yi, while an R² close to 0 indicates a poor fit.

Answer 11

If the errors εi are not normally distributed, the validity of hypothesis tests on β0 and β1 may be compromised, as the standard t-tests and confidence intervals rely on normality. Non-normal errors may lead to incorrect conclusions about the significance of the estimates.

Answer 12

Confidence intervals for β0 and β1 are calculated as β̂ ± t(1-α/2) * SE(β̂), where t(1-α/2) is the critical t-value and SE(β̂) is the standard error. A narrow interval suggests high precision and confidence in the estimate, while a wide interval indicates more uncertainty in the estimate.

Answer 13

If the residuals εi are not independent, this violates the assumption of independence in the Gauss-Markov theorem. You could use methods such as generalized least squares (GLS) or include autocorrelation models, such as AR(1), to adjust for correlated errors.

Answer 14

If homoscedasticity is violated (heteroscedasticity), weighted least squares (WLS) can be applied by assigning weights inversely proportional to the variance of the residuals, reducing the impact of observations with higher variance.

Answer 15

To test whether X significantly explains the variation in Y, use the F-test: F = (SSReg / 1) / (SSRes / (n - 2)), where SSReg is the regression sum of squares, SSRes is the residual sum of squares, and n is the number of observations. A high F-value compared to the critical value indicates that X significantly explains the variation in Y.

Answer 16

Yi = β0 + β1 * Xi + εi, where Yi is the dependent variable, β0 is the intercept, β1 is the slope, Xi is the independent variable, and εi is the random error term.

Answer 17

β̂1 = (Σ(Xi - X̄) * (Yi - Ȳ)) / Σ(Xi - X̄)², where β̂1 is the slope, Xi is the independent variable, X̄ is the mean of Xi, Yi is the dependent variable, and Ȳ is the mean of Yi.

Answer 18

β̂0 = Ȳ - β̂1 * X̄, where β̂0 is the intercept, Ȳ is the mean of Yi, β̂1 is the slope, and X̄ is the mean of Xi.

Answer 19

Ŷ = β̂0 + β̂1 * Xi, where Ŷ is the predicted value of Yi, β̂0 is the intercept, β̂1 is the slope, and Xi is the independent variable.

Answer 20

R² = 1 - (SSRes / SSTot), where R² is the proportion of variance in Yi explained by Xi, SSRes is the residual sum of squares, and SSTot is the total sum of squares.

Answer 21

SSRes = Σ(Yi - Ŷi)², where SSRes is the sum of squared residuals, Yi is the observed value, and Ŷi is the predicted value from the regression line.

Answer 22

SSTot = Σ(Yi - Ȳ)², where SSTot is the total sum of squares, Yi is the observed value, and Ȳ is the mean of Yi.

Answer 23

Var(β̂1) = σ² / Σ(Xi - X̄)², where Var(β̂1) is the variance of the slope estimator, σ² = RSS/(n-2), where RSS is the Residual Sum of Squares = Σεi²

Answer 24

MSE = SSRes / (n - 2), where MSE is the mean squared error, SSRes is the residual sum of squares, and n is the number of observations.

Answer 25

CI for β1 = β̂1 ± t(1-α/2) * SE(β̂1), where CI is the confidence interval, β̂1 is the slope estimate, t(1-α/2) is the critical t-value, and SE(β̂1) is the standard error of the slope.

Answer 26

Var(β̂1) = σ² / Σ(Xi - X̄)² = SSRes / (n-2) * (Σ(Xi - X̄)²) = MSE / Σ(Xi - X̄)²

Answer 27

To assess normality of the residuals εi, you can use visual methods like Q-Q plots or statistical tests like the Shapiro-Wilk test. If the residuals are not normally distributed, hypothesis tests and confidence intervals based on t-distributions may not be valid, especially in small samples, which could lead to incorrect conclusions about significance.

Answer 28

The method of least squares minimizes the sum of squared residuals, assuming homoscedasticity and uncorrelated errors. The method of maximum likelihood estimates parameters by maximizing the likelihood of observing the data, assuming the errors εi are normally distributed. Least squares is a simpler method with fewer assumptions, while maximum likelihood can provide more efficient estimates under normality but is more sensitive to model misspecification.

Answer 29

The variance of the slope estimator β̂1 is Var(β̂1) = σ² / Σ(Xi - X̄)², where σ² is the error variance and Σ(Xi - X̄)² represents the spread of Xi values. A larger spread (greater variance in Xi) reduces the variance of β̂1, leading to more precise (less variable) slope estimates. A narrow spread increases the uncertainty of the slope estimate.

Answer 30

Multicollinearity occurs when two or more predictors are highly correlated, making it difficult to separate their individual effects on Yi. This leads to inflated standard errors of the estimates β̂0 and β̂1, making them less reliable. Small changes in the data can cause large swings in the parameter estimates, reducing the precision and stability of the model.

Answer 31

The total sum of squares (SSTot) is partitioned into SSReg, the regression sum of squares, which explains the variation due to the independent variable Xi, and SSRes, the residual sum of squares, which represents unexplained variation. A large SSReg relative to SSRes indicates that the model fits well and explains a large portion of the variance in Yi.

Answer 32

In choosing estimators, a low-bias estimator such as least squares provides accurate estimates on average. However, if the estimator has high variance, it may be sensitive to sample fluctuations. A biased estimator may be preferred if it has much lower variance (e.g., ridge regression), balancing the trade-off between bias and variance to minimize mean squared error.

Answer 33

A pattern or trend in the residuals indicates that the model is misspecified, possibly due to non-linearity or omitted variables. You can address this by adding polynomial terms, transforming the variables (e.g., log or square root), or including additional relevant predictors to capture the underlying relationship.

Answer 34

Outliers can exert disproportionate influence on least squares estimates, distorting the estimated coefficients β̂0 and β̂1. You can detect outliers using diagnostic tools like residual plots, leverage statistics, or Cook's distance. To address outliers, you may remove or transform them, or apply robust regression techniques that reduce their impact.

Answer 35

A one-sided test assesses whether a parameter (e.g., β1) is greater or less than a specified value, providing insight when directionality is of interest. A two-sided test evaluates whether the parameter is different from a value in either direction. A one-sided test is useful when there is a theoretical basis for expecting an effect in a particular direction, such as testing whether an intervention increases performance.

Answer 36

SSTot = Σ(Yi - Ȳ)², where Yi is the observed value and Ȳ is the mean of the observed values.

Answer 37

SSTot = SSReg + SSRes, where SSReg is the regression sum of squares and SSRes is the residual sum of squares. This is called Partitioning

Answer 38

SSReg = Σ(Ŷi - Ȳ)² = (β̂1)² Σ(Xi - X̄)², where Ŷi is the predicted value and Ȳ is the mean of the observed values.

Answer 39

SSRes = Σ(Yi - Ŷi)², where Yi is the observed value and Ŷi is the predicted value from the regression model.

Answer 40

SSReg = (β̂1)² Σ(Xi - X̄)², where β̂1 is the estimated slope and Xi is the independent variable with mean X̄.

Answer 41

Regression SS: df = 1, Residual SS: df = n - 2, Total SS: df = n - 1, where n is the number of observations in the dataset.

Answer 42

β̂1 = Cov(X, Y) / Var(X)

Answer 43

t* = (β̂1 - β1⁰) / √(Var(β̂1)) ~ t_(n-2)

Answer 44

t* = (β̂0 - β0⁰) / √(Var(β̂0)) ~ t_(n-2)

Answer 45

θ̂ ± q_(1-α/2) * √(Var(θ̂))

Answer 46

If we collect a random sample from a population a large number of times, and each time we compute a (1 − α)% confidence interval for a parameter, then (1−α)% of the confidence intervals will include the true value of the parameter.

Answer 47

E(εi) = 0 Var(εi) = σ^2 < ∞ Cov(εi, εj ) = 0, ∀i != j

3 - Regressions Flashcards

(71 cards)