Part 7. Intro to Linear Regression Flashcards

Question 1

Q

Regression analysis

Answer

A

A tool for examining whether a variable is useful for explaining another variable.

i.e. whether earnings growth/cash flow growth helps explain the company’s value in marketplace.

Question 2

Q

Sum of squares total (SST)

Answer

A

A simpler exploration to try understand why each company’s ROA differs from mean ROA of 12.5%, we look at the sum of squared deviations of obs from mean to capture variations in return on assets (ROA) from their mean.

Question 3

Q

Simple linear regression (SLR)

Answer

A

Method for relating dependent and independent variables through estimation of a relationship, where we have one independent variable.

Question 4

Q

Multiple regression

Answer

A

Method for relating dependent and independent variables through estimation of a relationship, where we have more than one independent variable.

Question 5

Q

Ordinary least squares (OLS) regression

Answer

A

The goal is to fit a line to observations on Y and X to minimise the squared deviations from line; using least squares criterion.

Question 6

Q

Line of best fit

Answer

A

In simple linear regression, the estimate intercept, b0^, and slope b1^ are such that the sum of squared vertical distances from the observations to fitted line is minimised.

Question 7

Q

Residual for ith observation, ei

Answer

A

This is how much the observed value of Yi differs from the Yi^ estimated using the regression line: ei = Yi - Yi^.

This refers to the true underlying population relationship, whereas the residual refers to fitted linear relation based on sample.

Question 8

Q

Residuals

Answer

A

Represented by the vertical distances from the fitted line, therefore in the units of measurement represented by the dependent variable.

i.e. if dependent variable is in euros, the error term is in euros etc.

Question 9

Q

Sample correlation, r

Answer

A

The ratio of the covariance to the product of the standard deviations.

Question 10

Q

Slope

Answer

A

The change in dependent variable for one unit change in independent variable.

Question 11

Q

Cross sectional regression

Answer

A

This involves many observations of X and Y for same time period, depending on regression model these observations could come from different companies, asset classes, investment funds etc.

Question 12

Q

Time series

Answer

A

Use many observations from different time periods for the same company, asset class, investment fund, country or other entity depending on regression model.

i.e. monthly data from many years to test whether country’s inflation rate determine short term interest rates.

Question 13

Q

Assumptions of simple linear regression model:

Answer

A

Linearity - the relationship between the dependent variable Y, and independent variable X is linear.
Homoscedasticity - the variance of regression residuals is the same for all observations.
Independence - the observations, pairs of Ys and Xs are independent of one another, implies regression residuals are uncorrelated across observations.
Normality - the regression residuals are normally distributed.

Question 14

Q

Homoskedasticity

Answer

A

The variance of the residuals is the same for all observations.

Question 15

Q

Heteroskedasticity

Answer

A

If residuals are not homoscedastic, if the variance of residuals differs across observations.

Question 16

Q

Sum of squares regression (SSR):

Answer

A

The sum of the squared differences between the predicted value of the dependent variable, Yi^, based on the estimated regression line and mean of dependent variable Y-.

Question 17

Q

Coefficient of determination (R^2):

Answer

A

The percentage of the variation of the dependent variable that is explained by the independent variable.

measure used to evaluate goodness of fit.

Question 18

Q

Standard error of estimate (se):

Answer

A

The absolute measure of distance between the observed values of the dependent variable, and those predicted from estimated regression.

The smaller the se, the better fit of the model.

Question 19

Q

Measure of goodness fit of estimated regression:

Answer

A

Standard error estimate (se)

2. F-statistic

Question 20

Q

Standard error of slope coefficient (sb1)

Answer

A

In simple linear regression, this is the ratio of the models standard error of the estimate (se) to the square root of the variation of the independent variable.

Question 21

Q

Indicator/dummy variable

Answer

A

The case where it takes only the values 0 or 1 as the independent variable.

Question 22

Q

Level of significance

Answer

A

Its always a matter of judgement.

There is a 5% chance of rejecting H0, which is true (Type I error).
Decreasing level of significance from 0.05 to 0.01 decreases the probability of Type 1 error, but also increases probability of Type 2 error - failing to reject H0, when it is false.

Question 23

Q

p-value

Answer

A

The smallest level of significance at which H0 can be rejected.

The smaller the p-value, the smaller the chance of Type 1 error (rejecting true H0), so greater likelihood the regression model is valid.

Question 24

Q

The standard error of forecast depends on:

Answer

A

the standard error of the estimate, se.
the number of observations, n.
the forecasted value of independent variable, Xf, used to predict dependent variable and its deviation from estimated mean, X-
the variation of independent variable.