3. Testing and Evaluating Linear Models Flashcards by Emily Etty

What are the three parts of evaluation?

Evaluating individual coefficients, evaluating overall model quality, evaluating model assumptions

How well did you know this?

Not at all

Perfectly

What question would ask to explore significant of individual effects?

Is our model predictor informative of the relationship between x and y?

How well did you know this?

Not at all

Perfectly

How do we evaluate individual coefficients?

Hypothesis is needed to make the data testable

How well did you know this?

Not at all

Perfectly

What are the steps involved in hypothesis testing?

Research question

Statistical hypothesis

Calculate estimate of effect of interest

Calculate appropriate t-statistic

Evaluate t-statistic against the null

How well did you know this?

Not at all

Perfectly

What should a good research question include?

Constructs under study
the relationship being tested
A direction of relationship
Target populations etc.

How well did you know this?

Not at all

Perfectly

What are the different types of hypothesis?

Null = Precise and states specific value for the effect of interest

Alternative = Not specific, states something other than null is more likely to occur

H0 = B1 = 0
H1 = B1 not = 0

How well did you know this?

Not at all

Perfectly

What would a null hypothesis suggest about the relationship between x and y?

If x and y are unrelated, change in x will not result in any change in y do b1 will be equal to 0

How well did you know this?

Not at all

Perfectly

What is a p-value?

The P value means the probability, for a given statistical model that, when the null hypothesis is true.

E.g. P < 0.05 is the probability that the null hypothesis is true so in this case we would reject the null

How well did you know this?

Not at all

Perfectly

What is a t-statistic?

T is simply the calculated difference represented in units of standard error.

A test statistic describes how closely the distribution of your data matches the distribution predicted under the null hypothesis of the statistical test you are using so if it is a larger the number, it is further away from what the null hypothesis would predict it to be.

Predicted value of beta/SE of predicted beta

(the smaller the SE, the more precise)

The greater the magnitude of T, the greater the evidence against the null hypothesis.

How well did you know this?

Not at all

Perfectly

How do we actually test the statistical significance of individual coefficients?

We select a significance level, α (typically .05)

Then we calculate the p-value associated with our test statistic (here β)

If the associated p is smaller, then we reject the null.

If it is larger, then we fail to reject the null.

How well did you know this?

Not at all

Perfectly

What does it mean if the p-value is < t-stat?

Reject the null

How well did you know this?

Not at all

Perfectly

What does it mean if the p-value is > t-stat?

Fail to reject the null

How well did you know this?

Not at all

Perfectly

What sampling distribution is used for the null hypothesis?

T-distribution - n-k-1 degrees of freedom

Need significance level and critical value to compare observed t-value

How well did you know this?

Not at all

Perfectly

What is a critical value?

Establishes regions in sampling distribution of test statistic = Used to calculate upper and lower bounds of CI

How well did you know this?

Not at all

Perfectly

What are the different factors that impact SE value?

SE is smaller when residual variance (SS Residual) is smaller
SE is smaller when sample size ( N ) is larger
SE is larger when the number of predictors (k) is larger
SE is larger when a predictor is strongly correlated with other predictors ( R2xj)

How well did you know this?

Not at all

Perfectly

What is a t-distribution?

Standardized differences to sample means to population mean when population SD isn’t known

Normally distributed population

What is the confidence level for null?

1 - alpha

What does it mean if the confidence interval doesn’t include 0?

If it doesn’t include 0, then it is significant

How can we compare the critical value and t-statistic to tell us if we can reject the null or not?

If the value of the test statistic is less extreme than the critical value, then the null hypothesis cannot be rejected.

Absolute value of t-statistic > critical value = Reject the null

When are we more likely to find a significant effect?

When we have picked good variables (smaller residual SS) and we have a large sample

How do we evaluate overall model performance?

The aim of our linear model is to build a model which describes y as a function of x.

That is we are trying to explain variation in y using x so we evaluate model evaluation via assessing variation.

What does variation in y stand for?

Total variation of interest

What is variation made up of?

Model and Residual Variance

How do we measure total variation in the outcome?

Sum of Squares = SS model + SS residual

What does R2 mean (coefficient of determination)?

Quantifies the amount of variability in the outcome accounted for by the predictors. More variance accounted for, the better. Represents the extent to which the prediction of y is improved when predictions are based on the linear relation between x and y. R2 = SSmodel/SStotal or 1 - (SSresidual/SStotal)

What is adjusted R2?

It is the R2 value adjusted for when there are two or more predictors Random sampling can impact it Adjusted for sample size and number of predictors Increased IVs = ^ Value

Why is it important to compute adjusted R2 in a model with multiple predictors?

It accounts for random fluctuation that comes with increases in sample size & number of predictors

What does comparing R2 and adjusted R2 tell us?

The most vital difference between adjusted R-squared and R-squared is simply that adjusted R-squared considers and tests different independent variables against the model and R-squared does not. So if big difference between them - i.e adjusted R2 is a lot smaller then the additional input variables are not adding value to the model.

If there is a smaller sample, what does this mean for fluctuations in adjusted R2?

In smaller samples , the fluctuations from zero will be larger on average.

If we have a highly correlated predictor, how does that impact the SE of coefficients?

Increases SE as we're less certain that our variables are driving the effect