3. Testing and Evaluating Linear Models Flashcards
What are the three parts of evaluation?
Evaluating individual coefficients, evaluating overall model quality, evaluating model assumptions
What question would ask to explore significant of individual effects?
Is our model predictor informative of the relationship between x and y?
How do we evaluate individual coefficients?
Hypothesis is needed to make the data testable
What are the steps involved in hypothesis testing?
Research question
Statistical hypothesis
Calculate estimate of effect of interest
Calculate appropriate t-statistic
Evaluate t-statistic against the null
What should a good research question include?
Constructs under study
the relationship being tested
A direction of relationship
Target populations etc.
What are the different types of hypothesis?
Null = Precise and states specific value for the effect of interest
Alternative = Not specific, states something other than null is more likely to occur
H0 = B1 = 0
H1 = B1 not = 0
What would a null hypothesis suggest about the relationship between x and y?
If x and y are unrelated, change in x will not result in any change in y do b1 will be equal to 0
What is a p-value?
The P value means the probability, for a given statistical model that, when the null hypothesis is true.
E.g. P < 0.05 is the probability that the null hypothesis is true so in this case we would reject the null
What is a t-statistic?
T is simply the calculated difference represented in units of standard error.
A test statistic describes how closely the distribution of your data matches the distribution predicted under the null hypothesis of the statistical test you are using so if it is a larger the number, it is further away from what the null hypothesis would predict it to be.
Predicted value of beta/SE of predicted beta
(the smaller the SE, the more precise)
The greater the magnitude of T, the greater the evidence against the null hypothesis.
How do we actually test the statistical significance of individual coefficients?
We select a significance level, α (typically .05)
Then we calculate the p-value associated with our test statistic (here β)
If the associated p is smaller, then we reject the null.
If it is larger, then we fail to reject the null.
What does it mean if the p-value is < t-stat?
Reject the null
What does it mean if the p-value is > t-stat?
Fail to reject the null
What sampling distribution is used for the null hypothesis?
T-distribution - n-k-1 degrees of freedom
Need significance level and critical value to compare observed t-value
What is a critical value?
Establishes regions in sampling distribution of test statistic = Used to calculate upper and lower bounds of CI
What are the different factors that impact SE value?
SE is smaller when residual variance (SS Residual) is smaller
SE is smaller when sample size ( N ) is larger
SE is larger when the number of predictors (k) is larger
SE is larger when a predictor is strongly correlated with other predictors ( R2xj)