Chapter 14: The General Linear Model Flashcards
What do the terms gradient and intercept, respectively, describe?
B1 and B0
When plotting their values on a graph, what impact does each regression coefficient have on the overall line?
B1 changes the lines slope.
B0 changes the position of the line.

Briefly explain what the method of least squares does.
The method of least squares calculates the value of your parameters, b, when the squared error (residual) between your model and data are smallest.
How does cross-product deviation differ from the predictor sum of squares mathematically? What is each used to calculate?
Cross-Product Deviation: Between two variables
Used in the covariance formula (SCP/N-1)
Predictor Sum of Squares: Within a single variable.
Used in the variance formula (SSx/N-1)
Both of these formulas are used in the b1 formula.
How do you calculate the regression coefficient, b1?
1) b1 = SCP/SSx
2) Another way to think about this formula is covariance over variance. Since covariance = (SCP/N-1) and variance = (SSx/N-1), you can cancel each N-1 and be left with SCP/SSx–the original formula.
3) And finally, a less intuitive explanation, you can think of this formula as a version of the correlation coefficient (seen below).

What is the best way to conceptualize b1 .
As an unstandardized relationship between predictor and outcome. In other words, for every unit that the predictor increases the outcome will increase by b1 .
How can you find the constant, b0, mathematically?
You isolate the general linear model and find,
b0 = Y - b1 * X
How can you calculate standardized beta?
By multiplying the regression coefficient, b1 , by the standard deviation of the predictor variable over the standard deviation of the outcome variable.

What are the significance levels for R2 ?
Small = 0.02
Medium = 0.13
Large = 0.26
How does hierarchical regression differ from step-wise regression?
Hierarchical regression systematically places variables in the model based upon a mutually agreed upon confidence from the community.
Step-wise regression places variables in the model based upon the power in that given study. Because of this, step-wise regression often varies between studies.
What are some issues with step-wise regression?
1) Does not standardize order across experiments so this method makes comparison difficult.
2) Tends to over-fit (put too many variables) or under-fit (too few) the model.
In simple words, what does Cook’s distance tell you? What values indicate potential issues?
Cook’s distance is the standardized total difference in predicted value when including or excluding any given case.
Simply put, a measure of outliers.
Values greater than 1 are worth further inspection.
What assumptions are parameter values impacted by?
Parameter values function independently of assumptions.
How can you calculate R2 using only sum of squares values?
SSM / SST
With what formula can you estimate the standard square error in the model?
(Yi-Y)2 provides the squared error
Dividing by N-2 gets the mean error
Square rooting the results standardizes them

How can you transfer the standardized square error into the parameter square error?
SEb = SEmodel/sqrt(SSx)
How can you calculate the test statistic for b and then use it in your confidence interval calculation? Here you must know both the test statistic and confidence interval formula.
Test Statistic = bobserved/SEb
Confidence Interval for b = Expected value +- (Critical Value * SEb)
What does Yuen’s procedure do?
Yuen’s procedure trims the outliers from a data set.
What do credible and confidence intervals, respectively, say about the null hypothesis?
Credible intervals are a Bayesian construct and thus make no reference to the null hypothesis.
Confidence intervals were a red herring to bring you away from what was important here: the information about credible intervals.
How do t-tests differ between independent and paired tests?
Independent tests are interested in contrasting group means.
Paired tests are interested in contrasting the difference scores between groups.
What two tests can be used to correct for heterogeneity of variance? Which is stronger?
Welch’s F and Brown—Forsythe’s F.
Welch’s F possesses more statistical power
What does the SSM formula have outside of the brackets that makes it different from the SST formula? What does this variable represent?
An N value that is multiplied by the brackets.
N represents the number of people in the group.
What is a HUGE red flag in the following chart?

The original model is not significant and thus our robust tests cannot be interpreted.
What is the function for each of these post-hoc tests?
1) Gabriel’s procedure
2) Hochberg’s GT2
3) Games-Howell
1) Gabriel’s Procedure: With slightly different sample sizes use this test as it has function power.
2) Hochberg’s GT2: With very different sample sizes use this test.
3) Games-Howell: If the population variances are not equal use this. Note: we can never be certain that population variances are equal, so there is an argument to always this test.
How does sum of square placement change in a repeated-measure design?
Both SSM and SSR can be found in the within-participant section. This is because the systematic variance and unsystematic variance occurs within the same individuals.
What does sphericity refer to?
Sphericity: Refers to the equality of variances of the differences between treatment levels
What are the two questions to ask about a model?
1) Is the model influenced by a small number of cases?
Asks a question of sample size.
2) Is the model generalizable?
Asks a question of assumptions being met.
What standardized residual numbers are cause for concern?
If any standardized residuals are above 3.29
If 1% of standardized residuals are above 2.58
If 5% of standardized residuals are above 1.96
What does an adjusted R2 value refer to?
It is an estimate of what R2 would be in the population.
When applying a regression coefficient to dummy coding, what two formulas produce the same result?
SCP/SSX and X-Y (Difference between groups)
How do you go from SD to SEDhat ?
What are the two ways to calculate Cohen’s D for a paired sample t-test?
1) dhat/sqrt(1-r)
2) (Dhat/SD)*sqrt(2)
Mean difference score/Standard deviation difference score * SQRT(2)
Why is it not useful to check and see if credible intervals contain
In a factorial design, what makes up SSM . What is the df for its constituent parts?
Each factor and their interactions.
The df for each individual factor is 1.
How do you calculate r using only f-ratios and df?
sqrt(Ffactor1/Ffactor1*df)