Module 5 Flashcards
Most often a research question will lead to a model such that…
a dependent variable is a function of one or more independent variables or predictors
General linear model
represents a dependent variable as a function of population means
Intercept only model
A participants score equals the population mean plus an error term
What is the linear model when the independent variable is binary?
yi = B0 + B1xi + ei
What does this model say?
yi = B0 + B1xi + ei
The value of y for participant i…
B0 = intercept parameter
B1xi = slope parameter multiplied by the value of x for participant i
ei = error term for participant i
If xi = 0 (ie. independent label is 0) what does B0 = ?
yi = B0 + B1(0) + ei
B0 = intercept parameter equals the population mean for participants who do not have a diagnosis
If xi = 1 (ie. independent label is 1) what does B1 = ?
yi = B0 + B1(1) + ei
B1 = the difference between the population mean for participants who do not have a diagnosis and the population mean for participants who do have a diagnosis
Explain each unit in the model that includes the population means of the two groups:
yij = uj + eij
yij = the value of the dependent variable individual i in group j
uj = population mean of the dependent variable for group j
eij = same as above but now includes the j subscript to index group membership
b1 =
u2 - u1
difference between population mean for participants without diagnosis and with diagnosis
because
u1 = B0
u2 = B0 + B1
u2 - u1 = B1
How to calculate predicted mean of each group from parameter estimates??
B0hat= intercept value
B1hat= Intercept value plus slope coefficient
what does the hat indcate?
estimated or predicted
Because Bhat1 is the estimated difference between the population means of the two groups it represents what?
A point estimate of the effect of the independent variable on the dependent variable.
If we found that the 95% confidence interval is 72 to 1258 around this effect estimate that we got from subtracting the slope value from the intercept. what is the conclusion?
This interval captures the parmeter B1 with 95% confidence
Because B1 = u1-u2, the interval captures the population mean difference with 95% confidence
Formula for calculating this CI
Bhat1 +/- tcrit Sbhat1
tcrit = critical value from a t distribution that sets off alpha/2 in each tail
Sbhat1 = estimated standard error of the slope parameter
What are the degrees of freedom in the situation of a binary independent variable?
N-2 because there are two coefficients - intercept and slope in the estimated model
What is standard error?
Standard deviation of the sampling distribution
Pooled Variance/ homogeneity of variance
assumes that the population variance of the dependent variable is equal across the two groups
Null hypothesis writing and formulas
H0 : u1 = u2
The population mean of group 1 equals the population mean of group 2
B0 = B0+B1
Only holds if B1=0 therefore
H0: B1 = 0
OR H0: u2 - u1 = 0
What is the purpose of a statistical model?
describe or explain individual differences or variation in a dependent variable
If a model does a good job of accounting for individual differences, then the variance of the errors should be….
relatively small to the overall variance of the variable
variance of the model should be lower than the overall dependent variable
indicating that the full model has accounted for or explained a portion of the dependent variable variance