Wk 7 - Logistic Regression Flashcards
What is the formula for linear regression? (x1, plus define components)
y’ = bx + c
predicted y = slope times x + constant (y intercept)
What does b signify in regression formulas? (x3)
Slope
Coefficient
Amount of change in y for every unit change in x
How is the fit of a regression line maximised/evaluated? (x2)
Least squares criterion:
Want minimal residuals (diff between scores and line)
What 2 questions can we ask of any given regression model?
Q1: Does the predictor variable do anything useful?
Q2: Does the model provide a good fit to the data?
What can we conclude if b = 0 in a regression model? (x1)
Changes in x produce no change/effect in y
How do we assess model fit in linear regression? (x2)
Calculate r-square (proportion of variance accounted for)
And test for significance
How are b and r-square related in linear regression? (x3)
Generally linked to some degree,
But if you move all scores similar distances from line,
b stays the same while r-square goes to hell
What is the major limitation of linear regression method? (x2)
Can’t deal with categorical data
‘All or nothing’ scores, rather than continuous predictions/outcomes available
How does what we are trying to predict change when using categorical rather than continuous DV/y variable? (x2)
Want to assess the change in PROBABILITY of y given b change in x
Rather than change in y scores
What statistical method enables regression with 2 categorical outcomes? (x1)
Binary logistic regression
In linear regression: Predictors are continuous or categorical Outcome is continuous Predictors assumed normally distributed Deals with linear relationships among variables
Whereas in logistic regression? (x4?
Predictors are continuous or categorical
Outcome is categorical
Predictors not assumed normally distributed
Deals with non-linear relationships among variables
What are the 2 applications/questions of logistic regression? (x1, x2)
Predict category people belong to, given predictors
Identify predictors of particular (categorical) outcome variable
*Outcomes are exhaustive and mutually exclusive
How does the linear regression model change for logistic regression? (x3)
y’ becomes a logistic function (s-shaped curve) =
1 divided by
e raised to the power of the linear equation (v)
In logistic regression, if we substitute our largest x value for v… (x1)
And if v is very small… (x1)
y gets close to zero
y gets large
What is the statistical question asked by logistic regression? (x2)
How many units change in x does it take
To shift the odds from favouring particular category of y?
What are odds? (x1)
Expression of relative probability of an event happening vs. not happening
How are odds calculated? (x1)
What happens to probability if you double the odds of an event occurring? (x2)
odds = p(event)
Divided by 1 - p(event)
Increases, but with diminishing returns