lecture 5 - regression with categorical independent variables Flashcards

1
Q

what is a dummy variable?

A

a variable used to represent a categorical variable in a regression model, but represents the categories as binary scores ( 1 or 0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is the reference or base category?

A

the category left at the end after all the binary variables have all been created - this is a group without a dummy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what does the regression equation look like with multiple Xs and bs?

A

Y = b1 X1 + b2X2 + u

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

phrases that refer to the usage of multiple bs and Xs to look at the change they have had to the previous mean

A

“controlling for”

“being held at their mean”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

why are categorical variables problematic to use in regression models? and what it the solution?

A

they have no important order, therefore dummy variables are created

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is an underlying assumption when using dummy variables?

A

the relationship between other Xs and the dependent variable stay the same throughout - the slopes are parallel to one another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

dummy variables allow what to vary between groups?

A

the intercept

How well did you know this?
1
Not at all
2
3
4
5
Perfectly