6A Flashcards

1
Q

What are the three main topics of this lecture?

A
  1. Regression with binary categorical independent variables. 2. Regression with non-binary categorical independent variables. 3. How to handle ordinal variables in regression.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two key characteristics used to classify variables?

A
  1. Logical order – Do the values follow a natural ranking?
  2. Equal distances – Are the gaps between values consistent?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are examples of variables with a logical order?

A

Monthly income, education level, left-right identification, immigration attitudes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are examples of variables without a logical order?

A

Vote choice (e.g., VVD, CDA, PVV), religion (e.g., Catholic, Protestant, Muslim).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are examples of variables with equal distances?

A

Monthly income, left-right identification, immigration attitudes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are examples of variables without equal distances?

A

Educational level (e.g., primary, secondary, higher education).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the three types of variables?

A
  1. Categorical – No logical order (e.g., vote choice). 2. Ordinal – Logical order, but unequal distances (e.g., education level). 3. Continuous – Logical order with equal distances (e.g., income).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are binary categorical variables?

A

Variables with only two categories, coded as 0 and 1 (e.g., voted = 1, did not vote = 0).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are non-binary categorical variables?

A

Variables with more than two categories (e.g., religion: Catholic, Protestant, Muslim).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are dummy variables (dichotomous variables)?

A

Binary categorical variables coded as 0 and 1 for use in regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why should the dependent variable (Y) always be continuous in linear regression?

A

Linear regression assumes that Y follows a continuous distribution, allowing for meaningful interpretation of coefficients.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can binary categorical variables be included in regression?

A

They are directly included as independent variables, coded as 0 and 1, and interpreted the same way as continuous variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the intercept represent in a regression with a binary independent variable?

A

The predicted value of Y when the binary variable equals 0 (i.e., the reference group).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the coefficient (b₁) of a binary independent variable represent?

A

The difference in the dependent variable (Y) between the 1 and 0 groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is regression with a binary independent variable related to a t-test?

A

A simple regression with a binary independent variable produces the same results as an independent-samples t-test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why can non-binary categorical variables not be used directly in regression?

A

Regression requires numerical input, so categorical variables must be converted into multiple binary variables (dummy variables).

17
Q

What is a reference category in dummy coding?

A

The category that is not assigned a dummy variable. All other categories are compared against it.

18
Q

What happens if a respondent has 0s on all dummy variables?

A

They belong to the reference category.

19
Q

How are dummy variables used in regression?

A

Each category (except the reference) gets a binary variable (1 = in category, 0 = not in category).

20
Q

What is an example of a dummy variable transformation?

A

If religion has four categories (Catholic, Protestant, Muslim, Non-religious), three dummy variables are created: Catholic (1/0), Protestant (1/0), Muslim (1/0), with Non-religious as the reference category.

21
Q

How is the regression coefficient of a dummy variable interpreted?

A

The coefficient represents the difference in the dependent variable (Y) between that category and the reference category.

22
Q

What is the advantage of using SPSS’s automatic dummy variable coding?

A
  1. It is faster and easier than manual coding. 2. It provides a p-value for the entire categorical variable. 3. It provides means for each category, aiding interpretation.
23
Q

What does the F-test in regression measure for categorical variables?

A

It tests whether the categorical variable as a whole (all dummy variables together) has a significant effect on Y.

24
Q

What does R² measure in regression?

A

The proportion of variance in the dependent variable (Y) explained by the independent variables.

25
Q

How can the effect size of a categorical variable be evaluated?

A
  1. Unstandardized effect size – Compare category means from SPSS output. 2. Standardized effect size – Calculate the Adjusted R² change when adding the categorical variable to the model.
26
Q

How is Adjusted R² Change calculated for categorical variables?

A

Model with categorical variable – Model without categorical variable = Adjusted R² Change (indicates how much variance is explained by the categorical variable).

27
Q

What are ordinal variables?

A

Variables with a logical order but unequal distances between values (e.g., education level: primary, secondary, higher education).

28
Q

How can ordinal independent variables be modeled in regression?

A

They can be treated as continuous or categorical.

29
Q

What is the advantage of modeling an ordinal variable as continuous?

A

It simplifies the analysis and avoids unnecessary complexity if the variable behaves approximately linearly.

30
Q

How can we decide whether to model an ordinal variable as categorical or continuous?

A

Compare the Adjusted R² values for both specifications. If the categorical model does not improve Adjusted R² substantially, use the continuous specification.

31
Q

What is an example of modeling an ordinal variable as categorical vs. continuous?

A

In a study on education and immigration attitudes, treating education as categorical (multiple dummy variables) only slightly improved Adjusted R² compared to a continuous model, so the simpler continuous model was preferable.