2A Flashcards

1
Q

What is regression analysis?

A
  1. Regression analysis is about predicting values on a y-variable that you already observed based on one or more x-variables.
  2. You try to determine what the association is between these variables.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explanatory purpose of regression analysis

A

It can be informative about causal relationship (but correlation does not imply causation!)

‘‘could wealth be a cause of the quality of democracy?’’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Descriptive purpose of regression analysis

A

Even without a causal relation, it is interesting in its own right to know that two things often go together

for example poverty and authoritarianism often go together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The simple linear regression model

A

to make a prediction, we use a model:

Yi = b0 + b1Xi + εi

Y = dependent variable
i = value for each case i
b0 = intercept / constant (where the line intercepts the x-axis)
b1 =regression coefficient (how steep the line is, with an increase of each unit of X, how much does Y increase?)
X = independent variable
e = error term (residual, the difference between the predicted y and the observed y)

  1. the intercept is also the value for Y if the value of X is 0.
  2. without the residual, the equation wouldn’t really be true because not every case is on the line
  3. The predicted value of Ŷ increases or decreases by the value of the regression coefficient for every one unit increase of X.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

predicted values

A

values on Y for each case based on the estimated model

without residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

observed values

A

values on Y for each case that we actually observe in the sample

with residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

why is this called simple linear regression?

A

simple: we have just one independent variable X
linear: the effect of x on Y can be represented by a straight line

  1. steepness of the line (slope) given by value of regression coefficient
  2. we multiply the value of X of each case by b1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how do you test the significance of regression coefficients?

A
  1. to test the statistical significance of the regression coefficients, we use the t-test
  2. t = b1 estimated - b1 expected under H0 / SE b1
  3. so, t = b1 estimated/SEb1 because b1 expected under H0 is 0.
  4. this is because H0 means there is no effect, so the regression coefficient is 0.
  5. we can look up the value of t in t-distribution to determine p-value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

meaning of the p-value

A
  1. the probability that you would have found the estimated coefficient for b1 (or an even larger coefficient) in our sample if X and Y were completely unrelated in the population.
  2. if p-value < a = we can reject to null-hypothesis
    = the effect of X on Y is statistically significant and exists in the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

where in the output can you find the values you need

How to interpret SPSS-output for simple regression analysis

A
  1. You look at the ANOVA table
  2. b0 is under the constant B
  3. b1 is under the constant for the independent variable
  4. SEb1 is the standard error for the independent variable
  5. t is the t for the independent variable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly