Session 6: Correlation and regression Flashcards

1
Q

Product moment correlation r

A

a measure of strength of a linear relationship between quantitative variables

  • also known as Bravais-Pearson correlation
  • calculated more frequently than other coefficients
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Positive correlation and negative correlation

A

positve: higher scores on one varibale tend to be associated with higher scores on the other
negative correlation: higehr scores one one variable tend to be associated with lower scores on the other one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Interpretation of r scores

A
r= .10 (small relationship)
r= .25 (medium relationship)
r= .50 (large relatipnship)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Linear regression

A

Linear regression is a statisctical method that allows modeling linear relationships between a dependent variable and one or more independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Regression equation

A
  • linear regression is about estimating a linear regression equation
  • linear regression equations have the same linear structure: Y= b0 + b1X
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Errors (in a regression equation)

A

difference between observed values and the predicted values by our predicted equation (these differences should be as small as possible)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How are the regression coefficients b0 and b1 determined?

A

The method of least squares finds b0 and b1 by minimizing the totoal squared error between the actual Y values and the predicted Y values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Testing the regression equations (Why?)

A

after the regression equation has been estimated, it should be checked how well it fits as a model of reality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Testing can be split into two parts (of the regression equation)

A
  1. Testing the regression equation - wether and how well the DV is explained by the regression equation (goodness of fit - F-tets)
  2. Testing the regression coefficients - wether and how well each IV of the regression equation contributes to explaining the DV (t-test)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Decomposition of variance

A
  • we know that the optimal regression equatios is found by minimizing the sum of squared reduals (SSR)
  • we could use the sum of squared residuals as a measure of goodness of fit of the regression equation to the observed data (the smaller SSR, the better the fit)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Total sum of squares (STT)=

A

explained sum of squares (SSE) + residual sum of squares (SSR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Coefficient of determination

A

explained sum of squares/ total sum of squares

  • measure of goodness of fit of the regresssions equation to the observed data
  • the higher the coefficient, the better the it of the regression equation to the observed data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the problem with SSR?

A

SSR varies not only with the goodness of fit, but also with the number and size of the Y values
(smaller SSR - better fit!)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly