Session 6: Correlation and regression Flashcards

Question 1

Q

Product moment correlation r

Answer

A

a measure of strength of a linear relationship between quantitative variables

also known as Bravais-Pearson correlation
calculated more frequently than other coefficients

Question 2

Q

Positive correlation and negative correlation

Answer

A

positve: higher scores on one varibale tend to be associated with higher scores on the other
negative correlation: higehr scores one one variable tend to be associated with lower scores on the other one

Question 3

Q

Interpretation of r scores

Answer

A

r= .10 (small relationship)
r= .25 (medium relationship)
r= .50 (large relatipnship)

Question 4

Q

Linear regression

Answer

A

Linear regression is a statisctical method that allows modeling linear relationships between a dependent variable and one or more independent variables

Question 5

Q

Regression equation

Answer

A

linear regression is about estimating a linear regression equation
linear regression equations have the same linear structure: Y= b0 + b1X

Question 6

Q

Errors (in a regression equation)

Answer

A

difference between observed values and the predicted values by our predicted equation (these differences should be as small as possible)

Question 7

Q

How are the regression coefficients b0 and b1 determined?

Answer

A

The method of least squares finds b0 and b1 by minimizing the totoal squared error between the actual Y values and the predicted Y values

Question 8

Q

Testing the regression equations (Why?)

Answer

A

after the regression equation has been estimated, it should be checked how well it fits as a model of reality

Question 9

Q

Testing can be split into two parts (of the regression equation)

Answer

A

Testing the regression equation - wether and how well the DV is explained by the regression equation (goodness of fit - F-tets)
Testing the regression coefficients - wether and how well each IV of the regression equation contributes to explaining the DV (t-test)

Question 10

Q

Decomposition of variance

Answer

A

we know that the optimal regression equatios is found by minimizing the sum of squared reduals (SSR)
we could use the sum of squared residuals as a measure of goodness of fit of the regression equation to the observed data (the smaller SSR, the better the fit)

Question 11

Q

Total sum of squares (STT)=

Answer

A

explained sum of squares (SSE) + residual sum of squares (SSR)

Question 12

Q

Coefficient of determination

Answer

A

explained sum of squares/ total sum of squares

measure of goodness of fit of the regresssions equation to the observed data
the higher the coefficient, the better the it of the regression equation to the observed data

Question 13

Q

What is the problem with SSR?

Answer

A

SSR varies not only with the goodness of fit, but also with the number and size of the Y values
(smaller SSR - better fit!)