Revision (2) Flashcards
What are the two ways in which linear regression and correlation are conceptually alike?
- Both of these concepts aim to identify linear relationships between two variables.
- Both of these concepts identify a line of best fit by using the method of least squares.
To what does the method of least squares refer?
To the minimisation of the squared distances of datapoints to a line of best fit (used in tests of both linear regression and correlation).
How are linear regression and correlation conceptually unalike?
Correlation does not treat the the variables it is investigating as separate entities, whereas linear regression does (it asks about the effect of one variable on another).
Between what does linear regression distinguish (unlike correlation)?
An independent variable (the variable that may have an influence on another) and a dependent variable (the variable being influenced by another).
While both linear regression and correlation analyse the covariance between two variables, how do they differ?
In how they standardise this covariance.
How is covariance standardised for correlation?
By multiplying the standard deviation of the independent variable involved by the standard deviation of the dependent variable involved.
How is covariance standardised for linear regression?
By squaring the standard deviation of the independent variable involved.
What is the equation for the regression line?
Y = mX + c
What are the ingredients necessary for the regression line equation?
- The gradient of the regression line (m)
- The intercept (c)
To what does the intercept of the regression line refer?
To the value of Y when X = 0.
What does it mean if the gradient of a regression line is significant?
That as the score on the independent variable involved changes, the score on the dependent variable involved also changes in a systematic way (the line representing their relationship is not flat).
What does R2 measure?
The proportion of the variance observed in the dependent variable that can be explained by the independent variable.
What is the formula for R2?
R2 = SSM/ SST
Why is the R2 value multiplied by 100?
To generate a percentgae.