chapter 4 - Correlation and Regression Flashcards
what is the purpose of correlation and regression
to understand the relationship between a dependent variable and one or more independent variables
what does correlation and regression help in [ got 3]
- identifying trends and patterns in data
- making predictions about future values based on past data
- quantifying the strengths and directions of relationships between variables
what does the dependent variable [ the response variable] represent
the output or outcome whose variation is being studied
what does the independent variable [ the explanatory variable] represent
it represents the inputs or causes i.e. potential reasons for variation
what is the purpose of a scatter diagram
to visually represent the relationship between two variables
what do scatter diagrams do
they help identify patterns, trends, and the potential strength of correlation between them
What is the X-axis also known as
the horizontal axis, the independent variable or explanatory variable
what is the Y-axis also known as
the vertical axis, the dependent variable or response variable
what does the dependent variable represent
the output or outcome whose variation is being studied
what does the independent variable represent
the inputs or causes
what is the regression line also known as
best fit line
what is the sum of squared errors
the total of all squared differences between actual and predicted results
what does the best fit line need to do
the best fit line is the one that makes the SSE as small as possible, ensuring a good representation of the data
how does the gradient affect the slope
- when the gradient is positive, the line is sloped upwards
- when the gradient is negative, the line is sloped downwards
when asked to predict a future value, which variable do you choose
the y data
when asked to find the regression of A data on B data, which is the x and which is the y variable
the a data must be the y variable and the b data must be the x variable
how to know if a line is accurate [3 points]
- does the scatter diagram indicate that a straight line is appropriate
- are the plotted points close to the line
- are the forecast for a value of x within or outside the range of values X is given
what is interpolation
if the x is within the range, the accuracy is likely to be better
what is extrapolation
if the x is outside the range, the accuracy of the estimate is likely to be low
what does the product moment correlation of coefficients measure
the strength and direction of a linear relationship between two variables and indicates how well the regression line fits the data
what does the correlation coefficient measure
how good a fit the regression line to the data
for product moment correlation coefficient, what does it mean if r is numerically greater than 1
you must have made a arithmetic/ numerical error
what does it mean in product moment correlation coefficient when r > 0 and r is nearer to 1
there is a strong positive correlation between x and y
what does it mean in product moment correlation coefficient when r > 0 and is near to 0
there is a weak positive correlation between x and y