chapter 4 - Correlation and Regression Flashcards

1
Q

what is the purpose of correlation and regression

A

to understand the relationship between a dependent variable and one or more independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what does correlation and regression help in [ got 3]

A
  • identifying trends and patterns in data
  • making predictions about future values based on past data
  • quantifying the strengths and directions of relationships between variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what does the dependent variable [ the response variable] represent

A

the output or outcome whose variation is being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what does the independent variable [ the explanatory variable] represent

A

it represents the inputs or causes i.e. potential reasons for variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the purpose of a scatter diagram

A

to visually represent the relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what do scatter diagrams do

A

they help identify patterns, trends, and the potential strength of correlation between them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the X-axis also known as

A

the horizontal axis, the independent variable or explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is the Y-axis also known as

A

the vertical axis, the dependent variable or response variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what does the dependent variable represent

A

the output or outcome whose variation is being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what does the independent variable represent

A

the inputs or causes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the regression line also known as

A

best fit line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is the sum of squared errors

A

the total of all squared differences between actual and predicted results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what does the best fit line need to do

A

the best fit line is the one that makes the SSE as small as possible, ensuring a good representation of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how does the gradient affect the slope

A
  • when the gradient is positive, the line is sloped upwards
  • when the gradient is negative, the line is sloped downwards
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

when asked to predict a future value, which variable do you choose

A

the y data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

when asked to find the regression of A data on B data, which is the x and which is the y variable

A

the a data must be the y variable and the b data must be the x variable

17
Q

how to know if a line is accurate [3 points]

A
  • does the scatter diagram indicate that a straight line is appropriate
  • are the plotted points close to the line
  • are the forecast for a value of x within or outside the range of values X is given
18
Q

what is interpolation

A

if the x is within the range, the accuracy is likely to be better

19
Q

what is extrapolation

A

if the x is outside the range, the accuracy of the estimate is likely to be low

20
Q

what does the product moment correlation of coefficients measure

A

the strength and direction of a linear relationship between two variables and indicates how well the regression line fits the data

21
Q

what does the correlation coefficient measure

A

how good a fit the regression line to the data

22
Q

for product moment correlation coefficient, what does it mean if r is numerically greater than 1

A

you must have made a arithmetic/ numerical error

23
Q

what does it mean in product moment correlation coefficient when r > 0 and r is nearer to 1

A

there is a strong positive correlation between x and y

24
Q

what does it mean in product moment correlation coefficient when r > 0 and is near to 0

A

there is a weak positive correlation between x and y

25
what does it mean in product moment correlation coefficient when r < 0 and is near to -1
there is a strong negative correlation between x and y
26
what does it mean in product moment correlation coefficient when r < 0 and r is near to 0
there is a weak negative correlation between x and y
27
what does noise mean in data
noise refers to random variations or unexplained fluctuations in data that obscure true relationship between variables
28
what are some sources of noise [ 4]
- measurement errors - external factors [ economic conditions, weather] - sampling variability - human error in data collection
29
what does the amount of noise represent
the strength of a linear relationship
30
how can noise be determined
the calculation of mean square errors
31
what does correlation indicate
how strongly two variables are related to each other
32
what does correlation not imply
that one variable causes the other to change
32
33
what does the coefficient of deter,onation show
how much of the variation of y can be explained by the explanatory variable
34
how to find the coefficient of determination using product moment correlation coefficient
js take r square/ r2
35
what is the simplest procedure when data is given in order of merit
a rank correlation coefficient
36
what does the product moment correlation coefficient assume
that the distribution of x and y are known
37
when the distribution of x and y are not known, which method should be used
the rank correlation coefficient should be used
38
why does the value of spearman's rank correlation coefficient differ from the value of the product moment correlation coefficient?
the product moment correlation coefficient use the original data, whereas Spearman's rank correlation only used the ordered data, thus losing some accuracy in the accuracy of the information