Chapter 3 Regression Flashcards

1
Q

What is a regression line !

A

The best line of best fit you can possibly draw with data supplied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is line called least squares of residuals

A

Residual is the distance between the line and actual observed point, such that it’s point - line (below the line negative)

Sim of all residuals =0, hence they square them.
Line is positioned in such a way to minimise the total distance of the total squares residuals

Hence least squares

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When do you use y on x x on y

A

When you have a X value and want to predict y, use y on x

If y to predict x , x on y

Now remember these aren’t the SAME LINE, will be similar but not case of reflecting and it’s calm or anything

= they won’t give the same lien of best fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Looking at the derivation what do all reversion lines go through

A

The mean of data , x bar y bar

Therefore both lines intersect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When can we use y on x and x on y in certain situations s

A

When random non random, means we tryna predict y values from x

This y on x ONLY

But if they both random, such as weight and height can do both

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Remember when calculating , it’s not sim x x y it’s also sum xy

A

Don’t lack

Similarly it’s not sum of y all squared, there is also sum of y ^2 DONT LACK

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interpolation be extrapolation
Why is extrapolation unreliable

A

Not possible to confirm the trend beyond the data points, only can interpolate to get a reliable estimate, but extrapolation, could give a good prediction but it’s UNRELIABLE, because we can’t confirm the trend

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sun of all residuals

A

0, hence that’s why we square then and minimise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Again definition of regression line

A

Aims to minimise the SUM of all speceicied ( y or x) residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is coefficient of determination

A

R^2 explains the variation of y as a result of x

So if 0.5 , then 50% of variation in y so like house prices are explained by variation in x, and the rest is due to other reasons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When should we use regression line

A

Only if it is linear else we can’t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly