Regression Flashcards

1
Q

What is a regression line
(Least sqaures regression line)e

A

Basically the line of best fit

It’s the line which minimised the VERTICAL distance from Esch data point, also known as LEAST SQAURES REGRESSION LINE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What will every regression line go through

A

Mean x mean y,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why are there TWO regression limes one for y and one for x

A

Regression line fir h takes into minimising VERTICAL distances and so for any value of x will return y

If you want to put y in and return x, then your line must minimise the HORIZONTAL DISTANCES instead , and need a new formula

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is grsdient both times, y on x and x on y

A

Y on x is sxy/ sxx

X on y is sxy/syy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Okay so what ar W both equations then

A

Make it easy y on x
Y-Ybar = m (x-xbar)

X in y

X - xbar= m (y-ybar)

And here x =my +c

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to know Y ON X or X ON Y

Random on non random ?

A

When it’s RANDOM ON NON RANDOM, you obv want the non random to give yiu a random result, so you want x to give you a y so you want y on x.
- it wouldn’t make sense to do x on y, giving you a non random result froma random variable?

2) HOWEVER RANDOM ON RSNDOM, you can use both! So x on y and y on x!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interpolation vs extrapolation

A

Extrapolate is predicting values outside of range, might be unreliable might be useful

Interpolation is between data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Residuals?

So what is the least regression sqaure line mean

A

Is the distance between the collected point snd the predicted value for it, using the collected datasx point snd putting it into the line.

Least sqaures regression line is MINIMISING THE sqaures of the residuals (because they can be negative)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What’s the sum of all residuals

A

Sum of all residuals after derivation WILL ALWAYS BE EQUAL TO 0!!!!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Formula for residual

A

Y value - prosecuted y value

Y - (bx+a)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the value of equal in equations for regression
Why

A

We know regression always goes through mean

So in ewautiom, rearrange for a = y bar - bxbar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Again what does regression line aim to do with residuals

A

Miniminise the sum of sqaures of residuals as sum of normal is 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When can we and when should we not use a regression lime (shape of curve)

A

If it doesn’t seem to be a linear association rather curved, csn’t use

  • if some of data fits but the rest doesn’t kigjt not be aproprsote, might have to extrapolate, think about context
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is coeffeicmt of determination and how to use it

A

It’s r 2, PMCC2

It explains the proration of y explained by change in x
So if it’s 50%, it means the variation in y is 50% explained by x that’d it, the rest is OTHER FACTORS

Like location etc for house scenario

How well did you know this?
1
Not at all
2
3
4
5
Perfectly