Unit 3 Flashcards

Linear Regression

1
Q

What does r tell us?

A

How STRAIGHT a positive or negative relationship is between TWO QUANTITATIVE variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What values can r be?

A

from -1 to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Give some strong r values and some weak r values.

A

Strong r values are close to 1 or -1, like -0.83 or 0.94. Weak r values are close to zero like 0.10 or -0.06

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

If something is correlated is it associated?

A

Yes (if it is straight)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

association or correlation?

A

association is talking about a relationship? correlation is an actual calculated number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to describe association on scatterplot?

A

DIRECTION… FORM.. STRENGTH (and outliers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

direction?

A

positive or negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

form?

A

Linear or nonlinear (straight, curved)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

strength?

A

give the r value (if straight), or say? “tightly packed” or “ loosely packed”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

does correlation mean causation?

A

NO WAY!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Give example of incorrectly using the word “correlation”

A

“There is a correlation between gender and video game playing.” This person should say “association.” You can’t say correlation because gender is categorical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Does high r value mean anything?

A

An r value alone tells little. CHECK THE SCATTERPLOT… IS IT LINEAR?? make sure it’s linear first.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can you check for “straight enough?”

A

by looking at the residuals plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how do you interpret slope?

A

“For an increase of 1 [unit of x] there is an (increase/decrease) of [SLOPE] [units of y]” Plug in your slope and your units!!!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how do you interpret y intercept?

A

“The model predicts that if there were no [x stuff] this is how much [y stuff] you’d have” Plug in the appropriate contexts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how to interpret slope EQUATION? rSy/Sx

A

For each increase of 1 standard deviation in x direction, you go r *(standard deviation) in y direction.

17
Q

if you multiply or divide the x’s or y’s (shift/scale) does r change?

A

No change. r does not have any units so the strength of the line will not change.

18
Q

If you switch x and y does r change?

A

No change, the strength will stay the same.

19
Q

if you switch x and y will slope change?

A

YES- slope is rsy/sx

20
Q

Can you predict an X by using a Y?

A

NOT WITH THE SAME EQUATION! BE CAREFUL!! You have to change the entire equation and start from scratch.

21
Q

What point does every regression line pass through?

A

the mean-mean point. (x bar, y bar)

22
Q

Why is it called “least squares regression line”?

A

After you find the mean-mean point, you fix the line so that it minimizes the squared vertical distance to that line (minimizes the squared residuals)

23
Q

interpret r squared

A

r squared is the % of variability in y that can be explained by the linear model.

24
Q

does high r squared mean a good model?

A

CHECK STRAIGHTNESS FIRST. you should check your plot and residuals to make sure model is appropriate and no outliers present? then it means something

25
Q

is r sensitive to outliers?

A

yes. A single outlier can make it seem like there is a relationship (out in x direction..), or that there is none.

26
Q

What is a residual?

A

ACTUAL minus PREDICTED. A-P

27
Q

What are b1 and bo ?

A

b1 is the SLOPE, and bo is the y- intercept.

28
Q

what is the LSRL

A

least squares regression line

29
Q

What should we look for in residual plot?

A

curve or pattern. Also, it should have equalish scatter from left to right

30
Q

Which is explanatory variable?

A

x. horizontal axis. it “explains” what happens to y

31
Q

which is response?

A

y.. Vertical axis.. It “responds” to the x

32
Q

will residual plots always show outliers? (will outliers always have large residuals?)

A

Not necessarily.. Some points have so much leverage, they pull the line up to it?

33
Q

How do you get equation from computer output?

A

y= b0 + b1 x or y=a+bx
y is the dependent variable
b0 is the coefficient of constant (or it says intercept)
b1 is the coefficient of the variable given (independent)
x is the independent variable

34
Q

What does “y-hat” mean?

A

The prediction equation for y.