UNIT 2 VOCAB Flashcards

1
Q

What does r tell us?

A

How STRAIGHT a positive or negative relationship is between two QUANTITATIVE variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What values can r be?

A

from -1 to +1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are some strong r values and some weak r values

A

Strong r values are close to 1 or -1, like -0.83 or 0.94. Weak r values are close to zero like 0.10 or -0.06

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What if a scatterplot goes straight across horizontally?

A

NO ASSOC. That would be like height and IQ. If each height has about the same IQ, then they are INDEPENDENT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

If something is correlated is it associated?

A

Yes (if it is straight)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

association or correlation?

A

association is talking about a relationship? correlation is an actual calculated number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How to describe association on scatterplot?

A

DIRECTION… FORM.. STRENGTH (and outliers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

direction?

A

positive or negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

form?

A

straight, curved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

strength?

A

give the r value (if straight), or say? “tightly packed” or “ loosely packed”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

does correlation mean causation?

A

NO WAY DUDE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Give example of incorrectly using the word “correlation”

A

“there is a correlation between gender and video game playing” This person should say “association.” You can’t say correlation because gender is categorical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

does high r value mean anything?

A

An r value alone tells little. CHECK THE SCATTER.. IS IT LINEAR?? make sure it’s linear first.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is r calculated?

A

r= sum(ZxZy) / (n-1)—- the sum of rectangle areas on standardized axes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how can you check for “straight enough?”

A

residuals plot fool!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how do you interpret slope?

A

for an increas of 1 [unit of x] there is an (increase/decrease) of [SLOPE] [units of y]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

how do you interpret y intercept?

A

The model predicts that if there were no [x stuff] this is how much [y stuff] you’d have

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

how to interpret slope EQUATION? rSy/Sx

A

for each increase of 1 st dev in x direction, you go r st dev in y direction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

If r= 0.8.. An x value that is 2 standard deviations above the mean will have a predicted y value that is _______

A

1.6 standard deviations above the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

if you mult or divide the x’s or y’s (shift/scale) does r change?

A

no. the strength remains the same. (If you log or square it, it will change, but just adding or multiplying won’t change it)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

if you switch x and y does r change?

A

NO. The strength stays the same.

22
Q

if you switch x and y will slope change?

A

YES- slope is rsy/sx? to get new slope you do: (r sqared)/old slope

23
Q

Can you predict an X by using a Y?

A

NOT WITH THE SAME EQUATION! BE CAREFUL!! You have to change the entire equation and start from scratch?

24
Q

What point is on every regression line?

A

the mean-mean point. (x bar, y bar)

25
Why is it calle d "least squares regression line?"
Because, after you find the mean-mean point, you fix the line so that it minimizes the squared vertical distance to that line (minimizes the squared residuals)
26
interpret r squared
r squared % of variability in y can be explained by the model. The rest is in residuals?
27
does high r squared mean a good model?
CHECK STRAIGHNESS FIRST. you should check your plot and residuals to make sure model is appropriate and no outliers present? then it means something
28
is r sensitive to outliers?
yes. A single outlier can make it seem like there is a relationship (out in x direction..), or that there is none.
29
Look for lurking variables?
think hot chocolate sales in caf at wachusett mountain and ski accidents at wachusett mountain. Did the chocolate cause the accident??????
30
outliers in regression?
doesn't follow the "flow" (pinky trick)
31
what about your calculator for using curves to fit curved data?
sure.. Quadreg, cubicreg, lnreg, etc? just be careful when substituting while writing the equation given.
32
what does "regression to the mean" mean?
preditions for y are closer to the mean y (y bar) than the actual x is to the mean x (in s.d). Sons were closer to average height than the dads. Super tall dads had tall sons, but not super tall sons, on average.
33
what does influential mean?
It means that the point, when added or removed to data, will influence the SLOPE.. Generally these are outliers in the x direction?. Far left or right.
34
what is a linear model?
it is an equation you can use? or a line of a graph, but it is just a model that says what kind of happens, and can be used to ESTIMATE WHAT MIGHT HAPPEN
35
what is a residual?
ACTUAL-PREDICTED. A-P. like this class.. AP (get it?)
36
what is b1 and bo ?
b1 is the SLOPE, and bo is the intercept. Remember that bo can be thought of as "b old" it is the old b? the intercept in y=mx+b? so it is still the intercept.
37
what is leverage?
leverage just means it is far away from x-bar? far right or left from the middle.. Some leverage points are not influential if they go along with the flow of the scatter.
38
what is the line that you plot?
IT IS A MODEL!
39
what is the LSRL
the "least squares regression line" that line.. That equation
40
what should we look for in resid plot?
curve or pattern.. Also, it should have equalish scatter from left to right
41
what's up with extrapolation?
not a good idea. sometimes it's all you can do, but still, NOT GOOD
42
which is explanatory variable?
x. horizontal axis. it "explains" what happens to y
43
which is response?
y.. Vertical axis.. It "responds" to the x
44
will residual plots always show outliers? (will outliers always have large residuals?)
Not necessarily.. Some points have so much leverage, they pull the line up to it?
45
What is homoscedasticity?
equal scatter along the regression line
46
What if the scatterplot is curved?
either straighten it and fit a line, or keep it and fit a curve (quadreg, cubicreg, lnreg, logreg)
47
How can you straighten data?
Do stuff to the y (square it, root it, log it, etc) and recheck the plot. Remember to put the transformation into your equation.. Example Sqrt y = 4.33 - 2.03 x
48
How do you undo a log when solving?
10^ stuff
49
How do you undo an ln when solving?
e^stuff
50
How do you undo sqrt when solving?
^2
51
How do you undo squares or cubes?
^ 1/2 or ^ 1/3
52
How do you get equation from computer output?
y= b0 + b1 xy is the dependent variableb0 is the coefficient of constant (or it says intercept)b1 is the coefficient of the variable given (indep)x is the indep variablegenerally arranged: Y= this down plus this left "