Scatterplots Flashcards
what 2 data types do they provide? (think back to week 2!)
relationship data
distribution data
two alternate names for the X variable
explanatory variable
covariate
alternate name for Y variable
response variable
in scatterplots, X variable may be what type of variable
categorical or continuous
in scatterplots, Y variable must be
continuous
2 directions of X-Y relationship
positive (X increases, Y increases)
negative (X increases, Y decreases)
3 shapes of X-Y relationship
linear
non linear
clusters
main statistic that scatterplots look at
correlation coefficient, r
values that r can take?
what two factors does it quantify?
-1 to +1
strength ( | number | ) and direction (+/-) of relationship
fairly weak correlation?
0 - 0.3 |
fairly strong correlation?
0.3 - 0.7 |
strong correlation?
0.7 - 0.9 |
very strong correlation?
0.9 - 1 |
R^2 value shows us
what proportion of variation in Y can be explained by variation in X?
correlation is not
causation