Lesson 3- Bivariate Data Analysis Flashcards
0
Q
Correlation Coefficient r
A
Scatterplot and r
- positive
- negative
- strong ( |r| -> 1 )
- weak (snowflakes, |r| -> 0 )
- r X slope
1
Q
Scatterplots
A
- 2 datas
- strength of a relationship
- predicting further results using functional expression
2
Q
Correlation Coefficient Properties
A
- positive (negative) r implies positive (negative) association
- correlation coefficient r has values between -1 and 1
- weak relationship r -> 0, strong relationship r -> 1, -1
- association X implies causation (causality: one causes another: fire and firemen)
3
Q
Correlation Coefficient properties 2
A
- r value has meaning only when linear relationships is assumed
- x and y must be quantitative (X categorical)
-x (explanatory variable) and y (response variable) are
indistinguishable; if x and y are switched, r value won’t change
- changing units in x or y has X effect on r: cause it’s fraction
- r is not resistant to outliers
4
Q
R^2
A
R^2 gives percentage of variation in y that is explained by the variation in x
5
Q
Least Squares Regression Line
A
LSRL
- y hat = a + bx
- best fitting straight line found by minimizing the sum of squares of residuals
- residual : y - y hat
- slope b=r(Sy/Sx), a from LSR passing through (mean x, mean y)
6
Q
Regression line
A
- distinguish b/w x and y ; regressions minimize residuals in y direction, not x
- if you graph z-scores of y against z-scores of x, slope of regression line = r, intercept = 0
7
Q
Regression chart
A
b = (seats), coefficient
a = constant, coefficient
r = square root ( R-sq )