BIO 300 Lab Quiz 3 Flashcards
correlation
strength of a linear association between 2 numerical variables
correlation uses
correlation coefficient
correlation coefficient
r
[-1,1]
unitless
-r
as one variable increases the other decreases
inferences from correlation
cannot infer causality
regression
implies causality between 2 variables
used to predict value of response variable from explanatory variable
can determine how much of variability is due to relationship w/ explanatory variable
regression statistic
R^2 = SS_regression / SS_total
[0,1]
linear regression assumptions
- relationship between response (Y) and explanatory (X) is linear
- Y values at each value of X are normally distributed
- variance of Y values is same at all values of X
- Y measurements are sampled randomly from the population at each value of X
are there any outliers
yes, can be seen by boxplot
how to do regression with multiple groups (N/S)
“regression with groups”
then add total regression line by right clicking then going to regression fit
do the data need to be transformed
are the data clumped in one corner of the scatterplot
is there greater spread in one section of the scatterplot
are there different orders of magnitude spanned in the variables
how to SLR
stat– regression– regression– fit regression model
options to check for regression
responses- Chl-a continuous predictor- Log P graphs- residuals vs. fits results- everything but Durbin-Watson storage- residuals
SSregression
proportion of variation in response variable accounted for by the regression
SSresidual
proportion of variation unexplained by regression