module 7 Flashcards
(39 cards)
3 types of associations
- positive: high score predicts high score/low>low
- negative: high score predicts low score/low>high
- no association: scores on first variable dont imply scores on second
what does pearsons correlation coefficient assess and what are the cut offs
- most common index of linear regression that assesses magnitude and direction
- range from -1 to 1: 1= perfect positive association, 0= no association, -1= perfect negative association
computation of SP
- sum of products of deviation, a index of covariability
- SP= ∑(X-x̄)(Y- ȳ)
for sum of products of deviation, what do the following suggest:
- lots of above mean/above mean or below/below values
- lots of below/above or above/below
- equal mix of above and below
- big positive SP value, big negative SP value, Sp value is close to zero
r coefficient is covariability of _____ relative to variability of _____
X and Y together, X and Y separately
r coefficient formula
r= SP/√SSxSSy
as you increase sample size, correlation coefficient will _____
not really change (as both SP and SS increase)
r coefficient formula; z score edition
r=∑ZxZy/n
z score
- best way of expressing location of a score w in a distribution
- reflects one score’s standing within a distribution of the score
coefficient of determination
- correlation squared (r^2)
- proportion of variance in one variable linearly accounted for by the other variable
r t test formula
t = r√df/√1-r^2
pearsons correlation special cases
- point biserial correlation: dichotomous/cont variable
- phi coefficient: dichotomous/dichotomous variable
- spearman rank order coefficient: ordinal variables
factors influencing the size of r
- distribution of variables: perfect is only possible if distribution is exactly pos or neg (same/opposite)
- reliability of measures: perf correlations need perf reliability in both measures
- restriction of range: restriction can attenuate correlations
what does (1-r^2) represent
the proportion of variance in the outcome variable that is not explained by the predictor variable
simple vs multiple regression
- single: uses single predictor variable
- multiple: uses 2+ predictor variables
regression equation
Y=bX+a
y= score of second variable (outcome)
b= slope of best fit line/regression coefficient
X= score of first variable (predictor)
a= y intercept aka regression constant, where x=0
total squared error (aka sum of squares error/ SSerror for regression
SSerror=∑(Y-Ŷ)^2
Y= actual score
Ŷ= predicted score
or = (1-r^2)SSy
what is used to determine extent to which regression equation fits actual data set
total squared error (SSerror) for regression
what equation can be used to find b in linear regression equation
b= SP/SSx
or
= total covariability of X+Y/total variability of X
what equation can be used to find a in linear regression equation
a=Ȳ-bX̄
when X and Y are z scores, what is the regression equation
Ẑy=rZx
- a=0 sinse X and Y are zero so its dropped
standard error of estimate
- measure of standard distance between regression line and actual data points
=√SSerror/df
as r approaches 0, SSerror is ____ but as r approaches 1, SSerror is ____
decreased, increased
in simple regression, what is the null hypothesis
- testing if b (slope) value, null=no linear association between X and Y