W4: RQ for Predictions 1 Flashcards
Predictions: What is it, and what is the keyword? What does it not do?
Using knowledge about one/more constructs to indicate standing on another construct.
- Indicate / Account for / Explain (if used descriptively)
- Not cause
- Barometer indicates / account for /explain (if used descriptively), but it does not cause the weather
What is in a good RQ involving prediciton
- ) Statement ending with ?
- ) Include all relevant constructs
- ) Indicate all relevant population
- ) Use predict as “driving word” (key)
DV/IV in a prediction RQ. X/Y.
Does the meaning and focus of RQ change if X and Y swapped?
- DV: Y-Axis
- Being Predicted (Predicted)
- IV: X-Axis
- Doing the predicting (Predictor)
- Meaning and focus of RQ change depending on which variable is defined on the DV/IV
What is “Variation”. How is it measured
Variation
- Total amount of variability in a distribution of scores from the mean
- Sum of squared deviation scores; or
- Sum of squares
- Hence, gets larger as n increases
What is “Variance”. How is it measured and expressed
Variance
- Average Sum of Squares in a distribution of scores
- Expressed in a squared metric, relative to the scores on which it is calculated
- Hence, it is independent of sample size
What is “Standard Deviation”. How is it expressed
Standard Deviation
- Square Root of Variance
- Expressed in same metric as scores on which it is calculated
What is the geometrical interpretation of deviations
- SD
- Length
- Variance
- Area (It is a square)
- Variation
- Sum of all the Areas/Squares
- (That’s why it’s called sum of squares)
What is the key distinguishing features between correlation and regression
- Correlation
- Symmetric Relationship
- Regression
- Asymmetric Relationship
What is a Symmetric Relationship. In terms of correlation; IV/DV; scatterplot
Variables have the same role and function in the characteristic of scores being summarised.
- Cor (A,B) = Cor (B,A)
- No IV/DV
- Scatterplot: Variable on X/Y axis does not matter
What is a Asymmetric Relationship. In terms of correlation; IV/DV; scatterplot
Variables have the different role and function in the characteristic of scores being summarised.
- Cor (A,B) /=/ Cor (B,A)
- IV/DV declared a priori
- Scatterplot: Variable on X/Y axis fundementally important
What is the formula and conceptual formulation of a correlation
Correlation
- Sum of cross products of deviation z-scores / (n-1)
- rxy = E (Zxi x Zyi) / n - 1
- Standardised (z) measure of strength and direction of association
- (X and Y deviations in z will not be the same!)
- We can look at the size of correlation to determine the strength of the association directly (-1 to 1)
What is the conceptual formulation of a covariance
Covariance
- Sum of cross product of deviation scores / (n-1)
- Sxy = E [(Xi - μx ) x (Yi - μy)] / (n-1)
- Unstandardised (using deviation) measure of strength and direction of association
- (X and Y deviations will not be the same!)
- We cannot look at the size of covariance to determine the strength of the association directly (-∞ to +∞)
Can we calculate correlation from covariance, vice versa? In what circumstances can we do that
Yes.
But we have to know the standard deviations of variable x and y
- rxy = Sxy / sdx x sdy
- Sxy = rxy x sdx x sdy
What alternative name of line of best fit. What does it do?
Linear Regression Line
- Summarise relationship between 2 variables
Formula to calculate the slope of the regression line
bx = rxy x (SDy / SDx)
- Sample correlation x Respective standard deviations.
- Know value of sample correlation
- Know respective standard deviations
- Therefore, slope will be the same if SD is the same, which is not very possible.
Alternatively,
b = (Y2 - Y1 ) / (X2 - X1)