Statistics Flashcards
Correlational research
non-experimental research to determine whether two variables are related to each other with little or no efforts to control extraneous variables
Third variable problem
An unmeasured (unintended) variable that could be influencing the relationship between the two variables being examined
characteristics of a relationship
the direction of the relationship and the strength of the relationship
positive correlation (the direction of the relationship)
two variables tend to move in the same direction
negative correlation (the direction of the relationship)
two variables tend to move in opposite directions
The strength of the relationship
the variance within each variable -> covariance
covariance
a measure of how he two variables vary/change together
for the formula:
1. for each participant subtract their value from the mean, for both variables, and multiply them together, giving one value for each participant
2. Sum all the values created in step 1 together
3. Divide by the number of pairs of observations minus 1
if the value is positive -> positive covariance/relationship
problems with co variance
it is very sensitive to the units of measurement of the variables and therefore its value does not imply the strength of the relationship
Correlation coefficient
Divide covariance by the product of the SD of both varibles
allows for standardised covariance
the numeral value ranges from -1.00 (perfect negative correlation) to +1.00 (perfect positive correlation) providing information about the relationship between two variables
Directional hypothesis
there is a relationship with the nature of the relationship specified
e.g. There is a (strong/moderate/weak) positive/negative relationship
non-directional hypothesis
there is a relationship without specification of the relationship, no prediction of the nature
Pearson correlation
Pearson’s product moment correlation is a correlation analysis parametric test
Assumptions of a Pearson correlation
- Levels of Measurement: Continuous data (interval or ratio)
- Related pairs: two data points from each observation/participant
- Linearity: the relationship is linear
- Normality: Residuals are normally distributed (Q-Q plot of residuals used to show)
- Homoscedasticity: variability (spread) of one variable remains constant across the range of another variable
- Absence of outliers: outliers are data points/values that differ from the majority of the data
Spearman correlation
Non-parametric alternative to the Pearson correlation
spearman coefficient:
step 1: rank the scores for each variable separately
step 2: calculate differences between ranked pairs
step 3: square the diffeences
step 4: sum the squared differences
step 5: compute the spearman rank order correlation coefficient
no tied observations
all observations are unique
tied observations
some values are the same
independent correlations
different group - do not affect the data from the other group
dependent correlations
shared variables between correlations
Regression
determine if one variable predicts/explains the other variable
Regression analysis
a statistical technique to investigate and model the relationship between variables
simple linear regession
one predictor variable predicting one outcome variable
multiple linear regression
two or more predictor variables predicting one outcome variable
Regression model (Line of best fit)
prediction of outcome changes based on value of predictor
Residual/Error
Actual value - Predicted value
helps measure how much error there is in predictions
represents difference between the actual values and the predicted values generated by our regression model