L3 - Associations Flashcards
What kind of analysis is used to assess association between continuous variables?
Correlation
What kind of analysis is used to assess association between categorical variables?
Odds Ratios (and contingency tables)
What is covariance?
This is a measure of the degree of concurrent variation in people’s scores on two variables. This is a measure of strength and direction of linear association between two variables.Can be pop parameter or sample statistic
What are similar features of variance and covariance?
- Similar formula!!- their size depends on the metric of the variables used to calculate it.. so they need to be standardised.. we need to know whether something is a big or small covariance—>PEARSON’S CORRELATION COEFFICIENT!
What is pearson’s correlation coefficient?
This is basically the standardised version of covariance!This is where the deviation scores are replaced with z scores in the process of calculating covariance. denoted with an “r”
What are some properties of the t distribution?
- bell shaped - Variability defined by its df- as df increases, the t dist converges to a standard normal dist
Which distribution does the sampling distribution of a population correlation coefficient of zero correspond to?
dont know right now!!
What are degrees of freedom?
It basically means you have certain constraints operating, and you only have a certain number (df) of options until the solution is known.
how do you calculate the degrees of freedom for a correlation?
df = n-2basically, because there are 2 variables
How does the the regions of rejection for a normal distribution differ from a t distribution?
The region of rejection is further out from zero in the t distribution
As df increases, it pushes the region close to 1.96 which is the normal distribution.
How is the observed test statistic found for sample correlation?
Tobs = (r - null hypoth value) / standard error.
whereby, standard error is:
square root of: (1- r^2)/(n-2)
When will a larger value of Tobs occur?
When:
- There is a bigger correlation (r), because then the numerator is larger and standard error is smaller
- There is a smaller standard error
Why is pearson correlation coefficient an attractive statistic for research?
Because it is a natural effect size measure. there is no need to transform the value to know the size of the effect.
When would we use fisher’s r to z transformation?
When there is a large sample, for confidence interval estimates on XECI
What are the underlying assumptions of Pearson Correlations?
That the scores on both variables are..
- linearly related to each other.
- continuous
- independently observed of each other
- normally distributed
- measured without error
- unrestricted in their range
formula to calculate pearson correlation is biased, but consistent.
Violation of the assumptions causes further bias.