Chapter 13: Relationships Flashcards

1
Q

How do you test a correlation coefficient using z-scores? Run yourself through the procedure.

A

You need two values: the signal and the error.

Signal: Use the Zr formula from below (practice reciting it).

Noise: Use the SE formula with an additional -3 [σ/sqrt(N-3)]

Test-Statistic: Divide your signal by your error.

Now look-up your value in a z-score chart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What units are the standardized residual values in? Because of this, how can we evaluate their significance?

A

The standardized residual values are z-scores.

We can evaluate these residuals by comparing them to z-score significance values.

z > 1.95 = significant at 0.05

z > 2.58 = significant at 0.01

z > 3.29 = significant at 0.001

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Is covariance normalized or standardized?

A

Normalized.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a common complaint against the Yate’s correction?

A

It tends to over-correct the test-statistic, resulting in too low of a value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can you transition a covariance value into a correlation coefficient?

A

You divide covariance by the product of both groups standard deviations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What do the words ‘Tertium Quid’ mean?

A

An unaccounted variable affecting the measurement (a confound).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When would you use the ‘likelihood ratio test’ (G-test)?

A

As an alternative to the chi-squared test for when sample sizes are small.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain the process of calculating confidence intervals for correlation coefficients.

A

You start by calculating the lower and upper boundary for your confidence interval.

Zr±(1.96*SEZr)

These are the same values that you used to calculate your z-score test statistic.

Take the upper and lower bound then plug them each into the attached formula.

Does your range contain zero?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does a Bayes Factor tell you?

A

“The Bayes factor tells you the probability of the data given the alternative hypothesis relative to the probability of the data given the null hypothesis.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Covariance might otherwise be called the average cross-product deviation. What is the formula for covariance, and why does the first sentence make sense?

A

The top part of this formula is called the cross-product deviation–the residual for each factor multiplied. We then sum the cross-product deviation for each datum to get the sum of cross-product deviations.

The sum of cross-product deviations becomes covariance when it is divided by the degrees of freedom–(N-1). This, mathematically, is akin to taking an average cross-product deviation across your data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

You receive a chi-square value of -4, what can you derive from this?

A

The calculations were done incorrectly. Given that you square the residual a negative outcome is not possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When and how would you conduct a Yate’s correction?

A

You would conduct a Yate’s correction on a 2x2 table that has low expected frequencies.

This uses roughly the same formula as a chi-squared test except that your residual is an absolute value that has 0.5 subtracted from it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a percentage bend correlation?

A

A non-linear correlation line that is robust to outliers and extreme cases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does a Pearon’s chi-square test evaluate? And what is the default assumption?

A

Checks whether there is a relationship between two categorical variables. Assumes that there is no relationship (the null hypothesis).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What rules must we follow with a 2x2 contingency table? What about a larger table?

A

2x2 tables must not have an expected value of lower than 5.

Contingency tables larger than 2x2 must have no more than 20% of its square with a value less than 5, and all with a value above 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Calculate the degrees of freedom for this chi-squared scenario. Then use the provided x2 value to determine if there is a relationship between your categorical variables (at the standard p-value).

Rows = 3

Column = 4

x2 = 11.54

A

Degrees of Freedom = (Rown-1)(Columnn-1)

DF = (3-1)(4-1) = 6

There is no relationship between our two categorical variables. We know this because our chi-squared value of 11.54 is less than our critical value of 12.592.

The chi-squared value must be bigger to reject the null hypothesis.

17
Q

What is the chi-square formula? What does each of these variables contribute to said formula?

A

Observed: The value that was observed for the condition.

Model: The value that represents the null hypothesis for the condition.

Squaring The Difference: Squaring the difference of the observed and model values removes negative inputs.

Divide by Model: Dividing by the expected value standardizes the results.

18
Q

How do you calculate the standardized residual? Why is this value useful?

A

You use the pure deviation version of the chi-squared formula, except that your standardization (model/expected value) is squared.

This is useful because it provides you with the individual residual contributions to the chi-squared value. Meaning, if you add each standardized residual then you get the chi-squared test-statistic.

19
Q

How do Asymp. Sig. and Exact Sig. in an SPSS report differ?

A

Asymptotic significance values are produced by chi-squared tests and are approximations of the true chi-squared distribution–higher sample sizes produce closer approximations.

Exact Significance values are produced by Fisher’s method and are a complicated representation of the true chi-squared distribution.

20
Q

How do you calculate an expected frequency for a contingency table? What hypothesis does this represent?

A

Using the formula below.

The expected frequency represents the frequency of an event that you would expect if there was no relationship between two variables–i.e. the null hypothesis.

21
Q

When might Fisher’s exact test be useful?

A

An alternative Chi-Squared test for 2x2 contingency tables when your sample size and expected frequencies are too low.

22
Q

What does a chi-squared result of zero tell you?

A

The expected and observed values are the same, indicating that the null hypothesis is true.

23
Q

What type of skew does the chi-square test contain? What does this mean?

A

This test is positively skewed, meaning that most values will be small and insignificant.

24
Q

What assumptions are relevant for the chi-squared distribution?

A

The assumption of independent errors.

Normality and homoscedasticity are not relevant.

25
Q

Both Spearman’s Rho and Kendall’s Tau are derivatives of Pearson’s correlation coefficient. How do they differ? What is an advantage of Rho?

A

Spearman’s Rho is used for ordinal/ranked data. This organization of data does not allow for outliers to impact results.

Kendall’s Tau is used for data with a smaller sample size.

26
Q

How do you test a correlation coefficient using a t-distribution? Run yourself through the procedure.

A

Simply use the formula below.

Take your test-statistic to a t-chart and check for significance.

If your value is bigger than the one listed then it is significant.

If your value is smaller than the one listed then it is not significant.