Correlation Flashcards
The correlation coefficient describes the relationship between what kind of variables?
Ratio/Interval
If X and Y have a correlation of 0, and an X score has a z score of 1.0,
then what is the best guess of the corresponding Z score of Y?
The Mean of Y
If X and Y have a correlation of 1, and an X score has a z score of 0.6,
then what is the best guess of the corresponding Z score of Y?
.6
If X and Y have a correlation of 1, and an X score has a z score of 0.6,
then what is the best guess of the corresponding Z score of Y?
If X is the maximum possible R2 and Y is the minimum possible R2, then what is X + Y?
1
If Y can be predicted perfectly by X, what is the correlation of X and Y?
Can’t determine. Depends upon the type of prediction - if linear prediction then correlation is 1 or -1, but with quadratic prediction, correlation could be 0.
In a set of 100 husbands and wives, the wife’s IQ is always exactly 3 points higher than the husband’s IQ. What is the correlation between the husband IQ and Wife IQ?
1, because we can always perfectly predict. `
If the Z score of a husband’s age and the Z score of the wife’s age always add up to 0, then what is the correlation between the ages?
-1. Because we can always predict one age from the other, but they are always in opposite directions.
Standard Error of the Estimate Formula
√SSresidual / (N-2)
The BEST regression line equation minimizes the error between the actual values and those predicted by the regression line. Error is defined as…?
Sum of squared differences between actual and predicted values
How much error is there in the least squares regression line with 2 pairs of XY values?
None. We can always draw a line between two points that contains both points (hence no error between predicted and actual values).
The Goodness of fit test.
This test compares an actual distribution of frequencies to a theoretical probability distribution. For example, we could compare an actual distribution of a coin flip to a theoretical distribution (i.e. 50/50) to determine if there was evidence for an unfair coin.
The Test for Independence
This test looks at two categorical variables and determines if one category in one variable tends to be associated with one category in the other variable. For example, we could look at whether or Men and Women differentially answer a question such as ‘Have you been arrested?’. The test would determine if men are more likely to give a certain response (e.g. yes).
The McNemar test for change
This test compares the distribution of a categorical variable before and after an event. For example, this test could be used to see if support for a government policy (yes/no) is affected by an external event (such as a war or an economic downturn).
Goodness of Fit test; Calculate the deviations from the expected frequency
Calculate Σ(Obsi-Expi)2/Expi
- If I want to see if being married has an effect on voting for a married candidate in an election, what kind of statistical test would I use?
independence chi squared
I want to see if getting married affects support for sexual predator laws. I collect my data and find that getting married produces X switches from ‘Yes’ to ‘No’ and X switches from ‘No’ to ‘Yes’. What are the results of the hypothesis test?
Since the number of switches are the same, the expected frequencies for each category will be the total (2x)/2 = X, which means..
The expected and the observed frequencies will be the same. This means the χ2values will all be 0, so
Sum of all χ2values will = 0. This means that the null hypothesis will not be rejected because observed χ2= 0
Total error for regression
Total Error = Σ(Ypredicted-Yactual)2
- Consider the following information for a power calculation (normal score distribution with μ1 known, σ estimated from sample):
μ1=100,μ2=102,σX = 16,N=16,Tails=2,α=.05.
Calculate the power of this experiment
CAN’T SOLVE!!!!
BECAUSE σ is estimated from sample when it HAS to be KNOWN.
CANT SOOVLE POWER PROOBS WITH ESTIMATED σ, we need The standard deviation of the populations ONLYYYYY
When do we use a one-tailed confidence interval?
WE DON’T!!! ALWAYS TWO TAILED FOR CONFIDENCE INTERVALS :D
When comparing scores of two groups, dependent t tests are preferred over independent t tests because they usually have____?
A) a lower standard error
B) more variability
C) a lower value in the denominator of the equation which calculates tobs
D) three of the other answers
E) two of the other answers
E,
lower standard error and a lower value in the denominator of the question!