Lecture 6 Conceptual Short-Answer Flashcards
Exam 2
Why do researchers often prefer reporting correlations over covariances when describing association between 2 quantitative variables?
As its value ranges from -1 to 1, the size of the correlation coefficient is interpretable, unlike covariance
What does the correlation coefficient represent?
It is the magnitude and direction of the linear relationship between 2 variables
How can two variables be associated but have zero correlation?
Correlation only measures the linear relationship, if the variables are nonlinearly related, their correlation may be 0
Compare correlation with regression. What do they have in common, and how are they different?
Correlation describes the direction and strength of a linear relationship (general and without units) and produces a correlation coefficient.
Regression describes the relationship between the 2 variables by a line function with 2 model parameters (slope and intercept).
Both are used to examine the relationship between 2 variables, but provide different types of information.
What is the goal of regression analysis?
To assess the relationship between 2 variables by explaining or predicting one variable using another
In what sense can the mean be considered a good summary statisic?
The sample mean of a variable minimizes the sum of squared residuals for that variable when no information about other variables is available
What does Anscombe (1973)’s quartet imply?
It’s good to visually look at the data, not just the statistics
What graphs do you use for a relationship between 2 categorical variables?
Two-way contingency table, grouped bar graph
What graph do you use for a relationship between 1 categorical and quantitative variables?
Grouped box-and-whisker plot
What graph do you use for a relationship between 2 quantitative variables?
Scatterplot
How do you find the slope in a linear regression model?
S(dx)(dy) / S(dx)^2
How do you find the intercept in a linear regression model?
Y_bar - slope(x_bar)
How do you calculate the residual?
Residual = actual - expected