Week 10 - Correlation and Linear Regression Flashcards

1
Q

What is the point of a correlation test?

A

Allows you to examine an association between two scale variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the different symbols for the Pearson correlation coefficient and when are they used?

A

r - when measured from a study sample
rho (𝜌) - when discussing the statistical population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do the values of the Pearson correlation coefficient (r) represent?

A

Values close to zero means there is no linear association
Values near 1 (positive slope, direct) means x and y increase together and are very close to a straight line (as x increases by one, so does y)
Values between -1 an 0 (negative slope, inverse) mean as y decreases, x increases
Values of 1 or -1 are perfectly correlated positively or negatively

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What values determine the strength of correlation with the Pearson correlation coefficient?

A

< 0.3 = weak
0.3 -/< 0.5 = moderate
=/> 0.5 = large

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the notation for the null and alternative hypotheses for correlation tests?

A

𝐻0: 𝜌 = 0
𝐻𝐴: 𝜌 ≠ 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the df notation for correlation tests?

A

df = n - 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What sampling distribution do t-tests use?

A

t-distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the assumptions for correlation tests?

A

Variables X and Y should be scale
Variable X has a linear relationship with variable Y
Both variables should approximate a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When do you reject the null hypothesis for correlation tests?

A

p < alpha
𝜌 ≠ 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the point of simple linear regression?

A

To examine whether changes in variable X can predict changes in variable Y when X and Y are numerical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which variables in linear regression are on the x and y axes?

A

X axis: predictor/exposure/outcome variable
Y axis: outcome/dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the equation for simple linear regression?

A

Y = mx + b + error
Y: outcome
m: slope
x: predictor
b: y-intercept

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the parameters for simple linear regression and what are they used for?

A
  1. intercept
  2. slope

Hypothesis testing is done on the parameters of the systematic component

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the notation for the null and alternative hypothesis for linear regression?

A

𝐻0: 𝛽1 = 0
𝐻𝐴: 𝛽1 ≠ 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the assumptions for simple linear regression?

A

Variable type: outcome must be numeric and predictor can be numeric or categorical
Independence: Y values are independent of one another
Linearity: X and Y have a linear relationship
Normal distribution: Residuals of the relationship are normally distributed (Y has normal distribution around the mean of X)
Homoscedasticity: Y variance is equal for any X value
Consider potential outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When do you reject the null hypothesis for linear regression?

A

p < alpha

16
Q

What is the difference between correlation and linear regression?

A

Correlation: association and correlation
Regression: prediction and causation