Statistical Modelling: Correlation + Regression Flashcards

1
Q

When is correlation used?

A

When there is no distinction between the two variables i.e. no causation implied
Measures the association between two continuous variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When is regression used?

A

When one variable is a response to another variable. The value of the X variable can be used to predict the value of Y variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does a correlation coefficient (r) of 0 imply?

A

No linear relation between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the range for a correlation coefficient?

A

-1 to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a Pearson’s correlation coefficient of +1 imply?

A

Perfect positive linear association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Assumptions for hypothesis testing and confidence intervals for population correlation (p)

A

Both variables are plausibly normally distribute
There is a linear relationship between them
The null hypothesis is that there is no association
Scatter diagram should show a roughly elliptical pattern

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is r^2?

A

The percentage of variance of one variable explained by the other variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can the best fitting line of regression be estimated?

A
y = a + bx
a = intercept
b = slope
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When can multiple linear regression be used?

A

To investigate the influence of several explanatory variables simultaneously on the outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why use multiple linear regression analysis?

A
  1. To identify any explanatory variables that may be associated with the y variable
  2. To investigate the extent to which one or more explanatory variables are linearly related to the y variable after adjusting for other related variables
  3. To predict the value of the y variable from the explanatory x variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is the estimated multiple regression variable calculated?

A
Y = b0 +b1x1 + b2x2 +...bpxp
b1 = amount by which y increases on average if we increased x1 by one unit but keep all other xp's constant (or adjust for them)
b0 = intercept when all variables are 0
How well did you know this?
1
Not at all
2
3
4
5
Perfectly