Practical 6: Correlation & Regression Flashcards

1
Q

When is correlation conducted?

A

When we have two continuous variables and we need to assess their relationship

  • graphically represented using a scatterplot
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is a scatter plot demonstrated in SPSS

A

Graph –> legacy dialogs –> scatter/dot –> define groups and label cases by ID

Labelling cases by ID ensures we can identify any outlying variables

To put a line of best fit –> choose at line of best fit –> select linear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is correlation?

A

A measure of statistical association and assess the strength of a linear relationship - can demonstrate the direction and magnitude of the relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define the strength of correlations?

A
  1. 0 - 0.20 –> very weak
  2. 2 - 0.39 –> weak
  3. 4 - 0.59 –> moderate
  4. 5 - 0.70 –> strong
  5. 80 - 1.00 –> very strong
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the assumptions for Pearson’s?

A

Normally distributed
Each participant has a pair of values
There is a linear relationship between the values
No outliers
Observations are randomly and independently drawn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we select pearson’s

A

Correlate –> bivariate –> pearson’s

Choose - Spearman’s correlation if not normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When is Spearman’s correlation used?

A

Non-normally distributed data (or ordinal data - assume it is skewed)

Monotonic relationship between two values - i.e when one increases or decreases so does the other but not the same/constant rate

Random independent observations

Both have a pair of values

(non-parametric version of pearson’s correlation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the components of a simple linear regression?

A

y = B0 + B1X + e

x - predictor, independent variable (continuous or categorical)

y = dependent variable, outcome –> always continuous

B0 –>intercept value that y takes when x is 0

B1 is the slope –> determines the change in y when x changes by one unit

e –> residual represents the distance between points on the y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the ordinal least sqaures method?

A

Method of making the residuals as small as possible

Line of best fit represents the one in which the sum of the residuals is the smallest. Worked out by squaring each residual and adding them together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the assumptions for a simple linear regression?

A

There is a linear relationship between the dependent and independent variables

Residuals are independent of another

Residuals follow a normal distribution with mean 0 and constant SD

Homogeneity of variance - i.e the size of the error doesn’t significantly alter for different values of the independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is a simple linear regression ran in SPSS?

A

Analyse –> regression –> linear

On statistics choose estimates oriented and confidence intervals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does an R square refer to on a simple linear regression printed out?

A

How much variance in the independent variables accounts for the the variance in the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the ANOVA table refer to on simple linear regression?

A

Whether the SLR models explains the data significantly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is a SLR reported?

A

Significant relationship was found between X and Y with a 1cm increase in X associated with a B1 increase in Y ( B1 = , t =, p =, 95% CI =)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In SPSS how is a prediction model made for a simple linear regression?

A

Type in new height (X value)

Click analyse –> regression –> simple linear –> define groups –> in save select under-standardised value, 95% confidence individual

Select mean if want to estimate y of the population

Select mean if want to estimate y for an individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do we conduct SLR if there is a categorical x

A

Regression line would just connect mean responses in one group with a mean response in another

The slope coefficient measures the group differences in means –> i.e. how does y change when gender changes

17
Q

How do we do a linear regression with a categorical variable of more than 2 values

A

Need to recode x as a dummy variable

D1 (low)
D1 1, D2 0, D3, 0

D2 (medium)
D1 0, D2 1, D3 0

D3 (high)
D1 0, D2 0, D3 1

18
Q

How do we do a linear regression with a categorical variable of more than 2 values

A

Need to recode x as a dummy variable

D1 (low)
D1 1, D2 0, D3, 0

D2 (medium)
D1 0, D2 1, D3 0

D3 (high)
D1 0, D2 0, D3 1

19
Q

How are dummy variables created in SPSS?

A

Tranform –> recode into different variables

Choose variable –> and name three new variables –> for each one click change to name –> click old and new values to select value for it

20
Q

How can we check if we can use B1 for the whole population?

A

check the p value of B1