Linear Model Flashcards

1
Q

The scatter plot can be summarised by the following five numerical summaries…

A
  • sample mean and sample SD of X
  • sample mean and sample SD of Y
  • correlation coefficient (r)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

correlation coefficient (r)

A

a numerical summary which measures how points are spread around the line.

  • It indicates both the sign and strength of the linear association.
  • The r is between -1 and 1.
  • If r is +ve: slopes up.
  • If r is -ve: slopes down
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The Correlation coefficient (r) is the mean of the product of the variables in standard units

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Properties of the Correlation Coefficient

A

Symmetry - The correlation coefficient is not affected by interchanging the variables.

Scaling - The correlation coefficient is shift and scale invariant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Outliers have no influence on ‘r’

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Nonlinear association can be detected by the correlation coefficient

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

correlation coefficient in R

A

cor()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

linear regression

A

lm(y~x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

how to put regression line on a plot

A

abline( lm(y~x), col=”…”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Prediction error (residual)

A

vertical distance (or ‘gap’) of a point above or below the regression line (difference)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Residual plot

A

graphs the residuals vs x.
* If the linear fit is appropriate for the data, it should show no pattern (random points around 0).
* check appropriatness of linear model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

is extrapolating reliable?

A

no, it is a prediction error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

before predicting using a linear model you should…

A

check the scatter and residual plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If the vertical strips on the scatter plot show equal spread in the y direction…

A

then the data is homoscedastic, otherwise the data is heteroscedastic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

homoscedastic

A

an assumption of equal or similar variances in different groups being compared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

heteroscedastic

A

when the SDs of a predicted variable, monitored over different values of an independent variable or as related to prior time periods, are non-constant