Lecture 8 - Interpolation vs Extrapolation, Influential Observations, Confound, Lurking Variable Flashcards
Question
Which of the following CANNOT be done using a simple linear regression equation?
A) Determine whether the association is linear or non-linear
B) Predict the value of 𝑦 for a particular value of 𝑥
C) Estimate the slope between 𝑦 and 𝑥
D) Estimate whether the linear association is positive or negative.
A) Determine whether the association is linear or non-linear
A simple linear regression equation assumes a linear relationship between the independent variable x and the dependent variable y. It cannot be used to determine whether the association is linear or non-linear. If the relationship is non-linear, other models (such as polynomial regression or non-linear regression) should be used instead.
Question
Interpolation vs Extrapolation
Compute the Correlation
Question
What is the value of the correlation?
A) 0.8076
B) 0.7691
C) 0.8986
D) -0.8986
E) None of the above
D) It will decrease by 0.30730
A) 2.59 mm/y
C) It would not be appropriate to make this prediction
Extrapolation Required
Outliers and
Influential Observations
Adding an Outlier
Notes:
Influential observation
Has a large influence on the statistical calculations being done
Notes:
Identification of “Influential Observation”
Under certain scenarios, the investigator may choose to remove such points from the analysis
- If removing it from the data causes our line of best fit to change markedly (see previous example), then its an influential observation.
Investigators should try to determine if it’s due to an error or some other factor surrounding the unit/process from which this observation was collected.