M7: Cautions in Regression and Categorical Data Flashcards

1
Q

Extrapolation

A

Predicting outside the range of trusted values of X within the observed range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Cautions in Analyzing Associations

A

a) Extrapolation
b) Influential outliers
c) Correlation does not imply causation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Outliers

A

Points that are away from the trend of the observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Influential outliers

A

Tend to pull the regression line towards them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What to do if your data has outliers?

A

Check the data and correct any typos
If there are unusual observations, try to find more about them.
If they do not belong in the data set, delete the point before proceeding with the regression analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Lurking variables

A

A variable usually unobserved that influences the association between x and y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Types of graphs to find association

A

a) Box plot (Categorical data / Blood pressure / Caf)
b) Scatter-plot (Size of the head / IQ)
c) Contingency table (Smoking / Lung cancer)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly