Chapters 36 and 37-Beyond Simple Linear Models Flashcards

1
Q

What does nonlinear regression not mean?

A

Nonlinear regression does not simply mean that the graph of X vs Y will be curved.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does nonlinear regression mean?

A

Nonlinear regression means that Y is not linear with respect to the parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an example of nonlinear regression?

A

The Michaelis-Menten equation for enzyme activity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

For what should nonlinear models should usually be chosen for?

A

Nonlinear models should usually be chosen for theoretical reasons, not just by trying to fit a curved line equation to the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What kind of equation is rarely appropriate for nonlinear regression?

A

Polynomial equations are rarely appropriate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How would polynomial equations possibly be useful?

A

They might be useful for limited interpolation. These equations also make programs like MS Excel easier to fit data than truly nonlinear models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why are polynomial equations still considered linear?

A

Because Y is linear with respect to the parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Finding the best fit parameters for a nonlinear equation is usually done how?

A

By many computer programs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Like linear regression, what does nonlinear regression find?

A

Finds parameter values that minimize the sum of squares of the difference between actual and predicted Y values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does multiple regression do?

A

It extends linear regression to allow multiple independent (X) variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is an example of multiple regression?

A

Distinguish between effects of lead exposure vs age on kidney function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Usually, multiple regression is considered distinct from _______.

A

multivariate analyses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Multivariate usually means…

A

that there are multiple Y variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Example of multivariate?

A

How does soil texture affect quantities of all plant species in a community?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a Regression Coefficient?

A

A parameter explaining the relationship of an independent variable to the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In simple linear regression, _______ is a regression coefficient.

17
Q

In multiple linear regression, how many regression coefficients are there?

A

Two or more

18
Q

What do regression coefficients quantify?

A

Quantifies the relative impact of each independent variable on the dependent variable: ie the amount of change in Y for each unit of change in X

19
Q

Why is the regression coefficients chosen?

A

To minimize the sum of squares

20
Q

Dummy variable

A

Allows a discrete variable to be used. Presence value=1

Absence value=0

21
Q

What does the P value mean for regression coefficient?

A

P values test the null hypothesis that the coefficient is 0.

22
Q

What is a problem of finding the P value for a regression coefficient?

A

R2 goes up automatically when the number of independent variables increase.

23
Q

Adjusted R2

A

The adjusted R2
is always smaller and depends
on the # of X variables (decrease with) and #
of samples (increases with).

24
Q

Assumptions of multiple linear regression

A

Independent observations from one population
• The random component is Gaussian
• Linear relationships only (doesn’t guarantee a linear
relationship, but doesn’t detect other relationships)
• No interaction beyond that specified in the model
(e.g. assuming that the effect of lead is not greater
when age is greater), but interaction terms can be
created

25
What if you don’t know which factors are important for | causing your dependent variable observations?
Could use an automatic selection process to choose independent variables (from many for which you have observations) that significantly improved your ability to predict the Y values.
26
e.g. forward-stepwise selection:
start with a very simple model and use a computer to choose the X variable that most improves the prediction. • If significant keep it and choose another X variable that best improves the prediction further – until you reach a point where the best additional variable cannot provide significant improvement. • Note: you just performed multiple comparisons
27
How is automatic selection useful?
It may detect relationships that were not already obvious
28
How should you approach using information from automatic selection?
because this involves multiple comparisons, the R 2 , CIs, and P values cannot be trusted (if reported).
29
Therefore, automatic selection must be...
considered an exploratory analysis to generate the model (inductive process). • If the model that you generate results in X variables that make sense scientifically, then you could do a completely independent analysis to test if those variables predict Y significantly.
30
Adding X variables will automatically
increase the R2 • Sample size needs to be much larger than the number of X variables (10-40 times greater).
31
Multicollinearity:
many X variables are strongly correlated; adding correlated X variables will not help predict Y better; avoid entering correlated Xs.
32
Interaction:
may need to account for instances when the effect of a particular X depends on the value of another X. Use an interaction term.