Chapters 36 and 37-Beyond Simple Linear Models Flashcards

1
Q

What does nonlinear regression not mean?

A

Nonlinear regression does not simply mean that the graph of X vs Y will be curved.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does nonlinear regression mean?

A

Nonlinear regression means that Y is not linear with respect to the parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an example of nonlinear regression?

A

The Michaelis-Menten equation for enzyme activity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

For what should nonlinear models should usually be chosen for?

A

Nonlinear models should usually be chosen for theoretical reasons, not just by trying to fit a curved line equation to the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What kind of equation is rarely appropriate for nonlinear regression?

A

Polynomial equations are rarely appropriate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How would polynomial equations possibly be useful?

A

They might be useful for limited interpolation. These equations also make programs like MS Excel easier to fit data than truly nonlinear models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why are polynomial equations still considered linear?

A

Because Y is linear with respect to the parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Finding the best fit parameters for a nonlinear equation is usually done how?

A

By many computer programs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Like linear regression, what does nonlinear regression find?

A

Finds parameter values that minimize the sum of squares of the difference between actual and predicted Y values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does multiple regression do?

A

It extends linear regression to allow multiple independent (X) variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is an example of multiple regression?

A

Distinguish between effects of lead exposure vs age on kidney function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Usually, multiple regression is considered distinct from _______.

A

multivariate analyses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Multivariate usually means…

A

that there are multiple Y variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Example of multivariate?

A

How does soil texture affect quantities of all plant species in a community?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a Regression Coefficient?

A

A parameter explaining the relationship of an independent variable to the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In simple linear regression, _______ is a regression coefficient.

A

slope

17
Q

In multiple linear regression, how many regression coefficients are there?

A

Two or more

18
Q

What do regression coefficients quantify?

A

Quantifies the relative impact of each independent variable on the dependent variable: ie the amount of change in Y for each unit of change in X

19
Q

Why is the regression coefficients chosen?

A

To minimize the sum of squares

20
Q

Dummy variable

A

Allows a discrete variable to be used. Presence value=1

Absence value=0

21
Q

What does the P value mean for regression coefficient?

A

P values test the null hypothesis that the coefficient is 0.

22
Q

What is a problem of finding the P value for a regression coefficient?

A

R2 goes up automatically when the number of independent variables increase.

23
Q

Adjusted R2

A

The adjusted R2
is always smaller and depends
on the # of X variables (decrease with) and #
of samples (increases with).

24
Q

Assumptions of multiple linear regression

A

Independent observations from one population
• The random component is Gaussian
• Linear relationships only (doesn’t guarantee a linear
relationship, but doesn’t detect other relationships)
• No interaction beyond that specified in the model
(e.g. assuming that the effect of lead is not greater
when age is greater), but interaction terms can be
created

25
Q

What if you don’t know which factors are important for

causing your dependent variable observations?

A

Could use an automatic selection process to choose
independent variables (from many for which you have
observations) that significantly improved your ability to
predict the Y values.

26
Q

e.g. forward-stepwise selection:

A

start with a very
simple model and use a computer to choose the X
variable that most improves the prediction.
• If significant keep it and choose another X
variable that best improves the prediction further
– until you reach a point where the best
additional variable cannot provide significant
improvement.
• Note: you just performed multiple comparisons

27
Q

How is automatic selection useful?

A

It may detect relationships that were not already obvious

28
Q

How should you approach using information from automatic selection?

A

because this involves multiple comparisons, the
R
2
, CIs, and P values cannot be trusted (if reported).

29
Q

Therefore, automatic selection must be…

A

considered an exploratory
analysis to generate the model (inductive process).
• If the model that you generate results in X variables
that make sense scientifically, then you could do a
completely independent analysis to test if those
variables predict Y significantly.

30
Q

Adding X variables will automatically

A

increase the R2
• Sample size needs to be much larger than the
number of X variables (10-40 times greater).

31
Q

Multicollinearity:

A

many X variables are strongly
correlated; adding correlated X variables will not
help predict Y better; avoid entering correlated Xs.

32
Q

Interaction:

A

may need to account for instances
when the effect of a particular X depends on the
value of another X. Use an interaction term.