Biostats Linear Regression Flashcards

1
Q

Is the next step up after correlation. It is used when we want to predict the value of a variable based on the
value of another variable

A

Linear Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The variable we want to predict is
called the dependent variable (or sometimes called as the ________________)

A

Outcome variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The variable we are using to predict the other variable’s value is called the independent variable, referred to as

A

Predictor variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

We could use _________________ to understand whether exam
performance can be predicted based on revision time; whether cigarette consumption can be predicted based on smoking duration; and so forth.

A

linear regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

If you have two or more independent
variables, rather than just one, you need to use

A

Multiple regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Is closely related to correlation

A

Linear Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

focuses on using the relationship for prediction

A

Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If the relationship is perfect, the prediction is also

A

perfect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The vertical difference between the
data points and the predicted
regression line is known as the

A

Residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

These values are squared to remove the negative numbers and then summed to give

A

SSR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The vertical difference between the
data points and the mean of the
outcome variable can be calculated.
These values are squared to remove
the negative numbers and then
summed to give the total sum of the
squares

A

SST (shows how good the mean value is as a model of the outcome scores.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The vertical difference between the
mean of the outcome variable and the
predicted regression line is now
determined. Again these values are
squared to remove the negative
numbers and then summed to give

A

model sum of squares (SSM) - how better the model is compared to just using the mean of the outcome variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how better the model is compared to just using the mean of the outcome variable.

A

SSM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Total sum of the squares

A

SST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

the larger the SSM the better the model is at predicting the _______ compared to the ________________. If this is accompanied by a _________ the model also has a small error

A

outcome, mean value alone, small SSR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

is similar to the coefficient of determination in correlation in that it shows how much of the variation in the outcome variable can be predicted by the predictor variable(s).

A

R2

17
Q

In regression, the model is assessed by the ___________ based on the improvement in the prediction of the _________ and the __________________. The larger the _______ the better the model.

A

F statistic, model SSM, residual error SSR, F value