Questions about concepts Exam 3 Flashcards

1
Q

What are residuals?

A

Difference between the actual value vs the predicted value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the line of best fit?

A

a straight line drawn on a scatter plot that best represents the trend of the data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the method of least squares?

A

It is a process that finds the line of best fit that minimizes the sum of residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In a sample linear correlation, what are the range of values?

A

-1 and +1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a sample linear correlation?

A

It is a concept that measures the strength and direction of a linear relationship between two variables in a sample dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In a multiple regression model what is the probability distribution of the error?

A

0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the standard deviation of a multiple regression model assumed to be?

A

constant for all values of the independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the values of the errors? Independent or dependent?

A

Independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you test the vailidity of the multiple regression model?

A

F test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the null hypothesis involving two independent variables?

A

B1=B2=0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When the independent variables are correlated with one another this is called?

A

Multicollinearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does extrapolation refer to?

A

Estimating the mean value of the dependent variable for values outside the experimental range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is part of the variable screening process?

A
  1. Higher order terms are not considered 2. Common sense should be used when eliminating variables 3. Model building begins after screening
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What can NOT be used in a high dimensional configuration where the number of samples n is less than the number of predictors?

A

Backward selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which model assement should be minimized for the best model

A

AIC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which model assesment should be maximized

A

Adujsted R^2

17
Q

What is the method of least squares

A

finds the line of best fit that minimizes the sum of residuals

18
Q

Leverage is extreme in what? Outlier is extreme in what?

A

Leverage is extreme in x (independent), outlier is extreme in y (dependent)

19
Q

if the standardized residual is greater than 3 or less than 3 it is considered a what?

A

Outlier

20
Q

If a correlation coefficient in the matrix is bigger than 0.8 in absolute value what issue is that?

A

Multicolinearity