ANOVA Flashcards

1
Q

What is the regression error sum of squares (SSE)?

A

SSE (Error Sum of squares) is variation due to factors other than the relationship between X and Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the regression sum of squares (SSR)?

A

SSR (Sum of squares) is a variation that is explained by the relationship between X and Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How is the SST value calculated?

A

SST is the total sum of squares. It is deermined by adding SSR and SSE to get a complete measure of variation of the Yi values around their mean (Ȳ).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the correlation coefficient (r) measure?

A

The correlation coefficient measures the strength of the relationship between X and Y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the coefficient of determination(r2)?

A

The coefficient of determination is the regression sum of squares, divided by the total sum of squares.

r2 = SSR/SST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the difference between:

coefficient of determination and correlation coefficient?

A

The correlation coefficient (r) is multiplied by itself to get the coefficient of determination (r2).

The coefficient of determination (r2)demonstrates a percentage of variation in y, which is explained by all the x variables in the model. This value is always between 0 and 1.

The coefficient of correlation (r) is the degree of relationship between two variables, i.e. x and y. It can fall between -1 and 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is another term for the concept of “Residual”?

A

Estimated error value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the equation to calculate residuals (estimated error value) for any point in a regression model?

A

ei = Yi - Ŷi

Where Yi is the observed value and Ŷi is the predicted value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What table is displayed in the image? What is it for?

A

ANOVA table.

It is used to summarize regression terms applicable to models being evaluated. Especially to determine the F test statistic and p-value for hypothesis testing against a regression model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

For a straight line probabilistic model, there are 6 possible relationship types. WHat are they?

(Hint: PL, NL, PC, NC, UC, NA)

A

Positive linear

Negative linear

Positive Curvilinear

Negative curvilinear

U-shaped Curvilinear

No relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What part of the simple linear regression model is the deterministic component?

A

Y=β01X+ε is the simple linear regression model.

E(y) =β01X is the deterministic component of the regression model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the model equation for a simple linear regression model?

A

Y=β01X+ε

Where Y = dependent variable (aka response variable)

β0 = y-intercept for the population

β1 = slope for the population

X = independent variable (aka predictor of Y)

ε = random error component

Reminder: without the ε component, the equation

E(y) =β01X is the deterministic component of the regression model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are three measures of variation to observe when working with linear regression models?

A

SSR (regression sum of squares)

SSE (error sum of squares)

SST (total sum of squares)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do we calculate the SST (total sum of squares)?

This is the total sum of squares.

A

∑(Yi - Ȳ)2

This reads as the sum of the difference between the observed Y and the average of Y, squared.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we calculate the SSE (error sum of squares)?

This is the unexplained variation or error sum of squares.

A

∑(Yi - Ŷi)2

This reads as the sum of the difference between the observed Y and the predictedY, squared.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do we calculate the SSR (regression sum of squares)?

This is the explained variation or regression sum of squares.

A

∑(Ŷi - Ȳ)2

This reads as the sum of the difference between the predicted Y and the average of Y, squared.

17
Q

If a coefficient of determination (r2) is found to be 0.817, what does this mean?

A

It means that 81.7% of changes in Y can be explained by X.

R2 is usually explained in terms of percentage.

18
Q

What is the standard error of the estimate (Sxy) used to determine?

A

The standard error of the estimate is used to measure the variability of the observed Y from the predicted Y. This also can be described as the measure of variation of an observation made around the computed regression line.

It is similar to a standard deviation (which measures the variation in the set of data from the mean).

19
Q

How is the standard error of the estimate calculated?

A

Sxy = √SSE/(n-2)

20
Q

Which of the outputs in the ANOVA table are used to calculate the standard error of the estimate (Sxy)?

A

The standard error of the estimate is calculated using the SSE.

Sxy = √SSE/(n-2)

21
Q

What coefficient measures the strength between Y and X?

A

Regression coefficient

22
Q

What measures variation due to factors other than the relationship between X and Y?

A

Error sum of squares SSE

23
Q

What measures how much change in Y can be explained by X?

A

Coefficient of determination