5. Bias Flashcards

Question 1

Q

What are two types of graphs used for detecting outliers?

Answer

A

Scatterplots (less helpful with several predictors)
Histograms

Question 2

Q

In an average sample, 95% of standardised residuals should lie between…

Question 3

Q

In an average sample, 95% of standardised residuals should lie between…

Question 4

Q

Any case for which the absolute value of the standardised residual is 3 or more, is likely to be…

Answer

A

An outlier

Question 5

Q

What does Cook’s distance measure?

Answer

A

Measures the influence of a single case on the model as a whole

Question 6

Q

In Cook’s distance, values greater than __ may be cause for concern

Question 7

Q

DF beta statistics tell us…

Answer

A

The change in b when a case is removed

Question 8

Q

With DF beta statistics, be wary of standardised values with absolute values…

Question 9

Q

The population model should have what two errors?

Answer

A

Homoscedastic errors
Independent errors

Question 10

Q

The relationship between predictor(s) and outcome is…

Question 11

Q

The combined effect of predictors is…

Question 12

Q

A model’s errors refer to what?

Answer

A

The differences between predicted values and observed values of the outcome variable in the population model
These values cannot be observed

Question 13

Q

A model’s residuals refer to what?

Answer

A

The differences between predicted values and observed values of the outcome variable in the sample model
These values can be observed and are representative of the population model errors

Question 14

Q

What does “Errors should be independent” mean?

Answer

A

The population error in prediction for one case should not be related to the error in prediction for another case

Question 15

Q

What does “errors should be homoscedastic” mean?

Answer

A

Variance of population errors (residuals) should be consistent at different values of the predictor variable

Question 16

Q

bs are unbiased but not optimal. Standard error is incorrect. Therefore…

Answer

Study These Flashcards

A

t-tests, p-values and confidence intervals will also be incorrect

Question 17

Q

p-values associated with the bs of the model assume that…

Answer

Study These Flashcards

A

the test statistic associated with them follows a normal distribution

Question 18

Q

What are the three concepts of “The bootstrap”
1. Standard errors are derived empirically using a ____ technique
2. Results in robust ____ and ____
3. Designed for ____ samples (when normality matters)

Answer

Study These Flashcards

A

resampling
confidence intervals, p-values
small

5. Bias Flashcards

(18 cards)