Lecture 5- Bias Flashcards

1
Q

Methods of detecting outliers and influential cases

A
  • Graphs
  • Standardised residual
  • Cook’s distance
  • DF beta statistics (unstandardised and standardised)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How to detect outliers and influential cases using standardised residual

A
  • 95% of standardised residuals SHOULD lie between +/- 2
  • 99% of standardised residuals SHOULD lie between +/- 2.5
  • If the absolute value of standardised residuals is +/- 3, is likely to be an outlier
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How to detect outliers and influential cases using Cook’s distance

A
  • Measure the influence of a single case on the model as a whole
  • Absolute values greater than 1 may be cause for concern
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to detect outliers and influential cases using DF beta statistics

A
  • The change in b when a case is removed

- Be wary of standardised values with absolute values > 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The population model should have

A
  • Homoscedastic errors

- Independent errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Key assumptions of the general linear model

A
  • Linearity and additivity
  • Spherical errors
  • Normality of something or other
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A models errors refer to the

A

Differences between predicted values and observed values of the outcome variable in the population model
These values cannot be observed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A model’s residuals refer to the

A

Differences between predicted values and observed values of the outcome variable in the sample model
These values can be observed and are representative of the population model errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The population error in prediction for one case should

A

Not be related to the error in prediction for another case (autocorrelation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Because population errors can’t be observed

A

Sample residuals are inspected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Variance of population errors (residuals) should be

A

Consistent at different values of the predictor variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Violation of the assumption in spherical errors

A
  • bs are unbiased but not optimal

- Standard error is incorrect, therefore t-tests, p-values and confidence intervals will also be incorrect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Do normality of model errors matter

A

Not really

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When errors are normally distributed b will be

A

Unbiased and optimal but there may be classes of estimator that are more accurate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

p-values associated with the bs of the model assume that

A

The test statistic associated with them follows a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Confidence intervals for bs are constructed using

A

The standard error, which is derived from a sampling distribution of b that is assumed to be normal

17
Q

What are robust procedures

A
  • The bootstrap
  • Heteroskedasticity
  • Consistent standard errors
18
Q

What is the bootstrap procedure

A
  • Standard errors are derived empirically using a resampling technique
  • Results in robust confidence intervals and p-values
  • Designed for small samples where normality matters
19
Q

What are DF beta statistics

A
  • How much beta changes when a case is removed

- Wary of standardised values with absolute values greater than 1

20
Q

Normality is relevant to

A
  • Population model errors

- Sampling distribution

21
Q

Independent observations tend to lead to

A

Independent errors