Lecture 1 Week 1 Histogram Flashcards

Question 1

Q

What are parametric models and what are two examples of parametric models?

Answer

A

see slide 4,5

Question 2

Q

What is the advantage and disadvantage of parametric models?

Answer

A

The advantage of parametric models compared to non-parametric models is that they are more efficient if the underlying assumptions are satisfied.

However, parametric models can lead to misleading results if the underlying assumptions are violated.

On the other hand, non-parametric models allow us to impose much less assumptions and obtain information that may be lost
using parametric models.

Question 3

Q

What is the conclusion based on the results of the parametric density and non-parametric density?

Answer

A

Summary of the results:
- Parametric density: The estimated log-normal distributions seems
not to change much over time.
- Non-parametric density: The mode of the density (maximum
value of the density) seems to change over time. The mode seems
to get lower.

Conclusion: The parametric density can be too restrictive since it
imposes a certain structure on the shape for the distribution. We see
almost no variation in the distribution over time. Instead, the
non-parametric density (kernel) is more flexible and captures some
variation in the distribution.

Question 4

Q

What is the conclusion based on the summary of the results of the parametric regression and the non-parametric regression?

Answer

A

Parametric regression: The relationship between log wage and
School is forced to be linear, instead, the relationship between
log wage and Exp is a quadratic function.

Non-parametric regression: The model allows for more general
relationship between the variables. For instance, it captures that the
relationship between log wage and School is flat for small values of
School.

In this course, we will explore several approaches for non-parametric
regression that range from kernel regression to Neural Networks.

Question 5

Q

What is the histogram, what does the histogram split and what is affected by the width of the bins?

Answer

A

The histogram is a non-parametric estimator of the density function
of the observations.

The histogram splits the data into intervals, called bins. Then the
frequency of the observations in each bin is represented by a vertical
bar.
The width of the bins affects the interpretation of the histogram.

Question 6

Q

How is the histogram constructed

Question 7

Q

What is the formal definition of an histogram?

Question 8

Q

What are the statistical properties of the histogram?

Question 9

Q

What is the expectation and bias of the histogram and what happens to the bias as h–> 0?

Question 10

Q

What is the variance of the histogram and what is the approximate expression for the variance and when does the variance go to 0?

Question 11

Q

What is the MSE of the histogram and what are some conclusions based on the MSE and when MSE goes to 0?

Question 12

Q

Explain the trade-off between bias and variance using h and the sample size n?

Question 13

Q

How to choose h?

Question 14

Q

What is the MISE of the histogram and what is the approximate expression for the MISE (AMISE)?
How is the ||f’||22 interpreted?

Question 15

Q

How can we select the optimal bandwidth?
What is the interpretation of the result?

Question 16

Q

What is then the optimal bandwidth and what is a practical solution that is used?

Answer

Study These Flashcards

A

slide 36

Lecture 1 Week 1 Histogram Flashcards

(16 cards)