Lecture 1 Week 1 Histogram Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What are parametric models and what are two examples of parametric models?

A

see slide 4,5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the advantage and disadvantage of parametric models?

A

The advantage of parametric models compared to non-parametric models is that they are more efficient if the underlying assumptions are satisfied.

However, parametric models can lead to misleading results if the underlying assumptions are violated.

On the other hand, non-parametric models allow us to impose much less assumptions and obtain information that may be lost
using parametric models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the conclusion based on the results of the parametric density and non-parametric density?

A

Summary of the results:
- Parametric density: The estimated log-normal distributions seems
not to change much over time.
- Non-parametric density: The mode of the density (maximum
value of the density) seems to change over time. The mode seems
to get lower.

Conclusion: The parametric density can be too restrictive since it
imposes a certain structure on the shape for the distribution. We see
almost no variation in the distribution over time. Instead, the
non-parametric density (kernel) is more flexible and captures some
variation in the distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the conclusion based on the summary of the results of the parametric regression and the non-parametric regression?

A

Parametric regression: The relationship between log wage and
School is forced to be linear, instead, the relationship between
log wage and Exp is a quadratic function.

Non-parametric regression: The model allows for more general
relationship between the variables. For instance, it captures that the
relationship between log wage and School is flat for small values of
School.

In this course, we will explore several approaches for non-parametric
regression that range from kernel regression to Neural Networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the histogram, what does the histogram split and what is affected by the width of the bins?

A

The histogram is a non-parametric estimator of the density function
of the observations.

The histogram splits the data into intervals, called bins. Then the
frequency of the observations in each bin is represented by a vertical
bar.
The width of the bins affects the interpretation of the histogram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is the histogram constructed

A

slide 21

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the formal definition of an histogram?

A

slide 22

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the statistical properties of the histogram?

A

slide 26

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the expectation and bias of the histogram and what happens to the bias as h–> 0?

A

slide 27

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the variance of the histogram and what is the approximate expression for the variance and when does the variance go to 0?

A

slide 28

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the MSE of the histogram and what are some conclusions based on the MSE and when MSE goes to 0?

A

slide 29

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain the trade-off between bias and variance using h and the sample size n?

A

slide 30

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How to choose h?

A

slide 32

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the MISE of the histogram and what is the approximate expression for the MISE (AMISE)?
How is the ||f’||22 interpreted?

A

slide 34

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How can we select the optimal bandwidth?
What is the interpretation of the result?

A

slide 35

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is then the optimal bandwidth and what is a practical solution that is used?

A

slide 36