Data Analysis week 3 Flashcards

Question 1

Q

What is the purpose of taking samples

Answer

A

Making an estimation for the population value by calculation the sample value.

Question 2

Q

When is a sample random

Answer

A

If every element of the population has an equal chance of being included in the sample.

Question 3

Q

How do you measure if an estimation is close to the population value

Answer

A

By precision and bias.

Question 4

Q

What does a high precision mean

Answer

A

A high precision means the estimate of a population value is not that different for different samples.

Question 5

Q

What does high bias mean

Answer

A

High bias means the value of the estimate is very different from the population value.

Question 6

Q

What does low precision mean

Answer

A

Low precision means the values of the estimates of the population value are very different for different samples.

Question 7

Q

What does low bias mean

Answer

A

Low bias means the value of the estimate is not that different from the population value.

Question 8

Q

In the following function in R to draw samples from a dataset, what does n stand for: s <- slice_sample( diamonds, n=50 )

Answer

A

The number of observations from the dataset you put in one sample.

Question 9

Q

What is a sampling distribution

Answer

A

A series of estimates obtained from a (large) number of repeated independent samples.

Question 10

Q

What can you say about the sampling distribution if the sampling distribution is biased

Answer

A

The mean of the sampling distribution is different from the mean of the population (the population value).

Question 11

Q

What is standard error and what does it explain

Answer

A

Standard error is the standard deviation of the sampling distribution. It summarizes the precision of an estimate in one number. Calculates how likely we are to get the real value wrong.

Question 12

Q

What influence does the sample size (generally) have on bias

Answer

A

If the sample size increases, the bias (generally) decreases.

Question 13

Q

What influence does sample size have on precision (and variation between sample sizes)

Answer

A

If the sample size increases, the precision increases, because the variation between samples decreases

Question 14

Q

What influence does the sample size have on the standard error (in factors)

Answer

A

If the sample size increases with factor k^2, the standard error decreases with factor k

Question 15

Q

When is an estimate asymptotically biased

Answer

A

If the estimate gets less and less biased if the sample size increases.

Question 16

Q

For what is bootstrapping a method

Answer

Study These Flashcards

A

Bootstrapping is a method to estimate the sampling distribution.

Question 17

Q

What does bootstrapping result in and what does this name stand for

Answer

Study These Flashcards

A

Bootstrapping results in a bootstrap distribution and this is an estimate of the sampling distribution.

Question 18

Q

What method of sampling do you use in bootstrapping and how does this work

Answer

Study These Flashcards

A

Sampling with replacement. You draw a sample from the population and from this sample you draw a new samples of the same sample size.

Question 19

Q

What is the 95% interval, how can you interpret this and about what does this tell us something

Answer

Study These Flashcards

A

The 95% confidence interval is the interval between the 2.5/100 and the 97.5/100 quantiles of the bootstrap distribution. You can interpret this as: we can be 95% certain that the estimate of the bootstrap distribution is within the standard error. This tells us something about the precision of the bootstrap distribution.

Data Analysis week 3 Flashcards

(19 cards)