Stats. T2 Flashcards

Question 1

Q

What is the difference between a parametric and non-parametric statistical model?

Answer

A

a parametric model that assumes a parametric family, such as a binomial or a poisson. A non parametric doesn’t.

Question 2

Q

How can you estimate the method of moments

Answer

A

Identify the parameters to estimate
2.
Calculate the necessary sample moments, where the formula for the kith sample moment is
1/n(sum(xi^k)
3.
Set the sample moments equal to the theoretical moments and solve for the parameters:
m1 = μ
m2 =. μ^2. + σ^2

Question 3

Q

What is a prediction interval

Answer

A

A prediction interval is an interval of values with a specified probability of containing the future data value

Question 4

Q

What is the change of variable formula

Answer

A

f(h^-1(y)) = d/dy(h^-1)(y)

Question 5

Q

What is the moment generating function

Answer

A

a mathematical tool used to characterize its probability distribution by generating its moments. 𝑀𝑋(𝑡) = 𝐸(𝑒𝑡𝑋).
where X is a random variable

Question 6

Q

When do random variables X and Y have the same distribution?

Answer

A

When the MGF are the same

Question 7

Q

what is the theorem that links the mgt and derivatives

Answer

A

if an MGF(x) of t exists, then the kth moment of E(x^k) canbefound by taking the K’th derivative of MGFx(T) and evaluating at 0

Question 8

Q

What is the central limit theorem

Answer

A

If you take sufficiently large random samples from any population (with finite mean and variance) and compute their means, the distribution of those sample means will be approximately normal (Gaussian), regardless of the original population’s distribution.

Question 9

Q

What is the sample distribution

Answer

A

The sampling distribution is the probability distribution of a given statistic (e.g., mean, variance, proportion) computed from multiple random samples of the same size drawn from a population.

Question 10

Q

What is the standard error

Answer

A

The standard error is the standard deviation of the sampling distribution. The standard deviation of an estimate of the sampling distribution is called the estimated standard error.

Question 11

Q

What is the mean squared error of an estimator

Answer

A

Let 𝜃̂ be an estimator for 𝜃. The mean squared error of it is
=𝐸 {(𝜃^−𝜃)^2 }.
The mse is the expected squared distance between estimates, 𝜃, and the true
value, 𝜃. A large mse means that estimates tend to be a long way from the true value; a small mse means that estimates tend to be close to the true value. So we prefer estimators with small mse.

Question 12

Q

what is the bias

Answer

A

The bias of 𝜃^ is Bias(𝜃^) = E(𝜃^)-𝜃
Given two unbiased estimators, we would prefer the one with smaller variance (and hence smaller mean squared error). If one estimator has a smaller bias but a larger variance than another estimator then we have to decide whether bias or variance is more important, or just prefer the one with smaller mean
squared error.

Question 13

Q

What is a confidence interval

Answer

A

An interval estimator that contains 𝜃 with probability 𝑝 for all true values of 𝜃 is called a 𝑝-confidence interval, and 𝑝 is called the confidence level or coverage of the confidence interval

Question 14

Q

what is a simple and a composite hypothesis

Answer

A

A hypothesis that specifies the model distribution completely is known as a simple hypothesis. A hypothesis that does not specify the model distribution completely is called a composite hypothesis

Question 15

Q

What is the critical region in a hypothesis test

Answer

A

The set of outcomes for which you will reject 𝐻0 is called the critical region

Question 16

Q

what is a type 1 or 2 error

Answer

A

Rejecting the null hypothesis when it is true is known as a type I error; failing to reject the null hypothesis when it is false is known as a type II error.

Question 17

Q

What is the power of a hypothesis test

Answer

A

This latter probability is known as the power of the test and equals the probability that we shall (correctly) reject the null hypothesis when the al- ternative hypothesis is true.

Question 18

Q

how do you conduct a hypothesis test

Answer

A

Write down your null and alternative hypotheses, 𝐻0 and 𝐻1, about the population quantity.
Choose the significance level, 𝛼, and power, 𝛽, at which you wish to conduct the test.
Find a critical region, 𝐶, for which the significance level is 𝛼.
Find the sample size, 𝑛, for which the power is 𝛽.
Collect your data.
If the data fall inside the critical region then reject 𝐻0 in favour of 𝐻1.

Question 19

Q

what is a test statistic(T)

Answer

A

a value collected from sample data that determines whether to reject the null hypothesis

Question 20

Q

What is a z test

Answer

A

a hypothesis test used when the SD is known the sample is large enough.

z = (x-mu)/(variance-root(n)))

Question 21

Q

what is the test statistic when variance is unknown but population is large

Answer

A

Y-mu/sampleSD-root(n)

Question 22

Q

What is the P value

Answer

A

how improbable our observed test statistic would be if the null hypothesis were true

Question 23

Q