TENTAPLUGG Flashcards

1
Q

What are properties of good estimators?

A
  1. It is unbiased/validity. The expected value of the sample parameter is equal to the population parameter.
  2. It is reliable/efficient. The estimator has an as small variance as possible. Generally decreases when sample size increases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which starting points for CI and hypothesis testing are approximate because of CLT (single population)?

A

Case 3 and Case 4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the assumptions for Case 1 (single pop.)?

A
  • Xi follows a normal distribution

- Sigma2 (pop. variance) is known.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the assumptions for Case 2 (single pop.)?

A
  • Xi follows a normal distribution

- Sigma2 (pop. variance) is UNknown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the assumptions for Case 3 (single pop.)?

A
  • Xi follows an unknown distribution
  • Sigma2 is unknown
  • n >= 30
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the assumptions for Case 4 (single pop.)?

A
  • n >= 40
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the assumptions for Case 5 (single pop.)?

A
  • Xi follows a normal distribution

- mew and Sigma2 are unknown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which starting points for CI and hypotheses testing are approximate (double populations)?

A
  • Case 2 (round down v)

- Case 3 and Case 5, from CLT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the assumptions for Case 1 (double pop.)?

A
  • Independent samples
  • Xi, Yj follows normal distributions
  • mewX, mewY, Sigma2X, Sigma2Y are unknown, but Sigma2X=Sigma2Y–> pooled variance S2p
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the assumptions for Case 2 (double pop.)?

A
  • Independent samples
  • Xi, Yj follow normal distributions
  • mewX, mewY, Sigma2X, Sigma2Y are unknown, Sigma2X & Sigma2Y are UNEQUAL
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the assumptions for Case 3 (double pop.)?

A
  • Independent samples
  • Xi, Yj follow unknown distributions
  • mews and sigma2s are unknown (may or may not be equal)
  • nx, ny >= 30
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the assumptions for Case 4 (double pop.)?

A
  • DEPENDENT samples (matched pair)
  • Xi, Yi follows normal distributions, –> Di follows normal distribution
  • mews and sigma2s are unknown
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the assumptions for Case 5 (double pop.)?

A
  • Independent samples

- nx, ny >=40

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the assumptions for Case 6 (double pop.)?

A
  • Independent samples
  • Xi, Yj follows normal distributions
  • mews and sigma2s are unknown
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is special about the CI for sigma2x/sigma2y (Case 6)?

A

The CI in the formula sheet is for sigma2Y/sigma2X, so switch order of x- and y-terms.

Fa;b;0,95 = 1 / Fb;a;0,05

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the four essential concepts of hypotheses testing?

A
  1. A null hypothesis H0, and an alternative hypothesis H1
  2. A significance level of the test, alpha
  3. A test statistic
  4. A decision rule
17
Q

What are the four possible scenarios of hypothesis testing?

A
  1. P(Accept H0 I H0 is true) = 1-alpha
  2. P(Accept H0 I H1 is true) = beta (type 2 error/false negative)
  3. P(Reject H0 I H0 is true) = alpha (type 1 error/false positive)
  4. P(Reject H0 I H1 is true) = 1-beta
18
Q

What is the power of a test (hypothesis testing)?

A

The ability to correctly reject H0 when it is false.

P(Reject H0 I H1 is true) = 1-beta

19
Q

What is the p-value (hypothesis testing)?

A

The smallest significance level alpha, at which H0 can be rejected.
General rule: Reject H0 if p-value < given alpha

20
Q

How can we test for correlations?

A

With the assumption that we have a random sample from a joint normal distribution, we can use Pearson’s test for correlations.

21
Q

What are the assumptions for the SLR model?

A
  1. Linearity
  2. The Xi values are fixed/known
  3. E(error term i) = 0. V(error term i) =Sigma2(error term)
  4. E(error term i ; error term j) = 0 (i not equal to j)
  5. Error term i follows a normal distribution
22
Q

What is the residual?

A

e(i) = y(i) - y-hat(i).

Deviation between estimated value and observed value.

23
Q

What is S2(error term)?

A

An estimator of the error term.

24
Q

What parts does ANOVA consist of?

A
  • SST: sum of squares total. variation in dependent variable.
  • SSR: sum of squares regression. model variation
  • SSE: sum of squares error. residual variation
  • R2-value: how many percent of the variation in the data that are explained by the model. R2>0,5

FOR MLR:
- Adjusted R2-value. Protects against overfitting.

25
Q

What is the optimal predictor for both individual and aggregate/average predictions (SLR), and how do we find CIs?

A

y-hat(n+1) = b0 + b1*x(n+1)

individual: y-hat(n+1) +- t(alpha/2)(n-2) * s(e) * s(y-hat, n+1)
Aggregate: y-hat(n+1) +- t(alpha/2)(n-2) * s(e) * s(E(……))

26
Q

What are the assumptions of the MLR model?

A
  1. Linearity
  2. The x1i, … , xki values are fixed/known
  3. E(error term i) = 0. V(error term i) =Sigma2(error term)
  4. E(error term i ; error term j) = 0 (i not equal to j)
  5. The independent variables (x) are not perfectly related (no multicollinearity problem)

FOR CI and HT
6. Error term i follows a normal distribution

27
Q

What is multicollinearity and how do we detect it?

A

Independent variables are perfectly related.

Signs of problem:

  • R2 values are close to 1 but non/few of the variables are significant
  • Spurious/wrong signs of coefficients

Detect:

  • Sample correlation higher than ex 0,8?
  • Variance Inflation Factor, VIF > 10?
  • Tolerance factor: 1/VIF < 0,1?