Basic Questions Flashcards

1
Q

Define an experiment

A

Any process of measurement or observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define a random variable

A

A variable that takes on values according to the outcome of an experiment. Can either be continuous or discrete.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define covariance and correlation

A

The measure of dependence between two random variables. Correlation is a standardized measure of this dependence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define a random sample

A

A collection of observations from a population that are independent and identically distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define an unbiased estimator

A

An estimator whose theoretical expectation is the true value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define a consistent estimator and any implications

A

An estimator which converges in probability to the true value. As n increases the distribution of the estimator will collapse around the true value. If an estimator is unbiased, the distribution of the estimator is centered at the true value for any fixed n.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Types of least squares estimators and properties of LSE

A

1) Ordinary least squares: assumes errors are iid with same variance and expectation of zero
2) Weighted least squares: allows unequal variances but still assumes iid
3) Generalized least squares: allows unequal variances and correlated errors
Properties: unbiased, minimum variance among all linear functions, do not need to know distribution, do need to know the model (eg Y=XB+e)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define maximum likelihood in words and the overall properties

A

It is the value which maximizes the likelihood function of our parameter space, ie the MLE is the value that makes our sample most likely
Properties: Usually consistent, asymptotically most efficient, invariant to transformations. Often biased, do need to know distribution, may not exist

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define a pivot

A

A pivot is a function of the data and the unknown parameter whose distribution is known and does not depend on the parameter of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Steps to find a confidence interval, correct interpretation

A

1) Find a pivot for the parameter
2) Write a probability statement involving the pivot
3) Rearrange to get CI for parameter
Interpretation: The probability that the interval includes the parameter is 1-alpha. In other words, if we were to draw many samples we would obtain many CI’s and 1-alpha% of those CI’s would include the true parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Explain the difference between statistical significance and practical importance.

A

Statistical significance deals with the question: how sure are we that our null hypothesis is wrong? This will deal with p-values.
Practical importance deals with the question: How wrong is our null hypothesis? This is measured by something independent of sample size (usually effect size).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When to use a nonparametric test?

A

It is similar to t-test, use when data is not normal. The sample is considered to be iid from a symmetric, continuous distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define a categorical variable

A

A variable whose support is a set of categories, eg hair color

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In probability, what is an event?

A

A subset of the sample space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Difference between ANOVA and regression?

A

ANOVA can be applied to any regression model (no matter if the model contains only continuous, only categorical, or both kinds of predictors). ANOVA assesses the impact of a predictor or a whole set of predictors on the residuals: how much of the variation in the data can be explained by these predictors? The regression analysis, on the other hand, is a complementary tool to asses the quantitative relation between a predictor and the response.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly