Sampling And Estimation Flashcards

1
Q

A method of soliciting a sample in such a way that each item or person in the population being studied has the same likelihood of being in the sample

A

Simple Random Sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The difference between a sample statistic and its corresponding population parameter

A

Sample Error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A technique that uses a classification system to separate the population into smaller groups based on one or more distinguishing characteristics

A

Stratified Random Sampling

i.e. bond indexing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data which consists of observations taken over a period of time at a specific and equally spaced time interval.
i.e. Time - Series – set of monthly returns on Microsoft stock from Jan 1994 to Dec 2004.

A

Time Series Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data that are a sample of observation taken at a single point in time.
i.e. Reported earnings per share of all NASDAQ companies as of Dec. 31, 2014.

A

Cross Sectional Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Observations over time of multiple characteristics of the same entity, such as unemployment, inflation anf GDP growth rates, for a country over 10 years.

A

Longitudinal Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data that contains observations over time of the same characteristic for multiple entities, such as debt/equity ratios for 20 companies over 24 quarters.

A

Panel Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Theorem that states for simple random samples of size n, from a population with a mean u, and a finite variance, sigma^2, the sampling distribution of the sample mean, Xbar, approaches a normal probability distribution with mean u, and a variance equal to sigma^2 / N as the sample size becomes large.

A

Central Limit Theorem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Properties of CLT

A
  • If the sample size, n, is sufficiently large (n>= 30), the sampling distribution of the sample means will be approximately normal.
    • The mean of the population, u, and the mean of the distribution of all possible sample means are equal.
  • **The variance of the distribution of sample means is sigma^2 /N. the population variance divided by the sample size.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sample Error of Standard Mean Calculation

A

sigmaXbar = sigma / n^(1/2)

* the standard deviation of the distribution of the sample means.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Desired Properties of an Estimator

A
  1. Unbiasedness - when the expected value of the estimator is equal to the parameter you are trying to estimate.
  2. Efficient – if the variance of its sampling distribution is smaller than all the other unbiased estimators of the parameter you are trying to estimate.
  3. Consistent - the accuracy of the parameter estimate increases as the sample size increases.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Single (sample) values used to estimate a population parameter.

A

Point estimates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A bell-shaped probability distribution that is symmetrical about its mean.

A

t-distribution
*Use t-distribution when constructing confidence intervals based on small samples (n <30) from populations with unknown variance and a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

t-distributions have the following properties

A
  1. Symmetrical
  2. Has 1 parameter : Degrees of Freedom
  3. Has more probability in the tails than the normal distribution
  4. As df increases, the shape of the t-distribution more closely approaches a standard normal distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The number of sample observations minus 1, for sample means

A

Degrees of Freedom (df)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Characteristics of T-Distribution

A
  1. Centered at Zero
  2. Flatter than a normal distribution
  3. As df increases, shape becomes more spiked and tails become thinner.
  4. t-test levels of significance only correspond to one tail probabilities
17
Q

This estimates result in a range of values within which the actual value of a parameter will lie, given the probability of 1-alpha

A

Confidence Intervals

18
Q

How confident your estimate is, denoted by alpha

A

Level of Significance

19
Q

Confidence Interval Calculation

A

C.I. = Xbar + z * (sigma / n^(1/2))

20
Q

Distribution with known variance

A

Use z score

21
Q

Distribution with unknown variance

A

Use t score

22
Q

2 Limitations of “larger is better”

A
  1. may contain observations from a different population

2. Cost

23
Q

Bias that refers to results where the statistical significance of the pattern is overestimated because the results were found through data-mining.

A

Data mining Bias

24
Q

Bias which occurs when some data is systematically excluded from the analysis, usually because of the lack of availability

A

Sample Selection Bias

*Survivorship bias in mutual funds

25
Q

Occurs when a study tests a relationship using sample data that was not available on the test date.

A

Look-ahead bias

26
Q

Results if the time period over which the data is gathered is either too short or too long.

A

Time-period Bias