Econometrics Flashcards
YNAM
RIRO
YOSO!
YANR
You Need A Model
Random In Random Out
You Only Sample Once
You are Never Right
Cross Sectional, Time Series, Longitudal/Panel
.
Probability Distribution (function)
A list of all possible values of the random variable, and the probability that each value will occur.
-These probabilities sum to 1.
Random Variable (RV)
A number with an uncertain value, but it can be estimated with probability.
-its value depends in some clearly-defined way on a set of possible random events(frequency distribution).
discrete random variable vs continuous random variable
A Discrete can only take a countable number of values(finite). For example, number of people in a room.
A Continuous can take an infinite number of values. For example, amount of time taken to do something, it could be infinite.
Bernoulli distribution
a random variable(dummy variable) which takes the value 1 with success probability of p, and the value 0 with failure probability of q=1-p.
Binomial distribution
with parameters n and p is the discrete probability distribution of the number of successes(x) in a sequence of n independent yes/no experiments, the outcomes of the experiment are Bernoulli(0/1).
E(Y)
= expected value of Y = weighted average of all possible values/mean of Y = typical value of Y = center of Y's distribution = first moment of Y = MUy
V(Y)
= variance of Y = Spread/Dispersion of Y's distribution = expected of SQUARED distance between Y and Muy = [ (y-MUy)^2] =σ^2
SD(Y)
=Standard Deviation of Y
= “ typical” distance between Y + M measured in same units as Y -> (y-MUy)
=sqrt{V(Y)}
Sample Space
The set of all possible outcomes
Marginal Probability Distribution
probability distribution between 2 random variables(X and Y).
Normal Probability Distribution
A normal distribution with a mean of 0 and SD/V of 1. N(0,1).
To compute probabilities for a normal distribution it must first be standardized by subtracting the mean, then dividing the result by the SD.
Standardized
-Special Type of linear Function
To compute probabilities for a normal distribution it must first be standardized by subtracting the mean, then dividing the result by the SD.
Formula: Z= (y-My)/SD
aka Z= (y-My)/sqr(V/n)
“Z-score”
The number of standard deviations that a value x is away from the mean.
*Use tables at end of book
Covariance
A measure of the extent to which 2 random variables move together.
Formula:
Correlation
An alternative measure of the dependence between X and Y that solves the “units” problem of covariance.
-Specifically, the correlation between X and Y is the covariance(x,y), divided by their standard deviations.
Formula: Corr(X, Y) = cov(X,Y) / SD(x),SD(y)
-# is between -1 and 1. 0 means uncorrelated.
Probability Density Function
a probability dist. function for a Continuous RV falling in between the range/area of the distribution.
Cumulative Probability Distribution
the probability that a random variable id >= to a particular value,
Regression functions tell you about?
AVERAGES(statistical/probability) of a conditional probability of (x | y).
-We summarize these based on conditional distribution functions
Law of Large numbers.
states that based on probability and statistics under general conditions, Ybar will be near Mu(convergence) with very high probability when n is large. and vise versa if it’s small.
Central Limit Theorem
states that regardless of the distribution of the underlying(parent) population of data based on law of large numbers:
- the mean of the population of means is always = to the mean of underlying population
- The SD of the population is always = to the SD of the underlying population divided by the the square root of the sample size(n)
- The distribution of means will increasingly approximate a normal distribution as the size N of samples increases.
-The distribution of sample means is also called the “sampling distribution”, which is a normal distribution.
p-value
Also called the significance probability is the probability of drawing a statistic at least as adverse to the null hypothesis as the one you computed from the sample, assuming the null hypothesis is correct.
Formula:See page 72
Standard Error, sample variance, sample SD
See page 73-74 for formulas.
Estimator (& Estimate)
an estimator is a function of a sample data draw randomly from a population. An estimate is the numerical value of the estimator when it is actually computed using data from a sample.
- if mean of estimator equals mu it is Unbiased.
- Least squared estimator: minimizes the distance between Y and mu.
Coverage probability
the probability that a confidence interval of random samples contains the true population mean.