Econometrics Final Flashcards

Question

Factorial & Combination & Permutation Formula

Answer 1

Factorial Formula: n!, number ways to order n objects - How many ways to order n=#=8 runners in a sequence Combination Formula: Ckn=(n!(n-k)!)k!=n!/k!(n-k)! number of unordered ways in which k objects can be selected from n objects - How many ways to pick 3 (k=3) out of 8 (n=4) runners, who gets a medal - True Combination lock accepts 1-2-3, 2-1-3, 3-2-1 - Has less outcomes, groupings < orders for each grouping Permutation Formula: Pkn=n!/(n-k)!, n=total, k=limited spots = Total # of groupings/Limited # of orders - How many ways to pick k=3=1st,2nd,3rd places, who gets what medal - True Permutation lock only accepts 1-2-3 - Has more outcomes, orders for each grouping > groupings

Answer 2

1. Total # of combinations: C25=5!2!(5-2)!=10 1. Total combinations that only men are hired: C23=3!/2!(3-2)!=3 1. Probability=# of events in population that satisfy A/total # of events in population=3/10=30%

Answer 3

**Probability as a Set Function**: real-valued set function P that assigns to each event A in the sample space S, a number P(A) that satisfies the following 3 properties: 1. Always positive 2. P(S)=100%=1=probability of all outcomes 3. Mutually exclusive events probability is the sum of each (addition rule)P(A1uA2u...uAk)=P(A_1)+P(A_2)+...+P(A_k) or P(AuB)=P(A)+P(B)

Answer 4

1. Complement Rule: P(A)=1-P(A), 1=P(A)+P(A) 1. Addition Rule: P(AuB)=P(A)+P(B)-P(AnB) draw diagram see notes P((AuB)(AuhatB))=P(AuB)+P(AuhatB) Mutually Exclusive Addition Rule: P(AuB)=P(A)+P(B) For any A & B Addition Rule: P(AuB)=P(A)+P(B)-1 2. P(empty)=0 3. If AcB, P(A)=P(A)+P(B)-1 5. P(AuhatA)=1 6. P(hatA|hatA)=1

Answer 5

**Conditional Probability**: probability of one event A, given another event B is true/has occurredB new total sample space for A within B to be contained in P(A|B)=P(AnB)/P(B)=# of A that satisfy spacetotal # of events in space P(B|A)=P(AnB)/P(A)=# of B that satisfy space/total # of events in space ** Multiplication Rule**: rearranging conditional probability P(AnB)=P(A|B)P(B) or P(AnB)=P(B|A)P(A)

Answer 6

S={1,2,3,4,5,6}, A{2,4,6}, B{6},P(A|B)=P(AB)P(B)=P(16)/P(2,4,6)=16/12=1/3

Answer 7

S={1...61...6}, A{2}, B{(1,1),(1,2),(2,1)},P(A|B)=P(AnB)/P(B)=2/36/3/36=1/18/1/12=2/3 Basic Outcomes=36, A{(2,xi)...(yi,2)},B{(1,1)(1,2)(2,1)},P(A|B)=2/36,P(B)=3/36

Answer 8

P(RednAce)=P(Red|Ace)P(Ace) =2/4 4/52=2/52

Answer 9

Not correlated if either is true, both probability are unaffected (ex.Shape of coin vs Flipping heads) P(AnB)=P(A)P(B) P(A|B)=P(A) because the condition of B has no effect on probability of A P(B|A)=P(B) because the condition of A has no effect on probability of B

Answer 10

Yes cause P(AnB)=P(A)P(B) 26=3/6 4/6=26

Answer 11

**Bivariate Probabilities**: probabilities (A & B) that a certain event will occur when there are two independent random variables in your scenario **Joint Distribution of X{xi} & Y{yi}:** described by bivariate probabilities **Marginal Probabilities**: the probability of a single event occurring, independent of other events (Ai,Bi) mutually exclusive Bi collectively exhaustiveP(A)=P(AnB1)+P(AnB2)+...+P(AnBk) See doc for table & diagram

Answer 12

Joint probability is the probability of two events occurring simultaneously. Marginal probability is the probability of an event irrespective of the outcome of another variable. Conditional probability is the probability of one event occurring in the presence of a second event that has occured.

Answer 13

**Total Law of Probability**: Bi mutually exclusive & exhaustive events partitions A into k number mutually exclusive & exhaustive events such that A=(AnB1)(AnB2)...(AnBk) therefore using addition rule:P(A)=P(AnB1)+P(AnB2)+...+P(AnBk)=kEi=1 P(AnBi) If Bi mutually exclusive & collectively exhaustive(BiBj=, S=B1B2...Bk) subbing in multiplication rule → kE i=1 P(A|Bi)P(Bi) for any A

Answer 14

Answer 15

P(B|A)=? P(A|B)=97.5% P(A|B)=12.5%, P(B)=10% P(B)=90% P(B|A)=P(A|B)P(B)P(A|B)P(B)+P(A|B)P(B)=(97.5%)(10%)(97.5%)(10%)+(12.5%)(90%)=46.4%

Answer 16

a)Addition rule, mutually exclusive, Anb=A, first property of probability P(hatAnB)>=0 b)Cancel out, divide both sides, it becomes a true statement/property See doc

Answer 17

Answer 18

a)(C 4 2)/(C 8/2)=only men combos/total combos=3/14 b)1/28 c) 1/28 since B c A AnB=B

Answer 19

**Random Variable (X)**: a function which maps outcome of an experiment s to X, represents a possible numeral value from a random experiment **Discrete**: with limited countable outcomes (Dice, coin) P(XA)=XAP(X=x)=P(X=0)+P(X=1)+...+P(X=n), X=RV, x=constant **Continuous**: with infinite outcomes (Height) **Space of X**: {x:X(s)=x,sS}=Sx

Answer 20

Probability Mass Function f_x(x)=P(X=x) Discrete **Properties**: 1. Always positive be/w 0-1 1. XSfx(x)=P(XA)=1=100% 2. If results independent, you can sum up probabilities f_x(x)=1/4 if x=0 ... 1/2 if x=1 1/4 if x=2 Cumulative Distribution Function F(x_0)=P(X<=x_0)=XEX<=0 f_x(x) F(x_0)= 1/6 if x=0 2/6 if x=1 ... ... 6/6 if x=6

Answer 21

E(X)=Ex (xfx(x)) Two Coins Expected Value E(X)=0(14)+1(12)+2(14)=1 Rolling Die Expected Value fx(i)=P(X=i)=1/6 E(x)=i=16i(16)=1(16)+2(16)+3(16)+4(16)+5(16)+6(16)=3.5

Answer 22

**Variance**: (sigma)^2=E(X-E[X])^2=Ex (x-E[X])^2f_x(x) measure of spread/distance^2 from mean, uninterpretable units but prevents cancelling **Standard Deviation**: =2=E(X-)2=x(x-)2fx(x) measure of spread/distance from mean, original units interpretable 2 Coins =(0-1)2(14)+(1-1)2(12)+(2-1)2(14)=0.707

Answer 23

E[g(x)]=xEg(x)f_x(x) E[g(X)]=xxfx(x)=g(0)(0.5)+g(1)(0.5)=0+50=50

Answer 24

**Bernoulli Probability Distribution**: random variables with only 2 possibilities **Binomial Distribution**: sequence of n independent Bernoulli Random Variables/multiple sets of 2 possibility random variables Y=i=1EnXi P(Y=y)= probability of y=# of successes in n=sample size trials with p=probability of success on each trial

Answer 25

**Probability Mass Function**: f(x)=px(1-p)1-x X has Bernoulli distribution X=0,1 P(X=x)=P(X=1)+P(X=0)=1 P(X=1)=p P(X=0)=1-p **Cases** Options^Sets=2(options 0,1)3(sets/votes) Power of p & 1-p tell how many successes & failures occur, sums up to the total # of cases **Mean** =p=E(X)=X=0,1xpx(1-p)1-x=0(1-p)+1(p) If X=0, weight=P(0)=1-p If X=1, weight=P(1)=p **Variance** 2=p(1-p)=E[(X-)2]=X=0,1(x-)2px(1-p)1-x=(0-p)2(1-p)+(0-1-p)2p If X=0, distance from mean=(0-p)2, weight=P(0)=1-p, E[0]=p If X=1, distance from mean=(1-p)2, weight=P(0)=p, E[1]=p **Standard Deviation** =(p(1-p))^(1/2)=Var[X]^1/2

Answer 26

a.p^2 b.2p(1-p) c.1+p d.1/2

Answer 27

* **Probability Mass Function of Binomial Distribution**: P(Y=y)=n!/y!(n-y)! p^y(1-p)^n-y * **Mean** =E(Y)=E(i=1nXi)=i=1nE(Xi)=E(X1)+E(X2)+...+E(Xp)=p+p+...+p=np * **Variance** 2=Var(Xi)=Var(X1)+Var(X2)+...+Var(Xn)=p(1-p)+...+p(1-p)=np(1-p) * **Standard Deviation** =np(1-p) * * **Average of n Independent Bernoulli Random Variable** X=1ni=1nXi=1nY=1n(X1+X2+...+Xn) * **Expected Value of Avg n Independent Bernoulli Random Variable** = Avg of Sample Means = Good estimator of Population Fractions E(X)=E(X1n)=E(X2n)+E(X3n)+...+E(Xnn)=npn=p * **Variance of Avg n Independent Bernoulli Random Variable:** np(1-p)

Answer 28

* f(x,y)=fx(x)fy(y) or pijXY=piXpjY * EY|X[Y|X]= yyf(x,y)fx(x)=yyf(x)f(y)fx(x) =EY[Y] * Cov(X,Y)=0 --> 0/sigmaXsigmaY=0

Answer 29

1. Stochastically Independent: captures conditional dependency based on mean & variance, does one thing occurring have an impact on the other’s mean & spread f(x,y)=fx(x)fy(y) 1. Mean Independent: captures conditional dependency based on mean only, does one thing occurring have an impact on the other’s mean EX|Y[X|Y]=EX[X] or EY|X[Y|X]=EY[Y] 1. Uncorrelated: captures linear(direction+spread) relations only Cov(X,Y)XY=0

Answer 30

C(yn)n!y!(n-y)!py(1-p)n-y=C(24)4!2!(4-2)!(0.5)2(1-(0.5))4-2=3/8

Answer 31

* a.P(Y=y)=(55)n!y!(n-y)!py(1-p)n-y=1/32 * b.P(Y3)=P(Y=3)+P(Y=4)+P(Y=5)C(35)(0.5)2(1-(0.5))4-3+C(45)(0.5)2(1-(0.5))5-4+1/32=1/2 * c. P(W2)=P(W=2)+P(W=3)+P(W=4)=C(24)(0.5)2(1-(0.5))4-2+..+C(44)(1-(0.5))4-4=11/16

Answer 32

* **Joint Probability Mass Function:** P(XY)=f(x,y)=P(X=x,Y=y) a function that expresses probability that X=x & simultaneously Y=y * **Marginal Probabilities:** function that expresses probability of an event irrespective of outcome of another variable integrated over all possible values of other variable(s) P(X=x)=fX(x)=yf(x,y), P(Y=y)=fY(x,y)=xf(x,y) * **Stochastic Independence**: all pairs of x & y must satisfy f(x,y)=fX(x)fY(y) or all random variable must satisfy f(x1,x2,...,xk)=fX1(x1)fx2(x2)...fXk(xk). Derived from Statistically Independent P(AB)=P(A)P(B)=P(A|B)P(B|A) * **Law of Iterated Expectations** EX[EY|X[Y|X]]=EY[Y] all cases of X gives all cases of Y

Answer 33

Answer 34

**Covariance**: average product=direction (+/-) of relation b/w 2 random variables, expected value of the product of the spread of X & Y from their mean (magnitude doesn’t tell you anything, unit is uninterpretable) * Cov(X,Y)=E[(X-X)(Y-Y)]=xy(x-X)(y-Y)f(x,y) * Cov=0 no linear relation *Cov>0 positive linear relation * Cov<0 negative linear relation **Correlation**: unitless measurement of the strength (spread+direction) of the linear relation b/w X & Y (-1 to 1), covariance divided by the product of X & Y standard deviation (weaker/large spread/denominator → closer to 0 vs stronger/small spread/denominator → closer to -1/1) * p=Corr(X,Y)=Cov(X,Y)XY * p=0 no linear relation * p>0 positive linear relation (1=perfect positive linear dependency) *p <0 negative linear relation (-1=perfect negative linear dependency)

Answer 35

Only equal if g(x) is linear

Answer 36

**Continuous Random Variable**: variable that assumes any value in an infinite interval/outcomes depending on ability to measure accurately (ex.Thickness, time, height) - **Probability Density Function**: probability that X=outcome lies between a & b, differentiating CDF fX(x)=P(axb)=abfX(t)dt=P(X

Answer 37

**Uniform Distribution**: probability distribution with equal probabilities for all possible outcomes of the random variable uniformly distributed on [a,b] XU[a,b] **Normal Distribution**: approximate probability distributions of wide range of RV in empirical applications XN(,2) * **Bernoulli** * CLT n big/Normal and population variance known **T-Distribution:** * n too small, population variance unknown, can't be bernoulli **Chi-Square**: * Estimating population variance

Answer 38

X=mean-Z Z=X-mean/ SD Z-N(0,1) X-N(mean,variance)

Answer 39

* **Covariance** Cov(X,Y)=E[(X-x)(Y-y)]=E(XY)=xy=0 (if X & Y are independent) * **Correlation** Cor(X,Y)=Cov(X,Y)x y * **Expectation** E[X1+X2+X3+...+Xn]=1+2+3+...+n * **Variance**: Var[X1+...+Xn]=21+...+2n+2Cov(X1,X2)+...+2Cov(Xn-1,Xn)=21+...+2n if independent * **Jointly Normally Distributed** (independent with identical mean & variance): X=N(,2/n) * * W=aX+bY * **Expectation** E[aX+bY]=aX+bY * **Variance**: Var[aX+bY]=a22X+b22Y+2abCov(X,Y)=0 if independent * **Jointly Normally Distributed**: aX+bYN(w,w)N(aX+bY , a22X+b22Y+2abCov(X,Y) )

Answer 40

**Population**: set of all items/individuals/variables of interest **Sample**: subset of population observed, less time consuming & costly than census of entire population **Random Sampling**: every item in population has equal chance of being selected and are selected independently (thrown back into population and could be drawn again) **Inferential Statistics**: making statements about population parameters (unknown) by examining sample results (known) * **Estimation**: make a claim about population mean using sample mean/evidence * **Hypothesis Testing**: test a claim about population using sample mean/evidence **Sampling Distribution**: plots the frequency of all possible sample means, each sample has n=# of items/people, the larger the n the smaller the variance=more accurate to the population mean

Answer 41

If population is not Normal apply Central Limit Theorem: as n increases the distribution converged to normal distribution, sample means from population will be approximately normal as long as sample size is large enough n(Xn-)dN(0,2)

Answer 42

27.52 See doc

Answer 43

**Point Estimator** omega hat of population parameter is a unbiased random variable; function of random sample, realized value of point estimator (random variable) is point **estimate** f(X1...Xn)=(X1...Xn)

Answer 44

**Unbiased Estimator** E()=, mean of estimator=mean of true parameter (E(0)-0=0) **Efficiency**: spread of variance, preferably the smaller=more efficient Var(1)

Answer 45

a single number within an interval (range of values) providing info about variability based on observation from 1 sample with limits as functions of sample P(L[X1,...,Xn]<

Answer 46

Confidence Level 1-alpha[0<<1]: percentage of probability that true population parameter is within the interval of values. 95% of time true value will be in interval. 95% of the sample dataset’s point estimate will be within the interval. However, 5% chance true value isn’t in interval & 5% of time sample data set’s point estimate won’t be in interval P(Point Estimate-Reliability Factor(Standard Error)

Answer 47

The uncertainty/amount of random sampling error in the results: +/- Z_alpha/2 SD/sqrt(n)=ME\ **Reduce** ME=Z/2n: reduce population standard deviation , reduce confidence interval (1-alpha), increase sample size n

Answer 48

**Hypothesis** (mew or p or var): claim about population parameter * Population Mean mew * Population Proportion p * Population Variance var **Null Hypothesis**/Counterfactual (H0): assumption to be tested in population parameter, status quo **Alternative Hypothesis**: hypothesis researcher is trying to support, challenges status quo

Answer 49

Testing: Assume null is true =,, (innocent until proven guilty), where does sample fall within its probability distribution 1) Find distribution 2) Choose technique depending on info given and parameter of interest * Z-Test: normal/CLT + known population variance * T-test: n small or unknown population variance * Chi-Square: estimating population variance 3) Choose upper/lower/double tail rejection region, compare realized sample with: * Significance level or Critical value * P-value * Confidence Interval

Answer 50

* Significance Level (alpha=%) * Critical Value (C=X_C) determined by significance level * Rejection Region [X=+Zn,]/Range of Values Unstandardized

Answer 51

P-Value/Observed Level of Significance: probability of getting more extreme test statistic than the realized sample within the null hypothesis H0 true probability distribution, smallest value of that can be rejected **Required** Information To Calculate: a realized sample and a distribution even if n=small or distribution is not normal P(ZZX=X-/n)=p value P(Z1.96)=5% **Process**: 1. Convert sample into test statistic 1. Use Z-table to find P(ZX)=p value 1. Compare p value & significance level p value>=do not reject outside of rejection region p value<=reject within rejection region

Answer 52

**Type 1 Error (α)** - False Positive: rejecting true H0 which we can never know given just a realized sample therefore there is always a probability that it exists of level of significance=α (ex.1%, 5%, 10%) Guilty before innocent = Serious, convicting innocent person. **Calculating** Type 2 Error: is the significance level chosen % P(reject H0|true H0)=alpha **Type 2 Error (β)** - False Negative: failing to reject false H0 with probability β P(fail to reject H0|false H0)=β Innocent before guilty=less serious, letting go guilty person **Calculating** Type 2 Error: P(ZXc=X-/n|true)=n=64, =6, =0.05, H0: μ52, H1: μ<52, True μ*=50 1. Calculate Critical Value 1. Standardize Critical Value in terms of true distribution 1. Using Z-table find probability/integral/area under the curve

Answer 53

Type 1 & 2 Tradeoff: moving rejection region alters the size of error, however cannot decrease both errors at same time, decreasing one will increase the other as Type 1 Error occurs when H0 is true, Type 2 Error occurs when H0 is false Smaller Rejection Region → smaller type 1 error → larger type 2 error → smaller power Larger Rejection Region → larger type 1 error → smaller type 2 error → Larger power Larger Rejection Region: larger type 2, type 1 same, large power (more evident what is true rejection) n Sample Size/Variance Increase: smaller type 2 error, type 1 error same, larger power

Answer 54

**Power** (1-) - True Positive: probability to successfully rejecting false H0 P(reject H0|false H0)=1-P(fail to reject H0|false H0)=1-β=type 2 error Power of the test increases, sample size increases **Calculate**: 1. Find Critical Value & standardize into Z score 1. Use Z-table to find probability of rejection area created by critical value in true distribution

Answer 55

lol goodluck bro

Econometrics Final Flashcards

(109 cards)