Probability And Statistics Equations Flashcards

Question

Expected value E[X]

Answer 1

Discrete E[X]=SUMi xif(xi) Continous E[X]=integralxf(x)dx Probability weighted sums of possible values of x

Answer 2

E[X]=(a+b)/2

Answer 3

Discrete E[g(X)] = SUMi g(xi)f(xi) Continuous E[g(X)] = integral g(x)f(x)dx

Answer 4

E[a+bX]=a+bE[X]

Answer 5

E[g(x)+h(y)]=E[g(x)] + E[h(y)]

Answer 6

E[g(x)h(y)]= E[g(x)]E[h(y)]

Answer 7

If f is concave E[f(X)] =< f(E[X]) If f is convex E[f(X)] >= f(E[X])

Answer 8

Var(X) = E([X-E(X)]^2) = E[X^2] - (E[X])^2

Answer 9

X: discrete Var(X) = SUMi(xi-nu)^2f(x) X: continous Var(X) = integral (X - nu)^2f(x) dx

Answer 10

SD = gyok alatt Var(X)

Answer 11

Var(X) = (b-a)^2/12

Answer 12

Var(X) = sigma^2

Answer 13

Var(X)=p(1-p)

Answer 14

Y discrete E(YIX=x) = SUMi yi f(yiIX) Y continous E(YIX=x) = integral yf(yIx)dy

Answer 15

E[h(X)YIX] = h(X)E[YIX]

Answer 16

E[YIX] = E[Y]

Answer 17

E[Y] = E(E[YIX])

Answer 18

Cov(X,Y) = E[(X-E(X)(Y-E(Y))] = E[XY] - E[X]E[Y]

Answer 19

Cov(X,X)=Var(X)

Answer 20

aCov(X, Y)

Answer 21

Cov(X, Y) + Cov(X, Z)

Answer 22

a^2Var(X) + b^2Var(Y) +- 2abCov(X,Y)

Answer 23

0 E[XY] = E[X]E[Y]

Answer 24

Corr(X, Y) = Cov(X, Y)/gyokVar(X)gyokVar(y) = Cov((x-E[X])/gyokVar(X), (y-E[Y])/gyokVar(Y))

Answer 25

Complete enumeration of a same set of interest

Answer 26

Subset of a population

Answer 27

Source material or device from which a sample is drawn

Answer 28

Selects the pre-determined number of respondents to be interviewed from a target population with each potential respondent having an equal non-zero chance of being selected

Answer 29

If the sampling procedure is repeated many times, the features of the sample would on average (across all the samples) match those of the population

Answer 30

Fixed quotas of certain types of respondents to be interviewed such that the resulting sample characteristics resemble those of the population

Answer 31

Deterministic functions which assign numbers to uncertain events which are generated by random sampling

Answer 32

When sample values are all drawn from the same population and have the same distribution

Answer 33

Numerical measure that describes a specific characteristic of a population

Answer 34

Numerical measure that describes a specific characteristic of a sample. Formally a statistic is a function of a random variable (subject to sampling variation)

Answer 35

Parameter in the population which is to be estimated in a statistical analysis

Answer 36

A function for calculating an estimate of a given population parameter based on randomly sampled data. An estimator is a function of a sample of data which is drawn randomly. Different random samples result in different values for the estimator. They are themselves random variables and therefore have distributions, expected values etc.

Answer 37

An estimate is the numerical value of the estimator given a specific sample is draen; it is a nonrandom number (eg. The sample mean)

Answer 38

Distribution of the estimator

Answer 39

A measure of variation in the sampling distribution; it is equal to the square root of the variance of the statistic

Answer 40

A measure of variation in data it is equal to the square root of the variance of the data

Answer 41

As the sample size grows the sample mean converges, in a certain sense, to the population mean

Answer 42

As the sample size grows the sampling distribution of the standardised sample mean converges to a standard normal N(0,1)

Answer 43

Any parameter which is not of immediate interest but which must be accounted for in the analysis of those parameters which are of interest

Answer 44

“Sample statistic +- a number of std errors * std error of the statistic”

Answer 45

“There is a 95% probability that the interval [a, b] will contain the population parameter”

Answer 46

Statement that some population parameter is equal to a particular value or lies in some set of values

Answer 47

Rejecting the null hypothesis when in fact it is true

Answer 48

Failing to reject the null when it is in fact false

Answer 49

1. State the hypotheses - the Null and Alternative 2. Construct a ‘test’ statistic: Z = ((sample statistic) - (hypothesised population parameter))/SE of the sample statistic 3. State the sampling distribution of the test statistic under the provisional assumption that the Null is true Z~N(0,1) 4. Use the SN distribution to control the probability of a Type 1 error 5. Make a Decision - reject or fail to reject the Null

Answer 50

Sequence of data points recorded in chronological order. Observations are often taken at equally-spaced points in time Aims are (1) provide a simple model of the evolution of a variable as an aid to understanding (2) to provide a basis for forecasting/prediction Data are not independent

Answer 51

Exponential growth —> take their logarithm

Answer 52

Variations that occur due to sudden causes and are usually ex ante unpredictable

Answer 53

Predictable periodic pattern that reoccurs or repeats over regular intervals

Answer 54

A series follows an up and down pattern that is not seasonal

Answer 55

One which can be expressed explicitly by an analytic expression, it has no probabilistic or random aspects

Answer 56

Non-deterministic time series is one which cannot be described by an analytic expression Reasons for randomness (1) all the information necessary to describe it explicitly is not available, although it might be in principle or (2) the nature of the generating process is inherently random

Answer 57

The idea that there is nothing statistically special about the segment of history that you observed in the sense that the statistical properties of the process generating the data are invariant to shifts in the window of observation

Answer 58

All statistical features of a distribution are invariant to time-shifts

Answer 59

E[Xt] and Var(Xt) do not vary with time Cov(Xt, Xt-h) and Corr(Xt, Xt-h) do not vary with time only h

Answer 60

1) differencing the data 2) trend -> fitting a curve to the data 3) non-constant variance -> taking logarithm or square root of the variance

Answer 61

Difference between potential outcomes

Answer 62

ATT+Selection Bias

Answer 63

Average difference in the no treatment outcome between the treated and un-treated groups Reflects the idea that this bias will arise if individuals are selected for treatment on the basis of potential outcomes

Answer 64

The mean potential outcomes are identical for the treated and untreated groups Selection bias is zero

Answer 65

Findings for the sample are credible

Answer 66

Its findings can be credibly extrapolated to the population or Real World policy of interest

Answer 67

1) Contamination: People in the control group access the treatment anyway 2) Non-compliance: individuals who are offered the treatment refuse to take it 3) Hawthorne Effect: a phenomenon in which participants alter their behaviour as a result of being part of an experiment or study 4) Placebo effect: the placebo effect impacts outcomes because of perceived changes

Answer 68

1) small/local nature of RCT (geographic area, institutional environment, demographic group) 2) spillover effects 3) short durations —> don’t know long term impact

Answer 69

Assignment to treatment id independent of potential outcomes conditional on covariates

Answer 70

1) credibility of CIA (more factors) 2) the common support problem/curse of dimensionality - few or no observations for certain groups 3) “bad controls” - controlling for variables which are themselves outcomes

Probability And Statistics Equations Flashcards

(100 cards)