Probability and statistics Flashcards

Question

Expectation Function Properties: E(x) exists if: - sample space is finite - the sum / integral converges absolutely

Answer 1

- Idealised long run average Discrete: Sum ∑xi P(Xi) ``` Continuous: Integral xf(x) over R ``` E(x) - sum / integral of xi times pmf / pdf

Answer 2

1. X = constant - P(X=c) = 1, then E(X) = C 2. Y = aX +b E(Y) = aE(X) + b Proof - summation / integral and compute Symmetry of E(x): X has a symmetric pmf/pdf - if E(x) exists it is the central point of the pmf/pdf If symmetric about μ, Let Y = X-μ - pmf is not symmetric about 0 so E(Y) = 0, E(X) - μ = 0 rearranging gives result

Answer 3

1. Var(x) ≥ 0 Sum of positive terms - all squared and real 2. Var(x) = 0 if X is constant is. p(x=c) =1 Compute 3. y = aX + b, var(y) = a²var(x) Compute

Answer 4

δ/μ The ratio of s.d / mean

Answer 5

An experiment with 2 outcomes: success and failure X - only takes values 0 or 1

Answer 6

Sum of n independent Bernoulli trials X - no. Of successes in n independent Bernoulli trials with probability of success p

Answer 7

X - no. of independent Bernoulli trials until a success The waiting time between successes in binomial

Answer 8

X - no. of Bernoulli trials until rth success The sum of r independent geometric

Answer 9

X - no. of type 1 objects when n objects are drawn from population N containing M type 1 objects Sampling without replacement - with replacement is the binomial!!

Answer 10

X - no. Of accidents in a fixed time period - rate of events in time period Poisson process involves assuming independence between non-overlapping time intervals - Poisson is only appropriate when this independence is satisfied Limit of binomial with n large and p small

Answer 11

The time between events in a Poisson process

Answer 12

X - time until kth accident in Poisson process A sum of r independent exponential variables

Answer 13

Sequence of binomial distribution variables

Answer 14

If X1 and X2 are independent, the joint density function must factorise into a product of the form f1(X1)f2(x2)

Answer 15

Suppose X1...Xn are r.v. Defined on the same sample space. The joint distribution function of X1..Xn is the function F(x1, ... xn) = P(X≤x₁...X≤xn)

Answer 16

A distribution of a single random variable F₁(x) = P(X₁≤x) = P(X₁≤x,X₂≤∞.....xn≤∞)

Answer 17

E(X1) = Ex2[Ex1|x2 (x1 | x2) ] Where Ex[•] denotes the expectation over the marginal distribution of X and Ey|x[•] denotes the expectation over the conditional distribution of Y given the value taken by X

Answer 18

If X1, X2 ... Are independent random variables having a common distribution with means μ variance δ

Answer 19

A statistic Tn = Tn (X1...Xn) is an estimator of a parameter θ if its value tn = Tn(X1.. Xn) is used as an estimator of θ X1... Xn IID R.V. With unknown mean E(x) = μ An estimator of μ is the sample mean

Answer 20

E((x-α)^r) Variance is the 2nd moment of x about the mean

Answer 21

Cor(x₁,x₂) = E((x₁-μ₁)(x₂-μ₂)) If continuous: integrate over one variable over R and then the other variable over R For discrete: sum over the different variables

Answer 22

Independence → Cor(x₁,x₂) = 0 Cor(x₁,x₂) = 0 does not imply Independence

Answer 23

∑ E(Xi) = E( ∑ Xi ) = E(X1 + X2 + ... Xn) = ∑x1 ∑x2 ... ∑xn (x1, x2, ... xn) P(x1, x2....xn) = ∑ X1 P(x1, x2....xn).... ∑Xn P(x1, x2....xn)

Answer 24

Summation and compute

Answer 25

X1, X2 are independent if joint pdf factorises

Answer 26

Correlation parameter - measures the strength of linear association between the two variables x1, x2 ρ = Corr(x,y) = cov(x,y)/σx σy

Answer 27

Conditional mean: if E(x1|x2) is a linear function of x2 - suggests x1 and x2 are linearly dependent If x2 > x1: conditional mean E(x1|x2) > marginal mean E(x1) If x2 is greater than average - expect x1 to be greater than average

Answer 28

σ₁²(1-ρ²) Since -1

Answer 29

Conditional > marginal Observed x1 exceeds its expectation and corr(x1,x2) X2 is likely to exceed its expectation

Answer 30

Use PGF to show!!

Answer 31

If x1... Xn are independent discrete random variables taking non-negative integer values, the pgf of their sum is the product of their PGFs If all the Xs are IID : pgf to power of n Pgf only defined for discrete!!!!

Answer 32

If x1... xn are independent random variables - the mgf of their sum is the product of their mgfs

Answer 33

X bar is the sample mean The average of all the observations in a sample X bar n = Sn/n - it's distribution is the sampling distribution of the mean

Answer 34

P(lim(x bar= μ) = 1 as n tends to infinity Almost every possible sequence of sample means tends strictly to μ as n→∞

Answer 35

If z1...zn are independent N(0,1) random variables, the distribution of the sum of squares: ∑ Zi ² is the chi-squared distribution with n degrees of freedom - same as the Γ(n/2,1/2) distribution with expectation n variance 2n

Answer 36

Z~N(0,1) and U~Xn² Z and U are independent Distribution of the ratio: T = z/√U/n is the t-distribution is n degrees of freedom

Answer 37

If U and V are independent r.vs distributed as Xm² and Xn² respectively Distribution of the ratio: W = U/m / V/n Is the F-distribution with m,n degrees of freedom W ~ Fm,n => 1/W ~ Fn,m

Answer 38

Y = b0 + b1x Data points satisfy the n equations: yi = b0 + b1xi + ei - ei = prediction error - choose b0, b1 to minimise ∑ ei²

Answer 39

Xi values Independent values

Answer 40

yi are the observed (dependent) values of a random variable Yi, whose distribution depends on xi

Answer 41

The curve as a function of x is the regression curve of y on x

Answer 42

One in which the regression curve is a linear function of the parameters of the model

Answer 43

Independent random variables with μ = 0 and σ² = common variance

Answer 44

S(B0, B1) = ∑ (yi - B0 - B1xi) ²

Answer 45

A way to estimate B0, B1 to minimise the sum of square errors Data must be: Homoscedastic - variation in y is same for all x - variance is constant Independent

Answer 46

To minimise S(B0,B1) Differentiate wrt. b0, B1 and set to zero

Answer 47

The least squares estimate of μ minimises the squares errors S(μ) = ∑(yi - μ)²

Answer 48

For data not independent!!!

Answer 49

``` 1 sample: Known variance: Z-test Unknown variance: T-test - sample variance s^2 ``` 2 sample: Known variance Z - test (standard normal) Unknown variance T - test with pooled sample variance Testing for variance: F - test 2-paired: Testing for mean T - test reduced to 1 sample problem and taking the difference of the results as the new tested variable data

Answer 50

Asse that the differences are independent, identically distributed and normally distributed

Answer 51

Any monotonic increasing or decreasing function!!! fy(y) = fx(x) |dx/dy| =fx(g^-1(y)) |dx/dy| Switched the derivative! Insert the change of variables Don't forget the modulus Monotonic!!

Answer 52

If Sn a functions of Xi's that are mutually independent, Any function of individual X's are also mutually independent. Thus the expectation of the product is the product of the expectations so the pgf is the sum is the product of the PGFs

Answer 53

The NB(r,p) distribution arises at the distribution of the number is trials required to obtain r successes in a sequences of independent Bernouilli trials, each with success probability p ``` The geo(p) dist. Is the distribution of the number of trials required to obtain one success Because the trials are independent, the distributions of the number of trials between successes are therefore independent geometric r.v.s and the total number of trials until the rth success is therefore the sum of r independent geometric rvs ```

Answer 54

If X1 and X2 are aE independent, then for all pairs of values (x1, x2) P(X1 = x1, X2=x2) =P(X1=x1)(P(X2=x2)

Answer 55

Sn~Poi(nμ)

Answer 56

The main things to consider when choosing between estimators are bias and variance Is the standard error (s.d) tends to zero as n tends to infinity - the estimator is consistent Can also compare estimator on the basis of MSE = variance + bias squared The estimator with the smaller mean squared error would be preferred as this is likely to yield an estimate that is closer to the true value of the parameter

Answer 57

Observations are normally distributed There is no particular priori reason for this to be the case - in terms of situations in which the normal distribution is know to arise The measurements have obviously been rounded to a discrete set of values - suggests that the assumption of normality probability is not realistic here

Answer 58

The pgf is defined for discrete random variables taking non-negative integer values

Probability and statistics Flashcards

(82 cards)