Probability & Statistics Flashcards

Question 1

Q

Definition of a Sample Space

Answer

A

The sample space is the set of all possible outcomes for the experiment. It is denoted by S

Question 2

Q

Definition of an Event

Answer

A

An event is a subset of the sample space. The event occurs if the actual outcome is an element of this subset.

Question 3

Q

Definition of a Simple Event

Answer

A

An event is a simple event if it consists of a single element of the sample space S

Question 4

Q

Meaning of Disjoint events

Answer

A

We say two sets A and B are disjoint if they have no element in common, i.e, A ∩ B = {}

Question 5

Q

De Morgan’s laws

Answer

A

(A ∪ B)c = Ac ∩ Bc
(A ∩ B)c = Ac ∪ Bc

Question 6

Q

Kolgromov’s axioms for probability

Answer

A

a) For every event A we have P(A) >= 0,

b) P(S) = 1,

c) If A1, A2, …, An are n pairwise disjoint events
then
P(A1 U A2 U … U An) = P(A1) + P(A2) + … + P(An)

Question 7

Q

Complement Rule for Probability

Answer

A

If A is an event then
P(Ac) = 1 - P(A)

Question 8

Q

Probability of an Empty Set

Answer

A

P(∅) = 0

Question 9

Q

Probability of an Event Upper Bound

Answer

A

If A is an event then P(A) <= 1

Question 10

Q

Probability of a Subset

Answer

A

If A and B are events and A ⊆ B then
P(A) <= P(B)

Question 11

Q

Probability of a Finite Event

Answer

A

The probability of a finite set is the sum of the probabilities of the corresponding simple events.

Question 12

Q

Inclusion-exclusion for two events

Answer

A

P(A ∪ B) = P(A) + P(B) − P(A ∩ B)

Question 13

Q

Inclusion-exclusion for three events

Answer

A

P(A∪B∪C) = P(A)+P(B)+P(C)−P(A∩B)−P(A∩C)−P(B∩C)+P(A∩B∩C)

Question 14

Q

Ordered with replacement (repetition allowed)

Question 15

Q

Ordered without replacement (no repetition)

Answer

A

n!
(n−r)!

Question 16

Q

Conditional Probability

Answer

A

If E1 and E2 are events and P(E1) /= 0 then the conditional probability of E2 given E1, usually denoted by P(E2|E1), is

P(E2|E1) = P(E1 ∩ E2)
P(E1)

Question 17

Q

Unordered without replacement (no repetition)

Answer

A

nCr (n)
(r)

Question 18

Q

Defenition of Independence

Answer

A

We say that the events E1 and E2 are (pairwise) independent if

P(E1 ∩ E2) = P(E1)P(E2)

Question 19

Q

When are three events E1, E2, and E3 called pairwise independent

Answer

A

P(E1 ∩ E2) = P(E1)P(E2),
P(E1 ∩ E3) = P(E1)P(E3),
P(E2 ∩ E3) = P(E2)P(E3).

Question 20

Q

When are three events E1, E2, and E3 called mutually independent

Answer

A

P(E1 ∩ E2 ∩ E3) = P(E1)P(E2)P(E3)

Question 21

Q

When are two events E1 and E2 are said to be conditionally independent given an event E3

Answer

A

P(E1 ∩ E2|E3) = P(E1|E3)P(E2|E3)

Question 22

Q

Definition of Random Variable

Answer

A

A random variable is a function from S to R

Question 23

Q

Definition of Discrete Random Variables

Answer

A

A random variable X is discrete if the set of values that X takes
is either finite or countably infinite.

Question 24

Q

Definition of Probability Mass Functions

Answer

A

The probability mass function (p.m.f.) of a discrete random
variable X is the function which given input x has output P(X = x)

Question 25

Q

Sum of Probabilities for a Discrete Random Variable

Answer

A

The sum of the Outputs must equal 1

Question 26

Q

Definition of Expectation

Answer

A

If X is a discrete random variable which takes values x1, x2, x3, . . ., then the expectation of X (or the expected value of X) is defined by
E(X) = x1P(X = x1) + x2P(X = x2) + x3P(X = x3) + · · · .

Question 27

Q

Bound on Expectations of a Random Variable

Answer

A

m ≤ E(X) ≤ M

Question 28

Q

Expectation of a function

Answer

A

E( f(X) ) = f(x1)P(X = x1) + f(x2)P(X = x2) + f(x3)P(X = x3) + · · ·

Question 29

Q

Definition of Moments

Answer

A

The nth moment of the random variable X is the expectation E(X^n)

Question 30

Q

Definition of Variance

Answer

A

Var(X) = [x1 − E(X)]2P(X = x1) + [x2 − E(X)]2P(X = x2)
+ [x3 − E(X)]2P(X = x3) + …

Question 31

Q

Variance formula

Answer

A

Var(X) = E(X^2) − [E(X)]^2

Question 32

Q

Linear function of expectation

Answer

A

E(aX + b) = aE(X) + b

Question 33

Q

Linear function of variance

Answer

A

Var(aX + b) = a^2Var(X)

Question 34

Q

What is a Bernoulli(p) distribution:

Answer

A

It is where a random variable X only takes values 0 and 1

Question 35

Q

Bernoulli distribution Expectation and Variance

Answer

A

E(X) = p, Var(X) = p(1 − p)

Question 36

Q

What is Binomial distribution:

Answer

A

A discrete random variable X has the Binomial(n, p) distribution, denoted X ∼ Bin(n, p), if its p.m.f. is :

P(x =k) = nCk x p^k x (1-p)^n-k

Question 37

Q

Binomial distribution Expectation and Variance

Answer

A

E(X) = np, Var(X) = np(1 − p)

Question 38

Q

What is Geometric distribution:

Answer

A

A discrete random variable X has the Geometric(p) distribution, denoted X ∼ Geom(p), if its p.m.f. is :

P(X = k) = p(1 − p)^k−1

Question 39

Q

Geometric distribution Expectation and Variance

Answer

A

E(X) = 1/p
Var(X) = 1 − p/ p^2

Question 40

Q

What is Hypergeometric distribution

Answer

A

the hypergeometric distribution describes the probability of successes in draws, without replacement, from a finite population

P(X = k) = mCk x (n-m)C(l-k) / nCl

Question 41

Q

Hypergeometric distribution Expectation and Variance

Answer

A

E(X) = l x (m/n)
Var(X) = l x (m/n) x (n-m/n) x (n-l/n-1)

Question 42

Q

What is Negative binomial distribution

Answer

A

The negative binomial distribution models the number of failures in a sequence of independent and identically distributed Bernoulli trials before a specified number of successes occurs

P(X = k) = (k+r-1)C(r-1) x p^r x (1-p)^k

Question 43

Q

Negative Binomial distribution Expectation and Variance

Answer

A

E(T) = (1-p)r/p
Var(T) = (1-p)r/p^2

Question 44

Q

What is Uniform distribution

Answer

A

uniform distribution refers to a type of probability distribution in which all outcomes are equally likely

P(X=k) = 1/(n+1) if m<=k<=n+m

Question 45

Q

(discrete) Uniform distribution Expectation and Variance

Answer

A

where n = b-a, and m=a
E(X) = m +(n/2)
Variance = n(n+2)/12

Question 46

Q

What is Poisson distribution

Answer

A

Poisson distribution expresses the probability of a given number of events occurring in a fixed interval of time if these events occur with a known constant mean rate.

P(X=k) = (λ^k/k!) x e^-λ

Question 47

Q

Poisson distribution Expectation and Variance

Answer

A

E(X) = λ, Var(X) = λ

Question 48

Q

Cumulative distribution function

Answer

A

The cumulative distribution function (c.d.f.) of a random variable X is the function which given t has output
P(X ≤ t).

Question 49

Q

Moment Generating function

Answer

A

Let X be a discrete random variable
which takes integer values. The moment generating function (mgf ) of X is the function which given t has output E(e^(tX))

Question 50

Q

Definition of a Continuous random variable

Answer

A

We say that a random variable X is a continuous random variable if there exists a
continuous function fx from R to [0, ∞) with the following property:

P(a<= X <= b) = integral of fx(t)

Question 51

Q

Expectation and Variance of crv

Answer

A

E(X) = integral tfx(dt)
Var(X) = E(X^2) − (E(X))^2

Question 52

Q

(continuous) Uniform Expectation and Variance

Answer

A

E(X) = (a+b)/2
Var(X) = (b-a)^2/12

Question 53

Q

Exponential distribution

Answer

A

It is often used to model the time elapsed between events

P(X=t) = λe^λt

Question 54

Q

Exponential distribution Expectation and Variance

Answer

A

E(X) = 1/λ
Var(X) = 1/λ^2

Question 55

Q

Joint Probability mass function

Answer

A

Let X and Y be two discrete random variables defined on the same sample space and taking values x1, x2, . . . and y1, y2, . . . respectively. The
function:

(xk, yl) → P( (X = xk) ∩ (Y = yl) )

Question 56

Q

Marginal Probability

Answer

A

P(X=xk) = suml P(X = xk, Y = yl)

same goes the other way

the idea is that if we only care about the probability of X
taking a particular value, we need to sum over all possible values of Y

Question 57

Q

Expectations of 2 Variables

Answer

A

If g(X, Y ) is a real-valued function of the two discrete random variables X and Y then the expectation of g(X, Y ) is obtained as

E( g(X, Y ) ) = sumk suml g(xk, yl)P(X = xk, Y = yl)

Question 58

Q

Linearity of Expectation with multiple variables

Answer

A

If X and Y are discrete random variables then

E(X + Y ) = E(X) + E(Y )

Question 59

Q

Independence for Random Variables

Answer

A

Two discrete random variables X and Y are independent if
the events “X = xk” and “Y = yl” are independent for all possible values xk, yl

Question 60

Q

Covariance of X, Y

Answer

A

The covariance of X and Y is defined by:

Question 61

Q

Correlation coefficient of X and Y

Question 62

Q

Formula for Covariance (easy)

Answer

A

Cov(X, Y ) = E(XY ) − E(X)E(Y )

Question 63

Q

Normal distribution formula

Question 64

Q

Normalisation

Answer

A

Using the substitution z = (x − µ)/σ one can confirm that the
p.d.f. is normalised

Answer 62

A

When you standardise a normal distribution, the mean becomes 0 and the standard deviation becomes 1

Answer 63

A

Let Z ∼ N(0, 1) denote a standard normal random variable.

Answer 64

A

If X 1 ,X 2 ,…,X n are independent normal random variables with mean μ and variance
σ2, the sum X=X1+X2+⋯+Xn is also a normal random variable. The distribution of X is given by X∼N(nμ,nσ2), meaning the mean of
X is nμ and the variance of X is 2 nσ2

Answer 65

A

A survey is the collection of data from a sample of the population.

Answer 66

A

In an observational study researchers observe the behaviour of individuals
without trying to influence the outcome of the study

Answer 67

A

In a designed experiment researchers apply some treatment to the units under
investigation and measure the response.

Answer 68

A

Continuous variables/data are variables which are given in
terms of real numbers

Discrete variables/data are variables which are given by integers

Answer 69

A

Categorical variables/data are
variables which are expressed in terms of categories

Ordinal variables/data are variables which are expressed in
terms of ordered categories

Answer 70

A

This is called the estimator of the mean where x bar is called the point estimate

Answer 71

A

If n is odd Q2 = x (n+1/2)

If n is even Q2 = average (x(n/2),x((n/2) +1)

Answer 72

A

This is a biased estimator

Answer 73

A

mx, Q1, Q2, Q3, Mx

Answer 74

A

An estimator of a given parameter is said to be unbiased if its expected value is equal to the true value of the parameter

Answer 75

A

That means Σ2is biased and it slightly underestimates the variance

Answer 76

A

Measures the average squared difference between the estimated values and the actual value

Answer 77

A

The fraction of 1−α of such intervals contain the population mean µ. We call 1−α the level of confidence

Answer 78

A

A hypothesis is a statement about a parameter θ of
the pmf (or pdf ) of a random variable X

Answer 79

A

In hypothesis testing the central claim, the null hypothesis H0 is a statement θ = θ0 which we intend to find evidence against

Answer 80

A

Assume the null hypothesis H0 is valid. We say a type-I error has occurred if the test procedure for H0 rejects the null hypothesis. The probability of a type-I error occurring is called the significance level α of the
test procedure

Answer 81

A

The test procedure tests the null hypothesis H0 against the so called alternative hypothesis H1. The alternative hypothesis specifies under which conditions the null hypothesis should be rejected.

Answer 82

A

If we test H0 : θ = θ0 against H1 : θ /= θ0 we need a two-sided
test.

Answer 83

A

If we test H0 : θ = θ0 against H1 : θ > θ0

Answer 84

A

If we test H0 : θ = θ0 against H1 : θ < θ0

Answer 85

A

A type-I error occurs if the test rejects the null hypothesis H0 even though H0 is valid

Answer 86

A

the test does not reject H0 even though in some sense H0 is false

Answer 87

A

The probability for a type-II error is denoted
by β. 1−β is called the power of the test

Answer 88

A

the P-value of the observation is the probability to observe a sample statistic as extreme or more extreme as the observation under the assumption that
the null hypothesis is true

At significance level α the null hypothesis is rejected if the P-value obeys P < α.