Probability Flashcards

Question

Bernoulli distribution (Def)

Answer 1

An experiment that can take two values, 1 (success) and 0 (failure), with ``` P(1) = θ P(0) = 1-θ ```

Answer 2

Repeated experiment n times, with each experiment a Bernoulli variable

Answer 3

Describes the number of events occurring within a given interval, with rate λ. It is commonly used to describe count data

Answer 4

Let X_i be the i-th component of a k-dimensional random vector *X*. The distribution function F_(X_i)(x) of X_i is called the MARGINAL DISTRIBUTION of X_i. For a bivariate discrete r.v., it’s PMF is:

Answer 5

(1) A positive value indicates a positive LINEAR relationship, and viceversa (2) Zero indicates the variables are LINEARLY INDEPENDENT Note:

Answer 6

Correlation is a measure of how strong the linear relationship is between two random variables

Answer 7

Two r.v.s X and Y are independent if all events relating to X are independent of all events relating to Y. The following statements are equivalent: (1) X and Y are independent (2) The JOINT PMF of X and Y is the product of the MARGINAL PMFs (3) The CONDITIONAL distribution of X given Y=y does not depend on y, and viceversa

Answer 8

n independent trials with k possible outcomes for each trial. Each time, the probability of observing the j-th outcome is θ_j. Denote by X_j the number of times we observe the j-th outcome. *X* = [ X_1, X_2, … , X_k] X_1 + X_2 + … + X_k = n θ_1 + θ_2 + … + θ_k = 1

Answer 9

Multinoulli is a multinomial distribution when n=1, i.e. there is only 1 trial, but still k possible outcomes

Answer 10

The Binomial is a special case of the Multinomial, where k=2 (i.e. only 2 possible outcomes). If X ~ Binom(n, θ), then *X* = (X, n-X) ~ Mu(n, (θ, 1-θ))

Answer 11

Using the joint PMF/PDF it’s possible to find the expected value of any real function g(X, Y) of X and Y. Let X, Y be a pair of discrete r.v. and g(X, Y) be any real-valued function of X and Y. Then if it exists, the expected value of g(X, Y) is defined to be:

Answer 12

For any a<=b, the probability P(a

Answer 13

You can calculate P(X<=x) in two stages, using the Standard Normal dist Z ~ N(0, 1) (1) Transform P(X<=x) into P(Z<=z) (2) Use the CDF of Z to calculate probabilities

Answer 14

X has a uniform distribution over the interval [a, b], written X ~ U(a, b) If it has PDF and CDF

Answer 15

X has an exponential dist with parameter λ>0, if it has PDF and CDF

Answer 16

(1) f_XY(x, y) = f_X(x) * f_Y(y) (2) The Joint PDF factorizes into: f_XY(x, y) = C * g(x) * h(y) With C some constant (the factorization is not unique) (3) f_X|Y(x|y) = f_X(x) and viceversa Conditions (1) and (2) require that the joint range space R_XY is the cartesian product of R_X and R_Y. If (2) holds, then the Marginal PDFs of X and Y are proportional to g(x) and h(y), respectively

Answer 17

Let {X_n} be a sequence of r.v.s. Let X_n-bar be the sample mean of the first n terms of the sequence. A CLT is a proposition giving a set of conditions to guarantee the convergence of the sample mean to a NORMAL DIST, as the sample size increases, i.e. sufficient to guarantee that

Answer 18

The CLT is used as follows: (1) we observe a sample consisting of n observations X_1, X_2, … , X_n (2) If n is large enough, then a standard normal distribution is a good approximation of the distribution of sqrt(n) * (X_n-bar - μ) / σ (3) Therefore, we pretend that sqrt(n) * (X_n-bar - μ) / σ ~ N(0, 1) (4) As a consequence, the distribution of the sample mean is

Answer 19

Let X_1, X_2, … , X_n be a sequence of iid Bernoulli(θ) r.v.s. We know that: - E[X_i] = θ and Var[X_i] = θ*(1-θ) - X = Σ(X_i) ~ Binom(n, θ) The CLT tells us that:

Probability Flashcards

(74 cards)