ch2 Flashcards

Question

Multinomial distribution

Answer 1

Variation of binomial distribution involving more than two outcomes. PMF is [(n!)/(x1! X2! ... xk!)]p1^(x1)...pk^(xk) for k possible outcomes, n events, x is the number of times outcome k occurs

Answer 2

Poi(x | k) = e ^(-k) [(k^x)/(x!)] First term is normalization constant ensuring distribution sums to 1. Expresses the probability of a given number of events occurring in a fixed interval of time/space if 1) these events occur with a known constant rate and 2) independently of the time since the last event.

Answer 3

Fn(t) = number of elements in sample <= t / n

Answer 4

Assigning a size to a set based solely on whether it contains a fixed element x or not.

Answer 5

Use because of the central limit theorem (stats that the averages of samples of observations of random variables independently drawn from independent distributions converge in distribution to the normal. Physical quantities that are expected to be the sum of many independent processes (eg measurement errors) thus often have distributions that are nearly normal. Probability density is [1/sqrt(2*pi*variance)]*e^[(x-populationmean)^2/(2*variance)]

Answer 6

The inverse variance of a guassian, 1/variance. A high precision means a narrow distribution centered on the population mean.

Answer 7

Special function of sigmoid shape that describes diffusion. erf(x) = 1/sqrt(pi) * integral from x to 0 of e^(-t^2) For nonnegative values of x, the error function has the following interpretation: for a random variable Y that is normally distributed with mean 0 and variance 1/2, erf(x) describes the probability of Y falling in the range [-x, x]].

Answer 8

formed in the limit that variance -> 0 where the gaussian becomes an infinitely tall and infinitely then “spike” centered at the mean has the sifting property which selects out a single term from a sum or integraton since the integrand is only non-zero if x - mean =0

Answer 9

used since gaussians are more sensitive to outliers as their log probability only decays quadratically w distance from the center [1 + (1/v)((x-u)/(o))^2]^-(v+1/2) u is mean, o^2 is scale parameter, v is degrees of freedom. variance is actually (vo^2)/(v-2)

Answer 10

t-distribution with degree of freedom 1. has such a heavy tail that the integral defining the mean doesnt converge

Answer 11

another distribution with a heavy tail (low sensitivity to outliers), aka the double-sided exponential distribution (1/2b)*exp(-|x-mean|/b) mean is a location parameter and b > 0 is a scale parameter. mean and mode are both u; variance is 2b^2 puts more density st zero than guadsian, useful for encouraging sparsity in a model (?)

Answer 12

special case of gamma distribution Ga(x | 1, #) where 1 is the shape ans # is the rate parameter. Describes the ti,es betweem events in a Poisson process (ie a process in which events occur continuously and independently at the constant average rate #)

Answer 13

specia case of gamma distribution Ga(x | v/2, 1/2). Distribution of the sum of squared gaussian random variables.

Answer 14

special case same as the gamma distribution where shape (a) is an integer, usually fixed at 2, yielding = Ga(x |2, #) where # is the rate parameter. Events that occur independently with some average rate are modeled with a Poisson process. The waiting times between k occurrences of the event are Erlang distributed. (The related question of the number of events in a given amount of time is described by the Poisson distribution.)

Answer 15

a family of continuous probaiblity distributions dfeined on the interval [0,1] parametrized by two positive shape parameters, denoted by alpha and beta that appear as xponents of the random variable and control the shape of the distribution has been applied to model the behavior of random variables limited to intervals of finite length in a wide variety of disciplines Beta(alpha,beta) = [x^(alpha-1)(1-x)^(beta-1)]/Gamma(A)Gamma(B) Where Gamma is the gamma function

Answer 16

a flexible distribution for positive real valued rvs, x > 0 Defined in terms of two paramets shape a>0 and rate b>0. Ga(T|shape =a, rate = b) = [(b^a)/Ga(a)] * [T^(a-1)] * e ^(-Tb) where Ga is the gamma function

Answer 17

integral from 0 to infinity over u^(x-1)e^(-u) with respect to u an extension of the factorial function, with its argument shifted down by 1, to real and complex numbers

Answer 18

used to model the distribution of quantities that exhibit long tails/heavy tails. For example, word frequencies in english follow Zipf's law. Wealth is similar skewed, esp in plutocracies like the US. pfm is k * m^k * x^-(k+1) * I(x >= m(

Answer 19

Zipf's law states that given a large sample of words used, the frequency of any word is inversely proportional to its rank in the frequency table

Answer 20

measurement of the degree to which X and Y are (linearly) related cov[X,Y] = E[XY] - E[X]E[Y] or E[(X-E[X])(Y-E(Y)] can be between 0 and infinity

Answer 21

cov[X,Y]/sqrt(var(X)var(Y)). A normalized measure with a finite upper bound. Corr[X,Y] is 1 iff Y = aX + b and there's a linear relationship between X and Y. not related to the slope of the regression line, which is actually cov[X,Y]/var[X] correlation implies dependence, but noncorrelation does not imply independence (another relationship might hold)

Answer 22

most widely used joint probability density function for continuous variables; covered more in ch4

Answer 23

the property that the expected value of the sum of random variables is equal to the sum of their individual expected values, regardless of whether they are independent

Answer 24

y = f(x) = Ax + b E[y] = E[Ax+b] = A(mu) + b where mu = E[x}. cov[y] = cov[Ax+b] = A(cov[X])(transpose of A) Mean and covariance only define the distribution of y if x is Gaussian.

ch2 Flashcards

(48 cards)