Distributions Flashcards

1
Q

when we use the bernoulli distribution and what is that?

A

the Bernoulli Distribution it is used in the yes-no type of problems and is express by the formula q = 1 - p.
FORMULA :
**P(X = x) = p^x (1-p)^1-x

where 0 ≤ p ≤ 1, x ∈ {0, 1}

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

when we use the bernoulli distribution and what is that?

A

The binomial distribution is a discrete probability distribution that describes the number of successes (often denoted as “k”) in a fixed number of independent Bernoulli trials. Each Bernoulli trial has only two possible outcomes: success (usually denoted as “1”) or failure (usually denoted as “0”);
FORMULA
vedere la formula (..)

where
p is the probability
n= number of trials and
x is the value of the experimetnt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A fair die is thrown four times. Calculate the probabilities of getting:
0 Twos
1 Two
2 Twos
3 Twos
4 Twos
(Exercise using binomial distribution)

A

page 16-17 of the distributions slide

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is the discrete uniform distribution?

A

The discrete uniform distribution is a symmetric probability distribution wherein a
finite number of values are equally likely to be observed and every one of n values
has equal probability 1/n.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the geometric distribution?

A

Geometric distribution is a type of discrete probability distribution that represents
the probability of the number of successive failures before a success is obtained in
a Bernoulli trial.
In other words, in a geometric distribution, a Bernoulli trial is repeated until a
success is obtained and then stopped.
FORMULA
P(X = x) = (1 − p)^x−1* p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If your probability of success is 0.2, what is the probability you meet an
independent voter on your third try? (solve the exercise)

A

page 29 of distributions slide

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is the poisson distribution?

A

It expresses the probability of a given number of events occurring in a fixed
interval of time or space if these events occur with a known constant mean rate
and independently of the time since the last event
FORMULA
p(x; λ) = (e^−λ * λ^x) / x!

where
λ = average number of events in a given interval
x = value of a random experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is the continuos uniform distribution?

A

the general form is
f (x) = 1/ b-a

where
x ∈ [a, b], a < b.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is the normal distribution?

A

The normal distribution, also known as the Gaussian distribution, is symmetrical
around its mean usually it have the form of a bell curve
FORMULA:
f (x) = (1/σ * √2π) * e^-1/2(x - µ / σ)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is the Gamma distribution?

A

see page 16 for the formula

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the inverse gamma ?

A

see page 17 for the formula

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the mean of a dataset?

A

The mean, often referred to as the average, is a measure of central tendency that represents the typical or central value of a dataset.

   n µ = ∑    xi / n
   i=1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the weibull distribution?

A

f (x; α, λ) = αλ(λx) ^ α−1 exp(-(λx)^ α)

x ≥ 0, α, λ > 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the median of a dataset?

A

the median is the middle value if we sort the value.
We have two cases :
1. if the array is odd we can take the central value
2. if the array is even we have to put the two central value divided by two

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is the mode?

A

the mode is the most frequently occured value, if there are more mode in a set, for instance two we call it bimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is the exponential distribution?

A

The exponential distribution is the probability distribution of the time between
events in a Poisson point process.
f (x; λ) = λe^−λx
where
x ≥ 0, λ > 0

13
Q

what is the expectation of a random variables

A

The expectation of a random variable, often denoted as E[x] It represents the average or mean value that you would expect from that random variable over a large number of repetitions or trials.
If we have a discrete case we use this formula

E[X]=∑xi ⋅ P(X=x)
i

14
Q

what is the variance of a random variables?

A

The variance of a random variable is a statistical measure that quantifies the spread or dispersion of the random variable’s values around its mean (expectation)

Var(X) = E[(X−μ)^2]

15
Q

what is the standard deviation of a random variables?

A

we could define it as the square root of the variance

16
Q

what is a sthocastic process?

A

A stochastic process, also known as a random process, is a mathematical concept that describes a collection of random variables evolving over time or another underlying parameter.

17
Q

what is covariance in random variables?

A

Covariance is a statistical measure that quantifies the degree to which two random variables change together, remember that is key concept also for the correlation coefficience

Cov(X,Y)=E[(X−μx)(Y−μy)]

where:
1. μx and μy are the means
2. X and Y are two random variables

18
Q

What is the correlation coefficient of a two random variables?

A

The correlation coefficient is a statistical measure that quantifies the strength and direction of the linear relationship between two random variables.

r = Cov (X,Y) / σX σY

The resulting Pearson correlation coefficient:

r is a value between -1 and 1. It provides the following information:

r=1: Perfect positive linear relationship. The variables move together in a perfect linear manner.

r=−1: Perfect negative linear relationship. The variables move together in a perfect linear manner, but in opposite directions.

r=0: No linear relationship. There is no linear association between the variables.

where
σx = √Var(X) = √E[(X−μx)^2]
​σy = √Var(Y) = √E[(Y−μy)^2]

19
Q

what is the covariance matrix?

A

The covariance matrix is a square matrix that provides a comprehensive summary of the variances and covariances among multiple random variables

Remark: it is important for the PCA that we would to

Remark: in machine learning, the covariance matrix can be a valuable tool for understanding and preprocessing data, as well as for improving the performance and interpretability of models.

20
Q
A