Probability Distributions Flashcards

1
Q

What is a probability distribution?

A

It describes how probabilities are distributed over the values of a random variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What properties define a normal distribution?

A

Symmetrical, bell-shaped curve, mean=median=mode, and fully described by mean and standard deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you calculate probabilities in a binomial distribution?

A

Using the formula involving the number of trials, success probability per trial, and desired number of successes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Poisson distribution and where is it commonly used?

A

It models the number of events in a fixed interval of time or space and is used in fields like telecommunications and insurance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe the characteristics of a uniform distribution.

A

All outcomes are equally likely over the interval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does the exponential distribution differ from the normal distribution?

A

It is used for modeling time between events and has a rapid decrease in probability from a starting point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the key features of a geometric distribution?

A

Models the number of trials until the first success and is memoryless like the exponential distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is the standard deviation calculated in a normal distribution?

A

It is the square root of the variance, which is a parameter of the distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the shape parameter signify in a Weibull distribution?

A

Indicates the concentration of failure times or the shape of the distribution’s tail.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is the normal distribution important in statistics?

A

Because many statistical methods assume data follow a normal distribution due to the central limit theorem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can you use the central limit theorem with non-normal distributions?

A

By using it to approximate distributions of sample means from any distribution as normal when sample sizes are large.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the difference between discrete and continuous distributions?

A

Discrete distributions count occurrences, while continuous distributions measure outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain the concept of the cumulative distribution function (CDF).

A

It describes the probability that a random variable is less than or equal to a certain value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the significance of the mean in a probability distribution?

A

It’s the expected value or the balance point of the distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you find the median in a probability distribution?

A

By finding the value at which half of the observations lie above and half below.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What role does variance play in probability distributions?

A

It measures how spread out the values are around the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a hypergeometric distribution?

A

It models the number of successes in a sample without replacement from a finite population.

18
Q

How do z-scores relate to normal distributions?

A

They measure the number of standard deviations an element is from the mean in a normal distribution.

19
Q

What is a negative binomial distribution?

A

It counts the number of failures before a specified number of successes occurs.

20
Q

Why might one use a log-normal distribution?

A

To model variables that are positively skewed, such as income or city sizes.

21
Q

How do you determine if a distribution is skewed?

A

By looking at the skewness coefficient or comparing the mean and median.

22
Q

What is the difference between probability and density in distributions?

A

Probability applies to discrete cases; density applies to continuous cases.

23
Q

How are percentiles calculated in a distribution?

A

By finding the values below which a certain percentage of the data fall.

24
Q

What is a hazard function in survival analysis?

A

It describes the rate at which subjects fail or die over time.

25
Q

Why are probability distributions essential in machine learning?

A

They model the underlying distributions of data features necessary for algorithms like classification and regression.

26
Q

How do you estimate parameters of a distribution?

A

Through maximum likelihood estimation, method of moments, or Bayesian estimation.

27
Q

What is a beta distribution, and when is it used?

A

Used to model variables that are constrained to intervals like percentages or proportions.

28
Q

Describe the process of hypothesis testing using a normal distribution.

A

Set up a null hypothesis about the mean, calculate the z-score for the sample mean, and compare it to a critical value.

29
Q

How do parameter estimates affect the shape of a distribution?

A

Changes in estimates can shift or stretch/compress the distribution curve.

30
Q

What is the law of large numbers, and how does it relate to distributions?

A

It states that averages of samples converge to the population mean as sample sizes increase.

31
Q

What is the difference between parametric and non-parametric distributions?

A

Parametric involves specific distribution with set parameters; non-parametric does not assume a specific distribution.

32
Q

How can you model dependencies between variables using distributions?

A

Through copulas or joint distributions that express how variables influence each other.

33
Q

What is a mixture model in statistics?

A

It represents a combination of two or more probability distributions.

34
Q

How do outliers impact the assumptions of normal distribution?

A

They can cause violations of the assumption that data are normally distributed.

35
Q

How can transformations help in normalizing distributions?

A

By using log, square root, or other transformations to make data more symmetric.

36
Q

What is the importance of tail behavior in risk assessment?

A

Tail behavior helps assess the risk of extreme outcomes in distributions.

37
Q

How do you handle overdispersion in count data?

A

By using models like negative binomial when Poisson assumptions do not hold.

38
Q

Why might someone use Monte Carlo simulations in studying distributions?

A

To model and understand the behavior of random variables and uncertainties in complex systems.

39
Q

What are the implications of heavy-tailed distributions in finance?

A

They can indicate larger risks of extreme outcomes, important for risk management.

40
Q

How do you interpret a probability plot?

A

By comparing the data’s distribution with a theoretical distribution to assess normality or other characteristics.