gallery of continuous variables Flashcards

1
Q

Parametrized distributions

A

When we studied discrete random variables we learned, for example, about the Bernoulli(𝑝) distribution. The probability 𝑝 used to define the distribution is called a parameter and Bernoulli(𝑝) is called a parametrized distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what distribution do tosses of a fair coin follow?

A

tosses of fair coin follow a Bernoulli distribution where the parameter 𝑝 = 0.5.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a key question of statistics?

A

estimate the parameters of a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

give an example of estimating the parameters of a distribution

A

if I have a coin that may or may not be fair then I know it follows a Bernoulli(𝑝) distribution, but I don’t know the value of the parameter 𝑝.

I might run experiments and use the data to
estimate the value of 𝑝.

As another example, the binomial distribution Binomial(𝑛, 𝑝) depends on two parameters 𝑛 and 𝑝.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Uniform Distribution
Parameters, Range, Notation, PDF, CDF, Models

A
  1. Parameters: π‘Ž, 𝑏.
  2. Range: [π‘Ž, 𝑏].
  3. Notation: uniform(π‘Ž, 𝑏) or U(π‘Ž, 𝑏).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

uniform distribution graphs

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Example 1. 1. Suppose we have a tape measure with markings at each millimeter. If we
measure (to the nearest marking) the length of items that are roughly a meter long, the
rounding error will be uniformly distributed between -0.5 and 0.5 millimeters.

A

Many board games use spinning arrows (spinners) to introduce randomness. When spun, the arrow stops at an angle that is uniformly distributed between 0 and 2πœ‹ radians.

  1. In most pseudo-random number generators, the basic generator simulates a uniform distribution and all other distributions are constructed by transforming the basic generator.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

exponential distribution

A
  1. Parameter: πœ†.
  2. Range: [0, ∞).
  3. Notation: exponential(πœ†) or exp(πœ†).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

taxi waiting time

A

If I step out to 77 Mass Ave after class and wait for the next taxi, my waiting time in minutes is exponentially distributed. We will see that in this case πœ† is given by 1/(average number of taxis that pass per minute).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

waiting time for unstable isotope to undergo nuclear decay

A

The exponential distribution models the waiting time until an unstable isotope
undergoes nuclear decay. In this case, the value of πœ† is related to the half-life of the isotope

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

true or false: exponential distribution is the only one that models waiting times

A

False.

there are other distributions that also model waiting times, but the exponential distribution has the property that it is memoryless.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Example of memoryless wait time

A

Suppose that the probability that a taxi arrives within
the first five minutes is 𝑝. If I wait five minutes and, in this case, no taxi arrives, then the
probability that a taxi arrives within the next five minutes is still 𝑝. That is, my previous
wait of 5 minutes has no impact on the length of my future wait!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Example of wait time with memory

A

Suppose I were to instead go to Kendall Square subway station and wait for the next inbound train. Since the trains are coordinated to follow a schedule (e.g., roughly 12 minutes between trains), if I wait five minutes without seeing a train then there is a far greater probability that a train will arrive in the next five minutes.

In particular, waiting
time for the subway is not memoryless, and a better model would be the uniform distribution on the range [0,12].

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the memorylessness of the exponential distribution analagous to?

A

The memorylessness of the exponential distribution is analogous to the memorylessness
of the (discrete) geometric distribution, where having flipped 5 tails in a row gives no information about the next 5 flips. Indeed, the exponential distribution is precisely the continuous counterpart of the geometric distribution, which models the waiting time for a discrete process to change state. More formally, memoryless means that the probability of waiting 𝑑 more minutes is independent of the amount of time already waited.

In symbols,

𝑃 (𝑋 > 𝑠 + 𝑑 | 𝑋 > 𝑠) = 𝑃(𝑋 > 𝑑).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

proof of memorylessness

A

We know that

(𝑋 > 𝑠 + 𝑑) ∩ (𝑋 > 𝑠) = (𝑋 > 𝑠 + 𝑑),

since the event β€˜waited at least 𝑠 minutes’ contains the event ’waited at least 𝑠 + 𝑑 minutes’. Therefore the formula for conditional probability gives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

exponential distribution graphs

A
17
Q

Normal Distribution or Gaussian Distribution?

A

In 1809, Carl Friedrich Gauss published a monograph introducing several notions that have become fundamental to statistics: the normal distribution, maximum likelihood estimation, and the method of least squares (we will cover all three in this course).

For this reason,
the normal distribution is also called the Gaussian distribution, and it is by far the most important continuous distribution.

18
Q

Normal Distribution

A
19
Q

standard normal distribution

A

𝑁 (0, 1) has mean 0 and variance 1.

20
Q

Z

A

standard normal random variable

21
Q

πœ™(𝑧)

A

standard normal density

22
Q

normal distribution graphs

A
23
Q

different normal distributions

A
24
Q

three approximate probabilities for the standard normal distribution

A

𝑃 (βˆ’1 ≀ 𝑍 ≀ 1) β‰ˆ 0.68

𝑃 (βˆ’2 ≀ 𝑍 ≀ 2) β‰ˆ 0.95

𝑃 (βˆ’3 ≀ 𝑍 ≀ 3) β‰ˆ 0.99

25
Q

describe a figure to name the three approximate probabilities as areas under the graph of the standard normal pdf πœ™(𝑧).

A
26
Q

symmetry calculations

A

We can use the symmetry of the standard normal distribution about 𝑧 = 0 to make some calculations.

27
Q

The rule of thumb says 𝑃 (βˆ’1 ≀ 𝑍 ≀ 1) β‰ˆ 0.68. Use this to estimate Ξ¦(1).

A

Ξ¦(1) = 𝑃 (𝑍 ≀ 1). In the figure, the two tails (in blue) have combined area 1 βˆ’ 0.68 = 0.32. By symmetry the left tail has area 0.16 (half of 0.32), so 𝑃 (𝑍 ≀ 1) β‰ˆ
0.68 + 0.16 = 0.84.

28
Q

Using R to compute the standard normal cdf

A

Use the R function pnorm(π‘₯, πœ‡, 𝜎) to compute 𝐹 (π‘₯) for N(πœ‡, 𝜎^2)

pnorm(1,0,1)
[1] 0.8413447

pnorm(0,0,1)
[1] 0.5

pnorm(1,0,2)
[1] 0.6914625

pnorm(1,0,1) - pnorm(-1,0,1)
[1] 0.6826895

pnorm(5,0,5) - pnorm(-5,0,5)
[1] 0.6826895

Of course z can be a vector of values
pnorm(c(-3,-2,-1,0,1,2,3),0,1)
[1] 0.001349898 0.022750132 0.158655254 0.500000000 0.841344746 0.977249868 0.998650102

29
Q

Use R to compute 𝑃 (βˆ’1.5 ≀ 𝑍 ≀ 2)

A

This is Ξ¦(2) βˆ’ Ξ¦(βˆ’1.5) = pnorm(2,0,1) - pnorm(-1.5,0,1) = 0.91044

30
Q

Pareto distribution

A