Continuous probability distributions Flashcards

1
Q

difference between discrete and continuous data

A

—Remember: discrete data has a countable number of possible values –>discrete probability distributions can be put in tables
—Continuous data have an infinite number of possible values –>we use a smooth function, f(x) to describe the probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is a good way to represent continuous data?

A

via histograms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how do observations influence the shape of the histogram?

A

more observations result to a smoother histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how to represent a continuous random variable?

A

continuous random variable is a function such that the probability that the variable lies in an interval (a, b) is the area under the curve from a to b

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are features of a probability density function?

A

—f(x) must satisfy the following:

  1. f(x)≥0 for all x, that is, it must be non-negative.
  2. The total area underneath the curve representing f(x) = 1.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what do areas on the histogram represent?

A

—In each case above, the % areas of the histogram boxes (that is, the area of a box as a % of the total area of all the boxes) are providing estimates of the probabilities of intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what are possible shapes of f(x)?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how to evaluate area under curve?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

for continuous probability density function, when x takes on a specific value eg. x=2, what is the area?

A

area = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

describe the mean and variance of a continuous random variable

A

The mean measures the location of the distribution, the variance measures the spread of the distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Find P(X>0.5)

A

¼

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Find P(X<0.75)

A

—1-P(X>0.75)=1-(½*¼*½)=15/16

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is uniform distribution?

A

—Special sort of distribution for continuous data.
—Described by the function
f(x) = 1/b-a a ≤ x ≤ b

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is expectation and variance of uniform distribution?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

—The length of time patients wait to see a doctor is uniformly distributed between 40 minutes and 3 hours.
—Let X be the waiting time in minutes.

A

as f(x) = 1/b-a

f(x) = 1/180-40

f(x)=1/140

40 less than or equal to X 180 equal to or greater than

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

—The length of time patients wait to see a doctor is uniformly distributed between 40 minutes and 3 hours.
—Let X be the waiting time in minutes.
Find the probability of waiting between 50 minutes and 2 hours.

A

—P(50≤x≤120) = (120-50) x 1/140 = 0.5

17
Q

—The length of time patients wait to see a doctor is uniformly distributed between 40 minutes and 3 hours.
—Let X be the waiting time in minutes.

Find the mean and variance of the distribution of waiting times

A
18
Q

—The length of time patients wait to see a doctor is uniformly distributed between 40 minutes and 3 hours.
—Let X be the waiting time in minutes.
Find the probability of having to wait exactly one hour.

A

—P(X=60) = 0

19
Q

what is another special sort of distribution for continuous data?

A

The Normal Distribution

—The general form of the pdf (probability density function) is given by:

20
Q

describe normal distribution

A

—Bell-shaped, symmetric about µ, reaches highest point at x=µ, tends to zero as x→±∞.

21
Q

<!--StartFragment-->

About the Normal Distribution<!--EndFragment-->

A
  1. E(X) = µ; V(X) = σ².
  2. Area under curve = 1
  3. Different means – shift curve up and down x-axis
  4. Different variances – curve becomes more peaked or more squashed
  5. Shorthand notation: X~N(µ, σ²).
22
Q

what do different means on a normal distribution look like?

A
23
Q

what do different variances on normal distribution look like?

A
24
Q

what is the standard normal distributoin?

A

when μ=0, σ² =1

25
Q

P(Z<-1.08)

A

3) P(Z<-1.08) = P(Z>1.08) by symmetry

= 0.1401 (from (2))

OR P(Z<-1.08) = 0.1401 from tables directly.

26
Q

P(-1.51

A

= P(Z

= 0.8599 – 0.0655

= 0.7944

27
Q

what do you call a random variable from a standard normal distribution?

A

—Call a r.v. from this a Standard Normal r.v., use notation Z~N(0,1)

28
Q

how do you convert any Normal random variable to a Standard Normal Random Variable?

A

—If X~N(μ,σ²), then use the linear transformation below:

the standardised value is known as the Z score

—So, for ANY random variable that comes from a normal distribution, if we subtract the mean and divide by the standard deviation, we get a r.v.~N(0,1).

29
Q

Find P(Z>1.08)

A

= 1 – P(Z<1.08)

= 1 – 0.8599 (from tables)

= 0.1401

30
Q

—Suppose X~N(50, 100²). Find P(X<200)

A
31
Q

—The length of metallic strips produced by a machine are normally distributed with a mean of 100cm and a variance of 2.25cm². Only strips that are between 98 and 103cm are acceptable. What proportion of strips are acceptable?

A
32
Q

—A battery lasts an average of 3 years with a standard deviation of 0.5 years. Assuming that battery lives are normally distributed, find the probability that a given battery will last less than 2.3 years.

A
33
Q

—Salaries of workers in a factory are normally distributed with mean $48,000 and standard deviation $3,500. What is the minimum salary of the top 20% of workers?

A

—From tables, P(Z
So,

P(Z> (a-48000/3500) ) = 0.2

P(Z<0.84) = 0.8 (need to find area below)

P(Z>0.84) = 0.2 (Area above)

Therefore a-48000/3500 = 0.84

a= 50940

34
Q

features of continuous probability distributions

A
  1. f(x)≥0 for all x
  2. Area under curve = 1
35
Q

describe symmetry found in Z tables

A

—Symmetry → P(Z<-a) = P(Z>a)

36
Q
  1. The time of failure for a continuous operation monitoring
    device of air quality has a uniform distribution over a 24
    hour day

If the device has a self-checking computer chip that
determines whether the device is operational every
hour on the hour, what is the probability that it will
take at least 40 minutes to detect that a failure has
occurred?

A
P(Failure takes longer than 40 minutes to detect) = P(failure 
occurs between 12-12.20am; or 1-1.20am; or 2-2.20am …) 
Hence P(failure takes longer than 40 minutes to detect = 24 \* 
1/24 \* 20/60 = 1/3
37
Q
  1. The time of failure for a continuous operation monitoring
    device of air quality has a uniform distribution over a 24
    hour day

If the device has a self-checking computer chip that
determines whether the device is operational every
hour on the hour, what is the probability that a failure
will be detected within 10 minutes of its occurrence?

A

P(failure detected within 10 minutes) = P(failure occurs between
12.50-1am; or 1.50-2am; or 2.50-3am ……)
There are 24 windows each 10 minutes wide during which the
failure could occur to be detected within 10 minutes. Hence P(failure detected within 10 minutes) = 24 * 1/24 * 10/60
= 1/6
d. If the devi

38
Q

The time of failure for a continuous operation monitoring
device of air quality has a uniform distribution over a 24
hour day.
a. If a failure occurs on a day when it is daylight between
5.55am and 7.38pm, what is the probability that the
failure will occur during daylight hours?

A

Let X be the time of failure, past midnight, in hours.
X has a uniform distribution, with positive probability from a
lower bound of 0 to an upper bound of 24.
f(x) = 1/24, 0<x><br></br>P(failure occurs in daylight) = P(failure is between 5.55am and<br></br>7.38am)<br></br>This probability is calculated by finding the area of the rectangle<br></br>with height 1/24 and base length (5.55am to 7.38pm) = 13 hours<br></br>and 43 minutes = 13 43/60 hours.<br></br>Hence P(failure in daylight) = 1/24 * (13 43/60) = 823/1440</x>