Continuous Probability Distributions Flashcards

1
Q

What is a probability distribution?

A
  • A graph, table or formula that specifies the probability associated with each possible value the random variable can assume.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a probability distribution represent and what values can it hold?

A
  • This probability distribution represents the theoretical population distribution.
  • A probability distribution can have values that are a countable number of values (Discrete).
  • Alternatively, a probability distribution can have values corresponding to any of the points in a interval or intervals (Continuous).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What symbols make up a Normal Probability Distribution?

A
x = Continuous random variable
μ= Population Mean
σ= Population Standard Deviation
π = 3.14159, e = 2.71828
  ~N(μ,σ)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What symbols make up a Standard Normal Probability Distribution?

A

z = Standardized x
z = (x - μ)/σ
~N(0,1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Study the equations for a Normal Probability Distribution and a Standard Normal Probability Distribution

A

https://docs.google.com/document/d/1r_ttbYs-4jXdkBbVGPH9vk1swjRRmRJUWdllcJXdaAI/edit?usp=sharing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Provide a summary of a Standard Normal Distribution.

A
  • A symmetrical, continuous “bell” shaped curve
  • Common distribution for scale data
  • Parameters of a Standard Normal distribution :
  • μ = 0, μ = 1, or ~N(0,1)
  • z values range from -∞ to +∞
  • The z value is z standard deviations from a mean of zero (z=2)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What can be used to find the value of z in a Standard Normal Distribution?

A
  • Again we have a table that can be used to obtain the probability of getting a certain value (z) within a distribution of Z values.
  • The Standard Normal table gives the cumulated probability between the mean (0) and the z value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Study the use of the Standard Normal Table

A

https://docs.google.com/document/d/1r_ttbYs-4jXdkBbVGPH9vk1swjRRmRJUWdllcJXdaAI/edit?usp=sharing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Carry out the following example.

A
  • Using the Standard Normal Table, find the area of Z from the mean (area goes from 0 to 1.90)
    Answer:
  • First look up the area between the mean and 1.90 Standard Deviations (i.e. z=1.90)
  • This can be represented as P(0
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How would you find a negative z value using the Standard Normal Table?

A
  • So far we have dealt with positive z values.
  • However, when we have a negative z value, we can still look up the positive z value since the area is the same either side of the curve (symmetrical).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Give an example of how to find the area of a negative z value.

A
  • Negative Z scores P(-1.2z>1.2)

= 0.3849

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we find a total probability when given two points on the graph from the mean?

A
  • We can add the two probabilities together to obtain a total probability
  • For example: P(-1.42
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How would you find the area if the probability didn’t start at the mean(0)?

A
  • We have to use subtraction to get the answer

- Remember: P(0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How would you find the area if the probability didn’t start at the mean(0) but it does stop at a certain z point?

A
  • Again, we are going to use subtraction to calculate our final probability
  • For example: if the area that we are trying to find starts at 0.42 and ends at 1.18 on the graph, then we first need to find the values of the areas between the mean (0) and both of these points.
  • P(0.42
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How would you find the value of T when just given the probability?

A

P(T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a Log Normal Distribution?

A
  • A continuous probability distribution of a random variable whose logarithm is normally distributed.
17
Q

What is the difference between a Normal Distribution and a Log Normal Distribution?

A

Normal Distribution:
- Range - infinity to + infinity
- Symmetrical around mean
- No skewness
- Mean = Median = Mode
- Std. Dev. has no relationship with the Mean
Log Normal Distribution:
- Range 0 to + infinity
- Unsymmetrical around mean
- Right hand skewed
- Mode < Median < Mean
- Std. Dev. has a linear relationship with the Mean
- The larger the mean, the larger the Std. Dev.
- Graph of Log Normal distribution in google doc

18
Q

What is the Central Tendency?

A
  • In statistics, a central tendency is a central or typical value for a probability distribution.
19
Q

How is the central tendency measured?

A
  • Mode, median and mean are all meaningful measures of central tendency
20
Q

How do we measure a variables distribution if it is in a single population?

A
  • A single population has an unknown distribution of a random variable
  • If a single sample of this population is being studied then the use of descriptive statistics is needed.
    Useful descriptive statistics:
  • Compare mean, median, mode
  • Skewness, kurtosis
    Useful descriptive graphics (if n>30):
  • Box and Whisker plots
  • Histogram
  • Q-Q plot
21
Q

How is a box and whisker plot beneficial?

A
  • Box and Whisker plots can be useful in summarising the distribution of the data
22
Q

What does unsymmetrical distribution of data in a box plot tell you?

A
  • Unsymmetrical distribution of data around the median (i.e. skewed - most of the data is near zero) and a high number of extreme outliers are indicators of data that is not Normally distributed.
23
Q

What does symmetrical distribution of data in a box plot tell you?

A
  • Symmetrical distribution of data around the median (e.g. equal whiskers) and a low number of outliers are indicators of data that is Normally distributed.
24
Q

How are histograms useful in determining distribution?

A
If a histogram appears to be:
- Appears skewed  
- Appears unsymmetrical
- Non-Normal distribution
or if it:
- Appears not to be skewed 
- Appears symmetrical 
- Possibly a Normal distribution
25
Q

How are Q-Q plots useful in determining distribution?

A
  • The Q-Q plot compares the expected quantiles (from a Normal dist. with the same mean and std dev. as the data) against the observed quantiles
26
Q

What happens when the samples of a population(s) has n<30?

A
  • Population(s) with unknown distribution
  • Several samples (each sample - n<30 usually)
    Useful descriptive graphics (all samples):
  • Clustered Box and Whisker plots
  • Clustered Standard Error plots
  • Spread and Level plots (mean vs var. or Std. Dev.)
    DO NOT USE:
  • Box and Whisker plot – using all the data combined
  • Histogram – using all the data combined
  • Q-Q plot – using all the data combined
27
Q

What is the benefit of using a Clustered box plot for populations where each sample has n<30?

A
  • We can visually check the number of outliers and the symmetry of each sample using the Box plot.
  • Example is in google doc
28
Q

What is the benefit of using a Standard Error plot for populations where each sample has n<30?

A

They can identify whether:

  • There is no relationship between the mean and the variation around the mean, therefore, could be a Normal distribution underlying the data
  • If there is a relationship between the mean and the variation around the mean, therefore, non Normal distribution
  • Examples of S.E. plots on google doc
29
Q

What is the benefit of using a Spread and Level plot for populations where each sample has n<30?

A
  • They can identify the relationship between the mean and standard deviation which will be useful in determining if the data is normally distributed or not.
  • If there is a relationship between the mean and the standard deviation the underlying distribution is probably not normal. What type of distribution has this type of relationship?
  • Examples in google doc
30
Q

What is another version of the Spread and Level Plot?

A
  • Instead of plotting the mean against the Standard deviation, you can plot the mean against the Variance
  • If there is no relationship between the mean and the variance, then the data is normally distributed.