02 Random Variables Flashcards
What is a random variable?
A numerical measure of the outcomes of a random phenomenon.
There are two kinds of random variables. What are they?
Discrete and continuous.
- Discrete random variables can take a finite number of outcomes (eg shoe sizes)
- Continuous random variables can take an infinite number of values (eg heights)
What is a probability distribution?
A list or graph showing the probabilities of the different possible values the random variable can take
A bar chart showing the probabilities associated with a discrete random variable is called what?
A probability histogram.
How does a probability distribution for a continuous random variable represent the probabilities?
As areas under a probability density curve.
The mean of a probability distribution is also referred to as the…
Expected value, E(X)
How do you calculate the mean of a discrete random variable?
μX = Σxi⋅pi = x1⋅p1 + x2⋅p2 + x3⋅p3 + … + xn⋅pn
What does the variance of a probability distribution measure?
How far the data is from the mean.
- The larger the variance, the more data points that are far from the mean.
- The smaller the variance, the more data points that are close to the mean
How is the variance of a probability distribution calculated?
Var(X) = σ2X = Σ(Xi-μX)2⋅pi = (X1-μ)2⋅p1 + (X2-μ)2⋅p2 + …+ (Xn-μ)2⋅pn
What is the standard deviation?
Standard deviation, σ, is the square root of the variance:
- σ = √Var(X)
What is a population?
- Properties of the probability distribution are population properties.
- The population is the total number of people/trees/chocolates/objects…. described by the probability distribution
What is a sample?
- The data actually measured or observed
- The sample is the group of people/chocolates/trees whose height/size/age you measure/count
What is the Law of Large Numbers?
The mean of many trials approaches the true mean of the distribution as the number of trials increases.
μ and σ are the symbols for the population or sample mean and standard deviation?
Population
x̄ and s are the symbols for the population or sample mean and standard deviation?
Sample
Symbolically, the Law of Large Numbers says what?
x̄ → μ as n → ∞
where n = number of trials
If X is a random variable and a, b are constants, how are μa+bX and σ2a+bX related to μX and σ2X?
- μa+bX = a + b⋅μX
- σ2a+bX = a + b2⋅σ2X
If X, Y and Z are random variables such that Z = X+Y, then how are the means and variances of X, Y and Z related?
- μZ = μX + μY
- σ2Z = σ2X + σ2X if X, Y are independent
If X, Y and W are random variables such that W = X-Y, then how are the means and variances of X, Y and W related?
- μW = μX - μY
- σ2W = σ2X + σ2X if X, Y are independent
If an event has only two possible outcomes, what distribution can we use if we want to know the probability of getting k successes out of a total number of n trials?
The binomial distribution.
What are the 4 criteria for the use of a binomial model?
- There are only two possible outcomes of each observation (success or failure)
- There is a fixed number of observations, n
- The n observations are independent
- The probability of success, p, is the same for each observation
What are the parameters of a distribution?
Parameters are the numbers that completely characterise a distribution. This means that only they are needed to calculate probabilities; nothing else.
What are the parameters of the binomial distribution?
- n* and p; B(n, p)
- n* = the total number of trials
- p* = the probability of obtaining a success
What is the formula for calculating a probability with the binomial distribution?
The probability that there are k successes in n trials is given by P(X=k) = nCk⋅pk⋅(1-p)n-k