Chapter 6: Random Variables and Probability Distributions Flashcards

1
Q

Random Variable

A

takes numerical values that describe the outcomes of some chance process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Probability Distribution of a Random Variable

A

gives its possible values and their probabilities

all probabilities
- 0<p<1
- add up to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Discrete Random Variable

A

doesn’t take a fixed set of possible values with gaps between
- has to be numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Probability Distribution Histogram

A

y-axis = probability
x-axis = values of x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define Expected Value (mean) of a Discrete Random Variable

A

to find ______ of x, multiply each possible value by its probability, then add all the products.

mu of x = E(x) = XiPi + X2P2 + X3P3 … = Summation of XiPi

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Interpretation of Expected Value (mean, mu)

A

If the random process of __context__ is repeated for a very large number of times, the average number of __x-context__ we can expect is __expected value__. (decimals OK)

ex. If the random process of __asking a student how many movies they watched this week__ is repeated for a very large number of times, the average number of __movies__ we can expect is __3.23 movies__.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Standard Deviation of a Discrete Random Variable

A

sigma (x) = sqrt (summation (xi - mu(x))^2Pi))

on the formula sheet I think

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Interpretation of Standard Deviation of a Discrete Random Variable

A

The __context__ typically vary by __sigma__ from the mean of __mu__.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Variance of a Discrete Random Variable

A

sigma (x) ^2 = summation (xi - mu(x))^2Pi)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Continuous Random Variables

A

A random variable that can take any value in an interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

VARIABLE

A

What it describes: a characteristic of an individual
typical uses: analyzing the distribution of a specific sample
Types:
1. categorical
- possible values: any
- typical graphs used: pie chart, bar chart
2. quantitative
- possible values: numeric, where averages make sense
- typical graphs used: histogram, boxplot

examples:
variable: gender at birth
type: categorical
possible values: male, female
distribution: varies by sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

RANDOM VARIABLE

A

What it describes: a numerical outcome of a chance process
typical uses: answering questions of a chance process
Types:
1. discrete
- possible values: numeric with gaps between possible values
- typical graphs used: probability distribution table, histogram
2. continuous
- possible values: numeric on an interval of possible values
- typical graphs used: density curve

examples:
how many heads on 5 tosses of a coin?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Adding/Subtracting a Constant C, on a Probability Distribution

A

Shape: stays the same
Center: add/subtract C
Variability: stays the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Multiplying/Dividing a Constant C, on a Probability Distribution

A

Shape: stays the same
Center: multiply/divide by C
Variability: multiply/divide by C

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Adding/Subtracting Random Variables X and Y

A

Mean: mu (x±y) = mu (x) ± mu (y)
Standard deviation: sigma (x±y) = sqrt( sigma (x) ^2 + sigma (y) ^2) –> x and y must be independent random variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If combine two normal (continuous) random variables

A
  • any sum/difference of independent normal random variables is also normally distributed
  • the mean and standard deviation of the resulting normal distribution can be found using the same rules for mean and variances that we have been using for discrete random variables.
17
Q

Binomial Random Variable

A

Conditions: BINS
binomial formula
describing a binomial distribution: [shape, mean, standard deviation]
10% condition
large count condition

18
Q

Binomial Conditions BINS

A

Binary: success and failures
Independent: each trial is independent of other trials
Number of trials: fixed
Same probability of success for each trial

19
Q

Binomial Formula

A

P(X=x) = nCx * p^x * (1-p)^(n-x)

n = number of trials
x = number of success
p = probability of success

20
Q

Describing a binomial distribution

A

shape: (when n is small)
- roughly symmetric when p is close to 0.5
- skewed right when p<0.5
- skwed left when p>0.5
mean: mu (x) = np
Standard deviation: sigma (x) = sqrt (np(1-p))

21
Q

Binomial 10% and large count condition

A

10% condition: When sampling without replacement, we can treat individual trials as independent as long as n<1/10N
large count condition: np≥10 and n(1-p)≥10 –> can use normal distribution to model a dinomial distribution with mu = np and sigma = sqrt (np(1-p))

22
Q

Geometric Random Variable

A

Conditions: BITS
geometric formula
describing a geometric distribution: [shape, mean, standard deviation]

23
Q

Geometric Conditions BITS

A

Binary: success and failures
Independent: each trial is independent of other trials
Trials until first success
same probability of success for each trial

24
Q

Geometric Formula

A

P(X=x) = (1-p)^(x-1)*p

p = probability of success
x = trial of first success

25
Describing a geometric distribution
shape: skewed right mean: mu (x) = 1/p Standard deviation: sigma (x) = sqrt ((1-p)/p)
26
Interpretation of binomial mean
after many trials, the average number of successes __context__ is __mu (x)__ out of __n__ trials.
27
Interpretation of binomial standard deviation
the number of successes typically varies by __sigma (x)__ from the mean of __mu (x)__ out of __n__ trials.
28
Interpretation of geometric mean
on average, __context__ is expected to be __mu (x)__ before success.
29
Interpretation of geometric standard deviation
__context__ will typically vary by __sigma (x)__ from the mean of __mu (x)__ before success.
30
CDF and PDF
Binomial CDF and Geometric CDF = P(x ≤/≥ #) Binomial PDF and Geometric PDF = P(x = #)