Exam 1 Flashcards
A Geometric model
good for when we’re interested in # of Bernoulli trials until next success
Bernoulli trials
2 possible outcomes (success and failure), probably of success p is constant, the trials are independent
examples of Bernoulli trials
tossing a coin, shooting hoops
a Binomial model
when we’re interested in the number of successes in a certain number of Bernoulli trials
a Normal model
to approximate a binomial when we expect at least 10 successes/failures
Poisson model
when n is large and p is small. good approximation if n>eq 20 peq 100 with p
lambda
only parameter within Poisson model
good model for number of occurrences over given period of time
Poisson (with the parameter the mean of the distribution lambda)
Exponential model
can model the time between two random events
mean time between two events
1 / lambda
when p is small for a large # of cases
Normal model
when checking the probability of this many successes in a row
Binomial for Bernoulli
when asked how many trials until this happens
Geometric for Bernoulli
3 tips for sketching good Normal curve
-(1) bell-shaped and symmetric around its mean, start at the middle and then sketch from left to right, (2) only draw for 3 standard deviations left to right, (3) changes from curving downward to back up is called inflection point and is one standard deviation away from mean
tells how many standard devs a value is from mean
z score
Let y represent value corresponding to
outlying value indicated by a certain z score (e.g. high IQ example)
Table of standard normal distribution
Use it when you’re given a z score and looking for cut off and stuff
Finding a percentile
Use z-table to find value of how many are below that given percentile, then do additional solving for y if needed
Formula for IQR
Q3 - Q1
example of random phenomenon
flipping a coin, two possible outcomes; one toss of coin will consist of a ‘trial’
term for result of one ‘trial’
‘outcome’
term for collection of all possible ‘outcomes’
‘sample space’
definition of empirical probability
[a specific number, what is that called] – says that the long-run relative frequency of repeated independent events (with identical probs.) gets closer and closer to a single value
formula for empirical probability
of times A occurs / # of trials = relative frequency of occurrence A in long run [ex. red light green light, after many days P(green?)?=.35))
When can you NOT do empirical probability assignment
when you cannot repeat events
definition of theoretical probability
for when you cannot repeat events. comes from mathetmical model, not from observations or repetitions. (Ex: American roulette, if you bet on red what is the prob of winning? [18/38])
formula for theoretical probability
P(A) = # of outcomes in A / # of possible outcomes
personal probability
subjective sense based on personal experience and guesswork
definition of formal probability
based on a set of axioms (=a statement we assume to be true) on how probability works
Rule 1
For any event A, 0 <= P(A) <= 1
Rule 2
P(S)=1, the probability of all possible outcomes of a trial must be 1
Rule 3: The complement rule
-the set of outcomes that are not in the event A is called the complement of A, denoted AC
P(A^C) = 1 - P(A)
Addition Rule
P(A or B) = P(A) + P(B) - P(A and B)
Disjoint events
events that have no outcomes in common (and thus cannot occur together). Also called ‘mutually exclusive’
If events are disjoint, then P(A and B) = ??
ONLY if two events are disjoint
0
Pause and do practice exercise on being in a relationship and in sports
Either on powerpoint or in textbook
Conditional probability [of B given A]
P (B I A ) = P (A and B) / P (A)
Contingency table
used for conditional probability, comes up often
Pause and do practice exercise on being a girl and popular
Either in book or slideshow
Shortcut for finding probability of two independent events A and B
P (A and B) = P (A) * P (B I A )
If not independent P(A I B) = ??
0, and definitely not equal to P(A)
marginal probabilities on contingency table
the totals on edges that aren’t the TOTAL
joint probability on contingency table
where probabilities of two things meet on the center part of the table
best diagram for working with conditional probability (probs. with ‘givens’ in them)
tree diagrams
conditioning event on a tree diagram
would be the sooner event
intersectional probability
** check out textbook! **
Reversing the conditioning
This means finding the probability of something working backwards through tree diagram; e.g. knowing something already happened, find the probability that they were this (binge drinker)
Bayes rule
for two branches, a bit complicated probably best to set up tree diagram
Reverse the condition [ctd]
Hard part is recognizing the regular probabilities and conditional probabilities from the problem. Basically you divide the probability of both (e.g. on tree diagram) by the probability of one (which might take some investigating)
Pause and do a practice problem on reversing the conditioning
Binge drinker or from textbook
Random variable (denoted ‘X’)
Two types, discrete and continuous.
specific value (denoted ‘x’)
e.g. P(X=x)
discrete random variables
can take one of a countable number of distinct outcomes (ex. number of credit hours)
continuous random variables
can take any numeric value within a range of values (ex. cost of books this term)
A probability model for a random variable consists of what two things?
(1) collection of all possible outcomes, (2) the probabilities that the values occur
Expected value
E(X) of a discrete random variable can be found by summing the products of each possible value by the prob. that it occurs – write this by hand, E x * P(x)
Variance, formula
E( x - u)^2 * P(x) , write this by hand
Standard deviation, formula
sq root Var (X)
Which formulas apply only to discrete random variables?
Expected value = E x * P(x) // variance = E (x-u)^2 * P(x) // standard deviation = sq rt Variance
Shifting and rescaling random variables by a constant
Adding or subtracting a constant from the data shifts the mean but doesn’t change the variance or standard deviation
Rescaling random variables by a constant
Multiplies the mean by that constant and the variance by the square of that constant
Combining random variables
- The Sum or the difference of two random variables is also a random variable
- the mean of the sum of two random variables is the sum of the means
- the mean of the difference of two random variables is the difference of the means
-If the random variables are independent,
, the variance of their sum or difference is always the sum of the variances *** this is important
Now, what if the two random variables are not independent?
-can you still compute the variance
yes, however you have one additional term here known as covariance
(in this course given definition but not asked to calculate)
if see new random variable
A random variable assumes any of several different numeric values as a result of some random event. Random variables are denoted by a capital letter such as X.
Random variable
assumes a value based on the outcome of a random event
Pause and do screenshot at 1.46 pm
** in class worksheet #1 **
Pause and do screenshot 4/30
** q2 mail-rder **
Bernoulli trials – what are 3 conditions?
- there are two possible outcomes (success and failure)
- the prob of success, p, is constant
- the trials are independent
Examples of Bernoulli trials?
- tossing a coin
- looking for defective products rolling off an assembly line
- shooting free throws in a basketball game
- and many more…
First thing you need to identify in a problem involving Bernoulli
‘what is a trial’
Geometric model – tells us what in terms of Bernoulli trials?
of Bernoulli trials until the first success
What single parameter is specified in the Geometric model
p (probability of success)
Pause and do example of Geometric model [biased coin]
*** screenshot p433
Helpful rules for solving problems on Geometric model
multiplication rule, complement rule
Independence – the 10% condition
Rule that allows us to proceed if we don’t have independent trials in Bernoulli. Still OK to proceed as long as the sample is small than 10% of the population [ex universal blood donors]
Pause do to blood donor practice problem
** screenshot at 4.36pm **
Binomial probability model
No longer counts the trials but counts the number of successes within the fixed number of Bernoulli trials.
Two parameters that define the Binomial model
n, number of trials // p, probability of success
Formulas for mean and s.d. in Binomial model
u = np // s.d. = sqrt (npq)