Discrete Distributions Flashcards
What is a distribution?
the possible values a variable can take and how frequently they occur
What is the probability function?
p(y)
What are probabilities or probability distributions?
measure the likelihood of an outcome
We define distributions with which two characteristics
- Mean & 2. Variance
What character do we use to denote mean?
population: μ - mu
sample: x-bar
What character do we use to denote variance?
population: σ2 - sigma squared
sample: s2 - s squared
What character do we use to denote std deviation?
population: σ - sigma
sample - s
What units are std deviation measured in?
The same units as the mean
What units is variance measured in?
mean units squared
What is a distribution?
A collection of all the possible values a variable can take and how frequently they occur in the sample space.
A function which assigns a probability to each value a variable can take.
The average value of the elements in the data set.
The likelihood of an event occurring.
A collection of all the possible values a variable can take and how frequently they occur in the sample space.
What is the difference between sample data and population data?
Sample data represents the entire data we have, while population data is only some part of it.
The terms sample data and population data are interchangeable and mean a part of the data.
The terms sample data and population data are interchangeable and mean the entire data set.
Sample data represents some part of the data, while population data is the same as all the data.
Sample data represents some part of the data, while population data is the same as all the data.
Which of the following is expressed with the letter s?
Population mean.
Population variance.
Population standard deviation.
Sample mean.
Sample variance.
Sample standard deviation.
Sample standard deviation.
What are the Types of Probability Distributions?
- Discrete Distributions - finite number of outcomes - examples: rolling a die or picking a playing card
- Continuous distributions - infinitely many outcomes - example: Time and distance
What is the notation for distributions?
X ~ N (μ, σ2)
Variable - Tilde - Type - Characteristics in parenthesis (usually mean, variance)
What are some characteristics of Discrete Distributions?
- all outcomes are equally likely - called Equiprobable
They follow a uniform distribution - finitely many distinct outcomes
- can express the entire probability distribution with either a table, graph or formula
- need to ensure every unique outcome has a probability assigned to it
In Probability we are often more interested in the likelihood of an interval than an individual value.
intervals - add up all the values that fall within that range
What is a Bernoulli Distribution?
- Events with only 1 trial and 2 possible outcomes
- Any event with two outcomes can be transformed into a Bernoulli event
– We use ‘Bern’ to describe it: Bern(p) - examples: coin flip, quiz with T or F question, Vote D or R
- usually denote the higher value with p - we assign a value of 1
- usually denote the lower value with 1- p
- The variance of Bernoulli events would always = p(1-p)
What is a Binomial Distribution?
- The outcomes per iteration are two
- Many iterations
- a sequence of identical Bernoulli events
- Notation: B(n,p)
What is the Poisson Distribution?
- Test how unusual an event frequency is for a given interval
- Denoted as Po(lamda)
- deals with the FREQUENCY with which an event occurs within a specific interval
- instead of the probability of an event, it requires knowing how often it occurs for a specific period of time or distance
- graph always starts from zero
example: a firefly lights up 3 times in 10 seconds. What is the likely hood of it lighting up 8 times in 20 sec
What is Euler’s number? Napier’s Constant?
e ~ 2.72
What are characteristics of Continuous Distributions?
The probability distribution graph would be a curve as opposed to unconnected individual bars
What is the Normal Distribution?
outcomes often observed in nature
What is the Chi-Squared? (Kai)
Asymmetric
only consists of non-negative values
always starts with zero on the left
What is the Exponential distribution?
used when dealing with events that are rapidly changing early on
online news example - more relevant when the news is fresh, as times goes on they die off
What is the Logistic distribution?
useful in forecast analysis
useful for determining a cut-off point for a successful outcome
example:
how much of an in-game advantage is necessary to predict victory for either team
predictions would never reach true certainty