Distributions Flashcards
How are Y and y defined?
Y –> Actual outcome of an event
y –> One of the possible outcomes
Ways of writing the likelihood of a particular outcome y:
P (Y = y)
p (y)
What is p(y) called?
Since p(y) expresses the probability of each distinct outcome, we call this:
The probability function
With what do we define distributions?
2 characteristics:
Mean –> Average value –> μ
Variance –> How spread out the data is –> σ^2
How are population and sample data defined?
Population data –> All the data
Sample data –> Just a part of it
How are sample mean and variance denoted?
Sample mean symbol: x̄
Sample variance: s^2 (square)
How is the standard deviation defined/denoted
Standard deviation –> Square root of variance:
√(σ^2)
Formule standaarddeviatie
Standaarddeviatie:
Sx = σ = de standaarddeviatie van getallenreeks x.
Xi = de waarde van getal i in de getallenreeks.
μ = het gemiddelde van de getallenreeks (som getallen / aantal)
Nx = het aantal getallen in de proef.
Formule Standaarddeviatie
σ = Sx = √( ∑ ( (xi - μ)2 / nx) )
Notation for distributions:
Variable name
Tilde sign
Type –> Capital letter
Characteristics (μ, σ^2)
X ~ N (μ, σ^2)
Discrete distribution when all outcomes are equally likely?
Equiprobable –> Uniform distribution
Discrete distribution with only two possible outcomes?
Follow a Bernoulli distribution
Single trial
Discrete distribution when carrying out a similar experiment several times in a row
Binomial Distribution
Two outcomes per iteration
Many iterations –> Multiple trials
Discrete distribution when calculating chance of succes after given an average probability?
Poisson distribution
How unusual is an event frequency for a given interval
Example: There’s 35 points per game. How big is the chance of 12 points in the first quarter of the next game?
Characteristics of normal distribution?
Often observed in nature
Margin values are called outliers
When is Student’s T distribition used?
A small sample approximation of a Normal distribution
It accommodates extreme values much better
Curve has fatter tails than normal distribution
When is Chi-Squared distribition used?
A-symmetric continuous distribution
Only consists of non-negative values
Starts from 0
Often used in hypothesis testing