CAP Probability and Statistics Flashcards
Negative binomial probably distribution
A distribution that describes the odds of having a number of successful Bernoulli trials before having a given number of failures
Hypergeometric probability distribution
A function describing the probability of drawing a sample of k successes in n draws without replacement from a population with K successes and N samples
Lognormal Distribution
Any distribution whose logarithm is normally distributed
Statistics
Study of collection, analysis, interpretation, and organization of data
Cumulative Distribution Function (CDF)
A function that describes the likelihood of a random variable taking on a value less than or equal to a given value
Binomial Distribution
A function describing the number of successes in independent experiments, defined by number of trials and probability of success in each trial
Arithmetic mean
Measure of central tendency indicated by summing the sample then dividing by the sample size
Weibull probability distribution
A distribution that describes how failure rates change over time
Probability Mass Function (PMF)
A function that describes the relative likelihood of a discrete random variable taking on a given value
Harmonic mean
Measure of central tendency indicated by the reciprocal of the arithmetic mean of the reciprocals; typically used when averaging rates
normal probability distribution
A distribution shaped like a symmetric bell curve, used when sample size is large or the population standard deviation is known
Geometric mean
Measure of central tendency indicated by taking the nth root of the product of the sample; typically used when comparing items that have different properties with different ranges
Bernoulli Probability Distribution
A function where the random variable equals 1 with a probability p and equals 0 with a probability 1-p
exponential probability distribution
A function describing the amount of time that passes between events in a Poisson process
Probability Density Function (PDF)
A function that describes the relative likelihood of a continuous random variable taking on a given value
Gamma probability distribution
A distribution used to model arrival times of multiple entities in a Poisson process, models any process where values are always positive and come from skewed distributions
Variance
The sum of square deviations of all values in the data set divided by the number of values in the data set
Median
Measure of central tendency indicating the value that separates the higher half of a sample from the lower half of the sample
beta probability distribution
A distribution that models the behavior of random variables limited to intervals of finite lengths
Skewness
Measure of asymmetry in a data distribution about its mean
Student’s t-distribution
A distribution used to estimate the mean of a normally distributed population where the sample size is small and the population standard deviation is unknown
Poisson Probability Distribution
A function describing the likelihood of a given number of events occurring in a fixed interval of time or space where events occur independent of each other at a known constant rate
Chi Square Probability Distribution
A distribution used in statistical tests for fitting an observed distribution to a theoretical one, assessing independent of two classification criteria, and confidence interval estimation for standard deviation
Heteroskedasticity
Describes a data set where subpopulations have different variances than the population as a whole
Standard deviation
The square root of variance
Mode
Measure of central tendency indicating the most common value in a sample
Absolute deviation
The difference from each value in the dataset from the arithmetic mean
Geometric probability distribution
A function describing the number of Bernoulli trials needed to reach the first successful trial
Probability
Measure of likelihood of an event occurring
Uniform probability distribution
A function where all intervals of the same length are equally probable
Kurtosis
Measure of sharpness of a peak in a data distribution