Statistics & Probability Flashcards
In mathematics, ___ consists of writing a number or another mathematical object as a product of several factors, usually smaller or simpler objects of the same kind.
Factorization or factoring
Is the property of certain operations in mathematics and computer science whereby they can be applied multiple times without changing the result beyond the initial application. The concept of ___ arises in a number of places in abstract algebra (in particular, in the theory of projectors and closure operators) and functional programming (in which it is connected to the property of referential transparency).
The term was introduced by Benjamin Peirce in the context of elements of algebras that remain invariant when raised to a positive integer power, and literally means “(the quality of having) the same power”, from ___ + ___ (same + power).
The natural number 1 is an ___ element with respect to multiplication (since 1×1 = 1), and so is 0 (since 0×0 = 0), but no other natural number is (e.g. 2×2 = 2 does not hold). For the latter reason, multiplication of natural numbers is not an ___ operation.
Idempotence
Is the inverse function to exponentiation. That means the ___ of a given number x is the exponent to which another fixed number, the base b, must be raised, to produce that number x. In the simplest case, the ___ counts the number of occurrences of the same factor in repeated multiplication
Logarithm
A ___ is a mathematical curve that describes a smooth periodic oscillation. A ___ is a continuous wave. It is named after the function ___, of which it is the graph. It occurs often in pure and applied mathematics, as well as physics, engineering, signal processing and many other fields.
Sine wave, sinusoid or sinusoidal
___ is the absence of, or a violation of, symmetry (the property of an object being invariant to a transformation, such as reflection). Symmetry is an important property of both physical and abstract systems and it may be displayed in precise terms or in more aesthetic terms.
The absence of or violation of symmetry that are either expected or desired can have important consequences for a system.
In mathematics, there are no a and b such that a < b and b < a. This form of ___ is an ___ relation.
Asymmetry
In mathematics, an ___ assigns numbers to functions in a way that can describe displacement, area, volume, and other concepts that arise by combining infinitesimal data. ___ is one of the two main operations of calculus, with its inverse operation, differentiation, being the other.
Integral (Integration)
A stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event.
Roughly speaking, a process satisfies the ___ property if one can make predictions for the future of the process based solely on its present state just as well as one could knowing the process’s full history, hence independently from such history
Markov chain
In mathematics, the ___ of a positive integer n, denoted by n!, is the product of all positive integers less than or equal to n: n! = n x (n-1) x (n-2) x (n-3) x … x 3 x 2 x 1
For example: 5! = 5 x 4 x 3 x 2 x 1 = 120
Factorial
In mathematics, a ___ is a multiplicative factor in some term of a polynomial, a series, or any expression; it is usually a number, but may be any expression. In the latter case, the variables appearing in the ___ are often called parameters, and must be clearly distinguished from the other variables.
For example, in 7x^2 - 3xy + 1.5 + y the first two terms respectively have the ___ 7 and −3. The third term 1.5 is a constant ___. The final term does not have any explicitly written ___ factor that would not change the term; the ___ is taken to be 1.
Coefficient
In mathematics, a ___ is an expression consisting of variables (also called indeterminates) and coefficients, that involves only the operations of addition, subtraction, multiplication, and non-negative integer exponents of variables. An example of a ___ of a single indeterminate, x, is x2 − 4x + 7. An example in three variables is x3 + 2xyz^2 − yz + 1.
Polynomial
In mathematics, a ___ is a polynomial which is the sum of two monomials. A ___ in a single indeterminate (also known as a univariate ___) can be written in the form ax^m - bx^n where a and b are numbers, and m and n are distinct nonnegative integers and x is a symbol which is called an indeterminate or, for historical reasons, a variable.
Binomial
In mathematics, a ___ is, roughly speaking, a polynomial which has only one term.
Monomial
___ are useful ways to make sense of and tap into the logic and intuition of combinatoric identities
Story proofs
___ is the branch of mathematics concerning numerical descriptions of how likely an event is to occur, or how likely it is that a proposition is true. The ___ of an event is a number between 0 and 1, where, roughly speaking, 0 indicates impossibility of the event and 1 indicates certainty
Mathematics is the logic of certainty; ___ is the logic of uncertainty
Probability
In mathematics, a ___ is a well-defined collection of distinct objects, considered as an object in its own right. For example, the numbers 2, 4, and 6 are distinct objects when considered separately, but when they are considered collectively they form a single ___ of size three, written {2, 4, 6}. The concept of a ___ is one of the most fundamental in mathematics. Developed at the end of the 19th century, ___ theory is now a ubiquitous part of mathematics, and can be used as a foundation from which nearly all of mathematics can be derived.
Set
___ is a branch of mathematical logic that studies ___, which informally are collections of objects. Although any type of object can be collected into a ___, ___ is applied most often to objects that are relevant to mathematics. The language of ___ can be used to define nearly all mathematical objects.
Set theory
In probability theory, the ___ of an experiment or random trial is the set of all possible outcomes or results of that experiment. A ___ is usually denoted using set notation, and the possible ordered outcomes are listed as elements in the set. It is common to refer to a ___ by the labels S, Ω, or U (for “universal set”). The elements of a ___ may be numbers, words, letters, or symbols. They can also be finite, countably infinite, or uncountably infinite.
For example, if the experiment is tossing a coin, the ___ is typically the set {head, tail}, commonly written {H, T}. For tossing two coins, the corresponding ___ would be {(head,head), (head,tail), (tail,head), (tail,tail)}, commonly written {HH, HT, TH, TT}. If the ___ is unordered, it becomes {{head,head}, {head,tail}, {tail,tail}}.
Sample space, also called sample description space, possibility space or event space
In set theory, the ___ of a collection of sets is the set of all elements in the collection. It is one of the fundamental operations through which sets can be combined and related to each other.
Union (denoted by ∪)
In set theory, the ___ of two sets A and B, is the set containing all elements of A that also belong to B (or equivalently, all elements of B that also belong to A), and nothing else.
Intersection (denoted by A ∩ B)
In set theory, the ___ of a set A refers to elements not in A. When all sets under consideration are considered to be subsets of a given set U, the absolute ___ of A is the set of elements in U but not in A. The relative ___ of A with respect to a set B, also termed the difference of sets A and B, written B \ A, is the set of elements in B but not in A.
Complement
___ is any of several theories of sets used in the discussion of the foundations of mathematics. Unlike axiomatic set theories, which are defined using formal logic, ___ is defined informally, in natural language. It describes the aspects of mathematical sets familiar in discrete mathematics (for example Venn diagrams and symbolic reasoning about their Boolean algebra), and suffices for the everyday use of set theory concepts in contemporary mathematics.
Naïve set theory
In combinatorics, the ___ is a basic counting principle (a.k.a. the fundamental principle of counting). Stated simply, it is the idea that if there are A ways of doing something and B ways of doing another thing, then there are A x B ways of performing both actions.
Multiplication rule, rule of product or multiplication principle
___ is when a sampling unit is drawn from a finite population and is returned to that population, after its characteristic(s) have been recorded, before the next unit is drawn.
Sampling with replacement
In ___, each sample unit of the population has only one chance to be selected in the sample. For example, if one draws a simple random sample such that no unit occurs more than one time in the sample.
Sampling without replacement, also know as dependent events.
In probability theory, the ___ concerns the probability that, in a set of n randomly chosen people, some pair of them will have the same ___. By the pigeonhole principle, the probability reaches 100% when the number of people reaches 367 (since there are only 366 possible ___, including February 29). However, 99.9% probability is reached with just 70 people, and 50% probability with 23 people where 1-(365!/(365-23)!/365^23). These conclusions are based on the assumption that each day of the year (excluding February 29) is equally probable for a ___.
Birthday problem or birthday paradox
___ is an interpretation of probability; it defines an event’s probability as the limit of its relative ___ in many trials. Probabilities can be found (in principle) by a repeatable objective process (and are thus ideally devoid of opinion). This interpretation supports the statistical needs of many experimental scientists and pollsters. It does not support all needs, however; gamblers typically require estimates of the odds without experiments.
The development of the ___ account was motivated by the problems and paradoxes of the previously dominant viewpoint, the classical interpretation. In the classical interpretation, probability was defined in terms of the principle of indifference, based on the natural symmetry of a problem, so, e.g. the probabilities of dice games arise from the natural symmetric 6-sidedness of the cube. This classical interpretation stumbled at any statistical problem that has no natural symmetry for reasoning.
Frequentist probability or frequentism
___ is an interpretation of the concept of probability, in which, instead of frequency or propensity of some phenomenon, probability is interpreted as reasonable expectation representing a state of knowledge or as quantification of a personal belief.
The ___ interpretation of probability can be seen as an extension of propositional logic that enables reasoning with hypotheses, that is to say, with propositions whose truth or falsity is unknown. In the ___ view, a probability is assigned to a hypothesis, whereas under frequentist inference, a hypothesis is typically tested without being assigned a probability.
___ probability belongs to the category of evidential probabilities; to evaluate the probability of a hypothesis, the ___ probabilist specifies a prior probability. This, in turn, is then updated to a posterior probability in the light of new, relevant data (evidence). The ___ interpretation provides a standard set of procedures and formulae to perform this calculation.
Bayesian probability
The ___ of a function of a real variable measures the sensitivity to change of the function value (output value) with respect to a change in its argument (input value). ___ are a fundamental tool of calculus. For example, the ___ of the position of a moving object with respect to time is the object’s velocity: this measures how quickly the position of the object changes when time advances.
Derivative (differentiation)
___ is the property of a mathematical relationship or function which means that it can be graphically represented as a straight line. Examples are the relationship of voltage and current across a resistor (Ohm’s law), or the mass and weight of an object. Proportionality implies ___, but ___ does not imply proportionality.
Linearity
In mathematics, two varying quantities are said to be in a relation of ___, if they are multiplicatively connected to a constant, that is, when either their ratio or their product yields a constant. The value of this constant is called the coefficient of ___ or ___ constant.
If the ratio (y/x) of two variables (x and y) is equal to a constant (k = y/x), then the variable in the numerator of the ratio (y) is the product of the other variable and the constant (y = k⋅x). In this case y is said to be directly ___ to x with ___ constant k. Equivalently one may write x = 1/k⋅y, that is, x is directly ___ to y with ___ constant 1/k (= x/y). If the term ___ is connected to two variables without further qualification, generally direct ___ can be assumed.
If the product of two variables (x⋅y) is equal to a constant (k = x⋅y), then the two are said to be inversely ___ to each other with the ___ constant k. Equivalently, both variables are directly ___ to the reciprocal of the respective other with ___ constant k (x = k⋅1/y and y = k⋅1/x).
Proportionality
In mathematics, a (real) ___ is a set of real numbers lying between two numbers, the extremities of the ___. For example, the set of numbers x satisfying 0 ≤ x ≤ 1 is an ___ which contains 0, 1 and all numbers in between. Other examples of ___ are the set of real numbers R, the set of negative real numbers, and the empty set.
Interval
In statistics and quantitative research methodology, a ___ is a set of individuals or objects collected or selected from a statistical population by a defined procedure.
For a ___ to be “good,” it must possess two qualities: it must be large enough to be statistically significant and it must be random.
Sample (n)
In statistics, ___ is a bias in which a sample is collected in such a way that some members of the intended population have a lower sampling probability than others. It results in a ___ sample, a non-random sample of a population (or non-human factors) in which all individuals, or instances, were not equally likely to have been selected. If this is not accounted for, results can be erroneously attributed to the phenomenon under study rather than to the method of sampling.
Sampling bias
In statistics, ___ is a method of sampling from a population which can be partitioned into subpopulations.
In statistical surveys, when subpopulations within an overall population vary, it could be advantageous to sample each subpopulation independently. ___ is the process of dividing members of the population into homogeneous subgroups before sampling. The ___ should define a partition of the population. That is, it should be collectively exhaustive and mutually exclusive: every element in the population must be assigned to one and only one ___. Then simple random sampling or systematic sampling is applied within each ___. The objective is to improve the precision of the sample by reducing sampling error. It can produce a weighted mean that has less variability than the arithmetic mean of a simple random sample of the population.
Stratified (strata, stratum) sampling
A measure of central tendency.
For a data set, the arithmetic ___, also called the mathematical expectation or average, is the central value of a discrete set of numbers: specifically, the sum of the values divided by the number of values. If the data set were based on a series of observations obtained by sampling from a statistical population, the arithmetic ___ is the sample ___ to distinguish it from the ___ of the underlying distribution, the population ___ (denoted mu )
Mean
A measure of central tendency.
In statistics and probability theory, the ___ is the value separating the higher half from the lower half of a data sample, a population or a probability distribution. For a data set, it may be thought of as the “middle” value. For example, the basic advantage of the ___ in describing data compared to the mean (often simply described as the “average”) is that it is not skewed so much by a small proportion of extremely large or small values, and so it may give a better idea of a “typical” value. For example, in understanding statistics like household income or assets, which vary greatly, the mean may be skewed by a small number of extremely high or low values. ___ income, for example, may be a better way to suggest what a “typical” income is. Because of this, the ___ is of central importance in robust statistics, as it is the most resistant statistic, having a breakdown point of 50%: so long as no more than half the data are contaminated, the ___ will not give an arbitrarily large or small result.
Median
A measure of central tendency.
The ___ of a set of data values is the value that appears most often. If X is a discrete random variable, the ___ is the value x (i.e, X = x) at which the probability mass function takes its maximum value. In other words, it is the value that is most likely to be sampled.
Like the statistical mean and median, the ___ is a way of expressing, in a (usually) single number, important information about a random variable or a population. The numerical value of the ___ is the same as that of the mean and median in a normal distribution, and it may be very different in highly skewed distributions.
The ___ is not necessarily unique to a given discrete distribution, since the probability mass function may take the same maximum value at several points x1, x2, etc. The most extreme case occurs in uniform distributions, where all values occur equally frequently.
Mode
A measure of dispersion.
In statistics, the ___ is a measure of the amount of variation or dispersion of a set of values. A low ___ indicates that the values tend to be close to the mean (also called the expected value) of the set, while a high ___ indicates that the values are spread out over a wider range.
___ is most commonly represented in mathematical texts and equations by the lower case Greek letter sigma σ, for the population ___, or the Latin letter s, for the sample ___.
The ___ of a random variable, statistical population, data set, or probability distribution is the square root of its variance. It is algebraically simpler, though in practice less robust, than the average absolute deviation. A useful property of the ___ is that, unlike the variance, it is expressed in the same units as the data.
The ___ of a random variable, sample, statistical population, data set, or probability distribution is the square root of its variance.
Standard deviation, abbreviated to SD
The ___ of a statistic (usually an estimate of a parameter) is the standard deviation of its sampling distribution or an estimate of that standard deviation. If the parameter or the statistic is the mean, it is called the ___ of the mean (SEM).
The sampling distribution of a population mean is generated by repeated sampling and recording of the means obtained. This forms a distribution of different means, and this distribution has its own mean and variance. Mathematically, the variance of the sampling distribution obtained is equal to the variance of the population divided by the sample size. This is because as the sample size increases, sample means cluster more closely around the population mean.
Therefore, the relationship between the ___ and the standard deviation is such that, for a given sample size, the ___ equals the standard deviation divided by the square root of the sample size. In other words, the ___ of the mean is a measure of the dispersion of sample means around the population mean.
Standard error (SE)
For any nonnegative integers k and n, the ___ (n over k) , read as ‘’ n choose k ‘’, is the number of subsets of size k for a set of size n.
Binomial coefficient
___ is the study of mathematical structures that are fundamentally ___ rather than continuous. In contrast to real numbers that have the property of varying “smoothly”, the objects studied in ___ – such as integers, graphs, and statements in logic – do not vary smoothly in this way, but have distinct, separated values. ___ therefore excludes topics in “continuous mathematics” such as calculus or Euclidean geometry. ___ objects can often be enumerated by integers. More formally, ___ has been characterized as the branch of mathematics dealing with countable sets (finite sets or sets with the same cardinality as the natural numbers). However, there is no exact definition of the term “___.” Indeed, ___ is described less by what is included than by what is excluded: continuously varying quantities and related notions.
Discrete mathematics
In combinatorics (combinatorial mathematics), the ___ is a counting technique which generalizes the familiar method of obtaining the number of elements in the union of two finite sets; symbolically expressed as |A union B| = |A| + |B| - |A intersect B| where A and B are two finite sets and |S| indicates the cardinality of a set S (which may be considered as the number of elements of the set, if the set is finite). The formula expresses the fact that the sum of the sizes of the two sets may be too large since some elements may be counted twice. The double-counted elements are those in the intersection of the two sets and the count is corrected by subtracting the size of the intersection.
Inclusion–exclusion principle
In statistics, ___ is any statistical relationship, whether causal or not, between two random variables or bivariate data. In the broadest sense ___ is any statistical association, though it commonly refers to the degree to which a pair of variables are linearly related. Familiar examples of dependent phenomena include the ___ between the physical statures of parents and their offspring, and the ___ between the price of a good and the quantity the consumers are willing to purchase, as it is depicted in the so-called demand curve.
Correlation or dependence
Mathematical notation uses a symbol that compactly represents summation of many similar terms: the summation symbol, an enlarged form of the upright capital Greek letter ___.
Capital-sigma notation
The product of a sequence of factors can be written with the product symbol, which derives from the capital letter ___ in the Greek alphabet
Capital pi notation
In mathematics, the ___ of a real number x, denoted |x|, is the non-negative value of x without regard to its sign. Namely, |x| = x if x is positive, and |x| = −x if x is negative (in which case −x is positive), and |0| = 0.
Absolute value or modulus
In mathematics, the ___ of a set is a measure of the “number of elements of the set”. For example, the set A={2,4,6} contains 3 elements, and therefore A has a ___ of 3. Beginning in the late 19th century, this concept was generalized to infinite sets, allowing to distinguish several stages of infinity, and to perform arithmetic on them. There are two approaches to ___ – one which compares sets directly using bijections and injections, and another which uses ___ numbers. The ___ of a set is also called its size, when no confusion with other notions of size is possible.
Cardinality
In mathematics, a ___ is a relation between sets that associates to every element of a first set exactly one element of the second set. Typical examples are ___ from integers to integers or from the real numbers to real numbers.
___ were originally the idealization of how a varying quantity depends on another quantity. For example, the position of a planet is a ___ of time. Historically, the concept was elaborated with the infinitesimal calculus at the end of the 17th century, and, until the 19th century, the ___ that were considered were differentiable (that is, they had a high degree of regularity). The concept of ___ was formalized at the end of the 19th century in terms of set theory, and this greatly enlarged the domains of application of the concept.
A ___ is a process or a relation that associates each element x of a set X, the domain of the ___, to a single element y of another set Y (possibly the same set), the codomain of the ___. If the ___ is called f, this relation is denoted y = f (x) (which is spoken aloud as f of x), the element x is the argument or input of the ___, and y is the value of the ___, the output, or the image of x by f. The symbol that is used for representing the input is the variable of the ___ (one often says that f is a ___ of the variable x).
Function
In mathematics, a ___ is a value of a continuous quantity that can represent a distance along a line. The adjective ___ in this context was introduced in the 17th century by René Descartes, who distinguished between ___ and imaginary roots of polynomials. The ___ include all the rational numbers, such as the integer −5 and the fraction 4/3, and all the irrational numbers, such as √2 (1.41421356…, the square root of 2, an irrational algebraic number). Included within the irrationals are the transcendental numbers, such as π (3.14159265…). In addition to measuring distance, ___ can be used to measure quantities such as time, mass, energy, velocity, and many more.
Real number
In mathematics, the ___ are all the real numbers which are not rational numbers, the latter being the numbers constructed from ratios (or fractions) of integers. When the ratio of lengths of two line segments is an ___ number, the line segments are also described as being incommensurable, meaning that they share no “measure” in common, that is, there is no length (“the measure”), no matter how short, that could be used to express the lengths of both of the two given segments as integer multiples of itself.
Irrational numbers
In mathematics, a ___ is a complex number that is not an algebraic number—that is, not a root (i.e., solution) of a nonzero polynomial equation with integer coefficients. The best-known ___ are π and e.
Transcendental number
___ is a type of bias that occurs in ___ academic research. It occurs when the outcome of an experiment or research study influences the decision whether to ___ or otherwise distribute it. ___ only results that show a significant finding disturbs the balance of findings, and inserts bias in favor of positive results.
Publication or significance bias
___ are non-numerical variables. Their values aren’t represented with numbers because their values are represented with words.
Categorical variables
__ are numerical variables.
Quantitative variables
The lowercase letter ___ (μ) is used as a special symbol in many academic fields.
Mu
A measure of dispersion.
In probability theory and statistics, ___ is the expectation of the squared deviation of a random variable from its mean. Informally, it measures how far a set of numbers is spread out from their average value.
It can be measured as the population ___ (sigma squared) or the sample ___ (capital S squared), which can be a biased sample ___ or an unbiased sample ___.
The ___, is defined as the sum of the squared distances of each term in the distribution from the mean (μ), divided by the number of terms in the distribution (N).
Variance
A measure of dispersion.
In statistics, the ___ of a set of data is the difference between the largest and smallest values. It can give you a rough idea of how the outcome of the data set will be before you look at it actually. Difference here is specific, the ___ of a set of data is the result of subtracting the smallest value from largest value.
Range
In statistics, an ___ is a data point that differs significantly from other observations. An ___ may be due to variability in the measurement or it may indicate experimental error; the latter are sometimes excluded from the data set. An ___ can cause serious problems in statistical analyses.
The interquartile range is often used to find ___ in data. ___ here are defined as observations that fall below Q1 − 1.5 IQR or above Q3 + 1.5 IQR.
Outlier
In statistics, a ___ is a set of similar items or events which is of interest for some question or experiment. A statistical ___ can be a group of existing objects (e.g. the set of all stars within the Milky Way galaxy) or a hypothetical and potentially infinite group of objects conceived as a generalization from experience (e.g. the set of all possible hands in a game of poker). A common aim of statistical analysis is to produce information about some chosen ___.
Population (N)
In probability theory and statistics, ___ describes the probability of an event, based on prior knowledge of conditions that might be related to the event. For example, if the risk of developing health problems is known to increase with age, ___ allows the risk to an individual of a known age to be assessed more accurately than simply assuming that the individual is typical of the population as a whole.
___ is stated mathematically as the following equation: P(A|B) = P(B|A) * P(A) / P(B) or posterior probability = likelihood * marginal likelihood / prior probability.
Bayes’ theorem (alternatively Bayes’s law or Bayes’s rule)
___ is a method of statistical inference in which Bayes’ theorem is used to update the probability for a hypothesis as more evidence or information becomes available.
Bayesian inference
In probability theory and statistics, ___ is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The ___ value can be positive, zero, negative, or undefined.
For a unimodal distribution, negative ___ commonly indicates that the tail is on the left side of the distribution, and positive ___ indicates that the tail is on the right. In cases where one tail is long but the other tail is fat, ___ does not obey a simple rule. For example, a zero value means that the tails on both sides of the mean balance out overall; this is the case for a symmetric distribution, but can also be true for an asymmetric distribution where one tail is long and thin, and the other is short but fat.
Skewness
In descriptive statistics, the ___, is a measure of statistical dispersion, being equal to the difference between 75th and 25th percentiles, or between upper and lower quartiles, ___ = Q3 − Q1. In other words, the ___ is the first quartile subtracted from the third quartile; these quartiles can be clearly seen on a box plot on the data. It is a trimmed estimator, defined as the 25% trimmed range, and is a commonly used robust measure of scale.
Interquartile range (IQR), also called the midspread, middle 50%, or H‑spread
In statistics, the ___, is a shorthand used to remember the percentage of values that lie within a band around the mean in a normal distribution with a width of two, four and six standard deviations, respectively; ___ of the values lie within one, two and three standard deviations of the mean, respectively.
68–95–99.7 rule, also known as the empirical rule (more precisely, 68.27%, 95.45% and 99.73%)
___ is the discipline that concerns the collection, organization, analysis, interpretation and presentation of data. In applying ___ to a scientific, industrial, or social problem, it is conventional to begin with a ___ population or a ___ model to be studied. Populations can be diverse groups of people or objects such as “all people living in a country” or “every atom composing a crystal”. ___ deals with every aspect of data, including the planning of data collection in terms of the design of surveys and experiments.
Statistics
In probability theory, the ___ is a theorem that describes the result of performing the same experiment a large number of times. According to the law, the average of the results obtained from a large number of trials should be close to the expected value (theoretical/classical probability) and will tend to become closer to the expected value as more trials are performed.
Law of large numbers (LLN)
The ___ of an event is the ratio of the number of outcomes in which a specified event occurs to the total number of trials, not in a theoretical sample space but in an actual experiment. In a more general sense, ___ estimates probabilities from experience and observation.
Empirical probability, relative frequency, or experimental probability
The ___ is the ratio of the number of cases favorable to it, to the number of all cases possible when nothing leads us to expect that any one of these cases should occur more than any other, which renders them, for us, equally possible.
Classical definition or interpretation of probability
In statistics, ___ is the number of standard deviations by which the value of a raw score (i.e., an observed value or data point) is above or below the mean value of what is being observed or measured. Raw scores above the mean have positive ___, while those below the mean have negative ___.
It is calculated by subtracting the population mean from an individual raw score and then dividing the difference by the population standard deviation. This process of converting a raw score into a ___ is called standardizing or normalizing (however, “normalizing” can refer to many types of ratios).
The standard score or the z-score
In combinatorics, the ___ is a basic counting principle. Stated simply, it is the idea that if we have A ways of doing something and B ways of doing another thing and we can not do both at the same time, then there are A + B ways to choose one of the actions.
Rule of sum, addition rule or addition principle
In logic and probability theory, two events (or propositions) are ___ if they cannot both occur at the same time. A clear example is the set of outcomes of a single coin toss, which can result in either heads or tails, but not both.
Mutually exclusive or disjoint
In probability theory, the ___ of a random variable X, denoted E(X) or E[X], is a generalization of the weighted average, and is intuitively the arithmetic mean of a large number of independent realizations of X. ___ is also a key concept in economics, finance, and many other subjects.
Expected value also known as the expectation, mathematical expectation, mean, average, or first moment
___ is a fundamental notion in probability theory, as in statistics and the theory of stochastic processes.
Two events are ___ if the occurrence of one does not affect the probability of occurrence of the other (equivalently, does not affect the odds). Similarly, two random variables are ___ if the realization of one does not affect the probability distribution of the other.
Independence, independent, statistically independent, or stochastically independent
In probability theory and statistics, the ___ with parameters n and p is the discrete probability distribution of the number of successes in a sequence of n independent experiments, each asking a yes–no question, and each with its own Boolean-valued outcome: success/yes/true/one (with probability p) or failure/no/false/zero (with probability q = 1 − p). A single success/failure experiment is also called a Bernoulli trial or Bernoulli experiment, and a sequence of outcomes is called a Bernoulli process; for a single trial, i.e., n = 1, the ___ is a Bernoulli distribution. The ___ is the basis for the popular binomial test of statistical significance.
Binomial distribution
In mathematics, a ___ of a set is, loosely speaking, an arrangement of its members into a sequence or linear order, or if the set is already ordered, a rearrangement of its elements. The word “___” also refers to the act or process of changing the linear order of an ordered set.
___ differ from combinations, which are selections of some members of a set regardless of order.
The exact number of ___ is 3! = 3 x 2 x 1 = 6. The number gets extremely large as the number of items (n) goes up.
In a similar manner, the number of arrangements of r items from n objects is consider a partial ___. It is written as nPr (which reads “n ___ r”), and is equal to the number n(n-1) … (n-r+1), also written as n!/(n-r)!
Permutation, or permute
In mathematics, a ___ is a selection of items from a collection, such that the order of selection does not matter (unlike permutations). For example, given three fruits, say an apple, an orange and a pear, there are three ___ of two that can be drawn from this set: an apple and a pear; an apple and an orange; or a pear and an orange. More formally, a k-___ of a set S is a subset of k distinct elements of S. If the set has n elements, the number of k-___ is equal to the binomial coefficient which can be written using factorials as n!/k!(n-k)!
Combination
___, is a probability distribution that is symmetric about the mean, showing that data near the mean are more frequent in occurrence than data far from the mean. In graph form, ___ will appear as a bell curve.
For a ___, 68% of the observations are within +/- one standard deviation of the mean, 95% are within +/- two standard deviations, and 99.7% are within +- three standard deviations.
Normal distribution, also known as the Gaussian distribution
In probability theory, the ___ establishes that, in many situations, when independent random variables are added, their properly normalized sum tends toward a normal distribution (informally a bell curve) even if the original variables themselves are not normally distributed. The theorem is a key concept in probability theory because it implies that probabilistic and statistical methods that work for normal distributions can be applicable to many problems involving other types of distributions.
Central limit theorem (CLT)
The ___, is a classical problem in probability theory. One of the famous problems that motivated the beginnings of modern probability theory in the 17th century, it led Blaise Pascal to the first explicit reasoning about what today is known as an expected value.
The problem concerns a game of chance with two players who have equal chances of winning each round. The players contribute equally to a prize pot, and agree in advance that the first player to have won a certain number of rounds will collect the entire prize. Now suppose that the game is interrupted by external circumstances before either player has achieved victory. How does one then divide the pot fairly? It is tacitly understood that the division should depend somehow on the number of rounds won by each player, such that a player who is close to winning will get a larger part of the pot. But the problem is not merely one of calculation; it also involves deciding what a “fair” division actually is.
Problem of points, also called the problem of division of the stakes
The number __, is a mathematical constant approximately equal to 2.71828, and can be characterized in many ways. It is the base of the natural logarithm. It is the limit of (1 + 1/n)^n as n approaches infinity, an expression that arises in the study of compound interest. It can also be calculated as the sum of the infinite series.
e, known as Euler’s number