Probability & Statistics Flashcards

1
Q

In plain words, what are descriptive statistics and what do you lose by using them?

A

Descriptive statistics are the numbers and calculations we use to summarize raw data, which always implies some loss of nuance or detail.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do inferential statistics allow us to do?

A

We can use data from the “known world” to make informed inferences about the “unknown world.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does the scientific method dictate about the variables when testing a hypothesis?

A

That the variable of interest should be the only thing that differs between the experimental group and the control group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the main problem with the mean (average)?

A

It is sensitive to outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When will the mean and median be similar?

A

When the distribution has no serious outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the benefits of the median and quartiles/percentiles/etc?

A

They describe where a particular observation lies compared with everyone else.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference between an absolute figure and a relative figure?

A

Absolute figures can usually be interpreted without any context or additional information. A “relative” value or figure has meaning only in comparison to something else.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

In informal terms, what is the standard deviation?

A

It is a measure of how dispersed the data are from their mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the key difference between the distribution of weights of airplane passengers and the same from professional marathon runners when both have the same mean?

A

The weights of the two groups have roughly the same “middle,” but the airline passengers have far more dispersion around that midpoint, meaning that their weights are spread farther from the midpoint.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What do we know about the proportion of observations in a normal distribution?

A

We know (by definition) exactly what proportion of the observations in a normal distribution lie within one standard deviation of the mean (68.2 percent), within two standard deviations of the mean (95.4 percent), within three standard deviations (99.7 percent), and so on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a simple explanation for an index?

A

It is a descriptive statistic made up of other descriptive statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a histogram?

A

A histogram is an approximate representation of the distribution of numerical data that uses bins to divide the range of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the advantages of using a curve to represent the distribution of data over a histogram?

A

1) The curve allows you to estimate the probability of a value that wasn’t observed.
2) The curve is not limited by the width of the bins.
3) If we have limited resources, the approximate curve (based on the mean & st.dev of the data we collected) is usually good enough.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why is the distribution of the height of male babies much narrower than the one of male adults?

A

Because there are more possible measurements for adult males’ heights. The more options there are for height, the less likely any specific measurement will be one of them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is needed to draw a normal distribution?

A

The mean (to center the distribution) and the standard deviation (to give it width).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a random variable?

A

Random variables are ways to map outcomes of random processes to numbers. Basically, you are quantifying the outcomes.

Example: Y = sum of upward face after rolling seven dice.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How are random variables different from traditional variables?

A

With traditional variables, you can solve for them or assign them values.
With random variables it makes more sense to talk about the probability of an outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the difference between discrete and continuous RVs?

A

Discrete: they take distinct, separate values. They are countable.
Continuous: they can take any value in an interval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

¿Qué es un modelo probabilístico?

A

Es una descripción cuantitativa de una situación, un fenómeno o un experimento cuyo resultado es incierto.

20
Q

¿Qué dos pasos clave son necesarios para plantear un modelo probabilístico?

A

Describir los posibles resultados del experimento, mediante un espacio muestral.
Describir las creencias sobre la probabilidad de los posibles resultados (le asignamos probabilidades a los resultados).

21
Q

¿Qué son los axiomas? Dé un ejemplo de uno.

A

Son las propiedades básicas que las probabilidades deben satisfacer.

Por ejemplo, las probabilidades no pueden ser negativas.

22
Q

¿Cuáles son las tres propiedades principales que deben cumplir los elementos (eventos) de un espacio muestral de un experimento?

A

Deben ser mutuamente excluyentes. Solo puede haber un resultado al final de un experimento.
Deben ser colectivamente exhaustivas. Juntos, todos los elementos del conjunto agotan todos los resultados posibles del experimento.
Deben tener la granularidad correcta. Los resultados “físicamente” diferentes deben corresponder a diferentes puntos en el espacio muestral. Por “diferentes” se refiere a diferentes en los aspectos relevantes, pero no necesariamente diferentes en aspectos irrelevantes.

23
Q

¿Qué tipo de diagrama es útil para describir un experimento con múltiples etapas?

A

Una descripción secuencial, comunmente conocido como un diagrama de árbol.

24
Q

En un espacio muestral continuo ¿cómo deben plantearse los eventos probables?

A

Deben plantearse como conjuntos. La probabilidad de que un evento individual ocurra es cero, pero la de un conjunto es positiva.

25
Q

¿Cuáles son los axiomas que las probabilidades deben cumplir?

A

No pueden ser negativas. Deben ser mayores o iguales a cero.
Normalización. Si el subconjunto (evento) es todo el espacio muestral, su probabilidad debe ser igual a uno (1).
Aditividad. Si la intersección de A y B es cero, entonces la probabilidad de la unión entre A y B es la suma de sus probabilidades individuales.

26
Q

What is classical probability and how is it different from the other two types of probability?

A

It is defined as the number of ways (outcomes) the event can occur, divided by the total number of outcomes in the sample space.

It is different from empirical probability and subjective probability in that it uses sample spaces instead of frequency distributions or personal knowledge, respectively.

27
Q

What is a probability experiment?

A

A probability experiment is a chance process that leads to well-defined outcomes or results.

28
Q

What is a trial of a probability experiment?

A

A single trial of a probability experiment means to perform the experiment one time.

29
Q

What is an outcome of a probability experiment?

A

An outcome of a probability experiment is the result of a single trial of a probability experiment.

30
Q

What is a sample space?

A

The set of all outcomes of a probability experiment is called a sample space.

31
Q

What does random mean and how does it relate to outcomes of probability experiments?

A

A random process means you cannot predict with certainty which outcome will occur when the experiment is conducted.

Each outcome of a probability experiment occurs at random.

Also, each outcome of the experiment is equally likely unless otherwise stated.

32
Q

What is an event? What kinds of events are there? Are events trials? Explain.

A

An event usually consists of one or more outcomes of the sample space.

An event with one outcome is called a simple event.
When an event consists of two or more outcomes, it is called a compound event.

Simple and compound events should not be confused with the number of times the experiment is repeated.

33
Q

What is the formula for determining the classical probability of an event E?

A

The number of outcomes contained in event E divided by the total number of outcomes in the sample space.

34
Q

What is the complement of event E?

A

E ̅ (E bar) is called the complement of event E and consists of the outcomes in the sample space which are not outcomes of event E.

35
Q

What is empirical probability and when is it useful? What is another name for it?

A

Probabilities can be computed for situations that do not use sample spaces. In such cases, frequency distributions are used, and the probability is called empirical probability.

Empirical probability is sometimes called relative frequency probability.

36
Q

What is the formula for calculating empirical probabilities?

A

The frequency with which the event is observed divided by the sum of all the frequencies observed.

37
Q

What is the law of large numbers?

A

As the number of identically distributed, randomly generated variables increases, their sample mean (average) approaches their theoretical mean.

38
Q

What two devices can be used to represent sample spaces?

A

Tree diagrams and tables.

39
Q

What does it mean for two events to be mutually exclusive?

A

It means they cannot occur at the same time.

40
Q

What rule do you use to calculate the probability of two or more mutually exclusive events occurring? Explain it.

A

The addition rule 1: add the probabilities of each individual event.

41
Q

What rule do you use to calculate the probability of two or more NOT mutually exclusive events occurring? Explain it.

A

The addition rule 2: add the probabilities of each individual event and subtract the probabilities of their intersections.

42
Q

What are independent events?

A

Two events, A and B, are said to be independent if the fact that event A occurs does not affect the probability that event B occurs.

43
Q

What are dependent events?

A

When the occurrence of the first event in some way changes the probability of the occurrence of the second event, the two events are said to be dependent.

44
Q

For two independent events A and B, what is the probability of A and B occurring [P(A and B)]?

A

You multiply their individual probabilities.

The word *and is the key: it means that both events occur in sequence and you must multiply their probabilities.

45
Q

When two events are dependent, how do you calculate the probability of both events occurring?

A

P(A and B) = P(A) * P(B|A)