ST102.2 - Probability Theory Flashcards

1
Q

What notation does capital N and lowercase n equate to?

A
  • Capital N equates to population size.

- Lowercase n equates to sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are statistical inferences and the associated assumptions?

A

The statistical inference will allow us to say things like the following about the population.

  • ‘A 95% confidence interval for the population proportion, π, of ‘Yes’ voters is (0.5083, 0.5717).’
  • ‘The null hypothesis that π = 0.50, against the alternative hypothesis that π > 0.50, is rejected at the 5% significance level.’
  • In short, the opinion poll gives statistically significant evidence that ‘Yes’ voters are in the majority among likely voters.

The inferential statements about the opinion poll rely on the following assumptions and results.
- Each response Xi is a realisation of a random variable from a Bernoulli distribution with probability parameter π.
- The responses X1, X2, . . . , Xn are independent of each other.
The sampling distribution of the sample mean (proportion) X¯ has expected value π and variance π(1 − π)/n.
- By use of the central limit theorem, the sampling distribution is approximately a normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is each response Xi a realisation of?

A

Each response Xi is a realisation of a random variable from a Bernoulli distribution with probability parameter π

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are X1, X2, . . . , Xn all?

A

The responses X1, X2, . . . , Xn are independent of each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the expected value and variance of the sample mean?

A

The sampling distribution of the sample mean (proportion) X¯ has expected value π and variance π(1 − π)/n.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What will the sampling distribution be if we use the central limit theorem?

A

By use of the central limit theorem, the sampling distribution is approximately a normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is probability theory?

A

Probability theory is the branch of mathematics that deals with randomness.

  • Values in a sample are variable. If we collected a different sample we would not observe exactly the same values again.
  • Values in a sample are also random. We cannot predict the precise values which will be observed before we actually collect the sample.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a basic definition of an experiment?

A

An experiment is a process that produces outcomes and which can have several different outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a basic definition of an outcome of an experiment?

A

The outcome of the experiment: for example, rolling a 3. It may be notated by S or Omega or lowercase omega.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the basic definition of sample space?

A

Sample space S: the set of all possible outcomes/elements which are under consideration, here {1, 2, 3, 4, 5, 6}.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the basic definition of an event?

A

Event: any subset A of the sample space, for example, A = {4, 5, 6}.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the basic definition of the probability of an event?

A

The probability of an event, P(A), will be defined as a function that assigns probabilities (real numbers) to events (sets).
- The probability of an outcome A of an experiment is the proportion (relative frequency) of trials in which A would be the outcome if the experiment was repeated a very large number of times under similar conditions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the definition of a set?

A

A set is a collection of elements (also known as ‘members’ of the set).

  • EXAMPLE.
  • A = {Amy, Bob, Sam}.
  • B = {1, 2, 3, 4, 5}.
  • C = {x | x is a prime number} = {2, 3, 5, 7, 11, . . .}.
  • D = {x | x ≥ 0} (that is, the set of all non-negative real numbers).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What notation is used to denote membership or an element of a set?

A
  • x ∈ A means that object x is an element of set A.
  • x ∈/ A means that object x is not an element of set A (with a cross through ∈ to demonstrate that it is not a member of that set).
  • EXAMPLE. If A = {1, 2, 3, 4, 5}, then:
  • 1 ∈ A and 2 ∈ A.
  • 6 ∈/ A and 1.5 ∈/ A.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What notation is used to denote an empty set?

A

The empty set, denoted ∅, is the set with no elements, i.e. x ∈ ∅ / is true for every object x, and x ∈ ∅ is not true for any object x.
- This is an impossible event, therefore P(∅) = 0. The opposite of this would be a certain event where P = 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are venn diagrams?

A

The familiar Venn diagrams help to visualise statements about set.

  • The S (the box which encloses the circles is our sample space).
  • The circles represent the individual events or sets.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is important to remember about venn diagrams when it comes to proofs?

A

However, Venn diagrams are not formal proofs of results in set theory, but they act as visual aid instead.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is meant by a subset?

A

A ⊂ B means that set A is a subset of set B, defined as: A ⊂ B when x ∈ A ⇒ x ∈ B.

  • Hence A is a subset of B if every element of A is also an element of B.
  • Venn diagram depicting a subset, where A ⊂ B: (set A is enclosed within set B).
  • Two sets A and B are equal (A = B) if they have exactly the same elements. This implies that A ⊂ B and B ⊂ A.
  • EXAMPLE. {1, 2, 3} ⊂ {1, 2, 3, 4}, because all elements appear in the larger set
  • EXAMPLE. {1, 2, 5} /⊂ {1, 2, 3, 4}, because the element 5 does not appear in the larger set.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is meant by a union of sets?

A

Unions of sets (‘or’) {Inclusive}

  • The union, denoted ∪, of two sets is: A ∪ B = {x | x ∈ A or x ∈ B}.
  • That is, the set of those elements which belong to A or B (or both).
  • A ∩ B: both A and B happen.
  • EXAMPLE. If A = {1, 2, 3, 4}, B = {2, 3} and C = {4, 5, 6}, then:
  • A ∪ B = {1, 2, 3, 4}
  • A ∪ C = {1, 2, 3, 4, 5, 6}
  • B ∪ C = {2, 3, 4, 5, 6}.
20
Q

What is meant by an intersection of sets?

A

Intersections of sets (‘and’)

  • The intersection, denoted ∩, of two sets is: A ∩ B = {x | x ∈ A and x ∈ B}. (x is such that x is a member of A and x is a member of B).
  • That is, the set of those elements which belong to both A and B.
  • A ∪ B: either A or B happens (or both happen).
  • EXAMPLE. If A = {1, 2, 3, 4}, B = {2, 3} and C = {4, 5, 6}, then:
  • A ∩ B = {2, 3}
  • A ∩ C = {4}
  • B ∩ C = ∅.
21
Q

What can you use as a union and intersection operator as a union and intersection of many sets?

A

Both set operators can also be applied to more than two sets, such as A ∩ B ∩ C.
Concise notation for the unions and intersections of sets A1, A2, . . . , An is:

n
U Ai = A1 ∪ A2 ∪ · · · ∪ An
i=1

and

n
∩ Ai = A1 ∩ A2 ∩ · · · ∩ An.
i=1

These can also be used for an infinite number of sets, i.e. when n is replaced by ∞.

22
Q

What is referred to as the complement (‘not’) of something?

A

If S is referred to as the sample space.
- Then A ⊂ S for every set A we may consider. The complement of A with respect to S is:
A^c = {x | x ∈ S and x ∈/ A}.
- That is, the set of those elements of S that are not in A.
- Therefore it is all but A.
- P(A) = 1 - P(A^c). AND ALSO P(A^c) = 1 − P(A).
- A^c: A does not happen, i.e. something other than A happens.

23
Q

What are the 4 set operators and why are they significant?

A

In proofs and derivations about sets, you can use the following set operators without proof.

  1. Commutativity
  2. Associativity
  3. Distributive laws
  4. De Morgan’s laws
24
Q

What is commutativity?

A

The “Commutative Laws” say we can swap numbers over and still get the same answer
- EXAMPLE. A ∩ B = B ∩ A and A ∪ B = B ∪ A.

25
Q

What is associativity?

A

The “Associative Laws” say that it doesn’t matter how we group the numbers (i.e. which we calculate first) …
- EXAMPLE. A ∩ (B ∩ C) = (A ∩ B) ∩ C and A ∪ (B ∪ C) = (A ∪ B) ∪ C.

26
Q

What are distributive laws?

A

The “Distributive Law” says that multiplying a number by a group of numbers added together is the same as doing each multiplication separately.
- EXAMPLE. A ∩ (B ∪ C) = (A ∩ B) ∪ (A ∩ C)
and A ∪ (B ∩ C) = (A ∪ B) ∩ (A ∪ C).

27
Q

What are De Morgan’s laws?

A

For “De Morgan’s Laws” the complement of the union of two sets is equal to the intersection of their complements and the complement of the intersection of two sets is equal to the union of their complements.
EXAMPLE. (A ∩ B)^c = A^c ∪ B^c
and (A ∪ B)^c = A^c ∩ B^c

28
Q

What is the complement of an empty set?

A

∅^c = S

The complement of an empty set is the sample space

29
Q

What are some further properties of set operations?

A
  • ∅ ⊂ A, A ⊂ A and A ⊂ S.
  • A ∩ A = A and A ∪ A = A.
  • A ∩ A^c = ∅ and A ∪ A^c = S.
  • If B ⊂ A, A ∩ B = B and A ∪ B = A.
  • A ∩ ∅ = ∅ and A ∪ ∅ = A.
  • A ∩ S = A and A ∪ S = S.
  • ∅ ∩ ∅ = ∅ and ∅ ∪ ∅ = ∅.
30
Q

What does disjoint mean?

A

It means the same as mutually exclusive i.e there is not overlap and they do not occur at the same time.

31
Q

What determines if two sets A and B are disjoint or mutually exclusive?

A

Two sets A and B are disjoint or mutually exclusive if:

A ∩ B = ∅ (this does not mean they are mutually exclusive if A ∩ B = 0 as this is to do with probability).

32
Q

What are pairwise disjoint?

A

Sets A1, A2, . . . , An are pairwise disjoint if all pairs of sets from them are disjoint, i.e. Ai ∩ Aj = ∅ for all I /= j.

33
Q

What is a partition?

A

A partition is a set of pairwise Mutually Exclusive and Collectively Exhaustive sets (MECE).
- The sets A1, A2, . . . , An form a partition of the set A if they are pairwise disjoint and if:
n
U Ai = A, that is, A1, A2, . . . , An are collectively exhaustive of A.
i=1

34
Q

EXAMPLE. Suppose that A ⊂ B. Show that A and B ∩ Ac

form a partition of B.

A

We have: A ∩ (B ∩ A^c) = (A ∩ A^c) ∩ B = ∅ ∩ B = ∅ ME
and:
A ∪ (B ∩ A^c) = (A ∪ B) ∩ (A ∪ A^c) = B ∩ S = B. CE
- Hence A and B ∩ Ac are mutually exclusive and collectively exhaustive of B (MECE), and so they form a partition of B.

35
Q

What is probability in terms of axioms?

A

‘Probability’ is formally defined as a function P(·) from subsets (events) of the sample space S onto real numbers. Such a function is a probability function if it satisfies the following axioms (‘self-evident truths’).

36
Q

What is an axiom?

A

Axioms are self-evident truths upon which other probability results may be obtained, we are not proving these axioms but rather they provide us with a framework to derive probability results. They are a statement or proposition on which an abstractly defined structure is based. THEY ARE INVIOLABLE - THEY ARE LIKE LAW.
- EXAMPLE.
- Axiom 1: P(A) ≥ 0 for all events A. [0,1]
- Axiom 2: P(S) = 1.
- Axiom 3: If events A1, A2, . . . are pairwise disjoint (i.e. Ai ∩ Aj = ∅ for all i /= j ), then:
∞ ∞
P (U Ai) = Σ P(Ai)
i=1 i=1

37
Q

What are the 3 axioms?

A

The axioms require that a probability function must always satisfy these requirements.
1. Axiom 1 requires that probabilities are always non-negative. P(A) ≥ 0 for all events A. [0,1]
2. Axiom 2 requires that the outcome is some element from the sample space with certainty (that is, with probability 1). In other words, the experiment must have some outcome. P(S) = 1. Therefore the sample space is = 1.
3. Axiom 3 states that if events A1, A2, . . . are mutually exclusive, the probability of their union is simply the sum of their individual probabilities.
- If events A1, A2, . . . are pairwise disjoint (i.e. Ai ∩ Aj = ∅ for all i /= j ), then:
∞ ∞
P (U Ai) = Σ P(Ai)
i=1 i=1

All other properties of the probability function can be derived from the axioms. We begin by showing that a result like Axiom 3 also holds for finite collections of mutually exclusive sets.

38
Q

What is the proof for empty sets using axioms?

A

For the empty set, ∅, we have: P(∅) = 0.
Proof: Since ∅ ∩ ∅ = ∅ and ∅ ∪ ∅ = ∅, Axiom 3 gives:
P(∅) = P(∅ ∪ ∅ ∪ · · ·) = ∞Σ(i=1)P(∅).
However, the only real number for P(∅) which satisfies this is P(∅) = 0.

39
Q

How can we proove that the sum of a pairwise disjoint past n + 1 is an empty set?

A

If A1, A2, . . . , An are pairwise disjoint, then:
n n
P (U Ai) = Σ P(Ai)
i=1 i=1

Proof: In Axiom 3, set An+1 = An+2 = · · · = ∅, so that:
   ∞            ∞
P (U Ai)  =  Σ P(Ai)
   i=1           i=1
THEREFORE:
   n            ∞                n
Σ P(Ai)  +  Σ P(Ai)   =   Σ P(Ai)
   i=1           i=n + 1      i=1 

since P(Ai) = P(∅) = 0 for i = n + 1, n + 2, . . ..

40
Q

How can we sum probabilities of mutually exclusive sets?

A

P(A) = P(A1) + P(A2) + P(A3).

That is, we can simply sum probabilities of mutually exclusive sets. This is very useful for deriving further results.

41
Q

How can we use prove that the probability of the complement of A is equal to the probability of 1 - P(A)?

A

Proof: We have that A ∪ A^c = S and A ∩ A^c = ∅. MECE.
Therefore using axiom 2 as the probability of the LHS = RHS:
1 = P(S) = P(A ∪ A^c) = P(A) + P(A^c)
using the previous result with with n = 2, A1 = A and A2 = A^c

42
Q

How can you prove that probabilities are between 0 and 1?

A

For any event A, we have: P(A) ≤ 1.
Proof (by contradiction): If it was true that P(A) > 1 for some A, then we would have:
- P(A^c) = 1 − P(A) < 0.
- This violates Axiom 1, so cannot be true.
- Therefore, it must be that P(A) ≤ 1 for all A.
- Putting this and Axiom 1 together, we get:
0 ≤ P(A) ≤ 1 for all events A.

43
Q

How can you prove if A ⊂ B, then P(A) ≤ P(B)?

A

Proof: Partition B as:
B = A ∪ (B ∩ A^c) where the two sets in the union are disjoint.

Therefore:
- P(B) = P(A ∪ (B ∩ A^c)) = P(A) + P(B ∩ A^c) ≥ P(A) since P(B ∩ A^c) ≥ 0.

44
Q

How can you prove P(A ∪ B) = P(A) + P(B) − P(A ∩ B).

A

Proof: Using partitions (dividing into MECE):
- P(A ∪ B) = P(A ∩ B^c) + P(A ∩ B) + P(A^c ∩ B)
- P(A) = P(A ∩ B^c) + P(A ∩ B)
- P(B) = P(A^c ∩ B) + P(A ∩ B)
and hence:
- P(A ∪ B) = (P(A) − P(A ∩ B)) + P(A ∩ B) + (P(B) − P(A ∩ B)) = P(A) + P(B) − P(A ∩ B).

45
Q

What are the summaries of important probability functions?

A

In summary, the probability function has the following properties.

  • P(S) = 1 and P(∅) = 0.
  • 0 ≤ P(A) ≤ 1 for all events A.
  • If A ⊂ B, then P(A) ≤ P(B).

These show that the probability function has the kinds of values we expect of something called a ‘probability’.

  • P(A^c) = 1 − P(A).
  • P(A ∪ B) = P(A) + P(B) − P(A ∩ B).

These are useful for deriving probabilities of new events.

46
Q

What is the difference between empirical probability and theoretical probability?

A

Experimental probability is the probability that is determined on the basis of the results of an experiment repeated many times.
- Calculated as the (Number of times occurred)/(Total number of times experiment performed).

Theoretical probability is the probability that is determined on the basis of reasoning.

47
Q

What is classical probability?

A

Classical probability is a simple special case where values of probabilities can be found by just counting outcomes. This requires that:

  1. The sample space contains only a finite number of outcomes.
  2. All of the outcomes are equally likely.

EXAMPLE of classical probability.

  • Tossing a coin (heads or tails) one or more times.
  • Rolling one or more dice (each scored 1, 2, 3, 4, 5 or 6).
  • Drawing one or more playing cards from a deck of 52 cards.