CHAPTER THREE Bayesian Decision Theory Flashcards

Question 1

Q

What is deductive reasoning?

Answer

A

A logical process where if the premises are accepted, the conclusion must also be accepted.

Question 2

Q

Provide an example of a valid but factually incorrect syllogism.

Answer

A

Plants are good for you; tobacco is a plant; ergo tobacco is good for you.

Question 3

Q

Who is credited with the idea of deductive reasoning?

Answer

A

Aristotle.

Question 4

Q

What are the two strong syllogisms described by E. T. Jaynes?

Answer

A

If A is true, then B is true.
A is true.
Therefore B is true.
If A is true, then B is true.
B is false.
Therefore A is false.

Question 5

Q

What does A ∨ B represent in Boolean algebra?

Answer

A

Both A and B are true (conjunction).

Question 6

Q

What does A ∧ B mean in Boolean algebra?

Answer

A

At least one of A and B is true (disjunction).

Question 7

Q

What is the implication represented by A → B?

Answer

A

If A is true, then B is true.

Question 8

Q

What is the output of an AND gate?

Answer

A

The output is true if both inputs are true.

Question 9

Q

What does an OR gate do?

Answer

A

It outputs true if at least one of its inputs is true.

Question 10

Q

What is a NOT gate’s function?

Answer

A

It outputs true if it is not receiving some input.

Question 11

Q

What are NAND gates capable of?

Answer

A

They can be used to create all other types of gates.

Question 12

Q

What is the limitation of classical logic?

Answer

A

It deals only with absolute truths (ones and zeros).

Question 13

Q

What does Bayesian reasoning allow us to do?

Answer

A

It helps us deal with probabilities and uncertainties.

Question 14

Q

What is Bayes’ theorem used for?

Answer

A

It provides a mathematical framework for updating beliefs based on new evidence.

Question 15

Q

What is the difference between frequentist and Bayesian approaches in decision theory?

Answer

A

Decision theorists cannot use frequentist math.

Question 16

Q

In the lottery example, what is the prior probability of winning with one ticket?

Answer

A

1 in 131,115,985.

Question 17

Q

What happens when the box beeps while testing a lottery number?

Answer

A

You must consider the likelihood ratio and the prior probability to update your belief.

Question 18

Q

What is the likelihood ratio in the beeping box example?

Question 19

Q

What is the new posterior probability after the box beeps?

Answer

A

4:131,115,985.

Question 20

Q

What does Bayesian reasoning rely on?

Answer

A

It combines prior information with new data to form a revised understanding.

Question 21

Q

Fill in the blank: Bayesian logic allows us to deal with ____ in reasoning.

Answer

A

shades of gray.

Question 22

Q

True or False: Bayesian reasoning is only applicable in scientific contexts.

Question 23

Q

What is the probability mass of now?

Answer

A

1/32,778,996

This represents a very small chance in a probabilistic context.

Question 24

Q

How many wrong combinations does the box beep for on average?

Answer

A

8,194,749 wrong combinations

This indicates the challenge of identifying the correct combination amidst many false positives.

Question 25

Q

How many times must you run a ticket through the box for it to be likely the right one?

Answer

A

Fourteen times

This emphasizes the improbability of identifying the correct ticket easily.

Question 26

Q

What concept in thermodynamics is compared to Bayes’ theorem?

Answer

A

Carnot engine

The Carnot engine represents the most efficient theoretical model for heat engines.

Question 27

Q

What does Bayes’ theorem help approximate in decision-making?

Answer

A

It helps approximate Bayesian decision theory

Decisions under uncertainty are better when they approximate Bayes’ theorem.

Question 28

Q

What does E. T. Jaynes argue about Bayes’ theorem?

Answer

A

It allows for plausible reasoning beyond Aristotelian logic

Jaynes emphasizes that using probabilities can extend logical reasoning.

Question 29

Q

What is an example of a logical conjunction represented in Bayesian terms?

Answer

A

p(A ∧ B) = p(C)

This shows how the probability of two events occurring together can be expressed in Bayesian logic.

Question 30

Q

What does the likelihood ratio indicate in the context of wet pavements?

Answer

A

It tells how much to update beliefs about rain based on evidence

The likelihood ratio quantifies how much more plausible a hypothesis becomes given new evidence.

Question 31

Q

What is the prior probability of rain at a certain time of year in the example?

Answer

A

33 percent

This serves as the baseline probability before considering additional evidence.

Question 32

Q

What is the posterior probability of it having rained given wet pavements?

Answer

A

66 percent

This result derives from applying Bayes’ theorem with prior probabilities and likelihoods.

Question 33

Q

What is Cromwell’s rule in Bayesian decision theory?

Answer

A

Never assign probabilities of one or zero to anything except logical truths

This rule encourages keeping an open mind to possibilities.

Question 34

Q

What happens to posterior probability if the prior is set to zero?

Answer

A

The posterior probability remains zero

This illustrates the issue of being too certain about initial beliefs.

Question 35

Q

How are odds calculated from probabilities?

Answer

A

By dividing the probability by 1 minus the probability

This transformation highlights the relationship between probabilities and odds.

Question 36

Q

What is the odds representation of a probability of 0.9?

Answer

A

9:1

This indicates a strong likelihood in favor of the event occurring.

Question 37

Q

What is a critical distinction between probabilities of 1 and 0.999999?

Answer

A

1 equals infinity to 1

This distinction shows the mathematical implications of assigning absolute certainty.

Question 38

Q

Why should you avoid assigning a probability of one?

Answer

A

Because it implies absolute certainty, which is unrealistic

Real-world events often have uncertainties that should be accounted for.

Question 39

Q

What is the implication of assigning a probability of zero or one?

Answer

A

You should never assign anything a probability of zero or one.

This means acknowledging that while some events are extremely unlikely, they are not impossible.

Question 40

Q

What does a very small probability indicate?

Answer

A

Very, very small probabilities are very, very small and should not be confused with impossibility.

For example, a one-in-a-quadrillion chance exists but is extremely unlikely.

Question 41

Q

What is the conservation of expected evidence?

Answer

A

You can’t go looking for new evidence to support your theory; the absence of expected evidence counts as evidence against your hypothesis.

This is a principle in Bayesian decision theory.

Question 42

Q

How does the absence of expected evidence affect belief?

Answer

A

If you expect to see evidence and do not find it, your belief should shift significantly in the opposite direction.

For example, if you expect to see evidence of wrongdoing and do not, your belief in the wrongdoing decreases.

Question 43

Q

What is the relationship between expected evidence and posterior probability?

Answer

A

The more strongly you expect something, the less your posterior probability changes when you find it.

Conversely, unexpected evidence causes a more significant shift in belief.

Question 44

Q

Fill in the blank: The absence of evidence is, in fact, _______.

Answer

A

evidence of absence.

This principle suggests that not finding evidence for a belief can strengthen the belief that it does not exist.

Question 45

Q

What is utility in decision theory?

Answer

A

Utility describes how much you care about something in decision-making under uncertainty.

It is often treated as equivalent to money for simplicity in calculations.

Question 46

Q

What does expected value combine?

Answer

A

Expected value combines probability and utility.

This helps in making decisions based on the anticipated outcomes of those decisions.

Question 47

Q

How is expected value calculated using a lottery example?

Answer

A

Expected value is calculated by dividing the value of the jackpot by the chance of winning it.

For example, if a lottery ticket costs £1 and has a jackpot of £150 million with a 1 in 131,115,985 chance, the expected value is positive.

Question 48

Q

What is a Dutch book in betting theory?

Answer

A

A Dutch book occurs when a person’s beliefs about probabilities lead to guaranteed losses regardless of the outcome.

It demonstrates irrationality in betting based on inconsistent probability assessments.

Question 49

Q

What did John von Neumann contribute to decision theory?

Answer

A

John von Neumann developed game theory and sought a normative way to make decisions under uncertainty to maximize expected well-being.

His work laid the foundation for understanding complex decision-making in economics.

Question 50

Q

What challenge arises when trying to maximize group utility?

Answer

A

Conflicts of interest between individuals complicate the process of maximizing group utility.

Different individuals may prioritize different desires, making it difficult to achieve a consensus on utility maximization.

Question 51

Q

Fill in the blank: Classical economics assumes that while you can rank people’s preferences, you cannot _______.

Answer

A

compare them.

This is particularly true when preferences conflict between individuals.

Question 52

Q

What is a key axiom proposed by von Neumann regarding people’s desires?

Answer

A

People’s desires need to be transitive.

Question 53

Q

Define transitive preferences in the context of decision-making.

Answer

A

If a person prefers A to B and B to C, then they must prefer A to C.

Question 54

Q

What happens if preferences are intransitive?

Answer

A

The individual can become a money pump.

Question 55

Q

What are the three necessary conditions for preferences according to von Neumann?

Answer

A

Transitive
Continuous
Monotonic

Question 56

Q

What does it mean for preferences to be continuous?

Answer

A

There are no sudden jumps in people’s preferences as outcomes change.

Question 57

Q

What does monotonicity imply about decision-making?

Answer

A

A decision with a 50% chance of £10 should be indifferent to a decision with a 100% chance of £5.

Question 58

Q

What is meant by substitutability in preferences?

Answer

A

If indifferent between cake and jelly, one shouldn’t care about the probabilities of receiving each.

Question 59

Q

What is the utility theorem proposed by von Neumann?

Answer

A

People have preferences that can be assigned a numerical value, called ‘utils’.

Question 60

Q

What is the concept of ‘utils’?

Answer

A

A unit of measure for preferences in von Neumann’s utility theorem.

Question 61

Q

How did von Neumann apply his theories to the scenario of Holmes and Moriarty?

Answer

A

He analyzed their decision-making strategies using expected utilities.

Question 62

Q

What is expected utility?

Answer

A

The average utility of possible outcomes weighted by their probabilities.

Question 63

Q

What should Moriarty do to maximize his expected utility?

Answer

A

Be unpredictable in his actions.

Question 64

Q

What is a key challenge in decision-making under uncertainty?

Answer

A

Knowing the expected utility of any decision due to lack of information.

Answer 63

A

A principle stating that simpler explanations are preferred over complex ones.

Answer 64

A

William of Ockham.

Answer 65

A

The shortest computer program that describes a given output.

Answer 66

A

The complexity of an object in terms of the length of the shortest description.

Answer 67

A

Three eleven-digit strings of numbers, where one is predictable and two are not.

Answer 68

A

Truly random sequences require longer descriptions than predictable ones.

Answer 69

A

Between the complexity of the algorithm and the confidence in predicting the output.

Answer 70

A

A single ‘bit’ of information can halve the probability space.

Answer 71

A

They are initial probabilities that influence the outcome of Bayesian inference.

Answer 72

A

[Occam’s razor]

Answer 73

A

It indicates that the search space has been halved, increasing the probability mass on each remaining option.

Answer 74

A

If an extra bit of information doesn’t allow you to halve the search space, it’s not worth it.

Answer 75

A

Assign higher prior probability to the simpler hypothesis to write as a computer program.

Answer 76

A

Modern AI systems operate in a manner consistent with Bayesian decision-making under uncertainty.

Answer 77

A

It’s uncertainty about what parameter to use, reflecting a higher-level prediction about the shape of the world.

Answer 78

A

It demonstrates adapting prior probabilities based on changing evidence, like the opponent’s hiding behavior.

Answer 79

A

It complicates the evaluation because different priors can lead to different conclusions despite the same evidence.

Answer 80

A

The Mysterious Barry guessing numbers correctly, where even many correct guesses may not convince skeptics.

Answer 81

A

Two subjects scored significantly better than chance, raising questions about psychic powers.

Answer 82

A

Alternative explanations, like fraud or design flaws, can overshadow belief in psychic phenomena.

Answer 83

A

Individuals with different priors may interpret the same evidence in opposing ways, reinforcing their beliefs.

Answer 84

A

To predict uncertain outcomes, fundamentally based on Bayesian reasoning.

Answer 85

A

It’s a model that predicts labels for new data based on labeled training data.

Answer 86

A

It shifts from a prior probability to a posterior probability based on new information.

Answer 87

A

Linear regression.

Answer 88

A

p ≈ 0.33

Answer 89

A

Individuals might conclude the source of the evidence is untrustworthy rather than changing their belief.

Answer 90

A

On average, taller people have larger feet.

Answer 91

A

To draw a line through data points that minimizes the sum of squared errors.

Answer 92

A

By measuring the vertical distance from the line to each dot and squaring that distance.

Answer 93

A

The total error across all data points.

Answer 94

A

To estimate a person’s height based on their shoe size using the line of least squares.

Answer 95

A

The amount of training data and the variance in that data.

Answer 96

A

It involves updating prior beliefs with new data to form a posterior distribution.

Answer 97

A

Curved lines, such as exponential curves, S-shaped curves, or sine waves.

Answer 98

A

When the model is too simple to accurately capture the underlying data.

Answer 99

A

When a model fits the training data too closely and fails to generalize to new data.

Answer 100

A

Parameters that control the model’s capacity to fit the data, such as the degree of freedom in curve fitting.

Answer 101

A

The AI’s prior beliefs about hyperparameters.

Answer 102

A

To predict what a human would say or draw in response to a prompt.

Answer 103

A

By predicting the next word in a sequence based on prior text.

Answer 104

A

A term used to describe language models that predict sequences without genuine understanding.

Answer 105

A

It helps the AI make better predictions about future data.

Answer 106

A

To determine whether the AI built an internal representation of the game rather than just memorizing statistics.

Answer 107

A

A technique to examine the internal states of the AI and its decision-making process.

Answer 108

A

They likely build internal representations that assist in making predictions.

Answer 109

A

Bayesian.

Answer 110

A

It suggests the AI has formed a model of the game instead of merely memorizing moves.