CHAPTER 15 Turn Statistics into Substance Flashcards

Question 1

Q

What is a common issue with how statistics are reported?

Answer

A

Statistics are often reported or presented in ways that are misleading or unhelpful for decision making.

Question 2

Q

What is necessary to translate a statistic into useful information?

Answer

A

Think clearly about the question at hand.

Question 3

Q

What does Bayes’ rule help us with?

Answer

A

Bayes’ rule helps us update beliefs in response to new information.

Question 4

Q

What must be combined with evidence-based beliefs for decision making?

Answer

A

Your values.

Question 5

Q

What is the key to turning statistics into substance?

Answer

A

Ask and answer the question you really care about.

Question 6

Q

How can the presentation of information affect its perceived meaning?

Answer

A

Changing scales can dramatically alter whether a relationship seems large or small, important or unimportant.

Question 7

Q

What is the difference between miles-per-gallon and gallons-per-mile?

Answer

A

Miles-per-gallon tells how far a car drives given how much gasoline is burned, while gallons-per-mile indicates how much gasoline is burned given how far it is driven.

Question 8

Q

In terms of reducing gas consumption, which vehicle type should be prioritized?

Answer

A

Gas-guzzling SUVs.

Question 9

Q

What is one consequence of using miles-per-gallon as a metric?

Answer

A

It may mislead consumers and regulators into making bad decisions.

Question 10

Q

What is a better measure of fuel efficiency than miles-per-gallon?

Answer

A

Gallons-per-hundred-miles.

Question 11

Q

What confusion can arise from percent changes versus percentage point changes?

Answer

A

A percent change is a ratio of the percentage point change to the initial value, while percentage point change is the numerical difference between two percentages.

Question 12

Q

What does a 44% reduction in heart-related risks imply without context?

Answer

A

It does not necessarily indicate a significant number of lives saved.

Question 13

Q

What is the baseline heart attack risk in the studied population?

Answer

A

About 2.8%.

Question 14

Q

What is the actual reduction in heart attacks with a 44% decrease?

Answer

A

About one percentage point reduction.

Question 15

Q

Why are visual presentations of data important?

Answer

A

They help in accurately and informatively displaying quantitative information.

Question 16

Q

What is essential when creating data visualizations?

Answer

A

Choosing the scale on which to present data.

Question 17

Q

What can a seemingly innocuous change of scale in a graph do?

Answer

A

Transform a graph to make a relationship look enormous or inconsequential.

Question 18

Q

What should be questioned when viewing a data visualization?

Answer

A

The underlying data and analyses, assumptions, and whether the findings answer the question being asked.

Question 19

Q

Fill in the blank: Quantitative evidence, on its own, can’t tell you what to ______.

Question 20

Q

Fill in the blank: The same 2-miles-per-gallon improvement has a much larger effect on gas consumption when applied to a ______ vehicle.

Answer

A

gas-guzzling.

Question 21

Q

What is the significance of choosing the scale for data visualization?

Answer

A

Choosing the scale can significantly alter the perception of the data, making relationships appear large or small.

Question 22

Q

What effect does changing the vertical axis scale have in a bar graph comparing numbers 89 and 90?

Answer

A

It can make the two numbers appear vastly different or nearly identical.

Question 23

Q

What should you consider when interpreting a graph?

Answer

A

Carefully read the axes and consider what the numbers mean substantively.

Question 24

Q

True or False: There is always a correct scale for data visualization.

Question 25

Q

In what context might the difference between 89 and 90 be substantively significant?

Answer

A

If you are chaperoning school children, the difference in students returning home safely is significant.

Question 26

Q

What happens if a graph is on a scale too large?

Answer

A

Important information may be hidden, making substantively meaningful differences difficult to see.

Question 27

Q

If a 1-point difference is substantively negligible, what scale would appropriately reflect that?

Answer

A

A scale of 0 to 100.

Question 28

Q

What can changing the scale of axes in a graph influence?

Answer

A

It can make correlations look strong or weak and affect the appearance of linear relationships.

Question 29

Q

What did Achen and Bartels argue regarding voters’ policy views and political behavior?

Answer

A

They argued that policy views have little relationship to political behavior, which is driven by non-policy concerns.

Question 30

Q

In their analysis, what did Achen and Bartels use to support their claim about party affiliation?

Answer

A

A visual representation of data showing trends in party identification among white Southerners.

Question 31

Q

What was the trend in party identification for white Southerners from 1960 to the end of the twentieth century?

Answer

A

They shifted from overwhelmingly Democratic to overwhelmingly Republican.

Question 32

Q

What does the vertical axis in the visualization of party identification measure?

Answer

A

The Democratic margin, calculated as the percent identifying as Democratic minus the percent identifying as Republican.

Question 33

Q

What did the figure’s large vertical axis scale potentially obscure?

Answer

A

Substantively meaningful differences in party identification trends.

Question 34

Q

How did the trends in partisanship differ between those who opposed and did not oppose integration?

Answer

A

Those who opposed integration switched partisan affiliation at a faster rate.

Question 35

Q

What was the margin change for white Southerners who opposed integration from 1962 to 2000?

Answer

A

From a 48-point Democratic margin to an 18-point Republican margin.

Question 36

Q

What was the change in the Democratic margin for those who did not oppose integration during the same period?

Answer

A

From a 32-point Democratic margin to a 1-point Republican margin.

Question 37

Q

Fill in the blank: To effectively convey information in data visualizations, one must keep it _______.

Question 38

Q

What should be the primary focus when creating data visualizations?

Answer

A

Conveying substantive information clearly.

Question 39

Q

When is it appropriate to use a figure instead of a table?

Answer

A

When the figure conveys more information than a table would.

Question 40

Q

What is one way to convey uncertainty in data visualizations?

Answer

A

By showing distributions, standard errors, or confidence intervals.

Question 41

Q

What is Bayes’ rule used for?

Answer

A

To integrate new quantitative information into existing knowledge.

Question 42

Q

In the example of Juanita Brooks, what evidence was used to charge the Collins couple?

Answer

A

Eyewitness testimony about their characteristics and the yellow car.

Question 43

Q

What probability did the mathematician conclude regarding the innocence of the Collins couple?

Answer

A

About a 1 in 12 million chance that they were innocent.

Question 44

Q

How do you calculate the probability of multiple independent events occurring together?

Answer

A

By multiplying the probabilities of each event occurring individually.

Question 45

Q

How many cards are in a standard deck?

Question 46

Q

What was the prosecutor’s argument regarding the Collins couple’s characteristics?

Answer

A

The chances that a randomly selected person would have specific characteristics is the product of the probabilities of those characteristics.

Question 47

Q

What probability did the prosecutor initially calculate for the Collins couple?

Answer

A

1 in 12 million

Question 48

Q

What did the prosecutor’s analysis underestimate regarding the Collins couple’s probability of innocence?

Answer

A

The probability was likely closer to 1 in 1 billion.

Question 49

Q

True or False: The characteristics used in the prosecutor’s argument are independent.

Question 50

Q

What is the correct probability to consider when evaluating the Collins couple’s guilt?

Answer

A

The probability that the Collins couple is innocent, given the evidence.

Question 51

Q

What is conditional probability?

Answer

A

The probability of one event occurring given that another event has occurred.

Question 52

Q

In the context of the Collins case, what does ‘P(innocent | evidence)’ represent?

Answer

A

The probability that the Collins couple is innocent given that they match the eyewitness description.

Question 53

Q

What is Bayes’ Rule?

Answer

A

A mathematical formula for calculating the probability of a claim being true, given available evidence.

Question 54

Q

What does the prior belief represent in Bayes’ Rule?

Answer

A

The probability of a claim being true before considering new evidence.

Question 55

Q

What does the posterior belief represent in Bayes’ Rule?

Answer

A

The probability of a claim being true after incorporating new evidence.

Question 56

Q

How did the prosecutor ignore an important factor in the Collins case?

Answer

A

He focused only on the new evidence, neglecting the prior probability of innocence.

Question 57

Q

What was the prior belief regarding the Collins couple’s innocence?

Answer

A

Very close to 1, since almost all couples in LA were innocent.

Question 58

Q

What is the significance of the probability 1 in 1,000,000 in the Collins case?

Answer

A

It represents the likelihood that an innocent couple matches the eyewitness description.

Question 59

Q

According to the analysis, how many innocent couples in Los Angeles matched the eyewitness description?

Answer

A

2 innocent couples

Question 60

Q

Fill in the blank: The probability the Collins couple is guilty given that they match the eyewitness description is ______.

Question 61

Q

Why does the Collins couple have a higher probability of being innocent than guilty?

Answer

A

Because out of the three couples matching the description, two are innocent.

Question 62

Q

What was the main error in the prosecutor’s argument?

Answer

A

He answered the wrong question regarding the guilt of the Collins couple.

Question 63

Q

What was the false negative rate for Test 1 regarding celiac disease?

Answer

A

5 percent

Question 64

Q

What was the false positive rate for Test 2 regarding celiac disease?

Answer

A

50 percent

Answer 59

A

50 percent

Answer 60

A

80 percent

Answer 61

A

Approximately 1.6 percent

Answer 62

A

It allows for the combination of probabilities across tests.

Answer 63

A

It shows how to update beliefs based on new evidence and prior probabilities.

Answer 64

A

5 percent

This means that Test 1 returns a negative result for a child with celiac disease 5% of the time.

Answer 65

A

80 percent

This indicates that Test 2 returns a positive result for a child with celiac disease 80% of the time.

Answer 66

A

1 percent

This is the initial assumption about the prevalence of celiac disease among the kids in question.

Answer 67

A

Kid with celiac disease
Kid without celiac disease

The first type has celiac and experiences a false negative on Test 1, while the second type does not have celiac and experiences a false positive on Test 2.

Answer 68

A

Approximately 1 in 1,000

This illustrates how Bayesian reasoning can lead to surprising conclusions based on test results.

Answer 69

A

Screening of Passengers by Observation Techniques (SPOT)

This program aimed to catch potential terrorists using behavioral cues.

Answer 70

A

Indicators of nervousness or suspicious behavior

Different suspicious behaviors were assigned different points to determine if a traveler should be further questioned.

Answer 71

A

5 percent

This amounted to hundreds of millions of dollars per year.

Answer 72

A

Likelihood of a random traveler being a terrorist
Likelihood of a terrorist appearing suspicious
Likelihood of a non-terrorist appearing suspicious

These factors are necessary to form accurate posterior beliefs about a traveler’s risk.

Answer 73

A

The TSA does not know the answers to key questions about terrorist behavior

This lack of knowledge undermines the efficacy of the SPOT program.

Answer 74

A

Undocumented immigration status

This indicates a failure of the SPOT program to catch actual terrorists.

Answer 75

A

Approximately 2 billion

This figure highlights the scale of air travel and the challenge of identifying potential terrorists.

Answer 76

A

100

This is a generous estimate used in the analysis to evaluate the SPOT program.

Answer 77

A

Approximately 1 in 200,000

This reflects the very low likelihood of suspicious behavior indicating terrorism even under favorable assumptions.

Answer 78

A

It helps assess confidence about the truth of a hypothesis based on new evidence

Bayes’ rule provides a structured way to update beliefs in light of new data.

Answer 79

A

0.05

This threshold indicates a 5% chance of incorrectly rejecting the null hypothesis.

Answer 80

A

The probability of finding a statistically significant result given that a relationship exists

High statistical power indicates a greater chance of detecting an effect when it truly exists.

Answer 81

A

They lead to low posterior beliefs about the effect being real

This illustrates the importance of prior beliefs in interpreting statistical results.

Answer 82

A

Prior beliefs are crucial for shaping posterior beliefs, especially in studies with low prior probabilities like ESP.

If prior beliefs are low, new evidence may not significantly affect beliefs.

Answer 83

A

Stronger prior beliefs (close to 0 or 1) make it harder to change beliefs in response to new evidence, while moderate priors (around 0.2) allow for larger shifts.

This is illustrated in Figure 15.7.

Answer 84

A

False

Different prior beliefs can lead to different interpretations of the same evidence.

Answer 85

A

Bayesian statistics involves specifying the whole prior distribution of beliefs about possible relationship sizes and updating these beliefs when new evidence is presented.

This contrasts with frequentist statistics.

Answer 86

A

Percentage point change is the numerical difference between two percentages, while percent change measures the degree of change relative to the original value.

Percent change is sensitive to the original value.

Answer 87

A

Factors include:
* False positive rates
* False negative rates
* Costs of the tests
* Speed of the tests

These factors are crucial for making informed decisions about testing.

Answer 88

A

Both rates are critical for accurate diagnosis, with low rates being essential for reliable testing outcomes.

Regulatory agencies often require low rates for test approval.

Answer 89

A

[magnitude]

This involves beliefs about how likely each possible relationship size is.

Answer 90

A

Focusing on one statistic can lead to poor decision-making, as it may overlook other important factors and trade-offs.

A comprehensive evaluation of costs and benefits is necessary.

Answer 91

A

They are cheaper, can be administered at home, and provide faster results, which are critical in controlling the spread of infectious diseases.

Speed and cost are significant in the context of highly infectious diseases like coronavirus.

Answer 92

A

The main benefit is to prevent infected individuals from spreading the disease to others quickly.

Rapid testing can significantly reduce transmission rates.

Answer 93

A

Using a combination of tests allows for quick initial screening with cheaper tests and follow-up with more accurate tests to confirm results.

This approach can minimize the impact of false positives.

Answer 94

A

Understanding costs and benefits is essential for making informed decisions based on quantitative evidence, rather than just focusing on statistical outcomes.

Personal values play a role in how different costs and benefits are weighed.

Answer 95

A

The simple numerical difference between two percentages.

Answer 96

A

The difference between the initial value and the new value divided by the original value (multiplied by 100).

Answer 97

A

Percent change is highly sensitive to the original value.

Answer 98

A

The probability of an event conditional on some other information, written as Pr(C | E).

Answer 99

A

Your belief about something before learning new evidence.

Answer 100

A

Your belief about something after incorporating new evidence.

Answer 101

A

A formula for calculating your posterior belief conditional on new evidence and your prior belief.

Answer 102

A

The probability of finding a statistically significant result in the data given that the relationship really exists in the world.

Answer 103

A

12 percent.

Answer 104

A

0.12 percent.

Answer 105

A

Provide absolute values or context rather than just percentage change.

Answer 106

A

900 percent.

Answer 107

A

0.9 percentage points.

Answer 108

A

1 percent.

Answer 109

A

10 percent.

Answer 110

A

90 percent.

Answer 111

A

Correct test result for someone with coronavirus
False positive for someone without coronavirus

Answer 112

A

P(Job | Group).

Answer 113

A

P(Group | Job) is equal for both groups.

Answer 114

A

The probability of being hired given group membership.

Answer 115

A

It may provide sufficient information to determine hiring likelihood.