CHAPTER 15 Turn Statistics into Substance Flashcards

1
Q

What is a common issue with how statistics are reported?

A

Statistics are often reported or presented in ways that are misleading or unhelpful for decision making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is necessary to translate a statistic into useful information?

A

Think clearly about the question at hand.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does Bayes’ rule help us with?

A

Bayes’ rule helps us update beliefs in response to new information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What must be combined with evidence-based beliefs for decision making?

A

Your values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the key to turning statistics into substance?

A

Ask and answer the question you really care about.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can the presentation of information affect its perceived meaning?

A

Changing scales can dramatically alter whether a relationship seems large or small, important or unimportant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference between miles-per-gallon and gallons-per-mile?

A

Miles-per-gallon tells how far a car drives given how much gasoline is burned, while gallons-per-mile indicates how much gasoline is burned given how far it is driven.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

In terms of reducing gas consumption, which vehicle type should be prioritized?

A

Gas-guzzling SUVs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is one consequence of using miles-per-gallon as a metric?

A

It may mislead consumers and regulators into making bad decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a better measure of fuel efficiency than miles-per-gallon?

A

Gallons-per-hundred-miles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What confusion can arise from percent changes versus percentage point changes?

A

A percent change is a ratio of the percentage point change to the initial value, while percentage point change is the numerical difference between two percentages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does a 44% reduction in heart-related risks imply without context?

A

It does not necessarily indicate a significant number of lives saved.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the baseline heart attack risk in the studied population?

A

About 2.8%.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the actual reduction in heart attacks with a 44% decrease?

A

About one percentage point reduction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why are visual presentations of data important?

A

They help in accurately and informatively displaying quantitative information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is essential when creating data visualizations?

A

Choosing the scale on which to present data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What can a seemingly innocuous change of scale in a graph do?

A

Transform a graph to make a relationship look enormous or inconsequential.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What should be questioned when viewing a data visualization?

A

The underlying data and analyses, assumptions, and whether the findings answer the question being asked.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Fill in the blank: Quantitative evidence, on its own, can’t tell you what to ______.

A

do.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Fill in the blank: The same 2-miles-per-gallon improvement has a much larger effect on gas consumption when applied to a ______ vehicle.

A

gas-guzzling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the significance of choosing the scale for data visualization?

A

Choosing the scale can significantly alter the perception of the data, making relationships appear large or small.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What effect does changing the vertical axis scale have in a bar graph comparing numbers 89 and 90?

A

It can make the two numbers appear vastly different or nearly identical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What should you consider when interpreting a graph?

A

Carefully read the axes and consider what the numbers mean substantively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

True or False: There is always a correct scale for data visualization.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

In what context might the difference between 89 and 90 be substantively significant?

A

If you are chaperoning school children, the difference in students returning home safely is significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What happens if a graph is on a scale too large?

A

Important information may be hidden, making substantively meaningful differences difficult to see.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

If a 1-point difference is substantively negligible, what scale would appropriately reflect that?

A

A scale of 0 to 100.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What can changing the scale of axes in a graph influence?

A

It can make correlations look strong or weak and affect the appearance of linear relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What did Achen and Bartels argue regarding voters’ policy views and political behavior?

A

They argued that policy views have little relationship to political behavior, which is driven by non-policy concerns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

In their analysis, what did Achen and Bartels use to support their claim about party affiliation?

A

A visual representation of data showing trends in party identification among white Southerners.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What was the trend in party identification for white Southerners from 1960 to the end of the twentieth century?

A

They shifted from overwhelmingly Democratic to overwhelmingly Republican.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What does the vertical axis in the visualization of party identification measure?

A

The Democratic margin, calculated as the percent identifying as Democratic minus the percent identifying as Republican.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What did the figure’s large vertical axis scale potentially obscure?

A

Substantively meaningful differences in party identification trends.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

How did the trends in partisanship differ between those who opposed and did not oppose integration?

A

Those who opposed integration switched partisan affiliation at a faster rate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What was the margin change for white Southerners who opposed integration from 1962 to 2000?

A

From a 48-point Democratic margin to an 18-point Republican margin.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What was the change in the Democratic margin for those who did not oppose integration during the same period?

A

From a 32-point Democratic margin to a 1-point Republican margin.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Fill in the blank: To effectively convey information in data visualizations, one must keep it _______.

A

simple

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What should be the primary focus when creating data visualizations?

A

Conveying substantive information clearly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

When is it appropriate to use a figure instead of a table?

A

When the figure conveys more information than a table would.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is one way to convey uncertainty in data visualizations?

A

By showing distributions, standard errors, or confidence intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is Bayes’ rule used for?

A

To integrate new quantitative information into existing knowledge.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

In the example of Juanita Brooks, what evidence was used to charge the Collins couple?

A

Eyewitness testimony about their characteristics and the yellow car.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What probability did the mathematician conclude regarding the innocence of the Collins couple?

A

About a 1 in 12 million chance that they were innocent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

How do you calculate the probability of multiple independent events occurring together?

A

By multiplying the probabilities of each event occurring individually.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

How many cards are in a standard deck?

A

52 cards

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What was the prosecutor’s argument regarding the Collins couple’s characteristics?

A

The chances that a randomly selected person would have specific characteristics is the product of the probabilities of those characteristics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What probability did the prosecutor initially calculate for the Collins couple?

A

1 in 12 million

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What did the prosecutor’s analysis underestimate regarding the Collins couple’s probability of innocence?

A

The probability was likely closer to 1 in 1 billion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

True or False: The characteristics used in the prosecutor’s argument are independent.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is the correct probability to consider when evaluating the Collins couple’s guilt?

A

The probability that the Collins couple is innocent, given the evidence.

51
Q

What is conditional probability?

A

The probability of one event occurring given that another event has occurred.

52
Q

In the context of the Collins case, what does ‘P(innocent | evidence)’ represent?

A

The probability that the Collins couple is innocent given that they match the eyewitness description.

53
Q

What is Bayes’ Rule?

A

A mathematical formula for calculating the probability of a claim being true, given available evidence.

54
Q

What does the prior belief represent in Bayes’ Rule?

A

The probability of a claim being true before considering new evidence.

55
Q

What does the posterior belief represent in Bayes’ Rule?

A

The probability of a claim being true after incorporating new evidence.

56
Q

How did the prosecutor ignore an important factor in the Collins case?

A

He focused only on the new evidence, neglecting the prior probability of innocence.

57
Q

What was the prior belief regarding the Collins couple’s innocence?

A

Very close to 1, since almost all couples in LA were innocent.

58
Q

What is the significance of the probability 1 in 1,000,000 in the Collins case?

A

It represents the likelihood that an innocent couple matches the eyewitness description.

59
Q

According to the analysis, how many innocent couples in Los Angeles matched the eyewitness description?

A

2 innocent couples

60
Q

Fill in the blank: The probability the Collins couple is guilty given that they match the eyewitness description is ______.

61
Q

Why does the Collins couple have a higher probability of being innocent than guilty?

A

Because out of the three couples matching the description, two are innocent.

62
Q

What was the main error in the prosecutor’s argument?

A

He answered the wrong question regarding the guilt of the Collins couple.

63
Q

What was the false negative rate for Test 1 regarding celiac disease?

64
Q

What was the false positive rate for Test 2 regarding celiac disease?

A

50 percent

65
Q

What percentage of kids without celiac tested positive on Test 2?

A

50 percent

66
Q

If 10,000 kids are tested, how many would be expected to have celiac disease based on the prior belief?

67
Q

What is the probability that a kid tests positive on Test 2, given that they have celiac disease?

A

80 percent

68
Q

What is the probability Abe has celiac, given that he was small in stature and tested positive on Test 2?

A

Approximately 1.6 percent

69
Q

What is the significance of the false positive and false negative rates being independent?

A

It allows for the combination of probabilities across tests.

70
Q

How does the story of Abe’s diagnosis illustrate the use of Bayes’ Rule?

A

It shows how to update beliefs based on new evidence and prior probabilities.

71
Q

What is the probability of a false negative on Test 1 for a kid with celiac?

A

5 percent

This means that Test 1 returns a negative result for a child with celiac disease 5% of the time.

72
Q

What is the probability of a positive result on Test 2 for a kid with celiac?

A

80 percent

This indicates that Test 2 returns a positive result for a child with celiac disease 80% of the time.

73
Q

What is the prior belief regarding the probability that a kid has celiac disease?

A

1 percent

This is the initial assumption about the prevalence of celiac disease among the kids in question.

74
Q

What are the two types of kids who might get a negative on Test 1 and a positive on Test 2?

A
  • Kid with celiac disease
  • Kid without celiac disease

The first type has celiac and experiences a false negative on Test 1, while the second type does not have celiac and experiences a false positive on Test 2.

75
Q

What was the probability that Abe had celiac given the two test results?

A

Approximately 1 in 1,000

This illustrates how Bayesian reasoning can lead to surprising conclusions based on test results.

76
Q

What was one major new program implemented by the U.S. government in airport security after 9/11?

A

Screening of Passengers by Observation Techniques (SPOT)

This program aimed to catch potential terrorists using behavioral cues.

77
Q

What do Behavior Detection Officers look for during airport security screenings?

A

Indicators of nervousness or suspicious behavior

Different suspicious behaviors were assigned different points to determine if a traveler should be further questioned.

78
Q

What percentage of the TSA’s annual budget was allocated to the SPOT program by 2010?

A

5 percent

This amounted to hundreds of millions of dollars per year.

79
Q

What crucial information does the TSA need to determine the likelihood of a traveler being a terrorist?

A
  • Likelihood of a random traveler being a terrorist
  • Likelihood of a terrorist appearing suspicious
  • Likelihood of a non-terrorist appearing suspicious

These factors are necessary to form accurate posterior beliefs about a traveler’s risk.

80
Q

What did the General Accountability Office (GAO) find regarding the TSA’s knowledge about the SPOT program?

A

The TSA does not know the answers to key questions about terrorist behavior

This lack of knowledge undermines the efficacy of the SPOT program.

81
Q

What was the most common reason for detaining individuals identified by a Behavior Detection Officer?

A

Undocumented immigration status

This indicates a failure of the SPOT program to catch actual terrorists.

82
Q

How many passenger trips occur through U.S. airports each year according to the GAO?

A

Approximately 2 billion

This figure highlights the scale of air travel and the challenge of identifying potential terrorists.

83
Q

How many would-be terrorists were assumed to be in U.S. airports each year for the SPOT program analysis?

A

100

This is a generous estimate used in the analysis to evaluate the SPOT program.

84
Q

What is the probability that a person who behaves suspiciously is actually a terrorist, according to the SPOT program assumptions?

A

Approximately 1 in 200,000

This reflects the very low likelihood of suspicious behavior indicating terrorism even under favorable assumptions.

85
Q

What does Bayes’ rule help us understand regarding scientific hypotheses?

A

It helps assess confidence about the truth of a hypothesis based on new evidence

Bayes’ rule provides a structured way to update beliefs in light of new data.

86
Q

What is the significance level used in hypothesis testing, often set at?

A

0.05

This threshold indicates a 5% chance of incorrectly rejecting the null hypothesis.

87
Q

What is statistical power in hypothesis testing?

A

The probability of finding a statistically significant result given that a relationship exists

High statistical power indicates a greater chance of detecting an effect when it truly exists.

88
Q

What is the impact of low prior beliefs on posterior beliefs after obtaining statistically significant results?

A

They lead to low posterior beliefs about the effect being real

This illustrates the importance of prior beliefs in interpreting statistical results.

89
Q

What role do prior beliefs play in posterior beliefs?

A

Prior beliefs are crucial for shaping posterior beliefs, especially in studies with low prior probabilities like ESP.

If prior beliefs are low, new evidence may not significantly affect beliefs.

90
Q

How does the strength of prior beliefs affect belief change in response to new evidence?

A

Stronger prior beliefs (close to 0 or 1) make it harder to change beliefs in response to new evidence, while moderate priors (around 0.2) allow for larger shifts.

This is illustrated in Figure 15.7.

91
Q

True or False: Two individuals with different prior beliefs will react the same way to the same piece of evidence.

A

False

Different prior beliefs can lead to different interpretations of the same evidence.

92
Q

What is Bayesian statistics?

A

Bayesian statistics involves specifying the whole prior distribution of beliefs about possible relationship sizes and updating these beliefs when new evidence is presented.

This contrasts with frequentist statistics.

93
Q

What is the difference between percentage point change and percent change?

A

Percentage point change is the numerical difference between two percentages, while percent change measures the degree of change relative to the original value.

Percent change is sensitive to the original value.

94
Q

What factors should be considered when evaluating diagnostic tests?

A

Factors include:
* False positive rates
* False negative rates
* Costs of the tests
* Speed of the tests

These factors are crucial for making informed decisions about testing.

95
Q

What is the significance of false positive and false negative rates in testing?

A

Both rates are critical for accurate diagnosis, with low rates being essential for reliable testing outcomes.

Regulatory agencies often require low rates for test approval.

96
Q

Fill in the blank: Bayesian reasoning can be complicated when assessing the ______ of a relationship.

A

[magnitude]

This involves beliefs about how likely each possible relationship size is.

97
Q

What is a potential drawback of focusing solely on one quantitative statistic like false negative rates?

A

Focusing on one statistic can lead to poor decision-making, as it may overlook other important factors and trade-offs.

A comprehensive evaluation of costs and benefits is necessary.

98
Q

Why might paper-strip tests be considered despite having higher false negative rates?

A

They are cheaper, can be administered at home, and provide faster results, which are critical in controlling the spread of infectious diseases.

Speed and cost are significant in the context of highly infectious diseases like coronavirus.

99
Q

What is the main benefit of rapid testing in the context of infectious diseases?

A

The main benefit is to prevent infected individuals from spreading the disease to others quickly.

Rapid testing can significantly reduce transmission rates.

100
Q

How can combining different testing methods improve diagnostic accuracy?

A

Using a combination of tests allows for quick initial screening with cheaper tests and follow-up with more accurate tests to confirm results.

This approach can minimize the impact of false positives.

101
Q

What is the significance of understanding the costs and benefits in decision-making?

A

Understanding costs and benefits is essential for making informed decisions based on quantitative evidence, rather than just focusing on statistical outcomes.

Personal values play a role in how different costs and benefits are weighed.

102
Q

What is the definition of percentage point change?

A

The simple numerical difference between two percentages.

103
Q

Define percent change.

A

The difference between the initial value and the new value divided by the original value (multiplied by 100).

104
Q

How does percent change differ from percentage point change?

A

Percent change is highly sensitive to the original value.

105
Q

What is conditional probability?

A

The probability of an event conditional on some other information, written as Pr(C | E).

106
Q

What is prior belief?

A

Your belief about something before learning new evidence.

107
Q

Define posterior belief.

A

Your belief about something after incorporating new evidence.

108
Q

What is Bayes’ rule?

A

A formula for calculating your posterior belief conditional on new evidence and your prior belief.

109
Q

What is statistical power?

A

The probability of finding a statistically significant result in the data given that the relationship really exists in the world.

110
Q

If GDP growth in Country B was 10 percent, what was GDP growth in Country A if it was reported as 20 percent higher?

A

12 percent.

111
Q

If GDP growth in Country B was 0.1 percent, what was GDP growth in Country A if it was reported as 20 percent higher?

A

0.12 percent.

112
Q

What is one way to avoid misleadingly masking differences in economic growth scenarios?

A

Provide absolute values or context rather than just percentage change.

113
Q

What is the percent difference in growth if Country C is at 1 percent and Country D is at 0.1 percent?

A

900 percent.

114
Q

What is the percentage point difference in growth if Country C is at 1 percent and Country D is at 0.1 percent?

A

0.9 percentage points.

115
Q

True or False: A false positive occurs when a person without a condition tests positive.

116
Q

What is the false positive rate reported for a specific coronavirus test?

A

1 percent.

117
Q

What is the false negative rate reported for a specific coronavirus test?

A

10 percent.

118
Q

What is the probability a person gets a positive result given that they really do have coronavirus?

A

90 percent.

119
Q

What are the two ways to get a positive test result?

A
  • Correct test result for someone with coronavirus
  • False positive for someone without coronavirus
120
Q

Using Bayes’ rule, how do you express the probability of a job given group membership?

A

P(Job | Group).

121
Q

If people in a given job are equally likely to be privileged and unprivileged, what does this imply?

A

P(Group | Job) is equal for both groups.

122
Q

What additional information is needed to determine if a member of the privileged group is more likely to be hired?

A

The probability of being hired given group membership.

123
Q

If the same number of members from both groups applied for the job, what does this imply?

A

It may provide sufficient information to determine hiring likelihood.