0 - Introduction Flashcards

1
Q

What is the primary purpose of this book?

A

To prepare readers to work effectively with data science teams and maximize the value from their expertise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is necessary to thrive in a modern corporate environment?

A

Some understanding of data science and its applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does ‘good questions’ refer to in the context of data science?

A

Questions that increase the chances that proposed solutions will solve problems and avoid unnecessary expenses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the key elements for successful collaboration between customers and data science teams?

A

Understanding goals, communicating solutions, and collaborating to deliver value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What basic knowledge should readers have before using this book?

A

Basic understanding of descriptive statistics, ability to read simple graphs, and experience with spreadsheet programs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is this book NOT intended to be?

A

A textbook for becoming a data scientist or a programming book.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What will the book discuss in relation to tools of the trade?

A

Basic information needed to become a good data science customer, including software and data storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is it important to use the right tools in data science projects?

A

To avoid wasting time and money on incorrect software and tools.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the composition of a data science team similar to?

A

A baseball team, with different players having different skills.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some job titles found on a data science team?

A
  • Data scientist
  • Data engineer
  • Data analyst
  • Machine learning engineer
  • Statistician
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What common failure do companies face when starting data science projects?

A

Trying to hunt mosquitoes with a machine gun, leading to distraction by advanced methods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the focus of unsupervised machine learning?

A

Grouping people based on data rather than predicting an outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Give an example of customer clusters in the restaurant industry.

A
  • Weekly night-outers
  • Anniversary diners
  • Family mealers
  • One-and-doners
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does supervised machine learning aim to predict?

A

An outcome of interest, such as response to an ad or hospital stay duration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

List some methods introduced in supervised machine learning.

A
  • Linear regression
  • Logistic regression
  • Classification and regression trees
  • Random forests
  • Gradient-boosted machine learning
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What specialized topics will the book touch on?

A
  • Network analysis
  • Spatial analysis
  • Deep learning
  • AI
17
Q

What key metrics are used in network analysis?

A
  • Density
  • Centrality
18
Q

What is one application of AI discussed in the book?

A

Computer vision, including image tagging and information extraction.

19
Q

What question will senior managers likely ask regarding data science investments?

A

Are the millions of dollars spent providing a good return on the investment?

20
Q

What methods will be discussed for measuring impact?

A
  • A/B testing
  • Difference in difference
  • Interrupted time series
  • Regression discontinuity
21
Q

What is the primary question regarding data science investments?

A

Are the millions of dollars we spend on our data science investments providing a good return on the investment?

22
Q

What are some methods to measure impact in data science?

A

A/B testing, causal inference, difference in difference, interrupted time series, regression discontinuity

23
Q

What is an important ethical issue in data science?

A

Reinforcing racial, sexual, or other biases through algorithms

24
Q

What should a data science team be aware of regarding modeling?

A

The ramifications of its modeling to ensure no explicit or implicit bias is included

25
What is an example of a biased model mentioned in the text?
Credit-scoring models that may include factors like race and sex
26
What might talent prediction models create that is unfair?
Closed loops that penalize students and job applicants for not fitting historical patterns
27
Who are the two main characters introduced in the book?
Steve and Kamala
28
In which industry does Steve work?
Consumer finance
29
What roles does Kamala have in her health insurance company?
Clinical strategy and marketing
30
What is a key balance Kamala needs to achieve in her role?
Delivering good patient care while keeping her company profitable
31
What is the focus of the lessons in the book?
Being a good customer regardless of professional focus
32
Fill in the blank: The book will introduce the basic concepts of measuring _______.
impact
33
True or False: The ethical concerns in data science are often ignored.
True
34
What is one of the advanced methods mentioned for demonstrating impact?
Regression discontinuity
35
What does the book aim to provide regarding ethical practices in data science?
Basic ethical concerns, ways to avoid issues, and best practices in ethics