Fairness in Data Analytics: Common Biases Flashcards

Understand the what, why and how of common biases when conducting fair data analytics.

1
Q

What is sampling bias?

A

When a sample isn’t representative of the population being studied.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is sampling bias problematic?

A

It leads to misleading conclusions because the sample doesn’t reflect the true population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How to avoid sampling bias?

A

Ensure your sample is random and includes diverse subgroups of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is confirmation bias?

A

The tendency to search for or interpret data in a way that confirms pre-existing beliefs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is confirmation bias harmful in analysis?

A

It skews the analysis by focusing only on data that supports assumptions, ignoring contradictory evidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to avoid confirmation bias?

A

Look at all the data objectively and challenge your own assumptions throughout the analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is selection bias?

A

Bias that occurs when individuals are not randomly selected, affecting the validity of the results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is selection bias dangerous?

A

It distorts findings, as the chosen sample may have characteristics that are not representative of the target group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to correct selection bias?

A

Use randomized sampling methods and ensure all relevant groups are included.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is survivorship bias?

A

Focusing only on the successful cases, ignoring those that failed or were excluded.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why does survivorship bias mislead?

A

It creates a false perception by only analyzing surviving cases, leading to overly optimistic conclusions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How to avoid survivorship bias?

A

Include data from all cases, both successes and failures, in your analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is response bias?

A

When survey respondents answer untruthfully or in a way they think is expected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why does response bias affect data?

A

It leads to inaccurate or skewed data, as respondents may not provide honest answers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How to minimize response bias?

A

Design surveys to be neutral and ensure respondents feel safe providing honest answers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is overfitting in data models?

A

When a model is too closely fitted to a specific dataset, failing to generalize.

17
Q

Why is overfitting a problem?

A

It performs well on training data but poorly on new, unseen data.

18
Q

How to prevent overfitting?

A

Use cross-validation and simplify the model to focus on general trends.

19
Q

What is anchoring bias?

A

The tendency to rely too heavily on the first piece of information encountered.

20
Q

Why is anchoring bias problematic?

A

It can cloud judgment by overly influencing the interpretation of new data.

21
Q

How to avoid anchoring bias?

A

Consider all data points equally without giving undue weight to initial findings.