L10: Ethics Flashcards
WHY BIG DATA ANALYTICS CAN BE
ETHICALLY PROBLEMATIC
BD analytics is powerful
* (Otherwise we would not care, would we?)
* Powerful tools always open possibilities for
misuse
* The role of ethics in data analytics is in
general often underestimated (because it
has been considered
Responsibility when using BDA
- Traditional decision maker (Physician uses traditional means and mistreats patient)
- Decision makers using ML (Physician uses machines outcome and mistreats patients)
- ML Algorithm (Self-driving car crashes)
–> Who to blame? responsibility, legal
ETHICAL CONCERNS SERVE NOT
ONLY A SELF-PURPOSE
Legal consequences
* Discrimination based on gender, race etc.—unintended or not—is illegal in most
jurisdictions
*Conformity with legislation, e.g., Data Protection
Regulation Impact on employees
* Homogenization of workforce (when hiring based on
ML decisions)
*Frustration
Impact on
customers
* E.g. Racist
outcome at
Google Photo
*Spillover to unaffected
customers Impact on business
partners
* Loss of trust, dissolving of
personal relationships
EXAMPLE FOR THE POWER OF BIG DATA
ANALYTICS
Facebook Likes can
predict:
* race
* gender
* Men’s sexual orientation
* Women’s sexual
orientation
2013
95%
accuracy
95%
accuracy
88%
accuracy
WHAT IS
ETHICS?
“[A] set of moral principles/a theory or system of moral values”
(Merriam- Webster)
- “[M]oral principles that govern a person’s behavior or the
conducting of an activity, the branch of knowledge that deals
with moral” (Oxford Dictionaries) - “The field of ethics (or moral philosophy) involves
systematizing, defending, and recommending concepts of
right and wrong behavior” (Internet Encyclopedia of
Philosophy)
Ethical frameworks
Consequentialism (J. St. Mill) / utilitarianism - the only thing that matters are consequences of actions and we judge
consequences to be good or bad by looking at whether we harm fewer people than we help
Deontology/Kantian – what matters is the action itself, whether it is good/bad. One should act in accordance with a universal
moral law. How do you know what is a universal law?
Use the categorical imperative: “act only in accordance with that maxim through which you can at the same time
will that it become a universal law”
Virtue ethics (Aristotel) – act as a virtous person would, in this situation + cultivate the right virtues.
Etc.
(The debate is usually between the first two.)
Consequentialism (J. St. Mill) / utilitarianism
- the only thing that matters are consequences of actions and we judge
consequences to be good or bad by looking at whether we harm fewer people than we help
Deontology/Kantian
what matters is the action itself, whether it is good/bad. One should act in accordance with a universal
moral law. How do you know what is a universal law?
Use the categorical imperative
“act only in accordance with that maxim through which you can at the same time
will that it become a universal law”
Virtue ethics (Aristotel)
act as a virtous person would, in this situation + cultivate the right virtues.
DATA-DRIVEN
PRICING
Uber uses a dynamic pricing model using data to determine the price
Examples:
* “Uber charged a rider $18,000 to go 11 miles” (The Washington Post)
* Ride was refunded afterwards
* Uber charged the multifold of prices during the Sydney siege (2014)
* Uber charged the multifold of prices during the Hurricane Katrina
What do you think from a utilitarianism point of view/what about as
seen from a Kantian perspective ?
What are the limits of dynamic
pricing? Are there any?
WHAT TO PAY ATTENTION TO
1) misrepresenting data
2) misrepresenting insights
3) privacy
4) algorithmic bias
5) misrepresenting algorithms
6) digital divides
MAIN SOURCE OF BIAS
We are used to finding and using proxies for various quantities of interest (e.g.
measuring how productive one is by looking at how many parcels they pick in a
given day), but as humans, we are also used to taking such proxies with a grain
of salt, and using common sense to get a holistic view.
OK CUPID DATA BREACH
*In 2016, researchers published data of 70,000 OkCupid users—including usernames, political
leanings, drug usage, and intimate sexual details.
‘Some may object to the ethics of gathering and releasing this data. However, all the data found in the dataset are or were already publicly
available, so releasing this dataset merely presents it in a more useful form.’
- Researchers Emil Kirkegaard and Julius Daugbjerg Bjerrekær
*Although the researchers did not release the real names and pictures of the OkCupid users,
critics noted that their identities could easily be uncovered from the details provided—such as fromthe usernames.
KEY POINT 1: AWARENESS MATTERS
- ML engineers are often not aware of potential ethical issues