Session 7 Flashcards

Question 1

Q

Sensitive Characteristics (or protected attributes)

Answer

A

are those that cannot be used (legally) to differentiate individuals with respect to the target variable in predictive models

Question 2

Q

By defining some characteristics as “sensitive”, we are assuming that the algorithms can end up differentiating individuals based on these characteristics

Answer

A

This may be because the data reveals existing injustices (e.g., some groups of individuals may be already discriminated, and that shows in the data)
It can be because of differences in tastes and behaviors, and may not represent discrimination (e.g., Sinterklaas is more popular in The Netherlands than in Portugal)

Question 3

Q

Formal non-discrimination criteria

Answer

A

Many fairness criteria have been proposed over the years, each aiming to formalize different requirements

Question 4

Q

Most proposed fairness criteria are properties of the joint distribution of:

Answer

A

A - the sensitive attribute
Y - the target variable
R - the classifier or score

Question 5

Q

Most criteria fall into one of three categories regarding how these variables are related with each other:

Answer

A

Independence (R ⊥ A)
Separation (R ⊥ A | Y)
Sufficiency (Y ⊥ A | R)

Question 6

Q

Independence

Answer

A

has been explored through many equivalent terms or variants, referred to as demographic parity, statistical parity, group fairness

Main idea: “Everybody gets treated the same”

Question 7

Q

A classifier R is independent from an attribute A if

Answer

A

the probability of the classifier predicting an observation to be positive (R = 1) does not change with a change in the attribute A:

Pr(R = 1 | A = a) = Pr(R = 1 | A = b)

Example: The probability that a person is predicted to default on their loan does not depend on their race

Question 8

Q

Seperation

R ⊥ A | Y

Answer

A

requires the score (R) to be independent from the sensitive attribute (A) given the outcome (Y). In other words, it allows correlation between the score and the sensitive attribute to the extent that it is justified by the target variable

Question 9

Q

Seperation (R ⊥ A | Y)

Main idea:

Answer

A

Given an outcome (e.g., defaulting on a loan), the percentage of individuals predicted positive (and negative) are similar across groups of a sensitive attribute (e.g., black, white)

Question 10

Q

Sufficiency (Y ⊥ A | R)

Answer

A

requires the outcome (Y) to be independent from the sensitive attribute (A) given the score (R). In other words, it allows correlation between the outcome and the sensitive attribute to the extent that it is justified by the score.

Question 11

Q

Sufficiency (Y ⊥ A | R)

Main idea:

Answer

A

Given a prediction, the percentage of those that are positive is similar across groups of a sensitive attribute (e.g., black, white)

Question 12

Q

PPV / NPV

Answer

A

Positive Predicted Value

Negative Predicted Value

Question 13

Q

Relationships between criteria

In general

Answer

A

Each of these fairness criteria is incompatible with the other two. You can satisfy only one of them at a time.

Question 14

Q

How is Google able to find cats in my photos?

Approach #1: Predictive modeling

Answer

A

Define a target variable
- > cat vs no cat
Gather a large set of photos
- > label the photos
Create a set of features (or predictors)
- > 2 eyes, pointy ears, spots
Run a tree induction model
Use the model to classify my photos

Question 15

Q

Deep learning

Answer

A

is a new area of machine learning that uses artificial neural networks for unsupervised pattern recognition

Question 16

Q

Deep Learning is being used across different fields:

Answer

Study These Flashcards

A

Object Recognition (cats and self driving cars)

- Speech Recognition (Google Assistant and Siri) - Drug discovery

Question 17

Q

The perceptron

Answer

Study These Flashcards

A

Is an algorithm for supervised learning of binary classifiers
Similar to logistic regression
Can be used for online learning, i.e., it can adjust to new observations

Question 18

Q

Neural networks can…

Answer

Study These Flashcards

A

approximate any function

Question 19

Q

Weights are updated using an algorithm called

Answer

Study These Flashcards

A

backpropagation

Session 7 Flashcards

(19 cards)