Classification Models Flashcards by Lucas Peri

What is the motivation for learning interpretable classification models?

Understanding a model improves trust in its predictions and can provide insights into the data/application domain.

Important in fields like medicine and finance, where explanations are often legally required.

How well did you know this?

Not at all

Perfectly

What are the two approaches to interpretability?

Intrinsic approach and Post-hoc approach.

Intrinsic approaches involve directly interpretable models like decision trees, while post-hoc methods are used for black-box models.

How well did you know this?

Not at all

Perfectly

Define global interpretability.

Interpreting the entire model at once, understanding how features interact to predict class labels generally.

Examples include small decision trees.

How well did you know this?

Not at all

Perfectly

Define local interpretability.

Explaining the prediction of each testing example separately.

This can involve interpreting specific paths in a decision tree.

How well did you know this?

Not at all

Perfectly

What does each path in a decision tree represent?

A rule in the form of IF-THEN statements.

For example, IF (Salary = ‘low’) THEN (Buy = ‘no’).

How well did you know this?

Not at all

Perfectly

What is a pro of interpreting decision trees?

They are visual models that are easy to interpret, especially if small.

Decision trees typically focus on the most relevant attributes.

How well did you know this?

Not at all

Perfectly

What is a con of interpreting decision trees?

Once an attribute is selected at a node, all its values must be added to the outgoing branches, leading to potential data fragmentation.

This can include irrelevant values.

How well did you know this?

Not at all

Perfectly

What are the two approaches to learning IF-THEN classification rules?

Approach 1: Extraction from a decision tree; Approach 2: Learning rules directly from data.

Ordered rules can provide a clear hierarchy for classification.

How well did you know this?

Not at all

Perfectly

List some pros of IF-THEN rules.

Can be analyzed modularly
Can contain only relevant attribute values
Can be learned directly from data

Unlike decision trees, which may include irrelevant values.

How well did you know this?

Not at all

Perfectly

List some cons of IF-THEN rules.

Not visual/hierarchical
May contain irrelevant values if from decision trees
More difficult interpretation for ordered rule lists

Rules are applied sequentially, complicating interpretation.

How well did you know this?

Not at all

Perfectly

What is the basic principle regarding model size and interpretability?

The smaller the size of the model, the simpler it is.

For decision trees, this refers to the number of nodes; for rule sets, the number of rules.

How well did you know this?

Not at all

Perfectly

What are Naïve Bayes models based on?

Assigning a new example to the class with the maximal value of the product of conditional probabilities and class probabilities.

The Naïve Bayes formula is used for classification.

How well did you know this?

Not at all

Perfectly

How is local interpretation of a Naïve Bayes model achieved?

By computing the importance of each attribute value for classifying the test example and ranking them.

The formula used is Imp(Attr_j) = | P(Attr_j | Class = yes) - P(Attr_j | Class = no) |.

How well did you know this?

Not at all

Perfectly

What is LIME in the context of model interpretability?

Local Interpretable Model-agnostic Explanations, which provide local explanations for classifications of new instances.

It learns a linear local model based on the features of the instance.

How well did you know this?

Not at all

Perfectly

What limitation does LIME have?

The data space region where the explanation applies is unclear.

The local linear model’s effectiveness depends on the size of the neighborhood around the instance.

How well did you know this?

Not at all

Perfectly

What is a key takeaway regarding the interpretability of models?

Study These Flashcards

The relative importance of predictive performance and interpretability is application domain-dependent.

Different models (decision trees, rule sets, Naïve Bayes) have distinct pros and cons.

What is decision tree/rule set size an objective measure of?

Study These Flashcards

Simplicity

It has limited effectiveness as it is a purely syntactic measure, ignoring attribute meanings.

Does a shorter model guarantee better interpretability for users?

Study These Flashcards

A shorter model is not necessarily more interpretable by users than a larger one.

What can black box models be indirectly interpreted by?

Study These Flashcards

Learning local models for explaining each example

These local models are just surrogate models, unlike white box models which are intrinsically interpretable.

What is the central question regarding algorithm predictions and biased data?

Study These Flashcards

How fair are the algorithm’s predictions given the biased data?

What percentage of images in the ImageNet dataset come from the US?

Study These Flashcards

45%

This is significant considering the US only represents 4% of the world’s population.

What is the main effect of Google Translate when translating articles referring to women?

Study These Flashcards

Phrases often become ‘he said’ or ‘he wrote’

This amplifies the bias in the data due to the ratio of masculine to feminine pronouns.

What does the Discrimination Score (DS) measure?

Study These Flashcards

The difference in prediction probabilities between favored and unfavored individuals

DS = P(Y = +1 | S = 0) – P(Y = +1 | S = 1).

What is the true positive (TP) rate formula?

Study These Flashcards

TP / (#TP + #FN)

What is one approach to learning fair classifiers?

Pre-processing approach ## Footnote This includes removing sensitive attributes and data massaging.

What is 'fairness through unawareness'?

Removing sensitive attributes from the dataset ## Footnote It is trivial to implement but not very effective in practice.

What is the effect of removing sensitive attributes on bias detection?

It makes it more difficult to detect biases in the learned model.

What is the goal of reweighing in the context of fair classifiers?

Assigning weights to each example based on its sensitive attribute value and class label.

What does the summary state about biased data?

Data is usually biased, making it challenging to learn fair classification models.

What is one measure of unfairness in classification tasks?

Difference of TP rates ## Footnote This measures the disparity in true positive rates between groups.

What is the Data Massaging approach?

Done to make data fairer. You change the class labels of some examples in the training dataset, to try and make the data fairer.

Classification Models Flashcards

(31 cards)