Week 8 Flashcards by Jedidja Marsman

What ML interpretation method separates the explanations from the machine learning model?

Model-agnostic interpretation methods

How well did you know this?

Not at all

Perfectly

What is the advantage of using model-agnostic interpretation methods over model-specific ones?

Their flexibility. The same method can be used for any type of model.

How well did you know this?

Not at all

Perfectly

What is the disadvantage of using only interpretable models instead of using model-agnostic interpretation methods?

Predictive performance is lost compared to other ML models, and you limit yourself to one type of model.

How well did you know this?

Not at all

Perfectly

What are two alternatives to using model-agnostic interpretation methods?

Use only interpretable models.
Use model-specific interpretation methods.

How well did you know this?

Not at all

Perfectly

What is the disadvantage of using model-specific interpretation methods compared to model-agnostic ones?

It binds you to one model type and it’s difficult to switch to something else.

How well did you know this?

Not at all

Perfectly

Name three flexibilities that are desirable aspects of a model-agnostic explanation system:

Model flexibility
Explanation flexibility
Representation flexibility

How well did you know this?

Not at all

Perfectly

Model flexibility (as an aspect of a model-agnostic explanation system)

It can work with any ML model, such as random forests and deep neural networks.

How well did you know this?

Not at all

Perfectly

Explanation flexibility (as an aspect of a model-agnostic explanation system)

It’s not limited to a certain form of explanation. For example, linear formula and graphics with feature importances are both options.

How well did you know this?

Not at all

Perfectly

Representation flexibility (as an aspect of a model-agnostic explanation system)

It’s able to use a different feature representation as the model being explained.

How well did you know this?

Not at all

Perfectly

How can we further distinguish model-agnostic interpretation methods?

Into local and global methods.

How well did you know this?

Not at all

Perfectly

What do global model-agnostic interpretation methods describe?

How features affect the prediction on average.

How well did you know this?

Not at all

Perfectly

What do local model-agnostic interpretation methods describe?

An individual prediction.

How well did you know this?

Not at all

Perfectly

How are global model-agnostic methods often expressed?

As expected values based on the distribution of the data.

How well did you know this?

Not at all

Perfectly

What is the partial dependence plot?

A feature effect plot: the expected prediction when all other features are marginalized out.

How well did you know this?

Not at all

Perfectly

When are global interpretation methods particularly useful?

When you want to understand the general mechanisms in the data or debug a model (since they describe average behavior).

How well did you know this?

Not at all

Perfectly

PDP (abbreviation)

Partial dependence plot

How well did you know this?

Not at all

Perfectly

PD plot (abbreviation)

partial dependence plot

How well did you know this?

Not at all

Perfectly

What does the PDP show?

Study These Flashcards

The marginal effect one or two features have on the predicted outcome of a ML model. Can show whether relationship between target and a feature is linear, monotonic or more complex.

What does x_s denote in the PD function for regression?

Study These Flashcards

The features for which the PD function should be plotted.

What does X_C denote in the PD function for regression?

Study These Flashcards

The other features (so non-x_s features) used in the ML model ^f.

How does PD work?

Study These Flashcards

By marginalizing the ML model output over the distribution of the features in set C, so that the function shows the relationship between the features in set S we are interested in, and the predicted outcoe.

Give the PD function for regression in the form of an expectation:

Study These Flashcards

E_{X_C}[^f(x_S, X_C)].

Give the PD function for regression in the form of an integral:

Study These Flashcards

integral sign ^f(x_S, X_C) dP(X_C).

How is the partial function ^f_S estimated?

Study These Flashcards

By calculating averages in the training data, using the Monte Carlo method.

Give the partial function ^f_S that is used in the PD function for regression:

^f_S(x_S) = 1/n *sum*(i=1 to n) ^f(x_S,x⁽ⁱ⁾_C ).

What does the partial function ^f in the PD function for regression tell us?

For given values of features S, it tells us what the average marginal effect on the prediction is.

What does x⁽ⁱ⁾_C denote in the partial function ^f in the PD function for regression?

Actual features values from the dataset for the features in which we are not interested.

What is n in the partial function ^f in the PD function for regression?

The number of instances in the dataset.

What is the assumption of the PDP about the relationship between C and S?

The features in C are not correlated with the features in S.

What happens if the assumption that features in C are not correlated with features in S is violated in PDP?

The averages calculated for the PDP will include data points that are unlikely/impossible.

WHat does the PDP display for classification where the ML model outputs probabilities?

The probability for a certain class given different values for features in S.

What kind of model-agnostic method is the PDP?

A global method.

How do you calculate the partial dependence for categorical features?

Replace the feature value of all data instances with one value and average the predictions.

What does a flat PDP indicate?

The feature is not important.

How is importance of a feature defined in PDP for numerical features?

As the deviation of each unique feature value from the average curve.

What is the variable for the importance of numerical features?

I(x_S)

Range rule

The way of calculating the deviation when you want a rough estimate and only know the range.

Why should the PDP-based feature importance be interpreted with care?

It captures only the main effect of the feature and ignores possible feature interactions.

Name three disadvantages of the PDP:

1. It doesn't show the feature distribution, so you might overinterpret regions with almost no data. 2. Assumption of independence. 3. Heterogeneous effects might be hidden (averaged out by marginalizing).

What does permutation feature importance measure?

The increase in the prediction error of the model after we permute the feature's values, which breaks the relation between the feature and the true outcome.

When is a feature important when using permutation feature importance?

If shuffling its values increases the model error, cuz then the model relied on the feature for the prediction.

Should you compute importance on training or test data?

Since permutation feature importance relies on measurements of the model error, you should use unseen test data to prevent overfitting.

Global surrogate model

An interpretable model that is trained to approximate the predictions of a black box model.

Week 8 Flashcards

(44 cards)