Week 7 Flashcards by Jedidja Marsman

Interpretability

The degree to which a human can understand the cause of a decision and consistently predict the result of a model.

How well did you know this?

Not at all

Perfectly

Explainable

When feature values of instances can be related to model prediction in a humanly understandable way.

How well did you know this?

Not at all

Perfectly

What levels of interpretability / explainability does Molnar state?

Interpretability is at a global level of the model, explainability is concerned with an individual prediction.

How well did you know this?

Not at all

Perfectly

Intrinsically interpretable models

Provide all means necessary for the decisions explanation. Is interpretable due to simplicity, has all info itself.

How well did you know this?

Not at all

Perfectly

Model agnostic explainable AI method

Explains any model, no matter the type

How well did you know this?

Not at all

Perfectly

Model specific explainable AI method

Accesses and uses the model internals.

How well did you know this?

Not at all

Perfectly

Counterfactual explanation

The one that is the closest to the instance that we’re trying to predict, with minimal changes, that gives a different prediction.

How well did you know this?

Not at all

Perfectly

CNN (abbreviation)

Convolutional neural network

How well did you know this?

Not at all

Perfectly

Expressive power

What is the structure of the explanations?
EG. is it ‘if-then’, a tree, natural language…

How well did you know this?

Not at all

Perfectly

Translucency

Describes how much the explanation method relies on looking into the machine learning model

How well did you know this?

Not at all

Perfectly

Portability (property of explanation method)

Measures the range of machine learning models with which the explanation can be used.

How well did you know this?

Not at all

Perfectly

Algorithmic complexity (property of explanation method)

The computational complexity of the explanation method.

How well did you know this?

Not at all

Perfectly

Accuracy (property of explanation)

How well does an explanation predict unseen data?

How well did you know this?

Not at all

Perfectly

Fidelity (property of explanation)

How well does the explanation approximate the prediction of the black box model?

How well did you know this?

Not at all

Perfectly

Certainty (property of explanation)

Does the explanation reflect the certainty of the machine learning model?

How well did you know this?

Not at all

Perfectly

Comprehensibility (property of an explanation)

How well do humans understand the explanations? How convincing are they?

How well did you know this?

Not at all

Perfectly

How is fidelity measured?

Study These Flashcards

Objectively

How is plausability measured?

Study These Flashcards

By comprehensibility: it requires a user study.

How is simulatability measured?

Study These Flashcards

By measuring the degree that a human can calculate or predict the model’s outcome, given the explanation.

What should a good explanation be?

Study These Flashcards

Contrastive
Selective
Social
Truthful
General and probable
Consistent with prior beliefs.

Black-box models

Study These Flashcards

Require post-hoc explanations, cannot have perfect fidelity.

What are the two families of interpretable models that we’re focusing on?

Study These Flashcards

Linear models
Decision trees and decision rules

Give the hierachy of linear models from big to small:

Study These Flashcards

Generalized additive models
Generalized linear models
Linear models
Scoring systems

Scoring system

Study These Flashcards

A specialised type of linear model that gives an integer in a range as output.

Generalized linear model

Multivariate linear model

Multivariate polynomial model

Linearity

f(x+y) = f(x) + f(y) and f(c*x) = c* f(x)

Homoscedasticity

Constant variance

Name four assumptions for linear modeling:

1. Normality of the target variable 2. Homoscedasticity 3. Independent instance distribution 4. Absence of multicollinearity 5. Linearity

Multicollinearity

When there are pairs of strongly correlated features in the data, so coloms are correlated.

What does it mean when we have a modular view in the interpretation of linear models?

We assume all remaining feature values are fixed, so a change in a particular feature will be reflected in the outcome.

Numerical feature weight (in the interpretation of linear models)

When all other features are constant, it is the change in outcome when the feature weight value is increased by one unit.

Binary feature weight (in interpretation of linear models)

The contribution to the model outcome of the feature when it is set to one.

Categorical feature with L categories (interpretation of linear models)

One-hot-encoding

Feature effect

Multiplication of the estimated weight and the normalized feature value.

How can we model nonlinear component functions f_j (x_j) in GAM models?

We learn them greedyli and use splines.

Splines

Indicator functions

Are combinary. Give an output with a condition, so that output is the case if the condition is met. Otherwise another output will be given.

Week 7 Flashcards

(41 cards)