Retail Credit Risk Flashcards

Question 1

Q

define retail lending

Answer

A

exposure to an individual/small business, and guaranteed by such person

Question 2

Q

what are 4 examples of retail lending.

Answer

A

credit cards
residential mortgages
small business facilities
installment loans

Question 3

Q

what are two characteristics of retail lending

Answer

A

low individual exposure
managed collectively rather than individually

Question 4

Q

what is a credit risk score

Answer

A

a total number of points that predicts a borrower’s future repayment performance based on historical information

Question 5

Q

what is a scorecard

Answer

A

a mathematical algorithm used to generate a score for rank-order risk analysis

Question 6

Q

what are scorecards used for

Answer

A

lending decisions
mitigation of portfolio credit risk

Question 7

Q

what are two benefits of using a scorecard

Answer

A

easy to interpret
easy to monitor

Question 8

Q

what are the 6 stages in model development

Answer

A

business objectives
data preparation
model development
model approval
model deployment
monitoring

Question 9

Q

what are 3 aspects of business objectives

Answer

A

key issues
expectations for the model
structure

Question 10

Q

define key issues

Answer

A

trends, challenges and concerns outlined by the business

Question 11

Q

define structure

Answer

A

project team members, data and timeline

Question 12

Q

what are the 5 C’s of data preparation

Answer

A

Comprehensiveness
Clean
Consistent
Current
Caretaking

Question 13

Q

define comprehensiveness

Answer

A

ensuring the data captures the full scope and complexity of the underlying information

Question 14

Q

define clean

Answer

A

ensuring the accuracy of the data

Question 15

Q

define consistent

Answer

A

ensuring the uniformity of the data across different sources.

Question 16

Q

define current

Answer

A

ensuring the data is up to date

Question 17

Q

define caretaking

Answer

A

the ongoing management of the data to preserve its quality

Question 18

Q

what are 6 aspects of the data preparation in the model development lifecycle

Answer

A

the 5 C’s
exclusion criteria
timeframe
defining the target and explanatory variables
segmentation (# of models)
sampling

Question 19

Q

what are three sources of exclusion criteria

Answer

A

scope
data errors
operational

Question 20

Q

what two periods are involved in the timeframe of model creation

Answer

A

observation period
performance period

Question 21

Q

what are two aspects of the observation periods

Answer

A

for explanatory variables
should be representative of the current/future environment

Question 22

Q

what are two aspects of performance periods

Answer

A

for the target variable
should be long enough to have a sufficient number of defaults.

Question 23

Q

what are the two modeling techniques

Answer

A

industry standard
other methodologies

Question 24

Q

compare the advantages of the two modeling techniques

Answer

A

industry-standard:
1. few variables
2. expert judgment

other methods:
1. many variables
2. one step for variable reduction and model fitting
3. adaptive learning

Question 25

Q

compare the disadvantages of the two modeling techniques

Answer

A

industry-standard:
1. few variables
2. distributional assumptions
3. separate steps for variable reduction and model fitting

other methods:
1. many variables
2. risk of overfitting

Question 26

Q

what are the 5 steps of the industry standard model development technique?

Answer

A

variable transformation
variable reduction
model fitting
scorecard scaling
scorecard assessment

Question 27

Q

what technique can be used in variable transofrmation

Answer

A

weight of evidence

Question 28

Q

define variable reduction

Answer

A

removing any variable that cannot be used or doesnt make sense

Question 29

Q

what are two techniques for variable reduction

Answer

A

grouping
variable clustering

Question 30

Q

what is grouping

Answer

A

creating bins within a variable

Question 31

Q

what are three benefits of grouping?

Answer

A

i. Accounts for non-linear relationship between the target and explanatory variables.
ii. Accounts for outliers
iii. Allows for the treatment of missing values as a separate category.

Question 32

Q

how should grouping be performed?

Answer

A

Start by creating 20 equal bins.
Calculate the WOE of each bin.
Collapse bins with similar WOE.
Remove variables with weak IV.

Question 33

Q

what is variable clustering

Answer

A

grouping correlated variables together such that variables within a cluster are highly correlated and variables between of clusters are uncorrelated two reduce the multicollinearity of the model.

Question 34

Q

which two variables should represent the cluster then using variable clustering?

Answer

A

the variable with the highest IV
the variable with the lowest 1-R^2

Question 35

Q

what are two aspects of model fitting in the industry standard technique?

Answer

A

variable selection: forward, backwards, ridge lasso
assumptions that historical experiences predict future behaviour and that consumer behaviour will not change significantly

Question 36

Q

define scorecard scaling when using the industry standard technique

Answer

A

raw scores are scaled to a three digit number

Question 37

Q

what is the formula in score in scorecard scaling

Answer

A

score=offset+(factor⋅ln⁡(2⋅odds) )-PDO

Question 38

Q

what are the 3 types of scorecard assessment

Answer

A

rank ordering
population stability
benchmarking

Question 39

Q

what are the 5 evaluation metrics used in rank ordering scorecard assessment

Answer

A

KS statistic
misclassification
ROC curve
accuracy ratio
lift chart

Question 40

Q

what does population stability do

Answer

A

quantify population differences by measuring the shift between two sample distributions

Question 41

Q

what is the formula for the population shift index (PSI)

Answer

A

PSI=∑[(N_bin-B_bin )⋅ln⁡(N_bin/B_bin ) ]

Question 42

Q

what values of PSI indicate: no significant shift, a minor shift, a significant shift

Answer

A

<0.1: no significant shift
0.1-0.25: minor shift
>0.25: significant shift

Question 43

Q

what is benchmarking

Answer

A

comparing a scorecard to an existing scorecard

Question 44

Q

what is the KS statistic

Answer

A

the maximum difference between the CDFs of the distributions of defaults and non-defaults

Question 45

Q

what is misclassification

Answer

A

the confusion matrix

Question 46

Q

what is the ROC curve

Answer

A

the probability a randomly chosen non-default will be ranked righter than a randomly chosen default; plots the true positive rate against the false positive rate

Question 47

Q

what is the formula of the accuracy ratio

Answer

A

AR=GINI/(Perfect GINI)

Question 48

Q

what is the GINI index

Answer

A

the area between the Lorenz and random curve

Question 49

Q

what is a lift chart

Answer

A

the cumulative % of defaults per decile divided by the total population % of defaults.

Question 50

Q

what does weight of evidence do

Answer

A

transforms explanatory variables into a set of groups based on the similarity of the target variable distributions.

Question 51

Q

what does WOE measure

Answer

A

how strong a group is at separating defaults from non-defaults

Question 52

Q

what does a negative WOE signify?

Answer

A

more defaults than non-defaults

Question 53

Q

what is the formula for WOE

Answer

A

WOE=ln⁡[((# non-defaults)/(total non-defaults))/((# defaults)/(total defaults))]

Question 54

Q

what is a variable’s information value

Answer

A

the predictive power of a single variable (its ability to separate defaults from non-defaults)

Question 55

Q

what is the formula for information value

Answer

A

IV=∑[[(# non-defaults)/(total non-defaults)-(# defaults)/(total defaults)]⋅WOE_i

Question 56

Q

what IV value ranges indicate:
very weak
weak
moderate
strong

Answer

A

<0.02: very weak
0.02-0.1: weak
0.1-0.3: moderate
0.3+: strong