Section 3 Evaluation of a classifier Flashcards

Question

Why do we not make model very complex to fit perfectly?

Answer 1

In theory, a model can be made arbitrarily complex, as to perfectly fit the data it was estimated on. High complexity will fit model perfectly to the data but will generalise poorly, while low complexity may not fit the data well. Although alot of metrics will reduce as complexity increases the variance of predictions will increase. The variance of predictions is linked to if you can trust your model in reality out of sample.

Answer 2

Although alot of metrics will reduce as complexity increases the variance of predictions will increase. The variance of predictions is linked to if you can trust your model in reality out of sample.

Answer 3

The data used to learn the model provides an optimistic view of the predictive performance as it can be made arbitrarily complex and the same data is used to learn the model and assess its predictive performance. The data used to fit the model must not be used to evaluate the predictive performance of a model, only can be used for assessing goodness-of-fit and quality of the model.

Answer 4

Data points not used as target cases in the fitting procedure. The test data set is used to estimate the generalisation error of the fitted model. Hence, these data points are used to test the fitted model. Test data should be either external data with similar characteristics to data used for fitting or should be out of sample and split off from the training data.

Answer 5

Training set: Data points whose target variable values are used in the model fitting procedure, that is are used to learn the parameters of the model. These data points are employed to train the model.

Answer 6

Crucial assumption is that both training and test data are generated by the same data generating process.

Answer 7

Data leakage is contaminating the test data with training data information. When we split data into training and test data we standardize the sets individually to avoid data leakage.

Answer 8

The estimated out-of-sample loss is never smaller than the estimated training loss. Similarly, (in expectation) the estimated out-of-sample predictive performance is not greater than the estimated training performance

Answer 9

Validation set: Data points used to evaluate and compare the models to see which is best. Each model in turn is evaluated on these data. Surrogate of the test data

Answer 10

Split the available data into: Training data – Validation data – Test data. Model training: Use the training data to estimate the model parameters. Model validation: Use the validation data to perform model selection. Find model which is maximising validation performance. Model testing: Use the test data to assess the predictive performance of the model and the ability to generalise to unseen inputs in real world problems. Estimate generalised predictive performance.

Answer 11

Resampling methods involve repeatedly drawing random samples from a dataset and refitting and testing a model of interest on each sample in order to obtain additional information about a model. Resampling is replicated a number of times to account for the sampling variability of the process

Answer 12

Cross validation is a class of resampling methods that estimate the performance by holding out a subset of the training observations from the fitting process. The model is trained on the “kept” observations (in-sample). The model is applied to those held out observations (out-of-sample) to evaluate the predictive performance. The process is replicated a number of times to account for the sampling variability The estimate of the predictive performance is the average predictive performance computed

Answer 13

The data are randomly split into training and hold-out samples. The model is fitted on the training set, and then the predictive performance of the model is evaluated on the hold-out sample. In the case of a single model the hold-out sample corresponds to the test set. In the case of multiple models, the hold-out sample is split into validation and test set.

Answer 14

50% training and 25% validation, 25% test

Answer 15

we combine the training data back with validation data and retrain the model. Always more data is better. Keep test data completely separate.

Answer 16

The idea is that only one observation in turn is used for validation, while the remaining are employed for training - we are splitting data in deterministic way The cross-validation estimate of the performance is the average performance over the N iterations This method requires training N models

Answer 17

Where Leave-one-out can be computationally expensive and time consuming this may be a better approach. K -fold approach can save time and computational resources. In k-fold cross validation, chunks (folds) of data are used to evaluate the performance of a model instead of single observations. The cross-validation estimate of the performance is the average performance over the K folds

Answer 18

If a large number of observations is available, usually a simple hold-out procedure works fine - smaller samples should use leave-one-out or k-fold.

Answer 19

The simple hold-out procedure can overestimate the generalization error. Leave-one-out overcomes this issue, as prediction is evaluated on a single observation (in turn), so it tends to not overestimate the error. Also the estimated error is less variable. Not screened by random subsets - but computationally intensive. K-fold cross-validation reduces the computational complexity of leave-one-out, but trading for bias in the estimated error.

Section 3 Evaluation of a classifier Flashcards

(43 cards)