lecture 10: performance issues Flashcards

1
Q

we have learnt the building blocks of a learning algorithm, what is the next missing step?

A

validating the accuracy of the algorithm in terms of predicting novel data based on the limited data we have.
to do this, we can use a train, validation and test split of the data, and then cross validate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are the common partitioning folds for TVT splitting

A

2,4,5 and 10 fold CV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the purpose of splitting data three ways?

A

the test set now contains data that the algorithm has never seen before, so if the model performs well on predicting the test set, we can say that the model generalises well

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the 2 most common evaluation metrics used to measure performance of regression learning algorithms?

A

mean square error and mean absolute error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are the evaluation metrics for classification learning algorithms

A

confusion matrix for binary classification, cost matrix for binary classification, decision error trade off, area under curve for receiver operating characteristic curve, gini coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are some of the trade-offs involved with achieving high software quality

A

computational efficiency, maintainability, flexibility, extensibility, usability etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly