Analysis Flashcards

Question 1

Q

In ROC (Receiver Operating Characteristic) analysis, what 2 measurements do we take for each threshold level?

Answer

A

Specificity and Sensitivity

OR

False Positive Rate and True Positive Rate

Question 2

Q

What does the ROC curve plot?

Answer

A

Specificity(x-axis) and Sensitivity (y-axis)

OR

False Positive Rate (x-axis) and True Positive Rate (y-axis)

Question 3

Q

What is Sensitivity?

Answer

A

TP/ (number of real positives)

classifies how good the model is at picking out positive values

Question 4

Q

What is Specificity?

Answer

A

TN/ (number of real negatives)

classifies how good the model is at picking out negative values

Question 5

Q

What does a good ROC curve look like?

Answer

A

Like a top-left corner

It should show a sharp rise in the True Positive Rate, without much increase in the False Positive Rate

This means it can classify a lot of positive samples correctly, without misclassifying negative samples

Question 6

Q

What metric do we use to show how good an ROC curve is?

Answer

A

We look at the area underneath the ROC curve,

the ideal case is an area of 1

Question 7

Q

Are KNNs good with large scale data?

Answer

A

No

There is a high computational complexity of neighbour search and distance calculation with lots of dimensions

Question 8

Q

Why do KNNs have a high memory cost?

Answer

A

They need to store all the training data.

Question 9

Q

Are KNNs good with dealing with imbalanced data?

Question 10

Q

Are KNNs sensitive to outliers?

Question 11

Q

What is the no free-lunch theorem?

Answer

A

This is more of a philosophy which states that:

Given no prior information to the learning task or data distribution

We can never say that any particular algorithm has a guaranteed advantage over any other.

Question 12

Q

What do we need to decide when using KNN?

Answer

A

The neighbour number K

The distance measure

Question 13

Q

Can a KNN handle both linear and non-linear data patterns?

Question 14

Q

Can we use Regularised Linear Least Squares with a small dataset?

Answer

A

Yes, good results can still be achieved

Question 15

Q

Does Linear Regression have a low computational cost?

Question 16

Q

In Regularised Linear Least Squares, what hyper parameters are there to set?

Answer

A

The regularisation parameter, lambda

and the form of Regularisation e.g. L2 or L1

Question 17

Q

Is Linear Regression sensitive to outliers?

Brainscape's Knowledge GenomeTM

Analysis Flashcards

Brainscape's Knowledge Genome^TM