MODULE 2 S2.1 Flashcards

Question 1

Q

Strengths / Advantages of k-NN

Answer

A

Easy to understand
Works well without any special adjustments
Suitable as a first-time models

Question 2

Q

When considering more than one neighbor, we use _________ to assign a label.

Question 3

Q

Building the model consists only of storing the training dataset.

Answer

A

k-Nearest Neighbors (k-NN)

Question 4

Q

We import the _____________ class for the k-NN regression variant.

Answer

A

KNeighborsRegressor

Question 5

Q

Weakness / Disadvantages of k-NN

Answer

A

If the number of features or samples is large, the prediction is slow and data preprocessing is important.
Does not work well with sparse datasets

Question 6

Q

In k-NN, to make a prediction for a new data point, the algorithm finds the closest data points in the training dataset—its ________________

Answer

A

nearest neighbors

Question 7

Q

T/F In its SIMPLEST version, the k-NN algorithm only considers exactly one nearest neighbor, which is the closest training data point to the point we want to make a prediction for.

Question 8

Q

The Squared Score (R^2), also known as the _______________

Answer

A

Coefficient of Determination

Question 9

Q

It is the default distance used to choose the right measure.

Answer

A

Euclidean distance

Question 10

Q

It is arguably the simplest machine learning algorithm.

Answer

A

k-Nearest Neighbors (k-NN)

Question 11

Q

T/F In its SIMPLEST version, the k-NN algorithm can consider more than 1 nearest neighbors.

Answer

A

FALSE (exactly 1)

Question 12

Q

It is a measure of goodness of a prediction for a regression model, and yields a score between 0 and 1.

Answer

A

Squared Score (R^2)

Question 13

Q

T/F Instead of considering only the closest neighbor, we can also consider an arbitrary number, k, of neighbors.

Question 14

Q

Parameters of the k-NN Classifier

Answer

A

number of neighbors (k)
how you measure distance between data points

Question 15

Q

T/F Predicting worse than the average can result in negative numbers

Question 16

Q

T/F In k-NN, High Model Complexity is underfitting.

Answer

Study These Flashcards

A

FALSE

Question 17

Q

T/F K-nearest neighbors make a prediction for a new data point by finding the data that match from the training dataset.

Answer

Study These Flashcards

A

FALSE

Question 18

Q

In k-NN, Low Model Complexity is:

Answer

Study These Flashcards

A

Underfitting

Question 19

Q

T/F In k-NN, when you choose a small value of k (e.g., k=1), the model becomes more complex.

Answer

Study These Flashcards

A

TRUE

Question 20

Q

T/F There is a regression variant of the k-nearest neighbors algorithm.

Answer

Study These Flashcards

A

TRUE

Question 21

Q

In k-NN, High Model Complexity is:

Answer

Study These Flashcards

A

Overfitting

Question 22

Q

T/F The ‘k’ in k-Nearest neighbors refers to the new closest data point.

Answer

Study These Flashcards

A

FALSE

Question 23

Q

T/F In k-NN, Euclidean distance (by default) is used to choose the right distance measure.

Answer

Study These Flashcards

A

TRUE

MODULE 2 S2.1 Flashcards

kNN (23 cards)