B06 K-NN Flashcards

Question 1

Q

Some examples of Parametric models are:

Answer

A

Linear Regression
Logistic Regression
Naive Bayes
Simple Neural Networks

Question 2

Q

Some examples of Non- Parametric models are:

Answer

A

k - Nearest Neighbor
Support Vector Machines
Decision Trees

Question 3

Q

A learning model that summarizes data with a set of
parameters of fixed size (independent of the number of
training examples) is called a __________. No
matter how much data you throw at a ___________, it won’t change its mind about how many
parameters it needs.”

Answer

A

parametric model
parametric
model

Question 4

Q

Strengths of Parametric Models?

Answer

A

-Simpler: These methods are easier to
understand and the results are easy to
interpret.
-Speed: Parametric models are usually
very fast to train.
-Less Data: They do not require as much
training data and can work well even if
the fit to the data is not perfect.

Question 5

Q

Weaknesses of Parametric Models?

Answer

A

-Constrained: By choosing a functional
form, these methods are highly
constrained to the specified form.
-Limited Complexity: The methods are
more suited to simpler problems.
-Poor Fit: In practice, the methods may
not always match the underlying
mapping function.

Question 6

Q

A learning model that does not make strong
assumptions about the form of the mapping function is
called a ___________. By not making
assumptions, ____________ are free to learn
any functional form from the training data.

Answer

A

non-parametric model

Question 7

Q

Weaknesses of Non-Parametic Models?

Answer

A

-More data: Require a lot more training
data to estimate the mapping
function.
-Slower: A lot slower to train, as they
often have far more parameters to
train.
-Overfitting: Have a higher risk of
overfitting against the training data

Question 8

Q

Strengths of Non-Parametric Models?

Answer

A

-Flexibility: Capable of fitting a large
number of functional forms.
-Power: No assumptions (or weak
assumptions) about the underlying
function.
-Performance: Can result in higher
performance models for prediction.

Question 9

Q

A class of non-parametric learning methods that do
not generate a model but instead make use of
verbatim training data for classification?

Answer

A

Lazy or instance-based learners or

rote learners

Question 10

Q

The _________________algorithm gets its name
from the fact that it classifies an unlabeled observation
based on information about the _______labeled
________ of the observation.

Answer

A

k-Nearest Neighbor (k-NN)
k-nearest
neighbors

Question 11

Q

Choosing the right K

A ____ reduces the impact of
noisy data but increases the risk of
ignoring important patterns

Question 12

Q

Choosing the right K

A _______ makes the model
susceptible to noise and/or outliers.

Question 13

Q

Note that the ______ the dataset, the ____
important the difference between two choices
for k becomes.

Answer

A

larger

less

Question 14

Q

Strengths of K-NN?

Answer

A

-Simple and effective.
-Makes no assumptions about the
underlying data distribution.
-Training phase is very fast

Question 15

Q

Weaknesses of K-NN?

Answer

A

-Does not produce a model.
-The selection of an appropriate
k is often arbitrary.
-Rather slow classification
phase.
-Does not handle missing, outlier
and nominal data well without
pre-processing.

B06 K-NN Flashcards

(15 cards)