Ch. 10 Flashcards

1
Q

K-nearest neighbors algorithm

A

You get a new data point
Look at its X nearest neighbors
Classify the new data point based on its nearest neighbors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Normalization exa

A

Look at a users average rating ie 3.3 and raise or lower their ratings a little until they reach an average value ie 3.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Classification

A

Categorization into a group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Regression

A

Predicting a response (like a number)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Cosine similarity

A

An alternative to the distance formula that compares the angles of vectors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

OCR

A

Optical character recognition
Having a computer recognize characters off a page ie scanning a book

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What do ocr algorithms typically measure

A

Lines
Points
Curves

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Training

A

The first phase of ocr where you go through images of numbers and extract features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Naive Bayes Classifier

A

Spam filters
Break a sentence into words to estimate the probability of a word appearing in a spam email

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Feature

A

An aspect of data you will compare. Ie size, color, weight, a numeric rating etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly