Exam stuff Flashcards

Question 1

Q

What is the worst method for initializing the factors in matrix factorization?

Answer

A

Setting all the factors to the same amount. The gradient of the error function will then be the same for all factors, effectively meaning we train the model with a single factor.

Question 2

Q

What is regularization?

Answer

A

A term added to prevent overfitting. Preventing the network to memorize the training data. The more data we have the lower the risk of overfitting

Question 3

Q

How is the similarity vs prob of detection plot changed when the number of bands b is changed but b*r stays constant? When only r changes?

Answer

A

The central point s_0 moves to the left the more b increases and p = 1 is reached at lower similarity. The central point s_0 moves more to the right when only r is increased.

Question 4

Q

Let us assume that a Matrix Factorization model with K factors has been trained for N users and M items. How much memory is needed to store the model parameters assuming each parameter is stored with P bytes? How large is P?

Answer

A

P with normal precision is 4 bytes (8 with double precision). The parameters need P(NK + MK) in memory.

Question 5

Q

How much memory is taken by the signature matrix?

Answer

A

Each element of sig matrix is -1 or 1. So each entry is 1 bit. So M = random projections * no of documents.

Exam stuff Flashcards

(5 cards)