lecture 6: ridge regression and polynomial regression Flashcards

Question 1

Q

learning of a vectored function is the same as a scalar function apart from

Answer

A

the output of w, it is a matrix instead of a vector

Question 2

Q

using matrix notation, what is the sum of squared error cost function

Answer

A

E = WX - Y

capital letters represent matrices

Question 3

Q

the formulas for W are the same as those for w, TRUE or FALSE

Question 4

Q

recap: what type of problem does it correspond to when yᵢ is continuous valued vs discrete valued

Answer

A

regression for continuous and classification for discrete

Question 5

Q

what are the 2 linear methods for classification

Answer

A

binary classification and multi-category classification

Question 6

Q

what are the 2 classifications for binary classification?

Answer

A

yᵢ ∈ {-1,1}

for the value(s) of y derived, take the sign as the answer

Question 7

Q

what is the method of assignment used for multi-category classification

Answer

A

one-hot encoding

Question 8

Q

with the final y matrix obtained, how do we classify each item?

Answer

A

using argmax, for each row the column with the largest number determines the class label
if the largest number is in column 1, item is class 1

Question 9

Q

why do we use ridge regression?

Answer

A

we cannot guarantee that XᵀX is invertible, ridge regression ensures that whatever is in the bracket is invertible by adding an identity matrix with a minimised coefficient 𝜆

Question 10

Q

what is the term added for minimisation of the coefficient of identity matrix 𝜆

Answer

A

𝜆wᵀw

Question 11

Q

ridge regression in primal form

hint: similar to over determined system

Answer

A

w = (XᵀX + 𝜆I)⁻¹ Xᵀy

Question 12

Q

ridge regression in dual form

hint: similar to under determined system

Answer

A

w = Xᵀ(XXᵀ + 𝜆I)⁻¹ y

Question 13

Q

why do we use polynomial regression

Answer

A

to try and get a better fit for the data

Question 14

Q

generally, for high dimensional problems, polynomials of order larger than what is seldom used

lecture 6: ridge regression and polynomial regression Flashcards

(14 cards)