IDL Flashcards

Question 1

Q

How do neurons learn?

Answer

A

Learning by changing the topology & thickness of connections

Question 2

Q

In “Vanilla” Recurrent Neural Network, what is the activation fn in output layer?

Answer

A

output layer activated by softmax function (can represent probability distribution over words)

Question 3

Q

In vanilla recurrent network, what is s(t) and y(t)?

Answer

A

s(t) is f(U w(t) + Ws(t-1)) where f = sigmoid activation function

y(t) = g(Vs(t)) where g is softmax activation function

Question 4

Q

Define the pocket convergence theorem

Answer

A

The pocket algorithm converges with a probability 1 to optimal weights even if the sets are not linearly separable

Question 5

Q

Define Cover’s theorem

Answer

A

What is the probability that a randomly labeled set of N points in d dimensions is linearly separable?

Question 6

Q

Apply cover’s theorem in higher dimensional space

Answer

A

Id the number of points in d dimensions is less than 2*d, they are almost always linearly separable

Question 7

Q

What is Adaline? What is the activation function in Adaline?

Answer

A

Adaptive line element,

The difference between Adaline and the standard (McCulloch–Pitts) perceptron is that in the learning phase, the weights are adjusted according to the weighted sum of the inputs (the net). In the standard perceptron, the net is passed to the activation (transfer) function and the function’s output is used for adjusting the weights.

Identity function is the activation function

Question 8

Q

Why is LSTM better than Vanilla rec net?

Answer

A

Ability to learn which remote and recent information is relevant for given task and using it to generate output

IDL Flashcards

(8 cards)