lecture 5 - ANNs Flashcards

Question

what type of model is the perceptron

Answer 1

- a **classifier** that decides whether some pattern of inputs is **present or not** - gives a binary output, so it can only treat linearly separable problems

Answer 2

the weights (parameters)

Answer 3

- a biological neuron fires if there is a specific orientation (pattern) in the input - a perceptron fires when there is a specific pattern present in the input data

Answer 4

1. boolean OR function 2. boolean AND function 3. emphasize one input over another

Answer 5

- **either** x_1 **or** x_2 or **both** are activated - we change w_0 = θ - a low **θ** (0.5) ensures any positive input triggers a response

Answer 6

- x_1 **and** x_2 are both are activated - we change w_0 = θ - a higher **θ** (1.5) ensures a trigger only when both inputs are positive (stricter)

Answer 7

- when you want to know what happens in x_1 but not necessarily in x_2 - we change w_0 = θ and the other weights - increase the **weight** of the more critical input while keeping a suitable threshold

Answer 8

- a perceptron that handles inputs with more than two dimensions - e.g., 10x10 image that has 100 inputs of x_i

Answer 9

- The weight matrix W_ji maps inputs to outputs. - For example, in a 10x10 image with 26 outputs, W_ji would be a 26×100 matrix where each weight determines the contribution of a pixel to a specific output.

Answer 10

It is the weight connecting **input pixel 57** to **output neuron 3** (which could represent, for example, the letter "C").

Answer 11

- done through gradient descent - adjusts weights by minimizing a cost function like MSE that quantifies how wrong the model is

Answer 12

- **sigmoid function** - because it is **smooth and differentiable**, mapping inputs to a range between 0 and 1, making it suitable for calculating gradients during optimization

Answer 13

To minimize the error 𝐸 by adjusting weights w_ji in the direction that reduces the error.

Answer 14

- change in weight from output j to input i = learning rate * derivative of the activation function * error at the output neuron * input from the connected neuron - Δw_ji =−η⋅g′(a_j)⋅(y_j−t_j)⋅x_i

Answer 15

1. η: step size 2. (y-t): how wrong the output of the neuron was 3. g'(a): how much a weight change will affect the output 4. x: whether there was any input at all

Answer 16

- if y_j > t_j: decrease the weight - if y_j < t_j: increase the weight

Answer 17

- the **local error contribution** - δ_𝑗 = g′(a_j)⋅(y_j−t_j) - similar to the error term used in both reinforcement learning and fitting algorithms

Answer 18

1. present input, comput output y = g(wx) 2. compare the output to compute the error 3. the 'local contribution' to the error is the δ of the node; δ = g'(a) ⋅ (y-t) 4. use δ_j and x_i to update the weight slightly; Δwji = −ε⋅δ_j⋅x_i

Answer 19

A perceptron cannot solve problems that are not linearly separable, such as the Boolean XOR function

Answer 20

- the output is 1 only when **one** input is 1 and the other is 0. - when both inputs are 0 or both are 1, the output is 0.

Answer 21

- XOR requires separating data points in a way that a single line cannot achieve - e.g., in (0,0) and (1,1) belong to one class (y=0), while (0,1) and (1,0) belong to another class (y=1). - these points cannot be divided into two groups by a single straight line, as they are crossed opposites

Answer 22

- **adding a hidden layer** that introduces additional intermediate nodes 1. Adding a hidden layer with intermediate nodes (e.g., h1 = either and h2 = both, output responds to only h1) that transform the input into a space where XOR is linearly separable. 2. addine a hidden layer with alternative configurations (e.g., h1 responds to x1, h2 responds to x2, output responds to either h1 or h2)

Answer 23

They transform inputs into a higher-dimensional feature space where the problem becomes linearly separable.

Answer 24

- are **universal function approximators**, not just for classification. - they can solve any problem with the right configuration - the problem now is to find the right configuration

lecture 5 - ANNs Flashcards

(48 cards)