Machine Learning (part of exam 2/3) Flashcards

Question

What are Neural Networks?

Answer 1

They are modelled off the human brain and nervous system. They can extract and detect pattern and generalise them to make predictions.

Answer 2

They explored the visual cortex' of cats in the 1960s and discovered line and edge detectors in the visual system. They received the Nobel prize in 1981.

Answer 3

Each neuron/cell has a center (nucleus), a body (soma), dendrites and axons. The dendrites and axons are on the edges of the cell and are responsible for receiving input (dendrites) and sending output (axons). When a dendrite connects with an axon of another cell, this connection is called a synapse. If the input reaches a certain strength, it fires, meaning the cell gets activated and passes the activation on to all connected cells along it's axons.

Answer 4

A neuron is called a node or unit. The input (in) is weighed: Sum of (all inputs a * all weights w). The output (a) is determined by the activation function (g). So we have: a = g(in) a = g (sum of (all inputs a * all weights w))

Answer 5

It's a single node that connects n input signals with one output signal, typically resulting in either -1 or +1. The activation function is a simple threshold function.

Answer 6

The boolean functions "and", "or" and "not" can be combined by linearly separating the result from the rest. More comple functions like "xor" can't be modeled.

Answer 7

It's a function that includes a learning rate (alpha) and an error calculation, which calculates the weights needed for given inputs and outputs.

Answer 8

The error of one training example x can be measured by the squared difference of the output value h(x) and the desired target value f(x). For evaluating the performance of a network we can try the network on a set of datapoints and average the values (values are calculated like above).

Answer 9

It's almost always a search problem. The machine has to search for the right values and it can use different methods for that, e.g. heuristic search functions, local search functions,..

Answer 10

It's the space of all possible values for the weights we're looking for.

Answer 11

We use an evaluation function that measures the error of each weight. So we search for weights that have a low error on the training data.

Answer 12

It's where the error measure for this example is minimal.

Answer 13

via Gradient Descent: go downhill in the direction where it is steepest (hill-climbing search).

Answer 14

Because it's not differentiable. Suggested answer: Because there is no function that can exactly show this kind of pattern (going directly from 0 to 1 without any "curves". (the Way from 0-1 is not defined). (??? honestly i don't get this at all but this is what he said in the lecture and what it says on the slides so ¯\_(ツ)_/¯ there is a lot of math involved here. If you wanna check it out for yourselves, it's in the video of the lecture of 13.11.2024 at approx 1:18:00. have fun!)

Answer 15

The Sigmoid Activation Function. Because it's easy to differentiate and non-linear.

Answer 16

Usually we compute the error of a function but if there is a hidden layer, we need to calculate the error of the output layers, then backpropagate that to the hidden layer. Delta is the error term of the output node times the derivation of its inputs. This is done to update the weights and minimize errors, so in a way, this is how a machine learns.

Answer 17

It's a neural networks with many hidden layers.

Answer 18

Image classification. The many layers are fully connected (every node on one level is connected to every node on the next level). There are layers specifially trained on recognizing i.e. edges, corners, diagonal lines, faces, trees,...

Answer 19

* a lot of training data (big data) * fast processing * unsupervised pre-training of layers

Answer 20

It's a technique in image processing, where for each pixel of an image a new feature is computed using a weighted combination of its neighborhood (n*n pixels around it). Depending on the weights this could blur the image or have it show only the edges, etc.

Answer 21

Using Deep Learning an input image can be altered to look like a reference image (ie look like as if Van Gogh had painted it).

Answer 22

They're methods for increasing robustness of a learning machine. "Invisible" changes are made to an image to confuse the machine, to make mistakes and further down the road, to make the machine recognize such mistakes.

Answer 23

They allow to process sequential data by feeding back the output of the network into the next input.

Machine Learning (part of exam 2/3) Flashcards

(48 cards)