08_neural networks Flashcards
How do kNN, linear models and tree-based models really learn?
not iteratively
knn: computed distances and compares distribution of unseen data points with distribution of seen data points
linear: fitted to seen data based on the task
tree-based: identify and memorize patterns relevant to the task
With which three components does the human brain work?
Neurons (nerve cells)
Dendrites (connects neurons)
Axons (long-distance connections)
–> neurons are inter-connected forming a dense network
How is information passed through neurons in the human brain?
through electrical signals
connected neurons absorb the incoming signals and process them. some of them will fire, but not all.
–> cascade of signals
What do we need for neural networks to represent the deep cascade of the layers of neurons in a human brain?
input data, which is processed in its hidden layers, and generates output data
What is a fully connected network?
a neural network where each neuron is connected to all neurons in the previous layers and all the neurons in the following layer
How can a fully connected neural network be characterized?
- number of layers (depth)
- number of neurons in each layer
- number of input variables (= number of neurons in the first layer)
- number of output variables (= number of neurons in the final layer)
How does a fully connected neural network work?
- vectorial input data provided to the network, one value per neuron in the input layer
- all inputs are seen by each neuron in the underlying layer
- each neuron will process the incoming info, firing (1) under some conditions and (0) otherwise
- repeat
- output is generated in the final layer
How does a neural network act in general terms?
acts as a function approximator
- any mathematical function can be approcimated
Can we implement artificial neural networks to learn specific tasks?
through connectionism, everything is connected with everything
What are two problems we have to solve before we can implement artificial neural networks?
1) how to implement neurons?
2) how to train the network?
How does a general neuron work?
number of inputs might differ from number of outputs - what is the function?
takes in a vector of values, processes them and returns a binary signal based on its learned behavior, which is then passed on to all neurons in the following layer
What is part of the function of a perceptron?
input variable x,
weight w
bias value b
–> if the resulting value is greater zero, perceptron neuron fires, otherwise not
step function is called activation function: introduces non-linearity into the output of the perceptron
What can a single perceptron be considered as?
a linear classifier
How do we train a perceptron?
perceptron learning rule, weights are adjusted by a step size that is called the LEARNING RATE
by iteratively running this algorithm over training data multiple times, weights can be learned so that the model perform properly
What is a major limitation of individual perceptrons?
inability to reproduce a logical exclusive-or (XOR) function!
- bc are simply linear functions
multi-layer perceptrons concatenate layers of perceptrons, which makes them much more powerful
What does MLP stand for?
multi-layer perceptron
What are MLPs?
simple feed-forward neural networks (info traverses graph in only one direction
- fully-connected
- can learn more complex relations from data than single perceptrons, each layer adds NON-LINEARITIES that increase the model’s capacity
- modern MLPs utilize additional layers and other non-linear activation functions that support the learning process
What is the function behind a neuron?
x * w + b > 0