Principles of deep learning in artificial networks Flashcards
Deep learning approach (1)
Learn from experience (machine learning):
- No formal rules of transformations
- No ‘knowledge base’
- No logical inference
Deep learning approach (2)
Process inputs through a hierarchy of concepts:
- Each concept defined by its relationship to simpler concepts
- So, build complicated concepts out of simpler concepts
Course goal (1)
Explore the relationship between cognitive science and AI
Course goal (2)
Focus on deep learning in artificial machine learning networks and comparison to biological systems
- Which biological processes do deep networks imitate?
- What is missing in artificial networks?
- What might make AI/machine learning more like biological intelligence/learning
Course goal (3)
Become familiar with the use of AI in cognitive science research
Course goal (4)
Build some deep learning networks to do human-like tasks
Why deep learning?
AI has made great advances in tasks that are:
- Described by formal mathematical rules
- Relatively simpel for computers
- Difficult for humans
AI had been less effective in tasks that are:
- Hard/impossible to describe using formal mathematical rules
- BUT easy for humans to perform (intuitive or automatic)
Simulation of neural computation
Representation & features
Machine learning performance depends on the ….
representation of the case to be classified
what information the computer is given about the situation
Representation & features
Each piece of input information is knows as a …
feature
(the same feature can be represented in different formats, often easy to convert between formats. The chosen format strongly affects the difficult of the task
Representation in deep networks
- Useful features may need to be transformed or extracted first.
- So deep networks have multiple representations -> each is build from an earlier representation
- This can: Transform features to a different format before learning their links to the output AND extract complex features from simpler features
- Essentially multiple steps in a program
- Each layer can be seen as the computer’s memory state after executing a set of instructions.
- Deeper networks execute more instructions in sequence - Just like a computer program, the individual steps are generally very simple.
- Complex outcomes emerge from interactions between many simple steps
What is a deep network?
A learning network that transforms or extracts features using:
- Multiple nonlinear processing units
- Arranged in multiple layers with:
- Hierarchical organisation
- Different levels of representation and abstraction
20th century view of object recognition
- Builds a representation of local image features
- Builds a representation of larger-scale shapes and surfaces
- Matches shapes and surfaces with stored object representations-recognition
Why nonlinear functions?
Any operation that can be done with only linear functions of the input can be straightforwardly described by formal mathematical rules, so is not a good use fore deep networks.
Name the complex nonlinear function with four operations or processing steps
Filter, threshold, pool and normalize
Name 1 issue which arises with ReLU
Is has no maximum output, while a biological neuron does have a maximum firing rate