Week 3: feedforward visual processing Flashcards by Annie Clarnette

Why do we study vision networks to understand DCNN?

image processing is a common application of artificial deep networks
the early visual system is the best understood system in the human brain
early visual responses are a common application of DCNN as simulations of neural processing

How well did you know this?

Not at all

Perfectly

Where do responses to specific edge orientations emerge?

V1, the primary visual cortex. first area processing vision in the brain

How well did you know this?

Not at all

Perfectly

How is the first step of visual processing different in biological networks and artificial networks?

In biological network, contrast is initially computed in an orientation-independent filter in the retinal ganglion cell. Artificial DCNNs often skip this and go directly to edge orientation.

How well did you know this?

Not at all

Perfectly

What happens in V1?

Orientation-selective responses are computed in V1 by operations comparing the outputs of retinal ganglion cells

How well did you know this?

Not at all

Perfectly

How are neurons processing the image grouped?

Neurons processing the same part of the visual field are grouped, and neurons with similar orientation preferences are grouped, this grouping is at a very small scale.

How well did you know this?

Not at all

Perfectly

How do neurons have different orientation preferences?

Neurons have orientation preferences which gradually change across the cortex

How well did you know this?

Not at all

Perfectly

How do orientation preferences form feature maps?

The orientation columns form further feature maps which are squeezed into the same 2D cortical surface

How well did you know this?

Not at all

Perfectly

What are in the feature maps at V1?

In V1 there is a large complex set of feature maps with each feature represented at all spatial positions. Includes colour, eye, spatial frequency, orientation, motion direction.

How well did you know this?

Not at all

Perfectly

What happens after V1 in terms of processing an image?

Form (object recognition) and motion (motion and space) information are processed separately in different areas of the brain. There are multiple branching hierarchies performing different tasks

How well did you know this?

Not at all

Perfectly

How does the hierarchy of the brain differ to that of a neural network?

Lots of brain areas sample from V1 creating a web of connections. Artificial networks use a linear hierarchy

How well did you know this?

Not at all

Perfectly

What happens as you go up the hierarchy V1-V4… in the brain?

As you go up, the areas have a larger representation of the central visual field, and respond to increasingly complex features

How well did you know this?

Not at all

Perfectly

What happens to the spatial integration of the image as you move up the hierarchy?

Spatial relationships between image locations are maintained

How well did you know this?

Not at all

Perfectly

How does V1 represent the different areas in the visual field?

V1 strongly over-represents the central visual field compared to the peripheral parts of the field

How well did you know this?

Not at all

Perfectly

Which parts of the brain focus on object recognition and spatial perception?

Object recognition: ventral stream, temporal lobe

Spatial perception/action planning: dorsal stream, pariatal lobe

How well did you know this?

Not at all

Perfectly

How do later visual field maps sample from earlier visual field maps?

Later visual field maps sample from approximately constant cortical areas of earlier visual field maps, regardless of the visual position represented

How well did you know this?

Not at all

Perfectly

How does an artificial network filter represent a sample from the visual field map and how is it different?

Study These Flashcards

Later visual field maps sample from constant cortical areas of earlier visual field maps. this is represented by the fixed size of artificial network filters
Differences:
-the inputs in biological networks over-represent central vision
-the input in biological network is neurons rather than images

How do features transform as they move up through the layers of the hierarchy?

Study These Flashcards

Transformations find commonly-seen patterns in activity of earlier layers. this has been difficult for humans to recognise

What types of computations are later stages in the hierarchy doing?

Study These Flashcards

Later stages are likely doing the same computations as earlier stages, but from more abstracted inputs

Why do we use DCNNs to simulate feature transformations?

Study These Flashcards

It is hard for humans to think about the transformations and representations of later layers in the network. DCNNs are useful for experimenting and testing hypotheses on how these transformations work

How do mid-level representations seem to be optimised?

Study These Flashcards

Mid-level representation appears to be optimised to allow subsequent transformation to support object recognition

What is distributed encoding?

Study These Flashcards

Object identity is not reflected in the activity of a single neuron but the pattern of activity in a larger population of neurons

What are the advantages of distributed encoding?

Study These Flashcards

allows some cell death without representation failing (graceful degradation)
allows new patterns to be stored without new cells, a fixed group of cells can store a variable number of objects
it is consistent with measured cell properties -ie there are rarely all or nothing responses

What is the disadvantage of distributed encoding?

Study These Flashcards

It is harder for humans to understand

What are face-selective neurons?

Study These Flashcards

Found later in the ventral stream, there are cells that respond more strongly to specific faces regardless of the image used

How do artifical DCNNs relate to face-selective neurons?

Artificial DCNNs produce similar results to the brain as later network layers closely resemble responses of face-selective cells

What are Deepfakes?

Videos where a DCNN is used to map ones person's face onto another. first show the network training videos of the target face, then the network maps the features of the face and their movements onto the source face. artificial DCNNs can convincingly manipulate facial identity

What are object-selective areas? what type of objects are commonly studied

There are brain areas that respond to many classes of objects often studied: faces, places, words and tools

How does the brain process semantic content?

Responses in the brain show that processing semantic content produces similar results to processing visual objects which suggests similar processes are involved

How does the brain change its process when given a task to identify humans?

Recording sites throughout the brain change their object selectivity and start responding more to humans. A face-selective area could become a car-selective area if given a car identification task

How does the brain process visual content in dreams?

Similarly to content seen when awake. The responses in the brain are similar when dreaming all the way back to the early image representation in V1

How do responses change when we focus on something? object vs spatial responses?

Responses are drawn towards attended content, more neurons will respond to the attended area of the image - object responses are drawn towards attended object - spatial responses drawn towards attended locations

Week 3: feedforward visual processing Flashcards

(31 cards)