Bashivan Flashcards

Question

What are the important features of the convolutional neural network?

Answer 1

Yann LeCun 1. Filter with weights for each pixel (x0 or x1) 2. This same filter multiplies each area of the image → convoluted feature (matrix of scores of probability that each area corresponds exactly to the filter) - X-detector will have x1 forming X-shape, T-detector has x1 forming a T-shape, etc. - 1 filter for all image, no need for different filters for different areas (compared to neocognitron)

Answer 2

It is a CNN (convolutional neural network) for classifying objects Architecture → 9 layers of convolution, pooling, nonlinearity, and normalization (60mil parameters) Cost function → supervised training Learning rule → backpropagation → takes derivatives to figure out how effect each parameter has on the output Dataset → Imagenet (1.3 million images)

Answer 3

The weights of the first layer (outputs) resembled V1 cortex outputs a lot

Answer 4

They are very similar to humans → Similar response patterns and error patterns (they make similar mistakes as humans do)

Answer 5

1. Behavioural agreement (output level) 2. Neural agreement 3. In-silico electrophysiology 4. Development agreement

Answer 6

Model-brain comparison based on comparing model outputs with human's responses: 1. Compare accuracy of response 2. Compare pattern of errors 3. Compare how accuracy is affected by distortions in humans vs model

Answer 7

A model-brain comparison method based on representational similarities: 1. Representational similarity analysis → compare IT response to every stimulus in matrix to layer response (the "IT" layer) 2. Encoding models → How much neuronal response changes when exposure to similar stimuli (compare) → do linear regression - predicts the response of each measurement channel (e.g. a neuron or fMRI voxel) on the basis of properties of the experimental condition (e.g. sensory stimulus)

Answer 8

*In silico = experiment done by a computer Model-brain comparison method: 1. Lesion studies → lesions in the brain // cutting out single units 2. Decoding → test presence of information in specific regions of the brain / regions of the model - How the brain or a model encodes information about sensory inputs, motor actions, or mental states, and then using that activity to make predictions about what the brain is doing. 3. Selectivity profile → find patterns of selectivities in different areas or units

Answer 9

Compare development of brain and model over time → do they follow similar patterns of development? - Performing previous analyses at different stages of learning

Answer 10

It is matrix whic shows the similarity of activation of IT or CNN model when showing 2 different objects Blue → very similar response to both Red → very different response to both *Showed 8 pictures of objects fitting into each of the 8 different categories Conclusions: - IT has a very similar matrix to CNN models - V4 doesn't - IT and CNN respond similarly to 2 different picture of boats for ex. (same category) - IT and CNN responds similarly to pictures of fruits and faces

Answer 11

V4 activity → Layer 3 IT activity is better predicted by Top layer (layer 4)

Answer 12

Convolutional model for object recognition - Has 20 layers 1. Unit presentation → each unit is rpresented by its most stimulating visual pattern 2. Circuit presentation → units in 2 layers and their connections - 3x3 matrix showing the pattern have to appear in which part of the matrix to excite the downstream unit/inhibit (spatial selectivity)

Answer 13

First detectors: *3x3 convolution 1. Gabour Filters (44%) → Orientation selective similar to V1 simple cells, a bti invariant of where in the RF *Simple edge detectors come in pairs of negative reciprocals 2. Color Contrast Filters (42%) → detectors for color on one side of the RF also come in pairs of negative reciprocals 3. Other Units (14%) → difficult to name categories

Answer 14

Complex detectors: *1x1 convolution 1. Low Frequency (27%) 2. Color Contrast (16%) 3. Complex Gabor (14%) 4. Multicolor (14%) 5. Gabor Like/Complex Gabor (17%) 6. Color (6%) 7. Other Units (5%) 8. Hatch (2%)

Answer 15

Complex Gabor → Edge detectors that are invariant to the exact position of the dark/light part and of the the edge Dark-Light-Dark = L-D-L *Equivalent to complex orientation selective V1 cells

Answer 16

Responses of multiple Gabor filters with similar orientations are combined → UNION OVER CASES - Upstream units with different contrast selectivities, but similar orientations excite together - Opposit orientation gabor units inhibit the downstream complex Gabor - Similar but not quite exact orientation → small excitation

Answer 17

*3x3 convolution Shape predecessors !! 1. Color Contrast/Multicolor/Color Center-Surround 2. TEXTURES (hatch, Gabor, texture contrasts) 3. Early Birghtness Gradient → important for object boundaries 4. A bit of lines, curves, corners

Answer 18

Because it is trained with the ImageNet dataset which has pictures representing natural stimuli for humans and animals and a big section of this dataset is composed fo picture of dogs and animals

Answer 19

Shape detection is done through spatial combinations with different excitabilities from different orientations in the different pixels of the 3x3 matrix

Answer 20

CURVE DETECTORS Too many to show all of them - Eye/small circles (2%) - Fur precursors (3%) → Specific types of texture - High-Low Frequencies (6%) → high in one side of RF and low in the other side

Answer 21

*In layer 3a The filter is excited by different curvature direction at each corners and by full circle in the middle *Also triangle detectors

Answer 22

A. Curves are excited by early curves with similar orientation B. Curves are inhibited by early curves with opposing orientaton Inhibition changes through-out layers and increases in specificity as the layers increase in specificity

Answer 23

*Focus on specific shapes - Proto-Head filters - Oriented fur - Boundaries

Answer 24

Formed through spatial filter combination from upstream units

Answer 25

Invariant object detectors are similarly constructed by pooling across orientation-specific object parts detectors 1. Build the full object by pooling part with the same orientation 2. Pool the all full orientation-selective objects so that the unit is excited by the full object in all orientations

Answer 26

Layer 0 → First detectors Layer 1 → Compelx detectors (union over cases) Layer 2 → Shape predecessors (through patial combinations) Layer 3a → Curve detectors (selectivity-dependent inhibition, circle, triangle) Layer 3b → Shapes

Answer 27

1. Cross-modal consistency → generative consistency 2. Future prediction (forward prediction in movement/picture frames)

Answer 28

1. Object recognition 2. Object detection 3. Sound localization *Unsupervised learning algorithms may fail to discover properties of the worls that are statistically weak, but important for survival → that is advantage of supervised objective function

Answer 29

V1 → orientations (motion direction) V2 → correlation patterns/textures V4 → Curvatures IT → multi-part object (faces, body parts, landscapes)

Answer 30

Alexnet CNN *cells specificity of 1st layer ressembled V1 of primates In order: Muti-layer peceptron (Rosenblatt) → Neocognitron (Fukushima) → Alexnet CNN → InceptionV1

Answer 31

a) the non-linear part captures the full pattern of activity c) inputs are weighed differentially based on learning process

Bashivan Flashcards

(55 cards)