Object perception Flashcards

Question

Describe general recognition theory - categorizing defined by

Answer 1

Probabilistic distributions and categorization is based on decision boundaries that separate perceptual regions Ability to differentiate objects depends on how much their features overlap If do not overlap = easier to find decision boundary (hyperplanes)= decision easier and faster

Answer 2

Gcm = store many specific faces you’ve seen before, when seeing new face = compare it to stored examples and assign the category based on similarity to past faces Grt = rely on perceptual dimensions - face shape, Jaw width, eye size - make a decision based on statistical boundaries between 2 categories

Answer 3

Structuralist tehory Alphabet of shapes - geometric ions - goons Form objects = combined to create Limitation = doesn’t really handle variability we see in objects, just a crude characterization fo obejcts

Answer 4

Kinda impossible bc some variability - one single neuron for every single concept in physical world —single neuron responsible for recognizing grandma?

Answer 5

Concept contributes to ongoing debate between localized vs distributed representation Extreme ex of localized representation in brain Also if cell dies = does that mean wont recognize grandma anymore

Answer 6

Ppl with electrodes in brain - epilepsy treatment - did exp Jennifer Anniston = cell fires when see her, other systems also fire if hear her voice - also for Harrison ford for some ppl Kinda supports grandmother cell theory

Answer 7

Deep neural network = Multiple layer neural networks capable fo being trained to recognize obejcts Numerous instances of an object shown to network with feedback provided Overtime = network learns to recognize new instances of object that is has never been explicitly trained on - need to generalize so can see object

Answer 8

Alex net Stimulus —> layer 1 —> … layers 6-8 = huamn face So small area stimulus inputted and processes = spatial average passed on until can put label of huamn face on it

Answer 9

Deep neural networks rival representational performance of inferior temporal cortex - it - in monkeys in object recognition task Representations of Dnn based object recognition model successfully predict the representations measured in inferior temporal cortex using fair Using dnn to mdoel visual properties of stimuli= demonstrate that intermediate and high level image features can predict visual awareness and provide mechanistic explanation for phenomenon of attentional blink - like if show image v quick

Answer 10

Detecting spots and edges and bars = use retinal ganglion cells, lateral geniculate nucleus and primary visual cortex -v1

Answer 11

Retinal ganglion cells and Lgn - localized contrast

Answer 12

Primary visual cortex - orientation selectivity - combine spots

Answer 13

Brain performs sophisaticated processing b beyond v1 Integrating visual features into structured representations of obejcts - intermediate level vision and high level vision

Answer 14

V2,v33,v4 etc —> grouping features into contours, textures and surfaces

Answer 15

It cortex —> recognizing complex shapes, obejcts and categories - tolerance to variability

Answer 16

Not just about simple features but about hierarchical processing across multiple visual areas - feedback and feed forward features

Answer 17

Receptive fields of extrastriate cells respond to visual properties crucial for object perception Only respond if boundary belongs to object and not background

Answer 18

For given edge or contour = neurons determine which side belongs to object and which belongs to background - a fundamental processs in figure ground segregation

Answer 19

Loosely defined stage of visually processing that occurs after low level feature extraction - like edges, contrast and before high level object recognition and scene understanding

Answer 20

Perception of edges and surfaces Determines which regions of an image should be grouped into obejcts Bridges low level feature detection and high level object recognition

Answer 21

Primary visual cortex v1 neurons have smaller respective fields that detect local edges and contrast Neurons are orientation selective - responding to edges at specific angels

Answer 22

Complicated Computerized edge detectors are not as effective as humans in detecting meaningful edges As humans = can see it better

Answer 23

Locally contrast between background and foreground nto strong enough Computers miss edges that humans easily perceive bc they rely on local contrast and intensity differences

Answer 24

Contour that is perceived even though no physical edge exists between one side and the other Edge detectors fail Minds fail gaps Problem with some of more structuralist theories - mind can solve problem

Answer 25

Whole is greater than sum of parts Opposes structuralism - which emphasizes breaking perception into basic elements Suggests that perception is holistic = meaning we naturally organize elements into meaningful wholes rather than processing each part independently

Answer 26

Set of rules that describe when and how elements in an image appear grouped together

Answer 27

Similar objects - colour, shape, size or texture = appear grouped together - perceived as group Segment animal from background

Answer 28

Elements close to each other tend to be grouped together in perception

Answer 29

Lines and edges are perceived as following the smoothest past Doesn’t explain everything tho - group as x, if have context = beak = ex

Answer 30

Mind fills in missing info to perceive complete shapes Illusory controls - segment arrow from background

Answer 31

Elements moving together are grouped Flock of birds - moving together in shaped directions

Answer 32

Brain separates objects from background Vase segmented in foreground ex

Answer 33

Elements located within a shared boundary or enclosed area are perceived as a group Stronger than proximity

Answer 34

Elements visually connected by lines tend to be grouped Overrules proximity

Answer 35

Parallel contours are likely to belong to same group

Answer 36

Symmetrical regions are more likely to be perceived as a group

Answer 37

Animals take advantage of gestalt grouping principles to form groups in their environment Sometimes camouflage is used to confuse observed Like Tiger - most animals are dichromats so camo better

Answer 38

All together = can help figure out object Ambiguity and perceptual committes Metaphor for how perception operates Committees must integrate conflicting inputs and reach consensus Many diff and sometimes competing principles influence perception Perception emerges as result of dominant interpretation agreed upon by these processes Combined info from many regions = draw most likely conclusions

Answer 39

Similarity, closure, good continuation, proximity, figure ground organization

Answer 40

1 = group what should be grouped together 2 = separate what should be separated 3 = use prior knowledge - brain stores experiences to Avoid mistakes/suprises 4 = avoid accidents, like leaning tower Pisa illusion 5 = seek consensus and minimize ambiguity = on most likely hypothesis, what’s nature of object in front of me

Answer 41

After processing in extrastriate cortex, object info divided into 2 distinct pathways = where and what pathways

Answer 42

Dorsal stream Processes locations and shapes of obejcts Does not encode object names or functions Extends from occipital love to parietal lobe

Answer 43

Processes object identity - names and functions, independent of location Extends form occipital lobe to temporal lobe - infra temporal cortex Not unidirectional v1 = bigger V2 = complex, boundary ownership V4 = cells that respond to linear shapes

Answer 44

Supports spatial awareness = dorsal And object recognition = ventral In visual perception

Answer 45

Neural response to polar, hyperbolic and Cartesian gratings in area v4 of monkey V4 = bridges early edge detection - v1 and object recognition in inf temporal cortex More responses to these specific patterns Not much activity for sinuosoidal gratings or patches with oriented linear edges

Answer 46

V2 = bit more complex boundaries = fore and background V4 Posterior it = responds to object parts but nto whole objects - don’t need whole object there

Answer 47

Some areas show specificity = preferential responses to certain categories Results obtained by univariance analysis - functional mri = averaging - take average response and contrast it Shown pics and see if area responds more to one thing

Answer 48

Responds more to obejcts Loc = first stage in visual hierarchy = where full objects explicitly represented - complete objects - whole Responds strongly to shape defined obejcts - doesn’t matter orientation, viewpoints, sizes, positions Partial invariance - doesn’t respond to specific image but just shape of object - also involved in figure ground segmentation and distinguishing object from background

Answer 49

Bridges mid level feature processsing - v4, pit with high level object recognition = ita cortex, Ffa, Ppa Supports invariant object recognition - crucial for recognizing obejcts across diff contexts Provides whole object representations -making it a key step in ventral visual stream

Answer 50

Major hub for object recognition = makes it essential part of understating how brain transforms raw visual input into meaningful obejcts

Answer 51

In fusiform gyrus of ventral temporal lobe Usually only in right hemisphere Sometimes bilateral

Answer 52

Highly tuned to faces bit also respond to expert level recognition Preferential to faces

Answer 53

Helps recognize faces across diff angles, lighting, expression = suggest view invariant representation

Answer 54

Linked to prosopagnosia - cannot recognize faces anymore - do not know if ffa cares about identity - more research needed But this conditions doesn’t mean its linked to identity - bc could like identify based on info form ffa

Answer 55

Some research argues ffa is not strictly for faces but instead specializes in fine grained within category visual recognition

Answer 56

Region just posterior to hippocampus Responds preferentially to places

Answer 57

As dedicated scene processing region

Answer 58

Idea that object recognition alone explains scene perception - scene not just a collection fo objects Spatial layout is key

Answer 59

Ppa from hippocampal spatial navigation = refines understanding of scene perception vs memory asked navigation

Answer 60

Functional link between vision and spatial cognition - bridging perception and higher order place representation Also responds to other things too - not completely separated = all regions contribute some

Answer 61

Extracts curves, textures and complex contours

Answer 62

Sensitive to local features

Answer 63

Encodes whole object representations

Answer 64

Represent object parts

Answer 65

Intermediate processing

Answer 66

Sensitive to shape, invariant to texture and colour

Answer 67

Recognizes faces

Answer 68

Category selective

Answer 69

Recognizes scenes and places

Answer 70

Category selective

Answer 71

Relieve projections from lower level regions to help them process info about category they prefer to respond to

Answer 72

Small and big objects = projection onto brain medically and laterally to fusiform gyrus = contrast between sizes of objects Also on dorsal part = why does where care about obejcts - more than just spatial location in dorsal pathway

Answer 73

Contexts helps guide recognition of obejcts

Answer 74

Many it neurons demonstrate invariance - at cellular level in neurons in itc = meaning they continue to respond to an object regardless of its size position or viewpoint = suggests that it neurons encode more abstract representations of objects rather than raw sensory features Invariance essential for object recognition Can rotate = neurons still respond. It if rotate too much =responses dampens

Answer 75

Studying brain computer interface = brought machine learning Departed from invariant methods = ● Collect fMRI scans of a participant while they view images from multiple known categories.- show more images = better decoding area ● Train a computer model to recognize the brain activity patterns associated with each category. ● Test the model to see if it can correctly identify an unseen image based on learned brain activity patterns. - show image

Answer 76

One of first expos = even and odd runs = while doing exp = give break to ppl, between showing them houses and faces If do not cross categorical divisions = have strong correlation of distributed patterns on the 2 runs correlation drops = if switch categories = concept of distributed representations - remove group of voxels corresponding to ffa but still decode if looking at face = distributed system involved in object recognition

Answer 77

● Collect fMRI scans of a participant while they view images from multiple known categories. - build model ● Define a feature space, e.g. a gabor wavelet pyramid for visual stimuli. ● Fit weights that show how each feature contributes to the neural signal at each voxel. ● Once trained, encoding models can predict responses to new, unseen stimuli. Comparing the predicted responses with actual fMRI data allows researchers to assess the accuracy of the model and understand the representational structure of the brain region under study. - model has feature set that is rich enough to understand what brain processing

Answer 78

Fmri activity Feature space - multi dimensional encodes orientations, contrast Multiple learned weights by feature space for unseen images —> then gives you the predicted fmri activity = look at performance correlated with predicted and measure fmri activity in that voxel and make map of where in brain mdoel can explain activity

Answer 79

Stage 1 = model estimation = pyramidal hierarchy of Gabor patches = sinusoidal gratings with diff orientations spatial frequencies and positions in an image = gabors as feature space Stage 2 = image identification = measure brain activity for an image = can identify which image person looking at if mdoel successful at encoding right features Graph = correlation of measured voxel activity and predicted When pop = Megan taht mdoel has stronger correlation between predicted activity for one image and observed activity for same image Strong diagonal means mdoel very rarely better predicted response associated with other image

Answer 80

Representational similarity analysis Similar obejcts in world must have simialr representations in mind Study = judge states of USA by shape = if simialr Saw these 2 things high correlated = also by name but still reate according = has to form mental image of the state = multidimensional representational space where similarity encoded