Visual perception Flashcards
What are the stages of visual perception?
1) Environmental stimulus
2) Light transformation
- 1st transformation and representation
3) Receptor processing
- 2nd transformation and transduction
4) Neural processing
- Retino-geniculo-striate pathway
5) Perception
- Knowledge
6) Recognition
- Knowledge
7) Action
What happens during light transformation? (2nd stage of visual perception)
1st transformation and representation
- Light waves are converted into retinal object representations:
- Light is focused on lens and reflected onto the retina
- Inverted object representation on retina
- Accommodation
Why is accommodation needed?
- Need to accommodate eye to perceive near and far objects
- As there is a fixed distance between lens and retina, lens needs to change thickness to accommodate to near and far objects (using ciliary muscles)
- Lens changes shape so that focus point is always the fovea
How does anatomy change during accommodation to a far object?
- Ciliary muscles are relaxed
- Suspensory ligaments are pulled tight
- Lens is thin
- Little curvature, little focusing power - light is only bent a little
How does anatomy change during accommodation to a near object?
- Contracted ciliary muscles
- Slack suspensory ligaments
- Thick lens
- Strong curvature, strong focusing power - light is bent a lot
Define accommodation
The process by which the eye changes optical power to focus on an object as its distance varies
What are far and near points in accommodation?
Far point - the maximum distance of an object from the eye for which a clear image of the object can be seen
Near point - the minimum distance of an object from the eye for which a clear image of the object can be seen
(Accommodation limit increases with age)
What happens during receptor processing? (3rd stage of visual perception)
2nd transformation and transduction
- Transduction - retinal object representation is turned into an electrical signal
- Light travels through all layers of eye wall, is then reflected on pigment epithelium
- Light sensitive chemicals within photoreceptors’ outer segments change when hit by light
- The chemical reaction sparks an action potential which travels to bipolar then ganglion cells
What types of vision are rods and cones involved in?
Rods - scotopic vision
Cones - photopic vision
Both - mesopic vision
What are the absolute numbers of rods and cones?
Rods = 120 mil
Cones = 6 mil
What are the densities and distribution in retina of rods and cones?
Rods - none in fovea, mainly in peripheral retina
Cones - most in fovea
What are the abilities for dark adaptation of rods and cones?
Rods - slowly adapt, but adapt more thoroughly (completely adapt after 20 mins)
Cones - quickly adapt, but plateaus after 8 mins
What are the absolute sensitivities of rods and cones?
Cones - lower
Rods - higher
What is the acuity of rods and cones?
Rods - worse as information about different light sources combines - the same ganglion cell receives input from multiple rods
Cones - better - detailed info about different light sources - separate ganglion cells receive input from separate cones
What is the spectral sensitivity of rods and cones?
Rods - more sensitive to shorter wavelengths - max 500nm
Cones - more sensitive to longer wavelengths - max 560nm
What is the colour vision of rods and cones?
Rods - dark adapted vision, operates at low luminance, no colour sensation
Cones - light adapted vision, operates at high luminance
S cones - respond to short wavelengths, blue
M cones - respond to medium wavelengths, green
L cones - respond to long wavelengths, red
What is the rod-cone break?
In adapting to low light - cones stop adapting but rods continue
What is neural convergence?
Many neurons synapse onto fewer neurons
- Multiple rod cells per ganglion cell
- One cone cell per ganglion cell
- As there are 1 mil ganglion cells, but 6 mil cones and 120 mil rods
- Group of rod cells create a bigger signal than a single cone cell
What is the Purkinje shift in spectral sensitivity?
Increased sensitivity to short wavelengths in dark-adapted eye
What is the diameter of the eye?
2.5cm
What is the retina?
Light sensitive surface on 75% of the inner eye
What is the sclera?
Outer layer of the eye; tough fibrous coat
What are the two key retinal landmarks?
1) Fovea - (macula) central vision, highest acuity, most detailed vision (most cones)
2) Optic disc - blind spot, usually unaware because blind spot of one eye corresponds to ‘seeing’ retina in the other eye
What parts of the eye are involved in the optical system for focusing?
Iris, pupil, cornea and ciliary muscles
What is macular degeneration?
Deterioration of the retina in the macula (fovea)
Irreversible blindness, loss of central vision
What is the electromagnetic energy of different light wavelengths?
Short wavelength - high electromagnetic energy
Long wavelength - low electromagnetic energy
Spectrum of visible light = 400-700nm
What is Myopia? (accommodation problem)
Nearsightedness
- Far objects out of focus, because lens is too thick or eyeball is too long
- Requires concave correction lenses
What is Hyperopia? (accommodation problem)
- Near objects are out of focus, usually because the eyeball is too short
- Requires convex correction lenses
What is the ganglion cell input during transduction? (Neural convergence)
- Neural convergence - each ganglion cell receives input from 126 photoreceptors (fovea = 1:1 , periphery = 1:100)
What is the receptive field?
The part of the visual field in which a stimulus can modify the neuron’s firing rate
- Input in receptive field either excites or inhibits a ganglion cell - depending on where in a ganglion’s receptive field the stimulus is located
What is the structure and function of on-center ganglion cells?
On area in the center - if stimulus falls here = excitation
Off area in the surround - if stimulus falls here = inhibition
What is the structure and function of off-center ganglion cells?
Off area in center - if stimulus falls here = inhibition
On area in surround - if stimulus falls here = excitation
How do excitatory and inhibitory areas affect neuron firing rates?
Excitatory area (+) - on area - increases the firing rate of a neuron when stimulated
Inhibitory area (-) - off area - decreases the firing rate of a neuron when stimulated
How does uniform illumination affect an on/off cell?
Both excitation and inhibition in center and surround = activation cancels out
What is the function of on/off cells?
To specialise our vision to edges of objects, since a luminance discontinuity (edge of object) causes activation
What are the three main types of ganglion cells?
1) Magnocellular (M)
2) Parvocellular (P)
3) Koniocellular (K)
Magnocellular ganglion cells - input and colour specificity?
- Receive input from mostly rods
- Not colour specific
Parvocellular ganglion cells - input and colour specificity?
- Input from M or L cones
- Colour specific (green or red on/off)
Koniocellular ganglion cells - input and colour specificity?
- Excitatory input from S cones
- Inhibitory input from M and L cones
- Blue = on
Where do the axons of M,P and K cells go to?
The optic nerve
What areas are involved in the retino-geniculo-striate pathway?
Retina
Lateral geniculate nucleus
Striate visual cortex
How is lateralisation in the optic chiasm structured?
- Nasal axons cross over to the other side of the brain
- Temporal axons stay on the same side
- Visual fields now represented in contralateral hemispheres (LVF in right hem, RVF in left hem)
Temporal representation of RVF in left eye, stays in left hemisphere, LVF in right eye, stays in right hemisphere
Nasal representation of RVF in right eye, crosses to left hemisphere at optic chiasm, LVF in left eye, crosses to right hemisphere at optic chiasm
Axons of which neurons are in the six layers of the lateral geniculate nucleus?
Layers 1 and 2: Magnocellular neurons
Layers 3, 4, 5 and 6: Parvocellular neurons
Koniocellular neurons interlayered
Through which structures does visual information travel during neural processing?
Starts at retina
Optic nerve
Optic chiasm (nasal representations cross, temporal representations stay)
Optic tract
Lateral geniculate nucleus/ lateral geniculate body (thalamus) (at the midbrain)
Optic radiations
Visual cortex
What are the striate and extrastriate visual areas?
Striate visual cortex = V1 (visual area one) = Primary visual cortex (stripy appearance)
Extrastriate visual areas:
V2, V3, V4, V5 (MT = middle temporal area), IT = inferotemporal cortex
What happens to neural convergence and neuron receptive fields as visual information travels through the cortex?
- As visual information travels through the cortex, there is ongoing neural convergence:
- The neuron’s receptive fields increase
Visual info gets more complex with ongoing integration
Some V1 neurons are orientation selective - what does this mean?
They have elongated receptive fields to capture edges in a particular orientation
They respond when a stimulus in their receptive field matches their preferred orientation (Hubel and Wiesel, 1959)
The less the stimulus matches the preferred orientation, the lower the neuron’s firing rate
- Receptive field not stimulated, no response
- Receptive field stimulated with preferred orientation, response
- Receptive field stimulated with non-preferred orientation, no response
What did Hubel and Wiesel (1959) find in their experiments with a sleeping cat?
V1 single cell recording
Inserted electrode into single neuron in V1 - wrong orientation - no neuronal response
Right orientation - neuronal response
What else are V1 neurons selective for other than orientation?
Other V1 neurons are motion direction selective - they respond when a stimulus in their receptive field matches their preferred motion direction
Other V1 neurons are selective for colour and brightness (circular receptive fields)
How big are V2 receptive fields compared to V1?
Twice as big
What stimulus features do V2 neurons respond to?
Same as V1 = orientation, motion direction, brightness
More complex features = length, angles, arcs, shapes, texture
Which areas are involved in the temporal (ventral) stream?
V1, V2, V4, IT (inferotemporal lobe)
Which areas are involved in the parietal (dorsal) stream?
V1, V2, V3, MT (middle temporal area)
What do the parietal and temporal streams differentially process?
Parietal (dorsal) stream processes object locations = where
Temporal (ventral) stream processes object identities = what
How did Ungerleider and Mishkin (1982) find evidence for two separate streams? (Monkeys)
Monkeys with parietal (‘where’) lesions could distinguish a cube from a triangle (object discrimination), while monkeys with temporal (‘what’) lesions could not.
Monkeys with temporal (‘what’) lesions could learn the position of an object (location discrimination), while monkeys with parietal (‘where’) lesions could not.
Perfect double dissociation - damage of brain areas leads to opposite pattern of impairments - shows those brain areas have separable functions.
What are the receptive fields and functions of V3 and MT in the parietal/dorsal (where) stream?
Motion perception:
- Receptive fields are 5 times larger in V3 and 8 times larger in MT than in V1
- Integrated information over a large area of the retina
What are the receptive fields and functions of V4 in the temporal/ventral (what) stream?
- Receptive fields are 5 times larger in V4 than in V1
- V4 neurons respond to object-defining features such as colour, orientation, complex shapes, texture
What are criticisms of seperate parietal/dorsal and temporal/ventral streams?
- How useful is the separation between two visual processing streams?
- The two streams seem to interact heavily
- e.g. reaching and grasping is more precise towards familiar objects
- Directional movements controlled by dorsal stream are influenced by object recognition in the ventral stream
What is sensation?
The un-interpreted sensory impressions created by the detection of a stimulus
What is perception?
The psychological and cognitive processes of making sense of sensations
What is structuralism? (Late 19th century - father and method?)
- The study of the elements of consciousness: conscious experience (perception) can be broken down into basic elements (sensory elements), which can then be combined again to describe any human experience
- Father of Structuralism: Wilhelm Wundt, 1879, started the first psychological laboratory in Leipzig, Germany
- Method: introspection
- Goal: describe elements of perception (sum of sensory events)
What evaluation is there of structuralism?
:) - First ‘school of thought’ in psychology
:( - Validity - no measures, purely descriptive
:( - Reliability - observations are not consistent and constant
:( - Objectivity - observations depend on observer - biased
What is psychophysics? (Gustav Fechner, 1860)
Measuring rather than describing (structuralism) elements of perception
Goal - objective measure of perceptual elements
Basic idea - measuring the absolute perception threshold (just noticeable intensity to detect a stimulus)
What is the method of adjustment in psychophysics?
Participants adjust the intensity of a light, until they are just able to perceive the light.
e.g. Dark adaptation experiment - The threshold of detecting light stimulus differs based on how adapted participants have become to dark conditions
What is the method of limits in psychophysics?
Participants are presented with trials of increasing/decreasing light intensity and asked if they can perceive the light.
Trials with ascending light intensity - point at which they start to perceive light
Trials with descending light intensity - point at which they stop perceiving light
Both ascending and descending are done in many trials and mean perception threshold is calculated from each absolute perception threshold
Stimulus is either detected 100% or 0%
What is the method of constant stimuli in psychophysics?
Like the method of limits (same task), but:
- Many more trials per light intensity
- Randomised light intensity across trials - no stimulus order (eliminates confounding variable)
Percentage of yes answers to perceiving stimulus is plotted as a function of stimulus intensity
Results in psychometric function - a fine grained threshold: probability with which a stimulus of certain intensity is detected
What is psychophysics? (Ernst Weber, 1834)
Basic idea - measuring the difference threshold (minimum intensity difference to discriminate two stimuli)
What is Weber’s law? (psychophysics)
The change in a stimulus to be able to discriminate it from another stimulus, is a constant ratio of that original stimulus
i.e. the brighter light 1 is, the greater the required brightness change for light 2 to be perceived as different
Required brightness change has a constant ratio
What are examples of Weber’s law in the real world? (Coffee, pub)
Can you tell the difference between coffee with 1 or 9 cubes of sugar and coffee with 8 or 9 cubes of sugar?
When in a pub with many people, you have to shout to be heard but in a library you only need to whisper
What is psychophysics? (Stanley Stevens, 1957) (Response expansion and compression)
Basic idea - subjective magnitude estimation (measuring the relation between stimulus intensity (objective measure) and perceived intensity (subjective experience)
- An increase in the perceived stimulus intensity can be larger (response expansion) or smaller (response compression) than the increase in the measured stimulus intensity
What is Steven’s power function? (Psychophysics)
P = KS (to the power of) n
P = perceived intensity
K = constant (scales values to size)
S = stimulus intensity
n = exponent
What is the subjective magnitude estimation of brightness and pain?
Brightness - big difference but we don’t perceive it to be that different
Pain from electric shock - small change produces much bigger subjective pain response
What evaluation is there of psychophysics?
:) - Valid, reliable, and objective measures (hard data)
:( - No theoretical account of perception (unclear how we perceive)
What is James Gibson’s (1966) Ecological theory of perception?
Perception is direct (perception = sensation)
‘The real world is rich in sensory information and provides sufficient context for the visual systems to directly perceive what is there’
- Perception must be investigated in a natural environment as perception is directly offered by the world (sensation)
- Goal - explain how we attach meaning to sensory input
- Basic idea - perception takes place in the optic array (light in the environment) and is directly based on invariant information in the visual field, which is extracted by the observer’s movement
- i.e. as an observer moves, the optic array becomes ambient (the light surrounding the observer changes), but some information remains constant (invariant)
What is optic flow pattern? (Invariant visual information) (Driving example)
The focus point of a driver remains motionless while the rest of the visual field appears to move away from this point
What is texture gradient? (Invariant visual information)
The elements of a textured ground are denser in the distance
What is horizon ratio? (Invariant visual information)
The proportion of the object above and below the horizon is constant for objects of the same size standing on the same ground
What evaluation is there of Gibson’s ecological theory of perception?
:) - Theoretical account - tries to explain how perception and object recognition work
:( - Assumes that perception, offered by the real world, is fast, readily available and effortless, and always accurate
What type of accounts of perception are structuralism, psychophysics and ecological theory?
Bottom-up accounts
What is affordance of objects?
When we combine our physical psychological states with our constantly changing arrays, we are enabled to recognise not only what it is but what it does.
e.g. coffee mug = affords grabbing it (handle), drinking from it (cupped shape), enjoying it (smell)
Does retinal image fully determine perception?
No - What we perceive is not always what is represented on our retina
What is the Fraser spiral illusion?
James Fraser (1908)
- Looks like spirals, actually circles
What is the vertical-horizontal illusion?
Vertical looks longer - bisecting lines are perceived as being longer than bisected (interrupted) lines
What is the irradiation illusion?
(Hermann von Helmholtz, 1867)
White inner area of black shapes looks bigger than black inner area of white shapes
- Light areas appear to be larger than dark areas, because light from a white region irradiates adjacent dark regions
What is equivocal perception? (Ambiguous figures)
Perceiving different objects from the same retinal image
What is figure ambiguity?
- Retinal image does not change, but we perceive different objects
- Focus on foreground or background
e.g. vase or two faces?
What is feature ambiguity?
- Same feature can be different parts of different objects
e.g. duck or rabbit?
What is depth ambiguity?
- 3D perception changes in the same retinal image
e.g. Necker cube (Louis Albert Necker, 1832) - where is the front and where is the back?
How do visual illusions critique Gibson’s ecological theory of perception and other bottom up accounts?
Bottom up accounts cannot explain these illusions
Retinal images do not fully determine perceptions
However, in Gibson’s view - such illusions and figures are carefully constructed to mislead us and do not exist in the real world therefore they have no ecological validity
However, real world visual illusions exist too
What is the moon illusion?
The moon appears larger when it is close to the horizon as compared to when it is high up in the sky
What is the waterfall illusion?
Motor aftereffect: after observation of motion in one direction (waterfall for 30-60 seconds) stationary objects (e.g. trees) appear to move in the opposite direction
Neurons adapt to the motion we observe, so we don’t find it too disturbing, this effect lasts so neurons are confused with the lack of motion of stationary objects
What is the wagon wheel effect?
- Stroboscopic effect: a moving wheel appears to stand still or move in opposite direction to its true rotation
Motion adaptation of neurons
Are all retinal images ambiguous and what is the inverse projection problem?
All retinal images are ambiguous
The inverse projection problem - a 3D object is represented on a 2D retinal surface.
The same pattern of light on the retina can be caused by different objects.
The real-world 3D object cannot be derived from the retinal image
- Superimposed objects and untypical angles
What does the existence of perceptual illusions show about perception?
- Perception is more than just sensation - sensory information from the retina is insufficient, perception is a matter of interpretation