voice and speech perception Flashcards
voice reveals a lot about you
- how people recognise you
- how social you are
- trustworthiness
- emotions
- stress
- build (higher voice smaller vocal tract and build)
voice and trustworthiness
women = men are more trustworthy when deeper pitch at end men = women are more trustworthy when higher pitch at end
associate voices with social traits study
McAleer et al (2014)
- cronbach alpha found people constantly agree on whether a voice sounds attractive/confident etc
- principle dimensions = trustworthiness and dominance
- independent
- simplifies from 10 dimensions to 2
acoustic profiles
pitch = fundamental frequency intensity = amplitude tempo = speed
can manipulate these independently
emotions expressed in distinct acoustic profiles
Enjoy, elation, sad and greif
enjoy = high pitch low intensity low tempo elation = high pitch high intensity high tempo sad = low pitch low intensity low temp grief = high pitch high intensity high tempo (same as elation)
vocal emotions = shared by culture or language specific
Scherer et al (2001)
- Ps judged emotion sentence (by german actor) conveyed
- european more accurate than indonesia
- low consistency between indonesian judges
temporal voice area
- discovered by Belin et al (2000)
- voice and non-voice played to Ps and matched for levels of energy
- brain maps created - 2 large areas found
brain regions in identifying speaker identity
discovered using neural adaption - repeating speaker identity or syllables
- anterior part of temporal lobe
brain regions in vocal emotion identification
dichotic listening (angry and neutral voices)
- asked Ps to pay attention to one
- within TVA increased activation when pay attention to angry
- when fake angry noises using white noise - not found - unique to human voices
are voice areas necessary for voice processing?
TMS - less correct responses as to whether a voice or not when TVA stimulated
no effect of loudness discrimination
speech processing = top-down
influenced what you focus on based on prior knowledge
e. g., vocoded speech
- when you know what it says can hear it
broca’s area damage
can’t speak properly
can understand
so left inferior frontal region = important in speech production
wernicke’s area damage
can’t understand speech
can speak fluently
damage to left posterior region = important in speech comprehension
rotated speech
inverted spectrograph
full spectral info
bad intelligibility
vocoded speech
speech divided into frequency bands bad spectral info partial intelligibility (once learnt)
left temporal regions respond more to
intelligible speech
normal speech
full intelligibility
full spectral info
rotated vocoded speech
divided into frequency bands and inverted
bad spectral info
bad intelligibility
right temporal regions respond to
spectrally rich
spectral uncertainty principle
trade off between time and frequency
less precise in time - more precise in frequency = important in
music
more precise in time - less precise in frequency = important for
speech
McGurk effect
same sound + different mouth movement = hear different
efference copy = copy of motor command sent to auditory centre - what sound would have been heard by that mouth movement
combined
auditory and visual system engaged
neural oscillations for speech segmentation
only hear words when brain oscillations coordinate with them
Ps listen to syllables at different frequencies
4.5Hz = enhanced communication between auditory and motor systems
as most human languages have a rate of 5 syllables a second = 4.5Hz
how pathway
dorsal
connects speech with multisensory regions through arculate fasciculus
what pathway
ventral
semantic processing of speech