computational models of speech perception Flashcards
challenges for lexical access
- speech is a continuous stream
- homonyms
–> spelt and said the same but different meanings - homophones
–> spelt differently, said the same, different meaning - co-articulation
- different Accents
- invariance problem
–> problems of definition of acoustic properties (e.g. phonemes, syllables, words) - ambiguity in word boundaries
–> e.g. four candles vs fork handles
ways to disambiguate the speech stream
- categorical perception
- voice onset time
- perceptual learning
- top down processing
categorical perception
ability to distinguish between sounds on a continuum based on voice onset times
voice onset time
vocal cord vibration
–> VVVVa vs FFa
perceptual learning
- adjust categorical perception based on sounds we hear
- seems to be hard wired
–> babies and primates can do this
top down processing
- e.g. “The state governors met with their respective legislatures convening in the capital city.”
- cough disguising one sound in “legislatures” - - top down processing allows us to recognise the word, and not identify which sound was blocked by the cough
spreading activation
- facilitates predictions of what may be coming up next via activation of items that are related to the acoustic input
- when hearing ‘apple’
–> appeal
–> apron
–> appear - all may be activated in the lexicon
what lexical characteristics affect the speed of lexical access?
- word length
–> long words are slower to process - neighbourhood density
–> words with lots of neighbours are processed more slowly - frequency
–> more frequently a word is accessed in the lexicon, the quicker you can access it
lexical access is based on?
- bottom Up (acoustic input)
- top down processing (disambiguating the speech stream)
- lexical characteristics
- context
- spreading activation that facilitates predictions
mechanics of lexical access (3 options)
- gradually activate the word that matches the acoustic input
- activate all words that start with the same sound as the acoustic input and gradually de-activate words that no longer match the sounds
- gradually activate the word that matches the acoustic input more than other words
option 1 for lexical access
- e.g. word = apricot
–> we break up each sound
–> ay - pri - cot
option 2 for lexical access
- e.g. word = apricot
- as soon as we hear ‘ay’ we might think of:
–> april
–> ape
–> apricot - as we continue to hear the word we de-activate words that no longer match
–> after ay - pri is heard ‘ape’ is de-activated
–> after ay - pri - cot is heard ‘april’ is de-activated
option 3 for lexical access
- e.g. word = apricot
- when we hear ‘ay’
- we might activate:
–> ape
–> apricot
–> april
–> say
–> pay - as we hear more of the word, we activate the word that matches the word more than others
–> e.g. apricot more than say
2 models of speech comprehension
- The Cohort Model
–> Marslen-Wilson (1987)
–> predicts that we access words in the lexicon via activation of all words sharing initial features and gradually de-activate words that stop matching the features (option 2) - The Trace Model
–> Elman & McClelland (1999)
–> predicts that features activate phonemes that activate words with a gradual increase in activation of words that match all features so that the word with the most activation wins (option 3)
The Cohort Model
- we activate words in the mental lexicon (the cohort) that match the input
–> e.g. the ‘ay’ in apricot can activate apple, apart, apricot, apex, april, ape - we then gradually de-activate items that fail to match the input
- we reach a uniqueness point (we now know the word - only one word)
- items that do not match the onset of the word (‘ay’ in apricot) are not activated
neighbourhood effects in The Cohort Model
- words that match the acoustic input compete for activation
–> neighbours compete with each other for recognition - learning the word ‘aprikol’ will slow down the recognition of the word apricot
frequency effects in The Cohort Model
- words with high frequency have high resting states
–> less activation required to recognise high frequency words
–> e.g. apricot would be recognised more quickly (ms) than a low frequency word (aprikol)
evidence of The Cohort Model
- Gating experiments
–> Ps are presented with fragments of words that gradually reveal the whole word and asked to guess what the word is after each presentation
–> e.g. john went to the zoo and saw a ‘ca…’
–> cap
–> cat
–> kangaroo - gradually more of ‘ca’ is told to help guess the word
early findings of Gating Paradigm (Grosjean, 1980)
- presented the word stretcher using the paradigm
what do Gating experiments suggest?
- recognition of a word is a gradual process that starts from word onset and continues until the end of the word
- candidate words that no longer fit the acoustic input are eliminated