lecture 4 - speech perception Flashcards

1
Q

comprehension skills =

A

reading (visual word recognition)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

speech perception harder than reading because…

A

greater memory demands
ambiguous signal
harder to segment
transitory (not permanent)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

categorical perception

A

an abrupt boundary between categorisation of 2 phonemes and stimuli that intermediate between the 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

decoding =

A

extracting discrete elements (phonemes or sounds)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

5 stages of speech perception

A

1) decoding
2) phoneme/syllable identification
3) word identification
4) interpretation
5) meaning of current sentence is integrated with preceding speech to construct overall message

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Mattys et al: 2 types of adverse conditions =

A

1) energetic masking = distracting sounds cause intelligibility to be degraded (other voices/noise)
2) informational masking = cog load makes speech perception harder (affects top-down processing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

co-articulation

A

pronunciation of phoneme depends on preceding and following phonemes e.g. ‘bill, ‘rub’ - increases variability in signal BUT allows you to predict next phoneme
more co-articulation within words than between them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

assimilation

A

phonemes take on acoustic properties of neighbouring phonemes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

stress

A

in English, initial syllable of most words is stressed - strings of words without initial stress are misperceived

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Mattys et al’s Hierarchical approach to segmentation: 3 categories of cues

A

1) lexical (syntax, word knowledge or semantics) - optimal interpretative conditions
2) segmental (e.g. co-articulation) - poor lexical info
3) metrical prosody (e.g. word stress) - poor segmental info
(LSM - i love matty so much)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

McGurk effect

A

when watching someone say /ga/ with sound /ba/ we hear /da/
multimodal perception - relying on speech - triggered by automatic bottom-up proces triggered by discrepant visual and auditory signals
effect is stronger when crucial word is presented in semantically congruent way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

2 extreme positions of context effects

A

‘interactionist account’ = context affects processing at early stage & influences word perception
‘autonomous account’ = context affects later processing - can only contribute to evaluation and integration of lexical processing not its generation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

phonemic restoration effect (context effect)

A

evidence that sentence context can influence phoneme perception e.g. phoneme replaced with a cough - perception of that word was influenced by the sentence it was in

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Ganong effect (AKA lexical identification shift) (context effect)

A

tendency to perceive ambiguous sound as a phoneme that would complete a real word rather than completing nonsense words

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

TRACE model (McClelland & Elman)

A

bottom-up and top-down pricessing interact flexibly in word recognition - all sources of info are used at same time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

TRACE model assumptions:

1) there are prcessing units/nodes at 3 different levels:

A

1) features (voicing/manner of production); 2) phonemes; 3) words
feature words are connected to phoneme nodes which are connected to word nodes - connection between nodes operate in both directions and are always facilitatory
connections between nodes at same level = inhibitory & bidirectional (once a unit is activated it inhibits its competitors)
nodes influence each other in the strength of their interconnections

17
Q

TRACE model: process

A

as excitation and inhibition spread across nodes, a pattern of activation develops
the word identiied = determined by activation level of possible candidate words

18
Q

TRACE model: bottom-up and top-down activation

A

bottom-up activation = proceeds upwards from feature level to phoneme to word level
top-down = from words to features

19
Q

activation from word level to phoneme level would facilitate phoneme detection =

A

word superiority effect (evidence for top-down processing)

20
Q

TRACE model can explain Ganong effect

A

top-down activation from word level is responsible

21
Q

TRACE model can explain categorical perception

A

discrimination boundary becomes sharper because of mutual inhibition between phoneme units at phoneme level - one phoneme becomes increasingly activated whilst others become inhibited (evidence for top-down processing)

22
Q

TRACE model criticisms

A

1) attaches excessive importance to top-down processes (mispronunications have strong negative effect)
2) may be too flexible to be tested
3) model was tested only on small set of short words and is yet to be tested on more complex vocabularies

23
Q

TRACE model support

A

1) copes well with noisy input (emphasis on top-down)
2) accounts for:
Categorical speech perception
Lexical identification shift
Word superiority effect in phoneme naming
Word frequency effect

24
Q

Cohort model (Marslen-Wilson & Taylor) - 3 stages

A

focus more on bottom-up processes
3 stages:
1) access stage (all words conforming to sound sequence so far become active = the cohort)
2) selection stage (words are eliminated if cease to match further info from presented word or inconsistent with context)
3) integration stage (semnatic and syntactic properties of chosen word are used to integrate it into sentence

25
Q

uniqueness point (cohort model)

A

point when only one word is consistent with accoustic signal

26
Q

parallel interactive processing (cohort model) =

A

various sources of info are processed in parallel, allowing for their interaction

27
Q

isolation point (cohort model) =

A

point in a word where proportion of listeners identify word correctly but may not be confident about it (typically earlier than uniqueness point - biased by context)

28
Q

lexical access (cohort model) =

A

point at which all info about a word becomes available following recognition

29
Q

integration (cohort model) =

A

at start of comprehension process, proper, semantic and syntactic proerties are integtrated into higher level sentence representation

30
Q

revised cohort model:

A

words vary in their activation
original = context influences selction stage; revised = context only influences later integration stage (greater emphasis on bottom-up)
Word-initial cohort may include words with similar (rather than exact) matches to the initial phoneme

31
Q

shadowing task (cohort model) =

A

pps listen to speech containing distortions & repeat it back
50% of the time participants repeat back as it should be, without the distortion
Most frequent when distortion was slight, on final syllable and word predictable from context

32
Q

Listening for mispronunciations task (cohort model) =

A

pps listen to speech where a sound is distorted & detect changes
Participants are more sensitive to changes to the beginning of the words (e.g., poot – boot)

33
Q

common ground and egocentric heuristic

A

we make more use of common ground when talking to people we are familiar with than listening to strangers