Word Recognition Flashcards

Question

DAS similarity

Answer 1

* Neighborhood consists of all words that differ by Deletion, Addition, or Substitution of just one phoneme * Treat rhymes as just as neighborly as cohorts.

Answer 2

Neighborhood consists of all words that share first 1-2 phonemes

Answer 3

* Neighborhood based on phonetic features * cat/pat (1 feature diff) more similar than cab/bat (2 feature diff) * bull/veer are neighbors (3 feature diff)

Answer 4

* **Frequency weighted similarity score**: Similarity of candidate to input times candidate's log frequency: FWSS(t,x) = f(t) \* S(t,x) * Neighborhood probability is a candidate's FWSS divided by sum of FWSS of all other words in lexicon * For DAS similarity, S(t,x) is 1 or 0, so it reduces to FWNP(t) = f(t) / ∑ f(w) * Proportion of neighborhood frequency contributed by the word t * Frequency-weighting applies a prior probability * If 2 words matched on neighborhood size, more frequent one is easier to recognize * If 2 words matched on frequency, one in less dense neighborhood is easier to recognize

Answer 5

* Multiple activation: Multiple candidate words are activated * Activation of candidate word is based on * Similarity: degree of fit between speech and candidate * Priors: prior probability of word * Compeition: Competition among candidates leads to recognition

Answer 6

* Establish linking hypothesis between model and data * Figure out if success/failure is due to theory, implemenation (input representation, number of units, etc), model parameters, or linking hypothesis

Answer 7

1. Post-lexical (adjust choice rule that transforms activations) 2. Resting activations 3. Manipulate phoneme-to-word weights (effect of frequency is proportional to amount of word heard) All very similar results (Dahan et al., 2001) but bottom-up weight scheme had best fit to data.

Answer 8

* We see lexical effects on sublexical tasks * Makes model robust to noise * Implicitly encodes sublexical probabilities (diphone etc) * TRACE more accurate and faster if feedback on * Top-down knowledge can guide perceptual recalibration * Interactive activation systems can perform optimal Bayesian inference

Answer 9

* Lexical effects can be captured if perceptual and lexical info integrated postlexically * No way to increase information in raw signal, so best bottom-up signal is best guess * Predicts perceptual hallucinations * But listeners to show lexically influence perceptions, like phoneme restoration or failing to recognize some mispronunciations

Answer 10

* Eyetracking * Demonstrated competition effects even though competitors were not displayed * Manipulated frequency, neighborhood density (one phoneme DAS), and cohort density * High frequency: Early, continuous facilitating effect * High cohort density: Early, continuous inhibitory effect * High neighbor density: Early facilitating, late inhibitory effect (rhymes kick in) * Competitor set dynamically changes as word unfolds

Answer 11

* Subcategorical phonetic mismatches * Ne(ck)t slower than ne(p)t in eyetracking * Marslen-Wilson and Warren, 1994, found no RT difference for these kind of data. * Concluded part of "neck" could not be inhibiting "net" * Here, eyetracking data shows the predicted difference. * Compatible with lateral inhibition

Answer 12

Implicit More frequently a phoneme or n-phone appears in TRACE lexicon, the more top-down feedback it gets

Answer 13

* Implemented an activation model inspired by TRACE but without the spatial duplication of time * Used string kernels * Achieves spatial invariance for visual word recognitions, so generalize to time. * Layers * Input phonemes over time * 1-Phone and 2-Phone levels * Word level (lateral inhibition here) * /d/o/g/ activates /do/, /dg/, /og/ but not /gd/ * Proof of concept

Answer 14

1. Temporal order over phonemes: dog vs. god 2. Multi-token independence problem: do vs. dude 3. Temporal order over words: man bites dog vs. dog bites man 4. Segmentation problem

Answer 15

* Lexical effects * Main Ganong effect: Xlug as /p/, Xlood as /b/ * Time pressure eliminates effect * Late lexical effects (targeX as /t/). Context has stronger effect for ambiguous phonemes late in word. * No lexical effects for unambiguous phonemes * Phoneme monitoring data * No lexical effect on RT for word-initial targets (respond when you hear word starting with /g/) * Lexical effect on RT for later targets (faster on secreT vs. gulduT) * Lexical conspiracy: Phonotactic effects from lexical statistics * Xluly (X in p-t continuum). Interprets as "Xl" early on then shifts to "tl" by end, a phonotactically illegal sequence. * Simulating cohort and rhyme competition in eyetracking * Sensitivity to subcategorical mismatches * Lexical basis for segmentation

Answer 16

* Shortlist A, Merge, Shortlist B * Differentiates time-specific tokens and time-invariant words * Generates a shortlist of tokens and lexical lattice on the fly * How could lattice be wired on the fly? * Core theoretical Assumptions * Prelexical to Lexical paths are feedforward only * Candidate selection is based on matching and mismatching information * Implicit segmentation via competition among candidates

Answer 17

* Global similarity * Static set of competitors * Recognition RT related frequency-weighted neighborhood probability * More neighbors, harder. * More frequent neighbors, harder. * Naming, decision, recognition in noise allow global similarity to matter

Answer 18

They overlap too much. Phoneme certainty is never 100% at any one point in time.

Answer 19

1. Fake acoustics 2. Real acoustics 3. Simulate output of prelexical processing

Answer 20

* Norris and McQueen 2007 * Most radical move: Abandons activation entirely * What is an activation, anyway? * No inhibition among competing hypotheses * Competition reflected in differing path probabilities * Assumes word recognition is optimal, in a Bayesian sense * As ambiguity increases, prior beliefs matter more. * Priors require us to encode frequency. * Likelihood functions are more important than similarity metrics

Word Recognition Flashcards

(44 cards)