MIDTERM Flashcards
study of speech using instruments to visualize and measure various aspects of speech
experimental phonetics
a sound wave is a_____because the movement of air particles is in the same direction as the direction of the sound
longitudinal wave
articles of the medium crowd closer together
compressions
particles of the medium spread farther apart
rarefactions
presents the sound as a unified entity
waveform
presents the individual components of the sound at a certain time point
spectrum
presents the individual components of the sound over time
spectrogram
calculate period
T = 1/f
calculate frequency
f = 1/T
a sound with a higher frequency has ____ cycles per second
more
shorter vot
voiced stop
longer vot
voiceless stop
/i/
periodic, complex
/z/
periodic, complex
/t/
aperiodic transient
/s/
aperiodic, continuous
horizontal axis in a waveform
time
vertical axis in a waveform
amplitude
horizontal axis in a spectrum
frequency
vertical axis in a spectrum
amplitude
horizontal axis in a spectrogram
time
vertical axis in a a spectrogram
frequency
/u/ f1 and f2
low f1, low f2 (closed, back)
/i/ f1 and f2
low f1 and high f2 (closed, front)
/æ/ f1
high f1 (open)
we can hear sound better in a vacuum chamber because there is no background noise
false
sound in water feels different from sound in air because sound travels slower underwater than in the air
false
when the slope of the source spectrum is steeper, the output sound is softer
true
f0 is the first harmonic
true
fundamental frequency means the first formant
false
the dark horizontal bars on the spectrogram indicate that the sound involves
resonance
the thin white vertical bars on the spectrogram indicates that the sound is
voiced
nasal stops are transient waves
false
vowels before voiceless consonants are generally shorter than those before the voiced counterparts
true
a negative VOT correlates with a voiceless stop
false
obstruents include both oral and nasal stops
false
human perception is better described by a ___scale
logarithmic scale
which of the following is not a logarithmic scale?
hertz or decibel
hertz
a ____ sound is such that its frequency is above the normal range of human hearing
supersonic
0 dB means silence
false
0 Hz means silence
true
the intensity of a sound is 70 dB is 10 times greater than that of a sound at 60 dB
true
a sound at 70 dB is perceived 10 times louder than a sound at 60 dB
false
what is the frequency of a sound that is one octave higher than a 100 Hz sound
200 Hz
production experiment target analysis:
speech
perception experiment target
responses to speech stimuli
pilot experiment
a small‐scale preliminary study to evaluate feasibility of the
research
prosody defined by form
comprises the suprasegmental aspects of the
speech stream
acoustic aspects of prosody
fundamental frequency, duration and intensity
prosody defined by function
phonetic and phonological properties of speech
that are NOT related to the choice of lexical items
functions marked in an utterance by prosody
- Syntactic structure
- Speech act
- Turn taking
- Emphasis
- Rhythm
- Emotion and attitude
rhythm
Repeated pattern of sound units
isochrony
The rhythmic organization of speech into equal intervals, languages can be categorized based on isochrony pattern
stress-timed language
The interval between two stressed syllables is equal, english and german
Syllable‐timed languages
The duration of each syllable is equal, french and spanish
prosodic boundary
immediate common answer is “pauses”, but it’s a broader
term
prosodic grouping
utterance of more than two words in it often has a perceptible
sub‐grouping.
_____is the main cue to boundaries, most reliable cue in disambiguating syntactic
structures
duration
main durational cues affecting boundary perception:
- Pauses
- Pre‐boundary lengthening
The presence of a ____ is a strong cue to
boundaries.
pause (silent interval)
Pre‐boundary lengthening
Segments are lengthened before boundaries.
Pitch excursion at the
boundary is commonly
analyzed as___tones
boundary tones
Some boundary tones are often linked to the_____(not only syntactic structure). They are treated as intonational
morphemes
semantic meaning
At prosodic boundaries, the intensity of speech becomes
stronger.
false
Stronger boundaries are associated with ____ intensity
lower
Intensity _____ at boundaries.
decreases
Prosody can signal boundaries in a _____ structure, thus
can help syntactic disambiguation.
syntactic
Prosody can signal the_____in the utterance
emphasis
what are the acoustic correlates of prominence?
- fundamental frequency
- duration
- intensity
______ is often said to be the best acoustic
correlate of prosodic prominence in English.
Intensity (loudness)
_____ duration signals word stress in English
Longer
____ duration signals emphasis
Longer
________(the most prominent element in the sentence) can be
signaled by an increase in pitch range.
Focus
an abstract concept to indicate the prominence of a
syllable in the word
stress
actual phonetic realization of the
prominence in terms of pitch movement
Pitch Accent
If a word is prominent in a sentence, this prominence is
realized as a ______ on the stressed syllable of the
word.
pitch accent
Focus can be signaled by an _____ in pitch range
increase
Post‐focal material is realized with a ______ pitch range
reduced
In the word “permit” as a noun, the lexical stress always falls
on the first syllable.
true
_____ is a lexical property defined for each word
Stress
_______ is a sentential phenomenon
Prosody
statement: pitch ___
fall
question: pitch____
rise
post-focus: pitch____
compression
In a post-focus position, a word loses its lexical stress.
false
In a post-focus position, a word can appear without pitch
accents.
true
data are we get from the experiment as the
results
Dependent Variables or Response Variables
things we’re going to manipulate or control the changes in them.
Independent Variables or Explanatory Variables
Everything that can affect the results but are not in the scope
of the research question
Constants of the experiment
Different values that an independent variable can take on are
called
levels
Syntagmatic relation
- Relation between the target of analysis and the surrounding
context - Positioning
Paradigmatic relation
*** Relation between the target of analysis and its alternatives
* Substitution
There is no reliable acoustic property that constantly
indicates lexical stress across conditions.
Syntagmatic observations - conclusion
Acoustic properties relevant to lexical meaning often require
paradigmatic analysis
Acoustic properties relevant to sentential intonation often require
syntagmatic analysis
The analysis presents all data points accurately
Observational adequacy
The analysis provides significant generalizations of the observed data
Descriptive adequacy
The analysis provides a principled basis for the choice between
competing descriptions.
* The analysis has predictive power.
Explanatory adequacy
Easy boundaries:
- Oral stop [p, b, t, d, k, g] + Vowel
- Sibilants (= strong fricatives) [s, ʃ, z, ʒ] + Vowel
Difficult boundaries:
- Approximant [w, j, r, l] + Vowel
- Weak voiced fricatives [v, ð] + Vowel
The best consonant for prosody research
Nasal stops are good for both duration and F0 investigation
Reliable pitch track
- Pitch track must be smooth
- For a continuous pitch track, use sonorant sounds
In a post‐focus position, a word loses its lexical stress.
false
In a post‐focus position, a word can appear without pitch
accents.
true
When designing the stimuli, it is good to place the target
word of analysis at the end of the sentence.
false
Data we obtain from the participants: speech sounds
Production experiments
Data we obtain from the participants: judgment on the sounds
Perception experiments
Perceptually judging an object is a way of measuring the
object
scale
- Assigning discrete categories to objects
- The categories are not ordered
Nominal Scale
- Assigning discrete categories to objects
- The categories are ordered
Ordinal Scale
*a special kind of ordinal scale
* It is about the level of agreement on a symmetric agree‐
disagree scale for a given statement
*most widely used approach to scaling responses in
survey research
*5 point or 7 point
Likert Scale
Assigning continuous values to objects
Interval Scale
Assigning continuous values to objects, relative to a
“reference”
Ratio Scale
In general, phonetic research works on____signals
because human vocal tract for a single speaker contains only
one sound source.
mono
____ uses one channel
mono
____ uses more than one
(typically two) channels
stereo
- Theoretical study
- Most studies in syntax, semantics, phonology
Qualitative research
- Experimental study
- Generating numerical data
- Most studies in phonetics
Quantitative research
the input that is intentionally varied in an experiment to observe its effect on the outcome is called the
independent variable
the outcome or response that is measured or observed in an experiment is called the
dependent variable
when the experiment has 2 independent variables, one with 2 levels and one with 3 levels how many conditions will the factorial design create
6
in the production experiment, which of the following were the dependent variables?
pitch, duration, intensity
in the production experiment, which of the following were the independent variables
position of sentence stress
in the perception experiment which of the following were the dependent variables?
judgement on the position of sentence stress
in the perception experiment which of the following were the independent variable?
pitch manipulation, duration manipulation, intensity manipulation
____analysis concerns the relation between the target of analysis and the surrounding context
syntagmatic
____analysis concerns the relation between the target of analysis and its alternatives
paradigmatic
____analysis is often necessary for examining the acoustic properties relevant to lexical meaning
paradigmatic
_____concerns whether the analysis provides significant generalizations of the observed data
descriptive adequacy
____concerns whether the analysis provides a principled basis for the choice between descriptions and its predictive power
explanatory adequacy
which is the best for investigating pitch and duration?
nasal stops
nasal stops are the most reliable consonants to segment
false
when it comes to segmentation, an affricate can be treated the same as a stop fricative sequence
true
the presence or absence of voicing is the most useful cue when segmenting oral stops
false