midterm Flashcards
AX design stimuli seperated by an ____
Interstimulus Interval (ISI).
in ABX design the listner determines if it is __ or ___ that is the same as __
A or B, X
- ppl often use ____ between sequences (A, B, X)
500ms
2AFC design stands for
Two Alternative Forced Choice (2AFC) Design
Which stimulus came first, A or B?”.
this represents which design
Two Alternative Forced Choice (2AFC) Design
what are some of the 2AFC design elements
Design Elements
▶ Minimizes bias; ideal for similar stimuli.
▶ Assumes either order is equally possible.
▶ Requires discernment of stimulus order.
whats 4IAX design
- using same 2 sounds (A or B) but you’re presenting like it could be ABAA, etc
First and last stimuli of 4IAX design are called
“flankers” or “flanking stimuli”
labelling design aims to assess ____ knowledge when presented a ___ stimuli
categorical, singular
Oddity Design is sort of a ___ task when presnted ___ stimuli
discrimination, multiple
d ′ (d-prime) is
▶ A measure of sensitivity in perceptual tasks.
▶ Indicates how well an individual can discern between signal (e.g., a stimulus) and
noise
basilar membarne can process up to
20k hz
smaller changes in basilar membrane can be detected. voiced can be distinguished from eachother (all in ____region)
lower
Which end of the basilar membrane will be active in processing the acoustic signals related to the sounds [s] and [z]?
A. Apex
B. Base
base
- ’s’ has way___ centre of gravity, ‘sh’ has ___ centre of gravity
higher, lower
Which is the unit of Center of Gravity? (See the Info window)
A. dB
B. pascal
C. Hz
D. calvin
C. Hz
VOT is _____ in voiced stops
negative
larger vocal tract, ____ formant frequencies
higher
Vowel-Intrinsic Methods: Use information from a ____ vowel token(s) (e.g., F1,
F2)
Use information from a single vowel token(s) (e.g., F1,
F2)
Vowel-Extrinsic Methods:
Compare formant values across different vowel(s) by
the same individual.
Speaker-Intrinsic Methods:
Normalize based on data from a single speaker’s
vowels.
Speaker-Extrinsic Methods
Utilize data from multiple speakers (e.g., Labov et
al. 2006).
Sound propagates in a ____ motion
longitudinal
in transverse waves the particles are _____ to the wave motion
perpendicular
peak of transverse wave, ____ ____ of longitudinal wave
compression zone
Recoding sounds is simply capturing the ___ ____ over time
pressure variation
what is the default sampling rate for humans
44100hz
for storage format ___ is the preferred way to go since it is compatible with many audio processing
application
WAV
for storage format ___ is not recommended if your purpose is to perform
acoustic analyses.
MP3
a good signal-to-noise ratio ranges from around
0.64, -0.57
_____ refers to the loss of signals from the high and/or low ends because the
audio gain was set at a too high level
Clipping
dynamic or condenser?
- doesn’t need power from external source
▶ less sensitive to details
▶ better for noisy environment
▶ are sturdier (harder to break when dropped)
dynamic
dynamic or condenser?
▶ requires power supply
▶ more sensitive to details
▶ good when there’s little background noise
▶ are easy to break when dropped
condenser
unidirectional or omnidirectional?
good for single interviewee setting, especially in places with higher background noise
unidirectional
unidirectional or omnidirectional?
good for focus group interviews
omnidirectional?
___ is specially useful when you want to transcribe video dat
ELAN
Unlike word processors, in ___ transcriptions are time-aligned with the audio
PRATT
____ allows the use of multiple tiers to include different levels of annotations
PRATT
SAMPA stands for
Speech Assessment Methods Phonetic Alphabet
____ provided a workaround using basic text, ensuring compatibility across
different systems for representing phonetic symbols.
SAMPA
what are the 3 symbol systems
▶ IPA ▶ SAMPA ▶ ARPAbet
ARPAbet is a phonetic transcription system developed by the __________(ARPA) for speech processing.
Advanced Research
Projects Agency
____ is specifically
designed for use in automatic speech recognition (ASR) and speech synthesis.
ARPAbet
Bella needs to transcribe some audio files where she wants to annotate the sounds with multiple layers of information/coding. She also needs to make sure in her annotation that the beginning and ending of all the phonetic segments are marked with clear boundaries showing timing information. Which of the following option will not work for her purpose? (One correct answer.)
MS Word
Praat TextGrid
ELAN
None of the above will work, in fact
MS Word
How is the SAMPA transcription system different from the ARPAbet convention? (One correct answer.)
Incorrect answer:
ARPAbet represents phonemes with two- or three-letter codes based on American English, while SAMPA closely follows IPA conventions with simpler ASCII characters.
SAMPA uses only ASCII characters, while ARPAbet uses both ASCII and special symbols.
ARPAbet provides a wider range of phonetic symbols than SAMPA.
SAMPA uses only ASCII characters, while ARPAbet uses both ASCII and special symbols.
SAMPA is primarily used for American English transcription, while ARPAbet is used for British English.
SAMPA uses only ASCII characters, while ARPAbet uses both ASCII and special symbols
John is planning to use DARLA (Dartmouth Linguistic Automation) to annotate 10 audio files. What things would John be able to perform with DARLA? (one correct answers.)
DARLA will return a TextGrid having exactly two different tiers.
DARLA will return two separate TextGrids with time-aligned annotations for individual words as well as phonemes.
John will be able to upload all his 10 files at a single step instead of one file at a time, but it will take more time for DARLA to return the aligned TextGrids.
John needs to upload a transcription TextGrid having only a single tier.
John will be able to upload all his 10 files at a single step instead of one file at a time, but it will take more time for DARLA to return the aligned TextGrids.
John is curious how he can generate a narrowband spectrum in Praat. (One correct answer.)
He needs to specify…
the Window length as 0.005 or less under Spectrogram settings
the Window length as 0.030 or more under Spectrogram settings
the Window length as 0.030 or more under Formant settings
the Window length as 0.005 or less under Formant settings
the Window length as 0.030 or more under Spectrogram settings
Stratified random sampling:
Population is divided into strata, and a random sample is taken from within each
category or stratum.
Common ____ in linguistics include age, sex, region, ethnicity, etc
strata
Systematic sample
Participants are picked according to a pre-determined rule; e.g., every 3rd person
cluster sampling:
The population is divided into multiple clusters which are similar in characteristics
▶ Then some whole clusters are randomly sampled from the whole list of clusters
▶ Data are collected from each member in the selected clusters
clusters would predict ____ characteristics, strata would predict ___ characteristics
similar, diff
A researcher is studying language use in a large multilingual city. To ensure representation from all neighborhoods, the city is first divided into distinct neighborhoods. The researcher then randomly selects 5 neighborhoods from the list and surveys all residents in those selected neighborhoods. What type of probability sampling is being used in this study?
A. Simple Random Sampling
B. Stratified Sampling
C. Cluster Sampling
D. Systematic Sampling
C. Cluster Sampling
Convenience sampling:
Researcher invites subjects based on accessibility and proximity
Purposive sampling:
a sample that the researcher things that is going to give the most useful data for the purpose of the research (subjects meet pre-determined characyeristcis)
snowball sampling (sometimes called ___ sampling):
Researcher recruits new participants based on the recommendations of other
participants in the same study. ▶
chain referral
can convenience sampling can still yield representative results?
yes
an ____ is very useful to discover any linguistics traits that other methods are unable to
identify.
ethnographic approach
____ often uses grid sampling
Dialectology
Experimental design allows us to establish a ____ relationship between two variables.
causal
non-experimental Cross-sectional:
you compare two (or more) “pre-existing” groups (you don’t create those groups)
True-experimental:
▶ random assignment is ____ ▶ Quasi-experimental:
▶ random assignment is ____
possible, impossible
Extraneous variable:
Variables that that are NOT independent variables but may have effect on the
dependent variable
Confounding variables:
An extraneous variable that potentially ended up affecting the dependent variable
Lurking variables:
(ex)
Lurking variable is one that affects both your IV and DV
Lurking variable: financial well-being of the family
Pretest-Posttest design:
DV measured before intervention –> Intervention is made –> DV measured again
Within-subject design:
each participant is exposed to ALL the conditions in the IV
Between-subject design
each participant is exposed to only ONE condition of IV
factorial design:
When there are more than one independent variables in a single experiment, it is a
factorial design
single-factor design:
when you have a single factor/independent variable in the
experiment
univariate design:
when you have a single dependent variable
Which of the following is an example of a 3x2 factorial design? (There is one correct answer.)
A study examining the effect of teaching method (traditional, online) and testing environment (quiet, noisy) on students’ vocabulary retention.
A study analyzing the relationship between L1 background (English, Spanish, French) and exposure time (1 year, 2 years) on L2 pronunciation.
A study exploring the impact of dialect (urban, rural, suburban) and gender (male, female) on speech rate.
A study investigating the influence of language proficiency (low, medium, high) and age group (children, adults) on phoneme recognition accuracy.
A study analyzing the relationship between L1 background (English, Spanish, French) and exposure time (1 year, 2 years) on L2 pronunciation.
Which of the following categories of sounds will likely have positive VOT? (Two correct answers.)
Voiced unaspirated stop
Voiceless unaspirated stop
Voiced aspirated stop
Voiceless aspirated stop
Voiceless unaspirated stop
Voiceless aspirated stop