test Flashcards

1
Q

“Which features that isn’t language can you hear in spoken text?”

A

<ul><li>Intonation</li><li>Loudness, energy</li><li>Tempo</li><li>Rhytm</li><li>Voice quality</li><li>Pauses</li></ul>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

“<strong>What is intonation in speech?</strong><span></span>”

A

<div><strong>Intonation</strong> refers to the rise and fall of pitch in speech which adds emotional nuance and can indicate whether a statement is a question, a statement, or an exclamation.</div>

<br></br>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

<strong>What does loudness in speech convey?</strong>

A

<div><strong>Loudness</strong> in speech conveys the volume and energy level, which can express emotions like enthusiasm or anger, and helps in adapting communication to the context.</div>

<br></br>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

<strong>How does tempo affect speech?</strong>

A

“<span> </span><strong>Tempo</strong><span> in speech, or the speed at which someone speaks, can convey feelings like excitement, urgency, or calmness. It is often adjusted based on the listener’s characteristics or the context.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

“<strong>What is rhythm in speech?</strong><span></span>”

A

“<strong>Rhythm</strong><span> involves the pattern of stressed and unstressed syllables in speech. It is typical for a language and variations can provide clues about a speaker’s background or emotional state.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

<strong>How does voice quality impact communication?</strong>

A

“<strong>Voice quality</strong><span> affects how a person’s voice sounds, including elements like pitch, tone, and modulation. It can communicate emotions or characteristics such as confidence or warmth.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

<strong>What role do pauses play in speech?</strong>

A

“<strong>Pauses</strong><span>, whether filled (like ‘uhm’) or unfilled (silent), are used strategically in speech to convey hesitation, emphasis, or to allow time for the listener to process information.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Welke dingen zijn er die je kunt zien in non-verbale communicatie?

A

<ul><li>Gaze patterns</li><li>Hand gestures</li><li>Pointing</li><li>Posture</li><li>Distance</li></ul>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

<strong>What do facial expressions communicate in body language?</strong>

A

“<strong>Facial expressions</strong><span> convey emotions and attitudes, playing a critical role in understanding the emotional context of communication.<br></br><br></br></span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

<strong>How do gaze patterns function in communication?</strong>

A

“<strong>Gaze patterns</strong><span> indicate where and how a person looks, signaling attention, interest, and sometimes conveying dominance, submission, or attraction.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

<strong>What role do hand gestures play in communication?</strong>

A

“<strong>Hand gestures</strong><span> complement or emphasize verbal communication, vary between cultures, and can significantly enhance the clarity and impact of a message.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

<strong>What is the significance of pointing in nonverbal communication?</strong>

A

“<strong>Pointing</strong><span> serves as a nonverbal method to direct attention or indicate objects, conveying information or expressing ideas without using words.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

<strong>How does posture influence communication?</strong>

A

<div><strong>Posture</strong>, the position and orientation of the body, can indicate various emotions and attitudes like confidence, openness, or aggression, similar to how facial expressions are processed.</div>

<br></br>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

<strong>What does the use of personal space, or proxemics, indicate in communication?</strong>

A

“<strong>Proxemics</strong><span>, the use of personal space, varies by cultural norms and can communicate levels of intimacy, formality, or discomfort between individuals during interactions.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

<strong>How do voice and body language interconnect in communication?</strong>

A

“<span>In communication, signals in one modality, such as </span><strong>voice</strong><span> or </span><strong>body language</strong><span>, are often mirrored in the other, indicating a strong connection between the two. This mirroring can vary in strength due to individual differences, cultural norms, personal communication styles, or the specific context of the interaction.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

“<strong>What is Quintilianus’s perspective on rhetoric in ““Institutio Oratoria””?</strong>”

A

“<span>Quintilianus, in ““Institutio Oratoria,”” defines </span><strong>rhetoric</strong><span> as the art of persuading an audience, utilizing stylistic tricks, strategic ordering of information, and other rhetorical techniques to effectively communicate and influence.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

<strong>How did rhetoric traditionally focus on oral language?</strong>

A

“<span>Historically, </span><strong>rhetoric</strong><span> focused primarily on oral language, as exemplified by figures like Cicero, emphasizing the use of speech for effective persuasion and public speaking.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

“<strong>What is ““pronunciatio”” in the context of rhetoric?</strong>”

A

“<span>In rhetoric, </span><strong>pronunciatio</strong><span> refers to the delivery aspect, which includes not only intonation but also nonverbal communication such as body language, facial expressions, and gestures, integral for effective speech delivery.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

<strong>Why should nonverbal features match the content of spoken utterances in rhetoric?</strong>

A

“<span>Nonverbal features should match the content of spoken utterances to enhance the authenticity and impact of the message. For example, a happy message should ideally be delivered with a happy voice and facial expression to reinforce the sentiment and persuade the audience effectively.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

<strong>Why is nonverbal communication significant in presidential debates?</strong>

A

“<span>In presidential debates, the </span><strong>importance of nonverbal communication</strong><span> is crucial as current-day politicians are highly aware of its potential impact.<br></br><br></br>use of voice and body language helps convey messages more powerfully and can significantly influence audiences, highlighting the vital role of nonverbal cues in political communication.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

“<strong>Why is nonverbal communication significant in presidential debates?</strong><span></span>”

A

<div><div><div><div><div><div><div><div><div><div><div><div><div><div><div>In presidential debates, the <strong>importance of nonverbal communication</strong> is crucial as current-day politicians are highly aware of its potential impact. Effective use of voice and body language helps convey messages more powerfully and can significantly influence audiences, highlighting the vital role of nonverbal cues in political communication.</div></div></div></div></div><div><div><div><div></div></div></div></div><div></div></div><div><div></div></div></div></div></div></div></div></div></div></div></div>

<div><div><div><div><div></div><div><div><div><div><div></div></div><br></br></div></div></div></div></div></div></div>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

<strong>What does the statement that nonverbal features account for more than 90% of communication imply, and what are its limitations?</strong>

A

“<div><div><div><div><div><div><div><div><div><div><div><div><div><div><div>The statement that <strong>nonverbal features account for more than 90% of communication</strong> is popularly derived from Mehrabian’s research on emotion recognition with conflicting cues. <br></br><br></br>However, this claim is often misunderstood as it specifically relates to the communication of <b>feelings and attitudes</b>, not to all types of communication. <br></br><br></br>Moreover, the application of this statistic is <b>limited</b> because people can exhibit <b>contradictory nonverbal cues</b>, such as smiling at a funeral, which do not necessarily reflect their true emotions or intentions.</div></div></div></div></div><div><div><div><div></div></div></div></div><div></div></div><div><div></div></div></div></div></div></div></div></div></div></div></div><div><div><div><div><div></div><div><div><div><div><div></div></div><br></br></div></div></div></div></div></div></div>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

<strong>Why is the study of nonverbal communication considered a relatively new and paradoxical field?</strong>

A

<div>The study of <strong>nonverbal communication</strong> is considered relatively new and paradoxical because while there is a<b> strong intuition </b>that nonverbal features significantly influence communication, the actual extent of their impact is only beginning to be understood. Historically, the field has been <b>hampered by a lack of tools to effectively record, measure, or analyze</b> these features.</div>

<br></br>

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

<strong>What does multimodality mean in the context of human perception?</strong>

A

“<strong>Multimodality</strong><span> refers to how our perceptual system integrates information from various sensory modalities such as vision, hearing, touch, and taste.</span>”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
 How do different modalities in communication affect our perception of experiences such as dining?
When we eat in a fancy restaurant a dish can taste differen than when you taste the same dish at home or at a fastfood chain.
26
What can you say about multimodal communication compared to only  spoken communication e.g.
"Multimodal communication is considered the most natural form of human interaction because it involves multiple sensory modalities. Traditionally, speakers observe their addressees and vice versa,

spoken communication without visual contact is still relatively rare, underscoring the innate multimodal nature of human communication.

"
27
How do faces and speech interact to affect our perception of spoken language?
This visual information from the face significantly influences how we perceive and interpret spoken language, as the coordination of visual and auditory components enhances our understanding and response to communication.

28
What is the ventriloquism effect and how does it influence our perception?
The ventriloquism effect is a perceptual phenomenon where auditory and visual signals, presented from different locations, are perceived as coming from the same source. The brain links the sound to the visual signal, creating a perception that they are spatially related.

 This strong effect, which humans can hardly suppress, suggests a form of recalibration by the brain to bridge the difference between visual and auditory locations, enhancing the integration of multimodal stimuli.

29
What is the McGurk effect and how was it discovered?
The McGurk effect is a perceptual phenomenon where conflicting visual and auditory signals lead to a third, different perception. For example, when a video of someone saying /ga/ is paired with the sound of /ba/, people often hear /da/.

This effect was discovered by accident by McGurk and his assistant John MacDonald while researching how children perceive speech and whether they are more responsive to the face or voice of their mother. The McGurk effect illustrates how our perception integrates and sometimes confuses combined sensory inputs.

30
What is the Cocktail Party Phenomenon in the context of speech perception?
The Cocktail Party Phenomenon refers to our ability to focus on one person’s speech in a noisy or crowded environment. This ability highlights how our auditory system can selectively attend to a single source of sound among many distractions, a crucial skill for effective communication in social settings.

31
"How does lipreading contribute to speech perception? "
Lipreading involves interpreting visual cues from the movements of a speaker’s lips, which significantly aid in speech perception, especially in challenging auditory conditions. This visual information can compensate for poor audio quality or background noise, allowing for better understanding of spoken words.

32
What are compensatory effects in communication?
"Compensatory effects occur when there is noise or interference in one sensory channel (auditory or visual), prompting the other channel to enhance its input to compensate for the missing or unclear information. "
33
How do nonverbal communication skills develop in children?
"As children grow, they not only enhance their verbal skills such as lexicon, grammar, and pronunciation, but they also become more proficient in using and interpreting nonverbal features like voice tone and body language. "
34
in which order is the acquisition of nonverbal featurs in childrn?
  1. In the womb: Intonation patterns, rhytm and features of the voicee.
  2. As young infant: infants learn to imitate facial gesturs lik tongu protrusion and mouth opening.
  3. Infants: quickly learn to integrate information coming from different modalitis. 
35
"How do nonverbal features change as a child grows older?"
"1. As a child grows older, nonverbal features become more functional.

2. Children learn to associate specific nonverbal forms, like
nodding or higher intonation at the end of a sentence, with particular communicative or social functions.

3.This change is due to increasing
social awareness and exposure to a varied environment (family, school, society)."
36
"How do biological and physiological factors influence infants' intonation and rhythm?"
" Preference for low-ending (lower pitch or frequency) contours is due to air pressure and lung energy decreasing naturally. These factors show that intonation and rhythm are influenced by the innate biological predispositions of infants."
37
"How do nonverbal features reveal differences in social awareness among different age groups?"
"Nonverbal features may reveal differences in social awareness between younger children, older children, and adults.

This is a
working hypothesis suggesting that as children grow, their use and understanding of nonverbal features evolve, reflecting their increasing social awareness."
38
"What evidence suggests a strong genetic, biological basis for nonverbal cues to basic emotions?"
Work into cues to basic emotions suggests a strong genetic, biological basis:
  • Baby: crying when sad
  • Dog: happy when tail moves

    etc.
39
What evidence supports the genetic basis for nonverbal expressions, even in blind individuals?

"Blind people produce facial expressions similar to those of their family members and to each other, despite lacking visual exposure. This suggests a genetic foundation for nonverbal features."
40
"What was the setup of the memory experiment mentioned in the surpris experiment?"
Participants are led to believe they are taking part in a memory experiment. The cover story is that the study investigates the effect of context and reading aloud.
Experiment stages:
  1. Participants imagine words fitting a specific context (e.g., organs of the body).
  2. They see 10 words on a screen, shown one by one.
  3. They read aloud the words as soon as they appear.
  4. They recall as many words as possible afterward.
41
"
What were the two experimental contexts for the word ""liver"" in the surprise experiment?
"
"
The word ""liver"" appeared in two contexts:
  • Normal Context: organs of the human body
  • Surprise Context: favorite food items for Dutch kids
This was combined with other questions about cities, pets, etc.

"
42
"What was the participant demographic for the surprise experiment?"
"About 25 subjects (Dutch) participated in the experiment."
43
Wat wordt er bedoeld met verbal en non-verbal/
Verbal = wordy
Non-verbal = de rest
44
"Why is body language a thing, but voice language isn't?"
if voice x language would be a thing, than body language would be sign language. 

Speech is most of the times intentional communicating while language all different ways of communicating are. 
45
Why is sign language verbal, and body language not?
  • There is a literal translation of expressions/signs, limited things they can refer to, wrong or right)  it has “words”
  • Each sign can be equated to a specific meaning (similar to spoken/written language)
  • It is an intentional and structured symbolic body movements that constitute a form of language expression (linguistic nature)
46
"What is the aspects of the conventional pair of form and meaning in linguistics are conventional?"
"
  • Form: The phonetic or gestural elements of words, such as phonemes, morphemes, or hand movements.

  • Meaning: The denotation of a word, which includes objects, actions, or concepts that the word represents, and its syntactic status (how the word functions within the structure of a sentence or phrase).
"
47
"What is meant by denotation?"
"Denotation is the specific, literal meaning of a word, independent of any emotional or cultural connotations. It refers to what the word directly represents or describes. For example, the denotation of ""to read"" is the action of interpreting written text."
48
"What is meant by syntactic status?"
" Syntactic status refers to the role that a word plays within the structure of a sentence or phrase. It defines how a word functions grammatically, such as being a subject, object, verb, or modifier."
49
"How do words, signs, and morphemes function differently in language?"
"
Words, signs, and morphemes function differently across languages to convey meaning:
  • Words: In languages like English, meanings are conveyed through distinct words (e.g., ""read,"" ""reads,"" ""reading"").
  • Morphemes: In languages like Turkish, meanings are conveyed by adding morphemes (smallest units of meaning) to a root word (e.g., root ""ok"" in ""okuma"" for ""read"" and ""okur"" for ""he/she/it reads"").
  • Signs: In sign languages, meanings are conveyed through signs, which have their own grammar and syntax (e.g., Dutch Sign Language is not a direct translation of spoken Dutch).
"
50
What is unique about morphemes in Turkish?

"
Turkish uses morphemes to convey(overdragen) meaning:
  • Words are formed by adding morphemes to a root.
  • Example: The root ""ok"" (read) can become ""okuma"" (to read) or ""okur"" (he/she/it reads).
  • This method, known as agglutination, allows a single root to take on various grammatical and semantic roles by adding different morphemes.
"
51
"English and Turkish differ in their use of words and morphemes:"
"
  • English: Uses distinct words for different meanings (e.g., ""read,"" ""reads,"" ""reading"").
  • Turkish: Uses morphemes added to a root to convey different meanings (e.g., root ""ok"" for read, ""okuma"" for to read, ""okur"" for he/she/it reads).
  • English words change forms less frequently compared to Turkish, which systematically uses morphemes.

"
52
"What is a morpheme?"
"A morpheme is the smallest unit of meaning in a language. It can be a word or a part of a word (like a prefix or suffix) that cannot be broken down further without losing or altering its meaning. For example, in the word ""unhappiness,"" there are three morphemes: ""un-"" (a prefix meaning ""not""), ""happy"" (the root), and ""-ness"" (a suffix meaning ""state of"")."
53
What are properties of sign language?
Modalities: different ways of expressing language
  • Sign languages are real languages (own grammar and syntax)! Sign language of the NL  (NGT) is not signed Dutch (not a direct translation of spoken language)

  • Signs are conventional (own vocabulary and rules for expression), not mimicry (nabootsen).
54
"What is Neil Cohn's view on visual language?"
"According to Neil Cohn from the Visual Language Lab, visual language (e.g., in comics) is also considered a language because language is not restricted to spoken or written forms but can manifest in various modalities ."
55
"what is the role of modality for 'verbality'?"
modality does not matter for verbality.
56
"What are the steps of vocal fold/cord vibration in the speech process?"
  1. Vocal folds closed at the beginning of the speech process.
  2. Air pressure from lungs is generated.
  3. Vocal folds open due to lung pressure, allowing air to pass through.
  4. Pressure released is influenced by muscle tension and emotional state.
  5. Vocal folds close again, and the cycle repeats about 100-300 times per second.
57
"What determines the pitch of the voice?"
"The frequency of the vocal fold vibrations determines the pitch of the voice; heavier vocal folds result in a lower frequency and lower voice."
58
"How does the tenseness of vocal folds affect sound pitch?"
"Tenser vocal folds produce higher-pitched sounds."
59
"How does the size of vocal folds affect their vibration and pitch?"
" Larger vocal folds vibrate more slowly, resulting in a lower frequency and lower voice."
60
"What causes variations in vocal fold size?"
"Variations in vocal fold size are due to genetics and hormonal changes during puberty."
61
"What non-verbal information affects vocal fold vibration? "
" Tenseness and size of the vocal folds influence their vibration and the pitch of the voice."
62
"How does the position of the tongue affect vowel sounds?"
"The position of the tongue changes the resonance of higher frequencies, resulting in different vowels."
63
"What is vowel height and how does it affect vowel sounds?"
"Vowel height refers to how high the tongue is in the mouth, with different heights producing different vowels."
64
"What is vowel backness and how does it affect vowel sounds?"
"Vowel backness refers to the position of the tongue in the mouth (front/back), influencing vowel sounds."
65
"How does lip rounding affect vowel sounds?"
"Lip rounding involves forming the lips in a circle (rounded vowel) or not (unrounded), affecting the sound of vowels."
66
"What is vowel tenseness and how does it affect vowel sounds?"
"Vowel tenseness refers to stressed/tense vowels, which can change the quality of the vowel sound."
67
"Wat zijn consonanten?"
"Consonanten beperken of stoppen de luchtstroom, wat leidt tot hoorbare fricatie of onderbreking."
68
"Wat is de plaats van articulatie bij consonanten?"
" De plaats van articulatie verwijst naar waar in het spraakkanaal de luchtstroom wordt beperkt of gestopt om een consonant te produceren."
69
"Wat zijn bilabiale consonanten en geef voorbeelden?"
"Bilabiale consonanten worden geproduceerd met twee lippen, zoals p, b, en m."
70
"Wat zijn labiodentale consonanten en geef een voorbeeld?"
"Labiodentale consonanten worden geproduceerd met lippen en tanden, zoals f."
71
"Wat zijn interdentale consonanten en geef een voorbeeld?"
"Interdentale consonanten worden geproduceerd tussen de tanden, zoals th."
72
"Wat zijn alveolaire consonanten en geef voorbeelden? "
" Alveolaire consonanten worden geproduceerd bij de richel achter de tanden, zoals t en d."
73
"Wat zijn alveo-palatale consonanten en geef voorbeelden?"
"Alveo-palatale consonanten worden geproduceerd bij het harde gehemelte, zoals j en y"
74
"Wat zijn velare consonanten en geef voorbeelden?"
"Velare consonanten worden geproduceerd bij het zachte gehemelte, zoals k en ng in ""going"" en ""uncle""."
75
"Wat zijn glottale consonanten en geef een voorbeeld? "
"Glottale consonanten worden geproduceerd in de keel, zoals h."
76
"Wat is de wijze van articulatie bij consonanten?"
"De wijze van articulatie beschrijft hoe de luchtstroom wordt gemanipuleerd om hoorbare fricatie of onderbreking te produceren."
77
"Wat is een stop of plosief?"
"Een stop of plosief is het blokkeren van het geluid en het vervolgens loslaten."
78
"Wat is een fricatief?"
"Een fricatief ontstaat door het vernauwen van de luchtstroom met de tong. Voorbeelden: f, v, s, z."
79
"Wat is een affricatief?"
"Een affricatief combineert een orale stop (plosief) en een fricatief. Voorbeelden: zoals in ""chop"", zoals in ""judge""."
80
"Wat is een liquid?"
"Een liquid laat de luchtstroom over de zijkant van de tong stromen. Voorbeelden: l, r."
81
"Wat is een glide?"
"Een glide heeft slechts een milde obstructie en in sommige talen worden deze als klinkers beschouwd. Voorbeelden: w, j zoals in ""yes""."
82
"Wat betekent voicedness bij consonanten?"
"Voicedness verwijst naar het trillen van de stembanden tijdens de productie van een consonant. Voorbeelden: voiced - b, d; voiceless - p, t."
83
"Wat is een voiced consonant?"
"Een voiced consonant heeft trillende stembanden. Voorbeelden: b, d, g."
84
"Wat is een voiceless consonant?"
"Een voiceless consonant heeft geen trillende stembanden. Voorbeelden: p, t, k."
85
"Zijn alle klinkers voiced of voiceless?"
"Alle klinkers zijn per definitie voiced. Voorbeelden: a, e, i, o, u."
86
Where can you find the most non-verbal information of speech?
the pitch (a.k.a. intonation)
87
"What is F0 in vocal fold vibration?"
F0 is the fundamental frequency, representing the rate at which the vocal folds vibrate. A lower F0 corresponds to a lower-pitched voice, while a higher F0 corresponds to a higher-pitched voice.

88
"How does pitch relate to vocal fold vibration?"
Pitch is the perception of the frequency of vocal fold vibrations.

  • The faster the vibration (higher F0), the higher the pitch; 
  • the slower the vibration (lower F0), the lower the pitch.

89
"What is the significance of F0 in voice analysis? "
"F0 is crucial in analyzing voice stress and emotional states.

Variations in
F0 can indicate different stress levels, emotional conditions, and even cognitive loads​"
90
"What factors affect the fundamental frequency (F0) of vocal fold vibration?"
"Factors affecting F0 include the

  • tension of the cricothyroid muscle,
  • subglottal pressure, 
  • vocal fold length, 
  • and thickness​ of vocal folds.
"
91
"What are the non-verbal aspects of pitch?"
Non-verbal aspects of pitch include:
  • Pitch accents: Indicate new, given, or contrastive information.
  • Question/assertion: Rising pitch at the end indicates a question, while a steady drop indicates an assertion.
  • Tone of voice: Conveys attitudes, emotions, or nuances in the speaker’s intention.
  • Emotion: Pitch variation can indicate the speaker’s emotional state.
92
"What are the verbal aspects of pitch?"
"
Verbal aspects of pitch include:
  • Lexical stress: Emphasis on a particular syllable within a word can change its meaning (e.g., ""to address"" vs. ""an address"").
  • Lexical tone: In some languages, pitch variations differentiate between words (e.g., in Mandarin, ""ma"" means mother only with the correct tone).
"
93
"How do you understand non-verbal aspects of pitch?"
Understanding non-verbal aspects involves comparing them with other linguistic elements, such as

  • the overall mood of the conversation,
  • syntactic structure, 
  • meaning, 
  • and grammatical elements, 

to gain a holistic picture of the intended meaning.

94
"What is ELAN used for in non-verbal communication analysis?"
ELAN is software used to annotate videos by adding explanatory notes or comments. It requires a human annotator and a coding manual, making the process time-consuming and subjective.

95
"What is OpenFace and its primary focus?"
"OpenFace is facial recognition software that helps automate the annotation of facial expressions. Its primary focus is on facial expressions, often leaving out other aspects of body language."
96
"How is VR used in gesture tracking?"
"VR and related technologies have gesture trackers that can detect and interpret gestures. However, the interpretation of gestures still often requires human understanding."
97
What are some key body language variables and their indications?

"
  • Muscle tone: Indicates emotional states or reactions.
  • Distance (proximity between individuals during interaction): Reflects comfort, intimacy, or conversational dynamics.
  • Facial expressions:
    • Eyebrow position: Can indicate surprise, skepticism, or interest.
    • Mouth shape: Reflects emotions and verbal articulation.
  • Gaze/attention: Indicates interest, focus, or distraction.
  • Fidgeting (small/repetitive movements, often unconsciousness): Reflects discomfort, anxiety, or impatience.
(Note: The list of body language variables is extensive, and researchers do not universally agree on all aspects.)
"
98
"In which 2 ways does information structure manifest itself? "
Information structure manifests itself in two ways:
  • Discourse units: Sentences that belong together are organized into chunks, phrases, and marked by boundaries.
  • Distinguishing importance: Important information is distinguished from unimportant information (accents,
    prominence, emphasis, etc)
99
"What is prominence marking in speech?"
"Prominence marking uses various cues to highlight or emphasize specific words or elements in a sentence."
100
"How do speakers of Germanic languages use prominence marking?"
"
Speakers may use pitch accents to signal the importance of words:
  • Dutch: “Ik voel me SERIEUS genomen” vs. “Ik voel me serieus GENOMEN” (people respect me vs. people don't respect me).
  • English: “The kids had lunch. The boys/BOYS were eating an apple.” (only boys vs. also girls).
  • Context: “No, not the RED button, the BLUE button” (emphasizing the contrast).
"
101
"How do speakers use visual cues to signal prominent information?"
"Speakers signal prominent information through visual cues, such as facial variations.

  • Rapid eyebrow movements (flashes) can play a similar role as pitch accents in emphasizing important information.
"
102
"What is the connection between pitch and eyebrow movements in communication?"
There is a close connection between pitch and eyebrow movements, with high/raising notes often synchronized with raised eyebrows.

103
" Is there a one-to-one mapping between auditory and visual cues?"
No, there is no one-to-one mapping, but speakers prefer to synchronize verbal cues with visual cues.

When they align, it enhances clarity and emphasis of the message. When verbal and visual cues don’t align it may create difficulty in communication.

104
"How do newsreaders align visual and auditory cues?"
"In newsreaders, there is often alignment between visual and auditory cues for prominent information, especially for strong accents, despite speaker variation."
105
" How are auditory and visual beats coordinated according to experimental data?"
Experimental data suggest that auditory and visual beats are tightly coordinated.

When a speaker produces a visual beat on a word (gesture), some acoustic properties of that word are affected, and the auditory prominence of that word increases.

106
"What were the key results of the reaction times experiment regarding auditory and visual accents?"
  • Auditory accent: The strongest cue for perceived accent, with high correct identification rates (94.2% for Maarten, 94.9% for Maandag, 85.8% for Mali).
  • Congruent situations: Received more responses than incongruent ones.
  • Visual accent: Used when the auditory signal was unclear, indicating reliance on visual information.
  • Reaction times: Incongruencies led to significantly longer reaction times, indicating confusion.
107
Wat waren de uitkomsten in het onderzoek over visuele en audititeve cues?
  • Congruente stimuli worden sneller verwerkt dan incongruente stimuli.
    • Met name voor het eerst en derde woord.
  •  Auditieve nadruk, is de sterkste manier van nadruk.
  • Maar in incongruente situaties werden visuele hints meer belangrijk.
  • In incongruente situaties, worden met een visuele nadruk leverde meer reacties op, maar er was ook een langere reactietijd.
108
"Where are the visual cues for prominence located on the face in the vertical dimension?"
  • Top: Rapid eyebrow movements (flashes) may play a similar role as pitch accents.
  • Mouth area: Articulators make more exaggerated movements when a prominent or important word is produced.

109
"Where are the visual cues for prominence located on the face in the horizontal dimension?"
  • Perceptual: Observers are more sensitive to dynamic variations in the left part of the face than the right.
  • Acoustic/physical: There is a significant correlation between F0 (pitch) and the left eyebrow. The left side of the face represents the head better than the right side.
110
"What did Thompson et al. (2004) find about observers' sensitivity to facial variations?"
Thompson et al. (2004) found that observers are more sensitive to dynamic variation in the left part of the face than the right part.

111
"What conclusions can be drawn from the results of the horizontal dimension study on facial feature recognition?"
  1. Closer distances improve recognition accuracy for facial features.
  2. Whole face visibility provides the highest accuracy, followed by eyes and brows, with the mouth area being the hardest to recognize.
  3. The eyes and brows are more easily recognized at a distance than the mouth area.
  4. As distance increases, the ability to correctly identify facial features significantly decreases.
112
"What is concluded about the importance of different facial areas for prominence signaling?"
Different facial areas are not equally important for prominence signaling:
  • Vertical: Top is more important than bottom.
  • Horizontal: Left part is more important than right part.

113
"How do languages differ in terms of their prosody?"
Languages differ in terms of their prosody in two main ways:
  • Prosodic form: Differences in the timing of pitch movements, pitch range, tempo, etc.
  • Prosodic functions: Differences in the use of pitch rise to mark question intonation, use of accent, etc.

114
"What are examples of differences in prosodic form?"
Differences in prosodic form include:
  • Timing of pitch movements
  • Pitch range differences
  • Tempo

115
"What are examples of differences in prosodic functions?"
Differences in prosodic functions include:
  • Use of pitch rise to mark question intonation
  • Use of accent to indicate emphasis
116
"What is chunking in prosody?"
"Chunking in prosody refers to the way speakers group words and phrases into discourse units, making it easier to understand and process spoken language. It involves using prosodic cues like pauses, intonation, and stress to signal the boundaries of these units."
117
" What is the difference between plastic and non-plastic languages in terms of accents?"
"
  • Plastic languages are more flexible in moving accents within an utterance, while
  • non-plastic languages are less flexible.
"
118
"How do Germanic and Romance languages differ in their use of accents? "
"
  • Germanic languages (e.g., Dutch, English, German) are generally more flexible (plastic) with accents,
  • while Romance languages (e.g., French, Italian, Spanish) are less flexible (non-plastic).
"
119
"How are football scores announced differently in English and Italian to reflect the flexibility of accents?"
"n English, football scores are announced with accents that can move within the sentence, e.g., ""Liverpool ONE CHELSEA one."" In Italian, the accent placement is less flexible, and scores are announced with a more fixed pattern, e.g., ""Rome UNO Juventus UNO."" This shows that English can emphasize different parts of the sentence more easily than Italian."
120
"What compensatory strategies do languages have for accents?"
"Languages may use compensatory strategies such as word order to manage accentuation differences."
121
" How is English used in South Africa according to Swerts and Zerbian (2010)?"
English is used as a first language (L1) by a large number of people and as a lingua franca (L2) by many who speak different languages as their first language, such as Zulu and various Bantu languages.

122
"How many speakers participated in the task and what languages did they speak of Zwerts and Serbian?"
20 speakers participated:
  • 10 speakers of L1 English (only English)
  • 10 speakers of L1 Zulu (both Zulu and English)
123
"What task did the study participants perform in the experiment involving English and Zulu speakers?"
"Participants described differently colored objects from left to right, focusing on a red cow that appeared in different contexts (contrasting with preceding color or form, and appearing at the end of the list or not)."
124
"Wat waren de onderzoeksresultaten van Swerts en Zerbian (2010)?"
"Moedertaalsprekers van Engels in Zuid-Afrika gebruiken intonatie en prosodie anders dan niet-moedertaalsprekers.

 Niet-moedertaalsprekers, zoals Zulu-sprekers, gebruiken intonatie voornamelijk om continuïteit of finaliteit aan te geven, terwijl moedertaalsprekers ook intonatie gebruiken om focus en positie binnen een lijst aan te geven.
"
125
"Wat wordt bedoeld met 'Position' in de context van de uitspraak?"
"Position verwijst naar de vraag of de uitspraak definitief klinkt of niet. Eindzinnen kunnen gemakkelijk worden onderscheiden van niet-eindzinnen in Engels en Zulu."
126
How is contrastive emphasis marked in English and Zulu?

"In English (native and fluent L2 speakers), contrastive words are marked by emphatic stress. In Zulu and in the English of less proficient L2 speakers, contrastive words are NOT marked by emphatic stress."
127
What is empathic stress?
"Empathic stress refers to the increased emphasis placed on a specific word within a sentence to highlight its importance or convey emotion. This emphasis is often achieved through changes in pitch, loudness, or duration of the stressed word."
128
What are contrastive words?
" Contrastive words are words that are emphasized to distinguish them from other words or ideas in the same context. This emphasis helps to clarify differences or contrasts between items, such as in the sentence, ""I said the RED car, not the blue one,"" where ""RED"" is the contrastive word."
129
What are the implications of mastering intonation and the use of accents according to the research?
  1. Language learning: Educational programs should focus not only on phonology, lexicon, and grammar, but also on intonation and the functional use of accents.

  2. Sociolinguistic implications: If someone does not master the intonational rules of a specific language or uses the rules differently, they will continue to sound different from native speakers.

  3. Goodness of a speaker: The difference between good and bad speakers can be related to the effective use of accents.
130
What are the key conclusions about prosody in language according Zwerts and Serbian (2010)
  1. Languages can differ in their functional use of prosody, but these differences are related to the kind of function, such as chunking vs. prominence.

  2. The prosodic phenomena of a first language (L1) may transfer to a second language (L2), especially when the L2 speakers are less fluent. This transfer is referred to as prosodic traces.

  3. Such prosodic traces may have sociolinguistic implications, affecting how speakers are perceived and how effectively they communicate in their second language.
131
"What are non-verbal cues that are naturally produced?"
" Non-verbal cues naturally produced include facial expressions, gaze patterns, hand gestures, pointing, posture, and distance. They convey emotions, attitudes, and social signals."
132
"What does research involve when studying non-verbal cues?"
"Research involves studying measurements and theory descriptions to understand non-verbal cues."
133
"What is artificial production in the context of non-verbal cues?"
"Artificial production refers to making symbolic/manual changes (simple) and deep connectionist changes (more complex) in natural speech to see the effects."
134
"What does form refer to in the context of prominence types?"
Form refers to the cue itself (e.g., eyebrow raise, pitch raise/contour) that makes the voice or body language stand out and more noticeable.
135
" What does meaning refer to in the context of prominence types?"
"Meaning refers to the function or purpose of the form, particularly how they contribute to prominence and mark specific information."
136
"What is Focus in the context of prominence (meaning) types?"
"
Focus refers to a part of the sentence that has new and prominent information. For example, in ""Mark is the expert on deception,"" the focus could be on ""Mark"" to highlight him as the expert among alternatives.

"
137
"Give an example of a sentence with Focus on different words to provide new information."
Who was the expert on deception? Mark is the expert on deception.
  • Mark is the expert on deception (as opposed to another field)
138
"What is Link in the context of prominence (meaning) types?"
" Link refers to information that is not new but is given prominence to highlight common ground.

 For example, 
What about Marc? Marc is the expert on deception (Mark is what we all know, the
rest of the information (“is the expert on deception”) is new and thus the focus)
"
139
"What is Tail in the context of prominence (meaning) types?"
"
Tail refers to information that is not new and not prominent, often used for grammatical completeness. For example, in responding with a complete sentence: ""Yes, I think Marc is the expert on deception.""

"
140
"What is the role of pitch contours in emphasizing Focus?"
Focus is emphasized by placing a pitch peak (H) on the stressed syllable, making it stand out or sound more important due to the higher pitch.

141
"How does a Link transition in pitch contours?"
A Link involves a pitch pattern where the pitch lowers (stretch) and then rises (Low*High). This pattern helps to transition smoothly between syllables, creating a distinctive rhythm and melody in speech.

142
"How does speech rate affect prominence in speech?"
"Speech rate slows down on prominent parts to emphasize or give importance to specific elements in speech."
143
"What are speech acts in communication?"
"Taalhandelingeen.

Speech acts
are actions performed through speech, such as making statements, asking questions, giving commands, and expressing feelings. They go beyond conveying information to include influencing others and expressing emotions."
144
"What is the significance of understanding speech acts?"
"Understanding speech acts helps us interpret the intentions behind what people say and how language shapes our interactions. It reveals the purpose of communication beyond just conveying facts."
145
"What is the truth value of speech acts?"
"
unlike straightforward statements of fact, speech acts are not easily categorized
as true or false. They are more about the performance of an action or the expression of an
intention

e.g. ""Kun jij niet de suiker aangeven?"", betekent meestal niet dat iemand niet de suiker aan kan geven.
"
146
"What are the main components of prosody?"
"The main components of prosody include pitch rise (component 1) and intensity and pitch (component 2).

These elements help convey emotional nuances and emphasis in speech.
"
147
"What does the ability to say things involve in communication?"
The ability to say things involves understanding the performative nature of speech acts, analyzing prosody components (pitch rise, intensity, and pitch), and recognizing the intent or emotional valence behind expressions such as mockery, disbelief, and various emotions.

148
"How does information structure manifest itself in communication?"
"Information structure manifests itself in two ways:

  1. by distinguishing important information from unimportant information (accents, prominence, emphasis) 
  2.  by grouping sentences that ""belong together"" into discourse units (chunking, phrasing, boundary marking).
"
149
"What aspects are covered under distinguishing important information in information structure? "
"Distinguishing important information involves using accents, prominence, and emphasis."
150
"What does grouping sentences into discourse units involve in information structure?"
"Grouping sentences into discourse units involves chunking, phrasing, and boundary marking."
151
"What is boundary marking in speech?"
"Boundary marking is the practice of speakers marking the end of information units, such as a sentence, phrase, or turn, to indicate a boundary in speech."
152
"How do visual cues in text help facilitate the reading process?"
"Visual cues such as punctuation (e.g., full stops, commas), indentation, line breaks, and capitalized words at the beginning of a sentence help visualize the structure of a text and facilitate the reading process."
153
What are local cues and global cues in speech boundary marking?

""
154
What is the difference between local cues and global cues in speech?

  • Local cues: Encoded at the very edge of a speech unit
  • Global cues: Stretched over a whole unit
  • Global cues allow prediction of upcoming boundaries
  • Compare with turn-taking: Turn-switches often proceed smoothly without much overlap or delay due to the predictive capacity of prosody
155
What are auditory cues for boundary marking?
  • Intonation (boundary tones, declination)
  • Pitch reset
  • Durational lengthening (final word)
  • Pauses (silent or filled pauses)
  • Voice quality (creaky voice)
156
How can prosodic chunking disambiguate?
"compare it with a mathematical formula:

2 + (3x5) means something different than (2+3) x 5.

“The man said: the girl is ill” vs “The man, said the girl, is ill”

2 different meanings.
The man said: the girl is ill vs “The man, said the girl, is ill
"
157
"What is the purpose of the Peps-C programme?"
"The Peps-C programme provides teaching materials to help children learn how to produce or interpret prosody, including skills like chunking."
158
"What does the sentence ""Chicken fingers and fries"" illustrate in terms of prosodic chunking?"
"
The sentence ""Chicken fingers and fries"" illustrates how prosodic chunking can affect interpretation. Chunking helps distinguish between the intended meanings, such as (a) food items like chicken fingers and fries or (b) a literal combination of chickens, fingers, and fries.



"
159
What visual cues can indicate boundaries between speech units?
  • Gaze behaviour (Argyle & Cook, 1976)
  • Body posture (Cassell et al. 2001)
  • Head nods (during feedback signalling) (Maynard 1987)
  • Eyebrow movements (Cave et al. 1999
160
What does the reaction time experiment suggest about combining different information sources?
The reaction time experiment suggests that combining different information sources (such as audio and visual signals) is beneficial when these sources complement each other. When the information from both sources works well together, people can respond faster because the information is easier to process.

161
What happens when information sources do not complement each other?
" When information sources do not complement each other, it can lead to cognitive overload. This means that the brain has to process too much information at once, making it harder to respond quickly and efficiently. As a result, the reaction time becomes longer, and it takes more mental effort to understand the information."
162
 What was the general task in the reaction time experiment?
"he general task in the reaction time experiment was to ""press a designated button as soon as the end of the stimulus is reached."" This was applied in both the actual experiment with real audiovisual recordings and a baseline condition with stimuli of variable lengths without finality cues."
163
What were the conditions compared in the reaction time experiment?
"The conditions compared in the reaction time experiment were audiovisual (AV), audio-only (AO), and vision-only (VO). The reaction times were measured in these different modalities to understand the impact of combining audio and visual information."
164
What were the main findings from the reaction time experiment regarding AV stimuli?
The main findings from the reaction time experiment indicated that AV stimuli were the quickest in the actual experiment (with real audiovisual recordings) , while in the baseline condition, AV stimuli were the slowest. This suggests that combining modalities helps when the information sources are complementary, but leads to cognitive overload when they are not .

165
What task were participants asked to perform in the second reaction tim experiment?
Participants had to judge for both short and long utterances whether a fragment was final or not.

166
What was a key finding regarding end-of-utterance classifications in the second experiment?
"Observers could make the best end-of-utterance classifications for bimodal stimuli; interestingly, the lowest scores were for audio-only (AO) stimuli, despite receiving a lot of attention in the literature."
167
What was found about the ease of judging non-final vs. final fragments?
"participants had too choose if a fragment was final or not.

Non-final fragments
were easier than final fragments. People may be looking for marked features; if these are absent, they choose a default, non-final classification."
168
How did fragment length affect classification in the second experiment?
Longer fragments were easier than shorter fragments, possibly due to longer exposure to cues.

169
What was the difference in performance between Audio-Only and Visual-Only stimuli in terms of fragment length?
"he difference in performance between short and long stimuli was bigger for audio-only (AO) stimuli than for vision-only (VO) stimuli. Results for short and long stimuli were very similar in vision-only conditions."
170
What is a possible explanation for the performance difference between Audio-Only and Visual-Only stimuli?
The existence of more global auditory cues (such as declination), whereas visual cues are more locally encoded.

171
How do people manage their turns in a conversation?
People take turns: while person A is producing speech, person B remains silent until it is his/her turn to start talking.

172
What regulates the switch between speakers in a conversation?
"The switch between speakers is regulated through a turn-taking mechanism.

"
173
"What does smooth interaction in conversation often involve?"
"Smooth interaction involves switching turns smoothly, with minimal overlap in speech and only a few milliseconds delay between turns."
174
"What kind of cues speakers and addressees rely on to predict appropriate turn-taking opportunities?"
"They rely on specific cues that can be lexical, syntactic, auditory, or visual."
175
" What is the difference between true turns and minor backchannels in conversation?"
"
  • True turns involve active contributions with substantial information, while
  • minor backchannels are minimal responses indicating engagement, like nodding or saying ""uhuh"".
"
176
"To what extent can backchannel opportunity points be predicted?"
"
Backchannel opportunity points can be predicted to some extent by identifying specific cues in the conversation that indicate a speaker's willingness to listen or their need for a response.
"
177
"What specific issues arise in predicting backchannel opportunity points?"
The specific issues include:
  • Variation between individuals: How much individuals differ in their use of backchannels.
  • Implementation in synthetic characters (avatars): Whether this behavior can be effectively programmed into avatars to mimic natural human interactions.
178
"How can the implementation of backchannel behavior in animated characters improve computer systems?"
"The implementation can lead to an improved naturalness of computer systems, making interactions feel more human-like and intuitive."
179
"What is the research of Blomsma and colleagues based on?"
Their research is based on the o-cam paradigm, which involves participants interacting with what they believe is a live person but is actually a pre-recorded session to study backchannel behavior.

180
What is the basic concept of the O-cam paradigm in experiments?

"he O-cam paradigm involves participants interacting via an online session (like Skype or Zoom) where they believe they are seeing a live person. However, they are actually viewing a recording of a confederate.

This illusion is created through a scripted introduction, and the participants' task is to guess which of four similar tangram figures the other person is describing.
"
181
"How was the O-cam paradigm experiment conducted?"
"The O-cam paradigm experiment involved 14 participants who believed they were in a live interaction. They played several rounds, resulting in 6 minutes and 15 seconds of interaction each. The study identified 53 Backchannel Opportunity Points (BOPs) via 10 observers. It was evident that participants varied in their feedback behaviors."
182
What measurements were taken in the O-cam paradigm experiment?

Participants were rated on perceived personality traits (Friendliness, Extraversion, Activeness, Dominance) using 6-point scales. Their behaviors were analyzed based on auditory and visual features, which significantly correlated with personality impressions.

183
"What was discovered about the relationship between behavioral measures and personality impressions in the O-cam paradigm experiment?"
"The study found that behavioral measures, which included auditory and visual features, correlated significantly with perceived personality traits. These measures appeared to be strongly related to impressions of personality, such as Friendliness, Extraversion, Activeness, and Dominance."
184
"How were the behaviors of human subjects used in animations in the O-cam paradigm?"
"The behaviors of human subjects were implemented into an animated character, including both visual and auditory features. A second experiment revealed that different feedback behaviors led to different impressions of the avatar's personality."
185
"What are the potential uses of implementing human behaviors into animated characters?"
" Implementing human behaviors into animated characters can generate different personalities for machines, aid in developing user-specific adaptive systems, help train communicatively deprived individuals

(e.g., people with autism or blind people), and improve ""rapport"" between conversation partners through effective feedback signaling.
"
186
"What is Parallel Wavenet by Deepmind (now Google)?"
"Parallel Wavenet directly models the raw audio signal by predicting one sample at a time, conditioned on the previous samples and relevant context."
187
"What are the capabilities of Parallel Wavenet in speech synthesis?"
"It can produce highly realistic and natural-sounding speech and is successful in capturing the nuances of the human voice and generating high-fidelity audio."
188
"What is speech synthesis?"
" Speech synthesis is the artificial production of human speech. It converts written text into spoken words using computer algorithms. This technology is used in various applications, such as virtual assistants, navigation systems, and accessibility tools for visually impaired individuals."
189
"What advancements in speech synthesis were made in 2024?"
"In 2024, Wavenet continued to be used, but Tacotron 2.0 also became prominent, showcasing the dynamic nature of advancements in the field."
190
"In 2024, Wavenet continued to be used, but Tacotron 2.0 also became prominent, showcasing the dynamic nature of advancements in the field."
"Tacotron 2.0 consists of an encoder and a decoder. The encoder processes the input text and converts it into a fixed-size context vector (which is a numerical representation of the text.), while the decoder generates mel-spectrograms representing the speech features."
191
"How does Tacotron 2.0 improve speech synthesis? "
"Tacotron 2.0 provides a holistic approach to speech synthesis, allowing for direct modeling of the text-to-speech conversion process. It enables flexibility in controlling various aspects of speech synthesis, such as prosody and speaking style."
192
What limitation does Parallel Wavenet have in terms of controlling speech synthesis aspects?

Parallel Wavenet excels in producing natural speech but may have limited control over specific aspects like prosody and speaking style.

193
"What are mel-spectrograms in the context of Tacotron 2.0?"
"Mel-spectrograms are representations of speech features used by the decoder in Tacotron 2.0 to generate the sound output."
194
What is the trend in modern speech synthesis models?

"The trend is towards end-to-end models. These models are trained to predict the next part of speech from the given speech. This makes the models good enough to allow for fine-tuning."
195
"What is fine-tuning in speech synthesis models?"
"Fine-tuning involves using the hidden knowledge from a pre-trained model to learn related tasks. This process takes advantage of the hidden representations (black box) within the model.

By adjusting the model’s
parameters for a specific task or dataset, fine-tuning allows the model to adapt efficiently and transfer its knowledge to new domains or applications like entertainment or education."
196
"What is Text-to-Speech (TTS) in speech synthesis?"
"Text-to-Speech (TTS) is the process of creating artificial speech from written text. It aims to produce the best match between the written words and the spoken output."
197
"How do we know if the speech matches the text well?"
"
  •  We use a loss function. Think of it like a scorekeeper. It measures how close the generated speech is to what we want.
  • Example: If you want the speech to say ""Hello"" cheerfully and it says ""Hello"" sadly, the score will be high (bad match). If it says ""Hello"" cheerfully, the score will be low (good match).

"
198
"Should we include things like tone and emotion in our scorekeeping when using TTS?"
  • Non-verbal cues are things like intonation (voice rise and fall) and emotion (happy, sad, etc.). Including these in our scoring helps make the speech sound more natural.

199
Why are the meanings of non-verbal cues in synthesized speech still random?

"The randomness in the meanings of non-verbal cues is due to the way the technology generates these cues based on the training material received from humans. The system tries to reproduce what it has learned, but it doesn't always know where to mark the cues accurately, which can affect the naturalness of the speech."
200
"What does the focus action mark in synthesized speech?"
"The focus action marks that something is new to the listener. If the focus is on the right mark, the speech sounds natural; otherwise, it sounds robotic."
201
"How does WaveNet aim to make computer-generated voices more human-like?"
"WaveNet includes intonation, accents, emotion, and other vital layers of communication to deliver a richness and depth to computer-generated voices that earlier systems overlooked."
202
"What doubts does Tom Lentz have about systems generating affective states from text alone?"
"Tom Lentz doubts that a system can generate a speaker's affective state or common ground/shared understanding between speaker and listener with only text. Non-verbal cues necessary for affective states are limited in text, and information about common ground may be less explicit."
203
"What are some potential solutions for improving the naturalness of synthesized speech?"
  • Choice of words: Systems can use specific words to express emotions.
  • Previous conversation: Utilizing information structure from previous interactions can help improve naturalness.

204
"What are the research questions related to conveying an affective response through a robot's speech?"
  1. Can people perceive empathic behavior from a robot when only the emotions in its speech are used to express empathy?
  2. Do people prefer an empathetic voice from robots or a non-empathetic robotic voice?
  3. What factors of speech can be related to an empathetic voice?

205
What is the uncanny valley effect?

"The uncanny valley effect occurs when humanoid objects appear almost, but not exactly, like real humans, eliciting negative reactions."
206
"What method was used to study the perception of empathy in robotic speech?"
"The method involved an actor varying only prosody (intonation, rhythm, and stress) while speaking through a healthbot and a human speaking, both not visible to the participant."
207
"Can people perceive empathic behavior from a robot based on its speech alone?"
"Yes, users preferred an empathetic voice from robots and were able to perceive empathic behavior when only the emotions in the robot's speech were used to express empathy."
208
"What factors contribute to the perception of an empathetic voice in robotic speech?"
"
Users recognized additional emotional nuances such as empathy, concern, and encouragement in the robot's voice.

These factors contributed to their preference for an empathetic voice. Conversely, individuals tended to avoid choosing a robotic voice that lacked emotions and exhibited monotony.

"
209
What do the study results suggest about emotional expressiveness in robotic voices?

The results suggest that emotional expressiveness and variation in the voice are crucial for user acceptance and preference.

210
What was the method used to study stress in speech?

"The method involved manipulating stress (as in ""I am stressed""), which could be present, absent, or copied from the participant. The manipulation was done manually by adding wavering in pitch."
211
"What were the conditions regarding stress in the voice in the study?"
"
  1. Stress present: The speech included stress cues.
  2. Stress absent: The speech did not include stress cues.
  3. Stress copied from participant: The speech mirrored the stress cues present in the participant's voice.
"
212
What were the results of the study on stress in speech?

"
  • No significant change in stress: The study found no significant change in stress levels between the conditions (presence, absence, or mirrored affect). This lack of significance might be due to limitations in the study, such as a potential power issue or other influencing factors.
  • Significant effect in task success: Participants' performance on a shared task was influenced by the presence of their own or mirrored affect. This implies that emotional expression, even if not consciously perceived, had an impact on task success. The emotional cues present in participants' speech affected their interaction and collaboration, leading to differences in performance outcomes.
"
213
"What is Vall-E, and what are its key elements in training?"
"
 Vall-E is a model developed by OpenAI designed to mimic any given voice, including its emotional nuances. It can speak in any voice (including its emotion) if given a 3-second example of the desired speaker's voice. The key elements of the training are:
  • Ground truth: The human speaking target (what the speech should sound like).
  • Baseline: A simple text-to-speech model that lacks the ability to capture the nuances and subtleties of human speech, especially in terms of emotion.
  • Prompt: A 3-second example of the desired speaker’s voice, serving as the training input for the model to learn and replicate the specific voice and emotion.
"
214
"Is a 3-second example of the desired speaker's voice enough for Vall-E to mimic any voice including its emotional nuances?"
"
While Vall-E is designed to mimic any voice, including its emotional nuances, using only a 3-second example, the effectiveness of this method depends on the complexity of the emotions and the range of nuances in the target voice. The system may capture the general characteristics and some emotional aspects, but it might not fully replicate more complex or subtle emotional nuances that require a deeper understanding of the context and prolonged exposure to the speaker's voice.

"
215
"Do actors simply copy a whole emotion?"
"No, actors typically break down emotions into various components. For example, portraying doubt and despair involves understanding and expressing specific elements such as tone, pitch, pacing, and emphasis. It's a nuanced process that goes beyond merely copying a single emotion in its entirety."
216
"Are focus, tail, and common ground predictable in emotional speech?"
" In emotional speech, there can be some predictability in terms of focus, tail, and common ground. Certain emotions may lead to recognizable patterns in speech, like changes in pacing, pitch, or emphasis. However, this predictability is not universal and can vary based on individual differences and contextual factors."
217
"Can Vall-E generate or set emotions?"
Vall-E, developed by OpenAI, can replicate and generate emotions in speech to some extent. The model is trained to mimic the emotional nuances present in a provided 3-second example of a speaker. However, the complexity of emotions and their contextual nature may pose challenges in generating highly nuanced or context-specific emotional expressions.

218
"Can Vall-E incorporate body language?"
"No, Vall-E, being a text-to-speech model, focuses on generating spoken content and does not have the capability to incorporate or mimic body language. Body language involves visual cues such as gestures, facial expressions, and postures, which fall outside the scope of Vall-E's capabilities."
219
What about other non-verbal speech cues?

"Vall-E primarily focuses on replicating vocal aspects, including intonations and emotional cues in speech. However, it does not encompass other non-verbal speech cues like pauses, hesitations, or changes in rhythm, which also contribute significantly to effective communication. The model is limited to the auditory domain and doesn't account for the full range of non-verbal cues present in human communication."
220
" What is metacognition?"
"
Metacognition is the ability of people to think about their own thinking. It refers to a person's beliefs and knowledge about their own cognitive processes.

"
221
"
What is the difference between understanding another person's mental state and your own mental state in metacognition?

"
"
  1. Understanding another person's mental state:
    • Studied in the context of theory of mind.
    • Concerned with developmental or pathological aspects of metacognition.
    • The ability to understand that other people have beliefs, desires, intentions, and perspectives that may differ from one’s own.
    • Example: Sally-Anne test (assessing the ability to look inside another person’s head).
  2. Understanding your own mental state:
    • Awareness of one’s own cognitive processes, such as memory, attention, problem-solving strategies, and emotional states.
"
222
What is the Sally-Anne test and what does it assess?

 The Sally-Anne test assesses theory of mind by testing if a child understands that others can have false beliefs. Children are asked where Sally will look for a ball after it has been moved while she is absent. Those who have developed a theory of mind understand that Sally will look for the ball where she last left it, not where it actually is.

223
"What is the Tip of the Tongue (TOT) phenomenon?"
"The Tip of the Tongue (TOT) phenomenon is the feeling of being unable to recall a specific word or piece of information, even though you know it is stored in your memory and feels like it’s just on the brink of being retrieved."
224
 How does the degree of (un)certainty affect self-presentation in communication?

Differences in confidence levels are reflected in the way speakers present themselves, which is useful for their addressees.
For the speaker:
  • It serves as a face-saving strategy (not appearing ridiculous if wrong).
For the addressee:
  • It manages expectations and can make them more prone to asking again or asking someone else.

225
What are some auditory cues that indicate confidence or uncertainty in speech?

"
  • Linguistic hedges: Phrases like ""I am not sure, but..."" or ""I think...""
  • Filled pauses: Words like ""uh"" and ""uhm""
  • Prosody: Using question intonation

  • "
    226
    "What are possible visual cues that indicate confidence, uncertainty, or hesitation?"
    Visual cues include:
    • Body language
    • Facial expressions
    • Gestures
    These cues are natural and important ingredients of daily conversations as well.
    227
    "



    What does the flowchart by Nelson and Narens (1990) illustrate about metacognitive processes?

    "
    "
     The flowchart illustrates the process of answering a question with varying degrees of certainty:
    1. A question is asked: ""What is the capital of Switzerland?""
    2. Feeling of Knowing?
      • If yes, proceed to search memory (LTM).
      • If no, the answer is ""I don't know.""
    3. Willing to Search Longer?
      • If yes, continue searching.
      • If no, the answer is ""I don't know.""
    4. Answer Found?
      • If yes, check confidence level.
      • If no, continue searching if willing.
    5. Sufficiently Confident?
      • If yes, provide the answer (""That's Zurich."").
      • If no, continue searching or decide ""I don't know.""
    The flowchart shows how individuals navigate between certainty, uncertainty, and metacognitive awareness when answering questions.
    "
    228
    "What is the first stage of Hart's 1965 experiment on the production of uncertainty?"
    "Answering factual questions using tests like WISC, WAISC, and Trivial Pursuit."
    229
    "What are FOK-scores in the context of Hart's 1965 experiment?"
    OK-scores are subjective ratings about how confident individuals are in their ability to recognize the correct answer to a question if presented later.

    230
    "What happens in the third stage of Hart's 1965 experiment on uncertainty?"
    Participants take a multiple-choice test to recognize the correct answers, particularly those they were uncertain about initially.

    231
    "What does the term ""Tip of the tongue"" mean in Hart's experiment?"
    "
    It refers to cases where individuals say ""I don't know"" in the first stage but have a high FOK in the second stage.

    "
    232
    " How do filled pauses, delays, and high intonation affect FOK scores in adults during Hart's experiment?"
    These cues are associated with significantly lower FOK scores indicating higher uncertainty.

    233
    "What is the correlation between word production and certainty in Hart's study?"
    "There is a negative correlation; the more words people produce, the less sure they are."
    234
    "How do adults and children differ in expressing uncertainty according to Hart's study?"
    " Adults are more expressive with facial expressions and justifications for their answers, while children are less expressive and more likely to remain silent."
    235
    What visual cues are used to express uncertainty?
    "Eyebrow movements, smiling, and gaze patterns are visual cues indicating uncertainty."
    236
    How does smiling affect FOK scores for non-answers in adults versus children?
    " For adults, smiling correlates with higher FOK scores (embarrassment), while for children, it indicates pride in knowing the answer."
    237
    "What is the conclusion about expressing uncertainty from Hart's experiment?"
    Speakers use various audiovisual cues to express uncertainty, with adults doing so more than children due to better self-presentation skills.

    238
    "What were the results for children regarding FOK scores and non-answers in Hart's experiment?"
    Children had fewer high FOK scores for non-answers and were generally less expressive than adults.

    239
    "Why don't children show facial expressions when uncertain like adults?"
    "
    Children don't have the social skill to do facial expressions when they don’t know the answer, unlike adults.

    "
    240
    What do adults do if they don’t know something?
    Adults justify their silence, while children just stay silent.

    241
    What are some verbal cues of uncertainty?
    "High intonation, filled pauses, delay, and using more words."
    242
    What are some visual cues of uncertainty?
    1. Eyebrow movement,
    2. smile,
    3. “funny face”,
    4.  and gaze (looking away from the questioner).

    243
    " How do adults' non-answers differ from children's in terms of FOK-score?"
    "Adults' non-answers with filled pauses, delay, high intonation, etc., correspond with a significantly higher FOK score, while children's do not have such significant patterns."
    244
    What conclusion is drawn about speakers expressing their level of uncertainty?
    Speakers express their level of uncertainty via various audiovisual cues, with adults doing this much more than children.

    245
    " What does FOAK stand for? "
    Feeling of Another’s Knowing.

    246
    How can observers estimate a speaker’s level of uncertainty?
    "Observers can estimate a speaker’s level of uncertainty based on audiovisual cues."
    247
    Are answers or non-answers easier for observers to estimate uncertainty?
    "Answers are ""easier"" to estimate uncertainty than non-answers."
    248
    How do unimodal and bimodal stimuli compare in estimating uncertainty?
    Scores for unimodal stimuli (sound only and vision only) are good, but scores for bimodal stimuli (both sound and vision) are the best.

    249
    What was the task for different speakers/judges in the perception of uncertainty study?
    The task for children vs. adults was to judge the level of (un)certainty.

    250
    How did children find judging certainty in other children compared to adults?
    Children found it very difficult to judge other children on certainty but found it easier to judge adults.

    251
    How did adults find judging certainty compared to children?
    "Adults found it way easier, but found it easier to interpret for other adults than for children."
    252
    Who are better judges of uncertainty, adults or children?
    "Adults are better judges than children."
    253
    Who are better judged for uncertainty, adults or children?
    " Adults are better judged than children because adults signal their certainty more clearly."
    254
    What type of data was manipulated in the study of manipulatd data?
    "Answers (1 certain, 1 uncertain) from 5 speakers were selected; words had to have a similar sound shape."
    255
    How were the sound and image manipulated to create mixed stimuli in the study of manipulated data?
    " Sound and image were separated to create combinations of certain and uncertain settings for three variables: filler (absent, present), high intonation (absent, present), and marked facial expression (absent, present)."
    256
    What were the combinations of manipulated stimuli in the experiment of manipulated data

    The combinations were:
    • Face sure, voice unsure
    • Face sure, voice sure
    • Face unsure, voice unsure
    • Face unsure, voice sure
    257
    How many stimuli were created in total in th xprimnt of manipulated data?
    "A total of 40 stimuli were created."
    258
    "What was the experimental procedure for rating the speaker's confidence level in the experiment of manipulated data?"
    Both original and mixed stimuli were presented to 120 subjects who rated the speaker’s confidence level on a 7-point scale.

    259
     How many different experiments were conducted, and why in th experiment of manipulated data?
    "Eight different experiments were conducted to ensure subjects would not see the same speaker within one test."
    260
    What effect did fillers, high intonation, and marked facial expressions have on certainty perception in the experiment of manipulated data?
    • Presence of a filler led to a systematic increase in perception of confidence level.
    • Stimuli with high intonation were perceived as more uncertain.
    • Stimuli with marked facial expressions were perceived as more uncertain.
    261
    What previous work does this study (of manipulatd data) align with regarding incongruent stimuli and visual information?
    This study aligns with previous work on emotion perception showing the predominance of visual information (e.g., Mehrabian and Ferris).

    262
    What is meant by a filler?
    "
     A filler is a sound or word used to fill pauses in speech, often indicating hesitation or uncertainty. Common examples of fillers include ""um,"" ""uh,"" ""like,"" and ""you know.""

    "
    263
    Do cultures differ in the way they produce and interpret cues to uncertainty?
    Yes, cultures can differ in the way they produce cues to uncertainty and in how such cues are interpreted by observers.

    264
    Which cultures were compared in the study by cutural differences?
    "The study compared Dutch speakers with Japanese speakers."
    265
    Why are Japanese speakers an interesting test case in comparing uncertainty?
    Japanese speakers are interesting because they are often considered to be rather unemotional and have a tendency to avoid uncertainty more than Western cultures.

    266
    What was the task for the subjects regarding the Feeling of Knowing (FOK) scores in the study comparing cultures?
    " Subjects rated their certainty on a scale from 0 (not sure) to 7 (very sure), with the task being the same for both Dutch and Japanese adults."
    267
    What was the method for the perception of uncertainty in the study of comparing cultures?
    "Randomly selected answers with low-FOK and high-FOK ratings from 8 Dutch and 8 Japanese speakers were presented to Dutch and Japanese observers. The task was to rate the speaker’s certainty on a 7-point scale."
    268
    How many raters participated and what was their composition?
    "88 raters participated, with 44 Dutch and 44 Japanese, equally balanced across gender."
    269
    What were the results regarding the ease of judging certainty/uncertainty?
    • It was easier to judge Dutch speakers’ certain/uncertain answers than Japanese speakers’.
    • It was easier to judge females than males regarding their confidence levels.
    • There was no significant in-group effect observed.
    270
    What are the two main effects observed in the study about comparing cultures?
    • Difference between certain and uncertain answers was easier to judge for Dutch than for Japanese speakers.
    • It was easier to judge female speakers than male speakers on their confidence levels.
    271
    What conclusion can be drawn regarding cultural differences and gender in uncertainty perception ?
    The study concluded that it is generally easier to judge uncertainty for Dutch speakers compared to Japanese speakers, and female speakers provide clearer cues of confidence than male speakers, regardless of culture. There was no in-group bias (scoring people higher from their own culture)) observed in the study
    272
     Are there reasons to assume that facial cues to prominence differ between the upper and lower part of the face? Between the left and the right side of the face? Motivate your answer based on what we discussed in the colleges.
    "
     Yes, there are reasons to assume differences in facial cues to prominence:
    • Upper vs. Lower Part of the Face: The upper part of the face, especially the eyebrows, is often used to signal prominence. Rapid eyebrow movements (flashes) can play a similar role to pitch accents in speech, signaling emphasis or importance. The lower part of the face, such as the mouth, can also indicate prominence but is more often associated with emotional expressions.

    • Left vs. Right Side of the Face: Observers are more sensitive to dynamic variations in the left part of the face compared to the right. This could be because the left side of the face (from the observer's perspective) is more expressive and connected to the right hemisphere of the brain, which is involved in processing emotions. Studies have shown a significant correlation between pitch and left eyebrow movements, indicating a stronger connection between auditory and visual cues on the left side of the face.

    "
    273
    "Imagine a speaker who instructs someone else to move an object on a chessboard using phrases like ""Move the object from A2 to A3"" and ""Move the object from A2 to B3."" What would be the prosodic difference between these two phrases if they would be uttered by a native speaker of English? What do you expect to happen prosodically when the two phrases would be uttered by a native speaker of French in his/her language?"
    "
    Prosodic Differences in English (Germanic Language):
    • Phrase 1: ""Move the object from A2 to A3"":
      • Intonation: Relatively flat with a slight rise on ""A3"" to indicate the end of the instruction.
      • Stress: Slight emphasis on ""A3"" to mark the final destination.
    • Phrase 2: ""Move the object from A2 to B3"":
      • Intonation: More noticeable rise on ""B3"" to emphasize the different destination and direction.
      • Stress: Contrastive stress on ""B"" in ""B3"" to differentiate it from ""A3.""
    Prosodic Differences in French (Romance Language):
    • Intonation: Generally smoother and less variable than in English. A slight rise on ""A3"" to indicate the end.
    • Stress: French does not use stress for contrast as strongly as English. The phrase would likely have a more even stress pattern.

    "
    274
    "Why is spoken communication considered risky business? "
    Communication via spoken language is not an exact data transfer process; many things can go wrong because:
    • Speakers may experience problems expressing themselves.
    • Addressees may not fully understand what a speaker is saying.
    • Spoken language is a very evanescent phenomenon (speech is immediately gone).

    275
    What are the phases involved in grounding information in spoken dialogue?
    The process of grounding information typically proceeds in two phases:
    1. Presentation Phase: The current speaker sends a message to their communication partner.
    2. Acceptance Phase: The receiver signals whether the message was understood correctly or not.
    276
    How do conversation partners handle the infinite loop of feedback in spoken communication?
    Conversants circumvent the infinite loop by signaling that they received the feedback correctly, and this signaling cycle continues in a manageable way (Clark, Traum).

    277
    How do communication partners negotiate information during a conversation?
    Communication partners negotiate information through continuous signals on the status of the information being exchanged, similar to teamwork in activities like dancing or playing chess.

    278
    What are the types of feedback cues used in dialogue?
    "
    There are two main types of feedback cues:
    • Positive Feedback Cues: Signals like ""go on"" indicating that there are no problems with the information being exchanged.
    • Negative Feedback Cues: Signals like ""go back"" indicating that there are problems with the information being exchanged.

    "
    279
    Why is it more essential to detect go-back signals than go-on signals in spoken communication?
    "
     It is more essential to detect go-back signals (indicating a ""conflict"") because the consequences of ignoring these signals can be significant, leading to larger-scale conversation problems.

    This is similar to the traffic light metaphor where not stopping at a red light (conflict) is more critical than not following a green light (confirmation).

    "
    280
    What expectation arises from the traffic light metaphor in the context of feedback cues?
    The expectation is that negative feedback cues are more marked and stand out more compared to positive cues, similar to how a red light stands out more due to the potential consequences of ignoring it.

    281
    "What question arises about prosodic/non-verbal cues in the context of feedback cues? "
    The question is how prosodic and non-verbal cues compare to lexico-syntactic cues in indicating positive and negative feedback.
    282
    "What is hyperarticulation?"
    "It is speaking in an exaggerated manner, typically with a slower tempo, louder voice, higher pitch, and more pauses, often used in problematic dialogues."
    283
    "which settings is hyperarticulation commonly observed?"
    "In child-directed speech, speech over long distances, and Lombard speech (e.g., speaking louder in a noisy environment)."
    284
    "What was the primary focus of Study 1: Negations in Dutch on auditory feedback cues?"
    "To examine how people use negations in Dutch to signal problematic dialogue contexts."
    285
    "What were the participants in Study 1: Negations in Dutch asked to interact with? "
    Two speaker-independent spoken dialogue systems that provided train timetable information.

    286
    "What types of responses were compared in Study 1: Negations in Dutch?"
    "
    Responses to ""Do you want me to repeat the information?"" (go on) and ""Do you want to travel to Amsterdam?"" (go back).

    "
    287
    "How many participants and dialogues were involved in Study 1: Negations in Dutch?"
    There were 20 participants interacting in 120 dialogues in total.

    288
    "What is meant by a problematic dialogue context in Study 1: Negations in Dutch?"
    "A situation where there is misunderstanding or communication issues, causing the speaker to elaborate more to clarify the problem."
    289
    "What is meant by a non-problematic dialogue context in Study 1: Negations in Dutch"
    "A situation where the conversation proceeds smoothly without misunderstandings, requiring less elaboration."
    290
    "How did problematic and non-problematic cases differ in Study 1: Negations in Dutch?"
     Problematic cases had more elaborate responses with additional information to clarify issues in the conversation.

    291
    What are the two types of verification questions mentioned in the study about dutch negotions?
    "he two types are: ""Do you want me to repeat the information?"" (go on) and ""Do you want to travel to Amsterdam?"" (go back)"
    292
    What is the distribution of “no” responses analyzed in the study about neegotions in dutch?

    "The distribution is analyzed into three types: single no, no with additional information (stuff), and more detailed no responses."
    293
    "Wat gebeurt er met spraak in probleemgesprekken?"
    "Mensen praten langzamer, luider, en met meer pauzes."
    294
    "Noem een voorbeeld van probleemspraak."
    • Praten met kinderen, schreeuwen naar iemand ver weg, of spreken in een lawaaierige omgeving.

    295
    "Wat is kenmerkend voor negatieve feedback in probleemgesprekken?"
    Het wordt langzamer en langer uitgesproken.
    296
    "
    • Wat zijn de verschillende manieren waarop mensen ""nee"" zeggen?

    "
    "
    • Soms alleen ""nee"", soms ""nee"" met extra woorden zoals ""Amsterdam"".

    "
    297
    Wat doen mensen in probleemgesprekken met hun antwoorden?
    Ze voegen meer details toe.
    298
    "Wat was de taak van de 25 deelnemers in het perceptie-experiment?"
    "Ze moesten beoordelen of ""nee""-uitingen uit probleemgesprekken kwamen.
    "
    299
    "Hoe goed konden luisteraars de ""nee""-uitingen beoordelen?"
    "Ze konden dit goed beoordelen, ver boven toevalsniveau."
    300
    "Hoe ziet een ""ga door"" antwoord eruit in de grafiek?"
    "
    • Mensen zeggen ""nee dankjewel"" in één vloeiende zin zonder pauze.

    "
    301
    " Hoe ziet een ""ga terug"" antwoord eruit in de grafiek?"
    "
    • Mensen zeggen ""nee"" en daarna extra woorden met pauzes ertussen.

    "
    302
    " Wat gebeurt er met de pauze na ""nee"" in probleemgesprekken?
    "
    "De pauze na ""nee"" is langer."
    303
    "Hoe verandert de toonhoogte (F0) van de extra woorden in probleemgesprekken?"
    De toonhoogte (F0) van de extra woorden is hoger in probleemgesprekken.
    304
    Wat is precies het verschil tussn go-back n go-forward antwoorden?
    • Ga terug (Go back) Antwoorden: Deze antwoorden geven aan dat er iets moet worden aangepast of heroverwogen. Ze bevatten vaak meer informatie om de situatie te verduidelijken en om duidelijk te maken dat er een probleem is of dat er iets verkeerd is begrepen.
    • Ga door (Go forward) Antwoorden: Deze antwoorden geven aan dat alles in orde is en dat het gesprek kan doorgaan zonder aanpassingen. Ze zijn vaak korter en bevestigen dat er geen probleem is
    305
    "What are echoic responses in conversations?"
    "Echoic responses are when people often repeat each other’s words or phrases during conversations."
    306
    "What are the two types of echoic responses?"
    "The two types of echoic responses are priming behavior and conventionalized behavior."
    307
    "What is priming behavior in the context of echoic responses?"
    "Priming behavior is when people unconsciously copy each other’s expressions. For example, if one person uses a specific word or phrase, the other person might repeat it without realizing it."
    308
    "What is conventionalized behavior in echoic responses?"
    "Conventionalized behavior refers to standard actions or expressions commonly used in social interactions, such as greetings like bowing, kissing, or hugging. Mimicking these behaviors follows social norms and expectations."
    309
    "How do echoic responses serve as feedback in conversations?"
    "Repeating words or phrases can indicate feedback, showing whether the speaker wants to continue (go-on) or clarify something (go-back)."
    310
    " Give an example of a go-on signal using repeated words."
    "
    A: ""and then you transfer to the Keage line...""
    B: ""Keage line""
    A: ""which will bring you to Kyoto station""

    "
    311
    "Provide an example of a go-back signal with repeated words."
    "A: ""and that is the Keage line...""
    B: ""Keage line?""
    A: ""that’s right, Keage line""
    "
    312
    "What was the task given to participants in the study about echoic rsponses in japan?"
    "One student instructed another on how to build a specific construction using building blocks, with the goal of making it as similar as possible to a picture only the instructor could see."
    313
    "What was found about the use of repeated utterances for negative feedback cues?"
    Negative feedback cues were more likely to be

    • higher in pitch, 
    • slower in tempo, 
    • and produced after a longer delay.

    314
    "How can prosodic features help manage interaction in conversations?"
    " Prosodic features can help manage interaction by signaling whether the speaker wants to continue (go-on) or go back and clarify something (go-back)."
    315
    "How do the Japanese results on repetitive utterances compare to Dutch data on negations?"
    "The Japanese results are in line with the Dutch data, showing that speakers tend to make prosodic differences between go-on and go-back signals."
    316
    "What does the consistency of prosodic cuees in go-on and go-back signals results across different languages suggest?"
    "The consistency suggests that these patterns of using prosodic features to manage interaction may be a general characteristic of human communication."
    317
    "What are some common issues with human-machine interactions?"
    " People sometimes experience problems with a system (car, telephone, computer, radio) because it was not designed in an appropriate way.

     A system operates badly if it does not take into account the failings of the human cognitive system and human skills.

    "
    318
    " What is a good design principle for systems?"
    " A good design principle is to “design for error”, considering limitations in attention, consciousness, real-life experiences, and ergonomics."
    319
    "What are spoken dialogue systems (SDS)?"
    "Spoken dialogue systems are systems with which humans are supposed to interact in natural (spoken) language."
    320
    " Why do spoken dialogue systems often face problems?"
    SDS often face problems because they are not yet designed to handle the full range of human linguistic skills.

    321
    " How are SDS typically trained?"
    SDS are typically trained with normal speech, not accounting for variations like fast, slow speech, or repetition.

    322
    "Why will errors remain a problem for future systems?"
    "Errors will remain a problem due to noisy conditions, interactions with non-native speakers, or an expanded domain of the system."
    323
    "What are the three main tasks for dialogue managers in SDS systems?"
    The three main tasks are to
    1. Prevent errors,
    2. Detect errors,
    3. Correct errors.

    324
    "How can systems prevent errors?"
    ystems can prevent errors by using optimal dialogue strategies.

    325
    "How can systems detect errors?"
    Systems can detect errors using acoustic and semantic confidence scores.

    326
    "How can systems correct errors?"
    Systems can correct errors by using feedback cues and system prompts.

    327
    "How many subjects were involved in the audiovisual corpus study : Dutch Interaction with Dialogue System?"
    "9 subjects were engaged in telephone conversations with a speaker-independent train timetable information system."
    328
    "What task did the subjects perform in the study: Dutch Interaction with Dialogue System?
    "
    They had to query the system on 7 train journeys, resulting in 63 interactions.

    329
    "How was the data collected during the interactions in the study: Dutch Interaction with Dialogue System"
    " Subjects were video-taped and led to believe the data collection was for developing a new video-phone"
    330
    " What percentage of the dialogues were successfully completed in Dutch Interaction with Dialogue System
    "
    76% of the dialogues were successfully completed.

    331
    " Wat was de onderzoeksvraag in het onderzoek uitgevoerd door Wang et al aan MIT?"
    Hoe goed kunnen proefpersonen problematische en niet-problematische fragmenten onderscheiden in human-machin-intractions op basis van video-opnames?



    332
    Waaruit bestonden de minimal pairs van Wang et al.
    " In dit specifieke onderzoek werden de videoclips zorgvuldig geselecteerd om ""minimal pairs"" te vormen, waarbij elk paar vergelijkbare uitingen bevatte die plaatsvonden in een problematische en een niet-problematische dialooguitwisseling. De proefpersonen kregen de taak om te raden of de gepresenteerde clip afkomstig was uit een problematische of een niet-problematische context​"
    333
    "Wat was de hypothese in het onderzoek van Wang et al."
    "Proefpersonen kunnen problematische van niet-problematische interacties onderscheiden boven kansniveau door gebruik te maken van audiovisuele cues zoals hyperarticulatie en visuele signalen."
    334
    "Wat waren de onafhankelijke variabelen in dit onderzoek van Wang et al."
    "
    1. Het type fragment (problematisch of niet-problematisch), 
    2. het niveau van hyperarticulatie, en 
    3. de aanwezigheid van visuele cues zoals glimlach, hoofdbeweging, afgewende blik, fronsen, en wenkbrauwheffen.
    "
    335
    "Wat was het algemene onderzoeksopzet van de drie perceptie-experimenten?"
    "
    Proefpersonen bekeken videoclips van menselijke-machine interacties en moesten bepalen of elk fragment problematisch of niet-problematisch was. De experimenten waren onderverdeeld in drie typen:
    • Verificatievragen: Proefpersonen zagen gebruikers luisteren naar verificatievragen van het systeem (gebruikers zijn stil), wat probleemloos (juist) of problematisch (fout) kon zijn. Ja (op/neer) en nee (links/rechts) zijn bijna aangeboren, bekend vanaf jonge leeftijd.
    • Bestemmingsuitingen: Proefpersonen zagen sprekers een bestemming uiten; dit kon de eerste poging van de spreker zijn (probleemloos) of een correctie als reactie op een verificatievraag over verkeerd herkende of begrepen informatie.
    • Negaties: Proefpersonen zagen sprekers een negatie (""nee"") uiten, wat een reactie kon zijn op een algemene ja-nee vraag of een reactie op een verificatievraag met onjuiste informatie.
    "
    336
    Welke aspecten droegen bij aan het onderscheiden van of eeen fragment uit en problmatisch gesprek kwam of niet?
    "de mate van hyperarticulatie en verschilllend visuele ""cues"" zijn positief gecorreleerd met hoe goed mensen onderscheid kunnne makeeen."
    337
    Waarom is het onderzoek van Wang et al. over de manier waarop mensen problematische interacties herkennn relevant?
    "In dit specifieke onderzoek betekent dit het gebruik van dynamische variaties in de stem en gezichtsuitdrukkingen van gebruikers om te detecteren wanneer een interactie mogelijk problematisch is. Deze informatie kan vervolgens worden gebruikt om het dialoogsysteem te verbeteren door vroegtijdig problemen te signaleren en erop te reageren, wat de algehele gebruikservaring verbetert."
    338
    Waarom is het gezicht belangrijk, voor het leren herkennen van problematische interacties?
    "Omdat een spoken dialogue system daarop in kan spelen voordat de dialoog uberhaupt begonnen is."
    339
    "What is audiovisual prosody believed to reveal?"
    "Audiovisual prosody is commonly believed to reveal a speaker's emotions (e.g., negative vs positive)."
    340
    "How do children express their emotions compared to adults?"
    "Children express their emotions more openly than adults. But it is dependent on temper and family background."
    341
    "What happens to a child's emotional expression as they grow older?"
    "As a child grows older, they become less expressive due to internalization and learn to manipulate their expressions due to emotion regulation."
    342
    "What must participants do for each undisclosed card in the card game?"
    "Participants must guess whether each next undisclosed card contains a higher or lower number."
    343
    "What does making ""rational choices"" in the card game imply?"
    "Making ""rational choices"" implies 3 winning and 3 losing games."
    344
    "How many pairs of children participated, and what were their age groups?"
    " The game was done with pairs of children: 24 younger children (8-year-old) and 24 older ones (12-year-old)."
    345
    "In the perception study of the card game, what did observers have to determine? "
    "Observers had to determine for each pair of children whether they had just won or lost a game."
    346
    "According to the results of the card game expriment, which group was more expressive, and in what context?"
    "8-year-old children were more expressive when losing a game, while 12-year-old children were less expressive about winning or losing."
    347
    "What were the cross-cultural differences found in the study?"
    "Pakistani children were overall more expressive than Dutch children, with winning being more visible than losing in Pakistani children. Different conventions were used to show happy or sad reactions."
    348
    "How did the presence of others affect children's expressiveness?"
    " Children were less expressive when alone."
    349
    What is the pressence effect?
    "children lss xpressive  when being alon"
    350
    What key question is explored about children interacting with robots?
    Can children interact and collaborate with a robot in a social and intuitive way, and how similar is this to their interactions with peers?

    351
    How many children participated in the experiment of intracting with a robot and what were the conditions?
    "256 children (Dutch and Pakistani) participated in one of three conditions: alone, with iCat, or with a friend.
    "
    352
    "What was the outcome regarding children's fun in the experiment where it had to interact with robots."
    Children had the most fun playing with their peers, the least fun playing alone, and playing with iCat was in between.
    353
    "What was the setup of the experiment involving video-mediated interaction with children?"
    The setup involved video-mediated interaction where children could either have mutual eye-contact or not. They were always in different rooms but could see each other through a screen.

    As a control condition there was also research for children in the same room (co-presence)
    354
    "What were the results of the study about eye gaze in terms of children's fun levels?"
    Children reported having the most fun in the mediated mutual gaze condition, followed by the co-presence condition, and the least fun in the no gaze condition.

    355
    "What important effects does mutual eye-gaze have according to the study?"
    "Mutual eye-gaze has important effects on perceived social presence, game experience, and player behaviors, even if the eye-gaze is not perfect."
    356
    "What is the debate regarding gender differences in emotions? "
    "Alleged gender differences include the experience, expression, and perception of emotions.

     It has often been claimed that women are more emotional than men.

    The debate is whether such differences are real and, if so, whether they are related to biological and/or socio-cultural factors."
    357
    "What limitations are mentioned about previous investigations into emotional differences between men and women?"
    " Previous investigations have generally been limited and based on stimuli with limited ecological validity, such as still images rather than moving images and acted emotions rather than naturally induced emotions."
    358
    "What have few studies tried to combine regarding emotional differences?"
    "Few studies have tried to combine multiple perspectives (experience, expression, and perception) into one integrated approach."
    359
    "What are some Mood Induction Procedures (MIPs)?"
      • Velten method(1968)
      • Film (e.g., Gross and Levenson 1995)
      • Music (e.g., Sutherland et al. 1982)
      • Feedback/Social Interaction (e.g., Staudel and Paetzold 1984, Yinon and Landau 1987)
      • Gift (e.g., Isen et al. 1987)
      • Facial expression (e.g., Leventhal 1980)
    360
    "What is the Velten method?"
    "The Velten method, developed by M. E. Velten in 1968, is a mood induction procedure used in psychological research. It involves reading a series of self-referent statements designed to elicit a specific mood. Participants read these statements aloud or silently to induce positive, negative, or neutral emotional states. For example, positive statements might include phrases like ""I feel good about myself,"" while negative statements might include ""I feel very down."""
    361
    "What was the focus of Westermann et al. (1996) meta-analysis?"
    "The meta-analysis was based on 250 studies from 22 international journals and evaluated the effectiveness of different Mood Induction Procedures (MIPs)."
    362
    "How were the MIPs ordered based on effect sizes (r) according to Westermann et al. (1996)?"
      • Film [r = 0.738]
      • Feedback [r = 0.494]
      • Velten [r = 0.467]
      • Gift [r = 0.378]
      • Music [r = 0.360]
      • Facial expressions [r = 0.122]

    363
    "What are the two parts of the current research on gender-related differences in emotions?"
    The two parts are:
    • Production study
    • Perception study
    364
    " How many participants were in the production test and what were the moods of the films?"
    "There were 33 participants (16 males, 17 females) with moods being depressed and elated (positive and negative valency)."
    365
    "What was the mood induction procedure in the production test?"
    "The mood induction procedure involved 7-minute film fragments."
    366
    "What was the purpose of the production test using the film method? "
    The purpose was to study the influence of mood on solving dilemmas.
    367
    "Provide an example dilemma from the production test."
    • It is your turn to order at the bakery, but someone else goes before you. What do you do?
      • A: You don’t say a thing, since you have all the time in the world, or
      • B: You get angry with this asocial behavior and point out that it is your turn to order.
    368
    "What do the results of the production test on induced emotions indicate about men and women?"
    The results show that:
    • Men reported higher levels of positive emotions compared to women.
    • Women reported higher levels of negative emotions compared to men.
    369
    "What stimuli were used in the perception test?"
    "The stimuli were 66 film fragments (10 seconds each, no sound) taken from viewing and interview steps of each speaker."
    370
    " What did the perception test results indicate about the perceived emotions based on the gender of the speaker?"
    1. Men were perceived to express more positive emotions compared to women.
    2. Women were perceived to express more negative emotions compared to men.

    371
    "What did the perception test results indicate about the perceived emotions based on the gender of the observer?"
  • Men perceived more positive emotions compared to women.
  • Women perceived more negative emotions compared to men.
  • 372
    "What were the key findings in sum from the perception and production test?"
    • The film MIP worked very well (reliable method).
    • Systematic (and significant) gender differences were found:

      1. Women feel induced emotion stronger.
      2. Women display induced emotion more clearly.
      3. Women perceive induced emotion more accurately.
    373
    "What are the key questions in the study on blind people and visual cues?"
    1.  Do blind people also exploit visual cues?
    2. Is the way they express such cues similar to that of sighted people?
    3. How do visual cues relate to their auditory ones?
    374
    "what were the taWhat was the main effect for sight in the classification results?sks with the experiment with blind peopl"
    "Observers tended to give more correct answers about sighted people (M = .61, SE = .01) than about blind people "
    375
    "Which emotion was most often guessed correctly in the study?"
    "Happiness "
    376
    "How did the emotions rank in terms of correct guesses?"
    "
    1. Happiness (M = .83, SE = .01), 
    2. sadness (M = .66, SE = .01), 
    3. anger (M = .44, SE = .01),
    4. scared (M = .36, SE = .01).
    "
    377
    "Which presentation condition led to the most correct answers?"
    "The audiovisual condition (M = .63, SE = .01)."
    378
    "How did the audio-only and video-only conditions rank in terms of correct answers?"
    Audio-only condition (M = .59, SE = .01)
    video-only condition (M = .49, SE = .01).

    379
    "How do blind and sighted people use cues to signal emotions?"
    "Both use auditory and visual cues to signal emotions, showing similar behavioral patterns."
    380
    "How do visual cues from blind people compare to those from sighted people?"
    Visual cues from blind people tend to be more difficult to judge.

    381
    "What compensatory effect was found in the study with blind people?"
    "Blind people use auditory cues more strongly."
    382
    "What future research question is suggested in the study?"
     How about other and more social emotions (e.g. cues to uncertainty)?

    383
    What is stance?
    Attitude you can have towards a message
    384
    What kind of stances can you havee/
    •  Epistemic (“knowledge-y”, e.g. certainty)
    •  Affective (you can feel something about the message)
    • Emotional (just emotional, not about the message)
    385
    Which different cues are there for stance?
    - Verbal cues (examples)
    * Modal verbs (could, would, should)
    * Particles (surely, probably, luckily)

     Not an endless list, just examples
    - Non-verbal cues
    * Facial gestures/prosody
    * Audiovisual rosody and feeling of knowing
    386
    "How does the number of cues affect the feeling of knowing?"
    The more cues, the less feeling of knowing.

    387
    "Wat waren de onderzoeksresultaten van de studie door Swerts & Krahmer (2005)?"
    "
    De resultaten toonden aan dat gezichtsuitdrukkingen en intonatie significante indicatoren waren van een persoon's ""feeling of knowing"", en dat deze signalen door anderen accuraat konden worden herkend en geïnterpreteerd.

    "
    388
    "Wat was de onderzoeksvraag van Swerts & Krahmer (2005) in hun studie over ""feeling of knowing""?"
    "Hoe mensen hun ""feeling of knowing"" kunnen uitdrukken en herkennen, en hoe deze samenhangt met non-verbale signalen zoals gezichtsuitdrukkingen en intonatie."
    389
    "Hoe beïnvloeden cues de ""feeling of knowing"" en de zekerheid bij zelfevaluatie volgens het onderzoek?"
    " Hoe meer cues er zijn, hoe meer onzekerheid er is bij zelfevaluatie van ""feeling of knowing"". Meer cues leiden tot een lagere feeling of knowing. Echter, bij het geven van non-antwoorden (als je zeker weet dat je het antwoord niet weet), leidt meer cues tot een hogere feeling of knowing, omdat je snel ""nee"" kunt zeggen."
    390
    "Wat toont de studie aan over het gebruik van prosodie en intonatie bij het bepalen of een antwoord volledig is?"
    Prosodie (intonatie) helpt bij het bepalen of een antwoord volledig is of niet. Verschillende tonen kunnen een logische stelling aangeven, en helpen luisteraars te beoordelen of het antwoord compleet is.

    391
    "Hoe helpt prosodie bij het opsommen van voordelen?"
    "Prosodie helpt door een lichte intonatieverhoging te gebruiken bij elke nieuwe punt (voordeel), wat de luisteraar helpt begrijpen dat elk punt apart en belangrijk is. De dalende toon aan het einde van de opsomming geeft aan dat de lijst compleet is."
    392
    "Wat is het effect van een intonatieverhoging tijdens het noemen van voordelen?"
    "Een intonatieverhoging bij elk voordeel signaleert dat elk punt apart en belangrijk is."
    393
    "Wat geeft een dalende toon aan bij een opsomming? "
    "Een dalende toon aan het einde van een opsomming geeft de luisteraar het signaal dat de lijst compleet is."
    394
    "Hoe helpt prosodie bij het geven van een volledig antwoord?"
    "Prosodie helpt door een dalende toon te gebruiken aan het einde van de opsomming, wat aangeeft dat het antwoord volledig is."
    395
    "Wat is de prosodie in een onvolledig antwoord bij een ja/nee vraag met bevestiging?"
    "
    In het onvolledige antwoord ""Hij heeft de achtergrondinformatie gegeven,"" wordt een lichte stijging of vlakke toon gebruikt, wat aangeeft dat het antwoord mogelijk nog niet volledig is en er meer informatie kan volgen.

    "
    396
    "Wat is een voorbeeld van prosodie in een ja/nee vraag met bevestiging?"
    "In het antwoord ""Ja, hij heeft de achtergrondinformatie gegeven, de huidige stand van zaken uitgelegd, en de toekomstige stappen besproken,"" geeft een dalende toon aan het einde aan dat alle punten besproken zijn en het antwoord volledig is."
    397
    "What is irony?"
    Irony is a way of using words so that their intended meaning is different from the literal meaning, often to create emphasis or humor.

    398
    "What is verbal irony?"
    "Verbal irony is when the literal meaning of what is said contrasts with the intended meaning, often for emphasis or humor."
    399
    "How is context relevant to irony?"
    "Context helps determine the intended valence (positive or negative tone) of an ironic statement."
    400
    "What is sarcasm and how does it relate to irony?"
    "Sarcasm is a subtype of irony that is negative and critical."
    401
    "What are some tropes related to irony?"
    "Tropes related to irony include hyperbole (exaggeration) and understatement (downplaying something)."
    402
    "What is jocularity?"
    "Jocularity is saying things in a fun and playful way."
    403
    "What is a rhetorical question?"
    "A rhetorical question is asked not to receive an answer but to make a point or create an effect."
    404
    "What is non-verbal irony?"
    "Non-verbal irony refers to a stance or property of a message where the context or co-text indicates an ironic meaning."
    405
    " How do we recognize verbal irony?"
    "Verbal irony is recognized through context and non-verbal cues."
    406
    "What is the difference between context and co-text?"
    "Context refers to the circumstances or environment, while co-text refers to the surrounding text."
    407
    "What is a stance?"
    "
    A stance is a speaker's attitude or position towards a topic, expressed through both verbal and non-verbal communication.

    "
    408
    "What is an example of co-text?"
    " In the sentence ""I wonder how comfortable the replacement bus service will be,"" the co-text could be ""I already expect it to be a disaster."" The co-text helps clarify that the statement is ironic."
    409
     Do markers (cues) appear during or after ironic statements?
    "Markers (cues) appear both during and after ironic statements, as shown by the increased presence of visual cues in both stages."
    410
    What happens to the percentage of utterances with visual cues during ironic statements?
    "The percentage of utterances with visual cues is higher during ironic statements compared to baseline statements."
    411
    What types of visual cues are more common during ironic statements?
    "During ironic statements, there are more visual cues such as movements in the general face, eyes, eyebrows, mouth, head, and gestures."
    412
    What was the methodology for gathering interactive data on irony?
    "Participants described videos using prompted sentences (e.g., ""These singers have a splendid future in the world of music"") and their responses were annotated for facial movements, gestures, lexical items, and prosody."
    413
    What did the results indicate about irony and visual cues?
    "The results indicated that irony involves more visual cues, leading to a higher perception of irony. When someone is ironic, there are more verbal and non-verbal markers."
    414
    What prosodic feature is observed when speakers are instructed to be ironic?
    "When instructed to be ironic, speakers show lower pitch."
    415
    Are there consistent pitch changes in naturally occurring irony?
    " In naturally occurring irony, there are often no significant differences in pitch."
    416
     How does speech rate change when speakers are instructed to be ironic?
    "There is no consistent change in speech rate; some studies show no lower speech rate while others show lower speech rate."
    417
    "What pitch change is observed in ""dripping sarcasm""?"
    " ""Dripping sarcasm"" is associated with a higher pitch."
    418
    "What is meant by ""contrast as a cue"" for irony?"
    "Contrast as a cue means that irony is signaled by a prosodic difference from the surrounding context, rather than by a specific level of prosody. For example, a statement might be ironic if its pitch or tone differs significantly from the usual pattern in that conversation."
    419
    What is satirical imitation in text-level irony?
    "
    Satirical imitation involves pretense and criticism. For example, Alec Baldwin speaks faster when imitating Donald Trump, who normally speaks slower than Baldwin's normal voice.

    "
    420
    How does speech rate differ when Baldwin imitates Trump?
    "Baldwin's speech rate is significantly faster when imitating Trump, even faster than his normal speech rate, despite Trump's normal speech rate being slower than Baldwin's."
    421
     Is there a significant difference in pitch spread between Baldwin and Baldwin imitating Trump?
    "No, there is no significant difference in pitch spread between Baldwin's normal speech and his imitation of Trump."
    422
    What type of voice is commonly used in satirical speech?
    "A dead-pan voice (lack of emotion/expression) is commonly used in satirical speech."
    423
    What non-verbal cues did Attardo et al. (2011) find in humorous utterances?
    " The study found differences mainly in smiling and laughter between participants."
    424
    What are non-verbal cues on stance?
    "Non-verbal cues on stance include epistemic cues like certainty and affective cues like liking."
    425
    Why are non-verbal cues necessary for recognizing irony without contextual cues?
    "Non-verbal cues are necessary to recognize irony because they help convey the speaker's true intent when the context is unclear. For example, ""I am very happy to take the train to Tilburg this week"" might rely on non-verbal cues to show irony."
    426
    What are some non-verbal cues that may indicate irony?
    "The presence of gestures and contrast in speech or behavior may signal irony."
    427
    What is linguistic copying behaviour?

    "It is when speakers copy linguistic forms (such as words) of their speaking partner."
    428
    "Name some other terms related to linguistic copying behaviour."
    "Mimicry, alignment, adaptation, and accommodation."
    429
    What features have shown copying behaviour?

    "Gestures, facial expressions, syntactic structures, and prosody."
    430
    "What are immediate forms of mimicry?"
    "Forms of mimicry that happen spontaneously and on the spot during interaction."
    431
    How are immediate forms of mimicry different from other types?

    "They are different from long-term mimicry (e.g., fashion) and stylised or conventionalised mimicry (e.g., greeting behaviour)."
    432
     Is the distinction between different forms of mimicry always clear?

    "No, the distinction between these different forms of mimicry may not always be straightforward."
    433
    "What do many models suggest about the link between perception and behavior in linguistic copying?"
    "They suggest a tight link between perception and behavior, where a speaker's words or syntactic structures are ""primed"" by those of their conversation partner."
    434
    "How is alignment viewed in many models?"
    " Alignment is viewed as a largely automatic (almost unconscious) process."
    435
    "What is a naive expectation of the alignment model?"
    "The naive expectation is that adaptation is symmetrical."
    436
    When might adaptation be asymmetrical?

    Adaptation might be asymmetrical in interactions between:
    • People with different hierarchical status
    • Parents and children
    • Native and non-native speakers
    437
    "What is a question regarding interactions between speakers of different language varieties?"
    "What about interactions between speakers of different language varieties?"
    438
    "What type of language is Dutch?"
    "Dutch is a West Germanic language."
    439
    "Where is Dutch the native language?"
    "It is the native language of most of the population of the Netherlands and about sixty percent of the population of Belgium (Flemish part) and former colonies."
    440
    How many people speak Dutch according to Wikipedia?

    "Dutch is spoken by about 22 million people."
    441
    "What are the regional variations of Dutch?"
    The regional variations considered here are Netherlandic Dutch (ND) and Belgian Dutch (BD) (also known as Flemish).

    442
    "What is the general expectation about adaptation between Flemish and Dutch speakers?"
    "Flemish speakers adapt more to Dutch, than the other way around."
    443
    "Why is it expected that Flemish adapt more to Dutch?"
    "
    • Dutch is a pluricentric language, but speakers consider the variant in Haarlem as the ""best"" one.
    • Diachronically, Flemish have adapted more to Dutch than the other way around.
    • Flemish have fewer problems understanding Dutch.
    • Regional dialects are stronger in Belgium, which may cause Flemish speakers to be more sensitive to language variation.
    "
    444
    What game variant is used to elicit spontaneous interactions?

    "A variant of the battle ship game."
    445
    "How is the game played in the interactive paradigm?"
    "The game is played via Skype connection and participants cannot see each other."
    446
    "Who participates in each game session of battlship?"
    "Each game is played between a Flemish and a Dutch participant."
    447
    What roles do participants take during the game battlship?
    "Participants take turns being the leader or follower."
    448
    What was the main effect of Nationality in the results of thee research with battleship between flemish and dutch persons.
    "lemish speakers adapted more to Dutch ones (33% vs 10%)."
    449
    What were the significant effects found in the experiment of battleships?

    "Significant effects of Whostarts (players who follow adapt more) and Round (more adaptation in round 2)."
    450
    "What was the significant 2-way interaction found in the expeerimeent with battleships?"
    "The interaction between Nationality and Whostarts showed that when a Dutch person starts the game, there is more adaptation by Flemish speakers than when a Flemish person starts."
    451
    "How was the adaptation process described between dutch and flemish players of battleships?"
    " It was described as a very spontaneous, unconscious process."
    452
    "Did participants recognize the nationality of the other participant when playing battleships?"
    "Yes, Flemish and Dutch speakers immediately recognized that the other participant was of a different nationality."
    453
    "What was unique about the icons chosen in Experiment 2 (adaption between flemish and dutch)"
    "Half of the icons were chosen because they could potentially lead to different pronunciations."
    454
    "What was the main effect of Nationality in the results in thee experiment of phonological adaption between dutch and flemish peersons?"
    "
    •  Flemish speakers adapted more to Dutch ones (10% vs 1%).
    • The degree of phonological adaptation was much smaller than lexical adaptation.
    • here was no boosting effect; the degree of lexical adaptation did not correlate with the degree of phonological adaptation.
    "
    455
    "What is a limitation of the study on linguistic copying behaviour?"
    "The study only looked at speakers of the Brabantian variant of dutch, so the situation could be different for Limburgian variants spoken on either side of the border."
    456
    "What question remains about adaptation in other pluricentric communities?"
    "What about adaptation in other pluricentric communities such as German, French, Italian, English, and Portuguese?"
    457
    What are conventionalised gestures?
    "
    "
    458
    "What is an important distinction in gesture types?"
    "Gestures without an intrinsic meaning (e.g., beat gestures) and gestures that visually depict something (iconic vs metaphoric use)."
    459
    "What are gestures without intrinsic meaning determined by?"
    "They are determined by the rhythm of speech."
    460
    What are the two types of gestures that visually depict something?

    " Iconic gestures (concrete depiction) and metaphoric gestures (abstract depiction)."
    461
    What is alignment in terms of behaviour?

    "Alignment is when people adapt their behaviour to that of the people with whom they are interacting."
    462
    "What nonverbal features do people mimic?"
    People mimic posture and bodily gestures (both conventionalized and spontaneous ones).

    463
    "Who provided evidence for gestural mimicry?"
    "Kimbara (2006, 2008) and Parrill and Kimbara (2006)."
    464
    "What is the director-matcher paradigm?"
    "
    "
    465
    "What did Mol et al. (2009, 2011, 2012) study?"
    "They provided insights into how speakers adapt their gestures to specific addressees."
    466
    what is meant with an adressee?

    "An addressee is the person or entity to whom speech or communication is directed. In other words, it is the listener or receiver of the message being conveyed by the speaker. "
    467
    "What factors must be considered to understand adaptive processes in gestural behaviour?"
    "
    • The kind of addressee
    • The addressee's perspective
    • The meaning of the gesture

    "
    468
    "What was the paradigm used to study the effect of the kind of addressee?"
    "

    "
    469
    "What was the result of the experiment on the effect of the kind of addressee?"
    ""
    470
    "What was the result of the experiment on the effect of the meaning of gestures?"
    "

    "
    471
    In an interactive setting, speakers tend to adapt their gestural behaviour,
    depending on
    ""
    472
    "What is a metaphor?"
    "It is a figure of speech, an implied comparison.

    Cambridge Dictionary: 
    An expression that describes a person or object by referring to something that is considered to have similar characteristics to that person or object.

    Oxford Dictionary: A word or phrase used to describe somebody/something else, in a way that is different from its normal use, to show that the two things have the same qualities and to make the description more powerful.
    "
    473
    What is a consequence of conceptual metaphors?
    "
    "
    474
    What is meant with target domain and source domain in metaphors?
    "



    summarized: target domain = abstract concept, easified with a more concrete concept  (source domain)"
    475
    "


    What are the the target domain and source domain her?"
    Target = climate change
    Source = icecream melts
    476
    What is something you can say about variability in metaphors?
    "

    "
    477
    Hoe gebruiken we ruimtelijke metaforen om tijd aan teegeven?
    "
    • In westerse culturen is de uitdrukking van de toekomst als volgt:
      • Op de verticale as: De toekomst wordt aan de rechterkant geplaatst.
      • Op de sagittale as: De toekomst wordt vooraan geplaatst.
    • Deze tijd-ruimte link is ook terug te vinden in de taal:
      • In het Engels: de toekomst ligt voor ons (""ahead""), en we kijken terug (""back"").
      • In het Nederlands: je kijkt vooruit (""vooruit""), en je blikt terug (""terug"").
    Dit betekent dat zowel in de Engelse als in de Nederlandse taal tijd vaak wordt beschreven in termen van beweging door de ruimte, waarbij de toekomst voor ons ligt en het verleden achter ons.
    "
    478
    What can be sais about enbodiment and the time-space connection?
    "

    "
    479
    How is the connection between time and space in mandarin and chinese languages?
    "

    which makes sense, because they read from above to belowd.
    "
    480
    How is the vertical gesturing by speakers of Mandarin affected by?
    ""
    481
    What was the research design when researching gesturing space in Mandarin?
    "


    a.k.a. people had to describe words... There were sentences with spatial connotations and without. Researched was if people used gestures."
    482
    What were the results of the research about gesturing space in Mandarin?
    "
    "
    483
    What did Stocker study in 2016, regarding eye gaze?
    "

    "
    484
    What were the results of the study of Stocker in 2016?
    "

    "
    485
    "What was the research design when researching Mandarin speakers from Rizhao Polytechnic's eye movements, and what were the conclusions?"
    "
    • Participants: 31 native Mandarin speakers from Rizhao Polytechnic.
    • Procedure:
      • Participants listened to 54 pairs of sentences, 18 of which contained temporal relations (past or future).
      • They sat in front of a computer screen, looking at an empty gray screen while listening to the sentences.
      • Occasionally, they answered true/false questions about the sentences.
      • Eye movements were recorded using a portable eye-tracker (eye-tribe).
    • Stimuli: Sentences included vertical spatial metaphors (e.g., ""last month,"" ""next month""), sagittal spatial metaphors (e.g., ""before,"" ""after""), and neutral temporal references (e.g., ""yesterday,"" ""tomorrow"").
    Conclusions:
    • Eye movements revealed differences in how participants conceptualized past and future.
    • Significant differences were found between Swiss German and Chinese participants.
    • Participants could not guess the true purpose of the study.
    • Linguistic material, especially vertical time words, had a noticeable early effect on eye movements.
    "
    486
    "How did the focus on past and future values differ among Moroccans, Chinese, and Spaniards?"
    Moroccans focused more on the past, Spaniards on the future, and Chinese participants showed a neutral or balanced focus.

    487
    "What did the object positioning task reveal about cultural differences?"
    Moroccans predominantly placed the past in front, while Spaniards and some Chinese groups placed the future in front.