Summary Flashcards

Question

SIGSALY (Project X or Green Hornet)

Answer 1

based on Vocoder - needed for encryption (white noise stored on 2 vinyl phonographic records) - special turntables to synchronize time

Answer 2

process of splicing together pieces together of pre-recorded speech

Answer 3

process of changing a pre-recorded signal to produce a desired sound

Answer 4

A : ability to produce natural-sounding speech : flexibility in creating new words : speed of production D : lack of control over the sound of the speech : its susceptibility to error : inability to produce continuous speech

Answer 5

A : producing greater degreee of control over the sound of input D : more computationally intensive : more difficult to create new words or phrases with its technique

Answer 6

1. Lack of invariance problem - phonetic environment - differing speech conditions (tempo) - speaker variation (dialects) 2. perceptual constancy and normalization - ability recognize and interpret speech sounds regardless the context - map signals to independent category 3. speech segmentation problem - difficult to identify and segment individual speech sounds

Answer 7

generated by explicit model - articulatory synthesis --> using physiological models that stimulate movement vocal tract and articulators. - source-filter models --> two components combined, a source (vocal folds) with a filter (vocal tract) - formant synthesizers --> digital synthesizers that use combination of source-filter and pre-recorded vocal sample to generate realistic sounding speech

Answer 8

--> neuroprosthetic device that bypasses the normal acoustic hearing process by electric stimulation of auditory nerve

Answer 9

first --> source waveform is generated by explicit model second --> source waveform is generated by data third --> source waveform is learned from the data

Answer 10

tradeoff between processing speed and memory - model based - sample based

Answer 11

input is Mel frequency cepstral coefficients - divide signal in frames of 20-40 ms - mel filter bank (determine filter bank energies) - log transform - compute discrete cosine transform (DCT)

Answer 12

- Generating speech using data base of pre-recorded speech samples and selecting most appropriate units of speech form the data base ++ more natural speech -- less generalizable and more recordings needed

Answer 13

- Generating speech using data base of pre-recorded speech samples and selecting most appropriate units of speech form the data base ++ more natural speech -- less generalizable and more recordings needed

Answer 14

the sound between two adjacent phones, combined to form words

Answer 15

A : automatically train so avoid hand written rules : high quality synthesis and compact D : speech has to be generated by parametric model, final quality is dependent on parameter-to speech technique used

Answer 16

1. people with visual impairments to listen to text 2. listening to text during driving 3. travel information in public transport

Answer 17

- text analysis * identify tokens * tokenizing (split in smaller chunks) * normalization (determine spoken variant of each token) - linguistic analysis * phonemes * prosodic information (intonation, duration, stress, rhythm) - waveform generation (1,2,3)

Answer 18

a collection of texts with some unifying characteristics

Answer 19

sequence of characters that define a search pattern in strings of text such as words, phrases and numbers

Answer 20

- applicative (develop nlp tools) - analytical (empirical basis on the distribution of constructions and language phenomena)

Answer 21

- normalizing text (standard form) - tokenization (splice words) - lemmatization (find similar roots) - stemming (make simpler to roots) - sentence segmentation (breaking a sentence) - compare words and strings

Answer 22

- multiple languages (code switching) - genre (source of the text) - demographic characteristics writer - language changes over time

Answer 23

motivation situation language variety collection process annotation process distribution

Answer 24

1. tokenizing - token learner - token segmenter 2. normalizing word formats - case folding (lower case) - lemmatization - morphological parsing - stemming 3. segmenting sentences

Answer 25

phones --> same sound, different spelling graphs --> same spelling, different sound

Answer 26

synonymy, antonymy, hypernymy/hyponymy, meronymy/holonymy, co-hyponyms

Answer 27

house - villa same sense, different word

Answer 28

good - bad tegenstelling

Answer 29

"dog" is a hyponym of the word "animal" because animal is less specific

Answer 30

fingers is meronym of hand because it is a part of the hand hand is the homonymy of fingers because it is the whole

Answer 31

fingers is meronym of hand because it is a part of the hand hand is the homonymy of fingers because it is the whole

Answer 32

cat and dog are co-hyponyms because both a type of word animal

Answer 33

cup and coffee because belong to same semantic field

Answer 34

positive (happy) negative (sad) connotation pos (great). neg (terrible) evaluation

Answer 35

1 valence (neg of pos ) 2 arousal (excited or not) 3 dominance (control or not)

Answer 36

positive or negative evaluation language

Answer 37

tf-idf and word2vec

Answer 38

measure the importance of a term in a document relative to other documents in a corpus

Answer 39

methods used to represent words in a vector space in order to capture semantic and syntactic relationships between words

Answer 40

measure of similarity between two vectors, which is calculated by taking the cosine of the angle between the vectors

Answer 41

see if a word appears more often with a word than expected

Answer 42

two methods used to represent words in a vector space - CBOW is method used to predict a set of context words given a target word - Skipgram is a method used to predict a target word given a set of context words

Answer 43

first-order co-occurrence (wrote and book) if they are nearby second-order co-occurrence (wrote and said) if they have similar neighbors

Answer 44

1. SO polarity 2. PN polarity 3. strength of PN polarity 4. extracting opinions

Answer 45

big in size mixed language full texts different domains and genres range of text categories well documented

Answer 46

1 mode (written, spoken, mixed...) 2 representativeness (balanced, specialized) 3 time (diachronic, synchronic) 4 language (mono, multi, parallel, comparable) 5 sampling (full documents, sample) 6 mark up (raw annotated)

Summary Flashcards

(70 cards)