lecture 10 Flashcards

Question

syntactic divergences: head-marking vs dependent-marking

Answer 1

- head-marking languages: grammatical relationships are indicated on the head of a phrase --> the man **house-his** - dependent-marking languages --> the **man's** house

Answer 2

- these languages can omit pronouns --> e.g., spanish: i eat = como

Answer 3

negation operates differently across languages

Answer 4

1. aspect 2. motion events

Answer 5

conveying current actions - progressive aspect: swimming - expression with an adverb: schwimmt gerade

Answer 6

have two properties 1. **manner** of motion (swimming) 2. **direction** of motion (across the lake) languages either express the manner with a verb and the direction with a 'satellite' or vice versa

Answer 7

1. we would like to have a measure of confidence for the translations we learn 2. we would like to model uncertainty in translation

Answer 8

a simplified and idealized understanding of a physical process

Answer 9

- general framework for many NLP problems 1. generate target sentence 2. a channel corrupts the target 3. source sentence is a corrpution of the target sentence --> translation is then the process of recovering the original signal (e) given the corrputed signal (f) --> P(e|f) = p(e) * P(f|e)

Answer 10

1. makes it easier to mathematically represent translation and learn probabilities 2. fidelity (accuracy of content) and fluency (naturalness of language) can be modeled separately

Answer 11

task to learn sentence translation probabilities where we **first need to learn word-level translation probabilities** 1. start with parallel sentence pair --> a sentence in one language paired with its translation in another language 2. since there are multiple possible alignments, we try to find multiple sentence pairs --> multiple possible word alignments 3. key idea: look at the co-occurrence of translated words. words that occur together in the parallel sentence are likely to be translations 4. calculate P(f|e) --> probability of a word in language 1 (f) given another word (e)

Answer 12

we can only find the best alignment if we know the word translation probabilities --> this is a chicken and egg problem

Answer 13

iterative process: **Expectation-Maximization (EM) algorithm** 1. estimate alignment probabilities using word translation probabilities 2. re-estimate word translation probabilities - since we dont know the best alignment initially, we consider **all possible alignments** when estimating the word translation probabilities and **weigh** all these with their corresponding alignment probabilities - computed as the ratio of the expected number of times the pair (f, e) occurs to the expected number of times any word pairs with e.

Answer 14

use phrases (sequence of words) as the basic translation unit --> Instead of aligning single words between the source and target languages, we align entire phrases.

Answer 15

1. **local reordering**: B-SMT allows intra-phrase re-ordering, meaning that within a single phrase or sequence of words, the order can be adjusted and memorized to better match the target language's structure --> the ordering of words is adapted to fit the syntactic rules of the other language 2. **sense disambiguation**: PB-SMT uses the context provided by neighboring words within a phrase to disambiguate meaning. 3. **handling institutionalized expressions**: idioms can be learned as a single unit 4. **improved fluency**: incorporating entire phrases, which can be of any length, enhances the natural flow of translations.

Answer 16

1. learn the phrase table (central data structure in PB-SMT) 2. learn the phrase translation probabilities

Answer 17

1. word alignment 2. phrase extraction + distortion modelling + feature extraction + language modeling 3. tuning 4. decoder

Answer 18

preprocessing the input by changing the order of words in the input sentence to match the order of the words in the target language 1. parse the sentence to understand its syntactic structure 2. apply rules to transform the tree

Answer 19

1. break the words into its component morphemes 2. learn translations for the morphemes

Answer 20

handling names and OOVs (out of vocabulary words)

Answer 21

with respect to 1. adequacy: how good the output is in terms of preserving content of the source text 2. fluency: how good the output is as a well-formed target language types: 1. human evaluation 2. automatic evaluation 3. BLEU: compares ngrams between languages 4. TER: TER: measures number of edits required 5. METEOR

Answer 22

#words in candidate that are in ref / #words in candidate (with repetition)

Answer 23

#words in candidate that are in ref / #words in candidate clip the number of matching words to their max count in the reference sentence

Answer 24

cannot be used for PB-SMT

Answer 25

selects the word with the highest probability --> risks running into local optima

Answer 26

randomly selecting the next word based on the probability distribution --> introduces randomness, potentially capturing more diverse translations but at the risk of inconsitencies

lecture 10 Flashcards

(51 cards)