Exam Preparation Deck Flashcards

Question 1

Q

Define the eight features used for pronoun resolution. State the extraction method if the feature is hard to get.

Answer

A

Cataphoric: If pronoun occurs earlier than candidate antecedent.
Number agreement: If pronoun and candidate antecedent agree on the number. The numbers of expressions can be found by a morphological processor.
Gender agreement: If genders are compatible. This may require a named entity classifier.
Same verb: If the pair shares the same verb. Can be determined by a syntactic parser.
Sentence distance.
Grammatical role of the antecedent. Subject/object/other. Can be found by a syntactic parser.
Parallel: If the pair shares the same grammatical role.
Form: Proper/indefinite/definite/pronoun. Form of the antecedant.

Question 2

Q

What is a baseline?

What is a ceiling?

Answer

A

A baseline is a score given by a relatively simple approach which is used as a standard against which the approach under investigation is compared.

Ceiling is the maximum performance that could be expected, generally the agreement achieved between two or more humans performing the task.

Question 3

Q

Why might a discourse model be used over a Naive Bayes model in resolving pronouns?

Answer

A

May not produce a globally consistent answer.

Quite likely that the classifier would propose that ‘he’ and ‘it’ refer to Burns in the example given.

In discourse model, fixed binding gives information on the bound pronoun, thus giving global consistency. The model also allows ‘repeated mention’ heuristic, something impossible for a single pass classifier.

Question 4

Q

Define morphological ambiguity. Give an example.

Answer

A

Words that can be decomposed into different sets of morphemes.

For example, unionised can be seen as un-ion-ise-ed, or union-ise-ed.

Question 5

Q

Define lexical ambiguity. Give an example.

Answer

A

Arises when a word has multiple senses.

For example, the word ‘duck’ could be an action or an animal.

Question 6

Q

Define syntactic/structural ambiguity. Give an example.

Answer

A

Multiple ways of bracketing an expression.

He ate the pizza with a fork.

The prepositional phrase ‘with a fork’ can be bind to ‘he’ or ‘the pizza’.

Question 7

Q

Define discourse relation ambiguity. Give an example.

Answer

A

Implicit relationship between sentences.

Max fell. John pushed him.

Narration: Max fell and John pushed him.
Explanation: Max fell because John pushed him.

Question 8

Q

Describe the packing algorithm. What is it good for?

Answer

A

Packing is an optimization on the chart parsing. We record multiple derivations of a possible phrase in the same edge.

Works because rule application is not sensitive to the internal structure of an edge.

It can be proven that the algorithm can run in cubic time. It stops entries in the list from growing exponentially. However, unpacking takes exponential time…

Question 9

Q

Given a string of words, how do we compute the most likely tags?

Question 10

Q

Define:

Hyponymy
Meronymy
Synonymy
Antonymy

Answer

A

Hyponymy: More specific meaning of a general term. Dog is a hyponym of animal.
Meronymy: Part-of relation. Arm is a meronym of body.
Synonymy: Same meaning. Policeman and cops are synonyms.
Antonymy: Opposite meaning. Big and little are antonyms.

Question 11

Q

Describe Yarowsky’s minimally-supervised learning approach to word sense disambiguation.

Answer

A

Find examples in the corpus.
Manually identify seeds to disambiguate some uses.
Train a decision list classifier on Sense A/B examples. Rank features by log-likelihood ratio.
Apply classifier to training set. Add reliable examples.
Iterate 3,4 until convergence.

Question 12

Q

What can we do to avoid P(w_n | w_{n-1}) bigrams being zero?

Answer

A

Smoothing: distribute ‘extra’ probability between rare and unseen events.
Backoff: approximate unseen probabilities by a more general probability, e.g. unigrams

Question 13

Q

Define four notions of context.

Answer

A

Word windows (not filtered): n words on either side of the lexical item.
Word windows (filtered): n words on either side, removing functional words and very frequent content words.
Lexeme windows: Use stems instead of words.
Dependencies: Directed links between head and dependents. Context of item is the dependency structure it belongs to.

Question 14

Q

Define three ways to weigh context.

Answer

A

Binary model: Set value of dimension c to 1 if context c co-occurs with word w.
Basic frequency model: Count number of times c co-occurs with word w instead.
Point-wise mutual information (PMI). Slightly borrowing stuff from Information Theory.

Question 15

Q

How do we combine visual and text words?

Answer

A

Feature level fusion: Concatenate text and visual vectors. Reduce dimension by SVD or NMF.
Scoring level fusion. Estimate similarity separately for text and visual vectors. Take a mean of the scores.

Question 16

Q

How do you learn adjective matrices from corpus data?

Question 17

Q

Define the seven major tasks in content generation.

Answer

A

Content determination: Decide what content to convey.
Discourse structuring: Structuring of text… e.g. abstract, introduction, conclusion…
Aggregation: How information split into sentence-sized chunks.
Referring expression generation: Deciding when to use pronouns, how many modifiers…
Lexical choice: What lexical items to use to convey a concept.
Surface realization: Map semantic representation to a string.
Fluency ranking: Rank the strings overgenerated by a big grammar. May use n-grams.

Question 18

Q

What is a chart? What is it useful for?

Answer

A

Data structure for parsing natural language.
Allow DP by storing partial results of parsing. Hence avoiding recomputation.
Chart consists of a list of edges, each storing one result of a rule application.
Each edge has five fields: id, left vertex, right vertex, category and daughters.

Question 19

Q

Distinguish inflectional and derivational morphology.

Answer

A

Inflectional: Concerns properties such as tense, aspect, number, person, gender.
Derivational: (un-, re-, anti-), broad range of semantic possibilities…