Week 7: dependency syntax Flashcards

Question 1

Q

What type of constituents are difficult for constituency syntax?

Answer

A

Discontinuous constituents

Question 2

Q

What does dependency syntax limit analysis to?

Answer

A

Relationships between words

Question 3

Q

What is the standard assumption of dependency syntax?

Answer

A

Each word has a single parent word

Question 4

Q

Reasons dependencies are easier for computational linguistics (three)

Answer

A

Conceptually simple
Good cross-linguistically
Very little theoretical baggage

Question 5

Q

UD fundamentals 1:
UDs are …

Answer

A

…word-based, where words are whatever the tokeniser gives you

Question 6

Q

UD fundamentals 2: _______ are more important than ________

Answer

A

Content words are more important than function words

Question 7

Q

UD fundamentals 3:
In symmetric cases, …

Answer

A

…draw edges from left to right

Question 8

Q

UD fundamentals 4:
If a word is elided, …

Answer

A

…promote its child to the head position. If the result is “unnatural and misleading”, use the orphan relation.

Question 9

Q

What is the main predicate of the sentence labelled as?

Question 10

Q

What word class is normally the root?

Answer

A

Verb (not always though!)

Question 11

Q

What are non-projective trees?

Answer

A

Trees with crossing edges

Question 12

Q

Why are non-projective trees problematic? (Two reasons)

Answer

A

Not enough training data - many treebanks were converted from earlier constituency treebanks without non-projective trees
Some parsing algorithms cannot produce non-projective trees

Question 13

Q

How do you go from a constituent parse to dependencies?

Answer

A

Replace the label of each constituent with its head word
Attach all word in the constituent to the head word

Question 14

Q

How do you go from dependencies to constituents?

Answer

A

Identify left and right boundaries for the subtree, add parentheses and choose a label
Proceed recursively with each child

Question 15

Q

Limitations of dependencies (three)

Answer

A

Constituency testing is impossible and competing analysis are harder to justify
We cannot maintain binary branching - all tree nodes are now surface words so there can be no hidden nodes
Dependency trees are unordered (equivalent under a permutation of indices) whereas constituents are naturally ordered.

Question 16

Q

UD specific problems (three)

Answer

A

Tokenisation approaches are not uniform
Guidelines are not uniform
Annotation quality is not held to the same standard everywhere

Question 17

Q