Week 7: dependency syntax Flashcards
What type of constituents are difficult for constituency syntax?
Discontinuous constituents
What does dependency syntax limit analysis to?
Relationships between words
What is the standard assumption of dependency syntax?
Each word has a single parent word
Reasons dependencies are easier for computational linguistics (three)
- Conceptually simple
- Good cross-linguistically
- Very little theoretical baggage
UD fundamentals 1:
UDs are …
…word-based, where words are whatever the tokeniser gives you
UD fundamentals 2: _______ are more important than ________
Content words are more important than function words
UD fundamentals 3:
In symmetric cases, …
…draw edges from left to right
UD fundamentals 4:
If a word is elided, …
…promote its child to the head position. If the result is “unnatural and misleading”, use the orphan relation.
What is the main predicate of the sentence labelled as?
root
What word class is normally the root?
Verb (not always though!)
What are non-projective trees?
Trees with crossing edges
Why are non-projective trees problematic? (Two reasons)
- Not enough training data - many treebanks were converted from earlier constituency treebanks without non-projective trees
- Some parsing algorithms cannot produce non-projective trees
How do you go from a constituent parse to dependencies?
- Replace the label of each constituent with its head word
- Attach all word in the constituent to the head word
How do you go from dependencies to constituents?
- Identify left and right boundaries for the subtree, add parentheses and choose a label
- Proceed recursively with each child
Limitations of dependencies (three)
- Constituency testing is impossible and competing analysis are harder to justify
- We cannot maintain binary branching - all tree nodes are now surface words so there can be no hidden nodes
- Dependency trees are unordered (equivalent under a permutation of indices) whereas constituents are naturally ordered.