CHUNKING Flashcards

1
Q

What is the purpose of Chunking

A

Moving from individual words to meaningful groups “chunks”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a noun phrase NP

A

(or nominal phrase)
performs the same grammatical function as a noun
“my favourite book”
“a big blue whale”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a verb phrase VP

A

a syntactic unit composed of at
least one verb and its dependents
“read a good novel”
“we will visit the park tomorrow”
“she is writing a letter’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Determiner

A

often a noun phrase can start with one. They can be:
- Simple lexical items: the, this, a, an…
- Simple possessives: John’s
- Complex recursive versions: John’s sister’s husbund’s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Nominal

A

Contains the head and any pre and post modifiers of the head. Follows the determiner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

VP/NP in context-free grammars

A

terminals : words
non-terminals : constituents like VP NP or sentence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are pre-modifiers

A
  • Quanitifiers, cardinals, ordinals: eg three cars
  • Adjectives: eg large cars
  • There is an ordering: three large cars (not large three)
    ‘Cars’ here is the head
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the head in a NP

A

Main component that carries the primary meaning of the phrase
Usually a noun, pronoun, or word functioning as a noun

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are post-modifiers

A
  • Prepositional phrases: eg from Seattle
  • Non-finite clauses: eg arriving before noon
  • Relative clauses: eg that serve breakfast
  • Nominal → Nominal PP
    Nominal → Nominal GerundVP
    Nominal → Nominal RelClause
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Agreement

A

constraints that hold among various
constituents that take part in a rule or set of rules

eg determiners and the head nouns in NPs have to agree in their number (Not “This flights”)

A way to deal with this is to add further rules in CFGs for NPs to deal with this:
- SingularNP → SingularDet SingularNom
- PluralNP → PluralDet PluralNom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the constituents of VPs

A

English VPs consist of a head verb along with 0 or more following constituents which we call arguments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is subcategorisation

A

We can subcategorise verbs according to the sets of VP rules they can take part in
We have to formally express these constraints

However for CFGs, this does not scale well, explodes the number of rules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are Treebanks

A

Corpora in which each sentence has been paired with a parse tree

Instead of paying linguists to write a grammar, pay them to annotate real sentences with parse trees

Then use the annotated data to learn the rules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are probabilistic CFGs

A

Where each production rule has a probability
Using MLE:
We use the treebank to count how many times a rule A -> a is enacted
divide by the number of times a rule with LHS A is enacted
to get P(a|A)

Each rule A → a is assigned a probability p(a|A)
the sum over all expansions of A must equal 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Probabilistic CFGs for Chunking

A

Chunking can now be done probabilistically using these rewriting rules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When calculating a sentence probaility

A
  • Assume productions for each node are chosen independently
  • Probability of derivation is the product of the probabilities of its productions

Essentially follow the path and multiply the probabilities of each branch taken