Natural Language Processing Flashcards

1
Q

tokenization

A

separating each instance (usually a relevant word) in a given character sequence usually the first part of nlp analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

part-of-speech (pos) tagging

A

assigning a word type (noun/verb/etc.) to a token

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

dependency parsing

A

process of describing the relationship between tokens (subject/object/etc.), the grammatical structure of a sentence

‘rainy weather’ : weather is head, rainy is child, dependent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

lemmatization

A

extracting the base forms of a word or token

ex. base of was is be
ex. base of cats is cat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

sentence boundary detection (SBD)

A

finding and segmenting individual sentences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

named-entity-recognition (NER)

named entity identification

entity chunking

entity extraction

A

location and classify named entities (richmond) into a category (city)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

information extraction

A

automatically extracting structured information from unstructured or semi-structured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

similarity

A

comparing documents to see how similar they are to each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

text classification

A

assigning categorizes or labels to whole documents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

rule based matching

A

find words and phases in a document, as well as the tokens and their relationships

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly