Natural Language Processing Flashcards

Question 1

Q

tokenization

Answer

A

separating each instance (usually a relevant word) in a given character sequence usually the first part of nlp analysis

Question 2

Q

part-of-speech (pos) tagging

Answer

A

assigning a word type (noun/verb/etc.) to a token

Question 3

Q

dependency parsing

Answer

A

process of describing the relationship between tokens (subject/object/etc.), the grammatical structure of a sentence

‘rainy weather’ : weather is head, rainy is child, dependent

Question 4

Q

lemmatization

Answer

A

extracting the base forms of a word or token

ex. base of was is be
ex. base of cats is cat

Question 5

Q

sentence boundary detection (SBD)

Answer

A

finding and segmenting individual sentences

Question 6

Q

named-entity-recognition (NER)

named entity identification

entity chunking

entity extraction

Answer

A

location and classify named entities (richmond) into a category (city)

Question 7

Q

information extraction

Answer

A

automatically extracting structured information from unstructured or semi-structured data

Question 8

Q

similarity

Answer

A

comparing documents to see how similar they are to each other

Question 9

Q

text classification

Answer

A

assigning categorizes or labels to whole documents

Question 10

Q

rule based matching

Answer

A

find words and phases in a document, as well as the tokens and their relationships

Question 11

Q