Text Mining Flashcards
Was sind Bag-of-Tokens Approaches?
Zählen der Wörter in einem Text
Was ist das Problem von Bag-of-Tokens Approaches?
Looses all order-specific information!
Reduces context information.
Was ist Syntax?
ordering of words and its possible effect on meaning
Was ist Semantik?
concerns the (literal) meaning of words, phrases, and sentences
Was sind Pragmatics?
concerns the overall communicative and social
context and its effect on interpretation
Wie kommt man von Flat Text
zu Struktur und Bedeutung?
Beschreib Word-level ambiguity
Beschreib Semantics and Anaphora resolution
Beschreib Syntactic ambiguity
Beschreib Presupposition and pragmatic inferences
Was ist Syntactic Parsing?
Produces the correct syntactic parse tree for a sentence
How many syntactic interpretations does a sentence ending in n prepositional phrases have?
over 2^n
Was ist eine kontextfreie Grammatik?
Was ist Probabilistic Structure Parsing?
Was ist Shallow Natural Language Processing?
Was ist Morphology?
the field of linguistics that studies the
internal structure of words
Was ist ein Morpheme?
the smallest linguistic unit that has
semantic meaning
Was ist Morphological Analysis?
What is Part-of-Speech (POS) Tagging?
Was ist Phrase Chunking?
Was ist Semantic Role Labeling?
Was ist Semantic Information Extraction (IE)?
Wobei hilft Shallow NLP?
e. g.:
* Question Answering
* Text Summarization
Was ist der Unterschied zwischen Informations Retrieval und Information Extraction?
Information Retrieval Models
Beschreib das Boolean Model