Linguistics Flashcards
Morphemes vs. Syllables
- Syllables can be seen as the phonological building blocks of words
Speaker → “speak” + “er” (Morphemes) vs “speak” + “ker”(syllables
Free morphemes can be used as words, bound morphemes cannot
Words
- Every word is composed of one or more morphemes
- In some languages, characters represent complete words(or syllables)
Morphological normalization
- Identification of a single canonical representative for morphologically related wordforms
- Used in NLP to identify different forms of the same word
Normalization methods
- Stemming(Derivation): The text analysis that identifies the stem of a token(”derive” - > “deriv”, “am” →”am”)
- Lemmatization(Inflection): The text analysis that identifies the lemma of a token(”derive”→”derive”, “am”→”be”)
Syntax
- The syntax of a language is defined by a grammar
- the structural relationships between words, usually within a sentences
Two types of word classes
- Open(lexical words): Theoretically, infinitely many members per class(Nouns, Adjectives, Adverbs, Main Verbs(see, played) and digital numbers(3233)
- Closed(functional words): Number of members is fixed in principle(Modal verbs (can, had), determiners(the, some), conjunctions(add, or), pronouns(its, their), prepositions, particles(off, up) and interjections(Ow, Eh)
Structural Relationships
- Part of speech. The class of a word is decided by its syntactic context(is on the boundary between morphology and syntax)
Abstract classes: Noun, verb, adjective, adverb, preposition, pronoun, conjunction, interjection, determiner
POS tagging: the text analysis that assigns a part-of-speech tag to each token
Clauses
- Clauses: Grammatical units that express complete propositions
Clauses are the smallest grammatical unit that can express a complete preposition
Two basic types of clauses:
- Main Clauses: Independent, can stand alone as a sentence: I remained dry
- Subordiante clause: is reliant on a main clause and thus depends on it: (Usually stars with a subordinating conjunction) Although it rained, “because I was inside the building”
Grammars
a description of the valid structures of a language, is defined by a set of rules
Syntactic parsing
(aka full parsing): the text analysis that determines the grammatical structure of a sentence with respect to a given grammar(Types: constituency parsing and dependency parsing
Phrase vs Dependency grammar
Phrase grammar: Models the constituents of a sentence and how they are composed of other constituents and words(Constituency tree, inner nodes are non-terminal, leaf are terminal)
Dependency grammar: models the dependencies between the words in a sentence(Dependency tree, all nodes are terminal, the root is nearly always the main verb(of the first main clause)
Semantics
syntactic ambiguity: arises when a piece of text has more than one valid syntactic structure
### the meaning of single words and compositions of words
Meaning:
- Propositional content in terms of validity or truth conditions
- Often requires common-sense reasoning based on world knowledge(Max can open Tim’s safe, he knows the combination)
- Includes expressed emotional content(That poor cat, Fortunately, Max can ope
Linguistic Form vs context of use
- Meaning implied by linguistic form
- Context-independent meaning, such as “its raining”
- What a speaker publicly commits to, such as “It is wet outside”
- A speaker’s private state, such as “You should take an umbrella”
- Meaning dependent on context of use
- Scope of quantifiers, such as “Every student reads some book”
- Word sense ambiguities, such as “I’m making it”
- Semantic relations between noun in compounds, such as “play book”
- Meaning dependent on non-lunguistic perception
- Time, such as “now”, tomorrow,..
- Location, such as “here”, “there”, “That’s a beautiful city”
- Speaker and hearer such as “I”, “You”,..
Lexical Semantics
- Word senses: Distinctions in meaning between different uses of the same form
- Polysemy: related word senses with the same lexical entry(newspaper(physical object vs abstract content))
- Homonymy: Unrelated word senses that have the same lexical entry(Bank ( river bank vs money bank))
- Semantic roles: Number of arguments of a predicate, specific relationship the arguments bear to the predicate
- The roles the arguments of a predicate have in the state or activity captured by the predicate
- Different predicates have different semantic roles: She saw Max, She kissed Max, She ressembled Max
Compositional semantics
The meaning of word compositions in phrases, clauses and sentences
- Relations:
- Semantic: Relations between entities from the world
- Temporal: Relations between describing courses of events
- Linguistic Operators:
- Quantifiers: Indicating quiantities of objects, such as “Every man read”
- Hedges: Limiting the impact of propositions, such as “Probably every”
- Negation: inverting adjectives/predicates, such as “Probably not every”
- Semantic relations:
- Word compositions that capture relational predicates with arguments
- Typically: who did what to whom, where, when, how and why?
- Common relations types:
- Binary relations: Relations with two arguments
- Events: Relations with multiple arguments, possibly nested relations
- Connotation: What word choice conveys beyond truth-conditional semantics, such as “Insightful results”vs “interesting results”
Between Lexical and compositional Semantics
- Multi-word expression: A lexical larger than a word that can bear both compositional and idiomatic meanings(driving instructor, vice versa)
- On the boundary between lexical an compositional semantics(Long time no see, Kick the bucket)
- Word n-grams: An alternative to identifying multi-word expression to simply use word bigrams, trigrams, or similar instead
Entity: An entity represents an object from the real world
- Named entities: objects that can be denoted with a proper name: Names like In Hannover or at Leibniz University Hannover
- Numeric entities: Values, quantities, proportions, ranges or similar. “in this year”, “2023-03-04”, $10 000, etc.