Historical Corpus Linguistics Flashcards
Corpus
Collection of machine-readable, digitalized texts, spoken/ written data on defined principles, naturally occurring speech, balanced, representative
Types
Plain text corpora, part-of-speech tagged corpora, parsed corpora
Corpus linguistic studies
Intersubjectively verifiable, frequency based, context sensitive, computer assisted
Intensifiers in Late Modern English assumptions
Major changes have happened, very is the most frequent, women lead
Intensifiers in Late Modern English method
The Old Bailey Corpus (1720-1923), meta information (year, gender, class), adverb + general adjective
Intensifiers in Late Modern English
Very is most popular (66%), perfectly/ absolutely established, women and higher classes lead, but men and higher social classes introduce
Corpora (size)
Standard vs. mega
1st generation vs. 2nd generation
Corpora (context)
Monolingual vs. bilingual/ multilingual
Native vs. non-native
Synchronic vs. diachronic
General vs. specialised
Corpora (design)
Static vs. dynamic
Plain text vs. annotated
Sample text vs. full text
Corpora vs. collections/ archives