LING1109 - module key concepts wk 4-5 Flashcards
Corpus linguistics brings a set..
of methods to the empirical study of language
Corpus linguists work with…
datasets too large to be read and analysed by individual researchers
Corpus linguistics uses techniques like…
frequency lists and concordance lines to study study patterns in corpora
A corpus is best used
to answer a research question it was designed to answer
Corpora can be
written, spoken or signed
The “corpus driven” approach examines..
what is frequent or salient in the data
The “corpus based” approaches..
involves bring a hypothesis to the data and testing it out.
Monitor corpora develops
dataset that grows over time and that contains a variety of materials
Sample corpora seek to
represent a particular type of language over a specific period of time
Corpus linguists use..
transparent, replicable methods, to avoid selecting only examples from their data that support their hypothesis
Corpora can be
monolingual, bilingual or multilingual
A parallel corpus contains..
texts in a native language (L1), and their translations (L2)
A comparable corpus
is a corpus based on the same sampling frame as another corpus
Corpus linguistics uses large collections of..
machine readable texts to search for patterns in discourse
Collocates are words
that typically accompany a “node” word (that is the word that you are interested in studying)