Quantitative Linguistics Flashcards
1
Q
What is a language corpus?
A
- A systematic collection of naturally occurring texts (written or spoken) stored on a computer
- Structure and content follow extralinguistic principles.
2
Q
What is meant by co-occurrence?
A
A sequence of words that co-occur more often than would be expected by chance.
3
Q
What is collocation?
A
A specific combination of words (strong co-occurence) eg. strong tea, powerful wizard