A corpus is a database of spoken or written language.
The words in a corpus can be collected from a variety of
sources.
For example, words in a written corpus may come from newspapers,
magazines, books, or the internet; while words in a spoken corpus
may come from everyday conversations.
With computer software to analyze a corpus, we can find out the
most commonly used English words, expressions, and phrases in a
language.