- English-Corpora: COCA
[Davies] 1 1 billion word corpus of American English, 1990-2010 Compare to the BNC and ANC Large, balanced, up-to-date, and freely-available online
- Compare: Corpus of Contemporary American English (COCA) and the British . . .
In summary, while 100 million words is often adequate for studying syntax, for some very low-frequency phenomena, there is a real difference between 100 million words (BNC) and 560 million words (COCA)
- English Corpora: most widely used online corpora. Billions of words of . . .
The COCA corpus has more than 250 million words of data from the Web (divided almost evenly between blogs and other texts from the Web), and the texts have been categorized very well into different web genres by Serge Sharoff
- English Corpora: most widely used online corpora. Billions of words of . . .
Compare genres, dialects, time periods; use AI; search by PoS, collocates, synonyms, and much more
- The COCA corpus (new version released March 2020)
The Corpus of Contemporary American English (COCA) is by far the most widely-used of these corpora In early 2020, we dramatically expanded the scope and size and features of COCA to make it even more useful for researchers, teachers, and learners
- English Corpora: most widely used online corpora. Billions of words of . . .
The academic genre should have texts from a wide range of domains – like science, law, philosophy, humanities, history, education, etc (COCA is balanced between these domains )
- English Corpora: most widely used online corpora. Billions of words of . . .
By far, the most widely used corpus for language learning is COCA (the Corpus of Contemporary American English) COCA is the only corpus that is large, recent, and genre - balanced
- English-Corpora. org
由于COCA 是唯一具有体裁多样、库容大、语料新等特点的英语语料库,数以百计类似的英语句法变异方面的深度研究都是基于它进行的。
|