- English-Corpora: COCA
[Davies] 1 1 billion word corpus of American English, 1990-2010 Compare to the BNC and ANC Large, balanced, up-to-date, and freely-available online
- English Corpora: most widely used online corpora. Billions of words of . . .
The COCA corpus has more than 250 million words of data from the Web (divided almost evenly between blogs and other texts from the Web), and the texts have been categorized very well into different web genres by Serge Sharoff
- English Corpora: most widely used online corpora. Billions of words of . . .
Compare genres, dialects, time periods; use AI; search by PoS, collocates, synonyms, and much more
- English Corpora: most widely used online corpora. Billions of words of . . .
By far, the most widely used corpus for language learning is COCA (the Corpus of Contemporary American English) COCA is the only corpus that is large, recent, and genre - balanced
- English Corpora: most widely used online corpora. Billions of words of . . .
到目前为止,在语言学习方面最广泛使用的语料库是COCA (the Corpus of Contemporary American English)。 COCA是唯一一个庞大、新近且体裁均衡的语料库。
- English Corpora: most widely used online corpora. Billions of words of . . .
The academic genre should have texts from a wide range of domains – like science, law, philosophy, humanities, history, education, etc (COCA is balanced between these domains )
- Compare: Corpus of Contemporary American English (COCA) and the British . . .
In summary, while 100 million words is often adequate for studying syntax, for some very low-frequency phenomena, there is a real difference between 100 million words (BNC) and 560 million words (COCA)
- English Corpora: most widely used online corpora. Billions of words of . . .
The corpora from English-Corpora org are the most widely used corpora in the world, and COCA is by far the most widely used of the 17 corpora at the site Hundreds of thousands of researchers, teachers, and students have found the data from COCA to be more reliable and useful than that of any other corpus
|