copy and paste this google map to your website or blog!
Press copy button and paste into your blog or website.
(Please switch to 'HTML' mode when posting into your blog. Examples: WordPress Example, Blogger Example)
English-Corpora: COHA COHA contains more than 475 million words of text from the 1820s-2010s (which makes it 50-100 times as large as other comparable historical corpora of English) and the corpus is balanced by genre decade by decade
CCOHA: Clean Corpus of Historical American English This paper describes methods applied to the downloadable version of the COHA corpus in order to overcome its main limitations, such as inconsistent lemmas and malformed tokens, without compromising its qualitative and distributional properties
英语语料库(English-Corpora. org)使用指南 9 Corpus of Historical American English (COHA) 400 million American 1810-2009 Balanced 10 The TV Corpus 325 million 6 countries 1950-2018 TV shows 11 The Movie Corpus 200 million 6 countries 1930-2018 Movies 12 Corpus of US Supreme Court Opinions 130 million American 1790s-present Legal opinions 13 Corpus of American Soap Operas
English Corpora: most widely used online corpora. Billions of words of . . . These are the most widely used online corpora, and they serve many different purposes for teachers and researchers at universities throughout the world In addition, the corpus data (e g full-text, word frequency) has been employed by a wide range of companies in many different fields, especially technology and language learning The links below are for the free online interface
English Corpora: most widely used online corpora. Billions of words of . . . The corpus is balanced by genre across the decades For example, fiction+ TV Movies (which are scripted, similar to fiction) accounts for 54-57% of the total in each decade (1820s-2000s), and the corpus is balanced across decades for sub-genres and domains as well (e g by Library of Congress classification for non-fiction; and by sub-genre for fiction -- prose, poetry, drama, etc)
The Corpus of Historical American English (COHA), With COHA (and also Google Books (E-C Advanced), to a lesser extent) we can easily find the collocates of gay decade by decade, and we can also directly compare the collocates in different sets of decades (e g gay in 1830s-1910s vs 1970-2000s)
English-Corpora. org: a guided tour (see video) following charts from COHA (400 million words, 1810-2009) shows steamship by decade, and Reds by decade and even by year (note 1953, the year of the McCarthy hearings in the US Senate) As the search for a most ADJ NOUN
English-Corpora: COCA PDF overview Five minute tour Features for learners The Corpus of Contemporary American English (COCA) was created by Mark Davies, and it is the only large and "balanced" corpus of American English COCA is probably the most widely-used corpus of English, and it is related to other corpora from English-Corpora org, which offer unparalleled insight into variation in English
English-Corpora: COHA 400 million word corpus of historical American English, 1810-2000 The largest corpus of historical American English