companydirectorylist.com  Global Business Directories and Company Directories
Search Business,Company,Industry :


Country Lists
USA Company Directories
Canada Business Lists
Australia Business Directories
France Company Lists
Italy Company Lists
Spain Company Directories
Switzerland Business Lists
Austria Company Directories
Belgium Business Directories
Hong Kong Company Lists
China Business Lists
Taiwan Company Lists
United Arab Emirates Company Directories


Industry Catalogs
USA Industry Directories














  • Word frequency list based on a 15 billion character corpus: BCC (BLCU . . .
    The Beijing Language and Culture University created a balanced corpus of 15 billion characters It’s based on news (人民日报 1946-2018,人民日报海外版 2000-2018), literature (books by 472 authors, including a significant portion of non-Chinese writers), non-fiction books, blog and weibo entries as well as
  • Common Idioms; A Collection by Grade [HSK old HSK 中考 高考 . . . ]
    The corpus is much larger than the CCL (470 million characters), the CNC (100 million characters), the SUBTLEX-CH (47 million characters) and the LCMC (less than 2 million characters) It seems as if the frequency lists derived from this corpus might be the most reliable frequency lists currently available
  • Media-related vocabulary gathering project - Pleco Software Forums
    With a small corpus of 650 articles from People's Daily, downloaded using a Python script, I hope to start providing a more modern frequency list of media-related vocabulary The frequency list has the following features: It uses all sections of the 人民日报 People's Daily newspaper, including the sports section
  • Flashcards for TOCFL (2023), CCCC, TBCL - Pleco Software Forums
    I've parsed out vocabulary from these taiwanese tests and converted to flashcards in pleco's format Useful e g for seeing term levels, intended part of speech and sometimes definitions examples TOCFL vocab was updated some couple years ago and I haven't yet seen a processed version of the
  • Bigrams sorted by frequency with pinyin English?
    The Beijing Language and Culture University created a balanced corpus of 15 billion characters It’s based on news (人民日报 1946-2018,人民日报海外版 2000-2018), literature (books by 472 authors, including a significant portion of non-Chinese writers), non-fiction books, blog and weibo entries as well as
  • Sentences flashcards generator (Python script) - Pleco Software Forums
    The Beijing Language and Culture University created a balanced corpus of 15 billion characters It’s based on news (人民日报 1946-2018,人民日报海外版 2000-2018), literature (books by 472 authors, including a significant portion of non-Chinese writers), non-fiction books, blog and weibo entries as well as
  • Integrating BCC Corpus Data into Dictionary - Pleco Software Forums
    The BCC corpus seems to have pretty loose licensing terms Pleco already seems to be using frequency data to sort the search results Adding them meaningfully to dictionary definitions would be even better, I believe That is something which printed dictionaries can’t do
  • www. plecoforums. com
    most_common_n_number_of_corpus_words = 40000 # Limit selection of corpus words to the # the first n most common words from the corpus (all from BCC corpus)




Business Directories,Company Directories
Business Directories,Company Directories copyright ©2005-2012 
disclaimer