- Corpora Incorporation? - Pleco Software Forums
Wondering if you guys ever considered incorporating any corpora info? For instance: LIVAC Synchronous Corpus - esp Chinese speech communities spread data, i e : which areas use what words Sketchengine - collocations, word combinations, synonyms, concordance features, etc
- www. plecoforums. com
most_common_n_number_of_corpus_words = 40000 # Limit selection of corpus words to the # the first n most common words from the corpus (all from BCC corpus)
- Bigrams sorted by frequency with pinyin English?
The Beijing Language and Culture University created a balanced corpus of 15 billion characters It’s based on news (人民日报 1946-2018,人民日报海外版 2000-2018), literature (books by 472 authors, including a significant portion of non-Chinese writers), non-fiction books, blog and weibo entries as well as
- Flashcards for TOCFL (2023), CCCC, TBCL - Pleco Software Forums
I've parsed out vocabulary from these taiwanese tests and converted to flashcards in pleco's format Useful e g for seeing term levels, intended part of speech and sometimes definitions examples TOCFL vocab was updated some couple years ago and I haven't yet seen a processed version of the
- Pleco for Palm OS Windows Mobile | Page 4 | Pleco Software Forums
PlecoDict 1 0 Discussion of the previous version of our Chinese dictionary flashcard software, now discontinued on Windows Mobile but still available on Palm OS
- bigrams | Pleco Software Forums
Bigrams sorted by frequency with pinyin English? I'm searching for a list of Mandarin Bigrams sorted by frequency This is just for general study, so I'm not too concerned about what corpus the frequency is derived from, as long as it is relatively modern News, popular culture, etc are all fine Ideally, I'd like to have pinyin and English DavidMars Thread Jun 21, 2023 bigrams frequency
- 79,000 Chinese-English, French, German, Italian, Japanese, and Spanish . . .
We could try just averaging the HSK values of the HSK words and then blending the result with a score from the BCC corpus for all words all non-HSK words Perhaps there is even some Digital Humanities paper about assessing the difficulty of Chinese sentences that we could learn a thing or two from
- frequency list | Pleco Software Forums
I'm searching for a list of Mandarin Bigrams sorted by frequency This is just for general study, so I'm not too concerned about what corpus the frequency is derived from, as long as it is relatively modern News, popular culture, etc are all fine Ideally, I'd like to have pinyin and English
|