This repo contains reduplications of those using Chinese characters (a.k.a. Hanzi) languages, such as Mandarin, Minnan (a.k.a. Southern Min), Hakka and Jyut (a.k.a. Yue / Cantonese). These data can be used for linguistic analysis and NLP research.
本字典收集使用漢字的漢語、閩南語、客語、粵語(廣東話)的疊字詞,以供語言學分析及自然語言處理(NLP)研究。
Reduplications are a special formed type in east asian cultural sphere (a.k.a. chinese character sphere). It can be use in chengyu or adjective modifier. For instance, 紅 (âng)
in Minnan means red, 紅紅 (âng-âng)
means so red, and 紅紅紅 (âng-âng-âng)
means very red.
AA
:相鄰的疊字,例如「天天」、「打打球」、「水汪汪」、「楚楚可憐」等,皆屬此類型。ABA
:間隔一個,例如「白雲白」、「不明不白」、「以牙還牙」等,皆屬此類型。ABCA
:間隔兩個,例如「為所欲為」、「亞美尼亞」等,皆屬此類型。- and so on...
Most of raw data refer to dictionaries and IME wordbooks.
多數的原始資料來源為字典以及輸入法詞庫。
- 《常用疊字分類詞典》,賴慶雄、李炳傑著,螢火蟲出版社(2019),ISBN: 9789864521753
- Rime IME Wordbook