Ambiguities in Chinese character simplification
A relatively small number of Chinese characters known as (simplified Chinese): 简繁一对多; (traditional Chinese): 簡繁一對多 do not have a one-to-one mapping between their simplified and traditional forms. This is because the simplification process merged two or more distinct characters into one. In most cases, the characters have become homonyms, having the same pronunciation, but completely unrelated meanings.
As a result, converting text from simplified to traditional characters is difficult to automate, especially in the case of common characters such as 后後后 ("behind" or "empress"), 表表錶 (table, clock), 奸奸姦 ("traitor" or "rape") and more.
By far less common are traditional characters that map to two simplified characters.
The following is an exhaustive list of all characters whose simplified and traditional forms do not map in a one-to-one manner. Simplified characters are marked with a pink background, and traditional characters with light blue.
1 to 2
板板闆 杯杯盃 辟辟闢 表表錶 别別彆 卜卜蔔 布布佈 才才纔 彩彩綵 参參蔘 冲沖衝 虫虫蟲
丑丑醜 仇仇讎 出出齣 村村邨 粗粗麤 酬酬醻 当當噹 党黨党 淀澱淀 吊弔吊 冬冬鼕 发發髮
布范範 丰豐丰 谷谷穀 仇雇僱 刮刮颳 板廣广 哄哄鬨 后後后 伙夥伙 获獲穫 几幾几 机機机
饥飢饑 迹跡蹟 奸奸姦 姜姜薑 借借藉 尽盡儘 据據据 卷捲卷 克克剋 困困睏 夸夸誇 罗羅囉
累累纍 厘厘釐 漓漓灕 梁梁樑 了了瞭 霉霉黴 弥彌瀰 丑蔑衊 么么麼 麽麽麼 迹蘋苹 仆僕仆
出鋪舖 朴朴樸 签簽籤 据確确 舍舍捨 沈沈瀋 姜勝胜 累術朮 签松鬆 他他祂 迹嘆歎 坛壇罈
你你妳 当體体 同同衕 涂涂塗 团團糰 涂喂餵 机為爲 纤纖縴 咸鹹咸 弦弦絃 绣綉繡 须須鬚
熏熏燻 腌醃腌 叶葉叶 佣傭佣 涌湧涌 夸游遊 于於于 余余餘 吁籲吁 郁郁鬱 欲欲慾 御御禦
愿願愿 岳岳嶽 云雲云 赞贊讚 脏臟髒 扎扎紮 你占佔 厘折摺 梁征徵 郁證証 志志誌 制制製
致致緻 辟鍾鐘 种種种 表周週 注註注 仆準准 冢塚冢 庄庄莊 涩澀澁 蚕蠶蚕 忏懺忏 吨噸吨
赶趕赶 庄構构 姜櫃柜 怀懷怀 坏壞坏 梁極极 茧繭茧 家家傢 价價价 吁潔洁 惊驚惊 腊臘腊
蜡蠟蜡 帘簾帘 怜憐怜 岭嶺岭 村撲扑 秋秋鞦 千千韆 据確确 扰擾扰 洒灑洒 晒曬晒 适適适
听聽听 洼窪洼 村撲扑 秋旋鏇 吊踴踊 帘優优 仇症癥 朱朱硃 荐薦荐 离離离 卤鹵滷 气氣气
圣聖圣 万萬万 后與与 摆擺襬 据蟣虮 篱籬篱 泞濘泞 赶惡噁 托托託 麽嚥咽
1 to 3
曲曲麯麴 升升昇陞 忏蘇囌甦 系系係繫 尝嘗甞嚐 胡胡鬍衚 洒畫劃划 泞回迴囘 升匯彙滙 里里裡裏
了歷曆厤 袅裊嫋嬝 线線綫 丑向嚮曏 只只隻衹 它它牠祂
1 to 4
赶并並併竝 仇采埰寀採 涂厂庵厰廠 干乾干幹榦 蒙蒙懞濛矇 面面麵麪麫 舍復複覆复 朱萲蕿蘐藼
1 to 5
粗鬥斗鬪鬦鬭 困臺台檯枱颱
2 to 1
著著着 兒兒(ní)儿(ér) 乾乾(qián)干(gān) 夥夥伙 藉藉(jí)借(jiè) 瞭瞭(liào)了(liǎo) 麼麽(mó)么(me) 餘馀余 摺摺折 徵徵(zhǐ)征(zhēng) 畫画划 鯰鲶鲇 瀋沈渖 鹼碱硷
Special Case
薴(níng) (limonene) has the simplified form 苧;however 苧(zhù) (boehmeria) is a traditional character, which has the simplified form 苎. 薴苧 苧苎 |