From 1f40c781cdc54f38a142b636c462c213328c2aaf Mon Sep 17 00:00:00 2001 From: Dendy Faist Date: Sat, 20 Sep 2025 07:02:39 +0200 Subject: [PATCH] feat: Update lists & add light novel corpus --- data/ignored.list | 130 ++++++++++++++++++++++++++++++++++++++--- data/kanken-links.json | 3 +- data/lists/ebook.list | 10 +++- 3 files changed, 130 insertions(+), 13 deletions(-) diff --git a/data/ignored.list b/data/ignored.list index b0f80ac..06575fd 100644 --- a/data/ignored.list +++ b/data/ignored.list @@ -1,8 +1,125 @@ -巹 +刄 +Just an 異体字 of 刃... Dunno, doesn't make much sense to me + +吨 +For phonetic purposes, used to say "ton" as in the unit of weight... that's it. + +刕 +Kanji used for the names of weapons and stuff like that I guess just cuz it +looks kinda cool. + +吣 +Puking of cats and dogs, also foul language. This is the simplification of 唚. +Doesn't seem to be all that used. Can't find examples of any words with this. + +吡 +Only used in transliteration from other languages for "bi", that's kinda it. +It kinda makes sense doe. + +叽 +Just a variaton on 叫 that isn't all that used it seems. Not worth it, it seems +from my little research. + +叚 +Variant form of 假 which kinda seems to mean borrowing or something. Crazy. + +叕 +No information on it. Probably old version of something that turned into +something with radicals or whatever. + +厾 +To touch lightly, to poke (with a stick, or with whatever). Maybe this pops +up in the future, revive in such a case but it's not really seeming like a very +popular word from my search + +厤 +Seems to be a historial version of 歴, not seen anywhere but dictionaries of +ancient stuff. Not really worth it + +厣 +A covering flap in animals, such as a gill cover. +Not very interesting for me to learn + +厝 +Not really used much. Whetstone, used in some expressions but failed to find +usages of them. Seems to be also a variant of 錯, which is interesting. + +厓 +Old version of 崖 and 涯. I guess it used to mean kinda both and the radicals +were added later + +剅 +Seems like it used only in the name of a specific place and in dialects. I can't +find youtube videos that use it even, really. + +刿 +Not really used for anything. Simplification of 劌 which isn't really all that +used anyway. + +刽 +Simplified variant of a kanji that isn't really used. Only one 熟語 in the +chinese dictionary, and the two in the Wiktionary are literary and archaic. + +刱 +Really I should've learnt this one but oh well. A variant of 剏, but this is +accepted for the kanken. + +刧 +Variant form of 劫, not really all that interesting. + +刬 +Same as 剗, it's just a simplified version: +> Another variant of 鏟... how? I must be missing something. + +厎 +Not used at all lol. It seems to be an 異体字, means whetstone. Seems like +to be treated kinda like 磨 in japanese or something + +厍 +Only used in names, it's not the simplified version of 庫 + +剟 +To delete or to cut into blocks... not much else. There's no reason in learning +this. In japanese yet another kezuru lol. + +剜 +There's no info online about it. It just means "to scoop" as in "えぐる" +(yet another one). Doesn't make much sense to learn. +(it's not cutting an arm as a punishment) + +剗 +Another variant of 鏟... how? I must be missing something. + +剐 +Unorthodox variant of 剮. Doesn't seem to be all that used, not worth it. +Seems to mean to cut flesh from bone + +剡 +Chinese character, used in names of stuff, not much else. Doesn't seem to be +worth it. + +剷 +Variant form of 鏟, apparently. It's only used in Taiwan and it kinda means +shoveling and leveling. Not worth the effort. + +劁 +To cut, used in the word for castrating livestock. That's it. Can't find +usages or even modern usages. + +劐 +Can't find really much info or relevancy. Can't find usage of it either. + +劦 +Unused really. Changed over to other kanji like 協 and 捏 +https://en.wiktionary.org/wiki/%E5%8A%A6 + +劢 +Simplified form of 勱, as far as I can tell it kinda means exherting oneself +and also as a simplification for 励む. There's no real value in it. + +卺 ladle for holding wine made from dried gourd (匏), used in ancient marriage -rituals. -Not very interesting but also it's the simplified version of 巹... Idk, a little -stretched. +rituals. Not used at all 卬 It's mostly only used as a component, the origins are unclear but it's known @@ -68,11 +185,6 @@ Old variant/origin? of 陶, which is 常用漢字 / 名前に使える漢字 and to kinda mean "pottery" , "clay" . It's used in 陶器[とうき]. Not interesting on its own -匜 -Old vase for holding water/wine. Not used aside from that. -https://en.wikipedia.org/wiki/Yi_(vessel) -REVISIT - 匦 Simplified form of 匭. Not of interest at all. Small box or something. diff --git a/data/kanken-links.json b/data/kanken-links.json index 8aeca36..22344ba 100644 --- a/data/kanken-links.json +++ b/data/kanken-links.json @@ -26878,5 +26878,6 @@ "䨻": "y/23050", "𬚩": "y/28415", "𠔻": "y/28412", - "𪚥": "y/28413" + "𪚥": "y/28413", + "办": "y/13156" } diff --git a/data/lists/ebook.list b/data/lists/ebook.list index 46e259a..ed58a9f 100644 --- a/data/lists/ebook.list +++ b/data/lists/ebook.list @@ -189,6 +189,10 @@ 鄆潞碣媓嗉瓱嚕畤縉怵锻輗劓輟玷煆帕蒺箛吭笒偆晹偰毉昺啤冓罏秖節禎閶搨辤凞邗彀嚈噠 瑭潢昪煜嬀㬎芾忢鍳溉袗鸊鷉鱖陏鏧鰧讌糝箟髁糙譛离㷔壩芮斝蘄虢穌鼂瑇瑒耊找凳柰縧麈 盔撾鮊鱔黿攢驊騮驌駃騠騊駼顖綉扆拄炁拽憨鰣殮鄷壳猙扒倮璇戩瘖瘂髒豭瑗癯踽偬鐇滎鼇 -盦惲翬琿龢芩暻犹洳溏沺呴棃筷豬駙珓崁⻆歃賾澂拕泆瀅氂轀鬭斁鐲懟阨頥忼擕隳熛鄄渮纊 -鶇濰帮紓幞蜇阴竽傈轌蕋駉盻禘鄹愀纆絏杇棖臧鍇騂魋忞葸悾黻勉忮踧踖絺綌紾飪阼喭僎訒 -慝蕢柙櫝肸訕耰蓧鼗璩靛坷煮糲揀 \ No newline at end of file +盦惲翬琿龢芩暻犹洳溏沺呴棃筷豬駙珓崁⻆歃賾澂絋鶇幞揀颫犖鞢韴韈嵓齩髃𥇥𥆩䑛虗殱𤄃 +姱裼蟫蚜炅佟綦虁剕甗絺塤綌璦菹貙詡偀蝪篶覬覦餤踹摔鷖躺腁霪睟摽焄鷆听哎珱昰鯳魵鮴 +鞁莇筴黻倢伃栝蕡䗪惷蔛蕢鱉尃豨蓚塋郅堋竓纊騭栫祅喼抳犾罧揵檑餺飥邙刕醮羡愡敕兹滚 +瘕軺蔲汛幗紱嘽膄崫砡槝糫薏鵟癭菔怔忡㾮鉀梍凞鈺蓪枘掫匾傕卬臧轀沚椴嵒轣噐搞您櫤誐 +儗撇儵呕槫鳫裵澑猬杔膛鶴騸菇琇鐲靖杦畆鰩划呿秂瞪弸蕫皤唏凬帮摒棙岼湶砿飃拕泆瀅氂 +鬭斁懟阨頥忼擕隳熛鄄渮濰紓蜇阴竽傈轌蕋駉盻禘鄹愀纆絏杇棖鍇騂魋忞葸悾勉忮踧踖紾飪 +阼喭僎訒慝柙櫝肸訕耰蓧鼗璩靛坷煮糲 \ No newline at end of file