GitHub - nramkissoon/Kanjidicparser: Script for parsing kanji data from the KANJIDIC2 project

Kanjidicparser

Kanjidicparser.py is a script for parsing data from the KANJIDIC project: http://www.edrdg.org/wiki/index.php/KANJIDIC_Project

The script exports a json file ("kanji_dict.json") containing a nested dictionary with data for all kanji entries in the KANJIDIC database.

Usage

Make sure kanjidic2.xml is downloaded from the KANJIDIC project site into the same directory as the script and run the script file.

How to use kanji_dict.json

kanji_dict.json is a nested dictionary where each key is an individual kanji. Each kanji key links to another dictionary where kanji-specific information.

Example entry: 増

meanings: 'increase', 'add', 'augment', 'gain', 'promote'
onyomi: 'ゾウ'
kunyomi: 'ま.す', 'ま.し', 'ふ.える', 'ふ.やす'
nanori: 'まし', 'ます'
freq: '231'
jlpt: '2'

Accessing information: kanji_dict.json loaded into variable dict

dict["増"]["meanings"] returns the list ['increase', 'add', 'augment', 'gain', 'promote']

Notes regarding data fields

meanings - definitions in English

onyomi - readings closer to original Chinese readings, usually used for noun and compounds

kunyomi - Japanese readings

freq - frequency of occurrence

jlpt - JLPT level

License informtion

kanjidicparser.py is free to use and modify. Data from the KANJIDIC projects is subject to conditions found at http://www.edrdg.org/edrdg/licence.html.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
kanji_dict.json		kanji_dict.json
kanjidic2.xml		kanjidic2.xml
kanjidicparser.py		kanjidicparser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

nramkissoon/Kanjidicparser

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages