Tibetan phonetics engine in Python
-
Updated
Oct 27, 2024 - Python
Tibetan phonetics engine in Python
This app is a first step toward providing effective machine translation for the Classical Tibetan corpus of important religious, philosophical, and historical texts that were nearly lost during the invasion of Tibet.
This Tibetan tokenizer based on Bi-LSTM+CRF methods, it was created with the aim of aiding researchers in the field of Tibetan natural language processing.
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
🦜 NLP for Tibetan, in Python.
repo for Tibetan corpora
An application of PyBo to Tibetan Spell-Checking
syllable-based diffs that make use of google's diff-match-patch and pybo's preprocess
Basic Universal Dependencies Part-of-Speech Tagger for Tibetan
Add a description, image, and links to the tibetan-nlp topic page so that developers can more easily learn about it.
To associate your repository with the tibetan-nlp topic, visit your repo's landing page and select "manage topics."