🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
-
Updated
Sep 10, 2024 - Python
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
🦜 NLP for Tibetan, in Python.
repo for Tibetan corpora
Tibetan phonetics engine in Python
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
Basic Universal Dependencies Part-of-Speech Tagger for Tibetan
This Tibetan tokenizer based on Bi-LSTM+CRF methods, it was created with the aim of aiding researchers in the field of Tibetan natural language processing.
An application of PyBo to Tibetan Spell-Checking
This app is a first step toward providing effective machine translation for the Classical Tibetan corpus of important religious, philosophical, and historical texts that were nearly lost during the invasion of Tibet.
syllable-based diffs that make use of google's diff-match-patch and pybo's preprocess
Add a description, image, and links to the tibetan-nlp topic page so that developers can more easily learn about it.
To associate your repository with the tibetan-nlp topic, visit your repo's landing page and select "manage topics."