bpe
Here are 77 public repositories matching this topic...
PyTorch original implementation of Cross-lingual Language Model Pretraining.
-
Updated
Jul 28, 2020 - Python
-
Updated
Mar 1, 2023 - Shell
Java library implementing Byte-Pair Encoding Tokenization
-
Updated
May 17, 2023 - Java
An extremily simple and restricted tool/lib converting binary data into text that can be processed with unsuperwised character-level natural language processing tools/libs
-
Updated
Oct 13, 2023 - Python
simple chatbot using NLP and BPE
-
Updated
Jul 28, 2023 - Jupyter Notebook
A modified, secure version of BPE algorithm
-
Updated
Mar 29, 2024 - Python
This repository provides a clear, educational implementation of Byte Pair Encoding (BPE) tokenization in plain Python. The focus is on algorithmic understanding, not raw performance.
-
Updated
Aug 28, 2024 - Python
Source crypt Gradle plugin
-
Updated
May 3, 2022 - Kotlin
-
Updated
Sep 4, 2022 - Jupyter Notebook
Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python
-
Updated
Jan 30, 2023 - Python
Repository for the experiments in my paper: "A Systematic Analysis of Vocabulary and BPE Settings for Optimal Fine-tuning of NMT: A Case Study of In-domain Translation "
-
Updated
Apr 1, 2022
ASR pytorch project
-
Updated
Oct 16, 2022 - Python
Zero-dependency implementation of BitNet neural network training and BPE tokenization in C
-
Updated
Aug 3, 2024 - C
Low resource language machine translation(az,be,tr -> en).
-
Updated
Nov 10, 2018 - Python
Improve this page
Add a description, image, and links to the bpe topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the bpe topic, visit your repo's landing page and select "manage topics."