Contribute with trained packages or training checkpoints for a language #50
Replies: 5 comments 30 replies
-
Help with training is definitely appreciated! If anyone's interested in helping the first step would be to get the training script running with the auto downloaded data. |
Beta Was this translation helpful? Give feedback.
-
I used the Wiki OPUS data (basically just ran your training script), and tbh that data is kinda garbage. 1/20 of the sentences are some random stuff. E.g. in English it's gonna be about some geopolitics, while in Russian it's gonna be about guitar picks. I'd love to get some more quality data, but it's super hard to find, I couldn't even find where to buy this kinda stuff.
Yep, here it is, https://drive.google.com/file/d/16j1xiRyhVnepUrB1a0Txk8-um3-sw7Vv/view?usp=sharing |
Beta Was this translation helpful? Give feedback.
-
I'm very new to scripting, nmt and all this stuff. This might be my first github comment. But i do have a couple old computers (ubuntu, manjaro) which could be put to good use training language models. They do not have separate GPUs. I followed instructions for the training script (apart from installing CUDA), and encountered an error when trying to run
so i checked out the |
Beta Was this translation helpful? Give feedback.
-
@PJ-Finlay, hey. I finally managed to train a good en->ru model with the new guide you made. It's actually better than the legacy one you have, except for curse words. What data did you use so the legacy one nails curse words so well? |
Beta Was this translation helpful? Give feedback.
-
Hi @PJ-Finlay |
Beta Was this translation helpful? Give feedback.
-
Hi,
I was wondering if you would be interested with training-checkpoints or trained models for languages?
I see it as a way to crowdsource/share GPU power that we all might have 🙂
My understanding is that you have an index-file with links to you GC storage, but also some are kept on Google-Drive and GitHub-raw-content too.
I appreciate comments/clarifications, and I hope the idea helps 👍
Beta Was this translation helpful? Give feedback.
All reactions