-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] DoReMi #54
[Feature] DoReMi #54
Conversation
…ature_doremi_new_codebase
…per domain in the global batch size
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Amazing job 🚀🚀🚀 Can we move srx/nanotron/doremi folder to examples/doremi just like we did with moe? this way we're sure which features correspond to what example and we keep nanotron source code minimal
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very nice integration with nanotron. Last batch of comments before merging this!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 🚀 🚀 🚀 🚀
Some minor comments left
README: https://github.com/huggingface/nanotron/blob/xrsrke/feature_doremi_new_codebase/examples/doremi/README.md