Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training time #2

Open
vineet2104 opened this issue Nov 5, 2024 · 3 comments
Open

Training time #2

vineet2104 opened this issue Nov 5, 2024 · 3 comments

Comments

@vineet2104
Copy link

vineet2104 commented Nov 5, 2024

Thank you for the amazing work!
I am trying to train your model on Coco to replicate some results using the same hardware setup (4 A100 GPUs with batch size =16). Can you please tell me how many days of training did it take you?

@vineet2104 vineet2104 changed the title Training iterations Training time Nov 6, 2024
@chenxi52
Copy link
Owner

chenxi52 commented Nov 11, 2024

image
You can refer to this.

@vineet2104
Copy link
Author

Do you use any cuda optimizations to run the code faster? Or maybe training with FP16 or BF16?

@chenxi52
Copy link
Owner

Do you use any cuda optimizations to run the code faster? Or maybe training with FP16 or BF16?

We haven't utilized them in this repo, but you could experiment with them to observe their impact on training time and the final outcomes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants