GPU memory requirements for training #26

Sunappnio · 2022-07-05T02:38:13Z

Sunappnio
Jul 5, 2022

This model is huge and takes a lot of gpu memory, so batch size can only be set a small number. May I ask which gpu do you use?

Cryolite · 2022-07-05T05:26:23Z

Cryolite
Jul 5, 2022
Maintainer

I use RTX-3090, so the GPU memory is 24GB.

The speed of learning on GPUs is achieved through parallelism with a certain batch size, so learning with small batch sizes is likely to be extremely slow.

If you are training on a low-memory GPU, try the --checkpointing option. This option slows down the training speed by about 30% to 40%, but it dramatically reduces the GPU memory requirement and allows training with large batch sizes, which may result in faster training.

For more information on what checkpointing (do not confuse this with snapshots, which store progresses of learning) does, see for example https://pytorch.org/docs/stable/checkpoint.html.

1 reply

Sunappnio May 6, 2023
Author

Thanks! Finally I rent some V100s, and play around supervised learning, your model works like a charm, better than cnn

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU memory requirements for training #26

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

GPU memory requirements for training #26

Sunappnio Jul 5, 2022

Replies: 1 comment · 1 reply

Cryolite Jul 5, 2022 Maintainer

Sunappnio May 6, 2023 Author

Sunappnio
Jul 5, 2022

Replies: 1 comment 1 reply

Cryolite
Jul 5, 2022
Maintainer

Sunappnio May 6, 2023
Author