some questions about the model architecture #28

Marigod98 · 2022-06-16T08:41:01Z

Thanks for your wonderful work!
I am interested in the model architecture: two branches, optimize post and prior distributions with KL_divergence and use some other loss to optimize decoder, I try to use the architecture on some other works. but when I train the model, KL_divergence can‘t be optimized well, always meet error: NaN or Inf found in input tensor, did you ever meet the error？ could you share some experience about how to optimize KL_divergence?
Thank you very much and sorry for the inconvenience.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

some questions about the model architecture #28

some questions about the model architecture #28

Marigod98 commented Jun 16, 2022

some questions about the model architecture #28

some questions about the model architecture #28

Comments

Marigod98 commented Jun 16, 2022