Fix mixed precision torch compile bug #75

Skylion007 · 2023-10-07T18:31:57Z

This is solves the bug that torch compile encountered. It didn't know the module was in fp16 until it hit weights in the backward pass because it does not know that .half() has side-effects. Making the program adhere to a more functional paradigm completely solves this bug (by invalidating the dtype cache of the nn Module).

This is solves the bug that torch compile encountered. It didn't know the module was in fp16 until it hit weights in the backward pass because it does not know that .half() has side-effects. Making the program more functional completely solves this bug

mvpatel2000

Nice!!

jon-chuang · 2023-10-23T17:21:13Z

@Skylion007 could you give more context on:

Where the torch.compile was invoked?

jon-chuang · 2023-10-23T17:34:51Z

Something that I find extremely suspicious about this fix is the following:

self.text_encoder.requires_grad_(False)
self.vae.requires_grad_(False)
if self.encode_latents_in_fp16:
    self.text_encoder = self.text_encoder.half()
    self.vae = self.vae.half()

Since vae does not require grad, how can it throw an error:

RuntimeError: attempting to assign a gradient with dtype 'float' to a tensor with dtype 'c10::Half'. Please ensure that the gradient and the tensor have the same dtype

About the dtype of its grad?

Skylion007 requested review from A-Jacobson, mvpatel2000, jazcollins, Landanjs and coryMosaicML October 7, 2023 18:31

Skylion007 mentioned this pull request Oct 7, 2023

torch.compile does not know .half() has side-effects pytorch/pytorch#110797

Closed

mvpatel2000 approved these changes Oct 8, 2023

View reviewed changes

Skylion007 merged commit 3365aec into main Oct 8, 2023
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mixed precision torch compile bug #75

Fix mixed precision torch compile bug #75

Skylion007 commented Oct 7, 2023 •

edited

Loading

mvpatel2000 left a comment

jon-chuang commented Oct 23, 2023

jon-chuang commented Oct 23, 2023

Fix mixed precision torch compile bug #75

Fix mixed precision torch compile bug #75

Conversation

Skylion007 commented Oct 7, 2023 • edited Loading

mvpatel2000 left a comment

Choose a reason for hiding this comment

jon-chuang commented Oct 23, 2023

jon-chuang commented Oct 23, 2023

Skylion007 commented Oct 7, 2023 •

edited

Loading