Add HF Auth mixin to Stable Diffusion #1763

msaroufim · 2023-07-13T00:09:37Z

Right now stale diffusion and lit-llama are not actually running in CI because they get rate limited by huggingface. since we've now added an auth token as a github secret we can move stable diffusion out of canary and do things like include it in blueberries dashboard

We also added some nice errors so people running in torchbench locally know they will need to have a token to run these models

Anyways auth is a mixin which seems like the right abstraction

Some relevant details about the model

Torchbench has a function get_module() that has the intent of testing a nn.Module on an actual torch.Tensor

Unfortunately a StableDiffusionPipeline is not an nn.Module it's a composition of a tokenizer and 3 seperate nn.Modules an encoder, vae and unet.

text_encoder

    def get_module(self):
        batch_size = 1
        sequence_length = 10
        vocab_size = 32000

        # Generate random indices within the valid range
        input_tensor = torch.randint(low=0, high=vocab_size, size=(batch_size, sequence_length))

        # Make sure the tensor has the correct data type
        input_tensor = input_tensor.long()
        print(self.pipe.text_encoder(input_tensor))
        return self.pipe.text_encoder, input_tensor

Text encoder outputs a BaseModelOutputWithPooling which has multiple nn modules https://gist.github.com/msaroufim/51f0038863c5cce4cc3045e4d9f9c399

======================================================================
FAIL: test_stable_diffusion_example_cuda (__main__.TestBenchmark)
----------------------------------------------------------------------
components._impl.workers.subprocess_rpc.ChildTraceException: Traceback (most recent call last):
  File "/home/ubuntu/benchmark/components/_impl/workers/subprocess_rpc.py", line 482, in _run_block
    exec(  # noqa: P204
  File "<subprocess-worker>", line 35, in <module>
  File "<subprocess-worker>", line 12, in _run_in_worker_f
  File "/home/ubuntu/benchmark/torchbenchmark/util/model.py", line 26, in __call__
    obj.__post__init__()
  File "/home/ubuntu/benchmark/torchbenchmark/util/model.py", line 126, in __post__init__
    self.accuracy = check_accuracy(self)
  File "/home/ubuntu/benchmark/torchbenchmark/util/env_check.py", line 469, in check_accuracy
    model, example_inputs = maybe_cast(tbmodel, model, example_inputs)
  File "/home/ubuntu/benchmark/torchbenchmark/util/env_check.py", line 424, in maybe_cast
    example_inputs = clone_inputs(example_inputs)
  File "/home/ubuntu/benchmark/torchbenchmark/util/env_check.py", line 297, in clone_inputs
    assert isinstance(value, torch.Tensor)
AssertionError

vae

    def get_module(self):
        print(self.pipe.vae(torch.randn(9,3,9,9)))

Same problem for vae
https://github.com/huggingface/diffusers/blob/main/src/diffusers/models/vae.py#L27

unet

    def get_module(self):
        # This will only benchmark the unet since that's the biggest layer
        # Stable diffusion is a composition of a text encoder, unet and vae
        encoder_hidden_states = torch.randn(320, 1024)
        sample = torch.randn(4, 4, 4, 32)
        timestep = 5
        inputs_to_pipe = {'timestep': timestep, 'encoder_hidden_states': encoder_hidden_states, 'sample': sample}
        result = self.pipe.unet(**inputs_to_pipe)
        return self.pipe, inputs_to_pipe

Unet unfortunately does not have a tensor input

For VAE and encoder the test failure is particularly helpful

(sam) ubuntu@ip-172-31-9-217:~/benchmark$ python test.py -k "test_stable_diffusion_example_cuda"
F
======================================================================
FAIL: test_stable_diffusion_example_cuda (__main__.TestBenchmark)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/ubuntu/benchmark/test.py", line 75, in example_fn
    assert accuracy == "pass" or accuracy == "eager_1st_run_OOM", f"Expected accuracy pass, get {accuracy}"
AssertionError: Expected accuracy pass, get eager_1st_run_fail

----------------------------------------------------------------------
Ran 1 test in 7.402s

FAILED (failures=1)

torchbenchmark/util/metadata_utils.py

xuzhao9 · 2023-07-14T01:11:56Z

Looks like it is also needed to setup the env value in the Docker run command so that the Docker container can access to it: https://github.com/pytorch/benchmark/blob/main/.github/workflows/pr-a10g.yml#L38

Use docker run -e HUGGINGFACE_HUB_TOKEN=${HUGGINGFACE_HUB_TOKEN} ... to setup the env value in the docker container.

torchbenchmark/models/stable_diffusion/__init__.py

… into msaroufim/authsd

msaroufim · 2023-07-15T02:55:25Z

The last part that's tripping me up is how to make get_module() work for this PR - @xuzhao9 i posted some logs and explanation above lemme know if you have any thoughts

msaroufim · 2023-07-18T17:47:08Z

Thanks @xuzhao9 for offline help, the example error was fixed locally

I had to run python run.py "stable_diffusion" -d cuda --accuracy to see the real error

xuzhao9

Nice to see the CI is green on this PR. Great job!

facebook-github-bot · 2023-07-18T20:59:31Z

@msaroufim has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-07-18T22:18:54Z

@msaroufim merged this pull request in 411e388.

Add Auth to Stable Diffusion

203b44e

facebook-github-bot added the cla signed label Jul 13, 2023

diffuser install bug

8f67302

msaroufim requested a review from xuzhao9 July 13, 2023 01:42

cpuhrsch approved these changes Jul 13, 2023

View reviewed changes

msaroufim and others added 2 commits July 13, 2023 09:42

Update metadata_utils.py

87f3ae2

Trigger Build

61b3c1f

xuzhao9 reviewed Jul 13, 2023

View reviewed changes

torchbenchmark/util/metadata_utils.py Outdated Show resolved Hide resolved

push

80d36d2

msaroufim force-pushed the msaroufim/authsd branch from 6f182e1 to 80d36d2 Compare July 13, 2023 17:35

msaroufim added 3 commits July 13, 2023 10:35

Update metadata_utils.py

164a5c7

Update pr-gha-runner.yml

82edc34

Update pr-a10g.yml

9efa3ee

msaroufim requested a review from xuzhao9 July 13, 2023 17:45

msaroufim and others added 5 commits July 13, 2023 11:06

Update install.py

623ac4b

Update install.py

b39b098

Update __init__.py

b9b6ca8

Trigger Build

3ae570b

Update __init__.py

720a163

msaroufim changed the title ~~Add Auth to Stable Diffusion~~ Add HF Auth to Stable Diffusion Jul 13, 2023

msaroufim changed the title ~~Add HF Auth to Stable Diffusion~~ Add HF Auth mixin to Stable Diffusion Jul 13, 2023

Update install.py

5dc46d1

msaroufim and others added 3 commits July 14, 2023 20:40

tests now pass

618e17f

Update pr-a10g.yml

8eb7d96

Update pr-a10g.yml

f3ab9b3

xuzhao9 reviewed Jul 14, 2023

View reviewed changes

torchbenchmark/models/stable_diffusion/__init__.py Outdated Show resolved Hide resolved

msaroufim and others added 3 commits July 14, 2023 15:36

Update __init__.py

a6cd3fb

Update pr-a10g.yml

81bbab9

update example test

7ba4158

Merge branch 'msaroufim/authsd' of https://github.com/pytorch/benchmark…

64922c2

… into msaroufim/authsd

msaroufim requested a review from xuzhao9 July 17, 2023 16:08

update

419375f

xuzhao9 approved these changes Jul 18, 2023

View reviewed changes

facebook-github-bot closed this in 411e388 Jul 18, 2023

facebook-github-bot added the Merged label Jul 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add HF Auth mixin to Stable Diffusion #1763

Add HF Auth mixin to Stable Diffusion #1763

msaroufim commented Jul 13, 2023 •

edited

Loading

xuzhao9 commented Jul 14, 2023

msaroufim commented Jul 15, 2023 •

edited

Loading

msaroufim commented Jul 18, 2023 •

edited

Loading

xuzhao9 left a comment

facebook-github-bot commented Jul 18, 2023

facebook-github-bot commented Jul 18, 2023

Add HF Auth mixin to Stable Diffusion #1763

Add HF Auth mixin to Stable Diffusion #1763

Conversation

msaroufim commented Jul 13, 2023 • edited Loading

Some relevant details about the model

text_encoder

vae

unet

xuzhao9 commented Jul 14, 2023

msaroufim commented Jul 15, 2023 • edited Loading

msaroufim commented Jul 18, 2023 • edited Loading

xuzhao9 left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Jul 18, 2023

facebook-github-bot commented Jul 18, 2023

msaroufim commented Jul 13, 2023 •

edited

Loading

msaroufim commented Jul 15, 2023 •

edited

Loading

msaroufim commented Jul 18, 2023 •

edited

Loading