Add Magvit-v2 (vqvae) #656

hqkate · 2024-09-10T04:09:09Z

What does this PR do?

Adds Mavgit-v2 VQVAE training scripts, supporting both pynative and graph mode.

Fixes # (issue)

Adds # (feature)

Lookup-Free-Quantization(LFQ)
VQVAE-3d Training
VQGAN Training

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you make sure to update the documentation with your changes? E.g. record bug fixes or new features in What's New. Here are the
documentation guidelines
Did you build and run the code without any errors?
Did you report the running environment (NPU type/MS version) and performance in the doc? (better record it for data loading, model inference, or training tasks)
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@xxx

SamitHuang · 2024-09-12T06:43:29Z

examples/magvit/README.md

+
+For the pretraining of VQVAE-2d, we provide a pretrained model weights as follow:
+
+| Model | Dataset | Image Size | Weights | PSNR | SSIM |


what is the weights here?

I was about to upload the pretrained model weights and provide a download link in the readme. But since this PR is not the final version of the code, I probably won't put the model weights now. The "weight" column has been removed, please check, thanks thanks~

SamitHuang · 2024-09-12T06:49:28Z

examples/magvit/videogvt/models/quantization/lookup_free_quantization.py

@@ -0,0 +1,347 @@
+# Copyright 2024 Huawei Technologies Co., Ltd
+# Copyright (c) 2022-present, Kakao Brain Corp.


remove these copyright

CaitinZhao · 2024-09-13T03:45:47Z

examples/magvit/README.md

@@ -0,0 +1,119 @@
+# MAGVIT-v2: Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
+
+This folder contains the Mindspore implementation of [MAGVIT-v2](https://arxiv.org/pdf/2310.05737).


add original code repo

源碼沒有開源，在Readme加了説明，也列明了一些參考的代碼倉

Update readme and minor fix

* name change * bugfix inflate * minor fix * modify config

hqkate added 4 commits September 10, 2024 11:50

add magvit vqvae

7cc1b17

remove tests scripts

9605bc7

rm unused files

3937dbd

minor fix

b01ac0f

hqkate marked this pull request as ready for review September 11, 2024 08:50

hqkate requested review from CaitinZhao, SamitHuang and zhanghuiyao as code owners September 11, 2024 08:50

fix pre-commit

f3294b1

SamitHuang reviewed Sep 12, 2024

View reviewed changes

hqkate added 2 commits September 13, 2024 09:36

update readme and rm copyrights

5c694c6

rm copyright

bea7f42

CaitinZhao reviewed Sep 13, 2024

View reviewed changes

hqkate added 5 commits September 27, 2024 11:08

update readme and minor fix

a275322

Merge pull request #1 from hqkate/magvit-dev

d3d3531

Update readme and minor fix

Fix format and some bugs (#2)

ee4c1cb

* name change * bugfix inflate * minor fix * modify config

update readme

7651a47

Merge branch 'mindspore-lab:master' into magvit

a729089

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Magvit-v2 (vqvae) #656

Add Magvit-v2 (vqvae) #656

hqkate commented Sep 10, 2024 •

edited

Loading

SamitHuang Sep 12, 2024

hqkate Sep 13, 2024

SamitHuang Sep 12, 2024

hqkate Sep 13, 2024

CaitinZhao Sep 13, 2024

hqkate Sep 27, 2024


		For the pretraining of VQVAE-2d, we provide a pretrained model weights as follow:

		\| Model \| Dataset \| Image Size \| Weights \| PSNR \| SSIM \|

		@@ -0,0 +1,347 @@
		# Copyright 2024 Huawei Technologies Co., Ltd
		# Copyright (c) 2022-present, Kakao Brain Corp.

		@@ -0,0 +1,119 @@
		# MAGVIT-v2: Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

		This folder contains the Mindspore implementation of [MAGVIT-v2](https://arxiv.org/pdf/2310.05737).

Add Magvit-v2 (vqvae) #656

Are you sure you want to change the base?

Add Magvit-v2 (vqvae) #656

Conversation

hqkate commented Sep 10, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

SamitHuang Sep 12, 2024

Choose a reason for hiding this comment

hqkate Sep 13, 2024

Choose a reason for hiding this comment

SamitHuang Sep 12, 2024

Choose a reason for hiding this comment

hqkate Sep 13, 2024

Choose a reason for hiding this comment

CaitinZhao Sep 13, 2024

Choose a reason for hiding this comment

hqkate Sep 27, 2024

Choose a reason for hiding this comment

hqkate commented Sep 10, 2024 •

edited

Loading