-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Update] diffusers v0.29.2 Update #650
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
townwish4git
force-pushed
the
diffusersv0.29.2
branch
from
September 2, 2024 11:37
4fa41fc
to
0c87422
Compare
CaitinZhao
approved these changes
Sep 13, 2024
townwish4git
changed the title
[WIP] Prepare for diffusers v0.29.2 Update - Prerequisite Integration
[Updata] diffusers v0.29.2 Update
Sep 14, 2024
townwish4git
changed the title
[Updata] diffusers v0.29.2 Update
[Update] diffusers v0.29.2 Update
Sep 14, 2024
@vigo999 will take over this version upgrade. |
… from StableDiffusionPipeline
townwish4git
force-pushed
the
diffusersv0.29.2
branch
from
September 26, 2024 04:00
74c6f90
to
03187e5
Compare
vigo999
approved these changes
Sep 26, 2024
geniuspatrick
approved these changes
Sep 27, 2024
20 tasks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do?
Description
This pull request serves as a preliminary submission for integrating the diffusers library to version v0.29.2. It is intentionally marked as a work-in-progress (WIP) and should not be merged into the main branch until specific criteria are met. This early merge request aims to streamline future development processes by initiating code review and allowing for parallel testing.
Merge Criteria:
Action Items:
Once these milestones are achieved, this PR will be ready for final review and formal integration, setting a solid foundation for the upcoming v0.29.2 release.
Please note, this PR is part of the preparatory phase and requires subsequent validation steps to ensure quality and stability before final acceptance.
Features
New models/pipelines
1. Marigold
Proposed in Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation, Marigold introduces a diffusion model and associated fine-tuning protocol for monocular depth estimation. It can also be extended to perform surface normals’ estimation.
2. PixArt-Sigma
PixArt Simga is the successor to PixArt Alpha. PixArt Sigma is capable of directly generating images at 4K resolution. It can also produce images of markedly higher fidelity and improved alignment with text prompts. It comes with a massive sequence length of 300 (for reference, PixArt Alpha has a maximum sequence length of 120)!
3. AnimateDiff SDXL
a-r-r-o-w contributed the Stable Diffusion XL (SDXL) version of AnimateDiff. However, note that this is currently an experimental feature, as only a beta release of the motion adapter checkpoint is available.
4. Hunyuan DiT
Hunyuan DiT is a transformer-based diffusion pipeline, introduced in the Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding paper by the Tencent Hunyuan.
5. StableDiffusion3
This release emphasizes Stable Diffusion 3, Stability AI’s latest iteration of the Stable Diffusion family of models. It was introduced in Scaling Rectified Flow Transformers for High-Resolution Image Synthesis by Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas Müller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, Kyle Lacey, Alex Goodwin, Yannik Marek, and Robin Rombach.
ControlNets
1. ControlNetXS
ControlNet-XS was introduced in ControlNet-XS by Denis Zavadski and Carsten Rother. Based on the observation, the control model in the original ControlNet can be made much smaller and still produce good results. ControlNet-XS generates images comparable to a regular ControlNet, but it is 20-25% faster (see benchmark with StableDiffusion-XL) and uses ~45% less memory.
ControlNet-XS is supported for both Stable Diffusion and Stable Diffusion.
2. SD3 CntrolNet
More
1. Massive Refactor of from_single_file
We have further refactored from_single_file to align its logic more closely to the from_pretrained method. The biggest benefit of doing this is that it allows us to expand single file loading support beyond Stable Diffusion-like pipelines and models. It also makes it easier to load models that are saved and shared in their original format.
2. Using Long Prompts with the T5 Text Encoder
We increased the default sequence length for the T5 Text Encoder from a maximum of 77 to 256! It can be adjusted to accept fewer or more tokens by setting the max_sequence_length to a maximum of 512. Keep in mind that longer sequences require additional resources and will result in longer generation times. This effect is particularly noticeable during batch inference.
Before submitting
What's New
. Here are thedocumentation guidelines
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@xxx