Flax: Fix img2img and align with other pipeline #1824

skirsten · 2022-12-24T23:49:05Z

Fixes for flax img2img
Other misc changes
Re-aligned the img2img pipe with the normal pipe (mostly copy paste)

HuggingFaceDocBuilderDev · 2022-12-24T23:54:37Z

The documentation is not available anymore as the PR was closed or merged.

pcuenca

Thanks a lot! This clearly improves the existing pipeline, I just left comments with a few suggestions and ideas for a potential improvement (make strength parallelizable too).

src/diffusers/pipeline_flax_utils.py

src/diffusers/pipelines/stable_diffusion/pipeline_flax_stable_diffusion.py

src/diffusers/pipelines/stable_diffusion/pipeline_flax_stable_diffusion_img2img.py

patil-suraj

Thanks a lot for working on this! Left some comments

src/diffusers/pipelines/stable_diffusion/pipeline_flax_stable_diffusion_img2img.py

patil-suraj · 2022-12-26T11:37:25Z

src/diffusers/pipelines/stable_diffusion/pipeline_flax_stable_diffusion_img2img.py

+            width // self.vae_scale_factor,
+        )
+        if noise is None:
+            noise = jax.random.normal(prng_seed, shape=latents_shape, dtype=jnp.float32)


any specific reason to hardcode the dtype to jnp.float32? think we should use self.dtype here as before for half-precision inference.

This was a copy paste from the normal pipeline. There are some other places where the dtype is hardcoded to jnp.float32 related to latents. I just remember setting every occurrence to self.dtype and losing all details in the generated images.

I would prefer if all occurrences of jnp.float32 could be removed at the same time. But I can also just revert this one change. Let me know what you think.

cc @pcuenca do we need noise in float32 in Jax?

I investigated this a bit more and found the "problem". I asked the JAX maintainers on their thoughts here: jax-ml/jax#13798

Basically, to prevent losing detail in the image, the noise has to be generated in float32 and then casted to self.dtype:

noise = jax.random.normal(prng_seed, shape=latents_shape, dtype=jnp.float32).astype(self.dtype)

Still I would prefer not to do that in this PR and instead create a new PR that fixes all of these occurrences at once.

That experiment was very cool and instructive, thanks a lot for taking the time to clarify the behaviour! I agree to deal with this in another PR.

patil-suraj · 2022-12-26T11:41:38Z

src/diffusers/pipelines/stable_diffusion/pipeline_flax_stable_diffusion_img2img.py

            # run with python for loop
-            for i in range(t_start, len(scheduler_state.timesteps)):
+            for i in range(start_timestep, num_inference_steps):


think it's safer to use len(scheduler_state.timesteps) that num_inference_steps because depending on the scheduler the could be some extra timesteps, for example if using PNDMScheduler with PRK stpes, it'll add some extra time timesteps

🤔 I was not aware of that. Thanks for letting me know. In that case it would also have to be changed here.

Hmm, it looks to me like here it is making sure that the length is always equal to num_inference_steps by dropping some plms timesteps.

Or am I getting that wrong?

Actually you are right, let's use num_inference_steps here. Because the timesteps are extended for 2nd order schedulers like Heun.

Hmm, I'm not sure about this. Because the timesteps are extended for some schedulers, shouldn't we loop through the timesteps instead?

Those schedulers are not part of flax yet no? (specifically the HeunScheduler),

No, they are not.

patil-suraj · 2022-12-26T11:43:27Z

src/diffusers/pipelines/stable_diffusion/pipeline_flax_stable_diffusion_img2img.py

+        # 0. Default height and width to unet
+        height = height or self.unet.config.sample_size * self.vae_scale_factor
+        width = width or self.unet.config.sample_size * self.vae_scale_factor


we compute the height and width in __call__ as well, so here we could make it required.

patil-suraj

All looks good now, thank you for addressing the comments! The quality checks are failing, run make style and make quality , push and then it should be good to merge after green light from @pcuenca

skirsten · 2022-12-29T12:31:32Z

The failing CI seems to be unrelated to these changes (doc_builder)

pcuenca · 2022-12-29T12:45:37Z

@skirsten This is what make style did for me: a47d1f0, you probably didn't install doc-builder.

Feel free to cherry-pick in your branch :)

Edit: I created this PR to your repo.

skirsten · 2022-12-29T12:51:41Z

@pcuenca Thanks, I added you commit. Yes, I was missing some packages (and make did not resolve python to python3) so I just assumed that the problem was not in this branch 🙈

pcuenca · 2022-12-29T12:53:54Z

No worries!

pcuenca · 2022-12-29T17:08:54Z

@skirsten @patil-suraj is this ready to merge then?

* Flax: Add components function * Flax: Fix img2img and align with other pipeline * Flax: Fix PRNGKey type * Refactor strength to start_timestep * Fix preprocess images * Fix processed_images dimen * latents.shape -> latents_shape * Fix typo * Remove "static" comment * Remove unnecessary optional types in _generate * Apply doc-builder code style. Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

pcuenca reviewed Dec 25, 2022

View reviewed changes

pcuenca requested review from yiyixuxu and patil-suraj December 25, 2022 20:17

skirsten mentioned this pull request Dec 25, 2022

Add Flax stable diffusion img2img pipeline #1355

Merged

2 tasks

patil-suraj reviewed Dec 26, 2022

View reviewed changes

patil-suraj approved these changes Dec 26, 2022

View reviewed changes

pcuenca approved these changes Dec 29, 2022

View reviewed changes

skirsten added 10 commits December 29, 2022 13:28

Flax: Add components function

1eb9024

Flax: Fix img2img and align with other pipeline

6bf1983

Flax: Fix PRNGKey type

7753431

Refactor strength to start_timestep

9d10981

Fix preprocess images

a276639

Fix processed_images dimen

1f0117d

latents.shape -> latents_shape

da3311d

Fix typo

44546ef

Remove "static" comment

74128b2

Remove unnecessary optional types in _generate

19b3de4

skirsten force-pushed the flax/fix-img2img branch from 7085ba1 to 19b3de4 Compare December 29, 2022 12:28

Apply doc-builder code style.

0d937ea

pcuenca merged commit ab0e92f into huggingface:main Dec 29, 2022

Flax: Fix img2img and align with other pipeline #1824

Flax: Fix img2img and align with other pipeline #1824

Uh oh!

Conversation

skirsten commented Dec 24, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Dec 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

skirsten commented Dec 29, 2022

Uh oh!

pcuenca commented Dec 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

skirsten commented Dec 29, 2022

Uh oh!

pcuenca commented Dec 29, 2022

Uh oh!

pcuenca commented Dec 29, 2022

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Dec 24, 2022 •

edited

Loading

pcuenca commented Dec 29, 2022 •

edited

Loading