[training] feat: enable quantization for hidream lora training. #11494

sayakpaul · 2025-05-05T12:34:06Z

What does this PR do?

This PR adds support to apply quantization from bitsandbytes to the base model before we attach LoRA params to it and train them. This helps reduce the memory consumption quite a bit:

(with quantization)
Memory (before device placement): 9.085089683532715 GB.
Memory (after device placement): 34.59585428237915 GB.
Memory (after backward): 36.90267467498779 GB.

(without quantization)
Memory (before device placement): 0.0 GB.
Memory (after device placement): 57.6400408744812 GB.
Memory (after backward): 59.932212829589844 GB.

With --offload, we can reduce further. The reason why we see some memory before device placement in the case of quantization is because, by default bnb quantized models are placed on the GPU first.

Quick test:

export MODEL_NAME="HiDream-ai/HiDream-I1-Dev"
export INSTANCE_DIR="linoyts/3d_icon"
export OUTPUT_DIR="trained-hidream-lora"

CUDA_VISIBLE_DEVICES=0 accelerate launch train_dreambooth_lora_hidream.py \
	--pretrained_model_name_or_path=$MODEL_NAME \
	--dataset_name=$INSTANCE_DIR  \
	--output_dir=$OUTPUT_DIR  \
	--bnb_quantization_config_path="bnb_config.json"  \
	--mixed_precision="bf16" \
	--instance_prompt="3d icon"  \
	--caption_column="prompt"  \
	--resolution=1024   --train_batch_size=1   \
	--gradient_accumulation_steps=4   \
	--use_8bit_adam   --rank=8   \
	--learning_rate=2e-4   --report_to="wandb" \
	--lr_scheduler="constant_with_warmup"   --lr_warmup_steps=100 \
	--max_train_steps=1000 \
	--cache_latents  --gradient_checkpointing  \
	--validation_epochs=25   --seed="0"   \
	--final_validation_prompt="a 3dicon, a llama eating ramen"

The bnb config json:

{
    "load_in_4bit": true,
    "bnb_4bit_quant_type": "nf4"
}

Results:

WandB: https://wandb.ai/sayakpaul/dreambooth-hidream-lora/runs/01l8vy12

TODO

Docs
Complete a full reasonable run

HuggingFaceDocBuilderDev · 2025-05-05T12:46:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2025-05-05T14:34:55Z

src/diffusers/quantizers/quantization_config.py

@@ -179,7 +179,7 @@ class BitsAndBytesConfig(QuantizationConfigMixin):
    This is a wrapper class about all possible attributes and features that you can play with a model that has been
    loaded using `bitsandbytes`.

-    This replaces `load_in_8bit` or `load_in_4bit`therefore both options are mutually exclusive.
+    This replaces `load_in_8bit` or `load_in_4bit` therefore both options are mutually exclusive.


Harmless change.

linoytsaban

nice! 🤏🤏🤏

linoytsaban · 2025-05-05T14:59:14Z

btw @sayakpaul - do we usually do similar quantization support with a json config?

sayakpaul · 2025-05-05T15:14:33Z

Not sure what you mean 👀

feat: enable quantization for hidream lora training.

8cef82c

sayakpaul added the training label May 5, 2025

better handle compute dtype.

5d5e80a

finalize.

6fd71bd

sayakpaul requested a review from linoytsaban May 5, 2025 14:34

sayakpaul marked this pull request as ready for review May 5, 2025 14:34

Merge branch 'main' into quantized-hidream-training

8ef7b6f

sayakpaul commented May 5, 2025

View reviewed changes

sayakpaul and others added 2 commits May 5, 2025 20:07

fix dtype.

881024d

Merge branch 'main' into quantized-hidream-training

facf883

linoytsaban approved these changes May 5, 2025

View reviewed changes

sayakpaul merged commit 071807c into main May 5, 2025
16 checks passed

sayakpaul deleted the quantized-hidream-training branch May 5, 2025 15:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[training] feat: enable quantization for hidream lora training. #11494

[training] feat: enable quantization for hidream lora training. #11494

Uh oh!

sayakpaul commented May 5, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented May 5, 2025

Uh oh!

sayakpaul May 5, 2025

Uh oh!

linoytsaban left a comment

Uh oh!

linoytsaban commented May 5, 2025

Uh oh!

sayakpaul commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

[training] feat: enable quantization for hidream lora training. #11494

[training] feat: enable quantization for hidream lora training. #11494

Uh oh!

Conversation

sayakpaul commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

TODO

Uh oh!

HuggingFaceDocBuilderDev commented May 5, 2025

Uh oh!

sayakpaul May 5, 2025

Choose a reason for hiding this comment

Uh oh!

linoytsaban left a comment

Choose a reason for hiding this comment

Uh oh!

linoytsaban commented May 5, 2025

Uh oh!

sayakpaul commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

sayakpaul commented May 5, 2025 •

edited

Loading