fix: unscale fp16 gradient problem & potential error (#6086) #6231

lvzii · 2023-12-19T09:35:05Z

Referring to commit ,
Fixes # (6086) in train_text_to_image_lora_sdxl.py, and Fixes # (4619) in train_text_to_image_lora_sdxl.py during fixing the former error.

HuggingFaceDocBuilderDev · 2023-12-19T09:44:18Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2023-12-20T03:59:15Z

examples/text_to_image/train_text_to_image_lora_sdxl.py

+        # Make sure vae.dtype is consistent with the unet.dtype
+        if args.mixed_precision == "fp16":
+            vae.to(weight_dtype)


This is not needed in my opinion. We already set the torch_dtype in the pipeline during loading it.

If no other pretrained_vae_model_name_or_path is set, then the vae is set to float32,

if args.pretrained_vae_model_name_or_path is None: vae.to(accelerator.device, dtype=torch.float32)

and the pipeline here does not reload vae. So vae.dtype float32 != unet.dtype fp16, which in my tests causes RuntimeError: Input type (c10::Half) and bias type (float) should be the same

Ah okay got it!

sayakpaul · 2023-12-20T03:59:28Z

examples/text_to_image/train_text_to_image_lora_sdxl.py

+    # Make sure the trainable params are in float32.
+    if args.mixed_precision == "fp16":
+        models = [unet]
+        if args.train_text_encoder:
+            models.extend([text_encoder_one, text_encoder_two])
+        for model in models:
+            for param in model.parameters():
+                # only upcast trainable parameters (LoRA) into fp32
+                if param.requires_grad:
+                    param.data = param.to(torch.float32)


Works for me!

sayakpaul

TYSM!

sayakpaul · 2023-12-21T03:39:37Z

Thank you for your contributions!

… (huggingface#6231) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

fix: unscale fp16 gradient problem & potential error (huggingface#6086)

1b4ad5d

sayakpaul reviewed Dec 20, 2023

View reviewed changes

sayakpaul approved these changes Dec 20, 2023

View reviewed changes

sayakpaul added 2 commits December 20, 2023 09:29

Merge branch 'main' into fix-sdxl-train-script

b954499

Merge branch 'main' into fix-sdxl-train-script

ab3b41a

sayakpaul merged commit 6ca9c4a into huggingface:main Dec 21, 2023

donhardman pushed a commit to donhardman/diffusers that referenced this pull request Dec 29, 2023

fix: unscale fp16 gradient problem & potential error (huggingface#6086)…

3157aa0

… (huggingface#6231) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

sayakpaul mentioned this pull request Jan 5, 2024

ValueError: Attempting to unscale FP16 gradients. #6442

Closed

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

fix: unscale fp16 gradient problem & potential error (huggingface#6086)…

15ef77a

… (huggingface#6231) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

bghira mentioned this pull request May 12, 2024

training example for instruct pix2pix doesn't zero out embeds #7920

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: unscale fp16 gradient problem & potential error (#6086) #6231

fix: unscale fp16 gradient problem & potential error (#6086) #6231

lvzii commented Dec 19, 2023

HuggingFaceDocBuilderDev commented Dec 19, 2023

sayakpaul Dec 20, 2023

lvzii Dec 20, 2023

sayakpaul Dec 20, 2023

sayakpaul Dec 20, 2023

sayakpaul left a comment

sayakpaul commented Dec 21, 2023

fix: unscale fp16 gradient problem & potential error (#6086) #6231

fix: unscale fp16 gradient problem & potential error (#6086) #6231

Conversation

lvzii commented Dec 19, 2023

HuggingFaceDocBuilderDev commented Dec 19, 2023

sayakpaul Dec 20, 2023

Choose a reason for hiding this comment

lvzii Dec 20, 2023

Choose a reason for hiding this comment

sayakpaul Dec 20, 2023

Choose a reason for hiding this comment

sayakpaul Dec 20, 2023

Choose a reason for hiding this comment

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul commented Dec 21, 2023