[research_projects] add shortened flux training script with quantization #11743

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

#

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Jump to bottom

Open

DerekLiu35 wants to merge 8 commits into huggingface:main from DerekLiu35:add-flux-lora-quantization-nano-example

DerekLiu35 commented Jun 18, 2025

adds shortened script to be referenced in this blogpost huggingface/blog#2888


          add example

89a3af6

HuggingFaceDocBuilderDev commented Jun 18, 2025

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Author

DerekLiu35 commented Jun 18, 2025

sayakpaul reviewed

View reviewed changes

examples/research_projects/flux_lora_quantization/train_dreambooth_lora_flux_nano.py Outdated

+              if __name__ == "__main__":
+                  class Args:
+                      pretrained_model_name_or_path = "black-forest-labs/FLUX.1-dev"
+                      data_df_path = "embeddings_alphonse_mucha.parquet"

Member

sayakpaul Jun 19, 2025

Add a note on where this is coming from.

sayakpaul reviewed

View reviewed changes

examples/research_projects/flux_lora_quantization/train_dreambooth_lora_flux_nano.py Outdated

+                              )
+                              # Compute loss
+                              weighting = compute_loss_weighting_for_sd3("none", sigmas)

Member

sayakpaul Jun 19, 2025

We have the weighting_scheme args. So, let's use it from there.

sayakpaul reviewed

View reviewed changes

examples/research_projects/flux_lora_quantization/train_dreambooth_lora_flux_nano.py Outdated

+                              noise = torch.randn_like(model_input)
+                              bsz = model_input.shape[0]
+                              u = compute_density_for_timestep_sampling("none", bsz, 0.0, 1.0, 1.29)

Member

sayakpaul Jun 19, 2025 •

edited

Loading

args.weighting_scheme

Let's also make constants for the magic numbers.

sayakpaul reviewed

View reviewed changes

Member

sayakpaul left a comment

Thanks! Left some comments.

examples/research_projects/flux_lora_quantization/train_dreambooth_lora_flux_nano.py Outdated

+                      model = accelerator.unwrap_model(model)
+                      return model._orig_mod if is_compiled_module(model) else model
+                  def save_model_hook(models, weights, output_dir):

Member

sayakpaul Jun 19, 2025

If we don't have a load model hook, then I don't think it will make sense to have this either no? Are do we have a utility to resume from a checkpoint in this script?

Author

DerekLiu35 Jun 19, 2025

I kept save_model_hook to save intermediate checkpoints, but I probably didn't need to save optimizer states too. Though, yeah I think adding back load model hook to resume from checkpoints is a good idea

Member

sayakpaul Jun 19, 2025

Yeah either we don't support intermediate checkpoints at all or support it. I think okay without to prefer minimalism.

examples/research_projects/flux_lora_quantization/train_dreambooth_lora_flux_nano.py Outdated

+                  cast_training_params([transformer], dtype=torch.float32) if args.mixed_precision == "fp16" else None
+                  # Initialize tracking
+                  accelerator.init_trackers("dreambooth-flux-dev-lora-alphonse-mucha", config=vars(args)) if accelerator.is_main_process else None

Member

sayakpaul Jun 19, 2025

Suggested change

      
                accelerator.init_trackers("dreambooth-flux-dev-lora-alphonse-mucha", config=vars(args)) if accelerator.is_main_process else None
          
            if accelerator.is_main_proces
          
                accelerator.init_trackers("dreambooth-flux-dev-lora-alphonse-mucha", config=vars(args))

Can we do it while creating the output folder?

examples/research_projects/flux_lora_quantization/train_dreambooth_lora_flux_nano.py Show resolved Hide resolved

examples/research_projects/flux_lora_quantization/train_dreambooth_lora_flux_nano.py

+                      init_lora_weights="gaussian",
+                      target_modules=["to_k", "to_q", "to_v", "to_out.0"],
+                  )
+                  transformer.add_adapter(transformer_lora_config)

Member

sayakpaul Jun 19, 2025

Should cast the LoRA params to FP32. Do you have a full run with this script that works without FP32 upcasting?

Author

DerekLiu35 Jun 19, 2025

I think I was casting to FP32 below with
cast_training_params([transformer], dtype=torch.float32) if args.mixed_precision == "fp16" else None below (probably will move it over here and change it to match original training script better.

I do have a full run with this script with reasonable results without FP32 upcasting.
But, I noticed in the loss curves are slightly different between nano script (rare-voice-24 run) and original script (fanciful-totem-2) so I will need to find where the discrepancy is coming from.

Member

sayakpaul Jun 19, 2025

If it doesn't affect results, probably okay

Member

sayakpaul commented Jun 19, 2025

@bot /style


          Apply style fixes

f5a0a4d

Contributor

github-actions bot commented Jun 19, 2025

Style fixes have been applied. View the workflow run here.

sayakpaul and others added 6 commits

June 19, 2025 07:32


          Merge branch 'main' into add-flux-lora-quantization-nano-example

4355b20


          Apply suggestions from code review

5648aaa

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>


          apply suggestions from code review

ae6bd61


          remove unused arg

ba5144b


          init tracker while creating the output folder

4a5f73a


          fix style

dc7932e

# for free to join this conversation on GitHub. Already have an account? # to comment

Labels

None yet