[Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost and stuck on some models #14610

ibrainventures · 2024-01-10T17:19:43Z

Checklist

The issue exists after disabling all extensions
The issue exists on a clean installation of webui
The issue is caused by an extension, but I believe it is caused by a bug in the webui
The issue exists in the current version of the webui
The issue has not been reported before recently
The issue has been reported before but has not been fixed yet

What happened?

A image generated without downcast gets downcasted after 1 run of the XYZ Plot (checkpoint change)

Steps to reproduce the problem

Generate a Image without Downcast (default setting Dev Version)
Save it
Reuse it with the save subseed number
Make a xyz plot with this image and 2 models (also the origin model)
Run the plot (it shows right)
Run the generation without a Script
Now The Image is generated WITH downcast
Stop the webui service Process / Start the webui Service Process (nor UI Reload)
Generate the Image -- Now it is WITHOUT Downcast

What should have happened?

The "without Downcast" should be respected also after running a xzy Script.

What browsers do you use to access the UI ?

Mozilla Firefox

Sysinfo

sysinfo-2024-01-10-17-17.json

Console logs

Total progress: 100%|███████████████████████████████████████████████████████████████████████████████████████████████|

Additional information

No response

ibrainventures · 2024-01-10T21:19:47Z

After some tests this issue (only) happens on starting or switching to 30% of my checkpoints:

some examples of problematic checkpoints (SD 1.5):

analogmadness_v70
juggernaut_reborn
cyberrealistic_v41

Starting or switching to those models are breaking the downcast function.

Starting with a "unproblematic" checkpoint (eg : epiccartoon_v1) and all works fine (with or without downcast..).
Switching or Starting with / on of the "problematic" checkpoints breaks the downcast option. After that the system is stucked in the "use_downcasted_alpha_bar true" mode. Only a restart of the webui process WITH a unproblematic checkpoint resolves the stuck-ness.

ibrainventures · 2024-01-10T22:58:05Z

add:
It also get stucked / freezed on the unproblematic checkpoints, if i add the

"Downcast model alphas_cumprod to fp16 before sampling. For reproducing old seeds."

checkbox to the frontend and run a xyz script with 2 models / checkpoint change and the option set to true.

Cyberbeing · 2024-01-11T19:30:49Z

I can confirm this issue with various other models as well.
I just ran into it when using Refiner with a couple SD1.5 based models, no scripts.

I did some testing, and it seems this issue is being caused by models without alphas_cumprod, inheriting alphas_cumprod from the next model loaded after #14145. Since some FP16 converted ckpt/safetensor models have alphas_cumprod in FP16 as well, it triggers this bug.

I can reproduce the bug as follows:

Disable Downcast model alphas_cumprod to fp16
Load a model which does not contain alphas_cumprod in the ckpt/safetensors
Exit Webui, Start Webui
Do a generation (save the result for comparisons)
Enable Downcast model alphas_cumprod to fp16
Do a generation (save the result for comparisons)
Disable Downcast model alphas_cumprod to fp16
Do a generation (confirm the result matches 4.)
Enable refiner with a model which contains alphas_cumprod with FP16 dtype
Do a generation with refiner enabled
Disable refiner
Do a generation (the result now looks like 6. when it should look like 4. since the model has inherited FP16 alphas_cumprod from the refiner model)
Enable refiner with a model with contains alphas_cumprod with FP32 dtype
Do a generation with refiner enabled
Disable refiner
Do a generation (the result now looks like 4. since the model has inherited FP32 alphas_cumprod from the refiner model)

Another unrelated observation, is that models containing a FP16 alphas_cumprod don't actually end up with a FP32 precision alphas_cumprod when Downcast model alphas_cumprod to fp16 is disabled (when enabled only tiny details change if any).

On the other hand, models containing a FP32 alphas_cumprod can have rather significant changes to output between Downcast model alphas_cumprod to fp16 enabled and disabled (i.e. I've seen major characteristics of people and objects completely change while maintaining nearly identical composition) . It makes me realize I may need to go back and re-convert all my FP32 models to FP16 while maintaining the true FP32 precision alphas_cumprod to make the Downcast model alphas_cumprod to fp16 switch useful. Though if this bug is fixed, I could instead just delete FP16 alphas_cumprod from the models. I can only assume LDM must generate FP32 alphas_cumprod automatically if it is missing from the model on load.

Cyberbeing · 2024-01-13T23:14:10Z

I did a bit more testing today, and discovered another way to trigger this issue.

Disable Downcast model alphas_cumprod to fp16
Load a model which ~~does not contain~~ alphas_cumprod in the ckpt/safetensors
Exit Webui, Start Webui
Do a generation (save the result for comparisons)
Enable Downcast model alphas_cumprod to fp16
Do a generation (save the result for comparisons)
Disable Downcast model alphas_cumprod to fp16
Switch to a model which ~~does not contain~~ alphas_cumprod with FP16 dtype
Do a generation (ignore results, though at this point this model is also stuck with fp16 alphas_cumprod)
Switch back to the original model
Do a generation (the results will now match 6, when it should look like 4.

Edit: It seems this issue switching issue occurs even with models containing fp32 alphas_cumprod.

What this tells me, is that rather than from the model itself, in this case the dtype of alphas_cumprod is being inherited from the alphas_cumprod dtype of your last generation prior to switching models. Expected behavior would be for alphas_cumprod to return to FP32 when Downcast model alphas_cumprod was disabled in 7, even if you didn't perform a generation prior to switching models. It's noteworthy that the issue doesn't occur in this case if I do a generation between staps 7 & 8. It's only when the Downcast model alphas_cumprod to fp16 state has been changed, but the effect of the change has not being triggered prior to model switch.

It would appear this issue is being caused in part by reuse_model_from_already_loaded() never calling load_model() when shared.opts.sd_checkpoints_limit is exceeded. So if you are unable to reproduce the bug, try setting shared.opts.sd_checkpoints_limit=1. Without load_model() being called, it seems alpha_cumprod has a chance of getting stuck as float16 on model switch under certain conditions:

stable-diffusion-webui/modules/sd_models.py

Lines 735 to 744 in cf2772f

    
           elif len(model_data.loaded_sd_models) > 0: 
        
               sd_model = model_data.loaded_sd_models.pop() 
        
               model_data.sd_model = sd_model 
        
               sd_vae.base_vae = getattr(sd_model, "base_vae", None) 
        
               sd_vae.loaded_vae_file = getattr(sd_model, "loaded_vae_file", None) 
        
               sd_vae.checkpoint_info = sd_model.sd_checkpoint_info 
        
               print(f"Reusing loaded model {sd_model.sd_checkpoint_info.title} to load {checkpoint_info.title}") 
        
               return sd_model

ibrainventures added the bug-report Report of a bug, yet to be confirmed label Jan 10, 2024

ibrainventures changed the title ~~[Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost after XYZ Plot~~ [Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost after a some checkpoints Jan 10, 2024

ibrainventures changed the title ~~[Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost after a some checkpoints~~ [Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost and stuck on some models Jan 10, 2024

ibrainventures mentioned this issue Jan 10, 2024

[Bug]: alphas_cumprod are downcasted to half precision during model load despite existing at full precision during sampling #14071

Closed

1 task

catboxanon added bug Report of a confirmed bug and removed bug-report Report of a bug, yet to be confirmed labels Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost and stuck on some models #14610

[Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost and stuck on some models #14610

ibrainventures commented Jan 10, 2024 •

edited

Loading

ibrainventures commented Jan 10, 2024

ibrainventures commented Jan 10, 2024 •

edited

Loading

Cyberbeing commented Jan 11, 2024 •

edited

Loading

Cyberbeing commented Jan 13, 2024 •

edited

Loading

[Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost and stuck on some models #14610

[Bug]: 1.7x DEV v1.7.0-329-g85bf2eb4 alphas_cumprod Downcast setting got lost and stuck on some models #14610

Comments

ibrainventures commented Jan 10, 2024 • edited Loading

Checklist

What happened?

Steps to reproduce the problem

What should have happened?

What browsers do you use to access the UI ?

Sysinfo

Console logs

Additional information

ibrainventures commented Jan 10, 2024

ibrainventures commented Jan 10, 2024 • edited Loading

Cyberbeing commented Jan 11, 2024 • edited Loading

Cyberbeing commented Jan 13, 2024 • edited Loading

ibrainventures commented Jan 10, 2024 •

edited

Loading

ibrainventures commented Jan 10, 2024 •

edited

Loading

Cyberbeing commented Jan 11, 2024 •

edited

Loading

Cyberbeing commented Jan 13, 2024 •

edited

Loading