make enable_sequential_cpu_offload more generic for third-party devices #4191

ji-huazhong · 2023-07-21T07:50:23Z

What does this PR do?

This PR make enable_sequential_cpu_offload more generic for third-party devices.

I noticed that in #4114, enable_sequential_cpu_offload has been refactored to be more generic for other devices.
But inside the function enable_sequential_cpu_offload, we use torch.cuda.empty_cache to release all unoccupied cache memory, which has no effect for other devices (such as xpu)

diffusers/src/diffusers/pipelines/pipeline_utils.py

Lines 1128 to 1130 in 7a47df2

    
           if self.device.type != "cpu": 
        
               self.to("cpu", silence_dtype_warnings=True) 
        
               torch.cuda.empty_cache()  # otherwise we don't see the memory savings (but they probably exist)

We could change torch.cuda.empty_cache() with another from outside, like

+ torch.cuda.empty_cache = torch.xpu.empty_cache
device = torch.device("xpu")
pipeline.enable_sequential_cpu_offload(device=device)

but it looks a little weird.

I think a better way is

get the device module according to the device type first
and then call the corresponding empty_cache method.

Now, we can use enable_sequential_cpu_offload more conveniently with xpu, like

device = torch.device("xpu")
pipeline.enable_sequential_cpu_offload(device=device)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@patrickvonplaten and @sayakpaul

HuggingFaceDocBuilderDev · 2023-07-21T08:01:39Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten

Works for me!

pcuenca

Nice!

…es (huggingface#4191) * make enable_sequential_cpu_offload more generic for third-party devices * make style

ji-huazhong added 2 commits July 21, 2023 14:43

make enable_sequential_cpu_offload more generic for third-party devices

d5a904d

make style

c3b66d0

sayakpaul requested review from pcuenca and patrickvonplaten July 21, 2023 09:04

patrickvonplaten approved these changes Jul 21, 2023

View reviewed changes

pcuenca approved these changes Jul 21, 2023

View reviewed changes

sayakpaul merged commit e2bbaa4 into huggingface:main Jul 21, 2023

orpatashnik pushed a commit to orpatashnik/diffusers that referenced this pull request Aug 1, 2023

make enable_sequential_cpu_offload more generic for third-party devic…

ab265b1

…es (huggingface#4191) * make enable_sequential_cpu_offload more generic for third-party devices * make style

orpatashnik pushed a commit to orpatashnik/diffusers that referenced this pull request Aug 1, 2023

make enable_sequential_cpu_offload more generic for third-party devic…

25e03d3

…es (huggingface#4191) * make enable_sequential_cpu_offload more generic for third-party devices * make style

orpatashnik pushed a commit to orpatashnik/diffusers that referenced this pull request Aug 1, 2023

make enable_sequential_cpu_offload more generic for third-party devic…

f414399

…es (huggingface#4191) * make enable_sequential_cpu_offload more generic for third-party devices * make style

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

make enable_sequential_cpu_offload more generic for third-party devic…

942277f

…es (huggingface#4191) * make enable_sequential_cpu_offload more generic for third-party devices * make style

yao-matrix mentioned this pull request Jun 13, 2025

enable cpu offloading of new pipelines on XPU & use device agnostic empty to make pipelines work on XPU #11671

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

make enable_sequential_cpu_offload more generic for third-party devices #4191

make enable_sequential_cpu_offload more generic for third-party devices #4191

Uh oh!

ji-huazhong commented Jul 21, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jul 21, 2023 •

edited

Loading

Uh oh!

patrickvonplaten left a comment

Uh oh!

pcuenca left a comment

Uh oh!

Uh oh!

	if self.device.type != "cpu":
	self.to("cpu", silence_dtype_warnings=True)
	torch.cuda.empty_cache() # otherwise we don't see the memory savings (but they probably exist)

make enable_sequential_cpu_offload more generic for third-party devices #4191

make enable_sequential_cpu_offload more generic for third-party devices #4191

Uh oh!

Conversation

ji-huazhong commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ji-huazhong commented Jul 21, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 21, 2023 •

edited

Loading