Can the coefficients of CogVideoX-5B be used in CogVideoX1.5-5B? #24

zishen-ucap · 2025-01-14T03:48:27Z

Hello,

Thank you for your amazing work on the CogVideo series!

I noticed the coefficients for CogVideoX-5B (as shown in the attached image) and wanted to ask if they can be directly applied to CogVideoX1.5-5B, or if any adjustments are needed?

Looking forward to your response. Thanks again!

LiewFeng · 2025-01-14T05:44:52Z

hi, @zishen-ucap . Thank you for your interest in our work. Not sure about it. You can try it and show some results here. If it can not work well, you can follow issue 20 to obtain new coeff.

zishen-ucap · 2025-01-14T07:49:52Z

Thanks for your suggestion! I tried it out, and with a negligible subjective performance drop, the sampling time decreased significantly from 475s to 260s. Here are the results:

cogvideo15_teacache.-._20250114_15475606.mp4

final_output13_20250114_15485175.mp4

I tested five different prompt sets and noticed that the residual replacement always occurs at the same inference_steps (e.g., [2, 14, 19, ...]). I’m curious, in your experiments with other models, did you observe a similar pattern, or is this behavior unique to CogVideoX?

If residual replacement tends to happen at fixed inference_steps, would it be feasible to treat these steps as priors to further accelerate video generation?

Looking forward to your insights!

LiewFeng · 2025-01-14T08:50:58Z

For CogVideoX, we find that timestep embedding shows stronger correlation with model ouput. Thus, we leverage timstep embedding to decide which step to be cached. Since timestep embedding keeps the same for all prompts, the same timesteps will be cached, given a threshold. Different threshold will select different timesteps to be cached. For most models, we find that timestep embedding modulted noisy input shows stronger correlation with model outpt and leverage timestep embedding modulated noisy input to decide which timestep to be cached.

zishen-ucap · 2025-01-14T09:49:49Z

I see! Thank you for your detailed answer and sharing

LiewFeng · 2025-01-15T02:20:46Z

Welcome to launch a PR to support CogVideoX1.5-5B if it's convenient.

LiewFeng mentioned this issue Jan 15, 2025

Is there a plan to support CogVideoX1.5-I2V #17

Closed

LiewFeng closed this as completed Jan 15, 2025

zishen-ucap mentioned this issue Jan 17, 2025

Add support for CogVideoX1.5 #28

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can the coefficients of CogVideoX-5B be used in CogVideoX1.5-5B? #24

Can the coefficients of CogVideoX-5B be used in CogVideoX1.5-5B? #24

zishen-ucap commented Jan 14, 2025

LiewFeng commented Jan 14, 2025

zishen-ucap commented Jan 14, 2025

LiewFeng commented Jan 14, 2025

zishen-ucap commented Jan 14, 2025

LiewFeng commented Jan 15, 2025

Can the coefficients of CogVideoX-5B be used in CogVideoX1.5-5B? #24

Can the coefficients of CogVideoX-5B be used in CogVideoX1.5-5B? #24

Comments

zishen-ucap commented Jan 14, 2025

LiewFeng commented Jan 14, 2025

zishen-ucap commented Jan 14, 2025

LiewFeng commented Jan 14, 2025

zishen-ucap commented Jan 14, 2025

LiewFeng commented Jan 15, 2025