The 4th order coefficient of FLUX does not show a clear relationship between the output_diff and the predicted_output_diff #20

phyllispeng123 · 2025-01-07T03:47:02Z


Fig 1

Fig 2

I used 400 prompts from https://huggingface.co/datasets/k-mktr/improved-flux-prompts to generate 400 pairs of (modulated_input_diff, output_diff), the shape of each is 49 as I take the following hyperparameters.

num_inference_steps = 50, 
guidance_scale=3.5, 
max_sequence_length=256, 
generator=torch.Generator(device).manual_seed(42)

The result is satisfying because the modulated_input_diff and modulated_output_diff using my 400 generated data always show a stable and close relationship using different prompt （Fig 2）. However, I meet some problem when I use the 4th order coefficient provided in ./TeaCache4FLUX/teacache_flux.py,

I don't see an obvious relationship between either ( log(ouput_diff) vs log(predicted_output_diff) ) or ( ouput_diff vs predicted_output_diff ) using my own data.（Fig 1）
I do the 4th order polynomial fitting with my own data, and get the different coeffient [-34.84608751, -10.79323838, 16.39479138, -1.21976726, 0.12762022]), but it also show a bad relationship.
I find the L1 loss between the ouput_diff and predicted_output_diff decreses as the order of fitting increases (I tried order from 1 to 10)

The code is displayed below, I wonder if I do it wrong ? (BTW, the TeaCache speed-up and performance is marvelous in both flux and hunyuanvideo !!! )

plt.clf()
x = input_diff.mean() #### the .csv, shape = (400, 49)
y = output_diff.mean() #### the .csv, shape = (400, 49)
coefficients = [4.98651651e+02, -2.83781631e+02,  5.58554382e+01, -3.82021401e+00, 2.64230861e-01]
rescale_func = np.poly1d(coefficients)
ypred = rescale_func(x)
plt.figure(figsize=(8,8))
plt.plot(np.log(x), np.log(y), '*',label='log original values',color='green')
plt.plot(np.log(x), np.log(ypred), '.',label='log polyfit values',color='blue')
plt.xlabel(f'4th order true fitting')
plt.legend(loc=4)

The text was updated successfully, but these errors were encountered:

LiewFeng · 2025-01-07T04:45:27Z

There may be some difference between our implementation.

We use relative L1 Loss instead of L1 loss, not sure what are you using.
The output in our setting is the residual output, i.e., output hidden states - input hidden states, instead of output hidden states, since we cache the residual output.
The coeff is calculated under 28-step setting. It should work well for 50-step setting.
The output hidden states is the one brefore normed.

phyllispeng123 · 2025-01-07T05:04:36Z

There may be some difference between our implementation.

We use relative L1 Loss instead of L1 loss, not sure what are you using.

The output in our setting is the residual output, i.e., output hidden states - input hidden states, instead of output hidden states, since we cache the residual output.

The coeff is calculated under 28-step setting. It should work well for 50-step setting.

The output hidden states is the one brefore normed.

I used the equation (4) provided in the paper as my L1 loss, then I calculated output_diff = L1_loss (output hidden states, previous output hidden states ), input_diff = L1_loss ( input hidden states, previous input hidden states )
Does that mean that the 'model output diff' in the paper always indicate the residual output ? The Fig 3 and the Fig 5 in the paper also means the relation between the L1_loss ( input hidden states, previous input hidden states ) and the L1_loss (output hidden states, input hidden states ) ?(the loss denominator are previous input hidden states, and input hidden states respectively ?) But I can not find where you define the model ouput diff, and I always reckon the model ouput diff should be the different between output hidden states and previous output hidden states.
thanks for your comfirmation !
yes, I use the output hidden states as model output and it is brefore normed.

LiewFeng · 2025-01-07T05:24:33Z

L1_rel(modulated input, previous modulated input) and L1_rel(residual output, previous residual output)

phyllispeng123 · 2025-01-07T07:28:53Z

L1_rel(modulated input, previous modulated input) and L1_rel(residual output, previous residual output)

OKK!! Now I get the point !!!!
I regenerate 100 pairs of (modulated input diff, residual output diff ) like you stated above using prompts in https://huggingface.co/datasets/k-mktr/improved-flux-prompts and get a very different coefficient = [-76.48384686 15.27823855 11.35678576 -0.87895694 0.12150872]. However, yours coefficient = [4.98651651e+02, -2.83781631e+02, 5.58554382e+01, -3.82021401e+00, 2.64230861e-01]. I wonder the differences is due to the datasets ? If you can share what datasets and how many dataset you used to generate the residual output and modulated input ? Did you do another adjusting to the coefficient ?

My way of getting the coefficient is like below:

def find_coefficient():
    output_diff = pd.read_csv('./output_diff.csv') ### residual output diff csv, shape=(100, 49), 100 prompts that have (49+1) inference step
    input_diff = pd.read_csv('./input_diff.csv') ### modulated input diff csv, shape=(100, 49), 100 prompts that have (49+1) inference step
    
    #### take mean out of 100 prompts
    x = input_diff.mean()
    y = output_diff.mean()
    
    #### 4th order fit
    coefficients = np.polyfit(x, y, 4)
    rescale_func = np.poly1d(coefficients)
    ypred = rescale_func(x)
    plt.clf()
    plt.figure(figsize=(8,8))
    plt.plot(np.log(x), np.log(y), '*',label='log residual output diff values',color='green')
    plt.plot(np.log(x), np.log(ypred), '.',label='log polyfit values',color='blue')
    plt.xlabel(f'log input_diff')
    plt.ylabel(f'log residual_output_diff')
    plt.ylim(-3,1)
    plt.legend(loc=4) 
    plt.title('4th order My Polynomial fitting ')
    plt.tight_layout()
    plt.savefig('residual_polynomial_fitting_log.png')
    
    #### 4th order fit using teacache coefficients
    coefficients = [4.98651651e+02, -2.83781631e+02,  5.58554382e+01, -3.82021401e+00, 2.64230861e-01]
    rescale_func = np.poly1d(coefficients)
    x = input_diff.mean()
    y = output_diff.mean()
    ypred = rescale_func(x)
    plt.clf()
    plt.figure(figsize=(8,8))
    plt.plot(np.log(x), np.log(y), '*',label='log residual output diff values',color='green')
    plt.plot(np.log(x), np.log(ypred), '.',label='log polyfit values',color='blue')
    plt.xlabel(f'log input_diff')
    plt.ylabel(f'log residual_output_diff')
    plt.legend(loc=4) 
    plt.ylim(-3,1)
    plt.title('4th order Teacache Polynomial fitting ')
    plt.tight_layout()
    plt.savefig('residual_polynomial_fitting_loggt.png')

LiewFeng · 2025-01-07T08:17:21Z

70 prompts from here.

Maybe you can try with 28 inference steps.

LiewFeng · 2025-01-09T03:49:35Z

Closed due to inactive. Feel free to reopen it if necessary.

hkunzhe · 2025-01-20T08:06:59Z

L1_rel(modulated input, previous modulated input) and L1_rel(residual output, previous residual output)

The relative L1 distance should be

relative_l1_distance = (torch.abs(prev - cur).mean()) / torch.abs(prev).mean()

Is it correct? @LiewFeng

LiewFeng · 2025-01-20T08:10:26Z

@hkunzhe . Yes.

LiewFeng mentioned this issue Jan 9, 2025

Support TeaCache in Memo? #22

Open

LiewFeng closed this as completed Jan 9, 2025

This was referenced Jan 14, 2025

How to get coefficients? #2

Closed

SDXL support? #25

Open

Can the coefficients of CogVideoX-5B be used in CogVideoX1.5-5B? #24

Closed

How to implement this for a custom DiT model? #26

Closed

Nvidia Cosmos support? #29

Closed

LiewFeng mentioned this issue Feb 21, 2025

parameters of the polynomial equation #38

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The 4th order coefficient of FLUX does not show a clear relationship between the output_diff and the predicted_output_diff #20

The 4th order coefficient of FLUX does not show a clear relationship between the output_diff and the predicted_output_diff #20

phyllispeng123 commented Jan 7, 2025 •

edited

Loading

LiewFeng commented Jan 7, 2025

phyllispeng123 commented Jan 7, 2025 •

edited

Loading

LiewFeng commented Jan 7, 2025 •

edited

Loading

phyllispeng123 commented Jan 7, 2025 •

edited

Loading

LiewFeng commented Jan 7, 2025

LiewFeng commented Jan 9, 2025

hkunzhe commented Jan 20, 2025

LiewFeng commented Jan 20, 2025

The 4th order coefficient of FLUX does not show a clear relationship between the output_diff and the predicted_output_diff #20

The 4th order coefficient of FLUX does not show a clear relationship between the output_diff and the predicted_output_diff #20

Comments

phyllispeng123 commented Jan 7, 2025 • edited Loading

LiewFeng commented Jan 7, 2025

phyllispeng123 commented Jan 7, 2025 • edited Loading

LiewFeng commented Jan 7, 2025 • edited Loading

phyllispeng123 commented Jan 7, 2025 • edited Loading

LiewFeng commented Jan 7, 2025

LiewFeng commented Jan 9, 2025

hkunzhe commented Jan 20, 2025

LiewFeng commented Jan 20, 2025

phyllispeng123 commented Jan 7, 2025 •

edited

Loading

phyllispeng123 commented Jan 7, 2025 •

edited

Loading

LiewFeng commented Jan 7, 2025 •

edited

Loading

phyllispeng123 commented Jan 7, 2025 •

edited

Loading