`requires_grad_pre_hook` on `Llava` #44

Erland366 · 2025-02-14T00:32:07Z

There's an issue when training Llava model on latest transformers (4.49.0.dev0) where requires_grad_pre_hook failed on the CLIPEncoder -> specifically this module model.base_model.model.vision_tower.vision_model.encoder

Upon inspection, this is because the inputs on the hook is empty. This means this is a larger issue because CLIPEncoder never receive any input.

I tested bypassing the hook error, and the training result is very bad

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`requires_grad_pre_hook` on `Llava` #44

`requires_grad_pre_hook` on `Llava` #44

Erland366 commented Feb 14, 2025

requires_grad_pre_hook on Llava #44

requires_grad_pre_hook on Llava #44

Comments

Erland366 commented Feb 14, 2025

`requires_grad_pre_hook` on `Llava` #44

`requires_grad_pre_hook` on `Llava` #44