-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Error while training #3
Comments
Hi, thanks for raising this issue. |
Thanks! |
In the training script, say if you want to train the model with only 4 GPUs, you start the deepspeed job with |
hi, dear author, I'm wondering whether there are gonna be evaluation scripts for other datasets reported in the paper, eg Refer-DAVIS-17 and your proposed ReasonVOS datasets. Thanks! |
Hi @Lexarymade , we are actively working on organizing the data and evaluation scripts. As for Ref-DAVIS-17, its evaluation is a bit complicated as it relies on another evaluation toolkit. We will organize an instruction on how to evaluate it recently. |
Hi, have u tried to train the model on A100? I find that it may take 8 days for training on 8 A100. Is the flash-attention not used in this project? |
Hi @yahooo-m , when developing this project, we did not investigate the implementation with flash-attn. It seems that Phi-3 series models would not automatically trigger flash-attn unless we explicitly specify it. This seems to be typical issue according to this and this. To use flash-attn, you can modify the training script:
|
Hi, when running train_joint.py, I met the error "TypeError: VideoLISAForCausalLM.model_forward() missing 1 required positional argument: 'dense_indices'". And I check the input_dict, it actually does not have this key: ['image_paths', 'images', 'images_clip', 'input_ids', 'labels', 'attention_masks', 'masks_list', 'label_list', 'valid_indices', 'resize_list', 'offset', 'questions_list', 'sampled_classes_list', 'inference', 'conversation_list'].
The text was updated successfully, but these errors were encountered: