crossencoder training on multigpu #2991

riyajatar37003 · 2024-10-16T07:25:14Z

Hi,

I can see how to train embedding model with brand new trainer. but i couldn't see same for crossencoder.
can you point me if i am missing ?

Thanks
https://sbert.net/docs/sentence_transformer/training_overview.html

riyajatar37003 · 2024-10-16T08:02:37Z

https://sbert.net/docs/package_reference/cross_encoder/cross_encoder.html#training-inputs
in this code , how can i save checkpoints after k-steps of training ?

tomaarsen · 2024-10-17T10:48:51Z

The CrossEncoder training hasn't yet been updated to use a new modern Trainer, that's still on my TODO-list, I'm afraid.
There's currently not a very convenient way to save checkpoints every k steps. The easiest is actually to subclass the CrossEncoder, copy the fit method, and add a line like if training_steps % 1000 == 0: self.save(output_path + str(training_steps)) or something like it.

Apologies. I'll improve CrossEncoder support in the coming months.

Tom Aarsen

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

crossencoder training on multigpu #2991

crossencoder training on multigpu #2991

riyajatar37003 commented Oct 16, 2024

riyajatar37003 commented Oct 16, 2024

tomaarsen commented Oct 17, 2024

crossencoder training on multigpu #2991

crossencoder training on multigpu #2991

Comments

riyajatar37003 commented Oct 16, 2024

riyajatar37003 commented Oct 16, 2024

tomaarsen commented Oct 17, 2024