Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Prostt5 ctranslate2 splitting long sequenes #327

Open
wants to merge 23 commits into
base: prostt5-ctranslate2
Choose a base branch
from

Conversation

mpjw
Copy link

@mpjw mpjw commented Aug 14, 2024

Functionality to predict AA sequences that are longer than ProstT5 attention with a split-wise approach.
Sequences will be split if longer than --prostt5-split-length (default 6000, deactivate with 0), and the prediction of splits will be concatenated.
Test cases include one file with a 500 split length and one with a 6000 split length.
Additionally, a conda environment was created for compiling this branch with CUDA.

Marcus Wagner added 23 commits August 6, 2024 21:13
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant