We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Currently, proteinlanguage raises an error if it encounters an unknown token at runtime:
torch.tensor(token_indexes, dtype=self.dtype, device=self.device) TypeError: an integer is required (got type str)
iterate_dataset
The text was updated successfully, but these errors were encountered:
fix: protein-language unknown token handling (fixes ##139)
e387bac
Fix proteinlanguage handling (#146)
9fa788f
* feat: crawlers now convert unicode to ascii [skip ci] * fix: protein-language unknown token handling (fixes ##139) * feat: protein_sequence dataset detects unknown tokens at construction time if iterate_dataset is passed * fix: codiga style * refactor: selfies>=2 version bump
Successfully merging a pull request may close this issue.
Currently, proteinlanguage raises an error if it encounters an unknown token at runtime:
iterate_dataset
is True, this issue should be detected at object construction (not the case currently)The text was updated successfully, but these errors were encountered: