We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Creating a TextDataset that contains None item should be prevented.
If you create such a dataset, which does not make sense but is currently possible, during SetFit training you end up with a strange error:
SetFit
<...> encodings = self._tokenizer.encode_batch( TypeError: TextEncodeInput must be Union[TextInputSequence, Tuple[InputSequence, InputSequence]]
Not really a bug, but we can check for this and provide an error that is more helpful.
Reported by @eisioriginal
The text was updated successfully, but these errors were encountered:
Prevent TextDataset objects from containing None (#73)
1d38af8
Signed-off-by: Christopher Schröder <chschroeder@users.noreply.github.com>
No branches or pull requests
Feature description
Creating a TextDataset that contains None item should be prevented.
Motivation
If you create such a dataset, which does not make sense but is currently possible, during
SetFit
training you end up with a strange error:Not really a bug, but we can check for this and provide an error that is more helpful.
Additional comments
Reported by @eisioriginal
The text was updated successfully, but these errors were encountered: