Skip to content

Include "<pad>" as possible padding token #15

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

DennisCraandijk
Copy link

@DennisCraandijk DennisCraandijk commented Jan 6, 2025

Various prominent models use "<pad>'' as padding token rather than "[PAD]". This change checks which of the two is used and gets the corresponding padding id before passing this to the StaticModelFineTuner

Various prominent models use "<pad>'' as padding token rather than "[PAD]". This change checks which of the two is used and gets the corresponding padding id before passing this to the StaticModelFineTuner
@Pringled
Copy link
Member

Pringled commented Jan 9, 2025

Hi @DennisCraandijk, thanks for creating this PR! We will do a bit of testing ourselves and then merge and release this feature (might take a bit longer than usual as we are finishing up a release for one of our other projects).

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants