Skip to content

Cleanxml #1259

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Merged
merged 5 commits into from
Mar 20, 2022
Merged

Cleanxml #1259

merged 5 commits into from
Mar 20, 2022

Conversation

AngledLuffa
Copy link
Contributor

Add cleanxml as an option to the tokenize annotator.

Will do ssplit next

@AngledLuffa AngledLuffa force-pushed the cleanxml branch 3 times, most recently from 7ca5fdc to 1742849 Compare March 16, 2022 21:49
@AngledLuffa AngledLuffa force-pushed the cleanxml branch 2 times, most recently from aa71022 to e57afc1 Compare March 16, 2022 22:30
@AngledLuffa AngledLuffa force-pushed the cleanxml branch 2 times, most recently from 55595d3 to 4011cfa Compare March 17, 2022 20:00
For the TokenizerAnnotatorITest, this uppdates the behavior of the test
to reflect that the newlines are now being consumed by the inner ssplit
@AngledLuffa AngledLuffa merged commit b0d1e46 into dev Mar 20, 2022
@AngledLuffa AngledLuffa deleted the cleanxml branch March 20, 2022 01:42
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant