Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Fix bugs in the normalize module #53

Merged
merged 11 commits into from
Jan 12, 2019
Merged

Fix bugs in the normalize module #53

merged 11 commits into from
Jan 12, 2019

Conversation

Changaco
Copy link
Member

Correction de divers bugs et lacunes dans le processus de normalisation des titres de textes.

A spellchecker can't tell that "EXPOSE DES MOTIFS" should be "Exposé des motifs" and not "Expose des motifs". A more advanced natural language processing algorithm could, but it couldn't guarantee 100% accuracy either.
Example: "Loi de finances rectificative pour 1964 (n° 64-1278 du 23 décembre 1964)"
"Décret du n° 2017-1681 du 13 décembre 2017 authentifiant […]"
"Décret n°2003-730 du 1 août 2003 2003-730 du 1er août 2003 modifiant […]"
@Changaco Changaco merged commit 7dacb05 into master Jan 12, 2019
@Changaco Changaco deleted the fix-normalize branch January 12, 2019 13:50
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant