Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Singularize english words (R pluralize) #177

Open
francoisfauteux opened this issue Aug 13, 2021 · 0 comments
Open

Singularize english words (R pluralize) #177

francoisfauteux opened this issue Aug 13, 2021 · 0 comments

Comments

@francoisfauteux
Copy link

Testing with R pluralize on vwr english.words from CELEX (~66K) returns ~600 inconsistencies:

library(pluralize)
library(vwr)
dat<-data.frame(word=english.words,sing=singularize(english.words))
dat<-dat[which(dat$word!=dat$sing & !dat$sing %in% english.words),]

Examples:

abdomen: abdoman
always: alway
amaryllis: amarylli
appendicitis: appendiciti
asbestos: asbesto
axis: axi

(...) and so on.

One possible workaround is to filter singulars that are not in a selected dictionary or lexicon.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant