Finds shared entries in sitemaps of linguistically related dictionaries. Builds a mapping of relations that allows individual dictionaries to crosslink to related sources. Usually having same word in multiple dictionaries of different languages would not be too helpful, but in case of scandinavian languages from 8th to 16th century all of the languages are closely enough related to be useful as crossreference.
Parses sources from following dictionary projects:
- Old Norse Dictionary
- Old Icelandic Dictionary
- Old Norwegian Dictionary
- Old Swedish Dictionary
- Old Danish Dictionary
The parser finds over 1 000 entries that are present in all four dictionaries. There are also over 20 000 entries that appear in at least two different dictionaries, making them worth a crosslink.
yarn add scandinavian-dictionary-crosslinker
Run cargo run
in downloader
folder. Downloads latest XML sitemaps to resources
folder.
Run go run *.go
in crosslinks
folder. Generates crosslinks json to resources
folder.
Run nimble build
and ./minifier
in minifier
folder. Generates minified & gzipped json outputs.
Run go run main.go
in root folder to update json & readme to NPM module.