Skip to content

stscoundrel/scandinavian-dictionary-crosslinker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scandinavian dictionary crosslinker

Finds shared entries in sitemaps of linguistically related dictionaries. Builds a mapping of relations that allows individual dictionaries to crosslink to related sources. Usually having same word in multiple dictionaries of different languages would not be too helpful, but in case of scandinavian languages from 8th to 16th century all of the languages are closely enough related to be useful as crossreference.

Parses sources from following dictionary projects:

The parser finds over 1 000 entries that are present in all four dictionaries. There are also over 20 000 entries that appear in at least two different dictionaries, making them worth a crosslink.

Install

yarn add scandinavian-dictionary-crosslinker

Download sitemaps.

Run cargo run in downloader folder. Downloads latest XML sitemaps to resources folder.

Generate crosslinks

Run go run *.go in crosslinks folder. Generates crosslinks json to resources folder.

Minify outout

Run nimble build and ./minifier in minifier folder. Generates minified & gzipped json outputs.

Update data to NPM module.

Run go run main.go in root folder to update json & readme to NPM module.