Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 855 Bytes

README.md

File metadata and controls

18 lines (13 loc) · 855 Bytes

Wiktionary-Json-Parse

Wiktionary Json Parse is a Java program that parses a large JSON file gotten from kaikki containing English dictionary entries into the preferred SQL databases, Structuring and removing unneeded attributes from the file.

Important Steps

  • Download the rar file from here
  • Place it in the resources folder
  • Run the classes in the main folder to generate the preferred databases

Citations:

Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022.