Wikipedia-parser-indexer-tokenizer Code to parse Wikipedia dump, tokenize and index processed docs Files folder contains sample wikipedia dump containing 1600 docs. src folder contains the code to parse, tokenize and Index the processed docs.