Skip to content
This repository has been archived by the owner on Feb 27, 2020. It is now read-only.

Latest commit

 

History

History
9 lines (8 loc) · 321 Bytes

TODO.md

File metadata and controls

9 lines (8 loc) · 321 Bytes

Remove punctuation ';"{}[]/?.,! Lowercase the data since it is going to happen anyway also stopwords are lower case Try different embedding See if better performance with stopwords included Find out what words are not in the embeddings?

Include magic features Start recording statistics Try to merge less popular words?