Relevant tweets #5

audrism · 2019-02-14T14:15:43Z

No description provided.

abhidya · 2019-03-06T18:57:16Z

https://github.com/DisasterMasters/TweetAnalysis/blob/master/src/results/Relevance%20Preprocessing.ipynb
Best Text Preprocessing for Doc2vec is simply distributed bag of words + punctuation removal
Tried combos of
distributed memory
distributed bag of words
LowerCase
Removal of Stop Words
Rare words removal
Spelling correction
punctuation removal

audrism · 2019-03-13T18:27:38Z

@abhidya what are the datasets you train relevant/irrelevant tweets for irma? Also is the code link above the right one. @nwest13

audrism added this to the Sprint 1 milestone Feb 14, 2019

audrism assigned abhidya Feb 14, 2019

Provide feedback