word2vec_vs_glove

Simple python implementation of two popular word embedding algorithm: Word2vec and GloVe.

The existing dataset in this project is SST(Stanford Sentiment Treebank)
SST contain sentiment analysis labels which can be used to evaluating the pros & cons of each embedding model.

pip install -r requirement.txt

sh get_datasets.sh

python train.py -m word2vec --save-every=True --vector-path=./model/word2vec -s 10 --learning-rate=0.3 -w 5 --iterations=40000

python train.py -m glove -s 50 --learning-rate=0.05 --iterations=200 --save-every=True --vector-path=./model/glove

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
utils		utils
README.md		README.md
get_datasets.sh		get_datasets.sh
glove.py		glove.py
requirements.txt		requirements.txt
sgd.py		sgd.py
train.py		train.py
word2vec.py		word2vec.py

Provide feedback