- ABC News article: Extracts the topic and keyboard from each news article (csv file) for over 10,000 articles. Two approaches used with the model: bag of words, TF-IDF
- temp.py and abc.csv file
- AAN Dataset: Extracts four keywords from each paper in the AAN (American Academy of Neurology) Conference for over 20,000 papers (json object). Implemented Gensim LDA and output the title and trained keywords as the final product.
- jsontest.py and aan.json file