uvl-analytics-concepts-seanmf

About

This is a microservice that can run in a docker container and perform SeaNMF topic detection when queried by a REST call.

REST API

See swagger.yaml for details. The tool at https://editor.swagger.io/ can be used to render the swagger file.

Method Parameter

alpha - Factorization weight for word-semantic correlations, higher alpha may increase coherence, but reduce interpretability of topics. Can be in (0,1] range. Default: 0.1

beta - Sparsity factor, increase beta (eg. 0.1) for SparseSeaNMF (SSeaNMF), for normal SeaNMF this parameter is not needed (=0). Can be in [0,1] range. Default: 0

n_topics - The number of topics that shall be detected. Higher topic coherence indicates a better n_topics. Can be any number > 0. Default: 10

max_iter - Maximum number of iterations that will be performed. Default: 500

max_err - Error threshold. The processing will stop when the error is smaller than max_err or max_iter is reached. Default: 0.1

fix_random - Set to true to fix random seed to 0. This will make the results reproducible. Default: false

vocab_min_count - Only words that occur more than vocab_min_count times will be added to vocabulary, if the dataset is small and the vocab_min_count to high, the processing will fail. Default: 3

Source

This repository is based on the SeaNMF implementation of the paper

Tian Shi, Kyeongpil Kang, Jaegul Choo and Chandan K. Reddy, "Short-Text Topic Modeling via Non-negative Matrix Factorization Enriched with Local Word-Context Correlations", In Proceedings of the International Conference on World Wide Web (WWW), Lyon, France, April 2018. PDF

Original Repository: (https://github.com/tshi04/SeaNMF)

The code has been modified to be dockerized and to offer a REST API via Flask.

License

Free use of this software is granted under the terms of the GPL version 3 (GPL 3.0).

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
__pycache__		__pycache__
data		data
seanmf_results		seanmf_results
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_seanmf.md		README_seanmf.md
__init__.py		__init__.py
config.json		config.json
data_process.py		data_process.py
model.py		model.py
requirements.txt		requirements.txt
results.py		results.py
starter.py		starter.py
swagger.yaml		swagger.yaml
train.py		train.py
utils.py		utils.py
vis_topic.py		vis_topic.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

uvl-analytics-concepts-seanmf

About

REST API

Method Parameter

Source

License

About

Releases

Packages

Contributors 3

Languages

License

feeduvl/uvl-analytics-concepts-seanmf

Folders and files

Latest commit

History

Repository files navigation

uvl-analytics-concepts-seanmf

About

REST API

Method Parameter

Source

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages