Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

pyldavis "individual terms...interpreting..current topic" v. gensim's print_topics #79

Open
husseinkohy opened this issue Dec 12, 2016 · 5 comments

Comments

@husseinkohy
Copy link

husseinkohy commented Dec 12, 2016

Hi,

I have noticed that the "individual terms" shown on the horizontal barchart of pyldavis prints different words from gensim's print_topics function (https://radimrehurek.com/gensim/models/ldamodel.html). in some cases I do not see one matching word between the two calls. changing lambda does not make a difference. any insight would be greatly appreciated. I will post the same question to gensim's mailing list. thanks

update: it seems that number of topic assigned to a text could be different. is there a way to match the topic number assigned by these two models?

@MCin22
Copy link

MCin22 commented Jan 24, 2017

Hi there! I just started to work using gensim and pyLDAvis and also noticed similar problem. Is there any way to match topics between gensim output and pyLDAvis visualization?

I wanted to manually name generated topics based on the most relevant terms and to give labels to documents that are included to corpora, but because of mismatching between outputs from gensim's print_topics function and pyLDAvis' visualization I can't do it. (I checked out whether the most relevant terms occurs in documents that belong to specific topic and got nothing)

@shah-p
Copy link

shah-p commented May 8, 2019

Hi there! I have observed the same issue. It's been a while since you have raised this issue.
Did you find any answer for this? It's not even been discussed widely. So, it's hard to find the answer.

Any help would be much appreciated. Thanks.

@prachimehta2705
Copy link

Hi,
I have been facing the same issue as well and have been stuck for a while now. Please share any possible solution.
Thanks

@husseinkohy
Copy link
Author

husseinkohy commented Jul 2, 2019 via email

@shah-p
Copy link

shah-p commented Jul 4, 2019

Hi,
I have been facing the same issue as well and have been stuck for a while now. Please share any possible solution.
Thanks

Hi,
I noticed that the topic numbers in pyLDAvis and LDA output are mismatching but results are matching. for example, words of topic 0 of LDA output (from sci-kit in my case) are in topic 2 in pyLDAvis and so on for all other topics.
Check this and let me know if you are getting the same way.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants