-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Biterm most frequent topic filtering #7
Comments
I tried to speed up the logic which I orignally implemented at Lines 232 to 233 in d0c40fb
Later on, I make sure only biterms with terms highly emitted by each topic are shown at Line 246 in d0c40fb
So a bug clearly but probably not occurring that much unless you really have completely overlapping topics. |
I think you are responding to my other issue but this one is different. I can send a reprex tomorrow |
Nevermind, I just saw your commit to fix this issue 👍 |
I pushed the package on CRAN just now. |
# for free
to join this conversation on GitHub.
Already have an account?
# to comment
textplot/R/textplot_biterms.R
Lines 230 to 231 in d0c40fb
Correct me if I'm wrong but these don't actually pick the best/most frequent topic.
topic_freq
gives the number of occurrences of each biterm in the whole corpus sincetopic
is not included in theby
argument of the first line.Hence second line picks the maximum of a variable that is constant within each group
The text was updated successfully, but these errors were encountered: