Phrases documentation for threshold argument is misleading #2111
Labels
bug
Issue described a bug
difficulty easy
Easy issue: required small fix
documentation
Current issue related to documentation
good first issue
Issue for new contributors (not required gensim understanding + very simple)
Hacktoberfest
Issues marked for hacktoberfest
https://github.com/RaRe-Technologies/gensim/blob/37e49971efa74310b300468a5b3cf531319c6536/gensim/models/phrases.py#L252-L255
It feels to me like this should have said the opposite: "Heavily depends on concrete scoring-function" rather than "Hardly depends on concrete socring-function".
For example, if you choose npmi instead of the default, the threshold has to between -1 and 1, which makes the default (10.0) make no sense. The current documentation suggests that 10.0 will be OK.
The text was updated successfully, but these errors were encountered: