Phrases documentation for threshold argument is misleading #2111

umangv · 2018-06-28T15:47:22Z

https://github.com/RaRe-Technologies/gensim/blob/37e49971efa74310b300468a5b3cf531319c6536/gensim/models/phrases.py#L252-L255

It feels to me like this should have said the opposite: "Heavily depends on concrete scoring-function" rather than "Hardly depends on concrete socring-function".

For example, if you choose npmi instead of the default, the threshold has to between -1 and 1, which makes the default (10.0) make no sense. The current documentation suggests that 10.0 will be OK.

gojomo · 2018-07-04T05:31:05Z

I suspect the intended meaning was, as you surmise, "heavily depends"... and the docs around the 'npmi' option should make that acceptable range clear.

jenishah · 2018-10-24T04:39:10Z

Hardly has already been replaced by Heavily. Would it be a good idea to add the range of the score under returns in the documentation of the scoring function?

piskvorky added bug Issue described a bug documentation Current issue related to documentation labels Jul 4, 2018

menshikh-iv added difficulty easy Easy issue: required small fix good first issue Issue for new contributors (not required gensim understanding + very simple) Hacktoberfest Issues marked for hacktoberfest labels Sep 28, 2018

jenishah mentioned this issue Oct 26, 2018

Add documentation about ranges to scoring functions for Phrases #2242

Merged

menshikh-iv closed this as completed in #2242 Dec 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Phrases documentation for threshold argument is misleading #2111

Phrases documentation for threshold argument is misleading #2111

umangv commented Jun 28, 2018

gojomo commented Jul 4, 2018

jenishah commented Oct 24, 2018

Phrases documentation for threshold argument is misleading #2111

Phrases documentation for threshold argument is misleading #2111

Comments

umangv commented Jun 28, 2018

gojomo commented Jul 4, 2018

jenishah commented Oct 24, 2018