Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

调用CharTabel,把“幺”改为“么”不合理 #1427

Closed
1 task done
tiandiweizun opened this issue Feb 18, 2020 · 1 comment
Closed
1 task done

调用CharTabel,把“幺”改为“么”不合理 #1427

tiandiweizun opened this issue Feb 18, 2020 · 1 comment
Assignees
Labels

Comments

@tiandiweizun
Copy link

tiandiweizun commented Feb 18, 2020

Describe the bug
A clear and concise description of what the bug is.

调用CharTabel,把“幺”改为“么”不合理,虽然 么 也有1的意思,且也有发音为yao的,但是么通常不代表幺的意思,且幺字已经是正则化的,没有必要进一步改变,需要去掉CharTable.txt里面幺=么

Code to reproduce the issue
Provide a reproducible test case that is the bare minimum necessary to generate the problem.

 System.out.println(CharTable.convert("幺妹的手机号码是幺三二开头的"));

Describe the current behavior
A clear and concise description of what happened.

么妹的手机号码是么三二开头的

Expected behavior
A clear and concise description of what you expected to happen.

幺妹的手机号码是幺三二开头的

System information

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04): win10
  • Python version: java
  • HanLP version: 1.7.6

Other info / logs
Include any logs or source code that would be helpful to diagnose the problem. If including tracebacks, please include the full traceback. Large logs and files should be attached.

  • I've completed this form and searched the web for solutions.
hankcs added a commit that referenced this issue Feb 18, 2020
@hankcs
Copy link
Owner

hankcs commented Feb 18, 2020

感谢反馈,已经修复,请参考上面的commit。
如果还有问题,欢迎重开issue。

@hankcs hankcs closed this as completed Feb 18, 2020
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants