Skip to content

unicode_chars.rs is outdated again #51080

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Closed
est31 opened this issue May 26, 2018 · 0 comments
Closed

unicode_chars.rs is outdated again #51080

est31 opened this issue May 26, 2018 · 0 comments
Labels
A-Unicode Area: Unicode C-feature-request Category: A feature request, i.e: not implemented / a PR. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue.

Comments

@est31
Copy link
Member

est31 commented May 26, 2018

src/libsyntax/parse/lexer/unicode_chars.rs contains a list of confusable characters (see #25957). Its comments mention that it bases on a document. Of this document, a newer revision has been published (link). We should update unicode_chars.rs to include the newly listed confusables as well.

In the last update, in #43629, I've suggested manually going through the list. But maybe we can find a way to isolate the characters that are special to us and autogenerate the remaining list?

@sanxiyn sanxiyn added the A-Unicode Area: Unicode label May 27, 2018
@kennytm kennytm added T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. C-feature-request Category: A feature request, i.e: not implemented / a PR. labels May 27, 2018
@est31 est31 closed this as completed Oct 19, 2018
@rust-lang rust-lang locked and limited conversation to collaborators Oct 19, 2018
# for free to subscribe to this conversation on GitHub. Already have an account? #.
Labels
A-Unicode Area: Unicode C-feature-request Category: A feature request, i.e: not implemented / a PR. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue.
Projects
None yet
Development

No branches or pull requests

3 participants