[wp-trac] [WordPress Trac] #44793: remove_accents() doesnt escape all versions of "i"
WordPress Trac
noreply at wordpress.org
Sun Sep 22 20:30:02 UTC 2019
#44793: remove_accents() doesnt escape all versions of "i"
------------------------------------------------+--------------------------
Reporter: bagosm | Owner:
| SergeyBiryukov
Type: defect (bug) | Status: reviewing
Priority: normal | Milestone: Future
| Release
Component: Formatting | Version:
Severity: normal | Resolution:
Keywords: has-patch dev-feedback 2nd-opinion | Focuses:
------------------------------------------------+--------------------------
Changes (by azaozz):
* milestone: 5.3 => Future Release
Comment:
As far as I understand `remove_accents()` is meant to replace accented
letters from Latin based languages with their non-accented equivalent (as
much as possible) only for the
[https://en.wikipedia.org/wiki/Unicode_block Latin-1 Supplement, Latin
Extended-A, and Latin Extended-B] Unicode blocks.
It is not meant to "reduce" a string containing chars from the above
Unicode blocks to an "ASCII equivalent", however in practice this is often
the case. Looking back, seems it was added to both sanitize and make
permalinks and user names easy to url-encode and look better (SEO
related?), although there may have been other reasons.
Looking at the questions @xkon raises, `remove_accents()` would probably
need better documentation, and in the longer run, a re-evaluation and
refactoring.
In that terms moving to "future release" for now. Please feel free to add
back to 5.3 if feasible.
--
Ticket URL: <https://core.trac.wordpress.org/ticket/44793#comment:21>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform
More information about the wp-trac
mailing list