[wp-trac] [WordPress Trac] #24661: remove_accents is not removing combining accents
WordPress Trac
noreply at wordpress.org
Mon Sep 5 17:16:57 UTC 2016
#24661: remove_accents is not removing combining accents
-------------------------------------+-----------------------------
Reporter: NumidWasNotAvailable | Owner:
Type: defect (bug) | Status: new
Priority: normal | Milestone: Future Release
Component: Formatting | Version: 1.2.1
Severity: normal | Resolution:
Keywords: has-patch needs-refresh | Focuses:
-------------------------------------+-----------------------------
Comment (by gitlost):
Refresh of the patch, modified to target only "Mn" non-spacing combining
marks, and only if they follow Latin characters. This follows the approach
taken by the ICU
[http://www.unicode.org/repos/cldr/trunk/common/transforms/Latin-ASCII.xml
Latin-ASCII.xml] transliteration rule file. The single-byte case (when
PCRE UTF-8/UCP is unavailable) uses regexs generated from the UCD data
files [http://www.unicode.org/Public/9.0.0/ucd/UnicodeData.txt
UnicodeData.txt] and [http://www.unicode.org/Public/9.0.0/ucd/Scripts.txt
Scripts.txt].
The patch includes unit tests drawn from the previous ones.
It would address the OP bug report in the parent ticket #22363, assuming
`remove_accents` is added as a filter on `sanitize_file_name`.
--
Ticket URL: <https://core.trac.wordpress.org/ticket/24661#comment:16>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform
More information about the wp-trac
mailing list