[wp-trac] [WordPress Trac] #24661: remove_accents is not removing combining accents

WordPress Trac noreply at wordpress.org
Mon Sep 5 17:16:57 UTC 2016


#24661: remove_accents is not removing combining accents
-------------------------------------+-----------------------------
 Reporter:  NumidWasNotAvailable     |       Owner:
     Type:  defect (bug)             |      Status:  new
 Priority:  normal                   |   Milestone:  Future Release
Component:  Formatting               |     Version:  1.2.1
 Severity:  normal                   |  Resolution:
 Keywords:  has-patch needs-refresh  |     Focuses:
-------------------------------------+-----------------------------

Comment (by gitlost):

 Refresh of the patch, modified to target only "Mn" non-spacing combining
 marks, and only if they follow Latin characters. This follows the approach
 taken by the ICU
 [http://www.unicode.org/repos/cldr/trunk/common/transforms/Latin-ASCII.xml
 Latin-ASCII.xml] transliteration rule file. The single-byte case (when
 PCRE UTF-8/UCP is unavailable) uses regexs generated from the UCD data
 files [http://www.unicode.org/Public/9.0.0/ucd/UnicodeData.txt
 UnicodeData.txt] and [http://www.unicode.org/Public/9.0.0/ucd/Scripts.txt
 Scripts.txt].

 The patch includes unit tests drawn from the previous ones.

 It would address the OP bug report in the parent ticket #22363, assuming
 `remove_accents` is added as a filter on `sanitize_file_name`.

--
Ticket URL: <https://core.trac.wordpress.org/ticket/24661#comment:16>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list