[wp-trac] [WordPress Trac] #30130: Normalize characters with combining marks to precomposed characters

WordPress Trac noreply at wordpress.org
Thu Jul 21 21:10:12 UTC 2022

#30130: Normalize characters with combining marks to precomposed characters
 Reporter:  zodiac1978               |       Owner:  SergeyBiryukov
     Type:  enhancement              |      Status:  reviewing
 Priority:  normal                   |   Milestone:  6.1
Component:  Formatting               |     Version:
 Severity:  normal                   |  Resolution:
 Keywords:  has-patch needs-refresh  |     Focuses:

Comment (by audrasjb):

 In [changeset:"53754" 53754]:
 #!CommitTicketReference repository="" revision="53754"
 Formatting: Normalize to Unicode NFC encoding before converting accent
 characters in `remove_accents()`.

 This changeset adds Unicode sequence normalization from NFD to NFC, via
 the `normalizer_normalize()` PHP function which is available with the
 recommended `intl` PHP extension.

 This fixes an issue where NFD characters were not properly sanitized. It
 also provides a unit test for NFD sequences (alternate Unicode
 representations of the same characters).

 Props NumidWasNotAvailable, targz, nacin, nunomorgadinho, p_enrique,
 gitlost, SergeyBiryukov, markoheijnen, mikeschroder, ocean90, pento,
 helen, rodrigosevero, zodiac1978, ironprogrammer, audrasjb, azaozz,
 laboiteare, nuryko, virgar, dxd5001, onnimonni, johnbillion.
 Fixes #24661, #47763, #35951.
 See #30130, #52654.

Ticket URL: <https://core.trac.wordpress.org/ticket/30130#comment:55>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform

More information about the wp-trac mailing list