[wp-trac] [WordPress Trac] #30130: Normalize characters with combining marks to precomposed characters

WordPress Trac noreply at wordpress.org
Sat Feb 11 21:08:29 UTC 2023


#30130: Normalize characters with combining marks to precomposed characters
-------------------------+-----------------------------
 Reporter:  zodiac1978   |       Owner:  SergeyBiryukov
     Type:  enhancement  |      Status:  closed
 Priority:  normal       |   Milestone:  6.1
Component:  Formatting   |     Version:
 Severity:  normal       |  Resolution:  fixed
 Keywords:  has-patch    |     Focuses:
-------------------------+-----------------------------

Comment (by julianoe):

 Maybe I missed something, but this 6.1 fix only fixed the problem related
 to remove_accents, right?
 I stumbled onto those issues while working on a very old (8+ years) large
 french website and remove_accents works well now. so that's cool, thanks
 everyone who contributed.

 But MacOS users are still submitting content that, for example, can't be
 found by others through the search because the excerpt/title uses UTF-8
 NFD normalisation and not NFC. Accents are not normalized through the
 fields listed or in the editor
 https://core.trac.wordpress.org/ticket/30130#comment:28

 As a test I created a post simply titled "L’Araignée" (you can see here
 https://www.fontspace.com/unicode/analyzer#e=ZcyB that it's using a
 decomposed/combining accent).
 When I search "araignée" using standard UTF-8 NFC accents, it's absent
 from the search results.
 The W3C validator will still output warnings about "text run not in
 Unicode Normalization Form C".

 What should be the documented/recommended route for people having this
 issue (some of them might even stumble on this issue through searches,
 like I did)?
 - use @gitlost plugin?
 - recommend MacOS users to use Safari instead of Firefox. I can't test it,
 is Safari still the only browser converting the strings to NFC?
 - should we still think about on a fix in WordPress in a separate issue?

-- 
Ticket URL: <https://core.trac.wordpress.org/ticket/30130#comment:61>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list