[wp-trac] [WordPress Trac] #21212: MySQL tables should use utf8mb4 character set
WordPress Trac
noreply at wordpress.org
Wed Feb 11 08:45:08 UTC 2015
#21212: MySQL tables should use utf8mb4 character set
----------------------------+---------------------
Reporter: pento | Owner:
Type: task (blessed) | Status: closed
Priority: normal | Milestone: 4.2
Component: Database | Version: 3.4.1
Severity: normal | Resolution: fixed
Keywords: | Focuses:
----------------------------+---------------------
Comment (by masakielastic):
I think '''wp_encode_emoji''' is not suitable for both escaping 4-byte
characters and the standard api.
wp_encode_emoji function is [http://www.wikiwand.com/en/Leaky_abstraction
leaky abstraction] since new emoji characters will be added in Unicode
Standard every year. Thus the users of function are forced to check the
change of [http://www.unicode.org/Public/emoji/1.0/emoji-data.txt emoji-
data.txt], emoji skin tone modifiers (U+1F3FB..U+1F3FF, see
[http://www.unicode.org/reports/tr51/tr51-1.html Unicode Technical Report
51] ) and use their own function.
[http://en.wikipedia.org/wiki/Regional_Indicator_Symbol Regional indicator
symbols] (U+1F1E6..U+1F1FF) are not emoji themself though, they are used
for national flags. see
[http://www.unicode.org/Public/7.0.0/ucd/auxiliary/GraphemeBreakProperty.txt
GraphemeBreakProperty.txt] or [http://unicode.org/reports/tr29/ Unicode
Standard Annex 29] for the details.
Another reason why I do not vote for wp_encode_emoji function is that all
of 4-byte characters is not emoji.
A lot of 4-byte chinese characters are used for the names of places and
the family names. U+20BB7 is used for
[http://en.wikipedia.org/wiki/Yoshinoya Yoshinoya], which is Japanese fast
food chain and have more than 1800 stores.
A part of Variation selectors supplements are used for variant form of
chinese characters (U+E0100..U+E01EF). U+E0101 is used for Katsushika-ku
(U+845B U+E0101 U+98FE U+533A). TAKARA TOMY which is famous for Pokemon
has the head office in Katsushika-ku.
--
Ticket URL: <https://core.trac.wordpress.org/ticket/21212#comment:88>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform
More information about the wp-trac
mailing list