[wp-meta] [Making WordPress.org] #5874: Database character set default configuration
Making WordPress.org
noreply at wordpress.org
Sun Aug 22 04:51:40 UTC 2021
#5874: Database character set default configuration
---------------------+-----------------------
Reporter: anrghg | Owner: (none)
Type: defect | Status: new
Priority: high | Milestone:
Component: General | Keywords: has-patch
---------------------+-----------------------
The file **wp-config-sample.php** suggests to configure the database
charset to `utf8`. The origin of the file seems unknown, its traceability
starts 8 years ago, but it most probably predates the year 2010 and was
not updated when MySQL released support for actual UTF-8 in 2010.
In MySQL, `utf8` is a proprietary charset that only supports the Basic
Multilingual Plane of Unicode, as it is limited to 3 bytes, but in UTF-8,
characters from U+10000 on take up 4 bytes (starting at F0 90 80 80).
So in MySQL, `utf8` is a misnomer for `utf8mb3`, while the real UTF-8
encoding, that `utf8` **should be** but //is not//, is labeled `utf8mb4`.
--
Ticket URL: <https://meta.trac.wordpress.org/ticket/5874>
Making WordPress.org <https://meta.trac.wordpress.org/>
Making WordPress.org
More information about the wp-meta
mailing list