[wp-meta] [Making WordPress.org] #5874: Database character set default configuration

Making WordPress.org noreply at wordpress.org
Sun Aug 22 04:51:40 UTC 2021


#5874: Database character set default configuration
---------------------+-----------------------
 Reporter:  anrghg   |      Owner:  (none)
     Type:  defect   |     Status:  new
 Priority:  high     |  Milestone:
Component:  General  |   Keywords:  has-patch
---------------------+-----------------------
 The file **wp-config-sample.php** suggests to configure the database
 charset to `utf8`. The origin of the file seems unknown, its traceability
 starts 8 years ago, but it most probably predates the year 2010 and was
 not updated when MySQL released support for actual UTF-8 in 2010.

 In MySQL, `utf8` is a proprietary charset that only supports the Basic
 Multilingual Plane of Unicode, as it is limited to 3 bytes, but in UTF-8,
 characters from U+10000 on take up 4 bytes (starting at F0 90 80 80).

 So in MySQL, `utf8` is a misnomer for `utf8mb3`, while the real UTF-8
 encoding, that `utf8` **should be** but //is not//, is labeled `utf8mb4`.

-- 
Ticket URL: <https://meta.trac.wordpress.org/ticket/5874>
Making WordPress.org <https://meta.trac.wordpress.org/>
Making WordPress.org


More information about the wp-meta mailing list