[wp-trac] [WordPress Trac] #13590: Inserting a tetragram (SMP/Plane 1) character truncates post fields

WordPress Trac wp-trac at lists.automattic.com
Fri May 28 06:53:42 UTC 2010


#13590: Inserting a tetragram (SMP/Plane 1) character truncates post fields
--------------------------+-------------------------------------------------
 Reporter:  sardisson     |       Owner:            
     Type:  defect (bug)  |      Status:  new       
 Priority:  normal        |   Milestone:  Unassigned
Component:  General       |     Version:  2.9.2     
 Severity:  normal        |    Keywords:            
--------------------------+-------------------------------------------------
 WordPress 2.9.2 on Apache 2.2.15, MySQL 5.1.46-log, PHP 5.2.13

 I was writing a post in which I used the "tetragram for advance" (U+1D319)
 in both the post title and in the body of the post (as a raw UTF-8 glyph
 rather than an entity).

 When I had WordPress save a draft of the post, both the title and the post
 body were truncated at the point where U+1D319 had been (U+1D319 was also
 removed).

 (In addition, in the permalink field, WP generated something that was
 represented by the glyph for "invalid codepoint" [black diamond with ?
 inside] on Mac OS X, rather than successfully percent-encoding the glyph
 as WordPress does for other non-ASCII characters in post titles when
 generating permalinks.  And, although I manually percent-encoded the glyph
 for the URL, the permalink ended up being 404.  I suspect there's a whole
 host of places where unexpected glyphs cause problems?)

 On the one hand, this is very much a dataloss issue (I lost 1/3 of my
 post), but on the other hand it's probably not likely to happen often in
 real-world usage, so I've left priority and severity set to default values
 ;)

-- 
Ticket URL: <http://core.trac.wordpress.org/ticket/13590>
WordPress Trac <http://core.trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list