[wp-trac] [WordPress Trac] #11175: wp_check_invalid_utf8() should drop invalid utf-8 chars only instead of truncating string

WordPress Trac wp-trac at lists.automattic.com
Thu May 13 21:42:30 UTC 2010


#11175: wp_check_invalid_utf8() should drop invalid utf-8 chars only instead of
truncating string
--------------------------+-------------------------------------------------
 Reporter:  sirzooro      |       Owner:  hakre                            
     Type:  defect (bug)  |      Status:  new                              
 Priority:  normal        |   Milestone:  Future Release                   
Component:  Charset       |     Version:  2.9                              
 Severity:  normal        |    Keywords:  has-patch tested needs-unit-tests
--------------------------+-------------------------------------------------
Changes (by westi):

  * milestone:  3.0 => Future Release


Comment:

 This is too risky to do without alot of testing.

 Truncating is usually a better response to invalid utf8 because it is very
 hard to determine what the real cause of invalidity is and where it is
 safe to start parsing the bytestream again.

 Moving to future release until we have some detail tests for this.

-- 
Ticket URL: <http://core.trac.wordpress.org/ticket/11175#comment:11>
WordPress Trac <http://core.trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list