[wp-trac] Re: [WordPress Trac] #6077: UTF-8 strings are sometimes cut in the middle of a character

WordPress Trac wp-trac at lists.automattic.com
Thu Mar 6 04:08:17 GMT 2008


#6077: UTF-8 strings are sometimes cut in the middle of a character
---------------------------------------------+------------------------------
 Reporter:  nbachiyski                       |        Owner:  anonymous
     Type:  defect                           |       Status:  reopened 
 Priority:  normal                           |    Milestone:  2.5      
Component:  General                          |      Version:           
 Severity:  normal                           |   Resolution:           
 Keywords:  unicode utf-8 excerpt has-patch  |  
---------------------------------------------+------------------------------
Comment (by tenpura):

 According to the PHP user notes, html_entity_decode() has a bug with
 UTF-8.
 Maybe we shoud create substitute function?

 bug should be reproduced with this code before PHP 5.0.1:
 {{{
 echo html_entity_decode('€', ENT_QUOTES, 'UTF-8');
 }}}

-- 
Ticket URL: <http://trac.wordpress.org/ticket/6077#comment:6>
WordPress Trac <http://trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list