[wp-trac] [WordPress Trac] #5998: Invalid Unicode characters

WordPress Trac wp-trac at lists.automattic.com
Mon Feb 25 18:17:23 GMT 2008


#5998: Invalid Unicode characters
----------------------+-----------------------------------------------------
 Reporter:  shelleyp  |       Owner:  anonymous            
     Type:  defect    |      Status:  new                  
 Priority:  normal    |   Milestone:  2.5                  
Component:  General   |     Version:  2.3.3                
 Severity:  normal    |    Keywords:  unicode invalid xhtml
----------------------+-----------------------------------------------------
 Wordpress does not check for invalid Unicode characters, such as the
 following:

 U+FFFE
 U+FFFF

 When the pages are served up as XHTML, allowing these characters through
 generates an XML error.

 WordPress should filter out illegal Unicode code points.

 Please see http://www.w3.org/TR/REC-xml/#NT-Char

 Also, the regex
 [http://www.w3.org/International/questions/qa-forms-utf-8 here] is
 incorrect, see [http://intertwingly.net/blog/2008/01/02/Keeping-On-Your-
 Toes
 this page].

-- 
Ticket URL: <http://trac.wordpress.org/ticket/5998>
WordPress Trac <http://trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list