[wp-trac] Re: [WordPress Trac] #3517: WordPress should be 100% UTF-8

WordPress Trac wp-trac at lists.automattic.com
Thu Jan 4 07:54:39 GMT 2007


#3517: WordPress should be 100% UTF-8
---------------------+------------------------------------------------------
 Reporter:  sehh     |        Owner:  anonymous
     Type:  defect   |       Status:  new      
 Priority:  normal   |    Milestone:  2.2      
Component:  General  |      Version:  2.0.5    
 Severity:  major    |   Resolution:           
 Keywords:  UTF-8    |  
---------------------+------------------------------------------------------
Comment (by sehh):

 I am sorry but i believe the issue is not just about storing/retreiving.
 Storing and retreiving UTF-8 data with latin character set is buggy to say
 the least. The issue here is that all character related functions break.

 So if you store the character U+9AA8, you'll retreive it correctly as
 U+9AA8.

 BUT if you try to SORT() or COUNT() you'll get wrong values.

 For example, COUNT() for U+0041 should return 1, instead it returns 6
 because the function sees the character as "U", "+", etc.


 Now, about the issue with other character sets. I don't know anyone who
 would rather use anything other than UTF-8. Unicode is now a worldwide
 standard for everything, all current operating systems run in full Unicode
 (Windows, MacOSX, Linux, *BSD, OS/2 even BeOS and other more obscure
 systems). Ofcourse i don't know everything, so i'm open to suggestions
 here and reasons why someone would want to use non-unicode.

 Remember that only unicode allows you to display multiple languages on the
 same page.

-- 
Ticket URL: <http://trac.wordpress.org/ticket/3517#comment:8>
WordPress Trac <http://trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list