[wp-trac] Re: [WordPress Trac] #3517: WordPress should be 100% UTF-8
WordPress Trac
wp-trac at lists.automattic.com
Thu Jan 4 07:54:39 GMT 2007
#3517: WordPress should be 100% UTF-8
---------------------+------------------------------------------------------
Reporter: sehh | Owner: anonymous
Type: defect | Status: new
Priority: normal | Milestone: 2.2
Component: General | Version: 2.0.5
Severity: major | Resolution:
Keywords: UTF-8 |
---------------------+------------------------------------------------------
Comment (by sehh):
I am sorry but i believe the issue is not just about storing/retreiving.
Storing and retreiving UTF-8 data with latin character set is buggy to say
the least. The issue here is that all character related functions break.
So if you store the character U+9AA8, you'll retreive it correctly as
U+9AA8.
BUT if you try to SORT() or COUNT() you'll get wrong values.
For example, COUNT() for U+0041 should return 1, instead it returns 6
because the function sees the character as "U", "+", etc.
Now, about the issue with other character sets. I don't know anyone who
would rather use anything other than UTF-8. Unicode is now a worldwide
standard for everything, all current operating systems run in full Unicode
(Windows, MacOSX, Linux, *BSD, OS/2 even BeOS and other more obscure
systems). Ofcourse i don't know everything, so i'm open to suggestions
here and reasons why someone would want to use non-unicode.
Remember that only unicode allows you to display multiple languages on the
same page.
--
Ticket URL: <http://trac.wordpress.org/ticket/3517#comment:8>
WordPress Trac <http://trac.wordpress.org/>
WordPress blogging software
More information about the wp-trac
mailing list