[wp-trac] [WordPress Trac] #11738: sanitize_text_field() issue with UTF-8 characters

WordPress Trac wp-trac at lists.automattic.com
Wed Jan 6 23:21:43 UTC 2010


#11738: sanitize_text_field() issue with UTF-8 characters
--------------------------+-------------------------------------------------
 Reporter:  hakre         |       Owner:  hakre      
     Type:  defect (bug)  |      Status:  new        
 Priority:  normal        |   Milestone:  2.9.2      
Component:  Charset       |     Version:  2.9.1      
 Severity:  major         |    Keywords:  needs-patch
--------------------------+-------------------------------------------------

Comment(by azaozz):

 Replying to [comment:3 Denis-de-Bernardy]:
 > per the php doc:
 >
 > http://php.net/manual/en/reference.pcre.pattern.modifiers.php
 >
 > > u (PCRE8)
 > ...
 > So it's available all the time given the current requirements. And we'd
 even get validity if we bump the requirement to php 4.3.5 or php 4.4.

 It seems it should be but as far as I remember we had some issues with it
 in the past. Also PCRE (the library) can be build without UTF-8 support,
 in fact it seems it's disabled by default:

   "If you want to make use of the support for UTF-8 Unicode character
 strings in
   PCRE, you must add --enable-utf8 to the "configure" command. Without it,
 the
   code for handling UTF-8 is not included in the library. Even when
 included,
   it still has to be enabled by an option at run time. When PCRE is
 compiled
   with this option, its input can only either be ASCII or UTF-8..."

 http://www.pcre.org/readme.txt

-- 
Ticket URL: <http://core.trac.wordpress.org/ticket/11738#comment:8>
WordPress Trac <http://core.trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list