[wp-trac] [WordPress Trac] #19033: Problem with Hebrew letter "Nun" hiding search results

WordPress Trac wp-trac at lists.automattic.com
Thu Nov 3 14:22:51 UTC 2011


#19033: Problem with Hebrew letter "Nun" hiding search results
----------------------------------------+----------------------------------
 Reporter:  shirgans                    |       Owner:  shir.gans@…
     Type:  defect (bug)                |      Status:  new
 Priority:  normal                      |   Milestone:  3.3
Component:  Charset                     |     Version:  3.2.1
 Severity:  critical                    |  Resolution:
 Keywords:  has-patch needs-unit-tests  |
----------------------------------------+----------------------------------

Comment (by azaozz):

 Replying to [comment:2 SergeyBiryukov]:
 > Looks like this has to do with `\s` in the regexp, similarly to #11528
 and [12501].

 Yes, we should be careful not to use `\s` in regexp anywhere as it grabs
 parts of utf-8 chars (not only in Hebrew).

 In this case it seems we are looking for word separators in the search
 string that was entered in a `<input type="text"` field. Perhaps `\r\n\t`
 should be stripped completely or even the string should be rejected if any
 of these are found, then we could use `\b`.

 There should be many examples of search string sanitization and handling,
 maybe we should look around a bit. For example chars like `!@#$%^&` are
 usually ignored, etc.

-- 
Ticket URL: <http://core.trac.wordpress.org/ticket/19033#comment:3>
WordPress Trac <http://core.trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list