[wp-trac] [WordPress Trac] #35293: Emoji Regex in wp_encode_emoji() is wildly inaccurate

WordPress Trac noreply at wordpress.org
Wed Aug 2 07:06:44 UTC 2017


#35293: Emoji Regex in wp_encode_emoji() is wildly inaccurate
--------------------------+-----------------------
 Reporter:  pento         |       Owner:  pento
     Type:  defect (bug)  |      Status:  reopened
 Priority:  normal        |   Milestone:  4.9
Component:  Emoji         |     Version:  4.2
 Severity:  normal        |  Resolution:
 Keywords:                |     Focuses:
--------------------------+-----------------------

Comment (by pento):

 == TEST 3

 Convert the regex to match against UTF-32 byte patterns. My theory is that
 fixed byte length characters will be faster to match, as they're also
 faster to process in the `mb_*()` functions.

 Data: https://travis-ci.org/pento/test-41501/builds/260073851

 Analysis:
 - It's slower across all PHP versions, string lengths, languages, and
 emoji usage.

 Conclusion: Test 3 should not be included.

--
Ticket URL: <https://core.trac.wordpress.org/ticket/35293#comment:29>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list