[wp-trac] [WordPress Trac] #19368: UTF-8 characters truncated mid-byte sequence in excerpt in RSS2 feed

WordPress Trac wp-trac at lists.automattic.com
Sun Nov 27 06:05:59 UTC 2011


#19368: UTF-8 characters truncated mid-byte sequence in excerpt in RSS2 feed
--------------------------+------------------------------
 Reporter:  kurtmckee     |       Owner:
     Type:  defect (bug)  |      Status:  new
 Priority:  normal        |   Milestone:  Awaiting Review
Component:  Feeds         |     Version:
 Severity:  normal        |  Resolution:
 Keywords:                |
--------------------------+------------------------------
Changes (by dd32):

 * component:  General => Feeds


Comment:

 The template for the RSS feed is this one:
 http://core.trac.wordpress.org/browser/trunk/wp-includes/feed-rss2.php

 the description element uses [http://core.trac.wordpress.org/browser/trunk
 /wp-includes/feed.php#L168 the_excerpt_rss()] which ultimately uses
 [http://core.trac.wordpress.org/browser/trunk/wp-
 includes/formatting.php#L1879 wp_trim_excerpt] to generate the excerpt.

 That looks multi-byte safe to me, as it's only splitting on "\r\n\t "..
 I've not tested anything here, just traced it for you

-- 
Ticket URL: <http://core.trac.wordpress.org/ticket/19368#comment:1>
WordPress Trac <http://core.trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list