[wp-trac] [WordPress Trac] #39190: RSS feed not valid: U+001A or 0x1a

WordPress Trac noreply at wordpress.org
Thu Dec 8 21:23:41 UTC 2016


#39190: RSS feed not valid: U+001A or 0x1a
--------------------------+-----------------------------
 Reporter:  pavelevap     |      Owner:
     Type:  defect (bug)  |     Status:  new
 Priority:  normal        |  Milestone:  Awaiting Review
Component:  TinyMCE       |    Version:
 Severity:  normal        |   Keywords:
  Focuses:                |
--------------------------+-----------------------------
 I am not sure how is it possible, but one of our users copy/paste text
 from another website into TinyMCE and content is now filled with strange
 non-visible Unicode characters `U+001A` / `0x1a` (`001a` in JSON
 response).

 It is not visible in TinyMCE or frontend, but it is saved in database and
 also RSS feed is considered as invalid (displayed without problem in
 browser, but characters are there).

 RSS feed is processed by other custom scripts and we are receiving
 following messages (different validators):

 `PCDATA invalid Char value 26`

 `Input is not proper UTF-8, indicate encoding! Bytes: 0x1A`

 I can replace those characters and add a check before saving content into
 database, but it should not be possible for users to break their feeds by
 copying content into TinyMCE?

--
Ticket URL: <https://core.trac.wordpress.org/ticket/39190>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list