[wp-trac] [WordPress Trac] #44958: &nbsp character in title generates a permalink (and slug) with space

WordPress Trac noreply at wordpress.org
Wed Oct 11 18:10:28 UTC 2023


#44958: &nbsp character in title generates a permalink (and slug) with space
-------------------------------------------------+-------------------------
 Reporter:  ace2_heart                           |       Owner:  (none)
     Type:  defect (bug)                         |      Status:  assigned
 Priority:  normal                               |   Milestone:  6.4
Component:  Permalinks                           |     Version:  4.8
 Severity:  major                                |  Resolution:
 Keywords:  needs-testing has-patch has-         |     Focuses:
  testing-info has-screenshots                   |
-------------------------------------------------+-------------------------

Comment (by dmsnell):

 the regex is replacing `%e280` and then even more sequences with an
 optional percent sign but none of those are valid URL-encodings. do we
 have any examples of any system producing those outputs? At a minimum that
 should be interpreted/rendered as `�80` but it would be valid to simply
 ignore decoding / leave it as `%e280`.

 given that `%e2` by itself is invalid UTF-8/invalid URL encoding, it's a
 bit easier to discuss here, but I'd be worried about consuming too much
 input because it looks like it shouldn't be there and then further
 corrupting data that's otherwise fine. in other words, if we //don't//
 have any examples of systems producing output like `%e280` then it might
 be a good call for us to limit unintentional breakage by only replacing
 `%e2%80` and successive sequences that all carry the percent-sign.

-- 
Ticket URL: <https://core.trac.wordpress.org/ticket/44958#comment:24>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list