[wp-trac] [WordPress Trac] #54471: WordPress accepts non-alpahbetical characters in the URL

WordPress Trac noreply at wordpress.org
Fri Nov 19 01:49:14 UTC 2021


#54471: WordPress accepts non-alpahbetical characters in the URL
--------------------------+-----------------------------
 Reporter:  lubosr        |      Owner:  (none)
     Type:  defect (bug)  |     Status:  new
 Priority:  normal        |  Milestone:  Awaiting Review
Component:  General       |    Version:
 Severity:  critical      |   Keywords:
  Focuses:                |
--------------------------+-----------------------------
 Hello,

 Originally I have created a forum post to address this issue as I thought
 that this problem is affecting only my website. Here is the original post:
 https://wordpress.org/support/topic/issue-with-urls-and-extra-characters
 /#new-topic-0

 The wordpress accepts any of the following ( but not limited to)  url's
 for the same post and does not throw 404 error. Examples:

 https://exmaple.com/my-awesome-article- ( note the trailing - )
 https://exmaple.com/-my-awesome-article ( note the leading - )
 https://exmaple.com/my-awesome.article ( note the . instead of - )
 https://exmaple.com/my-awesome-article, ( note the trailing , )

 This issue has a potential for double content as the URL is distinct for
 all pages.

 First I have spend some time to rectify this issue by changing and saving
 permalink settings, changing theme, checking to apache redirection and
 site config file etc but to no avail.

 However, then I have discovered that many of the wordpress websites are
 plagued with the same issue. Here are some examples:
 Correct URL:
 https://techcrunch.com/2021/11/18/webcams-and-microphones-for-better-
 video-calls/
 Broken URL:
 https://techcrunch.com/2021/11/18/webcams-and-microphones.------for-
 better-video-calls/
 The issue is not limited to only "." in the url. See the "=" and ",". Here
 is another example from microsoft website:
 Correct URL:
 https://news.microsoft.com/transform/novartis-empowers-scientists-ai-
 speed-discovery-development-breakthrough-medicines/
 Broken URL:
 https://news.microsoft.com/transform/novartis-empowers-scientists===.=ai-
 speed-discovery.-------development-breakthrough-medicines,,,/

 Here is the wget output for the above URL as you can see no 404 has been
 shown:

 {{{
 wget https://news.microsoft.com/transform/novartis-empowers-scientists
 ===.=ai-speed-discovery.-------development-breakthrough-medicines,,,/
 --2021-11-19 12:39:02--  https://news.microsoft.com/transform/novartis-
 empowers-scientists===.=ai-speed-discovery.-------development-
 breakthrough-medicines,,,/
 Resolving news.microsoft.com (news.microsoft.com)... 141.193.213.21,
 141.193.213.20
 Connecting to news.microsoft.com
 (news.microsoft.com)|141.193.213.21|:443... connected.
 HTTP request sent, awaiting response... 200 OK
 Length: unspecified [text/html]
 Saving to: ‘index.html
 }}}

 The list of the websites goes on and on as I have tested multiple portals.

-- 
Ticket URL: <https://core.trac.wordpress.org/ticket/54471>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list