[wp-hackers] Duplicate Content Issue - URLs work with dots and colons in it

Shea Bunge info at bungeshea.com
Sat Aug 31 13:51:31 UTC 2013


@Baxi
I agree that this is strange behaviour and probably should be fixed. I was just pointing out that the rel="canonical" link fixes the issue with Google


> Date: Sat, 31 Aug 2013 09:09:20 -0400
> From: david at ernsts.us
> To: wp-hackers at lists.automattic.com
> Subject: Re: [wp-hackers] Duplicate Content Issue - URLs work with dots and colons in it
> 
> You'd prefer your visitors see a 404 page? What's the benefit?
> 
> On Saturday, August 31, 2013, Baki Goxhaj wrote:
> 
> > @Shea,
> >
> > rel="canonical" or not this seems like a undesirable behaviour to have in
> > WordPress. The urls should not work at all with those dots and colons -
> > shouldn't that be the case?
> >
> > Kindly,
> >
> > Baki Goxhaj
> > about.me/banago
> >
> >
> > On Thu, Aug 29, 2013 at 1:03 PM, Abdussamad Abdurrazzaq <
> > abdussamad at abdussamad.com <javascript:;>> wrote:
> >
> > > Yep but only on single posts. What about categories?
> > >
> > > http://wplancer.com/category/**code..../<
> > http://wplancer.com/category/code..../>
> > >
> > > I supposed we would have to install one of those SEO plugins.
> > >
> > >
> > > On 08/29/2013 02:58 PM, Shea Bunge wrote:
> > >
> > >> If you look at the source of that page, you will see that there is a
> > >> <link rel="canonical"> pointing to the original post URL. This tells
> > Google
> > >> that the original URL is preferred, no matter what URL the page is
> > accessed
> > >> from.
> > >>
> > >>
> > >>  Date: Thu, 29 Aug 2013 11:51:38 +0200
> > >>> From: banago at gmail.com <javascript:;>
> > >>> To: wp-hackers at lists.automattic.**com <wp-hackers at lists.automattic.com<javascript:;>
> > >
> > >>> Subject: [wp-hackers] Duplicate Content Issue - URLs work with dots and
> > >>> colons in it
> > >>>
> > >>> Hi guys,
> > >>>
> > >>> URLs with a dot, a colon, or many, still works and Google treats it as
> > >>> duplicate content. Here is an example:
> > >>>
> > >>> Original:http://ma.tt/2010/11/**one-point-oh/<
> > http://ma.tt/2010/11/one-point-oh/>
> > >>> With a dot: http://ma.tt/2010/11/one-**point-oh./<
> > http://ma.tt/2010/11/one-point-oh./>
> > >>> With a column: http://ma.tt/2010/11/one-**point-oh:/<
> > http://ma.tt/2010/11/one-point-oh:/>
> > >>> With many dots:
> > >>> http://ma.tt/2010/11/one-**point-oh...../<
> > http://ma.tt/2010/11/one-point-oh...../>
> > >>> <http://**wplancer.com/a-beginners-**guide-to-the-command-line.....**
> > >>> ...../<
> > http://wplancer.com/a-beginners-guide-to-the-command-line........../>
> > >>> >
> > >>>
> > >>> All of these URLs work and if mistakenly on the site, they will be
> > >>> considered duplicate content by Google.
> > >>>
> > >>> Is this a bug? If not, why is this happening?
> > >>>
> > >>> PS: I wrote about this in the support forums, but the post seems filled
> > >>> with typos, thus it might be a reason I had no replies there:
> > >>> http://wordpress.org/support/**topic/dublicate-content-url-**
> > >>> works-with-tots-and-columns-**in-it<
> > http://wordpress.org/support/topic/dublicate-content-url-works-with-tots-and-columns-in-it
> > >
> > >>> Kindly,
> > >>>
> > >>> Baki Goxhaj
> > >>> about.me/banago
> > >>> ______________________________**_________________
> > >>> wp-hackers mailing list
> > >>> wp-hackers at lists.automattic.**com <wp-hackers at lists.automattic.com<javascript:;>
> > >
> > >>> http://lists.automattic.com/**mailman/listinfo/wp-hackers<
> > http://lists.automattic.com/mailman/listinfo/wp-hackers>
> > >>>
> > >>
> > >> ______________________________**_________________
> > >> wp-hackers mailing list
> > >> wp-hackers at lists.automattic.**com <wp-hackers at lists.automattic.com<javascript:;>
> > >
> > >> http://lists.automattic.com/**mailman/listinfo/wp-hackers<
> > http://lists.automattic.com/mailman/listinfo/wp-hackers>
> > >>
> > >>  ______________________________**_________________
> > > wp-hackers mailing list
> > > wp-hackers at lists.automattic.**com <wp-hackers at lists.automattic.com<javascript:;>
> > >
> > > http://lists.automattic.com/**mailman/listinfo/wp-hackers<
> > http://lists.automattic.com/mailman/listinfo/wp-hackers>
> > >
> > _______________________________________________
> > wp-hackers mailing list
> > wp-hackers at lists.automattic.com <javascript:;>
> > http://lists.automattic.com/mailman/listinfo/wp-hackers
> >
> _______________________________________________
> wp-hackers mailing list
> wp-hackers at lists.automattic.com
> http://lists.automattic.com/mailman/listinfo/wp-hackers
 		 	   		  


More information about the wp-hackers mailing list