[wp-hackers] SEO, How to turn off paged comments for search engine crawlers?

Mike Schinkel mikeschinkel at newclarity.net
Sun Nov 8 21:31:01 UTC 2009


On Nov 8, 2009, at 11:50 AM, Joost de Valk wrote:
> 6 is not going to work: canonical is meant to distinguish the one  
> "true" (canonical) version of a document from copies of it. Giving  
> each comment its own page would make each comment completely  
> different from the original article, and thus wouldn't work with  
> the  canonical tag.

You misunderstood my suggestion. I apologize for not stating it  
clearly enough.  I'll use examples this time (these URLs are examples,  
not exact):

Assume a post called "My Post" with 25 comments and 10 comments per  
page.  You'd potentially have URLs of:

http://www.example.com/my-post/
http://www.example.com/my-post/comments/page-1/
http://www.example.com/my-post/comments/page-2/

Each of these would have a rel=canoncial that pointed to:

http://www.example.com/my-post/

In addition, you'd have 25 URL that look like this:

http://www.example.com/my-post/comments/1/
http://www.example.com/my-post/comments/2/
http://www.example.com/my-post/comments/3/
...
http://www.example.com/my-post/comments/25/

These pages would each have just one comment and would have a title  
being the first sentence of the comment and these pages would have  
links back to the post and they would NOT use rel=canonical.

There is significant benefit to this approach:

-- More pages means more SEO surface area,

-- More comments mean more URLs pointing back to the main post each  
passing a bit of page rank,

-- Unique pages for the comments so that really good comments can draw  
search engine traffic of their own,

-- Unique comment pages allow people on social media and blogs to link  
directly to specific comments without confusing the people following  
the links with the other comments, and 

-- In general, whenever possible, it's good to have a real link to  
each unique piece of content and a comment is a unique piece of content.

I hope that clarifies why my 6th option was not invalid as it appeared  
to be given my initial terse suggestion.


-Mike Schinkel
WordPress Custom Plugins
http://mikeschinkel.com/custom-wordpress-plugins/

P.S.  We could also <IFRAME> the canonical URL (i.e. http://www.example.com/my-post/ 
) and use HTTP_REFERER to known when to serve up only the raw content  
and thus allow all URLs shown above to have the content displayed for  
context but not have it duplicated anywhere.




More information about the wp-hackers mailing list