[wp-trac] [WordPress Trac] #18465: Prevent search engines from indexing wp-admin and wp-includes (was: robots.txt should tell Google to not index wp-admin and wp-includes)

WordPress Trac wp-trac at lists.automattic.com
Wed Dec 21 12:07:07 UTC 2011


#18465: Prevent search engines from indexing wp-admin and wp-includes
--------------------------+-----------------------
 Reporter:  Viper007Bond  |       Owner:  ryan
     Type:  enhancement   |      Status:  reopened
 Priority:  lowest        |   Milestone:  3.3
Component:  General       |     Version:  3.2.1
 Severity:  trivial       |  Resolution:
 Keywords:  has-patch     |
--------------------------+-----------------------
Changes (by joostdevalk):

 * status:  closed => reopened
 * resolution:  fixed =>


Comment:

 This is a valid problem but the "fix" doesn't actually fix it. While the
 addition to robots.txt blocks the crawler from opening the URL, a URL that
 cannot be opened CAN still be listed in the index if Google finds enough
 links pointing to it, see the note on
 [http://support.google.com/webmasters/bin/answer.py?hl=en&answer=156449
 this Google help page]. Example of this can be seen on my Dutch domain:

 https://www.google.com/search?q=site%3Ayoast.nl++inurl%3Awp-admin

 The solution is to not exclude the admin directory in robots.txt, but to
 send an X-Robots-Tag HTTP header of value noindex (the HTTP version of a
 robots meta tag) for the files in admin and for admin-ajax.php, will add a
 patch.

-- 
Ticket URL: <http://core.trac.wordpress.org/ticket/18465#comment:28>
WordPress Trac <http://core.trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list