Google Published Detailed Specifications on Robots.txt

Nov 26, 2010 - 8:28 am 3 by

Webmasters and SEOs have new reading material for this weekend. Google has published a very comprehensive and detailed Robots.txt Specifications, Robots meta tag and X-Robots-Tag HTTP header specifications and how to control crawling and indexing by GoogleBot.

There are two threads that I know of covering this new document. One is at WebmasterWorld and the other is at Google Webmaster Help.

Tedster said he learned at least one new thing from this new resource. He said:

Google will look for and obey an FTP robots.txt file located at ftp://example.com/robots.txt

PageOneResults added a highlight on:

Redirects will generally be followed until a valid result can be found (or a loop is recognized). We will follow a limited number of redirect hops (RFC 1945 for HTTP/1.0 allows up to 5 hops) and then stop and treat it as a 404.

What did you learn?

Forum discussion at WebmasterWorld and Google Webmaster Help.

 

Popular Categories

The Pulse of the search community

Follow

Search Video Recaps

 
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Video Recaps

Search News Buzz Video Recap: Google Core Update Updates, Site Reputation Abuse Coming, Links, Ads & More

Apr 26, 2024 - 8:01 am
Google Search Engine Optimization

Google Publisher Center No Longer Allows Adding Publications

Apr 26, 2024 - 7:51 am
Google

Google Tests Placing The Snippet Date Next To URL

Apr 26, 2024 - 7:41 am
Google

Google Breaks Out Googlebot IP Ranges For User-Triggered Fetchers

Apr 26, 2024 - 7:31 am
Google News

Google Ad Revenue Up 13% & Bing Ads Revenue Up 12%

Apr 26, 2024 - 7:21 am
Google Ads

Google Ads Diagnostic Tool Low Keyword Quality Warning

Apr 26, 2024 - 7:11 am
Previous Story: Google Places Bulk Import Not Working