Google Published Detailed Specifications on Robots.txt

Nov 26, 2010 • 8:28 am | comments (3) by twitter Google+ | Filed Under Google Search Engine Optimization
 

Webmasters and SEOs have new reading material for this weekend. Google has published a very comprehensive and detailed Robots.txt Specifications, Robots meta tag and X-Robots-Tag HTTP header specifications and how to control crawling and indexing by GoogleBot.

There are two threads that I know of covering this new document. One is at WebmasterWorld and the other is at Google Webmaster Help.

Tedster said he learned at least one new thing from this new resource. He said:

Google will look for and obey an FTP robots.txt file located at ftp://example.com/robots.txt

PageOneResults added a highlight on:

Redirects will generally be followed until a valid result can be found (or a loop is recognized). We will follow a limited number of redirect hops (RFC 1945 for HTTP/1.0 allows up to 5 hops) and then stop and treat it as a 404.

What did you learn?

Forum discussion at WebmasterWorld and Google Webmaster Help.

Previous story: Google Places Bulk Import Not Working
 

Comments:

Rahul Batra

11/29/2010 08:08 am

Thanks for posting such a useful information. It is good to read about changes done by Google to be in touch and robots file are always a useful file to get the crawlers on the website.

seo packeges india

12/24/2010 02:32 pm

Hi this is an interesting post, thanks robots txt also known as the Robots Exclusion protocol or roborts.txt protocol, is a convention to prevent cooperating web spinders

Kaltech Solutions

06/27/2011 08:43 am

 robots.txt, meta tags and .htaccess plays a vital role in SEO.

blog comments powered by Disqus