Google Maps Incredibly Slow? Troubleshoot Issues With Google | Main | How Long Should It Take To Rank Well in Google?

Multiple Robots.txt Files for Single Domain

A HighRankings Forum thread asks why do some people use more than a single robots.txt file to control and instruct search spiders how to crawl and access their content. That is a good question. Typically, the spiders will only listen to the robots.txt file found in the root level. So technically, if you place a robots.txt on a subdomain, the search engine will likely ignore it. I do not believe the same applies to subdomains, where subdomains have their own root levels.

HighRankings administrator, Randy, said:

robots.txt anywhere but the Root level will be ignored by the spiders. In fact it would surprise me if it's ever even queried. robots.txt is not like .htaccess where you can control things on a per directory level.

The only way a subdirectory robots.txt might be valid is the rare case where someone has a domain name parked on a subdirectory of another domain. Or possibly if the subdirectory is really a subdomain, though that one too is questionable in my mind and isn't something I've tested to see if spiders look for a robots.txt for each subdomain.

I love what Ron Carnell added:

FWIW, I almost always back up a file before modifying it. My ex-wife always said I had trust issues? At any rate, I probably have a few copies of robots.txt laying around on more than a few sites. I don't worry about it because, as you pointed out, the only one that counts is in the root.

I believe Google often uses individual sitemaps per subdomain, to control their content.

Forum discussion at HighRankings Forum.



Like The Story? Vote For It On Yahoo Buzz! Or On Sphinn!

posted rustybrick in Search Engine Optimization at April 1, 2009 8:13 AM Comments (3)

Comments

Google does look at robots.txt on a subdomain. Remember that Google treats a subdomain as a new domain. JohnMu in Google Webmasters Help group too said this, but I am too lazy to find that post right now.

 

Yes, subdomain robots.txt is just as important as root robots.txt.

 

I had different robots.txt on subdomains than on root (blank) for years and Google never indexed forbiden sites.

 

Post a comment (Note: Can Take 120 Seconds For Your Comment To Show Up)

Do you want us to save your personal Information?

Premium Sponsors + advertise

To subscribe to the Search Engine Roundtable, click here