Yahoo's Crawler Not Listening To Robots.txt Directive?

Sep 19, 2011 - 9:04 am 3 by
Filed Under Yahoo SEO

Yahoo SlurpA WebmasterWorld thread reports that Yahoo may not be fully listening to the robots.txt directive to block their spider, Yahoo Slurp.

The thing is, Yahoo spider isn't all that active these days - because Bing is now powering much of Yahoo and thus BingBot is most active.

The webmaster said:

Depending on the Host and UA, the official Yahoo! Slurp apparently does whatever it wants to. Note the subtle differences in the subdomains and UAs...

This morning, the only Host to read/heed robots.txt was: [] Mozilla/5.0 (compatible; Yahoo! Slurp;

These retrieved graphics by the pageful, over 60 total: [] [] Mozilla/5.0 (compatible; Yahoo! Slurp/3.0;

I am not sure if this is a widespread issue or something that is just a smaller bug.

The main question is, should you care of Yahoo is crawling your site when Bing is? That discussion is also taking place in the forum thread. The answer is, it depends.

Forum discussion at WebmasterWorld.


Popular Categories

The Pulse of the search community


Search Video Recaps

Google Weekend Volatility, Google On Search Leak, Elizabeth Tucker Interview & Apple Intelligence - YouTube
Video Details More Videos Subscribe to Videos

Most Recent Articles

Google Updates

Google Father's Day Weekend Search Ranking Volatility

Jun 16, 2024 - 8:32 am
Search Forum Recap

Daily Search Forum Recap: June 14, 2024

Jun 14, 2024 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google Weekend Volatility, Google On Search Leak, Elizabeth Tucker Interview, Apple Intelligence & More

Jun 14, 2024 - 8:01 am

Google Tests Multiple Featured Snippets Under From Sources Across The Web

Jun 14, 2024 - 7:51 am
Google Search Engine Optimization

Google: Sometimes Search Experiments Conflict Causing Issues

Jun 14, 2024 - 7:41 am
Google Maps

Google Business Profiles Websites No Longer Load - 404

Jun 14, 2024 - 7:31 am
Previous Story: Bing Uses User Search History To Adapt Your Search Results