Entries from Search Engine Roundtable tagged with 'crawler'

GoogleBot Can Also Crawl Too Much & Be Nasty

The other day, I noticed a thread at Google Webmaster Help where a person was complaining that they were being hit hard by GoogleBot. In short, Google's spider was crawling his site in a very aggressive manner. He said: After...

Google Crawls Robots.txt Files Daily

JohnMu from Google posted in a Google Webmaster Help thread that Google typically crawls a site's robots.txt file on a daily basis. This is the first time (at least that I can remember) I have seen a Googler make a...

Stop GoogleBot From Indexing You At Busy Times

A Google Webmaster Help thread has a member upset that Google is crawling his site during times when his server is overloaded. Is there a way to tell GoogleBot to stay away during these times? JohnMu of Google said, yes...

Google's Video Crawler Not Respecting Robots.txt Directives?

It seems like we have confirmed reports from a Googler in Google Groups that Google's video crawler, part of the GoogleBot family, is not playing nice. In short, even though you may be telling Google not to crawl your videos,...

Is Google's AdWords Spider Lowercasing Destination URLs?

A WebmasterWorld thread has an advertiser complaining that Google's AdWords spider appears to be lowercasing the destination URLs they have. The thing is, the lowercase URLs for this webmaster don't work with the site and they don't have the time...

Microsoft Live Search Adds HTTP Compression & Conditional Gets Support to Crawler

The Live Search Blog announced several updated to their crawler. The first is a name change to reflect the upgrade, previously named msnbot/1.0, it is now named msnbot/1.1. The bulk of the changes include the HTTP Compression and Conditional Get...

Ask.com Fixes Crawler Issue With Badly-Formed URLs

Earlier this week, we reported that Ask.com Crawler Inserting Url-Encoded Spaces in URLs Causing 404 Errors. In short, Ask.com's crawlers were crawling badly formed URLs, causing tons of 404 errors in web server log files. Vivek Pathak, Ask.com's Infrastructure Product...

Ask.com Crawler Inserting Url-Encoded Spaces in URLs Causing 404 Errors?

A WebmasterWorld thread is reporting several webmasters noticing that Ask.com's crawler has recently been generating tons of 404 (file not found) errors on their sites. The issue appears to stem from Ask.com auto inserting URL-Encoded spaces into the URL. URL-encoded...

Premium Sponsors + advertise

To subscribe to the Search Engine Roundtable, click here