Bing's MSNBot Crawling Fake File Names?

Dec 28, 2009 - 8:41 am 6 by
Filed Under Bing Search

A WebmasterWorld thread and an older Bing Forums thread has discussion from webmasters over the issue of Microsoft Bing's web crawler, MSNBot, crawling file names that do not exist on a specific site.

This reminders me of the ongoing issue of Bing creating fake referrals in webmaster log files. This has been going on for years, where Microsoft claims they have fixed it, but never really has.

In this specific case, it seems like Bing is creating file names on a specific site to crawl. Wel, they are not creating files, just trying to fetch pages that do not and never have existed on a specific site. I am not sure if this is a Bing issue or a webmaster issue.

A long time WebmasterWorld member explained the issue:

In what is apparently a rather old bad behavior, msnbot has a practice of regularly requesting totally manufactured URIs that appear to be designed to trigger 404 errors. Here are two sample log entries of the two styles of bogus URIs msnbot requests:

'65.55.207.126'¦Tue, 15 Dec 2009 20:39:49 -0500¦'msnbot/2.0b (+http://search.msn.com/msnbot.htm)'¦'*/*'¦'/ADBF3C7AB534E8356F30D8AC05291640_00000.temp019f.html'¦'' '65.55.207.28'¦Wed, 16 Dec 2009 05:46:22 -0500¦'msnbot/2.0b (+http://search.msn.com/msnbot.htm)'¦'*/*'¦'/000166709_00001.temp00be.html'¦''

The requests ALWAYS take on one of the formats above starting with either a 32byte GUID or a nine digit integer.

In the Bing thread, another person said:

For many many years, msnbot has been crawling my sites looking for files that have never existed... i'm trying to figure out why... the filenames have changed slightly in recent times but they have been similar in structure since the beginning... they are something like 000092601_00002.temp0001.htm... in other words, 9 numbers underscore 5 numbers dot temp 4 numbers dot htm... the search for these is all over my server's directory tree...

I'll emphasize once more that these files have never existed on my site and i have no clue how msnbot may have picked them up...

Honestly, I feel bad that I am always beating up on Microsoft. I know they are new to the game, when you compare them to Google. But I have to report these issues.

Forum discussion at WebmasterWorld & Bing Forums.

 

Popular Categories

The Pulse of the search community

Follow

Search Video Recaps

 
Google Core Update Flux, AdSense Ad Intent, California Link Tax & More - YouTube
Video Details More Videos Subscribe to Videos

Most Recent Articles

Google Updates

Google March Core Update Still Rolling Out & Heated SEO Chatter Continue

Apr 25, 2024 - 7:51 am
Google

Report: How Prabhakar Raghavan Killed Google Search

Apr 25, 2024 - 7:41 am
Google Search Engine Optimization

Google Favicon Documentation Adds Rel Attribute Value Definitions

Apr 25, 2024 - 7:31 am
Google Ads

Google Ads API Version 16.1 Now Available

Apr 25, 2024 - 7:21 am
Google Search Engine Optimization

Google: Splitting & Merging Sites Takes Longer Than Normal Site Migrations

Apr 25, 2024 - 7:11 am
Search Forum Recap

Daily Search Forum Recap: April 24, 2024

Apr 24, 2024 - 4:00 pm
Previous Story: 60% of U.S. Government's Data on Google Servers? Nope