Google Indexing Unlinked Pages?

Feb 19, 2008 - 9:26 am 12 by

In October, a WebmasterWorld member has been monitoring his website and has noticed that Google is beginning to index pages that seem unlinked from anywhere. He suspects that either Google is using Toolbar data, someone is linking to these pages deliberately, or he may have caused the pages to be linked somehow (but he's pretty sure that's not the case).

Others notice similar "creative" spidering, and as Tedster puts it, "googlebot [is] trying to eat almost anything that might be edible in even the least way."

After investigating further, this is what appears to be happening, according to the member who discovered the issue:

- Googlebot is spidering GET forms by getting the form variables and either leaving them blank or assigning values to them (sometimes taken from options in the form itself) - Google has a list of words present on the site - This list of words is being used to populate the form variables, and the URL requested via GET

It seems that it's now February and the same member found similar strange activity within Google. The site has over 519,000 spider requests from Google since the end of October. He believes that Googlebot is adding the GET data by itself, either by accident or to discover new content. What do you think it is?

Forum discussion continues at WebmasterWorld.

 

Popular Categories

The Pulse of the search community

Search Video Recaps

 
- YouTube
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: January 15, 2025

Jan 15, 2025 - 10:00 am
Google Search Engine Optimization

Google Third-Party Review Boxes Algorithmically Selected

Jan 15, 2025 - 7:51 am
Google

Google Search Generative AI For People Also Search For

Jan 15, 2025 - 7:41 am
Bing Ads

Bing Ads - See More Links (Sitelinks)

Jan 15, 2025 - 7:31 am
Google

Google Search Tests Sitelinks Carousels On Desktop

Jan 15, 2025 - 7:21 am
Google

Google Search Video Tab With Continuous Scroll

Jan 15, 2025 - 7:11 am
Previous Story: How Google May Treat Boilerplate Content