Does Google Index Content in "The Cloud" (Amazon S3, etc)

May 14, 2008 - 7:56 am 1 by

Cloud computing is becoming more and more popular amongst webmasters and site owners. In short, companies like Amazon, RackSpace, Google and others are offering hosting services where you upload your content (html, images, videos, pdfs, etc.) to a web server, that web server then replicates that content onto other web servers - so if you think about it, your content is not just on one server, with limited resources and bandwidth, but on dozens (or more) of servers with virtually unlimited bandwidth and resources.

Duplicate content issue? Nope. There is only one URL for that content (unless you generate multiple URLs for the same content yourself) but Amazon S3, for example, doesn't create a duplicate content issue.

One webmaster at WebmasterWorld is complaining that Google Image search doesn't seem to be indexing the images he has hosted over at Amazon S3. But honestly, I think it is just a timing issue for him.

If you conduct a site command on site:s3.amazonaws.com, the location of the S3 content, you will find hundreds of thousands of results returned. If you conduct the same site command search at Google Image search, you find many images from S3 included in the Google Image Search index.

So, it does appear Google is indexing content in the cloud. Specifically from Amazon S3. Does something have to happen on the Amazon side for Google to index your content? I personally cannot find any hints to Amazon blocking any content from search engines on the technical docs or the FAQs. So maybe it is just a timing thing?

Forum discussion at WebmasterWorld.

 

Popular Categories

The Pulse of the search community

Follow

Search Video Recaps

 
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: May 24, 2024

May 24, 2024 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google Ranking Volatility, Ads In Google AI Overviews, Sundar Pichai Interview, Heartfelt Helpful Content & More Ad News

May 24, 2024 - 8:01 am
Google Search Engine Optimization

Google: The Site Reputation Abuse Policy Enforcement Not Yet Algorithmic

May 24, 2024 - 7:51 am
Google Search Engine Optimization

Google Search Can Now Index Electronic Publication (EPUB)

May 24, 2024 - 7:41 am
Google

Directory Of Embarrassing Google AI Overviews

May 24, 2024 - 7:31 am
Web Analytics

Google Analytics Real-Time Reports Adds Users In Last 5 Minutes

May 24, 2024 - 7:21 am
Previous Story: In 2008, Is The NoArchive Tag a Red Flag in SEO?