How Does Google Crawl Pages & Index Them?

Dec 28, 2006 • 7:43 am | comments (4) by twitter Google+ | Filed Under Google Search Engine Optimization
 

A WebmasterWorld thread asks "How does Google determine which pages to crawl?" Google didn't always crawl and index pages as they do now. With the Big Daddy update Google adapted their crawl priorities, which was around April 2006.

Google now bases the crawl priorities based on several factors, one of those factors includes PageRank. As far as I understand it, pages with higher PageRank will be crawled and indexed quicker than pages with lower PageRank, as a general rule.

That is one of the reasons people recommend placing links to your most important pages on your highest PageRank pages (i.e. homepage). One it will increase the PageRank of those pages and it will also give the bot easier access (higher level access) to the page.

Back in the older days, it was easier to get Google to index and rank all your pages on your huge dynamic site, if the pages were search engine friendly. Now even indexing requires page popularity and trust factors. Don't even get the SEO community started about being indexed but being in the supplemental index. :)

Forum discussion at WebmasterWorld.

Previous story: Paid Blog Reviews: ReviewMe & PayPerPost
 

Comments:

Michael Martinez

12/28/2006 07:32 pm

Toolbar PR is a poor guide to frequency of crawling with Google. The actual number of links, regardless of whether they pass value, seems to have a greater impact on crawl rates than anything related to Toolbar PR. For example, Matt Cutts has said at least a couple of times that sites with low crawling activity appear to "be on the fringe". I've got pages in my network that have relatively few links pointing to them and they are crawled about once a month. I have other pages with many links pointing to them and they are crawled every 1-2 days. There is no correlation between Toolbar PR and crawling activity.

Ryan W

01/01/2007 04:16 pm

Getting links to your site from a high PR site is the quickest way for a site to get indexed in Google. Higher site pagerank ensures that your site gets crawled frequently by Google.

Rachel G

05/26/2007 11:01 am

Google crawls are taking longer and longer, but content is still the king. The more content the better change of getting pages in the crawl.

kostas

10/12/2009 05:06 pm

i have the domain name www.ufobet.com but sometimes www.ufobet.com became ufobet.com without www.what is the problem?

blog comments powered by Disqus