Block (Passage) Level Link Analysis by MSN

Jul 30, 2004 - 8:34 am 0 by
Filed Under Bing Search

With all this discussion abut the problems with PageRank and HITS, Microsoft released a paper recently discussing its solution for the faults in PageRank and HITS. The basic premise of the article, which can be downloaded here, is that the faults are that all links on a single page are not equal. By breaking up the page into "blocks" or "passages" (as Orion likes to call them in the thread at Search Engine Watch), you can semantically understand what sections of the page is about what. And then based on the mathematical location of links, determine the weight and relevancy of that link.

Very interesting idea, of course this can be abused as well. I for one would love to see this working at MSN Search. For discussion, please join the Search Engine Watch thread. Here is a passage:

Link Analysis has shown great potential in improving the per-formance of web search. PageRank and HITS are two of the most popular algorithms. Most of the existing link analysis algorithms treat a web page as a single node in the web graph. However, in most cases, a web page contains multiple semantics and hence the web page might not be considered as the atomic node. In this paper, the web page is partitioned into blocks using the vision-based page segmentation algorithm. By extracting the page-to-block, block-to-page relationships from link structure and page layout analysis, we can construct a semantic graph over the WWW such that each node exactly represents a single semantic topic. This graph can better describe the semantic structure of the web. Based on block-level link analysis, we proposed two new algorithms, Block Level PageRank and Block Level HITS, whose performances we study extensively using web data.

block-links.jpg

 

Popular Categories

The Pulse of the search community

Search Video Recaps

 
- YouTube
Video Details More Videos Subscribe to Videos

Most Recent Articles

Bing Search

Bing Tests Dropping AI Labels From AI Answers

Feb 11, 2025 - 7:31 am
Google Ads

Google Ads Support Account Re-Verification During Ongoing Thread

Feb 11, 2025 - 7:21 am
Google Ads

Google Ads Gambling and Games Policy Update On April 14th

Feb 11, 2025 - 7:11 am
Google Maps

Google Confirms Review Count Issue - Working On Fix

Feb 10, 2025 - 3:38 pm
Search Forum Recap

Daily Search Forum Recap: February 10, 2025

Feb 10, 2025 - 10:00 am
Google Updates

Google Super Bowl LIX Search Ranking Update

Feb 10, 2025 - 7:51 am
Previous Story: Google.com Results Start to Come Back In Google