Yahoo!'s Concept Network & SuperUnits

Apr 21, 2005 • 9:21 am | comments (5) by twitter Google+ | Filed Under Yahoo Search Engine Optimization
 

Bill Slawski started a thread over at Cre8asite Forums named Yahoo! Superunits: of signatures and co-occurence. In that thread, he discusses a new patent Yahoo! Search released (on April 14th) named Systems and methods for search processing using superunits.

Here is the patent's abstract:

In a search processing system, a concept network is generated from a set of queries by parsing the queries into units and defining various relationships between the units based in part on patterns of units that appear together in queries. Units in the concept network that have some similar characteristic(s) are grouped into superunits. For each superunit, there is a corresponding signature that defines the similar characteristic of the group. A query is processed by identifying constituent units, determining the superunit membership of some or all of the constituent units, and using that information to formulate a response to the query.

Bill tells us to look for "some new vocabulary words - a concept network, a unit, a superunit, and signatures." I briefly skimmed it, but the whole concept network seems very interesting. "A concept network is generated from a set of queries by parsing the queries into units and defining various relationships between the units, e.g., based on patterns of units that appear together in queries."

Previous story: Google Advertising Professionals Promotional Credits
 

Comments:

Dazzlindonna

04/21/2005 02:16 pm

You said, Here is the patent's abstract - and boy, is that abstract. I need much more coffee in me this morning to make sense out of that paragraph. In fact, it sounds more like something you would see being produced by one of those auto-spam generators, where it uses the keyword phrases chosen in a random meaningless way - but in a way that fools the engines into thinking it makes sense. :)

Barry Schwartz

04/21/2005 02:28 pm

No one can fool Yahoo!

Orion

04/21/2005 04:43 pm

"Concept network, a unit, a superunit, and signatures" are not new things. If you read between lines, co-occurrence theory of set units and degree of memberships can be seen all over the patent. Orion

Bill

04/21/2005 04:57 pm

My thoughts exactly, Orion. I expected to see the word "co-occurrence" somewhere within the patent after reading through part of it, and yet was surprised to see it, too. These are not new things. But, they are concepts that people should start learning more about. We're going to try to break that patent application down into a little more accessible english, Donna. Might take a day or more, but it's probably one for people to get a good understanding of.

Teddie

04/25/2005 12:00 pm

That patent sounds very similar to the 'shard' or cluster orgnaisation of data by Google I saw in a recent presentaion. Check out this presentation around the 45min mark. http://www.search-engine-war.co.uk/2005/04/google_behind_t.html

blog comments powered by Disqus