Latent Semantic Analysis (LSA) - Crawl into the Google Algorithm?

Feb 3, 2005 - 5:19 pm 3 by
Filed Under Google

Earlier today, I started a little theory on the sandbox dieing, well, there is a ton of smart forum discussion going on in a thread at SEW Forums that I renamed to Major Google Changes: Latent Semantic Analysis?. Now, bakedjack has been really driving the thread into a discussion on LSA.

First let me quote some of randfish post on what LSA is about:

The idea behind this is that by taking a huge composite (index) of millions of web pages, the search engines can "learn" which words are related and which noun concepts relate to one another.
For example, using LSA, a search engine would recognize that trips to the zoo often include viewing wildlife and animals, possibly as part of a tour.
Now, conduct a search at Google for ~zoo ~trips. Note the bolded words match the terms I italicized in the paragraph above. Google is bolding 'related' terms and recognizing which terms that frequently occur concurrently (together / on the same page / in close proximity) in their index.
Some forms of LSA are too computationally expensive. For example, Google isn't smart enough to 'learn' the way some of the newer learning computers do at MIT (see some news reports on this). They cannot, for example, learn through their index that Zebras and Tigers are both examples of striped animals, although they may realize that stripes and zebra are more semanticly connected then ducks and stripes.

Very well done.

Chatting with Ammon Johns earlier today that said that a search engine can perform LSA two ways (more then two but here are two): (1) The way Teoma does it with Hubs and Communities (2) Looking at the words on a page, around the links, and seeing how they are related. Well, its best explained by a my coverage of the Super Session: History of SEO/SEM Theory and Testing - WMW Conf 7, where Daron Babin (aka SEGuru) was reported saying, "He recommends writing a page of content and pulling out the keywords, then give it to someone and ask them to figure out what they keyword is. He said its about the other words on the page, its that important. If the keyword is "apple" is the page about computers or fruit?"

More is being looked at with this in the thread.

 

Popular Categories

The Pulse of the search community

Follow

Search Video Recaps

 
- YouTube
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: December 6, 2024

Dec 6, 2024 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google November Core Update Done, Chrome Site Engagement Metrics, Canonicals, 21 Years & More

Dec 6, 2024 - 8:11 am
Google Updates

Google November 2024 Core Update Finally Finished Rolling Out

Dec 6, 2024 - 8:01 am
Google Search Engine Optimization

Google Does Try To Handle Broken Canonicals

Dec 6, 2024 - 7:51 am
Google Search Engine Optimization

Google Search: How Clustering Works With Localization

Dec 6, 2024 - 7:41 am
Google Search Engine Optimization

Google Marauding Black Holes With Clustering & Error Pages

Dec 6, 2024 - 7:31 am
Previous Story: Is the Google Sandbox Over