Latent Semantic Analysis (LSA) - Crawl into the Google Algorithm?

Feb 3, 2005 - 5:19 pm 3 by
Filed Under Google

Earlier today, I started a little theory on the sandbox dieing, well, there is a ton of smart forum discussion going on in a thread at SEW Forums that I renamed to Major Google Changes: Latent Semantic Analysis?. Now, bakedjack has been really driving the thread into a discussion on LSA.

First let me quote some of randfish post on what LSA is about:

The idea behind this is that by taking a huge composite (index) of millions of web pages, the search engines can "learn" which words are related and which noun concepts relate to one another.
For example, using LSA, a search engine would recognize that trips to the zoo often include viewing wildlife and animals, possibly as part of a tour.
Now, conduct a search at Google for ~zoo ~trips. Note the bolded words match the terms I italicized in the paragraph above. Google is bolding 'related' terms and recognizing which terms that frequently occur concurrently (together / on the same page / in close proximity) in their index.
Some forms of LSA are too computationally expensive. For example, Google isn't smart enough to 'learn' the way some of the newer learning computers do at MIT (see some news reports on this). They cannot, for example, learn through their index that Zebras and Tigers are both examples of striped animals, although they may realize that stripes and zebra are more semanticly connected then ducks and stripes.

Very well done.

Chatting with Ammon Johns earlier today that said that a search engine can perform LSA two ways (more then two but here are two): (1) The way Teoma does it with Hubs and Communities (2) Looking at the words on a page, around the links, and seeing how they are related. Well, its best explained by a my coverage of the Super Session: History of SEO/SEM Theory and Testing - WMW Conf 7, where Daron Babin (aka SEGuru) was reported saying, "He recommends writing a page of content and pulling out the keywords, then give it to someone and ask them to figure out what they keyword is. He said its about the other words on the page, its that important. If the keyword is "apple" is the page about computers or fruit?"

More is being looked at with this in the thread.


Popular Categories

The Pulse of the search community


Search Video Recaps

Video Details More Videos Subscribe to Videos

Most Recent Articles

Google Search Engine Optimization

Report: 14,000+ Google Search Ranking Features Leaked

May 28, 2024 - 6:15 am
Search Forum Recap

Daily Search Forum Recap: May 27, 2024

May 27, 2024 - 10:00 am

In Face Of AI Overview Backlash, Google Updates Docs With How To Show Web Only Results & How To Give Feedback

May 27, 2024 - 7:51 am
Google Search Engine Optimization

Google's John Mueller Blasts The Concept Of Toxic Links, Again

May 27, 2024 - 7:41 am
Google Search Engine Optimization

Some Reporting Fewer Links Reported In Google Search Console

May 27, 2024 - 7:31 am

Google Images "See Exact Matches" Helps You Find Who Stole Your Images

May 27, 2024 - 7:21 am
Previous Story: Is the Google Sandbox Over