Home / Google News / Latent Semantic Analysis (LSA) - Crawl into the Google Algorithm?

Latent Semantic Analysis (LSA) - Crawl into the Google Algorithm?

Feb 3, 2005 - 5:19 pm 3 — by Barry Schwartz

Filed Under Google

Earlier today, I started a little theory on the sandbox dieing, well, there is a ton of smart forum discussion going on in a thread at SEW Forums that I renamed to Major Google Changes: Latent Semantic Analysis?. Now, bakedjack has been really driving the thread into a discussion on LSA.

First let me quote some of randfish post on what LSA is about:

The idea behind this is that by taking a huge composite (index) of millions of web pages, the search engines can "learn" which words are related and which noun concepts relate to one another.
For example, using LSA, a search engine would recognize that trips to the zoo often include viewing wildlife and animals, possibly as part of a tour.
Now, conduct a search at Google for ~zoo ~trips. Note the bolded words match the terms I italicized in the paragraph above. Google is bolding 'related' terms and recognizing which terms that frequently occur concurrently (together / on the same page / in close proximity) in their index.
Some forms of LSA are too computationally expensive. For example, Google isn't smart enough to 'learn' the way some of the newer learning computers do at MIT (see some news reports on this). They cannot, for example, learn through their index that Zebras and Tigers are both examples of striped animals, although they may realize that stripes and zebra are more semanticly connected then ducks and stripes.

Very well done.

Chatting with Ammon Johns earlier today that said that a search engine can perform LSA two ways (more then two but here are two): (1) The way Teoma does it with Hubs and Communities (2) Looking at the words on a page, around the links, and seeing how they are related. Well, its best explained by a my coverage of the Super Session: History of SEO/SEM Theory and Testing - WMW Conf 7, where Daron Babin (aka SEGuru) was reported saying, "He recommends writing a page of content and pulling out the keywords, then give it to someone and ask them to figure out what they keyword is. He said its about the other words on the page, its that important. If the keyword is "apple" is the page about computers or fruit?"

More is being looked at with this in the thread.

Previous Story: Is the Google Sandbox Over

Next Story: JeevesGuy Come Out: Kaushal Kurapati

The content at the Search Engine Roundtable are the sole opinion of the authors and in no way reflect views of RustyBrick ®, Inc
Copyright © 1994-2025 RustyBrick ®, Inc. Web Development All Rights Reserved.
This work by Search Engine Roundtable is licensed under a Creative Commons Attribution 3.0 United States License. Creative Commons License and YouTube videos under YouTube's ToS.

Latent Semantic Analysis (LSA) - Crawl into the Google Algorithm?

Barry Schwartz / Executive Editor

Popular Categories

The Pulse of the search community

Search Video Recaps

Most Recent Articles

Daily Search Forum Recap: September 30, 2025

Is Google Ads Reporting Also Impacted By Num=100 Change?

ChatGPT Gets Instant Checkout With Agentic Commerce Protocol

Google Tests Dropping Underline From Search Result Snippets On Hover

Google Tests Progress Bar For Shopping Ads Carousel

New Google AdSense Traffic Source Breakdown Report