Yandex Palekh Algorithm Catches The Long Tail With Machine Learning

Nov 3, 2016 - 8:22 am 2 by

Yandex Palekh

Yesterday, Yandex announced that they launched something similar to the Google RankBrain - well, they didn't say that, I am.

They launched what they call Palekh which is name of a Russian city, the flag of that city is of a firebird, which you can see in the image above. Why the firebird, well, it has a long tail and this algorithm aims at improving the quality of the results for long tail queries.

Yandex told us that they handle about 100 million queries per day fall under the "long-tail" classification within their search engine. That is about 40% of all the queries performed on that search engine.

So they wanted to make the results better by better understanding those queries. Yandex told me that basically," the technology allows us to understand the meaning behind every query, and not just look for similar words."

For that, we're starting to use neural networks as one of 1500 factors of ranking - we've managed to teach our neural networks to see the connections between a query and a document even if they don't contain common words. This has been made possible by converting the words from billions of search queries into numbers (with groups of 300 each) and putting them in 300-dimensional space - now every document has its own vector in that space. If the numbers of a query and numbers of a document are near each other in that space, then the result is relevant. This technology is called a "semantic vector".

They are using "billions of queries from logs and relying on documents' headlines and search queries, not documents' texts yet." "We also have many targets (long click prediction, CTR, "click or not click" models etc.) that are teaching our neural network - our research has showed that using more targets is more effective," they added. So this is a self learning, machine learning algorithm.

Yandex is a very very important search engine for Russian users.

Forum discussion at Twitter.


Popular Categories

The Pulse of the search community


Search Video Recaps

Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: May 24, 2024

May 24, 2024 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google Ranking Volatility, Ads In Google AI Overviews, Sundar Pichai Interview, Heartfelt Helpful Content & More Ad News

May 24, 2024 - 8:01 am
Google Search Engine Optimization

Google: The Site Reputation Abuse Policy Enforcement Not Yet Algorithmic

May 24, 2024 - 7:51 am
Google Search Engine Optimization

Google Search Can Now Index Electronic Publication (EPUB)

May 24, 2024 - 7:41 am

Directory Of Embarrassing Google AI Overviews

May 24, 2024 - 7:31 am
Web Analytics

Google Analytics Real-Time Reports Adds Users In Last 5 Minutes

May 24, 2024 - 7:21 am
Previous Story: Webmasters React To Google's 200 Rankings Factors Claim While Googlers Look On