Google Shares Its Robots.txt Parser Code With Open Source World

Jul 2, 2019 - 7:46 am 0 by
Filed Under Google

Google Open Source

Google announced yesterday as part of its efforts to standardizing the robots exclusion protocol that it is open sourcing its robots.txt parser. That means how GoogleBot reads and listens to robots.txt files will be available for any crawler or coder to look at or use.

It is rare for Google to share anything they do in core search with the open source world - it is their secret sauce - but here Google has published it to Github for all to access.

Google wrote they "open sourced the C++ library that our production systems use for parsing and matching rules in robots.txt files. This library has been around for 20 years and it contains pieces of code that were written in the 90's. Since then, the library evolved; we learned a lot about how webmasters write robots.txt files and corner cases that we had to cover for, and added what we learned over the years also to the internet draft when it made sense."

Forum discussion at Twitter.

 

Popular Categories

The Pulse of the search community

Search Video Recaps

 
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: August 29, 2025

Aug 29, 2025 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google Spam Update, AI Mode Changes, ChatGPT Does Use Google, Search Ad News & More

Aug 29, 2025 - 8:01 am
Google Updates

Google August 2025 Spam Update Impact Felt Quickly

Aug 29, 2025 - 7:51 am
Bing SEO

Bing Webmaster Tools Sitemaps Index Coverage Button Missing For Some

Aug 29, 2025 - 7:41 am
Google

Google Tests Rounded Search Results Snippet Design Again

Aug 29, 2025 - 7:31 am
Google Maps

Google NMX Business Profiles May Show New Profiles Button

Aug 29, 2025 - 7:21 am
 
Previous Story: Search Google For Fireworks & Get A Fireworks Show