Google On Good Web Crawler Attributes

Aug 22, 2025 - 7:11 am 0 by

Googlebot Lizzi Image

Myriam Jessier asked Google about what would be good attributes of a web crawler. In which both Martin Splitt and Gary Illyes gave some responses to.

Myriam Jessier asked on Bluesky, "what are the good attributes? One should look into when picking a crawler to check things on a site for SEO and gen AI search?"

Martin Splitt from Google replied with this list of attributes:

  • support http/2
  • declare identity in the user agent
  • respect robots.txt
  • backoff if the server slows
  • follow caching directives*
  • reasonable retry mechanisms
  • follow redirects
  • handle errors gracefully*

Gary Illyes from Google forwarded the conversation to a new IETF document that talks about Crawler best practices. Gary wrote that this document was posted a few weeks ago.

It covers the recommended best practices including:

  • Crawlers must support and respect the Robots Exclusion Protocol.
  • Crawlers must be easily identifiable through their user agent string.
  • Crawlers must not interfere with the regular operation of a site.
  • Crawlers must support caching directives.
  • Crawlers must expose the IP ranges they are crawling from in a standardized format.
  • Crawlers must expose a page that explains how the crawled data is used and how it can be blocked.

Check out that full document over here - you can see that Gary Illyes co-authored it but not under Google's name.

Forum discussion at Bluesky.

Image credit to Lizzi

 

Popular Categories

The Pulse of the search community

Search Video Recaps

 
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: February 20, 2026

Feb 20, 2026 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google Volatility Heated All Week, Google Reviews Vanishing, AI Overview & AI Mode Links Updated, Google Ads News and more

Feb 20, 2026 - 8:01 am
Google Search Engine Optimization

Google Discourages Force Indexing Pages To Search

Feb 20, 2026 - 7:51 am
Google Maps

Google Updated The Business Profile Review Policies

Feb 20, 2026 - 7:41 am
Google Ads

Google Ads Updates Budget Pacing For Ad Scheduling

Feb 20, 2026 - 7:31 am
Other Search Engines

ChatGPT Ads From Expedia Spotted In The Wild

Feb 20, 2026 - 7:21 am
 
Previous Story: Google Gemini Cake Made Of AI