Preventing Google from Caching PDF Files

Sep 14, 2007 - 9:15 am 2 by
Filed Under Google

In July, I wrote asking for ideas on how to prevent "View as HTML" links from appearing on PDF files. In other words, authors of PDF files don't want them to be cached.

A DigitalPoint Forums member seems to have found the way to do this without implementing robots.txt. After all, he wants his page to be crawled but he doesn't want the HTML to be available.

He shares the following tidbit:

A special case is PDF files that should be indexed, but not cached. There is no way to directly include meta information in a PDF file, but if security is enabled for a PDF file it will be treated as if the noarchive tag was specified. Security settings can be controlled using Adobe Acrobat (not the free Reader).

Apparently, therefore, it's possible. More information can be found in this article that allows you to control caching of your pages.

Has anyone had success with this method?

Forum discussion continues at DigitalPoint Forums.

This post was written on September 11th and scheduled for publication on September 14th.

 

Popular Categories

The Pulse of the search community

Search Video Recaps

 
- YouTube
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: March 6, 2026

Mar 6, 2026 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google Heat Continues, AI Mode Recipe Link Cards, ChatGPT Web Search With Fewer Links & AI-Generated Search Landing Pages

Mar 6, 2026 - 8:01 am
Google Search Engine Optimization

Google: Most Sites Don't Need To Disavow Links But That's Not All Sites

Mar 6, 2026 - 7:51 am
Bing Search

Bing Search Tests Go To Shopping Button

Mar 6, 2026 - 7:41 am
Bing Ads

Bing With Asian Owned Labels On Microsoft Ads

Mar 6, 2026 - 7:31 am
Google Ads

Google Local Service Ads Won't Credit Calls For Existing Clients (Not Lead)

Mar 6, 2026 - 7:21 am
 
Previous Story: Does Word Position Matter On Keyword Phrases?