Home / Google News / Google's "Quick View" PDF Also Does OCR Conversion For Many Languages

Google's "Quick View" PDF Also Does OCR Conversion For Many Languages

Oct 9, 2009 - 8:22 am 5 — by Barry Schwartz

Filed Under Google

Google recently announced a feature that they have implemented just a couple weeks ago in the search results named "Quick View." Quick View basically shows you a PDF in a web based PDF viewer on Google. It takes the PDF from the host, typically the owner of the PDF, and does all the conversion on the Google's server.

The neat part is that this feature gives you OCR for virtually all of the languages Google has translation for. I'll get to that in a bit, first let me show you a basic example of how Quick View works and then I'll show you the translation OCR.

A search for [w4] returns the IRS's web site with the PDF of a W-4 form.

When you click on the Quick View link in the search results, you get this page:

Yes, a neat view of the PDF, the ability to download the file, print it or convert it to plain html. A WebmasterWorld thread has webmasters who are not happy about this because this bypasses your site and you get no traffic benefit from this. Tedster explains:

So it looks like one more way that Google Search can distribute a site's content without requiring a direct visit to the site itself - and in this case, it's an entire document, not just a snippet. And the intention is to roll this out for other file format types, too.

To make things even worse, from a copyright standpoint is the OCR technology. I can upload a book, in almost any language, let Google index it as a PDF and then convert it to plain HTML and copy and paste from there.

For example, this hebrew book in Quick View looks like this:

If you click the "Plain HTML" link you are taken here where Google has OCRed the text into copy and paste friendly Hebrew. Pretty neat! Well, to some, not to those that might own the copyright on this text.

Forum discussion at WebmasterWorld.

Previous Story: Daily Search Forum Recap: October 8, 2009

Next Story: HTML Sitemap vs. XML Sitemap: Google Says HTML Comes First

The content at the Search Engine Roundtable are the sole opinion of the authors and in no way reflect views of RustyBrick ®, Inc
Copyright © 1994-2026 RustyBrick ®, Inc. Web Development All Rights Reserved.
This work by Search Engine Roundtable is licensed under a Creative Commons Attribution 3.0 United States License. Creative Commons License and YouTube videos under YouTube's ToS.

Google's "Quick View" PDF Also Does OCR Conversion For Many Languages

Barry Schwartz / Executive Editor

Popular Categories

The Pulse of the search community

Google Search Volatility

Search Video Recaps

Most Recent Articles

Daily Search Forum Recap: July 10, 2026

Search News Buzz Video Recap: Google Search Breaks Usage Records, Social & Video Platforms Show In Search Console & More Google Ads, ChatGPT Ads & More

Google Ads Now Show AI Creation/Edited Labels

Google Automatically Assigns Product Categories With Evolving Taxonomy

Hack: Google AI Performance Reports For Platform Properties

Google: Canonicalization Issues May Take Up To Two Weeks To Resolve