Google Does Crawl & Index UTF-8 Encoded URLs

Sep 2, 2008 • 7:56 am | comments (2) by twitter Google+ | Filed Under Google Search Engine Optimization
 

A Google Groups thread asks if Google can crawl and index UTF-8 encoded URLs. The quick answer is yes, Google can crawl and index these URLs.

Google's JohnMu provided a link to the Google search results as proof, showing Google indexed and displayed the results:

UTF 8 Encoded URLs in Google

John explains that in some cases Google won't crawl them. He recommends "that you also use escaped URLs in your links, to make sure that your site is compatible with older browsers that don't understand straight UTF-8 URLs."

John added that he has seen issues with Google when "the URL uses non-ASCII characters but the server is expecting an encoding other than UTF-8."

There are plenty of URL encoding tools you can use to properly link to UTF-8 encoded URLs.

Forum discussion at Google Groups.

Previous story: Can Google Win The Browser Wars With Google Chrome?
 

Comments:

suzukik

09/02/2008 06:19 pm

I'm a Japanese webmaster. Google successfully indexes encoded URLs. Amazon.co.jp encodes a title of a book as its URL like Wikipedia. http://www.amazon.co.jp/Google%E3%82%92%E6%94%AF%E3%81%88%E3%82%8B%E6%8A%80%E8%A1%93-%E2%80%BE%E5%B7%A8%E5%A4%A7%E3%82%B7%E3%82%B9%E3%83%86%E3%83%A0%E3%81%AE%E5%86%85%E5%81%B4%E3%81%AE%E4%B8%96%E7%95%8C-WEB-DB-PRESS%E3%83%97%E3%83%A9%E3%82%B9%E3%82%B7%E3%83%AA%E3%83%BC%E3%82%BA/dp/4774134325/

Karla Koi

04/19/2012 11:30 am

I work with  non-ASCII /  UTF-8. Works fine on our system & Google crawls it.

blog comments powered by Disqus