I struggled with the title of this post, so please read through it so it all makes sense. In short, a person asked Gary Illyes from Google some time ago about duplicate content. The question on Twitter was if a PDF and web page have the exact same content, should one worry about the duplicate content of the page?
In which Gary replied, he wouldn't worry about duplicate content in general.
Here are the tweets:
@imprasadjoshi you need not to worry much about onsite duplication.— Gary Illyes (@methode) February 12, 2016
Now, Google always tells webmasters not to worry about duplicate content. Google handles it when webmasters do not. But most SEOs do the right thing and manage their duplicate content issues. Should they also worry about PDF versions? I wouldn't but hey.
Forum discussion at Twitter.