IHY: blocking referrer spam with htaccess
In Referrer Spam - Why you shouldn't publish your Web logs - Best Practices Search Engine Forums, an old thread originally posted by Alan Perkins, was recently revived by IHY member Dave B, who posts an htaccess file that looks like it could make a big dent in referrer spam for web sites that implement it.
Referrer spam is the practice of sending fake visitors with fake referrers to web sites, to have your URL appear in their log files. This is done in the hope that search engines will find the links and boost the spamming site's rankings.
It's not just a search engine spam problem, though: referrer spam can also interfere with traffic analysis. I checked my logs against Dave's htaccess list, and it looks like about 95% of the fake traffic would be blocked. Nice.
Like The Story? Vote For It On Yahoo Buzz! Or On Sphinn!
DanThies in at April 15, 2006 10:45 AM
Comments (2)

Comments
That .htacess "filtering" will demand a considerable amount of processing from your server. And just like Email subjects, they could get around the "keyword" filtering by using certain Modified Spellings of Misspellings.
Even if you do publish your Web Stats and decide to NOT make it a password protected directory .... you could just as easily "disallow it to be spidered in the robots.txt.
And with some highly developed Client Side trackers - you can ban IP addresses from being calculated as a visit, so as to focus on the REAL visits.
Posted by SEARCH ENGINES WEB at April 15, 2006 18:57