November 2005 Archives

Microsoft's Fremont (Google Base) is Live

Gary Price the other day wrote Microsoft's "Google Base" is Code Named Fremont & MS Receives Patent For Semi-Auto Annotation of Multimedia Objects. Basically, Microsoft built a similar application to Google Base, which went live on November 16th.

Today, Microsoft launched http://fremont.live.com/. According to a WebmasterWorld thread named MSN to launch Classified Ad Website, you will need "a @microsoft.com email address for verification."

More articles at eWeek and at RedHerring.

posted rustybrick in Microsoft MSN Search at November 30, 2005 2:59 PM Comments (0)

Methods to Get Indexed in Google without Links

A HighRankings forum thread named Can Google Index Unlinked Pages?, Can Google index unlinked pages? discusses ways to get indexed in Google without having any links that you know of.

Here are some of the answers in the thread:

  • Using Google Add URL Form
  • The URL appears up as a referrer in a pubic referrer log, or in trackbacks, etc.
  • Google crawls ISP logs
  • The Google Toolbar
  • Google Sitemap
  • Froogle
  • Google Web Accelerator
  • Google VPN
  • Port 443 (SSL Certificate Scan)
  • Register .gov, .mil, .edu domain.

Forum discussion at High Rankings Forums.

posted rustybrick in Google Optimization at November 30, 2005 11:33 AM Comments (1)

Goog Falls ~5% in One Day

A Reuters article named Google shares drop, biggest fall in year says;

Shares in Web-search leader Google Inc. slid 4.7 percent on Tuesday, their biggest decline in a year, on concern that the outlook for holiday sales may not be as strong as investors had hoped.

Expectations that U.S. retail sales activity following the Thanksgiving holiday may not be as strong as some analysts had predicted knocked Google down $19.94 to $403.54 in brisk turnover of 21.4 million.

Aren't people less likely these days to buy the day after Thanksgiving due to the ease of shopping online? I am no expert, so I will leave it at that.

Forum discussion at DigitalPoint Forums.

posted rustybrick in Google News & Press at November 30, 2005 9:29 AM Comments (0)

AdSense Publishers Remove Firefox Referral Ads

On November 4th Google released an AdSense & Firefox Referral Program and yesterday they allowed the Firefox Referral Program to Go International. A relatively new WebmasterWorld thread named Firefox referrals - experience of others? shows that all those in this thread, have tried and all have removed the firefox banner ads from their sites. The none contextually relevant ads have a low CTR and conversion rate.

I guess these ads don't work too well. :)



Forum discussion at WebmasterWorld.

posted rustybrick in Google AdSense at November 30, 2005 9:12 AM Comments (0)

Hubs in Jeopardy Due to NoFollow?

Hubs have played an important part of many search algorithms. If you want to know what is meant by a hub, I have an ok explanation from February 2004. A thread at Cre8asite Forums named asks the question Nofollow in forum links putting hub status in jeopardy?

A valid and important question to ask. If places like the Wikipedia Adds NoFollow Tags and others use the nofollow tag for all links, then the hub is in jeopardy. I know that the search engines say the nofollow should be used for links that can not be validated or verified (i.e. most blog comments, guest books, forum driven links and so on). But to deploy them site wide for authority sites, then where does the hub go?

Forum discussion at Cre8asite Forums.

posted rustybrick in Search Engine Optimization at November 30, 2005 8:59 AM Comments (0)

Google Helps Husband Snap Wife's Neck?

An article at the Register named Alleged techie killer Googled 'neck snap break' writes;

A Mac specialist on trial for the murder of his wife allegedly carried out a Google search for "neck snap break" and "hold" before her death, prosecutors in Durham, North Carolina, claimed last week.

He noted that the Google search information had just come to light after two years' investigation. Prosecutors are expected to present further computer forensic evidence before the trial concludes next week.

More information at WRAL.com.

Forum discussion at SEO Chat Forums.

posted rustybrick in Google News & Press at November 30, 2005 8:53 AM Comments (0)

Wikipedia Removes NoFollow Tags

I was a bit shocked to see that Wikipedia removed the nofollow link attributes from its pages. A thread at WebmasterWorld named Too many Wikipedia links?, which was primarily concerned with being penalized by Google for having too many links from the Wikipedia. But, as did some others in the thread, I thought Wikipedia used the nofollow attribute for links. It looks like they have removed them from all the links on the pages.

Take a look at the Yahoo! Wikipedia and then view the source and do a find on "nofollow". Nothing to be found.

So that does bring the question. Does one have to worry about having too many links from the Wikipedia?

posted rustybrick in Link Building at November 30, 2005 8:30 AM Comments (2)

MSN Search Pagination Bug

A WebmasterWorld thread reports a weird MSN pagination bug. Basically, do a search on United Health and then click on the page 2 link. You will notice, if you scroll back down, it will highlight that you are on page 3 and not page two. However, the URL looks correct for page two; &first=10, which denotes that you are starting at result 10. Same thing happens if you click on page 3, from page one, it will make it look like you are on page 4. It doesn't happen with all searches, seems to happen with United Health but not some others.

msn-search-pagination-bug.gif

Forum discussion at WebmasterWorld.

posted rustybrick in Microsoft MSN Search at November 29, 2005 9:34 AM Comments (1)

Google Space: Google Internet Cafe in London Airport

google-space.jpg

The t-shirts on the staff at the new Google Internet Cafe in Heathrow Airport say "Google Space" on them. ZDnet reports that Google turns Heathrow into testing lab and has pictures of the Google Space and the London GooglePlex launch. This news even got SlashDotted!

Google has taken its first foray into the physical world with the launch of an Internet cafe-style computing booth in London's Heathrow Airport.


The temporary installation, termed Google Space, consists of ten Samsung laptops in the public lounge of Terminal One at London's main airport.

The stand, launched on Tueday morning, will be staffed by at least two Google employees from 0700 to 1900 every day for the duration of the trial, which will run until 19 December. Google staff will be flown in from around the world to man the station.

Forum discussion at DigitalPoint Forums.

posted rustybrick in Google News & Press at November 29, 2005 9:14 AM Comments (2)

AdSense Firefox Referrals Go International

JenSense reports that Firefox Referrals now available for international AdSense publishers. JenSense, being from Canada, was upset that the firefox referral program was limited to only US publishers. There is also coverage at the Google AdSense Blog with links to the AdSense Referral FAQs section.

Forum discussion at WebmasterWorld & DigitalPoint Forums.

posted rustybrick in Google AdSense at November 29, 2005 9:07 AM Comments (2)

SEO Chats Does Away with Karate Belts

Remember when SEO Chat Added Karate Belts to the forum? Pretty much everyone I know made fun of them and thought they were ridiculous. Well, they grew on most of the members and just when everyone was getting used to them, SEO Chat pulled them. There is a thread at SEO Chat named My Belt? with the discussion. One member, rightly so, said; "It's funny. When they first intoruduced the belts, noone wanted them. Now, people get upset because they didn't."

Anyway, there is a new "belt" system, which is really a reputation system. Based on number of posts, you can get one of the following logos near your name, in this order.

seo-chat-rep.gif

Forum discussion at SEO Chat Forums.

posted rustybrick in SEO Forum News at November 29, 2005 8:41 AM Comments (0)

Meta Keywords; To Comma or Not?

Want to get old school today? Search Engine Watch Forums has a new thread asking one of the oldest questions out there in SEO. The thread is named META Keyword Strategies and asks;

I was wondering if anyone had any thoughts on the strategy of not implementing commas between the keywords in the META tags:

i.e.
{car accessories, auto parts, mufflers}
vs.
{car assecories auto parts mufflers}

Most say it doesn't make a difference but Danny Sullivan notes that the last time he spoke with Yahoo! about it, "they said to use commas."

posted rustybrick in Search Engine Optimization at November 29, 2005 8:35 AM Comments (1)

How to Contact Google; Maybe

A fun thread at Cre8asite forums started by Barry Welford; he asks; Why Is Google So Hard To Contact?

Basically, he found a typo on Google Blog search's home page and wanted to be a good citizen and notify them. Thing was, he couldn't find the appropriate email address or contact for to use to notify the blog search team at Google. He ultimately sent an email to press@google.com, and asked them to forward it on to the right people. Others in the forum thread linked to contact pages, which include all the following;

Join the discussion at Cre8asite Forums.

posted rustybrick in Google Search Engine at November 29, 2005 8:29 AM Comments (0)

Future of SEM: Danny Request Questions

Dannu Sullivan posted a thread at Search Engine Watch Forums named Need Your Questions For "Future Of SEM" Session. The session can be found on Day Three; 4:00pm - 5:15pm Slot.

n this roundtable discussion, a diverse panel of search marketers examines where search marketing may be heading in the years to come. Moderator: Danny Sullivan, Editor, SearchEngineWatch.com; Speakers: Dana Todd, President, SEMPO; Jill Whalen, Owner, High Rankings; Fredrick Marckini, CEO, iProspect; Greg Boser, President, WebGuerrilla LLC

Danny's request;

Next week at SES Chicago, I have a panel called "Future Of SEM," where I have a range of search marketers who will discuss where search marketing may be headed. I'm looking for your help. What questions do you have about the future of the industry? Please contribute them here, and I'll see about putting some of the best ones to the panel.

Post your questions at Search Engine Watch Forums.

posted rustybrick in Search Engine Strategies 2005 Chicago at November 29, 2005 8:20 AM Comments (0)

The SEO Myths Thread

A few days ago, Bill Slawski (aka bragadocchio) started an excellent thread that has really taken off named SEO Myths. In that thread he lists out 23 SEO myths off the top of his head. The thread, now four pages long, adds a lot to that list. Here are the top 10 from Bill;

1. SEO copywriting means writing strong copy, then inserting keywords within the copy a number of times.

2. keyword density is important, and you don't want too much or too little.

3. You should place content above menus to have it "crawled first"

4. Meta tags are the key to high rankings

5. The revisit meta tag can tell search engines to come back on a regular basis

6. is important

7. The more links the better

8. Pagerank is dead

9. There is a duplicate content penalty

10. There is a certain percentage of duplication that you can get away with before your page will be filtered in the results

If you read through the thread, you can see the professionalism in this forum setting. Each debated point listed in the list, is then backed up with evidence of all kinds (both academic and real life examples). It is a must see thread.

Forum discussion at Cre8asite Forums

posted rustybrick in Search Engine Optimization at November 29, 2005 8:13 AM Comments (0)

Ask Jeeves Missing Robots.txt File

In the past, I blamed MSN for Indexing Ask Jeeves SERPs but then today I saw that Google is also indexing Ask Jeeves SERPs. See the last result (#20) for tuxedos at Google, you will notice;

ask-tuxedo-google.gif

But then I spoke with Shawn and DigitalPoint and he took a quick look at Ask's robots.txt file, but couldn't find it at http://www.ask.com/robots.txt. So maybe that is the reason the other engines index Ask SERPs? Maybe Ask wants to be indexed? But why wouldn't Google manually exclude Ask SERP's from its index, since it may be duplicate results plus its linking to a direct competitor...

posted rustybrick in Ask.com at November 28, 2005 2:30 PM Comments (1)

Anchor Text with "Links" Discounted?

Some times it is funny to see how things in this or any industry get confused. Way back on May 12, 2004 we covered a topic named Links from links.html Pages Not Counted. A new HighRankings forum thread named asks The 'link' Word, the link word - undesirable?

Basically, if the word "links" is found in the anchor text, is it discounted?

Personally, I highly doubt it. Most people don't link with the word link, unless it describes what they are linking to. Or if they write, here is the link, for whatever reason.

Forum discussion at HighRankings Forum.

Side note; you can see I am having a hard time finding good threads today; slow day in the forums for some reason...

posted rustybrick in Link Building at November 28, 2005 11:40 AM Comments (1)

The Yahoo Ambassador Test

A WebmasterWorld forum thread named Yahoo Ambassador Test discusses the different levels of ambassador programs available at Yahoo! and how hard the tests are.

Nancy99 explains;

After you pay the non refundable $50 in the Ambassador sign up link, you have access to the training. There are 13 modules with about 4-8 items in each module. It does not seem as comprehensive as the AdWords training, but I have learned much new information already that has already helped with the Yahoo Ad accounts that I am managing now.

The discussion then moves into the types of programs available. There seems to be some confusion as to if there is a minimum spend to be part of any of the programs or not.

Forum discussion at WebmasterWorld.

posted rustybrick in Yahoo! Search Marketing at November 28, 2005 11:33 AM Comments (0)

Yahoo! Employees Spam Reporting Tool

I was invited once to Yahoo last year and I saw some things that I would not blog on. I can say that I did see this screen at Yahoo! and a bit past it. It is not a big deal, of course they want these types of quality controls at Yahoo!

An SEO Chat thread named Yahoo employees are manipulating the search results shows a screen capture of a yahoo SERPs page as if you were inside the Yahoo! office.

yahoo-internal-report.gif

I believe it is legit, but I can, of course, be wrong. Forum discussion at SEO Chat Forums.

posted rustybrick in Yahoo! Search Optimization at November 28, 2005 10:51 AM Comments (1)

Triple Play SES Chicago 2005 Coverage

Like we did at SES San Jose 05, here at the Search Engine Roundtable, we will providing triple coverage of the Search Engine Strategies Conference in Chicago next week. Ben, Chris and I will be doing most of the coverage. Below is our tentative and not guaranteed schedule of coverage. Here is the key for who will be covering what; Chris Boggs (CB), Ben Pfeiffer (BP), and Barry Schwartz (BS).

Monday, December 5, 2005 - Day 1:
~ 9:00am - 10:30am
Video Search (CB)
Introduction to Search Engine Marketing (BS)
Searcher Behavior Research Update (BP)
~ 11:00am - 12:30pm
Podcast Search (CB)
Reputation Monitoring & Management (BS)
Earning From Search & Contextual Ads (BP)
~ 1:45pm - 3:15pm
Global Search Landscape (CB)
Targeting Search Ads By Demographics & Behavior (BS)
BP is a wild card for this session. :)
~ 3:45pm - 5:15pm
Search Advertising 101 (CB)
Book Search (BS)
Ads Beyond Search (BP)

Tuesday, December 6, 2005 - Day 2:
~ 9:00am - 9:30am
Keynote: The Search Marketing Community (BS)
~ 10:15am - 11:30am
Creating Compelling Ads (CB)
News Search SEO (BS)
Business to Business Tactics (BP)
~ 1:00pm - 2:15pm
CB Break
SEM Via Communities, Wikis & Tagging (BS)
Meet the News Search Engines (BP)
~ 2:45pm - 4:00pm
Link Building Basics: Speaker (CB)
BS Wildcard
RSS, Blogs, Search Marketing (BP)
~ 4:30pm - 6:00pm
Successful Site Architecture (CB)
Google Print & The Copyright Debate (BS)
BP Wildcard

Wednesday, December 7, 2005 - Day 3:
~ 9:00am - 10:30am
Linking Strategies: Q&A Speaker (CB)
BS Wildcard
Working With Clients (BP)
~ 11:00am - 12:30pm
CB Break
SEM Campaign & Project Management (BS)
Working As A Team (BP)
~ 2:00pm - 3:30pm
Retailer Forum (CB)
Search Engine Q&A On Links (BS)
Developing Your SEM Niche (BP)
~ 4:00pm - 5:15pm
Converting Visitors Into Buyers (CB)
Future Of SEM (BS)
BP Wildcard
~ 5:30pm - 6:30pm
Evening Forum With Danny Sullivan (CB)

Thursday, December 8, 2005 - Day 4:
~ 9:00am - 10:15am
Break (CB)
Local Search Tactics (BS)
Wildcard (BP)
~ 10:45am - 12:00pm
Search Head or Tail (CB)
Wildcard (BS)
Measuring Success Through Phone Calls (BP)
~ 12:30pm - 1:45pm
Measuring Success Case Studies & Tactics (CB)
SEO Overkill (BS)

Chris posted a thread at Search Engine Watch forums named SES Chicago 2005: "Triple Play Blog Coverage" where you can post comments and questions in a forum setting.

Thanks.

posted rustybrick in Search Engine Strategies 2005 Chicago at November 28, 2005 9:01 AM Comments (1)

Brett Tabke Interviewed on Bot Banning

Brett Tabke, owner of WebmasterWorld, has given me the privilege to ask him a bunch of questions on the recent news that WebmasterWorld Bans Search Engine Bots from Crawling. So here it is....

Barry: Brett...Thank you for taking the time during this hectic period at WebmasterWorld to answer several questions about the recent changes you have made, to disallow spiders from accessing your site.

Barry: The big change was that you, on November 18th, changed your robots.txt file to disallow all bots from accessing your Web site. In a thread you started in the Foo forum at WebmasterWorld named lets try this for a month or three... you elegantly linked to your robots.txt file to show people. And subtitled the thread, "last recourse against rogue bots." Why was this the last course of action? I have spoken with dozen of site owners who run sites as large as yours. Most tell me that you can fight off these rogue bots one by one, but you need to factor in the costs of these bots into your hosting prices. How would you respond to that?

Brett: It is difficult to talk about issues that brush shoulders with security related matters. Once you talk about something and your actions to counter that problem in public, you give rise to an invertible counter measure. That said, we have been saying for many years that it was our number one problem on the site. I made a plea in the forums five years ago for a robots Inclusion standard (instead of an exclusion standard).

One thing that sets WebmasterWorld apart from all other similar sites, is the ease with which we can be crawled. There are no CGI parameters on url strings and all off-the-shelf bots can index the site. I can write a 15 line perl program in 5 minutes that will download the entire site - even with cookie support. That same thing can not be said about sites that are not freely crawlable (like other forums and auction sites with cgi based or non standard urls).

The change was for us to require cookie support via member login. That action mandates either allowing the approved big search engine crawlers to feast on a login page instead of page viewing several million pages before they realized the site was 100% different than before. The easiest solution to that is to set a robots.txt ban on all crawlers.

I knew it would be a controversial action. In such cases, it is always better to bring up the subject yourself or least people get the wrong impression that it was by no action of your own. I just threw up the post as a marker so that people knew we'd taken the action ourselves and I would come back later with more information after things settled down a bit. We had started down this road about mid-july when we began blocking many of the major crawlers.

> Why was this the last course of action?

We've tried every thing to stop the bots. Once we got up to several thousand ip's in the system ban list, it was having a serious effect on system performance. We also were occasionally into a situation where we would ban an IP and then that ip would get recycled to another member that had nothing to do with a download attack. It is hard to block an IP such as an AOL ip, because you block several million users using that IP via the AOL proxy cache.

> I have spoken with dozen of site owners who run sites as large as yours.


Size is not the only issue. The ease with which WebmasterWorld can be crawled is first up. I've been studying offline browsers for about a week. All of the site rippers or offline browsers available from Tucows, are able to download WebmasterWorld in it's entirety. Only 6 were able to successfully download part of a Vbulletin site. One would also choke on weird urls (like caps in filenames, or extremely long filenames).

> Most tell me that you can fight off these rogue bots one by one,

Ya, we were spending about an hour or two a day on this problem. I was to the point of hiring one person full time to address it.

Barry: Part of this process, you made a change that now requires cookie support, something that most bots can not support. As a side affect, all members had to relogin to WebmasterWorld. First question, do you have any stats on how many times the "forgot my password" function was used over the past 5 days? :) And my second question is; wouldn't it have been more affective to spend money on a full time server guy to fight off these bots then to lose the search engine traffic completely?

Brett: The majority of people are using browsers such as Opera or IE that auto remember passwords. We also have switched our cookies about once every 60 days for this very reason. That keeps people from leaving cookies laying around in an internet cafe, or on their work machine.

> affective to spend money on a full time server guy to fight off these bots then to lose the search engine traffic completely?

Even hiring a full time guy at this point wouldn't fix the problem. All the tools we have used are only a bandaid solution at trying to cure cancer. We have tried: page view throttling, bandwidth throttling, agent name parsing, cookie requirements from selected ISP's (over 500 including all of Europe/China), IP banning, link poisoning, various auto banning, and various forms of cloaking and site obfuscation to make the site uncrawlable to non-se bots.

The biggest issue, is the massive amount of overhead system and time it takes to manage all that. The totality of it all is staggering. From raw parsing of log files, to code, to server setup, to managing it all takes an inordinate amount of time. It is very easy to make mistakes in all that. (like the time we banned New Zealand visitors because we banned the big ISP's proxy server there) Our site is here for the members - not the rogue bots.

Barry: On that note; almost all the big names in the industry were shocked that you would take this action. They pretty much laughed that you thought you wouldn't be delisted within 30 days, let alone 60 days. Danny Sullivan said;

Brett figures he's got 60 days until pages drop from places like Google to get an alternative search solution in place. That seems optimistic to me. WebmasterWorld is a prominent site and should get getting revisited on a sub-daily basis. If search engines are hitting that robots.txt ban repeatedly, they ought to be dropping those pages in short order, or they aren't very good search engines. I mean, can you imagine the irony of Google and Yahoo getting pilloried on WebmasterWorld for taking so long to drop pages after they were told to do so after the ban was put into place.

Search experts like DaveN, Oilman, SEGuru and others all felt the same way. Why did you really feel it would not happen so fast?

Brett: It has been over 180 days since we blocked GigaBlast, 120 days since we blocked Jeeves, over 90 days since we blocked MSN, and almost 60 days since we blocked Slurp. As of last Tuesday we were still listed in all but Teoma. MSN was fairly quick, but still listed the urls without a snippet.

Google will hang onto a site up to 90 days after you put up a robots.txt ban. Even if the site is completely unreachable, we have seen sites still listed as url only sites up to six months later. It is only via the Google url removal utility where that process will be faster. It is a feature I had not used on Google in many years, and completely overlooked it.

Barry: Also in that summary thread, listed above, you expressed your frustration with the engines for "changing a perfectly good and accepted internet standard." Can you expand on that, and what steps you think they should take to get the robots.txt syntax the way it should be for 2005?

Brett: Without webmaster input, changing the robots.txt standard only encourages others to also play with the standard. Of the offline browser bots I looked at from Tucows, the majority of them can be set to ignore robots.txt. Why, because the standard has not been appreciated, endorsed, or adhered to by the engines as will as well as by the offline browser or site ripper programmers. The engines have fostered an era of robots.txt disrespect.

The engines changing the standard to suit their own needs, is exactly the same as Netscape and Microsoft playing around with the HTML standards during the browser wars. Only by adhering and endorsing standards can we together keep the net from becoming more chaotic than it is now. The enormity of what a webmaster has to already know is already too much for one person. The last thing the internet needs is every big search engine coming out with it's version of robots.txt standard. We need them to support the standard or form an open commission of theirs and our peers to come up with a new one (Which I have been endorsing for 5 years).

That said, as the author of the first robots.txt validator in 1998, I do take the standard very seriously. Hardly a day goes by when I don't get a email from someone asking why their robots.txt with an "Allow" line was marked as bad by the robots.txt validator.

Barry: Due to the fact you are an SEO expert, people came up with wild theories as to why you really did this. Some people said you were banned for cloaking. Some people said that you had a crazy PR stunt in mind. One PR stunt was that the search engines were coming out with a uniform site submission tool and you wanted to be the first to use it. Others said that you wanted to show the search engines that you do not need them. I am sure you heard of many other theories. Which do you find the most funny? Which do you find the most outrageous? And how would you respond to some of them?

Brett: I often forget the scale of how huge WebmasterWorld has become and how many people look to us on issues like this leadership. I have given up trying to disabuse people of notions to contrary why we do things. Not every hat is tin foil and not every helicopter black.

> Some people said you were banned for cloaking.

In order to address many of the rogue site ripper issues, we do openly cloak on the agent level some things. We have to be able to determine what is a good se bot and what isn't. If we randomly go throwing around poison links that lead to autobans without knowing what bot was what bot - we would be banning the se bots left and right. We also use it to keep random ad served content off the page where the only difference is the filename of the image file. That would encourage massive amounts of respidering.

We do everything we can to try to out fox the rogue bots. SE bots were always served the same content as members, and we never IP cloak so it is clear to just about everyone what we are doing. You could always check by a simple agent name switch to slurp. Sometimes we will trip and make a mistake ourselves as there are a few thousand lines of code dedicated to the issues we are talking about.

The number of things needed to address rogue bots is absurd. It was when I was trying to trim down the htaccess ban list to a few thousand IP's after getting hit for 12m page views in a week, that I threw my hands in the air and turned on required login and blocked all the bots. It wasn't a spur of the moment decision, but it was a spur of the moment reaction. If I had it to do over again, the only thing different I would do, is have the new site search engine debugged and ready to go.

> Some people said that you had a crazy PR stunt in mind.

I knew there would be an interest in to to WebmasterWorld members. Some of the other speculation by other noted webmasters was flat out wrong, self interested competitors, and showed a complete lack of understanding of the tech issues involved. One major blogger suggested that we could address all this with a couple of bans in the httaccess list. I laughed when I listened to it, because we had close to 4000 IP's in there and were on the very of banning entire C blocks and all of the AOL proxy servers. Clearly, the tech issues were well beyond his knowledge.

> Others said that you wanted to show the search engines that you do not need them.

Yes, a hundred thousand targeted referrals a day are just plain wrong. Lets cut to the chase; I adore search engine traffic, but my first duty as a webmaster is to the visitors and members of our site. Anything that interferers with that to the degree that rogue spiders, downloaders, offline browser, monitoring services, site rippers, or whatever you call it - I have to take action.

> I am sure you heard of many other theories. Which do you find the most outrageous?

That I was starting a bot busting service that I had talked about 4 years ago in the forums.

> And how would you respond to some of them?

I would not respond to it. My first and only duty is to the members and visitors of WebmasterWorld. Anything I can do to enhance their experience at the site is our goal. That viewpoint is what built WebmasterWorld and what will sustain it. Take care of your members first, and everything else will take care of itself. The more transparent we can make the tech, the better it works for everyone.

Barry: Do you expect support from Google, Yahoo, MSN and Ask Jeeves to get back into their indexes quickly? Have they offered you any support or advice?

Brett:
> Do you expect to get back into their indexes quickly?

No different in that regard than any other website.

> Have they offered you any support or advice?

Yes, they have been great off-the-record. It isn't something they can talk about in public either. I am saddened by that fact, but I do understand the big sites simply can ill afford to talk about security or tech issues that can have a negative effect on their own system in public. I was asked in Vegas why we had banned so many engines - clearly, they had taken notice - but no one had a answer except to ask why G was still allowed on the site.

Barry: What are your plans for the next 7 days? And then the next month or two in terms of these rogue spiders and non rogue spiders?

Brett: There has to be 5 pounds of turkey in the fridge and I think the last half of the pumpkin pie will be done by the end of the day ;-)

Other than that, I have a site search engine to finish debugging and then we have an open house at our new offices, Christmas travek, PubCon Australia, new employees training, and a spring PubCon in Boston to plan and flush out. Interesting times indeed!

> And then the next month or two in terms of these rogue spiders and non rogue spiders?

We have made alot of changes to the core bot detection architecture this last week. The members have been so helpful and giving with new ideas and new ways we can address the problem. There is no one magic bullet that is going to fix the problem, but a more polished approach using all the techniques is what we are working on. People have gone so far as to write custom code for us to use free of charge.

The one thing I would like to leave people with, is to download a few of the site ripper programs and run them against their own site. Test how easily their site can or can not be crawled. There will be site owners that will be shocked to see their site is either completely crawlable without regard for robots.txt, or uncrawlable because of various site architecture problems. There is something there to be learned by every site owner.

Barry: Well thank you for spending the time answering my questions. I wish you all the best and I hope everything works out in the long run.

Brett: Thank you.

posted rustybrick in SEO Forum News at November 28, 2005 8:08 AM Comments (9)

Engagement Party & Sandbox ;)

Remember how I proposed? Well, it drove lots of natural links from quality sites all within a few days. Yahoo!'s linkdomain command brings back 223 links, whereas the Yahoo! Site Explorer tool brings back 129 links to the domain name. Point being, I know "yisha" isn't a competitive term, but that has little to do with ranking number one in a matter of months at Google.

barry-yisha.gif

Shows you what a creative idea can do for ones search rankings.

On a related note; the engagement party is this weekend.

Have a good weekend all!

posted rustybrick in Miscellaneous at November 25, 2005 1:15 PM Comments (2)

Faster Way to Login to YPN

For some reason, whenever you go to https://publisher.yahoo.com/ it won't remember your password or auto fill, like AdSense. They programmed not to allow it for some reason. A thread at WebmasterWorld named Is there a direct link to the login? gives an other option.

I am not sure how secure it is to have this on your computer, you make that call.

<form method=post action="https://publisher.yahoo.com/portal/login.php" name=verify_form>
<input type=hidden name=username value=your_username>
<input type=hidden name=password value=your_passwd>
<input type=submit>
</form>
<body>
<script language="javascript">
document.verify_form.submit();
</script>
</body>

You can just save it locally or put it onto a https server

Forum discussion at WebmasterWorld.

posted rustybrick in Yahoo! Publisher Network at November 25, 2005 8:49 AM Comments (1)

Number One at Yahoo! But Where is the Traffic?

A thread I can relate to at WebmasterWorld is named #1 in Yahoo but No Traffic. I hold the number one position for a few top keyword phrases that, according to Overture's keyword suggestion tool, gets over 3,000 searches per day. And guess what, I may get five to ten referrals from it per day. The member at the thread reports the same;

We've jumped up to between #1 to #3 in Yahoo for the last week for a two word phrase that has 100k+ searches per month according to Overture, #*$!, and other sources. But our logs show almost no change in traffic.

Not only that, other members report the same thing.

In theory, I believe the more technology related the search phrase the less likely people are going to search on it at Yahoo! The less likely they will click on your result. An other member reports the same for niche travel industries.

Moderator Tigger adds;

I've been ranking at 2nd for years on Y for some of my main money terms and the problem I think is the SE just doesn't get used much. The same keyword when I was ;0( ranking on G pulled in 300-400 searches a day (same place 2nd) on Y I'm lucky if it pulls in 10 - its just a numbers game and Y just doesn't have the traffic.

Forum discussion at WebmasterWorld.

posted rustybrick in Yahoo! Search Optimization at November 25, 2005 8:39 AM Comments (4)

What If Google Said Thank You?

I'll take some blogger leeway here and expand on a WebmasterWorld thread named Google's way to say "Thank You" in the AdSense forum. Basically what this particular member noticed was an increase in earnings per click on his birthday and on thanksgiving. Specifically;

the day before my birthday --> + 94% compared to the two weeks average before and after that day!

Thanksgiving so far --> + 80%
compared to the two weeks before Thanksgiving

Now of course this is ridiculous but if you have read The Search you would think twice. Yes, this is most likely not happening now but can/will it happen? Yes it can.

Why can not Google give you a higher percentage of the pie, on special days? Why not? Would it make for good marketing? Would users love Google if they offered up special discounts on products on the week of their birthday? Would publishers love a higher EPC on special days? Hey some states offer tax free days? Tons of retailers have holiday sales. The future of search...

posted rustybrick in Google AdSense at November 25, 2005 8:21 AM Comments (1)

PageRank for Pages with NoIndex

A cre8asite member asks in a thread named PR on noindex pages?

Why Would goolge bother to include a PR for page that you don't want in the index... Seems odd to me...

Three different answers, all pretty valid, are brought up in response to the question.

(1) The toolbar is way out of date and inaccurate as a real time measurement.

(2) Ammon responds that this is typical of "Ghost PR" and links to two threads at Cre8asite, one from January 2003 and the other from July 2003.

(3) Then you got a detailed explanation of how the how Google indexes pages.

posted rustybrick in Google Optimization at November 25, 2005 8:15 AM Comments (0)

AdBright Conflicting with YPN Ads?

Some Yahoo! Publishers in the YPN program are receiving warning emails from Yahoo! that they can not run other contextual ad programs with YPN. The only thing is, these publishers are running AdBright in conjunction with YPN. And we all know AdBright is not contextually driven.

One member said;

I called Yahoo and they said that AdBrite wasn't specifically forbidden or specifically allowed. They said either it's a mistake by the review staff (which seems unlikely) or else it's another ad program that caused the problem.

Forum discussion at WebmasterWorld.

posted rustybrick in Yahoo! Publisher Network at November 24, 2005 8:50 AM Comments (0)

Google Bowling Supporters Thread

Have you ever had the Temptation to Google-Bowl? Well if you had, this Cre8asite Forum Thread is perfect for you. Upcoming star, randfish, writes;

Noting that Jagger brought back the ability to have spammy links knock sites back in the SERPs, I'm having a very hard time resisting asking some friends to give it a shot on a particularly hated competitor.

After that, he gets support from the forum - telling him to hold tight and be strong. Do not succumb to temptation, good will always prevail over evil. Yada yada. :)

Anyway, it turns into a fun thread, so if you got the time, check it out.

posted rustybrick in Google Optimization at November 24, 2005 8:40 AM Comments (0)

WebmasterWorld Reset Cookies: Login Required

If you try accessing a thread today at WebmasterWorld you will notice that you will have to re-login. It looks like the staff at WebmasterWorld has reset cookies, requiring all members (and rogue bots) to re-login in order to access any thread. I hope you remember your password, if not, I am sure there is a forgot your password link that will be used often today and tomorrow.

posted rustybrick in SEO Forum News at November 24, 2005 8:37 AM Comments (0)

Search Engines and Thanksgiving 2005

Last year we had creative logos from all Google, Yahoo and Ask - oh lets not forget gmail's logo. We even had a Turkey Day Google Backlink Update! Today, on Thanksgiving 2005, we have some new logos to share with you.

We have a logo from Yahoo! which links to the Yahoo! Holiday Guide 2005.

yahoo-thanksgiving05.gif

We have a logo from Ask Jeeves, which links to one of those nice smart answers on Thanksgiving. I also decided to post a thread on Ask's Turkey day logo at SEW Forums.

sdj_jeeves_thanksgiving[1].gif

We do not yet have an official Google Holiday logo for Thanksgiving yet. But Gmail does have a logo for the day.

gmailthanksgiving05.gif

Update: Google uploaded it's Thanksgiving logo, which links to a search on thanksgiving.

thanksgiving05.gif

Happy Thanksgiving All!

posted rustybrick in Miscellaneous at November 24, 2005 8:22 AM Comments (1)

Google AdWords at Bottom of Google SERPs!

That is right, Jeff Martin, over at Search Engine Watch Forums found Google testing Google AdWords ads, right under the listings in the Google results. He named the thread Google running ads beneath SERP results now - Very Yahooish and posted screen captures. He is the screen capture of the adwords at the bottom of the results.

adwords-bottom-thumb.gif

The search looks to be on click fraud detection, but it must be one of those geo tests, because I do not see the ads.

Forum discussion at Search Engine Watch Forums.

posted rustybrick in Google AdWords at November 24, 2005 8:07 AM Comments (4)

Sergey Brin All Dressed Up

This may be too funny for me not to post on. A thread elegantly named Sergey Brin -- Google Founder at Digital Point Forums shows Sergey "like you have (hopefully) never seen him before..."

sergey-dress.jpg

You can click on the image above for a larger version, shows more details. Some quotes from the thread;

He shows some characteristics of Michael Jackson
I always wondered what rich folks did to pass the time of day (or night).
Looks like it must have been casual friday at the Googleplex.
He really makes one ugly woman!

Shawn pinged me with the source of the image at http://www.db.stanford.edu/~sergey/photos/drag96.jpg.

posted rustybrick in Other Google Topics at November 23, 2005 2:55 PM Comments (4)

Froogle Feeds Work with Google Base

A WebmasterWorld thread named Froogle Feeds to Google Base reports that if you currently have a Froogle feed, you can simply reuse it, with no change, on Google Base.

So in short, just upload your current Froogle formatted files to BASE and they will work, you will NOT have to change over the required Fields as BASE states.

I did not try this myself, but it may work.

posted rustybrick in Other Google Topics at November 23, 2005 12:59 PM Comments (0)

Microsoft's Search Result Clustering Toolbar (SRC)

Nuclei, who just launched Article Distribution Center today, informed me of a new search engine out by Microsoft named Search Result Clustering Toolbar in Microsoft Research Asia. If you take a look at the how to use the search engine page, it will describe that you can use it for;

  • Query disambiguation
  • Sub-topics discovery
  • Fact finding of peoples
  • Relationship finding of peoples
  • Q&A

They also have a toolbar.

Forum discussion at Web Work Shop Forums.

posted rustybrick in Microsoft MSN Search at November 23, 2005 10:16 AM Comments (0)

Building a Large Site Map

A HighRanking's thread named Site Map Depth Question goes over the basics of building out a large site map. As you can see from the thread, you want to have less then a h