RKG Logo

I enjoy stumbling onto new things, and so changed my default FireFox homepage from Google’s personalized homepage to Yahoo’s redirect to a random URL (random.yahoo.com/bin/ryl) just to shake things up. After randomly hitting content spam pages (MFA) a few times when opening the browser in the morning, I began to wonder about their prevalence. After all, the web is a huge haystack, and those bogus pages must be occasional needles, right?

Curious, I tried 50 random pages from random.yahoo.com/bin/ryl. I’m assuming (big assumption) that Y! isn’t filtering all that much, save for language — that’s my guess because (a) all the results were in english, (b) three of the 50 were broken links, and (c) three of the 50 were porn sites.

Of the 50, four were clearly junk pages solely designed to generate search revenue. These four URLs were all concatenations of two common dictionary words which didn’t make much sense together, clearly suggesting they were purchased by a ‘bot. (The most amusing of the four was dochunter.com
, which can’t seem to decide if the page is about hunting moose, choosing a MD, or
– gasp — hunting doctors).

This survey is decidedly unscientific, is based on a tiny sample, and depends critically on the randomness of random.yahoo.com/bin/ryl, which isn’t known.

But still, 4 in 50 is 8% — that is amazingly high, in my opinion. The web is well over 11.5 billion pages (that estimate is over 18 months stale) — 8% of 11.5b is over 900 million junk pages.

Even if this estimate is off on the high side by an order of magnitude, that suggests at least 100 million bogus content pages siphoning value from advertisers to spammers. Scary.

If you like this post, consider subscribing to our RSS feed. You can also have new posts sent to you via email.

Share this post (via email, Digg, Delicious, etc)

Possibly Similar Posts

Trackback

http://www.rimmkaufman.com/rkgblog/2006/07/03/content-spam-at-8/trackback/

Blogs Citing This Post

  1. Pingback: Quack, Quack: Made-For-AdSense Spam on January 25, 2008

Your Comment

Tags

RKG:
Technorati:

Email Updates

Categories

Recent Comments

  • George Michie: Thanks for your comments Ophir, you raise excellent points. Particularly as Geo-targeting competition in different areas moves...
  • SEO Services: Nice Post. Thanks for sharing this information with us.
  • Ophir Cohen: The Problem with Positions in SEM The whole concept of positions (1 vs 3 vs 15) has a lot of meaning when the goal of a campaign is...
  • George Michie: Thanks for your thoughts, Jason. I like your metaphor of the millions of query "channels". Indeed it may be that poor converting...
  • George Michie: Thanks Christian, I think they simply present those aggregate figures to highlight the difficulty: the data is incredibly sparse,...
  • Jason Anderson: Interesting Post. Seems likely that the same forces that produce a mix of regional and national advertising on television will...
  • Christian Little: Hey George, Regarding the university study, I read it over and I don't understand all the math, but I don't think it could be...
  • Elaine: Alan, Thanks for the Great post. As Eric mentioned, I'd love to see more on year-over-year analysis when you have it.
  • George Michie: Jonathan, thanks for your comment. I look forward to reading your book! Some Direct Marketers worry that the Big Brand Advertisers...
  • Adsense: Yes your comments are spot on. I hate the possibility of fraud and hope I never get accused by accident. If someone hates you enough they...
  • Jonathan Salem Baskin: I absolutely agree with your post, and am continually fascinated (and somewhat befuddled) by marketers' willingness to...
  • George Michie: Dan, I hear you! Ultimately Google will choose the ad that generates the most revenue. That will depend largely on customer intent...
  • dan: I find geo-targeting an alarming trend. The power of the internet is that you can get information, products and services from around the...
  • Alan Rimm-Kaufman: Oliver -- yes, there's always a tradeoff between sales and profits. Picking that tradeoff is the most important strategic...
  • George: Good point, Christian, Yep, far be it from me to criticize the marketing plans of huge, profitable, growing firms. Ultimately there is "hit...

Blog Stats

  • Posts: 757
  • Words: 335,687
  • Comments: 1,326

Administration

Close
  • Social Web
  • E-mail
Powered by ShareThis