Stuntdubl Business Search Marketing Consulting

Echoeing Click Stream as an Algorithm Validator

Fun commentary, but technology sometimes sucks…Graywolf and I talked with Greg about click stream analysis and its’ potential impact on search engine results positions. Most people that talk about search engine rankings sometimes forget to realize that there are 100’s if not 1000’s variables to tweak in the search algorithms. Disclaimer: generally when I ramble on the radio, it is nearly all pure speculation.

Grab the podcast download of the show at Webmasterradio.fm

There are at minimum a good 100 + prominent variables or more for influence and rankings.

Qualifying for search click stream validation:
I think there may be the potential need to pass certain variable threshholds in order to validate the findings that a site should be in the top 10,20,50, etc.
Variables I would validate with toolbar data:

Top 8 Ideas for tracking Clickstream to Validate Quality Indicators
What I would do if I search relevancy was my goal:
-track clickthroughs on serps
-link clickthrough
-bookmarks
-history
-user data
-freshness
-community data
-social trend data

Graywolf, GoodROI, and I talked on the implications of click data in the mp3 download here for GoodKarma.

From threadwatch - clickstreams are dirty

Google patent

Rand on the historical patent

Notes:
Clickstream data is used to validate quality indicators
Example: influx of links from 10k sites clickstream data must validate that x% of the links are clicked on by users

Top most likely uses of toolbar data
1· Validation that links are for users (monitoring clickthrough)
2· Validation of site size to detect cloaking page filesize etc.
3· Understanding different types of sites different verticals have different behavior
4· Users will spend more time on a reviews site and visit periodically vs. less time on a directory type site
5· Number of times results are clicked

1 - history data relevant to:
2 -

  • The “number of times that a document is selected from a set of search results
  • The “amount of time one or more users spend accessing the document”
  • The relative “amount of time” compared to an average that users spend on a particular site/page

Statdubl says…stat I missed in the radio show.
MSN messenger is the MS community data at 26 - 28 min. range.

Dumbest thing out of my mouth: “it’s always gettin’ tougher and tougher…”.

Sources Cited:
Google historical data patent
Roger on community loyalty

What I learned…
MG is much smarter than I am.

Thanks for a great discussion guys.

I love Social Media! - Votes are noticed and appreciated:These icons link to social bookmarking sites where readers can share and discover new web pages.
  • del.icio.us
  • digg
  • Fark
  • Reddit
  • YahooMyWeb

4 Comments Leave a comment »

The URI to TrackBack this entry is: http://www.stuntdubl.com/2006/04/29/echoeing-click-stream-as-an-algorithm-validator/trackback/

graywolf
April 29th, 2006,
11:13 pm

nah I’m just much older, yep that was fun.

Phantombookman
May 1st, 2006,
12:56 pm

A great show, very interesting, as are your posts.

Many thanks

Barry Fish
May 5th, 2006,
6:17 am

If they use click data then how would they separate that from AdWords or Yahoo! clicks?

Todd Malicoat Interview - SEO Buzz Box
June 26th, 2006,
11:35 am

[…] I agree with your SEO proof concept, and I refer to it as quality validation. For instance, if a fantastic new site like Rand and Kat’s Web 2.0 Awards site is launched, you don’t want to keep it in the trustbox forever. I would be willing to bet that user data (toolbar, desktop search, firefox, etc.) are being used to validate the links that the site picks up. Just like link popularity, quality can be faked for a little while. That’s why there’s the need for quality validation. When the user data (whichever criteria are used) validates that the site is, in fact, a quality site, it is given a reprieve from the trustbox. The idea of quality validation is based loosely around the ideas of quality indicators being used. A quality indicator such as user volume or time on site could certainly be used to validate immense link growth within a given timeframe. There are probably plenty of other correlations that could be made with a large enough data set. Michael Gray (aka Graywolf) and I discussed this idea of quality validators and the use of clickstream data extensively with Greg Niland (GoodROI) on GoodKarma if people would like to hear more on the subject. […]

Leave a Reply