{} A short history of website analytics | Textor Web Development | London
 
2 June 2008:A short history of website analytics

In this issue

Featured article
This month I try and explain the complex topic of website analytics.
Tips
Do you user Google adwords? If so you need to be aware of keyword Insertion.
From my Blog
The e-conveyancing juggernaut heads cliffwards - but very slowly.
News from the web
This month in the internet world

 


A short history of website analytics

No sooner had the first website been created, then I am sure the owner asked - is anyone looking?

Everyone wants to know how many people are visiting their website, how long they are staying, where did they come from and so on. This is the realm of website analytics.

Log analysis

All web servers keep logs of activity and the first analytics programs processed the logs to give reports.

The first thing we are interested in is how many visitors there were in a given time period.  This is not a simple question to answer because there are four different numbers.

  • how many server requests (aka hits)
  • how many web pages have been looked at (page views)
  • how many different times someone has come onto your web site and looked at pages before going away (visitor sessions)
  • how many different people have come onto the website (unique visitors)

So a statement like 'we had a million hits last month' doesn't mean a million people visited the web site, it might equate to:

  • 50,000 page views
  • in 20,000 visitors sessions
  • by 10,000 unique visitors.

In fact 'hits' is the least useful measure because it depends on how many server requests there are per page, which depends on the page design and construction.  A modern web page will often require 10-20 server requests to load it up.

What else can we find out?

We can also find out

  • what web page the user came to the site from
  • and, if that web page was a search engine, what keywords were searched for 
  • what type of computer and operating system 
  • and what web browser
  • what internet address and from that (sometimes)
  • what company or what location

So there is a lot of information here, which can be sliced and diced in a number of ways.

Is log analysis the only way?

The problem with log analysis is that it involves processing very large log files and this needs a lot of computer power.  It often takes time, and to get decent numbers, money. But this is the internet, this is the 21st century, we want answers NOW.

So as the technology developed it became possible to produce results in real time. This approach depends on inserting a small amount of code into every page on your web site (tags). These tags send a message to a central server somewhere which is recording every page view. By logging into the server you can get results in real time - or close to it.

Log analysis vs Tags

The two aproaches, log analysis and tag-based analysis measure different things and estimate things in different ways, so will never give identical results, in fact they will often be quite different.  For example:

  • tag-based analysis will never give a figure for hits. No big deal because it is a meaningless number anyway.
  • tag-based analysis is a bit short on server performance measurement, for example it can't work out how much bandwidth the site has been using.
  • tag-based analysis will not record visits by non-browser visitors, particularly search engine spiders. So you won't know when Google is indexing you.
  • log analysis will not record visits when web pages are cached locally, by the browser or by the visitor's network

However trends should be the same and with enough work you can make the results reasonably consistent.

If you want to a really top-notch job you need to do both and then spend a lot of time correcting and rationaising the data. You really have to want to do this because it is not easy.

Google wades in

There have always been a number of free analytics programs. The most well-known is Webalizer. The two most well-known of the commercial systems in the late 90's were Webtrends and Urchin. Both offered log anlysis and tag based systems.

Then in 2005 Google bought Urchin. They did (or didn't do) two things of note:

  • they continued to develop the tag-based service and offered it for free (it was previously $495 dollars a month)
  • work on the log analysis product seemed to come to a halt. Finally, the first new release since 2004 was announced in April 2008.

Don't let the 'free' price-tag fool you, it is a significant product and gives pretty good results. It was worth $495 a month, and for most users it is all they will ever need.

When your main competitor offers their product for free there is only one way to go. Upmarket. This is what Webtrends have done. A product that was once $495 dollars to puchase is now thousands of pounds plus a hefty annual maintenance fee. This cut out their previous mass market but opened up a new market - the corporate users who don't feel they have bought a proper piece of technology unless they have handed over a significant wedge.

Major players

In a recent survey 66% of responders used Google Analytics. Over 50% used Webtrends or Omniture. The rest were around 5% each or less.

There were over 50 vendors at Internet World this year listed under the Analytics/Reporting heading - which did not include Webtrends or Omniture. 
 

There is a very interesting dicsussion of the strengths and weaknesses of several of the products here.

Tips

Keyword insertion not as painful as it sounds

When you write an ad for Google adwords, it is important to include the keywords in the text of the ad. So if a user types in  'Motorola V8' you want your ad to say 'Motorola V8' not 'Mobile Phone'. This significantly improves your click-through rate.

  • Google will put the keywords in bold making them stand out
  • Having searched for 'Motorola V8' the user will scan the page for the phrase 'Motorola V8' and tend to ignore ads that do not have that phrase.
  • It makes you look like you are more focused on the user's needs

So does this means that if you have 100 keywords you need 100 ads? Not any more. Using keyword insertion you can place the keywords right there in the ad. If the keywords are too large to fit, you can specify a default to be placed in the ad instead.

If your default is 'Mobile Phones', just use this format {KeyWord:Mobile Phones}. If someone searches for 'Motorola V8' that is what they will see. If their phrase is too long (e.g. 'I want a motorola V8 now now now!') then they will get the default (Mobile Phones)

Check out the help file for this for more details. You would be surprised how few Google advertisers use this really valuable feature.

From my blog

The e-conveyancing juggernaut continues towards the cliff edge

The story so far.

The land registry has decided to get into the 21st century. Which is a 'good thing', except they have gone the extra mile by replacing signed deeds with electronic documents containing a digital signature. In my belief this will not work - see my August 2003 newsletter on e-conveyancing for reasons why. 

Final completion of the project has in the last year been put back by four years and the current completion date is 2014-15 a staggering 16-17 years into the project (it all started in 1998). Who knows how many millions will have been spent, and the decision to go for digital signatures may well bring the whole project to its knees.

The most recent project was to introduce a system to create what is called a 'chain matrix'. This would provide buyers and sellers with a web page that tracked the progress of the buying chain typical in house purchase transations. This has been quietly shelved after a £4.6m pilot showed little interest in it.

This summer the digital signature (Private Key Encryption or PKI) part will start to come on-stream. This means lawyers will be signing everything on behalf of their customers. A recent isssue of the Law Gazette quotes David Parton, a very senior conveyancing professional: "‘I also think lawyers will be reluctant to sign on behalf of their clients". Duh!

This is a disaster waiting to happen.

News

Apple is to release Films on iTunes (in the US) as soon as the DVDs hit the stores.

Boeing tried to create an in-flight broadband service, which I would think would be a no-brainer as a business proposition. However they contrived to kill the business in 2006. Now Panasonic is taking a shot.

If you had to guess which computer manufacturer would rate badly in the green stakes, you probably wouldn't guess it was Apple.

If you are a Microsoft office 2007 user, expect to see service pack 1 on your machine next month.

A new search engine, that just searches Wikipedia has been released. The USP is that it uses conversational phrasing instead of keywords. I searched 'who acted in casablanca' and the results were impressive. A list of actors and links to their CVs. However I asked it questions about who our allies were at the battle of Waterloo and the Prussians were not mentioned.  The other big USP is that it doesn't have a wacky imposssible-to-spell name - www.powerset.com. Once the concept is proven with Wikipedia, expect this technology to move out to the rest of the web.

If you want an iPhone - forget it for now. Supplies are being run down prior to the new 3G iPhone which is going to be faster, better, more open and generally more exciting.

If you are interested in what is 'out there' then the new astronomy service from Microsoft is for you. Awsome.

search engine rewards program?  Microsoft are launching one.

German police are investigating a mother who put her baby on ebay. She says it was only a joke, but the child has been taken into care and she is facing psychiatric tests.