Share

Wednesday, January 25, 2012

Standard Metrics to calculate Time on Page & Time on Site


I was merrily using Time on Page and Time on Site metrics for quite some time before I actually realized how they were being measured.
It was a real Doh (!) moment.
Turns out we have not rfid'ed every visitor and they don't rub their head against their monitor before starting their session on my website (and of course another head rub when they decide they have had enough and exit). This would allow us to capture the time stamps accurately and have exact measures.
What a disappointment! Joking, just joking!! :)
I find that few people understand correctly how the Average Time on Site calculation is made.
That's regardless of source: weather they use the religious truth from a Competitive intelligence tool or from their website web analytics solution. For the latter it does not matter what data capture methodology you use on your site, WebLogs or JavaScript Tags.
This post is my humble attempt at explaining how Time on Page and Time on Site are computed.
In order to make life easy I am going to assume the following session happening on a website:
typical website session 2
Someone requests your home page, your web analytics tool starts a session for the visitor, two more pages are requested before the visitor decides to leave your website (close browser, type in a url of a different site, click on a link on your site to go to different site….).
What we want to compute is…..
how to compute time on site
Tp = Time spent on a page.
Ts = Time spend on the website.
Someone visits your website at 10:00……
start of website session
There is a entry in your log file (weblog or javascript tag, does not matter) that says, in English, "someone has requested the website homepage file at 10:00".
[It actually looks more like this....
111.111.111.111 - - [08/Oct/2007:11:17:55 -0400] "GET / index.html HTTP/1.1" 200 10801 "http://www.google.com/search?q=avinash+kaushik&ie=utf-8&oe=utf-8 &aq=t&rls=org.mozilla:en-US:official&client=firefox-a" "Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.8.1.7) Gecko/20070914 Firefox/2.0.0.7"
Notice the time stamp there?
Why spoil the fun with technical stuff! But if you want more the tech stuff is explained very nicely here: Log file sample explained.
]
So far all your analytics program knows is when a page was requested, hence:
Tp = N/A [Not Available]
Ts = N/A
Next more fun happens on your site: someone clicks on a link to Page 2 from your home page. Hurray no bounce! :)
time on page one
Now there is a new entry in your log file that says: "The same Visitor requested page two at 10:01"
Finally your web analytics program can compute some time metrics!
It knows how long the Visitor spent on the home page. It subtracts 10:01 from 10:00 and gets one minute. Hence:
Tp (Home Page) = 1 minute.
Notice something important: the only way your program knows how long someone spent on one page is by looking at the two time stamps. One from the request for the first page and one from request of second page.
Next the blinking "get $200 rebate on a $210 product" link on Page 2 entices the person to click on to Page 3 to buy the product. More sweet success! "Engagement!!"
time on page two
The magical math outlined above happens (10:05 minus 10:01) and for Page 2:
Tp (Page 2) = 4 minutes.
The Visitor reaches Page 3 and notices that the rebate offer only applies to people who live in Antarctica who can show they currently own refrigerators (!!). As you can reasonably imagine this happens on the site on Page 3…..
time on page three
Exit!
How long did it take to find and read the rebate fine print? You could reasonably guess if you knew how long the Visitor spent on Page 3.
The problem is that your logfile is missing one time stamp to do the magic math.
Tp (Page 3) = Time of page request (10:05) minus Time of next page request (N/A).
Hence:
Tp (Page 3) = 0 minutes (Since N/A!)
The program has no idea how long the Visitor spent on the last page on your site.
This is true for pretty much all web analytics programs in terms of default behavior.
Let's wrap this puppy up….
time spent on site 1
Tp (Home Page) = 1 minute.
Tp (Page 2) = 4 minutes.
Tp (Page 3) = 0 minutes.
Ts = 5 minutes. (Time on Site, also known as Session Length)
Makes sense?
I mean it probably does not make sense that you don't know how long Visitors spend on the last page in the session, but the explanation of how it works makes sense?
The Case of Bounce / Single Page Sessions:
Now I am sure your are asking yourself: "I wonder what happens in cases of sessions where there is only one page?"
Good question. This is what you are wondering about….
time on page for bounce
Bounce!
This is what is computed….
time on site for bounce
Tp = 0 minutes.
Ts = 0 minutes.
For bounced sessions (single page view sessions) your web metrics program can't compute how long people have spent on the page or on your website. It certainly records the page request (10:00) and it records start of the session but it does not know how long someone was there.
In case the Visitor came at 10:00 and leaves the browser open and rushes as the Visitors Wife / Husband / Boss yells at them to do the dishes and cleaning the dishes takes and hour…. the session is terminated at the end of 29 mins (default setting in most session based web analytics tools). It is important to note that the web metrics program will still say:
Tp = 0 minutes.
Ts = 0 minutes.
The Case of Tabbed Browsing:
Firefox gets credit for popularizing tabbed browsing – I love it and I have no idea how we survived for years without it! The latest versions of IE also support tab browsing, so the masses are also now using this delightful feature.
But what happens to the Time on Page and Time on Site computations when people open a link on a site in another tab and browse the site via two tabs at the same time? I do this all the time! :)
It messes up the time computations.
Here is a common scenario that we'll use to understand the impact…..
time on site impact tabbed browsing
A Visitor comes to your home page. From there opens the first link in a new tab, but continues to scan the home page. The clicks onto a link to Page 2 from the home page, then onto Page 3 and then closes the tab (or moves away and forgets about it).
The Visitor goes to the tab opened from the home page to Page 4 of your site, spends time there, goes on to Page 5 in that tab. Then exits.
How is time on site computed? There are two ways in which I have seen web analytics tools report on this customer behavior.
Dealing with tab browsing # 1:
The web analytics tool takes the above picture literally and this happens…..
Outcomes: Two Sessions, one for each tab in the browser.
Session One (top): [referrer -> Google]
      Tp (Home Page) = 2 minutes
      Tp (Page 2) = 3 minutes
      Tp (Page 3) = 0 minutes
      Ts (session duration) = 5 minutes
Session Two (bottom): [referrer -> your site/homepage]
      Tp (Page 4) = 6 minutes
      Tp (Page 5) = 0 minutes
      Ts (session duration) = 6 minutes
Net net: 2 Visits. 1 Unique Visitor. Also notice the impact on referrers (for the second one you'll see your site referring to itself).
Really interesting outcome!
Dealing with tab browsing # 2:
Some web analytics tools "collect" all the "hits" (entries in the log files) and they will "linearize" the hits and construct one session from all the tabbed browsing Visitor behavior.
So keeping our use case exactly as above, visually this is what happens when data is processed……
time on site impact tabbed browsing linearized
[Pretty high resolution image: Click: Time on site impact tabbed browsing -linearized]
Outcomes: One Sessions, visit “reorganized” by time stamps.
Session One: [referrer -> Google]
      Tp (Home Page) = 1 minute
      Tp (Page 4) = 1 minute
      Tp (Page 2) = 3 minutes
      Tp (Page 3) = 2 minutes
      Tp (Page 5) = 0 minutes
      Ts (session duration) = 7 minutes
Net net: 1 Visit. 1 Unique Visitor.
Neither one perfectly captures exactly what the Visitor is actually doing on your site.
Which one do you prefer?
Be sure to ask your web analytics vendor which of the above two (or a different method) do they use to deal with tabbed browsing when it comes to computing time one site.
Given the increasing popularity of tabbed browsing the impact on your numbers could be big.
Google Analytics uses the second method ("linearization").
Take a deep breath now!
time on site explained
Extra Credit Section:
As always there are hacks that might allow you to capture the time spent on the last page (or even on the last event if you are using event logging).
One of the most common (and when I say most common I mean used by the 0.001%) is to add extra script / code that would capture the fact that the page was "unloaded" in the browser. Technically it is often called "onbeforeunload event".
In this case for each page you have a page request time stamp as well as a page unload time stamp. Hence you can do clean magic math required.
hacks extra coolYou will have to script this yourself or ask your vendor to help you create such code. You would then also have to ask your Vendor to modify how time on page (and time on site) are computed in the web analytics tool to take advantage of the additional time stamp.
If you are doing your own logfile parsing then you can more easily modify the log file to parse and data and then compute the metric for you.
There are other hacks as well.
Some people have also ventured that if your analytics tool has outbound link tracking (also often called exit tracking) then the web analytics tool has the time stamp of that click and can use it to compute time spent on the last page.
I am not a big fan of this because most people on your site will not exit the site by clicking on a link on your site (because most links on your site don't link to other sites, how inconvenient!). So if you use the outbound/exit link time stamps you'll end up computing Time on Site / Page for some Visitors to your site one way and everyone else another way.
It muddies waters, it becomes mixing apples with watermelons.
I am a fan of consistency, even if you consistently measure something imprecisely.
I hope that this post has helped you understand how time on website and time on a page is computed. Next time you see Average Time on Site or Average Time on Page you'll know what data is being included and what is not.
Also next time you compare numbers between a external tool, one of the competitive intelligence tools, and those reported by your internal web analytics tool, you'll know the questions to ask before you jump to conclusions!
Closing Thought : The purpose of this post is not to imply, overtly or covertly, that Time on Site is not a good metric. Far from it. For many types of businesses it can be a critical metric. My hope here is to educate you about how it is computed so that you can make more informed decisions. Computation of no web analytics metric is without flaws (look no further than Unique Visitors!!), time on site is perhaps one of the less flawed ones. :)

Wednesday, January 18, 2012

The Rise of Brands in Google's Relevancy Algorithms

Google Brand Promotion.

Local Search Ranking Factors.....


PURE LOCAL FACTORS

GENERAL SIGNALS

1 → 4 0.58 → 0.86 
  1. Place Page Criteria
    1.39 0.58 NC
  2. Off-PlacePage/Off-Site Criteria
    2.45 0.86 NC
  3. Review Criteria
    3.03 0.72 NC
  4. Website Criteria
    3.77 0.86 NC

SPECIFIC FACTORS

79 → 1 3.11 → 32.34 
  1. Proper Category Associations(PLACE PAGE)
    69.6915.63↑ 2
  2. Physical Address in City of Search(PLACE PAGE)
    69.5722.69--
  3. Manually Owner-verified Place Page(PLACE PAGE)
    64.0317.99↓ 2
  4. Volume of Traditional Structured Citations (IYPs, Data Aggregators)(OFF-PLACE/OFF-SITE)
    62.2423.67--
  5. Volume of Unstructured Citations (Newspaper Articles, Blog Posts)(OFF-PLACE/OFF-SITE)
    56.9622.39↑ 6
  6. Overall Volume of Reviews of Place(REVIEWS)
    54.6924.2↑ 3
  7. Local Area Code on Place Page(PLACE PAGE)
    52.5427.2↑13
  8. Proximity of Address to Centroid(PLACE PAGE)
    51.8730.74↑4
  9. Product / Service Keyword in Business Title(PLACE PAGE)
    49.5431.69↓ 9
  10. Product / Service Keyword in Place Page Description(PLACE PAGE)
    47.627.17↑4
  11. Crawlable Address Matching Place Page Address(WEBSITE)
    47.1828.61↑19
  12. Location Keyword in Business Title(PLACE PAGE)
    46.3632.29↑17
  13. Crawlable Phone Number Matching Place Page Phone Number(WEBSITE)
    45.8126.24↑24
  14. Product/Service Keywords in Reviews(REVIEWS)
    44.5126.28↑9
  15. Location Keywords in Reviews(REVIEWS)
    44.0628.07↑21
  16. Volume of Native Google Places Reviews (w/text)(REVIEWS)
    43.7530.39↑8
  17. Product / Service Keywords in Place Page Custom Fields(PLACE PAGE)
    43.6926.66↑16
  18. Association of Photos with Place Page(PLACE PAGE)
    41.3925.58↑1
  19. Location Keyword in Place Page Description(PLACE PAGE)
    40.6630.56↑12
  20. Volume of Third-Party Unstructured Reviews(REVIEWS)
    37.929.95↑27
  21. Association of Videos with Place Page(PLACE PAGE)
    35.7226.6↑14
  22. Product/Service Keywords in Anchor Text of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    35.6930.26↑2
  23. Quality of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    34.8730.04↓ 7
  24. Volume of GeoTagged Media Associated with Your Business (e.g. Panoramio, Flickr, YouTube)(OFF-PLACE/OFF-SITE)
    34.329.62↑4
  25. City, State in Places Landing Page Title(WEBSITE)
    33.7830.53↑7
  26. Volume of Native Google Places Ratings (no text)(REVIEWS)
    32.2727.64n/a
  27. Product / Service Keyword in Website URL(WEBSITE)
    32.2128.01↑11
  28. Overall Velocity of Reviews (Native + Third-Party)(REVIEWS)
    3230.9↓11
  29. Age of Place Page(PLACE PAGE)
    31.9629.66↑14
  30. PageRank / Authority of Landing Page Specified in Places(WEBSITE)
    31.9630.25↑18
  31. Numerical Percentage of Place Page Completeness(PLACE PAGE)
    31.9331.16n/a
  32. Location Keywords in Anchor Text of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    31.929.42↓ 6
  33. PageRank / Authority of(WEBSITE)Homepage / Highest Ranked Page(WEBSITE)
    31.6930.04↑10
  34. Volume of MyMaps / MapMaker References to Your Business(OFF-PLACE/OFF-SITE)
    30.0626.8↓16
  35. Volume of Third-Party Structured / hReview Formatted Reviews(REVIEWS)
    29.932.34↓26
  36. Quantity of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    28.9327.01↓9
  37. Location Keywords in Place Page Custom Fields(PLACE PAGE)
    28.928.81↑26
  38. Geographic Keyword in Website URL(WEBSITE)
    28.1528.48↑1
  39. Matching Google Account Domain to Places Landing Page Domain(PLACE PAGE)
    27.8726.02↑ 7
  40. Places Business Title in Anchor Text of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    27.5430.89↓ 6
  41. Bulk Owner-verified Place Page(PLACE PAGE)
    26.631.3n/a
  42. Quantity of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    25.927.83n/a
  43. Marginal Category Associations(PLACE PAGE)
    25.4530.03↓ 22
  44. Quality of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    24.7831.03n/a
  45. High Numerical Ratings of Place by Google Users (e.g. 4-5)(REVIEWS)
    24.7228.1↓ 3
  46. Number of Actions Taken by Searchers on a Place Page (e.g. Driving Directions, Mobile Phone Calls)(PLACE PAGE)
    24.5126.91n/a
  47. Velocity of Native Google Places Reviews(REVIEWS)
    24.0928.24↓30
  48. Location Keywords in Anchor Text of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    23.4827.61n/a
  49. High Numerical Third-Party Ratings (e.g. 4-5)(REVIEWS)
    23.1528.91↓7
  50. Product/Service Keywords in Anchor Text of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    22.625.12n/a
  51. Domain Diversity of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    22.4828.15n/a
  52. Verified KML File on Domain Name(WEBSITE)
    22.1826.35↓7
  53. Places Business Title in Anchor Text of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    21.2727.75n/a
  54. Domain Diversity of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    20.8126.42n/a
  55. Positive Sentiment in Reviews(REVIEWS)
    20.5727.27↓11
  56. Velocity of New Inbound Links to Website(OFF-PLACE/OFF-SITE)
    20.0926.5n/a
  57. City, State in Most/All Title Tags(WEBSITE)
    19.8728.28↓32
  58. Specifying Service Areas in a List for Place Page(PLACE PAGE)
    19.4525.01↓1
  59. Popularity (# of Views) of MyMaps References to Your Business(OFF-PLACE/OFF-SITE)
    18.6324.63↓18
  60. City, State in Places Landing Page H1/H2 Tags(WEBSITE)
    17.926.44n/a
  61. Velocity of Third-Party Reviews(REVIEWS)
    17.2127.37↓44
  62. Crawlable Contact Information in hCard Microformat(WEBSITE)
    15.1524.13↓12
  63. Matching, Public WHOIS Information(OFF-PLACE/OFF-SITE)
    1422.97↓7
  64. Specifying a Service Area via Polygon for Place Page(PLACE PAGE)
    13.7821.46↓7
  65. City, State in Most/All H1/H2 Tags(WEBSITE)
    13.6921.49n/a
  66. Inclusion of Coupon on Place Page(PLACE PAGE)
    13.3622.73↓14
  67. Velocity of New Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    1221.99n/a
  68. Loadtime of Places Landing Page(WEBSITE)
    11.6619.02n/a
  69. Volume of Check-Ins on Popular Services (e.g. Foursquare, Gowalla, Facebook, Twitter)(OFF-PLACE/OFF-SITE)
    10.617.38↓ 15
  70. Volume of Crawlable Testimonials(WEBSITE)
    10.1217.8n/a
  71. Volume of hReview Testimonials(WEBSITE)
    7.4516.2520
  72. Velocity of Check-Ins on Popular Services (e.g. Foursquare, Gowalla, Facebook, Twitter)(OFF-PLACE/OFF-SITE)
    6.2414.83↓17
  73. Number of Likes on Matching Facebook Page(OFF-PLACE/OFF-SITE)
    4.4211.31n/a
  74. OTHER (Please specify in comments)
    3.9315.88n/a
  75. Number of Followers on Matching Twitter Account(OFF-PLACE/OFF-SITE)
    3.8411.3n/a
  76. Authority of Followers on Matching Twitter Account(OFF-PLACE/OFF-SITE)
    3.6312.1n/a
  77. Velocity of New Facebook Likes(OFF-PLACE/OFF-SITE)
    2.610.92n/a
  78. Participation in Boost or Adwords(OFF-PLACE/OFF-SITE)
    1.548.74↓20
  79. Velocity of New Followers on Matching Twitter Account(OFF-PLACE/OFF-SITE)
    0.723.11n/a

BLENDED LOCAL FACTORS

GENERAL SIGNALS

1 → 4 0.75 → 1.08 
  1. Website Criteria
    1.90 0.94 ↑3
  2. Place Page Criteria
    2.32 0.84 ↓1
  3. Off-PlacePage/Off-Site Criteria
    2.65 1.08 ↓1
  4. Review Criteria
    3.65 0.75 ↓1

SPECIFIC FACTORS

79 → 1 11.13 → 33.51 
  1. Physical Address in City of Search(PLACE PAGE)
    61.2429.31↑ 1
  2. PageRank / Authority of Website Homepage / Highest Ranked Page(WEBSITE)
    53.3627.61↑38
  3. Proper Category Associations(PLACE PAGE)
    52.3331.01--
  4. City, State in Places Landing Page Title(WEBSITE)
    51.1829.33↑28
  5. Manually Owner-verified Place Page(PLACE PAGE)
    49.7231.32↓4
  6. Volume of Traditional Structured Citations (IYPs, Data Aggregators)(OFF-PLACE/OFF-SITE)
    48.4228.17↓2
  7. Quality of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    48.1528.68↑9
  8. Location Keywords in Anchor Text of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    46.1528.65↑18
  9. Product / Service Keyword in Website URL(WEBSITE)
    46.0926.81↑29
  10. PageRank / Authority of Landing Page Specified in Places(WEBSITE)
    45.9630.43↑38
  11. Quantity of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    45.8730.2↑16
  12. Overall Volume of Reviews of Place(REVIEWS)
    43.5127.86↓ 3
  13. Quality of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    42.6930.1n/a
  14. Product/Service Keywords in Anchor Text of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    42.2729.01↑10
  15. Location Keywords in Anchor Text of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    42.1829.26n/a
  16. Crawlable Phone Number Matching Place Page Phone Number(WEBSITE)
    41.6629.76n/a
  17. Volume of Unstructured Citations (Newspaper Articles, Blog Posts)(OFF-PLACE/OFF-SITE)
    39.8129.38↓6
  18. Domain Diversity of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    39.5428.66n/a
  19. Product / Service Keyword in Business Title(PLACE PAGE)
    38.5731.13↓12
  20. Geographic Keyword in Website URL(WEBSITE)
    38.0929.64↑19
  21. Domain Diversity of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    38.0630.43n/a
  22. Places Business Title in Anchor Text of Inbound Links to Website(OFF-PLACE/OFF-SITE)
    36.8730.05↓12
  23. Proximity of Address to Centroid(PLACE PAGE)
    36.7533.51↓11
  24. Crawlable Address Matching Place Page Address(WEBSITE)
    35.9630.9↑6
  25. Places Business Title in Anchor Text of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    35.7228.12n/a
  26. Local Area Code on Place Page(PLACE PAGE)
    35.5431.24↑11
  27. City, State in Places Landing Page H1/H2 Tags(WEBSITE)
    35.4830.04n/a
  28. Volume of Native Google Places Reviews (w/text)(REVIEWS)
    33.1229.85↓6
  29. City, State in Most/All Title Tags(WEBSITE)
    32.7231.88↓4
  30. Volume of Third-Party Unstructured Reviews(REVIEWS)
    32.629.1817
  31. Quantity of Inbound Links to Places Landing Page URL(OFF-PLACE/OFF-SITE)
    32.48