<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Slicksurface - Tech, Design &#38; SEO Blog &#187; Web Analytics</title>
	<atom:link href="http://www.slicksurface.com/blog/category/seo/web-analytics/feed" rel="self" type="application/rss+xml" />
	<link>http://www.slicksurface.com/blog</link>
	<description>Observations about technology, design, and search engine optimization by the staff of Slicksurface LLC - a design and technology company located in New York.</description>
	<lastBuildDate>Wed, 28 Apr 2010 16:09:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Google Images Now Also Served From WWW</title>
		<link>http://www.slicksurface.com/blog/2010-04/google-images-now-also-served-from-www</link>
		<comments>http://www.slicksurface.com/blog/2010-04/google-images-now-also-served-from-www#comments</comments>
		<pubDate>Wed, 28 Apr 2010 16:09:54 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Image Search]]></category>

		<guid isPermaLink="false">http://www.slicksurface.com/blog/?p=337</guid>
		<description><![CDATA[A couple weeks ago I got quite worried about the Google Images traffic on one of my sites...

That's a pretty dramatic drop. I started going through all the scenarios of what might be happening. Then a couple days later I noticed this...

In other words, at the same time there was a drop in traffic from [...]]]></description>
			<content:encoded><![CDATA[<p>A couple weeks ago I got quite worried about the Google Images traffic on one of my sites...</p>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-339" title="google images traffic down" src="http://www.slicksurface.com/blog/resources/2010/04/google-images-traffic-down.jpg" alt="google images traffic down" width="590" height="231" /></p>
<p>That's a pretty dramatic drop. I started going through all the scenarios of what might be happening. Then a couple days later I noticed this...</p>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-338" title="google www traffic up" src="http://www.slicksurface.com/blog/resources/2010/04/google-www-traffic-up.jpg" alt="google www traffic up" width="590" height="228" /></p>
<p>In other words, at the same time there was a drop in traffic from images.google.com, there was an uptick in traffic from www.google.com/imgres. This means Google is now serving Google Images content off www.google.com.</p>
<p>So I didn't do anything wrong, and my Google Images traffic is just fine... Now, if <a href="http://www.slicksurface.com/blog/2008-06/google-images-google-analytics-frames">Google Analytics could only learn how to properly count Google Images traffic</a> we'd be set.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2010-04/google-images-now-also-served-from-www/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Images, Google Analytics &amp; Frames</title>
		<link>http://www.slicksurface.com/blog/2008-06/google-images-google-analytics-frames</link>
		<comments>http://www.slicksurface.com/blog/2008-06/google-images-google-analytics-frames#comments</comments>
		<pubDate>Fri, 06 Jun 2008 20:48:19 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://www.slicksurface.com/blog/?p=161</guid>
		<description><![CDATA[Google Analytics severely underreports traffic from sites that frame the pages on your site including Google Images, Google Video and others.]]></description>
			<content:encoded><![CDATA[<p>Over the past week or so I've had a big change in my thinking regarding Google Analytics. I still think it's a wonderful service (you can't beat the price - for now...), but I'm starting to see some incredibly big flaws...</p>
<p>It all started with discrepancies between Google Analytics and Unica's NetInsight - big discrepancies for one particular site where NetInsight was showing about 60% more visits and visitors. You always expect some variation between analytics packages - but not 60%.</p>
<p>We first noticed that the pages with the biggest discrepancies were those that were most likely to get Google Images traffic. Then I noticed that many of the referral sources were roughly similar, but Google Images was way out of whack.</p>
<p><strong>Google Analytics Doesn't Report Pages Framed By Another Site</strong></p>
<p>I had a theory - that Google Analytics wasn't counting pageviews if the page was framed by another site. So I did a test. I set up a Google Analytics profile and made the tags only fire if they were in a frameset. The code looked something like this:</p>
<blockquote><p><code>if (top.location != location) {<br />
var framesTracker = _gat._getTracker("UA-287XXX-4");<br />
framesTracker._initData();<br />
framesTracker._trackPageview();<br />
}</code></p></blockquote>
<p>The first day I thought my theory had been proven wrong - traffic was showing up in the frames profile, but then it quickly dropped off to zero.</p>
<p style="text-align: center;"><a href="http://www.slicksurface.com/blog/resources/2008/06/ga-framed-blip.png"><img class="size-full wp-image-162" title="ga-framed-blip" src="http://www.slicksurface.com/blog/resources/2008/06/ga-framed-blip.png" alt="Google Analytics graphic showing framed content showing up the first day a profile is active, then nothing" width="500" height="108" /></a></p>
<p>In other words, the first day the profile existed framed pages were counted as page hits. The next day a filter kicked in and, with a few rare exceptions, framed pages never showed up again.</p>
<p>It's not just Google Images traffic that's not being reported, it's any site that frames pages on your site - so things like Yahoo! Image Search and MSN/Live Image Search are affected as well.</p>
<p>Why they would do this, I just don't understand. It's truly baffling. If your web server serves the page, then it should appear in the analytics for your site. Period. This means traffic from things like image and video search is grossly under reported in Google Analytics.</p>
<p>But it gets worse...</p>
<p><strong>Google Reports Some Traffic From Google Images As 'Direct' Traffic</strong></p>
<p>Let's take the example of Google Images (though it happens in other cases). You do a query and get a search results page with a bunch of thumbnails on it (<a href="http://images.google.com/images?hl=en&amp;q=harrier+hound&amp;btnG=Search+Images&amp;gbv=2" target="_blank" rel="nofollow">example</a>).</p>
<p style="text-align: center;"><a href="http://www.slicksurface.com/blog/resources/2008/06/gi-serps.png"><img class="alignnone size-medium wp-image-163" title="gi-serps" src="http://www.slicksurface.com/blog/resources/2008/06/gi-serps-300x220.png" alt="Google Images Search Results" width="300" height="220" /></a></p>
<p>You then click on an image and and get a page that's displayed using a frameset (<a rel="nofollow" href="http://images.google.com/imgres?imgurl=http://l.yimg.com/us.yimg.com/i/us/pets/brgd/harrier.jpg&amp;imgrefurl=http://pets.yahoo.com/dogs/breed-guide/hound-group/74/harrier/&amp;h=210&amp;w=210&amp;sz=10&amp;hl=en&amp;start=1&amp;sig2=rx5SfhFHClu5mhEvwhW1jQ&amp;tbnid=Qr2qtDMSo315ZM:&amp;tbnh=106&amp;tbnw=106&amp;ei=AZJJSO3nH5SypgSR--TcBA&amp;prev=/images%3Fq%3Dharrier%2Bhound%26gbv%3D2%26hl%3Den%26safe%3Doff%26sa%3DG" target="_blank">example</a>).</p>
<p style="text-align: center;"><a href="http://www.slicksurface.com/blog/resources/2008/06/gi-detail.png"><img class="size-medium wp-image-164" title="gi-detail" src="http://www.slicksurface.com/blog/resources/2008/06/gi-detail-300x264.png" alt="Google Images Image page" width="300" height="264" /></a></p>
<p>In the top frame is Google's information on the image. In the bottom frame is the page from the site with the image.</p>
<p>Now, if you click on any link in the bottom frame (that's being served by the site with the image), it does not remove the frameset. The frameset stays in place. Technically, I understand where this is coming from, but it's horrible from a user interface perspective. But remember what we established above - no pages that are framed by another site get reported in Google Analytics. So the person could spend a half hour looking at your site this way and you would never know it. They could purchase something from you and it wouldn't register as an e-commerce transaction in Google Analytics. This is bad - very bad...</p>
<p>Realizing this, I decided to add target="_top" to all of the links on the site to ensure that if the person clicked on a link on our page the frameset would be removed.</p>
<p>That lead to some confusion the following days when we saw a spike in direct traffic and didn't understand where it was coming from...</p>
<p style="text-align: center;"><a href="http://www.slicksurface.com/blog/resources/2008/06/ni-direct-traffic-spike.png"><img class="size-full wp-image-165" title="ni-direct-traffic-spike" src="http://www.slicksurface.com/blog/resources/2008/06/ni-direct-traffic-spike.png" alt="Spike in direct traffic in Google Analytics" width="500" height="109" /></a></p>
<p>After a while we realized it was coming from Google Images (or more precisely, from framed pages). Let me explain...</p>
<p>Because Google Analytics doesn't count the first framed page as traffic on your site (even though it's served by your site), that page just doesn't exist in Google Analytics. But it's that page that has Google Images as a referrer. So, by losing that page you lose the fact that Google Images sent you the traffic because the first page that Google Analytics counts is the one that the person went to when they clicked on the link on your framed page. Hence, the first initial referrer according to Google Analytics was the framed page on your site. That appears (in Google Analytics) as direct traffic (not a referral or organic traffic) since your own site can't be a referrer for your site.</p>
<p>Google's help pages are inaccurate <a rel="nofollow" href="http://www.google.com/support/analytics/bin/static.py?page=troubleshooter.cs&amp;problem=tracking&amp;selected=tracking_frames&amp;ctx=tracking_tracking_frames_55598&amp;aw_referral=" target="_blank">when they say</a>:</p>
<blockquote><p>If frames on the site reside on different domains, the referral information is likely to be inaccurate, since one frame may be recorded as the referring source of another, instead of a previous site being recorded as the referring source.</p></blockquote>
<p>It's more than the referral information being inaccurate - they're removing much of the traffic completely.</p>
<p>This does not mean you lose a record of all the traffic from Google Images. If the user clicks on a link on a page controlled by Google Images, for example the "remove frame" or "original context" links in the top frame, then since that action took place on a Google Images page, it will come through as a Google Images referral. But that's just a fraction of your actual Google Images traffic. If someone looks at the one framed page and leaves - they'll never show up in Google Analytics at all.</p>
<p><strong>Almost Impossible To Get Search Keywords For Image Search</strong></p>
<p>To make matters worse, even though Google Images is a search engine, it's pretty much impossible to treat it as an organic traffic source. Beyond the fact that you don't see all the traffic, the problem is one of URLs. Let me explain...</p>
<p>For starters it would be nice if we could just add something like this to the tracking code:</p>
<blockquote><p><code>pageTracker._addOrganic("images.google.com","q");</code></p></blockquote>
<p>In theory that's supposed to take any referral from images.google.com, find the 'q' query parameter and record the contents of that as the search keyword.</p>
<p>But the 'q' parameter is only found in the URL of the search results page. But the search results page is the referrer for the frameset that shows the page from your site at the bottom. The referrer for your page is the frameset, not the search results page. The frameset does, sorta, have the q parameter in it, but it's hidden inside the 'prev' parameter and looks something like this...</p>
<blockquote><p><code>&amp;prev=/images%3Fq%3Dharrier%2Bhound%26gbv%3D...</code></p></blockquote>
<p>Notice the q%3Dharrier%2Bhound... That's not nearly as neat as it was on the search results page where it was &amp;q=harrier+hound.</p>
<p>Now, some people have written some fairly complex routines to parse the q parameter out of that string, but it only works some of the time. When you consider the fact that the Google Images traffic you see is only a fraction of the real traffic it's questionable whether it's worth it to even bother since it's just bad data and you'll be mixing it with your good organic data from regular web search. Mixing good with bad makes everything bad.</p>
<p><strong>It Doesn't Have To Be This Way</strong></p>
<p>Google Analytics has painted themselves in a corner to an extent by not properly handling framed documents. If they decided to change how they handled framed documents it would significantly impact the analytics of many sites. But I'm not sure that a good excuse for continuing the way things are...</p>
<p>What should happen is that if a page's referrer is a frameset of a known search engine, then the query parameter should be pulled out of the referrer for the frameset (should be possible with Javascript). And it goes without saying that things like Google Images and Google Video should be known to be search engines (they are not at the moment).</p>
<p>You would think Google would at least want to properly reflect the traffic they drive to sites. Right now they're serverly underreporting traffic from Google Images and Google Video.</p>
<p>Hopefully this will get better in the future. I should do a test to see how IndexTools handles the situation since, in time, it will be Yahoo! free competitor to Google Analytics.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2008-06/google-images-google-analytics-frames/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Ways To Identify Bad Bots That Execute Analytics Tags</title>
		<link>http://www.slicksurface.com/blog/2008-05/ways-to-identify-bad-bots-that-execute-analytics-tags</link>
		<comments>http://www.slicksurface.com/blog/2008-05/ways-to-identify-bad-bots-that-execute-analytics-tags#comments</comments>
		<pubDate>Thu, 15 May 2008 22:03:38 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Bad Bots]]></category>
		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://www.slicksurface.com/blog/?p=150</guid>
		<description><![CDATA[Bad bots are affecting web analytics that are based on Javascript tagging. Learn about two ways to combat these user-mimicking spiders. Knowing they're crawling your site is the first step in limiting their damage.]]></description>
			<content:encoded><![CDATA[<p>Since I wrote the post on<a href="http://www.seomoz.org/ugc/bad-bots-confound-web-analytics-by-executing-javascript-tags" target="ym"> how bad bots can wreak havoc on web analytics by executing the Javascript-based tags</a> that many analytics packages use to track users, I've been giving some thought to how one might identify analytics-executing spiders to lessen their impact...</p>
<p><strong>Segmentation</strong></p>
<p>One possible way is by using segmentation. Basically segmentation lets you set a variable for each user that you can use to group different types of users. You might have a segmentation variable with values like "employee", "customer", and "general public".</p>
<p>You set the segmentation variable for Google Analytics by adding a line to your tracking code that looks something like:</p>
<p><code>pageTracker._setVar("segmentation value");  [using new GA tracking code]<br />
__utmSetVar('segmentation value');          [using old GA tracking code]<br />
</code></p>
<p>The other tag-based analytics programs have a similar feature (often more advanced with the ability to set multiple variable values).</p>
<p>The theory behind this approach is that the bots may not actually execute the Javascript, they may see the tracking code on your page, recognize it as a tracking code, grab your Google Analytics ID and do a programmatic call to Google Analytics (<a href="http://www.vdgraaf.info/google-analytics-without-javascript.html" target="pvdg">as described by Peter van der Graff</a>). In this case, if you've set a segmentation value, they most likely won't be programmed to detect that and as a result they'll show up on your reports as "not set".</p>
<p>That said, at least in some of the cases I've seen, the bots are able to set segmentation variables, which makes me think the bots use embedded browsers in the same manner as <a rel="nofollow" href="http://www.websitescreenshots.com/" target="ws">the screen capture program WebShots</a> (which is controllable from the command line).</p>
<p><strong>Segmentation + A Honey Pot</strong></p>
<p>A better option is to use segmentation in addition to a honey pot. Traditionally, a honey pot is link a real user would never click on, which is excluded by robots.txt, so no well-behaved spider would click on it either. Only bad bots load honey pot pages. In this case robots.txt exclusion is not necessary, since no "good bot" currently executes Javascript-based analytics tags, and if a good bot ever did do that you could tell it was a good bot by how it identifies itself in it's 'user-agent' string.</p>
<p>Essentially, you're setting up a page and hoping the bad bot goes to the page. When the bad bot goes to the page you set the segmentation variable to something like "bad bot" and you can then see the bad bot in your analytics reports. Because you're using a segmentation variable, just about any report can be broken down by the presence of this value.</p>
<p>You'll want to make sure you don't then overwrite the segmentation variable. If you use a honey pot you may want to only set the segmentation variable on the honey pot page, that way you won't wipe out the value on a subsequent page. Or at least test the cookie with the segmentation value in it before setting other segmentation values.</p>
<p>This solution is far from perfect. For starters, the bad bot may not choose to go to your honey pot page. Some may and some may not. These bad bots are not spiders - they're not trying to find pages on your site. They're programmed to act like humans, so they'll just click on a few links and leave. However, if you see "bad bot" in your segmentation values, it will tell you you may have bigger problems.</p>
<p><strong>A Honey Pot Without Segmentation</strong></p>
<p>You can also use the honey pot concept without segmentation. In this case you'd simply look for the honey pot page in your content reports and see examine entrance paths to the page to determine who's sending you bad traffic.</p>
<p>This may be sufficient. While segmentation variables can be used on many different types of reports, because the segmentation variable won't be set every time a bad bot crawls your site, they may not be as helpful as you might think. The general knowledge that there are bad bots crawling your site and their entrance paths may be as good as it gets.</p>
<p>But do realize that just because a bot says it's coming from xyz.com, doesn't mean it's actually come from xyz.com - that can be faked too. The particulars of your situation will determine how best to interpret what you see in the data.</p>
<p><strong>Segmentation + Honey Pot, Take 2 (best option)<br />
</strong></p>
<p>Using both of the approaches above separately is probably the best solution.</p>
<p>To find bad bots that are executing the analytics tags programmatically, set a segmentation variable for all users (bad bots or not), and use the lack of a segmentation value to identify programmatic bad bots. Then you use path analysis on a honey pot to find the bad bots that are executing the Javascript with an embedded browser.</p>
<p>That gives you coverage on both types of possible bad bot implementations.</p>
<p><strong>Wrap Up</strong></p>
<p>Let me reiterate that there's no way to catch every bad bot - at least I can't think of a way. But if you use some of the strategies I've mentioned here and see that bad bots are crawling your site and affecting your analytics, then you're ahead of the game. The worst case scenario is that bad bots cause you to make bad decisions by changing your data and leading you to false conclusions. If you know your data have been affected, you're more likely to think twice and not rely on faulty data to make decisions.</p>
<p>And also remember, as I've discussed in a previous post, there are <a href="http://www.slicksurface.com/blog/2008-05/proving-the-value-of-seo-when-botnets-corrupt-your-analytics">ways to use segmentation to set up bot-free zones in your web analytics</a>. Segmentation on things like "Paying customers" or "registered users" can all be relatively immune to bot traffic.</p>
<p>Good luck!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2008-05/ways-to-identify-bad-bots-that-execute-analytics-tags/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Proving The Value Of SEO When Botnets Corrupt Your Analytics</title>
		<link>http://www.slicksurface.com/blog/2008-05/proving-the-value-of-seo-when-botnets-corrupt-your-analytics</link>
		<comments>http://www.slicksurface.com/blog/2008-05/proving-the-value-of-seo-when-botnets-corrupt-your-analytics#comments</comments>
		<pubDate>Thu, 15 May 2008 21:52:58 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Bad Bots]]></category>
		<category><![CDATA[Hackers]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[SEO/SEM]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Search Marketing]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[fraud]]></category>
		<category><![CDATA[hacking]]></category>

		<guid isPermaLink="false">http://www.slicksurface.com/blog/?p=153</guid>
		<description><![CDATA[If distributed attacks on web analytics become common place, those of us doing SEO are in for a world of hurt... After all, how can you prove your worth if your statistics can't be relied on? But, while painful, botnet analytics hacking really isn't the end of SEO - let me explain...
Insuring that pages are [...]]]></description>
			<content:encoded><![CDATA[<p>If <a href="http://www.seomoz.org/ugc/bad-bots-confound-web-analytics-by-executing-javascript-tags" target="_blank">distributed attacks on web analytics</a> become common place, those of us doing SEO are in for a world of hurt... After all, how can you prove your worth if your statistics can't be relied on? But, while painful, botnet analytics hacking really isn't the end of SEO - let me explain...</p>
<p><strong>Insuring that pages are built correctly</strong></p>
<p>At it's heart, one of the core practices of SEO is to ensure that pages are built properly - that their focus and content can be easily understood by search engine spiders. "Web designers" just want pages to look pretty. "Web programmers" just want pages to be connected to data sources in cool ways or do some other cool trick. Neither of those groups really cares about how the pages perform. SEO will always need to be there to make sure the page can be understood by search engines. With the advent of botnet web analytics corruption, it may be difficult for an SEOer to prove s/he's done her/his job, but it's not impossible...</p>
<p><strong>E-commerce</strong></p>
<p>Sites that have e-commerce as their primary goal will be far less affected than "marketing" sites trying to get market exposure because e-commerce transactions will be difficult to fake. Conversion ratios may be corrupted, but the actual sales (in dollars, pounds, euros, etc.) will still be a concrete measure of success. Further, segmentation by things like "paying customer" will become invaluable.</p>
<p><strong>Registered users</strong></p>
<p>Building on segmentation possible with e-commerce, segmentation based on whether the user is registered will be invaluable as well. Using segmentation, you can look just at the activity of registered users which should be bot free provided you use a good captcha, and possibly e-mail confirmation. (At the very least it will take a very targeted attack to mess with your user segmentation data - it won't be affected by random attacks).</p>
<p><strong>Ranking in SERPs</strong></p>
<p>Your actual ranking in the SERPs is also something that can't be faked. Programs like <a rel="nofollow" href="http://www.webposition.com/" target="_blank">WebPosition</a>, while hated by the search engines, will be critical in measuring success and validating organic traffic. [If you're getting traffic off the keyword 'art', but don't rank for 'art', then you know you have a bot problem.]</p>
<p><strong>The cost of good analytics<br />
</strong></p>
<p>The analytics company that works the hardest to combat botnets will be the big winner here. Since Google Analytics, Microsoft Gatineau, and now IndexTools (Yahoo!), are free packages it's unclear whether Google, Microsoft or Yahoo! really have the incentive to combat the effects of botnets. I mean, look at Google's track record with click fraud in AdWords. Even if you accept the fact that they try to stop it, there's still plenty of it that goes undetected. If the tool that is best able to effectively detect and squash botnets is a paid service, there will be a very real increase in the cost of quality analytics.</p>
<p>It will also be more expensive to interpret analytics since you can't just take the numbers at face value. You'll have to jump through more hoops to come to a conclusion and it's not likely to be able to be done by people without significant experience. Those experts will come at a price.</p>
<p>I can also see 3rd party verification services cropping up that may not engage in SEO, but validate the results of SEO companies. Again, an added cost, though probably only something that would be done for very high-end projects.</p>
<p><strong>Conclusion</strong></p>
<p>Botnets aren't going to destroy the SEO industry. If anything they'll increase the level of professionalism. Word will get out that fraud and deception are possible with SEO and reputation will become absolutely critical. I'm guessing it will knock some smaller players (both customers and providers) out of the game - or at least knock them down to a place where they don't even try to validate results. But in cases where money talks, money will be able to buy at least "decent" analytics and the people who are best qualified to interpret the data.</p>
<p>Still, analytics hacking by botnets will become a much bigger problem in the future, and it will radically change the SEO industry...</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2008-05/proving-the-value-of-seo-when-botnets-corrupt-your-analytics/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Red Text Isn&#8217;t Always Bad In Google Analytics</title>
		<link>http://www.slicksurface.com/blog/2008-05/red-text-isnt-always-bad-in-google-analytics</link>
		<comments>http://www.slicksurface.com/blog/2008-05/red-text-isnt-always-bad-in-google-analytics#comments</comments>
		<pubDate>Fri, 09 May 2008 19:51:45 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[search engine traffic]]></category>

		<guid isPermaLink="false">http://www.slicksurface.com/blog/?p=149</guid>
		<description><![CDATA[Red text isn't always bad in Google Analytics. Just because something went down relative to other things, doesn't mean it's not improving on it's own...]]></description>
			<content:encoded><![CDATA[<p>Take a look at the following graphic... What does it tell you about search engine traffic?</p>
<p style="text-align: center;"><img src="http://www.slicksurface.com/blog/resources/2008/05/ga-red-text.png" alt="Red text in Google Analytics looks like a problem, but isn't..." width="255" height="129" /></p>
<p>It would appear that your search engine traffic has gone down by 2.76%. But that's not actually the case at all. If you drill into search engines you see the following....</p>
<p style="text-align: center;"><img src="http://www.slicksurface.com/blog/resources/2008/05/ga-green-text.png" alt="Google Analytics showing improved search engine traffic" width="133" height="52" /></p>
<p>Organic traffic has actually gone up 5.7% even though the first graphic said it had gone down 2.76%. The reason is simple - search engine traffic went down <em>relative to the other types of traffic</em>. In other words, direct traffic went up more than 5.7%. The first graphic is the product of a zero sum game - if one area increases, the others have to decrease, even though, on their own they're increasing.</p>
<p>Let's put it another way. If direct traffic and referring sites had gone up 5.7% as well, then the percentages in the first graphic would all have been zero even though all of them went up 5.7%.</p>
<p>The take away is that when interpreting web analytics stats, be careful in your interpretations. Digging deeper can clarify initial misconceptions.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2008-05/red-text-isnt-always-bad-in-google-analytics/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IndexTools To Become Yahoo! Analytics?</title>
		<link>http://www.slicksurface.com/blog/2008-04/indextools-to-become-yahoo-analytics</link>
		<comments>http://www.slicksurface.com/blog/2008-04/indextools-to-become-yahoo-analytics#comments</comments>
		<pubDate>Wed, 09 Apr 2008 12:56:27 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[IndexTools]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[Yahoo!]]></category>

		<guid isPermaLink="false">http://www.slicksurface.com/blog/2008-04/indextools-to-become-yahoo-analytics</guid>
		<description><![CDATA[Yahoo! is acquiring IndexTools. So, what exactly will be the outcome? Will IndexTools become free? Will it get shut down completely in favor of Gatineau if Yahoo! and Microsoft merge? Let's hope for the best...]]></description>
			<content:encoded><![CDATA[<p>I got an e-mail this morning from <a href="http://www.indextools.com">IndexTools</a> saying they've been acquired by Yahoo! and more details will be coming on April 15th. I'm guessing this means Yahoo! is trying to compete with Google Analytics which means IndexTools will become free.</p>
<p>That's both good and bad... One of the wonderful things about IndexTools was how responsive they were - they'd do all sorts of customization/integration for you at a reasonable price. I'm hoping that's not going to go away, but I'm afraid it will... On the other hand they were the analytics tool I really wanted to work with becaue,  like Google Analytics, they just seemed to "get" the issues and have a sensible, but powerful interface. I've had some exposure now to both WebTrends and Unica's NetInsight and can't say the same for either of them.</p>
<p>The bottom line is that it's possible we'll see another free competitor to Google Analytics that has many of the same qualities of Google Analytics in that it has a clear, no-nonsense, to-the-point interface. That's quite exciting.</p>
<p>It is curious timing however. In the middle of a possible merger between Yahoo! and Microsoft, Yahoo! goes and buys an analytics package when Microsoft is already working on <a href="http://www.slicksurface.com/blog/2007-07/would-you-trust-microsoft-with-analytics-data">Gatineau</a>. What would be horrible is to see a merger where IndexTools gets shut down completely in favor of Gatineau. Let's hope that never happens...</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2008-04/indextools-to-become-yahoo-analytics/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Analytics&#8217; New Weekly &amp; Monthly Graphs Should Be Used Carefully</title>
		<link>http://www.slicksurface.com/blog/2008-04/google-analytics-new-weekly-monthly-graphs-should-be-used-carefully</link>
		<comments>http://www.slicksurface.com/blog/2008-04/google-analytics-new-weekly-monthly-graphs-should-be-used-carefully#comments</comments>
		<pubDate>Fri, 04 Apr 2008 16:32:11 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[graphing]]></category>

		<guid isPermaLink="false">http://www.slicksurface.com/blog/2008-04/google-analytics-new-weekly-monthly-graphs-should-be-used-carefully</guid>
		<description><![CDATA[Google Analytics' new weekly and monthly graphing is a wonderful enhancement. We find that monthly graphs usually smooth the data too much and should be avoid, however the weekly graphing enhances the understanding of trends even on sites with relatively even daily traffic.]]></description>
			<content:encoded><![CDATA[<p>This morning I logged into <a href="http://www.google.com/analytics/" target="_blank">Google Analytics</a> and found that there was <a href="http://analytics.blogspot.com/2008/04/new-graphing-options-by-week-and-by.html" target="_blank">a new feature on the dashboard - weekly and monthly graphing</a>.</p>
<p>On sites that have the typical business week roller coaster appearance when you look at a lot of data, say over a year, it can be difficult to see trends. For example here's a busy e-commerce site:</p>
<p align="center"><img src="http://www.slicksurface.com/blog/resources/2008/04/hh-daily.png" alt="Daily Google Analytics graph with a year of data" height="148" width="550" /></p>
<p>You can generally get a feel for what's going on in that graph, but it's much clearer in the graph below which has one data point for each week. (It looks like there were some problems in December)...</p>
<p align="center"><img src="http://www.slicksurface.com/blog/resources/2008/04/hh-weekly.png" alt="Graph of weekly visits in Google Analytics" height="146" width="550" /></p>
<p>Yes, you lose some detail, but when looking at a lot of data you're able to better see the forest rather than the trees. The roller coaster trends you see in analytics tend to be weekly trends with traffic high during the work week and low on the weekend. So a weekly graph smooths out those normal ups and downs and lets you see very clearly how you did on a particular week.</p>
<p>When you get to monthly summary graphs the trends aren't quite as detailed...</p>
<p align="center"><img src="http://www.slicksurface.com/blog/resources/2008/04/hh-monthly.png" alt="Monthly summary graph of a year’s worth of visits from Google Analytics" height="148" width="550" /></p>
<p>I suppose that is useful for businesses who think in terms of performance month-to-month, but thinking that way will hide downturns that are spread over two months and exaggerate ones that are contained completely within a single month. In other words, unlike the weekly graph, it's not smoothing out routine variations - it's just removing details (that may be important).</p>
<p>The weekly and monthly graphs are useful even on sites which don't see routine weekly variation. Here are the graphs from a blog site that has pretty consistent traffic (weekends are about the same as weekdays). We start with the standard daily graph...</p>
<p align="center"><img src="http://www.slicksurface.com/blog/resources/2008/04/rt-daily.png" alt="Daily graph from Google Analytics showing one year of data" height="150" width="550" /></p>
<p>That's a much smoother graph than its counter part from the e-commerce site above (the first graph), but this next graph (the weekly graph) makes the trends very clear without losing much detail...</p>
<p align="center"><img src="http://www.slicksurface.com/blog/resources/2008/04/rt-weekly.png" alt="Weekly graph from Google Analytics showing one year of data" height="149" width="550" /></p>
<p>Even though you can see the trends in the daily graph, that weekly graph really does a much better job of smoothing the line out and showing the general trend, though it does hide the very last spike in traffic.</p>
<p>The monthly graph however, once again smooths things out so much that it loses important details...</p>
<p align="center"><img src="http://www.slicksurface.com/blog/resources/2008/04/rt-monthly.png" alt="Monthly graph from Google Analytics showing one year of data" height="147" width="550" /></p>
<p>According to that graph the growth rate of the site is nearly exponential. It loses the recent spikes and makes thing seem a little too optimistic. The accurate message from the data should be "growth is good, but if spikes in traffic aren't maintained traffic may go down somewhat". The monthly graph makes it seem like growth into the future is unlimited.</p>
<p>Theh bottom line is that Google has added another useful tool, but as with all statistical tools you need to be careful that the conclusions drawn from these tools don't hide important trends in the data. In this particular case I believe the weekly graphs are a huge asset, but the monthly graphs aggregate too much data to accurately describe the underlying trends. Of course, as you get <em>years</em> worth of data accumulated, the monthly graphs will be more appropriate to show long-term tending.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2008-04/google-analytics-new-weekly-monthly-graphs-should-be-used-carefully/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>How To Know If All Of Prior Day&#8217;s Data Is In On Google Analytics</title>
		<link>http://www.slicksurface.com/blog/2008-03/how-to-know-if-all-of-prior-days-data-is-in-on-google-analytics</link>
		<comments>http://www.slicksurface.com/blog/2008-03/how-to-know-if-all-of-prior-days-data-is-in-on-google-analytics#comments</comments>
		<pubDate>Mon, 31 Mar 2008 15:51:38 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://www.slicksurface.com/blog/2008-03/how-to-know-if-all-of-prior-days-data-is-in-on-google-analytics</guid>
		<description><![CDATA[Google Analytics is not exactly a real-time service. Numbers filter in from a variety of sources and then when everything is in end-of-day processing is done and the numbers are finalized. However, this can happen much later than you think... This article tells you a quick way to know if the data have been finalized.]]></description>
			<content:encoded><![CDATA[<p>If you use Google Analytics you've noticed that a few hours after the day ends the prior day starts showing up in the default date range. And you may have also noticed that the number of visits for the prior day can change for hours after the day has actually ended.</p>
<p><img src="http://www.slicksurface.com/blog/resources/2008/03/google-analytics-data-not-in.png" alt="What Google Analytics looks like when not all of the prior day’s data are in" align="right" height="306" hspace="4" vspace="6" width="280" />There's nothing on the default dashboard that will tell you if Google is done processing data for the previous day. However, it's not hard to figure out...  Click on "Visitors" in the nav bar on the left side of the screen. You'll see an overview of visitor data that will include a section that looks something like the one on the right...</p>
<p>If the graphs for Absolute Unique Visitors and New Visits drop to zero on the last day, then Google has not done end-of-day processing, which means it's likely that all of the data are not in and your numbers for the previous day may change.  The screen grab to the right was taken at 11:30am the following day - so it can take Google quite a while to actually finalize the numbers for the previous day.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2008-03/how-to-know-if-all-of-prior-days-data-is-in-on-google-analytics/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Analytics Reporting Visits From Sites That No Longer Exist</title>
		<link>http://www.slicksurface.com/blog/2007-08/google-analytics-reporting-visits-from-sites-that-no-longer-exist</link>
		<comments>http://www.slicksurface.com/blog/2007-08/google-analytics-reporting-visits-from-sites-that-no-longer-exist#comments</comments>
		<pubDate>Tue, 21 Aug 2007 14:40:00 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://new.slicksurface.com/blog/?p=81</guid>
		<description><![CDATA[It's weird to see traffic from a site that no longer exists in your web analytics reports, but that actually is the sign of a good analytics package and a good site. This happens because the users who were referred by the site keep coming back and the original referrer keeps getting the credit for the traffic.]]></description>
			<content:encoded><![CDATA[<p>For a while now I've noticed that one on of my blog sites I'm getting referrals visits from a site that has been shut down for over three months...</p>
<p align="center"><img src="http://www.slicksurface.com/blog/2007/08/links-from-defunct-sites.png" alt="Google Analytics graph showing links from sites that no longer exist." height="385" width="540" /></p>
<p>This had me stumped until I noticed that the number of new visits was flatlined at 0%.</p>
<p>In other words, Google's referral data is based on either the first site a person came from or the most recent site. I'm guessing its the first site since that's a fair number of visits to still be having months after the site is shut down. You'd think they'd find the site some other way and if Google showed the most recent referrer the original referrer would be wiped out by now.</p>
<p>If you look at it from just before the site was shut down you can see that there was a fairly precipitous drop, but then the real converts have hung in there (and not cleared their cookies)...</p>
<p align="center"><img src="http://www.slicksurface.com/blog/2007/08/bingity-referrals.png" alt="links over 3+ months" height="147" width="540" /></p>
<p>So if you have links from sites that no longer exist - congratulations - it means you have some real converts to your site...</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2007-08/google-analytics-reporting-visits-from-sites-that-no-longer-exist/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Update on Google Analytics Outage</title>
		<link>http://www.slicksurface.com/blog/2007-07/update-on-google-analytics-outage</link>
		<comments>http://www.slicksurface.com/blog/2007-07/update-on-google-analytics-outage#comments</comments>
		<pubDate>Mon, 30 Jul 2007 20:01:00 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://new.slicksurface.com/blog/?p=77</guid>
		<description><![CDATA[As I found this morning, Google Analytics has been down for about two days. More precisely the data for the past two days is missing - the interface is working pretty normally otherwise. Other people are reporting all sorts of weird problems over the past few weeks (and particularly now) - everything from all their [...]]]></description>
			<content:encoded><![CDATA[<p>As I found this morning, <a href="http://www.slicksurface.com/blog/2007/07/google-analytics-down-for-nearly-two.html">Google Analytics has been down for about two days</a>. More precisely the data for the past two days is missing - the interface is working pretty normally otherwise. <a href="http://www.webmasterworld.com/analytics/3395441.htm">Other people</a> are reporting all sorts of weird problems over the past few weeks (and particularly now) - everything from all their traffic being reported as direct traffic, to big dips in traffic (compared to WebTrends), to the interface not working...</p>
<p>Having done a little research it seems the major problems started around 6pm (Eastern?) on Saturday the 28th , which means they're almost a full 2 days into the outage at this point.</p>
<p>Google finally put out <a href="http://analytics.blogspot.com/2007/07/update-temporary-reporting-delay-since.html">an announcement</a> telling people about the outage. It took Google nearly a day and a half to say anything about it, which is unfortunate - they usually do better. The announcement said that data from Saturday should be fine now, but that's not what I'm seeing at all - there's just as much missing as there always was.</p>
<p>The weird part is that a trickle of visits did manage to get through. But the really odd part is that it's not like one data center was up - the hits that got through are distributed all around the world, and there's a mix of organic, direct and referrals... Go figure.</p>
<p>I'd love to be a fly on the wall in their office right now, and would hate to be them...</p>
<p>Hopefully they'll get it fixed quickly.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2007-07/update-on-google-analytics-outage/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Would You Trust Microsoft With Analytics Data?</title>
		<link>http://www.slicksurface.com/blog/2007-07/would-you-trust-microsoft-with-analytics-data</link>
		<comments>http://www.slicksurface.com/blog/2007-07/would-you-trust-microsoft-with-analytics-data#comments</comments>
		<pubDate>Mon, 30 Jul 2007 14:06:00 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[MS Gatineau]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://new.slicksurface.com/blog/?p=75</guid>
		<description><![CDATA[Microsoft has announced its web analytics package - Gatineau. But the question is whether they will be as good as Google and the paid services at protecting the privacy of the data.]]></description>
			<content:encoded><![CDATA[<p>So it seems Microsoft has decided to compete with Google Analytics with it's upcoming <a href="http://www.google.com/search?q=gatineau+microsoft">"Gatineau" web analytics service</a>. So far, I'm not sure what I think of it. The graphics don't look as slick as <a href="http://www.slicksurface.com/blog/2007/05/google-analytics-redesigned-new.html">Google's new interface</a>, but my primary concern is privacy...</p>
<p>When you pay for analytics with services such as WebTrends and IndexTools you sign contracts and the data clearly belongs to you. With Google Analytics, since it's free the trade off is that Google gets to do pretty much what they want with your data, within the confines of their privacy policy. The same will most likely be true of Microsoft.</p>
<p>The thing is, Google seems to always defend user data from the prying eyes of governments. They've got a good track record. I don't think of Microsoft in the same way. Google may be the new Big Brother, but they're doing it in a very different way than Microsoft was Big Brother in the past - Google just seems more responsible about the obligations that come with power.</p>
<p>Analytics data is pretty sensitive information. I'm just not sure Microsoft will guard my data as vigorously as Google would. Not that I'm intending to start up any terrorist web sites, but it's more about the principle of it...</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2007-07/would-you-trust-microsoft-with-analytics-data/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Analytics Down For Nearly Two Days (So Far)</title>
		<link>http://www.slicksurface.com/blog/2007-07/google-analytics-down-for-nearly-two-days-so-far</link>
		<comments>http://www.slicksurface.com/blog/2007-07/google-analytics-down-for-nearly-two-days-so-far#comments</comments>
		<pubDate>Mon, 30 Jul 2007 10:21:00 +0000</pubDate>
		<dc:creator>Jay Harper</dc:creator>
				<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://new.slicksurface.com/blog/?p=74</guid>
		<description><![CDATA[Google Analytics appears to be down - for nearly two days now. All sorts of different sites show the same pattern - no data since 2 or 3am on the 29th.]]></description>
			<content:encoded><![CDATA[<p>There have been interruptions in Google Analytics in the past. Usually they're like <a href="http://analytics.blogspot.com/2007/07/notice-of-brief-processing-delay.html">this one</a> where they announce that they're collecting the data, but there will be a brief delay in having it show up in the reports. I've also seen cases where there's a slight delay in getting data into a particular account with others working just fine.</p>
<p>But what's happening now is far worse. From my estimations there's a huge delay and no notice posted on the <a href="http://analytics.blogspot.com/">Google Analytics blog</a>. Here's some evidence from a variety of sites...</p>
<p><span style="font-weight: bold">Slicksurface.com:</span><br />
<img src="http://www.slicksurface.com/blog/2007/07/slicksurface-analytics.png" alt="analytics for slicksurface.com" height="147" width="540" /></p>
<p><span style="font-weight: bold">AnimalHavenShelter.org:</span><br />
<img src="http://www.slicksurface.com/blog/2007/07/animalhavenshelter-analytics.png" alt="analytics for animal shelter web site" height="146" width="540" /></p>
<p><span style="font-weight: bold">A consumer health site:</span><br />
<img src="http://www.slicksurface.com/blog/2007/07/health-univ-analytics.png" alt="analytics for consumer health site" height="147" width="540" /></p>
<p><span style="font-weight: bold">The personal blog of a someone I know:</span><br />
<img src="http://www.slicksurface.com/blog/2007/07/bderek-analytics.png" alt="analytics for friend's blog" height="147" width="540" /></p>
<p><span style="font-weight: bold">Another personal blog:</span><br />
<img src="http://www.slicksurface.com/blog/2007/07/rt-analytics.png" alt="analytics for personal blog" height="146" width="540" /></p>
<p>These aren't sites with insignificant traffic and they're about as broad a spectrum of sites as you get and they're all showing exactly the same trend - no data since  2 or 3am on the 29th, and substantial under reporting starting at the morning of the 28th for some sites and the evening of the 28th for other sites.</p>
<p>The last day shown in each of those graphs is yesterday - 7/29. Which means they were down all day yesterday, a substantial portion of 7/28, and so far today...</p>
<p>What bothers me the most is the lack of communication on the part of Google. Hopefully they're still collecting data and it's just a matter of time before it all shows up again in the reports.</p>
<p>UPDATE (1pm ET): Google has <a href="http://analytics.blogspot.com/2007/07/update-temporary-reporting-delay-since.html">posted something on their blog</a> acknowledging the problem and saying it's just a reporting delay... Let's hope.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.slicksurface.com/blog/2007-07/google-analytics-down-for-nearly-two-days-so-far/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
