<?xml version="1.0" encoding="UTF-8"?> <rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
><channel><title>Praveen P.N &#187; hk</title> <atom:link href="http://praveenpn.com/blog/tag/hk/feed/" rel="self" type="application/rss+xml" /><link>http://praveenpn.com/blog</link> <description></description> <lastBuildDate>Mon, 16 Jan 2012 06:40:48 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3</generator> <item><title>Google.cn and http headers</title><link>http://praveenpn.com/blog/2010/03/23/google-cn-and-http-headers/</link> <comments>http://praveenpn.com/blog/2010/03/23/google-cn-and-http-headers/#comments</comments> <pubDate>Tue, 23 Mar 2010 12:42:35 +0000</pubDate> <dc:creator>Praveen</dc:creator> <category><![CDATA[Google]]></category> <category><![CDATA[china]]></category> <category><![CDATA[headers]]></category> <category><![CDATA[hk]]></category> <category><![CDATA[http]]></category> <category><![CDATA[statistics]]></category><guid
isPermaLink="false">http://praveenpn.com/blog/?p=136</guid> <description><![CDATA[Google has shut down their Chinese site(http://google.cn). I saw something interesting while looking at the HTTP headers used by Google for the redirect. Both the URLs google.com and google.com.cn use a 302 redirect to redirect traffic to Google HK and &#8230; <a
href="http://praveenpn.com/blog/2010/03/23/google-cn-and-http-headers/">Continue reading <span
class="meta-nav">&#8594;</span></a>]]></description> <content:encoded><![CDATA[<p>Google has shut down their Chinese site(<a
href="http://google.cn" target="_blank">http://google.cn</a>). I saw something interesting while looking at the HTTP headers used by Google for the redirect.</p><p>Both the URLs google.com and google.com.cn use a 302 redirect to redirect traffic to Google HK and that&#8217;s perfectly fine. The redirected URL doesn&#8217;t look clean, not only is it NOT clean, I have a feeling they use URL GET parameters to keep track of some statistics.</p><p>Here&#8217;s how a sample header looked like:</p><pre style="overflow: auto;">
URL: http://www.google.cn/

GET / HTTP/1.1
Host: www.google.cn
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.2) Gecko/20100115 Firefox/3.6
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: en-us,en;q=0.5
Accept-Encoding: gzip,deflate
Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7
Keep-Alive: 115
Connection: keep-alive
Cookie: --removed for good--

HTTP/1.1 302 Found
Location: http://www.google.com.hk/url?sa=p&amp;cki=PREF%3DID%3D024bad9e5afabff3:U%3D957ffb38d60fef2f:FF%3D2:LD%3Dzh-CN:TM%3D1269345133:LM%3D1269345324:S%3DokxQi9JZRpNOay9b&amp;q=http://www.google.com.hk/&amp;ust=1269345354859186&amp;usg=AFQjCNGD11Zf8ak_X-V_y6RPXiFMeHqUQg
Cache-Control: private
Content-Type: text/html; charset=UTF-8
Date: Tue, 23 Mar 2010 11:55:24 GMT
Server: gws
Content-Length: 459
X-XSS-Protection: 0
</pre><p>Most of it is pretty boring stuff, but check the Location URL again. I tried hitting the URL a couple of times and it looks like Google is using a very lazy way to keep track of the users redirected from Google china to Google HK.</p><p>ust = User Stats?</p><div
style="overflow: auto;">Location: http://www.google.com.hk/url?sa=p&amp;amp;cki=PREF%3DID%3D024bad9e5afabff3:U%3D957ffb38d60fef2f:FF%3D2:LD%3Dzh-CN:TM%3D1269345133:LM%3D1269345324:S%3DokxQi9JZRpNOay9b&amp;amp;q=http://www.google.com.hk/&amp;amp;ust=<strong>1269345354859186</strong>&amp;amp;usg=AFQjCNGD11Zf8ak_X-V_y6RPXiFMeHqUQg</div><p>I looked at the traffic for a few minutes and here are the numbers I got. All of them in pretty sweet increasing order.</p><p>1269345<strong>163308698</strong><br
/> 1269345<strong>240966267</strong><br
/> 1269345<strong>306123236</strong><br
/> 1269345<strong>354859186</strong><br
/> 1269345<strong>574993411</strong></p><p><strong><span
style="font-weight: normal;">This could be real or just something I got completely wrong <img
src='http://praveenpn.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </span></strong></p> ]]></content:encoded> <wfw:commentRss>http://praveenpn.com/blog/2010/03/23/google-cn-and-http-headers/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced

Served from: praveenpn.com @ 2012-02-05 13:13:42 -->
