<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Yet another blog again</title>
	<atom:link href="http://govindkanshi.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://govindkanshi.wordpress.com</link>
	<description></description>
	<lastBuildDate>Sat, 21 Jan 2012 22:01:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='govindkanshi.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Yet another blog again</title>
		<link>http://govindkanshi.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://govindkanshi.wordpress.com/osd.xml" title="Yet another blog again" />
	<atom:link rel='hub' href='http://govindkanshi.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Hadoop Meetup at InMobi</title>
		<link>http://govindkanshi.wordpress.com/2011/12/10/hadoop-meetup-at-inmobi/</link>
		<comments>http://govindkanshi.wordpress.com/2011/12/10/hadoop-meetup-at-inmobi/#comments</comments>
		<pubDate>Sat, 10 Dec 2011 09:42:02 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[hadoop]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[InMobi]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=435</guid>
		<description><![CDATA[Attended well conducted event at InMobi on Hadoop. All the kudos to InMobi folks to open up and share not just their work(yoda) but awesome food/drinks. Rarely have I seen a platter more generous than at yesterday&#8217;s event. Event &#8211; Event hosters Vinayak Hegde(Data platform owner), Sharad Agarwal (ex-yahoo,hadoop,yarn committer,  present platform head at InMobi) were [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=435&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Attended well conducted event at InMobi on Hadoop. All the kudos to InMobi folks to open up and share not just their work(yoda) but awesome food/drinks. Rarely have I seen a platter more generous than at yesterday&#8217;s event.</p>
<p>Event &#8211; Event hosters Vinayak Hegde(Data platform owner), Sharad Agarwal (ex-yahoo,hadoop,yarn committer,  present platform head at InMobi) were punctual, humble and kind. Vinayak and Sharad provided needed time checks, context and hoped to continue the effort with help from community. Turnout was varied right from recent newhires to people with multiple decades in industry. It had lot of yahoo (no-surprise) folks, Nokia (100 node cluster), Huawei (apparenly built a HA and have deployed cluster of x nodes), NetApp (Bejoy and team with y node cluster) , Mu-sigma(evaluating and using various pieces of hadoop).  <a href="http://jsensarma.com/">Joydeep </a>(ex facebook, hive creator) came around to see/meetup folks. I was looking for raptor folks from Sungard though.</p>
<p><strong>Sessions</strong></p>
<p>Sonal on <a href="https://github.com/sonalgoyal/crux">crux </a>- This talk had two pieces &#8211; how crux allows uses api based interaction with mapping and reporting of data inside HBase. Her intention was to get people to contribute and help out build the other moving pieces.Crux at present goes directly rather than in between translation from SQL like query language. She explained how one needs to design backend carefully to ensure efficient/performant data access is done. Crux allows composite keys, filters  but eod secondary indexes and like need to be thought about by the system designer. Since Crux is just a reporting tool &#8211; it can only do so much(idea is to be nice to get/range operations &#8211; how much a db guy likes these operators &#8211; seek vs scan(killl)). Kudos to her for getting something out and talking about it.</p>
<p><a href="http://sharadag.tumblr.com/">Sharad</a>&#8216;s <a href="http://www.slideshare.net/ydn/apache-hadoop-india-simmit-2011-p-the-next-generation-of-hadoop-mapreduce">talk </a>on next generation Hadoop clarified the present constraints and hence the goals of .23 world. HA &#8211; Restart of Namenode vs DataNode, Scalability of NameNode &#8211; sheer footprint of everything it needs to keep track of and respond to, Need to support alternative parallelizable algo-with no force fitting into MR. His talk was succint and filled with great depth. Idea of containers getting resource fullfillment from ResourceManager to get themselves created via NodeManager and then spawning off AppMaster to look after the applifecycle independently is the key.   Application lifecycle of either MR/Iterative/MPI could be managed independently and Resources can still be managed centrally. Important takeaway for older installations &#8211; no change. Their world remains the same.  Data Affinity based container spawning is possible &#8211; this looks interesting from perspective of reducing network io.</p>
<p>Yoda talk by Gaurav from InMobi was about the inhouse Datawarehousing/Reporting tool they built with few resources over short period of time. It was slick.  He explained the pragmatism to do custom development compared to using Hive or other tools. One of the important pivot for decision making was documentation, community support and &#8220;in-ordinate&#8221; spawning of Jobs without taking into picture metadata and layout of data. Looks like good solution to their issues and allows them evolution according to their needs.  It is niether designed as generic framework nor does it aim to be one.  This honesty from the data framework team was refreshing as they were not trying to boil the ocean and focused on their constraints (lack of massive clusters) , needs of analysts (inhouse/publishers).  It would have been great to see how they choose plans for execution, is it cost based or .., which operators they push up or down and on what basis &#8211; if it based on metadata of the data &#8211; how do they keep updating it?</p>
<p>From my perspective &#8211; it was also great meeting up folks like abhinasha from bizosys &#8211; thanks buddy for the beer and leading the assault on food counter.  For old person like me  - I am still looking at easier ways to adopt as end user.</p>
<p>- SQL DSL front end (for loading PIG is ok but presence of sqoop, scribe is explosion of choices &#8211; lot of time is spent in evaluation)</p>
<p>- debugging the performance of a given Query , how many combiners, partitioners, which operator gets mapped to how many jobs and how it takes care of affinity to data location (ideally less I get to know, much productive I will be), Relational DBs have made me lazier &amp; biased , Way to extract only given amount of data.</p>
<p>- Monitoring/Prioritizing and concurrent access for read/write are what will get us relational folks in easier way to that world.</p>
<p>The day MapR, Hadapt combine &amp; provide statspack(ish), DMVish, monitoring support, is the day real revolution will become. (For record I have not had time to look at much appreciated cloudera distribution).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/435/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/435/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/435/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/435/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/435/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/435/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/435/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/435/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/435/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/435/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/435/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/435/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/435/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/435/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=435&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2011/12/10/hadoop-meetup-at-inmobi/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
		<item>
		<title>Javascript frameworks/libraries/tools</title>
		<link>http://govindkanshi.wordpress.com/2011/08/19/javascript-frameworkslibrariestools/</link>
		<comments>http://govindkanshi.wordpress.com/2011/08/19/javascript-frameworkslibrariestools/#comments</comments>
		<pubDate>Fri, 19 Aug 2011 05:34:02 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[javascript]]></category>
		<category><![CDATA[Javascript libraries]]></category>
		<category><![CDATA[Javscript frameworks]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=427</guid>
		<description><![CDATA[This is set of notes I have taken down over period of some time. I am just jotting down set of frameworks/libraries which can be possibly used in  js world. There are sort of two parts of this post 1. Why Framework of anykind for js based development 2. List of Interesting frameworks/libraries/tools in no [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=427&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This is set of notes I have taken down over period of some time. I am just jotting down set of frameworks/libraries which can be possibly used in  js world.</p>
<p>There are sort of two parts of this post<br />
1. Why Framework of anykind for js based development<br />
2. List of Interesting frameworks/libraries/tools in no particular order</p>
<p>So let us see what is the issue we are talking about. Let us say server side is going to be an endpoint (native app listener)<br />
or a controller (whether hosted natively for native app-thick client app  or zend/cakePHP/asp.net mvc/spring/seam etc), or a thrift/wcf endpoint we need</p>
<p>1. Logic to render the view in client browser/container or  at server via view (controller endpoint of either kind)<br />
2. Logic for serializing/deserializing, validation (basic vs domain specific),  routing<br />
3. Clean difference between layout &amp; data</p>
<p><strong>Options we have</strong><br />
1. Use of server-mvc heavy framework with js only used for  ajaxified/json-strigified talking back to ajax endpoint<br />
2. Use of mixed environment where client side mvc is used to organize  various moving parts and use server end points for backend interaction<br />
3. Create thin server with validation and thicker client side mvc</p>
<p>If we go down the path of jquery or similar tool to do client side  interaction, we have potential of large amount of code getting developed<br />
over period of time. Jquery/ExtJS also has lot of &#8220;browser&#8221; specific code and  this needs to be weaned out.</p>
<p>So one needs to choose a set of helper libraries/best practices to achieve the goal.<br />
<strong></strong></p>
<p><strong>Factors</strong><br />
a. Size of the framework/goals of the framework<br />
b. Dependencies required<br />
c. Updates and support<br />
d. Ease of testing &#8211; selenium support &#8211; at least for the main logic<br />
e. Tools support &#8211; perf/testing, profiler<br />
f. Familiarity &amp; comfort with patterns &#8211; pure mvc vs mvvm<br />
g. Duplicate code at server/client side</p>
<p><strong>Library/Framework requiring no introduction</strong><br />
1. jquery/prototype/dojo/mootools/ExtJS -<br />
Use base on comfort factor of the team.  No further explanations reqd.<br />
Restrict to specific usage and use dependency management system to load<br />
right modules.</p>
<p><strong>MVC Frameworks</strong><br />
1. SproutCore<br />
Apologies I have tried but not very successful at picking this up in  actual project &#8211; my bad &#8211; it is on my todo list one of these years.</p>
<p>http://sproutcore.com/</p>
<p>2. javascriptMVC &#8211; is a real heavyweight framework in real terms and does a &#8220;lot&#8221; of things. I am still a journeyman here but looks like in terms of  above 7 points this library comes closest to what can be used. It does take  time to grow on you. One thing which will comes as surprise is its dependency on java for compression, funcunit tests &amp; code generators(I guess).</p>
<p>http://javascriptmvc.com/</p>
<p>3. Spine<br />
Another mvc framework which looks like backbone but is more minimalistic. It is certainly lighter than javascriptMVC but both have different audiences. Key piece is the way inheritence implemented &#8211; it uses native prototype  based inheritence. I am at present not comfortable with its model creating/requiring those IDs and its persistence back to real db. Becuase they are mentioned in the main docs &#8211; some folks think of them as main actors in spine usage. Another important piece is spine can be much better integrated with backend rest endpoint.  One needs to be careful while using it&#8217;s ajax calls &#8211; as by default everything is asynchronous. Compared to javascriptMVC I find the idioms it follows to organize/extend very organic. (I could not formulate a better sentence to convey what it means to see x.controller.create {}) . Spine  has advantage in terms of api for html5 localstorage &amp; async operation. Folks well versed in backbone will find it simpler to adopt. One thing I do not like in terms of simplicity &#8211; init after every create(partly bec of prototypal inheritence).</p>
<p>http://maccman.github.com/spine/</p>
<p>4. Backbone<br />
Mentioning backbone after spine could be sacriligeous for some. But here is  another beautiful framework bit loaded towards model(eod data is god). It can be used with jquery or zepto.  It&#8217;s views are more of view-controllers(my own term). It gives a structure<br />
without tying models to the end dom &#8211; this is a big plus as you move away from jquery(trying to do many things there). Backbone uses<br />
underscore for handling collections of models.</p>
<p>http://documentcloud.github.com/backbone/</p>
<p>5. Cappucino &#8211; Another framework out of my league, ObjectiveJ is as seductive as ObjectiveC. Serious lack of motivation has prevented me<br />
from doing anything with it. I hear Mac only developers like the familiarity  &#8211; but again it is second hand information. Biggest advantage is no need to learn html or use it, css or manipulate  DOM. Cross compilation gets new meaning.</p>
<p>http://cappuccino.org/</p>
<p>6. Google Web toolkit &#8211; EOD write code in java and use libraries to  maximize productivity. Has lot of tools &#8211; unfortunately I have not<br />
used them personally except bits and pieces.</p>
<p>http://code.google.com/webtoolkit/</p>
<p><strong>Not a MVC framework but a great library to know and work with </strong><br />
1.Node **<br />
Node.js is an evented I/O framework(more of library plus whole ecosystem)  for V8(the default engine).Easily the most happening event oriented server  side library for js. LinkedIn has claimed fantastic performance increase over ror  (but hey they were using ruby &#8211; not fair to compare if you are not using 1.9 at least)</p>
<p>http://nodejs.org/</p>
<p>Windows support &#8211; Node 0.5.4 supports windows build via VS using GYP http://blog.nodejs.org/2011/08/12/node-v0-5-4/</p>
<p><strong>Miscellaneous libraries</strong></p>
<p>1. Store.js &#8211; Resolve cross browser localstorage issues. Uses localStorage, globalStorage, and userData behavior.</p>
<p>https://github.com/marcuswestin/store.js</p>
<p>2. underscore &#8211; It gives you useful functions you might normally have in your own utililties js file like each, map, isEmpty, isElement and more. Also  described as &#8220;It&#8217;s the tie to go along with jQuery&#8217;s tux&#8221;. Everything is available through _ &#8211; no really <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . Functional programming grokkers will like this tool.</p>
<p>http://documentcloud.github.com/underscore/docs/underscore.html</p>
<p>3. requirejs &#8211; Handle dependencies in clean manner.  Use it to minimize loading time, organize dependencies in clear fashion. LabJs too have something similar.</p>
<p>http://requirejs.org/</p>
<p>4. socket.io -  When you need to do &#8220;realtime&#8221; communication while websockets support  evolves. Really wished present browsers could be reused rather than  large re-install with new browser. It is not fun to test for regression  for all apps in large organization. This is where socket.io comes in good. It has support for realtime goes from sophisticated(shiny-new-browser) to legacy(IE5.5/safari3). It uses range of technologies to provide graceful degradation for experience across available browsers and devices.</p>
<p>http://socket.io/#how-to-use</p>
<p>Code to browse &#8211; https://github.com/LearnBoost/socket.io/blob/master/support/node-websocket-client/lib/websocket.js<br />
Line 442 onwards is where one should concentrate for understanding key stuff.</p>
<p>Various transports supported</p>
<p>https://github.com/LearnBoost/socket.io/tree/master/lib/transports</p>
<p>Silverlight support-http://groups.google.com/group/socket_io/browse_thread/thread/6f6d8575ae887829<br />
Great thing for SL team to implement as plugin.</p>
<p>5. coffeescript<br />
Another code generator for javascript &#8211; love child of python/ruby. I  personally stay away from more abstractions.Good stuff &#8211; all variables<br />
are in local lexical scope, no pollution of global scope http://jashkenas.github.com/coffee-script/</p>
<p>6. LabJS &#8211; Another asynchronous script loader, allows as much parallel  load limited by browser settings. One can easily specify which<br />
scripts have execution order dependencies and LABjs will ensure proper  load order. Small size comes in handy.</p>
<p>http://labjs.com/</p>
<p>7. Zepto &#8211; Mobile/jquery friendly framework for webkit browsers. Main reason  to suggest this would be that other frameworks have lot of code for browsers not available on client devices (IE on iOS/Android as of today). In spirit  of doing the right thing &#8211; suggested doing custom work or use Zepto kind of framework. It uses mobile webkit features and is very small &#8211; 2k, has support for major gestures.</p>
<p>http://github.com/madrobby/zepto</p>
<p>8. CacheProvider &#8211; from twitter&#8217;s Dustin Diaz &#8211; allows you to cache  data both in memory and using localStorage if it’s available.</p>
<p>http://www.dustindiaz.com/javascript-cache-provider</p>
<p>9. NowJS &#8211; One more node goodness to keep track of. Node has become the tsunami of  all good things. Aim of NowJS is to make real-time applications easier to develop by synchronising functions and variables in a namespace  between clients and a server.</p>
<p>http://nowjs.com/</p>
<p>10. jsFiddle &#8211; Docs http://doc.jsfiddle.net/</p>
<p>http://jsfiddle.net/MNGpr/</p>
<p><strong>Templates</strong><br />
This is a topic which ignites passion as people cling to them like their shadow. Do you notice no mention of server side templates <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  &#8211; it all<br />
depends on the choice of infra you want to make.</p>
<p>1. Moustache -http://mustache.github.com/,http://icanhazjs.com/<br />
2. Jquery Templates &#8211; http://api.jquery.com/category/plugins/templates/<br />
Inspired by John resig&#8217;s template plugin<br />
3a. jqote &#8211; http://aefxx.com/jquery-plugins/jqote/ ( No like &lt;![CDATA[&#8230; absolutely not)<br />
3b. render &#8211; https://github.com/atduskgreg/srender/<br />
4. closure template &#8211; http://code.google.com/closure/templates/</p>
<p><strong>Engines</strong></p>
<p>1. Rhino<br />
Reminds of script# effort and is basically java for javascript  (compiler &amp; interpreter). Allows javascript to interact on server side<br />
with java objects (pojo/bean&#8230;). I am not a big fan of mixing these  two worlds. Let them talk to each other at an end point and use common<br />
serialization mechanism. Much touted DynamicInvoke feature is here  in Java 7 but it&#8217;s usage in future for Rhino -last I checked &#8211; months<br />
before this writeup was written is not yet there.</p>
<p>2. SpiderMonkey &#8211; Compiler and Interpreter with mark and sweep gc  written in c. Used at Mozilla. The famous piece is the<br />
Tracing JIT  -https://developer.mozilla.org/En/SpiderMonkey/Internals/Tracing_JIT</p>
<p>3. V8 &#8211; Complete VM/runtime. Since it has become part of Node it has  spread its goodness allaround.</p>
<p>http://code.google.com/apis/v8</p>
<p>I have no direct experience in usage of V8/SpiderMonkey.</p>
<p>Rhino I have evaluated for usage and that is how I know miniscule piece of it.</p>
<p>I am not making a particular OS as the requirement as I think any<br />
programmer worth his salt can pick up and decide on usage based on his constraints.</p>
<p>** Node.js will be available from joyent on Windows pretty soon</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/427/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/427/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/427/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=427&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2011/08/19/javascript-frameworkslibrariestools/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
		<item>
		<title>Phpcloud event roundup &#8211; personal view</title>
		<link>http://govindkanshi.wordpress.com/2011/07/10/phpcloud-event-roundup-personal-view/</link>
		<comments>http://govindkanshi.wordpress.com/2011/07/10/phpcloud-event-roundup-personal-view/#comments</comments>
		<pubDate>Sun, 10 Jul 2011 16:26:59 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[event]]></category>
		<category><![CDATA[flipkart]]></category>
		<category><![CDATA[hasgeek.in]]></category>
		<category><![CDATA[phpcloud]]></category>
		<category><![CDATA[zynga]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=418</guid>
		<description><![CDATA[After a long time I atttended well concieved event. [updated : 11th July 2011] : Urls of site/people/projects [updated : 11th July 2011] : Renaming and few corrections/disclaimer. This is not complete view as I could not attend other fabulous sessions around membase, memcache,choosing right php framework and gearman. But I did hear great things [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=418&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>After a long time I atttended well concieved event.</p>
<p>[updated : 11th July 2011] : Urls of site/people/projects</p>
<p>[updated : 11th July 2011] : Renaming and few corrections/disclaimer.</p>
<p>This is not complete view as I could not attend other fabulous sessions around membase, memcache,choosing right php framework and gearman. But I did hear great things about them, hoping to catch them on videos.</p>
<p>Good things-</p>
<p>1. <a title="phpcloud event" href="http://phpcloud.hasgeek.in/" target="_blank">Event website</a>,  <a title="Schedule" href="http://phpcloud.hasgeek.in/#schedule" target="_blank">content</a>, <a title="doAttend" href="http://phpcloud.doattend.com/" target="_blank">payment process</a>, organizers</p>
<p>Conceptualization of event was validated with potential speakers, their talks and votes of interest. Kiran took pains to look at public profile and <a title="Check out the gearman talk details (voting etc on funnel website of hasgeek)" href="http://github.com/kertz" target="_blank">github of speakers</a>. A very nicely designed  website for potential attendees was the first temptation &#8211; really a blowback to vendor backed agency cookie cutter output. Although not all talks were focused on php or deep php , the theme connected cloud hosted php website.</p>
<p>Smooth checkout/payment integration was another plus. Wished there was a button, come later details will be preserved, as I struggled to justify staying out for full day away from little kiddo and other stuff as i entered details multiple times.</p>
<p>Another good thing was no over hype of changing the world kind of thing. A bunch of professionals who wanted to provide a reasonable venue for exchange of thoughts. More importantly no marketing only types or speaking and no work types projecting themselves as gift to humanity. This was apparent in most of the speakers especially zynga, facebook, capillary, mobstac &amp; some yahoo folks.(after the event impression meeting/exceeding the expectation).</p>
<p>2. Event venue &#8211; Dharmaram college inside christ college with good parking and plenty of trees was idyllic setup. College had really good quality projectors in place. Food, tea, biscuits were good- how can I say anything bad about food which provides pineapple gojju <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . Restrooms were plenty and clean. Water was available when needed. Only challenge was Internet, which I personally thought was a good riddance.  Then there was event photographer<a title="Kushal the man with eye for details" href="http://kushaldas.in/" target="_blank"> Kushal Das </a>omnipresent with his gadgetry.</p>
<p>3. Content &amp; people</p>
<p>My expectations were  around learning best practices across the technologies. It was wonderful to see pythonistas around prepping for pycon. There were lot of offline discussions around chef, puppet, ganglia, capstriano. I saw folks coding up in both django and a template I could not make out plus pure lamp stack or just plain cgi.</p>
<p>Flipkart continous deployment</p>
<p>I personally liked flipkart&#8217;s sessions &#8211; real people sharing real insights, best practices with humility previously not seen elsewhere. Flipkart&#8217;s sharing of how they do continous deployment &#8211; major 6 + releases everyday &#8211; I thought I saw 30 , although speaker could have done with double shot of coffee <img src='http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> , a person who knows his stuff  but sort of shy in speaking up. There are people with less than ounce of his depth who create unfortunately much perception of depth &#8211; they can talk without single shipping application.</p>
<p>Anyway coming back to content &#8211; main take away was use of debian packages to do deployment. They provision bunch of new machines after netboot to deploy right os, puppet to do configuration. Then finally debian packages are used to deploy the  code base, dependencies etc. Easy to roll back, audit. Some times somethings are  still done by hand(schema change). They can do rolling deployments most of the times. They do use very heavy cache and <a href="http://lucene.apache.org/solr/" target="_blank">solr for search</a>. This sessiom focussed on how every  commit results in build, deployment. They replicate as much of production environment into dev, something we push many customers at least for db size.</p>
<p>45 minutes were too short into get details of what tests they run, how they do perf testing, how they profile, use monitoring ( gomez/keynote) vs pingdom etc). Or dig into usage of logs to raise alerts, or dynamic monitoring of resource usage across php, jvm, <a href="http://nginx.org/" target="_blank">nginx</a>, <a title="ha proxy" href="http://haproxy.1wt.eu/" target="_blank">ha proxy</a> or <a title="varnish" href="http://varnish-cache.org/" target="_blank">varnish</a>, network components. As they shared later in another session they have home made load balance running linux in ha mode with multiple network cards to allow 3 networks down. <del>Amol</del>  Amod (thanks Mekin) clarified lot of questions wrt deployment, again very humble down to earth folks.</p>
<p>Flipkart performance tips</p>
<p>This talk by Siddarth was worth its time in Gold. Flipkart uses <strong>php minus apache</strong> front ended by ha proxy, varnish and nginx. I am assuming they use memcache. Explanation of various options and why they chose what they had was well detailed.</p>
<p>I think/assume usage of solr takes off whole lot of load and thrifted services are invoked only for real work with db. Since they do not store any state anywhere, near horizontal scale is possible at web, services layer. They have interesting component in java as threaded service/deamon which runs locally on every web server. This component  is the one which does async, parallel calls, does logging etc. Imho this is really the guts and choke point of app and I really could not get the point of this abstraction as they could talk from php to thrift service(except parallel calls). Availability of this component, failover was something time did not permit to get into. Caching, performance tools across proxy,cache(static-dynamic), web server, php, jvm(any particular gc setting?), solr,cache, db was not delved in deep as time was less. It would would been awesome to get into discussions around why of templates or no templates, models in php vs abstracted behind service layer. Similar discussion around use of jquery vs other frameworks, usage of cdn would have been great. One thing I did not see was use of any queue or analytics(not really part of the topic so okay) or nosql, functional lang ( was expecting some scala vs clojure adoption) discussion. Very-2 pragmatic and generous. Almost too generous for sharing pretty much their core. Even today it is difficult to find how amazon does its website scaling inspite all talk of openness etc.  Flipkart&#8217;s another good thing was that they did not boast because they use a &#8220;tech/lang&#8221;.</p>
<p><a title="change the healthcare" href="http://www.practo.com/" target="_blank">Practo </a>and usage of <a href="http://kr.github.com/beanstalkd/" target="_blank">beanstalkd</a></p>
<p>This particular talk by practo co-founder <a title="abhinav" href="http://twitter.com/abhinavlal" target="_blank">Abhinav lal </a>went into how he made the choice in favor of beanstalkd over rabbitmq or expensive message buses. He rightly called his usage as job queue where he pushes priority based messages into tube and has workers to process those messages. He uses ttr and persistence ( not clear whether it is automatically purged  after some time) to pull messages and process. Usage of supervisord to scale out, monitor the worker process was a neat trick he shared. Looked like he used php on worker as well as client side so serialization of data was no big deal. Overall beanstalkd came out as decent job queue and excelling at it. There is no fancy publish/subscribe, esb, jms infra on top of it. Again a great speaker.  I did bump into beanstalkd via <a href="https://github.com/tobi/delayed_job" target="_blank">delayedjob </a>earlier. So good validation of &#8220;scale&#8221; vs &#8220;usage&#8221;. Again a great value for money discussion.</p>
<p>Failure as an option by <a title="Vijay" href="http://twitter.com/#!/t3rmin4t0r" target="_blank">Vijay </a>from zynga</p>
<p>This was the talk delivered by a person who had guts to say things as they are. Specifically around cloud &#8211; unbound resources, elasticity, provisioning etc&#8230;.basically burst lot of myths&#8230; importantly he had imp advice &#8211; it is not possible to know/control all unknowns. So at least control your destiny where you can. He adviced to have few reserved or passive instances to overcome the tyranny of &#8216;avalanche &#8216; when cloud provider faces downtime, all clients try to provision more resources. He added other ideas around why/how to failover completely for x userbase vs half baked response to everybody. Now this requires lot of work and thinking at lot of layers &#8211; it would be ideal to <em>apologize only to people whose service went down rather than fail whale for everybody</em>. His suggestion of usage of ha proxy intelligently, not failing over all load of 5 machines to remaining 5 ( out of 10)if those 5 machines are at 50% + usage already. This would start cycle of provisioning and more load on already stressed infra.</p>
<p><a href="http://twitter.com/ravipratap" target="_blank">Ravi pratap</a>&#8216;s talk turned out to be  ec101 rather than being true to title of scaling. He I am sure could have done justice to that content too as he is great speaker with lot of hands on work. His explanations were crisp and he shared one imp thing thing- they have their own ami as expected. Again time was too short for him to get into details of scaling as covering basics took longer time. But would expect at some other location he can share his tips.</p>
<p>Eucalyptus talk was big let down unfortunately as it would have been easier to start of with demo rather than explain cloud again&#8230;certainly not cool to run down valid competition as malware.</p>
<p>Python tool fabric as deployment option by capillary (Nigel Babu) was completely new subject for me.</p>
<p><a href="http://thrift.apache.org/">Thrift</a> talk by capillary(<a href="http://in.linkedin.com/in/piyushgoel1" target="_blank">Piyush Goel</a>) on how they use it for communication from php to java service  was good tutorial. Main adv over protocol buffers was lack of rpc in protobuf.</p>
<p>Again thanks to @hasgeek for conjuring up goood meet. http://phpcloud.hasgeek.in</p>
<p>Conducted on 9th july 2011</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/418/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/418/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/418/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/418/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/418/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/418/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/418/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/418/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/418/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/418/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/418/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/418/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/418/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/418/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=418&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2011/07/10/phpcloud-event-roundup-personal-view/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
		<item>
		<title>Suresh Bhat at his best</title>
		<link>http://govindkanshi.wordpress.com/2010/11/20/suresh-bhat-at-his-best/</link>
		<comments>http://govindkanshi.wordpress.com/2010/11/20/suresh-bhat-at-his-best/#comments</comments>
		<pubDate>Sat, 20 Nov 2010 15:20:09 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[Reading]]></category>
		<category><![CDATA[Marathi Ghazal]]></category>
		<category><![CDATA[Suresh Bhatt]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=413</guid>
		<description><![CDATA[केव्हा तरी पहाटे केव्हातरी पहाटे उलटून रात्र गेली मिटले चुकून डोळे हरवून रात्र गेली कळले मला न केव्हा सुटली मिठी जराशी कळले मला न केव्हा निसटून रात्र गेली सांगू तरी कसे मी वय कोवळे उन्हाचे? उसवून श्वास माझा फसवून रात्र गेली! उरले उरात काही आवाज चांदण्याचे.. आकाश तारकांचे उचलून रात्र गेली! स्मरल्या मला न तेव्हा [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=413&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><strong><a>केव्हा तरी पहाटे</a></strong></p>
<p>केव्हातरी पहाटे उलटून रात्र गेली<br />
मिटले चुकून डोळे हरवून रात्र गेली</p>
<p>कळले मला न केव्हा सुटली मिठी जराशी<br />
कळले मला न केव्हा निसटून रात्र गेली</p>
<p>सांगू तरी कसे मी वय कोवळे उन्हाचे?<br />
उसवून श्वास माझा फसवून रात्र गेली!</p>
<p>उरले उरात काही आवाज चांदण्याचे..<br />
आकाश तारकांचे उचलून रात्र गेली!</p>
<p>स्मरल्या मला न तेव्हा माझ्याच गीतपंक्ती<br />
मग ओळ शेवटाची सुचवून रात्र गेली!</p>
<p>आता कुशीत नाही ती चंद्रकोर माझी..<br />
हलकेच कूस माझी बदलून रात्र गेली</p>
<p>अजुनी सुगंध येई दुलईस मोगऱ्याचा..<br />
गजरा कसा फुलांचा विसरुन रात्र गेली?</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/413/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/413/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/413/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=413&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2010/11/20/suresh-bhat-at-his-best/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
		<item>
		<title>An afternoon with Mangesh Padgaonkar&#8217;s Gypsy &#8211; collection of poems 1950-53</title>
		<link>http://govindkanshi.wordpress.com/2010/11/19/an-afternoon-with-mangesh-padgaonkars-gypsy-collection-of-poems-1950-53/</link>
		<comments>http://govindkanshi.wordpress.com/2010/11/19/an-afternoon-with-mangesh-padgaonkars-gypsy-collection-of-poems-1950-53/#comments</comments>
		<pubDate>Fri, 19 Nov 2010 06:19:17 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[Reading]]></category>
		<category><![CDATA[Gypsy]]></category>
		<category><![CDATA[Mangesh Padgaonkar]]></category>
		<category><![CDATA[Poem]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=408</guid>
		<description><![CDATA[विस्मरणाचे   दान तुझे ते विस्मरणाचे   दान तुझे ते सरले आज, आठवणींचा घेउनी आलीस सुंदर साज   Gift of amnesia faded today  Bouquet of your memories walked in today शिशिरामधल्या   उत्तर्रात्त्री दुरून एकला तारा करितो गूढ इषारा गहन कोषांत तमाच्या उरी कुणाची वेदना स्फुरे ? शिशिरामधल्या   उत्तर्रात्त्री आकाश मनात झुरे   Far away lonely star does a sign amongst the veins of darkness whose pain flows in the autumn nights sky&#8217;s heart reduces a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=408&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h2>विस्मरणाचे   दान तुझे ते</h2>
<div>विस्मरणाचे   दान तुझे ते सरले आज,</div>
<div>आठवणींचा घेउनी आलीस सुंदर साज  </div>
<div>Gift of amnesia faded today</div>
<div> Bouquet of your memories walked in today</div>
<h2><strong>शिशिरामधल्या   उत्तर्रात्त्री</strong></h2>
<div>
<p>दुरून एकला तारा</p>
<p>करितो गूढ <span id="6_TRN_1c">इषारा</span></p>
<p>गहन कोषांत तमाच्या उरी कुणाची वेदना स्फुरे ?</p>
<p>शिशिरामधल्या   <span id="6_TRN_1r">उत्तर्रात्त्री आकाश मनात झुरे </span> </p>
<p>Far away lonely star does a sign</p>
<p>amongst the veins of darkness whose pain flows</p>
<p>in the autumn nights sky&#8217;s heart reduces a little</p>
<h2>काळी कड सावलीची</h2>
<p>काळी कड सावलीची</p>
<p>प्रकाशाला शिवू बघे</p>
<p>आणि थिजल्या ओठांनी</p>
<p>पुन्हा काळोखांत रिघे</p>
<p>Can&#8217;t translate&#8230;Can&#8217;t..</p>
<p>Sideline of shadow</p>
<p>tries to catch a little bit of light</p>
<p>it retracts sharply</p>
<p>with scalded, seared lips</p>
<p>मातीत गाढ निजलेले</p>
<p> जरि बीज न नयनांना दिसते </p>
<p>घन वळता आषाढाचे</p>
<p>मज नवखी चाहूल येते  </p>
<p>&#8230;..</p>
<p>पाकळ्यात दडले तरीहि</p>
<p>गंधातून गूढ उकळते  </p>
<p>Invisible to eyes</p>
<p>a seed sleeps</p>
<p>till the full clouds of August</p>
<p>whisper a new life&#8230;.</p>
<p>Although hidden underneath</p>
<p>petals, redolance unveils a secret</p>
</div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/408/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/408/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/408/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/408/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/408/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/408/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/408/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/408/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/408/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/408/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/408/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/408/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/408/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/408/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=408&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2010/11/19/an-afternoon-with-mangesh-padgaonkars-gypsy-collection-of-poems-1950-53/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
		<item>
		<title>Movie log 2009-2010</title>
		<link>http://govindkanshi.wordpress.com/2010/11/04/movie-log-2009-2010/</link>
		<comments>http://govindkanshi.wordpress.com/2010/11/04/movie-log-2009-2010/#comments</comments>
		<pubDate>Thu, 04 Nov 2010 06:44:34 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[Movies]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/2010/11/04/movie-log-2009-2010/</guid>
		<description><![CDATA[Thanks Prakash for lending awesome old movies- none of them are dated or aged. 1. Deliverance 1971 &#8211; Awesome photography, editing with well rounded gritty characters. Movie becomes the river it talks about, sometimes meandering and others like its rapids but finally arrested by the mighty dam. 2. Americanization of Emily 1964 &#8211; Wonderful chirpy [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=402&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Thanks Prakash for lending awesome old movies- none of them are dated or aged.</p>
<p>1. Deliverance 1971 &#8211; Awesome photography, editing with well rounded gritty characters. Movie becomes the river it talks about, sometimes meandering and others like its rapids but finally arrested by the mighty dam.</p>
<p>2. Americanization of Emily 1964 &#8211; Wonderful chirpy ode to life and peace.   Some of the best quotes are at &#8211; http://www.imdb.com/title/tt0057840/quotes. Movie is filled with witty dialogues. James Garner&#8217;s dog robber portrayal is one of the best performances in terms of holding together viewer interest.</p>
<p>3. There will be blood 2007 &#8211; Never has a man looked so menacing on the screen after Jack Nicholson. Nope I do not care much about masks and gory makeup, it is the chilling closeup of man projecting evil. Daniel day lewis just overshadows everything I have seen from Jack N. One of the best movies ever &#8211; right up there with Coen brother&#8217;s other work.  The cinematography captures the hopes and futility of being an oil enterprenuer.  Daniel&#8217;s performance will be benchmark for many generations to come.</p>
<p>Update &#8211; 2010-Nov 4 -</p>
<p>4. Aura 2006 - An Argentinean movie shot entirely in patagonia region has stunning visuals. Supposedly a 2nd and last movie by the director. It is very lucid in its flow and its shot of trees swaying in forest is pure poetry. Only thing jarring was somewhat muffled sound recording &#8211; was it deliberate &#8211; because there is sudden change in movie flow/sound when the epileptic character starts going into fits. Anyway movie has heist premise which is possible but  can it be pulled off in new conditions? Again story telling of higher quality than usual fare. Another way of joining various threads than flashback/non linear story telling. Acting by the characters too is very fluid -nothing seems forced/dramatic at anytime.</p>
<p>Fabiàn Bielinsky (director) had another wonderful movie &#8211; Nine queens (I guess remake &#8211; criminal was not such a good film though).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/402/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/402/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/402/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/402/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/402/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/402/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/402/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/402/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/402/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/402/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/402/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/402/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/402/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/402/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=402&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2010/11/04/movie-log-2009-2010/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
		<item>
		<title>Making sense of public org on facebook.</title>
		<link>http://govindkanshi.wordpress.com/2010/07/31/making-sense-of-public-org-on-facebook/</link>
		<comments>http://govindkanshi.wordpress.com/2010/07/31/making-sense-of-public-org-on-facebook/#comments</comments>
		<pubDate>Sat, 31 Jul 2010 03:13:47 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[developer policy]]></category>
		<category><![CDATA[Entity extraction]]></category>
		<category><![CDATA[Facebook]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=399</guid>
		<description><![CDATA[Simple task &#8211; Find out who Delhi traffic police has investigated and in how much time. Where is the information &#8211; Right now on FB. How can one get it It should be simple, it is public data. You just get all the posts of Delhi traffic police spend few minutes in formulating a regular [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=399&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Simple task &#8211; Find out who Delhi traffic police has investigated and in how much time.</p>
<p>Where is the information &#8211; Right now on FB.</p>
<p>How can one get it</p>
<p>It should be simple, it is public data. You just get all the posts of Delhi traffic police spend few minutes in formulating a regular expression to extract the number plates. And follow it up by co-relating it to time it was reported and time it was &#8220;posted as action taken&#8221;. You can do fancier things by extracting entities and posting them back on map. But to start with access to reports and corresponding action taken post will be enough.</p>
<p>Challenge &#8211; FB&#8217;s dev policy unfortunately does not allow it <img src='http://s0.wp.com/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' />  or at least I could not make enough sense of the policy doc. I could not get a response from them and waited for reasonable time  before posting it.</p>
<p>Conclusion</p>
<p>Twitter is better friend here.</p>
<p>Search firms should be allowed to gather posts of &#8220;recognized&#8221; public  entities &#8211; in this case delhi traffic police. There need not be any access to its network of people who are &#8220;friends&#8221; with it or &#8220;like it&#8221;.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/399/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=399&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2010/07/31/making-sense-of-public-org-on-facebook/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
		<item>
		<title>RTI (KILL OR ATTACK) &#8211; Cluster search results? Google Predict/Apache Mahout?</title>
		<link>http://govindkanshi.wordpress.com/2010/07/21/rti-kill-or-attack-cluster-search-results-google-predictapache-mahout/</link>
		<comments>http://govindkanshi.wordpress.com/2010/07/21/rti-kill-or-attack-cluster-search-results-google-predictapache-mahout/#comments</comments>
		<pubDate>Wed, 21 Jul 2010 07:42:42 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[text analysis]]></category>
		<category><![CDATA[RTI]]></category>
		<category><![CDATA[search engine summation]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=394</guid>
		<description><![CDATA[Heard another RTI activist was shot in Ahmedabad. Wanted to lookup # of people killed/attacked for RTI activity. Unfortunately lot of results talk about same incident. I am interested in unique incidents, I achieved the same by changing the date range &#8211; a kludge.  Crawling should not be the only thing search engines should do, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=394&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Heard another RTI activist was shot in Ahmedabad. Wanted to lookup # of people killed/attacked for RTI activity. Unfortunately lot of results talk about same incident. I am interested in unique incidents, I achieved the same by changing the date range &#8211; a kludge.  Crawling should not be the only thing search engines should do, they have enough information to cluster results into a &#8220;bunch&#8221; which refer to same incident.</p>
<p>I can see the challenge though where an article refers to older incident thus affecting the ability to cluster. Another challenge would be an article summarizing the attacks on RTI applicants over years but these will be very few anyway.</p>
<p>Apparently google predict account/apache mahout should enable this. What is involved?</p>
<p>1. Identifying groups/clusters</p>
<p>2. Cluster identification on common name, date, location (extract date of crawl, date of publishing of article, source, name of person involved in attack or death. Name extraction can be accomplished by phrase analysis of sentence mentioning the two words nearby each other and looking up subject. Similar thing that needs to be done in Delhi traffice police tweets. Location extraction also is part of earlier process. If trusted sources provide the data &#8211; a pattern can be assumed &#8211; Location. Date. &#8220;Phrase describing the incident&#8221; &#8211; followed by lot of extraneous information.</p>
<p>3. Disambiguating based on exclusive notifcation rather than summary/article on scores of similar incidents</p>
<p>Trusted source information &#8211; rather than looking at all results &#8211; search should be directed across 2-3 places and information extracted from them. reuters/bbc/pti/xinhua.</p>
<p>Must mention &#8211; YQL/Yahoo pipes are like dynamic languages &#8211; instant gratification and visible work.</p>
<p>BTW must mention &#8211; youtube&#8217;s &#8220;featured video&#8221; suggests &#8220;cannibalism video from bbc&#8221; while watching videos on Vitthal (God).  Could not explain to mom &#8211; exactly what was the science behind that one <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
<div id="attachment_397" class="wp-caption alignleft" style="width: 310px"><a href="http://govindkanshi.files.wordpress.com/2010/07/broken-featurevideo-youtube.png"><img class="size-medium wp-image-397" title="broken-featurevideo-youtube" src="http://govindkanshi.files.wordpress.com/2010/07/broken-featurevideo-youtube.png?w=300&#038;h=244" alt="cannibalism video - featured while watching god's video" width="300" height="244" /></a><p class="wp-caption-text">Screwed up &quot;featured video&quot;</p></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/394/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/394/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/394/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/394/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/394/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/394/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/394/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/394/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/394/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/394/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/394/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/394/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/394/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/394/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=394&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2010/07/21/rti-kill-or-attack-cluster-search-results-google-predictapache-mahout/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>

		<media:content url="http://govindkanshi.files.wordpress.com/2010/07/broken-featurevideo-youtube.png?w=300" medium="image">
			<media:title type="html">broken-featurevideo-youtube</media:title>
		</media:content>
	</item>
		<item>
		<title>Delhi traffic police tweet analysis</title>
		<link>http://govindkanshi.wordpress.com/2010/07/15/delhi-traffic-police-tweet-analysis/</link>
		<comments>http://govindkanshi.wordpress.com/2010/07/15/delhi-traffic-police-tweet-analysis/#comments</comments>
		<pubDate>Thu, 15 Jul 2010 14:02:30 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[text analysis]]></category>
		<category><![CDATA[tweet analysis]]></category>
		<category><![CDATA[Delhi traffic police]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=389</guid>
		<description><![CDATA[Unfortunately there is no normalized data for locations, places, landmarks. This prevents gettin the geodata for poi and plotting them on map. Thus giving better picture of accident/prone, maximum diversion locations across temporal extract. Results can be little better with analysis of sentence like verb(obstruction/diversion) at location. One can also find out when things cleared [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=389&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Unfortunately there is no normalized data for locations, places, landmarks. This prevents gettin the geodata for poi and plotting them on map. Thus giving better picture of accident/prone, maximum diversion locations across temporal extract. Results can be little better with analysis of sentence like verb(obstruction/diversion) at location. One can also find out when things cleared and look at average time taken to clear up incidents.</p>
<p><strong>Words which come together :</strong></p>
<p>Traffic obstruction; broken down.; may contact; specific suggestions/complaints; broke down.; area specific; Raja Garden; Range</p>
<p>having&#8230;; DHAULA KUAN; red light; truck broke; concerned DCP/Traffic; given below:&#8230;; DTC bus; one truck; HTV broke; bus broke; flyover towards; website http://delhitrafficpolice.nic.in; Traffic heavy;  floor Bus; towards Gurgaon; towards Ashram; 1700 Hrs; Akshardham mandir; Budha Jayanti; CHOWKI NO.; G.C. Dwivedi; Jal Board</p>
<p><strong>Word frequency</strong></p>
<p>traffic                   -&gt;                       157<br />
obstruction               -&gt;                       122<br />
towards                   -&gt;                        99<br />
down.                     -&gt;                        63<br />
broken                    -&gt;                        46<br />
truck                     -&gt;                        44<br />
broke                     -&gt;                        42<br />
there                     -&gt;                        31<br />
flyover                   -&gt;                        30<br />
heavy                     -&gt;                        17<br />
nagar                     -&gt;                        12<br />
chowk                     -&gt;                        11<br />
contact                   -&gt;                        11<br />
garden                    -&gt;                        11<br />
specific                  -&gt;                        11<br />
suggestions/complaints,   -&gt;                        11<br />
container                 -&gt;                        10<br />
dhaula                    -&gt;                        10<br />
wazirabad                 -&gt;                        10<br />
ashram                    -&gt;                         9<br />
delhi                     -&gt;                         9<br />
vihar                     -&gt;                         9<br />
naraina                   -&gt;                         8<br />
cleared                   -&gt;                         7<br />
gurgaon                   -&gt;                         7<br />
hospital                  -&gt;                         7<br />
airport                   -&gt;                         6<br />
between                   -&gt;                         6<br />
having&#8230;                 -&gt;                         6<br />
mayapuri                  -&gt;                         6<br />
range                     -&gt;                         6<br />
staff                     -&gt;                         6<br />
aiims                     -&gt;                         5<br />
below:&#8230;                 -&gt;                         5<br />
break                     -&gt;                         5<br />
cleared.                  -&gt;                         5<br />
concerned                 -&gt;                         5<br />
dcp/traffic               -&gt;                         5<br />
e-mail                    -&gt;                         5<br />
flyover.                  -&gt;                         5<br />
given                     -&gt;                         5<br />
lajpat                    -&gt;                         5<br />
light                     -&gt;                         5</p>
<p><strong>Tweets with &#8220;red light&#8221; (shows I need to use new api to get time and complete text with help of tweet id) &#8211; 7 matches<br />
</strong></p>
<p>has been cleared at Kalindi kunj red light   Traffic obstruction at kalandi kun<br />
ffic obstruction at kalandi kunj red light due to one truck broken down   Traff<br />
rk    Traffic heavy at Wazirabad red light from Burari towards ISBT due to heav<br />
ffic obstruction at SHASTRI PARK red light due to a truck broken down  http //b<br />
J  Traffic obstruction at ASHRAM RED light due to a LGV broke down  http //bit<br />
both carriageway at Subroto park red light  http //bit ly/d8Wilc  Traffic jam a<br />
Rohini West Metro station due to red light point fault  http //bit ly/c0fXIg  T</p>
<p><strong> </strong><strong>Tweets with &#8220;obstruction&#8221; </strong>25 of 122 matches:<br />
rgaon  has been removed   Traffic obstruction at MADHUBAN CHOWK TOWARDS PEERA G<br />
GAS TANKER broken down   Traffic obstruction between naraina flyover to mayapu<br />
ue to a HTV broken down   Traffic obstruction at Subroto Park towards Gurgaon a<br />
crane has been directed   Traffic obstruction at Ladosarai T-point towards Khan<br />
oved traffic normalized   Traffic obstruction at Ladosarai T-point towards Khan<br />
due to a tree uprooted   Traffic obstruction from Naraina to mayapuri has been<br />
oved traffic normalized   Traffic obstruction at Majnu ka tilla towards Wazirab<br />
irabad  has been removed  Traffic obstruction from Naraina to mayapuri due to a<br />
a container broken down   Traffic obstruction at Majnu ka tilla towards Wazirab<br />
a container broken down   Traffic obstruction has been cleared at Dhaula Kuan<br />
n  21 19 PM 07-July-2010  Traffic obstruction at RTR flyover underpass towards<br />
irport has been cleared   Traffic obstruction at  RTR vacant vihar  IIT towards<br />
rport  has been cleared   Traffic obstruction at savitri flyover towards Nehru<br />
u place has been removed  Traffic obstruction at RTR flyover underpass towards<br />
damper has broken down   Traffic obstruction at  RTR vacant vihar  IIT towards<br />
damper has broken down   Traffic obstruction at savitri flyover towards Nehru<br />
ntainer has broken down   Traffic obstruction at Birtanya choke towards Punjabi<br />
ue to one truck has broken down   obstruction has been cleared at Wazirabad fly<br />
ed at Wazirabad flyover   Traffic obstruction at Wazerabad flyover due to one t<br />
e truck has broken down   Traffic obstruction at Raja Garden chowk from Tilak n<br />
Tanker has broken down   Traffic obstruction has been cleared at Kalindi kunj<br />
Kalindi kunj red light   Traffic obstruction at kalandi kunj red light due to<br />
over construction work    Traffic obstruction at NAGIA PARK  CHOWKI NO  2 SHAKT</p>
<p>Challenge is normalized names (no spelling differences etc) and then ability to pull their geolocation from one of the apis(mostly google) and plot them back with heat map kind of thing to point out bottlneck across a day.</p>
<p><strong>Looks like Naraina, Dhaula kuan, RTR flyover,Wazirabad red light, Kalindi kunj red light ,Raja Garden,NAGIA PARK are choke points. </strong>More detailed analysis to follow on another day with locations and kind of obstruction, accidents and hopefully geolocations on map.</p>
<p>Finally must get out the blogger crawler for solr <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  what else. And also finish getting data about dtp&#8217;s fb postings of people who are booked and for hopefully for what offence. This data can be far easily posted by delhi police themselves in form of odata for future usage rather than just fb posts or tweets.</p>
<p>Overall Delhi police must be congratulated for following up on citizen requests and posting action back to fb. Nothing expensive stuff to maintain <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  or buy except mobile phones equipped with fb/twitter app. Sure there is some nic app which is doing the storage of offenders etc. But interaction through twitter/fb is outright simple.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/389/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/389/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/389/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/389/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/389/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/389/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/389/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/389/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/389/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/389/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/389/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/389/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/389/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/389/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=389&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2010/07/15/delhi-traffic-police-tweet-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
		<item>
		<title>Irreverant post of irreverant task &#8211; tweet analysis</title>
		<link>http://govindkanshi.wordpress.com/2010/07/15/irreverant-post-of-irreverant-task-tweet-analysis/</link>
		<comments>http://govindkanshi.wordpress.com/2010/07/15/irreverant-post-of-irreverant-task-tweet-analysis/#comments</comments>
		<pubDate>Thu, 15 Jul 2010 13:30:27 +0000</pubDate>
		<dc:creator>govindkanshi</dc:creator>
				<category><![CDATA[python]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[nltk]]></category>
		<category><![CDATA[tweet analysis]]></category>

		<guid isPermaLink="false">http://govindkanshi.wordpress.com/?p=385</guid>
		<description><![CDATA[This post was planned at the end of last year with some weekend hours on nltk and intention of comparing with Lingpipe/Gate. Unfortunately the latter exercise never finished. In general I have gone ahead and cleaned up some adjectives, articles, preposition etc which do not add any value in analysis. Idea was to see if [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=385&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This post was planned at the end of last year with some weekend hours on nltk and intention of comparing with Lingpipe/Gate. Unfortunately the latter exercise never finished.</p>
<p>In general I have gone ahead and cleaned up some adjectives, articles, preposition etc which do not add any value in analysis. Idea was to see if it is possible to find out more about a person from tweets and. This was done at end of 2009 but got around to writing an analysis just now.</p>
<p><strong>What are the pair of words which get most mention?</strong></p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="182" valign="top"><strong>Tweetid</strong></td>
<td width="776" valign="top"><strong>Word pair</strong></td>
</tr>
<tr>
<td width="182" valign="top">Gulpanag <sup>&amp;</sup></td>
<td width="776" valign="top">please visit; visit   http://gulpanag.net; Col Shamsher; thank you:; happy birthday; Sub Himalayan;   Shamsher Singh; Whatever happened; welcome aboard; long term; highly   motivated; #skoda #yeti; cotton farmers; district #Fatehgarhsaheb; films   please; #ndtvtechlife awards; sub committee; right wing; Looking forward;   last night;</p>
<p>#mulitpolar #balanceofpower; &#8216;Chad   Gaya; @pavanduggal: #indiancyberlaw; @viveksmisra HTC; Berlin wall; Gaya   upar; Hindi-vear; Hot Tub; Rajya Sabha</td>
</tr>
<tr>
<td width="182" valign="top">Iamsrk <sup>*</sup></td>
<td width="776" valign="top">insha allah; dont know; looking   forward; red chillies; feel awful; low fat; human being.; take care; six   pack; sleep now.; really nice.; access version; good enuff.; happy birthday;   really nice; many things; keep awake; mani sirs; medicinal properties;   pradeep dhoot; push ups</td>
</tr>
<tr>
<td width="182" valign="top">Shashitharoor</td>
<td width="776" valign="top">Lok Sabha; Rajya Sabha; Chandran   Tharoor; copy jacob@tharoor.in; 140 characters; Fareed Zakaria; last night;   old friend; don&#8217;t know; Irfan Pathan; Passport Seva; yrs ago; Shihab Thangal;   Foreign Minister; Indian community; Rashtrapati Bhavan; Seva Kendras; next   year; Kapil Sibal; Indira Gandhi; Excellent mtg; Indian Ocean; haven&#8217;t seen;</p>
<p>latest story; Tharoor Foundation; long   day; roster duty; Youth Congress; pls write; foreign policy</td>
</tr>
<tr>
<td width="182" valign="top">Virsanghvi <sup>#</sup></td>
<td width="776" valign="top">via @addthis; Many thanks; Rude Food;   @addthis Rude; @addthis Counterpoint; Bal Thackeray; Parallax View; couldn&#8217;t   agree; Vir Sanghvi:;; Manu Sharma; Shiv Sena; Raj Thackeray;Amar Singh; agree   entirely; good one; foreign policy; @addthis Pursuits; Discovery Travel; Buck   Stops; don&#8217;t think; next time; Fair enough.; Air India; Shashi Tharoor;   Indira Gandhi; Custom Made; much. Glad; upset stomachs; good idea.</td>
</tr>
<tr>
<td width="182" valign="top">Sardesairajdeep<sup>~</sup></td>
<td width="776" valign="top">cnn ibn; write in.; shiv sena; amar   singh; don’t miss; per cent; karan thapar; devils advocate; shashi tharoor;   aam admi; sri lanka; lalit modi; news docu; ibn lokmat; prime time; real   heroes.; three idiots; womens bill; big story:; must watch; looks like;   breaking news:; big story; must confess; indo-pak talks; udhav thackeray;   mani shankar; phone tapping</td>
</tr>
<tr>
<td width="182" valign="top">Rajivmakhni</td>
<td width="776" valign="top">gadget guru; Kareena Kapoor; ndtv 24&#215;7;   Steve Waugh; Xperia X10; ndtv profit; iPhone 3GS; notion ink; size zero; next   week; Gadget Guru; much appreciated; NDTV 24&#215;7.; Windows Phone; #ndtvtechlife   awards; number portability; sea link; NDTV 24&#215;7; Newsnet today; quick review;   @ankitv @abhishektelang; proud moment; ink adam; awards jury; ndtv india;   newsnet today; Apple iPad; 100 invites; @RahulDX @ankitv; super thin</td>
</tr>
<tr>
<td width="182" valign="top">Lintool</td>
<td width="776" valign="top">#hadoop #mapreduce; Maryland:   http://bit.ly/caJi3h; looking forward; @brandynwhite talking; compute vision;</p>
<p>#Hadoop #MapReduce; vision   applications; language processing; natural language; object creation; #hadoop   #MapReduce; #mapreduce HUG; &#8220;explode&#8221; mapper; @cloudera @philz42;   Marginal Relevance; Maximal Marginal; [pwsim algo]; interactive Maximal; key space;   scale natural; 8GB input; FAST 2010; large scale; @kevinweil</p>
<p>@abdur; input key; seem like; #Hadoop   #MapReduce:; @deliprao @ChrisDiehl; it&#8217;s like</td>
</tr>
<tr>
<td width="182" valign="top">Jboner<sup>$</sup></td>
<td width="776" valign="top">Looking forward; Scala Days; available:   <a href="http://bit.ly/9HzbUn">http://bit.ly/9HzbUn</a>; &#8216;receive&#8217; method.;   @RayRoestenburg: Blogged; Message Routing:; Patterns&#8217;: http://is.gd/c5itt;   Routing: Part; art: http://bit.ly/civmvk; highly concurrent; main diff; last   night; Great talk; Actor&#8217;s &#8216;receive&#8217;; Actor&#8217;s logic; Stability Patterns&#8217;:;   That&#8217;s true.; @remeniuk Akka; Akka  HotSwap;   real state; @pavlobaron @mknittig; @ManoMarks thanks; #Akka Message; ;   @djspiewak Multiverse; good idea.; much better; would love</td>
</tr>
<tr>
<td width="182" valign="top">Tunkuv</td>
<td width="776" valign="top">Carl Levin; David Frum; Daily Beast;   Charles Leerhsen; Hurt Locker; Parties <a href="http://www.thedailybeast.com/blogs-and-stories/2010-04-15/at-">http://www.thedailybeast.com/blogs-and-stories/2010-04-15/at-</a>last-the-truth-about-tea-partiers/;   Tea Parties; all-volunteer army;</p>
<p>Paul Stevens; dear old; foreign policy;   illegal immigration; turns readers; vituperative morons; online medium; John   Batchelor; New York; Milton Friedman; doesn&#8217;t mean; #healthcare reform; John   Paul; can&#8217;t decide; Show tonight</td>
</tr>
<tr>
<td width="182" valign="top">Abdur<sup>%</sup></td>
<td width="776" valign="top">Red Dwarf; Anyone else; Looking   forward; next time; pretty cool; Happy Birthday; Indian food; new years;   pretty sure; taxi magic; Pretty cool; sit next; cant wait; late night; Jack   Falstaff; Steve Blank; Stinson Beach; Web 2.0; fresh wasabi; pie chart; last   night; San Francisco; Wine tasting; alarm clock; dim sum; kitty hospital;   Great dinner; screwing us.; rental cars;</td>
</tr>
</tbody>
</table>
<p>&amp;removed- SJOBA Sub</p>
<p>*removed &#8211; kwon doh; lahe wah; minal lahe; nasrun minal; taekwon; wah fatahun; artemis fowl; dash hulk; hulk kai;</p>
<p>#removed &#8211; @addthis Parallax</p>
<p>~removed &#8211; cnn ibn:; cnn ibn.;</p>
<p>%removed-&#8221;The Art</p>
<p>$removed- I&#8217;ll add; I&#8217;m sure; blog post</p>
<p><strong>Strawman Analysis :</strong></p>
<p>Abdur/Lintool sort of belong to one extreme of folks analyzed. They could not be more different. Abdur – coming from twitter-posts very little information processing/analysis stuff and it is mostly about food, wine and personal stuff. Lintool clearly comes out as the hadoop-mapreduce man. Abdur also comes out harassed by united, taxi.</p>
<p>Jboner undoubtedly provides credential to his scala /akka background.  He is passionate about akka, scala and his friends @debashishg, @viktorlang, @sbtourist.  Akka goes nicely with scalability, availability, actor, concurreny.  He also points out lot of presentations and slide decks in general.</p>
<p>Tunkuv comes out swinging as conservative who is hooked to cricket and still has patience to go through cricinfo and blast them for shoddy writing.  Being a conservative he also has views on the healthcare and policies of the us administration. Off course tea party is the slogan of the opposition so makes an appearance.</p>
<p>Rajiv Makhni is no doubt “the gadget man”. He is also part of the media crowd who put “breadcrumbs” to their tv shows, writings or in general fellow folks. There is lot of link back to parent firm – NDTV and its various shows that is.  He pretty much looks obsessed with phones, devices like ipad/tablet, dimensions and features.  Good thing about Rajiv is that he does most of the hard work and in general not immediately vowed by apple, Microsoft. His latest coverage about “other” brands of phones is genuine attempt to provide voice to alternative (micromax).</p>
<p>Rajdeep Sardesai too talks about his channel, upcoming show(requests to watch with timings and all)  and then some comments on current happenings across amar singh, thackreys, shashitharoor(man did the media guys milk him?). He also points out Sagarika’s articles from time to time. This is apparent in urls that he points too. I have not shared them here but extraction of urls and crawl to them provided the base for that conclusion.</p>
<p>Virsanghvi is unique. Virsanghvi has hands full with pointers to his articles via @addthis. He does cover wider array of topics from Padma Bhushan fiasco , headley, china/Pakistan and love of thackrey/Pawar family. I wished I could do sentiment analysis of those tweets most of which are full of sarcasm.  He made it a point during  Tharoor gate to pin Tharoor followers with his unique insights on folly of being Tharoor and his followers. He followed that with one page writeup in of the magazines basically saying Tharoor was pretty much product of vanity. He also made it personal wrt Mumbai vs Bombay and gave two hoots to the fact that he or his sort who speak French or Italian in 2 day visit that they go to those countries but consider it below themselves to learn a local language(may be he will pick up a cue from Aamir Khan who is learning it or Rajnikant who learnt/mastered local language). In case of food  HOMP on “ndtv good times” guys take his happiness in terms of genuine freshness.</p>
<p>ShashiTharoor – the man who was brought down by twitter unfortunately was a judicious user of tweeter. He posted his meetings, addresses and visits to his constituency, state on tweets. One can get a glimpse of his work just through his tweets. Man was transparent. He did have cricket on his radar though. Unlike media folks he did not pre-judge and pass comments or forwarded to his blog/writing etc. Only time he opened about personal life was about his son and his achievements just like any other father would. His ministerial work too gets prominence though he did keep his boundary wrt to foreign affair. One can clearly see man had India on his mind(this is without stemming so could be really lot more).  He too comes out pretty savvy re-tweeter. But his focus is clear – India, Kerala, meetings, visits and foreign affairs.  Cricket does popup but just engulfed by all the meetings, briefings and mentions about generally issues that is tackling or good things he sees.</p>
<p>Shahrukh comes out as genuine person who wants to connect with his fans and shares tips to youngsters about good/bad habits and need to focus on work. He is extremely dedicated to his kids and family and that comes out pretty strongly. His faith also makes appearance through his invocation of insha allah phrase(basically as/if god wishes). His sleep pattern makes an appearance and one tweet will be about the need of sleep in early morning when he goes to sleep. Exercise, healthy food and fitness make way on the tweets at times. His firm red chillies, his team kkr make brief seasonal appearance. He is also the only one who dwells on his past – mother and father – again re-affirming his attachments.  His language also comes out pretty dominated with verbs and action reflecting his persona? Shahrukh as mentioned earlier comes out swinging for family(kids), friends(karan), cities(Mumbai, Delhi) and his present  disposition (Kolkata knight riders or movie ra.one).</p>
<p>Gulpanag comes across as refreshing change with interest in riding, biking, automobiles and across issues of the time. This ranges from “army scam”, “farmer plight”, “girl child”, eco friendliness and in general fitness (running,bike).</p>
<p>There is gentleman whose tweet analysis I have not done in detail here. But Pankaj Pachauri comes across as genuine person with nuanced interests across varied subjects. His tweets and his programmes are both high quality. He is very few of the journalist who presents facts and analyzes situation of price increase, local manufacturing,  Africa mining, gas price increase, education cost and relation to enterprenuer ship. But unfortunately as it would happen his shows are mistimed and like the regional newspapers and good folks there he is ignored.</p>
<p>The word collocation as the above exercise is called can be easily combined with actual word frequency across tweets. In this case I have not taken trouble to normalize the data with synonyms, homonyms or polynyms. I just intersected across corpus of wordnet and brown. There is slight difference between frequencies of regular text tokenization and custom tokenization. For our purpose we will ignore regular text tokenization.  Name and place extraction via alchemy, evri did not work very well, nor did sentiment analysis for singular tweets. The other challenge was lot of new “sms/tweeter” words like hv, 4, lrnt, mv etc. These words need to be in dictionary replaced with actual words. Another idea would be &#8211; do common factor search across a bunch of folks (via # tags first and then other simpler things)</p>
<p>NLTK is useful kit to do quick 2 hrs work. It has its quirks and I have not exploited its whole NLP side of the  world. One of the days J.</p>
<p>Tweets downloaded in Dec 2009 and appended in May 2010.</p>
<p>Twitter api &#8211; <a href="http://code.google.com/p/python-twitter/">http://code.google.com/p/python-twitter/</a> (python 2.6)</p>
<p><strong>How many times word was tweeted (rough and not complete for space)</strong></p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="479" valign="top">Tweet Id</td>
<td width="479" valign="top">Word frequency</td>
</tr>
<tr>
<td width="479" valign="top">Gulpanag</td>
<td width="479" valign="top">@gulpanag                 -&gt;                        39</p>
<p>please                    -&gt;                        38</p>
<p>http://gulpanag.net       -&gt;                        37</p>
<p>don&#8217;t                     -&gt;                        35</p>
<p>about                     -&gt;                        32</p>
<p>visit                     -&gt;                        30</p>
<p>think                     -&gt;                        26</p>
<p>being                     -&gt;                        25</p>
<p>should                    -&gt;                        25</p>
<p>india                     -&gt;                        24</p>
<p>check                     -&gt;                        22</p>
<p>thanks                    -&gt;                        44</p>
<p>women                     -&gt;                        21</p>
<p>twitter                   -&gt;                        18</p>
<p>world                     -&gt;                        17</p>
<p>always                    -&gt;                        16</p>
<p>happy                     -&gt;                        16</p>
<p>welcome                   -&gt;                        16</p>
<p>great                     -&gt;                        15</p>
<p>lol!!                     -&gt;                        14</p>
<p>films                     -&gt;                        13</p>
<p>indian                    -&gt;                        13</p>
<p>bombay                    -&gt;                        12</p>
<p>chandigarh                -&gt;                        12</p>
<p>#fail                     -&gt;                        11</p>
<p>#fatehgarhsaheb           -&gt;                        11</p>
<p>#iphone                   -&gt;                        10</p>
<p>#punjab                   -&gt;                        10</p>
<p>phone                     -&gt;                        10</p>
<p>really                    -&gt;                        10</p>
<p>riding                    -&gt;                        10</td>
</tr>
<tr>
<td width="479" valign="top">Iamsrk</td>
<td width="479" valign="top">kids                      -&gt;                        101</p>
<p>thanx                     -&gt;                        70</p>
<p>life                      -&gt;                        69</p>
<p>think                     -&gt;                        69</p>
<p>sleep                     -&gt;                        61</p>
<p>should                    -&gt;                        53</p>
<p>great                     -&gt;                        48</p>
<p>being                     -&gt;                        47</p>
<p>really                    -&gt;                        45</p>
<p>happy                     -&gt;                        42</p>
<p>films                     -&gt;                        36</p>
<p>world                     -&gt;                        36</p>
<p>write                     -&gt;                        36</p>
<p>always                    -&gt;                        35</p>
<p>ra.one                    -&gt;                        35</p>
<p>shoot                     -&gt;                        35</p>
<p>people                    -&gt;                        31</p>
<p>start                     -&gt;                        31</p>
<p>everyone                  -&gt;                        30</p>
<p>never                     -&gt;                        30</p>
<p>still                     -&gt;                        29</p>
<p>better                    -&gt;                        26</p>
<p>friend                    -&gt;                        26</p>
<p>without                   -&gt;                        25</p>
<p>friends                   -&gt;                        24</p>
<p>later                     -&gt;                        23</p>
<p>reading                   -&gt;                        23</p>
<p>thought                   -&gt;                        23</p>
<p>doing                     -&gt;                        22</p>
<p>sometimes                 -&gt;                        22</p>
<p>time.                     -&gt;                        22</p>
<p>believe                   -&gt;                        21</p>
<p>film.                     -&gt;                        21</p>
<p>karan                     -&gt;                        21</p>
<p>looking                   -&gt;                        21</p>
<p>@kkriders                 -&gt;                        20</p>
<p>family                    -&gt;                        20</p>
<p>kolkata                   -&gt;                        20</p>
<p>mumbai                    -&gt;                        20</p>
<p>didnt                     -&gt;                        19</p>
<p>please                    -&gt;                        19</p>
<p>sleep.                    -&gt;                        19</p>
<p>sorry                     -&gt;                        19</p>
<p>them.                     -&gt;                        19</p>
<p>things                    -&gt;                        19</p>
<p>anyone                    -&gt;                        18</p>
<p>insha                     -&gt;                        18</p>
<p>thing                     -&gt;                        18</p>
<p>today                     -&gt;                        18</p>
<p>twitter                   -&gt;                        18</p>
<p>allah                     -&gt;                        17</p>
<p>early                     -&gt;                        17</p>
<p>makes                     -&gt;                        17</p>
<p>match                     -&gt;                        17</p>
<p>right                     -&gt;                        17</p>
<p>trying                    -&gt;                        17</p>
<p>watch                     -&gt;                        17</p>
<p>which                     -&gt;                        17</td>
</tr>
<tr>
<td width="479" valign="top">ShashiTharoor</td>
<td width="479" valign="top">indian                    -&gt;                       124</p>
<p>india                     -&gt;                       164</p>
<p>kerala                    -&gt;                       113</p>
<p>@shashitharoor            -&gt;                        82</p>
<p>delhi                     -&gt;                        64</p>
<p>great                     -&gt;                        64</p>
<p>tweet                     -&gt;                        60</p>
<p>visit                     -&gt;                        55</p>
<p>addressed                 -&gt;                        46</p>
<p>minister                  -&gt;                        45</p>
<p>write                     -&gt;                        44</p>
<p>dinner                    -&gt;                        43</p>
<p>foreign                   -&gt;                        43</p>
<p>always                    -&gt;                        42</p>
<p>people                    -&gt;                        42</p>
<p>world                     -&gt;                        41</p>
<p>today                     -&gt;                        40</p>
<p>cricket                   -&gt;                        39</p>
<p>public                    -&gt;                        37</p>
<p>state                     -&gt;                        36</p>
<p>sorry,                    -&gt;                        35</p>
<p>spoke                     -&gt;                        35</p>
<p>official                  -&gt;                        34</p>
<p>speech                    -&gt;                        32</p>
<p>congress                  -&gt;                        29</p>
<p>excellent                 -&gt;                        29</p>
<p>lunch                     -&gt;                        28</p>
<p>visited                   -&gt;                        27</p>
<p>office@tharoor.in         -&gt;                        26</td>
</tr>
<tr>
<td width="479" valign="top">VirSanghvi</td>
<td width="479" valign="top">thanks                    -&gt;                       349</p>
<p>@addthis                  -&gt;                       133</p>
<p>think                     -&gt;                        69</p>
<p>agree                     -&gt;                        64</p>
<p>indian                    -&gt;                        45</p>
<p>counterpoint              -&gt;                        43</p>
<p>india                     -&gt;                        43</p>
<p>liked                     -&gt;                        36</p>
<p>thackeray                 -&gt;                        30</p>
<p>point                     -&gt;                        24</p>
<p>enjoyed                   -&gt;                        23</p>
<p>guess                     -&gt;                        23</p>
<p>media                     -&gt;                        23</p>
<p>really                    -&gt;                        22</p>
<p>against                   -&gt;                        21</p>
<p>pakistan                  -&gt;                        21</p>
<p>still                     -&gt;                        19</p>
<p>@pritishnandy             -&gt;                        18</p>
<p>always                    -&gt;                        18</p>
<p>pilots                    -&gt;                        18</p>
<p>@vinkaycee                -&gt;                        17</p>
<p>delhi                     -&gt;                        17</p>
<p>times                     -&gt;                        17</p>
<p>legal                     -&gt;                        16</p>
<p>parallax                  -&gt;                        16</p>
<p>sorry                     -&gt;                        16</p>
<p>26/11                     -&gt;                        15</p>
<p>@thyagu2009               -&gt;                        15</p>
<p>bombay                    -&gt;                        15</p>
<p>fight                     -&gt;                        15</p>
<p>india&#8217;s                   -&gt;                        15</p>
<p>support                   -&gt;                        15</p>
<p>state                     -&gt;                        14</p>
<p>watching                  -&gt;                        14</p>
<p>@gulpanag                 -&gt;                        13</p>
<p>entirely                  -&gt;                        13</p>
<p>foreign                   -&gt;                        13</p>
<p>government                -&gt;                        13</p>
<p>great                     -&gt;                        13</p>
<p>police                    -&gt;                        13</p>
<p>politicians               -&gt;                        13</p>
<p>absolutely                -&gt;                        12</p>
<p>action                    -&gt;                        12</p>
<p>anyone                    -&gt;                        12</p>
<p>certainly                 -&gt;                        12</p>
<p>channels                  -&gt;                        12</p>
<p>headley                   -&gt;                        12</p>
<p>padma                     -&gt;                        12</p>
<p>problem                   -&gt;                        12</p>
<p>public                    -&gt;                        12</p>
<p>twitter                   -&gt;                        12</p>
<p>china                     -&gt;                        11</p>
<p>happy                     -&gt;                        11</td>
</tr>
<tr>
<td width="479" valign="top">Sardesairajdeep</td>
<td width="479" valign="top">india                     -&gt;                        107</p>
<p>indian                    -&gt;                        53</p>
<p>watch                     -&gt;                        53</p>
<p>tonight                   -&gt;                        47</p>
<p>story                     -&gt;                        43</p>
<p>mumbai                    -&gt;                        41</p>
<p>today                     -&gt;                        41</p>
<p>cricket                   -&gt;                        36</p>
<p>great                     -&gt;                        33</p>
<p>tharoor                   -&gt;                        33</p>
<p>write                     -&gt;                        33</p>
<p>special                   -&gt;                        31</p>
<p>breaking                  -&gt;                        30</p>
<p>hockey                    -&gt;                        30</p>
<p>political                 -&gt;                        29</p>
<p>channel                   -&gt;                        28</p>
<p>years                     -&gt;                        28</p>
<p>world                     -&gt;                        25</p>
<p>media                     -&gt;                        24</p>
<p>debate                    -&gt;                        23</p>
<p>against                   -&gt;                        22</p>
<p>sachin                    -&gt;                        22</p>
<p>still                     -&gt;                        21</p>
<p>11.30                     -&gt;                        20</p>
<p>report                    -&gt;                        20</p>
<p>truly                     -&gt;                        20</p>
<p>guess                     -&gt;                        19</p>
<p>needs                     -&gt;                        19</p>
<p>three                     -&gt;                        18</p>
<p>watching                  -&gt;                        18</p>
<p>indo-pak                  -&gt;                        17</p>
<p>terror                    -&gt;                        17</p>
<p>twitter                   -&gt;                        17</p>
<p>ibnlive.com               -&gt;                        16</p>
<p>justice                   -&gt;                        16</p>
<p>power                     -&gt;                        16</p>
<p>prices                    -&gt;                        16</p>
<p>sunday                    -&gt;                        16</p>
<p>indians                   -&gt;                        15</p>
<p>journalism                -&gt;                        15</p>
<p>stories                   -&gt;                        15</p>
<p>womens                    -&gt;                        15</p>
<p>karan                     -&gt;                        14</p>
<p>killed                    -&gt;                        14</p>
<p>pakistan                  -&gt;                        14</p>
<p>singh                     -&gt;                        14</p>
<p>thought                   -&gt;                        14</p>
<p>because                   -&gt;                        13</p>
<p>delhi                     -&gt;                        13</p>
<p>india&#8217;s                   -&gt;                        13</p>
<p>parliament                -&gt;                        13</p>
<p>party                     -&gt;                        13</p>
<p>rahul                     -&gt;                        13</p>
<p>violence                  -&gt;                        13</p>
<p>attack                    -&gt;                        12</p>
<p>budget                    -&gt;                        12</p>
<p>chief                     -&gt;                        12</td>
</tr>
<tr>
<td width="479" valign="top">RajivMakhni</td>
<td width="479" valign="top">cellguru                  -&gt;                        68</p>
<p>gadget                    -&gt;                        44</p>
<p>phone                     -&gt;                        43</p>
<p>newsnet                   -&gt;                        39</p>
<p>today                     -&gt;                        38</p>
<p>about                     -&gt;                        33</p>
<p>thank                     -&gt;                        32</p>
<p>great                     -&gt;                        31</p>
<p>review                    -&gt;                        26</p>
<p>first                     -&gt;                        25</p>
<p>6.30pm                    -&gt;                        24</p>
<p>iphone                    -&gt;                        24</p>
<p>kareena                   -&gt;                        23</p>
<p>phones                    -&gt;                        23</p>
<p>india                     -&gt;                        21</p>
<p>watch                     -&gt;                        21</p>
<p>coming                    -&gt;                        20</p>
<p>kapoor                    -&gt;                        18</p>
<p>apple                     -&gt;                        17</p>
<p>steve                     -&gt;                        17</p>
<p>10.30pm                   -&gt;                        16</p>
<p>mobile                    -&gt;                        16</p>
<p>price                     -&gt;                        16</p>
<p>people                    -&gt;                        15</p>
<p>12.30                     -&gt;                        13</p>
<p>twitter                   -&gt;                        13</p>
<p>24&#215;7.                     -&gt;                        12</p>
<p>android                   -&gt;                        12</p>
<p>nokia                     -&gt;                        12</p>
<p>pretty                    -&gt;                        12</p>
<p>profit                    -&gt;                        12</p>
<p>@vikramchandra            -&gt;                        11</p>
<p>market                    -&gt;                        11</p>
<p>better                    -&gt;                        10</p>
<p>content                   -&gt;                        10</p>
<p>notion                    -&gt;                        10</p>
<p>samsung                   -&gt;                        10</p>
<p>tablet                    -&gt;                        10</p>
<p>details                   -&gt;                         9</p>
<p>phone,                    -&gt;                         9</p>
<p>shows                     -&gt;                         9</p>
<p>start                     -&gt;                         9</p>
<p>xperia                    -&gt;                         9</p>
<p>amazing                   -&gt;                         8</p>
<p>garmin                    -&gt;                         8</p>
<p>microsoft                 -&gt;                         8</p>
<p>questions                 -&gt;                         8</p>
<p>quite                     -&gt;                         8</p>
<p>8.30pm                    -&gt;                         7</td>
</tr>
<tr>
<td width="479" valign="top">Lintool</td>
<td width="479" valign="top">#hadoop                   -&gt;                        63</p>
<p>#mapreduce                -&gt;                        57</p>
<p>about                     -&gt;                        19</p>
<p>dryadlinq                 -&gt;                        11</p>
<p>@kevinweil                -&gt;                         9</p>
<p>@abdur                    -&gt;                         8</p>
<p>@deliprao                 -&gt;                         7</p>
<p>@ian_soboroff             -&gt;                         7</p>
<p>paper                     -&gt;                         7</p>
<p>@brandynwhite             -&gt;                         5</p>
<p>parallel                  -&gt;                         5</p>
<p>processing                -&gt;                         5</p>
<p>university                -&gt;                         5</p>
<p>vision                    -&gt;                         5</td>
</tr>
<tr>
<td width="479" valign="top">Jboner</td>
<td width="479" valign="top">#akka                     -&gt;                        54</p>
<p>scala                     -&gt;                        23</p>
<p>great                     -&gt;                        21</p>
<p>#scala                    -&gt;                        12</p>
<p>clojure                   -&gt;                        12</p>
<p>concurrency               -&gt;                        11</p>
<p>presentation              -&gt;                         9</p>
<p>better                    -&gt;                         8</p>
<p>cool.                     -&gt;                         8</p>
<p>doing                     -&gt;                         8</p>
<p>looking                   -&gt;                         8</p>
<p>availability              -&gt;                         6</p>
<p>awesome.                  -&gt;                         6</p>
<p>forward                   -&gt;                         6</p>
<p>interesting               -&gt;                         6</p>
<p>looks                     -&gt;                         6</p>
<p>support                   -&gt;                         6</p>
<p>actors.                   -&gt;                         5</p>
<p>after                     -&gt;                         5</p>
<p>scalability               -&gt;                         5</p>
<p>slides                    -&gt;                         5</p>
<p>stability                 -&gt;                         5</p>
<p>starting                  -&gt;                         5</p>
<p>still                     -&gt;                         5</td>
</tr>
<tr>
<td width="479" valign="top">Tunkuv</td>
<td width="479" valign="top">#cricket                  -&gt;                        14</p>
<p>#goldman                  -&gt;                        11</p>
<p>#healthcare               -&gt;                        11</p>
<p>piece                     -&gt;                        11</p>
<p>@telegraphnews            -&gt;                         9</p>
<p>about                     -&gt;                         8</p>
<p>david                     -&gt;                         8</p>
<p>indian                    -&gt;                         8</p>
<p>#india                    -&gt;                         6</p>
<p>beast                     -&gt;                         6</p>
<p>cricket                   -&gt;                         6</p>
<p>right                     -&gt;                         6</p>
<p>@guardianbooks            -&gt;                         5</p>
<p>comment                   -&gt;                         5</p>
<p>human                     -&gt;                         5</p>
<p>interview                 -&gt;                         5</td>
</tr>
<tr>
<td width="479" valign="top">Abdur</td>
<td width="479" valign="top">great                     -&gt;                        81</p>
<p>twitter                   -&gt;                        91</p>
<p>about                     -&gt;                        76</p>
<p>thanks                    -&gt;                        59</p>
<p>should                    -&gt;                        46</p>
<p>happy                     -&gt;                        40</p>
<p>@kevinweil                -&gt;                        39</p>
<p>people                    -&gt;                        37</p>
<p>going                     -&gt;                        34</p>
<p>anyone                    -&gt;                        27</p>
<p>reading                   -&gt;                        25</p>
<p>think                     -&gt;                        25</p>
<p>there                     -&gt;                        23</p>
<p>pretty                    -&gt;                        22</p>
<p>still                     -&gt;                        22</p>
<p>watching                  -&gt;                        22</p>
<p>@goldman                  -&gt;                        21</p>
<p>because                   -&gt;                        21</p>
<p>check                     -&gt;                        20</p>
<p>today                     -&gt;                        19</td>
</tr>
</tbody>
</table>
<p><strong>Which people get lot of mention through tweets?</strong></p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="479" valign="top"><strong>Twitter id</strong></td>
<td width="479" valign="top"><strong>Top mentions</strong></td>
</tr>
<tr>
<td width="479" valign="top">Gulpanag</td>
<td width="479" valign="top">@vkaul, @nitinsgr, @nithinkd, @angadc, @achitnis,   @reallybuffalo, @sonaliranade, @rwac48, @arifone     ,   @madhulata, @maheshmurthy,   @ssarbjit, @acorn…@sherbir</td>
</tr>
<tr>
<td width="479" valign="top">iamsrk</td>
<td width="479" valign="top">@kkriders, karan</td>
</tr>
<tr>
<td width="479" valign="top">virsanghvi</td>
<td width="479" valign="top">@addthis, @vikram_sood, @pritishnandy,   @vinkaycee, @thyagu2009,   @itssotweet, @kanchangupta, @gulpanag</td>
</tr>
<tr>
<td width="479" valign="top">Sardesairajdeep</td>
<td width="479" valign="top">@imangy; @visaraj; @arunraveen;   @RohanBhade; @swapsdailydose;</p>
<p>@jinglebells27; @sidharth_madhav;   @Varunrd; @St_Hill; @aurodip;</p>
<p>@MirzaSania; @jemin_p; @santheepnair;   @bhogleharsha;</td>
</tr>
<tr>
<td width="479" valign="top">Rajivmakhni</td>
<td width="479" valign="top">@vikramchandra, @ankitv, @gulpanag,   @sachin_malhotra, @achitnis, @mariagorettiz</td>
</tr>
<tr>
<td width="479" valign="top">shashitharoor</td>
<td width="479" valign="top">@shashitharoor, @23jacob, @ashwinsid:   @ramgandhi52; @jaipurprince:;</p>
<p>@karmadude @chrisbrogan;@cricketwallah;   @arpitamgupta:</p>
<p>@josephseb: @arungiri;   @khalidalkhalifa:; @PARVEZ89:;</td>
</tr>
<tr>
<td width="479" valign="top">Jboner</td>
<td width="479" valign="top">@jboner, @djspiewak, @debasishg, @viktorklang, @sbtourist,   @pavlobaron</td>
</tr>
<tr>
<td width="479" valign="top">Lintool</td>
<td width="479" valign="top">@kevinweil, @abdur, @deliprao,   @ian_soboroff, @brandynwhite,</td>
</tr>
<tr>
<td width="479" valign="top">abdur</td>
<td width="479" valign="top">@kevinweil, @goldman  ,   @gregpass, @elizabeth, @jayvirdy, @jess,   @evan,@pankaj</td>
</tr>
<tr>
<td width="479" valign="top">tunkuv</td>
<td width="479" valign="top">@telegraphnews, @guardianbooks, @prempanicker; @saliltripathi; @ultrabrown;</td>
</tr>
</tbody>
</table>
<p><strong>Is the user tweeter savvy and uses hash tags/lists?</strong></p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="479" valign="top"><strong>Tweeter id</strong></td>
<td width="479" valign="top"><strong>Top # tags</strong><strong></strong></td>
</tr>
<tr>
<td width="479" valign="top">Abdur</td>
<td width="479" valign="top">#sgu; #justreturnzero;  #bestConfLunchOfAllTime; #chirp; #whereisbiz;   #SS.; #sfcabbies; #sfcabsareslow; #spoiledbynorthercalweather;   #copeyesightfail; #notquittingdayjobforstandup; #conspiracytheories; #2; #boycotunited;   #tsafail; #istlecture; #tokyoiswaycool; #SantaAbuse; #ihatedish   #faileddonotcalllist #endphonemarketers;</p>
<p>#zeroinboxfail; #mylaptopisscrewed; #3;   #mustfix;#goodfoodforthought; #foo09; #lifeleasons; #Retarded; #ssm09; #sfheatfail;   #fixfuckedsfcabcompanies; #TED; #7; #SQLputdowns; #cikm2008; #; #ceas</td>
</tr>
<tr>
<td width="479" valign="top">Gulpanag</td>
<td width="479" valign="top">#quote; #shatabdi; #CWG;   #Chandigarh;#amplifier; #trek; #Fatimabhutto; #ipad;</p>
<p>#Hockey; #t20; #delhi; #triathlon;   #race; #Womenempowerment; #BRICK; #lyrics;</p>
<p>#jetlag; #thiruvananthapuram;   #RoyalEnfield; #Kasauli; #MAC; #iPhone; #Twitter; #iPhone;</p>
<p>#Delhi; #AlQaeda #jem   #LET;#roadrunning; #strike; #HT; #Mumbai ; #respect; #Airtel;</p>
<p>#fail; #sin; #sine; #indianexpress;   #;#kasab; #newlylearntfact;  #orkutification; #olaytotaleffects; #Ladakh;   #raiddehimalya;  #NationalGeographic;   #migraine;</p>
<p>#wolframalpha; #LED; #Pvr;#reliance;   #iPhone; #blackberry; #Hot; #tatasky;    #Surya;#ipl; #wonderwhy; #ndtvtechlife; ; #unitedindiapak; #kindness;   #siemens; #stalker; #democracy; #china; #harleydavidson; #indiancyberlaw;   #KingOfGoodTimes;  #harassment #;   #childmarriage; #zeroaccuracy; #capitalstagnation; #africa.; #reva; #hockey</td>
</tr>
<tr>
<td width="479" valign="top">iamsrk</td>
<td width="479" valign="top">na</td>
</tr>
<tr>
<td width="479" valign="top">Jboner</td>
<td width="479" valign="top">#Scala; #Akka; #NodeJS; #MongoDB;   #AMQP; #REST; #UnitTesting;# #javaone;   #Terrastore; #jax2010; #clojure;   #playframework; #camel; #github; #assembla; #maven</td>
</tr>
<tr>
<td width="479" valign="top">Lintool</td>
<td width="479" valign="top">#SIGIR2010; #WWW2010; #MapReduce;  #Hadoop ;#1; #2; #cloudcomputing; #nlp;</p>
<p>#LHC; #pig ;#goog;#aws;</td>
</tr>
<tr>
<td width="479" valign="top">rajivmakhni</td>
<td width="479" valign="top">#nowwatching; #iPhone; #BBM #Appworld;   #tech-fairs;  #ndtvtechlife; #MycelebList; #ndtvgreenathon;</td>
</tr>
<tr>
<td width="479" valign="top">Shashitharoor</td>
<td width="479" valign="top">#TWITronym; #India; #HiFlyers2009;   #TED; #US; #nuclear; #NMST;#awesomeindianthings;   #TEDIndia; #beatcancer; #Nobel; #Gandhi; #uighurs;   #Jet2Kerala</td>
</tr>
<tr>
<td width="479" valign="top">tunkuv</td>
<td width="479" valign="top">#iPad; #DavidHockney; #WorldT20;   #cricket; #Scotus; #Euro #pakistan; #Beatles; #leadersdebate; #GordonBrown;   #Goldman; #UKelections; #Arizona; #Belgium; #SpongebobSquarepants #Heimlich; #Britain; #WallStreet; #Facebook; #Samaranch; #TeaParty; #NBC; #Pulitzer;</td>
</tr>
<tr>
<td width="479" valign="top">virsanghvi</td>
<td width="479" valign="top">na</td>
</tr>
</tbody>
</table>
<p>Code for analysis could be shared at git little later or shared on request.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/govindkanshi.wordpress.com/385/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/govindkanshi.wordpress.com/385/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/govindkanshi.wordpress.com/385/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/govindkanshi.wordpress.com/385/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/govindkanshi.wordpress.com/385/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/govindkanshi.wordpress.com/385/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/govindkanshi.wordpress.com/385/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/govindkanshi.wordpress.com/385/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/govindkanshi.wordpress.com/385/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/govindkanshi.wordpress.com/385/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/govindkanshi.wordpress.com/385/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/govindkanshi.wordpress.com/385/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/govindkanshi.wordpress.com/385/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/govindkanshi.wordpress.com/385/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=govindkanshi.wordpress.com&amp;blog=1090294&amp;post=385&amp;subd=govindkanshi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://govindkanshi.wordpress.com/2010/07/15/irreverant-post-of-irreverant-task-tweet-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/db06b5079c28bedc44c03860515c8e11?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">govindkanshi</media:title>
		</media:content>
	</item>
	</channel>
</rss>
