<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>Tim Showers - Web Development, Design, and Data Visualization</title>
	<atom:link href="http://www.timshowers.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.timshowers.com</link>
	<description>Tutorials, Polemics, and Discussion about all things web-nerdy</description>
	<pubDate>Sat, 03 Jan 2009 03:53:20 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6</generator>
	<language>en</language>
			<item>
		<title>Just For Fun: Ice Cube Flowchart</title>
		<link>http://www.timshowers.com/2009/01/ice-cube-flowchart/</link>
		<comments>http://www.timshowers.com/2009/01/ice-cube-flowchart/#comments</comments>
		<pubDate>Sat, 03 Jan 2009 00:25:28 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Visualization]]></category>

		<category><![CDATA[Flow Chart]]></category>

		<category><![CDATA[text processing]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=426</guid>
		<description><![CDATA[Reddit User Chknbone dropped this little New Year&#8217;s present the other day and I thought it was too cool to leave unshared.
It&#8217;s a flow chart of the Ice Cube song Today Was A Good Day (NSFW Language). Classic rap makes for some amusing grammar and linguistic feats, and seeing it mapped out here is a [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/chickenbone/2125243181/sizes/o/in/set-72157594286653785/"><img class="alignleft size-full wp-image-427" style="margin-right: 10px;" title="Today Was A Good Day Flow Chart" src="http://www.timshowers.com/wp-content/uploads/2009/01/ice-cube-flow-small.png" alt="" width="250" height="272" /></a><a href="http://www.reddit.com">Reddit</a> User <a href="http://www.reddit.com/user/Chknbone/">Chknbone</a> dropped <a href="http://www.flickr.com/photos/chickenbone/2125243181/sizes/o/in/set-72157594286653785/">this</a> little New Year&#8217;s present the other day and I thought it was too cool to leave unshared.</p>
<p>It&#8217;s a flow chart of the Ice Cube song <a href="http://www.youtube.com/watch?v=c4RY-eJgHHs">Today Was A Good Day</a> (NSFW Language). Classic rap makes for some amusing grammar and linguistic feats, and seeing it mapped out here is a good laugh, not to mention an amusing stretch of the flowchart metaphor.</p>
<p>So check out the <a href="http://www.flickr.com/photos/chickenbone/2125243181/sizes/o/in/set-72157594286653785/">original at flickr</a>, and here&#8217;s to more cool stuff in 2009!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2009/01/ice-cube-flowchart/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Using Transparency for Feature Highlighting</title>
		<link>http://www.timshowers.com/2008/12/using-transparency-for-feature-highlighting/</link>
		<comments>http://www.timshowers.com/2008/12/using-transparency-for-feature-highlighting/#comments</comments>
		<pubDate>Mon, 29 Dec 2008 15:46:39 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Visualization]]></category>

		<category><![CDATA[Geocoding &amp; Mapping]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=400</guid>
		<description><![CDATA[Axis maps blog is sharing an interesting technique for adding another dimension of data to maps by using transparency.  Somewhat like a cartogram, they&#8217;ve taken each county in the US and modified it based on population density, but instead of stretching it to a new size, they&#8217;ve increased the alpha transparency to make more populated [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.axismaps.com/blog/2008/12/election-map-follow-up/"><img class="alignleft size-full wp-image-401" style="margin: 15px; float: left;" title="US Election Map with Transparency scaled by Population" src="http://www.timshowers.com/wp-content/uploads/2008/12/cartogram-alternative-small.png" alt="" width="210" height="143" /></a></p>
<p><a href="http://www.axismaps.com/blog/2008/12/election-map-follow-up/">Axis maps blog</a> is sharing an interesting technique for adding another dimension of data to maps by using transparency.  Somewhat like a <a href="http://www.timshowers.com/2008/12/creating-effective-cartograms/">cartogram</a>, they&#8217;ve taken each county in the US and modified it based on population density, but instead of stretching it to a new size, they&#8217;ve increased the alpha transparency to make more populated areas brighter and less populated areas faded out.<span id="more-400"></span></p>
<p>It looks like a challenge to get the color mapping right, especially in light of data that has a few outliers compared to a large area of uniform low density like population.  Perhaps a logarithmic scale would be more useful for data like this.</p>
<p>Regardless it&#8217;s a novel highlighting method, and definitely one to keep in the toolbox in the future.  I could also see this being useful for highlighting in non-map visualizations such as <a href="http://en.wikipedia.org/wiki/Treemapping">treemaps</a>.</p>
<p>There are a few more examples over at the <a href="http://www.axismaps.com/blog/2008/12/election-map-follow-up/">original post</a>, so check it out!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/12/using-transparency-for-feature-highlighting/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Visualization Strategies: Hierarchical Data</title>
		<link>http://www.timshowers.com/2008/12/visualization-strategies-hierarchical-data/</link>
		<comments>http://www.timshowers.com/2008/12/visualization-strategies-hierarchical-data/#comments</comments>
		<pubDate>Mon, 22 Dec 2008 12:49:56 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Visualization]]></category>

		<category><![CDATA[deep data]]></category>

		<category><![CDATA[hierarchical data]]></category>

		<category><![CDATA[multi level data]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=292</guid>
		<description><![CDATA[One of the most challenging types of data to convert into a chart or visualization is also one of the most common: Multi-Level or &#8216;Hierarchical&#8217; data.
Perhaps every category of data is composed of sub-categories, or a change in one data point has a major effect on surrounding data, but regardless, the standard library of charts [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.timshowers.com/i/posts/multi-level-pie.png"><img class="alignleft size-full wp-image-297" style="margin-right: 20px;" title="Example of a Multi-level Pie Chart" src="http://www.timshowers.com/wp-content/uploads/2008/12/multi-level-pie-small.png" alt="Example of a Multi-level Pie Chart" width="210" height="194" /></a>One of the most challenging types of data to convert into a chart or visualization is also one of the most common: Multi-Level or &#8216;Hierarchical&#8217; data.</p>
<p>Perhaps every category of data is composed of sub-categories, or a change in one data point has a major effect on surrounding data, but regardless, the standard library of charts and graphs doesn&#8217;t offer much in the way of making hierarchical data clear, so here are a few alternatives:<span id="more-292"></span></p>
<h2 style="clear: both;">Nested Categories:</h2>
<h3>Treemaps:<a href="http://www.smartmoney.com/map-of-the-market/"><img class="alignright size-full wp-image-298" style="margin-left: 20px; margin-bottom: 20px;" title="Sample Treemap from Smart Money" src="http://www.timshowers.com/wp-content/uploads/2008/12/smartmoney-treemap.gif" alt="Sample Treemap from Smart Money" width="216" height="162" /></a></h3>
<p>Recently &#8220;Treemap&#8221; style charts have come out of their <a href="http://www.cs.umd.edu/hcil/treemap-history/">origins in academia</a> into common use.  A personal favorite of mine, they allow aggregate categories to show through without losing the smaller constituent data.</p>
<p>Each category is sized according to what percent of the total it takes up, and child categories can be placed inside parents in a similar manner. Interactive versions often allow for &#8216;drilling down&#8217; deeper into the data by clicking on a category to see its members full screen.</p>
<p><a href="http://www.jquery.info/The-TreeMap-plugin"><img class="alignright size-full wp-image-304" style="margin-left: 20px;" title="jQuery Treemap Example" src="http://www.timshowers.com/wp-content/uploads/2008/12/jquery-treemap-small.png" alt="" width="210" height="210" /></a>Treemaps seem to work best when the total number of categories at each level is fairly small (otherwise the hundreds or thousands of categories become tiny, undifferentiated squares), and each item fits neatly into a single sub-category.</p>
<p>As usual, <a href="http://manyeyes.alphaworks.ibm.com/manyeyes/">Many Eyes</a> leads the pack with excellent treemaps, and even offers a rather unique <a href="http://manyeyes.alphaworks.ibm.com/manyeyes/page/Treemap_for_Comparisons.html">treemap over time</a>.  Other solutions include a free (non-commercial) <a href="http://www.cs.umd.edu/hcil/treemap/">desktop program</a>, <a href="http://treemap.sourceforge.net/">java library</a>, <a href="http://www.jquery.info/The-TreeMap-plugin">jQuery plugin</a>, <a href="http://code.google.com/p/flex2treemap/">flex component</a>, and <a href="http://www.google.com/url?sa=t&amp;source=web&amp;ct=res&amp;cd=1&amp;url=http%3A%2F%2Fresearch.microsoft.com%2Fcommunity%2Ftreemapper%2F&amp;ei=kO5JSaHXAZ_4NIG6wOwE&amp;usg=AFQjCNGR2RrLj9NR-0jya-K1RE7Mq1jkvQ&amp;sig2=-W0sg8c98R5rB-RNWOYqdA">Excel add-in</a> from Microsoft.</p>
<h3 style="clear: both;">Multi-Level Pie:</h3>
<p><a href="http://www.fusioncharts.com/PowerCharts.asp"><img class="alignright size-full wp-image-301" style="margin-left: 10px;" title="Multi Level Pie Chart" src="http://www.timshowers.com/wp-content/uploads/2008/12/multi-level-pie-2-small.png" alt="" width="230" height="202" /></a>If the categories are a little more concise then those in a treemap, or deeper data is required at a glance, multi-level pie charts may be the solution.  As the concentric rings go &#8216;out&#8217;, each item is sized with respect to its contribution to the inner parent category, allowing for deep hierarchies to be understood at a glance.</p>
<p>Neoformix has a great illustrated example of <a href="http://www.neoformix.com/2006/MultiLevelPieChart.html">building a multi-level pie chart</a>, and Fusion Charts offers the best flash version that i&#8217;ve used in their <a href="http://www.fusioncharts.com/PowerCharts.asp?gMenuItemId=16">powercharts</a> package. <a href="http://hdgraph.objectis.net/">HDGraph</a> applies the concept to hard-drive space use, historically a big driver of multi-level visualization.</p>
<h3>Bubble Diagrams:</h3>
<p><a href="http://manyeyes.alphaworks.ibm.com/manyeyes/page/Bubble_Chart.html"><img class="alignleft size-full wp-image-325" style="margin-right: 20px;" title="bubble-chart-small" src="http://www.timshowers.com/wp-content/uploads/2008/12/bubble-chart-small.png" alt="" width="210" height="163" /></a>Bubble diagrams, which are generally used to show multi-dimensional data (x, y, size, color) instead of hierarchical data, can be adapted easily, either by coloring all members of a similar group the same (for example <a href="http://www.gapminder.org/">Gapminder</a> assigning members of each continent the same color), or by nesting the bubbles within another, to create an effect somewhat similar to a treemap.</p>
<p style="text-align: left;"><a href="http://www.timshowers.com/i/posts/gapminder-bubble.png"><img class="alignleft size-full wp-image-326" style="margin-right: 20px; margin-top: 10px; clear: left;" title="Bubble Chart from Gapminder World" src="http://www.timshowers.com/wp-content/uploads/2008/12/gapminder-bubble-small.png" alt="" width="210" height="163" /></a>Many Eyes offers a <a href="http://manyeyes.alphaworks.ibm.com/manyeyes/page/Bubble_Chart.html">fairly simple version</a>, mainly for highlighting relative sizes, as does Microsoft Excel, while <a href="http://www.gapminder.org/">Gapminder</a> offers more complex abilities. I&#8217;ve yet to see anyone take on the nested-bubble approach, but feel free to contact me if you know of a solution.</p>
<p style="text-align: left;">
<p style="text-align: left;">
<p style="text-align: left;">
<h2 style="clear: both; padding-top: 10px;">Organizational Hierarchies:</h2>
<h3>Vertical Hierarchy:</h3>
<p><img class="alignright size-full wp-image-334" style="margin-left: 10px;" title="A Sample Vertical Tree Interface" src="http://www.timshowers.com/wp-content/uploads/2008/12/vertical-tree.png" alt="" width="230" height="160" />Most commonly seen in the &#8216;organization chart&#8217;, vertical hierarchies are common for showing chains of membership or authority, but rarely have an added dimension to show actual numeric values.</p>
<p>The best examples use pictures effectively, and tend to represent data with each level&#8217;s members being of similar importance.  The <a href="http://blog.thejit.org/javascript-information-visualization-toolkit-jit/">JIT toolkit</a> offers a javascript based approach to this, as do any others.</p>
<h3 style="clear: both;">Horizontal Hierarchy:</h3>
<p><a href="http://blog.thejit.org/wp-content/jit-1.0a/examples/spacetree.html"><img class="alignleft size-full wp-image-336" style="margin-right: 10px; margin-bottom: 15px;" title="horizontal-tree-small" src="http://www.timshowers.com/wp-content/uploads/2008/12/horizontal-tree-small.png" alt="" width="210" height="140" /></a>In contrast to vertical hierarchy, these charts normally serve to hide large amounts of data that may be useless to a user, and allow fast drilling very deep into a hierarchy. Horizontal charts work best when the data is very deep, but each category only has a few subcategories, so the user can drill down very quickly.</p>
<p>My favorite horizontal hierarchy examples comes from the <a href="http://blog.thejit.org/wp-content/jit-1.0a/examples/spacetree.html">spacetree</a> at the <a href="http://blog.thejit.org/javascript-information-visualization-toolkit-jit/">JIT</a>.</p>
<h2>Relational Data:</h2>
<p>Sometimes data in hierarchies takes a more abstract form, where each element influences one or more elements, or just overlaps multiple categories.  These types of data sets tend to provide an additional challenge to visualize, since laying them out optimally involves a bit of heavy-lifting mathematically.</p>
<h3>Node-Link Diagrams:</h3>
<p><a href="http://www.timshowers.com/i/posts/node-chart.png"><img class="alignright size-full wp-image-338" style="margin-left: 20px;" src="http://www.timshowers.com/wp-content/uploads/2008/12/node-chart-small.png" alt="" width="210" height="135" /></a>Coming from a branch of mathematics, these diagrams offer different takes on data that is highly interrelated: one element may link to hundreds (or thousands) of additional data points, or none at all.</p>
<p>Node-Link charts (also called Network Diagrams) are best employed when data is related, but not necessarily in a clear hierarchy.  Different dimensions of data can be shown by the size of each node, color, or even position.</p>
<div id="attachment_342" class="wp-caption alignright" style="width: 220px"><img class="size-full wp-image-342" style="margin-left: 10px;" src="http://www.timshowers.com/wp-content/uploads/2008/12/3d-nodes-small1.png" alt="" width="210" height="227" /><p class="wp-caption-text">A network diagram projected in 3D to highlight hierarchies</p></div>
<p>Some toolkits even allow the links to be different lengths, thicknesses, or colors to show an additional dimension of how two points are related.</p>
<p>A huge number of solutions exist for different variations on network diagrams, a few selections include efforts from <a href="http://asterisq.com/">constellation</a>, fusion charts&#8217; <a href="http://www.fusioncharts.com/PowerCharts.asp">power charts</a>, <a href="http://manyeyes.alphaworks.ibm.com/manyeyes/page/Network_Diagram.html">many eyes</a>, <a href="http://blog.thejit.org/javascript-information-visualization-toolkit-jit/">JIT</a>, and <a href="http://www.jsviz.org/blog/">jsviz</a>.</p>
<p><a href="http://blog.thejit.org/javascript-information-visualization-toolkit-jit/">JIT</a> also offers an innovative hyperbolic network graph, allowing connections that are further down the chain to be hidden from the user.</p>
<h2>Traditional Charts:</h2>
<h3>Stacked Bar:</h3>
<p><a href="http://www.timshowers.com/i/posts/stacked-bar.jpg"><img class="alignright size-full wp-image-349" style="margin-left: 10px; margin-bottom: 10px;" title="stacked-bar-small" src="http://www.timshowers.com/wp-content/uploads/2008/12/stacked-bar-small.png" alt="" width="210" height="131" /></a>Probably a simplest and most familiar chart type on the list, at a glance this chart only allows two &#8216;levels&#8217; of depth, (parent category (Bar), and parent percentage (Inner bar)), but interactive versions allow the user to click on a bar or bar component and &#8216;zoom&#8217; to see categories that make up that bar.</p>
<p>This chart is excellent for comparing simple, broad categories with few constituents, and benefits from general familiarity, but fails to impart more than two levels of depth at a glance, and it&#8217;s often non-obvious that user interaction is possible.</p>
<p>Most common charting packages offer stacked bar charts, including Excel, and flash packages such as <a href="http://teethgrinder.co.uk/open-flash-chart-2/stacked-bar-chart.php">Open Flash Chart</a>, <a href="http://www.fusioncharts.com/">FusionCharts</a>, and <a href="http://anychart.com/home/">AnyChart</a>.</p>
<h3>Stacked Shapes/Area:</h3>
<p>Similar to stacked bar charts, many charts offer stacked Area or Shape (Pyramid, Cyclinder, etc) charts.</p>
<p><a href="http://www.timshowers.com/i/posts/stacked-pyramid.png"><img class="alignleft size-full wp-image-350" style="margin-right: 15px; margin-bottom: 10px;" src="http://www.timshowers.com/wp-content/uploads/2008/12/stacked-pyramid-small.png" alt="" width="210" height="104" /></a>Although these can be useful (particularly the pyramid chart, when one or two options in each &#8216;bar&#8217; take up an overwhelming percent of the total, but the lower percents must still be visible), they suffer from the pitfalls of stacked bar, plus the added difficulty of resolving the relative sizes of 3D objects.  Oftentimes relative proportions that are intuitive in rectangular dimensions (&#8221;<em>This</em> box is twice the area of <em>that</em> box&#8221;) are lost in other shapes or in 3D (&#8221;How much more volume is in the<em> bottom half</em> of the pyramid than the <em>top half</em>?&#8221;).</p>
<p>Again, these are supported by most common spreadsheet and graph libraries, but beware of obscuring data just to add eye-candy.</p>
<h2>Common Challenges &amp; Future Developments</h2>
<p>One of the largest problems with presenting hierarchical data is size, once the members of a tree grow beyond a few dozen, putting them all on screen at once can be overwhelming, but knowing which elements to hide can be a challenge as well.  This makes effective use of color, as well as choosing the right visualization for the job extremely important.</p>
<p>Hopefully in the future we&#8217;ll see more development of toolkits and frameworks for quick-deployment of interactive charts, and better support for deep hierarchical data.  Thanks to strides taken by a few of the javascript toolkits, flex/flare, and IBM&#8217;s Many Eyes, most of the solutions listed above are becoming accessible to the average developer, although custom deployments still abound.</p>
<p>With the uptick in machine readable statistics being made available at corporate, as well as national and international levels, we&#8217;re sure to see a rise in large, deep data sets in the future, and the corresponding need to understand that data intuitively.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/12/visualization-strategies-hierarchical-data/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Creating Effective Cartograms</title>
		<link>http://www.timshowers.com/2008/12/creating-effective-cartograms/</link>
		<comments>http://www.timshowers.com/2008/12/creating-effective-cartograms/#comments</comments>
		<pubDate>Sat, 20 Dec 2008 05:22:25 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Visualization]]></category>

		<category><![CDATA[Geocoding &amp; Mapping]]></category>

		<category><![CDATA[cartograms]]></category>

		<category><![CDATA[infographics]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=374</guid>
		<description><![CDATA[
Cartograms, or visualizations of an area skewed by some variable, are a powerful tool to control for disparities over a large area, especially with respect to politics.
A relatively large but sparsely populated area will dominate a standard projection, whereas a cartogram allows populated areas to be warped to show their true influence (See sidebar).
Applications of [...]]]></description>
			<content:encoded><![CDATA[<div style="width: 220px; float: right;">
<div id="attachment_375" class="wp-caption alignright" style="width: 220px"><br />
<a href="http://www-personal.umich.edu/~mejn/election/2008/"><img class="size-full wp-image-376" title="2008 Election State Map" src="http://www.timshowers.com/wp-content/uploads/2008/12/2008-election-standard-small.png" alt="" width="210" height="128" /></a><br />
<a href="http://www-personal.umich.edu/~mejn/election/2008/"><img class="size-full wp-image-375" title="2008 Presidential Election Cartogram" src="http://www.timshowers.com/wp-content/uploads/2008/12/2008-election-cartogram-small.png" alt="A Standard 2008 Election Map (Upper), and a Cartogram skewed by Population (Lower)" width="210" height="150" /></a><p class="wp-caption-text">A Standard 2008 Election Map (Upper), and a Cartogram skewed by Population (Lower)</p></div>
</div>
<p><a href="http://en.wikipedia.org/wiki/Cartogram">Cartograms</a>, or visualizations of an area skewed by some variable, are a powerful tool to control for disparities over a large area, especially with respect to politics.</p>
<p>A relatively large but sparsely populated area will dominate a standard projection, whereas a cartogram allows populated areas to be warped to show their true influence (See sidebar).</p>
<h3>Applications of Cartograms:</h3>
<p>Socio-economic data is the most obvious use case for cartograms, particularly data from the <a href="http://www-personal.umich.edu/~mejn/cartograms/">United Nations</a> and <a href="http://www-personal.umich.edu/~mejn/election/2008/">National Elections</a>.</p>
<p><span id="more-374"></span>The cartogram strategy can be applied to just about any area visualization, and works particularly well where there are major disparities between area size and overall effect.</p>
<h3>Algorithms and Tools:</h3>
<p><a href="http://www.style.org/iowacaucus/"><img class="size-medium wp-image-382 alignright" title="Iowa Election Cartogram" src="http://www.timshowers.com/wp-content/uploads/2008/12/style-iowa-small.png" alt="" width="210" height="139" /></a>A general outline of the cartogram creation process, as well as some excellent alternatives to the standard &#8217;skewed border&#8217; approach can be found at &#8220;<a href="http://www.style.org/iowacaucus/">Scaling Counties in a Checkerboard State</a>&#8221; over at <a href="http://www.style.org">style.org</a>.</p>
<p>Actual implementation of cartograms seems to often follow the algorithm first published <a href="http://www.pnas.org/content/101/20/7499.abstract">here</a>. Desktop versions in Java (with code) are available <a href="http://chorogram.choros.ch/scapetoad/index.php">here</a> and <a href="http://people.cas.sc.edu/hardistf/cartograms/home.htm">here</a>.  There&#8217;s a rundown of more methods at <a href="http://indiemaps.com/blog/2008/03/free-cartograms/">indiemaps</a>.</p>
<div id="attachment_386" class="wp-caption alignright" style="width: 220px"><a href="http://www.style.org/mappingvotes/"><img class="size-full wp-image-386" title="Style.org county election cartogram" src="http://www.timshowers.com/wp-content/uploads/2008/12/style-redesign-small1.png" alt="A rescaled election cartogram. Can you identify the state?" width="210" height="207" /></a><p class="wp-caption-text">A Rescaled Election Cartogram. Can You Identify the State?</p></div>
<p><a href="http://www.style.org/mappingvotes/">Another post</a> at style.org looks at alternate skewing methods, namely squarifying the areas in question, and then expanding them by a fixed ratio with respect to the data.</p>
<h3>Non-Standard Cartograms:</h3>
<p>Some of the most beautiful work in cartograms comes from the <a href="http://www.sasi.group.shef.ac.uk/publications/pandpexamples.htm">SASI group</a> and <a href="http://www.shef.ac.uk/geography/staff/dorling_danny/">Dr. Daniel Dorling</a>.</p>
<p><a href="http://indiemaps.com/blog/">Indiemaps blog</a> has a number of posts on cartograms, including the promise of python source code, and some great visuals.</p>
<p>I think cartograms are an untapped resource for commerical data, especially for geographically disbursed figures like sales or conversion rates. Hopefully in the future we&#8217;ll see more tools centered around this resource, as well as a decent online generator.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/12/creating-effective-cartograms/feed/</wfw:commentRss>
		</item>
		<item>
		<title>O&#8217;Reilly on the Future of Massive Data Analysis</title>
		<link>http://www.timshowers.com/2008/11/future-of-data-analysis/</link>
		<comments>http://www.timshowers.com/2008/11/future-of-data-analysis/#comments</comments>
		<pubDate>Fri, 21 Nov 2008 01:28:02 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Mining]]></category>

		<category><![CDATA[Data Visualization]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=269</guid>
		<description><![CDATA[There&#8217;s a post by Joseph Hellerstein worth a read over on O&#8217;Reilly Radar: The Commoditization of Massive Data Analysis.  It&#8217;s more enterprise focused then small-normal business focused, but that&#8217;s just a consequence of the target audience.

His primary point is becoming especially pertinent to web companies and smaller developers: The convergence of dropping hardware prices and [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://radar.oreilly.com/2008/11/the-commoditization-of-massive.html"><img class="aligncenter size-full wp-image-272" title="Comparison Chart of Data Analysis Methods" src="http://www.timshowers.com/wp-content/uploads/2008/11/datacomparison.png" alt="" width="500" height="192" /></a></p>
<p>There&#8217;s a post by <a href="http://radar.oreilly.com/joeh">Joseph Hellerstein</a> worth a read over on O&#8217;Reilly Radar: <a href="http://radar.oreilly.com/2008/11/the-commoditization-of-massive.html">The Commoditization of Massive Data Analysis</a>.  It&#8217;s more enterprise focused then small-normal business focused, but that&#8217;s just a consequence of the target audience.</p>
<p><span id="more-269"></span></p>
<p>His primary point is becoming especially pertinent to web companies and smaller developers: The convergence of dropping hardware prices and machine-readable APIs is making the storage and processing of vast amounts of information practical.</p>
<blockquote><p>We are at the beginning of what I call The Industrial Revolution of Data. We&#8217;re not quite there yet, since most of the digital information available today is still individually &#8220;handmade&#8221;: prose on web pages, data entered into forms, videos and music edited and uploaded to servers. But we are starting to see the rise of automatic data generation &#8220;factories&#8221; such as software logs, UPC scanners, RFID, GPS transceivers, video and audio feeds.</p></blockquote>
<p>It&#8217;s already reasonable for a site on a commodity web host to store every user and search interaction, or a database of tens of millions of data points, and in the future it will only get easier. The question is, what tools will we use to make sense of all of this?</p>
<p>His analysis reduces the field to SQL (via <a href="http://www.oracle.com/">Oracle</a>) and MapReduce (via <a href="http://hadoop.apache.org/core/">Hadoop</a>), but once we look beyond the enterprise, tools like <a href="http://www.erlang.org/">Erlang </a>(or functional programming in general) and the emerging <a href="http://incubator.apache.org/couchdb/">CouchDB </a>show promise, not to mention some of the cloud computing entries from <a href="http://aws.amazon.com/">Amazon</a> and others.</p>
<p>On the visualization side of things, tools like <a href="http://processing.org/">Processing</a> and the <a href="http://prefuse.org/">Prefuse Toolkit</a> are seeing quick uptake, as well as more focused commercial tools like <a href="http://www.fusioncharts.com/">FusionCharts</a>.</p>
<p>Whatever the toolchain turns out to be, those of us with an interest in understanding information have the opportunity to be on the forefront of the change, and if we don&#8217;t gain expertise in the available options early, we risk being left behind.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/11/future-of-data-analysis/feed/</wfw:commentRss>
		</item>
		<item>
		<title>A Roundup of U.S. Election Visualizations</title>
		<link>http://www.timshowers.com/2008/11/a-roundup-of-us-election-visualizations/</link>
		<comments>http://www.timshowers.com/2008/11/a-roundup-of-us-election-visualizations/#comments</comments>
		<pubDate>Thu, 20 Nov 2008 00:19:50 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Visualization]]></category>

		<category><![CDATA[Politics]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=262</guid>
		<description><![CDATA[Well Formed Data is offering this post full of offbeat election and presidential data visualizations.
It&#8217;s definitely worth a look, I&#8217;m particularly enamored with the New York Times&#8217; Presidential Physique Graph, although the data density is a bit low for being so large.
]]></description>
			<content:encoded><![CDATA[<p><a href="http://well-formed-data.net/archives/166/election-visualization-roundup"><img class="alignnone size-full wp-image-264" title="Infographic of Presidential Physiques" src="http://www.timshowers.com/wp-content/uploads/2008/11/measure.jpg" alt="" width="480" height="160" /></a></p>
<p><a href="http://well-formed-data.net/">Well Formed Data</a> is offering <a href="http://well-formed-data.net/archives/166/election-visualization-roundup">this post</a> full of offbeat election and presidential data visualizations.</p>
<p>It&#8217;s definitely worth a look, I&#8217;m particularly enamored with the New York Times&#8217; <a href="http://www.nytimes.com/interactive/2008/10/06/opinion/06opchart.html">Presidential Physique Graph</a>, although the data density is a bit low for being so large.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/11/a-roundup-of-us-election-visualizations/feed/</wfw:commentRss>
		</item>
		<item>
		<title>A Collection of Real UI Patterns</title>
		<link>http://www.timshowers.com/2008/09/a-collection-of-real-ui-patterns/</link>
		<comments>http://www.timshowers.com/2008/09/a-collection-of-real-ui-patterns/#comments</comments>
		<pubDate>Wed, 10 Sep 2008 00:25:08 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Visualization]]></category>

		<category><![CDATA[User Interface]]></category>

		<category><![CDATA[Case Studies]]></category>

		<category><![CDATA[Screenshots]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=239</guid>
		<description><![CDATA[Peter Morville has established this great flickr set of screenshots of the user interfaces from actual sites, broken down by purpose &#38; category.
Because they&#8217;re all from major sites, they do a great job illustrating both the prevailing UI paradigms, as well as the different spins on each approach.

]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/morville/collections/72157603785835882/"><img class="alignright size-full wp-image-240" title="Design patterns" src="http://www.timshowers.com/wp-content/uploads/2008/09/designpatterns.png" alt="" width="250" height="289" /></a></p>
<p><a href="http://semanticstudios.com/about/">Peter Morville</a> has established <a href="http://www.flickr.com/photos/morville/collections/72157603785835882/">this</a> great flickr set of screenshots of the user interfaces from actual sites, broken down by purpose &amp; category.</p>
<p>Because they&#8217;re all from major sites, they do a great job illustrating both the prevailing UI paradigms, as well as the different spins on each approach.<br />
<br style="clear: both;" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/09/a-collection-of-real-ui-patterns/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Implementing HTTPOnly in PHP</title>
		<link>http://www.timshowers.com/2008/08/implementing-httponly-in-php/</link>
		<comments>http://www.timshowers.com/2008/08/implementing-httponly-in-php/#comments</comments>
		<pubDate>Thu, 28 Aug 2008 23:55:35 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[PHP]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=230</guid>
		<description><![CDATA[Coding Horror has an article today about a little-known extension to the http cookie protocol: HTTPOnly.
Essentially, HTTPOnly makes any browser cookies from the site unreadable to javascript (in supported browsers anyway: IE7, Opera 9.5, FF3), thus raising the bar for XSS attacks considerably.
So how do we turn it on in PHP?

If you&#8217;re using a version [...]]]></description>
			<content:encoded><![CDATA[<p>Coding Horror has an <a href="http://www.codinghorror.com/blog/archives/001167.html">article</a> today about a little-known extension to the http cookie protocol: <a href="http://msdn.microsoft.com/en-us/library/ms533046.aspx">HTTPOnly</a>.</p>
<p>Essentially, HTTPOnly makes any browser cookies from the site unreadable to javascript (in supported browsers anyway: IE7, Opera 9.5, FF3), thus raising the bar for <a href="http://en.wikipedia.org/wiki/Cross-site_scripting">XSS</a> attacks considerably.</p>
<p>So how do we turn it on in PHP?<br />
<span id="more-230"></span></p>
<p>If you&#8217;re using a version of PHP pre5.2:</p>

<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>1
</pre></td><td class="code"><pre class="php"><span style="color: #990000;">header</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;Set-Cookie: hidden=value; httpOnly&quot;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span></pre></td></tr></table></div>

<p>If you&#8217;re using a new version of PHP (5.2+):</p>

<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>1
2
3
4
5
6
7
8
9
</pre></td><td class="code"><pre class="php"><span style="color: #666666; font-style: italic;">//Either of these options set the $_SESSION cookie into HTTPOnly mode</span>
<span style="color: #990000;">ini_set</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;session.cookie_httponly&quot;</span><span style="color: #339933;">,</span> <span style="color: #cc66cc;">1</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
<span style="color: #666666; font-style: italic;">// or</span>
<span style="color: #990000;">session_set_cookie_params</span><span style="color: #009900;">&#40;</span><span style="color: #cc66cc;">0</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">TRUE</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
&nbsp;
<span style="color: #666666; font-style: italic;">//Individual cookies can be set using:</span>
<span style="color: #990000;">setcookie</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;abc&quot;</span><span style="color: #339933;">,</span> <span style="color: #0000ff;">&quot;test&quot;</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">TRUE</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span> 
<span style="color: #666666; font-style: italic;">//or</span>
setrawcookie<span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;abc&quot;</span><span style="color: #339933;">,</span> <span style="color: #0000ff;">&quot;test&quot;</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">NULL</span><span style="color: #339933;">,</span> <span style="color: #000000; font-weight: bold;">TRUE</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span></pre></td></tr></table></div>

<p>And that&#8217;s it!<br />
One simple line of code (or function argument) that you can add to your header file that helps makes attacks on your site tougher to execute. </p>
<p>Code snippets courtesy of <a href="http://ilia.ws/archives/121-httpOnly-cookie-flag-support-in-PHP-5.2.html">Ilia Alshanetsky</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/08/implementing-httponly-in-php/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Visualization Strategies: Text &#038; Documents</title>
		<link>http://www.timshowers.com/2008/08/visualization-strategies-text-documents/</link>
		<comments>http://www.timshowers.com/2008/08/visualization-strategies-text-documents/#comments</comments>
		<pubDate>Thu, 21 Aug 2008 04:41:38 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Visualization]]></category>

		<category><![CDATA[infographics]]></category>

		<category><![CDATA[text processing]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=167</guid>
		<description><![CDATA[Whether it&#8217;s a campaign speech by a presidential contender, or a 300 page bestselling novel, large bodies of text are among the most requested topics for condensing into an infographic.
The purpose can vary from highlighting specific relations to contrasting points or use of language, but all of the following methods focus on distilling a volume [...]]]></description>
			<content:encoded><![CDATA[<p>Whether it&#8217;s a campaign speech by a presidential contender, or a 300 page bestselling novel, large bodies of text are among the most requested topics for condensing into an infographic.</p>
<p>The purpose can vary from highlighting specific relations to contrasting points or use of language, but all of the following methods focus on distilling a volume of text down to a visualization.</p>
<p><span id="more-167"></span></p>
<h2>Volumetric Comparisons:</h2>
<h3>Tag Clouds &amp; Wordles:</h3>
<div id="attachment_168" class="wp-caption alignright" style="width: 220px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/tagcloud.gif"><img class="size-medium wp-image-168" title="A Sample Tag Cloud" src="http://www.timshowers.com/wp-content/uploads/2008/08/tagcloud-300x164.gif" alt="A Tag Cloud from Many Eyes" width="210" height="115" /></a><p class="wp-caption-text">A Tag Cloud from Many Eyes</p></div>
<p>Among the most common visualizations is the so called &#8216;<a title="Tag Cloud" href="http://services.alphaworks.ibm.com/manyeyes/page/Tag_Cloud.html">Tag Cloud</a>&#8216;, which is just a list of words or descriptions that are sized by some relevance measure, usually number of repetitions in the data set.</p>
<p>Tag clouds are often used to highlight and compare themes within the document, such as stripping down a U.S. State of the Union Speech to its major keywords such as Iraq, Budget, etc.</p>
<div id="attachment_173" class="wp-caption alignright" style="width: 245px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/wordle-large2.gif"><img class="size-thumbnail wp-image-173" title="A Sample Wordle" src="http://www.timshowers.com/wp-content/uploads/2008/08/wordle-small.gif" alt="A Sample Wordle from Wordle.net" width="235" height="114" /></a><p class="wp-caption-text">A Sample Wordle from http://www.wordle.net</p></div>
<p>Closely related to tag clouds are <a title="Wordles" href="http://services.alphaworks.ibm.com/manyeyes/page/Wordle.html">wordles</a>, which are more artistically arranged (and often vibrantly colored) versions of a text. They tend to be less directly insightful as an infographic, but often give a more personal feel to a document.</p>
<p>A number of free tools online exist to create Tag Clouds and Wordles, I&#8217;ve personally found the tag cloud tools at <a href="http://services.alphaworks.ibm.com/manyeyes/app">Many Eyes</a> to be excellent, as well as the versions at <a href="http://www.swivel.com/">Swivel</a>. Wordles originated at <a href="http://www.wordle.net">Wordle.net</a></p>
<h3>Word Spectrum Diagrams</h3>
<div id="attachment_181" class="wp-caption aligncenter" style="width: 409px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/american-chinese-dist5.jpg"><img class="size-full wp-image-181" title="A Sample Word Spectrum Diagram" src="http://www.timshowers.com/wp-content/uploads/2008/08/american-chinese-dist5.jpg" alt="World Spectrum Diagram from the Google Data Set" width="399" height="133" /></a><p class="wp-caption-text">World Spectrum Diagram from the Google Data Set</p></div>
<p>Chris Harrison introduced these <a href="http://www.chrisharrison.net/projects/wordspectrum/index.html">Word Spectrum Diagrams</a> with a project meant to show related word bi-grams, or common word pairs.</p>
<p>The technique could easily be adapted to show relations between other sets of words, such as common idiomatic phrases used by speakers, or the word preferences of an author.</p>
<h2>Structure and Document Flow</h2>
<h3>Document Contrast Diagrams</h3>
<div id="attachment_183" class="wp-caption alignright" style="width: 220px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/dcd_1_s.png"><img class="size-medium wp-image-183" title="Sample Document Contrast Diagram" src="http://www.timshowers.com/wp-content/uploads/2008/08/dcd_1_s-300x240.png" alt="A DCD from Jeff Clark" width="210" height="168" /></a><p class="wp-caption-text">A DCD from Jeff Clark</p></div>
<p>Document contrast diagrams use the familiar bubble technique and effective use of color to contrast topic usage in two bodies of text.</p>
<p>Unlike many of the other infographic techniques featured here, DCD&#8217;s help highlight key differences of the text as well as the similarities.</p>
<p>They&#8217;re discussed more in depth at <a href="http://www.neoformix.com/2008/DocumentContrastDiagrams.html">Neoformix</a>.</p>
<h3>Literary Organism Maps</h3>
<div id="attachment_186" class="wp-caption alignright" style="width: 128px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/literary-organism-poster-thumb-600x848.jpg"><img class="size-medium wp-image-186" title="Literary Organism Map for 'On the Road'" src="http://www.timshowers.com/wp-content/uploads/2008/08/literary-organism-poster-thumb-600x848-211x300.jpg" alt="Literary Organism Map for Kerouac's &quot;On the Road&quot;" width="118" height="168" /></a><p class="wp-caption-text">Literary Organism Map for Kerouac&#39;s &quot;On the Road&quot;</p></div>
<p>This next visualization technique comes from <a title="Website of Artist Stefanie Posavec" href="http://www.itsbeenreal.co.uk/">Stefanie Posavec</a>, and while a bit less obvious than the others, offers some intriguing possibilities.</p>
<p>It purports to offer a scaled &#8216;map&#8217; of where each chapter goes within a textual context. While not as intuitive as tag clouds or word maps, variations of this style could be used to track &#8216;threaded&#8217; text like conversation transcripts.</p>
<p>Her <a href="http://www.notcot.com/archives/2008/04/stefanie_posave.php">project page</a> has even more amazing work in the same vein, although most of it is more artistic than analytical.</p>
<h3 style="clear: both;">Word Trees</h3>
<div id="attachment_189" class="wp-caption alignright" style="width: 160px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/dreamtree-small.gif"><img class="size-thumbnail wp-image-189" title="A Sample Word Tree" src="http://www.timshowers.com/wp-content/uploads/2008/08/dreamtree-small-150x150.gif" alt="Word Tree for &quot;I Have a Dream&quot;" width="150" height="150" /></a><p class="wp-caption-text">Word Tree for &quot;I Have a Dream&quot;</p></div>
<p><a href="http://services.alphaworks.ibm.com/manyeyes/page/Word_Tree.html">Word Trees</a> are another document flow visualization from the folks at IBM Many Eyes.</p>
<p>They help provide context to unstructured text, showing the relations between major words and phrases and their follow-ups at a glance.</p>
<p>Font scaling helps to show importance and relative frequency among the parts, and easy searching lets a user follow a path from one concept to another throughout the text body.</p>
<p>By treating follow up phrases as links (almost like a web hyperlink), a user can easily navigate a speech by subject, which makes following a large body of text by theme a breeze.</p>
<p>An obvious application of this technique would be highlighting correspondence between two parties, whether by letters or telephone/email transcripts.</p>
<h3>Document Arc Diagrams</h3>
<div id="attachment_195" class="wp-caption alignright" style="width: 160px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/document_arc_diagrams.jpg"><img class="size-thumbnail wp-image-195" title="Document Arc Diagrams" src="http://www.timshowers.com/wp-content/uploads/2008/08/document_arc_diagrams-150x150.jpg" alt="A Snippet from a Document Arc Diagram" width="150" height="150" /></a><p class="wp-caption-text">A Snippet from a Document Arc Diagram</p></div>
<p>Another winner from the NeoFormix team is the Document Arc Diagram.</p>
<p>Hover over a text fragment highlights all the related text fragments, or &#8216;document arcs&#8217;. This allows for the relations to be visible at a glance when one sentence may be linked to many contexts or related words.</p>
<p>The site includes a handy <a href="http://www.neoformix.com/Projects/DocumentArcDiagrams/index.html">generator</a> so that anyone can build a custom interactive diagram.</p>
<h2>Large Corpus Techniques</h2>
<h3>Transcript Analysis</h3>
<div id="attachment_197" class="wp-caption alignright" style="width: 160px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/nyttextbreakdown.gif"><img class="size-thumbnail wp-image-197" title="New York Times Transcript Analysis" src="http://www.timshowers.com/wp-content/uploads/2008/08/nyttextbreakdown-150x150.gif" alt="Transcript Analysis focusing on the word &quot;Iraq&quot;" width="150" height="150" /></a><p class="wp-caption-text">Transcript Analysis focusing on the word &quot;Iraq&quot;</p></div>
<p>The New York Times offered <a href="http://www.nytimes.com/interactive/2007/10/30/us/politics/20071030_DEBATE_GRAPHIC.html#transcript">this visualization</a> after one of the 2008 Democratic primary debates.</p>
<p>Annotated speech blocks showed the frequency of custom words, and highlighted occurrences and context.</p>
<p>Although this required a highly annotated text, it allowed an in depth searching an comparison of the text without compare.</p>
<p>The biggest challenge to recreating this for other purposes would be the large amount of document tagging required (presumably by hand), although certain transcript type data could probably be automatically tagged and fed into a similar system.</p>
<h3>Directed Sentence Diagrams</h3>
<div id="attachment_199" class="wp-caption alignright" style="width: 160px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/neosentdraw_sotu2000.png"><img class="size-thumbnail wp-image-199" title="Directed Sentence Diagram" src="http://www.timshowers.com/wp-content/uploads/2008/08/neosentdraw_sotu2000-150x150.png" alt="A Sample Directed Sentence Diagram from Neoformix.com" width="150" height="150" /></a><p class="wp-caption-text">A Sample Directed Sentence Diagram from Neoformix.com</p></div>
<p>The final technique is a bit unconventional, but <a href="http://www.neoformix.com/2008/DirectedSentenceDiagrams.html">Directed Sentence Diagrams</a> (again from Neoformix) are designed to show the topic &#8216;flow&#8217; in a body of work via color and cartesian length.</p>
<p>While sentence drawings <a href="http://arbitrarian.wordpress.com/2008/04/13/bushs-legacy-speaks-for-itself/">aren&#8217;t</a> <a href="http://www.notcot.com/archives/2008/04/stefanie_posave.php">new</a>, the idea of &#8216;directing&#8217; them as well as color coding them to show sentence length and topic imparts much more information into the space.</p>
<p>The sparse filling and line lengths gives a great overall picture of the percent of the document (or speaking time) given over to a given topic, and a careful analysis allows the reader to follow the outline of an argument&#8217;s points or the meandering of a story&#8217;s subjects.</p>
<p>Perhaps even moreso than the other diagrams this one requires a well annotated text, since every sentence must be have meta data on its subject, but for certain sample sets (particularly persuasive speeches or editorial style arguments) the technique is particularly effective.</p>
<h2>Formatting the Data</h2>
<p>One of the things in common with all text visualization techniques is effective use of sizing and color. Gradients and gradual font sizes can show relative importance, while opposing colors and sharp contrast can highlight points of contention.</p>
<p>Other primary challenges for the designer include trimming the text down to its vital elements, and pre-processing the data to remove <a href="http://en.wikipedia.org/wiki/Stop_words">stop words</a> or do some sort of <a href="http://en.wikipedia.org/wiki/Stemming">stemming</a> to having a ensure a clean final product. At bare minimum, removal of particles such as &#8216;He&#8217; and &#8216;The&#8217; is generally necessary for any word prevalence visualization.</p>
<p>Automated text processing is often critical with larger bodies, whether it be simple techniques like using PHP&#8217;s str_replace function on particles and stop words, or more advanced methods like stemming with Python&#8217;s <a href="http://nltk.sourceforge.net/">Natural Language Toolkit</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/08/visualization-strategies-text-documents/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Vizualizing Similarity with Circos</title>
		<link>http://www.timshowers.com/2008/08/vizualizing-similarity-with-circos/</link>
		<comments>http://www.timshowers.com/2008/08/vizualizing-similarity-with-circos/#comments</comments>
		<pubDate>Wed, 20 Aug 2008 23:44:34 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
		
		<category><![CDATA[Data Visualization]]></category>

		<category><![CDATA[online tools]]></category>

		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=157</guid>
		<description><![CDATA[The magazine American Scientist has a cover image this month featuring a circular vizualization of similarity between the human and canine genomes.

It&#8217;s done using the Circos project, a perl vizualization framework for linked data (it&#8217;s written with an eye towards genomics, but appears to work for any tabular data).
They even have an online generator that [...]]]></description>
			<content:encoded><![CDATA[<p>The magazine American Scientist has a <a title="American Scientist September 2008 Cover" href="http://mkweb.bcgsc.ca/circos/?American_Scientist_cover">cover image</a> this month featuring a circular vizualization of similarity between the human and canine genomes.</p>
<p><span id="more-157"></span></p>
<div id="attachment_158" class="wp-caption alignright" style="width: 220px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/circos-sample-11.png"><img class="size-medium wp-image-158" title="circos-sample-11" src="http://www.timshowers.com/wp-content/uploads/2008/08/circos-sample-11-300x300.png" alt="A Sample Image generated from Circos" width="210" height="210" /></a><p class="wp-caption-text">A Sample Image generated from Circos</p></div>
<p>It&#8217;s done using the <a title="Circos" href="http://mkweb.bcgsc.ca/circos/?">Circos</a> project, a perl vizualization framework for linked data (it&#8217;s written with an eye towards genomics, but appears to work for any tabular data).</p>
<p>They even have an <a title="Online Circos Generator" href="http://mkweb.bcgsc.ca/circos/tableviewer/">online generator </a>that takes tabular data and offers a range of settings. If you&#8217;re confused with regards to how to read the charts, the <a title="Circos 'About' Page" href="http://mkweb.bcgsc.ca/circos/tableviewer/index.mhtml/docs">about page</a> explains it.</p>
<p>The output seems a little busy, but that&#8217;s probably mainly a consequence of the input data. I&#8217;m planning to give it a try tonight and report back with my findings.</p>
<p><br style="clear: both;" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/08/vizualizing-similarity-with-circos/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
