<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tim Showers - Web Development, Design, and Data Visualization &#187; infographics</title>
	<atom:link href="http://www.timshowers.com/tag/infographics/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.timshowers.com</link>
	<description>Tutorials, Polemics, and Discussion about all things web-nerdy</description>
	<lastBuildDate>Mon, 18 May 2009 21:30:42 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Creating Effective Cartograms</title>
		<link>http://www.timshowers.com/2008/12/creating-effective-cartograms/</link>
		<comments>http://www.timshowers.com/2008/12/creating-effective-cartograms/#comments</comments>
		<pubDate>Sat, 20 Dec 2008 05:22:25 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
				<category><![CDATA[Data Visualization]]></category>
		<category><![CDATA[Geocoding & Mapping]]></category>
		<category><![CDATA[cartograms]]></category>
		<category><![CDATA[infographics]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=374</guid>
		<description><![CDATA[
Cartograms, or visualizations of an area skewed by some variable, are a powerful tool to control for disparities over a large area, especially with respect to politics.
A relatively large but sparsely populated area will dominate a standard projection, whereas a cartogram allows populated areas to be warped to show their true influence (See sidebar).
Applications of [...]]]></description>
			<content:encoded><![CDATA[<div style="width: 220px; float: right;">
<div id="attachment_375" class="wp-caption alignright" style="width: 220px"><br />
<a href="http://www-personal.umich.edu/~mejn/election/2008/"><img class="size-full wp-image-376" title="2008 Election State Map" src="http://www.timshowers.com/wp-content/uploads/2008/12/2008-election-standard-small.png" alt="" width="210" height="128" /></a><br />
<a href="http://www-personal.umich.edu/~mejn/election/2008/"><img class="size-full wp-image-375" title="2008 Presidential Election Cartogram" src="http://www.timshowers.com/wp-content/uploads/2008/12/2008-election-cartogram-small.png" alt="A Standard 2008 Election Map (Upper), and a Cartogram skewed by Population (Lower)" width="210" height="150" /></a><p class="wp-caption-text">A Standard 2008 Election Map (Upper), and a Cartogram skewed by Population (Lower)</p></div>
</div>
<p><a href="http://en.wikipedia.org/wiki/Cartogram">Cartograms</a>, or visualizations of an area skewed by some variable, are a powerful tool to control for disparities over a large area, especially with respect to politics.</p>
<p>A relatively large but sparsely populated area will dominate a standard projection, whereas a cartogram allows populated areas to be warped to show their true influence (See sidebar).</p>
<h3>Applications of Cartograms:</h3>
<p>Socio-economic data is the most obvious use case for cartograms, particularly data from the <a href="http://www-personal.umich.edu/~mejn/cartograms/">United Nations</a> and <a href="http://www-personal.umich.edu/~mejn/election/2008/">National Elections</a>.</p>
<p><span id="more-374"></span>The cartogram strategy can be applied to just about any area visualization, and works particularly well where there are major disparities between area size and overall effect.</p>
<h3>Algorithms and Tools:</h3>
<p><a href="http://www.style.org/iowacaucus/"><img class="size-medium wp-image-382 alignright" title="Iowa Election Cartogram" src="http://www.timshowers.com/wp-content/uploads/2008/12/style-iowa-small.png" alt="" width="210" height="139" /></a>A general outline of the cartogram creation process, as well as some excellent alternatives to the standard &#8217;skewed border&#8217; approach can be found at &#8220;<a href="http://www.style.org/iowacaucus/">Scaling Counties in a Checkerboard State</a>&#8221; over at <a href="http://www.style.org">style.org</a>.</p>
<p>Actual implementation of cartograms seems to often follow the algorithm first published <a href="http://www.pnas.org/content/101/20/7499.abstract">here</a>. Desktop versions in Java (with code) are available <a href="http://chorogram.choros.ch/scapetoad/index.php">here</a> and <a href="http://people.cas.sc.edu/hardistf/cartograms/home.htm">here</a>.  There&#8217;s a rundown of more methods at <a href="http://indiemaps.com/blog/2008/03/free-cartograms/">indiemaps</a>.</p>
<div id="attachment_386" class="wp-caption alignright" style="width: 220px"><a href="http://www.style.org/mappingvotes/"><img class="size-full wp-image-386" title="Style.org county election cartogram" src="http://www.timshowers.com/wp-content/uploads/2008/12/style-redesign-small1.png" alt="A rescaled election cartogram. Can you identify the state?" width="210" height="207" /></a><p class="wp-caption-text">A Rescaled Election Cartogram. Can You Identify the State?</p></div>
<p><a href="http://www.style.org/mappingvotes/">Another post</a> at style.org looks at alternate skewing methods, namely squarifying the areas in question, and then expanding them by a fixed ratio with respect to the data.</p>
<h3>Non-Standard Cartograms:</h3>
<p>Some of the most beautiful work in cartograms comes from the <a href="http://www.sasi.group.shef.ac.uk/publications/pandpexamples.htm">SASI group</a> and <a href="http://www.shef.ac.uk/geography/staff/dorling_danny/">Dr. Daniel Dorling</a>.</p>
<p><a href="http://indiemaps.com/blog/">Indiemaps blog</a> has a number of posts on cartograms, including the promise of python source code, and some great visuals.</p>
<p>I think cartograms are an untapped resource for commerical data, especially for geographically disbursed figures like sales or conversion rates. Hopefully in the future we&#8217;ll see more tools centered around this resource, as well as a decent online generator.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/12/creating-effective-cartograms/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Visualization Strategies: Text &amp; Documents</title>
		<link>http://www.timshowers.com/2008/08/visualization-strategies-text-documents/</link>
		<comments>http://www.timshowers.com/2008/08/visualization-strategies-text-documents/#comments</comments>
		<pubDate>Thu, 21 Aug 2008 04:41:38 +0000</pubDate>
		<dc:creator>Tim Showers</dc:creator>
				<category><![CDATA[Data Visualization]]></category>
		<category><![CDATA[infographics]]></category>
		<category><![CDATA[text processing]]></category>

		<guid isPermaLink="false">http://www.timshowers.com/?p=167</guid>
		<description><![CDATA[Whether it&#8217;s a campaign speech by a presidential contender, or a 300 page bestselling novel, large bodies of text are among the most requested topics for condensing into an infographic.
The purpose can vary from highlighting specific relations to contrasting points or use of language, but all of the following methods focus on distilling a volume [...]]]></description>
			<content:encoded><![CDATA[<p>Whether it&#8217;s a campaign speech by a presidential contender, or a 300 page bestselling novel, large bodies of text are among the most requested topics for condensing into an infographic.</p>
<p>The purpose can vary from highlighting specific relations to contrasting points or use of language, but all of the following methods focus on distilling a volume of text down to a visualization.</p>
<p><span id="more-167"></span></p>
<h2>Volumetric Comparisons:</h2>
<h3>Tag Clouds &amp; Wordles:</h3>
<div id="attachment_168" class="wp-caption alignright" style="width: 220px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/tagcloud.gif"><img class="size-medium wp-image-168" title="A Sample Tag Cloud" src="http://www.timshowers.com/wp-content/uploads/2008/08/tagcloud-300x164.gif" alt="A Tag Cloud from Many Eyes" width="210" height="115" /></a><p class="wp-caption-text">A Tag Cloud from Many Eyes</p></div>
<p>Among the most common visualizations is the so called &#8216;<a title="Tag Cloud" href="http://services.alphaworks.ibm.com/manyeyes/page/Tag_Cloud.html">Tag Cloud</a>&#8216;, which is just a list of words or descriptions that are sized by some relevance measure, usually number of repetitions in the data set.</p>
<p>Tag clouds are often used to highlight and compare themes within the document, such as stripping down a U.S. State of the Union Speech to its major keywords such as Iraq, Budget, etc.</p>
<div id="attachment_173" class="wp-caption alignright" style="width: 245px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/wordle-large2.gif"><img class="size-thumbnail wp-image-173" title="A Sample Wordle" src="http://www.timshowers.com/wp-content/uploads/2008/08/wordle-small.gif" alt="A Sample Wordle from Wordle.net" width="235" height="114" /></a><p class="wp-caption-text">A Sample Wordle from http://www.wordle.net</p></div>
<p>Closely related to tag clouds are <a title="Wordles" href="http://services.alphaworks.ibm.com/manyeyes/page/Wordle.html">wordles</a>, which are more artistically arranged (and often vibrantly colored) versions of a text. They tend to be less directly insightful as an infographic, but often give a more personal feel to a document.</p>
<p>A number of free tools online exist to create Tag Clouds and Wordles, I&#8217;ve personally found the tag cloud tools at <a href="http://services.alphaworks.ibm.com/manyeyes/app">Many Eyes</a> to be excellent, as well as the versions at <a href="http://www.swivel.com/">Swivel</a>. Wordles originated at <a href="http://www.wordle.net">Wordle.net</a></p>
<h3>Word Spectrum Diagrams</h3>
<div id="attachment_181" class="wp-caption aligncenter" style="width: 409px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/american-chinese-dist5.jpg"><img class="size-full wp-image-181" title="A Sample Word Spectrum Diagram" src="http://www.timshowers.com/wp-content/uploads/2008/08/american-chinese-dist5.jpg" alt="World Spectrum Diagram from the Google Data Set" width="399" height="133" /></a><p class="wp-caption-text">World Spectrum Diagram from the Google Data Set</p></div>
<p>Chris Harrison introduced these <a href="http://www.chrisharrison.net/projects/wordspectrum/index.html">Word Spectrum Diagrams</a> with a project meant to show related word bi-grams, or common word pairs.</p>
<p>The technique could easily be adapted to show relations between other sets of words, such as common idiomatic phrases used by speakers, or the word preferences of an author.</p>
<h2>Structure and Document Flow</h2>
<h3>Document Contrast Diagrams</h3>
<div id="attachment_183" class="wp-caption alignright" style="width: 220px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/dcd_1_s.png"><img class="size-medium wp-image-183" title="Sample Document Contrast Diagram" src="http://www.timshowers.com/wp-content/uploads/2008/08/dcd_1_s-300x240.png" alt="A DCD from Jeff Clark" width="210" height="168" /></a><p class="wp-caption-text">A DCD from Jeff Clark</p></div>
<p>Document contrast diagrams use the familiar bubble technique and effective use of color to contrast topic usage in two bodies of text.</p>
<p>Unlike many of the other infographic techniques featured here, DCD&#8217;s help highlight key differences of the text as well as the similarities.</p>
<p>They&#8217;re discussed more in depth at <a href="http://www.neoformix.com/2008/DocumentContrastDiagrams.html">Neoformix</a>.</p>
<h3>Literary Organism Maps</h3>
<div id="attachment_186" class="wp-caption alignright" style="width: 128px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/literary-organism-poster-thumb-600x848.jpg"><img class="size-medium wp-image-186" title="Literary Organism Map for 'On the Road'" src="http://www.timshowers.com/wp-content/uploads/2008/08/literary-organism-poster-thumb-600x848-211x300.jpg" alt="Literary Organism Map for Kerouac's &quot;On the Road&quot;" width="118" height="168" /></a><p class="wp-caption-text">Literary Organism Map for Kerouac&#39;s &quot;On the Road&quot;</p></div>
<p>This next visualization technique comes from <a title="Website of Artist Stefanie Posavec" href="http://www.itsbeenreal.co.uk/">Stefanie Posavec</a>, and while a bit less obvious than the others, offers some intriguing possibilities.</p>
<p>It purports to offer a scaled &#8216;map&#8217; of where each chapter goes within a textual context. While not as intuitive as tag clouds or word maps, variations of this style could be used to track &#8216;threaded&#8217; text like conversation transcripts.</p>
<p>Her <a href="http://www.notcot.com/archives/2008/04/stefanie_posave.php">project page</a> has even more amazing work in the same vein, although most of it is more artistic than analytical.</p>
<h3 style="clear: both;">Word Trees</h3>
<div id="attachment_189" class="wp-caption alignright" style="width: 160px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/dreamtree-small.gif"><img class="size-thumbnail wp-image-189" title="A Sample Word Tree" src="http://www.timshowers.com/wp-content/uploads/2008/08/dreamtree-small-150x150.gif" alt="Word Tree for &quot;I Have a Dream&quot;" width="150" height="150" /></a><p class="wp-caption-text">Word Tree for &quot;I Have a Dream&quot;</p></div>
<p><a href="http://services.alphaworks.ibm.com/manyeyes/page/Word_Tree.html">Word Trees</a> are another document flow visualization from the folks at IBM Many Eyes.</p>
<p>They help provide context to unstructured text, showing the relations between major words and phrases and their follow-ups at a glance.</p>
<p>Font scaling helps to show importance and relative frequency among the parts, and easy searching lets a user follow a path from one concept to another throughout the text body.</p>
<p>By treating follow up phrases as links (almost like a web hyperlink), a user can easily navigate a speech by subject, which makes following a large body of text by theme a breeze.</p>
<p>An obvious application of this technique would be highlighting correspondence between two parties, whether by letters or telephone/email transcripts.</p>
<h3>Document Arc Diagrams</h3>
<div id="attachment_195" class="wp-caption alignright" style="width: 160px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/document_arc_diagrams.jpg"><img class="size-thumbnail wp-image-195" title="Document Arc Diagrams" src="http://www.timshowers.com/wp-content/uploads/2008/08/document_arc_diagrams-150x150.jpg" alt="A Snippet from a Document Arc Diagram" width="150" height="150" /></a><p class="wp-caption-text">A Snippet from a Document Arc Diagram</p></div>
<p>Another winner from the NeoFormix team is the Document Arc Diagram.</p>
<p>Hover over a text fragment highlights all the related text fragments, or &#8216;document arcs&#8217;. This allows for the relations to be visible at a glance when one sentence may be linked to many contexts or related words.</p>
<p>The site includes a handy <a href="http://www.neoformix.com/Projects/DocumentArcDiagrams/index.html">generator</a> so that anyone can build a custom interactive diagram.</p>
<h2>Large Corpus Techniques</h2>
<h3>Transcript Analysis</h3>
<div id="attachment_197" class="wp-caption alignright" style="width: 160px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/nyttextbreakdown.gif"><img class="size-thumbnail wp-image-197" title="New York Times Transcript Analysis" src="http://www.timshowers.com/wp-content/uploads/2008/08/nyttextbreakdown-150x150.gif" alt="Transcript Analysis focusing on the word &quot;Iraq&quot;" width="150" height="150" /></a><p class="wp-caption-text">Transcript Analysis focusing on the word &quot;Iraq&quot;</p></div>
<p>The New York Times offered <a href="http://www.nytimes.com/interactive/2007/10/30/us/politics/20071030_DEBATE_GRAPHIC.html#transcript">this visualization</a> after one of the 2008 Democratic primary debates.</p>
<p>Annotated speech blocks showed the frequency of custom words, and highlighted occurrences and context.</p>
<p>Although this required a highly annotated text, it allowed an in depth searching an comparison of the text without compare.</p>
<p>The biggest challenge to recreating this for other purposes would be the large amount of document tagging required (presumably by hand), although certain transcript type data could probably be automatically tagged and fed into a similar system.</p>
<h3>Directed Sentence Diagrams</h3>
<div id="attachment_199" class="wp-caption alignright" style="width: 160px"><a href="http://www.timshowers.com/wp-content/uploads/2008/08/neosentdraw_sotu2000.png"><img class="size-thumbnail wp-image-199" title="Directed Sentence Diagram" src="http://www.timshowers.com/wp-content/uploads/2008/08/neosentdraw_sotu2000-150x150.png" alt="A Sample Directed Sentence Diagram from Neoformix.com" width="150" height="150" /></a><p class="wp-caption-text">A Sample Directed Sentence Diagram from Neoformix.com</p></div>
<p>The final technique is a bit unconventional, but <a href="http://www.neoformix.com/2008/DirectedSentenceDiagrams.html">Directed Sentence Diagrams</a> (again from Neoformix) are designed to show the topic &#8216;flow&#8217; in a body of work via color and cartesian length.</p>
<p>While sentence drawings <a href="http://arbitrarian.wordpress.com/2008/04/13/bushs-legacy-speaks-for-itself/">aren&#8217;t</a> <a href="http://www.notcot.com/archives/2008/04/stefanie_posave.php">new</a>, the idea of &#8216;directing&#8217; them as well as color coding them to show sentence length and topic imparts much more information into the space.</p>
<p>The sparse filling and line lengths gives a great overall picture of the percent of the document (or speaking time) given over to a given topic, and a careful analysis allows the reader to follow the outline of an argument&#8217;s points or the meandering of a story&#8217;s subjects.</p>
<p>Perhaps even moreso than the other diagrams this one requires a well annotated text, since every sentence must be have meta data on its subject, but for certain sample sets (particularly persuasive speeches or editorial style arguments) the technique is particularly effective.</p>
<h2>Formatting the Data</h2>
<p>One of the things in common with all text visualization techniques is effective use of sizing and color. Gradients and gradual font sizes can show relative importance, while opposing colors and sharp contrast can highlight points of contention.</p>
<p>Other primary challenges for the designer include trimming the text down to its vital elements, and pre-processing the data to remove <a href="http://en.wikipedia.org/wiki/Stop_words">stop words</a> or do some sort of <a href="http://en.wikipedia.org/wiki/Stemming">stemming</a> to having a ensure a clean final product. At bare minimum, removal of particles such as &#8216;He&#8217; and &#8216;The&#8217; is generally necessary for any word prevalence visualization.</p>
<p>Automated text processing is often critical with larger bodies, whether it be simple techniques like using PHP&#8217;s str_replace function on particles and stop words, or more advanced methods like stemming with Python&#8217;s <a href="http://nltk.sourceforge.net/">Natural Language Toolkit</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timshowers.com/2008/08/visualization-strategies-text-documents/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
	</channel>
</rss>
