<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Subluminal Messages &#187; jpmccusker</title>
	<atom:link href="http://subluminal.wordpress.com/author/jpmccusker/feed/" rel="self" type="application/rss+xml" />
	<link>http://subluminal.wordpress.com</link>
	<description>A science blog for all scientists, even amateurs.</description>
	<lastBuildDate>Sat, 29 Aug 2009 22:39:36 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='subluminal.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/aa34fc77cd0a704cf3ae7de17cd79449?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>Subluminal Messages &#187; jpmccusker</title>
		<link>http://subluminal.wordpress.com</link>
	</image>
			<item>
		<title>Why Java&#8217;s Date isn&#8217;t deprecated, even though it might look like it</title>
		<link>http://subluminal.wordpress.com/2009/04/09/why-javas-date-isnt-deprecated-even-though-it-might-look-like-it/</link>
		<comments>http://subluminal.wordpress.com/2009/04/09/why-javas-date-isnt-deprecated-even-though-it-might-look-like-it/#comments</comments>
		<pubDate>Thu, 09 Apr 2009 17:51:14 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[datetimes]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=100</guid>
		<description><![CDATA[This shouldn&#8217;t be news at this point (JDK 1.5 was released 5 years ago), but in Java 5 Sun decided to deprecate large swaths of java.util.Date&#8217;s methods and constructors and introduce the Calendar class, which can handle locale-specific conversions and manipulations of the units of time we humans are used to seeing. This has created [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=100&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>This shouldn&#8217;t be news at this point (JDK 1.5 was released 5 years ago), but in Java 5 Sun decided to deprecate large swaths of java.util.Date&#8217;s methods and constructors and introduce the Calendar class, which can handle locale-specific conversions and manipulations of the units of time we humans are used to seeing. This has created some confusion here and there about whether to represent datetimes as Dates or as Calendars, as it looks at a casual glance like Date has been more or less completely deprecated. It remains, however, as the preferred way of representing datetimes in Java. An explanation is after the jump.</p>
<p><span id="more-100"></span>Remember, it never says that Date itself is deprecated, only the methods of conversion are. Also, the javadoc for Calendar explicitly describes it as a way of converting between, not representing, different points in time relative to different calendar systems. The idiomatic usage pattern has a datetime being represented as a Date, but then converted to the locale-specific Calendar object to manipulate the date by saying &#8220;2 days from now&#8221; or &#8220;1 month ago&#8221;, which are locale-specific manipulations.</p>
<p>It seems like this is more complecated than it needs to be. Why not just keep datetimes as Calendars and be done with it? There&#8217;s a major problem with this approach, however. Since Calendar is abstract, one concrete calendar object may work very differently from another. For instance, say an entitiy (A) in Locale A computes and stores a Calendar object, and then another entity (B) in Locale B recieves the Calendar. If B attempts to say &#8220;one month from this point in time&#8221; using the Calendar from A, it will act according to A&#8217;s Locale, not B&#8217;s. Date does not have this problem, and is a representation of UTC. Datetimes are usually best represented as Date objects, and manipulated as Calendars, and then converted back to Dates for storage or communication between systems.</p>
<p>It would be nice in the future to be able to have convenience functions on Date to do Calendar-like things within whatever the current Locale is, but for now, we have to keep that distinction.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/100/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/100/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/100/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/100/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/100/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=100&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2009/04/09/why-javas-date-isnt-deprecated-even-though-it-might-look-like-it/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>
	</item>
		<item>
		<title>Go Out and Vote!</title>
		<link>http://subluminal.wordpress.com/2008/11/03/go-out-and-vote/</link>
		<comments>http://subluminal.wordpress.com/2008/11/03/go-out-and-vote/#comments</comments>
		<pubDate>Tue, 04 Nov 2008 04:24:18 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[vote]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=88</guid>
		<description><![CDATA[Go vote, if you haven&#8217;t already.
Get out the vote. Tell your friends, family, co-workers, fans, enemies, frenemies, random people on the street.
Find out where to vote: http://maps.google.com/vote

Today is the day to make history.
       <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=88&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Go vote, if you haven&#8217;t already.</p>
<p>Get out the vote. Tell your friends, family, co-workers, fans, enemies, frenemies, random people on the street.</p>
<p>Find out where to vote: <a href="http://maps.google.com/vote">http://maps.google.com/vote</a></p>
<p><span style="text-align:center; display: block;"><a href="http://subluminal.wordpress.com/2008/11/03/go-out-and-vote/"><img src="http://img.youtube.com/vi/fX40RsSLwF4/2.jpg" alt="" /></a></span></p>
<p>Today is the day to make history.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/88/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=88&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/11/03/go-out-and-vote/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>

		<media:content url="http://img.youtube.com/vi/fX40RsSLwF4/2.jpg" medium="image" />
	</item>
		<item>
		<title>Obama is for Science, We&#8217;re for Obama</title>
		<link>http://subluminal.wordpress.com/2008/11/03/obama-is-for-science/</link>
		<comments>http://subluminal.wordpress.com/2008/11/03/obama-is-for-science/#comments</comments>
		<pubDate>Tue, 04 Nov 2008 04:13:43 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[carbon emissions]]></category>
		<category><![CDATA[education]]></category>
		<category><![CDATA[evidence based policy]]></category>
		<category><![CDATA[green energy]]></category>
		<category><![CDATA[obama]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[science funding]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=86</guid>
		<description><![CDATA[[Edited to add Obama's positions on NASA and Space Exploration]
Those of you who know me already know that I support Barack Obama for president. I&#8217;ve hesitated to post this here, due to the non-political nature of this blog. However, it is long past time for me to make the case for Obama and his policies [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=86&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>[Edited to add Obama's positions on NASA and Space Exploration]</p>
<p>Those of you who know me already know that I support Barack Obama for president. I&#8217;ve hesitated to post this here, due to the non-political nature of this blog. However, it is long past time for me to make the case for Obama and his policies on science and research. More after the cut on his key positions and why they are important.</p>
<p><span style="text-align:center; display: block;"><a href="http://subluminal.wordpress.com/2008/11/03/obama-is-for-science/"><img src="http://img.youtube.com/vi/fBTpZ650l4w/2.jpg" alt="" /></a></span></p>
<p><span id="more-86"></span></p>
<p>Here are Obama&#8217;s key positions on science policy, with my commentary:</p>
<p><strong>Restoring integrity to U.S. science policy to ensure that decisions that can be informed by science are made on the basis of the strongest possible evidence.</strong></p>
<p>This is essential to any sane science policy. If we can learn more about an issue through evidence that has been scientifically validated, we will be much more effective in creating positive change. Science can inform policy in many ways, and this has not happened for 8 years, and wouldn&#8217;t happen under a McCain presidency.</p>
<p><strong>Doubling over a 10 year period the federal investment in basic research by key science agencies, with a special emphasis on supporting young researchers at the beginning of their careers, and backing high-risk, high-return research. </strong></p>
<p>High risk science has been at a standstill for decades as funding for science agencies has dried up. When funding is eliminated, funding agencies become more risk averse and focus funding on researchers who have already proved themselves. This often (but not always) leads to incremental improvements and a risk-averse environment for researchers who need to propose grants to fund new breakthroughs. Science is inherently risky, and no one knows what advances can be made. By increasing the ability of researchers to perform more risky, cutting edge research, we broaden our horizon of the possible with the dollars we spend.</p>
<p><strong>Making a national commitment to science education and training by recruiting some of America’s best minds to teach K-12 math and science and by tripling the number of the National Science Foundation’s Graduate Research Fellowships.</strong></p>
<p>As a computer scientist, I have no incentive to teach math or science to K-12 classrooms. I&#8217;m qualified to teach most of the math curriculum, any programming courses, or any basic science class. The people who do science are not teaching science to our children, which means that the quality of that education suffers, because teaching science in K-12 inevitably falls back on teaching &#8220;science facts&#8221;, rather than teaching students what falsifiability is, or how to set up an experiment with proper controls.</p>
<p>Also, the NSF&#8217;s <a href="http://www.nsf.gov/grfp">Graduate Research Fellowship</a> is a program to fund graduate students for Masters and Doctoral studies in the sciences. I know people who are on this fellowship, and they are doing amazing work. We need more of them.</p>
<p><strong>Encouraging American innovation to flourish by making the R&amp;D tax credit permanent, streamlining our patent system, eliminating the capital gains tax on start-ups and small businesses, and promoting the deployment of next-generation broadband networks.</strong></p>
<p>I used to run a small technology startup. We need more R&amp;D in this country, it and basic science are the engines that drive our economy. We have seen the amazing advances we can make when we encourage innovation, and we have seen the collapses that occur when we rest on our laurals and don&#8217;t drive innovation forward.</p>
<p><strong>Addressing the “grand challenges” of the 21st century through accelerating the transition to a low-carbon, oil-free economy, enabling all Americans to live longer and healthier lives, and protecting our country from emerging threats to our national security. </strong></p>
<p>Our carbon impact could easily be one of the worst things that have ever happened to the planet, causing mass extinctions and untold damage to the world&#8217;s ecosystem. But it can also be one of our greatest triumphs. Almost since life began, oxidation of carbon = energy. By using petroleum we simply accellerated the pace of that equation. By switching to technologies that can directly harness the power of the sun and earth, using wind, solar, hydro, geothermal, and renewable biofuels, we can &#8220;farm&#8221; energy rather than depending on &#8220;wild&#8221; depleted reserves. This can be done, but we need incentives to do this.</p>
<p><strong>STRENGTHEN AMERICA’S LEADERSHIP IN SPACE SCIENCE </strong></p>
<p><strong>It is essential to maintain U.S. leadership in space.  President Bush set forth a bold mission for NASA, but failed to provide the leadership and funding to see it through.  As a result, in 2010, the United States will face a five-year gap in the ability to send astronauts to the International Space Station.  An Obama administration will restore America’s leadership in space science:<br />
</strong></p>
<ul>
<li><strong>Establish a robust program for human and robotic exploration that preserves our space workforce, engages international allies and draws on expertise in the private sector.</strong></li>
<li><strong>Close the gap in NASA access to lower earth orbit and better utilize the International Space Station.</strong></li>
<li><strong>Strengthen NASA’s missions in space science, weather, climate research, and aeronautical research.</strong></li>
<li><strong>Develop a new generation of space vehicles to replace the Space Shuttle scheduled to retire in 2010.</strong></li>
<li><strong>Improve NASA’s educational outreach and programs that promote spin-off consumer technologies. </strong></li>
</ul>
<p>There&#8217;s a lot more to Obama&#8217;s science plan. It is very comprehensive and evidence-driven. He will listen to scientists to find out what science can answer, and to find out what is needed to get those answers. I encourage you to read his plan, available <a href="http://www.barackobama.com/issues/additional/#science">here</a>. Please remember to vote!</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/86/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/86/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/86/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/86/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/86/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/86/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/86/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/86/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/86/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/86/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=86&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/11/03/obama-is-for-science/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>

		<media:content url="http://img.youtube.com/vi/fBTpZ650l4w/2.jpg" medium="image" />
	</item>
		<item>
		<title>Zen Neuroscience: Observing Mindfulness</title>
		<link>http://subluminal.wordpress.com/2008/09/24/zen-neuroscience-observing-mindfulness/</link>
		<comments>http://subluminal.wordpress.com/2008/09/24/zen-neuroscience-observing-mindfulness/#comments</comments>
		<pubDate>Wed, 24 Sep 2008 21:41:30 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=81</guid>
		<description><![CDATA[There are specific neural processes that take place when people meditate, which is demonstrated in the PLoS ONE paper &#8220;Thinking about Not-Thinking”: Neural Correlates of Conceptual Processing during Zen Meditation:
Recent neuroimaging studies have identified a set of brain regions that are metabolically active during wakeful rest and consistently deactivate in a variety the performance of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=81&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>There are specific neural processes that take place when people meditate, which is demonstrated in the <a href="http://plosone.org">PLoS ONE</a> paper <a href="http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0003083">&#8220;Thinking about Not-Thinking”: Neural Correlates of Conceptual Processing during Zen Meditation</a>:</p>
<p style="padding-left:30px;">Recent neuroimaging studies have identified a set of brain regions that are metabolically active during wakeful rest and consistently deactivate in a variety the performance of demanding tasks. This “default network” has been functionally linked to the stream of thoughts occurring automatically in the absence of goal-directed activity and which constitutes an aspect of mental behavior specifically addressed by many meditative practices. Zen meditation, in particular, is traditionally associated with a mental state of full awareness but reduced conceptual content, to be attained via a disciplined regulation of attention and bodily posture. Using fMRI and a simplified meditative condition interspersed with a lexical decision task, we investigated the neural correlates of conceptual processing during meditation in regular Zen practitioners and matched control subjects. While behavioral performance did not differ between groups, Zen practitioners displayed a reduced duration of the neural response linked to conceptual processing in regions of the default network, suggesting that meditative training may foster the ability to control the automatic cascade of semantic associations triggered by a stimulus and, by extension, to voluntarily regulate the flow of spontaneous mentation.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/81/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/81/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/81/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=81&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/09/24/zen-neuroscience-observing-mindfulness/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>
	</item>
		<item>
		<title>Cross-Reference Visualizations</title>
		<link>http://subluminal.wordpress.com/2008/08/15/cross-reference-visualizations/</link>
		<comments>http://subluminal.wordpress.com/2008/08/15/cross-reference-visualizations/#comments</comments>
		<pubDate>Sat, 16 Aug 2008 04:32:32 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[Visualization]]></category>
		<category><![CDATA[cross-references]]></category>
		<category><![CDATA[religion]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=74</guid>
		<description><![CDATA[This is a pretty cool visualization of bible cross-references that was developed by Chris Harrison to show bible cross references throughout the length of the bible. I can&#8217;t help but think of how interesting this could be for genome visualization: the cross references could be based on genes that are transcription factors for other genes. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=74&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><div class="wp-caption aligncenter" style="width: 570px"><a href="http://www.chrisharrison.net/projects/bibleviz/index.html"><img src="http://www.chrisharrison.net/projects/bibleviz/BibleVizArc7small.jpg" alt="The bar graph that runs along the bottom represents all of the chapters in the Bible. Books alternate in color between white and light gray. The length of each bar denotes the number of verses in the chapter. Each of the 63,779 cross references found in the Bible is depicted by a single arc - the color corresponds to the distance between the two chapters, creating a rainbow-like effect." width="560" height="336" /></a><p class="wp-caption-text">Bible Cross-References: The bar graph that runs along the bottom represents all of the chapters in the Bible. Books alternate in color between white and light gray. The length of each bar denotes the number of verses in the chapter. Each of the 63,779 cross references found in the Bible is depicted by a single arc - the color corresponds to the distance between the two chapters, creating a rainbow-like effect. </p></div>
<p>This is <a href="http://www.chrisharrison.net/projects/bibleviz/index.html">a pretty cool visualization of bible cross-references</a> that was developed by Chris Harrison to show bible cross references throughout the length of the bible. I can&#8217;t help but think of how interesting this could be for genome visualization: the cross references could be based on genes that are transcription factors for other genes. (activate or deactivate those genes). Chris has <a href="http://www.chrisharrison.net/projects/visualization.html">lots of other fascinating visualization projects</a> to look at, and is very stimulating to browse it.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/subluminal.wordpress.com/74/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/subluminal.wordpress.com/74/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/74/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/74/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/74/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/74/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/74/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/74/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/74/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/74/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/74/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/74/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=74&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/08/15/cross-reference-visualizations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>

		<media:content url="http://www.chrisharrison.net/projects/bibleviz/BibleVizArc7small.jpg" medium="image">
			<media:title type="html">The bar graph that runs along the bottom represents all of the chapters in the Bible. Books alternate in color between white and light gray. The length of each bar denotes the number of verses in the chapter. Each of the 63,779 cross references found in the Bible is depicted by a single arc - the color corresponds to the distance between the two chapters, creating a rainbow-like effect.</media:title>
		</media:content>
	</item>
		<item>
		<title>Are Genius and Madness Even Closer Than we Think?</title>
		<link>http://subluminal.wordpress.com/2008/08/07/are-genius-and-madness-even-closer-than-we-think/</link>
		<comments>http://subluminal.wordpress.com/2008/08/07/are-genius-and-madness-even-closer-than-we-think/#comments</comments>
		<pubDate>Thu, 07 Aug 2008 19:13:21 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[genetics]]></category>
		<category><![CDATA[neuroscience]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[schizophrenia]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=48</guid>
		<description><![CDATA[&#8220;no great genius was without a mixture of insanity&#8221;
- Aristotle
&#8220;They say madness runs in our family. Some even call me mad! And why? Because I dared to dream …of my own race of atomic monsters! Atomic supermen with octagonal-shaped bodies that suck blood out of&#8230;&#8221;
-Prof. Hubert Farnsworth
To be clear, by genius I mean the ability [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=48&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p style="text-align:center;">&#8220;no great genius was without a mixture of insanity&#8221;<br />
- Aristotle</p>
<p style="text-align:center;">&#8220;They say madness runs in our family. Some even call me mad! And why? Because I dared to dream …of my own race of atomic monsters! Atomic supermen with octagonal-shaped bodies that suck blood out of&#8230;&#8221;<br />
-Prof. Hubert Farnsworth</p>
<p>To be clear, by genius I mean the ability for humans to think at the level we do. And by madness, I mean madness. A recent study suggests that we have our big brains at a high price: schizophrenia. It comes down to the massive metabolism needs that our brains have. A team led by evolutionary biologist Philipp Khaitovich of the Shanghai Institutes for Biological Sciences and the Max Planck Institute for Evolutionary Anthropology created an experiment to see how much genes that are involved in schizophrenia have evolved since humans split from chimpanzees.<span id="more-48"></span></p>
<p>From &#8220;<a href="http://sciencenow.sciencemag.org/cgi/content/full/2008/805/4">Madness: Price of a Big Brain?</a>&#8221; (<a href="http://sciencenow.sciencemag.org/">ScienceNOW</a>):
</p>
<p style="padding-left:30px;">First, the researchers looked at published databases of positively selected brain genes, which have been classified into 22 categories according to their function. They found that six of the categories included a high proportion of genes also implicated in schizophrenia; the genes in these six categories relate to energy metabolism.</p>
<p style="padding-left:30px;">So the team focused its search on energy pathways in the brain. Using a technique called nuclear magnetic resonance spectroscopy, the researchers measured the concentrations of 21 metabolites key to nerve function in the brains of 10 deceased schizophrenia patients and 12 normal human controls. Specifically, they examined an area of the prefrontal cortex implicated in social cognition. Nine of the metabolites, such as lactate, choline, and acetate, showed significantly different concentrations&#8211;some higher, some lower&#8211;in schizophrenics and normal humans. That finding, the authors say, confirms previous studies that brain metabolism is &#8220;substantially altered&#8221; in schizophrenia.</p>
<p style="padding-left:30px;">The researchers then looked at whether those nine metabolites might be important in human brain evolution. When they measured the concentrations in the same area in chimp brains, the team found that the differences between chimps and normal humans were much greater for those nine than for the 12 metabolites not implicated in schizophrenia, suggesting that energy pathways implicated in schizophrenia were also altered by human evolution, the team reports this week in <em>Genome Biology</em>. And 40 genes involved in these nine schizophrenia-related pathways also differed much more between chimps and humans than genes associated with the other 12.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/subluminal.wordpress.com/48/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/subluminal.wordpress.com/48/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/48/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=48&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/08/07/are-genius-and-madness-even-closer-than-we-think/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>
	</item>
		<item>
		<title>Searching for Text in Images</title>
		<link>http://subluminal.wordpress.com/2008/08/01/searching-for-text-in-images/</link>
		<comments>http://subluminal.wordpress.com/2008/08/01/searching-for-text-in-images/#comments</comments>
		<pubDate>Sat, 02 Aug 2008 03:31:19 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[image mining]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[Yale]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=26</guid>
		<description><![CDATA[We (my lab, http://krauthammerlab.med.yale.edu) recently published on our new image search engine for biomedical images, called YIF, or Yale Image Finder. From our blog post on our web site:
We have recently released a new biomedical image search engine we call YIF. You can access it at:
http://krauthammerlab.med.yale.edu/imagefinder
You can search the actual image content of over 34,000 [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=26&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>We (my lab, <a href="http://krauthammerlab.med.yale.edu">http://krauthammerlab.med.yale.edu</a>) recently published on our new image search engine for biomedical images, called YIF, or Yale Image Finder. From our <a href="http://krauthammerlab.med.yale.edu/introducing-yif">blog post</a> on our web site:</p>
<p style="padding-left:30px;">We have recently released a new biomedical image search engine we call YIF. You can access it at:</p>
<p style="padding-left:30px;"><a href="http://krauthammerlab.med.yale.edu/">http://krauthammerlab.med.yale.edu/imagefinder</a></p>
<p style="padding-left:30px;">You can search the actual image content of over 34,000 <a href="http://www.pubmedcentral.nih.gov/about/openaccess.html">Open   Access</a> articles from <a href="http://www.pubmedcentral.nih.gov/">PubMed Central</a>. We use OCR with different levels of image correction (article and corpus) for highly accurate image text extraction.</p>
<p style="padding-left:30px;">For more details about our algorithms, we have a paper in Bioinformatics titled “<a href="http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn340v1">Yale Image Finder (YIF): a new search engine for retrieving biomedical images</a>“.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/subluminal.wordpress.com/26/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/subluminal.wordpress.com/26/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/26/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/26/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/26/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/26/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/26/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/26/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/26/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/26/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/26/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/26/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=26&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/08/01/searching-for-text-in-images/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>
	</item>
		<item>
		<title>Double Arm Transplant in Germany Considered Success</title>
		<link>http://subluminal.wordpress.com/2008/08/01/double-arm-transplant-in-germany-considered-success/</link>
		<comments>http://subluminal.wordpress.com/2008/08/01/double-arm-transplant-in-germany-considered-success/#comments</comments>
		<pubDate>Sat, 02 Aug 2008 03:27:25 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[medicine]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=40</guid>
		<description><![CDATA[15 hours and 40 doctors later, a farmer has two new arms. The surgery replaced both limbs of a farmer who lost his arms in an accident in 2002. He lost both arms below the shoulder. From the Guardian article:
Christoph Höhnke, a surgeon on the transplant team, said that the complicated procedure was completed without [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=40&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>15 hours and 40 doctors later, <a href="http://www.guardian.co.uk/world/2008/aug/02/germany">a farmer has two new arms</a>. The surgery replaced both limbs of a farmer who lost his arms in an accident in 2002. He lost both arms below the shoulder. From the Guardian article:</p>
<p style="padding-left:30px;">Christoph Höhnke, a surgeon on the transplant team, said that the complicated procedure was completed without any unforeseen problems. It involved a team of 40 doctors, nurses and assistants working together, attaching one arm and then the other. &#8220;The whole thing went according to script,&#8221; he said.</p>
<p>With a surgery as complicated as this, though, many things can go wrong. The patient can reject the transplants, although close matching of blood and other immunological factors can minimize this, along with drugs that can dampen the immune system&#8217;s reaction to foreign bodies. The bones may not completely graft together. It&#8217;s like recovering from two severely broken arms on top of everything else.</p>
<p>Nerve regeneration has come a long way, however. The man who received a single arm transplant in 2006 was able to write with the transplanted hand within a year.</p>
<p><a href="http://krauthammerlab.med.yale.edu/imagefinder/Home,$Form.direct?formids=query%2CfigureText%2CfigureTextHP%2Ccaption%2Cabstract%2Ctitle%2Ctitle_0&amp;submitmode=&amp;submitname=&amp;query=%22nerve+regeneration%22&amp;figureText=on&amp;caption=on&amp;abstract=on&amp;title=on&amp;title_0=on">&#8220;nerve regeneration&#8221;</a> at <a href="http://krauthammerlab.med.yale.edu/imagefinder">Yale Image Finder</a>.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/subluminal.wordpress.com/40/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/subluminal.wordpress.com/40/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/40/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=40&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/08/01/double-arm-transplant-in-germany-considered-success/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>
	</item>
		<item>
		<title>Running Standard Deviations</title>
		<link>http://subluminal.wordpress.com/2008/07/31/running-standard-deviations/</link>
		<comments>http://subluminal.wordpress.com/2008/07/31/running-standard-deviations/#comments</comments>
		<pubDate>Thu, 31 Jul 2008 19:53:11 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[centroid]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[eigenvectors]]></category>
		<category><![CDATA[math]]></category>
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=15</guid>
		<description><![CDATA[My friend Dan at Invisible Blocks came up with a great way to compute a long-running mean from the count and mean:
count += 1
mean += (x - mean) / count

I remembered that I had come up with a similar thing for standard deviation back when I was developing clustering algorithms that could use that value. It uses a power sum average, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=15&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>My friend Dan at <a href="http://invisibleblocks.wordpress.com">Invisible Blocks</a> came up with a great way to <a href="http://invisibleblocks.wordpress.com/2008/07/30/long-running-averages-without-the-sum-of-preceding-values/">compute a long-running mean</a> from the count and mean:</p>
<pre class="brush: python;">count += 1
mean += (x - mean) / count
</pre>
<p>I remembered that I had come up with a similar thing for standard deviation back when I was developing clustering algorithms that could use that value. It uses a power sum average, where you track the power sum as an average (divide the power sum by n) in a similar way.</p>
<p><span id="more-15"></span></p>
<pre class="brush: python;">n=0
mean=0
pwrSumAvg=0
stdDev=0

def update(x):
    n += 1
    mean += (x - mean) / n
    pwrSumAvg += ( x * x - pwrSumAvg) / n
    stdDev = sqrt( (pwrSumAvg * n - n * mean * mean) / (n - 1) )
</pre>
<p>We get this using the running sum formula for standard deviation and essentially simplifying away the sum by removing n:</p>
<p><a href="http://en.wikipedia.org/wiki/Standard_deviation"><img class="aligncenter" src="http://upload.wikimedia.org/math/e/0/2/e0287c48e7a1faa538f87de24c6789c0.png" alt="" width="263" height="70" /></a></p>
<p>In this formula, the core values for the standard deviation is the sum of the squares minus the square of the sum, which we can recreate from the mean and power sum average, respectively:</p>
<p>p: power sum average<br />
m: mean<br />
n: elements</p>
<p><img src='http://l.wordpress.com/latex.php?latex=s%3D%5Csqrt%7B%5Cfrac%7Bpn%5E2+-+%28mn%29%5E2%7D%7Bn%28n-1%29%7D%7D&#038;bg=e6e6e6&#038;fg=000000&#038;s=0' alt='s=\sqrt{\frac{pn^2 - (mn)^2}{n(n-1)}}' title='s=\sqrt{\frac{pn^2 - (mn)^2}{n(n-1)}}' class='latex' /><br />
<img src='http://l.wordpress.com/latex.php?latex=s%3D%5Csqrt%7B%5Cfrac%7Bpn%5E2+-+m%5E2n%5E2%7D%7Bn%28n-1%29%7D%7D&#038;bg=e6e6e6&#038;fg=000000&#038;s=0' alt='s=\sqrt{\frac{pn^2 - m^2n^2}{n(n-1)}}' title='s=\sqrt{\frac{pn^2 - m^2n^2}{n(n-1)}}' class='latex' /><br />
<img src='http://l.wordpress.com/latex.php?latex=s%3D%5Csqrt%7B%5Cfrac%7Bn%5E2%28p+-+nm%5E2%29%7D%7Bn%28n+-+1%29%7D%7D&#038;bg=e6e6e6&#038;fg=000000&#038;s=0' alt='s=\sqrt{\frac{n^2(p - nm^2)}{n(n - 1)}}' title='s=\sqrt{\frac{n^2(p - nm^2)}{n(n - 1)}}' class='latex' /><br />
<img src='http://l.wordpress.com/latex.php?latex=s%3D%5Csqrt%7B%5Cfrac%7Bpn+-+nm%5E2%7D%7Bn+-+1%7D%7D&#038;bg=e6e6e6&#038;fg=000000&#038;s=0' alt='s=\sqrt{\frac{pn - nm^2}{n - 1}}' title='s=\sqrt{\frac{pn - nm^2}{n - 1}}' class='latex' /></p>
<p>Sorry for all the math, but I wanted to make sure I got it right, and that I can prove this later. I&#8217;ve already had to reconstruct it once, so I don&#8217;t want to do it again. Also, I just figured out how to embed <img src='http://l.wordpress.com/latex.php?latex=%5CLaTeX&#038;bg=e6e6e6&#038;fg=000000&#038;s=0' alt='\LaTeX' title='\LaTeX' class='latex' /> equations in WordPress, which is pretty damn cool.</p>
<p>Another neat thing to do with this has to do with manipulating centroids of vectors. That is, dealing with lists of values that correspond to the same datum. For instance, a row of data can be considered an n-dimensional vector, where each value in the row is a value along that dimension of the vector. The traditional data mining definition of a centroid usually deals with only the mean, but I like to refer to centroid clouds when adding in standard deviations, because rather than a point in the n-dimensional space, we refer to a diffuse cloud of probability when adding in the standard deviation value.</p>
<p>To find a centroid of two value vectors, it is a simple matter to apply the formulas above to create the mean and standard deviations of the centroid by applying them to each value along the vector. Combining a centroid with a value works the same way, where the centroid is a vector of mean and power sum averages, and the count appies to the entire centroid.</p>
<p>A neat part comes in when you try to combine two centroids. The equations and code from before become  more symmetrical than when only adding in one value at a time. A given value, <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=e6e6e6&#038;fg=000000&#038;s=0' alt='x' title='x' class='latex' />, can be generalized into a mean of <img src='http://l.wordpress.com/latex.php?latex=m%3Dx&#038;bg=e6e6e6&#038;fg=000000&#038;s=0' alt='m=x' title='m=x' class='latex' /> and a count of <img src='http://l.wordpress.com/latex.php?latex=c%3D1&#038;bg=e6e6e6&#038;fg=000000&#038;s=0' alt='c=1' title='c=1' class='latex' />. By doing this, we can change our code to be:</p>
<pre class="brush: python;">class value:
    _init_(self, x=0):
        self.n=1
        self.mean=x
        self.pwrSumAvg=x*x
        self.stdDev=0

    def merge(self, val):
        self.mean = (val.mean*val.n + self.mean*self.n)
                    / (self.n + val.n)
        pwrSumAvg = ( val.pwrSumAvg * val.n + self.pwrSumAvg * self.n)
                    / (self.n + val.n)
        self.n += val.n
        stdDev = sqrt( (pwrSumAvg * n - n * mean * mean) / (n - 1) )
</pre>
<p>We lose our nice calculation of the mean, but we gain the ability to merge these two centroids together. Note that the calculation for the standard deviation is the same as before, but we change the way the power sum average and the mean are calculated.</p>
<p>Why go through all this trouble? You can now use this for data clustering and also to potentially visualize the centroid clouds with their data scattered through them. In bioinformatics, for example, data is often summarized at many levels. Gene expression probes are grouped into probe sets, and probe sets are often clustered or combined into functional units, or gene sets. Preserving standard deviation at these levels allows for greater clarity into the data.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/subluminal.wordpress.com/15/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/subluminal.wordpress.com/15/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/15/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=15&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/07/31/running-standard-deviations/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>

		<media:content url="http://upload.wikimedia.org/math/e/0/2/e0287c48e7a1faa538f87de24c6789c0.png" medium="image" />
	</item>
		<item>
		<title>Data Mining: An Introduction</title>
		<link>http://subluminal.wordpress.com/2008/07/27/data-mining-intro/</link>
		<comments>http://subluminal.wordpress.com/2008/07/27/data-mining-intro/#comments</comments>
		<pubDate>Sun, 27 Jul 2008 16:56:19 +0000</pubDate>
		<dc:creator>jpmccusker</dc:creator>
				<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[cross validation]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[feature selection]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[normalization]]></category>

		<guid isPermaLink="false">http://subluminal.wordpress.com/?p=12</guid>
		<description><![CDATA[Data mining is, in the most general terms, an attempt to extract patterns and knowledge from data using various types of software and techniques. Data mining is used to learn and predict. This is applied to biology, neuroscience, fraud detection, national security, and even sports.
Some of these are more successful than others. For instance, text [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=12&subd=subluminal&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Data mining is, in the most general terms, an attempt to extract patterns and knowledge from data using various types of software and techniques. Data mining is used to learn and predict. This is applied to biology, neuroscience, fraud detection, national security, and even sports.</p>
<p>Some of these are more successful than others. For instance, text mining has been very successful at extracting proper nouns (names, places, etc.) from text, and what might be considered the biggest success of data mining comes from text mining: internet search engines. But at the same time, text mining has been less successful at automated text summarization.<span id="more-12"></span></p>
<p>Typically, data mining algorithms are broken into two groups: supervised and unsupervised. A supervised algorithm attempts to learn labels that have been applied to the data via some external means. Supervised algorithms can be used as either classifiers or predictors, based on whether the output of the algorithm (the labels) are either discrete (as in a classifier), or continuous (as in a predictor). An unsupervised algorithm tries to make some sense of the data without the use of labels. There are also other sorts of algorithms that don&#8217;t fit easily into these groups, such as optimizers. An optimizer attempts to optimize the parameters of an algorithm to maximize some sort of score in a multidimensional search space. Evolutionary Algorithms, such as genetic algorithms and genetic programming, along with Monte Carlo simulation and simulated annealing, are among the most common of optimization algorithms.</p>
<p>The inputs used in data mining are usually called features, and outputs are called labels, even if they are continuous values, and even if they aren&#8217;t being trained. An example is a row of data in the data set. Each example is a set of features and possibly labels. A model is the set of data that is the output of the analysis of the training data, and represents the algorithm&#8217;s &#8220;understanding&#8221; (as much as it can be called that) of the training data. A typical processing pipeline used in data mining will look like this:</p>
<ul>
<li><strong>Feature Selection:</strong> Often done by the person running the analysis, it is critical to find features that have some chance of performing the task you set out. You can&#8217;t use a coin toss to predict the winner of a basketball game, no matter how many times you try. It can also be done at a later stage as an automated process called automated feature selection.</li>
<li><strong>Normalization:</strong> There are two kinds of normalization: corrective normalization, which is used to remove measurement artifacts (this is very common in bioinformatics), and algorithmic normalization, which is used to bring the values into a range that the data mining algorithm will understand. A common form of algorithmic normalization is to bring the mean of all values to 0 and the standard deviation to 1, or to bring the minimum to 0 and the maximum to 1. It really depends on the data and the algorithm. In fact, after feature selection, proper normalization is probably the most important factor to successful data mining.</li>
<li><strong>Training:</strong> Training is the creation of the data mining model for that data. In supervised cases, this is by comparing the training data with the associated labels, and attempting to predict those labels from the data. In unsupervised cases, a model is built up based on some sort of similarity of the cases.</li>
<li><strong>Testing:</strong> This usually only occurs in supervised learning. A set of examples are reserved for testing the accuracy of the data mining model. The classifier or predictor is run over the reserved examples, and the outputs are compared to the already-existing labels. The scoring can take a number of forms, which I hope to go over later. The most rigorous form of testing is called cross validation. It is done by randomly partitioning the data into N parts, training a model on N-1 parts, and then testing the model on the remaining part. This is repeated for all N parts, and the scores are combined across the data. The most extreme form of this is where N is the number of examples in the data set, and is called leave-one-out cross validation. Otherwise, it is called N-fold cross validation.</li>
</ul>
<p>A common criticism of data mining has been that it is hypothesis fishing. That is, rather than observing the data, making a hypothesis, and testing the hypothesis, we let the computer examine all possible hypotheses and then pick the highest scoring one. I see data mining more as automated hypothesis generation. It is very difficult to get valid results from data mining without knowing the domain you are applying it to, and ultimately the model that gets created will need to be vetted against unlabeled data in some sort of prospective study. Once that happens, the resulting model can go from being a hypothesis to a theory.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/subluminal.wordpress.com/12/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/subluminal.wordpress.com/12/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/subluminal.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/subluminal.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/subluminal.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/subluminal.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/subluminal.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/subluminal.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/subluminal.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/subluminal.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/subluminal.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/subluminal.wordpress.com/12/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=subluminal.wordpress.com&blog=315531&post=12&subd=subluminal&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://subluminal.wordpress.com/2008/07/27/data-mining-intro/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/a8455664b943ebd075df06b2c4954cbc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jpmccusker</media:title>
		</media:content>
	</item>
	</channel>
</rss>