<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Concrete Nonsense</title>
	<atom:link href="http://concretenonsense.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://concretenonsense.wordpress.com</link>
	<description>A group blog about mathematics</description>
	<lastBuildDate>Sat, 21 Jan 2012 03:28:52 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='concretenonsense.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Concrete Nonsense</title>
		<link>http://concretenonsense.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://concretenonsense.wordpress.com/osd.xml" title="Concrete Nonsense" />
	<atom:link rel='hub' href='http://concretenonsense.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Some talks</title>
		<link>http://concretenonsense.wordpress.com/2012/01/18/some-talks/</link>
		<comments>http://concretenonsense.wordpress.com/2012/01/18/some-talks/#comments</comments>
		<pubDate>Thu, 19 Jan 2012 04:12:01 +0000</pubDate>
		<dc:creator>Steven Sam</dc:creator>
				<category><![CDATA[Algebraic Geometry]]></category>
		<category><![CDATA[Commutative Algebra]]></category>
		<category><![CDATA[Representation Theory]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1401</guid>
		<description><![CDATA[Unfortunately things have gotten more busy so I haven&#8217;t been writing on the blog. However, I found a new way to prepare talks that I really like: I type out my notes in a way which I hope is readable to someone who has not been to the talk. It is halfway between an expository [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1401&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Unfortunately things have gotten more busy so I haven&#8217;t been writing on the blog. However, I found a new way to prepare talks that I really like: I type out my notes in a way which I hope is readable to someone who has not been to the talk. It is halfway between an expository paper and lecture notes and the advantage is that I have a product which I can share on my webpage (and I find it more useful than slides &#8212; I am starting to dislike slides presentations for seminar talks). Anyway, I&#8217;ll advertise some of these here (I&#8217;ve never actually posted about my own research on this blog, so it&#8217;s a first!):</p>
<p>1. I&#8217;ve been working with Laurent Gruson and Jerzy Weyman on finding geometric interpretations for orbits in &#8220;Vinberg <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' />-representations&#8221;. I gave a talk on this at Princeton (<a href="http://math.mit.edu/~ssam/writings/talks/bhargavatalk.pdf">notes</a>) and Michigan (<a href="http://math.mit.edu/~ssam/writings/talks/michigan-GIT.pdf">notes</a>). The Princeton talk is more introductory in nature, and even though there is overlap, the two sets of notes should complement one another (for the record, both talks were approximately 1 hour, 45 minutes).</p>
<p>2. A separate project with Weyman involves trying to understand Koszul homology for certain classes of determinantal-like ideals. The motivation comes from trying to classify minimal free resolutions over quadric hypersurface rings and in trying to understand a certain result of Koike and Terada in combinatorial representation theory. I&#8217;m giving a talk on this tomorrow at Michigan (<a href="http://math.mit.edu/~ssam/writings/talks/michigan-koszulhomology.pdf">notes</a>).</p>
<p>It&#8217;s a bit time-consuming, but I think preparing notes like this can be very useful, especially for projects which haven&#8217;t been finished yet (it helps me gain direction). I hope more people try it!</p>
<p>-Steven</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1401/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1401/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1401/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1401/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1401/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1401/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1401/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1401/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1401/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1401/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1401/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1401/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1401/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1401/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1401&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2012/01/18/some-talks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">masnevets</media:title>
		</media:content>
	</item>
		<item>
		<title>(3+1)-avoiding posets</title>
		<link>http://concretenonsense.wordpress.com/2011/12/04/3-plus-1-avoiding-posets/</link>
		<comments>http://concretenonsense.wordpress.com/2011/12/04/3-plus-1-avoiding-posets/#comments</comments>
		<pubDate>Sun, 04 Dec 2011 14:00:53 +0000</pubDate>
		<dc:creator>JBL</dc:creator>
				<category><![CDATA[Combinatorics]]></category>
		<category><![CDATA[graph theory]]></category>
		<category><![CDATA[posets]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1301</guid>
		<description><![CDATA[Yan and I recently put a paper on the arXiv that enumerates the graded (3+1)-avoiding posets.  In this post, I kill the adjective &#8220;graded&#8221; and talk a bit about what (3+1)-avoiding posets are and why they&#8217;re interesting.  If you don&#8217;t know what a poset is, I&#8217;ve included the definition in Note 0 at the bottom of the post. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1301&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Yan and I recently put a paper on the arXiv that <a href="http://arxiv.org/abs/1106.5480">enumerates the graded (<strong>3</strong>+<strong>1</strong>)-avoiding <em></em>posets</a>.  In this post, I kill the adjective &#8220;graded&#8221; and talk a bit about what (<strong>3</strong>+<strong>1</strong>)-avoiding <em></em>posets are and why they&#8217;re interesting.  If you don&#8217;t know what a poset is, I&#8217;ve included the definition in Note 0 at the bottom of the post.</p>
<p>As with any object as general as posets, we are mostly interested not in results about all posets, but rather in finding particular families of posets with interesting or unexpected properties.  One such family of posets are the (<strong>3</strong>+<strong>1</strong>)-avoiding posets.  These are the posets that do not contain four elements, say <em>a</em>, <em>b</em>, <em>c</em>, and <em>d</em>, such that <em>a</em> &lt; <em>b</em> &lt; <em>c</em> and <em>d</em> is incomparable to the other three.  A short digression to explain the name &#8220;(<strong>3</strong>+<strong>1</strong>)-avoiding&#8221;: one natural class of posets are the <em>chains</em>, finite total orders like the first example in the previous paragraph.  A natural name for the chain with <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n' title='n' class='latex' /> vertices is <strong><em>n</em></strong>, so the chain with three vertices is <strong>3</strong> and the chain with one vertex (the only poset with exactly one vertex) is <strong>1</strong>.  There are several natural operations that we can use to combine posets, including the disjoint union, denoted &#8220;+&#8221;.  Thus, <strong>3</strong>+<strong>1</strong> is the poset that you get if you add a single isolated vertex to a three-element chain, and a poset is (<strong>3</strong>+<strong>1</strong>)-avoiding if it has no four elements that induce the poset <strong>3</strong>+<strong>1</strong>.</p>
<p><span id="more-1301"></span></p>
<p>It&#8217;s a common surprise in combinatorics to find that important objects are characterized by avoiding certain induced subobjects; subposet-avoidance is one particular example of this, and in fact (<strong>3</strong>+<strong>1</strong>)-avoiding posets show up in a number of unexpected places.  The simplest and perhaps nicest appearance is in the characterization of <em>semiorders</em>.  Semiorders are posets that arise in the following way: we have a set of data (real numbers generated by some experiment) such that each datapoint has error bars of the same size.  Thus, if two values <em>a</em> and <em>b</em> are separated by at least a fixed distance then we know their true order; but if they are separated by less than this distance, we can&#8217;t be certain which value is truly larger.  The relations we can be certain of are the relations of our poset.  It&#8217;s not hard to see that this definition is equivalent to the following: a semiorder is a poset whose elements are unit intervals in the real numbers, with one element less than another if and only if it lies entirely to its left.  (Aside: an easy exercise is to show that it doesn&#8217;t matter whether our intervals are open or closed.)  The main result of interest is that semiorders are exactly those posets that avoid <strong>3</strong>+<strong>1</strong> and <strong>2</strong>+<strong>2</strong>.  (If we drop the &#8220;unit&#8221; in &#8220;unit intervals&#8221;, we get just (<strong>2</strong>+<strong>2</strong>)-avoiding posets.)  These posets have all sorts of nice properties; for example, the number of them with <em>n</em> unlabeled elements is exactly the <em>n</em>th Catalan number, so we immediately know we&#8217;re going to get nice combinatorics and lots of connections with other objects.  See <a href="http://en.wikipedia.org/wiki/Semiorder">Wikipedia</a> and the work of Peter Fishburn for much more information about them. <strong></strong></p>
<p>A second set of connections comes from the following simple observation: a poset <em>P</em> avoids <strong>3</strong>+<strong>1</strong> if and only if its <em>incomparability graph</em> (i.e., the graph <em>G</em> on the same vertex set such that <em>u</em> is connected to <em>v</em> in <em>G</em> if and only if <em>u</em> is incomparable to <em>v</em> in <em>P</em>) is <em>claw-free</em>, i.e., contains no four vertices <em>a</em>, <em>b</em>, <em>c</em>, <em>d</em> such that <em>d</em> is connected to <em>a</em>, <em>b</em> and <em>c</em>, none of which are connected to each other.  Claw-free graphs are quite interesting; for example, they make an appearance in some <a href="http://arxiv.org/abs/1105.0698">recent work of Fadnavis</a>, who proved the following pretty result:</p>
<blockquote><p>Suppose that we wish to color a graph <em>G</em> with <em>q</em> colors, choosing the color for each vertex at random.  If <em>G</em> is claw-free then to maximize the chance that the resulting coloring is a proper coloring (i.e., no two adjacent vertices have the same color), we should choose colors uniformly at random (i.e., with equal probabilities 1/<em>q</em>).</p></blockquote>
<p>(Quite surprisingly, this result is not true in general!  To 2-color a star graph with 4 or more points, you&#8217;re better off with a more lop-sided distribution.)  Actually this result is just one part of a bigger story; for example, it&#8217;s also related to the Stanley-Stembridge conjecture, which asserts that symmetric chromatic polynomial of the comparability graph of a (<strong>3</strong>+<strong>1</strong>)-avoiding poset is <em>e</em>-positive.  (For definitions, check the Fadnavis paper, which is really excellent and has a lot of interesting material.)</p>
<p>As an enumerative combinatorialist, all these nice features of (<strong>3</strong>+<strong>1</strong>)-avoiding posets make me want to count them.  And, in some sense one should expect this to be not too difficult: claw-free graphs and (<strong>3</strong>+<strong>1</strong>)-avoiding posets both have nice structural classifications (due to Chudnovsky &amp; Seymour and to Skandera, respectively), and the related (<strong>2</strong>+<strong>2</strong>)-avoiding posets have been enumerated by Bousquet-Mélou, Claesson, Dukes &amp; Kitaev.  But, unfortunately, it seems like none of this is actually directly relevant.  So at least for now, counting (<strong>3</strong>+<strong>1</strong>)-avoiding posets remains very much open.</p>
<p>Notes:</p>
<p>0: One of the fundamental objects of combinatorics is the <em>partially ordered set</em>, or <em>poset</em> for short.<sup>1</sup>  Posets are just what their name suggests: they are given by an order relation (usually denoted <img src='http://s0.wp.com/latex.php?latex=%3C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&lt;' title='&lt;' class='latex' />) that is transitive (i.e., <img src='http://s0.wp.com/latex.php?latex=a+%3C+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &lt; b' title='a &lt; b' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b+%3C+c&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='b &lt; c' title='b &lt; c' class='latex' /> means <img src='http://s0.wp.com/latex.php?latex=a+%3C+c&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &lt; c' title='a &lt; c' class='latex' />) and antisymmetric (i.e., we never have <img src='http://s0.wp.com/latex.php?latex=a%2C+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a, b' title='a, b' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=a+%3C+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &lt; b' title='a &lt; b' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b+%3C+a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='b &lt; a' title='b &lt; a' class='latex' />), but it is partial in the sense that not every two elements are necessarily comparable (so we might have <img src='http://s0.wp.com/latex.php?latex=a+%5Cneq+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &#92;neq b' title='a &#92;neq b' class='latex' /> such that neither <img src='http://s0.wp.com/latex.php?latex=a+%3C+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &lt; b' title='a &lt; b' class='latex' /> nor <img src='http://s0.wp.com/latex.php?latex=a+%3E+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &gt; b' title='a &gt; b' class='latex' /> hold).  Obviously posets are a very flexible family of objects, including things like the usual order on the integers <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C+2%2C+%5Cldots%2C+n%5C%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;{1, 2, &#92;ldots, n&#92;}' title='&#92;{1, 2, &#92;ldots, n&#92;}' class='latex' /> (a partial order than happens to be a total order) or the containment order on the subsets of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C+2%2C+%5Cldots%2C+n%5C%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;{1, 2, &#92;ldots, n&#92;}' title='&#92;{1, 2, &#92;ldots, n&#92;}' class='latex' /> (the Boolean lattice).</p>
<p>1: Does anyone know the history of how this came to be?  As far as I know, MacMahon didn&#8217;t do anything with posets, so my assumption is that one can trace it to Rota, but obviously this is not based on anything concrete at all.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1301/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1301/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1301/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1301/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1301/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1301/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1301/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1301/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1301/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1301/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1301/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1301/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1301/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1301/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1301&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/12/04/3-plus-1-avoiding-posets/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jblblog</media:title>
		</media:content>
	</item>
		<item>
		<title>On Being Naive</title>
		<link>http://concretenonsense.wordpress.com/2011/11/23/on-being-naive/</link>
		<comments>http://concretenonsense.wordpress.com/2011/11/23/on-being-naive/#comments</comments>
		<pubDate>Wed, 23 Nov 2011 20:13:59 +0000</pubDate>
		<dc:creator>yanzhang</dc:creator>
				<category><![CDATA[Probability]]></category>
		<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[conditional independence]]></category>
		<category><![CDATA[Jaynes]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[naive bayes]]></category>
		<category><![CDATA[noisy-or]]></category>
		<category><![CDATA[bayesian networks]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1358</guid>
		<description><![CDATA[I gave a talk at the MIT applied math seminar recently. Some people wanted a blog post, so here it is. Some of my mathematical hobbies include probability and machine learning. I have recently realized that many simple but effective ideas of these fields really all come from one thing: an independence assumption. It was [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1358&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I gave a talk at the MIT applied math seminar recently. Some people wanted a blog post, so here it is.</p>
<p>Some of my mathematical hobbies include probability and machine learning. I have recently realized that many simple but effective ideas of these fields really all come from one thing: an independence assumption. It was only until I saw the same example in several different guises, however, before I really caught on. As something Occam would surely approve of, the extreme naiveté this approach embraces can actually go a long way. Our key player is simple: we say that <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='B' title='B' class='latex' /> are <em>conditionally independent</em> given <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C' title='C' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=P%28A%7CB%2C+C%29+%3D+P%28A+%7C+C%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(A|B, C) = P(A | C)' title='P(A|B, C) = P(A | C)' class='latex' />. This can be written in the more symmetric form <img src='http://s0.wp.com/latex.php?latex=P%28A%2C+B%7CC%29+%3D+P%28A%7CC%29+P%28B%7CC%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(A, B|C) = P(A|C) P(B|C)' title='P(A, B|C) = P(A|C) P(B|C)' class='latex' />. Now I will tell a few stories. Most of these should be old for a specialist, but I hope I&#8217;ve included some remarks that even they may appreciate.</p>
<p><span id="more-1358"></span></p>
<p><strong>Weighing Evidence</strong></p>
<p>(this is mostly <del>stolen from</del><strong></strong> inspired by Jaynes from <em>Probability Theory: the Logic of Science</em>) Suppose we have two hypotheses about the state of the world, say <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BE%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{E}' title='&#92;overline{E}' class='latex' />, and we know that exactly one of them is true. Now suppose we are getting consecutive pieces of data <img src='http://s0.wp.com/latex.php?latex=D_1%2C+D_2%2C+%5Cldots&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D_1, D_2, &#92;ldots' title='D_1, D_2, &#92;ldots' class='latex' />.  How does this data update our belief in <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> or its complement?</p>
<p>A standard use of Bayes&#8217;s Theorem gives that <img src='http://s0.wp.com/latex.php?latex=P%28D_1%2C+D_2%2C+%5Cldots+%2C+E%29+%3D+P%28E%29+P%28D_1+%7C+E%29P%28D_2+%7C+D_1%2C+E%29+P%28D_3+%7C+D_1%2C+D_2%2C+E%29%5Ccdots&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(D_1, D_2, &#92;ldots , E) = P(E) P(D_1 | E)P(D_2 | D_1, E) P(D_3 | D_1, D_2, E)&#92;cdots' title='P(D_1, D_2, &#92;ldots , E) = P(E) P(D_1 | E)P(D_2 | D_1, E) P(D_3 | D_1, D_2, E)&#92;cdots' class='latex' /> Doing the same for <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BE%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{E}' title='&#92;overline{E}' class='latex' /> and dividing gives</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdfrac%7BP%28D_1%2C+D_2%2C+%5Cldots%2C+E%29%7D%7BP%28D_1%2C+D_2%2C+%5Cldots%2C+%5Coverline%7BE%7D%29%7D+%3D+%5Cdfrac%7BP%28E%29%7D%7BP%28%5Coverline%7BE%7D%29%7D+%5Cdfrac%7BP%28D_1+%7C+E%29%7D%7BP%28D_1+%7C+%5Coverline%7BE%7D%29%7D+%5Cdfrac%7BP%28D_2+%7C+D_1%2C+E%29%7D%7BP%28D_2+%7C+D_1%2C+%5Coverline%7BE%7D%29%7D%5Ccdots&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;dfrac{P(D_1, D_2, &#92;ldots, E)}{P(D_1, D_2, &#92;ldots, &#92;overline{E})} = &#92;dfrac{P(E)}{P(&#92;overline{E})} &#92;dfrac{P(D_1 | E)}{P(D_1 | &#92;overline{E})} &#92;dfrac{P(D_2 | D_1, E)}{P(D_2 | D_1, &#92;overline{E})}&#92;cdots' title='&#92;dfrac{P(D_1, D_2, &#92;ldots, E)}{P(D_1, D_2, &#92;ldots, &#92;overline{E})} = &#92;dfrac{P(E)}{P(&#92;overline{E})} &#92;dfrac{P(D_1 | E)}{P(D_1 | &#92;overline{E})} &#92;dfrac{P(D_2 | D_1, E)}{P(D_2 | D_1, &#92;overline{E})}&#92;cdots' class='latex' /></p>
<p>Here, let&#8217;s make the naive assumption that the <img src='http://s0.wp.com/latex.php?latex=D_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D_i' title='D_i' class='latex' />&#8216;s are conditionally independent given <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BE%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{E}' title='&#92;overline{E}' class='latex' />. Not only are these different, we in fact want the slightly stronger assumption that the <img src='http://s0.wp.com/latex.php?latex=D_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D_i' title='D_i' class='latex' />&#8216;s are conditionally independent of the intersection of the previous <img src='http://s0.wp.com/latex.php?latex=D_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D_i' title='D_i' class='latex' />&#8216;s given <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BE%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{E}' title='&#92;overline{E}' class='latex' />. If so, on the right we just get a product of <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7BP%28D_i+%7C+E%29%7D%7BP%28D_i+%7C+%5Coverline%7BE%7D%29%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{P(D_i | E)}{P(D_i | &#92;overline{E})}' title='&#92;frac{P(D_i | E)}{P(D_i | &#92;overline{E})}' class='latex' />. We want to take logs here and rewrite our equation as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=O%28E+%7C+D_1%2C+D_2%2C+%5Cldots%29+%3D+O%28E%29+%2B+%5Csum_i+%5Clog%28%5Cfrac%7BP%28D_i+%7C+E%29%7D%7BP%28D_i+%7C+%5Coverline%7BE%7D%29%7D%29%2C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(E | D_1, D_2, &#92;ldots) = O(E) + &#92;sum_i &#92;log(&#92;frac{P(D_i | E)}{P(D_i | &#92;overline{E})}),' title='O(E | D_1, D_2, &#92;ldots) = O(E) + &#92;sum_i &#92;log(&#92;frac{P(D_i | E)}{P(D_i | &#92;overline{E})}),' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=O%28E%7CD%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(E|D)' title='O(E|D)' class='latex' /> is the <em>odds</em> <img src='http://s0.wp.com/latex.php?latex=%5Clog%28%5Cfrac%7BP%28E%7CD%29%7D%7BP%28%5Coverline%7BE%7D%7CD%29%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;log(&#92;frac{P(E|D)}{P(&#92;overline{E}|D)})' title='&#92;log(&#92;frac{P(E|D)}{P(&#92;overline{E}|D)})' class='latex' />. We now make the natural guess we are in the world <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> exactly when this is positive (which corresponds to the <img src='http://s0.wp.com/latex.php?latex=P%28E%7CD_1%2C+%5Cldots%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(E|D_1, &#92;ldots)' title='P(E|D_1, &#92;ldots)' class='latex' /> having higher probability).</p>
<p>The cute thing about this situation is that we are really &#8220;weighing&#8221; our evidence, as on a balance! Each new piece of data just contributes a number, and we mentally keep a tally and just decide yes or no given the sign of the final sum. Our original &#8220;bias&#8221; is exactly the odds of <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> given no other information, which exactly corresponds to the Bayesian information contained in our prior knowledge of <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' />, so the entire prior information comes into play as a &#8220;head start&#8221; bias in one direction, <em>as if it were a single instance of data with some weight</em>! This is a very clean way to make decisions, and really makes binary hypothesis testing very intuitive (it is fun/frustrating to try to generalize this to more than <img src='http://s0.wp.com/latex.php?latex=2&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='2' title='2' class='latex' /> hypotheses, where there are quite a few unexpected pitfalls). The key, however, was our conditional independence assumption.</p>
<p><strong>Naive Bayes</strong></p>
<p>A very naive approach to spam filtering is the following (generative) model. Let nature choose whether a message is spam (call this hypothesis <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='H' title='H' class='latex' />) with some probability <img src='http://s0.wp.com/latex.php?latex=P%28H%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(H)' title='P(H)' class='latex' /> and then, for each word in the dictionary, pick whether each word <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='w_i' title='w_i' class='latex' /> is in the message (abuse notation and call this the event <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='w_i' title='w_i' class='latex' />) with probability <img src='http://s0.wp.com/latex.php?latex=P%28w_i+%7C+H%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(w_i | H)' title='P(w_i | H)' class='latex' /> (or <img src='http://s0.wp.com/latex.php?latex=P%28w_i%7C%5Coverline%7BH%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(w_i|&#92;overline{H})' title='P(w_i|&#92;overline{H})' class='latex' />). You then pick maximum likelihood over sample data to learn the probabilities, and on a new piece of data just get the <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='w_i' title='w_i' class='latex' />&#8216;s and see whether <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='H' title='H' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BH%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{H}' title='&#92;overline{H}' class='latex' /> was more likely.</p>
<p>So what was the naive assumption? It was that the events <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='w_i' title='w_i' class='latex' /> were conditionally independent given either hypothesis. Cutely, the final log odds <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7BP%28H%7CD%29%7D%7BP%28%5Coverline%7BH%7D%7CD%29%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{P(H|D)}{P(&#92;overline{H}|D)}' title='&#92;frac{P(H|D)}{P(&#92;overline{H}|D)}' class='latex' /> is additive because of this (an exercise I leave to the reader) and extremely easy to calculate (and fast for computers, too!). The punchline is that this additivity is exactly the same additivity we had in the hypothesis testing. The expressions work out such that the inclusion/exclusion of every word will add some positive/negative number to a running total, biased in the beginning by some initial value determined by <img src='http://s0.wp.com/latex.php?latex=P%28H%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(H)' title='P(H)' class='latex' /> with no other information, and we choose to classify a message as spam based on whether this running total is positive or not.</p>
<p>Before continuing, I want to put in e-print my long-running gripe that &#8220;Naive Bayes&#8221; is a misnomer, since the only thing naive I can possibly think about doing with Bayes&#8217;s Theorem is conditional independence, and so every such algorithm should be called &#8220;naive.&#8221; The name offers no information about the particular algorithm that is associated with spam filtering, and the graphical network it corresponds to is not even the unique simplest graph (you can reverse all the arrows, for instance, and get Noisy-Or or any other ICI, but I admit those have another layer of complexity that I am not going into here). Unsurprisingly, there are actually many flavors of Naive Bayes depending on where you want to insert your naivete &#8212; see Metsis, Androutsopoulos and Paliouras, <em>Spam Filtering with Naive Bayes – Which Naive Bayes?</em><br />
<strong></strong></p>
<p><strong>The Unreasonable Effectiveness of Naive Bayes</strong></p>
<p>What&#8217;s the yoga here? Here&#8217;s my take: the conditional independence became a multiplicative condition and thus an additive condition, so the convenience of independence corresponds to the convenience of linearity. Thus, the hyperbolic punchline of this post is that &#8220;independence is linearity.&#8221;</p>
<p>I see a strange phenomenon (at least among pure mathematicians casually talking about applied math; I&#8217;m sure applied mathematicians have a better intuition) that people are very comfortable accepting linear approximations but not as comfortable accepting independence, whereas at least in my very simple setup they are <em>exactly </em>the same. I will audaciously extend my analogy to say that this intuition is inconsistent, and I don&#8217;t know why people seem to be completely fine with logistic regression (which really just says the log-odds is additive and is thus a third story equivalent to the two stories I&#8217;ve told in this blog post!) while careful about making disclaimers about Naive Bayes.</p>
<p>In fact, Naive Bayes, contrary to popular opinion, is actually also very good (and provably optimal, with certain definitions of optimal) when events are <em>very </em>dependent! It is only in the middle regions where it suffers, and it really doesn&#8217;t suffer by much. We also overestimate its problems because we like to think in terms of <img src='http://s0.wp.com/latex.php?latex=L%5E2&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L^2' title='L^2' class='latex' /> but errors in classifying are frequently done under zero-one loss (this is a really interesting nuance that I would love to talk about some other time, but this post has gotten long enough). For a more in-depth look, see Domingos and Pazzani, <em>On the Optimality of the Simple Bayesian Classifier</em><em> under Zero-One Loss</em>.</p>
<p><strong>Appendix: Noisy-Or and Bayesian Networks<br />
</strong></p>
<p>When we talk about conditional independence, we really should take the setup of Bayesian networks, which gives a natural excuse to introduce Naive Bayes&#8217; much lesser well-known sister, Noisy-Or (that often does better than Naive Bayes!). I spent some time in my talk going over the basics of d-separation, Markov blankets, etc.. However, I realized that I had no real interesting observations so I won&#8217;t talk too much about it in blog format, where the reader is very close to Google and smarter people who know much more than I do. I did have one silly &#8220;original&#8221; contribution, however, so I share it here.</p>
<p>Here is an example that I thought was surprisingly clean and possibly helpful for someone interested in the basics of Bayesian networks: consider the events A(AC), B(Battery), and C(Computer), corresponding to whether the corresponding electronic gizmo is on or off (with the computer connected to both the AC and the battery). This corresponds to a Bayesian network with two edges <img src='http://s0.wp.com/latex.php?latex=A+%5Crightarrow+C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A &#92;rightarrow C' title='A &#92;rightarrow C' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B+%5Crightarrow+C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='B &#92;rightarrow C' title='B &#92;rightarrow C' class='latex' />.</p>
<p>It is obvious that <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='B' title='B' class='latex' /> are independent until <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C' title='C' class='latex' /> is observed, which makes them conditionally dependent; if you know the Computer is on or off, then the other two power sources&#8217; integrities are coupled. Otherwise, your blissful ignorance gives you no information. This is starkly different from <em>every other </em>orientation (3 possible) of the edges, where <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='B' title='B' class='latex' /> are dependent but <em>conditionally independent</em> given <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C' title='C' class='latex' />! This quirkiness makes the weird d-separation criterion necessary, and I thought this example very mnemonically convenient for marking the &#8220;bad&#8221; edge-orientation.</p>
<p>-Yan</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1358/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1358/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1358/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1358/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1358/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1358/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1358/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1358/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1358/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1358/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1358/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1358/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1358/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1358/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1358&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/11/23/on-being-naive/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">KR</media:title>
		</media:content>
	</item>
		<item>
		<title>Positivity, Dynamics, Knots</title>
		<link>http://concretenonsense.wordpress.com/2011/08/20/positivity-dynamics-knots/</link>
		<comments>http://concretenonsense.wordpress.com/2011/08/20/positivity-dynamics-knots/#comments</comments>
		<pubDate>Sat, 20 Aug 2011 20:50:18 +0000</pubDate>
		<dc:creator>lewallen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1331</guid>
		<description><![CDATA[Some of my research last year had to do with notions of &#8220;positivity&#8221; in Heegaard Floer homology and knot theory (a phrase which I&#8217;m borrowing from a paper of Matt Hedden). As a simple example of positivity, a braid (and more generally, a knot) is called positive if it has a diagram containing only positive [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1331&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p style="text-align:left;"><strong></strong>Some of my research last year had to do with notions of &#8220;positivity&#8221; in Heegaard Floer homology and knot theory (a phrase which I&#8217;m borrowing from a paper of Matt Hedden). As a simple example of positivity, a braid (and more generally, a knot) is called positive if it has a diagram containing only positive crossings:</p>
<p><a href="http://concretenonsense.files.wordpress.com/2011/08/image1pos.jpg"><img class="size-medium wp-image-1335 aligncenter" title="Image1Pos" src="http://concretenonsense.files.wordpress.com/2011/08/image1pos.jpg?w=300&#038;h=79" alt="" width="300" height="79" /></a><span id="more-1331"></span></p>
<p>(Positive knots become negative knots if we switch either our nomenclature or the orientation on <img src='http://s0.wp.com/latex.php?latex=%7BS%5E%7B3%7D%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^{3}}' title='{S^{3}}' class='latex' />, so there are <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathbb%7BZ%7D+%2F2%5Cmathbb%7BZ%7D%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mathbb{Z} /2&#92;mathbb{Z}}' title='{&#92;mathbb{Z} /2&#92;mathbb{Z}}' class='latex' /> confusions lurking everywhere in this business. Also, note that although we needed on orientation on our knot to define the sign of a crossing, the sign is actually independent of this orientation, and only depends on the embdedding of the knot in <img src='http://s0.wp.com/latex.php?latex=%7BS%5E3%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^3}' title='{S^3}' class='latex' />).</p>
<p>A rather amazing theorem is that every positive braid is fibered. What this means is that if we take our positive braid and close it up into a knot <img src='http://s0.wp.com/latex.php?latex=%7BK%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{K}' title='{K}' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BS%5E%7B3%7D%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^{3}}' title='{S^{3}}' class='latex' />, then the &#8220;knot complement&#8221; <img src='http://s0.wp.com/latex.php?latex=%7BS%5E%7B3%7D%5Cbackslash+K%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^{3}&#92;backslash K}' title='{S^{3}&#92;backslash K}' class='latex' />, which we can think of as an open 3-manifold, actually fibers over the circle, with fiber a punctured surface. (Alternatively, if we remove a small tubular neighborhood of the knot, we can think of <img src='http://s0.wp.com/latex.php?latex=S%5E3+%5Cbackslash+K&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='S^3 &#92;backslash K' title='S^3 &#92;backslash K' class='latex' /> as a compact 3-manifold with torus boundary). Here&#8217;s how to close the braid:</p>
<p><a href="http://concretenonsense.files.wordpress.com/2011/08/image2close.jpg"><img class="size-medium wp-image-1336 aligncenter" title="Image2Close" src="http://concretenonsense.files.wordpress.com/2011/08/image2close.jpg?w=300&#038;h=175" alt="" width="300" height="175" /></a></p>
<p>In other words, there is a punctured surface <img src='http://s0.wp.com/latex.php?latex=%7B%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;Sigma}' title='{&#92;Sigma}' class='latex' />, and a map <img src='http://s0.wp.com/latex.php?latex=%7Bf%3A%5CSigma+%5Crightarrow+%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f:&#92;Sigma &#92;rightarrow &#92;Sigma}' title='{f:&#92;Sigma &#92;rightarrow &#92;Sigma}' class='latex' />, fixing the puncture, so that the knot complement is the mapping torus for <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' />, as pictured below. Such fibered 3-manifolds are very special.</p>
<p><a href="http://concretenonsense.files.wordpress.com/2011/08/image3torus.jpg"><img class="alignnone size-medium wp-image-1337 aligncenter" title="Image3Torus" src="http://concretenonsense.files.wordpress.com/2011/08/image3torus.jpg?w=300&#038;h=135" alt="" width="300" height="135" /></a>Note that it is precisely because <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' /> fixes the puncture that the line <img src='http://s0.wp.com/latex.php?latex=%7BK%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{K}' title='{K}' class='latex' /> above closes up and becomes a knot. I should say, the proof that positive braids are fibered uses an even more amazing theorem of Stallings, which characterizes fibered knot complements in terms of a simple algebraic property of their fundamental group. This particular notion of positivity is one that appears in Matt&#8217;s paper. I recently read some work of Etienne Ghys talking about a related notion, and I thought it was so cool that I had to post about it.</p>
<p>Here&#8217;s the theorem, which Ghys attributes to Freed, Schwarzman, and Sullivan. Let <img src='http://s0.wp.com/latex.php?latex=%7BM%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{M}' title='{M}' class='latex' /> be a compact manifold with a non-vanishing, smooth vector field <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />. First for the background: suppose there is a closed surface <img src='http://s0.wp.com/latex.php?latex=%7B%5CSigma+%5Csubset+M%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;Sigma &#92;subset M}' title='{&#92;Sigma &#92;subset M}' class='latex' /> which is transverse to the flow of <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> and meets the forward orbit of every point <img src='http://s0.wp.com/latex.php?latex=%7Bm%5Cin+M%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{m&#92;in M}' title='{m&#92;in M}' class='latex' /> at least once (therefore, it meets every orbit infinitely many times). We get a first return map <img src='http://s0.wp.com/latex.php?latex=%7Bf%3A%5CSigma+%5Crightarrow+%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f:&#92;Sigma &#92;rightarrow &#92;Sigma}' title='{f:&#92;Sigma &#92;rightarrow &#92;Sigma}' class='latex' />, simply by taking <img src='http://s0.wp.com/latex.php?latex=%7Bm%5Cin+%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{m&#92;in &#92;Sigma}' title='{m&#92;in &#92;Sigma}' class='latex' /> and flowing it forward until it hits <img src='http://s0.wp.com/latex.php?latex=%7B%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;Sigma}' title='{&#92;Sigma}' class='latex' /> again, say at <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{n}' title='{n}' class='latex' />, and defining <img src='http://s0.wp.com/latex.php?latex=%7Bf%28m%29%3Dn%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f(m)=n}' title='{f(m)=n}' class='latex' />. Then it&#8217;s not hard to check that <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> must be the mapping torus for <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' />, as before, and that <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> is just the natural vector field pointing along the &#8220;time&#8221; direction of the mapping torus (up to scaling), as depicted below:</p>
<p><a href="http://concretenonsense.files.wordpress.com/2011/08/image4vfield.jpg"><img class="size-full wp-image-1338 aligncenter" title="Image4VField" src="http://concretenonsense.files.wordpress.com/2011/08/image4vfield.jpg?w=500" alt=""   /></a></p>
<p>In this case, <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> is called the suspension of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' />. The question addressed by the theorem is: given a non-vanishing vector field <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%7BM%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{M}' title='{M}' class='latex' />, when is it the suspension of a map <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' />? Note that if we have such a suspension, and therefore a fibration over <img src='http://s0.wp.com/latex.php?latex=%7BS%5E1%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^1}' title='{S^1}' class='latex' />, we can pull back the form <img src='http://s0.wp.com/latex.php?latex=%7Bd%5Ctheta%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{d&#92;theta}' title='{d&#92;theta}' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%7BS%5E1%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^1}' title='{S^1}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7B%5COmega%5E1%28M%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;Omega^1(M)}' title='{&#92;Omega^1(M)}' class='latex' /> to get a closed, NON-VANISHING 1-form which is positive on <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />. It&#8217;s not so hard to figure out that having such a form is equivalent to being a suspension. The really cool theorem is an apparently much weaker condition which is also sufficient.</p>
<p>The key object is the set <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal%7BP%7D_X%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mathcal{P}_X}' title='{&#92;mathcal{P}_X}' class='latex' /> of probability measures on <img src='http://s0.wp.com/latex.php?latex=%7BM%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{M}' title='{M}' class='latex' /> which are invariant with respect to <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />. Given any <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmu+%5Cin+%5Cmathcal%7BP%7D_X%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mu &#92;in &#92;mathcal{P}_X}' title='{&#92;mu &#92;in &#92;mathcal{P}_X}' class='latex' />, and any 1-form <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega+%5Cin+%5COmega%5E%7B1%7D%28M%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;omega &#92;in &#92;Omega^{1}(M)}' title='{&#92;omega &#92;in &#92;Omega^{1}(M)}' class='latex' />, we can get a number by integrating:</p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cint_M+%5Comega%28X%29+d%5Cmu&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='&#92;displaystyle &#92;int_M &#92;omega(X) d&#92;mu' title='&#92;displaystyle &#92;int_M &#92;omega(X) d&#92;mu' class='latex' /></p>
<p>This associates to any <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmu%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mu}' title='{&#92;mu}' class='latex' /> a 1-chain, i.e. something dual to a 1-form. If <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;omega}' title='{&#92;omega}' class='latex' /> is exact, the above integral can be shown to be 0, using the <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />-invariance of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmu%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mu}' title='{&#92;mu}' class='latex' /> (use the invariance to rewrite the integrand, when <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;omega}' title='{&#92;omega}' class='latex' /> is an exact form, as a total differential). Therefore, we obtain a map</p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+S%3A+%5Cmathcal%7BP%7D_X+%5Crightarrow+H_1%28M%3B%5Cmathbb%7BR%7D%29&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='&#92;displaystyle S: &#92;mathcal{P}_X &#92;rightarrow H_1(M;&#92;mathbb{R})' title='&#92;displaystyle S: &#92;mathcal{P}_X &#92;rightarrow H_1(M;&#92;mathbb{R})' class='latex' /></p>
<p>whose image <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)}' title='{S(&#92;mathcal{P}_X)}' class='latex' /> is in fact compact and convex. Now, if <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> was a suspension, then <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)}' title='{S(&#92;mathcal{P}_X)}' class='latex' /> actually lies entirely in some positive half-space of <img src='http://s0.wp.com/latex.php?latex=H_1%28M%2C%5Cmathbb%7BR%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='H_1(M,&#92;mathbb{R})' title='H_1(M,&#92;mathbb{R})' class='latex' />. Why? Well, remember that in this case we have a closed non-vanishing one-form which is positive on <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />, and by pairing with this form we get a map <img src='http://s0.wp.com/latex.php?latex=%7BH_1%28M%3B%5Cmathbb%7BR%7D%29%5Crightarrow+%5Cmathbb%7BR%7D%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{H_1(M;&#92;mathbb{R})&#92;rightarrow &#92;mathbb{R}}' title='{H_1(M;&#92;mathbb{R})&#92;rightarrow &#92;mathbb{R}}' class='latex' /> which is positive on <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)}' title='{S(&#92;mathcal{P}_X)}' class='latex' />. Therefore <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)}' title='{S(&#92;mathcal{P}_X)}' class='latex' /> lies in a positive half-space of <img src='http://s0.wp.com/latex.php?latex=%7BH_1%28M%3B%5Cmathbb%7BR%7D%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{H_1(M;&#92;mathbb{R})}' title='{H_1(M;&#92;mathbb{R})}' class='latex' />. The rad theorem of FSS is that this is actually sufficient:</p>
<p>Theorem: <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> is a suspension if and only if <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%5Csubset+H_1%28M%3B%5Cmathbb%7BR%7D%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)&#92;subset H_1(M;&#92;mathbb{R})}' title='{S(&#92;mathcal{P}_X)&#92;subset H_1(M;&#92;mathbb{R})}' class='latex' /> is contained in some halfspace.</p>
<p>One remark about the measures <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal%7BP%7D_X%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mathcal{P}_X}' title='{&#92;mathcal{P}_X}' class='latex' />: note that if we have a closed periodic orbit <img src='http://s0.wp.com/latex.php?latex=O&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O' title='O' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='X' title='X' class='latex' />, i.e. some closed loop which integrates the flow, then we get a natural set of measures which are concentrated near <img src='http://s0.wp.com/latex.php?latex=O&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O' title='O' class='latex' />. In this sense, <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal%7BP%7D_X%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mathcal{P}_X}' title='{&#92;mathcal{P}_X}' class='latex' /> should be thought of as a set of generalized periodic orbits. The measures associated to actual periodic orbits just get sent by <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='S' title='S' class='latex' /> to the class represented by these orbits in <img src='http://s0.wp.com/latex.php?latex=H_1%28M%3B%5Cmathbb%7BR%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='H_1(M;&#92;mathbb{R})' title='H_1(M;&#92;mathbb{R})' class='latex' />.</p>
<p>One way to think about the theorem, and this is how one of the proofs goes, is that just from this positive subset of homology, we can do some fancy functional analysis to create from this an actual dual form, not just a cohomology class, with the right non-vanishing and positivity properties. So somehow, we&#8217;re free to work just in homology without losing information, which seems very appealing.</p>
<p>I&#8217;m not yet sure exactly what this theorem useful for, but it&#8217;s so neat. In a soon to come follow up, I will talk more about positivity on the knot side, and bleg about a concrete question that I&#8217;d love to get everyone&#8217;s opinion on.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1331/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1331/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1331&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/08/20/positivity-dynamics-knots/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">lewallen</media:title>
		</media:content>

		<media:content url="http://concretenonsense.files.wordpress.com/2011/08/image1pos.jpg?w=300" medium="image">
			<media:title type="html">Image1Pos</media:title>
		</media:content>

		<media:content url="http://concretenonsense.files.wordpress.com/2011/08/image2close.jpg?w=300" medium="image">
			<media:title type="html">Image2Close</media:title>
		</media:content>

		<media:content url="http://concretenonsense.files.wordpress.com/2011/08/image3torus.jpg?w=300" medium="image">
			<media:title type="html">Image3Torus</media:title>
		</media:content>

		<media:content url="http://concretenonsense.files.wordpress.com/2011/08/image4vfield.jpg" medium="image">
			<media:title type="html">Image4VField</media:title>
		</media:content>
	</item>
		<item>
		<title>On Physical Units</title>
		<link>http://concretenonsense.wordpress.com/2011/08/03/on-physical-units/</link>
		<comments>http://concretenonsense.wordpress.com/2011/08/03/on-physical-units/#comments</comments>
		<pubDate>Thu, 04 Aug 2011 01:43:13 +0000</pubDate>
		<dc:creator>yanzhang</dc:creator>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Physics]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[physics]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1316</guid>
		<description><![CDATA[I&#8217;ve been learning some physics from Allan Adams and some physics students here at MIT, and I&#8217;ve suddenly realized that there are a lot of contextual tricks I take for granted when I&#8217;m doing physics, which make the physicists&#8217; lives easier and sometimes really irritates the mathematicians. The first two examples I can think of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1316&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been learning some physics from Allan Adams and some physics students here at MIT, and I&#8217;ve suddenly realized that there are a lot of contextual tricks I take for granted when I&#8217;m doing physics, which make the physicists&#8217; lives easier and sometimes really irritates the mathematicians. The first two examples I can think of is differentiating under the integral and never checking convergence, though both of these really fall under the bigger umbrella of assuming everything is well-behaved (which probably accounts for 80% of the mathematical gripes I&#8217;ve seen against physicists). Now, I&#8217;m a happy supporter of this &#8220;wishful thinking&#8221; practice: to use a programming analogy, I think of this habit as the <a href="http://en.wikipedia.org/wiki/Lazy_evaluation">lazy evaluation</a> version of having good definitions, and as a lover of Python generators I totally appreciate the idea of saying &#8220;we&#8217;ll figure out the right definitions later since they actually exist.&#8221;</p>
<p>One of the most common physical tricks, however, is not of this category. It is the curiously natural framework: &#8220;we have a consistent idea of units.&#8221; Here&#8217;s a perfectly sound argument to get something that is not entirely obvious:<span id="more-1316"></span></p>
<p>Take the integral <img src='http://s0.wp.com/latex.php?latex=%5Cint_%7B-%5Cinfty%7D%5E%7B%5Cinfty%7D+e%5E%7B-ax%5E2%7D+dx&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;int_{-&#92;infty}^{&#92;infty} e^{-ax^2} dx' title='&#92;int_{-&#92;infty}^{&#92;infty} e^{-ax^2} dx' class='latex' />. There is a way to get some information about it without doing the real integral:</p>
<p style="padding-left:30px;">Do the substitution <img src='http://s0.wp.com/latex.php?latex=y+%3D+%5Csqrt%7Ba%7D+x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='y = &#92;sqrt{a} x' title='y = &#92;sqrt{a} x' class='latex' />. Then the integral becomes <img src='http://s0.wp.com/latex.php?latex=%5Cint+e%5E%7B-y%5E2%7D+%281%2F%5Csqrt%7Ba%7D%29+dy+%3D+%281%2F%5Csqrt%7Ba%7D%29+%5Cint+e%5E%7B-y%5E2%7D+dy+%3D+C+a%5E%7B-1%2F2%7D.&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;int e^{-y^2} (1/&#92;sqrt{a}) dy = (1/&#92;sqrt{a}) &#92;int e^{-y^2} dy = C a^{-1/2}.' title='&#92;int e^{-y^2} (1/&#92;sqrt{a}) dy = (1/&#92;sqrt{a}) &#92;int e^{-y^2} dy = C a^{-1/2}.' class='latex' /></p>
<p>Using a slightly physical language: f we don&#8217;t care about the actual constant, just the &#8220;order&#8221; of <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' /> (though it is a similar concept, we&#8217;re not exactly doing the order of growth of <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' />), we can deduce that the answer is in the &#8220;units&#8221; of <img src='http://s0.wp.com/latex.php?latex=a%5E%7B-1%2F2%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a^{-1/2}' title='a^{-1/2}' class='latex' /> (the complete answer is <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Cpi+%2F+a%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;sqrt{&#92;pi / a}' title='&#92;sqrt{&#92;pi / a}' class='latex' />), by &#8220;isolating&#8221; the part of the integral with dependence on <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' />.</p>
<p>Even though this is already somewhat trick-sy, it is not quite as far as what a physicist would do. They would (confirmed by experience!) look at this and say something like:</p>
<p style="padding-left:30px;">&#8220;Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' /> have the units of <img src='http://s0.wp.com/latex.php?latex=%5B%5Ctext%7Blength%7D%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='[&#92;text{length}]' title='[&#92;text{length}]' class='latex' />. The exponential must be unit-less otherwise it doesn&#8217;t have a well-defined unit, which means <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' /> must have units of <img src='http://s0.wp.com/latex.php?latex=%5B%5Ctext%7Blength%7D%5E%7B-2%7D%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='[&#92;text{length}^{-2}]' title='[&#92;text{length}^{-2}]' class='latex' /> and the integrand itself must then be unit-less. When we integrate, we then pick up a single unit of <img src='http://s0.wp.com/latex.php?latex=%5B%5Ctext%7Blength%7D%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='[&#92;text{length}]' title='[&#92;text{length}]' class='latex' /> in terms of <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' />, so it must be <img src='http://s0.wp.com/latex.php?latex=C+a%5E%7B-1%2F2%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C a^{-1/2}' title='C a^{-1/2}' class='latex' /> for some constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C' title='C' class='latex' />.&#8221;</p>
<p>The problem is this makes perfect &#8220;sense&#8221; to me in a completely sound way (there is no approximation or heuristic here), yet I cannot argue it to my satisfaction in any mathematical matter. All I know is that most people with even elementary physics experience have picked up a very consistent language of &#8220;units&#8221; that we can use to make definite deductions, but I&#8217;m finding it hard to axiomatize them in a clear way. After trying for about half an hour, the only thing I&#8217;ve decided is that we really want some sort of valuation on a space of functions that is multiplicative, which I believe is enough to make the differentiation and integration instincts about units work, and that we limit all addition to be done with functions of homogeneous valuation. However, is that really it (for example, I don&#8217;t feel this is all that has gone into the logic above)? If so, what is the right way to formalize it? Also, I distinctly remember having seen usage of units to argue more sophisticated chains of logic than the example I&#8217;ve given here, though the exact examples don&#8217;t come to mind. If anyone has further insight and examples it would be really helpful.</p>
<p><strong>Update</strong>: after an unnecessarily long discussion w/ Qiaochu (the source of the un-necessity being my muddled thinking about something irrelevant), I now agree the formalism is &#8220;easy&#8221; and can be done in several ways (though I still find the intuition to be a clearer way to think than the formalism). The method that seems most natural to me is to just think of all functions we care about as lying in a graded algebra with grades indexed by powers of units; Qiaochu prefers to think of the &#8220;physical&#8221; attributes as living in one-dimensional representations / weight spaces. Pick whatever you like. My request for more &#8220;interesting&#8221; examples of using units still holds.</p>
<p>-Yan</p>
<p>(thanks to Allan, Yoni, and Josh for teaching me physics)</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1316/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1316/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1316/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1316/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1316/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1316/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1316/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1316/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1316/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1316/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1316/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1316/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1316/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1316/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1316&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/08/03/on-physical-units/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">KR</media:title>
		</media:content>
	</item>
		<item>
		<title>Complexity Classes: P, NP, co-NP, PSPACE</title>
		<link>http://concretenonsense.wordpress.com/2011/07/21/complexity-classes-p-np-co-np-pspace/</link>
		<comments>http://concretenonsense.wordpress.com/2011/07/21/complexity-classes-p-np-co-np-pspace/#comments</comments>
		<pubDate>Thu, 21 Jul 2011 08:02:19 +0000</pubDate>
		<dc:creator>Alan Guo</dc:creator>
				<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[complexity class]]></category>
		<category><![CDATA[np-complete]]></category>
		<category><![CDATA[p vs. np]]></category>
		<category><![CDATA[pspace]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1274</guid>
		<description><![CDATA[One of the seven Millennium Prize Problems stated by the Clay Mathematics Institute is the P vs. NP problem, which asks: is P = NP? What are P and NP exactly? In this post, I&#8217;ll go over some basic computational complexity classes: P, NP, co-NP, PSPACE, and known relationships between them, as well as intuition [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1274&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>One of the seven Millennium Prize Problems stated by the Clay Mathematics Institute is the P vs. NP problem, which asks: is P = NP? What are P and NP exactly? In this post, I&#8217;ll go over some basic computational complexity classes: P, NP, co-NP, PSPACE, and known relationships between them, as well as intuition behind each class, without getting too technical. After reading, you should understand what the P vs. NP problem and you should get an idea of how the basic complexity classes are interrelated.</p>
<p><span id="more-1274"></span></p>
<p>To start, we need to agree on a model of Turing machines. Usually, theorists work with a single-tape model. Actually, any Turing machine can simulate any other with at most polynomial time overhead. We also distinguish between <strong>deterministic</strong> Turing machines and <strong>non-deterministic</strong> Turing machines. The difference is that from each configuration, a non-deterministic Turing machine can go on to more than one possible configuration. The precise formulation of this is that the transition function of a non-deterministic Turing machine maps configurations to sets of configurations rather than single configurations. Furthermore, we can assume that every configuration leads to two possible configurations. This doesn&#8217;t really correspond to any real-world physical model, it&#8217;s just a theoretical construction. Moreover, a non-deterministic machine accepts if and only if <em>at least one</em> of its branches accepts. It turns out that non-determinism doesn&#8217;t give Turing machines any power to recognize more languages. This is because any non-deterministic Turing machine can be simulated by a Turing machine (with potentially exponential time overhead) by trying all branches of the non-deterministic machine &#8220;in parallel&#8221; by using breadth first search, for example.</p>
<p>Now I introduce some definitions and notation. A <img src='http://s0.wp.com/latex.php?latex=f%28n%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='f(n)' title='f(n)' class='latex' />-time (or -space) Turing machine is one that uses at most <img src='http://s0.wp.com/latex.php?latex=f%28n%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='f(n)' title='f(n)' class='latex' /> steps (or tape cells) on any input of length <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n' title='n' class='latex' />.  Define <img src='http://s0.wp.com/latex.php?latex=%5Cmathbf%7BTIME%7D%28f%28n%29%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbf{TIME}(f(n))' title='&#92;mathbf{TIME}(f(n))' class='latex' /> to be the set of languages decidable by an <img src='http://s0.wp.com/latex.php?latex=O%28f%28n%29%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(f(n))' title='O(f(n))' class='latex' />-time deterministic Turing machine, and <img src='http://s0.wp.com/latex.php?latex=%5Cmathbf%7BNTIME%7D%28f%28n%29%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbf{NTIME}(f(n))' title='&#92;mathbf{NTIME}(f(n))' class='latex' /> to be the set of languages decidable by an <img src='http://s0.wp.com/latex.php?latex=O%28f%28n%29%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(f(n))' title='O(f(n))' class='latex' />-time non-deterministic Turing machine. Define <img src='http://s0.wp.com/latex.php?latex=%5Cmathbf%7BSPACE%7D%28f%28n%29%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbf{SPACE}(f(n))' title='&#92;mathbf{SPACE}(f(n))' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cmathbf%7BNSPACE%7D%28f%28n%29%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbf{NSPACE}(f(n))' title='&#92;mathbf{NSPACE}(f(n))' class='latex' /> analogously. Now I can define P, NP, co-NP, PSPACE, and more:</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cmathbf%7BP%7D+%3D+%5Ccup_%7Bk+%5Cge+0%7D+%5Cmathbf%7BDTIME%7D%28n%5Ek%29+%5C%5C+%5Cmathbf%7BNP%7D+%3D+%5Ccup_%7Bk+%5Cge+0%7D+%5Cmathbf%7BNTIME%7D%28n%5Ek%29+%5C%5C+%5Cmathbf%7BPSPACE%7D+%3D+%5Ccup_%7Bk+%5Cge+0%7D+%5Cmathbf%7BDSPACE%7D%28n%5Ek%29+%5C%5C+%5Cmathbf%7BNPSPACE%7D+%3D+%5Ccup_%7Bk+%5Cge+0%7D+%5Cmathbf%7BNSPACE%7D%28n%5Ek%29+%5C%5C+%5Cmathbf%7BEXPTIME%7D+%3D+%5Ccup_%7Bk+%5Cge+0%7D+%5Cmathbf%7BDTIME%7D%282%5E%7Bn%5Ek%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbf{P} = &#92;cup_{k &#92;ge 0} &#92;mathbf{DTIME}(n^k) &#92;&#92; &#92;mathbf{NP} = &#92;cup_{k &#92;ge 0} &#92;mathbf{NTIME}(n^k) &#92;&#92; &#92;mathbf{PSPACE} = &#92;cup_{k &#92;ge 0} &#92;mathbf{DSPACE}(n^k) &#92;&#92; &#92;mathbf{NPSPACE} = &#92;cup_{k &#92;ge 0} &#92;mathbf{NSPACE}(n^k) &#92;&#92; &#92;mathbf{EXPTIME} = &#92;cup_{k &#92;ge 0} &#92;mathbf{DTIME}(2^{n^k})' title='&#92;mathbf{P} = &#92;cup_{k &#92;ge 0} &#92;mathbf{DTIME}(n^k) &#92;&#92; &#92;mathbf{NP} = &#92;cup_{k &#92;ge 0} &#92;mathbf{NTIME}(n^k) &#92;&#92; &#92;mathbf{PSPACE} = &#92;cup_{k &#92;ge 0} &#92;mathbf{DSPACE}(n^k) &#92;&#92; &#92;mathbf{NPSPACE} = &#92;cup_{k &#92;ge 0} &#92;mathbf{NSPACE}(n^k) &#92;&#92; &#92;mathbf{EXPTIME} = &#92;cup_{k &#92;ge 0} &#92;mathbf{DTIME}(2^{n^k})' class='latex' /></p>
<p>Furthermore, for any class C, we define co-C to be the set of languages whose complements are in C, which gives us definitions for co-P, co-NP, co-PSPACE, and co-NPSPACE. So far, I&#8217;ve defined a lot of complexity classes, but it turns out a lot of these are the same.</p>
<p>First of all, let&#8217;s get some intuition for these. P is the set of problems that can be solved by polynomial-time deterministic algorithms. NP is the set of problems that can be solved by polynomial-time non-deterministic algorithms. What does this actually mean? A non-deterministic algorithm can &#8220;guess&#8221; the right step to take next. From every configuration, there are 2 configurations to go to, and as long as one path of configurations leads to an accept state, the algorithm accepts. What this means is that if we get lucky, if the algorithm guesses the &#8220;correct&#8221; configuration to go to every step, then it can reach the answer in polynomial time. An equivalent definition of NP is the set of problems for which a solution can be verified in polynomial time. Examples will come soon. PSPACE is the set of problems that can be solved using polynomial space. EXPTIME is the set of problems that can be solved using exponentially long time.</p>
<p>Why do we lump all the polynomials together? Why not distinguish between linear time problems, quadratic time, etc.? Well, like I mentioned at the beginning, we assumed one particular model of the Turing machine. However, if we change the Turing machine, our run-time and space may change by a polynomial amount. Therefore what&#8217;s linear on one machine could be quadratic on another. We want robust classes &#8212; classes that are the same no matter what model of Turing machine we choose, which is why we lump all the polynomials together.</p>
<p>Some relationships follow pretty straightforwardly from the definitions. For example, P is contained in NP, since any deterministic TM can be simulated by a non-deterministic one with basically no overhead (you can just view a DTM as a NDTM with one branch at each configuration). Also, PSPACE is contained in EXPTIME. The idea is that if the tape of a TM is bounded by some number, then the number of possible configurations it has is at most exponential in the space bound. The exponential part comes from counting the number of possible strings you could have on the tape (number of possible letters in the alphabet raised to the power of the number of cells). Anyway, if a DTM encounters the same configuration twice, then it must have entered an infinite loop. Therefore, any DTM that solves a PSPACE problem must halt, and hence can run for at most exponentially many steps, since that&#8217;s an upper bound on the number of possible configurations.</p>
<p>Moreover, for deterministic classes C like P, PSPACE, and EXPTIME, in fact C = co-C. This is because you can just construct a DTM which runs the original DTM and then spits out the opposite answer. If <img src='http://s0.wp.com/latex.php?latex=L+%5Cin+C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L &#92;in C' title='L &#92;in C' class='latex' /> is a language and <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> decides <img src='http://s0.wp.com/latex.php?latex=L&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L' title='L' class='latex' />, then construct <img src='http://s0.wp.com/latex.php?latex=M%27&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M&#039;' title='M&#039;' class='latex' /> to spit out the opposite answer of <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' />. Now, <img src='http://s0.wp.com/latex.php?latex=M%27&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M&#039;' title='M&#039;' class='latex' /> decides <img src='http://s0.wp.com/latex.php?latex=%5Coverline+L+%3D+%5C%7B+x+%3A+x+%5Cnotin+L%5C%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline L = &#92;{ x : x &#92;notin L&#92;}' title='&#92;overline L = &#92;{ x : x &#92;notin L&#92;}' class='latex' />, since it accepts <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> rejects <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=x+%5Cnotin+L&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x &#92;notin L' title='x &#92;notin L' class='latex' />.</p>
<p>Note, however, that this trick doesn&#8217;t work for non-deterministic machines, and so we can&#8217;t deduce that NP is the same as co-NP. In fact, their relationship is still not known! The reason this trick doesn&#8217;t work is that a non-deterministic machine might have <em>some</em> accepting branches and <em>some</em> rejecting branches for certain inputs. In this case, constructing a machine that reverses the answer will have <em>some</em> rejecting branches, corresponding to the accepting branches of the original machine, and <em>some</em> accepting branches corresponding to the rejecting branches of the original machine. In other words, on a given input, if the original machine accepts, the reversed-answer machine might ALSO accept, which is why the trick doesn&#8217;t work.</p>
<p>Another neat fact, known as <strong>Savitch&#8217;s Theorem</strong>, is that non-deterministic Turing machines can be simulated by deterministic Turing machines with at most quadratic space overhead. This implies that PSPACE = NPSPACE! Finally, the <strong>time hierarchy theorem</strong> tells us that allowing a DTM more time (by more than a log factor) the set of problems it decides is strictly larger. This immediately tells us that P is strictly contained in EXPTIME.</p>
<p>To deduce that NP is contained in PSPACE, we use the concept of completeness. For a given complexity class C, a problem is C-hard if, intuitively, it&#8217;s at least as hard as every problem in C. Technically, a language L is C-hard if every language in C is <strong>polynomial-reducible</strong> to L. A is polynomial-reducible to B, written <img src='http://s0.wp.com/latex.php?latex=A+%5Cle_p+B&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A &#92;le_p B' title='A &#92;le_p B' class='latex' />, if there exists a polynomial-time computable function <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='f' title='f' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=x+%5Cin+A+%5CLeftrightarrow+f%28x%29+%5Cin+B&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x &#92;in A &#92;Leftrightarrow f(x) &#92;in B' title='x &#92;in A &#92;Leftrightarrow f(x) &#92;in B' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' />. Intuitively, this means that B is as hard as A, since if we have a polynomial time algorithm for B, we can use it to solve A by first transforming the input using <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='f' title='f' class='latex' /> (in polynomial time), then feeding the result into our algorithm for B and returning the answer. A problem is C-<strong>complete</strong> if it is C-hard <em>and</em> it is itself in C. This intuitively means it is one of the hardest problems in C, and any polynomial time algorithm for a C-complete language gives us a polynomial time algorithm for <em>every</em> language in C.</p>
<p>The &#8220;canonical&#8221; NP-complete problem is SAT, the language of satisfiable boolean formulas. A boolean formula is just an expression involving variables and the boolean operators OR, AND, and NOT, and it is satisfiable if there is an assignment of truth values to the variables which makes the entire expression evaluate to TRUE. This is in NP, since given an assignment of truth values, we can verify that it satisfies the boolean formula in polynomial time why simply plugging in the values. This problem is also NP-complete, and the proof is way too technical to write out here, so I&#8217;ll just give the main idea. To reduce any NP problem L to SAT, you can construct a boolean formula which &#8220;simulates&#8221; the machine for L, in a manner reminiscent of how modern electronic circuits simulate Turing machines (and are really just evaluating boolean expressions). The boolean formula constructed will be satisfiable if and only if the simulated machine accepts the input.</p>
<p>It turns out SAT is in PSPACE. You can solve it using polynomial space (linear, in fact) by successively plugging in possible truth assignments. If there are <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n' title='n' class='latex' /> variables, you just try all <img src='http://s0.wp.com/latex.php?latex=2%5En&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='2^n' title='2^n' class='latex' /> possible assignments, but at any given time you only need to keep track of <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n' title='n' class='latex' /> values. But since SAT is NP-complete, the entire set NP is contained in PSPACE. In retrospect, this is not surprising at all. Intuitively, space should be more powerful than time, since we can re-use space but we can&#8217;t re-use time. So, in summary, we have</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cmathbf%7BP%7D+%5Csubseteq+%5Cmathbf%7BNP%7D+%5Csubseteq+%5Cmathbf%7BPSPACE%7D+%3D+%5Cmathbf%7BNPSPACE%7D+%5Csubseteq+%5Cmathbf%7BEXPTIME%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbf{P} &#92;subseteq &#92;mathbf{NP} &#92;subseteq &#92;mathbf{PSPACE} = &#92;mathbf{NPSPACE} &#92;subseteq &#92;mathbf{EXPTIME}' title='&#92;mathbf{P} &#92;subseteq &#92;mathbf{NP} &#92;subseteq &#92;mathbf{PSPACE} = &#92;mathbf{NPSPACE} &#92;subseteq &#92;mathbf{EXPTIME}' class='latex' /></p>
<p>and moreover we know that <strong>at least one</strong> of the containments is strict, but we don&#8217;t know which! Most theoreticians believe that every one of them is strict.</p>
<p>The interesting about NP and co-NP is that intuitively they should be different, but we really don&#8217;t know for sure. The complement of SAT, the language of unsatisfiable boolean formulas, is co-NP-complete. We don&#8217;t know if it&#8217;s in NP. While it&#8217;s easy to verify that a formula is satisfiable by exhibiting a satisfying assignment, there seems to be no easy way of verifying that a formula is <em>not</em> satisfiable. A problem that is in both NP and co-NP is PRIMES, the language of all prime numbers. It&#8217;s in co-NP because it&#8217;s easy to verify that a number is <em>not</em> prime by simply exhibiting a proper divisor.</p>
<p>I hear most theoreticians believe that P is not equal to NP. But what if P = NP? What would happen? Well, for one, it would mean it takes barely any more time to find a solution than it does to check one. We would be able to automate many processes so far believed to be difficult. Pretty much every cryptography system would fall apart since it would not take much longer to crack a key than it would to create it. A lot of good would come along with some bad. But this speculation is most likely pointless. Although the relationship hasn&#8217;t been proven, <em>intuitively</em> it seems clear that the two sets should not be equal, the same way the Jordan curve theorem was intuitively clear but was not proven until the early 1900s.</p>
<p>This post is already running much longer than I planned, so I&#8217;m going to save discussion of the interesting-ness of PSPACE for a follow-up post. I find PSPACE particularly interesting because it naturally captures the difficulty of many two-player games.</p>
<p>As always, leave a comment if you have questions or if you catch a mistake. Or if you disagree with me about P vs. NP.</p>
<p>-Alan</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1274/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1274/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1274/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1274/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1274/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1274/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1274/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1274/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1274/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1274/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1274/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1274/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1274/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1274/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1274&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/07/21/complexity-classes-p-np-co-np-pspace/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">aguo777</media:title>
		</media:content>
	</item>
		<item>
		<title>Nuking mosquitoes: Commutative algebra and the Frobenius problem</title>
		<link>http://concretenonsense.wordpress.com/2011/07/13/nuking-mosquitoes-commutative-algebra-and-the-frobenius-problem/</link>
		<comments>http://concretenonsense.wordpress.com/2011/07/13/nuking-mosquitoes-commutative-algebra-and-the-frobenius-problem/#comments</comments>
		<pubDate>Wed, 13 Jul 2011 21:26:24 +0000</pubDate>
		<dc:creator>Alan Guo</dc:creator>
				<category><![CDATA[Commutative Algebra]]></category>
		<category><![CDATA[frobenius problem]]></category>
		<category><![CDATA[hilbert series]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1244</guid>
		<description><![CDATA[This is one of those things that convinced me that commutative algebra was cool and actually useful for solving combinatorial problems. The Frobenius problem (commonly known as the coin problem, of which a special case is the McNugget problem) asks: given positive integers with , what is the largest integer that cannot be written in [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1244&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This is one of those things that convinced me that commutative algebra was cool and actually useful for solving combinatorial problems. The Frobenius problem (commonly known as the coin problem, of which a special case is the McNugget problem) asks: given positive integers <img src='http://s0.wp.com/latex.php?latex=a_1%2C+a_2%2C+%5Cldots%2C+a_n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a_1, a_2, &#92;ldots, a_n' title='a_1, a_2, &#92;ldots, a_n' class='latex' /> with <img src='http://s0.wp.com/latex.php?latex=%5Cgcd%28a_1%2C+a_2%2C%5Cldots%2Ca_n%29+%3D+1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;gcd(a_1, a_2,&#92;ldots,a_n) = 1' title='&#92;gcd(a_1, a_2,&#92;ldots,a_n) = 1' class='latex' />, what is the largest integer <img src='http://s0.wp.com/latex.php?latex=m&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='m' title='m' class='latex' /> that cannot be written in the form <img src='http://s0.wp.com/latex.php?latex=m+%3D+c_1a_1+%2B+c_2a_2+%2B+%5Ccdots+c_na_n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='m = c_1a_1 + c_2a_2 + &#92;cdots c_na_n' title='m = c_1a_1 + c_2a_2 + &#92;cdots c_na_n' class='latex' /> for nonnegative integers <img src='http://s0.wp.com/latex.php?latex=c_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='c_i' title='c_i' class='latex' />? For demonstrative purposes, I&#8217;ll show you how you can use commutative algebra to solve the case <img src='http://s0.wp.com/latex.php?latex=n+%3D+2&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n = 2' title='n = 2' class='latex' />, but the principle generalizes to larger <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n' title='n' class='latex' />. At the end, I&#8217;ll speculate on how one might use this to solve higher-dimensional versions of the Frobenius problem, whatever that might mean.</p>
<p><span id="more-1244"></span></p>
<p>Let <img src='http://s0.wp.com/latex.php?latex=Q+%3D+%5C%7Bc_1+a+%2B+c_2+b+%3A+c_1%2Cc_2+%5Cge+0%5C%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q = &#92;{c_1 a + c_2 b : c_1,c_2 &#92;ge 0&#92;}' title='Q = &#92;{c_1 a + c_2 b : c_1,c_2 &#92;ge 0&#92;}' class='latex' />. The problem statement implicitly assumes <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D+%5Csetminus+Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbb{N} &#92;setminus Q' title='&#92;mathbb{N} &#92;setminus Q' class='latex' /> is finite, but this is not difficult to show, especially when you&#8217;re wielding the rocket launcher that is commutative algebra. Notice that <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q' title='Q' class='latex' /> is a <strong>affine semigroup</strong>, which just means that it&#8217;s isomorphic to a finitely generated subsemigroup of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D%5Ed&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbb{Z}^d' title='&#92;mathbb{Z}^d' class='latex' /> for some <img src='http://s0.wp.com/latex.php?latex=d&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='d' title='d' class='latex' />. The <strong>saturation</strong> <img src='http://s0.wp.com/latex.php?latex=Q_%7B%5Ctext%7Bsat%7D%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q_{&#92;text{sat}}' title='Q_{&#92;text{sat}}' class='latex' /> of an affine semigroup <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q' title='Q' class='latex' /> is what you get when you take the group generated by <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q' title='Q' class='latex' /> intersected with the nonnegative cone <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q' title='Q' class='latex' /> generates. In symbols, <img src='http://s0.wp.com/latex.php?latex=Q_%7B%5Ctext%7Bsat%7D%7D+%3D+%5Cmathbb%7BZ%7D+Q+%5Ccap+%5Cmathbb%7BR_%2B%7D+Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q_{&#92;text{sat}} = &#92;mathbb{Z} Q &#92;cap &#92;mathbb{R_+} Q' title='Q_{&#92;text{sat}} = &#92;mathbb{Z} Q &#92;cap &#92;mathbb{R_+} Q' class='latex' />. The difference between an affine semigroup and its saturation is that the saturation &#8220;fills in the gaps&#8221; of the original affine semigroup so that it looks like it came from an actual group. In our case, <img src='http://s0.wp.com/latex.php?latex=Q_%7B%5Ctext%7Bsat%7D%7D+%3D+%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q_{&#92;text{sat}} = &#92;mathbb{N}' title='Q_{&#92;text{sat}} = &#92;mathbb{N}' class='latex' />. A basic result (Exercise 7.15 from <em>Combinatorial Commutative Algebra</em> by Miller-Sturmfels) says that any affine semigroup contains a translate of its saturation.  Well, that means <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q' title='Q' class='latex' /> contains a translate of <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbb{N}' title='&#92;mathbb{N}' class='latex' />, so there&#8217;s some number after which every number is in <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q' title='Q' class='latex' />.</p>
<p>The generating function <img src='http://s0.wp.com/latex.php?latex=p%28t%29+%3D+%5Csum_%7Bi+%5Cnotin+Q%7D+t%5Ei&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='p(t) = &#92;sum_{i &#92;notin Q} t^i' title='p(t) = &#92;sum_{i &#92;notin Q} t^i' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BN%7D+%5Csetminus+Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbb{N} &#92;setminus Q' title='&#92;mathbb{N} &#92;setminus Q' class='latex' /> is a polynomial, and furthermore it is <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B1%7D%7B1-t%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{1}{1-t}' title='&#92;frac{1}{1-t}' class='latex' /> minus the Hilbert series <img src='http://s0.wp.com/latex.php?latex=h_M%28t%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='h_M(t)' title='h_M(t)' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=M+%3D+%5Cmathbb%7BC%7D%5Bt%5Ea%2C+t%5Eb%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M = &#92;mathbb{C}[t^a, t^b]' title='M = &#92;mathbb{C}[t^a, t^b]' class='latex' />, since the monomials in <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BC%7D%5Bt%5Ea%2C+t%5Eb%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbb{C}[t^a, t^b]' title='&#92;mathbb{C}[t^a, t^b]' class='latex' /> are precisely the ones whose exponents are in <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q' title='Q' class='latex' />. So now if we could find a nice form for <img src='http://s0.wp.com/latex.php?latex=h_M%28t%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='h_M(t)' title='h_M(t)' class='latex' />, then we&#8217;d be done.</p>
<p>We can view <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> as a module over <img src='http://s0.wp.com/latex.php?latex=R+%3D+%5Cmathbb%7BC%7D%5Bx%2Cy%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='R = &#92;mathbb{C}[x,y]' title='R = &#92;mathbb{C}[x,y]' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=%5Cdeg%28x%29+%3D+a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;deg(x) = a' title='&#92;deg(x) = a' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Cdeg%28y%29+%3D+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;deg(y) = b' title='&#92;deg(y) = b' class='latex' />. Then we have an exact sequence</p>
<p><img src='http://s0.wp.com/latex.php?latex=0+%5Cto+%28x%5Eb+-+y%5Ea%29+%5Cto+%5Cmathbb%7BC%7D%5Bx%2Cy%5D+%5Cto+%5Cmathbb%7BC%7D%5Bt%5Ea%2Ct%5Eb%5D+%5Cto+0&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='0 &#92;to (x^b - y^a) &#92;to &#92;mathbb{C}[x,y] &#92;to &#92;mathbb{C}[t^a,t^b] &#92;to 0' title='0 &#92;to (x^b - y^a) &#92;to &#92;mathbb{C}[x,y] &#92;to &#92;mathbb{C}[t^a,t^b] &#92;to 0' class='latex' /></p>
<p>where the second map from the left is inclusion and the third map is <img src='http://s0.wp.com/latex.php?latex=x+%5Cmapsto+t%5Ea%2C+y+%5Cmapsto+t%5Eb&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x &#92;mapsto t^a, y &#92;mapsto t^b' title='x &#92;mapsto t^a, y &#92;mapsto t^b' class='latex' />, which is a surjection, so it suffices to show that we have exactness in the middle. Let <img src='http://s0.wp.com/latex.php?latex=I+%3D+%28x%5Eb+-+y%5Ea%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='I = (x^b - y^a)' title='I = (x^b - y^a)' class='latex' /> and let <img src='http://s0.wp.com/latex.php?latex=K+%3D+%5Cker%28R+%5Cto+M%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='K = &#92;ker(R &#92;to M)' title='K = &#92;ker(R &#92;to M)' class='latex' />. Now, it is straightforward to see that <img src='http://s0.wp.com/latex.php?latex=I+%5Csubseteq+K&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='I &#92;subseteq K' title='I &#92;subseteq K' class='latex' />. On the other hand, since <img src='http://s0.wp.com/latex.php?latex=%5Cmathbb%7BC%7D%5Bx%2Cy%5D+%2F+K+%5Ccong+%5Cmathbb%7BC%7D%5Bt%5Ea%2Ct%5Eb%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathbb{C}[x,y] / K &#92;cong &#92;mathbb{C}[t^a,t^b]' title='&#92;mathbb{C}[x,y] / K &#92;cong &#92;mathbb{C}[t^a,t^b]' class='latex' /> which is not a field, we know that <img src='http://s0.wp.com/latex.php?latex=K&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='K' title='K' class='latex' /> is not a maximal ideal, hence <img src='http://s0.wp.com/latex.php?latex=%5Cdim+K+%5Cge+1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;dim K &#92;ge 1' title='&#92;dim K &#92;ge 1' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Cdim&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;dim' title='&#92;dim' class='latex' /> here is the Krull dimension. We know that <img src='http://s0.wp.com/latex.php?latex=%5Cdim+%5Cmathbb%7BC%7D%5Bx%2Cy%5D+%3D+2&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;dim &#92;mathbb{C}[x,y] = 2' title='&#92;dim &#92;mathbb{C}[x,y] = 2' class='latex' /> and the <img src='http://s0.wp.com/latex.php?latex=0&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='0' title='0' class='latex' /> ideal is prime and <img src='http://s0.wp.com/latex.php?latex=0+%5Csubsetneq+I&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='0 &#92;subsetneq I' title='0 &#92;subsetneq I' class='latex' />, so <img src='http://s0.wp.com/latex.php?latex=%5Cdim+I+%5Cle+1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;dim I &#92;le 1' title='&#92;dim I &#92;le 1' class='latex' />. But the containment <img src='http://s0.wp.com/latex.php?latex=I+%5Csubseteq+K&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='I &#92;subseteq K' title='I &#92;subseteq K' class='latex' /> implies that <img src='http://s0.wp.com/latex.php?latex=%5Cdim+I+%5Cge+%5Cdim+K&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;dim I &#92;ge &#92;dim K' title='&#92;dim I &#92;ge &#92;dim K' class='latex' />, so we have <img src='http://s0.wp.com/latex.php?latex=1+%5Cge+%5Cdim+I+%5Cge+%5Cdim+K+%5Cge+1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='1 &#92;ge &#92;dim I &#92;ge &#92;dim K &#92;ge 1' title='1 &#92;ge &#92;dim I &#92;ge &#92;dim K &#92;ge 1' class='latex' />, hence <img src='http://s0.wp.com/latex.php?latex=%5Cdim+I+%3D+%5Cdim+K&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;dim I = &#92;dim K' title='&#92;dim I = &#92;dim K' class='latex' />. To show that <img src='http://s0.wp.com/latex.php?latex=I+%3D+K&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='I = K' title='I = K' class='latex' />, we just need to show that <img src='http://s0.wp.com/latex.php?latex=I&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='I' title='I' class='latex' /> is prime, but this follows from the fact that <img src='http://s0.wp.com/latex.php?latex=%5Cgcd%28a%2Cb%29+%3D+1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;gcd(a,b) = 1' title='&#92;gcd(a,b) = 1' class='latex' />.</p>
<p>Now, note that the Hilbert series <img src='http://s0.wp.com/latex.php?latex=h_I%28t%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='h_I(t)' title='h_I(t)' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=I&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='I' title='I' class='latex' /> is just <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7Bt%5E%7Bab%7D%7D%7B%281-t%5Ea%29%281-t%5Eb%29%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{t^{ab}}{(1-t^a)(1-t^b)}' title='&#92;frac{t^{ab}}{(1-t^a)(1-t^b)}' class='latex' /> since <img src='http://s0.wp.com/latex.php?latex=%5Cdeg%28x%5Eb+-+y%5Ea%29+%3D+ab&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;deg(x^b - y^a) = ab' title='&#92;deg(x^b - y^a) = ab' class='latex' />, so by rank-nullity,</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+h_M%28t%29+%3D+h_R%28t%29+-+h_I%28t%29+%3D+%5Cfrac%7B1+-+t%5E%7Bab%7D%7D%7B%281-t%5Ea%29%281-t%5Eb%29%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;displaystyle h_M(t) = h_R(t) - h_I(t) = &#92;frac{1 - t^{ab}}{(1-t^a)(1-t^b)}' title='&#92;displaystyle h_M(t) = h_R(t) - h_I(t) = &#92;frac{1 - t^{ab}}{(1-t^a)(1-t^b)}' class='latex' />.</p>
<p>We have</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+h_M%28t%29+%2B+p%28t%29+%3D+%5Csum_%7Bi+%5Cin+Q%7D+t%5Ei+%2B+%5Csum_%7Bi+%5Cnotin+Q%7D+%3D+%5Csum_%7Bi+%5Cin+%5Cmathbb%7BN%7D%7D+t%5Ei+%3D+%5Cfrac%7B1%7D%7B1-t%7D%2C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;displaystyle h_M(t) + p(t) = &#92;sum_{i &#92;in Q} t^i + &#92;sum_{i &#92;notin Q} = &#92;sum_{i &#92;in &#92;mathbb{N}} t^i = &#92;frac{1}{1-t},' title='&#92;displaystyle h_M(t) + p(t) = &#92;sum_{i &#92;in Q} t^i + &#92;sum_{i &#92;notin Q} = &#92;sum_{i &#92;in &#92;mathbb{N}} t^i = &#92;frac{1}{1-t},' class='latex' /></p>
<p>hence</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+h_M%28t%29+%3D+%5Cfrac%7B1%7D%7B1-t%7D+-+p%28t%29+%3D+%5Cfrac%7Bf%28t%29%7D%7B1-t%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;displaystyle h_M(t) = &#92;frac{1}{1-t} - p(t) = &#92;frac{f(t)}{1-t}' title='&#92;displaystyle h_M(t) = &#92;frac{1}{1-t} - p(t) = &#92;frac{f(t)}{1-t}' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=f%28t%29+%3D+1+-+p%28t%29+%2B+tp%28t%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='f(t) = 1 - p(t) + tp(t)' title='f(t) = 1 - p(t) + tp(t)' class='latex' /> is a polynomial since <img src='http://s0.wp.com/latex.php?latex=p%28t%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='p(t)' title='p(t)' class='latex' /> is. Now, the final answer to our problem is <img src='http://s0.wp.com/latex.php?latex=%5Cdeg+p+%3D+%5Cdeg+f+-+1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;deg p = &#92;deg f - 1' title='&#92;deg p = &#92;deg f - 1' class='latex' />. But</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cfrac%7B1-t%5E%7Bab%7D%7D%7B%281-t%5Ea%29%281-t%5Eb%29%7D+%3D+%5Cfrac%7Bf%28t%29%7D%7B1-t%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;displaystyle &#92;frac{1-t^{ab}}{(1-t^a)(1-t^b)} = &#92;frac{f(t)}{1-t}' title='&#92;displaystyle &#92;frac{1-t^{ab}}{(1-t^a)(1-t^b)} = &#92;frac{f(t)}{1-t}' class='latex' /></p>
<p>so</p>
<p><img src='http://s0.wp.com/latex.php?latex=%281-t%29%281-t%5E%7Bab%7D%29+%3D+%281-t%5Ea%29%281-t%5Eb%29f%28t%29.&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='(1-t)(1-t^{ab}) = (1-t^a)(1-t^b)f(t).' title='(1-t)(1-t^{ab}) = (1-t^a)(1-t^b)f(t).' class='latex' /></p>
<p>Since the degrees of both sides must be equal, we have <img src='http://s0.wp.com/latex.php?latex=1+%2B+ab+%3D+a+%2B+b+%2B+%5Cdeg+f&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='1 + ab = a + b + &#92;deg f' title='1 + ab = a + b + &#92;deg f' class='latex' />, whence <img src='http://s0.wp.com/latex.php?latex=%5Cdeg+f+%3D+1+%2B+ab+-+a+-+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;deg f = 1 + ab - a - b' title='&#92;deg f = 1 + ab - a - b' class='latex' />. So our answer is <img src='http://s0.wp.com/latex.php?latex=%5Cdeg+p+%3D+ab+-+a+-+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;deg p = ab - a - b' title='&#92;deg p = ab - a - b' class='latex' />. Yay! And we solved it using purely algebra!</p>
<p>One can ask a similar question for higher dimensions. For example, the answer to the Frobenius problem also answers the question &#8220;Where does the translate of the saturation of <img src='http://s0.wp.com/latex.php?latex=Q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='Q' title='Q' class='latex' /> start?&#8221; (after adding 1). There&#8217;s probably some similar question you can ask about higher dimensions, although now it&#8217;s not as clear what the &#8220;minimal&#8221; place where the saturation starts should be. I think it would make the most sense if an answer to such a question would take the form of a set of &#8220;generators&#8221;, where everything in the ideal generated by the &#8220;generators&#8221; is in the saturation. Anyway, if anyone knows the answer already, please let me know! I think the answer might lie in a paper of Maclagan and Smith titled <em>Multigraded Castelnuovo-Mumford Regularity</em>, but since I don&#8217;t even understand Castelnuovo-Mumford regularity in one variable, I have little hope of comprehending that paper.</p>
<p>Like always, if you catch any mistakes or have questions, leave a comment!</p>
<p>-Alan</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1244/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1244&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/07/13/nuking-mosquitoes-commutative-algebra-and-the-frobenius-problem/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">aguo777</media:title>
		</media:content>
	</item>
		<item>
		<title>Cantor&#8217;s diagonal argument and undecidability</title>
		<link>http://concretenonsense.wordpress.com/2011/07/12/cantors-diagonal-argument-and-undecidability/</link>
		<comments>http://concretenonsense.wordpress.com/2011/07/12/cantors-diagonal-argument-and-undecidability/#comments</comments>
		<pubDate>Tue, 12 Jul 2011 03:27:55 +0000</pubDate>
		<dc:creator>Alan Guo</dc:creator>
				<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[computer science]]></category>
		<category><![CDATA[turing machine]]></category>
		<category><![CDATA[undecidability]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1219</guid>
		<description><![CDATA[Probably every mathematician is familiar with Cantor&#8217;s diagonal argument for proving that there are uncountably many real numbers, but less well-known is the proof of the existence of an undecidable problem in computer science, which also uses Cantor&#8217;s diagonal argument. I thought it was really cool when I first learned it last year. To understand [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1219&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Probably every mathematician is familiar with Cantor&#8217;s diagonal argument for proving that there are uncountably many real numbers, but less well-known is the proof of the existence of an undecidable problem in computer science, which also uses Cantor&#8217;s diagonal argument. I thought it was really cool when I first learned it last year.</p>
<p><span id="more-1219"></span></p>
<p>To understand undecidability, you first have to know what a Turing machine is. A Turing machine is just a theoretical model of a computer, with a work tape, a tape head which points to a cell on the tape, and a set of states. These three things determine a <strong>configuration</strong> of the TM. In addition, the TM comes with a transition function which dictates how the TM changes its configuration. Lastly, one of its states is designated the <strong>initial state</strong>, which is the state that the TM starts in every time it runs on an input, and some of its states are designated <strong>accept states</strong>, while others are designated <strong>reject states</strong>. A precise definition of a Turing machine can be found in any textbook on theory of computation, but it&#8217;s not terribly important for our goal here. The thing to take away is that a TM is given an input (written on its tape), then its transition function dictates what it does (like an algorithm), and if the TM lands on an accept state, then it accepts the input, and if it lands on a reject state, then it rejects the input. Note that it is also possible for infinite loops to occur for certain inputs, in which case the TM is considered to not accept or reject.</p>
<p>I didn&#8217;t mention what kind of inputs are allowed for Turing machines. It can be any finite alphabet <img src='http://s0.wp.com/latex.php?latex=%5CSigma&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;Sigma' title='&#92;Sigma' class='latex' />, but I like to think of the allowed alphabet as <img src='http://s0.wp.com/latex.php?latex=%5C%7B0%2C+1%5C%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;{0, 1&#92;}' title='&#92;{0, 1&#92;}' class='latex' /> since that&#8217;s what we use in modern electronic computers, and two letters is enough to do everything you want to do, so from now on we&#8217;ll assume Turing machines take binary string inputs. A <strong>language</strong> is a set (possibly infinite) of binary strings. The language recognized by a Turing machine is the set of strings that it accepts. So, when we talk about &#8220;problems&#8221;, such as the primality problem (given an positive integer, determine if it&#8217;s prime) or the SAT problem (given a boolean formula, determine if it&#8217;s satisfiable), we are really talking about languages. For example, we can view the primality problem as the set of binary strings which represent prime numbers. Then the question of whether there exists an algorithm for deciding primality is actually whether there exists a Turing machine which decides the corresponding language.</p>
<p>I just used the term &#8220;decide&#8221; but I haven&#8217;t defined it yet. Whereas a machine recognizes a language if it accepts precisely those strings in it, a machine <strong>decides</strong> a language if it recognizes the language and halts on every input. The cool thing is that there are problems that are recognizable but not decidable, and showing that is the main goal of my post.</p>
<p>Consider the language</p>
<p><img src='http://s0.wp.com/latex.php?latex=L+%3D+%5C%7B%5Clangle+M%2C+x+%5Crangle+%3A+%5Ctext%7B+Turing+machine+%7D+M+%5Ctext%7B+accepts+%7D+x+%5C%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L = &#92;{&#92;langle M, x &#92;rangle : &#92;text{ Turing machine } M &#92;text{ accepts } x &#92;}' title='L = &#92;{&#92;langle M, x &#92;rangle : &#92;text{ Turing machine } M &#92;text{ accepts } x &#92;}' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M%2C+x+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M, x &#92;rangle' title='&#92;langle M, x &#92;rangle' class='latex' /> just means a binary string encoding <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> concatenated with the binary string <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' />, with a delimiter in between so the Turing machine knows where the encoding of <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> ends. In other words, the problem is to determine if a given Turing machine <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> accepts a given binary string <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' />. This problem is recognizable. You can design a Turing machine <img src='http://s0.wp.com/latex.php?latex=U&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='U' title='U' class='latex' />, called the &#8220;universal Turing machine&#8221;, which can simulate any other Turing machine. Then, given input <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M%2C+x+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M, x &#92;rangle' title='&#92;langle M, x &#92;rangle' class='latex' />, just simulate <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' /> and accept if <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> accepts <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' /> and reject if <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> rejects <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' />. Clearly our machine will accept <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M%2C+x+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M, x &#92;rangle' title='&#92;langle M, x &#92;rangle' class='latex' /> if and only if <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> accepts <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' />, so it recognizes <img src='http://s0.wp.com/latex.php?latex=L&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L' title='L' class='latex' />. It doesn&#8217;t decide <img src='http://s0.wp.com/latex.php?latex=L&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L' title='L' class='latex' />, however, since if <img src='http://s0.wp.com/latex.php?latex=M&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M' title='M' class='latex' /> goes into an infinite loop on <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' />, then so does our machine, but we want our machine to reject when this happens.</p>
<p>To show that <img src='http://s0.wp.com/latex.php?latex=L&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L' title='L' class='latex' /> is undecidable, we prove it by contradiction. Suppose we had a machine <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D' title='D' class='latex' /> which decides <img src='http://s0.wp.com/latex.php?latex=L&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L' title='L' class='latex' />. Note that every Turing machine has some encoding as a finite length binary string, so we can enumerate them as <img src='http://s0.wp.com/latex.php?latex=M_1%2C+M_2%2C+M_3%2C+M_4%2C+%5Cldots&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_1, M_2, M_3, M_4, &#92;ldots' title='M_1, M_2, M_3, M_4, &#92;ldots' class='latex' /> and construct the following table:</p>
<p><img src='http://s0.wp.com/latex.php?latex=%5Cbegin%7Barray%7D%7Bcccccc%7D+%26+M_1+%26+M_2+%26+M_3+%26+M_4+%26+%5Ccdots+%5C%5C+M_1+%26+1+%26+0+%26+0+%26+1+%26+%5Ccdots+%5C%5C+M_2+%26+0+%26+1+%26+0+%26+0+%26+%5Ccdots+%5C%5C+M_3+%26+1+%26+1+%26+0+%26+1+%26+%5Ccdots+%5C%5C+M_4+%26+0+%26+0+%26+1+%26+0+%26+%5Ccdots+%5C%5C+%5Cvdots+%26+%5Cvdots+%26+%5Cvdots+%26+%5Cvdots+%26+%5Cvdots+%26+%5Cddots+%5Cend%7Barray%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;begin{array}{cccccc} &amp; M_1 &amp; M_2 &amp; M_3 &amp; M_4 &amp; &#92;cdots &#92;&#92; M_1 &amp; 1 &amp; 0 &amp; 0 &amp; 1 &amp; &#92;cdots &#92;&#92; M_2 &amp; 0 &amp; 1 &amp; 0 &amp; 0 &amp; &#92;cdots &#92;&#92; M_3 &amp; 1 &amp; 1 &amp; 0 &amp; 1 &amp; &#92;cdots &#92;&#92; M_4 &amp; 0 &amp; 0 &amp; 1 &amp; 0 &amp; &#92;cdots &#92;&#92; &#92;vdots &amp; &#92;vdots &amp; &#92;vdots &amp; &#92;vdots &amp; &#92;vdots &amp; &#92;ddots &#92;end{array}' title='&#92;begin{array}{cccccc} &amp; M_1 &amp; M_2 &amp; M_3 &amp; M_4 &amp; &#92;cdots &#92;&#92; M_1 &amp; 1 &amp; 0 &amp; 0 &amp; 1 &amp; &#92;cdots &#92;&#92; M_2 &amp; 0 &amp; 1 &amp; 0 &amp; 0 &amp; &#92;cdots &#92;&#92; M_3 &amp; 1 &amp; 1 &amp; 0 &amp; 1 &amp; &#92;cdots &#92;&#92; M_4 &amp; 0 &amp; 0 &amp; 1 &amp; 0 &amp; &#92;cdots &#92;&#92; &#92;vdots &amp; &#92;vdots &amp; &#92;vdots &amp; &#92;vdots &amp; &#92;vdots &amp; &#92;ddots &#92;end{array}' class='latex' /></p>
<p>where the entry in row <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='i' title='i' class='latex' />, column <img src='http://s0.wp.com/latex.php?latex=j&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='j' title='j' class='latex' /> is a <img src='http://s0.wp.com/latex.php?latex=0&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='0' title='0' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=M_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_i' title='M_i' class='latex' /> accepts <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M_j+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M_j &#92;rangle' title='&#92;langle M_j &#92;rangle' class='latex' /> and is a <img src='http://s0.wp.com/latex.php?latex=1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='1' title='1' class='latex' /> if not. I just made up the numbers for concreteness. We can construct this table since we have <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D' title='D' class='latex' /> (to determine the value of an entry, just run <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D' title='D' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M_i%2C+M_j+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M_i, M_j &#92;rangle' title='&#92;langle M_i, M_j &#92;rangle' class='latex' />. Now, consider the Turing machine <img src='http://s0.wp.com/latex.php?latex=M_%7B%5Ctext%7Bbad%7D%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_{&#92;text{bad}}' title='M_{&#92;text{bad}}' class='latex' /> which, given input <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M &#92;rangle' title='&#92;langle M &#92;rangle' class='latex' />, does the opposite of what <img src='http://s0.wp.com/latex.php?latex=D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D' title='D' class='latex' /> does on input <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M%2C+M+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M, M &#92;rangle' title='&#92;langle M, M &#92;rangle' class='latex' /> (if the input happens to not be a proper encoding of any Turing machine, then it&#8217;s irrelevant what <img src='http://s0.wp.com/latex.php?latex=M_%7B%5Ctext%7Bbad%7D%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_{&#92;text{bad}}' title='M_{&#92;text{bad}}' class='latex' /> does). That is, <img src='http://s0.wp.com/latex.php?latex=M_%7B%5Ctext%7Bbad%7D%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_{&#92;text{bad}}' title='M_{&#92;text{bad}}' class='latex' /> accepts <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M &#92;rangle' title='&#92;langle M &#92;rangle' class='latex' /> if the entry corresponding to <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M%2C+M+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M, M &#92;rangle' title='&#92;langle M, M &#92;rangle' class='latex' /> in our table is a <img src='http://s0.wp.com/latex.php?latex=0&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='0' title='0' class='latex' />, and rejects if the corresponding entry is a <img src='http://s0.wp.com/latex.php?latex=1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='1' title='1' class='latex' />. Now, ask yourself: which <img src='http://s0.wp.com/latex.php?latex=M_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_i' title='M_i' class='latex' /> is <img src='http://s0.wp.com/latex.php?latex=M_%7B%5Ctext%7Bbad%7D%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_{&#92;text{bad}}' title='M_{&#92;text{bad}}' class='latex' />? For each <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='i' title='i' class='latex' />, our machine <img src='http://s0.wp.com/latex.php?latex=M_%7B%5Ctext%7Bbad%7D%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_{&#92;text{bad}}' title='M_{&#92;text{bad}}' class='latex' /> differs from <img src='http://s0.wp.com/latex.php?latex=M_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_i' title='M_i' class='latex' /> on the input <img src='http://s0.wp.com/latex.php?latex=%5Clangle+M_i+%5Crangle&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;langle M_i &#92;rangle' title='&#92;langle M_i &#92;rangle' class='latex' />, so it can&#8217;t be any of these. But we&#8217;ve enumerated all the Turing machines, so we have a contradiction! Therefore, <img src='http://s0.wp.com/latex.php?latex=L&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L' title='L' class='latex' /> is not decidable.</p>
<p>That ends the proof. I hope you found this as cool as I did when I first saw it. Anyway, feel free to leave comments if you have any questions about the proof, or corrections to it if I made a mistake.</p>
<p>-Alan</p>
<p>P.S. It&#8217;s also straightforward to see that there are languages that aren&#8217;t even Turing-recognizable, since there are countably many Turing machines but uncountably many languages.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1219/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1219&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/07/12/cantors-diagonal-argument-and-undecidability/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">aguo777</media:title>
		</media:content>
	</item>
		<item>
		<title>New Character Unlocked: Alan</title>
		<link>http://concretenonsense.wordpress.com/2011/07/12/new-character-unlocked-alan/</link>
		<comments>http://concretenonsense.wordpress.com/2011/07/12/new-character-unlocked-alan/#comments</comments>
		<pubDate>Tue, 12 Jul 2011 00:12:55 +0000</pubDate>
		<dc:creator>Alan Guo</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[intro]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1215</guid>
		<description><![CDATA[Hi everyone! I&#8217;m Alan, and I hope to contribute posts on connections between math and computer science. I majored in math and minored in CS as an undergrad, and will be a first year grad student in the computer science department at MIT in the fall. My interests on the math side include combinatorics, combinatorial [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1215&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Hi everyone! I&#8217;m Alan, and I hope to contribute posts on connections between math and computer science. I majored in math and minored in CS as an undergrad, and will be a first year grad student in the computer science department at MIT in the fall. My interests on the math side include combinatorics, combinatorial games, and commutative algebra. On the computer science side, I&#8217;m interested in algorithms, complexity theory, quantum computing, and artificial intelligence. The underlying theme behind my interests is the application of mathematical knowledge to do things more efficiently or prove that they&#8217;re hard to do, and as such my posts will have particular emphasis on this theme.</p>
<p>My underdeveloped academic homepage can be found <a href="http://www.mit.edu/~aguo/">here</a>.</p>
<p>In my spare time, I devoutly follow hedonism by enjoying food, music, movies and video games.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1215/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1215/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1215/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1215/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1215/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1215/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1215/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1215/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1215/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1215/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1215/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1215/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1215/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1215/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1215&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/07/12/new-character-unlocked-alan/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">aguo777</media:title>
		</media:content>
	</item>
		<item>
		<title>PSA: &#8220;RandomTableau&#8221; on Mathematica is broken (but Sage works fine)</title>
		<link>http://concretenonsense.wordpress.com/2011/07/09/psa-randomtableau-on-mathematica-is-broken-but-sage-works-fine/</link>
		<comments>http://concretenonsense.wordpress.com/2011/07/09/psa-randomtableau-on-mathematica-is-broken-but-sage-works-fine/#comments</comments>
		<pubDate>Sun, 10 Jul 2011 03:12:21 +0000</pubDate>
		<dc:creator>JBL</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Combinatorics]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[young tableaux]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1207</guid>
		<description><![CDATA[This post is just a public service announcement: the function RandomTableau[p] in Mathematica is meant to give a random standard Young tableau of shape p. Though the documentation isn&#8217;t actually so helpful as to tell you explicitly what probability distribution it is allegedly generating, the only natural option is uniform over all tableaux of that [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1207&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This post is just a public service announcement: the function RandomTableau[<em>p</em>] in Mathematica is meant to give a random standard Young tableau of shape <em>p</em>.  Though the <a href="http://reference.wolfram.com/mathematica/Combinatorica/ref/RandomTableau.html">documentation</a> isn&#8217;t actually so helpful as to tell you explicitly what probability distribution it is allegedly generating, the only natural option is uniform over all tableaux of that shape.  However, the function RandomTableau[<em>p</em>] <strong>does not generate tableaux uniformly</strong>.  The rest of this post quickly outlines the evidence for this claim.</p>
<p><span id="more-1207"></span>Given a standard Young tableau <em>T</em>, we can define the <em>JDT path</em> on <em>T</em>, as follows: starting from the (1, 1) entry of <em>T</em> (which happens always to be filled with the number 1), move to the smaller of the entry directly below (1, 1) and the entry directly to the right of (1, 1).  (Here we work in English notation.)  Then repeat this process: always move down or to the right, to whichever box has the smaller entry, until you reach the boundary of the shape.</p>
<p>It&#8217;s possible to ask a variety of probabilistic questions related to this path, but I&#8217;m going to narrow in very quickly on the particular case of interest: given the shape <em>p</em>, we can choose a tableau <em>T</em> of shape <em>p</em> at random.  We may then ask which boxes of <em>p</em> lie on the JDT path of <em>T</em>.  For example, with the shape <img src='http://s0.wp.com/latex.php?latex=p+%3D+%284%2C+3%2C+2%2C+1%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='p = (4, 3, 2, 1)' title='p = (4, 3, 2, 1)' class='latex' />, it&#8217;s easy to show (e.g., by exhaustive enumeration; there are 768 tableaux of this shape) that it is equally likely that the (1, 3), (2, 2) and (3, 1) entries lie on the JDT path of a random tableau of shape <em>p</em>.  (Incidentally, the fact that these three cells are equally likely is just a convenient freak coincidence for this shape.)  However, after running 75,000 trials on Mathematica, I find that the command RandomTableau[{4,3,2,1}] generates a tableau in which the JDT path contains (2, 2) about 31.5% of the time.  For reference, the probability of a given side coming up only 31.5% of the time when flipping a fair three-sided coin 10,000 times is only about 0.00005, and this happened 7 times in a row.</p>
<p>After that (fairly conclusive) experiment, I also tried keeping track of which tableaux are most favored and which are least favored by Mathematica.  Running through about 150,000 random tableaux suggests that tableaux that look like</p>
<p style="padding-left:90px;">1  5  8  10<br />
2  6  9<br />
3  7<br />
4</p>
<p>are favored (appearing roughly 50% more often than expected) while those like</p>
<p style="padding-left:90px;">1 3 5 10<br />
2 4 8<br />
6 9<br />
7</p>
<p>are disfavored (appearing roughly 30% less often than expected).</p>
<p>I submitted a short description of the problem to the Mathematica support desk in February, got an e-mail back saying they&#8217;d look into it and let me know if they wanted to hear more, and haven&#8217;t heard anything since.</p>
<p>For those who are curious, the equivalent command in Sage works like a charm.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/concretenonsense.wordpress.com/1207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/concretenonsense.wordpress.com/1207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/concretenonsense.wordpress.com/1207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/concretenonsense.wordpress.com/1207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/concretenonsense.wordpress.com/1207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/concretenonsense.wordpress.com/1207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/concretenonsense.wordpress.com/1207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/concretenonsense.wordpress.com/1207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/concretenonsense.wordpress.com/1207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/concretenonsense.wordpress.com/1207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/concretenonsense.wordpress.com/1207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/concretenonsense.wordpress.com/1207/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&amp;blog=2918042&amp;post=1207&amp;subd=concretenonsense&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/07/09/psa-randomtableau-on-mathematica-is-broken-but-sage-works-fine/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jblblog</media:title>
		</media:content>
	</item>
	</channel>
</rss>
