<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Concrete Nonsense</title>
	<atom:link href="http://concretenonsense.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://concretenonsense.wordpress.com</link>
	<description>A group blog about mathematics</description>
	<lastBuildDate>Thu, 13 Jun 2013 20:53:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='concretenonsense.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Concrete Nonsense</title>
		<link>http://concretenonsense.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://concretenonsense.wordpress.com/osd.xml" title="Concrete Nonsense" />
	<atom:link rel='hub' href='http://concretenonsense.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Defense Against the Dark Arts: the Reverse Regression Effect</title>
		<link>http://concretenonsense.wordpress.com/2013/02/16/defense-against-the-dark-arts-the-reverse-regression-effect/</link>
		<comments>http://concretenonsense.wordpress.com/2013/02/16/defense-against-the-dark-arts-the-reverse-regression-effect/#comments</comments>
		<pubDate>Sat, 16 Feb 2013 22:18:38 +0000</pubDate>
		<dc:creator>yanzhang</dc:creator>
				<category><![CDATA[Probability]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[causality]]></category>
		<category><![CDATA[reverse regression effect]]></category>
		<category><![CDATA[Simpson's Paradox]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1434</guid>
		<description><![CDATA[Much like our harmonious world, imagine a world populated with blue and green men living as brothers, ignoring the difference of the color of their skin. Much like our exciting world, both blue and green men get educated, go into the real world, and get jobs. One day, a green activist tells you that according [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1434&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Much like our harmonious world, imagine a world populated with blue and green men living as brothers, ignoring the difference of the color of their skin. Much like our exciting world, both blue and green men get educated, go into the real world, and get jobs. One day, a green activist tells you that according to a recent study, for every given education level, blue men receive much higher salary on average than green men, a clear sign of discrimination against the greens &#8212; why should equally educated green men not be paid as much as blue men? After leaving you indignant and disappointed, a blue activist tells you that according to another study, for every salary level, green men are less educated than blue men, a clear sign of discrimination against the blues &#8212; why should blue men study harder in school than green men to get the same rewards for their work?</p>
<p>The punchline, of course, is that the two studies come from the same data. Judea Pearl mentions it as a throwaway comment in a general talk &#8220;The Art and Science of Cause and Effect,&#8221; which I read in his great work <em>Causality</em>. In particular, the paper referenced was Goldberger&#8217;s 1984 paper <em>Reverse Regression and Salary Discrimination, </em>with actual data demonstrating the effect (though with a different color palette). I find it surprising that Pearl would dedicate an entire chapter in <em>Causality</em> to Simpson&#8217;s Paradox but only one comment in the talk to this effect. While it is somewhat similar to Simpson&#8217;s Paradox (which I may talk about some other time), they are decidedly different in origin and I find this phenomenon much more relevant, prominent, and chilling. Let me explain.<span id="more-1434"></span></p>
<h2>&#8220;You must have set it up wrong.&#8221;</h2>
<p>I&#8217;ve heard this comment several times already. The argument goes something like this: unless the data is really pathological, we can assume the data is something like a curve for each blue and green on the education vs. salary plot. Let&#8217;s say that education is the x-axis and salary the y-axis. The first report shows that in general we need to have the blue curve <strong>above</strong> the green curve at all vertical slices of the graph. But this means the blue curve must be <strong>left</strong> of the green curve at all horizontal slices of the graph, which contradicts the second report.</p>
<dl id="attachment_212">
<dt><a href="http://yanzhang.files.wordpress.com/2013/02/what2.png"><img alt="It seems we need the blue line to lie above the green line and to the right of the green line, which is impossible." src="http://yanzhang.files.wordpress.com/2013/02/what2.png?w=450&#038;h=337" width="450" height="337" /></a></dt>
<dd></dd>
</dl>
<p>Like all other examples of good math done in the wrong situations, the problem lies not with the logic above (which is correct) but with the assumptions. Instead, let&#8217;s make another very commonplace generative model:</p>
<ul>
<li><strong>assume the employers were just and higher education corresponds to higher salary</strong>, and there is no discrimination at all; (equivalently, have the education vs. salary plot be centered on the x=y line)</li>
<li>for whatever reason, <strong>have the blue men be more educated than the green men and thus get more salary</strong> (this could be, of course, due to prior discrimination in the history of the world, socioeconomic circumstances, etc. But the point is we don&#8217;t have that kind of information in our data. That&#8217;s why I stressed that the <strong>employers</strong> were just and not the <strong>world</strong>).</li>
<li>important: <strong>add some noise</strong>. So we don&#8217;t have well-defined lines, but some variation in ability and occasional misplacement of salary to ability.</li>
</ul>
<p>This gives you a graph that looks more like the following:</p>
<dl id="attachment_213">
<dt><a href="http://yanzhang.files.wordpress.com/2013/02/what.png"><img alt="A more &quot;realistic&quot; and equally natural model, where the main addition to the previous graph is noise." src="http://yanzhang.files.wordpress.com/2013/02/what.png?w=450&#038;h=337" width="450" height="337" /></a></dt>
<dd></dd>
</dl>
<p>And now you see it &#8212; on vertical slices, blue tend to be higher; on horizontal slices, blue tends to be on the right, something that we thought was very counter-intuitive due to the &#8220;lines&#8221; model. As Boris Alexeev pointed out to me, the <strong>noise</strong> is actually doing all of the work in this model!</p>
<p>The watchful reader will notice that this model makes very few assumptions and is very natural, and surely he/she can make some ideas of real-world data that follows this pattern. This is one main difference between the reverse regression effect and Simpson&#8217;s Paradox &#8212; by some fairly natural definition of &#8220;random,&#8221; a random set of data will get into the Simpson&#8217;s Paradox situation about 1/60 of the time (see Pearl). The RRE  does not depend on such coincidences.</p>
<h2>Interpretations and Discrimination</h2>
<p>So what is really going on? Some people get very impatient at this point: &#8220;is there discrimination or is there not?&#8221; Well, whether you believe my simple generative model or not, what is objectively going on is just that the data is telling a very simple story, which is that A) <strong>blue men are more educated </strong>and B) <strong>more educated people get paid more</strong>. Sadly, by trying to torture the data to make it talk, we are overstepping our bounds on what we can extract from the data. What the two activists are doing is trying to pull some complicated mechanism, like discrimination, out in a contorted way from very simple data.</p>
<p>An obvious question to ask at this point is: &#8220;<strong>how do we tell when discrimination exists</strong>?&#8221; Well, discrimination is a complicated object and it could come in different forms, including, say:</p>
<ul>
<li>discrimination-1: employers get equally qualified people, but then pay them less if they were green men</li>
<li>discrimination-2: employers are just, but the blue people discriminated against green people earlier and green men were put in tough socio-economical situations</li>
<li>discrimination-3: &#8230;</li>
</ul>
<p>The point is finding different kinds of discrimination is different, but we have an overloaded (and very emotionally charged!) word &#8220;discrimination.&#8221; <strong>To really find discrimination, you have to define each carefully and look for them separately. </strong>Using Pearl&#8217;s language, I would outline all different types of discrimination as different <em>causal mechanisms. </em>(formally, this means different graphical models using Pearl&#8217;s approach) I would then look at the world and see if those models make sense for the data. Qualitatively, it means something we knew intuitively all along: we need more history about this blue-green world and see how it compares with other worlds: maybe in most other worlds the green men are equally educated as blue men; maybe in this world the education of green men were repressed a century ago and education level has a hereditary effect. In our situation, a careful statement would be something like:</p>
<ul>
<li>discrimination-1: we don&#8217;t see discrimination-1, rather evidence against it;<strong> </strong></li>
<li>discrimination-2: we don&#8217;t see it, but we don&#8217;t see evidence against either. We don&#8217;t have the right kind of data to find discrimination-2.</li>
<li>discrimination-3: &#8230;<strong> </strong></li>
</ul>
<p>Yes, reductionism makes it more complicated. But it is necessary if we need to make precise judgments.</p>
<h2>Epilogue: the Dark Arts</h2>
<p>Why did I mention the Dark Arts? Can this information be maliciously used? <strong>Hell yes. I&#8217;ve literally just given you a recipe to finagle the politically charged idea of discrimination, GOING BOTH WAYS, from a single set of data</strong>. So if you feel like it, you can take basically any reasonable population story and create a perfectly-reasonable-sounding discrimination case against either side! I can even go a step further and see a use case when someone can point to this very effect to claim that no discrimination has taken place in a world where it really did (for example, maybe discrimination has actually caused the blue-green difference in education in the first place)! It is very important to stress that <strong>the RRE does not show that discrimination does not exist, rather that similar data like this example alone does not show discrimination</strong>. If this commonplace but nuanced situation is not Dark Arts (the knowledge of which is prerequisite to defend against it) of mathematics, I don&#8217;t know what is. The only thing I can do is to use this knowledge responsibly;  I hope it has similarly helped you.</p>
<p>Thanks to Boris Alexeev for first pointing me towards this effect and Pearl&#8217;s work on Causality. It was one of the best time investments I&#8217;ve ever made for mathematics.</p>
<p>-Yan</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1434/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1434/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1434&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2013/02/16/defense-against-the-dark-arts-the-reverse-regression-effect/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ff7abdf56e85a53563fc0e360de7e5dc?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">KR</media:title>
		</media:content>

		<media:content url="http://yanzhang.files.wordpress.com/2013/02/what2.png" medium="image">
			<media:title type="html">It seems we need the blue line to lie above the green line and to the right of the green line, which is impossible.</media:title>
		</media:content>

		<media:content url="http://yanzhang.files.wordpress.com/2013/02/what.png" medium="image">
			<media:title type="html">A more &#34;realistic&#34; and equally natural model, where the main addition to the previous graph is noise.</media:title>
		</media:content>
	</item>
		<item>
		<title>On Distances and Codes</title>
		<link>http://concretenonsense.wordpress.com/2012/12/09/on-distances-and-codes/</link>
		<comments>http://concretenonsense.wordpress.com/2012/12/09/on-distances-and-codes/#comments</comments>
		<pubDate>Sun, 09 Dec 2012 06:20:11 +0000</pubDate>
		<dc:creator>yanzhang</dc:creator>
				<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Combinatorics]]></category>
		<category><![CDATA[codes]]></category>
		<category><![CDATA[coding theory]]></category>
		<category><![CDATA[distances]]></category>
		<category><![CDATA[finite fields]]></category>
		<category><![CDATA[Gram matrix]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1428</guid>
		<description><![CDATA[I recently attended an interesting talk by Henry Cohn at the MIT combinatorics seminar on a certain class of (not necessarily linear) codes that include many of the classical creatures we know and love, such as the Hamming, Golay, and Reed-Solomon codes. This classification seeks to concretely capture the vague idea of &#8220;robustness&#8221; that people [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1428&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I recently attended an interesting talk by Henry Cohn at the MIT combinatorics seminar on a certain class of (not necessarily linear) codes that include many of the classical creatures we know and love, such as the Hamming, Golay, and Reed-Solomon codes. This classification seeks to concretely capture the vague idea of &#8220;robustness&#8221; that people familiar with these codes would feel from working with them. A cute little discussion came up and I wish to record it here and possibly obtain some new ideas. Though the discussion was only tangentially related to the main ideas of Henry&#8217;s presentation, I find the question very natural.<span id="more-1428"></span></p>
<p>The following construction came up: given a code of <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='k' title='k' class='latex' /> codewords in <img src='http://s0.wp.com/latex.php?latex=F_q%5En&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='F_q^n' title='F_q^n' class='latex' />, let <img src='http://s0.wp.com/latex.php?latex=%28A_0%2C+A_1%2C+%5Cldots%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='(A_0, A_1, &#92;ldots)' title='(A_0, A_1, &#92;ldots)' class='latex' /> be the <em>distance enumerator, </em>(apologies to Henry for forgetting the actual name, but I think this is a natural name to use) vector of pairwise distances between ordered pairs of codewords, such that <img src='http://s0.wp.com/latex.php?latex=A_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A_i' title='A_i' class='latex' /> is the number of pairs of points with distance <img src='http://s0.wp.com/latex.php?latex=i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='i' title='i' class='latex' /> weighted by <img src='http://s0.wp.com/latex.php?latex=1%2Fk&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='1/k' title='1/k' class='latex' /> (this way <img src='http://s0.wp.com/latex.php?latex=A_0+%3D+1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A_0 = 1' title='A_0 = 1' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Csum+A_i+%3D+k&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;sum A_i = k' title='&#92;sum A_i = k' class='latex' />). This weaker invariant is then used to analyze the code &#8212; this is actually a strictly weaker invariant, since it can be seen that not all such vectors actually come from actual codes, and some inequalities (using some orthogonal polynomials) describe some necessary (but not sufficient) conditions for these vectors to come from actual codes. The advantage of having these invariants is that they already tell a lot of information (Henry thinks possibly too much information) about the codes when they actually come from codes. Henry was wondering if there is a way to formally capture the idea that these invariants capture so much information.</p>
<p>(Small digression: I personally find this concept similar to the weight enumerators in codes, which are generating functions that capture the weights (number of <img src='http://s0.wp.com/latex.php?latex=1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='1' title='1' class='latex' />&#8216;s) in a binary code. Not all these generating functions actually come from codes, so the ones that do not are often called <em>psuedocodes</em>, but it is surprising how much information one can get from weight enumerators alone to tell stories and constraints about codes (see, for example, the MacWilliams identities). This concept is mirrored somewhat in Henry&#8217;s discussion.)</p>
<p>Anyway, I suggested thinking about the following natural question: consider the map that sends a code to its distance enumerator. Maybe you can show that the fibers of this map are fairly small, especially if you constrain yourself to, say, linear codes. If you can do this, it shows you don&#8217;t have a lot of wiggle room and it is clear that the distances themselves are going to capture most of the information.</p>
<p>Afterwards, Richard Stanley, being Richard, quickly figured out that this wouldn&#8217;t work. The following calculation suffices: there are <img src='http://s0.wp.com/latex.php?latex=%7Bq%5En+%5Cchoose+k%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='{q^n &#92;choose k}' title='{q^n &#92;choose k}' class='latex' /> codes. Distance enumerators sum to <img src='http://s0.wp.com/latex.php?latex=k&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='k' title='k' class='latex' />, so an upper bound is something like <img src='http://s0.wp.com/latex.php?latex=k+%2B+n+-1+%5Cchoose+k&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='k + n -1 &#92;choose k' title='k + n -1 &#92;choose k' class='latex' />. But it is clear that the fibers are way too large if we, say, let <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n' title='n' class='latex' /> grow. What if we limit to linear codes? Well, we can count these with our favorite q-analogue, <img src='http://s0.wp.com/latex.php?latex=%7Bn+%5Cchoose+%5Clog%28k%29%7D_q&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='{n &#92;choose &#92;log(k)}_q' title='{n &#92;choose &#92;log(k)}_q' class='latex' />. (I am taking base <img src='http://s0.wp.com/latex.php?latex=2&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='2' title='2' class='latex' /> here, as a <img src='http://s0.wp.com/latex.php?latex=t&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='t' title='t' class='latex' />-dimensional code has <img src='http://s0.wp.com/latex.php?latex=2%5Et&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='2^t' title='2^t' class='latex' /> elements). We have many fewer of these, but since <img src='http://s0.wp.com/latex.php?latex=q+%3E+1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='q &gt; 1' title='q &gt; 1' class='latex' /> these are still huge compared to the upper bound for the distance enumerators we gave, so it can&#8217;t be that limiting.</p>
<p>That said, the fact remains that<strong> it is unclear that there is a good algorithm to test if a given distance enumerator actually comes from a code (linear or not)</strong>. This is definitely decidable since the problem is finite, but the naive algorithm is absolutely horrible, and I haven&#8217;t come up with any ideas. Anything is welcome! By the way, it is fun to try this in <img src='http://s0.wp.com/latex.php?latex=R%5En&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='R^n' title='R^n' class='latex' /> instead of <img src='http://s0.wp.com/latex.php?latex=F_q%5En&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='F_q^n' title='F_q^n' class='latex' />. Then the problem becomes: &#8220;given <img src='http://s0.wp.com/latex.php?latex=n%5E2&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n^2' title='n^2' class='latex' /> pairwise distances, do we know if they came from <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n' title='n' class='latex' /> points?&#8221; This seems like a ridiculously natural question (maybe even applied-worthy) but I don&#8217;t know what is known about it. The closest is this: if we had a guess that matched up all the distances with all the pairs of points (this is <img src='http://s0.wp.com/latex.php?latex=O%28n%5E2%21%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(n^2!)' title='O(n^2!)' class='latex' /> time in the most naive case), we can look at the matrix and see if it is positive semi-definite! Kind of miraculously, such a matrix is a Gram matrix (comes from inner products) if and only if it is positive semi-definite. So this at least gives some idea of how to go on &#8211; maybe split this problem into assigning the ordered pairs and then seeing if the matrix comes from inner products? I&#8217;m not aware of a similar condition over finite fields for the second part, so maybe this approach is useless in that case.</p>
<p>Happy end of semester, everyone.</p>
<p>-Yan</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1428/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1428/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1428&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2012/12/09/on-distances-and-codes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ff7abdf56e85a53563fc0e360de7e5dc?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">KR</media:title>
		</media:content>
	</item>
		<item>
		<title>FindStat</title>
		<link>http://concretenonsense.wordpress.com/2012/08/12/findstat/</link>
		<comments>http://concretenonsense.wordpress.com/2012/08/12/findstat/#comments</comments>
		<pubDate>Sun, 12 Aug 2012 23:58:04 +0000</pubDate>
		<dc:creator>JBL</dc:creator>
				<category><![CDATA[Combinatorics]]></category>
		<category><![CDATA[Computer algebra]]></category>
		<category><![CDATA[FPSAC]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1418</guid>
		<description><![CDATA[In addition to the usual array of excellent talks and posters at FPSAC this year, there was a presentation by Chris Berg and Christian Stump on their new combinatorial statistic finder FindStat.  It&#8217;s intended to serve as a sort of OEIS but for combinatorial statistics, i.e., integer data associated to combinatorial objects like permutations or [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1418&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In addition to the usual array of excellent <a href="http://www.math.nagoya-u.ac.jp/fpsac12/proceedings.html">talks and posters</a> at <a href="http://www.math.nagoya-u.ac.jp/fpsac12/">FPSAC</a> this year, there was a <a href="http://www.math.nagoya-u.ac.jp/fpsac12/download/slide/0731/software-demonstration.pdf">presentation</a> by Chris Berg and Christian Stump on their new combinatorial statistic finder <a href="http://www.findstat.org/">FindStat</a>.  It&#8217;s intended to serve as a sort of <a href="http://oeis.org/">OEIS</a> but for combinatorial statistics, i.e., integer data associated to combinatorial objects like permutations or partitions.  In fact, though, it&#8217;s potentially much more powerful: for example, if you give a statistic on permutations, it not only checks its database of permutation statistics but also pushes the statistic through various combinatorial maps (like RSK) and then checks whether the statistic arises from some statistic on other combinatorial objects (like partitions).  Like the OEIS, they solicit contributions from the community.  It&#8217;s still in a very rudimentary stage, but I think it has a lot of promise.  Or as Yan asked at the end of the presentation, &#8220;Why are you guys so awesome?&#8221;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1418/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1418/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1418&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2012/08/12/findstat/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ef7b55858ab2457c72dfc7b2f2bd95c2?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">jblblog</media:title>
		</media:content>
	</item>
		<item>
		<title>PRIMES 2012</title>
		<link>http://concretenonsense.wordpress.com/2012/06/19/primes-2012/</link>
		<comments>http://concretenonsense.wordpress.com/2012/06/19/primes-2012/#comments</comments>
		<pubDate>Tue, 19 Jun 2012 17:32:23 +0000</pubDate>
		<dc:creator>JBL</dc:creator>
				<category><![CDATA[Combinatorics]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Representation Theory]]></category>
		<category><![CDATA[adinkra]]></category>
		<category><![CDATA[Cherednik algebra]]></category>
		<category><![CDATA[chromatic polynomial]]></category>
		<category><![CDATA[high school research]]></category>
		<category><![CDATA[MIT]]></category>
		<category><![CDATA[pattern avoidance]]></category>
		<category><![CDATA[posets]]></category>
		<category><![CDATA[PRIMES]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1414</guid>
		<description><![CDATA[At the end of May there was a two-day conference at MIT for the PRIMES program.  About 30 high school students presented the research they&#8217;ve done over the last 6 months, as part of collaborative teams including grad students and sometimes undergrads, professors and post-docs at MIT and a few nearby universities. The quality of [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1414&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>At the end of May there was a two-day conference at MIT for the <a href="http://web.mit.edu/primes/index.shtml">PRIMES</a> program.  About 30 high school students presented the research they&#8217;ve done over the last 6 months, as part of collaborative teams including grad students and sometimes undergrads, professors and post-docs at MIT and a few nearby universities. The quality of the work overall was quite impressive, even without taking into consideration that it was primarily performed by high school students.  The talk abstracts are available <a href="http://web.mit.edu/primes/materials/2012/primesabstracts2012.pdf">on the PRIMES website</a>; the papers that result will go online some time in December or January.  You can read last years&#8217; papers <a href="http://web.mit.edu/primes/papers.shtml">here</a>.</p>
<p>Several contributors to this blog were mentors of PRIMES students: my students Ravi and Nihal presented some very nice work on generalizations of pattern avoidance in alternating permutations; this extends work in my thesis as well as work of Julian West and collaborators on shape-Wilf equivalence.  Steven&#8217;s student Sheela presented work that&#8217;s a continuation of a project she began last year on the representation theory of Cherednik algebras.  Yan&#8217;s student Aaron (who also is my <a href="http://arxiv.org/abs/1203.5804">coauthor</a> in work based on his PRIMES project from 2011) presented his work studying the number of ways to put a graded poset structure on a given graph.  As I understand it, this question comes from work of Yan on adinkras, and is both natural and apparently unstudied.  The cutest result Aaron presented was the following: if <em>G</em> is a graph all of whose cycles are generated by its 4-cycles then the number of graded poset structures on G is <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B1%7D%7B3%7D+%5Cchi_G%283%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{1}{3} &#92;chi_G(3)' title='&#92;frac{1}{3} &#92;chi_G(3)' class='latex' />, where <img src='http://s0.wp.com/latex.php?latex=%5Cchi_G&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;chi_G' title='&#92;chi_G' class='latex' /> is the <a href="https://en.wikipedia.org/wiki/Chromatic_polynomial">chromatic polynomial</a> of <em>G</em>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1414/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1414/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1414&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2012/06/19/primes-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ef7b55858ab2457c72dfc7b2f2bd95c2?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">jblblog</media:title>
		</media:content>
	</item>
		<item>
		<title>Whitney Sum Formula and Exponentiation</title>
		<link>http://concretenonsense.wordpress.com/2012/03/16/whitney-sum-formula-and-exponentiation/</link>
		<comments>http://concretenonsense.wordpress.com/2012/03/16/whitney-sum-formula-and-exponentiation/#comments</comments>
		<pubDate>Fri, 16 Mar 2012 05:18:30 +0000</pubDate>
		<dc:creator>yanzhang</dc:creator>
				<category><![CDATA[Algebra]]></category>
		<category><![CDATA[Geometry & Topology]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1408</guid>
		<description><![CDATA[I&#8217;m alive, just a bit antisocial &#8211; I&#8217;ve been spelunking in the wonderful caves of classifying spaces and group cohomology, which really got me to revisit my horrible algebraic topology fundamentals. I&#8217;m still a toddler in this field, but I think I&#8217;m beginning to take baby steps so I hope Haynes Miller and my other [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1408&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I&#8217;m alive, just a bit antisocial &#8211; I&#8217;ve been spelunking in the wonderful caves of classifying spaces and group cohomology, which really got me to revisit my horrible algebraic topology fundamentals. I&#8217;m still a toddler in this field, but I think I&#8217;m beginning to take baby steps so I hope Haynes Miller and my other topology teachers (Anatoly, Dustin, Nick, etc.) would be eventually proud.</p>
<p>I just wanted to share a very short and simple insight today that made me very happy thanks to the Whitney sum formula<strong>: taking total Stiefel-Whitney classes are like exponentiating bundles</strong> (because taking total Stiefel-Whitney classes of a direct sum of two bundles becomes multiplication of the total Stiefel-Whitney classes of the individual bundles. This is &#8220;well duh&#8221; type of information to seasoned topologists, but to me this is exciting for several reasons:<span id="more-1408"></span></p>
<ul>
<li>for storing information in my brain, which is among one of the least well-suited-for-math brains in the department. For some reason, this aphorism-ish idea suddenly made it feel like I could manipulate these guys a lot better. Thankfully too, as a few months ago they were completely mysterious to me and I need them to do some computations.</li>
<li>for computation, this allows us to divide-and-conquer. This is the obvious &#8220;better bombs and banks&#8221; reason for mathematicians. For me personally, it is using the folk idea that we can associate representations of a finite group with a real vector bundle and then calculate the Stiefel-Whitney classes of the bundle, in which case the decomposition of representations into direct sums exactly corresponds with taking direct sums of our bundles!</li>
<li>for crazy ideas, this means I can think of it as a kind of exponential generating function associated to my bundle, in the sense that we associate exponential generating functions to combinatorial structures in combinatorics. It may be interesting to think about what &#8220;inverse Stiefel-Whitney classes&#8221; may mean, or even proving combinatorial generating function formulae as &#8220;categorification&#8221; of playing with bundles! I haven&#8217;t quite seen any good examples of this, so I&#8217;d be happy to hear some, or maybe even make some.</li>
<li>for the cool idea that math &#8220;forces itself to happen,&#8221; it is a good mental experiment to consider what classes can possibly have this exponential property, in the sense you can think of defining the exponential function via <img src='http://s0.wp.com/latex.php?latex=e%5E%7Bx%2By%7D+%3D+e%5Ex+e%5Ey&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='e^{x+y} = e^x e^y' title='e^{x+y} = e^x e^y' class='latex' />. Of course we have to play with the constants a bit &#8211; in our analogue this is just making sure the classes take the value <img src='http://s0.wp.com/latex.php?latex=1&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='1' title='1' class='latex' /> on trivial bundles &#8211; but it ends up being quite a restrictive property. Segal and Stretch&#8217;s &#8220;Characteristic Classes for permutation representations&#8221; explores this kind of perspective.</li>
</ul>
<p>Just a short breath to catch some air &#8211; it is a busy year where I have some more logistical duties and side projects. Back to the topological caves I go, though I really hope this kind of thinking would be helpful for at least one of you.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1408/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1408/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1408&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2012/03/16/whitney-sum-formula-and-exponentiation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ff7abdf56e85a53563fc0e360de7e5dc?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">KR</media:title>
		</media:content>
	</item>
		<item>
		<title>Some talks</title>
		<link>http://concretenonsense.wordpress.com/2012/01/18/some-talks/</link>
		<comments>http://concretenonsense.wordpress.com/2012/01/18/some-talks/#comments</comments>
		<pubDate>Thu, 19 Jan 2012 04:12:01 +0000</pubDate>
		<dc:creator>Steven Sam</dc:creator>
				<category><![CDATA[Algebraic Geometry]]></category>
		<category><![CDATA[Commutative Algebra]]></category>
		<category><![CDATA[Representation Theory]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1401</guid>
		<description><![CDATA[Unfortunately things have gotten more busy so I haven&#8217;t been writing on the blog. However, I found a new way to prepare talks that I really like: I type out my notes in a way which I hope is readable to someone who has not been to the talk. It is halfway between an expository [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1401&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Unfortunately things have gotten more busy so I haven&#8217;t been writing on the blog. However, I found a new way to prepare talks that I really like: I type out my notes in a way which I hope is readable to someone who has not been to the talk. It is halfway between an expository paper and lecture notes and the advantage is that I have a product which I can share on my webpage (and I find it more useful than slides &#8212; I am starting to dislike slides presentations for seminar talks). Anyway, I&#8217;ll advertise some of these here (I&#8217;ve never actually posted about my own research on this blog, so it&#8217;s a first!):</p>
<p>1. I&#8217;ve been working with Laurent Gruson and Jerzy Weyman on finding geometric interpretations for orbits in &#8220;Vinberg <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' />-representations&#8221;. I gave a talk on this at Princeton (<a href="http://math.mit.edu/~ssam/writings/talks/bhargavatalk.pdf">notes</a>) and Michigan (<a href="http://math.mit.edu/~ssam/writings/talks/michigan-GIT.pdf">notes</a>). The Princeton talk is more introductory in nature, and even though there is overlap, the two sets of notes should complement one another (for the record, both talks were approximately 1 hour, 45 minutes).</p>
<p>2. A separate project with Weyman involves trying to understand Koszul homology for certain classes of determinantal-like ideals. The motivation comes from trying to classify minimal free resolutions over quadric hypersurface rings and in trying to understand a certain result of Koike and Terada in combinatorial representation theory. I&#8217;m giving a talk on this tomorrow at Michigan (<a href="http://math.mit.edu/~ssam/writings/talks/michigan-koszulhomology.pdf">notes</a>).</p>
<p>It&#8217;s a bit time-consuming, but I think preparing notes like this can be very useful, especially for projects which haven&#8217;t been finished yet (it helps me gain direction). I hope more people try it!</p>
<p>-Steven</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1401/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1401/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1401&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2012/01/18/some-talks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/9e382356962deed0afd623ac51cbe101?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">masnevets</media:title>
		</media:content>
	</item>
		<item>
		<title>(3+1)-avoiding posets</title>
		<link>http://concretenonsense.wordpress.com/2011/12/04/3-plus-1-avoiding-posets/</link>
		<comments>http://concretenonsense.wordpress.com/2011/12/04/3-plus-1-avoiding-posets/#comments</comments>
		<pubDate>Sun, 04 Dec 2011 14:00:53 +0000</pubDate>
		<dc:creator>JBL</dc:creator>
				<category><![CDATA[Combinatorics]]></category>
		<category><![CDATA[graph theory]]></category>
		<category><![CDATA[posets]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1301</guid>
		<description><![CDATA[Yan and I recently put a paper on the arXiv that enumerates the graded (3+1)-avoiding posets.  In this post, I kill the adjective &#8220;graded&#8221; and talk a bit about what (3+1)-avoiding posets are and why they&#8217;re interesting.  If you don&#8217;t know what a poset is, I&#8217;ve included the definition in Note 0 at the bottom of the post. [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1301&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Yan and I recently put a paper on the arXiv that <a href="http://arxiv.org/abs/1106.5480">enumerates the graded (<strong>3</strong>+<strong>1</strong>)-avoiding <em></em>posets</a>.  In this post, I kill the adjective &#8220;graded&#8221; and talk a bit about what (<strong>3</strong>+<strong>1</strong>)-avoiding <em></em>posets are and why they&#8217;re interesting.  If you don&#8217;t know what a poset is, I&#8217;ve included the definition in Note 0 at the bottom of the post.</p>
<p>As with any object as general as posets, we are mostly interested not in results about all posets, but rather in finding particular families of posets with interesting or unexpected properties.  One such family of posets are the (<strong>3</strong>+<strong>1</strong>)-avoiding posets.  These are the posets that do not contain four elements, say <em>a</em>, <em>b</em>, <em>c</em>, and <em>d</em>, such that <em>a</em> &lt; <em>b</em> &lt; <em>c</em> and <em>d</em> is incomparable to the other three.  A short digression to explain the name &#8220;(<strong>3</strong>+<strong>1</strong>)-avoiding&#8221;: one natural class of posets are the <em>chains</em>, finite total orders like the first example in the previous paragraph.  A natural name for the chain with <img src='http://s0.wp.com/latex.php?latex=n&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='n' title='n' class='latex' /> vertices is <strong><em>n</em></strong>, so the chain with three vertices is <strong>3</strong> and the chain with one vertex (the only poset with exactly one vertex) is <strong>1</strong>.  There are several natural operations that we can use to combine posets, including the disjoint union, denoted &#8220;+&#8221;.  Thus, <strong>3</strong>+<strong>1</strong> is the poset that you get if you add a single isolated vertex to a three-element chain, and a poset is (<strong>3</strong>+<strong>1</strong>)-avoiding if it has no four elements that induce the poset <strong>3</strong>+<strong>1</strong>.</p>
<p><span id="more-1301"></span></p>
<p>It&#8217;s a common surprise in combinatorics to find that important objects are characterized by avoiding certain induced subobjects; subposet-avoidance is one particular example of this, and in fact (<strong>3</strong>+<strong>1</strong>)-avoiding posets show up in a number of unexpected places.  The simplest and perhaps nicest appearance is in the characterization of <em>semiorders</em>.  Semiorders are posets that arise in the following way: we have a set of data (real numbers generated by some experiment) such that each datapoint has error bars of the same size.  Thus, if two values <em>a</em> and <em>b</em> are separated by at least a fixed distance then we know their true order; but if they are separated by less than this distance, we can&#8217;t be certain which value is truly larger.  The relations we can be certain of are the relations of our poset.  It&#8217;s not hard to see that this definition is equivalent to the following: a semiorder is a poset whose elements are unit intervals in the real numbers, with one element less than another if and only if it lies entirely to its left.  (Aside: an easy exercise is to show that it doesn&#8217;t matter whether our intervals are open or closed.)  The main result of interest is that semiorders are exactly those posets that avoid <strong>3</strong>+<strong>1</strong> and <strong>2</strong>+<strong>2</strong>.  (If we drop the &#8220;unit&#8221; in &#8220;unit intervals&#8221;, we get just (<strong>2</strong>+<strong>2</strong>)-avoiding posets.)  These posets have all sorts of nice properties; for example, the number of them with <em>n</em> unlabeled elements is exactly the <em>n</em>th Catalan number, so we immediately know we&#8217;re going to get nice combinatorics and lots of connections with other objects.  See <a href="http://en.wikipedia.org/wiki/Semiorder">Wikipedia</a> and the work of Peter Fishburn for much more information about them. <strong></strong></p>
<p>A second set of connections comes from the following simple observation: a poset <em>P</em> avoids <strong>3</strong>+<strong>1</strong> if and only if its <em>incomparability graph</em> (i.e., the graph <em>G</em> on the same vertex set such that <em>u</em> is connected to <em>v</em> in <em>G</em> if and only if <em>u</em> is incomparable to <em>v</em> in <em>P</em>) is <em>claw-free</em>, i.e., contains no four vertices <em>a</em>, <em>b</em>, <em>c</em>, <em>d</em> such that <em>d</em> is connected to <em>a</em>, <em>b</em> and <em>c</em>, none of which are connected to each other.  Claw-free graphs are quite interesting; for example, they make an appearance in some <a href="http://arxiv.org/abs/1105.0698">recent work of Fadnavis</a>, who proved the following pretty result:</p>
<blockquote><p>Suppose that we wish to color a graph <em>G</em> with <em>q</em> colors, choosing the color for each vertex at random.  If <em>G</em> is claw-free then to maximize the chance that the resulting coloring is a proper coloring (i.e., no two adjacent vertices have the same color), we should choose colors uniformly at random (i.e., with equal probabilities 1/<em>q</em>).</p></blockquote>
<p>(Quite surprisingly, this result is not true in general!  To 2-color a star graph with 4 or more points, you&#8217;re better off with a more lop-sided distribution.)  Actually this result is just one part of a bigger story; for example, it&#8217;s also related to the Stanley-Stembridge conjecture, which asserts that symmetric chromatic polynomial of the comparability graph of a (<strong>3</strong>+<strong>1</strong>)-avoiding poset is <em>e</em>-positive.  (For definitions, check the Fadnavis paper, which is really excellent and has a lot of interesting material.)</p>
<p>As an enumerative combinatorialist, all these nice features of (<strong>3</strong>+<strong>1</strong>)-avoiding posets make me want to count them.  And, in some sense one should expect this to be not too difficult: claw-free graphs and (<strong>3</strong>+<strong>1</strong>)-avoiding posets both have nice structural classifications (due to Chudnovsky &amp; Seymour and to Skandera, respectively), and the related (<strong>2</strong>+<strong>2</strong>)-avoiding posets have been enumerated by Bousquet-Mélou, Claesson, Dukes &amp; Kitaev.  But, unfortunately, it seems like none of this is actually directly relevant.  So at least for now, counting (<strong>3</strong>+<strong>1</strong>)-avoiding posets remains very much open.</p>
<p>Notes:</p>
<p>0: One of the fundamental objects of combinatorics is the <em>partially ordered set</em>, or <em>poset</em> for short.<sup>1</sup>  Posets are just what their name suggests: they are given by an order relation (usually denoted <img src='http://s0.wp.com/latex.php?latex=%3C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&lt;' title='&lt;' class='latex' />) that is transitive (i.e., <img src='http://s0.wp.com/latex.php?latex=a+%3C+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &lt; b' title='a &lt; b' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b+%3C+c&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='b &lt; c' title='b &lt; c' class='latex' /> means <img src='http://s0.wp.com/latex.php?latex=a+%3C+c&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &lt; c' title='a &lt; c' class='latex' />) and antisymmetric (i.e., we never have <img src='http://s0.wp.com/latex.php?latex=a%2C+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a, b' title='a, b' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=a+%3C+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &lt; b' title='a &lt; b' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=b+%3C+a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='b &lt; a' title='b &lt; a' class='latex' />), but it is partial in the sense that not every two elements are necessarily comparable (so we might have <img src='http://s0.wp.com/latex.php?latex=a+%5Cneq+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &#92;neq b' title='a &#92;neq b' class='latex' /> such that neither <img src='http://s0.wp.com/latex.php?latex=a+%3C+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &lt; b' title='a &lt; b' class='latex' /> nor <img src='http://s0.wp.com/latex.php?latex=a+%3E+b&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a &gt; b' title='a &gt; b' class='latex' /> hold).  Obviously posets are a very flexible family of objects, including things like the usual order on the integers <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C+2%2C+%5Cldots%2C+n%5C%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;{1, 2, &#92;ldots, n&#92;}' title='&#92;{1, 2, &#92;ldots, n&#92;}' class='latex' /> (a partial order than happens to be a total order) or the containment order on the subsets of <img src='http://s0.wp.com/latex.php?latex=%5C%7B1%2C+2%2C+%5Cldots%2C+n%5C%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;{1, 2, &#92;ldots, n&#92;}' title='&#92;{1, 2, &#92;ldots, n&#92;}' class='latex' /> (the Boolean lattice).</p>
<p>1: Does anyone know the history of how this came to be?  As far as I know, MacMahon didn&#8217;t do anything with posets, so my assumption is that one can trace it to Rota, but obviously this is not based on anything concrete at all.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1301/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1301/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1301&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/12/04/3-plus-1-avoiding-posets/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ef7b55858ab2457c72dfc7b2f2bd95c2?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">jblblog</media:title>
		</media:content>
	</item>
		<item>
		<title>On Being Naive</title>
		<link>http://concretenonsense.wordpress.com/2011/11/23/on-being-naive/</link>
		<comments>http://concretenonsense.wordpress.com/2011/11/23/on-being-naive/#comments</comments>
		<pubDate>Wed, 23 Nov 2011 20:13:59 +0000</pubDate>
		<dc:creator>yanzhang</dc:creator>
				<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Probability]]></category>
		<category><![CDATA[bayesian networks]]></category>
		<category><![CDATA[conditional independence]]></category>
		<category><![CDATA[Jaynes]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[naive bayes]]></category>
		<category><![CDATA[noisy-or]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1358</guid>
		<description><![CDATA[I gave a talk at the MIT applied math seminar recently. Some people wanted a blog post, so here it is. Some of my mathematical hobbies include probability and machine learning. I have recently realized that many simple but effective ideas of these fields really all come from one thing: an independence assumption. It was [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1358&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I gave a talk at the MIT applied math seminar recently. Some people wanted a blog post, so here it is.</p>
<p>Some of my mathematical hobbies include probability and machine learning. I have recently realized that many simple but effective ideas of these fields really all come from one thing: an independence assumption. It was only until I saw the same example in several different guises, however, before I really caught on. As something Occam would surely approve of, the extreme naiveté this approach embraces can actually go a long way. Our key player is simple: we say that <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='B' title='B' class='latex' /> are <em>conditionally independent</em> given <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C' title='C' class='latex' /> if <img src='http://s0.wp.com/latex.php?latex=P%28A%7CB%2C+C%29+%3D+P%28A+%7C+C%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(A|B, C) = P(A | C)' title='P(A|B, C) = P(A | C)' class='latex' />. This can be written in the more symmetric form <img src='http://s0.wp.com/latex.php?latex=P%28A%2C+B%7CC%29+%3D+P%28A%7CC%29+P%28B%7CC%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(A, B|C) = P(A|C) P(B|C)' title='P(A, B|C) = P(A|C) P(B|C)' class='latex' />. Now I will tell a few stories. Most of these should be old for a specialist, but I hope I&#8217;ve included some remarks that even they may appreciate.</p>
<p><span id="more-1358"></span></p>
<p><strong>Weighing Evidence</strong></p>
<p>(this is mostly <del>stolen from</del><strong></strong> inspired by Jaynes from <em>Probability Theory: the Logic of Science</em>) Suppose we have two hypotheses about the state of the world, say <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BE%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{E}' title='&#92;overline{E}' class='latex' />, and we know that exactly one of them is true. Now suppose we are getting consecutive pieces of data <img src='http://s0.wp.com/latex.php?latex=D_1%2C+D_2%2C+%5Cldots&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D_1, D_2, &#92;ldots' title='D_1, D_2, &#92;ldots' class='latex' />.  How does this data update our belief in <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> or its complement?</p>
<p>A standard use of Bayes&#8217;s Theorem gives that <img src='http://s0.wp.com/latex.php?latex=P%28D_1%2C+D_2%2C+%5Cldots+%2C+E%29+%3D+P%28E%29+P%28D_1+%7C+E%29P%28D_2+%7C+D_1%2C+E%29+P%28D_3+%7C+D_1%2C+D_2%2C+E%29%5Ccdots&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(D_1, D_2, &#92;ldots , E) = P(E) P(D_1 | E)P(D_2 | D_1, E) P(D_3 | D_1, D_2, E)&#92;cdots' title='P(D_1, D_2, &#92;ldots , E) = P(E) P(D_1 | E)P(D_2 | D_1, E) P(D_3 | D_1, D_2, E)&#92;cdots' class='latex' /> Doing the same for <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BE%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{E}' title='&#92;overline{E}' class='latex' /> and dividing gives</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=%5Cdfrac%7BP%28D_1%2C+D_2%2C+%5Cldots%2C+E%29%7D%7BP%28D_1%2C+D_2%2C+%5Cldots%2C+%5Coverline%7BE%7D%29%7D+%3D+%5Cdfrac%7BP%28E%29%7D%7BP%28%5Coverline%7BE%7D%29%7D+%5Cdfrac%7BP%28D_1+%7C+E%29%7D%7BP%28D_1+%7C+%5Coverline%7BE%7D%29%7D+%5Cdfrac%7BP%28D_2+%7C+D_1%2C+E%29%7D%7BP%28D_2+%7C+D_1%2C+%5Coverline%7BE%7D%29%7D%5Ccdots&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;dfrac{P(D_1, D_2, &#92;ldots, E)}{P(D_1, D_2, &#92;ldots, &#92;overline{E})} = &#92;dfrac{P(E)}{P(&#92;overline{E})} &#92;dfrac{P(D_1 | E)}{P(D_1 | &#92;overline{E})} &#92;dfrac{P(D_2 | D_1, E)}{P(D_2 | D_1, &#92;overline{E})}&#92;cdots' title='&#92;dfrac{P(D_1, D_2, &#92;ldots, E)}{P(D_1, D_2, &#92;ldots, &#92;overline{E})} = &#92;dfrac{P(E)}{P(&#92;overline{E})} &#92;dfrac{P(D_1 | E)}{P(D_1 | &#92;overline{E})} &#92;dfrac{P(D_2 | D_1, E)}{P(D_2 | D_1, &#92;overline{E})}&#92;cdots' class='latex' /></p>
<p>Here, let&#8217;s make the naive assumption that the <img src='http://s0.wp.com/latex.php?latex=D_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D_i' title='D_i' class='latex' />&#8216;s are conditionally independent given <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BE%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{E}' title='&#92;overline{E}' class='latex' />. Not only are these different, we in fact want the slightly stronger assumption that the <img src='http://s0.wp.com/latex.php?latex=D_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D_i' title='D_i' class='latex' />&#8216;s are conditionally independent of the intersection of the previous <img src='http://s0.wp.com/latex.php?latex=D_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='D_i' title='D_i' class='latex' />&#8216;s given <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BE%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{E}' title='&#92;overline{E}' class='latex' />. If so, on the right we just get a product of <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7BP%28D_i+%7C+E%29%7D%7BP%28D_i+%7C+%5Coverline%7BE%7D%29%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{P(D_i | E)}{P(D_i | &#92;overline{E})}' title='&#92;frac{P(D_i | E)}{P(D_i | &#92;overline{E})}' class='latex' />. We want to take logs here and rewrite our equation as</p>
<p style="text-align:center;"><img src='http://s0.wp.com/latex.php?latex=O%28E+%7C+D_1%2C+D_2%2C+%5Cldots%29+%3D+O%28E%29+%2B+%5Csum_i+%5Clog%28%5Cfrac%7BP%28D_i+%7C+E%29%7D%7BP%28D_i+%7C+%5Coverline%7BE%7D%29%7D%29%2C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(E | D_1, D_2, &#92;ldots) = O(E) + &#92;sum_i &#92;log(&#92;frac{P(D_i | E)}{P(D_i | &#92;overline{E})}),' title='O(E | D_1, D_2, &#92;ldots) = O(E) + &#92;sum_i &#92;log(&#92;frac{P(D_i | E)}{P(D_i | &#92;overline{E})}),' class='latex' /></p>
<p>where <img src='http://s0.wp.com/latex.php?latex=O%28E%7CD%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(E|D)' title='O(E|D)' class='latex' /> is the <em>odds</em> <img src='http://s0.wp.com/latex.php?latex=%5Clog%28%5Cfrac%7BP%28E%7CD%29%7D%7BP%28%5Coverline%7BE%7D%7CD%29%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;log(&#92;frac{P(E|D)}{P(&#92;overline{E}|D)})' title='&#92;log(&#92;frac{P(E|D)}{P(&#92;overline{E}|D)})' class='latex' />. We now make the natural guess we are in the world <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> exactly when this is positive (which corresponds to the <img src='http://s0.wp.com/latex.php?latex=P%28E%7CD_1%2C+%5Cldots%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(E|D_1, &#92;ldots)' title='P(E|D_1, &#92;ldots)' class='latex' /> having higher probability).</p>
<p>The cute thing about this situation is that we are really &#8220;weighing&#8221; our evidence, as on a balance! Each new piece of data just contributes a number, and we mentally keep a tally and just decide yes or no given the sign of the final sum. Our original &#8220;bias&#8221; is exactly the odds of <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' /> given no other information, which exactly corresponds to the Bayesian information contained in our prior knowledge of <img src='http://s0.wp.com/latex.php?latex=E&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='E' title='E' class='latex' />, so the entire prior information comes into play as a &#8220;head start&#8221; bias in one direction, <em>as if it were a single instance of data with some weight</em>! This is a very clean way to make decisions, and really makes binary hypothesis testing very intuitive (it is fun/frustrating to try to generalize this to more than <img src='http://s0.wp.com/latex.php?latex=2&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='2' title='2' class='latex' /> hypotheses, where there are quite a few unexpected pitfalls). The key, however, was our conditional independence assumption.</p>
<p><strong>Naive Bayes</strong></p>
<p>A very naive approach to spam filtering is the following (generative) model. Let nature choose whether a message is spam (call this hypothesis <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='H' title='H' class='latex' />) with some probability <img src='http://s0.wp.com/latex.php?latex=P%28H%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(H)' title='P(H)' class='latex' /> and then, for each word in the dictionary, pick whether each word <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='w_i' title='w_i' class='latex' /> is in the message (abuse notation and call this the event <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='w_i' title='w_i' class='latex' />) with probability <img src='http://s0.wp.com/latex.php?latex=P%28w_i+%7C+H%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(w_i | H)' title='P(w_i | H)' class='latex' /> (or <img src='http://s0.wp.com/latex.php?latex=P%28w_i%7C%5Coverline%7BH%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(w_i|&#92;overline{H})' title='P(w_i|&#92;overline{H})' class='latex' />). You then pick maximum likelihood over sample data to learn the probabilities, and on a new piece of data just get the <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='w_i' title='w_i' class='latex' />&#8216;s and see whether <img src='http://s0.wp.com/latex.php?latex=H&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='H' title='H' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=%5Coverline%7BH%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;overline{H}' title='&#92;overline{H}' class='latex' /> was more likely.</p>
<p>So what was the naive assumption? It was that the events <img src='http://s0.wp.com/latex.php?latex=w_i&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='w_i' title='w_i' class='latex' /> were conditionally independent given either hypothesis. Cutely, the final log odds <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7BP%28H%7CD%29%7D%7BP%28%5Coverline%7BH%7D%7CD%29%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{P(H|D)}{P(&#92;overline{H}|D)}' title='&#92;frac{P(H|D)}{P(&#92;overline{H}|D)}' class='latex' /> is additive because of this (an exercise I leave to the reader) and extremely easy to calculate (and fast for computers, too!). The punchline is that this additivity is exactly the same additivity we had in the hypothesis testing. The expressions work out such that the inclusion/exclusion of every word will add some positive/negative number to a running total, biased in the beginning by some initial value determined by <img src='http://s0.wp.com/latex.php?latex=P%28H%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='P(H)' title='P(H)' class='latex' /> with no other information, and we choose to classify a message as spam based on whether this running total is positive or not.</p>
<p>Before continuing, I want to put in e-print my long-running gripe that &#8220;Naive Bayes&#8221; is a misnomer, since the only thing naive I can possibly think about doing with Bayes&#8217;s Theorem is conditional independence, and so every such algorithm should be called &#8220;naive.&#8221; The name offers no information about the particular algorithm that is associated with spam filtering, and the graphical network it corresponds to is not even the unique simplest graph (you can reverse all the arrows, for instance, and get Noisy-Or or any other ICI, but I admit those have another layer of complexity that I am not going into here). Unsurprisingly, there are actually many flavors of Naive Bayes depending on where you want to insert your naivete &#8212; see Metsis, Androutsopoulos and Paliouras, <em>Spam Filtering with Naive Bayes – Which Naive Bayes?</em><br />
<strong></strong></p>
<p><strong>The Unreasonable Effectiveness of Naive Bayes</strong></p>
<p>What&#8217;s the yoga here? Here&#8217;s my take: the conditional independence became a multiplicative condition and thus an additive condition, so the convenience of independence corresponds to the convenience of linearity. Thus, the hyperbolic punchline of this post is that &#8220;independence is linearity.&#8221;</p>
<p>I see a strange phenomenon (at least among pure mathematicians casually talking about applied math; I&#8217;m sure applied mathematicians have a better intuition) that people are very comfortable accepting linear approximations but not as comfortable accepting independence, whereas at least in my very simple setup they are <em>exactly </em>the same. I will audaciously extend my analogy to say that this intuition is inconsistent, and I don&#8217;t know why people seem to be completely fine with logistic regression (which really just says the log-odds is additive and is thus a third story equivalent to the two stories I&#8217;ve told in this blog post!) while careful about making disclaimers about Naive Bayes.</p>
<p>In fact, Naive Bayes, contrary to popular opinion, is actually also very good (and provably optimal, with certain definitions of optimal) when events are <em>very </em>dependent! It is only in the middle regions where it suffers, and it really doesn&#8217;t suffer by much. We also overestimate its problems because we like to think in terms of <img src='http://s0.wp.com/latex.php?latex=L%5E2&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='L^2' title='L^2' class='latex' /> but errors in classifying are frequently done under zero-one loss (this is a really interesting nuance that I would love to talk about some other time, but this post has gotten long enough). For a more in-depth look, see Domingos and Pazzani, <em>On the Optimality of the Simple Bayesian Classifier</em><em> under Zero-One Loss</em>.</p>
<p><strong>Appendix: Noisy-Or and Bayesian Networks<br />
</strong></p>
<p>When we talk about conditional independence, we really should take the setup of Bayesian networks, which gives a natural excuse to introduce Naive Bayes&#8217; much lesser well-known sister, Noisy-Or (that often does better than Naive Bayes!). I spent some time in my talk going over the basics of d-separation, Markov blankets, etc.. However, I realized that I had no real interesting observations so I won&#8217;t talk too much about it in blog format, where the reader is very close to Google and smarter people who know much more than I do. I did have one silly &#8220;original&#8221; contribution, however, so I share it here.</p>
<p>Here is an example that I thought was surprisingly clean and possibly helpful for someone interested in the basics of Bayesian networks: consider the events A(AC), B(Battery), and C(Computer), corresponding to whether the corresponding electronic gizmo is on or off (with the computer connected to both the AC and the battery). This corresponds to a Bayesian network with two edges <img src='http://s0.wp.com/latex.php?latex=A+%5Crightarrow+C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A &#92;rightarrow C' title='A &#92;rightarrow C' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B+%5Crightarrow+C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='B &#92;rightarrow C' title='B &#92;rightarrow C' class='latex' />.</p>
<p>It is obvious that <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='B' title='B' class='latex' /> are independent until <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C' title='C' class='latex' /> is observed, which makes them conditionally dependent; if you know the Computer is on or off, then the other two power sources&#8217; integrities are coupled. Otherwise, your blissful ignorance gives you no information. This is starkly different from <em>every other </em>orientation (3 possible) of the edges, where <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='A' title='A' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=B&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='B' title='B' class='latex' /> are dependent but <em>conditionally independent</em> given <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C' title='C' class='latex' />! This quirkiness makes the weird d-separation criterion necessary, and I thought this example very mnemonically convenient for marking the &#8220;bad&#8221; edge-orientation.</p>
<p>-Yan</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1358/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1358/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1358&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/11/23/on-being-naive/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ff7abdf56e85a53563fc0e360de7e5dc?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">KR</media:title>
		</media:content>
	</item>
		<item>
		<title>Positivity, Dynamics, Knots</title>
		<link>http://concretenonsense.wordpress.com/2011/08/20/positivity-dynamics-knots/</link>
		<comments>http://concretenonsense.wordpress.com/2011/08/20/positivity-dynamics-knots/#comments</comments>
		<pubDate>Sat, 20 Aug 2011 20:50:18 +0000</pubDate>
		<dc:creator>lewallen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1331</guid>
		<description><![CDATA[Some of my research last year had to do with notions of &#8220;positivity&#8221; in Heegaard Floer homology and knot theory (a phrase which I&#8217;m borrowing from a paper of Matt Hedden). As a simple example of positivity, a braid (and more generally, a knot) is called positive if it has a diagram containing only positive [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1331&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p style="text-align:left;"><strong></strong>Some of my research last year had to do with notions of &#8220;positivity&#8221; in Heegaard Floer homology and knot theory (a phrase which I&#8217;m borrowing from a paper of Matt Hedden). As a simple example of positivity, a braid (and more generally, a knot) is called positive if it has a diagram containing only positive crossings:</p>
<p><a href="http://concretenonsense.files.wordpress.com/2011/08/image1pos.jpg"><img class="size-medium wp-image-1335 aligncenter" title="Image1Pos" src="http://concretenonsense.files.wordpress.com/2011/08/image1pos.jpg?w=300&#038;h=79" alt="" width="300" height="79" /></a><span id="more-1331"></span></p>
<p>(Positive knots become negative knots if we switch either our nomenclature or the orientation on <img src='http://s0.wp.com/latex.php?latex=%7BS%5E%7B3%7D%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^{3}}' title='{S^{3}}' class='latex' />, so there are <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathbb%7BZ%7D+%2F2%5Cmathbb%7BZ%7D%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mathbb{Z} /2&#92;mathbb{Z}}' title='{&#92;mathbb{Z} /2&#92;mathbb{Z}}' class='latex' /> confusions lurking everywhere in this business. Also, note that although we needed on orientation on our knot to define the sign of a crossing, the sign is actually independent of this orientation, and only depends on the embdedding of the knot in <img src='http://s0.wp.com/latex.php?latex=%7BS%5E3%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^3}' title='{S^3}' class='latex' />).</p>
<p>A rather amazing theorem is that every positive braid is fibered. What this means is that if we take our positive braid and close it up into a knot <img src='http://s0.wp.com/latex.php?latex=%7BK%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{K}' title='{K}' class='latex' /> in <img src='http://s0.wp.com/latex.php?latex=%7BS%5E%7B3%7D%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^{3}}' title='{S^{3}}' class='latex' />, then the &#8220;knot complement&#8221; <img src='http://s0.wp.com/latex.php?latex=%7BS%5E%7B3%7D%5Cbackslash+K%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^{3}&#92;backslash K}' title='{S^{3}&#92;backslash K}' class='latex' />, which we can think of as an open 3-manifold, actually fibers over the circle, with fiber a punctured surface. (Alternatively, if we remove a small tubular neighborhood of the knot, we can think of <img src='http://s0.wp.com/latex.php?latex=S%5E3+%5Cbackslash+K&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='S^3 &#92;backslash K' title='S^3 &#92;backslash K' class='latex' /> as a compact 3-manifold with torus boundary). Here&#8217;s how to close the braid:</p>
<p><a href="http://concretenonsense.files.wordpress.com/2011/08/image2close.jpg"><img class="size-medium wp-image-1336 aligncenter" title="Image2Close" src="http://concretenonsense.files.wordpress.com/2011/08/image2close.jpg?w=300&#038;h=175" alt="" width="300" height="175" /></a></p>
<p>In other words, there is a punctured surface <img src='http://s0.wp.com/latex.php?latex=%7B%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;Sigma}' title='{&#92;Sigma}' class='latex' />, and a map <img src='http://s0.wp.com/latex.php?latex=%7Bf%3A%5CSigma+%5Crightarrow+%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f:&#92;Sigma &#92;rightarrow &#92;Sigma}' title='{f:&#92;Sigma &#92;rightarrow &#92;Sigma}' class='latex' />, fixing the puncture, so that the knot complement is the mapping torus for <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' />, as pictured below. Such fibered 3-manifolds are very special.</p>
<p><a href="http://concretenonsense.files.wordpress.com/2011/08/image3torus.jpg"><img class="alignnone size-medium wp-image-1337 aligncenter" title="Image3Torus" src="http://concretenonsense.files.wordpress.com/2011/08/image3torus.jpg?w=300&#038;h=135" alt="" width="300" height="135" /></a>Note that it is precisely because <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' /> fixes the puncture that the line <img src='http://s0.wp.com/latex.php?latex=%7BK%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{K}' title='{K}' class='latex' /> above closes up and becomes a knot. I should say, the proof that positive braids are fibered uses an even more amazing theorem of Stallings, which characterizes fibered knot complements in terms of a simple algebraic property of their fundamental group. This particular notion of positivity is one that appears in Matt&#8217;s paper. I recently read some work of Etienne Ghys talking about a related notion, and I thought it was so cool that I had to post about it.</p>
<p>Here&#8217;s the theorem, which Ghys attributes to Freed, Schwarzman, and Sullivan. Let <img src='http://s0.wp.com/latex.php?latex=%7BM%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{M}' title='{M}' class='latex' /> be a compact manifold with a non-vanishing, smooth vector field <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />. First for the background: suppose there is a closed surface <img src='http://s0.wp.com/latex.php?latex=%7B%5CSigma+%5Csubset+M%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;Sigma &#92;subset M}' title='{&#92;Sigma &#92;subset M}' class='latex' /> which is transverse to the flow of <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> and meets the forward orbit of every point <img src='http://s0.wp.com/latex.php?latex=%7Bm%5Cin+M%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{m&#92;in M}' title='{m&#92;in M}' class='latex' /> at least once (therefore, it meets every orbit infinitely many times). We get a first return map <img src='http://s0.wp.com/latex.php?latex=%7Bf%3A%5CSigma+%5Crightarrow+%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f:&#92;Sigma &#92;rightarrow &#92;Sigma}' title='{f:&#92;Sigma &#92;rightarrow &#92;Sigma}' class='latex' />, simply by taking <img src='http://s0.wp.com/latex.php?latex=%7Bm%5Cin+%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{m&#92;in &#92;Sigma}' title='{m&#92;in &#92;Sigma}' class='latex' /> and flowing it forward until it hits <img src='http://s0.wp.com/latex.php?latex=%7B%5CSigma%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;Sigma}' title='{&#92;Sigma}' class='latex' /> again, say at <img src='http://s0.wp.com/latex.php?latex=%7Bn%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{n}' title='{n}' class='latex' />, and defining <img src='http://s0.wp.com/latex.php?latex=%7Bf%28m%29%3Dn%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f(m)=n}' title='{f(m)=n}' class='latex' />. Then it&#8217;s not hard to check that <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> must be the mapping torus for <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' />, as before, and that <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> is just the natural vector field pointing along the &#8220;time&#8221; direction of the mapping torus (up to scaling), as depicted below:</p>
<p><a href="http://concretenonsense.files.wordpress.com/2011/08/image4vfield.jpg"><img class="size-full wp-image-1338 aligncenter" title="Image4VField" src="http://concretenonsense.files.wordpress.com/2011/08/image4vfield.jpg?w=500" alt=""   /></a></p>
<p>In this case, <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> is called the suspension of <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' />. The question addressed by the theorem is: given a non-vanishing vector field <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%7BM%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{M}' title='{M}' class='latex' />, when is it the suspension of a map <img src='http://s0.wp.com/latex.php?latex=%7Bf%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{f}' title='{f}' class='latex' />? Note that if we have such a suspension, and therefore a fibration over <img src='http://s0.wp.com/latex.php?latex=%7BS%5E1%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^1}' title='{S^1}' class='latex' />, we can pull back the form <img src='http://s0.wp.com/latex.php?latex=%7Bd%5Ctheta%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{d&#92;theta}' title='{d&#92;theta}' class='latex' /> on <img src='http://s0.wp.com/latex.php?latex=%7BS%5E1%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S^1}' title='{S^1}' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%7B%5COmega%5E1%28M%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;Omega^1(M)}' title='{&#92;Omega^1(M)}' class='latex' /> to get a closed, NON-VANISHING 1-form which is positive on <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />. It&#8217;s not so hard to figure out that having such a form is equivalent to being a suspension. The really cool theorem is an apparently much weaker condition which is also sufficient.</p>
<p>The key object is the set <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal%7BP%7D_X%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mathcal{P}_X}' title='{&#92;mathcal{P}_X}' class='latex' /> of probability measures on <img src='http://s0.wp.com/latex.php?latex=%7BM%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{M}' title='{M}' class='latex' /> which are invariant with respect to <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />. Given any <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmu+%5Cin+%5Cmathcal%7BP%7D_X%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mu &#92;in &#92;mathcal{P}_X}' title='{&#92;mu &#92;in &#92;mathcal{P}_X}' class='latex' />, and any 1-form <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega+%5Cin+%5COmega%5E%7B1%7D%28M%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;omega &#92;in &#92;Omega^{1}(M)}' title='{&#92;omega &#92;in &#92;Omega^{1}(M)}' class='latex' />, we can get a number by integrating:</p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cint_M+%5Comega%28X%29+d%5Cmu&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='&#92;displaystyle &#92;int_M &#92;omega(X) d&#92;mu' title='&#92;displaystyle &#92;int_M &#92;omega(X) d&#92;mu' class='latex' /></p>
<p>This associates to any <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmu%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mu}' title='{&#92;mu}' class='latex' /> a 1-chain, i.e. something dual to a 1-form. If <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;omega}' title='{&#92;omega}' class='latex' /> is exact, the above integral can be shown to be 0, using the <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />-invariance of <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmu%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mu}' title='{&#92;mu}' class='latex' /> (use the invariance to rewrite the integrand, when <img src='http://s0.wp.com/latex.php?latex=%7B%5Comega%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;omega}' title='{&#92;omega}' class='latex' /> is an exact form, as a total differential). Therefore, we obtain a map</p>
<p align="center"><img src='http://s0.wp.com/latex.php?latex=%5Cdisplaystyle+S%3A+%5Cmathcal%7BP%7D_X+%5Crightarrow+H_1%28M%3B%5Cmathbb%7BR%7D%29&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='&#92;displaystyle S: &#92;mathcal{P}_X &#92;rightarrow H_1(M;&#92;mathbb{R})' title='&#92;displaystyle S: &#92;mathcal{P}_X &#92;rightarrow H_1(M;&#92;mathbb{R})' class='latex' /></p>
<p>whose image <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)}' title='{S(&#92;mathcal{P}_X)}' class='latex' /> is in fact compact and convex. Now, if <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> was a suspension, then <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)}' title='{S(&#92;mathcal{P}_X)}' class='latex' /> actually lies entirely in some positive half-space of <img src='http://s0.wp.com/latex.php?latex=H_1%28M%2C%5Cmathbb%7BR%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='H_1(M,&#92;mathbb{R})' title='H_1(M,&#92;mathbb{R})' class='latex' />. Why? Well, remember that in this case we have a closed non-vanishing one-form which is positive on <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' />, and by pairing with this form we get a map <img src='http://s0.wp.com/latex.php?latex=%7BH_1%28M%3B%5Cmathbb%7BR%7D%29%5Crightarrow+%5Cmathbb%7BR%7D%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{H_1(M;&#92;mathbb{R})&#92;rightarrow &#92;mathbb{R}}' title='{H_1(M;&#92;mathbb{R})&#92;rightarrow &#92;mathbb{R}}' class='latex' /> which is positive on <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)}' title='{S(&#92;mathcal{P}_X)}' class='latex' />. Therefore <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)}' title='{S(&#92;mathcal{P}_X)}' class='latex' /> lies in a positive half-space of <img src='http://s0.wp.com/latex.php?latex=%7BH_1%28M%3B%5Cmathbb%7BR%7D%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{H_1(M;&#92;mathbb{R})}' title='{H_1(M;&#92;mathbb{R})}' class='latex' />. The rad theorem of FSS is that this is actually sufficient:</p>
<p>Theorem: <img src='http://s0.wp.com/latex.php?latex=%7BX%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{X}' title='{X}' class='latex' /> is a suspension if and only if <img src='http://s0.wp.com/latex.php?latex=%7BS%28%5Cmathcal%7BP%7D_X%29%5Csubset+H_1%28M%3B%5Cmathbb%7BR%7D%29%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{S(&#92;mathcal{P}_X)&#92;subset H_1(M;&#92;mathbb{R})}' title='{S(&#92;mathcal{P}_X)&#92;subset H_1(M;&#92;mathbb{R})}' class='latex' /> is contained in some halfspace.</p>
<p>One remark about the measures <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal%7BP%7D_X%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mathcal{P}_X}' title='{&#92;mathcal{P}_X}' class='latex' />: note that if we have a closed periodic orbit <img src='http://s0.wp.com/latex.php?latex=O&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O' title='O' class='latex' /> for <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='X' title='X' class='latex' />, i.e. some closed loop which integrates the flow, then we get a natural set of measures which are concentrated near <img src='http://s0.wp.com/latex.php?latex=O&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O' title='O' class='latex' />. In this sense, <img src='http://s0.wp.com/latex.php?latex=%7B%5Cmathcal%7BP%7D_X%7D&amp;bg=ffffff&amp;fg=000000&amp;s=0' alt='{&#92;mathcal{P}_X}' title='{&#92;mathcal{P}_X}' class='latex' /> should be thought of as a set of generalized periodic orbits. The measures associated to actual periodic orbits just get sent by <img src='http://s0.wp.com/latex.php?latex=S&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='S' title='S' class='latex' /> to the class represented by these orbits in <img src='http://s0.wp.com/latex.php?latex=H_1%28M%3B%5Cmathbb%7BR%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='H_1(M;&#92;mathbb{R})' title='H_1(M;&#92;mathbb{R})' class='latex' />.</p>
<p>One way to think about the theorem, and this is how one of the proofs goes, is that just from this positive subset of homology, we can do some fancy functional analysis to create from this an actual dual form, not just a cohomology class, with the right non-vanishing and positivity properties. So somehow, we&#8217;re free to work just in homology without losing information, which seems very appealing.</p>
<p>I&#8217;m not yet sure exactly what this theorem useful for, but it&#8217;s so neat. In a soon to come follow up, I will talk more about positivity on the knot side, and bleg about a concrete question that I&#8217;d love to get everyone&#8217;s opinion on.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1331/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1331/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1331&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/08/20/positivity-dynamics-knots/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/74e30911bae0524315cab41b6bb91cec?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">lewallen</media:title>
		</media:content>

		<media:content url="http://concretenonsense.files.wordpress.com/2011/08/image1pos.jpg?w=300" medium="image">
			<media:title type="html">Image1Pos</media:title>
		</media:content>

		<media:content url="http://concretenonsense.files.wordpress.com/2011/08/image2close.jpg?w=300" medium="image">
			<media:title type="html">Image2Close</media:title>
		</media:content>

		<media:content url="http://concretenonsense.files.wordpress.com/2011/08/image3torus.jpg?w=300" medium="image">
			<media:title type="html">Image3Torus</media:title>
		</media:content>

		<media:content url="http://concretenonsense.files.wordpress.com/2011/08/image4vfield.jpg" medium="image">
			<media:title type="html">Image4VField</media:title>
		</media:content>
	</item>
		<item>
		<title>On Physical Units</title>
		<link>http://concretenonsense.wordpress.com/2011/08/03/on-physical-units/</link>
		<comments>http://concretenonsense.wordpress.com/2011/08/03/on-physical-units/#comments</comments>
		<pubDate>Thu, 04 Aug 2011 01:43:13 +0000</pubDate>
		<dc:creator>yanzhang</dc:creator>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Physics]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[physics]]></category>

		<guid isPermaLink="false">http://concretenonsense.wordpress.com/?p=1316</guid>
		<description><![CDATA[I&#8217;ve been learning some physics from Allan Adams and some physics students here at MIT, and I&#8217;ve suddenly realized that there are a lot of contextual tricks I take for granted when I&#8217;m doing physics, which make the physicists&#8217; lives easier and sometimes really irritates the mathematicians. The first two examples I can think of [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1316&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I&#8217;ve been learning some physics from Allan Adams and some physics students here at MIT, and I&#8217;ve suddenly realized that there are a lot of contextual tricks I take for granted when I&#8217;m doing physics, which make the physicists&#8217; lives easier and sometimes really irritates the mathematicians. The first two examples I can think of is differentiating under the integral and never checking convergence, though both of these really fall under the bigger umbrella of assuming everything is well-behaved (which probably accounts for 80% of the mathematical gripes I&#8217;ve seen against physicists). Now, I&#8217;m a happy supporter of this &#8220;wishful thinking&#8221; practice: to use a programming analogy, I think of this habit as the <a href="http://en.wikipedia.org/wiki/Lazy_evaluation">lazy evaluation</a> version of having good definitions, and as a lover of Python generators I totally appreciate the idea of saying &#8220;we&#8217;ll figure out the right definitions later since they actually exist.&#8221;</p>
<p>One of the most common physical tricks, however, is not of this category. It is the curiously natural framework: &#8220;we have a consistent idea of units.&#8221; Here&#8217;s a perfectly sound argument to get something that is not entirely obvious:<span id="more-1316"></span></p>
<p>Take the integral <img src='http://s0.wp.com/latex.php?latex=%5Cint_%7B-%5Cinfty%7D%5E%7B%5Cinfty%7D+e%5E%7B-ax%5E2%7D+dx&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;int_{-&#92;infty}^{&#92;infty} e^{-ax^2} dx' title='&#92;int_{-&#92;infty}^{&#92;infty} e^{-ax^2} dx' class='latex' />. There is a way to get some information about it without doing the real integral:</p>
<p style="padding-left:30px;">Do the substitution <img src='http://s0.wp.com/latex.php?latex=y+%3D+%5Csqrt%7Ba%7D+x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='y = &#92;sqrt{a} x' title='y = &#92;sqrt{a} x' class='latex' />. Then the integral becomes <img src='http://s0.wp.com/latex.php?latex=%5Cint+e%5E%7B-y%5E2%7D+%281%2F%5Csqrt%7Ba%7D%29+dy+%3D+%281%2F%5Csqrt%7Ba%7D%29+%5Cint+e%5E%7B-y%5E2%7D+dy+%3D+C+a%5E%7B-1%2F2%7D.&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;int e^{-y^2} (1/&#92;sqrt{a}) dy = (1/&#92;sqrt{a}) &#92;int e^{-y^2} dy = C a^{-1/2}.' title='&#92;int e^{-y^2} (1/&#92;sqrt{a}) dy = (1/&#92;sqrt{a}) &#92;int e^{-y^2} dy = C a^{-1/2}.' class='latex' /></p>
<p>Using a slightly physical language: f we don&#8217;t care about the actual constant, just the &#8220;order&#8221; of <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' /> (though it is a similar concept, we&#8217;re not exactly doing the order of growth of <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' />), we can deduce that the answer is in the &#8220;units&#8221; of <img src='http://s0.wp.com/latex.php?latex=a%5E%7B-1%2F2%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a^{-1/2}' title='a^{-1/2}' class='latex' /> (the complete answer is <img src='http://s0.wp.com/latex.php?latex=%5Csqrt%7B%5Cpi+%2F+a%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;sqrt{&#92;pi / a}' title='&#92;sqrt{&#92;pi / a}' class='latex' />), by &#8220;isolating&#8221; the part of the integral with dependence on <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' />.</p>
<p>Even though this is already somewhat trick-sy, it is not quite as far as what a physicist would do. They would (confirmed by experience!) look at this and say something like:</p>
<p style="padding-left:30px;">&#8220;Let <img src='http://s0.wp.com/latex.php?latex=x&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='x' title='x' class='latex' /> have the units of <img src='http://s0.wp.com/latex.php?latex=%5B%5Ctext%7Blength%7D%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='[&#92;text{length}]' title='[&#92;text{length}]' class='latex' />. The exponential must be unit-less otherwise it doesn&#8217;t have a well-defined unit, which means <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' /> must have units of <img src='http://s0.wp.com/latex.php?latex=%5B%5Ctext%7Blength%7D%5E%7B-2%7D%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='[&#92;text{length}^{-2}]' title='[&#92;text{length}^{-2}]' class='latex' /> and the integrand itself must then be unit-less. When we integrate, we then pick up a single unit of <img src='http://s0.wp.com/latex.php?latex=%5B%5Ctext%7Blength%7D%5D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='[&#92;text{length}]' title='[&#92;text{length}]' class='latex' /> in terms of <img src='http://s0.wp.com/latex.php?latex=a&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='a' title='a' class='latex' />, so it must be <img src='http://s0.wp.com/latex.php?latex=C+a%5E%7B-1%2F2%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C a^{-1/2}' title='C a^{-1/2}' class='latex' /> for some constant <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='C' title='C' class='latex' />.&#8221;</p>
<p>The problem is this makes perfect &#8220;sense&#8221; to me in a completely sound way (there is no approximation or heuristic here), yet I cannot argue it to my satisfaction in any mathematical matter. All I know is that most people with even elementary physics experience have picked up a very consistent language of &#8220;units&#8221; that we can use to make definite deductions, but I&#8217;m finding it hard to axiomatize them in a clear way. After trying for about half an hour, the only thing I&#8217;ve decided is that we really want some sort of valuation on a space of functions that is multiplicative, which I believe is enough to make the differentiation and integration instincts about units work, and that we limit all addition to be done with functions of homogeneous valuation. However, is that really it (for example, I don&#8217;t feel this is all that has gone into the logic above)? If so, what is the right way to formalize it? Also, I distinctly remember having seen usage of units to argue more sophisticated chains of logic than the example I&#8217;ve given here, though the exact examples don&#8217;t come to mind. If anyone has further insight and examples it would be really helpful.</p>
<p><strong>Update</strong>: after an unnecessarily long discussion w/ Qiaochu (the source of the un-necessity being my muddled thinking about something irrelevant), I now agree the formalism is &#8220;easy&#8221; and can be done in several ways (though I still find the intuition to be a clearer way to think than the formalism). The method that seems most natural to me is to just think of all functions we care about as lying in a graded algebra with grades indexed by powers of units; Qiaochu prefers to think of the &#8220;physical&#8221; attributes as living in one-dimensional representations / weight spaces. Pick whatever you like. My request for more &#8220;interesting&#8221; examples of using units still holds.</p>
<p>-Yan</p>
<p>(thanks to Allan, Yoni, and Josh for teaching me physics)</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/concretenonsense.wordpress.com/1316/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/concretenonsense.wordpress.com/1316/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=concretenonsense.wordpress.com&#038;blog=2918042&#038;post=1316&#038;subd=concretenonsense&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://concretenonsense.wordpress.com/2011/08/03/on-physical-units/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ff7abdf56e85a53563fc0e360de7e5dc?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">KR</media:title>
		</media:content>
	</item>
	</channel>
</rss>
