<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>XOR's Hammer</title>
	<atom:link href="http://xorshammer.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://xorshammer.com</link>
	<description>Some things in mathematical logic that I find interesting</description>
	<lastBuildDate>Sun, 14 Mar 2010 19:40:54 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='xorshammer.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/00a4693d80d58f6360006176a0d737dc?s=96&#038;d=http://s2.wp.com/i/buttonw-com.png</url>
		<title>XOR's Hammer</title>
		<link>http://xorshammer.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://xorshammer.com/osd.xml" title="XOR&#039;s Hammer" />
	<atom:link rel='hub' href='http://xorshammer.com/?pushpress=hub'/>
		<item>
		<title>Topology and First-Order Modal Logic</title>
		<link>http://xorshammer.com/2010/03/13/topology-and-first-order-modal-logic/</link>
		<comments>http://xorshammer.com/2010/03/13/topology-and-first-order-modal-logic/#comments</comments>
		<pubDate>Sat, 13 Mar 2010 07:23:02 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.com/?p=433</guid>
		<description><![CDATA[The normal square root function can be considered to be multi-valued. Let&#8217;s momentarily accept the heresy of saying that the square root of a negative number is , so that our function will be total. How can we represent the situation of this branching &#8220;function&#8221; topologically? One thing we could do is just take the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=433&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The normal square root function can be considered to be multi-valued.</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sqrt.png"><img class="alignnone size-full wp-image-435" title="sqrt" src="http://xorshammer.files.wordpress.com/2010/03/sqrt.png?w=362&#038;h=362" alt="" width="362" height="362" /></a></p>
<p>Let&#8217;s momentarily accept the heresy of saying that the square root of a negative number is <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='0' title='0' class='latex' />, so that our function will be total.</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sqrt_zero.png"><img class="alignnone size-full wp-image-437" title="sqrt_zero" src="http://xorshammer.files.wordpress.com/2010/03/sqrt_zero.png?w=362&#038;h=362" alt="" width="362" height="362" /></a></p>
<p>How can we represent the situation of this branching &#8220;function&#8221; topologically?</p>
<p><span id="more-433"></span>One thing we could do is just take the graph of the multivalued &#8220;function&#8221; itself in the subspace topology of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^2' title='\mathbb{R}^2' class='latex' />, which is topologically just like this:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/y.png"><img class="alignnone size-full wp-image-439" title="y" src="http://xorshammer.files.wordpress.com/2010/03/y.png?w=362&#038;h=362" alt="" width="362" height="362" /></a></p>
<p>This has the downside that, at the origin (the place where all three lines meet), it doesn&#8217;t really represent the fact that the graph of the original multivalued &#8220;function&#8221; was the union of two genuine single-valued functions:  There is no neighborhood of the origin which is functional in any way.</p>
<p>But what about this topological space?  (The two filled-in dots represent points which are present in the space, the non-filled-in dot represents a point missing from the space.  The border is not part of the space.)</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf11.png"><img class="alignnone size-full wp-image-445" style="border:1px solid black;" title="sheaf1" src="http://xorshammer.files.wordpress.com/2010/03/sheaf11.png?w=362&#038;h=218" alt="" width="362" height="218" /></a></p>
<p>Here, the open sets are given by a basis consisting of: open sets on either of the three branches: i.e., like this</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch1.png"><img class="alignnone size-full wp-image-446" style="border:1px solid black;" title="sheaf1_branch" src="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch1.png?w=362&#038;h=218" alt="" width="362" height="218" /></a></p>
<p>and this:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch21.png"><img class="alignnone size-full wp-image-448" style="border:1px solid black;" title="sheaf1_branch2" src="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch21.png?w=362&#038;h=218" alt="" width="362" height="218" /></a></p>
<p>and this:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch31.png"><img class="alignnone size-full wp-image-449" style="border:1px solid black;" title="sheaf1_branch3" src="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch31.png?w=362&#038;h=218" alt="" width="362" height="218" /></a></p>
<p>as well as open sets of this form:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf1_open1.png"><img class="alignnone size-full wp-image-452" style="border:1px solid black;" title="sheaf1_open1" src="http://xorshammer.files.wordpress.com/2010/03/sheaf1_open1.png?w=362&#038;h=218" alt="" width="362" height="218" /></a></p>
<p>and this form:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf1_open2.png"><img class="alignnone size-full wp-image-453" style="border:1px solid black;" title="sheaf1_open2" src="http://xorshammer.files.wordpress.com/2010/03/sheaf1_open2.png?w=362&#038;h=218" alt="" width="362" height="218" /></a></p>
<p>(Note that this space isn&#8217;t Hausdorff!).</p>
<p>This space (call it <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' />) <em>represents</em> the fact that the original graph was the union of two genuine single-valued functions in the following sense:  There is a function, <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />, from <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> to the <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' />-axis (i.e., <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />) such that for all points <img src='http://l.wordpress.com/latex.php?latex=y%5Cin+T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in T' title='y\in T' class='latex' />, there is an open set <img src='http://l.wordpress.com/latex.php?latex=y%5Cin+U%5Csubset+T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in U\subset T' title='y\in U\subset T' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> restricted to <img src='http://l.wordpress.com/latex.php?latex=U&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='U' title='U' class='latex' /> is a homeomorphism to an open set in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />.  That is, every point in <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> has a neighborhood such that the inverse of <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> restricted to that neighborhood <em>is </em>a function.</p>
<p>The space <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> as defined above is (sort of) a sheaf over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />.  More precisely, it&#8217;s an <a href="http://ncatlab.org/nlab/show/etale+space">étalé space</a>:  To restate the definition above more generally, an étalé space over a topological space <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> is a topological space <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> together with a function <img src='http://l.wordpress.com/latex.php?latex=p%5Ccolon+T%5Cto+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p\colon T\to S' title='p\colon T\to S' class='latex' /> such that, for all <img src='http://l.wordpress.com/latex.php?latex=y%5Cin+T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in T' title='y\in T' class='latex' />, there is an open set <img src='http://l.wordpress.com/latex.php?latex=y%5Cin+U%5Csubset+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in U\subset S' title='y\in U\subset S' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=p%28U%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p(U)' title='p(U)' class='latex' /> is open and <img src='http://l.wordpress.com/latex.php?latex=p%7C_U&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p|_U' title='p|_U' class='latex' /> is a homeomorphism.  In general, for any <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in S' title='x\in S' class='latex' />, the set <img src='http://l.wordpress.com/latex.php?latex=p%5E%7B-1%7D%28x%29%5Csubset+T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p^{-1}(x)\subset T' title='p^{-1}(x)\subset T' class='latex' /> is called the <em>stalk</em> over <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' />.  Note that, as a subspace of <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' />, any stalk is discrete.</p>
<p>As another example along similar lines, gluing the domains of the different branches of the complex logarithm gives rise to a sheaf over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}' title='\mathbb{C}' class='latex' />: (This image is from Wikipedia and is by <a href="http://commons.wikimedia.org/wiki/User:Jan_Homann">Jan Homann</a>.)</p>
<p><a href="http://en.wikipedia.org/wiki/File:Riemann_surface_log.jpg"><img class="alignnone" title="logarithm" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/41/Riemann_surface_log.jpg/220px-Riemann_surface_log.jpg" alt="" width="220" height="259" /></a></p>
<p>In this case, <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> is the space pictured, <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}' title='\mathbb{C}' class='latex' /> (or <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D%5E%2A+%3D+%5Cmathbb%7BC%7D-%5C%7B0%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}^* = \mathbb{C}-\{0\}' title='\mathbb{C}^* = \mathbb{C}-\{0\}' class='latex' />), and <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> is projection along the depicted vertical direction.  Note that this represents how the domains of the various branches of the complex logarithm fit together; this graph is <em>not</em> a depiction of the complex logarithm or any branch of the complex logarithm, which is a function from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}' title='\mathbb{C}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}' title='\mathbb{C}' class='latex' />, and thus hard to draw!</p>
<p>The two examples I gave were of natural functions which happen to be multivalued, but there are much more general examples of étalé spaces.  For example, for any topological spaces <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=S%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S&#039;' title='S&#039;' class='latex' />, there is a sheaf of <em>all</em> continuous functions from <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=S%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S&#039;' title='S&#039;' class='latex' />!  The étalé space corresponding to this sheaf would have, for each <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in S' title='x\in S' class='latex' />, the stalk <img src='http://l.wordpress.com/latex.php?latex=p%5E%7B-1%7D%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p^{-1}(x)' title='p^{-1}(x)' class='latex' /> equal to the set of <a href="http://en.wikipedia.org/wiki/Germ_%28mathematics%29">germs</a> of continuous functions from <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=S%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S&#039;' title='S&#039;' class='latex' /> at <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' />. (See <a href="http://en.wikipedia.org/wiki/Sheaf_%28mathematics%29#The_.C3.A9tal.C3.A9_space_of_a_sheaf">here</a> for more).</p>
<h2>First-Order Modal Logic</h2>
<p>And now we&#8217;ll switch to a seemingly totally different topic.</p>
<p>A modal logic is a logic that contains an operator <img src='http://l.wordpress.com/latex.php?latex=%5CBox&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box' title='\Box' class='latex' />, representing necessity, and <img src='http://l.wordpress.com/latex.php?latex=%5CDiamond&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Diamond' title='\Diamond' class='latex' />, representing possibility.  If <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> is a proposition, then <img src='http://l.wordpress.com/latex.php?latex=%5CBox+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box p' title='\Box p' class='latex' /> should be interpreted as the proposition &#8220;It is necessary that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />&#8221;, or &#8220;<img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> is necessarily true&#8221;, and <img src='http://l.wordpress.com/latex.php?latex=%5CDiamond+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Diamond p' title='\Diamond p' class='latex' /> should be interpreted as the proposition &#8220;It is possible that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> is true&#8221;.</p>
<p>The operators <img src='http://l.wordpress.com/latex.php?latex=%5CBox&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box' title='\Box' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5CDiamond&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Diamond' title='\Diamond' class='latex' /> are dual to each other: <img src='http://l.wordpress.com/latex.php?latex=%5CBox+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box p' title='\Box p' class='latex' /> is equivalent to <img src='http://l.wordpress.com/latex.php?latex=%5Cneg%5CDiamond%5Cneg+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\neg\Diamond\neg p' title='\neg\Diamond\neg p' class='latex' /> and similarly <img src='http://l.wordpress.com/latex.php?latex=%5CDiamond+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Diamond p' title='\Diamond p' class='latex' /> is equivalent to <img src='http://l.wordpress.com/latex.php?latex=%5Cneg%5CBox%5Cneg+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\neg\Box\neg p' title='\neg\Box\neg p' class='latex' /> (if you think about it, this actually makes real-life sense, as well as just sense in logic-land!).</p>
<p>There are a number of different <a href="http://en.wikipedia.org/wiki/Modal_logic#Axiomatic_systems">axiomatic systems</a> for propositional modal logic; here we&#8217;ll just consider S4, which was invented by <a href="http://en.wikipedia.org/wiki/C._I._Lewis">C. I. Lewis</a> in the early 20th century. It has the following rules:</p>
<ol>
<li>For all propositions <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5CBox+p%5Crightarrow+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box p\rightarrow p' title='\Box p\rightarrow p' class='latex' /> is a theorem.</li>
<li>For all propositions <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5CBox+p%5Crightarrow+%5CBox%5CBox+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box p\rightarrow \Box\Box p' title='\Box p\rightarrow \Box\Box p' class='latex' /> is a theorem.</li>
<li>For all propositions <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q' title='q' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5CBox+%28p%5Crightarrow+q%29%5Crightarrow+%28%5CBox+p+%5Crightarrow+%5CBox+q%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box (p\rightarrow q)\rightarrow (\Box p \rightarrow \Box q)' title='\Box (p\rightarrow q)\rightarrow (\Box p \rightarrow \Box q)' class='latex' /> is a theorem.</li>
<li>For any proposition <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />, if <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> is a theorem, then <img src='http://l.wordpress.com/latex.php?latex=%5CBox+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box p' title='\Box p' class='latex' /> is a theorem.  (Note that this does <em>not </em>say that <img src='http://l.wordpress.com/latex.php?latex=p%5Crightarrow+%5CBox+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p\rightarrow \Box p' title='p\rightarrow \Box p' class='latex' /> is a theorem; if it did, the whole thing would be trivial!).</li>
</ol>
<p>In the middle of the 20th century, <a href="http://en.wikipedia.org/wiki/Saul_Kripke">Saul Kripke</a> invented possible-worlds semantics for modal logics.  The idea is that there is a set <img src='http://l.wordpress.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='W' title='W' class='latex' /> of possible worlds and at each possible world <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' />, each atomic proposition <img src='http://l.wordpress.com/latex.php?latex=A%2C+B%2C%5Cldots&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A, B,\ldots' title='A, B,\ldots' class='latex' /> may hold or not, independently of the other possible worlds.  Furthermore, there is a relation <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R' title='R' class='latex' /> between possible worlds; <img src='http://l.wordpress.com/latex.php?latex=wRw%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='wRw&#039;' title='wRw&#039;' class='latex' /> should be interpreted as &#8220;World <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' /> considers world <img src='http://l.wordpress.com/latex.php?latex=w%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w&#039;' title='w&#039;' class='latex' /> possible&#8221;.  The whole setup is called a <em>Kripke frame</em>.</p>
<p>This gives a semantics for modal logic: For any proposition <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5CDiamond+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Diamond p' title='\Diamond p' class='latex' /> holds at a world <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' /> if there is some world <img src='http://l.wordpress.com/latex.php?latex=w%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w&#039;' title='w&#039;' class='latex' /> that <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' /> considers possible such that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> holds at <img src='http://l.wordpress.com/latex.php?latex=w%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w&#039;' title='w&#039;' class='latex' />.  Similarly, <img src='http://l.wordpress.com/latex.php?latex=%5CBox+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box p' title='\Box p' class='latex' /> holds at a world <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> holds at <em>every</em> world <img src='http://l.wordpress.com/latex.php?latex=w%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w&#039;' title='w&#039;' class='latex' /> that <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' /> considers possible.</p>
<p>Somebody showed (probably Kripke, but I&#8217;m not sure) that the class of Kripke frames where <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R' title='R' class='latex' /> is reflexive and transitive corresponds to S4, in the sense that all theorems of S4 hold in all such Kripke frames, and everything which holds in all such Kripke frames holds in S4.</p>
<p>Note that the accessibility relation is completely clear-cut and discrete: given a world <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' />, you know exactly what worlds <img src='http://l.wordpress.com/latex.php?latex=w%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w&#039;' title='w&#039;' class='latex' /> it considers possible.  But, interpreting <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R' title='R' class='latex' /> as a measure of &#8220;closeness&#8221; of two worlds, observe that topology gives us a more nuanced version of what &#8220;closeness&#8221; means: we think of the topology on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />, for example, as defining what &#8220;closeness&#8221; means on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />, even though no two fixed real numbers are actually close to one another!  (At least, they&#8217;re not close to one another in any absolute sense.)</p>
<p>It turns out we can incorporate this sense of &#8220;closeness&#8221; into the semantics for modal logic as well.  We&#8217;ll define a topological Kripke model this way: We again have a set <img src='http://l.wordpress.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='W' title='W' class='latex' /> of possible worlds, but instead of defining a relation <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R' title='R' class='latex' /> between them, we define a topology  on <img src='http://l.wordpress.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='W' title='W' class='latex' />.  As above, we say, for each possible world <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' />, which of the atomic propositions <img src='http://l.wordpress.com/latex.php?latex=A%2CB%2C%5Cldots&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A,B,\ldots' title='A,B,\ldots' class='latex' /> hold at that world.  Note that this is equivalent to defining arbitrary subsets <img src='http://l.wordpress.com/latex.php?latex=%5BA%5D%2C%5BB%5D%2C%5Cldots&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='[A],[B],\ldots' title='[A],[B],\ldots' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='W' title='W' class='latex' />.  In general, we&#8217;ll let <img src='http://l.wordpress.com/latex.php?latex=%5Bp%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='[p]' title='[p]' class='latex' /> be the set of worlds at which proposition <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> holds.  Then we can define <img src='http://l.wordpress.com/latex.php?latex=%5B%5CBox+p%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='[\Box p]' title='[\Box p]' class='latex' /> to be the interior of <img src='http://l.wordpress.com/latex.php?latex=%5Bp%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='[p]' title='[p]' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5B%5CDiamond+p%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='[\Diamond p]' title='[\Diamond p]' class='latex' /> to be the closure of <img src='http://l.wordpress.com/latex.php?latex=%5Bp%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='[p]' title='[p]' class='latex' />.  In other worlds, <img src='http://l.wordpress.com/latex.php?latex=%5CBox+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box p' title='\Box p' class='latex' /> holds at a world <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' /> there is some open set <img src='http://l.wordpress.com/latex.php?latex=w%5Cin+U&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w\in U' title='w\in U' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> holds at every world <img src='http://l.wordpress.com/latex.php?latex=w%27%5Cin+U&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w&#039;\in U' title='w&#039;\in U' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%5CDiamond+p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Diamond p' title='\Diamond p' class='latex' /> holds at a world <img src='http://l.wordpress.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w' title='w' class='latex' /> if for all open sets <img src='http://l.wordpress.com/latex.php?latex=w%5Cin+U&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w\in U' title='w\in U' class='latex' /> there is some <img src='http://l.wordpress.com/latex.php?latex=w%27%5Cin+U&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w&#039;\in U' title='w&#039;\in U' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> holds at <img src='http://l.wordpress.com/latex.php?latex=w%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='w&#039;' title='w&#039;' class='latex' />.</p>
<p>It turns out that this semantics, too corresponds to S4, in the sense that all theorems of S4 hold in all topological Kripke models and all propositions which hold in all topological Kripke models are theorems of S4.  (I&#8217;m not exactly sure who first came up with this.  I looked at the Wikipedia article and some of its links, but I couldn&#8217;t quite figure out who was the first.  It seems Tarski was involved somehow anyway.)</p>
<p>So far, the logics we&#8217;ve considered have all been propositional, but you can easily add first-order logic to S4 to get FOS4.  There have been a number of proposals for how to define a semantics for FOS4.  In <a href="http://www.andrew.cmu.edu/user/awodey/preprints/FoS4.phil.pdf">2008</a>, <a href="http://www.andrew.cmu.edu/user/awodey/">Steve Awodey</a> and <a href="http://www.pitt.edu/~kok6/">Kohei Kishida</a> proposed that the right generalization of topological semantics for S4 to first-order semantics for FOS4 was to use étalé spaces!</p>
<p>Here&#8217;s an example of how it works.  Consider the following étalé space:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf2.png"><img class="alignnone size-full wp-image-474" title="sheaf2" src="http://xorshammer.files.wordpress.com/2010/03/sheaf2.png?w=270&#038;h=362" alt="" width="270" height="362" /></a></p>
<p>The top space, <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' />, is homeomorphic to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%2B+%3D+%5C%7By%5Cin%5Cmathbb%7BR%7D%5Cmid+y+%3E+0%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^+ = \{y\in\mathbb{R}\mid y &gt; 0\}' title='\mathbb{R}^+ = \{y\in\mathbb{R}\mid y &gt; 0\}' class='latex' />.  The bottom space, <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />, is the circle <img src='http://l.wordpress.com/latex.php?latex=S%5E1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^1' title='S^1' class='latex' />.  The projection <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> is given by <img src='http://l.wordpress.com/latex.php?latex=p%28y%29+%3D+%5Clangle+%5Ccos%282%5Cpi+y%29%2C%5Csin%282%5Cpi+y%29%5Crangle&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p(y) = \langle \cos(2\pi y),\sin(2\pi y)\rangle' title='p(y) = \langle \cos(2\pi y),\sin(2\pi y)\rangle' class='latex' />.</p>
<p>We can put a first-order structure on this étalé space by considering each stalk <img src='http://l.wordpress.com/latex.php?latex=p%5E%7B-1%7D%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p^{-1}(x)' title='p^{-1}(x)' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in S' title='x\in S' class='latex' /> to be a first-order universe, just as when defining regular semantics for first-order logic.</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf2_line.png"><img class="alignnone size-full wp-image-476" title="sheaf2_line" src="http://xorshammer.files.wordpress.com/2010/03/sheaf2_line.png?w=246&#038;h=362" alt="" width="246" height="362" /></a></p>
<p>We can define the interpretation of relation symbols, function symbols, and constant symbols more or less arbitrarily:  The only restrictions are that the selection of the interpretation of any particular constant symbol from each stalk must be done in a continuous manner, and there is a similar continuity condition on the interpretation of function symbols.</p>
<p>It will turn out that the interpretation of a formula <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> with free variables <img src='http://l.wordpress.com/latex.php?latex=v_1%2C%5Cldots%2Cv_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v_1,\ldots,v_n' title='v_1,\ldots,v_n' class='latex' /> will be a subset of <img src='http://l.wordpress.com/latex.php?latex=T%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T^n' title='T^n' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=T%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T^n' title='T^n' class='latex' /> is the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th fibered product over <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> with itself.  (This just means that, e.g., <img src='http://l.wordpress.com/latex.php?latex=T%5Ctimes+T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T\times T' title='T\times T' class='latex' /> is the étalé space where the stalk of any <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=p%5E%7B-1%7D%28x%29%5Ctimes+p%5E%7B-1%7D%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p^{-1}(x)\times p^{-1}(x)' title='p^{-1}(x)\times p^{-1}(x)' class='latex' />.).  In particular, as in the regular topological semantics, the interpretation of a sentence will be a subset of <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />.</p>
<p>The crucial step, as before, is that the interpretation of <img src='http://l.wordpress.com/latex.php?latex=%5CBox+%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Box \phi' title='\Box \phi' class='latex' /> is the interior of the interpretation of <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' />, and the interpretation of <img src='http://l.wordpress.com/latex.php?latex=%5CDiamond+%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Diamond \phi' title='\Diamond \phi' class='latex' /> is the closure of the interpretation of <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' />, although now the interiors and closures are being taken in the topology of <img src='http://l.wordpress.com/latex.php?latex=T%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T^n' title='T^n' class='latex' />.</p>
<p>To see how this works, suppose that in our example we define the relation <img src='http://l.wordpress.com/latex.php?latex=%5Cleq&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\leq' title='\leq' class='latex' /> on each stalk by restricting the usual ordering <img src='http://l.wordpress.com/latex.php?latex=%5Cleq&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\leq' title='\leq' class='latex' /> on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%2B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^+' title='\mathbb{R}^+' class='latex' />.  Then it is true in our model that <img src='http://l.wordpress.com/latex.php?latex=%5Cexists+x%5Cforall+y%5C%2C+x%5Cleq+y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\exists x\forall y\, x\leq y' title='\exists x\forall y\, x\leq y' class='latex' /> (since it is true in every stalk), but it is not true that <img src='http://l.wordpress.com/latex.php?latex=%5Cexists+x%5CBox%5Cforall+y%5C%2C+x%5Cleq+y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\exists x\Box\forall y\, x\leq y' title='\exists x\Box\forall y\, x\leq y' class='latex' />.  The reason is that in this stalk:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/03/sheaf2_line2.png"><img class="alignnone size-full wp-image-480" title="sheaf2_line2" src="http://xorshammer.files.wordpress.com/2010/03/sheaf2_line2.png?w=226&#038;h=362" alt="" width="226" height="362" /></a></p>
<p>the red dot is the minimum element of the stalk, but if you push it to the right just a little bit, that&#8217;s no longer true.  Intuitively, it&#8217;s the minimum element, but not <em>necessarily </em>so.</p>
<p>Awodey and Kishida prove that, as in the other cases, this sheaf semantics corresponds to FOS4 in that every theorem of FOS4 holds in all sheaf models, and everything which holds in all sheaf models is a theorem of FOS4.  (They actually prove something a bit stronger than that.)</p>
<h3>Other Uses of Sheaves in Logic</h3>
<p>This relationship between sheaves and logic came up relatively recently, but there is a longer and more well-known relationship which I&#8217;ll just mention briefly, namely through <a href="http://en.wikipedia.org/wiki/Topos#Elementary_toposes_.28toposes_in_logic.29">toposes</a>.  Using topos theory, you can interpret the class of sheaves over a topological space <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> as a category similar to the category of sets.  The interpretation is actually fairly similar to the interpretation here, but instead of interpreting modal logic, higher-order (non-modal) logic is interpreted.  It turns out that the set of truth values in this interpretation is the set of open subsets of <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> (whereas in the modal interpretation just given, the truth values could be arbitrary subsets of <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />).  One of the remarkable things about this is that the set of open subsets of <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> can <em>itself </em>be interpreted as an étalé space over <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />, which is what allows the equivalent of power sets to be taken in this category.</p>
<p>For more information on this, see <a href="http://ncatlab.org/nlab/show/Sheaves+in+Geometry+and+Logic">Sheaves in Geometry and Logic</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/433/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/433/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/433/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/433/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/433/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/433/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/433/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/433/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/433/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/433/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/433/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/433/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/433/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/433/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=433&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2010/03/13/topology-and-first-order-modal-logic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sqrt.png" medium="image">
			<media:title type="html">sqrt</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sqrt_zero.png" medium="image">
			<media:title type="html">sqrt_zero</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/y.png" medium="image">
			<media:title type="html">y</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf11.png" medium="image">
			<media:title type="html">sheaf1</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch1.png" medium="image">
			<media:title type="html">sheaf1_branch</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch21.png" medium="image">
			<media:title type="html">sheaf1_branch2</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf1_branch31.png" medium="image">
			<media:title type="html">sheaf1_branch3</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf1_open1.png" medium="image">
			<media:title type="html">sheaf1_open1</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf1_open2.png" medium="image">
			<media:title type="html">sheaf1_open2</media:title>
		</media:content>

		<media:content url="http://upload.wikimedia.org/wikipedia/commons/thumb/4/41/Riemann_surface_log.jpg/220px-Riemann_surface_log.jpg" medium="image">
			<media:title type="html">logarithm</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf2.png" medium="image">
			<media:title type="html">sheaf2</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf2_line.png" medium="image">
			<media:title type="html">sheaf2_line</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/03/sheaf2_line2.png" medium="image">
			<media:title type="html">sheaf2_line2</media:title>
		</media:content>
	</item>
		<item>
		<title>Two Interesting Observations about Voting I Hadn&#8217;t Seen Until Recently</title>
		<link>http://xorshammer.com/2010/02/22/two-interesting-observations-about-voting-i-hadnt-seen-until-recently/</link>
		<comments>http://xorshammer.com/2010/02/22/two-interesting-observations-about-voting-i-hadnt-seen-until-recently/#comments</comments>
		<pubDate>Tue, 23 Feb 2010 01:32:43 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.com/?p=424</guid>
		<description><![CDATA[By &#8220;voting&#8221;, I mean the following general problem:  Suppose there are candidates and voters.  Each voter produces a total ordering of all candidates.  A voting procedure is a function which takes as input all orderings, and produces an output ranking of all candidates.  Arrow&#8217;s impossibility theorem states that there is really no satisfactory voting procedure [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=424&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>By &#8220;voting&#8221;, I mean the following general problem:  Suppose there are <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> candidates and <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> voters.  Each voter produces a total ordering of all <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> candidates.  A voting procedure is a function which takes as input all <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> orderings, and produces an output ranking of all <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> candidates.  <a href="http://en.wikipedia.org/wiki/Arrow%27s_impossibility_theorem">Arrow&#8217;s impossibility theorem</a> states that there is really no satisfactory voting procedure when the number of candidates is greater than 2 (majority rule is a good voting procedure when there are two candidates).</p>
<p><span id="more-424"></span></p>
<p><strong>Observation #1</strong> (which I read in Chapter 23 of David Easley and Jon Kleinberg&#8217;s book <a href="http://www.cs.cornell.edu/home/kleinber/networks-book/">Networks, Crowds, and Markets</a>): <strong>Voters will vote strategically (i.e., they will lie) even when they have a common goal.</strong></p>
<p>In the setup above where each voter has a set of personal preferences and voters are essentially competing with other voters who have different preferences, it is easy to come up with situations where it would be advantageous for a voter to lie.  For example, if a voter&#8217;s true rankings are <img src='http://l.wordpress.com/latex.php?latex=A+%3E+B+%3E+C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A &gt; B &gt; C' title='A &gt; B &gt; C' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='C' title='C' class='latex' /> are candidates, but <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> has a much better chance of winning than <img src='http://l.wordpress.com/latex.php?latex=C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='C' title='C' class='latex' /> does, it may be advantageous for the voter to submit a ranking of <img src='http://l.wordpress.com/latex.php?latex=A+%3E+C+%3E+B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A &gt; C &gt; B' title='A &gt; C &gt; B' class='latex' /> if she wants to maximize the chance that <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> comes out on top.</p>
<p>However, in a situation where every voter has the same goals, but they have different private information (and it&#8217;s impossible or infeasible for them to share their private information with each other), it seems like there&#8217;s never a reason for a voter to lie.  But there is, even when there are only two alternatives that are being voted on.</p>
<p>Consider the following game:  There is a vase filled with marbles.  Either it has 10 white marbles (call this state <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />) or 5 white marbles and 5 green marbles (call this state <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' />).  Which of state <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> or state <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> holds was determined by flipping a fair coin before the game started (and this fact is common knowledge).  Each of three voters independently and without communication draws one marble at random from the vase, observes its color, puts it back, and then votes on whether or not <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> holds.  The voters win if a majority guesses right and lose otherwise.</p>
<p>As you can work out: if you draw a white marble, you believe state <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> holds with probability <img src='http://l.wordpress.com/latex.php?latex=2%2F3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2/3' title='2/3' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> holds with probability <img src='http://l.wordpress.com/latex.php?latex=1%2F3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1/3' title='1/3' class='latex' />.  If you draw a green marble, you believe state <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> holds with probability <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='0' title='0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> holds with probability <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1' title='1' class='latex' /> (you are sure that state <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> holds).</p>
<p>However, voters will not vote their true beliefs:  Suppose they did, and consider whether a fixed voter has an incentive to deviate from this strategy (i.e., consider whether all voters voting their true beliefs is a Nash equilibrium).  When you draw a green marble, you should definitely vote for state <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' />.  But what should you do when you draw a white marble?  The key question is: when will your vote make a difference?  Only when one other voter votes <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> and the other voter votes <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' />.  But because they are voting sincerely, the voter who voted <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> must have drawn a green marble and, therefore she must be right!  So, you should vote <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> as well.  That is, if you think that the other two voters are voting sincerely, you should disregard the information you get from observing a marble, and always vote <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' />!</p>
<p>In the Easley-Kleinberg chapter, the authors also demonstrate a version of this with juries, where the vote must be unanimous to convict a defendant, and otherwise she will be acquitted.  A similar situation happens: assume that you are thinking of voting to acquit.  Under what circumstances will your vote make a difference?  Only when every other juror has voted to convict.  In that circumstance, it is quite likely that you are wrong, and the defendant was guilty, and thus you should have voted to convict no matter what!   (This shows that everyone voting sincerely is not an equilibrium but doesn&#8217;t show what the equilibria of this game are.  According to the authors, finding the equilibria is quite difficult.)</p>
<p><strong>Observation #2 </strong>(which I saw in a <a href="http://www.votingmatters.org.uk/ISSUE26/I26P3.pdf">paper</a> by Roger Sewell, <a href="http://www.inference.phy.cam.ac.uk/mackay/Decision.html">David Mackay</a>, and Ian McLean): <strong>Maximizing the Entropy of the Outcome of Voting Leads to Good Results</strong></p>
<p>As I alluded to above, Arrow&#8217;s impossibility theorem says that there&#8217;s no satisfactory way to provide an output ranking of candidates given the input rankings from each voter (here we will assume that we simply know each voter&#8217;s true ranking and not consider strategic voting).  However, this just applies to <em>deterministic </em>voting procedures: it is, in fact, quite easy to come up with a <em>probabilistic </em>voting procedure satisfying all of the hypotheses of Arrow&#8217;s impossibility theorem: just pick a voter uniformly at random and take their preferences to be the output ranking!</p>
<p>Aside from the fact that it seems unlikely that the general public would accept such explicit randomization in the voting procedure any time soon, this process (which is called Random Dictator) has a couple of other negative aspects.  First of all, it can easily lead to extreme outcomes:  Suppose that there are 20 candidates, and the top 10 in the output ranking will be given various positions in the government.  Suppose 10 candidates are from one political party, and 10 are from the other, and further that the populace is highly polarized: everyone ranks their party&#8217;s 10 candidates strictly better than each of the 10 candidates of the other party.  Then it is guaranteed that a single-party government will be the result. What might be better is 10 officials chosen from a mix of the two parties according to each party&#8217;s representation in the voting public.</p>
<p>Another way in which Random Dictator doesn&#8217;t compromise very well is the following:  Suppose there is heavy contention between candidates <img src='http://l.wordpress.com/latex.php?latex=A_1%2C%5Cldots%2C+A_%7B20%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_1,\ldots, A_{20}' title='A_1,\ldots, A_{20}' class='latex' /> from the top rank among voters, but there is a candidate <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> that is everyone&#8217;s second choice.   Then under Random Dictator there is zero chance that <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> will be top-ranked in the output-ranking, even though it intuitively seems that <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> is more generally liked by the population than any of <img src='http://l.wordpress.com/latex.php?latex=A_1%5Cldots%2CA_%7B20%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_1\ldots,A_{20}' title='A_1\ldots,A_{20}' class='latex' />.</p>
<p>The authors of the paper fix these problems by proposing the following procedure:  For each pair of candidates <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' />, record the proportion <img src='http://l.wordpress.com/latex.php?latex=p_%7BA%2CB%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p_{A,B}' title='p_{A,B}' class='latex' /> of the voting population which prefers <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' />.  Now, from all probability distributions over output rankings such that, for each pair of candidates <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' />, the probability of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> being ranked higher than <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> in the output ranking is <img src='http://l.wordpress.com/latex.php?latex=p_%7BA%2CB%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p_{A,B}' title='p_{A,B}' class='latex' />, choose the one that has <a href="http://en.wikipedia.org/wiki/Maximum_entropy_probability_distribution">maximum entropy</a>.  Then choose an output ranking according to that distribution.</p>
<p>I won&#8217;t define maximum entropy here, but I will give a few examples.  The idea that there is a number called the entropy associated with every probability distribution, and furthermore that if you are looking for a probability distribution in a certain class, but don&#8217;t know anything about it except that it is in that class, then the &#8220;right&#8221; distribution to take is the one that maximizes the entropy (obviously this is an unprovable assertion).  In some sense, choosing the maximum entropy distribution from a class codifies the fact that you know nothing about it except that it is in that class.</p>
<p>For example, the maximum entropy probability distribution over the set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2C2%2C3%2C4%2C5%2C6%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{1,2,3,4,5,6\}' title='\{1,2,3,4,5,6\}' class='latex' /> is the uniform distribution, which assigns probability <img src='http://l.wordpress.com/latex.php?latex=1%2F6&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1/6' title='1/6' class='latex' /> to each number.  Fixing a <img src='http://l.wordpress.com/latex.php?latex=%5Cmu%5Cin%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mu\in\mathbb{R}' title='\mu\in\mathbb{R}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%5E2+%3E+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma^2 &gt; 0' title='\sigma^2 &gt; 0' class='latex' />, the maximum entropy probability distribution in the set of probability distributions over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' /> with mean <img src='http://l.wordpress.com/latex.php?latex=%5Cmu&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mu' title='\mu' class='latex' /> and variance <img src='http://l.wordpress.com/latex.php?latex=%5Csigma%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma^2' title='\sigma^2' class='latex' /> is a Gaussian distribution.  Fixing <img src='http://l.wordpress.com/latex.php?latex=%5Cmu+%3E+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mu &gt; 0' title='\mu &gt; 0' class='latex' />, the maximum entropy distribution over the positive reals with mean <img src='http://l.wordpress.com/latex.php?latex=%5Cmu&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mu' title='\mu' class='latex' /> is the exponential distribution with mean <img src='http://l.wordpress.com/latex.php?latex=%5Cmu&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mu' title='\mu' class='latex' />.</p>
<p>Basically, what the authors have proposed is that the actual total ranking of each voter are not important (or else we would have the first problem mentioned above), and in particular which candidate each voter happened to place in the top rank is not important (or else we would have the second problem mentioned above); the only thing that&#8217;s really important is getting the correct proportions of the pairwise rankings right.  And the way to get a distribution on output rankings which reflects nothing except for the constraints on the pairwise rankings is to pick the maximum entropy distribution satisfying those constraints.</p>
<p>The beauty of this is that if you disagree with them, that&#8217;s fine: all you have to do is figure out what <em>you </em>think is the important information to preserve from the voters&#8217; rankings, then pick the output distribution which maximizes entropy among those satisfying those rankings, and that will be the &#8220;right&#8221; output for your choice of what&#8217;s important, in the sense that it will not &#8220;take into account&#8221; anything that you don&#8217;t think is important.  To take a trivial example, if you decide that the total ranking of each voter is important, then the maximum entropy distribution on output rankings will degenerate into the Random Dictator process.</p>
<p>The authors additionally ran simulations of elections using various voting procedures in order to verify that the maximum entropy voting scheme they propose is &#8220;better&#8221; (in some senses they define) than others.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/424/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/424/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/424/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/424/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/424/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/424/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/424/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/424/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/424/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/424/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/424/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/424/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/424/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/424/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=424&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2010/02/22/two-interesting-observations-about-voting-i-hadnt-seen-until-recently/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>Quantish Physics: A Discrete Model of Quantum Physics</title>
		<link>http://xorshammer.com/2010/02/17/quantish-physics-a-discrete-model-of-quantum-physics/</link>
		<comments>http://xorshammer.com/2010/02/17/quantish-physics-a-discrete-model-of-quantum-physics/#comments</comments>
		<pubDate>Wed, 17 Feb 2010 05:11:12 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.com/?p=373</guid>
		<description><![CDATA[In the book Good and Real, author Gary Drescher, who received his PhD from MIT&#8217;s AI lab, defends the view that determinism is a consistent and coherent view of the world.   In doing so, he enters many different arenas: ethics, decision theory, and physics. In his chapter on quantum mechanics, he defends the &#8220;many-worlds&#8221; interpretation [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=373&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In the book <a href="http://books.google.com/books?id=3wrXAAAAMAAJ&amp;q=good+and+real&amp;dq=good+and+real&amp;cd=1">Good and Real</a>, author Gary Drescher, who received his PhD from MIT&#8217;s AI lab, defends the view that determinism is a consistent and coherent view of the world.   In doing so, he enters many different arenas: ethics, decision theory, and physics.</p>
<p>In his chapter on quantum mechanics, he defends the &#8220;many-worlds&#8221; interpretation (although he doesn&#8217;t think the term accurately describes the concept) versus the Copenhagen interpretation.  In the process of doing so, he does something I thought was extraordinary:  he comes up with a simple model of quantum mechanics in which all of the standard concepts you read about: the two-slit experiment, the Heisenberg uncertainty principle, etc., are represented.  This model requires no prerequisites from physics and actually uses almost totally discrete mathematics!</p>
<p>(Edit: I somehow missed this when originally writing this post, but Drescher also outlines quantish physics in an <a href="http://dspace.mit.edu/bitstream/handle/1721.1/6486/AIM-1026a.pdf?sequence=2">online paper</a>.)</p>
<p>I&#8217;ll sketch it below.</p>
<p><span id="more-373"></span></p>
<p>The first step is to define the &#8220;classical&#8221; version of our physics, which we will then tweak to get the quantum version.  The &#8220;topology&#8221; of our universe will be given by a finite directed graph <img src='http://l.wordpress.com/latex.php?latex=%28V%2CE%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(V,E)' title='(V,E)' class='latex' /> where each vertex has three edges coming in, and three edges going out.  There is given a bijection <img src='http://l.wordpress.com/latex.php?latex=f%5Ccolon+E%5Cto+E&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f\colon E\to E' title='f\colon E\to E' class='latex' /> such that if edge <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' /> is directed in to vertex <img src='http://l.wordpress.com/latex.php?latex=v&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v' title='v' class='latex' />, edge <img src='http://l.wordpress.com/latex.php?latex=f%28e%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(e)' title='f(e)' class='latex' /> is directed out of vertex <img src='http://l.wordpress.com/latex.php?latex=v&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v' title='v' class='latex' />.  Given this bijection, you can think of each edge as actually a piece of a wire: a directed loop in the graph.  Finally, we require that each edge is labeled either <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bcontrol%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{control}' title='\mathrm{control}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bswitch%7D%281%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{switch}(1)' title='\mathrm{switch}(1)' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bswitch%7D%28-1%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{switch}(-1)' title='\mathrm{switch}(-1)' class='latex' /> so that each triple of in-edges to a given vertex gets a distinct label.</p>
<p>We can picture vertices like this:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/02/quantish1.png"><img class="alignnone size-medium wp-image-400" title="quantish" src="http://xorshammer.files.wordpress.com/2010/02/quantish1.png?w=300&#038;h=132" alt="" width="300" height="132" /></a></p>
<p>The vertex is represented by the big box in the center.  We will always put the control edge at the top, and the two switch edges at the bottom.  Note that the edge <img src='http://l.wordpress.com/latex.php?latex=f%28e%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(e)' title='f(e)' class='latex' /> need not have the same label as <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' />.</p>
<p>Particles inhabit edges.  If <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P' title='P' class='latex' /> is the set of particles and <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E' title='E' class='latex' /> is the set of edges, then a point in <img src='http://l.wordpress.com/latex.php?latex=E%5EP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E^P' title='E^P' class='latex' /> determines the position of each particle.  The set <img src='http://l.wordpress.com/latex.php?latex=E%5EP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E^P' title='E^P' class='latex' /> is thus called (classical) configuration space.  Time in this universe is discrete; to describe how the system evolves, we just have to define the successor function <img src='http://l.wordpress.com/latex.php?latex=S%5Ec%5Ccolon+E%5EP%5Cto+E%5EP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^c\colon E^P\to E^P' title='S^c\colon E^P\to E^P' class='latex' /> which tells how the system progresses one time step (the <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='c' title='c' class='latex' /> superscript stands for &#8220;classical&#8221;).</p>
<blockquote><p>For any edge <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' /> and label <img src='http://l.wordpress.com/latex.php?latex=l%5Cin%5C%7B%5Cmathrm%7Bcontrol%7D%2C%5Cmathrm%7Bswitch%7D%281%29%2C%5Cmathrm%7Bswitch%7D%28-1%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='l\in\{\mathrm{control},\mathrm{switch}(1),\mathrm{switch}(-1)\}' title='l\in\{\mathrm{control},\mathrm{switch}(1),\mathrm{switch}(-1)\}' class='latex' />, we let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bedge%7D%28e%2Cl%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{edge}(e,l)' title='\mathrm{edge}(e,l)' class='latex' /> be the edge with same destination vertex as <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' /> and with label <img src='http://l.wordpress.com/latex.php?latex=l&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='l' title='l' class='latex' />.</p>
<p>For a configuration <img src='http://l.wordpress.com/latex.php?latex=s%5Cin+E%5EP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s\in E^P' title='s\in E^P' class='latex' /> and edge <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' />, we let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bhas%5C_particle%7D%28s%2Ce%29+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{has\_particle}(s,e) = 1' title='\mathrm{has\_particle}(s,e) = 1' class='latex' /> if edge <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' /> does not have a particle in configuration <img src='http://l.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s' title='s' class='latex' />, and we let it be <img src='http://l.wordpress.com/latex.php?latex=-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='-1' title='-1' class='latex' /> if it does.</p>
<p>Now we define <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> as follows: For every edge <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' /> labeled <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bcontrol%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{control}' title='\mathrm{control}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=S%28e%29+%3D+f%28e%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S(e) = f(e)' title='S(e) = f(e)' class='latex' />.  For every edge <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' /> labeled <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bswitch%28i%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{switch(i)}' title='\mathrm{switch(i)}' class='latex' />,</p>
<p><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BS%28e%29+%3D+f%28%5Cmathrm%7Bedge%7D%28e%2C%5Cmathrm%7Bswitch%7D%28i%5Ccdot%5Cmathrm%7Bhas%5C_particle%7D%28%5Cmathrm%7Bedge%7D%28e%2C%5Cmathrm%7Bcontrol%7D%29%29%29%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\displaystyle{S(e) = f(\mathrm{edge}(e,\mathrm{switch}(i\cdot\mathrm{has\_particle}(\mathrm{edge}(e,\mathrm{control}))))}' title='\displaystyle{S(e) = f(\mathrm{edge}(e,\mathrm{switch}(i\cdot\mathrm{has\_particle}(\mathrm{edge}(e,\mathrm{control}))))}' class='latex' /></p></blockquote>
<p>In other words, particles on a control edge always go straight along whatever loop they are on.  However, particles on a switch edge may or may not cross over to the loop of the other switch edge, depending on whether or not there is a particle on the control edge (hence the names).</p>
<p>For example, this configuration:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/02/quantish2.png"><img class="alignnone size-medium wp-image-402" title="quantish2" src="http://xorshammer.files.wordpress.com/2010/02/quantish2.png?w=300&#038;h=132" alt="" width="300" height="132" /></a></p>
<p>turns into this configuration:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/02/quantish3.png"><img class="alignnone size-medium wp-image-403" title="quantish3" src="http://xorshammer.files.wordpress.com/2010/02/quantish3.png?w=300&#038;h=132" alt="" width="300" height="132" /></a></p>
<p>and this configuration:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/02/quantish4.png"><img class="alignnone size-medium wp-image-404" title="quantish4" src="http://xorshammer.files.wordpress.com/2010/02/quantish4.png?w=300&#038;h=132" alt="" width="300" height="132" /></a></p>
<p>turns into this configuration:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/02/quantish5.png"><img class="alignnone size-medium wp-image-405" title="quantish5" src="http://xorshammer.files.wordpress.com/2010/02/quantish5.png?w=300&#038;h=132" alt="" width="300" height="132" /></a></p>
<p>Now for the &#8220;quantum&#8221; variation, which Drescher calls quantish physics.  In this case, each particle now has a <em>sign</em> (<img src='http://l.wordpress.com/latex.php?latex=%2B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='+' title='+' class='latex' />/<img src='http://l.wordpress.com/latex.php?latex=-&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='-' title='-' class='latex' />) attached to it.  Furthermore, each vertex <img src='http://l.wordpress.com/latex.php?latex=v&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v' title='v' class='latex' /> has an angle <img src='http://l.wordpress.com/latex.php?latex=v_%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v_\theta' title='v_\theta' class='latex' /> associated with it, called <img src='http://l.wordpress.com/latex.php?latex=v&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v' title='v' class='latex' />&#8216;s <em>measurement angle</em>.  The classical configuration space was <img src='http://l.wordpress.com/latex.php?latex=E%5EP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E^P' title='E^P' class='latex' />; the quantum configuration space <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='Q' title='Q' class='latex' /> will be the set of all formal linear combinations <img src='http://l.wordpress.com/latex.php?latex=%5Csum_i+z_i+s_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_i z_i s_i' title='\sum_i z_i s_i' class='latex' /> of states <img src='http://l.wordpress.com/latex.php?latex=s_i+%5Cin+E%5EP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s_i \in E^P' title='s_i \in E^P' class='latex' />.  Given a state <img src='http://l.wordpress.com/latex.php?latex=%5Csum_i+z_i+s_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_i z_i s_i' title='\sum_i z_i s_i' class='latex' />, the number <img src='http://l.wordpress.com/latex.php?latex=%7Cz_i%7C%5E2+%2F+%5Csum_j+%7Cz_j%7C%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|z_i|^2 / \sum_j |z_j|^2' title='|z_i|^2 / \sum_j |z_j|^2' class='latex' /> can reasonably be interpreted as the probability of being in state <img src='http://l.wordpress.com/latex.php?latex=s_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s_i' title='s_i' class='latex' /> (see Drescher for more comment on this).</p>
<p>The task now is to describe the successor function <img src='http://l.wordpress.com/latex.php?latex=S%5Eq%5Ccolon+Q%5Cto+Q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^q\colon Q\to Q' title='S^q\colon Q\to Q' class='latex' /> describing how the universe evolves through time. First some preliminary definitions:</p>
<blockquote><p>Given a nonzero complex number <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' />, an angle <img src='http://l.wordpress.com/latex.php?latex=%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\theta' title='\theta' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=i%5Cin%5C%7B-1%2C1%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i\in\{-1,1\}' title='i\in\{-1,1\}' class='latex' />, let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{split}(\theta,z,i)' title='\mathrm{split}(\theta,z,i)' class='latex' /> be the component of <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' /> which is parallel to <img src='http://l.wordpress.com/latex.php?latex=%5Carg%28z%29+%2B+%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\arg(z) + \theta' title='\arg(z) + \theta' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=i+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i = 1' title='i = 1' class='latex' /> and the component of <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' /> which is perpendicular to <img src='http://l.wordpress.com/latex.php?latex=%5Carg%28z%29+%2B+%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\arg(z) + \theta' title='\arg(z) + \theta' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=i+%3D+-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i = -1' title='i = -1' class='latex' />.</p>
<p>Note that <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%5Cin%5C%7B-1%2C1%5C%7D%7D%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%29+%3D+z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_{i\in\{-1,1\}}\mathrm{split}(\theta,z,i) = z' title='\sum_{i\in\{-1,1\}}\mathrm{split}(\theta,z,i) = z' class='latex' /> and, due to the Pythagorean theorem, <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%5Cin%5C%7B-1%2C1%5C%7D%7D%7C%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%29%7C%5E2+%3D+%7Cz%7C%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_{i\in\{-1,1\}}|\mathrm{split}(\theta,z,i)|^2 = |z|^2' title='\sum_{i\in\{-1,1\}}|\mathrm{split}(\theta,z,i)|^2 = |z|^2' class='latex' />.</p>
<p>If, furthermore, <img src='http://l.wordpress.com/latex.php?latex=j%5Cin%5C%7B-1%2C1%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j\in\{-1,1\}' title='j\in\{-1,1\}' class='latex' />, let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%2Cj%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{split}(\theta,z,i,j)' title='\mathrm{split}(\theta,z,i,j)' class='latex' /> be the component of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{split}(\theta,z,i)' title='\mathrm{split}(\theta,z,i)' class='latex' /> which is parallel to <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' /> (note: <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' />, not <img src='http://l.wordpress.com/latex.php?latex=%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\theta' title='\theta' class='latex' /> as above) if <img src='http://l.wordpress.com/latex.php?latex=j+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j = 1' title='j = 1' class='latex' /> and perpendicular to <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=j+%3D+-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j = -1' title='j = -1' class='latex' />.</p>
<p>As above <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%2Cj%5Cin%5C%7B-1%2C1%5C%7D%7D%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%2Cj%29+%3D+w&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_{i,j\in\{-1,1\}}\mathrm{split}(\theta,z,i,j) = w' title='\sum_{i,j\in\{-1,1\}}\mathrm{split}(\theta,z,i,j) = w' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%2Cj%5Cin%5C%7B-1%2C1%5C%7D%7D%7C%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%2Cj%29%7C%5E2+%3D+%7Cw%7C%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_{i,j\in\{-1,1\}}|\mathrm{split}(\theta,z,i,j)|^2 = |w|^2' title='\sum_{i,j\in\{-1,1\}}|\mathrm{split}(\theta,z,i,j)|^2 = |w|^2' class='latex' />.  Similarly, <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bj%5Cin%5C%7B-1%2C1%5C%7D%7D%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%2Cj%29+%3D+%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_{j\in\{-1,1\}}\mathrm{split}(\theta,z,i,j) = \mathrm{split}(\theta,z,i)' title='\sum_{j\in\{-1,1\}}\mathrm{split}(\theta,z,i,j) = \mathrm{split}(\theta,z,i)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bj%5Cin%5C%7B-1%2C1%5C%7D%7D%7C%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%2Cj%29%7C%5E2+%3D+%7C%5Cmathrm%7Bsplit%7D%28%5Ctheta%2Cz%2Ci%29%7C%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_{j\in\{-1,1\}}|\mathrm{split}(\theta,z,i,j)|^2 = |\mathrm{split}(\theta,z,i)|^2' title='\sum_{j\in\{-1,1\}}|\mathrm{split}(\theta,z,i,j)|^2 = |\mathrm{split}(\theta,z,i)|^2' class='latex' />.</p>
<p>Finally, note that for a given <img src='http://l.wordpress.com/latex.php?latex=%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\theta' title='\theta' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' />, the split function is simply multiplication by a complex number independent of <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' /> (and similarly for the two-argument split function).</p></blockquote>
<p>First off, <img src='http://l.wordpress.com/latex.php?latex=S%5Eq&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^q' title='S^q' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}' title='\mathbb{C}' class='latex' />-linear; it therefore suffices to define <img src='http://l.wordpress.com/latex.php?latex=S%5Eq%28s%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^q(s)' title='S^q(s)' class='latex' /> for classical configurations <img src='http://l.wordpress.com/latex.php?latex=s%5Cin+E%5EP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s\in E^P' title='s\in E^P' class='latex' />.</p>
<p>If a particle is on edge <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' /> labeled <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bcontrol%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{control}' title='\mathrm{control}' class='latex' />, it always passes straight through to <img src='http://l.wordpress.com/latex.php?latex=f%28e%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(e)' title='f(e)' class='latex' />.</p>
<p>If a particle is on a switch edge, the successor state will be the sum of (up to) 4 non-zero classical configurations corresponding to whether or not it stays on its loop or crosses over to the loop of the other switch edge, and whether or not it changes sign.  Let <img src='http://l.wordpress.com/latex.php?latex=s%28i%2Cj%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s(i,j)' title='s(i,j)' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=i%2Cj%5Cin%5C%7B-1%2C1%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i,j\in\{-1,1\}' title='i,j\in\{-1,1\}' class='latex' /> denote the classical state where the particle stays on its own loop iff <img src='http://l.wordpress.com/latex.php?latex=i+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i = 1' title='i = 1' class='latex' /> and the particle keeps the same sign iff <img src='http://l.wordpress.com/latex.php?latex=j+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j = 1' title='j = 1' class='latex' />.  Then the weight given to <img src='http://l.wordpress.com/latex.php?latex=s%28i%2Cj%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s(i,j)' title='s(i,j)' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bsplit%7D%28v_%5Ctheta%2C1%2Cij%2Cj%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{split}(v_\theta,1,ij,j)' title='\mathrm{split}(v_\theta,1,ij,j)' class='latex' /> (the <img src='http://l.wordpress.com/latex.php?latex=ij&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='ij' title='ij' class='latex' /> is not a typo).  The use of <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1' title='1' class='latex' /> is just because we are assuming that we are starting from a pure classical configuration; if it&#8217;s a classical configuration time a weight <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' />, the <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1' title='1' class='latex' /> would be replaced with <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' />.</p>
<p>When there are <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> particles in the classical configuration <img src='http://l.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s' title='s' class='latex' />, each is split separately; <img src='http://l.wordpress.com/latex.php?latex=S%5Eq%28s%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^q(s)' title='S^q(s)' class='latex' /> may be the sum of up to <img src='http://l.wordpress.com/latex.php?latex=4%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='4^n' title='4^n' class='latex' /> classical configurations.  Since the splitting is simply multiplication by a complex number, it doesn&#8217;t matter in what order the splittings are performed.</p>
<p>I&#8217;ll now briefly describe some quantum phenomena which can be interpreted in the quantish world.  For much more insight, meaning, and many more examples, please see Drescher&#8217;s book!</p>
<p><strong>The Two-Slit Experiment</strong></p>
<p>Suppose we have the following configuration:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/02/quantish63.png"><img class="alignnone size-full wp-image-414" title="quantish6" src="http://xorshammer.files.wordpress.com/2010/02/quantish63.png?w=500&#038;h=118" alt="" width="500" height="118" /></a></p>
<p>where the measurement angles of <img src='http://l.wordpress.com/latex.php?latex=v_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v_0' title='v_0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=v_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v_1' title='v_1' class='latex' /> are equal and oblique to the measurement angle of <img src='http://l.wordpress.com/latex.php?latex=v_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v_2' title='v_2' class='latex' />, and the sign of the particle is positive.  Then, it is always the case that after three timesteps, the particle is in the middle rightmost edge (i.e., <img src='http://l.wordpress.com/latex.php?latex=S%5Eq&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^q' title='S^q' class='latex' /> applied three times to the initial state yields the linear combination consisting of the sum of the single classical state where the particle is in the middle rightmost edge).  It is never in the bottom rightmost edge.</p>
<p>However, if we <em>remove</em> one of the ways the particle can get to the bottom edge:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/02/quantish72.png"><img class="alignnone size-full wp-image-415" title="quantish7" src="http://xorshammer.files.wordpress.com/2010/02/quantish72.png?w=500&#038;h=152" alt="" width="500" height="152" /></a></p>
<p>now the particle arrives at the bottom rightmost edge with positive probability (i.e., nonzero weight).  (The ellipsis simply means that this edge goes somewhere else; we don&#8217;t care what happens there.)  This is because of destructive interference in the first case, which was removed in the second.</p>
<p>Suppose we try to investigate what&#8217;s going on, and <em>observe</em> if the particle is on one particular edge:</p>
<p><a href="http://xorshammer.files.wordpress.com/2010/02/quantish8.png"><img class="alignnone size-full wp-image-418" title="quantish8" src="http://xorshammer.files.wordpress.com/2010/02/quantish8.png?w=500&#038;h=203" alt="" width="500" height="203" /></a></p>
<p>What&#8217;s going on here?  Well first of all, the way we observe things is by, e.g., having the particles we want to observe interact with particles in our eye.  In this example, we&#8217;ll take the red particle at the bottom to be a particle &#8220;in our eye&#8221; and observe the blue particle by having it interact with the red.  Second of all, there are various &#8220;delay&#8221; gates throughout (at the bottom left, the delay gate is explicit, at the top, two wires are labeled &#8220;delay&#8221;, which means that they pass through a delay gate, although I haven&#8217;t drawn it).  These aren&#8217;t really significant; they&#8217;re just to synchronize things.</p>
<p>Note that this gate has exactly the same behavior as our first setup, except that we are observing when the blue particle is on the bottom edge by having it interact with the red particle (and syncing things up).  However, the results are as in the second case: with nonzero probability, the red particle appears on the bottom rightmost edge!  The observation blocks the destructive interference of the first case.</p>
<p><strong>Heisenberg Uncertainty</strong></p>
<p>Say that a particle in a quantish state is definite with respect to a measurement angle <img src='http://l.wordpress.com/latex.php?latex=%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\theta' title='\theta' class='latex' /> if passing it through a switch wire of a vertex with measurement angle <img src='http://l.wordpress.com/latex.php?latex=%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\theta' title='\theta' class='latex' /> and no other particles entering the vertex will yield the particle always emerging on one specific edge.  Heisenberg uncertainty is represented in quantish physics by the fact that whenever a particle is definite with respect to some measurement angle <img src='http://l.wordpress.com/latex.php?latex=%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\theta' title='\theta' class='latex' />, it is always indefinite with respect to measurement angles oblique to <img src='http://l.wordpress.com/latex.php?latex=%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\theta' title='\theta' class='latex' />.</p>
<p><strong>The Einstein-Podolsky-Rosen Experiment</strong></p>
<p>Unfortunately, the diagrams for this setup are beyond my poor figure-making abilities.  However, I can substitute a poor description for poor figure-making.  It is possible to entangle two quantish particles by sending both of them through gates which have related measurement angles, then setting up two further gates through which a third particle is sent.  In the first of these gates, one of the switch wires from the first particle&#8217;s measurement is the control wire, and in the second of these gates, one of the switch wires from the second particle&#8217;s measurement is the control wire.  You then observe if the third particle emerges from the second of the two gates at the same place where it entered.</p>
<p>In that case, you can then measure the two particles both with respect to any fixed angle, and you will get the same results for both.  (Let me reiterate that this was a terrible description; see Drescher&#8217;s book for more).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/373/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/373/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/373/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/373/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/373/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/373/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/373/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/373/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/373/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/373/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/373/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/373/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/373/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/373/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=373&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2010/02/17/quantish-physics-a-discrete-model-of-quantum-physics/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/02/quantish1.png?w=300" medium="image">
			<media:title type="html">quantish</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/02/quantish2.png?w=300" medium="image">
			<media:title type="html">quantish2</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/02/quantish3.png?w=300" medium="image">
			<media:title type="html">quantish3</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/02/quantish4.png?w=300" medium="image">
			<media:title type="html">quantish4</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/02/quantish5.png?w=300" medium="image">
			<media:title type="html">quantish5</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/02/quantish63.png" medium="image">
			<media:title type="html">quantish6</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/02/quantish72.png" medium="image">
			<media:title type="html">quantish7</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2010/02/quantish8.png" medium="image">
			<media:title type="html">quantish8</media:title>
		</media:content>
	</item>
		<item>
		<title>Functions with Very Low Symmetry and the Continuum Hypothesis</title>
		<link>http://xorshammer.com/2009/07/19/functions-with-very-low-symmetry-and-the-continuum-hypothesis/</link>
		<comments>http://xorshammer.com/2009/07/19/functions-with-very-low-symmetry-and-the-continuum-hypothesis/#comments</comments>
		<pubDate>Sun, 19 Jul 2009 20:32:03 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.com/?p=353</guid>
		<description><![CDATA[A function from to is called even if for all , .  We might call it even about the point if, for all , . Conversely, we can call a function strongly non-even if for all , , . Finding strongly non-even functions is easy, as any injective function provides a trivial example.  We can [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=353&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A function from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' /> is called <em>even</em> if for all <img src='http://l.wordpress.com/latex.php?latex=h%5Cin%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='h\in\mathbb{R}' title='h\in\mathbb{R}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f%28-h%29+%3D+f%28h%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(-h) = f(h)' title='f(-h) = f(h)' class='latex' />.  We might call it <em>even about the point</em> <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> if, for all <img src='http://l.wordpress.com/latex.php?latex=h%5Cin%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='h\in\mathbb{R}' title='h\in\mathbb{R}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f%28x-h%29+%3D+f%28x+%2Bh%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(x-h) = f(x +h)' title='f(x-h) = f(x +h)' class='latex' />.</p>
<p>Conversely, we can call a function <em>strongly non-even</em> if for all <img src='http://l.wordpress.com/latex.php?latex=x%5Cin%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in\mathbb{R}' title='x\in\mathbb{R}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=h%3E0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='h&gt;0' title='h&gt;0' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f%28x-h%29%5Cne+f%28x%2Bh%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(x-h)\ne f(x+h)' title='f(x-h)\ne f(x+h)' class='latex' />.</p>
<p>Finding strongly non-even functions is easy, as any injective function provides a trivial example.  We can make things harder for ourselves by considering only functions from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' />.  But now, it is just as easy to show that there are no strongly non-even functions.</p>
<p>Therefore, let&#8217;s make the following definition: Let a function <img src='http://l.wordpress.com/latex.php?latex=f%5Ccolon+%5Cmathbb%7BR%7D%5Cto%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f\colon \mathbb{R}\to\mathbb{N}' title='f\colon \mathbb{R}\to\mathbb{N}' class='latex' /> be <em>non-even of order</em> <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> if, for all <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%7C%5C%7Bh%3E0%5Cmid+f%28x-h%29+%3D+f%28x+%2B+h%29%5C%7D%7C%5Cleq+n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|\{h&gt;0\mid f(x-h) = f(x + h)\}|\leq n' title='|\{h&gt;0\mid f(x-h) = f(x + h)\}|\leq n' class='latex' />.  Thus, a strongly non-even function is non-even of order <img src='http://l.wordpress.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='0' title='0' class='latex' />, and a function being non-even of order <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> implies that it&#8217;s non-even of order <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> for all <img src='http://l.wordpress.com/latex.php?latex=m%5Cgeq+n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\geq n' title='m\geq n' class='latex' />.</p>
<p>In <a href="http://front.math.ucdavis.edu/math.LO/9308222">this paper</a>, the set theorists Peter Komjáth and Saharon Shelah proved:</p>
<blockquote><p>The existence of a non-even function of order 1 is equivalent to the Continuum Hypothesis (i.e., the statement that <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B%5Caleph_0%7D+%3D+%5Caleph_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^{\aleph_0} = \aleph_1' title='2^{\aleph_0} = \aleph_1' class='latex' />).</p></blockquote>
<p>Thus, if we assume that there is a non-even function of order 1, then we can conclude that <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B%5Caleph_0%7D+%3D+%5Caleph_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^{\aleph_0} = \aleph_1' title='2^{\aleph_0} = \aleph_1' class='latex' />.  Can we weaken the hypothesis and still conclude something interesting?  We can, as they also proved:</p>
<blockquote><p>For any <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, if there is a non-even function of order <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B%5Caleph_0%7D%5Cleq+%5Caleph_%7B%5Clceil%5Clog_2%7B%28n%2B1%29%7D%5Crceil%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^{\aleph_0}\leq \aleph_{\lceil\log_2{(n+1)}\rceil}' title='2^{\aleph_0}\leq \aleph_{\lceil\log_2{(n+1)}\rceil}' class='latex' />.</p></blockquote>
<p><span id="more-353"></span>They showed this by showing the following (the statement above follows directly from this, given just a bit of thought):</p>
<blockquote><p>For any vector space <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}' title='\mathbb{Q}' class='latex' /> of cardinality <img src='http://l.wordpress.com/latex.php?latex=%5Caleph_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\aleph_n' title='\aleph_n' class='latex' />, and any function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' /> from <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' />, there is an <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in V' title='x\in V' class='latex' /> and a set <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> of unordered pairs <img src='http://l.wordpress.com/latex.php?latex=%28y%2Cz%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(y,z)' title='(y,z)' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=y%2Bz+%3D+2x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y+z = 2x' title='y+z = 2x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f%28y%29+%3Df%28z%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(y) =f(z)' title='f(y) =f(z)' class='latex' />, where the cardinality of <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=2%5En+-+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^n - 1' title='2^n - 1' class='latex' />.</p></blockquote>
<p>I&#8217;ll show here how to prove the weaker statement obtained by replacing <img src='http://l.wordpress.com/latex.php?latex=2%5En+-+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^n - 1' title='2^n - 1' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn-1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^{n-1}' title='2^{n-1}' class='latex' />.</p>
<p>Fix <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' />.  We will construct a set of basis elements <img src='http://l.wordpress.com/latex.php?latex=b_1%5E0%2C%5Cldots%2Cb_n%5E0%2Cb_1%5E1%2C%5Cldots%2Cb%5E1_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b_1^0,\ldots,b_n^0,b_1^1,\ldots,b^1_1' title='b_1^0,\ldots,b_n^0,b_1^1,\ldots,b^1_1' class='latex' /> with the property that for every set <img src='http://l.wordpress.com/latex.php?latex=S%5Csubseteq+%5C%7B1%2C%5Cldots%2Cn%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S\subseteq \{1,\ldots,n\}' title='S\subseteq \{1,\ldots,n\}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f%28%5Csum_%7Bi%5Cin+S%7Db_i%5E0+%2B+%5Csum_%7Bi%5Cnotin+S%7Db_i%5E1%29+%3D+f%28%5Csum_%7Bi%5Cin+S%7Db_i%5E1+%2B+%5Csum_%7Bi%5Cnotin+S%7Db_i%5E0%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(\sum_{i\in S}b_i^0 + \sum_{i\notin S}b_i^1) = f(\sum_{i\in S}b_i^1 + \sum_{i\notin S}b_i^0)' title='f(\sum_{i\in S}b_i^0 + \sum_{i\notin S}b_i^1) = f(\sum_{i\in S}b_i^1 + \sum_{i\notin S}b_i^0)' class='latex' />.  Taking <img src='http://l.wordpress.com/latex.php?latex=2x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2x' title='2x' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%5Cin%5C%7B0%2C1%5C%7D%2C+j%5Cin%5C%7B1%2C%5Cldots%2Cn%5C%7D%7D+b%5Ei_j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_{i\in\{0,1\}, j\in\{1,\ldots,n\}} b^i_j' title='\sum_{i\in\{0,1\}, j\in\{1,\ldots,n\}} b^i_j' class='latex' />, this will provide the required number <img src='http://l.wordpress.com/latex.php?latex=2%5E%7Bn-1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^{n-1}' title='2^{n-1}' class='latex' /> of unordered pairs.  (As a slight bit of notational convenience, if <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> is a set of basis elements, I will write <img src='http://l.wordpress.com/latex.php?latex=f%28S%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(S)' title='f(S)' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=f%28%5Csum+S%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(\sum S)' title='f(\sum S)' class='latex' />.)</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bc_%5Calpha%5C%7D_%7B%5Calpha%3C%5Comega_n%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{c_\alpha\}_{\alpha&lt;\omega_n}' title='\{c_\alpha\}_{\alpha&lt;\omega_n}' class='latex' /> be a basis for <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' />. For <img src='http://l.wordpress.com/latex.php?latex=1%5Cleq+i%5Cleq+n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1\leq i\leq n' title='1\leq i\leq n' class='latex' />, let <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bc_%5Calpha+%5Cmid+%5Comega_%7Bi-1%7D+%3C+%5Calpha+%3C+%5Comega_i%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{c_\alpha \mid \omega_{i-1} &lt; \alpha &lt; \omega_i\}' title='\{c_\alpha \mid \omega_{i-1} &lt; \alpha &lt; \omega_i\}' class='latex' /> be called the <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />th slice of basis elements.  For each <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' /> between <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1' title='1' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=j%5Cin%5C%7B0%2C1%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j\in\{0,1\}' title='j\in\{0,1\}' class='latex' />, we will pick <img src='http://l.wordpress.com/latex.php?latex=b_i%5Ej&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b_i^j' title='b_i^j' class='latex' /> to be in the <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />th slice of basis elements.</p>
<p>Given <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />, suppose that we have defined <img src='http://l.wordpress.com/latex.php?latex=b%5E0_%7Bi%27%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b^0_{i&#039;}' title='b^0_{i&#039;}' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=i+%2B+1+%5Cleq+i%27+%5Cleq+n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i + 1 \leq i&#039; \leq n' title='i + 1 \leq i&#039; \leq n' class='latex' /> and we will define <img src='http://l.wordpress.com/latex.php?latex=b%5E0_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b^0_i' title='b^0_i' class='latex' /> as follows: pick it so that it is in the <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />th slice of basis elements but is not any singleton set of the form: <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bx+%5Cmid+f%28s_1%5Ccup+%5C%7Bx%5C%7D%5Ccup%5C%7Bb_%7Bi%2B1%7D%5E0%2C%5Cldots%2C+b_%7Bn%7D%5E0%5C%7D%29+%3D+m_1%2C%5Cldots%2C+f%28s_t%5Ccup+%5C%7Bx%5C%7D%5Ccup%5C%7Bb_%7Bi%2B1%7D%5E0%2C%5Cldots%2Cb_n%5E0%5C%7D%29%3Dm_t%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{x \mid f(s_1\cup \{x\}\cup\{b_{i+1}^0,\ldots, b_{n}^0\}) = m_1,\ldots, f(s_t\cup \{x\}\cup\{b_{i+1}^0,\ldots,b_n^0\})=m_t\}' title='\{x \mid f(s_1\cup \{x\}\cup\{b_{i+1}^0,\ldots, b_{n}^0\}) = m_1,\ldots, f(s_t\cup \{x\}\cup\{b_{i+1}^0,\ldots,b_n^0\})=m_t\}' class='latex' /> for any finite sets <img src='http://l.wordpress.com/latex.php?latex=s_1%2C%5Cldots%2Cs_t%5Csubset+%5Comega_%7Bi-1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s_1,\ldots,s_t\subset \omega_{i-1}' title='s_1,\ldots,s_t\subset \omega_{i-1}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=m_1%2C%5Cldots%2Cm_t%5Cin+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m_1,\ldots,m_t\in \mathbb{N}' title='m_1,\ldots,m_t\in \mathbb{N}' class='latex' />.  Since there are only <img src='http://l.wordpress.com/latex.php?latex=%5Caleph_%7Bi-1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\aleph_{i-1}' title='\aleph_{i-1}' class='latex' /> such singleton sets, and <img src='http://l.wordpress.com/latex.php?latex=%5Caleph_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\aleph_i' title='\aleph_i' class='latex' /> elements of the <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />th slice of basis elements, we can find such a <img src='http://l.wordpress.com/latex.php?latex=b_i%5E0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b_i^0' title='b_i^0' class='latex' />.</p>
<p>Now we define the <img src='http://l.wordpress.com/latex.php?latex=b%5E1_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b^1_i' title='b^1_i' class='latex' />.  Given <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />, assume that <img src='http://l.wordpress.com/latex.php?latex=b%5E1_1%2C%5Cldots%2C+b%5E1_%7Bi-1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b^1_1,\ldots, b^1_{i-1}' title='b^1_1,\ldots, b^1_{i-1}' class='latex' /> have already been defined and we will define <img src='http://l.wordpress.com/latex.php?latex=b%5E1_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b^1_i' title='b^1_i' class='latex' />: pick it from the <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />th slice so that for all subsets <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bb%5E0_1%2C+b%5E1_1%2C%5Cldots%2C+b%5E0_%7Bi-1%7D%2Cb%5E1_%7Bi-1%7D%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{b^0_1, b^1_1,\ldots, b^0_{i-1},b^1_{i-1}\}' title='\{b^0_1, b^1_1,\ldots, b^0_{i-1},b^1_{i-1}\}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f%28S+%5Ccup+%5C%7Bb%5E0_i%5C%7D%5Ccup+%5C%7Bb%5E0_%7Bi%2B1%7D%2C%5Cldots%2Cb%5E0_n%5C%7D%29+%3D+f%28S%5Ccup+%5C%7Bb%5E1_i%5C%7D%5Ccup+%5C%7Bb%5E0_%7Bi%2B1%7D%2C%5Cldots%2Cb%5E0_n%5C%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(S \cup \{b^0_i\}\cup \{b^0_{i+1},\ldots,b^0_n\}) = f(S\cup \{b^1_i\}\cup \{b^0_{i+1},\ldots,b^0_n\})' title='f(S \cup \{b^0_i\}\cup \{b^0_{i+1},\ldots,b^0_n\}) = f(S\cup \{b^1_i\}\cup \{b^0_{i+1},\ldots,b^0_n\})' class='latex' /> (this is possible by the defining condition of <img src='http://l.wordpress.com/latex.php?latex=b%5E0_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b^0_i' title='b^0_i' class='latex' />.</p>
<p>It is now easy to prove the following proposition by induction on <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' />:</p>
<blockquote><p>For any set <img src='http://l.wordpress.com/latex.php?latex=S%5Csubseteq+%5C%7B1%2C%5Cldots%2Cm%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S\subseteq \{1,\ldots,m\}' title='S\subseteq \{1,\ldots,m\}' class='latex' /> (where <img src='http://l.wordpress.com/latex.php?latex=m%5Cleq+n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\leq n' title='m\leq n' class='latex' />), <img src='http://l.wordpress.com/latex.php?latex=f%28%5C%7Bb%5E0_i+%5Cmid+i%5Cin+S%5C%7D%5Ccup+%5C%7Bb%5E1_i+%5Cmid+i%5Cnotin+S%5C%7D+%5Ccup+%5C%7Bm_1%2C%5Cldots%2C+n%5C%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(\{b^0_i \mid i\in S\}\cup \{b^1_i \mid i\notin S\} \cup \{m_1,\ldots, n\})' title='f(\{b^0_i \mid i\in S\}\cup \{b^1_i \mid i\notin S\} \cup \{m_1,\ldots, n\})' class='latex' /> equals <img src='http://l.wordpress.com/latex.php?latex=f%28%5C%7Bb%5E1_i%5Cmid+i+%5Cin+S%5C%7D%5Ccup+%5C%7Bb%5E1_i%5Cmid+i%5Cnotin+S%5C%7D%5Ccup+%5C%7Bm_1%5Cldots%2Cn%5C%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(\{b^1_i\mid i \in S\}\cup \{b^1_i\mid i\notin S\}\cup \{m_1\ldots,n\})' title='f(\{b^1_i\mid i \in S\}\cup \{b^1_i\mid i\notin S\}\cup \{m_1\ldots,n\})' class='latex' />.</p></blockquote>
<p>(The proof simply uses the defining property of the <img src='http://l.wordpress.com/latex.php?latex=b%5E1_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b^1_i' title='b^1_i' class='latex' />.)  Now, taking <img src='http://l.wordpress.com/latex.php?latex=m%3Dn&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m=n' title='m=n' class='latex' />, the result follows.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/353/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/353/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=353&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2009/07/19/functions-with-very-low-symmetry-and-the-continuum-hypothesis/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>A Suite of Cool Logic Programs</title>
		<link>http://xorshammer.com/2009/05/14/a-suite-of-cool-logic-programs/</link>
		<comments>http://xorshammer.com/2009/05/14/a-suite-of-cool-logic-programs/#comments</comments>
		<pubDate>Fri, 15 May 2009 00:24:49 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.com/?p=320</guid>
		<description><![CDATA[You may have heard about the Tarski-Seidenberg theorem, which says that the first-order theory of the reals is decidable, that the first-order theory of the complex numbers is similarly decidable, or that the first order theory of the integers without multiplication is decidable. In the course of John Harrison&#8216;s logic textbook Handbook of Practical Logic [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=320&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>You may have heard about the <a href="http://en.wikipedia.org/wiki/Tarski%E2%80%93Seidenberg_theorem">Tarski-Seidenberg theorem</a>, which says that the first-order theory of the reals is decidable, that the first-order theory of the complex numbers is similarly decidable, or that the first order theory of the integers without multiplication is <a href="http://en.wikipedia.org/wiki/Presburger_arithmetic">decidable</a>.</p>
<p>In the course of <a href="http://www.cl.cam.ac.uk/~jrh13/">John Harrison</a>&#8216;s logic textbook <a href="http://www.cambridge.org/9780521899574">Handbook of Practical Logic and Automated Reasoning</a>, all three of these algorithms (and many more) are implemented.  Furthermore, you can download and play with them for free.  (However, I still recommend checking out the book: especially if you are looking for a good textbook for a course on logic with a concrete, computational bent.)</p>
<p>Below, I&#8217;ll describe how to install the programs and try them out.  There are many more interesting functions in this suite that I haven&#8217;t described.</p>
<h4><span id="more-320"></span>Installation:</h4>
<p>The software is written in OCaml and can be run interactively in an OCaml toplevel (don&#8217;t worry, you won&#8217;t actually need to know any OCaml).  Download and install <a href="http://caml.inria.fr/download.en.html">OCaml</a> as well as its preprocessor <a href="http://pauillac.inria.fr/~ddr/camlp5/">Camlp5</a> (which is used for formatting formulas nicely).</p>
<p>Then, download the code from <a href="http://www.cl.cam.ac.uk/~jrh13/atp/index.html">here</a> (under &#8220;All the code together&#8221;) and unzip it somewhere.</p>
<p>To run it, go to wherever you unzipped it and type <code>make interactive</code> in a shell.</p>
<p>(At least, that&#8217;s what worked for me on a Mac OSX.  Other systems may be different.)</p>
<h4>The Tarski-Seidenberg Theorem.</h4>
<p>The Tarski-Seidenberg theorem implies that there is a decision procedure which, given a first-order sentence over <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' /> using plus, times, 0, and 1, will tell you if it&#8217;s true or not.  The function <code>real_qelim</code> implements this.  Let&#8217;s try it out.  (The symbol # indicates the beginning of the prompt; don&#8217;t type that, just type in what&#8217;s after it.)</p>
<p>This function knows that not all quadratic polynomials have roots, but all cubics do.</p>
<pre># real_qelim &lt;&lt;forall b c. exists x. x^2 + b*x + c = 0&gt;&gt;;;
- : fol formula = &lt;&lt;false&gt;&gt;
# real_qelim &lt;&lt;forall b c d. exists x. x^3 + b*x^2 + c*x + d = 0&gt;&gt;
  ;;
- : fol formula = &lt;&lt;true&gt;&gt;</pre>
<p>Many geometric puzzles can, in theory, be solved automatically by this function.  Unfortunately, it is too slow for most interesting ones.  Harrison notes that there are open problems about kissing numbers of high-dimensional spheres which could be solved in theory by this algorithm, although in practice it is an unworkable approach.</p>
<p>This algorithm actually does something stronger than decide the truth of first-order sentences: it does quantifier-elimination, which means that if you give it a formula with free variables, it will give you a quantifier-free formula in those same free variables (in the case of a sentence, which has no free variables, that means either the formula &#8220;true&#8221; or the formula &#8220;false&#8221;).</p>
<p>For example, if you&#8217;ve forgotten the quadratic formula and want to know what the condition is for a quadratic polynomial to have a root:</p>
<pre># real_qelim &lt;&lt;exists x. x^2 + b*x + c = 0&gt;&gt;;;
- : fol formula =
&lt;&lt;(0 + c * 4) + b * (0 + b * -1) = 0 \/
 ~(0 + c * 4) + b * (0 + b * -1) = 0 /\
 ~(0 + c * 4) + b * (0 + b * -1) &gt; 0&gt;&gt;</pre>
<p>Note that there is no claim that the formula it gives you will be completely simplified, only that it will be correct.</p>
<h4>Deciding Sentences over the Complex Numbers</h4>
<p>We can similarly use the function <code>complex_qelim</code> to do quantifier elimination over the complexes.  The fact that this possible is easier to prove than the corresponding fact for the reals, and the algorithm is similarly faster.</p>
<pre># complex_qelim &lt;&lt;forall x. x^3 = 1 ==&gt; x = 1&gt;&gt;;;
- : fol formula = &lt;&lt;false&gt;&gt;</pre>
<p>The following sentence is also true over the reals (although for a different reason than why it&#8217;s true over the complexes), but it takes significantly longer for the real quantifier elimination algorithm to decide it.</p>
<pre># complex_qelim &lt;&lt;forall x1 x2 x3. (x1^3 = 1 /\ x2^3 = 1 /\
  x3^3 = 1 /\ ~(x1 = x2) /\ ~(x1 = x3) /\ ~(x2 = x3))
  ==&gt; x1 + x2 + x3 = 0&gt;&gt;;;
- : fol formula = &lt;&lt;true&gt;&gt;</pre>
<p>Suppose we read on wikipedia that the translation of the limaçon <img src='http://l.wordpress.com/latex.php?latex=r+%3D+b+%2B+a%5Ccos%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='r = b + a\cos\theta' title='r = b + a\cos\theta' class='latex' /> to rectangular coordinates is <img src='http://l.wordpress.com/latex.php?latex=%28x%5E2+%2B+y%5E2+-+ax%29%5E2+%3D+b%5E2%28x%5E2+%2B+y%5E2%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(x^2 + y^2 - ax)^2 = b^2(x^2 + y^2)' title='(x^2 + y^2 - ax)^2 = b^2(x^2 + y^2)' class='latex' />.  We can verify this (I&#8217;ve used <code>s</code> to represent <img src='http://l.wordpress.com/latex.php?latex=%5Csin%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sin\theta' title='\sin\theta' class='latex' /> and <code>c</code> to represent <img src='http://l.wordpress.com/latex.php?latex=%5Ccos%5Ctheta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\cos\theta' title='\cos\theta' class='latex' />):</p>
<pre># complex_qelim &lt;&lt; forall r s c x y. (x^2 + y^2 = r^2
  /\ r * c = x /\ r * s = y ==&gt;
  forall a b. (r = b + a * c ==&gt;
  (x^2 + y^2 - a * x)^2 = b^2 * (x^2 + y^2)))&gt;&gt;;;
- : fol formula = &lt;&lt;true&gt;&gt;</pre>
<h4>Presburger Arithmetic</h4>
<p>Finally, first-order sentences with plus and less-than over the integers and over the natural numbers are decidable.  The relevant functions are <code>integer_qelim</code> and <code>natural_qelim</code>.  Even though multiplication of variables is prohibited, we can still multiply by constants (since for example, instead of <img src='http://l.wordpress.com/latex.php?latex=2x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2x' title='2x' class='latex' /> we could have written <img src='http://l.wordpress.com/latex.php?latex=x+%2B+x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x + x' title='x + x' class='latex' /> anyway).</p>
<p>An example Harrison gives is:  There is an old (easy) puzzle which is to show that, with 3- and 5-cent stamps, you can make an <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-cent stamp for any <img src='http://l.wordpress.com/latex.php?latex=n%5Cgeq+8&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\geq 8' title='n\geq 8' class='latex' />.</p>
<pre># natural_qelim &lt;&lt;forall n. n &gt;= 8 ==&gt;
  exists x y. 3 * x + 5 * y = n&gt;&gt;;;
- : fol formula = &lt;&lt;true&gt;&gt;</pre>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/320/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/320/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/320/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=320&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2009/05/14/a-suite-of-cool-logic-programs/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>An Interesting Puzzle in Propositional Logic</title>
		<link>http://xorshammer.com/2009/04/09/an-interesting-puzzle-in-propositional-logic/</link>
		<comments>http://xorshammer.com/2009/04/09/an-interesting-puzzle-in-propositional-logic/#comments</comments>
		<pubDate>Thu, 09 Apr 2009 13:47:48 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.com/?p=299</guid>
		<description><![CDATA[Suppose that you&#8217;re translating an ancient text, and in this text you come across three words whose meaning you are unsure of: , , and .  So, you head down to the ancient language department of your local university. The first professor you come across, , knows what means, the second professor you come across, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=299&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Suppose that you&#8217;re translating an ancient text, and in this text you come across three words whose meaning you are unsure of: <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Blot%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{lot}' title='\mathit{lot}' class='latex' />.  So, you head down to the ancient language department of your local university.</p>
<p>The first professor you come across, <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+Bal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. Bal}' title='\hbox{Prof. Bal}' class='latex' />, knows what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' /> means, the second professor you come across, <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+Kat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. Kat}' title='\hbox{Prof. Kat}' class='latex' />, knows what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' /> means, and the third professor you come across, <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+Lot%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. Lot}' title='\hbox{Prof. Lot}' class='latex' />, knows what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Blot%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{lot}' title='\mathit{lot}' class='latex' /> means.  So you have fortunately solved your problem.</p>
<p>But you&#8217;re now curious and decide to meet some other professors in the department.  The next professor you come across is named <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%5Cmathrm%7BBal%7D%5Crightarrow%5Cmathrm%7BKat%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }(\mathrm{Bal}\rightarrow\mathrm{Kat})' title='\hbox{Prof. }(\mathrm{Bal}\rightarrow\mathrm{Kat})' class='latex' />.  He doesn&#8217;t know what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' /> means or what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' /> means, but if you told him what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' /> meant, he would be able to tell you what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' /> means (for example, maybe he knows that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' /> is the noun form of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' />).</p>
<p>The next professor you meet is named <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%5Cmathrm%7BBal%7D%5Crightarrow%28%5Cmathrm%7BKat%7D%5Crightarrow%5Cmathrm%7BLot%7D%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }(\mathrm{Bal}\rightarrow(\mathrm{Kat}\rightarrow\mathrm{Lot}))' title='\hbox{Prof. }(\mathrm{Bal}\rightarrow(\mathrm{Kat}\rightarrow\mathrm{Lot}))' class='latex' />.  If you told him what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' /> meant and what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' /> meant, he would be able to tell you what <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Blot%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{lot}' title='\mathit{lot}' class='latex' /> means.</p>
<p>The next professor you meet is named <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%28%5Cmathrm%7BBal%7D%5Crightarrow%5Cmathrm%7BKat%7D%29%5Crightarrow%5Cmathrm%7BLot%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }((\mathrm{Bal}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Lot})' title='\hbox{Prof. }((\mathrm{Bal}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Lot})' class='latex' />.  If you told him a <em>method</em> for finding out the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' /> given the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' />, he would be able to tell you the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Blot%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{lot}' title='\mathit{lot}' class='latex' />.</p>
<p>In general, for any two professors <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7DP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }P' title='\hbox{Prof. }P' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7DQ&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }Q' title='\hbox{Prof. }Q' class='latex' />, there is a professor <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7DP%5Crightarrow+Q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }P\rightarrow Q' title='\hbox{Prof. }P\rightarrow Q' class='latex' /> with the property that if you told him what <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7DP&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }P' title='\hbox{Prof. }P' class='latex' /> knew, he would be able to tell you what <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7DQ&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }Q' title='\hbox{Prof. }Q' class='latex' /> knows (but <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7DP%5Crightarrow+Q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }P\rightarrow Q' title='\hbox{Prof. }P\rightarrow Q' class='latex' /> doesn&#8217;t know any more than that).</p>
<p>Notice that some professors have essentially the same state of knowledge.  For example, <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+Bal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. Bal}' title='\hbox{Prof. Bal}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%28%5Cmathrm%7BKat%7D%5Crightarrow%5Cmathrm%7BKat%7D%29%5Crightarrow%5Cmathrm%7BBal%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }((\mathrm{Kat}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Bal})' title='\hbox{Prof. }((\mathrm{Kat}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Bal})' class='latex' /> have essentially the same knowledge, since to get the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' /> out of <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%28%5Cmathrm%7BKat%7D%5Crightarrow%5Cmathrm%7BKat%7D%29%5Crightarrow%5Cmathrm%7BBal%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }((\mathrm{Kat}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Bal})' title='\hbox{Prof. }((\mathrm{Kat}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Bal})' class='latex' /> you only have to tell him a method for finding out the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' /> given the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' />, which is something that you can do without any particular special knowledge concerning <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Blot%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{lot}' title='\mathit{lot}' class='latex' />.</p>
<p>A more nontrivial example is that <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%28%28%5Cmathrm%7BBal%7D%5Crightarrow%5Cmathrm%7BKat%7D%29%5Crightarrow%5Cmathrm%7BBal%7D%29%5Crightarrow%5Cmathrm%7BKat%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }(((\mathrm{Bal}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Bal})\rightarrow\mathrm{Kat})' title='\hbox{Prof. }(((\mathrm{Bal}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Bal})\rightarrow\mathrm{Kat})' class='latex' /> has the same state of knowledge as <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%5Cmathrm%7BBal%7D%5Crightarrow%5Cmathrm%7BKat%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }(\mathrm{Bal}\rightarrow\mathrm{Kat})' title='\hbox{Prof. }(\mathrm{Bal}\rightarrow\mathrm{Kat})' class='latex' />.  This is because each can &#8220;simulate&#8221; the other.  In one direction, suppose somebody told <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%28%28%5Cmathrm%7BBal%7D%5Crightarrow%5Cmathrm%7BKat%7D%29%5Crightarrow%5Cmathrm%7BBal%7D%29%5Crightarrow%5Cmathrm%7BKat%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }(((\mathrm{Bal}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Bal})\rightarrow\mathrm{Kat})' title='\hbox{Prof. }(((\mathrm{Bal}\rightarrow\mathrm{Kat})\rightarrow\mathrm{Bal})\rightarrow\mathrm{Kat})' class='latex' /> the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' />.  He therefore knows a trivial &#8220;method&#8221; for getting the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' /> given any inputs, and so he knows the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' />.  In the other direction, suppose we told <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%5Cmathrm%7BBal%7D%5Crightarrow%5Cmathrm%7BKat%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }(\mathrm{Bal}\rightarrow\mathrm{Kat})' title='\hbox{Prof. }(\mathrm{Bal}\rightarrow\mathrm{Kat})' class='latex' /> a method for turning (methods for turning the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' /> into the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' />) into the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' />.  Well, <img src='http://l.wordpress.com/latex.php?latex=%5Chbox%7BProf.+%7D%28%5Cmathrm%7BBal%7D%5Crightarrow%5Cmathrm%7BKat%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\hbox{Prof. }(\mathrm{Bal}\rightarrow\mathrm{Kat})' title='\hbox{Prof. }(\mathrm{Bal}\rightarrow\mathrm{Kat})' class='latex' /> knows a method for turning the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' /> into the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' />, so he can use that find the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bbal%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{bal}' title='\mathit{bal}' class='latex' />.  He can then use his method a second time to turn that into the meaning of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathit%7Bkat%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathit{kat}' title='\mathit{kat}' class='latex' />.</p>
<p>The puzzle is then to prove that there are only finitely many professors with different states of knowledge.</p>
<p>This puzzle is equivalent to showing that intuitionistic implicational propositional logic over three variables has only finitely many logically inequivalent formulas.  Another formulation is: Let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}_n' title='\mathbb{C}_n' class='latex' /> be the free cartesian closed category over <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> objects.  Given objects <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=B%5Cin+%5Cmathbb%7BC%7D_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B\in \mathbb{C}_n' title='B\in \mathbb{C}_n' class='latex' />, say that <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> are equivalent if there is an arrow from <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> and an arrow from <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />.  Then there are only finitely many equivalence classes in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D_3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}_3' title='\mathbb{C}_3' class='latex' />.  (The corresponding statements are also true with <img src='http://l.wordpress.com/latex.php?latex=3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='3' title='3' class='latex' /> replaced by <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />.)</p>
<p><span id="more-299"></span>This fact was first proved using algebraic methods by Arturo Diego in his Ph.D. thesis in the 1940&#8242;s.  It was subsequently reproved using semantic methods by various people including Nicolaas de Bruijn and Alasdair Urquhart.  A good overview of those results is in <a href="http://staff.science.uva.nl/~lhendrik/publications/DS199601.pdf">Lex Hendriks&#8217;s thesis</a>.  I proved it in a combinatorial way as part of <a href="http://www.math.cornell.edu/~shore/theses/MichaelSingle.pdf">my thesis</a>.</p>
<p>To rigorously state the problem: Let <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> be a finite set of propositional variables, and let <img src='http://l.wordpress.com/latex.php?latex=F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F' title='F' class='latex' /> be the smallest set containing <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> and such that if formulas <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\psi' title='\psi' class='latex' /> are in <img src='http://l.wordpress.com/latex.php?latex=F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F' title='F' class='latex' />, then the formula <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%5Crightarrow%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi\rightarrow\psi' title='\phi\rightarrow\psi' class='latex' /> is in <img src='http://l.wordpress.com/latex.php?latex=F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F' title='F' class='latex' />.  We define a relation <img src='http://l.wordpress.com/latex.php?latex=%5Cvdash&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\vdash' title='\vdash' class='latex' /> between sets of formulas in <img src='http://l.wordpress.com/latex.php?latex=F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F' title='F' class='latex' /> and formulas in <img src='http://l.wordpress.com/latex.php?latex=F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F' title='F' class='latex' /> as follows:  We will let <img src='http://l.wordpress.com/latex.php?latex=%5Cvdash&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\vdash' title='\vdash' class='latex' /> be the smallest relation such that, for any <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%5Csubset+F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma\subset F' title='\Gamma\subset F' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%2C%5Cpsi%5Cin+F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi,\psi\in F' title='\phi,\psi\in F' class='latex' />:</p>
<ol>
<li><img src='http://l.wordpress.com/latex.php?latex=%5CGamma%5Ccup%5C%7B%5Cphi%5C%7D%5Cvdash%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma\cup\{\phi\}\vdash\phi' title='\Gamma\cup\{\phi\}\vdash\phi' class='latex' />.</li>
<li>If <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%27%5Csupset+%5CGamma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma&#039;\supset \Gamma' title='\Gamma&#039;\supset \Gamma' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%5Cvdash+%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma\vdash \phi' title='\Gamma\vdash \phi' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%27%5Cvdash+%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma&#039;\vdash \phi' title='\Gamma&#039;\vdash \phi' class='latex' />.</li>
<li>If <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%5Cvdash+%5Cphi%5Crightarrow%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma\vdash \phi\rightarrow\psi' title='\Gamma\vdash \phi\rightarrow\psi' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%5Cvdash%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma\vdash\phi' title='\Gamma\vdash\phi' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%5Cvdash+%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma\vdash \psi' title='\Gamma\vdash \psi' class='latex' />.</li>
<li>If <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%5Ccup%5C%7B%5Cphi%5C%7D%5Cvdash+%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma\cup\{\phi\}\vdash \psi' title='\Gamma\cup\{\phi\}\vdash \psi' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%5CGamma%5Cvdash+%5Cphi%5Crightarrow%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Gamma\vdash \phi\rightarrow\psi' title='\Gamma\vdash \phi\rightarrow\psi' class='latex' />.</li>
</ol>
<p>The relation <img src='http://l.wordpress.com/latex.php?latex=%5Cvdash&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\vdash' title='\vdash' class='latex' /> formalizes the notion of a formula being provable given a set of hypotheses.</p>
<p>We say that <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\psi' title='\psi' class='latex' /> are equivalent if <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5Cphi%5C%7D%5Cvdash%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{\phi\}\vdash\psi' title='\{\phi\}\vdash\psi' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5Cpsi%5C%7D%5Cvdash%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{\psi\}\vdash\phi' title='\{\psi\}\vdash\phi' class='latex' />, and the proposition is then that there are only finitely many equivalence classes in <img src='http://l.wordpress.com/latex.php?latex=F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F' title='F' class='latex' />.</p>
<p>Unfortunately, a full solution using no other machinery (at least the one that I came up with) is a bit too notationally cumbersome for a blog entry, but the puzzle is by no means inaccessible to someone with no other knowledge of logic.</p>
<p>In any case, I will sketch the solution in an important special case: that of formulas which are <em>left-associated</em>, i.e., of the form <img src='http://l.wordpress.com/latex.php?latex=%28%5Ccdots%28%28%28A_1%5Crightarrow+A_2%29%5Crightarrow+A_3%29%5Crightarrow+A_4%29%5Ccdots%5Crightarrow+A_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\cdots(((A_1\rightarrow A_2)\rightarrow A_3)\rightarrow A_4)\cdots\rightarrow A_n)' title='(\cdots(((A_1\rightarrow A_2)\rightarrow A_3)\rightarrow A_4)\cdots\rightarrow A_n)' class='latex' /> where each <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i' title='A_i' class='latex' /> is a propositional variable.  Since we know how to parenthesize such formulas, we can write them simply in the form <img src='http://l.wordpress.com/latex.php?latex=A_1A_2A_3A_4%5Ccdots+A_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_1A_2A_3A_4\cdots A_n' title='A_1A_2A_3A_4\cdots A_n' class='latex' />.</p>
<p>The crucial insight is the following:</p>
<blockquote><p>Let <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i' title='A_i' class='latex' /> be a propositional variable for <img src='http://l.wordpress.com/latex.php?latex=1%5Cleq+i%5Cleq+n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1\leq i\leq n' title='1\leq i\leq n' class='latex' />.  Let <img src='http://l.wordpress.com/latex.php?latex=i%3Cj%3Ck&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i&lt;j&lt;k' title='i&lt;j&lt;k' class='latex' /> and suppose that <img src='http://l.wordpress.com/latex.php?latex=A_i%3DA_k&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i=A_k' title='A_i=A_k' class='latex' />.  Then the formulas <img src='http://l.wordpress.com/latex.php?latex=A_iA_1A_2%5Ccdots+A_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_iA_1A_2\cdots A_n' title='A_iA_1A_2\cdots A_n' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=A_jA_1A_2%5Ccdots+A_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_jA_1A_2\cdots A_n' title='A_jA_1A_2\cdots A_n' class='latex' /> are equivalent.</p></blockquote>
<p>To see why this is so, imagine that you are trying to prove some <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i' title='A_i' class='latex' /> and you have some hypothesis <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P' title='P' class='latex' />.   What does <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P' title='P' class='latex' /> allow you to do?</p>
<ul>
<li>If <img src='http://l.wordpress.com/latex.php?latex=P%3DA_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P=A_i' title='P=A_i' class='latex' />, it allows you to complete the proof immediately.</li>
<li>If <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P' title='P' class='latex' /> is of the form <img src='http://l.wordpress.com/latex.php?latex=A_j%5Crightarrow+A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_j\rightarrow A_i' title='A_j\rightarrow A_i' class='latex' />, then it allows you to change the goal from <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i' title='A_i' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=A_j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_j' title='A_j' class='latex' />.</li>
<li>If <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P' title='P' class='latex' /> is of the form <img src='http://l.wordpress.com/latex.php?latex=%28Q%5Crightarrow+A_j%29%5Crightarrow+A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(Q\rightarrow A_j)\rightarrow A_i' title='(Q\rightarrow A_j)\rightarrow A_i' class='latex' />, then it allows you to change the goal from <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i' title='A_i' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=A_j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_j' title='A_j' class='latex' /> and give yourself <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='Q' title='Q' class='latex' /> as a hypothesis.</li>
</ul>
<p>In the third case above, suppose that <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='Q' title='Q' class='latex' /> is of the form <img src='http://l.wordpress.com/latex.php?latex=%28R%5Crightarrow+A_k%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(R\rightarrow A_k)' title='(R\rightarrow A_k)' class='latex' />.  The only way that <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='Q' title='Q' class='latex' /> can be of use is if you have some way to change the goal from <img src='http://l.wordpress.com/latex.php?latex=A_j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_j' title='A_j' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=A_k&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_k' title='A_k' class='latex' />.  In general, if you have <img src='http://l.wordpress.com/latex.php?latex=BA_1%5Ccdots+A_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='BA_1\cdots A_n' title='BA_1\cdots A_n' class='latex' /> as a hypothesis, if you get to use <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> you must have a method for turning the goal from <img src='http://l.wordpress.com/latex.php?latex=A_%7Bi%2B1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_{i+1}' title='A_{i+1}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i' title='A_i' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=1%5Cleq+i%5Cleq+n-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1\leq i\leq n-1' title='1\leq i\leq n-1' class='latex' />.  By assumption in the boxed statement, we therefore have a method for turning the goal from <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i' title='A_i' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=A_j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_j' title='A_j' class='latex' /> and vice versa. So <img src='http://l.wordpress.com/latex.php?latex=A_iA_1%5Ccdots+A_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_iA_1\cdots A_n' title='A_iA_1\cdots A_n' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=A_jA_1%5Cldots+A_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_jA_1\ldots A_n' title='A_jA_1\ldots A_n' class='latex' /> are equivalent.  (Actually, what this argument shows is that they can be used in equivalent ways as hypotheses in a proof, which turns out to be enough.)</p>
<p>Now also observe that</p>
<blockquote><p>For any formula <img src='http://l.wordpress.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P' title='P' class='latex' /> and variable <img src='http://l.wordpress.com/latex.php?latex=A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A_i' title='A_i' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%28%28%28P%5Crightarrow+A_i%29%5Crightarrow+A_i%29%5Crightarrow+A_i%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(((P\rightarrow A_i)\rightarrow A_i)\rightarrow A_i)' title='(((P\rightarrow A_i)\rightarrow A_i)\rightarrow A_i)' class='latex' /> is equivalent to <img src='http://l.wordpress.com/latex.php?latex=P%5Crightarrow+A_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P\rightarrow A_i' title='P\rightarrow A_i' class='latex' />.</p></blockquote>
<p>From the two boxed facts, it pretty easily follows that every left-associated formula <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> is equivalent to one of length at most <img src='http://l.wordpress.com/latex.php?latex=3n%5E3+%2B2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='3n^3 +2' title='3n^3 +2' class='latex' /> (where <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> is the number of propositional variables):  Suppose that <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> is equivalent to no shorter formula.  Chopping <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> up into <img src='http://l.wordpress.com/latex.php?latex=%5Clfloor+%7C%5Cphi%7C%2F3%5Crfloor&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\lfloor |\phi|/3\rfloor' title='\lfloor |\phi|/3\rfloor' class='latex' /> triplets, I claim that no two triplets are the same: If so, and a triplet consisted of all the same variable, we could apply the second boxed fact to get a shorter formula.  Otherwise, we can apply the first.  Since there are only <img src='http://l.wordpress.com/latex.php?latex=n%5E3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n^3' title='n^3' class='latex' /> distinct triplets, that gives a length of at most <img src='http://l.wordpress.com/latex.php?latex=3n%5E3+%2B+2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='3n^3 + 2' title='3n^3 + 2' class='latex' />.</p>
<p>Therefore, there are at most <img src='http://l.wordpress.com/latex.php?latex=n%5E%7B3n%5E3%2B2%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n^{3n^3+2}' title='n^{3n^3+2}' class='latex' /> left-associated formulas.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/299/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/299/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/299/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/299/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/299/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/299/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/299/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/299/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/299/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/299/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/299/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/299/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/299/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/299/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=299&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2009/04/09/an-interesting-puzzle-in-propositional-logic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>What Happens When You Iterate Gödel&#8217;s Theorem?</title>
		<link>http://xorshammer.com/2009/03/23/what-happens-when-you-iterate-godels-theorem/</link>
		<comments>http://xorshammer.com/2009/03/23/what-happens-when-you-iterate-godels-theorem/#comments</comments>
		<pubDate>Tue, 24 Mar 2009 02:38:30 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=278</guid>
		<description><![CDATA[Let be Peano Arithmetic.  Gödel&#8217;s Second Incompleteness Theorem says that no consistent theory extending can prove its own consistency. (I&#8217;ll write for the statement asserting &#8216;s consistency; more on this later.) In particular, is stronger than .  But certainly, given that we believe that everything proves is true, we believe that does not prove a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=278&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> be Peano Arithmetic.  Gödel&#8217;s Second Incompleteness Theorem says that no consistent theory <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> extending <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> can prove its own consistency. (I&#8217;ll write <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BCon%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{Con}(T)' title='\mathrm{Con}(T)' class='latex' /> for the statement asserting <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' />&#8216;s consistency; more on this later.)</p>
<p>In particular, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D+%2B+%5Cmathrm%7BCon%7D%28%5Cmathrm%7BPA%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA} + \mathrm{Con}(\mathrm{PA})' title='\mathrm{PA} + \mathrm{Con}(\mathrm{PA})' class='latex' /> is stronger than <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />.  But certainly, given that we believe that everything <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> proves is true, we believe that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> does not prove a contradiction, and hence is consistent.  Thus, we believe that everything that <img src='http://l.wordpress.com/latex.php?latex=%28%5Cmathrm%7BPA%7D+%2B+%5Cmathrm%7BCon%7D%28%5Cmathrm%7BPA%7D%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\mathrm{PA} + \mathrm{Con}(\mathrm{PA}))' title='(\mathrm{PA} + \mathrm{Con}(\mathrm{PA}))' class='latex' /> proves is true.  But by a similar argument, we believe that everything that <img src='http://l.wordpress.com/latex.php?latex=%28%5Cmathrm%7BPA%7D+%2B+%5Cmathrm%7BCon%7D%28%5Cmathrm%7BPA%7D+%2B+%5Cmathrm%7BCon%7D%28%5Cmathrm%7BPA%7D%29%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\mathrm{PA} + \mathrm{Con}(\mathrm{PA} + \mathrm{Con}(\mathrm{PA})))' title='(\mathrm{PA} + \mathrm{Con}(\mathrm{PA} + \mathrm{Con}(\mathrm{PA})))' class='latex' /> proves is true.  Where does this stop?  Once we believe that everything <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> proves is true, what, exactly, are we committed to believing?</p>
<p><span id="more-278"></span>This is from Chapters 13&#8211;15 of Torkel Franzén&#8217;s book <a href="http://www.sm.luth.se/~torkel/eget/progps.html">Inexhaustibility</a>, which is admirably clear and well-written.</p>
<p>First off, let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_0' title='\mathrm{PA}_0' class='latex' /> be <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />, and let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%7Bn%2B1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_{n+1}' title='\mathrm{PA}_{n+1}' class='latex' /> be <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_n%2B%5Cmathrm%7BCon%7D%28%5Cmathrm%7BPA%7D_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_n+\mathrm{Con}(\mathrm{PA}_n)' title='\mathrm{PA}_n+\mathrm{Con}(\mathrm{PA}_n)' class='latex' />.  By the considerations above, we accept that each <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_n' title='\mathrm{PA}_n' class='latex' /> is sound.  (A theory being sound means that everything it proves is true.)</p>
<p>So, if we therefore let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%5Comega+%3D+%5Cbigcup_%7Bn%5Cin%5Cmathbb%7BN%7D%7D%5Cmathrm%7BPA%7D_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_\omega = \bigcup_{n\in\mathbb{N}}\mathrm{PA}_n' title='\mathrm{PA}_\omega = \bigcup_{n\in\mathbb{N}}\mathrm{PA}_n' class='latex' />, then we accept that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_\omega' title='\mathrm{PA}_\omega' class='latex' /> is sound.  We could therefore define <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%7B%5Comega+%2B+1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_{\omega + 1}' title='\mathrm{PA}_{\omega + 1}' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%5Comega+%2B+%5Cmathrm%7BCon%7D%28%5Cmathrm%7BPA%7D_%5Comega%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_\omega + \mathrm{Con}(\mathrm{PA}_\omega)' title='\mathrm{PA}_\omega + \mathrm{Con}(\mathrm{PA}_\omega)' class='latex' /> and we would have to accept that as sound as well, but in making this definition we run up against our first snag.</p>
<p>The snag is this: In order to express a sentence of the form <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BCon%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{Con}(T)' title='\mathrm{Con}(T)' class='latex' /> in the language of number theory, we much choose some recursively enumerable presentation of <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' />, and which recursively enumerable presentation we choose matters.  For example, if we add to any given presentation of <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> the stipulation that we are adding, for all <img src='http://l.wordpress.com/latex.php?latex=x%2Cy%2Cz+%3E+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x,y,z &gt; 0' title='x,y,z &gt; 0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n+%3E+2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n &gt; 2' title='n &gt; 2' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=x%5En+%2B+y%5En+%3D+z%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x^n + y^n = z^n' title='x^n + y^n = z^n' class='latex' /> the axiom <img src='http://l.wordpress.com/latex.php?latex=0+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='0 = 1' title='0 = 1' class='latex' />, then we haven&#8217;t actually added any new axioms, but if we construct the statement <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BCon%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{Con}(T)' title='\mathrm{Con}(T)' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> presented the second way, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BCon%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{Con}(T)' title='\mathrm{Con}(T)' class='latex' /> will imply Fermat&#8217;s Last Theorem, while <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BCon%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{Con}(T)' title='\mathrm{Con}(T)' class='latex' /> constructed from the original presentation of <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> may not.</p>
<p>As you might guess from the above, we are going to want to construct <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_\alpha' title='\mathrm{PA}_\alpha' class='latex' /> for ordinals <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' />.  When <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is a successor ordinal, it is clear how to get a &#8220;reasonable&#8221; presentation of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_\alpha' title='\mathrm{PA}_\alpha' class='latex' /> from a reasonable presentation of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_\beta' title='\mathrm{PA}_\beta' class='latex' /> (where <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta+%2B+1+%3D+%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta + 1 = \alpha' title='\beta + 1 = \alpha' class='latex' />), but if <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is a limit ordinal, in general it won&#8217;t be clear (although it is clear for <img src='http://l.wordpress.com/latex.php?latex=%5Calpha+%3D+%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha = \omega' title='\alpha = \omega' class='latex' />).</p>
<p>So how can we solve this problem?</p>
<p>The first step is to use a more computable representation of ordinals, namely ordinal <em>notations. </em>An ordinal notation is a number <img src='http://l.wordpress.com/latex.php?latex=n%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in\mathbb{N}' title='n\in\mathbb{N}' class='latex' /> with the following property: It is either 0, or for every <img src='http://l.wordpress.com/latex.php?latex=m%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in\mathbb{N}' title='m\in\mathbb{N}' class='latex' />, the output of the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine on input <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> is an ordinal notation.  (What this recursive definition really means is that the set of ordinal notations is the smallest set satisfying the above property.)</p>
<p>Given an ordinal notation <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, we let the ordinal it represents, <img src='http://l.wordpress.com/latex.php?latex=%7Cn%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|n|' title='|n|' class='latex' />, be defined by <img src='http://l.wordpress.com/latex.php?latex=%7C0%7C+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|0| = 0' title='|0| = 0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7Cn%7C+%3D+%5Csup_%7Bm%5Cin%5Cmathbb%7BN%7D%7D+%28%7C%5Cphi_n%28m%29%7C%2B1%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|n| = \sup_{m\in\mathbb{N}} (|\phi_n(m)|+1)' title='|n| = \sup_{m\in\mathbb{N}} (|\phi_n(m)|+1)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_n%28m%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_n(m)' title='\phi_n(m)' class='latex' /> denotes the output of the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine on input <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' />.</p>
<p>We can now uniformly pick presentations of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_a' title='\mathrm{PA}_a' class='latex' /> for ordinal notations <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> by letting <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_0+%3D+%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_0 = \mathrm{PA}' title='\mathrm{PA}_0 = \mathrm{PA}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_a' title='\mathrm{PA}_a' class='latex' /> be presented as the union of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_%7B%5Cphi_a%28m%29%7D+%2B+%5Cmathrm%7BCon%7D%28%5Cmathrm%7BPA%7D_%7B%5Cphi_a%28m%29%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_{\phi_a(m)} + \mathrm{Con}(\mathrm{PA}_{\phi_a(m)})' title='\mathrm{PA}_{\phi_a(m)} + \mathrm{Con}(\mathrm{PA}_{\phi_a(m)})' class='latex' /> over <img src='http://l.wordpress.com/latex.php?latex=m%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in\mathbb{N}' title='m\in\mathbb{N}' class='latex' />, where the consistency statements are constructed using the presentations given by induction.</p>
<p>Unfortunately, this doesn&#8217;t prevent us from doing the trick mentioned above: For any true <img src='http://l.wordpress.com/latex.php?latex=%5CPi_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Pi_1' title='\Pi_1' class='latex' /> sentence <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' />, there is an ordinal notation <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%7Ca%7C+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|a| = 1' title='|a| = 1' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_a' title='\mathrm{PA}_a' class='latex' /> proves <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' />.  The catch is that <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> will be quite an unusual notation for 1, and we&#8217;re not really justified in taking <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D_a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}_a' title='\mathrm{PA}_a' class='latex' /> to be a consistency extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> because <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> doesn&#8217;t &#8220;know&#8221; that <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> is an ordinal notation.</p>
<p>However, we can make a reasonable definition for what it means for <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> (or any extension) to prove that a number <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> is an ordinal notation.  (This is actually not trivial, since <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> can only talk about numbers, but the set of ordinal notations was defined to be the least <em>set</em> satisfying a certain property.)  We can then define an <em>autonomous</em> consistency extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> as follows: <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> is an autonomous consistency extension of itself, and if <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> is an autonomous consistency extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> proves that <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> is an ordinal notation, then <img src='http://l.wordpress.com/latex.php?latex=T_a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_a' title='T_a' class='latex' /> is an autonomous consistency extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />.</p>
<p>The autonomous consistency extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> have some claim to being exactly those that we recognize to be consistency extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> solely on the basis that we accept <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />.  But that isn&#8217;t really completely satisfying.  There&#8217;s nothing stopping us from letting <img src='http://l.wordpress.com/latex.php?latex=T_A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_A' title='T_A' class='latex' /> be the union of the autonomous consistency extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> and considering <img src='http://l.wordpress.com/latex.php?latex=T_A+%2B+%5Cmathrm%7BCon%7D%28T_A%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_A + \mathrm{Con}(T_A)' title='T_A + \mathrm{Con}(T_A)' class='latex' />.  Similarly, we got the set of autonomous consistency extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> by starting with <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> and then closing under <em>finite</em> applications of a particular operation, but we could also have considered transfinite applications of that operation.</p>
<p>Does there exist a theory (which we believe is true) which will prove anything <em>any </em>reasonable iterated consistency extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> proves?  It turns out there is.  Let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5CSigma-REF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\Sigma-REF}' title='\mathrm{PA}^{\Sigma-REF}' class='latex' /> be the theory obtained by adding to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> the axiom that for any <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' /> sentence <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> (that is, sentence of the form <img src='http://l.wordpress.com/latex.php?latex=%5Cexists+x_1%5Ccdots+%5Cexists+x_n+%5Cpsi%28x_1%2C%5Cldots%2C+x_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\exists x_1\cdots \exists x_n \psi(x_1,\ldots, x_n)' title='\exists x_1\cdots \exists x_n \psi(x_1,\ldots, x_n)' class='latex' /> where all of <img src='http://l.wordpress.com/latex.php?latex=%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\psi' title='\psi' class='latex' />&#8216;s quantifiers are bounded), if <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> proves <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> is true.</p>
<p>This property is called <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' />-soundness, and the axiom formalizing it is called a reflection axiom.  If <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' />-sound, then so is <img src='http://l.wordpress.com/latex.php?latex=T+%2B+%5Cmathrm%7BCon%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T + \mathrm{Con}(T)' title='T + \mathrm{Con}(T)' class='latex' />, since if <img src='http://l.wordpress.com/latex.php?latex=T+%2B+%5Cmathrm%7BCon%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T + \mathrm{Con}(T)' title='T + \mathrm{Con}(T)' class='latex' /> proved a false <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' /> statement <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> would prove the false <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' /> statement <img src='http://l.wordpress.com/latex.php?latex=%5Cneg+%5Cmathrm%7BCon%7D%28T%29+%5Cvee+%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\neg \mathrm{Con}(T) \vee \phi' title='\neg \mathrm{Con}(T) \vee \phi' class='latex' /> (false because <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' />-soundness implies consistency).  Similarly, any union of a chain of <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' />-sound theories must be <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' />-sound.</p>
<p>Because we can formalize the above argument in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5CSigma-REF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\Sigma-REF}' title='\mathrm{PA}^{\Sigma-REF}' class='latex' /> proves that every autonomous consistency extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' />-sound.  Therefore, it proves that every autonomous consistency extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> is consistent.  Therefore (since essentially autonomous consistency extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> say nothing besides the fact that lower autonomous consistency extensions are consistent), <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5CSigma-REF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\Sigma-REF}' title='\mathrm{PA}^{\Sigma-REF}' class='latex' /> extends each autonomous consistency extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />.</p>
<p>Okay, so adding axioms asserting that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' />-sound takes us beyond all the autonomous consistency extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />.  But what happens if add to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5CSigma-REF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\Sigma-REF}' title='\mathrm{PA}^{\Sigma-REF}' class='latex' /> axioms asserting that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5CSigma-REF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\Sigma-REF}' title='\mathrm{PA}^{\Sigma-REF}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' />-sound?  This is called a <em>reflection </em>extension, and we can form autonomous iterated reflection extensions just as we can autonomous iterated consistency extensions.</p>
<p>Is there any theory (which we believe is true) which goes beyond all the autonomous reflection extensions the same way that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5CSigma-REF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\Sigma-REF}' title='\mathrm{PA}^{\Sigma-REF}' class='latex' /> goes beyond all the autonomous consistency extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />?  There is. The theory <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5CSigma-REF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\Sigma-REF}' title='\mathrm{PA}^{\Sigma-REF}' class='latex' /> asserts that all <img src='http://l.wordpress.com/latex.php?latex=%5CSigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma_1' title='\Sigma_1' class='latex' /> sentences that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> proves are true.   But it&#8217;s actually the case that <em>all</em> sentences that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> proves are true.</p>
<p>By a result of Tarski&#8217;s we can&#8217;t define truth of an arithmetical formula in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />, but we can define it by adding a new predicate <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BTrue%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{True}' title='\mathrm{True}' class='latex' /> to the language of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />, together with suitable axioms.  The resulting theory <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5Cmathit%7BTrue%7D%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\mathit{True}}' title='\mathrm{PA}^{\mathit{True}}' class='latex' />, extends every autonomous reflection extension of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />.</p>
<p>In terms of what arithmetical sentences they can prove, <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5Cmathit%7BTrue%7D%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\mathit{True}}' title='\mathrm{PA}^{\mathit{True}}' class='latex' /> is an equivalent theory to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BACA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{ACA}' title='\mathrm{ACA}' class='latex' /> (edit: not <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BACA%7D_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{ACA}_0' title='\mathrm{ACA}_0' class='latex' />), which is the theory of second-order arithmetic, with a comprehension axiom for all arithmetic formulas.  This is essentially because sets of numbers in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BACA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{ACA}' title='\mathrm{ACA}' class='latex' /> are interchangeable with formulas in the language of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> with one free variable in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D%5E%7B%5Cmathit%7BTrue%7D%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}^{\mathit{True}}' title='\mathrm{PA}^{\mathit{True}}' class='latex' />.</p>
<p>And, of course, we then get autonomous iterated <em>truth </em>extensions of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' />, in analogy to the autonomous iterated reflection extensions and the autonomous iterated consistency extensions.  Here there is again a natural theory which extends all the autonomous iterated truth extensions, a theory called <img src='http://l.wordpress.com/latex.php?latex=%5CPi%5E1_1-%5Cmathrm%7BCA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Pi^1_1-\mathrm{CA}' title='\Pi^1_1-\mathrm{CA}' class='latex' />: it&#8217;s a theory of second-order arithmetic, like <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BACA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{ACA}' title='\mathrm{ACA}' class='latex' />, but it allows comprehension for <img src='http://l.wordpress.com/latex.php?latex=%5CPi%5E1_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Pi^1_1' title='\Pi^1_1' class='latex' />-formulas (formulas with a universal set quantifier in front), instead of just arithmetic formulas.</p>
<p>Of course, we can now start again, taking consistency or reflection extensions of <img src='http://l.wordpress.com/latex.php?latex=%5CPi%5E1_1-%5Cmathrm%7BCA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Pi^1_1-\mathrm{CA}' title='\Pi^1_1-\mathrm{CA}' class='latex' />.  But, as Franzén says:</p>
<blockquote><p>[E]xtending <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{PA}' title='\mathrm{PA}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5CPi%5E1_1-CA&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Pi^1_1-CA' title='\Pi^1_1-CA' class='latex' /> opens the door to a number of possible extensions that go beyond reflection.  In particular, we can extend a theory by introducing axioms about sets of higher type&#8212;meaning sets of sets of natural numbers, sets of sets of sets of natural numbers, and so on&#8212;and by introducing stronger comprehension principles for sets of a given type. &#8230; Axiomatic set theories like <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7BZF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{ZF}' title='\mathrm{ZF}' class='latex' /> give powerful first-order theories which prove everything provable in such iterated autonomous extensions. &#8230; In this connection the term &#8220;reflection&#8221; reappears and takes on a new meaning. &#8230; [This] leads to a further indefinite sequence of extensions of set theory, and furthermore, &#8220;axioms of infinity&#8221; [i.e., large cardinal axioms], have been formulated which can be reasonably argued to be stronger, as far as arithmetical theorems are concerned, than any such extension by set-theoretic reflection.</p></blockquote>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/278/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/278/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/278/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/278/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/278/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/278/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/278/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/278/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/278/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/278/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/278/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/278/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/278/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/278/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=278&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2009/03/23/what-happens-when-you-iterate-godels-theorem/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>Trigonometric Series and the Beginnings of Set Theory</title>
		<link>http://xorshammer.com/2008/12/20/trigonometric-series-and-the-beginnings-of-set-theory/</link>
		<comments>http://xorshammer.com/2008/12/20/trigonometric-series-and-the-beginnings-of-set-theory/#comments</comments>
		<pubDate>Sat, 20 Dec 2008 13:38:17 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=253</guid>
		<description><![CDATA[Let be a -periodic function. It may or may not have a representation as a trigonometric series A natural question to ask is whether or not the representation of as a trigonometric series is unique, if it has one. It was the consideration of this question that led Cantor to the invention of set theory. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=253&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Let <img src='http://l.wordpress.com/latex.php?latex=f%5Ccolon%5Cmathbb%7BR%7D%5Cto%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f\colon\mathbb{R}\to\mathbb{R}' title='f\colon\mathbb{R}\to\mathbb{R}' class='latex' /> be a <img src='http://l.wordpress.com/latex.php?latex=2%5Cpi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2\pi' title='2\pi' class='latex' />-periodic function.  It may or may not have a representation as a trigonometric series</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle%7Ba_0%2B%5Csum_%7Bn%3D1%7D%5E%5Cinfty+a_n%5Csin%28nx%29+%2B+b_n%5Ccos%28nx%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\displaystyle{a_0+\sum_{n=1}^\infty a_n\sin(nx) + b_n\cos(nx)}' title='\displaystyle{a_0+\sum_{n=1}^\infty a_n\sin(nx) + b_n\cos(nx)}' class='latex' /></p>
<p>A natural question to ask is whether or not the representation of <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' /> as a trigonometric series is unique, if it has one.  It was the consideration of this question that led Cantor to the invention of set theory.</p>
<p>There is a nice writeup of this story in the first part of <a href="http://www.math.caltech.edu/papers/unique.ps">this article</a> by Alexander Kechris.  I&#8217;ll give part of the story below.</p>
<p><span id="more-253"></span></p>
<p>Cantor solved the problem in the affirmative; i.e., he proved:</p>
<blockquote><p>Suppose that a trigonometric series converges to zero everywhere in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />.  Then all the coefficients of that series are zero.</p></blockquote>
<p>(By subtraction, this is equivalent to the problem stated above.)   He was also able to show (by a very similar method) the following, which I&#8217;ll call the Isolated Points Lemma:</p>
<blockquote><p>Suppose <img src='http://l.wordpress.com/latex.php?latex=a+%3C+b+%3C+c&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a &lt; b &lt; c' title='a &lt; b &lt; c' class='latex' /> and that a trigonometric series converges to zero on <img src='http://l.wordpress.com/latex.php?latex=%28a%2Cb%29%5Ccup+%28b%2Cc%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(a,b)\cup (b,c)' title='(a,b)\cup (b,c)' class='latex' />.  Then that series converges to zero at <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b' title='b' class='latex' /> as well.</p></blockquote>
<p>From these two results, we can immediately conclude the following:</p>
<blockquote><p>Suppose that a trigonometric series converges to zero at all but finitely many points.  Then the coefficients of that series are all zero.</p></blockquote>
<p>Call a set <img src='http://l.wordpress.com/latex.php?latex=S%5Csubset+%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S\subset \mathbb{R}' title='S\subset \mathbb{R}' class='latex' /> a set of uniqueness if whenever a trigonometric series converges to zero on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D+-+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R} - S' title='\mathbb{R} - S' class='latex' />, the coefficients of that series are all zero. Then the previous result may be stated: &#8220;All finite sets are sets of uniqueness.&#8221;</p>
<p>But we can use the Isolated Points Lemma to show more than that.  For example, we can show that the set <img src='http://l.wordpress.com/latex.php?latex=S+%3D+%5C%7B1%2Fn%5Cmid+n%5Cin+%5Cmathbb%7BN%7D%5C%7D%5Ccup+%5C%7B0%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S = \{1/n\mid n\in \mathbb{N}\}\cup \{0\}' title='S = \{1/n\mid n\in \mathbb{N}\}\cup \{0\}' class='latex' /> is a set of uniqueness.  The reason is that if a trigonometric series converges to zero on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D+-+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R} - S' title='\mathbb{R} - S' class='latex' />, then by the Isolated Points lemma, it also converges to zero on the points in <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2Fn+%5Cmid+n%5Cin%5Cmathbb%7BN%7D%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{1/n \mid n\in\mathbb{N}\}' title='\{1/n \mid n\in\mathbb{N}\}' class='latex' />.</p>
<p>But, <em>now that we know</em> that it converges to zero on the points in <img src='http://l.wordpress.com/latex.php?latex=%5C%7B1%2Fn%5Cmid+n%5Cin%5Cmathbb%7BN%7D%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{1/n\mid n\in\mathbb{N}\}' title='\{1/n\mid n\in\mathbb{N}\}' class='latex' />, we can apply the Isolated Points Lemma again to show that it converges to zero at 0 (since we now know that at converges to zero on, e.g., <img src='http://l.wordpress.com/latex.php?latex=%28-1%2C0%29%5Ccup+%280%2C1%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(-1,0)\cup (0,1)' title='(-1,0)\cup (0,1)' class='latex' />).</p>
<p>What we have actually shown by the above argument is the following:</p>
<blockquote><p>Given <img src='http://l.wordpress.com/latex.php?latex=A%5Csubset%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A\subset\mathbb{R}' title='A\subset\mathbb{R}' class='latex' />, let <img src='http://l.wordpress.com/latex.php?latex=A%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A&#039;' title='A&#039;' class='latex' /> be the set of limit points of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> (also known as the Cantor-Bendixson derivative of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />).  If <img src='http://l.wordpress.com/latex.php?latex=A%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A&#039;' title='A&#039;' class='latex' /> is a set of uniqueness, then <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> is a set of uniqueness.</p></blockquote>
<p>For any <img src='http://l.wordpress.com/latex.php?latex=A%5Csubset+%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A\subset \mathbb{R}' title='A\subset \mathbb{R}' class='latex' />, let <img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28n%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(n)}' title='A^{(n)}' class='latex' /> be the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Cantor-Bendixson derivative of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />.  Then, by iterating the above fact, we have the following:</p>
<blockquote><p>Suppose that for some <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28n%29%7D+%3D+%5Cemptyset&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(n)} = \emptyset' title='A^{(n)} = \emptyset' class='latex' />.  Then <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> is a set of uniqueness.</p></blockquote>
<p>This is as far as we can go as long as we merely iterate the Cantor-Bendixson derivative finitely often.  But, if we make the leap to iterating it transfinitely many times, we can go much further:</p>
<blockquote><p><strong>Theorem:</strong> All countable closed sets are sets of uniqueness.</p></blockquote>
<p><strong>Proof: </strong>First, define <img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28%5Calpha%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(\alpha)}' title='A^{(\alpha)}' class='latex' /> for all ordinals and all closed sets <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> as follows:</p>
<ol>
<li><img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%280%29%7D+%3D+A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(0)} = A' title='A^{(0)} = A' class='latex' /></li>
<li><img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28%5Calpha+%2B+1%29%7D+%3D+%28A%5E%7B%28%5Calpha%29%7D%29%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(\alpha + 1)} = (A^{(\alpha)})&#039;' title='A^{(\alpha + 1)} = (A^{(\alpha)})&#039;' class='latex' />.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28%5Calpha%29%7D+%3D+%5Cbigcap_%7B%5Cbeta+%3C+%5Calpha%7D+A%5E%7B%28%5Cbeta%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(\alpha)} = \bigcap_{\beta &lt; \alpha} A^{(\beta)}' title='A^{(\alpha)} = \bigcap_{\beta &lt; \alpha} A^{(\beta)}' class='latex' />, when <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is a limit ordinal.</li>
</ol>
<p>The Isolated Points Lemma says that if a trigonometric series converges to zero outside of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />, then it converges to zero outside of <img src='http://l.wordpress.com/latex.php?latex=A%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A&#039;' title='A&#039;' class='latex' />.  We will generalize this by showing the following lemma:</p>
<blockquote><p><strong>Lemma: </strong>If a trigonometric series converges to zero outside of <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />, then it converges to zero outside of <img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28%5Calpha%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(\alpha)}' title='A^{(\alpha)}' class='latex' /> for any <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' />.</p></blockquote>
<p><strong>Proof of Lemma: </strong>This is by transfinite induction. The successor step of the induction is just the Isolated Points Lemma again, so all we have to show is that, fixing a trigonometric series, if <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> is a limit ordinal and the series converges to zero outside of each <img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28%5Cbeta%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(\beta)}' title='A^{(\beta)}' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta+%3C+%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta &lt; \alpha' title='\beta &lt; \alpha' class='latex' />, then it converges to zero outside of <img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28%5Calpha%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(\alpha)}' title='A^{(\alpha)}' class='latex' />.  But this follows simply because every point of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D-A%5E%7B%28%5Calpha%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}-A^{(\alpha)}' title='\mathbb{R}-A^{(\alpha)}' class='latex' /> must be in some <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D-A%5E%7B%28%5Cbeta%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}-A^{(\beta)}' title='\mathbb{R}-A^{(\beta)}' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta+%3C+%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta &lt; \alpha' title='\beta &lt; \alpha' class='latex' /> by definition. <strong>End of proof of Lemma.</strong></p>
<p>To complete the proof of the theorem then, we just have to observe that for all countable closed <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=A%5E%7B%28%5Calpha%29%7D%3D%5Cemptyset&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^{(\alpha)}=\emptyset' title='A^{(\alpha)}=\emptyset' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' />.  Clearly, for all closed <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' />, there is an <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%28A%5E%7B%28%5Calpha%29%7D%29%27%3DA%5E%7B%28%5Calpha%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(A^{(\alpha)})&#039;=A^{(\alpha)}' title='(A^{(\alpha)})&#039;=A^{(\alpha)}' class='latex' /> (this is because <img src='http://l.wordpress.com/latex.php?latex=%5C%7BA%5E%7B%28%5Cbeta%29%7D%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{A^{(\beta)}\}' title='\{A^{(\beta)}\}' class='latex' /> is a decreasing sequence).  But it is standard fact that a set <img src='http://l.wordpress.com/latex.php?latex=B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B' title='B' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=B%27+%3D+B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B&#039; = B' title='B&#039; = B' class='latex' /> is either empty or of cardinality <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B%5Caleph_0%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^{\aleph_0}' title='2^{\aleph_0}' class='latex' />. (Such a set is called a perfect set and a reference for the cited fact is page 7 of David Marker&#8217;s <a href="http://www.math.uic.edu/~marker/math512/dst.ps">notes on descriptive set theory</a>.) <strong>End of proof.</strong></p>
<p>As a historical note, Kechris reports that while thinking about the above issues led Cantor to discover ordinals, he never actually wrote down a proof of the above theorem; that was finally done by Lebesgue in 1903.</p>
<p>Further, it was later proven by Bernstein and Young independently that arbitrary countable sets are sets of uniqueness, and by Bari that countable unions of closed sets of uniqueness are sets of uniqueness.</p>
<p>Edit: Simplified the proof.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/253/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/253/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/253/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=253&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/12/20/trigonometric-series-and-the-beginnings-of-set-theory/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>A Simple Introduction to Quantum Groups</title>
		<link>http://xorshammer.com/2008/12/11/a-simple-introduction-to-quantum-groups/</link>
		<comments>http://xorshammer.com/2008/12/11/a-simple-introduction-to-quantum-groups/#comments</comments>
		<pubDate>Fri, 12 Dec 2008 02:49:23 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=225</guid>
		<description><![CDATA[In the course of reading some background material for an article by James Worthington on using bialgebraic structures in automata theory, I was led to finally reading up on what a Hopf algebra (sometimes called a &#8220;quantum group&#8220;) is. Although it is not strictly related to logic, I&#8217;ll write up what I learned here. My [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=225&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In the course of reading some background material for <a href="http://arxiv.org/abs/0807.4553">an article</a> by James Worthington on using bialgebraic structures in automata theory, I was led to finally reading up on what a <a href="http://en.wikipedia.org/wiki/Hopf_algebra">Hopf algebra</a> (sometimes called a &#8220;<a href="http://en.wikipedia.org/wiki/Quantum_group">quantum group</a>&#8220;) is.</p>
<p>Although it is not strictly related to logic, I&#8217;ll write up what I learned here.</p>
<p><span id="more-225"></span></p>
<p>My main source for this is sigfpe&#8217;s <a href="http://sigfpe.blogspot.com/2008/08/hopf-algebra-group-monad.html">blog post</a> on this.  In fact, all I really did was take his post and remove the Haskell from it (and probably add some mistakes).  If you can read Haskell, I definitely recommend that post (and his blog in general).  I also read the article on quantum groups in the <a href="http://gowers.wordpress.com/2008/08/28/princeton-companion-latest/">Princeton Companion to Mathematics</a>, which is an amazing book.</p>
<p>We&#8217;ll take the ordinary definition of a group and turn it into the definition of a quantum group in two steps.</p>
<p><strong>Step 1: Groorgs.</strong></p>
<p>In the first step, we&#8217;ll &#8220;symmetrize&#8221; the definition of a group to get an object I call a groorg.  What does it mean to symmetrize a definition?  Well, for one thing, as part of the definition of a group we have a multiplication <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%5Ccolon+G%5Ctimes+G%5Cto+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}\colon G\times G\to G' title='\text{mult}\colon G\times G\to G' class='latex' />.  Therefore, to make it symmetric, we should also have a comultiplication <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D+%5Ccolon+G%5Cto+G%5Ctimes+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult} \colon G\to G\times G' title='\text{comult} \colon G\to G\times G' class='latex' />.</p>
<p>Furthermore, this comultiplication should satisfy laws dual to those satisfied by multiplication.  For example, multiplication is associative, which means that for any <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b' title='b' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=c%5Cin+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='c\in G' title='c\in G' class='latex' />, the two ways of using multiplication to turn the triple <img src='http://l.wordpress.com/latex.php?latex=abc&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='abc' title='abc' class='latex' /> into a single element are the same.  Dually, then, comultiplication should be coassociative, meaning that for any element <img src='http://l.wordpress.com/latex.php?latex=a%5Cin+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a\in G' title='a\in G' class='latex' />, the two ways of using comultiplication to turn that single element into a triple should be the same.</p>
<p>OK, so we&#8217;ve seen that the dual of multiplication is comultiplication.  Another part of the definition of a group is the inverse function <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D%5Ccolon+G%5Cto+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}\colon G\to G' title='\text{inv}\colon G\to G' class='latex' />, which is no problem, since it can be its own dual.  But what about the identity element <img src='http://l.wordpress.com/latex.php?latex=e%5Cin+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e\in G' title='e\in G' class='latex' />?  This is puzzling until you think of the identity not as an element of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' />, but as a map from a one element set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5Cbullet%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{\bullet\}' title='\{\bullet\}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' />, which, when written it that form, I&#8217;ll call <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D%5Ccolon+%5C%7B%5Cbullet%5C%7D%5Cto+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}\colon \{\bullet\}\to G' title='\text{unit}\colon \{\bullet\}\to G' class='latex' />.  Then it&#8217;s clear that the dual of <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}' title='\text{unit}' class='latex' /> should be <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D+%5Ccolon+G%5Cto+%5C%7B%5Cbullet%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit} \colon G\to \{\bullet\}' title='\text{counit} \colon G\to \{\bullet\}' class='latex' />.  The astute reader will note that there is only one such function, so this seems to be a trivial addition, but let&#8217;s press on regardless.</p>
<p>We can now begin the definition of a groorg.</p>
<blockquote><p><strong>Definition of a Groorg, Part 1. </strong>A groorg is a set <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' /> together with:</p>
<ol>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%5Ccolon+G%5Ctimes+G%5Cto+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}\colon G\times G\to G' title='\text{mult}\colon G\times G\to G' class='latex' />.</li>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%5Ccolon+G%5Cto+G%5Ctimes+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}\colon G\to G\times G' title='\text{comult}\colon G\to G\times G' class='latex' />.</li>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D%5Ccolon+%5C%7B%5Cbullet%5C%7D%5Cto+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}\colon \{\bullet\}\to G' title='\text{unit}\colon \{\bullet\}\to G' class='latex' />.</li>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D+%5Ccolon+G%5Cto+%5C%7B%5Cbullet%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit} \colon G\to \{\bullet\}' title='\text{counit} \colon G\to \{\bullet\}' class='latex' />.</li>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D%5Ccolon+G%5Cto+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}\colon G\to G' title='\text{inv}\colon G\to G' class='latex' />.</li>
</ol>
<p>These must satisfy the following properties:</p>
<ul>
<li>Multiplication must be associative and comultiplication must be coassociative.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}' title='\text{unit}' class='latex' /> should be a unit for multiplication.  This means that, given any <img src='http://l.wordpress.com/latex.php?latex=a%5Cin+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a\in G' title='a\in G' class='latex' />, if you form an ordered pair with <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D%28%5Cbullet%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}(\bullet)' title='\text{unit}(\bullet)' class='latex' />, then apply <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}' title='\text{mult}' class='latex' /> to that ordered pair, you get <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> back.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}' title='\text{counit}' class='latex' /> should be a counit for comultplication.  This means that, given any <img src='http://l.wordpress.com/latex.php?latex=a%5Cin+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a\in G' title='a\in G' class='latex' />, if you apply <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}' title='\text{comult}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> to get an ordered pair, then destroy one of the componenents with <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}' title='\text{counit}' class='latex' /> (i.e., just discard it), then you get <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> back.</li>
</ul>
</blockquote>
<p>Let&#8217;s pause here.  If you&#8217;ve followed along, you may have noticed that this last condition forces <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%28a%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}(a)' title='\text{comult}(a)' class='latex' /> to be <img src='http://l.wordpress.com/latex.php?latex=%28a%2Ca%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(a,a)' title='(a,a)' class='latex' /> (which does satisfy the conditions so far).  This is obviously quite restrictive, but it has one benefit: it means that we can rewrite the inverse law using comultiplication.  To see what I mean, consider the following: If we let <img src='http://l.wordpress.com/latex.php?latex=f%28a%2Cb%29+%3D+%28%5Ctext%7Binv%7D%28a%29%2Cb%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(a,b) = (\text{inv}(a),b)' title='f(a,b) = (\text{inv}(a),b)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g%28a%2Cb%29+%3D+%28a%2C%5Ctext%7Binv%7D%28b%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='g(a,b) = (a,\text{inv}(b))' title='g(a,b) = (a,\text{inv}(b))' class='latex' />, then the usual inverse law for a group says that for any <img src='http://l.wordpress.com/latex.php?latex=a%5Cin+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a\in G' title='a\in G' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%28%5Ctext%7Bmult%7D%5Ccirc+f%29%28a%2Ca%29+%3D+%28%5Ctext%7Bmult%7D%5Ccirc+g%29%28a%2Ca%29+%3D+%5Ctext%7Bunit%7D%28%5Cbullet%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\text{mult}\circ f)(a,a) = (\text{mult}\circ g)(a,a) = \text{unit}(\bullet)' title='(\text{mult}\circ f)(a,a) = (\text{mult}\circ g)(a,a) = \text{unit}(\bullet)' class='latex' />.  Now, since we know that <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}' title='\text{comult}' class='latex' /> is forced to take each <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%28a%2Ca%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(a,a)' title='(a,a)' class='latex' />, we can rewrite the inverse law as: <img src='http://l.wordpress.com/latex.php?latex=%28%5Ctext%7Bmult%7D%5Ccirc+f+%5Ccirc+%5Ctext%7Bcomult%7D%29%28a%29+%3D+%28%5Ctext%7Bmult%7D%5Ccirc+g%5Ccirc+%5Ctext%7Bcomult%7D%29%28a%29+%3D+%5Ctext%7Bunit%7D%28%5Cbullet%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\text{mult}\circ f \circ \text{comult})(a) = (\text{mult}\circ g\circ \text{comult})(a) = \text{unit}(\bullet)' title='(\text{mult}\circ f \circ \text{comult})(a) = (\text{mult}\circ g\circ \text{comult})(a) = \text{unit}(\bullet)' class='latex' />.</p>
<p>We can make this completely symmetric by getting <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}' title='\text{counit}' class='latex' /> into the act:</p>
<blockquote><p><strong>Definition of a Groorg, Part 2.</strong> A groorg is also required to satisfy: <img src='http://l.wordpress.com/latex.php?latex=%28%5Ctext%7Bmult%7D%5Ccirc+f+%5Ccirc+%5Ctext%7Bcomult%7D%29%28a%29+%3D+%28%5Ctext%7Bmult%7D%5Ccirc+g%5Ccirc+%5Ctext%7Bcomult%7D%29%28a%29+%3D+%5Ctext%7Bunit%7D%28%5Ctext%7Bcounit%7D%28a%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\text{mult}\circ f \circ \text{comult})(a) = (\text{mult}\circ g\circ \text{comult})(a) = \text{unit}(\text{counit}(a))' title='(\text{mult}\circ f \circ \text{comult})(a) = (\text{mult}\circ g\circ \text{comult})(a) = \text{unit}(\text{counit}(a))' class='latex' /> for all <img src='http://l.wordpress.com/latex.php?latex=a%5Cin+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a\in G' title='a\in G' class='latex' /> (where <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=g&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='g' title='g' class='latex' /> are as above.</p></blockquote>
<p>And we finally have a requirement which says that the two ways of computing <img src='http://l.wordpress.com/latex.php?latex=%28ab%2C+ab%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(ab, ab)' title='(ab, ab)' class='latex' /> by using comultiplication and multiplication are equal.</p>
<blockquote><p><strong>Definition of a Groorg, Part 3. </strong>A groorg is also required to satisfy: <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%28%5Ctext%7Bmult%7D%28a%2Cb%29%29+%3D+%5Ctext%7Bmult%7D%28%5Ctext%7Bcomult%7D%28a%29%2C%5Ctext%7Bcomult%7D%28b%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}(\text{mult}(a,b)) = \text{mult}(\text{comult}(a),\text{comult}(b))' title='\text{comult}(\text{mult}(a,b)) = \text{mult}(\text{comult}(a),\text{comult}(b))' class='latex' />, where the second <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}' title='\text{mult}' class='latex' /> is the natural multiplication on <img src='http://l.wordpress.com/latex.php?latex=G%5Ctimes+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G\times G' title='G\times G' class='latex' />.</p></blockquote>
<p>This concludes the definition of a groorg.</p>
<p>But, what use is it?  As we&#8217;ve observed, in every groorg, we must have that <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}' title='\text{comult}' class='latex' /> sends <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%28a%2Ca%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(a,a)' title='(a,a)' class='latex' />, so that the groorg just reduces to an ordinary group.  Furthermore, every group becomes a groorg by defining comult in that way (and by defining counit in the only possible way).</p>
<p>What we&#8217;ve gained is that we now have a definition of a group which is equivalent to the old one and which is symmetric, which will lend itself well to our next step.</p>
<p><strong>Step 2.  Adding superpositions.</strong></p>
<p>How can we turn this concept of a group into one of a &#8220;quantum&#8221; group? If you&#8217;re like me, the only thing you know about quantum mechanics is that you often hear the word &#8220;superposition&#8221; used in conjunction with it. That&#8217;s not much, but it turns out to be enough in this case.</p>
<p>Instead of having the composition <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%28g%2Ch%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}(g,h)' title='\text{mult}(g,h)' class='latex' /> of two group elements be another group element, let&#8217;s have it be a <em>superposition</em> of group elements.  It turns out that what this should mean is a <em>linear combination</em> of group elements. So, let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}[G]' title='\mathbb{C}[G]' class='latex' /> be the <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}' title='\mathbb{C}' class='latex' />-vector space generated by taking <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' /> as a formal set of basis vectors. Instead of requiring that <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}' title='\text{mult}' class='latex' /> be a map from <img src='http://l.wordpress.com/latex.php?latex=G%5Ctimes+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G\times G' title='G\times G' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' />, we will let it be a map from <img src='http://l.wordpress.com/latex.php?latex=G%5Ctimes+G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G\times G' title='G\times G' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}[G]' title='\mathbb{C}[G]' class='latex' />. So we have the following:</p>
<blockquote><p><strong>Provisional Definition of a Quantum Group.</strong> A quantum group is a set <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' /> together with</p>
<ol>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%5Ccolon+G%5Ctimes+G+%5Cto+%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}\colon G\times G \to \mathbb{C}[G]' title='\text{mult}\colon G\times G \to \mathbb{C}[G]' class='latex' />.</li>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%5Ccolon+G%5Cto+%5Cmathbb%7BC%7D%5BG%5Ctimes+G%5D+%3D+%5Cmathbb%7BC%7D%5BG%5D%5Cotimes%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}\colon G\to \mathbb{C}[G\times G] = \mathbb{C}[G]\otimes\mathbb{C}[G]' title='\text{comult}\colon G\to \mathbb{C}[G\times G] = \mathbb{C}[G]\otimes\mathbb{C}[G]' class='latex' />.</li>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D%5Ccolon+%5C%7B%5Cbullet%5C%7D%5Cto+%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}\colon \{\bullet\}\to \mathbb{C}[G]' title='\text{unit}\colon \{\bullet\}\to \mathbb{C}[G]' class='latex' />.</li>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D%5Ccolon+G%5Cto+%5Cmathbb%7BC%7D%5B%5C%7B%5Cbullet%5C%7D%5D+%3D+%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}\colon G\to \mathbb{C}[\{\bullet\}] = \mathbb{C}' title='\text{counit}\colon G\to \mathbb{C}[\{\bullet\}] = \mathbb{C}' class='latex' /></li>
<li>A map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D%5Ccolon+G%5Cto+%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}\colon G\to \mathbb{C}[G]' title='\text{inv}\colon G\to \mathbb{C}[G]' class='latex' />.</li>
</ol>
<p>satisfying &#8230;</p></blockquote>
<p>Before we can think about what properties these functions should satisfy, we have to settle a question: We know how to multiply two group elements to get a superposition of group elements, but how should we multiply two superpositions of group elements? For example, what should the product of <img src='http://l.wordpress.com/latex.php?latex=g_1+%2B+%5Cpi+g_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='g_1 + \pi g_2' title='g_1 + \pi g_2' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=ig_3+%2B+2+g_4&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='ig_3 + 2 g_4' title='ig_3 + 2 g_4' class='latex' /> be? (Note that I am using the same symbol to stand for the group element and the formal basis vector corresponding to it.)</p>
<p>The quickest way to define the multiplication of superpositions is to notice that, since <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bg_1%5Cotimes+g_2%5Cmid+g_1%2Cg_2%5Cin+G%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{g_1\otimes g_2\mid g_1,g_2\in G\}' title='\{g_1\otimes g_2\mid g_1,g_2\in G\}' class='latex' /> is a basis for <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D%5BG%5D%5Cotimes+%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}[G]\otimes \mathbb{C}[G]' title='\mathbb{C}[G]\otimes \mathbb{C}[G]' class='latex' />, the map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%5Ccolon+G%5Ctimes+G%5Cto+%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}\colon G\times G\to \mathbb{C}[G]' title='\text{mult}\colon G\times G\to \mathbb{C}[G]' class='latex' /> extends to a <em>linear</em> map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%5Ccolon+%5Cmathbb%7BC%7D%5BG%5D%5Cotimes+%5Cmathbb%7BC%7D%5BG%5D%5Cto+%5Cmathbb%7BC%7D%5BG%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}\colon \mathbb{C}[G]\otimes \mathbb{C}[G]\to \mathbb{C}[G]' title='\text{mult}\colon \mathbb{C}[G]\otimes \mathbb{C}[G]\to \mathbb{C}[G]' class='latex' />, which we can use to multiply superpositions. In the above example, <img src='http://l.wordpress.com/latex.php?latex=%28g_1+%2B+%5Cpi+g_2%29%5Cotimes+%28ig_3+%2B+2+g_4%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(g_1 + \pi g_2)\otimes (ig_3 + 2 g_4)' title='(g_1 + \pi g_2)\otimes (ig_3 + 2 g_4)' class='latex' /> is equal to <img src='http://l.wordpress.com/latex.php?latex=ig_1%5Cotimes+g_3+%2B+2g_1%5Cotimes+g_4+%2B+i%5Cpi+g_2%5Cotimes+g_3+%2B+2%5Cpi+g_2%5Cotimes+g_4&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='ig_1\otimes g_3 + 2g_1\otimes g_4 + i\pi g_2\otimes g_3 + 2\pi g_2\otimes g_4' title='ig_1\otimes g_3 + 2g_1\otimes g_4 + i\pi g_2\otimes g_3 + 2\pi g_2\otimes g_4' class='latex' />, so the product of the two superpositions would be <img src='http://l.wordpress.com/latex.php?latex=i%5Ctext%7Bmult%7D%28g_1%2Cg_3%29+%2B+2%5Ctext%7Bmult%7D%28g_1%2Cg_4%29+%2B+i%5Cpi%5Ctext%7Bmult%7D%28g_2%2Cg_3%29+%2B+2%5Cpi%5Ctext%7Bmult%7D%28g_2%2Cg_4%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i\text{mult}(g_1,g_3) + 2\text{mult}(g_1,g_4) + i\pi\text{mult}(g_2,g_3) + 2\pi\text{mult}(g_2,g_4)' title='i\text{mult}(g_1,g_3) + 2\text{mult}(g_1,g_4) + i\pi\text{mult}(g_2,g_3) + 2\pi\text{mult}(g_2,g_4)' class='latex' />.</p>
<p>Now, our symmetric definition of a group above translates exactly, and we no longer need to mention the basis explicitly:</p>
<blockquote><p><strong>Definition of a Quantum Group (or Hopf Algebra)</strong>.  A quantum group is a <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}' title='\mathbb{C}' class='latex' />-vector space <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> together with:</p>
<ol>
<li>A linear map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%5Ccolon+V%5Cotimes+V%5Cto+V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}\colon V\otimes V\to V' title='\text{mult}\colon V\otimes V\to V' class='latex' /></li>
<li>A linear map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%5Ccolon+V%5Cto+V%5Cotimes+V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}\colon V\to V\otimes V' title='\text{comult}\colon V\to V\otimes V' class='latex' /></li>
<li>A linear map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D+%5Ccolon+%5Cmathbb%7BC%7D%5Cto+V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit} \colon \mathbb{C}\to V' title='\text{unit} \colon \mathbb{C}\to V' class='latex' /></li>
<li>A linear map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D+%5Ccolon+V%5Cto+%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit} \colon V\to \mathbb{C}' title='\text{counit} \colon V\to \mathbb{C}' class='latex' /></li>
<li>A linear map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D%5Ccolon+V%5Cto+V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}\colon V\to V' title='\text{inv}\colon V\to V' class='latex' /></li>
</ol>
<p>satisfying the analogues of the laws given in the definition of a groorg.</p></blockquote>
<p><strong>Some Combinatorial Examples</strong></p>
<p>I believe there are many examples of the usefulness of this concept in physics.  However because I don&#8217;t know any physics, I won&#8217;t give them.</p>
<p>Here are two combinatorial examples from <a href="http://sigfpe.blogspot.com/2008/08/hopf-algebra-group-monad.html">sigfpe&#8217;s blog post</a>:</p>
<p><strong>Example 1: A Quantum Group on Finite Strings.<br />
</strong></p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=%5CSigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma' title='\Sigma' class='latex' /> be an alphabet, and let <img src='http://l.wordpress.com/latex.php?latex=%5CSigma%5E%2A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma^*' title='\Sigma^*' class='latex' /> be the set of all finite strings with characters from <img src='http://l.wordpress.com/latex.php?latex=%5CSigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Sigma' title='\Sigma' class='latex' />.  We may put a quantum group structure on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D%5B%5CSigma%5E%2A%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}[\Sigma^*]' title='\mathbb{C}[\Sigma^*]' class='latex' /> as follows:</p>
<ul>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%28%5Csigma%2C%5Ctau%29+%3D+%5Csigma%5Ctau&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}(\sigma,\tau) = \sigma\tau' title='\text{mult}(\sigma,\tau) = \sigma\tau' class='latex' />, the concatenation of <img src='http://l.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma' title='\sigma' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\tau' title='\tau' class='latex' />.</li>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D%281%29+%3D+%5Cvarepsilon&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}(1) = \varepsilon' title='\text{unit}(1) = \varepsilon' class='latex' />, the empty string.</li>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D%28%5Cvarepsilon%29+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}(\varepsilon) = 1' title='\text{counit}(\varepsilon) = 1' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D%28%5Csigma%29+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}(\sigma) = 0' title='\text{counit}(\sigma) = 0' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Csigma+%5Cne+%5Cvarepsilon&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma \ne \varepsilon' title='\sigma \ne \varepsilon' class='latex' />.</li>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D%28%5Csigma%29+%3D+%28-1%29%5E%7B%7C%5Csigma%7C%7D%5Cbar%7B%5Csigma%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}(\sigma) = (-1)^{|\sigma|}\bar{\sigma}' title='\text{inv}(\sigma) = (-1)^{|\sigma|}\bar{\sigma}' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=%7C%5Csigma%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|\sigma|' title='|\sigma|' class='latex' /> is the length of <img src='http://l.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma' title='\sigma' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cbar%7B%5Csigma%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\bar{\sigma}' title='\bar{\sigma}' class='latex' /> is the reverse of <img src='http://l.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma' title='\sigma' class='latex' />.</li>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%28%5Csigma%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}(\sigma)' title='\text{comult}(\sigma)' class='latex' /> be the sum of <img src='http://l.wordpress.com/latex.php?latex=%5Csigma_0%5Cotimes+%5Csigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma_0\otimes \sigma_1' title='\sigma_0\otimes \sigma_1' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Csigma_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma_0' title='\sigma_0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma_1' title='\sigma_1' class='latex' /> can be shuffled together to give <img src='http://l.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma' title='\sigma' class='latex' />.  This means that the characters in <img src='http://l.wordpress.com/latex.php?latex=%5Csigma_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma_0' title='\sigma_0' class='latex' /> occur in <img src='http://l.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma' title='\sigma' class='latex' /> in the same order, and when you remove them you get <img src='http://l.wordpress.com/latex.php?latex=%5Csigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma_1' title='\sigma_1' class='latex' />.</li>
</ul>
<p>For an example of the comultiplication, if <img src='http://l.wordpress.com/latex.php?latex=a&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a' title='a' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b%5Cin+%5CSigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b\in \Sigma' title='b\in \Sigma' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%28ab%29+%3D+%5Cvarepsilon+%5Cotimes+ab+%2B+a+%5Cotimes+b+%2B+b+%5Cotimes+a+%2B+ab%5Cotimes+%5Cvarepsilon&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}(ab) = \varepsilon \otimes ab + a \otimes b + b \otimes a + ab\otimes \varepsilon' title='\text{comult}(ab) = \varepsilon \otimes ab + a \otimes b + b \otimes a + ab\otimes \varepsilon' class='latex' />.<br />
We can verify the inverse law in this case: If we apply <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}' title='\text{inv}' class='latex' /> on the right to <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%28ab%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}(ab)' title='\text{comult}(ab)' class='latex' />, we get <img src='http://l.wordpress.com/latex.php?latex=%5Cvarepsilon%5Cotimes+ba+-+a+%5Cotimes+b+-+b+%5Cotimes+a+%2B+ab%5Cotimes+%5Cvarepsilon&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\varepsilon\otimes ba - a \otimes b - b \otimes a + ab\otimes \varepsilon' title='\varepsilon\otimes ba - a \otimes b - b \otimes a + ab\otimes \varepsilon' class='latex' />.  Applying <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}' title='\text{mult}' class='latex' /> to this, we get <img src='http://l.wordpress.com/latex.php?latex=ba+-ab+-ba+%2B+ab+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='ba -ab -ba + ab = 0' title='ba -ab -ba + ab = 0' class='latex' />.</p>
<p><strong>Example 2.  Another Quantum Group on Finite Strings.</strong><br />
This is also a quantum group on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D%5B%5CSigma%5E%2A%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}[\Sigma^*]' title='\mathbb{C}[\Sigma^*]' class='latex' />.</p>
<ul>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%28%5Csigma%2C%5Ctau%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}(\sigma,\tau)' title='\text{mult}(\sigma,\tau)' class='latex' /> be the sum of all possible ways of shuffling <img src='http://l.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma' title='\sigma' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Ctau&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\tau' title='\tau' class='latex' /> together.</li>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D%281%29+%3D+%5Cvarepsilon&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}(1) = \varepsilon' title='\text{unit}(1) = \varepsilon' class='latex' />, the empty string.</li>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D%28%5Cvarepsilon%29+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}(\varepsilon) = 1' title='\text{counit}(\varepsilon) = 1' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D%28%5Csigma%29+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}(\sigma) = 0' title='\text{counit}(\sigma) = 0' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Csigma+%5Cne+%5Cvarepsilon&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma \ne \varepsilon' title='\sigma \ne \varepsilon' class='latex' />.</li>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D%28%5Csigma%29+%3D+%28-1%29%5E%7B%7C%5Csigma%7C%7D%5Cbar%7B%5Csigma%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}(\sigma) = (-1)^{|\sigma|}\bar{\sigma}' title='\text{inv}(\sigma) = (-1)^{|\sigma|}\bar{\sigma}' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=%7C%5Csigma%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|\sigma|' title='|\sigma|' class='latex' /> is the length of <img src='http://l.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma' title='\sigma' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cbar%7B%5Csigma%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\bar{\sigma}' title='\bar{\sigma}' class='latex' /> is the reverse of <img src='http://l.wordpress.com/latex.php?latex=%5Csigma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma' title='\sigma' class='latex' />.</li>
<li>We let <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%28%5Csigma%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}(\sigma)' title='\text{comult}(\sigma)' class='latex' /> be the sum of all <img src='http://l.wordpress.com/latex.php?latex=%5Csigma_0%5Cotimes+%5Csigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma_0\otimes \sigma_1' title='\sigma_0\otimes \sigma_1' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Csigma+%3D+%5Csigma_0%5Csigma_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sigma = \sigma_0\sigma_1' title='\sigma = \sigma_0\sigma_1' class='latex' />.</li>
</ul>
<p>Again, we can verify the inverse law in a specific case: We have that <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%28ab%29+%3D+%5Cvarepsilon%5Cotimes+ab+%2B+a%5Cotimes+b+%2B+ab%5Cotimes+%5Cvarepsilon&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}(ab) = \varepsilon\otimes ab + a\otimes b + ab\otimes \varepsilon' title='\text{comult}(ab) = \varepsilon\otimes ab + a\otimes b + ab\otimes \varepsilon' class='latex' />.  If we apply <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}' title='\text{inv}' class='latex' /> on the right, we get <img src='http://l.wordpress.com/latex.php?latex=%5Cvarepsilon%5Cotimes+ab+-+a%5Cotimes+b+%2B+ab%5Cotimes+%5Cvarepsilon&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\varepsilon\otimes ab - a\otimes b + ab\otimes \varepsilon' title='\varepsilon\otimes ab - a\otimes b + ab\otimes \varepsilon' class='latex' />.  Now applying <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}' title='\text{mult}' class='latex' />, we get <img src='http://l.wordpress.com/latex.php?latex=ab+-ab+-ba+%2B+ba+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='ab -ab -ba + ba = 0' title='ab -ab -ba + ba = 0' class='latex' />.</p>
<p><strong>Example 3: A Quantum Group on Finite Binary Trees.</strong><br />
This example is from <a href="http://www.math.tamu.edu/~maguiar/Loday.pdf">here</a>.  We think of a finite binary tree as a finite tree where each node has either zero or two children, and where we distinguish between left and right.  This picture from the above paper shows how, if you select a leaf of a finite binary tree, you may divide the tree into the tree to the left of the leaf and the tree to the right of the leaf:<br />
<img class="alignnone size-full wp-image-227" title="split_tree" src="http://xorshammer.files.wordpress.com/2008/12/split_tree.png?w=435&#038;h=100" alt="split_tree" width="435" height="100" /><br />
If you select a multiset of <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q' title='q' class='latex' /> leaves, you may similarly divide your tree into <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q' title='q' class='latex' /> trees.</p>
<p>We may now define a quantum group on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D%5B%5Cmathcal%7BT%7D%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}[\mathcal{T}]' title='\mathbb{C}[\mathcal{T}]' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BT%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{T}' title='\mathcal{T}' class='latex' /> is the set of finite binary trees as follows:</p>
<ul>
<li><img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bmult%7D%28T_1%2CT_2%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{mult}(T_1,T_2)' title='\text{mult}(T_1,T_2)' class='latex' /> is the sum of all trees generated as follows: Suppose that <img src='http://l.wordpress.com/latex.php?latex=T_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_2' title='T_2' class='latex' /> has <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q' title='q' class='latex' /> leaves.  Divide <img src='http://l.wordpress.com/latex.php?latex=T_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_1' title='T_1' class='latex' /> into <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q' title='q' class='latex' /> trees as above, and stick them onto the leaves of <img src='http://l.wordpress.com/latex.php?latex=T_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_2' title='T_2' class='latex' />.  There are many graphical examples of this <a href="http://www.math.tamu.edu/~sottile/pages/HopfAlgebras/YSym/F-Product.html">here</a>.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bunit%7D%281%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{unit}(1)' title='\text{unit}(1)' class='latex' /> is the tree with a single node and no leaves.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcomult%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{comult}(T)' title='\text{comult}(T)' class='latex' /> is the sum of all <img src='http://l.wordpress.com/latex.php?latex=T_1%5Cotimes+T_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_1\otimes T_2' title='T_1\otimes T_2' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=T_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_1' title='T_1' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=T_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T_2' title='T_2' class='latex' /> are trees that <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> can be divided into.  There are many graphical examples of this <a href="http://www.math.tamu.edu/~sottile/pages/HopfAlgebras/YSym/F-Coproduct.html">here</a>.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcounit%7D%28T%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{counit}(T)' title='\text{counit}(T)' class='latex' /> is 1 if <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> is the tree with only one node, and zero otherwise.</li>
<li>For the definition of <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Binv%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{inv}' title='\text{inv}' class='latex' />, I refer you to the above paper.  However, there are many graphical examples of the antipode <a href="http://www.math.tamu.edu/~sottile/pages/HopfAlgebras/YSym/F-Antipode.html">here</a>.</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/225/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=225&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/12/11/a-simple-introduction-to-quantum-groups/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2008/12/split_tree.png" medium="image">
			<media:title type="html">split_tree</media:title>
		</media:content>
	</item>
		<item>
		<title>Doing Calculus on the Rationals (with the help of Nonstandard Analysis)</title>
		<link>http://xorshammer.com/2008/11/14/doing-calculus-on-the-rationals-with-the-help-of-nonstandard-analysis/</link>
		<comments>http://xorshammer.com/2008/11/14/doing-calculus-on-the-rationals-with-the-help-of-nonstandard-analysis/#comments</comments>
		<pubDate>Sat, 15 Nov 2008 02:32:08 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=218</guid>
		<description><![CDATA[Nonstandard Analysis is usually used to introduce infinitesimals into the real numbers in an attempt to make arguments in analysis more intuitive. The idea is that you construct a superset which contains the reals and also some infinitesimals, prove that some statement holds of , and then use a general &#8220;transfer principle&#8221; to conclude that [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=218&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Nonstandard Analysis is usually used to introduce infinitesimals into the real numbers in an attempt to make arguments in analysis more intuitive.</p>
<p>The idea is that you construct a superset <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%2A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^*' title='\mathbb{R}^*' class='latex' /> which contains the reals and also some infinitesimals, prove that some statement holds of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%2A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^*' title='\mathbb{R}^*' class='latex' />, and then use a general &#8220;transfer principle&#8221; to conclude that the same statement holds of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />.</p>
<p>Implicit in this procedure is the idea that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' /> is the <em>real</em> world, and therefore the goal is to prove things about it.  We construct a field <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%2A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^*' title='\mathbb{R}^*' class='latex' /> with infinitesimals, but only as a method for eventually proving something about <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />.</p>
<p>We can do precisely the same thing with <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}' title='\mathbb{Q}' class='latex' /> instead of with <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}' title='\mathbb{R}' class='latex' />.  But, in  <a href="http://www.andrew.cmu.edu/user/avigad/Papers/nonstandard2.pdf">Weak Theories of Nonstandard Arithmetic and Analysis</a>, Jeremy Avigad observed that if we don&#8217;t care about transferring the results back down to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}' title='\mathbb{Q}' class='latex' />, then we can get all the basic results of calculus and elementary real analysis just by working with <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D%5E%2A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}^*' title='\mathbb{Q}^*' class='latex' />, and without ever having to construct the reals.</p>
<p><span id="more-218"></span></p>
<p>Let me first differentiate two approaches to nonstandard analysis.  The first is the one I mentioned above, where you actually <em>construct</em> a field <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%2A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^*' title='\mathbb{R}^*' class='latex' /> (although you need the axiom of choice to do it).  This is done entirely within ordinary mathematics.  Call this the semantic approach.</p>
<p>Another approach is the axiomatic approach.  A good example of this is <a href="http://www.math.princeton.edu/~Nelson/">Edward Nelson</a>&#8216;s <a href="http://www.math.princeton.edu/~Nelson/books/1.pdf">internal set theory</a>.  In this approach, you take an ordinary axiomatization of some part of mathematics (for example, ZFC), introduce a new predicate for being &#8220;standard&#8221; or &#8220;normal-sized&#8221;, and some axioms saying that there exist things which are not standard and how these things relate to everything else.  In the usual situation, a sentence which does not contain the predicate &#8220;standard&#8221; is provable in the new theory iff it&#8217;s provable in the old theory.  (This is the case with IST and ZFC.)</p>
<p>The axiomatic approach is the approach we&#8217;ll take here.  We&#8217;ll let our language <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' /> consist of a function symbol for each primitive recursive function and relation, together with a predicate <img src='http://l.wordpress.com/latex.php?latex=st&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='st' title='st' class='latex' /> and a constant <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' />.  Our axioms will be the following:</p>
<ol>
<li>If <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> is a true (in the natural numbers) first-order <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BL%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{L}' title='\mathcal{L}' class='latex' />-sentence that does not include the new predicate <img src='http://l.wordpress.com/latex.php?latex=st&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='st' title='st' class='latex' />, then we take <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> as an axiom.</li>
<li>We take <img src='http://l.wordpress.com/latex.php?latex=%5Cneg+st%28%5Comega%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\neg st(\omega)' title='\neg st(\omega)' class='latex' /> as an axiom.</li>
<li>We take <img src='http://l.wordpress.com/latex.php?latex=%5Cforall+x%5C%2C+%28st%28x%29%5Crightarrow+%5Cforall+y%3Cx%5C%2Cst%28y%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\forall x\, (st(x)\rightarrow \forall y&lt;x\,st(y))' title='\forall x\, (st(x)\rightarrow \forall y&lt;x\,st(y))' class='latex' /> as an axiom.</li>
<li>We take <img src='http://l.wordpress.com/latex.php?latex=st%28x_1%29%5Cwedge+%5Ccdots+%5Cwedge+st%28x_k%29+%5Crightarrow+st%28f%28x_1%2C%5Cldots%2C+x_k%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='st(x_1)\wedge \cdots \wedge st(x_k) \rightarrow st(f(x_1,\ldots, x_k))' title='st(x_1)\wedge \cdots \wedge st(x_k) \rightarrow st(f(x_1,\ldots, x_k))' class='latex' /> to be an axiom for each <img src='http://l.wordpress.com/latex.php?latex=k&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='k' title='k' class='latex' />-ary primitive recursive function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' />.</li>
</ol>
<p>The interpretation of our sentences is that we are now quantifying over a domain which includes infinitely large natural numbers (of which <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> is an example) and that the predicate <img src='http://l.wordpress.com/latex.php?latex=st&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='st' title='st' class='latex' /> picks out those which are normal-sized.  However, since we are working within the axiomatic system, I will still refer to the domain we are quantifying over as <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' />.</p>
<p>Within the system, construct <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Z}' title='\mathbb{Z}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}' title='\mathbb{Q}' class='latex' /> from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> as usual.  We make the following definitions:</p>
<p>We say that an natural number <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> is unbounded if it is not standard (i.e., if <img src='http://l.wordpress.com/latex.php?latex=%5Cneg+st%28n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\neg st(n)' title='\neg st(n)' class='latex' />).  We say that an integer <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> is unbounded if <img src='http://l.wordpress.com/latex.php?latex=%7Cx%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|x|' title='|x|' class='latex' /> is unbounded.  We say that a rational <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q' title='q' class='latex' /> is unbounded if the closest integer to it is unbounded.</p>
<p>Furthermore, we say that a rational number <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q' title='q' class='latex' /> is infinitesimal if it equals 0 or if <img src='http://l.wordpress.com/latex.php?latex=1%2Fq&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1/q' title='1/q' class='latex' /> is unbounded.  We say that <img src='http://l.wordpress.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q' title='q' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='r' title='r' class='latex' /> are infinitely close, written <img src='http://l.wordpress.com/latex.php?latex=q%5Csim+r&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q\sim r' title='q\sim r' class='latex' />, if <img src='http://l.wordpress.com/latex.php?latex=q-r&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q-r' title='q-r' class='latex' /> is infinitesimal.</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}&#039;' title='\mathbb{Q}&#039;' class='latex' /> be the set of rationals which are not infinite.  We can now do analysis on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}&#039;' title='\mathbb{Q}&#039;' class='latex' />.  First of all, we can define continuity in a natural way: We say that <img src='http://l.wordpress.com/latex.php?latex=f%5Ccolon+%5Cmathbb%7BQ%7D%27%5Cto%5Cmathbb%7BQ%7D%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f\colon \mathbb{Q}&#039;\to\mathbb{Q}&#039;' title='f\colon \mathbb{Q}&#039;\to\mathbb{Q}&#039;' class='latex' /> is continuous if whenever <img src='http://l.wordpress.com/latex.php?latex=q+%5Csim+r&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q \sim r' title='q \sim r' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f%28q%29%5Csim+f%28r%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(q)\sim f(r)' title='f(q)\sim f(r)' class='latex' />.</p>
<p>We have the intermediate value theorem for <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}&#039;' title='\mathbb{Q}&#039;' class='latex' />: If <img src='http://l.wordpress.com/latex.php?latex=f%280%29+%3C0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(0) &lt;0' title='f(0) &lt;0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f%281%29+%3E0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(1) &gt;0' title='f(1) &gt;0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' /> is continuous, then there is a <img src='http://l.wordpress.com/latex.php?latex=q%5Cin+%5B0%2C1%5D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q\in [0,1]' title='q\in [0,1]' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=f%28q%29%5Csim+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(q)\sim 0' title='f(q)\sim 0' class='latex' />. Proof:  Recall that <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> is a natural number.  Let <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' /> be the maximum natural number less than <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' />  such that <img src='http://l.wordpress.com/latex.php?latex=f%28j%2F%5Comega%29+%3C0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(j/\omega) &lt;0' title='f(j/\omega) &lt;0' class='latex' />.  (This is possible because there are only finitely many natural numbers less than any natural number, including <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' />!)  But then <img src='http://l.wordpress.com/latex.php?latex=f%28j%2F%5Comega%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(j/\omega)' title='f(j/\omega)' class='latex' /> must be infinitely close to <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='{0}' title='{0}' class='latex' />, since by continuity <img src='http://l.wordpress.com/latex.php?latex=0+%3C+f%28%28j%2B1%29%2F%5Comega%29%5Csim+f%28j%2F%5Comega%29+%3C+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='0 &lt; f((j+1)/\omega)\sim f(j/\omega) &lt; 0' title='0 &lt; f((j+1)/\omega)\sim f(j/\omega) &lt; 0' class='latex' />.</p>
<p>We can also prove that any continuous function on <img src='http://l.wordpress.com/latex.php?latex=%7B%7D%5B0%2C1%5D%5Ccap%5Cmathbb%7BQ%7D%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='{}[0,1]\cap\mathbb{Q}&#039;' title='{}[0,1]\cap\mathbb{Q}&#039;' class='latex' /> attains a maximum (up to <img src='http://l.wordpress.com/latex.php?latex=%5Csim&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sim' title='\sim' class='latex' />) by essentially the same means: just consider the <img src='http://l.wordpress.com/latex.php?latex=j+%3C+%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j &lt; \omega' title='j &lt; \omega' class='latex' /> for which <img src='http://l.wordpress.com/latex.php?latex=f%28j%2F%5Comega%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(j/\omega)' title='f(j/\omega)' class='latex' /> is a maximum, which is again possible considering that there are only finitely many <img src='http://l.wordpress.com/latex.php?latex=j+%3C+%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j &lt; \omega' title='j &lt; \omega' class='latex' />.</p>
<p>Turning to differentiation, we may define <img src='http://l.wordpress.com/latex.php?latex=f%27%28x%29+%3D+y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f&#039;(x) = y' title='f&#039;(x) = y' class='latex' /> if for all non-zero infinitesimals <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='h' title='h' class='latex' />,</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%5Cfrac%7Bf%28x%2Bh%29+-+f%28x%29%7D%7Bh%7D%5Csim+y%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\displaystyle{\frac{f(x+h) - f(x)}{h}\sim y}' title='\displaystyle{\frac{f(x+h) - f(x)}{h}\sim y}' class='latex' /></p>
<p style="text-align:left;">(Note that the derivative is actually defined only up to <img src='http://l.wordpress.com/latex.php?latex=%5Csim&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sim' title='\sim' class='latex' />.)  We can then prove that the derivative of <img src='http://l.wordpress.com/latex.php?latex=x%5Ek&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x^k' title='x^k' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=kx%5E%7Bk-1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='kx^{k-1}' title='kx^{k-1}' class='latex' /> by letting <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='h' title='h' class='latex' /> be an arbitrary infinitesimal, expanding <img src='http://l.wordpress.com/latex.php?latex=%28x%2Bh%29%5Ek+-+x%5Ek&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(x+h)^k - x^k' title='(x+h)^k - x^k' class='latex' />, dividing by <img src='http://l.wordpress.com/latex.php?latex=h&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='h' title='h' class='latex' />, and noting that what results is <img src='http://l.wordpress.com/latex.php?latex=kx%5E%7Bk-1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='kx^{k-1}' title='kx^{k-1}' class='latex' /> plus an infinitesimal.</p>
<p style="text-align:left;">Avigad notes that we may continue by defining <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bexp%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{exp}' title='\text{exp}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Csin&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sin' title='\sin' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%5Ccos&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\cos' title='\cos' class='latex' /> by taking an unbounded partial sum of the Taylor expansions, and that this is sufficient to prove all the basic properties.  He also cites an easy proof in this setting of the Cauchy-Peano theorem on the existence of solutions to differential equations.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/218/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/218/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/218/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=218&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/11/14/doing-calculus-on-the-rationals-with-the-help-of-nonstandard-analysis/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>Games Which are Impossible to Analyze</title>
		<link>http://xorshammer.com/2008/11/09/games-which-are-impossible-to-analyze/</link>
		<comments>http://xorshammer.com/2008/11/09/games-which-are-impossible-to-analyze/#comments</comments>
		<pubDate>Sun, 09 Nov 2008 14:38:35 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=214</guid>
		<description><![CDATA[In the last post, I mentioned the computational complexity of various games. To be explicit, we consider each &#8220;game&#8221; to actually be a sequence of games for . For example, would be checkers played on an board. The problem was then to analyze the computational complexity of the function which takes and tells you which [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=214&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In the last post, I mentioned the computational complexity of various games.  To be explicit, we consider each &#8220;game&#8221; <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' /> to actually be a sequence of games <img src='http://l.wordpress.com/latex.php?latex=%5C%7BG_n%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{G_n\}' title='\{G_n\}' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in \mathbb{N}' title='n\in \mathbb{N}' class='latex' />.  For example, <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bcheckers%7D_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{checkers}_n' title='\text{checkers}_n' class='latex' /> would be checkers played on an <img src='http://l.wordpress.com/latex.php?latex=n%5Ctimes+n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\times n' title='n\times n' class='latex' /> board.   The problem was then to analyze the computational complexity of the function which takes <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> and tells you which player has a winning strategy and what the winning strategy is.  I&#8217;ll call that function the analysis function of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' />.</p>
<p>Are there any games which can actually be played in the real world with an undecidable analysis function?  Robern Hearn, in the same <a href="http://www.swiss.ai.mit.edu/~bob/hearn-thesis-final.pdf">thesis</a> that I linked to last time, showed that the answer is yes.</p>
<p><span id="more-214"></span>In order to make sure that our games can be played in the real world, we&#8217;ll restrict our attention to games <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' /> where each <img src='http://l.wordpress.com/latex.php?latex=G_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G_n' title='G_n' class='latex' /> has only finitely many positions (that is, the board has only finitely many states it can be in).</p>
<p>First off, observe that if <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' /> is a game of perfect information, then it has a decidable analysis function.  By the fact that there are only finitely many positions in each <img src='http://l.wordpress.com/latex.php?latex=G_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G_n' title='G_n' class='latex' />, you can construct a finite game tree for <img src='http://l.wordpress.com/latex.php?latex=G_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G_n' title='G_n' class='latex' /> (finite because you can cut it off when positions repeat) and then induct up it to find out who has a winning strategy.  (You may want to think about this if you haven&#8217;t seen it before.  This is something like what is sometimes called Zermelo&#8217;s Theorem.)</p>
<p>OK, so let&#8217;s look at games <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' /> which are not of perfect information, i.e., each of the players have some private information.  Now it&#8217;s possible that no player has a winning strategy (for example, consider the two-player game where both players secretly choose either 0 or 1, and if the sum is odd then Player 1 wins, and if it&#8217;s even then Player 2 wins).  Even so, the question of which, if any, players have a winning strategy is well-defined.</p>
<p>It turns out that these also have decidable analysis functions.  I&#8217;ll also omit this proof, but it&#8217;s similar to the above except that, instead of a finite tree of positions, you can construct, for each player, the tree of subsets of positions that he considers possible at any given time.</p>
<p>So what hope is there?  I said that both games of perfect information and games of imperfect information have decidable analysis functions.  But there was an unstated assumption: that all players are playing against each other.  Hearn showed that a game with two players who are playing as a team against a third player but such that each of two team members has private information (which they are not allowed to communicate) can be undecidable.</p>
<p>The idea of the proof is as follows: Imagine that we have three players, Player 1, Player 2, and Player 3, and that Players 2 and 3 are playing as a team against Player 1.  In order to make the analysis function undecidable, we would like to do something like (say) in game <img src='http://l.wordpress.com/latex.php?latex=G_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G_n' title='G_n' class='latex' /> force Player 2 to emulate the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine and have his team wins if it halts.  But we can&#8217;t literally make the board an infinite tape and make Player 2&#8242;s legal moves be those simulating the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine, because then the board would have infinitely many positions.</p>
<p>But what if, instead of an infinite tape, the board was a single cell where Player 2 wrote out the computational history of the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine one character at a time?  That is, if <img src='http://l.wordpress.com/latex.php?latex=s_t&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s_t' title='s_t' class='latex' /> represents the (finite) contents of the tape at time <img src='http://l.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='t' title='t' class='latex' />, he writes out the concatenation of all the <img src='http://l.wordpress.com/latex.php?latex=s_t&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s_t' title='s_t' class='latex' /> (with, say a special symbol # separating them).  This would be good, but how can we enforce this given that we want the set of legal moves to depend only on the board position?</p>
<p>The solution is the following:  We require both Player and Player 3 to write out the computational history of the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine in two separate &#8220;streams&#8221;.  Player 1 lets each of them know which stream he wants them to to write a character to at any given time.  Player 1 can then check them against each other by advancing one of them ahead of the other in one of the streams and checking, character by character, that the string that one of them is writing out is one step further along the computational history of the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine than the string that the other is writing out.  Since Player 2 and Player 3 do not know who is advanced relative to the other, they will not be able to cheat.</p>
<p>Hearn uses this argument to show that a game he calls Team Computation has an undecidable analysis function.  He then uses that to show that the team version of Constraint Logic (discussed in the previous post) is undecidable.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/214/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/214/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/214/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=214&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/11/09/games-which-are-impossible-to-analyze/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>How to Show that Games are Hard</title>
		<link>http://xorshammer.com/2008/11/03/how-to-show-that-games-are-hard/</link>
		<comments>http://xorshammer.com/2008/11/03/how-to-show-that-games-are-hard/#comments</comments>
		<pubDate>Tue, 04 Nov 2008 02:35:43 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=204</guid>
		<description><![CDATA[Peg Solitaire is a pretty popular game, often found in restaurants (including Cracker Barrel, if I remember correctly). It&#8217;s also NP-complete (by which I mean determining a winning strategy given the initial set-up is an NP-complete problem). You may have also heard of computational complexity results for Minesweeper (see here, for example). There are a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=204&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Peg_solitaire">Peg Solitaire</a> is a pretty popular game, often found in restaurants (including Cracker Barrel, if I remember correctly).  It&#8217;s also NP-complete (by which I mean determining a winning strategy given the initial set-up is an NP-complete problem).  You may have also heard of computational complexity results for Minesweeper (see <a href="http://for.mat.bham.ac.uk/R.W.Kaye/minesw/ordmsw.htm">here</a>, for example).  There are a number of other results showing that various popular games are complete for some complexity class.</p>
<p>But what if you come across a new game, which no computer scientist has heard of yet?  Well, you&#8217;re in luck, as <a href="http://www.dartmouth.edu/~rah/">Robert Hearn</a>, in his <a href="http://www.swiss.ai.mit.edu/~bob/hearn-thesis-final.pdf">thesis</a>, formulated a framework called Constraint Logic intended to make it easy to prove complexity results for games.</p>
<p><span id="more-204"></span>Games are classified by Hearn by whether they are zero-, one-, or two-player, and further by whether or not their length is polynomially bounded by their initial setup.  (Hearn also covers team games, which I&#8217;ll cover in a later post.)</p>
<p>Recall that by saying a game is in a complexity class I mean that the problem of, given an initial setup for the game, determining whether or not there is a winning strategy for a given player (if any) and, if so, what that strategy is, is in that complexity class.  Thus, when we talk about a game being in a particular complexity class, we are always implicitly talking about a way of generalizing the game to different initial setups.  For example, the statement that Go is in EXPTIME means that the problem of, given <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, determining a winning strategy for Go on an <img src='http://l.wordpress.com/latex.php?latex=n+%5Ctimes+n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n \times n' title='n \times n' class='latex' /> board is in EXPTIME.</p>
<ul>
<li>An example of a zero-player bounded-length game is the game <a href="http://en.wikipedia.org/wiki/Clock_patience">Clock Solitaire</a>.  Once the cards are dealt, the game is completely deterministic (this makes it zero-player), and furthermore, each card is turned over at most once (this makes it bounded).  Games of this sort are in the complexity class P.</li>
<li>An example of a zero-player unbounded game is <a href="http://en.wikipedia.org/wiki/Conway%27s_Game_of_Life">Conway&#8217;s Game of Life</a>.  Games of this sort tend to be in PSPACE.  Conway&#8217;s game of life is PSPACE-complete.</li>
<li>An example of a one-player bounded-length game is Peg Solitaire, mentioned above.  It is bounded since each move removes a peg, and there is no way to get a peg back.  Games of this sort are in NP.  Peg Solitaire, as mentioned above, is NP-complete.</li>
<li>Examples of one-player unbounded games include <a href="http://en.wikipedia.org/wiki/Rubik%27s_cube">Rubik&#8217;s Cube</a> and <a href="http://www.webdoodles.org/pages/rush_hour.htm">Rush Hour</a>.  Games of this sort tend to be in PSPACE.  Rush Hour is PSPACE-complete.  (There are a couple of different ways of generalizing the Rubik&#8217;s Cube to higher dimensions, but I&#8217;m not aware of complexity results for any of them.  If you know, please leave a comment.)</li>
<li>Examples of two-player bounded-length games include <a href="http://en.wikipedia.org/wiki/Tic_tac_toe">Tic-Tac-Toe</a>, John Nash&#8217;s game <a href="http://en.wikipedia.org/wiki/Hex_(board_game)">Hex</a>, and Othello.  In both cases, the game is of bounded-length since once a player marks a space on the board, it can never be unmarked or marked again.  Games of this sort are in PSPACE.  Hex and Othello are PSPACE-complete.  (As with the Rubik&#8217;s cube, there are a couple of different ways to generalize Tic-Tac-Toe to higher dimensions, but I don&#8217;t know of any complexity results for them.  If you know, please leave a comment.)</li>
<li>Examples of two-player unbounded games include many familiar games such as checkers, chess, and go.  Games of this type tend to be in EXPTIME.  Each of the three games mentioned is EXPTIME-complete.</li>
<li>Hearn also defines &#8220;team&#8221; games, which I will skip in this post.</li>
</ul>
<p>So what is constraint logic?  It&#8217;s a single setup which naturally gives examples of games of each of the six types listed above, and furthermore is in each case complete with respect to the associated complexity class.  Thus, if you can reduce constraint logic to whichever game you&#8217;re interested in, you have a completeness result.</p>
<p>The setup for constraint logic is as follows:  The game board is an undirected graph with weights of 1 or 2 on the edges and non-negative integers assigned to the vertices (these non-negative integers are called the <em>minimum inflow</em> of the vertex).  A position of the board is an assignment of direction to each edge.  The position is legal if, for each vertex, the sum of all the weights of the edges directed towards the vertex is at least the minimum inflow of the vertex.  A move is generally reversing the direction of an edge (to give another legal position), and the goal of the game is generally to reverse the direction of a specified edge.</p>
<p>How does this give us each of the six types?</p>
<ul>
<li>To form a zero-player bounded-length game:  Suppose given a board position <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' />, and a goal edge <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E' title='E' class='latex' /> (i.e., <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' /> is a directed graph with weights assigned to the edges and non-negative integers assigned to the vertices, and <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E' title='E' class='latex' /> is an edge of <img src='http://l.wordpress.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G' title='G' class='latex' />, and the game is won if the direction of <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E' title='E' class='latex' /> is reversed).  Then let <img src='http://l.wordpress.com/latex.php?latex=R_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R_0' title='R_0' class='latex' /> be the set of edges <img src='http://l.wordpress.com/latex.php?latex=E_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_0' title='E_0' class='latex' /> such that it&#8217;s legal to reverse the direction of <img src='http://l.wordpress.com/latex.php?latex=E_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_0' title='E_0' class='latex' />.  Reverse the direction of all of these edges to get a new position <img src='http://l.wordpress.com/latex.php?latex=G_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G_1' title='G_1' class='latex' />.  Let <img src='http://l.wordpress.com/latex.php?latex=R_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R_1' title='R_1' class='latex' /> be the set of all edges <img src='http://l.wordpress.com/latex.php?latex=E_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_1' title='E_1' class='latex' /> such that it&#8217;s legal to reverse the direction of <img src='http://l.wordpress.com/latex.php?latex=E_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_1' title='E_1' class='latex' /> <em>and </em><img src='http://l.wordpress.com/latex.php?latex=E_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_1' title='E_1' class='latex' /> <em>hasn&#8217;t been reversed before. </em>Reverse all of those, and so on.  Since an edge is reversed at most once, the game is bounded length.  (Notice that it might be the case that in reversing some edge in <img src='http://l.wordpress.com/latex.php?latex=R_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R_i' title='R_i' class='latex' />, you make it so that some other edge in <img src='http://l.wordpress.com/latex.php?latex=R_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R_i' title='R_i' class='latex' /> can no longer be legally reversed.  This can indeed happen, but it is possible to restrict to a subclass of games where this never happens).  This game is P-complete.</li>
<li>To form a zero-player unbounded length, you essentially do the same thing as above, but remove the restriction that an edge can be reversed at most once.  There are some technicalities however, which are too ensure that every time you try to reverse an edge, it is a legal move.  This game is PSPACE-complete</li>
<li>To form a one-player bounded game: On each turn the player reverses the direction of edge (to form a legal position).  Each edge can be reversed at most once.  The player wins if he is able to reverse the direction of the pre-specified goal edge.  This game is NP-complete.</li>
<li>To form a one-player unbounded game:  Just as above, except that each edge may be reversed as many times as necessary.  This game is PSPACE-complete.</li>
<li>To form a two-player bounded-game:  Each player has a set of edges which only they may reverse.  Each player also has their own target edge.  They take turns reversing edges and each edge may be reversed at most once.  The first player to reverse their target edge wins.  This game is PSPACE-complete</li>
<li>To form a two-player unbounded game:  Just as above, except that each edge may be reversed an unlimited number of times.  This game is EXPTIME-complete.</li>
</ul>
<p>Part II of Hearn&#8217;s thesis (linked above) contains a large number of complexity proofs for games based on these results.  Furthermore, he has results which make this endeavor easier: a priori, you would have to reduce any graph to the game in question to prove that it&#8217;s complete with respect to the appropriate complexity class.  Hearn proves that it suffices to reduce planar graphs which consist solely of AND vertices, which are those of minimum inflow 2 and with exactly three adjacent edges, one of which has weight 2 and two of which have weight 1, and OR vertices, which are those of minimum inflow 2 and with exactly three adjacent edges, all of which have weight 2.  (You can interpret games built up from these vertices as a kind of a non-deterministic circuit computation, which is where they get their name.)</p>
<p>For example, consider sliding block puzzles: In a sliding block puzzle, you are given a number of rectangular blocks in a rectangular box.  The goal is to slide the blocks around to get one particular block in one particular case.  Using Constraint Logic, Hearn, together with his advisor, <a href="http://erikdemaine.org/">Erik Demaine</a> (who is amazing, by the way), showed that this was PSPACE-complete.  The proof is shown in this figure, reproduced from <a href="http://erikdemaine.org/papers/CL_Complexity2008/">this paper by Demaine and Hearn</a>:</p>
<p><a href="http://xorshammer.files.wordpress.com/2008/11/constraint.png"><img class="aligncenter size-full wp-image-206" title="constraint" src="http://xorshammer.files.wordpress.com/2008/11/constraint.png?w=216&#038;h=124" alt="constraint" width="216" height="124" /></a>On the left is the translation of an AND vertex; on the riht is the translation of an OR vertex.  In each case, the three yellow blocks on the border represent the three edges adjacent to the vertex.  Reversing the direction of an edge corresponds to either pushing the block in to the square, or pulling it out.  The squares are designed so that it is possible to slide the blocks in the interior around so that pushing a yellow block inside is possible iff the corresponding edge reversal is legal.</p>
<p>For more information, see Hearn&#8217;s thesis and the paper by Demaine and Hearn linked above, as well as <a href="http://erikdemaine.org/papers/NCL_TCS/">this additional paper</a> by Demaine and Hearn on the subject.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/204/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/204/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/204/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=204&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/11/03/how-to-show-that-games-are-hard/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2008/11/constraint.png" medium="image">
			<media:title type="html">constraint</media:title>
		</media:content>
	</item>
		<item>
		<title>Another Puzzle in Recursion Theory: n-Enumerable Sets</title>
		<link>http://xorshammer.com/2008/10/26/another-puzzle-in-recursion-theory-n-enumerable-sets/</link>
		<comments>http://xorshammer.com/2008/10/26/another-puzzle-in-recursion-theory-n-enumerable-sets/#comments</comments>
		<pubDate>Sun, 26 Oct 2008 17:11:40 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Puzzles]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=198</guid>
		<description><![CDATA[We can think of a computably enumerable (or c.e.) set as a bag which some computer program puts more and more numbers into over time. The set then consists of all numbers which are in the bag from some point on (i.e., the numbers which are eventually put in the bag by the program). Suppose [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=198&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>We can think of a computably enumerable (or c.e.) set as a bag which some computer program puts more and more numbers into over time.  The set then consists of all numbers which are in the bag from some point on (i.e., the numbers which are eventually put in the bag by the program).</p>
<p>Suppose that we relax the restrictions on the program, and we allow it to take a number out of the bag that it has put in (but once the program has done that, the number stays out forever).  We call the set of numbers which are in the bag from some point on (i.e., the set of numbers which are put in the bag and never taken out) in a procedure of this sort a 2-c.e. set.</p>
<p>We can analogously let an <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-c.e. set be one given by a program which can, for each number, &#8220;toggle&#8221; that number&#8217;s status up to and including <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> times if it likes.</p>
<p>The puzzle is then to find, for each <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, an example of a set which is <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-c.e. but not <img src='http://l.wordpress.com/latex.php?latex=n-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n-1' title='n-1' class='latex' />-c.e.</p>
<p><span id="more-198"></span></p>
<p>Here is an example of a set which is <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-c.e. but not <img src='http://l.wordpress.com/latex.php?latex=n-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n-1' title='n-1' class='latex' />-c.e.:  The program simulates all Turing machines simultaneously (by dovetailing, this can still be done with a finite algorithm).</p>
<p>For each <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> it does the following: If the <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' />th Turing machine halts, it puts <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> in the bag.  In this case, the program also checks to see what number the <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' />th Turing machine output when it halted; call it <img src='http://l.wordpress.com/latex.php?latex=m_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m_1' title='m_1' class='latex' />.  If eventually the <img src='http://l.wordpress.com/latex.php?latex=m_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m_1' title='m_1' class='latex' />th Turing machine halts, the program takes <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> out of the bag.  In this case, the program also checks to see what number the <img src='http://l.wordpress.com/latex.php?latex=m_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m_1' title='m_1' class='latex' />th Turing machine output when it halted; call it <img src='http://l.wordpress.com/latex.php?latex=m_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m_2' title='m_2' class='latex' />.  If eventually the <img src='http://l.wordpress.com/latex.php?latex=m_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m_2' title='m_2' class='latex' />th Turing machine halts, the program puts <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> back in the bag.  And so on, up to a maximum of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> times.</p>
<p>Call that set <img src='http://l.wordpress.com/latex.php?latex=S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S_n' title='S_n' class='latex' />.  Clearly it&#8217;s <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-c.e.  We have to show that it&#8217;s not <img src='http://l.wordpress.com/latex.php?latex=n-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n-1' title='n-1' class='latex' />-c.e.</p>
<p>First observe that each <img src='http://l.wordpress.com/latex.php?latex=S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S_n' title='S_n' class='latex' /> is complete with respect to <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-c.e. sets.  This means that if <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-c.e., then there is a computable function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=m%5Cin+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in S' title='m\in S' class='latex' /> iff <img src='http://l.wordpress.com/latex.php?latex=f%28m%29%5Cin+S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(m)\in S_n' title='f(m)\in S_n' class='latex' /> for all <img src='http://l.wordpress.com/latex.php?latex=m%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in\mathbb{N}' title='m\in\mathbb{N}' class='latex' />.</p>
<p>Now observe that if <img src='http://l.wordpress.com/latex.php?latex=S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S_n' title='S_n' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=n-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n-1' title='n-1' class='latex' />-c.e., then its complement <img src='http://l.wordpress.com/latex.php?latex=%5Cbar%7BS_n%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\bar{S_n}' title='\bar{S_n}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-c.e.  We can find an <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-c.e. presentation of <img src='http://l.wordpress.com/latex.php?latex=%5Cbar%7BS_n%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\bar{S_n}' title='\bar{S_n}' class='latex' /> by letting our computer program start by putting all numbers in the bag and then doing the opposite of what the <img src='http://l.wordpress.com/latex.php?latex=n-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n-1' title='n-1' class='latex' />-c.e. presentation of <img src='http://l.wordpress.com/latex.php?latex=S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S_n' title='S_n' class='latex' /> does.</p>
<p>Combining these two observations, there is a computable function <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=m%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in\mathbb{N}' title='m\in\mathbb{N}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=m%5Cin+S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in S_n' title='m\in S_n' class='latex' /> iff <img src='http://l.wordpress.com/latex.php?latex=f%28m%29+%5Cnotin+S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(m) \notin S_n' title='f(m) \notin S_n' class='latex' />.  By Kleene&#8217;s Fixed Point Theorem (also known as <a href="http://en.wikipedia.org/wiki/Kleene%27s_recursion_theorem">Kleene&#8217;s Recursion Theorem</a>), there is an <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> such the <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' />th Turing machine behaves the same as the <img src='http://l.wordpress.com/latex.php?latex=f%28m%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(m)' title='f(m)' class='latex' />th Turing machine.  For this particular <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' /> we then have that <img src='http://l.wordpress.com/latex.php?latex=m%5Cin+S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in S_n' title='m\in S_n' class='latex' /> iff <img src='http://l.wordpress.com/latex.php?latex=m%5Cnotin+S_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\notin S_n' title='m\notin S_n' class='latex' />, which is a contradiction.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/198/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/198/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/198/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/198/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/198/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/198/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/198/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/198/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/198/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/198/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/198/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/198/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/198/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/198/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=198&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/10/26/another-puzzle-in-recursion-theory-n-enumerable-sets/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>Two Puzzles in Recursion Theory: Verbose Sets and Terse Sets</title>
		<link>http://xorshammer.com/2008/10/16/two-puzzles-in-recursion-theory-verbose-sets-and-terse-sets/</link>
		<comments>http://xorshammer.com/2008/10/16/two-puzzles-in-recursion-theory-verbose-sets-and-terse-sets/#comments</comments>
		<pubDate>Thu, 16 Oct 2008 23:16:05 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Puzzles]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=194</guid>
		<description><![CDATA[Let be the set of all such that the th Turing machine halts. (For these puzzles, we will assume that Turing machines are always run on a blank initial state, i.e., they take no input.) Recall that is computably enumerable, but not decidable. Puzzle #1. Describe a winning strategy for the following game: You are [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=194&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Let <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' /> be the set of all <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> such that the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine halts.  (For these puzzles, we will assume that Turing machines are always run on a blank initial state, i.e., they take no input.) Recall that <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' /> is computably enumerable, but not decidable.</p>
<p><strong>Puzzle #1.</strong> Describe a winning strategy for the following game: You are given three numbers <img src='http://l.wordpress.com/latex.php?latex=x_1%2C+x_2%2C+x_3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1, x_2, x_3' title='x_1, x_2, x_3' class='latex' />.  You must correctly say for each number whether or not it is in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' />.  You are allowed to ask (and receive a truthful answer to) two questions of the form &#8220;Is <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> in K?&#8221; for any <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />.</p>
<p><strong>Puzzle #2.</strong> Show that there is <em>no</em> winning strategy for the game which is the same as that in Puzzle #1 except you are given two numbers and may ask only one question.  (Even stronger, show that if <img src='http://l.wordpress.com/latex.php?latex=S%5Csubset+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S\subset \mathbb{N}' title='S\subset \mathbb{N}' class='latex' /> is a set such that you <em>can</em> win that game, then <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> must be decidable.)</p>
<p><span id="more-194"></span></p>
<p>These puzzles are special cases of more general questions answered in <a href="http://knight.cis.temple.edu/~beigel/papers/bggo-queries-ic.html">Terse sets, superterse sets, and verbose sets</a> by Richard Beigel, William Gasarch, John Gill, and James Owings.  I also suggest looking at Richard Beigel&#8217;s <a href="http://www.cis.temple.edu/~beigel/long.html">page of online papers</a>, which has a lot of interesting stuff.</p>
<p>I read about this first in Piergiorgio Odifreddi&#8217;s book &#8220;Classical Recursion Theory, Volume 1.&#8221;</p>
<p><strong>Answer #1.</strong> The first thing to do is to find out how many of <img src='http://l.wordpress.com/latex.php?latex=x_1%2C+x_2%2C+x_3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1, x_2, x_3' title='x_1, x_2, x_3' class='latex' /> are in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' />.  To do this, let <img src='http://l.wordpress.com/latex.php?latex=n_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n_1' title='n_1' class='latex' /> be such that the <img src='http://l.wordpress.com/latex.php?latex=n_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n_1' title='n_1' class='latex' />th Turing machine simulates the <img src='http://l.wordpress.com/latex.php?latex=x_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1' title='x_1' class='latex' />th, <img src='http://l.wordpress.com/latex.php?latex=x_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_2' title='x_2' class='latex' />th and <img src='http://l.wordpress.com/latex.php?latex=x_3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_3' title='x_3' class='latex' />th Turing machines in parallel, and halts after any two of them halt.</p>
<p>By asking if <img src='http://l.wordpress.com/latex.php?latex=n_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n_1' title='n_1' class='latex' /> is in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' />, you will find out either that there are two or more or that there are one or less of <img src='http://l.wordpress.com/latex.php?latex=x_1%2C+x_2%2C+x_3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1, x_2, x_3' title='x_1, x_2, x_3' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' />.   Use a similar second question to find out exactly how many of <img src='http://l.wordpress.com/latex.php?latex=x_1%2C+x_2%2C+x_3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1, x_2, x_3' title='x_1, x_2, x_3' class='latex' /> are in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' />.</p>
<p>Once you know how many of <img src='http://l.wordpress.com/latex.php?latex=x_1%2C+x_2%2C+x_3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1, x_2, x_3' title='x_1, x_2, x_3' class='latex' /> are in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' />, just run the <img src='http://l.wordpress.com/latex.php?latex=x_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1' title='x_1' class='latex' />th, <img src='http://l.wordpress.com/latex.php?latex=x_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_2' title='x_2' class='latex' />th, and <img src='http://l.wordpress.com/latex.php?latex=x_3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_3' title='x_3' class='latex' />th Turing machines in parallel, and wait until all the ones that are going to halt do halt.  Then you will know which of them are in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' />.</p>
<p><strong>Answer #2</strong>. Suppose that you have a winning strategy, and I&#8217;ll show how to compute <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />.  Let <img src='http://l.wordpress.com/latex.php?latex=f%5Ex_%7B%5Ctext%7Bno%7D%7D%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f^x_{\text{no}}(x,y)' title='f^x_{\text{no}}(x,y)' class='latex' /> be the function which, given <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' />, returns the guess for whether or not <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> is in <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />, supposing that you received a &#8220;no&#8221; answer to whatever question your strategy decided to ask.  Similarly define <img src='http://l.wordpress.com/latex.php?latex=f%5Ey_%7B%5Ctext%7Bno%7D%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f^y_{\text{no}}' title='f^y_{\text{no}}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f%5Ex_%7B%5Ctext%7Byes%7D%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f^x_{\text{yes}}' title='f^x_{\text{yes}}' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=f%5Ey_%7B%5Ctext%7Byes%7D%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f^y_{\text{yes}}' title='f^y_{\text{yes}}' class='latex' />.</p>
<p>Either it is the case that for all <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> there is a <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=f_%7B%5Ctext%7Bno%7D%7D%5Ex%28x%2Cy%29+%3D+f_%7B%5Ctext%7Byes%7D%7D%5Ex%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_{\text{no}}^x(x,y) = f_{\text{yes}}^x(x,y)' title='f_{\text{no}}^x(x,y) = f_{\text{yes}}^x(x,y)' class='latex' /> or it isn&#8217;t.  Suppose first that it is.  Then given <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' />, we may compute whether or not is in <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> by searching for such a <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' /> and computing <img src='http://l.wordpress.com/latex.php?latex=f_%7B%5Ctext%7Bno%7D%7D%5Ex%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_{\text{no}}^x(x,y)' title='f_{\text{no}}^x(x,y)' class='latex' /> (which equals <img src='http://l.wordpress.com/latex.php?latex=f_%7B%5Ctext%7Byes%7D%7D%5Ex%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_{\text{yes}}^x(x,y)' title='f_{\text{yes}}^x(x,y)' class='latex' />).  This must be the correct guess since either &#8220;yes&#8221; or &#8220;no&#8221; must be the correct answer to whatever question the strategy asked.</p>
<p>Now suppose that there is an <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f_%7B%5Ctext%7Bno%7D%7D%5Ex%28x%2Cy%29%5Cne+f_%7B%5Ctext%7Byes%7D%7D%5Ex%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_{\text{no}}^x(x,y)\ne f_{\text{yes}}^x(x,y)' title='f_{\text{no}}^x(x,y)\ne f_{\text{yes}}^x(x,y)' class='latex' />.  Since <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> is just a single number, we can assume that we know whether or not it&#8217;s in <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />.  Then, given any <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' />, since <img src='http://l.wordpress.com/latex.php?latex=f_%7B%5Ctext%7Bno%7D%7D%5Ex%28x%2Cy%29%5Cne+f_%7B%5Ctext%7Byes%7D%7D%5Ex%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_{\text{no}}^x(x,y)\ne f_{\text{yes}}^x(x,y)' title='f_{\text{no}}^x(x,y)\ne f_{\text{yes}}^x(x,y)' class='latex' /> we know which of them is correct, hence we know whether &#8220;yes&#8221; or &#8220;no&#8221; is the correct answer to the question that the strategy would pose.  Hence we can compute its correct guess as to whether or not <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' /> is in <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />.</p>
<p><strong>Additional Results.</strong> The paper linked above gives generalizations of both puzzles: You can find out whether any set of <img src='http://l.wordpress.com/latex.php?latex=2%5En+-+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^n - 1' title='2^n - 1' class='latex' /> numbers is in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' /> by asking <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> questions (by the same strategy of determining how many of them are in <img src='http://l.wordpress.com/latex.php?latex=K&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='K' title='K' class='latex' />), and, for any <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' />, if you can determine whether any set of <img src='http://l.wordpress.com/latex.php?latex=2%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^n' title='2^n' class='latex' /> numbers is in <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> by asking <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> questions then <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> is decidable.</p>
<p>The authors call a set <em>verbose</em> if it is such that you can determine whether any set of <img src='http://l.wordpress.com/latex.php?latex=2%5En+-1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^n -1' title='2^n -1' class='latex' /> numbers is in that set by asking <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> questions.  They call it <em>terse</em> if you need <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> questions to determine whether or not <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> numbers are in the set.  They are show a number of interesting results about these, mainly along the lines that (very roughly) lots of both kinds exist, and in lots of different places.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/194/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/194/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/194/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=194&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/10/16/two-puzzles-in-recursion-theory-verbose-sets-and-terse-sets/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>When are the Real Numbers Necessary?</title>
		<link>http://xorshammer.com/2008/10/13/when-are-the-real-numbers-necessary/</link>
		<comments>http://xorshammer.com/2008/10/13/when-are-the-real-numbers-necessary/#comments</comments>
		<pubDate>Tue, 14 Oct 2008 02:44:18 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=184</guid>
		<description><![CDATA[The natural numbers can all be finitely represented, as can the rational numbers. The real numbers, however, cannot be so represented and require some notion of &#8220;infinity&#8221; to define. This makes it both computationally and philosophically interesting to determine for what purposes you need the real numbers, and for what purposes you need only the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=184&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The natural numbers can all be finitely represented, as can the rational numbers.  The real numbers, however, cannot be so represented and require some notion of &#8220;infinity&#8221; to define.  This makes it both computationally and philosophically interesting to determine for what purposes you need the real numbers, and for what purposes you need only the rationals.</p>
<p>It&#8217;s pretty clear that spatial concepts having to do with distances and rotation require the real numbers.  For example, if we took <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}^2' title='\mathbb{Q}^2' class='latex' /> as our model of the plane, the distance from <img src='http://l.wordpress.com/latex.php?latex=%280%2C0%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(0,0)' title='(0,0)' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%281%2C1%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(1,1)' title='(1,1)' class='latex' /> would not be rational, and we would not be able to rotate the point <img src='http://l.wordpress.com/latex.php?latex=%281%2C0%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(1,0)' title='(1,0)' class='latex' /> about the point <img src='http://l.wordpress.com/latex.php?latex=%280%2C0%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(0,0)' title='(0,0)' class='latex' /> by most angles.</p>
<p>But I always implicitly thought that spatial notions not depending on distances or angles required only the rationals. It turns out that I was wrong: there are spatial notions not depending on distances or angles which differ depending on whether you take space to be <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}^n' title='\mathbb{Q}^n' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^n' title='\mathbb{R}^n' class='latex' />.  The fact that I was wrong follows from a theorem of Micha Perles which is very famous in combinatorics, but which I only found out about recently.</p>
<p><span id="more-184"></span></p>
<p>I found out because the combinatorialist <a href="http://www.math.umn.edu/~armstron/">Drew Armstrong</a> told me about it, and he referred me to the online book <a href="http://www.math.umn.edu/~pak/book.htm">Lectures on Discrete and Polyhedral Geometry</a> by <a href="http://www.math.umn.edu/~pak/">Igor Pak</a>.</p>
<p>Actually, the fact that I was wrong follows just from a lemma in the proof of Perles&#8217;s result, which I will state before telling you what Perles&#8217;s main result was.</p>
<p>Consider the following system of points and lines (the image is stolen from Pak&#8217;s manuscript):</p>
<p><a href="http://xorshammer.files.wordpress.com/2008/10/reals1.png"><img class="aligncenter size-full wp-image-185" title="reals1" src="http://xorshammer.files.wordpress.com/2008/10/reals1.png?w=273&#038;h=195" alt="" width="273" height="195" /></a></p>
<p>The lemma is then that while there is a collinearity- and noncollinearity-preserving embedding of this diagram into <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^2' title='\mathbb{R}^2' class='latex' />, there is not one into <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BQ%7D%5E2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Q}^2' title='\mathbb{Q}^2' class='latex' />.  Note that the question of a collinearity- and noncollinearity-preserving embedding of the diagram says nothing about angles or distances.  The proof is simply to assume that there is a rational embedding, then to find a rational transformation of the configuration to one where you know that one of the points has an irrational coordinate.  This proof appears on page 108 of Pak&#8217;s book.</p>
<p>Perles&#8217;s main theorem is the following, and I think it&#8217;s quite striking: A <em>polytope</em> is the convex hull of a finite set of points in some <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^n' title='\mathbb{R}^n' class='latex' />, where we consider two polytopes equivalent if they are combinatorially equivalent: i.e., if there is a bijection between the two sets of vertices such that if one pair of vertices has an edge between them, the corresponding pair does as well, etc.  Then for all dimensions <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> greater than 3, there is an <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-dimensional polytope which is <em>not</em> equivalent to one which is the convex hull of a set of points with only rational coordinates.</p>
<p>The discussion of this in Pak&#8217;s book is in Part I, Section 12.5.</p>
<p>Edit: I removed a paragraph on planar graphs because it didn&#8217;t really fit the article, and I took out the phrase &#8220;purely combinatorial property,&#8221; which was misleading and probably incorrect.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/184/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/184/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/184/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/184/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/184/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/184/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/184/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=184&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/10/13/when-are-the-real-numbers-necessary/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2008/10/reals1.png" medium="image">
			<media:title type="html">reals1</media:title>
		</media:content>
	</item>
		<item>
		<title>What Would the World Look Like if Everything was Computable?: An Introduction to Hyland&#8217;s Effective Topos</title>
		<link>http://xorshammer.com/2008/10/13/what-would-the-world-look-like-if-everything-was-computable-an-introduction-to-hylands-effective-topos/</link>
		<comments>http://xorshammer.com/2008/10/13/what-would-the-world-look-like-if-everything-was-computable-an-introduction-to-hylands-effective-topos/#comments</comments>
		<pubDate>Tue, 14 Oct 2008 00:48:39 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Toposes]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=177</guid>
		<description><![CDATA[Suppose that we wanted to construct a mathematical universe where all objects were computable in some sense. How would we do it? Well, we could certainly allow the set into our universe: natural numbers are the most basic computational objects there are. (Notation: I&#8217;ll use to refer to when we&#8217;re considering it as part of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=177&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Suppose that we wanted to construct a mathematical universe where all objects were computable in some sense.  How would we do it?</p>
<p>Well, we could certainly allow the set <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> into our universe: natural numbers are the most basic computational objects there are.  (Notation: I&#8217;ll use <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' /> to refer to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> when we&#8217;re considering it as part of the universe we&#8217;ll building, and just <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> when we&#8217;re talking about the set of natural numbers in the &#8220;real&#8221; world.)  What should we take as our set of functions <img src='http://l.wordpress.com/latex.php?latex=N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^N' title='N^N' class='latex' /> from <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' />?  Since we want to admit only computable things, we should let <img src='http://l.wordpress.com/latex.php?latex=N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^N' title='N^N' class='latex' /> be the set of computable functions from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' />, which we can represent non-uniquely by their indices (i.e., by the programs which compute them).</p>
<p>(For clarity, I&#8217;ll use the following notation for computable functions: <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e' title='\phi_e' class='latex' /> denotes the partial function from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> computed by the <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' />th Turing machine. Given <img src='http://l.wordpress.com/latex.php?latex=n%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in\mathbb{N}' title='n\in\mathbb{N}' class='latex' />, it is possible that the computation <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n)' title='\phi_e(n)' class='latex' /> never halts; in that case, I&#8217;ll write <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29%7B%5Cuparrow%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n){\uparrow}' title='\phi_e(n){\uparrow}' class='latex' /> and say that <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n)' title='\phi_e(n)' class='latex' /> is undefined.  If it does halt, I&#8217;ll write <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29%7B%5Cdownarrow%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n){\downarrow}' title='\phi_e(n){\downarrow}' class='latex' /> and say that <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n)' title='\phi_e(n)' class='latex' /> is defined.  If it halts, then it yields an output <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' />.  To indicate what it is, I&#8217;ll write <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29+%3D+m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n) = m' title='\phi_e(n) = m' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29%7B%5Cdownarrow%7D+%3D+m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n){\downarrow} = m' title='\phi_e(n){\downarrow} = m' class='latex' />.)</p>
<p>So, we&#8217;ve decided that <img src='http://l.wordpress.com/latex.php?latex=N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^N' title='N^N' class='latex' /> should equal <img src='http://l.wordpress.com/latex.php?latex=%5C%7Be+%5Cmid+%5Cforall+n%5Cin%5Cmathbb%7BN%7D%5C%2C+%5Cphi_e%28n%29%7B%5Cdownarrow%7D%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{e \mid \forall n\in\mathbb{N}\, \phi_e(n){\downarrow}\}' title='\{e \mid \forall n\in\mathbb{N}\, \phi_e(n){\downarrow}\}' class='latex' />.  What should <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B%5Cleft%28N%5EN%5Cright%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^{\left(N^N\right)}' title='N^{\left(N^N\right)}' class='latex' /> equal?  At this point, there is a slight subtlety: It&#8217;s not simply the set of computable functions from <img src='http://l.wordpress.com/latex.php?latex=N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^N' title='N^N' class='latex' /> (considered as a subset of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' />) to <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' /> (considered as <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' />), because we would like to only admit those functions from <img src='http://l.wordpress.com/latex.php?latex=N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^N' title='N^N' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' /> that return the same number when given inputs which represent the same element of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D%5E%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}^\mathbb{N}' title='\mathbb{N}^\mathbb{N}' class='latex' />.</p>
<p>Therefore, we&#8217;ll let <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B%28N%5EN%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^{(N^N)}' title='N^{(N^N)}' class='latex' /> be the set of <img src='http://l.wordpress.com/latex.php?latex=e%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e\in\mathbb{N}' title='e\in\mathbb{N}' class='latex' /> such that, for all <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in N^N' title='n\in N^N' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29%7B%5Cdownarrow%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n){\downarrow}' title='\phi_e(n){\downarrow}' class='latex' />, and whenever <img src='http://l.wordpress.com/latex.php?latex=n%2Cm%5Cin+N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n,m\in N^N' title='n,m\in N^N' class='latex' /> are such that for all <img src='http://l.wordpress.com/latex.php?latex=x%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in\mathbb{N}' title='x\in\mathbb{N}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_n%28x%29+%3D+%5Cphi_m%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_n(x) = \phi_m(x)' title='\phi_n(x) = \phi_m(x)' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29+%3D+%5Cphi_e%28m%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n) = \phi_e(m)' title='\phi_e(n) = \phi_e(m)' class='latex' />.</p>
<p>We can similarly define <img src='http://l.wordpress.com/latex.php?latex=%28N%5EN%29%5E%7B%28N%5EN%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(N^N)^{(N^N)}' title='(N^N)^{(N^N)}' class='latex' />, except that there are now two places where we should take into account that we consider <img src='http://l.wordpress.com/latex.php?latex=n%2Cm%5Cin+N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n,m\in N^N' title='n,m\in N^N' class='latex' /> equivalent if for all <img src='http://l.wordpress.com/latex.php?latex=x%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in\mathbb{N}' title='x\in\mathbb{N}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_n%28x%29+%3D+%5Cphi_m%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_n(x) = \phi_m(x)' title='\phi_n(x) = \phi_m(x)' class='latex' />:  We&#8217;ll let <img src='http://l.wordpress.com/latex.php?latex=%28N%5EN%29%5E%7B%28N%5EN%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(N^N)^{(N^N)}' title='(N^N)^{(N^N)}' class='latex' /> be the set of <img src='http://l.wordpress.com/latex.php?latex=e&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e' title='e' class='latex' /> such that, whenever <img src='http://l.wordpress.com/latex.php?latex=n%2C+m+%5Cin+N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n, m \in N^N' title='n, m \in N^N' class='latex' /> are equivalent in the aforementioned sense, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n)' title='\phi_e(n)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28m%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(m)' title='\phi_e(m)' class='latex' /> are defined and equivalent in the aforementioned sense.</p>
<p>In a similar fashion, we can define <img src='http://l.wordpress.com/latex.php?latex=%28N%5E%7B%28N%5EN%29%7D%29%5E%7B%28N%5EN%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(N^{(N^N)})^{(N^N)}' title='(N^{(N^N)})^{(N^N)}' class='latex' /> and so on; these sets are called the sets of hereditarily computable functions.</p>
<p>Can we generalize this construction to a category that incorporates all possible computable representations of real objects?  More ambitiously, can we generalize to a category that is a genuine mathematical <em>universe</em> in the sense that questions like &#8220;Does the Riemann Hypothesis hold in this category?&#8221; are meaningful?  The answer, due to Martin Hyland, is yes.</p>
<p><span id="more-177"></span></p>
<p>This material is from <a href="http://www.math.uu.nl/people/jvoosten/">Jaap van Oosten</a>&#8216;s book &#8220;Realizability: A Categorical Perspective&#8221; (<a href="http://www.math.uu.nl/people/jvoosten/boekbegin.pdf">link to Preface, Introduction and Table of Contents</a>).  Unfortunately, I don&#8217;t know of a freely available explanation of the effective topos on the web, which is part of the reason why I&#8217;m writing this blog entry.  (If you know of one, please leave a comment. Edit: <a href="http://www.lfcs.inf.ed.ac.uk/reports/92/ECS-LFCS-92-208/">Found one.</a>)  However, the Stanford Encyclopedia of Philosophy has a pretty good <a href="http://plato.stanford.edu/entries/logic-intuitionistic/#ReaSemForHeyAri">section</a> on the realizability interpretation of intuitionistic logic, on which Hyland&#8217;s effective topos is based.</p>
<p>Back to the math. Notice that what we did in the <img src='http://l.wordpress.com/latex.php?latex=N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^N' title='N^N' class='latex' /> case was the following: Although we represented the computable functions as a subset of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' />, we still kept the &#8220;real&#8221; set <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D%5E%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}^\mathbb{N}' title='\mathbb{N}^\mathbb{N}' class='latex' /> hiding around in the background: we used it to determine what the appropriate elements of <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B%28N%5EN%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^{(N^N)}' title='N^{(N^N)}' class='latex' /> should be: If two elements of <img src='http://l.wordpress.com/latex.php?latex=N%5EN&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^N' title='N^N' class='latex' /> represented the same element of the &#8220;real set&#8221; <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D%5E%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}^\mathbb{N}' title='\mathbb{N}^\mathbb{N}' class='latex' />, then an element of <img src='http://l.wordpress.com/latex.php?latex=N%5E%7B%28N%5EN%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N^{(N^N)}' title='N^{(N^N)}' class='latex' /> should assign the same number to both of them.</p>
<p>That suggests a generalization.  Let an<em> assembly</em> be a pair <img src='http://l.wordpress.com/latex.php?latex=%28X%2CE%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,E)' title='(X,E)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' /> is a set, and <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E' title='E' class='latex' /> is a function from <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{P}(\mathbb{N})' title='\mathcal{P}(\mathbb{N})' class='latex' />, the power set of <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' />.  We think of <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E' title='E' class='latex' /> as assigning to each element of <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' /> its computable representations.  We let a morphism between two assemblies <img src='http://l.wordpress.com/latex.php?latex=%28X%2CE%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,E)' title='(X,E)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28Y%2CE%27%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(Y,E&#039;)' title='(Y,E&#039;)' class='latex' /> be a function from <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='Y' title='Y' class='latex' /> such that there is an <img src='http://l.wordpress.com/latex.php?latex=e%5Cin+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e\in \mathbb{N}' title='e\in \mathbb{N}' class='latex' /> such that, whenever <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+E%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in E(x)' title='n\in E(x)' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29%7B%5Cdownarrow%7D+%5Cin+E%27%28f%28x%29%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n){\downarrow} \in E&#039;(f(x))' title='\phi_e(n){\downarrow} \in E&#039;(f(x))' class='latex' />.</p>
<p>With these morphisms, the class of assemblies forms a category.  Let <img src='http://l.wordpress.com/latex.php?latex=A%3D%28X%2CE_0%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A=(X,E_0)' title='A=(X,E_0)' class='latex' />  and  <img src='http://l.wordpress.com/latex.php?latex=B%3D%28Y%2CE_1%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B=(Y,E_1)' title='B=(Y,E_1)' class='latex' /> be two  assemblies.  Then they have a direct product <img src='http://l.wordpress.com/latex.php?latex=A%5Ctimes+B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A\times B' title='A\times B' class='latex' /> given by <img src='http://l.wordpress.com/latex.php?latex=%28X%5Ctimes+Y%2C+E_2%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X\times Y, E_2)' title='(X\times Y, E_2)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=E_2%28x%2Cy%29+%3D+%5C%7B+%5Clangle+n%2Cm%5Crangle+%5Cmid+n%5Cin+E_0%28x%29%5Ctext%7B+and+%7D+m%5Cin+E_1%28y%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_2(x,y) = \{ \langle n,m\rangle \mid n\in E_0(x)\text{ and } m\in E_1(y)\}' title='E_2(x,y) = \{ \langle n,m\rangle \mid n\in E_0(x)\text{ and } m\in E_1(y)\}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+%7B%5Ccdot%7D%2C+%7B%5Ccdot%7D%5Crangle+%5Ccolon+%5Cmathbb%7BN%7D%5E2+%5Cto+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\langle {\cdot}, {\cdot}\rangle \colon \mathbb{N}^2 \to \mathbb{N}' title='\langle {\cdot}, {\cdot}\rangle \colon \mathbb{N}^2 \to \mathbb{N}' class='latex' /> is a <a href="http://en.wikipedia.org/wiki/Pairing_function">pairing function</a>.   They have an exponential <img src='http://l.wordpress.com/latex.php?latex=A%5EB&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A^B' title='A^B' class='latex' /> given by <img src='http://l.wordpress.com/latex.php?latex=%28X%5EY%2C+E_2%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X^Y, E_2)' title='(X^Y, E_2)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=E_2%28f%29+%3D+%5C%7Be%5Cmid+%5Cforall+y%5Cin+Y%5C%2C%5Cforall+m%5Cin+E_1%28y%29%5C%2C+%5Cphi_e%28m%29%7B%5Cdownarrow%7D%5Cin+E_0%28f%28y%29%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_2(f) = \{e\mid \forall y\in Y\,\forall m\in E_1(y)\, \phi_e(m){\downarrow}\in E_0(f(y))\}' title='E_2(f) = \{e\mid \forall y\in Y\,\forall m\in E_1(y)\, \phi_e(m){\downarrow}\in E_0(f(y))\}' class='latex' />.</p>
<p>If we let <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' /> be the assembly <img src='http://l.wordpress.com/latex.php?latex=%28%5Cmathbb%7BN%7D%2C+E%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\mathbb{N}, E)' title='(\mathbb{N}, E)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=E%28n%29+%3D+%5C%7Bn%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E(n) = \{n\}' title='E(n) = \{n\}' class='latex' />, then the iterated exponential objects of <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' /> correspond precisely to our initial definition of the sets of hereditarily computable objects above.</p>
<p>This is all great, but we still can&#8217;t call the category of assemblies a mathematical <em>universe</em>.  Why not? Well, in the real world, we ask questions like &#8220;Is the Riemann Hypothesis true?&#8221;,  &#8220;Is Goldbach&#8217;s conjecture true&#8221;, etc., but we don&#8217;t yet know how to ask questions like &#8220;Is the Riemann Hypothesis true <em>in the category of assemblies</em>?&#8221; any more than it makes sense to ask whether or not the Riemann Hypothesis is true in the group <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{Z}' title='\mathbb{Z}' class='latex' /> or in the number 17.  What we need is a way to interpret statements being true or not in this category.</p>
<p>In turning the category of assemblies into one in which we can interpret logical statements, there are three considerations, each of which builds on the previous ones.</p>
<ol>
<li><strong>The object of truth values should have more than two elements.</strong> Let&#8217;s step back to the ordinary category of sets for a moment.  Say we have two sets <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' /> and an injection <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' /> from <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='T' title='T' class='latex' />.  Given <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in T' title='x\in T' class='latex' />, what is the truth value of the proposition that <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> is in the image of <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />?  Well, I don&#8217;t know, but it&#8217;s clearly either <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Btrue%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{true}' title='\text{true}' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Bfalse%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{false}' title='\text{false}' class='latex' />.  But in the category of assemblies it&#8217;s more complicated due to the computational information we have lying around.  Say that we have an injective morphism from an assembly <img src='http://l.wordpress.com/latex.php?latex=%28S%2CE%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(S,E)' title='(S,E)' class='latex' /> to an assembly <img src='http://l.wordpress.com/latex.php?latex=%28T%2CE%27%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(T,E&#039;)' title='(T,E&#039;)' class='latex' />.  Given <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+T&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in T' title='x\in T' class='latex' />, now what is the truth value of the proposition that <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> is in the image of <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />?  What if there is a <img src='http://l.wordpress.com/latex.php?latex=y%5Cin+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in S' title='y\in S' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=i%28y%29+%3D+x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i(y) = x' title='i(y) = x' class='latex' /> but <img src='http://l.wordpress.com/latex.php?latex=E%28y%29%5Cne+E%27%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E(y)\ne E&#039;(x)' title='E(y)\ne E&#039;(x)' class='latex' />?  Without resolving the issue now, one plausible answer is that the truth value should be the set of indices of all computable functions taking <img src='http://l.wordpress.com/latex.php?latex=E%28y%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E(y)' title='E(y)' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=E%27%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E&#039;(x)' title='E&#039;(x)' class='latex' />, so that the more &#8220;alike&#8221; <img src='http://l.wordpress.com/latex.php?latex=E%28y%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E(y)' title='E(y)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=E%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E(x)' title='E(x)' class='latex' />, the more &#8220;true&#8221; the proposition is, and furthermore this &#8220;alikeness&#8221; is represented in a computational way.  So, a working hypothesis is that the set of truth values should be something like <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{P}(\mathbb{N})' title='\mathcal{P}(\mathbb{N})' class='latex' />.</li>
<li><strong>Objects should come equipped with an equivalence relation.</strong> In the category of assemblies, there is no question about whether or not two elements of a given object are equal.  If we are making a category where the object of truth values is something like <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{P}(\mathbb{N})' title='\mathcal{P}(\mathbb{N})' class='latex' />, however, we should allow that the proposition that different elements are equal has a truth value in that object, rather than in the classical set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5Ctext%7Btrue%7D%2C+%5Ctext%7Bfalse%7D%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{\text{true}, \text{false}\}' title='\{\text{true}, \text{false}\}' class='latex' /> of truth values.  Therefore, objects should be something like <img src='http://l.wordpress.com/latex.php?latex=%28X%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,\sim)' title='(X,\sim)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' /> is a set and <img src='http://l.wordpress.com/latex.php?latex=%5Csim&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sim' title='\sim' class='latex' /> is a map from <img src='http://l.wordpress.com/latex.php?latex=X%5Ctimes+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X\times X' title='X\times X' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{P}(\mathbb{N})' title='\mathcal{P}(\mathbb{N})' class='latex' />.  (We can represent an assembly <img src='http://l.wordpress.com/latex.php?latex=%28X%2CE%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,E)' title='(X,E)' class='latex' /> as the object <img src='http://l.wordpress.com/latex.php?latex=%28X%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,\sim)' title='(X,\sim)' class='latex' /> in our new category where <img src='http://l.wordpress.com/latex.php?latex=x%5Csim+x+%3D+E%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\sim x = E(x)' title='x\sim x = E(x)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=x%5Csim+y%3D+%5Cemptyset&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\sim y= \emptyset' title='x\sim y= \emptyset' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=x%5Cne+y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\ne y' title='x\ne y' class='latex' />).</li>
<li><strong>Morphisms should be more general than functions.</strong> If we&#8217;re allowing objects to come equipped with some sort of equivalence relation, we will have to let morphisms be more general than functions: If <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' /> is a morphism from <img src='http://l.wordpress.com/latex.php?latex=%28X%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,\sim)' title='(X,\sim)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28Y%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(Y,\sim)' title='(Y,\sim)' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=f%28x%29+%3D+y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(x) = y' title='f(x) = y' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=y%5Csim+y%27%5Cne+%5Cemptyset&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\sim y&#039;\ne \emptyset' title='y\sim y&#039;\ne \emptyset' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=f%28x%29+%3D+y%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(x) = y&#039;' title='f(x) = y&#039;' class='latex' /> is also true to some degree.  So morphisms should probably be some sort of relation on <img src='http://l.wordpress.com/latex.php?latex=X%5Ctimes+Y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X\times Y' title='X\times Y' class='latex' /> that resembles a function in some way.</li>
</ol>
<p>Now, after listing all those (somewhat vague) considerations, I&#8217;ll describe the category that takes them into account.  It&#8217;s called the Effective Topos and it was discovered/invented by Martin Hyland.</p>
<p><strong>Description of the category</strong></p>
<p><strong></strong>The objects of the effective topos are pairs <img src='http://l.wordpress.com/latex.php?latex=%28X%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,\sim)' title='(X,\sim)' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' /> is a set and <img src='http://l.wordpress.com/latex.php?latex=%5Csim&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sim' title='\sim' class='latex' /> is a map from <img src='http://l.wordpress.com/latex.php?latex=X%5Ctimes+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X\times X' title='X\times X' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{P}(\mathbb{N})' title='\mathcal{P}(\mathbb{N})' class='latex' />.  This map is required to satisfy the following properties:</p>
<ul>
<li>There must be a number <img src='http://l.wordpress.com/latex.php?latex=s%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s\in\mathbb{N}' title='s\in\mathbb{N}' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=x%2Cy%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x,y\in X' title='x,y\in X' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+%28x%5Csim+y%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in (x\sim y)' title='n\in (x\sim y)' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_s%28n%29%7B%5Cdownarrow%7D+%5Cin+%28y%5Csim+x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_s(n){\downarrow} \in (y\sim x)' title='\phi_s(n){\downarrow} \in (y\sim x)' class='latex' />.</li>
<li>There must be a number <img src='http://l.wordpress.com/latex.php?latex=t%5Cin+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='t\in \mathbb{N}' title='t\in \mathbb{N}' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=x%2Cy%2Cz%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x,y,z\in X' title='x,y,z\in X' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+%28x%5Csim+y%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in (x\sim y)' title='n\in (x\sim y)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=m%5Cin+%28y%5Csim+z%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in (y\sim z)' title='m\in (y\sim z)' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_t%28%5Clangle+n%2Cm%5Crangle%29%5Cin+%28x%5Csim+z%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_t(\langle n,m\rangle)\in (x\sim z)' title='\phi_t(\langle n,m\rangle)\in (x\sim z)' class='latex' />.</li>
</ul>
<p>(In the above, <img src='http://l.wordpress.com/latex.php?latex=s&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='s' title='s' class='latex' /> stands for &#8220;symmetric&#8221; and <img src='http://l.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='t' title='t' class='latex' /> stands for &#8220;transitive.&#8221;)</p>
<p>A morphism from <img src='http://l.wordpress.com/latex.php?latex=%28X%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,\sim)' title='(X,\sim)' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%28Y%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(Y,\sim)' title='(Y,\sim)' class='latex' /> is represented by a function <img src='http://l.wordpress.com/latex.php?latex=F%5Ccolon+X%5Ctimes+Y%5Cto+%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F\colon X\times Y\to \mathcal{P}(\mathbb{N})' title='F\colon X\times Y\to \mathcal{P}(\mathbb{N})' class='latex' /> satisfying the following:</p>
<ul>
<li>There must be a number <img src='http://l.wordpress.com/latex.php?latex=st%5Cin+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='st\in \mathbb{N}' title='st\in \mathbb{N}' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=y%5Cin+Y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in Y' title='y\in Y' class='latex' /> if <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+F%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in F(x,y)' title='n\in F(x,y)' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7Bst%7D%28n%29+%3D+%5Clangle+m%2Cp%5Crangle&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_{st}(n) = \langle m,p\rangle' title='\phi_{st}(n) = \langle m,p\rangle' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=m%5Cin+%28x%5Csim+x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in (x\sim x)' title='m\in (x\sim x)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=p%5Cin+%28y%5Csim+y%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p\in (y\sim y)' title='p\in (y\sim y)' class='latex' />.</li>
<li>There must be a number <img src='http://l.wordpress.com/latex.php?latex=rl%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='rl\in\mathbb{N}' title='rl\in\mathbb{N}' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=x%2Cx%27%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x,x&#039;\in X' title='x,x&#039;\in X' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=y%2Cy%27%5Cin+Y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y,y&#039;\in Y' title='y,y&#039;\in Y' class='latex' />, if <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+%28x%5Csim+x%27%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in (x\sim x&#039;)' title='n\in (x\sim x&#039;)' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=m%5Cin+F%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in F(x,y)' title='m\in F(x,y)' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=p%5Cin+%28y%5Csim+y%27%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p\in (y\sim y&#039;)' title='p\in (y\sim y&#039;)' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7Brl%7D%28%5Clangle+n%2Cm%2Cp%5Crangle%29+%5Cin+F%28x%27%2Cy%27%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_{rl}(\langle n,m,p\rangle) \in F(x&#039;,y&#039;)' title='\phi_{rl}(\langle n,m,p\rangle) \in F(x&#039;,y&#039;)' class='latex' />.</li>
<li>There must be a number <img src='http://l.wordpress.com/latex.php?latex=sv%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='sv\in\mathbb{N}' title='sv\in\mathbb{N}' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=y%2Cy%27%5Cin+Y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y,y&#039;\in Y' title='y,y&#039;\in Y' class='latex' />, if <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+F%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in F(x,y)' title='n\in F(x,y)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=m%5Cin+F%28x%2Cy%27%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m\in F(x,y&#039;)' title='m\in F(x,y&#039;)' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7Bsv%7D%28%5Clangle+n%2Cm%5Crangle%29%5Cin+%28y%5Csim+y%27%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_{sv}(\langle n,m\rangle)\in (y\sim y&#039;)' title='\phi_{sv}(\langle n,m\rangle)\in (y\sim y&#039;)' class='latex' />.</li>
<li>There must be a number <img src='http://l.wordpress.com/latex.php?latex=tl%5Cin+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='tl\in \mathbb{N}' title='tl\in \mathbb{N}' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' />, if <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+%28x%5Csim+x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in (x\sim x)' title='n\in (x\sim x)' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7Btl%7D%28x%29%7B%5Cdownarrow%7D%5Cin+%5Cbigcup_%7By%5Cin+Y%7DF%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_{tl}(x){\downarrow}\in \bigcup_{y\in Y}F(x,y)' title='\phi_{tl}(x){\downarrow}\in \bigcup_{y\in Y}F(x,y)' class='latex' />.</li>
</ul>
<p>(In the above, <img src='http://l.wordpress.com/latex.php?latex=st&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='st' title='st' class='latex' /> stands for &#8220;strict,&#8221; <img src='http://l.wordpress.com/latex.php?latex=rl&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='rl' title='rl' class='latex' /> stands for &#8220;relational,&#8221; <img src='http://l.wordpress.com/latex.php?latex=sv&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='sv' title='sv' class='latex' /> stands for &#8220;single-valued&#8221; and <img src='http://l.wordpress.com/latex.php?latex=tl&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='tl' title='tl' class='latex' /> stands for &#8220;total.&#8221;)</p>
<p>We say that two such representations <img src='http://l.wordpress.com/latex.php?latex=F&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F' title='F' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=F%27&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F&#039;' title='F&#039;' class='latex' /> are equivalent if there exist <img src='http://l.wordpress.com/latex.php?latex=e_0%2Ce_1%5Cin+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='e_0,e_1\in \mathbb{N}' title='e_0,e_1\in \mathbb{N}' class='latex' /> such that for all <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=y%5Cin+Y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in Y' title='y\in Y' class='latex' />, if <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+F%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in F(x,y)' title='n\in F(x,y)' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7Be_0%7D%28n%29%7B%5Cdownarrow%7D%5Cin+F%27%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_{e_0}(n){\downarrow}\in F&#039;(x,y)' title='\phi_{e_0}(n){\downarrow}\in F&#039;(x,y)' class='latex' /> and conversely if <img src='http://l.wordpress.com/latex.php?latex=n%5Cin+F%27%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in F&#039;(x,y)' title='n\in F&#039;(x,y)' class='latex' /> then <img src='http://l.wordpress.com/latex.php?latex=%5Cphi_%7Be_1%7D%28n%29%7B%5Cdownarrow%7D%5Cin+F%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_{e_1}(n){\downarrow}\in F(x,y)' title='\phi_{e_1}(n){\downarrow}\in F(x,y)' class='latex' />.  (Thus, a morphism in the Effective Topos is actually an equivalence class of representations as above.)</p>
<p>Figuring out how to compose such morphisms is an exercise left to the websurfer.</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=%28X%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X,\sim)' title='(X,\sim)' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%28Y%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(Y,\sim)' title='(Y,\sim)' class='latex' /> be two objects.  Their direct product is given by <img src='http://l.wordpress.com/latex.php?latex=%28X%5Ctimes+Y%2C+%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(X\times Y, \sim)' title='(X\times Y, \sim)' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%28x%2Cy%29%5Csim+%28x%27%2Cy%27%29+%3D+%28x%5Csim+x%27%29%5Ctimes+%28y%5Csim+y%27%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(x,y)\sim (x&#039;,y&#039;) = (x\sim x&#039;)\times (y\sim y&#039;)' title='(x,y)\sim (x&#039;,y&#039;) = (x\sim x&#039;)\times (y\sim y&#039;)' class='latex' />.  To form the exponential <img src='http://l.wordpress.com/latex.php?latex=%28Y%2C%5Csim%29%5E%7B%28X%2C%5Csim%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(Y,\sim)^{(X,\sim)}' title='(Y,\sim)^{(X,\sim)}' class='latex' />, take the object <img src='http://l.wordpress.com/latex.php?latex=%28%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29%5E%7BX%5Ctimes+Y%7D%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\mathcal{P}(\mathbb{N})^{X\times Y},\sim)' title='(\mathcal{P}(\mathbb{N})^{X\times Y},\sim)' class='latex' />, where in the definition of <img src='http://l.wordpress.com/latex.php?latex=%5Csim&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sim' title='\sim' class='latex' />, you emulate the definition of a morphism given above.</p>
<p>The object of truth values (often denoted <img src='http://l.wordpress.com/latex.php?latex=%5COmega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega' title='\Omega' class='latex' /> in any topos) is <img src='http://l.wordpress.com/latex.php?latex=%28%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\mathcal{P}(\mathbb{N}),\sim)' title='(\mathcal{P}(\mathbb{N}),\sim)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=S%5Csim+T+%3D+%5C%7B%5Clangle+e_0%2Ce_1%5Crangle+%5Cmid+%5Cforall+n%5Cin+S%5C%2C%5Cphi_%7Be_0%7D%28n%29%7B%5Cdownarrow%7D%5Cin+T%5Ctext%7B+and+%7D%5Cforall+m%5Cin+T%5C%2C%5Cphi_%7Be_1%7D%28m%29%7B%5Cdownarrow%7D%5Cin+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S\sim T = \{\langle e_0,e_1\rangle \mid \forall n\in S\,\phi_{e_0}(n){\downarrow}\in T\text{ and }\forall m\in T\,\phi_{e_1}(m){\downarrow}\in S' title='S\sim T = \{\langle e_0,e_1\rangle \mid \forall n\in S\,\phi_{e_0}(n){\downarrow}\in T\text{ and }\forall m\in T\,\phi_{e_1}(m){\downarrow}\in S' class='latex' />.</p>
<p>The object <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1' title='1' class='latex' /> playing the role of a singleton set is <img src='http://l.wordpress.com/latex.php?latex=%28%5C%7B0%5C%7D%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\{0\},\sim)' title='(\{0\},\sim)' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=0%5Csim+0+%3D+%5C%7B0%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='0\sim 0 = \{0\}' title='0\sim 0 = \{0\}' class='latex' />.</p>
<p>The map from <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1' title='1' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5COmega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega' title='\Omega' class='latex' /> representing the truth value <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Btrue%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{true}' title='\text{true}' class='latex' /> is given by the equivalence class of the map  <img src='http://l.wordpress.com/latex.php?latex=F%5Ccolon+%5C%7B0%5C%7D%5Ctimes+%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29%5Cto%5Cmathcal%7BP%7D%28%5Cmathbb%7BN%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='F\colon \{0\}\times \mathcal{P}(\mathbb{N})\to\mathcal{P}(\mathbb{N})' title='F\colon \{0\}\times \mathcal{P}(\mathbb{N})\to\mathcal{P}(\mathbb{N})' class='latex' /> defined by <img src='http://l.wordpress.com/latex.php?latex=%280%2CS%29%5Cmapsto+%28S%5Cequiv+%5Cmathbb%7BN%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(0,S)\mapsto (S\equiv \mathbb{N})' title='(0,S)\mapsto (S\equiv \mathbb{N})' class='latex' />.</p>
<p>The natural numbers object of the effective topos is <img src='http://l.wordpress.com/latex.php?latex=N+%3D+%28%5Cmathbb%7BN%7D%2C%5Csim%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N = (\mathbb{N},\sim)' title='N = (\mathbb{N},\sim)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=n%5Csim+n+%3D+%5C%7Bn%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\sim n = \{n\}' title='n\sim n = \{n\}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n%5Csim+m+%3D+%5Cemptyset&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\sim m = \emptyset' title='n\sim m = \emptyset' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=n%5Cne+m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\ne m' title='n\ne m' class='latex' />.</p>
<p><strong>Interpretation of logical formulas in the effective topos.</strong></p>
<p>I&#8217;ll now describe how logical formulas can be interpreted in Hyland&#8217;s effective topos.  If <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cldots%2C+x_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1,\ldots, x_n' title='x_1,\ldots, x_n' class='latex' /> are variables intended to range over the objects <img src='http://l.wordpress.com/latex.php?latex=X_1%2C%5Cldots%2C+X_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X_1,\ldots, X_n' title='X_1,\ldots, X_n' class='latex' /> respectively and <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28x_1%2C%5Cldots%2C+x_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi(x_1,\ldots, x_n)' title='\phi(x_1,\ldots, x_n)' class='latex' /> is a formula with free variables from <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bx_1%2C%5Cldots%2Cx_n%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{x_1,\ldots,x_n\}' title='\{x_1,\ldots,x_n\}' class='latex' />, then I&#8217;ll show how to find a map <img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Cphi%5C%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\|\phi\|' title='\|\phi\|' class='latex' /> from <img src='http://l.wordpress.com/latex.php?latex=X_1%5Ctimes+%5Ccdots+X_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X_1\times \cdots X_n' title='X_1\times \cdots X_n' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5COmega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega' title='\Omega' class='latex' /> interpreting that formula.  If <img src='http://l.wordpress.com/latex.php?latex=n+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n = 0' title='n = 0' class='latex' />, and thus the formula has no free variables and is a sentence, then the interpretation will give a map from <img src='http://l.wordpress.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1' title='1' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5COmega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega' title='\Omega' class='latex' />.  We say that a sentence holds in the effective topos if its interpretation is equal to the map <img src='http://l.wordpress.com/latex.php?latex=%5Ctext%7Btrue%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\text{true}' title='\text{true}' class='latex' /> defined above.</p>
<p>The only atomic relation is equality, and the interpretation of atomic formulas is given by the <img src='http://l.wordpress.com/latex.php?latex=%5Csim&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sim' title='\sim' class='latex' /> component of the objects of the effective topos.</p>
<p>For clarity, assume that <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\psi' title='\psi' class='latex' /> contain only one free variable, and that it ranges over the object <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' />. If we know the interpretations of <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cpsi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\psi' title='\psi' class='latex' /> already, then we have the following:</p>
<ul>
<li> <img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Cphi%5Cwedge%5Cpsi%5C%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\|\phi\wedge\psi\|' title='\|\phi\wedge\psi\|' class='latex' /> is represented by the map taking <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5Clangle+n%2Cm%5Crangle+%5Cmid+n%5Cin+%5C%7Cphi%5C%7C%28x%29%2Cm%5Cin%5C%7C%5Cpsi%5C%7C%28x%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{\langle n,m\rangle \mid n\in \|phi\|(x),m\in\|\psi\|(x)\}' title='\{\langle n,m\rangle \mid n\in \|phi\|(x),m\in\|\psi\|(x)\}' class='latex' />.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Cphi%5Cvee%5Cpsi%5C%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\|\phi\vee\psi\|' title='\|\phi\vee\psi\|' class='latex' /> is represented by the map taking <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5Clangle+0%2Cn%5Cmid+n%5Cin+%5C%7C%5Cphi%5C%7C%28x%29%5C%7D%5Ccup+%5C%7B%5Clangle+1%2Cm%5Cmid+m%5Cin+%5C%7C%5Cpsi%5C%7C%28x%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{\langle 0,n\mid n\in \|\phi\|(x)\}\cup \{\langle 1,m\mid m\in \|\psi\|(x)\}' title='\{\langle 0,n\mid n\in \|\phi\|(x)\}\cup \{\langle 1,m\mid m\in \|\psi\|(x)\}' class='latex' />.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Cphi%5Crightarrow+%5Cpsi%5C%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\|\phi\rightarrow \psi\|' title='\|\phi\rightarrow \psi\|' class='latex' /> is represented by the map taking <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5C%7Be%5Cmid+%5Cforall+n%5Cin+%5C%7C%5Cphi%5C%7C%28x%29%5C%2C+%5Cphi_e%28n%29%7B%5Cdownarrow%7D%5Cin%5C%7C%5Cpsi%5C%7C%28x%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{e\mid \forall n\in \|\phi\|(x)\, \phi_e(n){\downarrow}\in\|\psi\|(x)\}' title='\{e\mid \forall n\in \|\phi\|(x)\, \phi_e(n){\downarrow}\in\|\psi\|(x)\}' class='latex' />.</li>
</ul>
<p>Now suppose that <img src='http://l.wordpress.com/latex.php?latex=%5Cphi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi' title='\phi' class='latex' /> has two free variables ranging over <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='Y' title='Y' class='latex' /> respectively, and I&#8217;ll show you how to interpret quantifiers.</p>
<ul>
<li><img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Cexists_%7By%5Cin+Y%7D%5Cphi%5C%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\|\exists_{y\in Y}\phi\|' title='\|\exists_{y\in Y}\phi\|' class='latex' /> is represented by the map taking <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cbigcup_%7By%5Cin+Y%7D%5C%7C%5Cphi%5C%7C%28x%2Cy%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\bigcup_{y\in Y}\|\phi\|(x,y)' title='\bigcup_{y\in Y}\|\phi\|(x,y)' class='latex' />.</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5C%7C%5Cforall_%7By%5Cin+Y%7D%5Cphi%5C%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\|\forall_{y\in Y}\phi\|' title='\|\forall_{y\in Y}\phi\|' class='latex' /> is represented by the map taking <img src='http://l.wordpress.com/latex.php?latex=x%5Cin+X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x\in X' title='x\in X' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5C%7Be%5Cin+%5Cmathbb%7BN%7D%5Cmid+%5Cforall+n%5Cin+%5Cmathbb%7BN%7D%5Cforall+y%5Cin+Y%5C%2C+%5Cphi_e%28n%29%7B%5Cdownarrow%7D%5Cin+%5C%7C%5Cphi%5C%7C%28x%2Cy%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{e\in \mathbb{N}\mid \forall n\in \mathbb{N}\forall y\in Y\, \phi_e(n){\downarrow}\in \|\phi\|(x,y)\}' title='\{e\in \mathbb{N}\mid \forall n\in \mathbb{N}\forall y\in Y\, \phi_e(n){\downarrow}\in \|\phi\|(x,y)\}' class='latex' />.</li>
</ul>
<p>Now, once we observe that we can interpret the power &#8220;set&#8221; of an object <img src='http://l.wordpress.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='X' title='X' class='latex' /> in the effective topos as the exponential <img src='http://l.wordpress.com/latex.php?latex=%5COmega%5EX&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega^X' title='\Omega^X' class='latex' />, we know how to interpret all first- and higher-order sentences as holding or not in the effective topos.</p>
<p>Here are some interesting sentences given by Van Oosten that highlight some differences between the effective topos and the ordinary category of sets.</p>
<ul>
<li>Note that we may write the relation &#8220;<img src='http://l.wordpress.com/latex.php?latex=%5Cphi_e%28n%29+%3D+m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi_e(n) = m' title='\phi_e(n) = m' class='latex' />&#8221; as a relation on <img src='http://l.wordpress.com/latex.php?latex=N%5Ctimes+N%5Ctimes+N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N\times N\times N' title='N\times N\times N' class='latex' /> in our language.  Then the sentence <img src='http://l.wordpress.com/latex.php?latex=%5Cforall+f%5Cin+N%5EN%5Cexists+e%5Cin+N%5C%2C+%5Cphi_e+%3D+f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\forall f\in N^N\exists e\in N\, \phi_e = f' title='\forall f\in N^N\exists e\in N\, \phi_e = f' class='latex' /> is true in the effective topos.</li>
<li>For every formula <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28S%2Cx%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi(S,x)' title='\phi(S,x)' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S' title='S' class='latex' /> is a variable ranging over <img src='http://l.wordpress.com/latex.php?latex=%5COmega%5E%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega^{N}' title='\Omega^{N}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> is a variable ranging over <img src='http://l.wordpress.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='N' title='N' class='latex' />, the sentence <img src='http://l.wordpress.com/latex.php?latex=%5Cforall+S%5Cexists+x%5Cphi%28S%2Cx%29%5Crightarrow+%5Cexists+x%5Cforall+S%5Cphi%28S%2Cx%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\forall S\exists x\phi(S,x)\rightarrow \exists x\forall S\phi(S,x)' title='\forall S\exists x\phi(S,x)\rightarrow \exists x\forall S\phi(S,x)' class='latex' /> holds in the effective topos.</li>
<li>We may construct the rationals <img src='http://l.wordpress.com/latex.php?latex=Q&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='Q' title='Q' class='latex' /> and the reals <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R' title='R' class='latex' /> in the effective topos just as we do in the category of sets. However, they have different properties.  For example, in the effective topos the statement &#8220;There exists a bounded monotonic sequence in <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R' title='R' class='latex' /> that does not converge to a limit.&#8221; holds, contradicting the Bolzano-Weierstrass theorem.  Intuitively, this is because we can find a bounded, monotonic sequence converging to a real number whose binary expansion encodes the halting problem but such that every member of the sequence has a decidable binary expansion.</li>
<li>The sentence <img src='http://l.wordpress.com/latex.php?latex=%5Cneg+%5Cforall+x%5Cin+R+%28x+%5Cleq+0+%5Cvee+x+%5Cgeq+0%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\neg \forall x\in R (x \leq 0 \vee x \geq 0)' title='\neg \forall x\in R (x \leq 0 \vee x \geq 0)' class='latex' /> holds in the effective topos.</li>
<li>Similar to the above, we may can show that the intermediate value theorem fails in the effective topos.</li>
<li>In the effective topos, the statement &#8220;All functions from <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R' title='R' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='R' title='R' class='latex' /> are continuous&#8221; holds.</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/177/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/177/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/177/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/177/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/177/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/177/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/177/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/177/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/177/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/177/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/177/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/177/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/177/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/177/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=177&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/10/13/what-would-the-world-look-like-if-everything-was-computable-an-introduction-to-hylands-effective-topos/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>A language which does term inference</title>
		<link>http://xorshammer.com/2008/10/07/term-inference/</link>
		<comments>http://xorshammer.com/2008/10/07/term-inference/#comments</comments>
		<pubDate>Wed, 08 Oct 2008 01:38:53 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Functional Programming]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=170</guid>
		<description><![CDATA[Many strongly typed languages like OCaml do type inference. That is, even though they&#8217;re strongly typed, you don&#8217;t have to explicitly say what the type of everything is since a lot of the time the compiler can figure it out by itself. For example, if you define a function which takes an x and adds [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=170&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Many strongly typed languages like OCaml do type inference.  That is, even though they&#8217;re strongly typed, you don&#8217;t have to explicitly say what the type of everything is since a lot of the time the compiler can figure it out by itself.  For example, if you define a function which takes an <code>x</code> and adds it to <code>3</code>, the compiler will figure out that <code>x</code> is an <code>int</code>.  (It couldn&#8217;t be a <code>float</code>, since it was added to <code>3</code> and not <code>3.0</code>.)</p>
<p>But it often seems like the compiler should be able to infer not just the types of expressions, but the expressions themselves!  For example, if the compiler infers that the type of some function <code>f</code> is <code>(int -&gt; int) -&gt; (int list) -&gt; int list</code> (i.e., <code>f</code> is a higher-order function which takes a function from <code>int</code> to <code>int</code>, a list of <code>int</code>s, and produces a list of <code>int</code>s), then <code>f</code> is very probably the <code>map</code> function, defined informally by</p>
<p style="text-align:center;"><code>map g [x_1;...;x_n] = [g x_1;...;g x_n]</code>.</p>
<p>Therefore, if the compiler determines that some expression has that type, and the user has somehow omitted the actual function definition, why not allow the compiler to infer what the expression is?</p>
<p>I made a stab at implementing this type of idea in a toy language I call <a href="http://www.box.net/shared/5og9ayke5u">TermInf</a> (apologies for the weird hosting: I don&#8217;t have another hosting service at the moment).  It&#8217;s a modification of the toy language <a href="http://math.andrej.com/2008/09/14/sub-and-poly-two-new-additions-to-the-pl-zoo/">Poly</a> from Andrej Bauer&#8217;s <a href="http://andrej.com/plzoo/">Programming Language Zoo</a>. You&#8217;ll need <a href="http://caml.inria.fr/ocaml/release.en.html">OCaml</a> to compile it.  Please feel free to alert me to any bugs or to tell me that my code is horrible.</p>
<p>More details below.</p>
<p><span id="more-170"></span></p>
<p>The basic idea is really simple: For any expression <code>e</code>, the expression <code>{e}</code> is also an expression.  The compiler will infer the type <code>t1</code> of <code>e</code> and the type <code>t2</code> that <code>{e}</code> has to be.  It will search for a sequence of coercions taking <code>t1</code> to t2 and if there is a unique one, it will replace <code>{e}</code> with that sequence of coercions applied to <code>e</code>.</p>
<p>Which functions are coercions is determined by the user; functions can be declared to be coercions or removed from the list of coercions at any point.</p>
<p>I can think of at least three ways this would be useful.</p>
<p><strong>1. Automatically coercing from one base type to another</strong></p>
<p>This is actually the least interesting of the three, but it serves to illustrate how TermInf works.</p>
<p>You can use <code>$show_coercions</code> to show all the current coercions.  The identity<br />
function <code>id</code> is always a coercion.</p>
<pre>TermInf. Press Ctrl-D to exit.
TermInf&gt; $show_coercions
id</pre>
<p>Let&#8217;s define a new coercion from <code>bool</code> to <code>int</code>.</p>
<pre>
TermInf&gt; let_coercion bool_to_int = fun x -&gt; if x then 1 else 0
val bool_to_int : bool -&gt; int
TermInf&gt; $show_coercions
bool_to_int
id
</pre>
<p>Now we can use the coercion.</p>
<pre>
TermInf&gt; {true} + 7
- : int = 8
</pre>
<p>In that instance, the interpreter could determine that the type of <code>{true}</code> had to be <code>int</code>, since it was added to <code>7</code>.  In the following instance, the interpreter can&#8217;t determine type of <code>{true}</code>.</p>
<pre>
TermInf&gt; {true}
Problem with term inference.
</pre>
<p>But we can always explicitly give a type to any expression, so we can use that to tell the type-inferer what the type of <code>{true}</code> is.</p>
<pre>
TermInf&gt; {true} : bool
- : bool = true
TermInf&gt; {true} : int
- : int = 1
</pre>
<p><strong>2.  Lifting functions</strong><br />
We can view the function <code>List.map</code> as a coercion, taking a function <code>'a -&gt; 'b</code> to a function <code>'a list -&gt; 'b list</code>.</p>
<pre>
TermInf&gt; let_coercion map = rec map is fun f -&gt; fun l -&gt; match l with [] -&gt; [] | x::ll -&gt; (f x)::(map f ll)
val map : ('a -&gt; 'b) -&gt; 'a list -&gt; 'b list
TermInf&gt; $show_coercions
map
bool_to_int
id
</pre>
<p>Now we can try it out.</p>
<pre>
TermInf&gt; let square = fun x -&gt; x * x
val square : int -&gt; int
TermInf&gt; ({square} 3) : int
- : int = 9
TermInf&gt; ({square} [1;2;3]) : int list
- : int list = 1 :: 4 :: 9 :: []
TermInf&gt; ({square} [[1;2];[5;6;7]]) : int list list
- : (int list) list = (1 :: 4 :: []) :: (25 :: 36 :: 49 :: []) :: []
</pre>
<p>Note that in our case, we had to explicitly tell the interpreter what the return type was, although presumably in practice the interpreter or compiler would usually be able to infer it.</p>
<p>The idea is that we can change the basic structure of the thing passed to <code>{square}</code>, and the term inferer will adapt.  Note that in the third case, the term inferer iterated map to produce the required <code>(int list list -&gt; int list list)</code> type.</p>
<p>We can similarly look inside the structure of pairs.</p>
<pre>
TermInf&gt; let_coercion map_pair = fun f -&gt; fun x -&gt; (f (fst x), f (snd x))
val map_pair : ('a -&gt; 'b) -&gt; 'a * 'a -&gt; 'b * 'b
TermInf&gt; ({square} [(1,2);(3,4)]) : (int * int) list
- : (int * int) list = (1, 4) :: (9, 16) :: []
TermInf&gt; ({square} ([1;2],[3;4])) : (int list) * (int list)
- : int list * int list = (1 :: 4 :: [], 9 :: 16 :: [])
</pre>
<p>Essentially all variants of map can be added.  For example, the function <code>mapi : ((int * 'a) -&gt; 'b) -&gt; 'a list -&gt; 'b list</code> where the function takes the index of the list element can be added.  Then the term-inferer will determine which version of map (or sequence of versions of map) is needed based on the function given to it.</p>
<p><strong>3. Term inference in conjunction with phantom types.</strong><br />
I put just enough type aliasing in TermInf to allow you to use phantom types.  (For a great introduction to phantom types, see <a href="http://till-varoquaux.blogspot.com/2007/06/phun-with-phantom-types.html">this blog post</a>).</p>
<p>Here&#8217;s an example of how type aliasing works in TermInf:</p>
<pre>
TermInf&gt; type hidden = int
TermInf&gt; let f = (fun x -&gt; x + 7) : hidden -&gt; hidden
val f : hidden -&gt; hidden
TermInf&gt; let x = 3 : hidden
val x : hidden
TermInf&gt; f x
- : hidden = 10
TermInf&gt; f 3
The types hidden and int are incompatible
</pre>
<p>Something we might like to do with phantom types is have the type system do a static dimensional analysis on our program.  Here&#8217;s an attempt to do that:</p>
<pre>
TermInf&gt; type meters
TermInf&gt; type gallons
TermInf&gt; type 'a units = int
TermInf&gt; let add = (fun x -&gt; fun y -&gt; x + y) : 'a units -&gt; 'a units -&gt; 'a units
val add : 'a units -&gt; 'a units -&gt; 'a units
TermInf&gt; let times = (fun x -&gt; fun y -&gt; x * y) : 'a units -&gt; 'b units -&gt; ('a * 'b) units
val times : 'a units -&gt; 'b units -&gt; ('a * 'b) units
TermInf&gt; let one_gal = 1 : gallons units
val one_gal : gallons units
TermInf&gt; let one_m = 1 : meters units
val one_m : meters units
</pre>
<p>Then we have the following correct behavior:</p>
<pre>
TermInf&gt; add one_gal one_gal
- : gallons units = 2
TermInf&gt; times one_gal one_m
- : (gallons * meters) units = 1
TermInf&gt; add one_gal one_m
The types gallons and meters are incompatible
</pre>
<p>But the following is not correct:</p>
<pre>
TermInf&gt; let x = times one_gal one_m
val x : (gallons * meters) units
TermInf&gt; let y = times one_m one_gal
val y : (meters * gallons) units
TermInf&gt; add x y
The types gallons and meters are incompatible
</pre>
<p>Of course, the problem is that the interpreter doesn&#8217;t know that units commute.<br />
But we can fix this with coercions.</p>
<pre>
TermInf&gt; let_id_coercion commute = id : ('a * 'b) units -&gt; ('b * 'a) units
val commute : ('a * 'b) units -&gt; ('b * 'a) units
TermInf&gt; add x {y}
- : (gallons * meters) units = 2
</pre>
<p>We&#8217;ve declared <code>commute</code> to be an identity coercion (by using <code>let_id_coercion</code> instead of <code>let_coercion</code>) to help the interpreter when it&#8217;s deciding if a term inference is unique or not.</p>
<p>Note that we don&#8217;t use term inference on both <code>x</code> and <code>y</code>, because then it couldn&#8217;t determine what type to give it.</p>
<pre>
TermInf&gt; add {x} y
- : (meters * gallons) units = 2
TermInf&gt; add {x} {y}
Problem with term inference.
</pre>
<p>This version of <code>commute</code> will just commute the two units at the top level, but there are a finite number of identity coercions that you can define that will give you associativity and commutativity (and inverses, if you want).  Thus, the type system will be able to perform a static dimensional analysis on your program.</p>
<p>Edit: I should note that I left out several details about how this actually works.  For example, the interpreter doesn&#8217;t search through <i>all</i> sequences of coercions, since there are infinitely many (and the problem of deciding if there is a unique one between any two given types is undecidable in general).  Instead it limits itself to sequences of coercions whose type is never &#8220;bigger&#8221; that the starting type or the goal type, where &#8220;bigger&#8221; is defined by a straightforward length function. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/170/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/170/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/170/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=170&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/10/07/term-inference/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>Playing Games in the Transfinite: An Introduction to &#8220;Ordinal Chomp&#8221;</title>
		<link>http://xorshammer.com/2008/09/29/playing-games-in-the-transfinite-an-introduction-to-ordinal-chomp/</link>
		<comments>http://xorshammer.com/2008/09/29/playing-games-in-the-transfinite-an-introduction-to-ordinal-chomp/#comments</comments>
		<pubDate>Tue, 30 Sep 2008 03:14:52 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=163</guid>
		<description><![CDATA[Chomp is a two-player game which is played as follows: The two players, A and B, start with a &#8220;board&#8221; which is a chocolate bar divided into small squares. With Player A starting, they take turns choosing a square and eating it together with all squares above and to the right. The catch is that [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=163&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Chomp">Chomp</a><em> </em>is a two-player game which is played as follows:  The two players, A and B, start with a &#8220;board&#8221; which is a chocolate bar divided into <img src='http://l.wordpress.com/latex.php?latex=n+%5Ctimes+m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n \times m' title='n \times m' class='latex' /> small squares.  With Player A starting, they take turns choosing a square and eating it together with all squares above and to the right.   The catch is that the square at the lower left-hand corner is poisonous, and the player who is forced to eat it loses.</p>
<p>This image from the Wikipedia article shows a typical sequence of moves for a <img src='http://l.wordpress.com/latex.php?latex=5%5Ctimes+3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='5\times 3' title='5\times 3' class='latex' /> chocolate bar:</p>
<p style="text-align:center;"><a href="http://xorshammer.files.wordpress.com/2008/09/chomp.png"><img class="aligncenter size-full wp-image-164" title="chomp" src="http://xorshammer.files.wordpress.com/2008/09/chomp.png?w=500&#038;h=86" alt="" width="500" height="86" /></a></p>
<p>At this point, Player A is forced to eat the poisoned square and hence loses the game.</p>
<p>Although the question of <em>what</em> the winning strategies are for this game is very much an open problem, the question of <em>who</em> has a winning strategy is not:  On the <img src='http://l.wordpress.com/latex.php?latex=1%5Ctimes+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1\times 1' title='1\times 1' class='latex' /> board, Player B wins (since Player A must eat the poison piece on his first move).  But for any other board, Player A has a winning strategy.</p>
<p>To see why, suppose not.  Then if Player A&#8217;s first move is to eat just the one square in the top right-hand corner, Player B must have a winning response (since we are supposing that Player B has a winning response to <em>any</em> move that Player A makes).  But if Player B&#8217;s response is winning, then Player A could have simply made that move to start with.</p>
<p>However, suppose we play Chomp not just on <img src='http://l.wordpress.com/latex.php?latex=n%5Ctimes+m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\times m' title='n\times m' class='latex' /> boards, but on <img src='http://l.wordpress.com/latex.php?latex=%5Calpha%5Ctimes%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha\times\beta' title='\alpha\times\beta' class='latex' /> boards, where <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta' title='\beta' class='latex' /> are arbitrary ordinals.  The game still makes sense just as before, and will always end in finite time, but Player A no longer wins all of the time (there will no longer be a top right-hand corner square if either <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta' title='\beta' class='latex' /> is a limit ordinal).</p>
<p>Scott Huddleston and Jerry Shurman investigated Ordinal Chomp in <a href="http://www.msri.org/publications/books/Book42/files/huddleston.pdf">this paper</a>, and showed that it has a number of interesting properties.  I&#8217;ll describe a few of them below.</p>
<p><span id="more-163"></span></p>
<p>First of all, to make things a bit easier to discuss, we will consider only <img src='http://l.wordpress.com/latex.php?latex=n%5Ctimes+%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\times \alpha' title='n\times \alpha' class='latex' /> games where <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> is finite rather than <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta%5Ctimes%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta\times\alpha' title='\beta\times\alpha' class='latex' /> games where <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta' title='\beta' class='latex' /> is arbitrary.  However, everything said will go through fine in that case as well.</p>
<p>Secondly, we will use the following game which is equivalent to Chomp:  At any point in the game, the board is described by a non-increasing sequence <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_0%2C%5Cldots%2C+%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_0,\ldots, \alpha_n)' title='(\alpha_0,\ldots, \alpha_n)' class='latex' />.  On each turn, the appropriate player picks an <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' /> and a <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta' title='\beta' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta+%3C+%5Calpha_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta &lt; \alpha_i' title='\beta &lt; \alpha_i' class='latex' /> and replaces the sequence with <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_0%2C%5Cldots%2C+%5Calpha_%7Bi-1%7D%2C%5Cbeta%2C%5Cldots%2C+%5Cbeta%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_0,\ldots, \alpha_{i-1},\beta,\ldots, \beta)' title='(\alpha_0,\ldots, \alpha_{i-1},\beta,\ldots, \beta)' class='latex' />.  We call this <em>taking a bite of height </em><img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta' title='\beta' class='latex' />. Playing on an <img src='http://l.wordpress.com/latex.php?latex=n%5Ctimes+m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\times m' title='n\times m' class='latex' /> chocolate bar as described above corresponds to playing with the sequence <img src='http://l.wordpress.com/latex.php?latex=%28m%2C%5Cldots%2C+m%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(m,\ldots, m)' title='(m,\ldots, m)' class='latex' /> consisting of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' />&#8216;s.</p>
<p>For ease of discussion later, we make the convention that any sequence (not just a non-increasing one) describes a game position by stipulating that <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_1%2C%5Cldots%2C+%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_1,\ldots, \alpha_n)' title='(\alpha_1,\ldots, \alpha_n)' class='latex' /> is the same as the position <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_1%2C%5Cmin%5C%7B%5Calpha_1%2C%5Calpha_2%5C%7D%2C%5Cldots%2C%5Cmin%5C%7B%5Calpha_1%2C%5Cldots%2C%5Calpha_n%5C%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_1,\min\{\alpha_1,\alpha_2\},\ldots,\min\{\alpha_1,\ldots,\alpha_n\})' title='(\alpha_1,\min\{\alpha_1,\alpha_2\},\ldots,\min\{\alpha_1,\ldots,\alpha_n\})' class='latex' />.</p>
<p>We saw above that Player A wins all non-trivial finite Chomp games, so let&#8217;s start by looking at a transfinite chomp game that Player B wins: <img src='http://l.wordpress.com/latex.php?latex=2%5Ctimes+%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2\times \omega' title='2\times \omega' class='latex' />, or <img src='http://l.wordpress.com/latex.php?latex=%28%5Comega%2C%5Comega%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\omega,\omega)' title='(\omega,\omega)' class='latex' /> in our new notation.  What is Player B&#8217;s winning strategy?  Well, notice that Player A&#8217;s first move has to put the game either in the state <img src='http://l.wordpress.com/latex.php?latex=%28n%2Cn%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(n,n)' title='(n,n)' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%28%5Comega%2C+n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\omega, n)' title='(\omega, n)' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />. In either case, Player B can then move the game into state of the form <img src='http://l.wordpress.com/latex.php?latex=%28m%2B1%2Cm%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(m+1,m)' title='(m+1,m)' class='latex' /> for some <img src='http://l.wordpress.com/latex.php?latex=m&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='m' title='m' class='latex' />.  From a position of that form, whatever Player A does, Player B can again move to a position of that form.  Eventually, Player B will move to the position <img src='http://l.wordpress.com/latex.php?latex=%281%2C0%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(1,0)' title='(1,0)' class='latex' />, and Player A will be forced to eat the poison piece.</p>
<p>So, Player B can win at least <em>some</em> transfinite Chomp games, although he still loses a lot of them: for example, he loses all games of the form <img src='http://l.wordpress.com/latex.php?latex=2%5Ctimes+%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2\times \alpha' title='2\times \alpha' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Calpha+%3E+%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha &gt; \omega' title='\alpha &gt; \omega' class='latex' />.  The reason is that in such a game Player A can win by first moving to the position <img src='http://l.wordpress.com/latex.php?latex=2%5Ctimes+%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2\times \omega' title='2\times \omega' class='latex' /> and then using Player B&#8217;s winning strategy!  Similarly, Player B loses all games <img src='http://l.wordpress.com/latex.php?latex=n+%5Ctimes+%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n \times \omega' title='n \times \omega' class='latex' /> for <img src='http://l.wordpress.com/latex.php?latex=n+%3E+2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n &gt; 2' title='n &gt; 2' class='latex' />.</p>
<p>In fact, for any ordinal <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' />, there is exactly one ordinal <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta' title='\beta' class='latex' /> such that Player B wins <img src='http://l.wordpress.com/latex.php?latex=%5Calpha%5Ctimes%5Cbeta&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha\times\beta' title='\alpha\times\beta' class='latex' />.  This is a consequence of what Huddleston and Shurman call the Fundamental Theorem of Transfinite Chomp in the above paper.  Another interesting consequence, which illustrates the style of reasoning used in their proof of the Fundamental Theorem, is the following:</p>
<p><strong>Theorem:</strong> For any sequence <img src='http://l.wordpress.com/latex.php?latex=%5Calpha_2%2C%5Cldots%2C+%5Calpha_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha_2,\ldots, \alpha_n' title='\alpha_2,\ldots, \alpha_n' class='latex' /> of ordinals, there is exactly one ordinal <img src='http://l.wordpress.com/latex.php?latex=%5Calpha_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha_1' title='\alpha_1' class='latex' /> such that Player B wins the game <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_1%2C%5Cldots%2C%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_1,\ldots,\alpha_n)' title='(\alpha_1,\ldots,\alpha_n)' class='latex' />.  (Remember the convention above about sequence which are not necessarily non-increasing.)</p>
<p><strong>Proof:</strong> The uniqueness is similar to the argument given above: If Player B has a winning strategy on the game <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_1%2C%5Cldots%2C%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_1,\ldots,\alpha_n)' title='(\alpha_1,\ldots,\alpha_n)' class='latex' /> then Player A has a winning strategy on all games <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha%27_1%2C%5Calpha_2%2C%5Cldots%2C%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha&#039;_1,\alpha_2,\ldots,\alpha_n)' title='(\alpha&#039;_1,\alpha_2,\ldots,\alpha_n)' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Calpha%27_1+%3E+%5Calpha_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha&#039;_1 &gt; \alpha_1' title='\alpha&#039;_1 &gt; \alpha_1' class='latex' />, since Player A can just move to the position <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_1%2C%5Cldots%2C+%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_1,\ldots, \alpha_n)' title='(\alpha_1,\ldots, \alpha_n)' class='latex' /> and then use Player B&#8217;s winning strategy.</p>
<p>The existence is by induction.   Fix  <img src='http://l.wordpress.com/latex.php?latex=%5Calpha_2%2C%5Cldots%2C+%5Calpha_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha_2,\ldots, \alpha_n' title='\alpha_2,\ldots, \alpha_n' class='latex' /> and suppose that for all <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_2%2C%5Cldots%2C%5Cbeta_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_2,\ldots,\beta_n' title='\beta_2,\ldots,\beta_n' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_i+%5Cleq+%5Calpha_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_i \leq \alpha_i' title='\beta_i \leq \alpha_i' class='latex' /> for all <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' /> and for at least one <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_i+%3C+%5Calpha_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_i &lt; \alpha_i' title='\beta_i &lt; \alpha_i' class='latex' /> we know that there is a <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_1+%3D+h%28%5Cbeta_2%2C%5Cldots%2C%5Cbeta_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_1 = h(\beta_2,\ldots,\beta_n)' title='\beta_1 = h(\beta_2,\ldots,\beta_n)' class='latex' /> such that Player B has a winning strategy.</p>
<p>For each ordinal <img src='http://l.wordpress.com/latex.php?latex=%5Cgamma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\gamma' title='\gamma' class='latex' />, let <img src='http://l.wordpress.com/latex.php?latex=B_%5Cgamma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='B_\gamma' title='B_\gamma' class='latex' /> be the set of all <img src='http://l.wordpress.com/latex.php?latex=%28%5Cbeta_2%2C%5Cldots%2C+%5Cbeta_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\beta_2,\ldots, \beta_n)' title='(\beta_2,\ldots, \beta_n)' class='latex' /> obtainable from <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_2%2C%5Cldots%2C+%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_2,\ldots, \alpha_n)' title='(\alpha_2,\ldots, \alpha_n)' class='latex' /> by taking a bite of height <img src='http://l.wordpress.com/latex.php?latex=%5Cgamma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\gamma' title='\gamma' class='latex' /> (this term was defined above, if you forgot what it means).</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=H+%3D+%5C%7B+h%28%5Cbeta_2%2C%5Cldots%2C%5Cbeta_n%29%5Cmid+%28%5Cbeta_2%2C%5Cldots%2C%5Cbeta_n%29%5Cin+B_%5Cgamma%5Ctext%7B+and+%7Dh%28%5Cbeta_2%2C%5Cldots%2C%5Cbeta_n%29+%3E+%5Cgamma+%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='H = \{ h(\beta_2,\ldots,\beta_n)\mid (\beta_2,\ldots,\beta_n)\in B_\gamma\text{ and }h(\beta_2,\ldots,\beta_n) &gt; \gamma \}' title='H = \{ h(\beta_2,\ldots,\beta_n)\mid (\beta_2,\ldots,\beta_n)\in B_\gamma\text{ and }h(\beta_2,\ldots,\beta_n) &gt; \gamma \}' class='latex' />.  Let <img src='http://l.wordpress.com/latex.php?latex=%5Calpha_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha_1' title='\alpha_1' class='latex' /> be the minimal ordinal not in <img src='http://l.wordpress.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='H' title='H' class='latex' />.  I claim that Player B has a winning strategy for <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_1%2C%5Cldots%2C%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_1,\ldots,\alpha_n)' title='(\alpha_1,\ldots,\alpha_n)' class='latex' />.  We will show this by showing that, for any move Player A makes, Player B can make a move that we know leads to a winning strategy for B.</p>
<p>So suppose Player A moves to <img src='http://l.wordpress.com/latex.php?latex=%28%5Cbeta_1%2C%5Cldots%2C+%5Cbeta_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\beta_1,\ldots, \beta_n)' title='(\beta_1,\ldots, \beta_n)' class='latex' />.  Either <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_1+%3C+%5Calpha_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_1 &lt; \alpha_1' title='\beta_1 &lt; \alpha_1' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_1+%3D+%5Calpha_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_1 = \alpha_1' title='\beta_1 = \alpha_1' class='latex' />.  First suppose that <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_1+%3D+%5Calpha_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_1 = \alpha_1' title='\beta_1 = \alpha_1' class='latex' />.  This means that Player A moved by taking a bite of height (say) <img src='http://l.wordpress.com/latex.php?latex=%5Cgamma&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\gamma' title='\gamma' class='latex' /> out of <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_2%2C%5Cldots%2C+%5Calpha_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_2,\ldots, \alpha_n)' title='(\alpha_2,\ldots, \alpha_n)' class='latex' /> by moving to <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha_1%2C%5Cbeta_2%2C%5Cldots%2C+%5Cbeta_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha_1,\beta_2,\ldots, \beta_n)' title='(\alpha_1,\beta_2,\ldots, \beta_n)' class='latex' />.  But by construction, we know that  <img src='http://l.wordpress.com/latex.php?latex=+%5Calpha_1+%5Cne++h%28%5Cbeta_2%2C%5Cldots%2C+%5Cbeta_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt=' \alpha_1 \ne  h(\beta_2,\ldots, \beta_n)' title=' \alpha_1 \ne  h(\beta_2,\ldots, \beta_n)' class='latex' />, which means that the player who is to move (Player B in this case) has a winning strategy.</p>
<p>Now suppose <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_1+%3C+%5Calpha_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_1 &lt; \alpha_1' title='\beta_1 &lt; \alpha_1' class='latex' />.  This means by construction of <img src='http://l.wordpress.com/latex.php?latex=%5Calpha_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha_1' title='\alpha_1' class='latex' /> that <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta_1+%3D+h%28%5Cbeta%27_2%2C%5Cldots%2C+%5Cbeta%27_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta_1 = h(\beta&#039;_2,\ldots, \beta&#039;_n)' title='\beta_1 = h(\beta&#039;_2,\ldots, \beta&#039;_n)' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta%27_i+%5Cleq+%5Cbeta_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta&#039;_i \leq \beta_i' title='\beta&#039;_i \leq \beta_i' class='latex' /> for all <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Cbeta%27_i+%3C+%5Cbeta_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\beta&#039;_i &lt; \beta_i' title='\beta&#039;_i &lt; \beta_i' class='latex' /> for at least one <img src='http://l.wordpress.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='i' title='i' class='latex' />.  Thus, if Player B moves to the position <img src='http://l.wordpress.com/latex.php?latex=%28%5Cbeta_1%2C%5Cbeta%27_2%2C%5Cldots%2C+%5Cbeta%27_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\beta_1,\beta&#039;_2,\ldots, \beta&#039;_n)' title='(\beta_1,\beta&#039;_2,\ldots, \beta&#039;_n)' class='latex' /> he ensures himself a win. <img src='http://l.wordpress.com/latex.php?latex=%5Csquare&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\square' title='\square' class='latex' />.</p>
<p>Note that this proof is constructive.  This means that you can actually use it to compute that (as we already know), the unique <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> such that Player B wins <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha%2C%5Comega%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha,\omega)' title='(\alpha,\omega)' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' />.  As a puzzle, you might like to find the unique <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> such that Player B wins <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha%2C%5Comega%2C1%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha,\omega,1)' title='(\alpha,\omega,1)' class='latex' /> (or such that Player B wins <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha%2C%5Comega%2C%5Comega%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha,\omega,\omega)' title='(\alpha,\omega,\omega)' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%28%5Calpha%2C%5Comega%5E%5Comega%2C%5Comega%5E2%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\alpha,\omega^\omega,\omega^2)' title='(\alpha,\omega^\omega,\omega^2)' class='latex' /> or whatever you like, although the last one will be hard).</p>
<p>The much-more-general Fundamental Theorem of Transfinite Chomp in the paper linked above is also constructive.  It allows you, in theory, to compute who will win the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-dimensional game (we have been considering <img src='http://l.wordpress.com/latex.php?latex=2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2' title='2' class='latex' />-dimensional games) <img src='http://l.wordpress.com/latex.php?latex=%5Calpha_1%5Ctimes+%5Ccdots+%5Ctimes+%5Calpha_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha_1\times \cdots \times \alpha_n' title='\alpha_1\times \cdots \times \alpha_n' class='latex' /> for any ordinals <img src='http://l.wordpress.com/latex.php?latex=%5Calpha_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha_i' title='\alpha_i' class='latex' />.  However, this is quite difficult in practice: according to the Wikipedia article, it is an open question who wins the <img src='http://l.wordpress.com/latex.php?latex=3&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='3' title='3' class='latex' />-dimensional game <img src='http://l.wordpress.com/latex.php?latex=%5Comega%5Ctimes%5Comega%5Ctimes%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega\times\omega\times\omega' title='\omega\times\omega\times\omega' class='latex' />.</p>
<p>As a final note, in the book <a href="http://books.google.com/books?id=wgVPu0YEOlQC&amp;printsec=frontcover&amp;dq=tracking+the+automatic+ant&amp;sig=ACfU3U3RQXwpKEMtvxFHn8WlyM9eSqRLPA">Tracking the Automatic Ant</a>, David Gale gives a very nice non-constructive proof that for all <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, there is a unique <img src='http://l.wordpress.com/latex.php?latex=%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\alpha' title='\alpha' class='latex' /> such that Player B wins <img src='http://l.wordpress.com/latex.php?latex=n%5Ctimes+%5Calpha&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\times \alpha' title='n\times \alpha' class='latex' />.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/163/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/163/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/163/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=163&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/09/29/playing-games-in-the-transfinite-an-introduction-to-ordinal-chomp/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>

		<media:content url="http://xorshammer.files.wordpress.com/2008/09/chomp.png" medium="image">
			<media:title type="html">chomp</media:title>
		</media:content>
	</item>
		<item>
		<title>Avoiding Set-Theoretic Paradoxes using Symmetry</title>
		<link>http://xorshammer.com/2008/09/21/avoiding-set-theoretic-paradoxes-using-symmetry/</link>
		<comments>http://xorshammer.com/2008/09/21/avoiding-set-theoretic-paradoxes-using-symmetry/#comments</comments>
		<pubDate>Mon, 22 Sep 2008 02:34:08 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=158</guid>
		<description><![CDATA[Intuitively, for any property of sets, there should be a set which has as its members all and only those sets such that holds. But this can&#8217;t actually work, due to Russell&#8217;s Paradox: Let , and then you can derive a contradiction from both and . The standard solution to this is essentially to forbid [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=158&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Intuitively, for any property <img src='http://l.wordpress.com/latex.php?latex=P%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P(x)' title='P(x)' class='latex' /> of sets, there should be a set <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bx+%5Cmid+P%28x%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{x \mid P(x)\}' title='\{x \mid P(x)\}' class='latex' /> which has as its members all and only those sets <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=P%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='P(x)' title='P(x)' class='latex' /> holds.  But this can&#8217;t actually work, due to <a href="http://en.wikipedia.org/wiki/Russell%27s_paradox">Russell&#8217;s Paradox</a>: Let <img src='http://l.wordpress.com/latex.php?latex=b+%3D+%5C%7Bx%5Cmid+x%5Cnotin+x%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b = \{x\mid x\notin x\}' title='b = \{x\mid x\notin x\}' class='latex' />, and then you can derive a contradiction from both <img src='http://l.wordpress.com/latex.php?latex=b%5Cin+b&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b\in b' title='b\in b' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=b%5Cnotin+b&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b\notin b' title='b\notin b' class='latex' />.</p>
<p>The standard solution to this is essentially to forbid the construction of any set which is too <em>big</em>.  This solves the problem since you can prove that there are many sets which are not members of themselves, making <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b' title='b' class='latex' /> too big to be a set.  But you also end up throwing out many sets which you might want to have: for example, the set of all sets, the set of all groups, etc.</p>
<p><a href="http://math.boisestate.edu/~holmes/">Randall Holmes</a> recently published a <a href="http://math.boisestate.edu/%7Eholmes/holmes/symmetry3.ps">paper</a> espousing another solution: instead of forbidding the construction of sets which are too <em>big</em>, forbid the construction of sets which are too <em>asymmetric</em>.  Details below.</p>
<p><span id="more-158"></span></p>
<p>Imagine you have some permutation <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\pi' title='\pi' class='latex' /> of the universe of sets.  Because any set is also a set of sets, we can also consider the related permutation <img src='http://l.wordpress.com/latex.php?latex=j%5Cpi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j\pi' title='j\pi' class='latex' /> defined by <img src='http://l.wordpress.com/latex.php?latex=j%5Cpi%28x%29+%3D+%5C%7B%5Cpi%28y%29+%5Cmid+y%5Cin+x%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j\pi(x) = \{\pi(y) \mid y\in x\}' title='j\pi(x) = \{\pi(y) \mid y\in x\}' class='latex' />.  That is, <img src='http://l.wordpress.com/latex.php?latex=j%5Cpi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j\pi' title='j\pi' class='latex' /> acts on a set <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> by applying <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\pi' title='\pi' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' />&#8216;s elements.  By iteration, we have <img src='http://l.wordpress.com/latex.php?latex=j%5En%5Cpi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j^n\pi' title='j^n\pi' class='latex' /> for any <img src='http://l.wordpress.com/latex.php?latex=n%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in\mathbb{N}' title='n\in\mathbb{N}' class='latex' />.</p>
<p>For <img src='http://l.wordpress.com/latex.php?latex=n%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\in\mathbb{N}' title='n\in\mathbb{N}' class='latex' />, say that a set <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-symmetric if <img src='http://l.wordpress.com/latex.php?latex=j%5En%5Cpi%28x%29+%3D+x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j^n\pi(x) = x' title='j^n\pi(x) = x' class='latex' /> for all permutations <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\pi' title='\pi' class='latex' /> of the universe of sets.  We say that a set is symmetric if it&#8217;s <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-symmetric for some <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />.  Holmes&#8217;s criterion is then to forbid the construction of any set which is not symmetric.  (You may have noticed that this discussion is not quite rigorous.  Holmes&#8217;s paper has a fully rigorous formalization of this.)</p>
<p>So which sets are symmetric?  First of all, notice that the empty set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{\}' title='\{\}' class='latex' /> is symmetric, as it&#8217;s 1-symmetric.  Therefore the set <img src='http://l.wordpress.com/latex.php?latex=%5C%7B%5C%7B%5C%7D%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{\{\}\}' title='\{\{\}\}' class='latex' /> consisting of solely the empty set is 2-symmetric and therefore symmetric.  Similarly any hereditarily finite set (this means a set which can be written down with a finite number of <img src='http://l.wordpress.com/latex.php?latex=%5C%7B&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{' title='\{' class='latex' />&#8216;s and <img src='http://l.wordpress.com/latex.php?latex=%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\}' title='\}' class='latex' />&#8216;s and <img src='http://l.wordpress.com/latex.php?latex=%2C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt=',' title=',' class='latex' />&#8216;s and nothing else) is symmetric, since it will be <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-symmetric where <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> is the maximum depth of the braces.</p>
<p>It&#8217;s also the case that the set of all sets is 1-symmetric, so that exists.  What about the set of all groups?  A group will be encoded as some ordered pair of a set and a binary operation on that set, and a binary operation will be further encoded as a set of ordered pairs.  The set of all groups will be <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-symmetric where <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> is large enough to &#8220;pierce&#8221; the encoding, so that it ends up just permuting the group elements (and thus permuting the groups and sending the set of all groups to itself).</p>
<p>Can we develop mathematics in this theory?  It seems that constructing the natural numbers will be a problem.  The usual (von Neumann) definition of the natural numbers is that:</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=0+%3D+%5C%7B%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='0 = \{\}' title='0 = \{\}' class='latex' /></p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=1+%3D+%5C%7B0%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='1 = \{0\}' title='1 = \{0\}' class='latex' /></p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=2+%3D+%5C%7B0%2C1%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2 = \{0,1\}' title='2 = \{0,1\}' class='latex' /></p>
<p style="text-align:left;">and, in general, each natural number is the set of all the preceding ones.  All of these sets exist, since the von Neumann definition of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> will be <img src='http://l.wordpress.com/latex.php?latex=%28n%2B1%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(n+1)' title='(n+1)' class='latex' />-symmetric, but the set of all natural numbers is not symmetric.</p>
<p style="text-align:left;">However, we can go back instead to Frege&#8217;s original definition of the natural numbers: each <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> is represented as the set of all sets of cardinality <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />.  For each <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, Frege&#8217;s definition of <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> is 2-symmetric, and the set of all natural numbers is 3-symmetric.  The rationals and reals can be constructed as usual.</p>
<p>So, how do we know that the set <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b' title='b' class='latex' /> is not symmetric?  We don&#8217;t, but an encouraging fact is the following: There is  no known way to prove that for any formula <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi(x)' title='\phi(x)' class='latex' />, the set <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bx%5Cmid+%5Cphi%28x%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{x\mid \phi(x)\}' title='\{x\mid \phi(x)\}' class='latex' /> exists.  Instead, one can prove that <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bx+%5Cmid+%5Cphi%28x%29%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{x \mid \phi(x)\}' title='\{x \mid \phi(x)\}' class='latex' /> exists if <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi(x)' title='\phi(x)' class='latex' /> is <em>stratified</em>: this means that one can assign a natural number to each variable in <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi(x)' title='\phi(x)' class='latex' /> so that for any occurrence of the formula <img src='http://l.wordpress.com/latex.php?latex=y+%5Cin+z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y \in z' title='y \in z' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi(x)' title='\phi(x)' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' /> is assigned the number one less than that assigned to <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' />, and for any occurrence of the formula <img src='http://l.wordpress.com/latex.php?latex=y+%3D+z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y = z' title='y = z' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=%5Cphi%28x%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\phi(x)' title='\phi(x)' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' /> is assigned the same number as that assigned to <img src='http://l.wordpress.com/latex.php?latex=z&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z' title='z' class='latex' />.  The formula defining <img src='http://l.wordpress.com/latex.php?latex=b&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='b' title='b' class='latex' /> is emphatically not stratified!</p>
<p>If you like working with universal sets, but it makes you uneasy to use a set theory which you don&#8217;t know is consistent, check out <a href="http://math.boisestate.edu/%7Eholmes/holmes/nf.html#Consistent">NFU</a>. It uses the concept of stratified formulas to avoid Russell&#8217;s paradox, allows the existence of the set of all sets (and set of all groups, etc.) and is known to be consistent relative to ZFC.  In fact, Randall Holmes proposed the system I&#8217;ve discussed here as a way of clarifying the semantics of a related set theory.  A book developing mathematics in NFU is <a href="http://math.boisestate.edu/%7Eholmes/holmes/head.ps">here</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/158/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/158/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/158/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=158&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/09/21/avoiding-set-theoretic-paradoxes-using-symmetry/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>The Undecidability of Identities Involving Sine, Exponentiation, and Absolute Value</title>
		<link>http://xorshammer.com/2008/09/14/the-undecidability-of-identities/</link>
		<comments>http://xorshammer.com/2008/09/14/the-undecidability-of-identities/#comments</comments>
		<pubDate>Sun, 14 Sep 2008 17:45:27 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=155</guid>
		<description><![CDATA[In the book A=B, the authors point out that while the identity is provable (by a very simple proof!), it&#8217;s not possible to prove the truth or falsity of all such identities. This is because Daniel Richardson proved the following: Let denote the class of expressions generated by The rational numbers, and . The variable [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=155&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In the book <a href="http://www.math.upenn.edu/~wilf/AeqB.html">A=B</a>, the authors point out that while the identity</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%5Csin%5E2%28%7C10+%2B+%5Cpi+x%7C%29+%2B+%5Ccos%5E2%28%7C10+%2B+%5Cpi+x%7C%29+%3D+1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\displaystyle{\sin^2(|10 + \pi x|) + \cos^2(|10 + \pi x|) = 1}' title='\displaystyle{\sin^2(|10 + \pi x|) + \cos^2(|10 + \pi x|) = 1}' class='latex' /></p>
<p style="text-align:left;">is provable (by a very simple proof!), it&#8217;s not possible to prove the truth or falsity of all such identities.  This is because Daniel Richardson proved the following:</p>
<p style="text-align:left;">Let <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{R}' title='\mathcal{R}' class='latex' /> denote the class of expressions generated by</p>
<ol>
<li>The rational numbers, and <img src='http://l.wordpress.com/latex.php?latex=%5Cpi&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\pi' title='\pi' class='latex' />.</li>
<li>The variable <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /></li>
<li>The operations of addition, multiplication, and composition.</li>
<li>The sine, exponential, and absolute value functions.</li>
</ol>
<p>Then the problem of deciding whether or not an expression <img src='http://l.wordpress.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E' title='E' class='latex' /> in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathcal%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathcal{R}' title='\mathcal{R}' class='latex' /> is identically zero is undecidable.  This means as well that the problem of deciding whether or not two expressions <img src='http://l.wordpress.com/latex.php?latex=E_1%2C+E_2%5Cin+%5Cmathcal%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_1, E_2\in \mathcal{R}' title='E_1, E_2\in \mathcal{R}' class='latex' /> are always equal is also undecidable, since this is equivalent to deciding if <img src='http://l.wordpress.com/latex.php?latex=E_1+-+E_2+%3D+E_1+%2B+%28-1%29%5Ccdot+E_2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='E_1 - E_2 = E_1 + (-1)\cdot E_2' title='E_1 - E_2 = E_1 + (-1)\cdot E_2' class='latex' /> is identically zero.</p>
<p>A summary of Richardson&#8217;s proof (mostly from Richardson&#8217;s paper itself) is below.</p>
<p><span id="more-155"></span></p>
<p>The proof depends on <a href="http://en.wikipedia.org/wiki/Hilbert%27s_tenth_problem">the MRDP theorem</a>, which says that for any <a href="http://en.wikipedia.org/wiki/Recursively_enumerable_set">recursively enumerable set</a> <img src='http://l.wordpress.com/latex.php?latex=S%5Csubset+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S\subset \mathbb{N}' title='S\subset \mathbb{N}' class='latex' />, there is a polynomial <img src='http://l.wordpress.com/latex.php?latex=p%28y%2Cx_1%2C%5Cldots%2C+x_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p(y,x_1,\ldots, x_n)' title='p(y,x_1,\ldots, x_n)' class='latex' /> such that</p>
<p style="text-align:center;">For all <img src='http://l.wordpress.com/latex.php?latex=y%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in\mathbb{N}' title='y\in\mathbb{N}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=y%5Cin+S&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in S' title='y\in S' class='latex' /> iff there exist <img src='http://l.wordpress.com/latex.php?latex=x_1%5Cldots%2C+x_n%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1\ldots, x_n\in\mathbb{N}' title='x_1\ldots, x_n\in\mathbb{N}' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=p%28y%2Cx_1%2C%5Cldots%2C+x_n%29+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p(y,x_1,\ldots, x_n) = 0' title='p(y,x_1,\ldots, x_n) = 0' class='latex' />.</p>
<p style="text-align:left;">At the time that Richardson proved his result, the MDRP theorem was not proven.  Instead, only the weaker result where <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' /> is allowed to be an exponential polynomial (i.e., the <img src='http://l.wordpress.com/latex.php?latex=x_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_i' title='x_i' class='latex' /> are allowed to appear as exponents in <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />) had been proven, and so that&#8217;s what he used.  I haven&#8217;t read Richardson&#8217;s proof closely enough to determine if his result can be improved using the full MDRP theorem.</p>
<p style="text-align:left;">In any case, since there are recursively enumerable sets which are not decidable, we may let <img src='http://l.wordpress.com/latex.php?latex=p%28y%2Cx_1%2C%5Cldots%2C+x_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p(y,x_1,\ldots, x_n)' title='p(y,x_1,\ldots, x_n)' class='latex' /> be an exponential polynomial such that the problem of deciding, given <img src='http://l.wordpress.com/latex.php?latex=y%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in\mathbb{N}' title='y\in\mathbb{N}' class='latex' />, whether or not there are <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cldots%2C+x_n%5Cin+%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1,\ldots, x_n\in \mathbb{N}' title='x_1,\ldots, x_n\in \mathbb{N}' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=p%28y%2Cx_1%2C%5Cldots%2C+x_n%29%3D0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p(y,x_1,\ldots, x_n)=0' title='p(y,x_1,\ldots, x_n)=0' class='latex' /> is undecidable.</p>
<p style="text-align:left;">Therefore, the problem of deciding, given <img src='http://l.wordpress.com/latex.php?latex=y%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in\mathbb{N}' title='y\in\mathbb{N}' class='latex' />, whether or not there are <em>non-negative real numbers</em> <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cldots%2C+x_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1,\ldots, x_n' title='x_1,\ldots, x_n' class='latex' /> such that</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%28p%28y%2Cx_1%2C%5Cldots%2Cx_n%29%29%5E2+%2B+%5Csum_%7Bi%3D1%7D%5En+%5Csin%5E2+%5Cpi+x_i+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(p(y,x_1,\ldots,x_n))^2 + \sum_{i=1}^n \sin^2 \pi x_i = 0' title='(p(y,x_1,\ldots,x_n))^2 + \sum_{i=1}^n \sin^2 \pi x_i = 0' class='latex' /></p>
<p style="text-align:left;">is undecidable.</p>
<p style="text-align:left;">Now, let <img src='http://l.wordpress.com/latex.php?latex=q%28y%2Cx_1%2C%5Cldots%2C+x_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q(y,x_1,\ldots, x_n)' title='q(y,x_1,\ldots, x_n)' class='latex' /> be an exponential polynomial which grows very fast (and such that <img src='http://l.wordpress.com/latex.php?latex=q%280%2C%5Cldots%2C0%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='q(0,\ldots,0)' title='q(0,\ldots,0)' class='latex' /> is very large).  Then, if <img src='http://l.wordpress.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y' title='y' class='latex' /> is a natural number and there are non-negative reals <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cldots%2C+x_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1,\ldots, x_n' title='x_1,\ldots, x_n' class='latex' /> such that</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle%7Bq%28y%2Cx_1%2C%5Cldots%2C+x_n%29%5Ccdot+%28%28p%28y%2Cx_1%2C%5Cldots%2C+x_n%29%29%5E2+%2B+%5Csum_%7Bi%3D1%7D%5En+%5Csin%5E2+%5Cpi+x_i%7D%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\displaystyle{q(y,x_1,\ldots, x_n)\cdot ((p(y,x_1,\ldots, x_n))^2 + \sum_{i=1}^n \sin^2 \pi x_i})' title='\displaystyle{q(y,x_1,\ldots, x_n)\cdot ((p(y,x_1,\ldots, x_n))^2 + \sum_{i=1}^n \sin^2 \pi x_i})' class='latex' /></p>
<p style="text-align:left;">is less than one, then both <img src='http://l.wordpress.com/latex.php?latex=%7Cp%28y%2Cx_1%2C%5Cldots%2C+x_n%29%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|p(y,x_1,\ldots, x_n)|' title='|p(y,x_1,\ldots, x_n)|' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5En+%5Csin%5E2+%5Cpi+x_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\sum_{i=1}^n \sin^2 \pi x_i' title='\sum_{i=1}^n \sin^2 \pi x_i' class='latex' /> are small.  From this last fact, we conclude that each <img src='http://l.wordpress.com/latex.php?latex=x_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_i' title='x_i' class='latex' /> is close to a natural number.  Let <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+x_i%5Crangle&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\langle x_i\rangle' title='\langle x_i\rangle' class='latex' /> be the natural number closest to <img src='http://l.wordpress.com/latex.php?latex=x_i&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_i' title='x_i' class='latex' />.  Then, <img src='http://l.wordpress.com/latex.php?latex=%7Cp%28y%2C%5Clangle+x_1%5Crangle%2C+%5Cldots%2C+%5Clangle+x_n%5Crangle%29%7C&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|p(y,\langle x_1\rangle, \ldots, \langle x_n\rangle)|' title='|p(y,\langle x_1\rangle, \ldots, \langle x_n\rangle)|' class='latex' /> will be small.  But then, because it&#8217;s an integer, it will be zero.</p>
<p style="text-align:left;">Therefore, we have an expression <img src='http://l.wordpress.com/latex.php?latex=G%28y%2Cx_1%5Cldots%2Cx_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G(y,x_1\ldots,x_n)' title='G(y,x_1\ldots,x_n)' class='latex' /> formed from sine and exponential functions (and rational numbers, addition, and multiplication) such that for each <img src='http://l.wordpress.com/latex.php?latex=y%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in\mathbb{N}' title='y\in\mathbb{N}' class='latex' />, there exist non-negative reals <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cldots%2C+x_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1,\ldots, x_n' title='x_1,\ldots, x_n' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=G%28y%2Cx_1%2C%5Cldots%2C+x_n%29+%3C+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G(y,x_1,\ldots, x_n) &lt; 1' title='G(y,x_1,\ldots, x_n) &lt; 1' class='latex' /> iff there exist natural numbers <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cldots%2C+x_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1,\ldots, x_n' title='x_1,\ldots, x_n' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=p%28y%2Cx_1%2C%5Cldots+x_n%29+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p(y,x_1,\ldots x_n) = 0' title='p(y,x_1,\ldots x_n) = 0' class='latex' /> (which is an undecidable problem).</p>
<p style="text-align:left;">By an argument which I won&#8217;t reproduce here, we can replace <img src='http://l.wordpress.com/latex.php?latex=G%28y%2Cx_1%2C%5Cldots%2C+x_n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G(y,x_1,\ldots, x_n)' title='G(y,x_1,\ldots, x_n)' class='latex' /> with <img src='http://l.wordpress.com/latex.php?latex=G_0%28y%2Cx%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G_0(y,x)' title='G_0(y,x)' class='latex' /> with the property that for each <img src='http://l.wordpress.com/latex.php?latex=y%5Cin%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='y\in\mathbb{N}' title='y\in\mathbb{N}' class='latex' /> there exists a real <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=G_0%28y%2Cx%29+%3C+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G_0(y,x) &lt; 0' title='G_0(y,x) &lt; 0' class='latex' /> iff there exist natural numbers <img src='http://l.wordpress.com/latex.php?latex=x_1%2C%5Cldots%2C+x_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1,\ldots, x_n' title='x_1,\ldots, x_n' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=p%28y%2Cx_1%2C%5Cldots%2C+x_n%29%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p(y,x_1,\ldots, x_n)= 0' title='p(y,x_1,\ldots, x_n)= 0' class='latex' />.  (Notice that <img src='http://l.wordpress.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x' title='x' class='latex' /> now ranges over all reals.)  But now, consider the sequence  of functions</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%7CG_0%280%2Cx%29%7C+-+G_0%280%2Cx%29%2C+%7CG_0%281%2Cx%29%7C+-+G_0%281%2Cx%29%2C+%7CG_0%282%2Cx%29%7C+-+G_0%282%2Cx%29%2C+%5Cldots&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='|G_0(0,x)| - G_0(0,x), |G_0(1,x)| - G_0(1,x), |G_0(2,x)| - G_0(2,x), \ldots' title='|G_0(0,x)| - G_0(0,x), |G_0(1,x)| - G_0(1,x), |G_0(2,x)| - G_0(2,x), \ldots' class='latex' /></p>
<p style="text-align:left;">Each is identically zero iff the corresponding <img src='http://l.wordpress.com/latex.php?latex=G%28n%2Cx%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='G(n,x)' title='G(n,x)' class='latex' /> is ever less than zero, which is an undecidable problem.</p>
<p style="text-align:left;">The reference for Richardson&#8217;s paper is: Daniel Richardson, &#8220;Some unsolvable problems involving elementary functions of a real variable,&#8221; Journal of Symbolic Logic, Volume 33, 1968, pages 514&#8211;520</p>
<p style="text-align:left;">Another reference is: B.F. Caviness, &#8220;On canonical forms and simplification,&#8221; Journal of the ACM, volume 17, 1970, pages 385&#8211;396.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/xorshammer.wordpress.com/155/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/xorshammer.wordpress.com/155/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/155/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=155&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/09/14/the-undecidability-of-identities/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>A Geometrically Natural Uncomputable Function</title>
		<link>http://xorshammer.com/2008/09/04/a-geometrically-natural-uncomputable-function/</link>
		<comments>http://xorshammer.com/2008/09/04/a-geometrically-natural-uncomputable-function/#comments</comments>
		<pubDate>Fri, 05 Sep 2008 02:50:47 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=148</guid>
		<description><![CDATA[There are many functions from to that cannot be computed by any algorithm or computer program. For example, a famous one is the halting problem, defined by if the th Turing machine halts and if the th Turing machine does not halt. Another one in the same spirit is the busy beaver function. We also [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=148&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>There are many functions from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> that cannot be computed by any algorithm or computer program.  For example, a famous one is the <a href="http://en.wikipedia.org/wiki/Halting_problem">halting problem</a>, defined by <img src='http://l.wordpress.com/latex.php?latex=f%28n%29+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(n) = 0' title='f(n) = 0' class='latex' /> if the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine halts and <img src='http://l.wordpress.com/latex.php?latex=f%28n%29+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f(n) = 1' title='f(n) = 1' class='latex' /> if the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />th Turing machine does not halt.  Another one in the same spirit is the <a href="http://en.wikipedia.org/wiki/Busy_beaver_function">busy beaver function</a>.</p>
<p>We also know a priori that there must be uncomputable functions, since there are <img src='http://l.wordpress.com/latex.php?latex=2%5E%7B%5Caleph_0%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2^{\aleph_0}' title='2^{\aleph_0}' class='latex' /> functions from <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{N}' title='\mathbb{N}' class='latex' /> but only <img src='http://l.wordpress.com/latex.php?latex=%5Caleph_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\aleph_0' title='\aleph_0' class='latex' /> computer programs.  But that is nonconstructive, and the two examples I gave above seem a bit like they&#8217;re cheating since their definitions refer to the concept of computability.  Is there a natural example of an uncomputable function that does not refer to computability?</p>
<p>In <a href="http://www.math.toronto.edu/alex/wpk7.ps">this paper</a>,  Alex Nabutovsky found what I think is a great example of such a function from geometry.  Details below.</p>
<p><span id="more-148"></span></p>
<p>For any <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />, let <img src='http://l.wordpress.com/latex.php?latex=S%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^n' title='S^n' class='latex' /> be the <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' />-dimensional unit sphere <img src='http://l.wordpress.com/latex.php?latex=%5C%7Bx+%5Cin+%5Cmathbb%7BR%7D%5En+%5Cmid+%7Cx%7C+%3D+1%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\{x \in \mathbb{R}^n \mid |x| = 1\}' title='\{x \in \mathbb{R}^n \mid |x| = 1\}' class='latex' />.  For all <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> there is an &#8220;equatorial&#8221; embedding of <img src='http://l.wordpress.com/latex.php?latex=S%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^n' title='S^n' class='latex' /> into <img src='http://l.wordpress.com/latex.php?latex=S%5E%7Bn%2B1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^{n+1}' title='S^{n+1}' class='latex' /> sending <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+x_1%2C%5Cldots%2C+x_n%5Crangle&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\langle x_1,\ldots, x_n\rangle' title='\langle x_1,\ldots, x_n\rangle' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%5Clangle+x_1%2C%5Cldots%2C+x_n%2C+0%5Crangle&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\langle x_1,\ldots, x_n, 0\rangle' title='\langle x_1,\ldots, x_n, 0\rangle' class='latex' />.  This is certainly the nicest way of embedding <img src='http://l.wordpress.com/latex.php?latex=S%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^n' title='S^n' class='latex' /> into <img src='http://l.wordpress.com/latex.php?latex=S%5E%7Bn%2B1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^{n+1}' title='S^{n+1}' class='latex' /> but there are other ways.</p>
<p>If <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' /> is an embedding of <img src='http://l.wordpress.com/latex.php?latex=S%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^n' title='S^n' class='latex' /> into <img src='http://l.wordpress.com/latex.php?latex=S%5E%7Bn%2B1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^{n+1}' title='S^{n+1}' class='latex' /> then let its <em>amount of wiggle room</em> be the maximum amount that <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' />&#8216;s image can be thickened before it intersects itself.  More precisely, it is the maximum <img src='http://l.wordpress.com/latex.php?latex=t&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='t' title='t' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=x_0+%2B+t_0+v_0+%5Cne+x_1+%2B+t_1+v_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_0 + t_0 v_0 \ne x_1 + t_1 v_1' title='x_0 + t_0 v_0 \ne x_1 + t_1 v_1' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=t_0%2C+t_1+%3C+t&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='t_0, t_1 &lt; t' title='t_0, t_1 &lt; t' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=x_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_0' title='x_0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=x_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1' title='x_1' class='latex' /> are in the image of <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=v_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v_0' title='v_0' class='latex' /> is the unit normal to the image of <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' /> at <img src='http://l.wordpress.com/latex.php?latex=x_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_0' title='x_0' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=v_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='v_1' title='v_1' class='latex' /> is the unit normal to the image of <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' /> at <img src='http://l.wordpress.com/latex.php?latex=x_1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1' title='x_1' class='latex' />.  We let <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' />&#8216;s <em>crumbledness </em>be the reciprocal of its amount of wiggle room.</p>
<p>It is known (by a theorem of  Stephen Smale) that any embedding of <img src='http://l.wordpress.com/latex.php?latex=S%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^n' title='S^n' class='latex' /> into <img src='http://l.wordpress.com/latex.php?latex=S%5E%7Bn%2B1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^{n+1}' title='S^{n+1}' class='latex' /> can be isotoped to the equatorial embedding (up to reparameterization), but you may have to increase the crumbledness to do it.  Nabutovsky proves that, for any dimension <img src='http://l.wordpress.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n' title='n' class='latex' /> and crumbledness <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='c' title='c' class='latex' />, there is an <img src='http://l.wordpress.com/latex.php?latex=f_n%28c%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_n(c)' title='f_n(c)' class='latex' /> such that  any embedding of crumbledness <img src='http://l.wordpress.com/latex.php?latex=%3Cc&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='&lt;c' title='&lt;c' class='latex' /> can be isotoped to the equatorial embedding going through only embeddings of crumbledness <img src='http://l.wordpress.com/latex.php?latex=%3Cf_n%28c%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='&lt;f_n(c)' title='&lt;f_n(c)' class='latex' />.  We lose nothing in terms of complexity by considering <img src='http://l.wordpress.com/latex.php?latex=f_n%5Ccolon+%5Cmathbb%7BN%7D%5Cto%5Cmathbb%7BN%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_n\colon \mathbb{N}\to\mathbb{N}' title='f_n\colon \mathbb{N}\to\mathbb{N}' class='latex' />.</p>
<p>Nabutovsky shows that for any <img src='http://l.wordpress.com/latex.php?latex=n+%5Cgeq+5&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n \geq 5' title='n \geq 5' class='latex' />, any <img src='http://l.wordpress.com/latex.php?latex=f_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_n' title='f_n' class='latex' /> satisfying the above condition is uncomputable, and, furthermore, the minimum <img src='http://l.wordpress.com/latex.php?latex=f_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f_n' title='f_n' class='latex' /> which works grows like the busy beaver function!</p>
<p>The proof depends on the fact that for <img src='http://l.wordpress.com/latex.php?latex=n%5Cgeq+5&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='n\geq 5' title='n\geq 5' class='latex' />, it is undecidable if a compact manifold (presented either as a simplicial complex, or as a zero-set of a polynomial with rational coefficients, or some such representation that a computer can handle) embedded in <img src='http://l.wordpress.com/latex.php?latex=S%5E%7Bn%2B1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^{n+1}' title='S^{n+1}' class='latex' /> is diffeomorphic to <img src='http://l.wordpress.com/latex.php?latex=S%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^n' title='S^n' class='latex' />.  (For a good summary of these types of undecidability results in group theory and topology, see Section 3 of Bob Soare&#8217;s <a href="http://www.people.cs.uchicago.edu/~soare/res/Geometry/geom.pdf">Computability and Differential Geometry</a>.)  Essentially, the idea is that if <img src='http://l.wordpress.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='f' title='f' class='latex' /> were computable, you could decide if a manifold were homeomorphic to <img src='http://l.wordpress.com/latex.php?latex=S%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^n' title='S^n' class='latex' /> by taking embedding it in <img src='http://l.wordpress.com/latex.php?latex=S%5E%7Bn%2B1%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='S^{n+1}' title='S^{n+1}' class='latex' />, measuring its crumbledness (say as <img src='http://l.wordpress.com/latex.php?latex=c&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='c' title='c' class='latex' />), then checking all possible isotopies of the manifold through embeddings of crumbledness <img src='http://l.wordpress.com/latex.php?latex=%3C+f_n%28c%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='&lt; f_n(c)' title='&lt; f_n(c)' class='latex' />.  The fact that the manifold&#8217;s crumbledness can be measured and that all possible isotopies going through embeddings of bounded crumbledess can be checked computably is related to the fact that often you can computably search over compact spaces, as I wrote about in <a href="http://xorshammer.wordpress.com/2008/08/21/compute-definite-integral/">this post</a>.</p>
<p>If you can get a hold of a copy, I highly recommend Shmuel Weinberger&#8217;s book <a href="http://books.google.com/books?id=CtvmQiuOKSEC&amp;printsec=frontcover&amp;dq=shmuel+weinberger&amp;sig=ACfU3U3ODHye2679NPO2AlELJZFR6WlVPg">Computers, Rigidity, and Moduli</a>, where he talks about this and other related results in a very lively and engaging manner.</p>
<p>Edit: Fixed some notation.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/xorshammer.wordpress.com/148/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/xorshammer.wordpress.com/148/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xorshammer.wordpress.com/148/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xorshammer.wordpress.com/148/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/xorshammer.wordpress.com/148/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/xorshammer.wordpress.com/148/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/xorshammer.wordpress.com/148/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/xorshammer.wordpress.com/148/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/xorshammer.wordpress.com/148/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/xorshammer.wordpress.com/148/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/xorshammer.wordpress.com/148/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/xorshammer.wordpress.com/148/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/xorshammer.wordpress.com/148/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/xorshammer.wordpress.com/148/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/xorshammer.wordpress.com/148/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/xorshammer.wordpress.com/148/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=148&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xorshammer.com/2008/09/04/a-geometrically-natural-uncomputable-function/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ca5529817f93ca2fff7634d05d46a358?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mkoconnor</media:title>
		</media:content>
	</item>
		<item>
		<title>Integrability Conditions (Guest Post!)</title>
		<link>http://xorshammer.com/2008/09/04/integrability-conditions-guest-post/</link>
		<comments>http://xorshammer.com/2008/09/04/integrability-conditions-guest-post/#comments</comments>
		<pubDate>Thu, 04 Sep 2008 23:38:13 +0000</pubDate>
		<dc:creator>mkoconnor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://xorshammer.wordpress.com/?p=149</guid>
		<description><![CDATA[Please enjoy the following guest post on differential geometry by Tim Goldberg. A symplectic structure on a manifold is a differential -form satisfying two conditions: is non-degenerate, i.e. for each and tangent vector based at , if for all tangent vectors based at , then is the zero vector; is closed, i.e. the exterior derivative [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xorshammer.com&amp;blog=4469928&amp;post=149&amp;subd=xorshammer&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Please enjoy the following guest post on differential geometry by <a href="http://www.math.cornell.edu/~goldberg/">Tim Goldberg</a>.</p>
<p>A <em>symplectic structure</em> on a manifold <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is a differential <img src='http://l.wordpress.com/latex.php?latex=2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2' title='2' class='latex' />-form <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> satisfying two conditions:</p>
<ol>
<li> <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> is <em>non-degenerate</em>, i.e. for each <img src='http://l.wordpress.com/latex.php?latex=p+%5Cin+M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p \in M' title='p \in M' class='latex' /> and tangent vector <img src='http://l.wordpress.com/latex.php?latex=%5Cvec%7Bu%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\vec{u}' title='\vec{u}' class='latex' /> based at <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />, if <img src='http://l.wordpress.com/latex.php?latex=%5Comega_p%28%5Cvec%7Bu%7D%2C%5Cvec%7Bv%7D%29+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega_p(\vec{u},\vec{v}) = 0' title='\omega_p(\vec{u},\vec{v}) = 0' class='latex' /> for all tangent vectors <img src='http://l.wordpress.com/latex.php?latex=%5Cvec%7Bv%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\vec{v}' title='\vec{v}' class='latex' /> based at <img src='http://l.wordpress.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='p' title='p' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%5Cvec%7Bu%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\vec{u}' title='\vec{u}' class='latex' /> is the zero vector;</li>
<li><img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> is <em>closed</em>, i.e. the exterior derivative of <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> is zero, i.e. <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bd%7D+%5Comega+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{d} \omega = 0' title='\mathrm{d} \omega = 0' class='latex' />.</li>
</ol>
<p>In trying to come up with answers to questions like &#8220;what do you do?&#8221; and &#8220;what is symplectic geometry?&#8221; that would be accessible to an advanced undergraduate or beginning graduate student, I&#8217;ve tried to come up with fairly intuitive descriptions of what the two symplectic structure conditions really mean.</p>
<p>Non-degeneracy is pretty easy, because my intended audience is certainly familiar with the dot product in Euclidean space, and probably familiar with more general machinery like inner products and bilinear forms. A <em>bilinear form</em> on a vector space over a field <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{F}' title='\mathbb{F}' class='latex' /> is just an assignment of a number in <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BF%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{F}' title='\mathbb{F}' class='latex' /> to each pair of vectors, in such a way that the assignment is linear in each vector in the pair. A bilinear form is called <em>non-degenerate</em> if the only thing that pairs to zero with every single vector is the zero vector. A <img src='http://l.wordpress.com/latex.php?latex=2&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='2' title='2' class='latex' />-form on <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is a collection <img src='http://l.wordpress.com/latex.php?latex=%5Comega+%3D+%5C%7B+%5Comega_p+%5Cmid+p+%5Cin+M+%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega = \{ \omega_p \mid p \in M \}' title='\omega = \{ \omega_p \mid p \in M \}' class='latex' /> of skew-symmetric bilinear forms, one for each tangent space of <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' />. Saying that <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> is non-degenerate is saying that each of these bilinear forms is non-degenerate.</p>
<p>It&#8217;s much less clear how to describe to the uninitiated what the closed condition means. It&#8217;s even a bit unclear why this condition is required in the first place. A pretty nice answer came up yesterday, in a reading group I attend that is trying to learn about generalized complex structure. We are going through the PhD thesis of Marco Gualtieri, titled &#8220;Generalized Complex Geometry&#8221;. It is available at the following websites:</p>
<p style="text-align:center;"><a href="http://front.math.ucdavis.edu/0401.5221">http://front.math.ucdavis.edu/0401.5221</a></p>
<p style="text-align:center;"><a href="http://front.math.ucdavis.edu/0703.5298">http://front.math.ucdavis.edu/0703.5298</a>.</p>
<p>This was the first meeting, and <a href="http://www.math.cornell.edu/People/Faculty/matsumura.html">Tomoo Matsumura</a> was the speaker. He suggested that the requirement that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bd%7D+%5Comega+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{d} \omega = 0' title='\mathrm{d} \omega = 0' class='latex' /> is an <em>integrability condition</em>. I had never thought of it this way, but I probably will from now on.<br />
<span id="more-149"></span><strong></strong></p>
<p><strong>Almost complex and complex structures</strong></p>
<p>Let me first describe what integrability means for an almost complex structure on a manifold. A <em>complex structure on a vector space</em> <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> is real and finite-dimensional, is a linear endomorphism <img src='http://l.wordpress.com/latex.php?latex=J%5Ccolon%7BV%7D%5Cto%7BV%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J\colon{V}\to{V}' title='J\colon{V}\to{V}' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=J%5E2+%3D+-+%5Cmathrm%7Bid%7D_V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J^2 = - \mathrm{id}_V' title='J^2 = - \mathrm{id}_V' class='latex' />. Taking the determinant of both sides, we have <img src='http://l.wordpress.com/latex.php?latex=%28%5Cdet+J%29%5E2+%3D+%28-1%29%5E%7B%5Cdim+V%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\det J)^2 = (-1)^{\dim V}' title='(\det J)^2 = (-1)^{\dim V}' class='latex' />. Since <img src='http://l.wordpress.com/latex.php?latex=%28%5Cdet+J%29%5E2+%5Cgeq+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\det J)^2 \geq 0' title='(\det J)^2 \geq 0' class='latex' />, we must have <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5E%7B%5Cdim+V%7D+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(-1)^{\dim V} = 1' title='(-1)^{\dim V} = 1' class='latex' />, so <img src='http://l.wordpress.com/latex.php?latex=%5Cdim+V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\dim V' title='\dim V' class='latex' /> must be even. Furthermore, since <img src='http://l.wordpress.com/latex.php?latex=%28%5Cdet+J%29%5E2+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(\det J)^2 = 1' title='(\det J)^2 = 1' class='latex' />, we know <img src='http://l.wordpress.com/latex.php?latex=%5Cdet+J+%5Cneq+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\det J \neq 0' title='\det J \neq 0' class='latex' />, so <img src='http://l.wordpress.com/latex.php?latex=J&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J' title='J' class='latex' /> is a linear automorphism. The complex structure <img src='http://l.wordpress.com/latex.php?latex=J&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J' title='J' class='latex' /> makes <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> into a complex vector space, by setting</p>
<p style="text-align:center;"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%28a%2Bb+%5Cmathrm%7Bi%7D%29+%5Cvec%7Bv%7D+%3A%3D+a+%5Cvec%7Bv%7D+%2B+b+%5C%2C+J%28%5Cvec%7Bv%7D%29%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\displaystyle{(a+b \mathrm{i}) \vec{v} := a \vec{v} + b \, J(\vec{v})}' title='\displaystyle{(a+b \mathrm{i}) \vec{v} := a \vec{v} + b \, J(\vec{v})}' class='latex' /></p>
<p>for <img src='http://l.wordpress.com/latex.php?latex=a%2C+b+%5Cin+%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='a, b \in \mathbb{R}' title='a, b \in \mathbb{R}' class='latex' />and <img src='http://l.wordpress.com/latex.php?latex=%5Cvec%7Bv%7D+%5Cin+V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\vec{v} \in V' title='\vec{v} \in V' class='latex' />.</p>
<p>The standard example is <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^{2n}' title='\mathbb{R}^{2n}' class='latex' /> with its usual ordered basis, labelled <img src='http://l.wordpress.com/latex.php?latex=x_1%2C+y_1%2C+%5Cldots%2C+x_n%2C+y_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1, y_1, \ldots, x_n, y_n' title='x_1, y_1, \ldots, x_n, y_n' class='latex' />, and complex structure <img src='http://l.wordpress.com/latex.php?latex=J_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J_0' title='J_0' class='latex' /> defined by <img src='http://l.wordpress.com/latex.php?latex=J_0%28x_j%29+%3D+y_j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J_0(x_j) = y_j' title='J_0(x_j) = y_j' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=J_0%28y_j%29+%3D+-+x_j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J_0(y_j) = - x_j' title='J_0(y_j) = - x_j' class='latex' /> for all <img src='http://l.wordpress.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='j' title='j' class='latex' />. Putting <img src='http://l.wordpress.com/latex.php?latex=z_j+%3D+x_j+%2B+%5Cmathrm%7Bi%7D+y_j&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='z_j = x_j + \mathrm{i} y_j' title='z_j = x_j + \mathrm{i} y_j' class='latex' />, we obtain the usual ordered basis for <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BC%7D%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{C}^n' title='\mathbb{C}^n' class='latex' /> with its usual complex structure.</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> be a <img src='http://l.wordpress.com/latex.php?latex=%282n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(2n)' title='(2n)' class='latex' />-dimensional manifold. An <em>almost complex structure</em> on <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is a smoothly-varying collection <img src='http://l.wordpress.com/latex.php?latex=J+%3D+%5C%7B+J_p+%5Cmid+p+%5Cin+M+%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J = \{ J_p \mid p \in M \}' title='J = \{ J_p \mid p \in M \}' class='latex' /> of complex structures, one for each tangent space of <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' />. (The existence of an almost complex structure forces <img src='http://l.wordpress.com/latex.php?latex=%5Cdim+M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\dim M' title='\dim M' class='latex' /> to be even.) An almost complex structure on <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is just a bunch of complex structures on the tangent spaces glued together smoothly along <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' />. Recall that as a manifold, all tangent spaces to a vector space can be canonically identified with the vector space itself, so a choice of complex structure on the vector space induces an almost complex structure on the vector space as a manifold.</p>
<p>An almost complex structure <img src='http://l.wordpress.com/latex.php?latex=J&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J' title='J' class='latex' /> on <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is called a <em>complex structure</em> if the complex structures on the vector spaces fit together in an even nicer way. We require that there be a covering of <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> by coordinate neighborhoods such that on each such neighborhood <img src='http://l.wordpress.com/latex.php?latex=J&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J' title='J' class='latex' /> is the pullback of the standard complex structure on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^{2n}' title='\mathbb{R}^{2n}' class='latex' />. We require also that all transition maps for these coordinate charts be holomorphic with respect to the standard complex structure. This collection of coordinate charts form a complex atlas for <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' />, and give <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> the structure of a complex manifold. (Notice that&#8217;s it&#8217;s easy to choose coordinates so that a single <img src='http://l.wordpress.com/latex.php?latex=J_p&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J_p' title='J_p' class='latex' /> looks like the standard one, <img src='http://l.wordpress.com/latex.php?latex=J_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='J_0' title='J_0' class='latex' />. We require that this hold not just at a single point, but in an entire neighborhood of the point.)</p>
<p>An almost complex structure <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is called <em>integrable</em> if it is actually a complex structure. There are many integrability conditions for almost complex structures, such as the vanishing of the <em>Nijenhuis tensor</em> associated to an almost complex structure.</p>
<p><strong>Almost symplectic and symplectic structures</strong></p>
<p>Now we give a parallel discussion for symplectic structures. A <em>symplectic structure on a vector space</em> <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> is real and finite-dimensional, is non-degenerate and skew-symmetric bilinear form <img src='http://l.wordpress.com/latex.php?latex=%5COmega%5Ccolon%7BV+%5Ctimes+V%7D%5Cto%7B%5Cmathbb%7BR%7D%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega\colon{V \times V}\to{\mathbb{R}}' title='\Omega\colon{V \times V}\to{\mathbb{R}}' class='latex' />. Choose a basis for <img src='http://l.wordpress.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='V' title='V' class='latex' /> and represent <img src='http://l.wordpress.com/latex.php?latex=%5COmega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega' title='\Omega' class='latex' /> by a matrix <img src='http://l.wordpress.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A' title='A' class='latex' /> relative to this basis. Because <img src='http://l.wordpress.com/latex.php?latex=%5COmega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega' title='\Omega' class='latex' /> is non-degenerate we know <img src='http://l.wordpress.com/latex.php?latex=%5Cdet+A+%5Cneq+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\det A \neq 0' title='\det A \neq 0' class='latex' />, and because it is skew-symmetric we know that <img src='http://l.wordpress.com/latex.php?latex=A+%3D+-A%5E%7B%5Ctext%7Btr%7D%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='A = -A^{\text{tr}}' title='A = -A^{\text{tr}}' class='latex' />, so <img src='http://l.wordpress.com/latex.php?latex=%5Cdet+A+%3D+%5Cdet+%28-A%5E%7B%5Ctext%7Btr%7D%7D%29+%3D+%28-1%29%5E%7B%5Cdim+V%7D+%5Cdet+A%5E%7B%5Ctext%7Btr%7D%7D+%3D+%28-1%29%5E%7B%5Cdim+V%7D+%5Cdet+A&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\det A = \det (-A^{\text{tr}}) = (-1)^{\dim V} \det A^{\text{tr}} = (-1)^{\dim V} \det A' title='\det A = \det (-A^{\text{tr}}) = (-1)^{\dim V} \det A^{\text{tr}} = (-1)^{\dim V} \det A' class='latex' />. Hence <img src='http://l.wordpress.com/latex.php?latex=%28-1%29%5E%7B%5Cdim+V%7D+%3D+1&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(-1)^{\dim V} = 1' title='(-1)^{\dim V} = 1' class='latex' />, so <img src='http://l.wordpress.com/latex.php?latex=%5Cdim+V&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\dim V' title='\dim V' class='latex' /> must be even.</p>
<p>The standard example is <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^{2n}' title='\mathbb{R}^{2n}' class='latex' />with its usual ordered basis, labelled <img src='http://l.wordpress.com/latex.php?latex=x_1%2C+y_1%2C+%5Cldots%2C+x_n%2C+y_n&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='x_1, y_1, \ldots, x_n, y_n' title='x_1, y_1, \ldots, x_n, y_n' class='latex' />, and symplectic structure <img src='http://l.wordpress.com/latex.php?latex=%5COmega_0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega_0' title='\Omega_0' class='latex' /> defined by <img src='http://l.wordpress.com/latex.php?latex=%5COmega_0%28x_i%2Cy_j%29+%3D+%5Cdelta_%7Bij%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega_0(x_i,y_j) = \delta_{ij}' title='\Omega_0(x_i,y_j) = \delta_{ij}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%5COmega_0%28x_i%2Cx_j%29+%3D+%5COmega_0%28y_i%2Cy_j%29+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\Omega_0(x_i,x_j) = \Omega_0(y_i,y_j) = 0' title='\Omega_0(x_i,x_j) = \Omega_0(y_i,y_j) = 0' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=%5Cdelta_%7Bij%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\delta_{ij}' title='\delta_{ij}' class='latex' /> is the Kronecker delta.</p>
<p>Let <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> be a <img src='http://l.wordpress.com/latex.php?latex=%282n%29&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='(2n)' title='(2n)' class='latex' />-dimensional manifold. An <em>almost symplectic structure</em> on <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is a smoothly-varying collection <img src='http://l.wordpress.com/latex.php?latex=%5Comega+%3D+%5C%7B+%5Comega_p+%5Cmid+p+%5Cin+M+%5C%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega = \{ \omega_p \mid p \in M \}' title='\omega = \{ \omega_p \mid p \in M \}' class='latex' /> of symplectic structures, one for each tangent space of <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' />. (The existence of an almost symplectic structure forces <img src='http://l.wordpress.com/latex.php?latex=%5Cdim+M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\dim M' title='\dim M' class='latex' /> to be even.) An almost symplectic structure on <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is just a bunch of symplectic structures on the tangent spaces glued together smoothly along <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' />.  As before, a choice of symplectic structure on a vector space induces an almost symplectic structure on the vector space as a manifold.</p>
<p>An almost symplectic structure <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> on <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> is called a <em>symplectic structure</em> if the symplectic structures on the vector spaces fit together in an even nicer way. Analogous to the complex structure case, we require that there be a covering of <img src='http://l.wordpress.com/latex.php?latex=M&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='M' title='M' class='latex' /> by coordinate neighborhoods such that on each such neighborhood <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> is the pullback of the standard complex structure on <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^{2n}' title='\mathbb{R}^{2n}' class='latex' />. We require also that all transition maps for these coordinate charts be symplectic with respect to the standard symplectic structure. A manifold with a symplectic structure is a symplectic manifold.</p>
<p>Not much seems to be said about almost symplectic structures on manifolds, and so even less is said about <em>integrable</em> almost symplectic structures. But if one were to say something about them, surely the first thing would be to notice that, by Darboux&#8217;s Theorem, there is an extremely simple integrability condition. This is exactly that <img src='http://l.wordpress.com/latex.php?latex=%5Comega&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\omega' title='\omega' class='latex' /> be closed, i.e. that <img src='http://l.wordpress.com/latex.php?latex=%5Cmathrm%7Bd%7D+%5Comega+%3D+0&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathrm{d} \omega = 0' title='\mathrm{d} \omega = 0' class='latex' />.</p>
<p><strong>Summary</strong></p>
<p>To summarize, every manifold is locally isomorphic to some <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5En&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^n' title='\mathbb{R}^n' class='latex' />. An almost complex manifold is one equipped with a smoothly varying collection of complex  structures on its tangent spaces. An almost complex manifold is a complex manifold if it is locally isomorphic to some <img src='http://l.wordpress.com/latex.php?latex=%5Cmathbb%7BR%7D%5E%7B2n%7D&#038;bg=ffffff&#038;fg=444444&#038;s=0' alt='\mathbb{R}^{2n}' title='\mathbb{R}^{2n}' class='latex' /> with its standard complex structure. In this case, the almost complex structure is called integrable.  Every previous sentence in this paragraph holds with the word &#8220;complex&#8221; replaced with &#8220;symplectic&#8221;. There are many well-known conditions for an almost complex structure 