Posted by: Steven Sam | June 24, 2009

Pfaffians and Plücker ideals

In this post, I want to discuss Pfaffians, a topic which I wish I had learned about as an undergraduate. I’m very interested in syzygies of ideals and such, and every now and then Pfaffians come up, so if only I knew what they were! Now that I know, I want to explain what they are and how they’re related to Plücker ideals.

Everything will be over the field K. If an n x n matrix has rank < r, then this can be checked by showing that all of the r x r submatrices of it have determinant 0. In particular, since these r x r minors are polynomials in the entries of the matrix, this says that the set of all matrices of rank < r is an algebraic subset Y of the space of all matrices. That it's irreducible can be seen by the following argument: let X be the space of n x n matrices, and let Gr(r-1, n) be the Grassmannian of r-1 planes in n-dimensional affine space. Then consider the subset Z of Gr(r-1, n) x X given by \{(W, f) \mid \text{image}(f) \subseteq W \}. If R is the tautological subbundle on Gr(r-1, n), then Z = \mathrm{Hom}(K^n, R) is a vector bundle over Gr(r-1, n) and hence is irreducible. But the image of Z under the projection \text{Gr}(r-1, n) \times X \to X is Y, so Y is also irreducible.

It's not so clear that the ideal generated by the r x r minors of a generic (= entries are algebraically independent variables over K) n x n matrix is radical, but this turns out to be true (one way to show that this is true is to find an explicit Gröbner basis for it).

But what if we only care about skew-symmetric matrices? To check if a matrix has rank < r, we could do the same as above, but the ideal generated by the r x r minors of a generic skew-symmetric matrix will NOT be radical. One problem already is that the determinant of any skew-symmetric matrix is always a perfect square in the field K, and hence our ideal should contain these square roots, which are called Pfaffians.
Read More…

Posted by: Steven Sam | June 1, 2009

P-partitions and Gorenstein algebras

In this post I’d like to say what a P-partition is, what a Gorenstein ring is, and plan to discuss a chain of topics which will lead from one to the other. Roughly the first half of this post can be found in Section 4.5 of Richard Stanley’s Enumerative Combinatorics, Vol 1.

First, let’s start with P-partitions. Here P is a finite poset with p elements. The standard definition of partition is of course a way of decomposing a positive number into a sum of positive numbers, i.e., 5 = 1 + 3 + 1. Since we don’t care about the order, we just list them in descending order, so the example becomes (3,1,1). Another way to interpret this is in terms of posets: let [m] be the usual ordering on the set {1, 2, …, m}. Then a partition of n using at most m parts is the same as an order-reversing map \sigma \colon \left[m\right] \to {\bf N}, where {\bf N} is the natural numbers (including 0) under the usual order, such that \sum_{i=1}^m \sigma(i) = n. Now replace [m] with an arbitrary partition P and we can talk about P-partitions. Also, we’ll say that a P-partition is strict if \sigma from before is strictly order-reversing.
Read More…

Posted by: Steven Sam | May 16, 2009

Tannaka–Krein duality

In this post, I want to discuss to what extent a group’s character table determines it up to isomorphism.

First, let’s do the easy case of Abelian groups. The set of characters of an Abelian group G is itself a group called G^\vee given by pointwise multiplication. In fact, G is isomorphic to G^\vee (though not canonically) as follows: we can write G \cong {\bf Z}/q_1 \oplus \cdots \oplus {\bf Z}/q_n where the q_j are prime powers. Fix generators for each cyclic summand. For each j, the function f_j \colon G \to {\bf C} given by sending the generator of {\bf Z}/q_j to \text{exp}(2\pi i/q_j) gives an element of order q_j in G^\vee, and the map G \to G^\vee given by (a_1, \dots, a_n) \mapsto (f_1^{a_1}, \dots, f_n^{a_n}) is injective. Since the number of characters of G is equal to the order of G, we conclude that it is an isomorphism. To do this canonically (without picking the direct sum decomposition), we can just do it twice (picking the decomposition twice ends up cancelling the fact that we made a choice), and we get an isomorphism G \to (G^\vee)^\vee by sending x to the character of G^\vee defined by evaluation: \psi \mapsto \psi(x). This is a special instance of Pontryagin duality, which holds more generally for any locally compact Abelian group. So we know that the characters determine the group up to isomorphism in this case.

Now let’s look at the noncommutative case. The first value of n for which there exist two nonisomorphic noncommutative groups of order n is 8, in which case we have the dihedral group D_4 which is the symmetries of the square, and the quaternion group Q_8.
Read More…

Posted by: Steven Sam | April 28, 2009

q-analogues and homogeneous spaces

While I was working on my final paper for the course on quivers that I’m taking this semester, I came across the following result (the notation \mathbf{F}_{p^r} means the finite field with p^r elements, and \mathbf{C} is the field of complex numbers):

Theorem. Let X be a variety defined over Z and assume for some prime p, and all r>0, that the function \#X({\bf F}_{p^r}) is obtained by plugging in p^r into some polynomial P(t). Then P(t) has integral coefficients, and P(1) is the Euler characteristic of X({\bf C}) (computed using cohomology with compact support).

If you’re not comfortable with varieties, just think of the solution set of some collection of polynomials in multiple variables, and if you don’t know what the rest of the words mean, that won’t matter much for the stuff starting with the next paragraph. Here I’m using the notation X(F) to denote the solutions for X over F, and I’m thinking of X({\bf C}) as a complex analytic space. The proof uses \ell-adic cohomology (with compact support) and the Grothendieck–Lefschetz trace formula. I won’t get into that, but I’d like to use this as an excuse to talk about the q-analogues of the natural numbers. Read More…

Posted by: Steven Sam | April 15, 2009

The Borel–Weil–Bott theorem

In connection with my last post on the Boij–Söderberg conjectures, I mentioned constructing equivariant supernatural vector bundles and equivariant pure Cohen–Macaulay modules using the Borel–Weil–Bott theorem. So in this post, I’d like to say something about what this theorem says and next time discuss how it can be used. I learned the stuff on Bott’s theorem from Jerzy Weyman’s book Cohomology of Vector Bundles and Syzygies [warning: there are some mistakes in the statement of Bott's theorem for general reductive groups]. Bott’s theorem is usually stated for a reductive group, but for concreteness we’ll stick with the general linear group, since that’s all we’ll need.

The setup is as follows: k denotes some field of characteristic 0, G = \mathbf{GL}(n), B is the subgroup of upper triangular matrices, and T is the subgroup of diagonal matrices. Then G/B is the complete flag variety whose k-points correspond to maximal flags V_\bullet = (0 \subset V_1 \subset \cdots \subset V_n = k^n) where \dim V_i = i. We are interested in realizing representations of G as cohomology groups of line bundles over G/B. But first we’ll state the relative version of the theorem.
Read More…

Posted by: yanzhang | April 9, 2009

A Useful Symmetric Function Identity

A not often-mentioned skill that I have been trying to develop is the evaluation of the “usefulness” of statements. Given my horrible memory and degrading mental RAM, I must prioritize which facts to consciously absorb and which ones to skim, since I’m sure I forget at least one theorem every time I learn a new one. Luckily, books aid to an extent by labelling the important things “Theorems” and “Lemmas,” though sometimes the latter are just as important as (if not much more than) the former, and sometimes even more important items appear in the Exercises section. Something about Exercise 7.70 in Richard Stanley’s Enumerative Combinatorics 2 really struck me as “useful.” The statement is:

\sum_{\lambda \vdash n} H_\lambda^{k-2} \prod_i^k s_\lambda(x^{(i)}) = \frac{1}{n!}\sum_{\prod w_i = 1} \prod_i^k p_{\rho(w_i)}(x^{(i)}), where the x^{(i)} are sets of variables, H_\lambda the product of the hook-lengths, \rho(w) gives the cycle structure, and w_i are permutations of size n.

Two things make the intuition of “useful” more concrete:

One, the statement has a lot of flexibility; By the symmetry of the symmetric group and the fact that cycles are preserved under inverses, you can sum over, say, products of all ordered (k-1)-tuples of w_i by rewriting the sum on the right as over w_1 = w_2\ldots w_k, for example, or sum over pairs which give the same product, say w_1w_2 = w_3w_4. Furthermore, having the choice over the variables makes a lot of specialization techniques useful. For example, you can “pick out” cycles by specializing x^{(i)} = (1, \zeta, \ldots, \zeta^{n-1}, 0, 0, \ldots), where \zeta is an n-th root of unity, because the power sums then give p_c(x^{(i)}) = 0 unless c = n.

Two, the statement has a lot of power; the proof of this fact has a “Stanley difficulty” of [3] (out of 5) means it contains some nontrivial usage of the symmetric function machinery already. Thus, by using it we know we are backed with some complex stuff in the background. Note when k = 1, we get the amazing Hook-length formula directly; when k = 2, by observing that summing over w_1w_2 = 1 is really summing over a single w (a special case of our discussion above on “flexibility”), we get the familiar “inner product” identity\sum_{\lambda \vdash n} s_\lambda(x) s_\lambda(y) = \frac{1}{n!}\sum_w p_{\rho(w)}(x) p_{\rho(w)}(y). These are two nontrivial statements without an obvious connection with each other, a pleasant surprise.

This warm “finding a hidden gem” feeling is very nice.

-Y

P.S. Well, there is a third (and less glamorous) cue, which is that at least two problems in the problem sets came with the hint: “Look at Exercise 7.70.” The power of hindsight in all its glory, I suppose. However, I liked this statement too much that I needed an excuse to write it down, so shhh.

Posted by: Steven Sam | April 2, 2009

Boij–Söderberg theory III: proofs

It’s been a while since the last post, but I want to give an indication of how the Eisenbud and Schreyer proved the Boij–Söderberg conjectures. This fell into roughly 3 steps. There are some constructions that are involved which I will not mention, but may mention in a future post. In particular, I will take it as a given that for any given degree sequence d, there exists a Cohen–Macaulay module M whose Betti diagram is pure with degree sequence d. Recall the setup from the previous post: we picked degree sequences \overline{d} and \underline{d} to restrict our attention to a finite-dimensional space of Betti tables.

Step 1: Identify the exterior facets of the cone spanned by the pure diagrams.

This step was done by Boij and Söderberg in their original paper. Recall that we put a partial ordering on the set of degree sequences given by pointwise comparison and that the cone spanned by the pure diagrams offers a geometric realization of this poset. In particular, the facets are given by chains of degree sequences which are obtained uniquely by removing a single element \pi(d_i) from a maximal chain \pi = (\pi(d_1) < \cdots \pi(d_q)). The uniqueness part means that there do not exist two different maximal chains such that one can remove a single element from each to get the same resulting chain. Let i denote the index of the degree sequence that was removed. They classified the exterior facets into three types:

Case 1: We remove the maximal or minimal element.

Case 2: The degree sequences \pi(d_{i-1}) and \pi(d_{i+1}) differ in exactly one entry.

Case 3: The degree sequences \pi(d_{i-1}) and \pi(d_{i+1}) differ in exactly two entries.
Read More…

Posted by: Steven Sam | March 8, 2009

Boij–Söderberg theory II: the Cohen–Macaulay property

Last time in this post, I gave an introduction to minimal resolutions over polynomial rings and stated a theorem of Eisenbud and Schreyer. This time I want to describe the significance of the Cohen–Macaulay property, and in part III, I will start explaining the proof of the Boij–Söderberg conjectures.

The first point to address is the notion of a Cohen–Macaulay module. Let’s first assume that M is a module over a local ring R with maximal ideal P (either this is an actual local ring, or R is graded, and it has a unique homogeneous maximal ideal). An M-sequence x_1, \dots, x_n \in P satisfies 1) multiplication by x_i is an injective function M_{i-1} \to M_{i-1} where M_0 = M and M_{i-1} = M / (x_1, \dots, x_{i-1}) M for i>0, and 2) (x_1, \dots, x_n) M \ne M, and the depth of M, depth(P,M), is the longest length of an M-sequence. The dimension of M, denoted dim(M), is the Krull dimension of R / ann(M) where ann(M) denotes the annihilator ideal of M. In general, the inequality \text{dim}(M) \ge \text{depth}(P,M) holds, and we say that M is Cohen–Macaulay (CM from now on) in case of equality. The ring R is CM if it is CM as a module over itself, and we extend these definitions to the global case by saying that a ring / module is CM if its localization at every maximal ideal is CM. But actually, since we will be dealing with graded modules, we will think of the polynomial ring K[x_1, \dots, x_n] as a “local graded ring” because it has a unique homogeneous maximal ideal.
Read More…

Posted by: Steven Sam | February 26, 2009

Combinatorial approach to e

I plan to continue my discussion of Boij–Söderberg theory from last time, but I’d like to make a quick detour first. This came up today in the office when I was talking with yanzhang.

Question: If I randomly pick numbers (with a uniform distribution) from the unit interval [0,1], what is the expected number of numbers that I need to pick before their sum is at least 1?

The answer for some strange reason is the number e, and I’ll illustrate this with two approaches. Both give different expressions of e, so this problem will show an equivalence between two definitions of e.

The first approach is to use calculus. For 0 \le r \le 1, let f(r) be the expected number of randomly selected numbers in [0,1] so that their sum is at least r, and define f(r) = 0 if r is negative. Then we have the following integral equation

\displaystyle f(r) = 1 + \int_0^r f(r-t)\, dt = 1 + \int_0^r f(s)\, ds.

To see this, pick a number t randomly in [0,1], and then add f(r-t). Since we need to account for all such t, we just integrate over [0,r] (I could integrate over [0,1], but remember that f(r-t) = 0 if t>r), and divide this integral by the length of [0,1], which is 1. The second inequality follows from a change of variables s=r-t. Taking derivatives of both sides, we get f'(r) = f(r) by the fundamental theorem of calculus, so f(r) = ce^r for some constant c. Of course, f(0) = 1, so c=1. Our original question was for r=1, in which case f(1) = e, as we promised.

The second approach is the same, except we discretize everything. Let’s ask an analogous question. First fix k. If I randomly pick integers uniformly from [0,k], what is the expected number that I need before their sum is at least n, where n \le k? Let g(n) be this expected value. Then again, we have

\displaystyle g(n) = 1 + \frac{1}{k+1} \sum_{i=0}^n g(i)

by the same reasoning as before. Let \Delta be the first difference operator, i.e., \Delta f(x) = f(x) - f(x-1). Now apply \Delta to the above equation to get

\displaystyle g(n) - g(n-1) = \frac{1}{k+1} (g(n) - g(-1)).

Of course, g(-1) = 0, and rewriting this last equation gives

\displaystyle g(n) = \frac{k+1}{k} g(n-1)

assuming that n>0. Since g(0) = 1, this means that g(n) = \left( \frac{k+1}{k} \right)^n. Now set k=n: If we rephrase this question, we also see that g(n) is the expected number of random selections of rational numbers with denominator n from [0,1] such that their sum is at least 1. Letting n go to infinity, we should get the answer from the previous method since the rationals form a dense subset of the reals.

So these two approaches show the equivalence of two definitions of e: one as the unique number such that \displaystyle \frac{d}{dx} e^x = e^x, and the other as the limit \displaystyle \lim_{n\to \infty} \left( \frac{n+1}{n} \right)^n.

Do you know of any other solutions to the original question?

-Steven

Posted by: Steven Sam | February 24, 2009

Boij–Söderberg theory I: preliminaries

I wanted to say something about Boij–Söderberg theory. But before I can even explain what that means, I have to give some definitions in homological commutative algebra. I might as well also use this as an excuse to talk about some results in that area.

Fix a field K and let A = K[x_1, \dots, x_n] be a polynomial ring in n variables. Let M be a module over A. We say that a chain complex

\cdots \to F_i \to F_{i-1} \to \cdots \to F_1 \to F_0 \to M \to 0

is a free resolution of M if each F_i is a free module, and if the complex is exact in each degree, i.e., the image of each map is equal to the kernel of the one following it. It is easy to see that every module has a free resolution: pick some set of generators for M, and let F_0 be the free module generated by them, so that we have a surjection F_0 \to M. Now pick a set of generators for the kernel of this map, and let F_1 be the free module generated by them, so that we have a map F_1 \to F_0 \to M which is exact at F_0, and repeat as necessary (possibly forever).

But we won’t really be interested in this unless M is graded. The polynomial ring has a natural grading A = \bigoplus_{i \ge 0} A_i where A_i is the K-vector space generated by degree i monomials. By grading, I just mean that A_i \cdot A_j \subseteq A_{i+j} for all i and j. So a (positively) graded module will be one with a decomposition M = \bigoplus_{i \ge 0} M_i such that A_i \cdot M_j \subseteq M_{i+j} for all i and j. Elements in M_i for some i will be called homogeneous. The right notion of a homomorphism of graded modules M \to N will be degree 0 maps: those that send M_i to N_i for all i. We could also talk about degree d maps, but instead we will use the notation M(d) to denote a grading shift: M(d)_i = M_{d+i}. So what we might call a degree d map is just a degree 0 map M(-d) \to N. So now that our modules are graded, we will be interested in graded free resolutions. In general, these will look like

(*) \quad \cdots \to \bigoplus_d A(-d)^{\beta_{i,d}} \to \cdots \to \bigoplus_d A(-d)^{\beta_{0,d}} \to M \to 0

(we will see later why I use -d instead of d, where the \beta_{i,j} will denote the number of generators in each graded piece (most of these numbers will be 0 of course). The existence of graded free resolutions works as before: instead of just picking random generators each time, we restrict our attention to homogeneous ones.

Okay, so how about some examples. Read More…

Older Posts »

Categories