In this post we discuss local class field theory (specifically looking at -adic fields) with a focus on the broader picture, and the multiple approaches.
Goal of post
So, before I begin actually writing about local class field theory, I’d like to make clear what my goals for this post are. Namely, local class field theory is a subject which has been infinitely written about. Besides the various books on class field theory there are countless well-written, more conciseaccounts of class field theory (both local and global) available on the internet, covering the theory from essentially all angles.
For this reason, I feel obligated to explain why what I have written below adds anything, perhaps, to the canon of already available literature. My goal is, in some sense, to serve as a sort of ‘next step’ beyond the local portion of Bjorn Poonen’s valuable note A Brief Summary of the Statements of Class Field Theory. Namely, below I do a bit more than state the main statements of local class field theory. I try to motivate the need/desire for the subject, as well as try to explain where it sits in the larger picture of modern number theory. I then try to give vague outlines of the proofs of the subject. Focusing on trying to at least explain how the objects are defined, and deferring the proofs of the hard/non-instructive results to the actual books on the subject.
Being a ‘next step’, I assume a little more sophistication than Bjorn Poonen’s note. In particular, I assume below that the reader is well-acquainted with the standard material of a first course in algebraic number theory (including the theory of -adic local fields). I will also assume, particularly in the motivational sections, as well as the Lubin-Tate portion some familiarity with the language of algebraic geometry.
Because this is not at all a comprehensive account of the theory, I have included at the end a list of the books I have found most helpful in each of the various topics we’ll be discussing below.
Classical motivation to study local class field theory
Class field theory has its modern-historic origins in Hilbert’s 12th problem. Namely, at that point in history Kronecker and Weber had proven their famous theorem: all finite abelian extensions (Galois extensions with abelian Galois group) of were contained inside a cyclotomic extension (one of the form where is a primitive root of unity). Hilbert then asked for a similar characterization of the abelian extensions of other number fields and, as a natural extension, what happens for the non-abelian extensions.
The desire to understand general number fields, and their extensions, came up naturally in the solving of Diophantine equations (a topic of great historical interest). Indeed, the first attempts to prove Fermat’s last theorem involved factorizing in and attempting to leverage facts about the ring to prove that any integral solution cannot be of the form for . In fact, this technique can be made to work as long as is a PID. In fact, it works for a prime if , where is the class number of . Thus, suddenly the desire to solve Diophantine equations leads one to ask somewhat nuanced questions about various rings of integers and, in particular, an understanding of their class numbers.
It was a great insight of the movers and shakers of the time that there should be a connection between a number field having trivial class group (i.e. being a PID) and abelian extensions of . Namely, one expects non-trivial elements of to be witnessed by finite abelian extensions of . This can be made precise, for example, by the existence of the Hilbert class field of which is an abelian extension , unramified everywhere (including the infinite places!) and such that —it is characterized as being the maximal everywhere unramified abelian extension. But, this is only a posteriori motivation since one needs most of the class field theoretic machinery to justify the existence and basic properties of .
So, how can we give an intuition about why one should expect a relationship between and abelian extensions without assuming the contents of the subject we are trying to motivate? Well, one motivation, I think, could be the following. Let us fix a number field , and suppose (for simplicity), that . Consider then
(where is the usual scheme-theoretic non-vanishing set of ). Note then that, in fact, and that we have a surjection
Thus, a natural thing to do to see if is non-zero is to check whether is non-zero. But, note that has a very natural connection with abelian extensions of . Namely, by general Kummer theory the fact that is a scheme over implies that we have a surjection
But, corresponds exactly to cyclic extensions of unramified outside of with degree dividing . Thus, a non-trivial element of which comes from a non-trivial element of produces a non-trivial abelian extension with constrained ramification.
This gives an indication that the abelian extensions of (even those with constrained ramification) are capturing properties of the class number of and, intuitively, should bear witness to the non-PIDness of .
So, from the desire to study Diophantine equations we are pushed to study the abelian extensions of . But, more particularly, we are prompted to study the abelian extensions with a view towards their ramification, or, lack thereof. But, the ramification properties of an abelian Galois extension are captured entirely by local data.
More specifically, let us assume that is a finite abelian extension and is a prime of . Then, we can consider the decomposition group defined to be for any prime of lying over (in general two different choices of will result in conjugate subgroups of the Galois group, but this is unimportant since we’re abelian). Then, we know that and its inertia subgroup are abelian groups capturing all the ramification and inertial data at . But, by basic number theory we have the following commutative rectangle of isomorphisms
where are the residue fields, and is the Galois group of the completions.
Note then that is an abelian extension of local fields. Thus, our desire to understand Diophantine equations led us to consider the class group, which led us to consider abelian extensions of number fields with constrained ramification, which led us to study abelian extensions of local fields. This is the setting for local class field theory.
In particular, the goal of local class field theory is to try and understand the arithmetic of the abelian extensions (where now is local) by ‘internal to data’. This is not a term which I can define in a literal sense other than to say that one should be able to answer questions about the number theory of entirely in terms of objects involving objects sitting inside of/related to objects inside of . For a concrete example of this internal type description, at least in the global case, see this post.
Explicitly, we shall try to understand the abelianized Galois group of in terms of the topological group . Of course, we will give a very explicit and rigorous definition of what we mean by ‘understand in terms of ‘, but for now, let us start with an analogy.
Suppose that we wanted to understand the finite abelian extensions (or, for finite fields, equivalently, all finite extensions) of the field . Well, we know precisely what the lattice of finite subextensions of looks like. For each we have a unique extension of degree , and if and only if . In other words, the lattice looks precisely like lattice associated to the divisibility ordering on . But, this is precisely the same as the lattice of finite index open subgroups of (with the discrete topology) with corresponding to .
Thus, we have an ordering preserving bijection between the open finite index (here these adjectives are really unnecessary) subgroups of and the finite abelian extensions of . Moreover, we have the property that for any two comparable integers and , say , we know that
and thus this bijection between the lattice of finite abelian extensions of also preserves the relative ‘quotients’ of these objects.
That said, there is another group which has this precise property—the abelianized Galois group of ! Indeed, has a property that its open finite index subgroups correspond precisely with the finite abelian extensions of and in such a way which preserves quotients. Thus, we see, at least insofar as finite abelian extensions go, the group ‘approximates’ . In fact, we can make this more explicit. We have an injection
(here is the arithmetic Frobenius ) sending open finite index subgroups to open finite index subgroups, and in fact, inducing a bijection of such objects. This is great since the group is patently simpler than the group , and when it comes to understanding finite abelian extensions the above says we can use interchangeably.
Similarly, for a general (assume -adic for the sake of simplicity) local field we would like some sort of simplifying approximating subgroup which captures all of the data of finite abelian extensions. We would then like to relate this group to giving us, as desired, an ‘internal to ‘ characterization of the finite abelian extension (the group is certainly internal to !).
This is precisely what class field theory gives us. Namely, we will define a continuous group map such that induces a bijection on finite index open subgroups. We will then show that there is an isomorphism of topological groups, the Artin reciprocity map,
and then, finally, to round things off, we’ll give a characterization of the finite index open subgroups of .
Motivation from the Langlands program
One often times hears the Langlands program described as ‘non-abelian class field theory’. In other words, a desire to understand the non-abelian extensions of a number field, and their ramification behavior, in terms of some sort of ‘reciprocity data’ similar to the Artin reciprocity map described above (or, more correctly, its global analogue). How this manifests itself concretely is that we’re interested in understanding so-called Artin representations of . Namely, continuous Galois representations .
In what way does the representation theory of help us understand the ramification behavior of extensions of number fields? Here is a concrete, example. Consider the polynomial . This defines an -extension of , and thus we have no hope of trying to understand the ramification behavior in terms of the (abelian) class field theory we discussed in the last section, and no hope of trying to understand it in terms of something as simple as congruence conditions (see, again, this post). That said, there exists a representation which essentially dictates the splitting behavior of this extension. Sounds complicated? It’s not. Choose a faithful embedding of and consider the representation obtained by first quotienting to and then composing with . Then, note that for almost all primes we have that is unramified, and thus there is a well-defined element and splits in if and only its image under is trivial.
But, this is unsatisfying in some sense. We have not reduced the problem at all since, almost definitionally, understanding when is zero is equally as hard as understanding when (i.e. when is split). But, if we could understand the representation in terms of some other object, one in which we had a better grip on, then perhaps this representation theoretic point of view would be a little less vapid.
The idea is that, roughly, we want our ‘algebraic representation’ (as defined above) to be equal to an ‘analytic representation’. What one means by ‘algebraic’ and ‘analytic’ is very complicated, so let us suffice ourselves with an example. In the above case of , what type of ‘analytic representation’ one might equate to our defined ? Well, it can be shown (see, for example Buhler’s Icosahedral Galois Representations) that there exists a weight cuspidal eigenform whose associated Galois representation is equal to our . I won’t go into what the definition of is (it’s especially nuanced for weight forms), but let me explain the upshot of this conclusion. One can show that the character (nebentypus) of has conductor , and thus one can conclude from the statement that that for (all but finitely many) primes of , the prime splits in if and only if
where is the -coefficient of . Thus, we can understand/attack the splitting behavior of by appealing, essentially, to the analytic object which opens us up to a whole new universe of techniques.
Remark: The astute reader will notice a glaring issue in my above statements. Namely, I never fixed (I just said take to be some faithful representation). I am skipping over some important, but technical details. Namely, I am really considering a faithful representation of a -fold cover of , and specifying by its projectivization (along with some other properties). To see the full technical details, again, consult Buhler’s book.
Before we relate all of this back to (abelian) class field theory, let us quickly make a remark. Somewhere in the above progression of ideas we went from trying to understand the ramification behavior of (i.e. the ramification indicies/residual degrees of all primes) to just understanding which primes split (meaning split completely). Moreover, we were only able to do this for all but finitely many primes. Thus, we seem to have not fulfilled our goal. But, in fact, we have, in a very strong sense. Indeed, recall that for an extension of number fields one defines the set to be the set of primes of which split in . It is then essentially a consequence of Chebotarev density that for two Galois extensions one has that (literal equality) if and only if and agree at all but finitely many primes. Thus, we see that understanding the analytic set literally determines as a (Galois) extension of . So, we’ve lost nothing in restricting our attention to splitting behavior.
So, hopefully at this point one believes that understanding number fields, and their ramification data, can be well-understood by studying representations and when these representations are ‘analytic in nature’. But, it’s not at all obvious how this relates to global class field theory, let alone the local class field theory we are after. To make precise how class field theory is “the case of the Langlands conjecture” (where the Langlands conjecture is roughly the statement that all reasonable Galois representations are ‘analytic’) we would need to actually define the objects that come into play. So, instead, let us suffice ourselves with an understanding of how global/local class field theory is a statement about representation theory at all.
This is not nearly as difficult as it may sound. Namely, the “” part of the statement that local class field theory is part of the Langlands program refers to the dimension of representation. Namely, we should be looking at characters of the Galois group. So, continuous homomorphisms . But, as is the case with all Galois representations, this character comes bundled with a bunch of ‘local data’. Namely, for all primes of we have an induced character
(the first embedding is only defined up to conjugacy, but this is unimportant since our target is an abelian group).
But, there are two things to notice about this local data . First, since is abelian (and the homorphism is continuous) this character factors through . Moreover, since is profinite it’s not hard to see that the map must also factor through a finite quotient. Thus, our character actually factors through a finite abelian quotient of . But, by local class field theory, this should correspond to a character of . Thus, class field theory is morally (and, rigorously) equivalent to a bijection
which makes the connection between local class field theory and representation theory. Moreover, while we haven’t defined ‘analytic representation’ it’s believable that representations of might be ‘analytic’ in nature. Namely, is a locally profinite group, and so has well-defined Haar measure, as well as a reasonable theory of analytic functions. This makes believable the jump from connecting local class field theory to representaiton theory to connecting it with the Langlands program.
So, we’ve discussed the historical origins/desire for something like local class field theory in studying Diophantine equations, and how this fits into the much broader goal of the (local) Langlands program. But, despite all of this discussion, we haven’t indicated whether or not we have actually completed Hilbert’s goal. Namely, remember the wording: we want to describe abelian extensions in a way similar to the Kronecker-Weber theorem. For Hilbert this meant describing the abelian extensions in terms of values of special functions (e.g. are the rational values of ).
One places where this is possible is in the theory of complex multiplication, a key motivation for many aspects of the Langlands program and related areas. The theory of complex multiplication says that every abelian extension of an imaginary quadratic number field is contained inside one of the form where is an elliptic curve with complex multiplication by . Thus, again, we obtain abelian extensions by adjoining values of special functions (here the special function being the -function).
So, one might seek to extend this type of result to describe arbitrary abelian extensions of number fields and/or local fields. The great realization of Lubin and Tate was that one could actually create such a uniformization, at least in the local fields case, if one reinterprets what it should mean to analogize the Kronecker-Weber theorem, then one can, in fact, find a satisfactory solution. In particular, one can also think about the roots of unity as the torsion on the algebraic group . So, one might think that one can obtain the abelian extensions of a general local field by adjoining roots of other algebraic groups.
This is almost correct. What Lubin and Tate showed is that for every local field with uniformizer there is an associated formal Lie group with a ring map such that one obtains all the ‘interesting’ abelian extensions by adjoining to all the -torsion in .
There should be clarification on the phrase ‘interesting’. For any local field with residue field one has an easy way to build abelian extensions of . I speak, of course, of the unramified extensions. These are all obtained by adjoining roots of unity with . Thus, we have sitting inside the maximal abelian extension the maximal unramified extension
and by ‘interesting’ extensions we essentially mean the ‘complement in of ‘. In other words, the ‘ramified part’ of the maximal abelian extension.
The last sentence in the last paragraph brings up an important, and somewhat subtle point. Namely, if what we’re looking for is a ‘complement’ of in , why not just do the obvious? Namely, why not just define the ‘maximal abelian totally ramified extension’ of , call it and take this as a complement? While this seems reasonable, it fails from a simple issue. Namely, the compositum of two totally ramified extensions need not be totally ramified. So, this ‘maximal totally ramified abelian extension’ doesn’t literally make sense. So, we have to be more sophisticated to come up with a complement, and to describe it explicitly.
With this technical subtleties highlighted, let us explain what we mean by a ‘complement’ to the unramified piece. To make this precise note that implicitly in our discussion above, we have claimed that (the profinite completion of ). This should not be shocking— was supposed to be an analogy in of . But,
and since is already profinitely complete (we’ll remark more on this later), we may deduce that
Now, it stands to reason that since that is the fixed field of in under this isomorphism. Thus, the fixed field of should be a linearly disjoint subextension with and . It is the field for which Lubin and Tate describe in terms of the torsion of a formal group.
Remark: Why have we denoted this extension by —why ? Well, not shockingly, will stand for a uniformizer. The dependence on this uniformizer is contained in the fact that the decomposition
is not canonical. In fact, it depends precisely on the choice of a uniformizer of . This is where this mysterious comes from.
As an example of this procedure, let’s consider the case when . Then, it will turn out that and thus the -torsion is just . Thus, is and thus, from this Lubin-Tate theory, we recover the classic statement of the local Kronecker-Weber theorem:
where, as one expects, is the maximal cyclotomic extension of .
The last thing that should be commented on is how this Lubin-Tate theory fits into the general framework of the local Langlands program. It turns out that, despite the apparent ad hoc nature of (abelian) Lubin-Tate theory, it is precisely this approach to class field theory (opposed to the cohomological one described below) which generalizes to the non-abelian setting. Namely, the local class field theory is essentially the local Langlands conjecture for (in the ways indicated above). The local Langlands conjecture for is now a theorem (due, ultimately, to Michael Harris and Richard Taylor) in general. And, it turns out, that the main statement of their work is that the local Langlands conjecture for (and, incidentally, the Jacquet-Langlands correspondence) can be realized in the cohomology of the Lubin-Tate tower. Whatever this means, the appearance of the names Lubin and Tate is not a coincidence since this is a strict generalization of the Lubin-Tate theory for local class field theory.
The statements of local class field theory
Statement and general corollaries
Before we begin discussing some of the key elements to the proof of local class field theory, let us begin by stating rigorously its two main statements. For the remainder of the post we’ll be sticking, mostly for convenience, not necessity, to -adic local fields. So, let’s fix a -adic local field , with integer ring , maximal ideal , valuation , and residue field .
Theorem 1(Local class field theory):
- There exists a unique continuous homomorphism such that the composition
is the map and such that for every finite abelian extension the composition
is surjective with kernel .
- Every finite open subgroup of is of the form for some finite abelian extension .
We call the local Artin reciprocity map. Let us explain some consequences of this theorem.
First, let us explain some properties of the map which follow, not immediately, from its defining properties.
First, let us observe that has dense image. Indeed, it is an elementary excercise in topological group theory that for a profinite group , a continuous homomorphism (for a topological group) has dense image if and only if for all open finite index subgroups the composition is surjective. Now, all the finite index subgroups of are of the form for some finite abelian extension . Thus, the second property of 1. implies precisely that surjects onto all finite topological quotients, and thus has dense image.
But, a much stronger statement is true:
Corollary 2: The local Artin reciprocity map induces an isomorphism .
Before we prove this corollary, we need the following lemma:
Lemma 3: Let be a local field. Then, all finite index subgroups of are open.
Proof: Note that if is a finite index subgroup with , then . Thus, it suffices to prove that is open for all since a subgroup of a topological group is open if and only if it contains an open subgroup. To prove this result we note, again, that for , and so it suffices to prove this result for . Finally, note that since it suffices to show that is open for .
But, for the exponential function defines a topological group isomorphism between and . But, note that for this latter group is divisible by , and thus, since the exponential function is an isomorphism of topological groups, is contained in . But, is open, and thus is open as desired.
Thus, we see that the term ‘open’ in the local existence theorem is superfluous.
Remark: This is one place where we are using that is -adic. The result that all finite index subgroups are open fails in the case of function fields. Clearly the proof fails since there is no function field analogue of the exponential map, but the result is also false.
So, with this, we can prove the corollary:
Proof(Corollary 2): Let us recall that the profinite completion of a group is the unique group initial amongst those profinite groups with maps to . To this end, let us suppose that is a profinite group and is a group map. We need to show that there is a continuous group map . Now, note that
as runs over the finite index open subgroups of . Thus, our map corresponds to maps . Now, since is finite this map must factor through a finite quotient. Using the local existence theorem (together with Lemma 3) we see that, in particular, we have a factorization
for some finite abelian . Now, by the properties of the local Artin map we obtain a map
and thus a map . Moreover, note that this map is necessarily continuous since the map , with the discrete topology, is continuous.
Now, note that by combining the properties of the local Artin reciprocity map, and the local existence theorem, we can see that is an order reversing bijection between the finite abelian extensions of and the finite index subgroups of . From this, it’s easy to see that the maps we’ve constructed form an inverse system, and thus we get a continuous map factorizing the map . Moreover, since has dense image in it’s clear that this factorization is unique. The conclusion follows.
Let us note the following observation we made during the proof:
Corollary 4: The map is an order reversing bijection
such that is isomorphic to .
Now, before we continue, let us stop and wonder in amazement at Corollary 2. It tells us that if is an extension of degree , then there is an isomorphism of topological groups:
where denotes the -power roots of unity in , and the last two groups on the right have the discrete topology.
As an example, it tells us that
where we have used that and used the fact that is profinitely complete, as follows from Lemma 3. This is an astounding fact. There is no reason why one would hope, let alone expect, that we could have such a simple description of the group which dictates the abelian extensions of —wow!
One thing we have not used yet is the first property of the local Artin reciprocity map. But, that gets put to use, essentially, to prove the following:
Corollary 5: Under the isomorphism the fixed field of is . More generally, the filtration
corresponds to the filtration of by abelianized higher ramification groups.
Note, again, we’ve implicitly used the fact that is profinitely complete, which is true, again, by Lemma 3.
Proof: Note that the fixed field of is the same as the fixed field of the image of under . But, by assumption we have that the composition of with is the valuation map. In particular, the kernel is , which proves the proposition.
The second claim follows by precisely the same idea.
Thus, we are naturally led to ask what the fixed field of is—a field extension with .
The Weil group
We have certainly stated the fundamental results of local class field theory above, in particular having shown that ‘approximates’ in the same way that approximates (in particular classifying the finite quotients). But, we have yet to describe the ‘canonical subgroup’ of which really is the ‘approximating subgroup’ and which the Artin map just provides an isomorphism with.
So, let us attempt to define this group. As the notation suggests, we will actually define a subgroup (important in the general formulation of Langlands) for which our group is just the (Hausdorff) abelianization. We will take a somewhat lowbrow approach in defining , deciding to ignore the more general notion of such groups as defined in Tate’s Number Theoretic Background article in the Corvallis proceedings.
So, let us begin by thinking about what we might want to do. Namely, if is supposed to be the ‘approximating subgroup’ of , then we would hope/expect its image under any surjection to be an ‘approximating subgroup’ in the quotient. In particular, we’d hope that the image of in to be the approximating subgroup we’ve already mentioned for finite field Galois groups—the integers . Thus, we expect there to be a continuous surjection
and we expect the kernel to be the same as the kernel of the reduction map —the inertia group .
So, we expect to have a short exact sequence
which essentially forces to be the preimage of under . Ok, cool. But, we expect this above sequence to be a short exact sequence not just of groups, but of topological groups. So, in particular, since is discrete we should require that , the kernel of this map, to be open. Of course, in the subspace topology this is not the case. If it were open in then the map would have open kernel, making be discrete, which it isn’t.
Thus, we see that to make perform as desired, it will be a subgroup of , but will have a different topology than the one inherited from . In particular, we give the unique topology making continuous and for which has the topology inherited from . We call this topological group the Weil group of .
Note that while the map is not an embedding of topological groups, it’s an injective continuous group map with dense image in . Why does it have dense image? Well, recall that a subset is dense if and only if for all finite Galois extensions the image of in is everything. To see that this is true for merely note that while is not surjective (i.e. Frobenius doesn’t generate everything in infinite extensions) it is true that this map composed with any finite quotient is surjective with kernel .
Said slightly more rigorously, let be any finite Galois extension. Then, it’s clear that we have the following commutative diagram of maps
where is the inertia group in and is the residue field. This diagram allows us to easily conclude that surjects onto (since the outer upward pointing arrows are surjective) and thus prove that the image of in is dense.
Moreover, note that the local Artin reciprocity map takes its image in the image of . Indeed, note that the first property of the Artin map implies that the composition has image in the preimage of in which implies its image lies in .
But, even more is true:
Theorem 6: The local Artin reciprocity map provides an isomorphism of topological groups .
Note that one obtains, from the above observation, a group map from the local Artin reciprocity map. We need just explain why it’s continuous, bijective, and closed. Now, to see why it’s continuous, we need really only check that the preimage of is open since, after all, the map is continuous, and differs from the subspace topology only because is not open. But, note that the preimage of is the same thing as the kernel of the composition
but, by the defining properties of the local Artin reciprocity map, this preimage is just which is, in fact, open. The bijectivity follows by just considering the map on finite levels (quotients by open subtroups). Finally, we can conclude that the map is a homeomorphism since our continuous bijection corresponds to, when we decompose our groups as products of by closed subgroups, to continuous bijections on these closed subgroups. But, then we may conclude that the maps are homeomorphisms since continuous bijections between compact Hausdorff spaces are always such. The conclusion then easily follows.
Now, even though the above was instructive, it’s unclear as to why we went through so much effort. How does the observation that the local Artin reciprocity map is really providing an isomorphism helpful? In fact, why even define in the first place—it doesn’t provide that much more content than the statement of local class field theory by itself.
The reason is one of foresight. Namely, even though the Weil group plays only a minor theoretical role in the discussion of (abelian class field theory), the discussion of the character theory of , it plays a much more substantive role in higher dimensions. In fact, the true statement of the local Langlands conjecture for speaks not of Galois representations but, instead, of representations of (more rigorously, Weil-Deligne representations). So, having stated the above isomorphism is, once one understands what this really means, the proof that local class field theory is ‘local Langlands for ‘.
But, a less highfalutin reason is that it’s just plain nice to know what the image of is. To understand the approximating group without having to understand all of local class field theory. The content of Theorem 6, in this light, is just saying that the approximating group inside of does, in fact, have an ‘internal to ‘ description.
We begin with the more classical cohomological proof of local class field theory. But, it should be remarked that our discussion is certainly not historical in the following sense. The first proof of local class field theory actually, somewhat counterintuitively, reduced to the global case first. There people did not think in modern terms (i.e. in terms of adeles) and instead phrased everything in terms of class fields (thus the name of the subject).
To make the following more palatable, we give a brief outline of the steps needed to complete a cohomological proof of local class field theory:
- Show that there is a pairing , where is the Pontryagin dual of and is the Brauer group of (see below).
- Compute to be and, in the process, show that every element of is in the image of .
- From steps 1. and 2. we obtain a map , but the right hand object, by Pontryagin duality, is just . This map is the local Artin reciprocity map.
- Show that the local Artin reciprocity map has the desired properties, and show the existence theorem by hand.
The ‘cohomological’ part of the proof will come in step 2., in particular, in understanding the Brauer group of .
As stated in the introduction, our goal is really just to have an understanding of how the objects of local class field theory are defined. So, we’ll focus entirely on steps 1-3, deferring the check that the defined map ‘works’ to the more serious treatments of the subject.
Step 1: the cyclic algebra pairing
The Brauer group
The Brauer group is an object of deceivingly great importance in both number theory and algebraic geometry. It’s so deceiving, in fact, that upon first hearing its definition, one is liable to get glazed eyes, and completely forget the definitions. This is especially true if one only ever considers the Brauer group of a field, which belies the true geometric nature of the object. To get a sense of their more interesting geometric content see, for instance, this post.
Let us fix a field throughout this section. We do not assume that is local, or even of characteristic .
So, let us define a central simple algebra to be a finite-dimensional unital (but not necessarily commutative) algebra such that its center is , and which has no non-trivial two-sided ideals.
There is essentially only one example of a central simple algebra over . Let us recall that a central division algebra over is a unital finite-dimensional -algebra such that every non-zero element has a two-sided inverse. Then, for any central division algebra the algebra of -matrices over is a central simple algebra over . This comes from the basic fact that all two-sided ideals of (for any ring ) are of the form for a two-sided ideal .
Let us give some concrete examples:
- Over any field , we have the trivial example of a central division algebra. Namely, itself. This gives rise to the ‘family’ of trivial central simple algebras over given by .
- Over one has the Hamiltonian quaternions which gives rise to a non-trivial central division algebra over .
- Over , Wedderburn’s little theorem says that is the only central division algebra over .
- Much simpler, if is algebraically closed, then all central division algebras over are trivial. Indeed, if is a central division algebra and , then is a finite field extension of and so must be itself—contradiction.
- Generalizing the Hamiltonian quaternions, one can define for any two the quaternion algebra .
When we said that central simple algebras of the form were representative of the general case of central simple algebras over we weren’t exaggerating:
Theorem 7(Artin-Wedderburn): Let be a field, and a central simple algebra over . Then, for a unique central division algebra .
So, as a corollary, we see that if is a finite field or an algebraically closed field, then all central simple algebras are of the form .
Now, we can form a group , called the Brauer group of , as follows. We call two central simple algebras and equivalent if for some . Then, as a set, is just the equivalence classes of central simple algebras over .
To define the group structure, we need the following lemma which is, essentially, an exercise in non-commutative algebra:
Lemma 8: Let and be central simple algebras over . Then, is a central simple algebra over .
This endows with the obvious group operation of tensor product. This is well-defined since the following identity holds:
as one can easily check.
One can shift the complicatedness of the set to complicatedness of the group operation. Namely, by the Artin-Wedderburn theorem, every central simple algebra over is equivalent to a unique central division algebra over . Thus, as a set, is nothing but the central division algebras over . But, in this guise the group operation is not precisely obvious. Namely, given central simple algebras how should we define —a new central simple algebra.
Of course, the answer is not since this is not, in general, a central division algebra over . Instead, we use the fact that is a central simple algebra over , and thus of the form for some central division algebra . We then define . One can check that the map
from the isomorphism classes of central division algebras over to (where stands for the equivalence class of ) is an isomorphism of groups.
Note that our observation following the Artin-Wedderburn theorem now allows us to conclude that is trivial in case is either finite or algebraically closed.
A different perspective
We now seek to give a different description, a cohomological description, of the Brauer group. To this end, let us record the following theorem:
Theorem 9: If is a central simple algebra then is a central simple algebra over for any field extension .
Thus, we see that if is a central simple algebra, then is a central simple algebra over . Thus, from previous observation we may conclude that is of the form for a unique , called the degree of .
In fact, we actually have a converse to this observation:
Theorem 10: Let be an algebra over . Then, is a central simple algebra over if and only if one of the following equivalent properties holds:
- for some .
- There exists a finite extension such that .
- There exists a finite Galois extension such that .
This allows us to get a different perspective on central simple algebras. Namely, the above indicates, that in fact, is just . Thus, we can think about about central simple algebras in terms of Galois cohomology.
In particular, recall that for an object a twist of is an object such that . It then follows from the general yoga of torsors that the twists of (up to -isomorphism) are classified by .
While this is not totally rigorous (check the link for a more detailed account), this tells us that algebras which become isomorphic to , in other words degree central simple algebras over , are classified by (the action being the entry-wise one). Here we have used that the algebra automorphisms of are just —this is a special case of the Skolem-Noether theorem.
The rough intuition for this correspondence is as follows. Given a degree central simple algebra , we know from above that is . We want to be able to go back to by some sort of ‘quotient operation’, something similar to just taking -invariants. The data needed to perform such an operation turns out to be precisely the data of a -cocycle—an element of .
Now, there is a natural map
which takes a pair of -cocycles valued in -matrices and -matrices (up to scalar multiplication in both cases) and outputs the -cocycle valued in -matrices (up to scalar multplication) by taking the tensor product (i.e. the Kronecker product):
where the direct limit is taken with respect to the obvious inclusions
a group structure.
Now, one can soup up our observation about twists to the following theorem:
Theorem 11: The association taking a central simple algebra over to its associated -cocycle (being a twist of ) is an isomorphism of groups.
This theorem is nice because it opens up the study of the Brauer group to cohomological methods of attack. But, it has the serious downside that we’re taking cohomology of a non-commutative -module. In particular, the individual objects are not groups, they are just pointed sets. We only obtain the group operation only in the limit.
Thus, it would be nice if we could trade in this cohomology group for a cohomology group valued in some commutative -module. It turns out, somewhat surprisingly, that this is the case. The key being the following short exact sequence -modules:
which gives us the following exact sequence (coming from the long exact sequence in cohomology):
But, note that the first term of this is zero. This is essentially a strengthening of Hilbert’s theorem 90—this group should be classifying vector bundles on which become isomorphic to over , but there is only one such vector space: !
Thus, we have obtained an injection
The non-trivial fact is then then following:
Theorem 12: The direct limit map
from the map on finite pieces described above, is an isomorphism of groups.
Thus we have, as desired, reduced the study of the Brauer group to the study of the cohomology of an abelian -module. This isomorphism is the only practical way to compute for many fields .
As an example, let us explain why , with non-trivial element being represented by the Hamiltonian quaternions . By Theorem 12 we need only consider the following computation:
this being done by the usual resolution associated to a cyclic group.
This will also be the method by which we compute the Brauer group of a local field.
The pairing: cyclic algebras
Now that we have discussed the general theory of the Brauer group, we can describe the cyclic algebra pairing:
mentioned above. Again, here, denotes the Pontryagin dual of , meaning that it’s . But, since is profinite we can describe is an alternate and more useful way.
Namely, note that has an open neighborhood of which contains no non-trivial subgroups. Then, for any continuous homomorphism we have that contains an open subgroup (since the group is profinite). Now, is a subgroup of and so, by construction, trivial. Thus, factors through , a finite group.
Remark: This is a trivial incarnation of the no small subgroups argument. For an interesting read on this topic, I suggest this post of Terry Tao.
From this, we see that the image of any element of lies in the torsion subgroup of , which is the roots of unity. This is, as an abstract group, just . But, note that since factors through a finite quotient, it’s continuous with having the subspace topology of if and only if its continuous with endowed with the discrete topology.
So, summarizing, we may conclude that can be alternatively thought about as where is given the discrete topology. This is the form which will be most useful to us.
So, now, we want to construct for any pair a central simple algebra, denoted . To define , note that for some finite abelian extension . In fact, since the only finite subgroups of are cyclic, we actually have that produces an isomorphism
and we single out which maps, under , to .
We then define to be
where is the non-commutative polynomial algebra with commutation relation . This is clearly a central -algebra since if then (for obvious reasons!) and which forces . But, one can also show that is also simple, and thus defines an element . We call an algebra of the form a cyclic algebra.
It is this pairing
which will play the pivotal role in our discussion of local class field theory. Sticking to customary notation, we shall write just for the class of highlighting the fact that we’re dealing with a pairing .
Step 2: Brauer group computation
The next step in the cohomological proof of class field theory is to actually compute the group where is a (-adic) local field. Now, by what we discussed in the last section, this is equivalent to computing the Galois cohomology group or, equivalently, . This is a highly non-trivial task.
The first key observation one must make involves how ‘deep’ into one must go to split central simple algebras. More specifically, let us call a central simple algebra split if it’s isomorphic to a matrix algebra over . Or, said differently, if the class is trivial. Let us then say that a field extension splits if is split. Note that by Theorem 10 there is always a finite Galois extension splitting any central simple algebra .
Let us denote the set of central simple algebras over split by by . This is clearly a subgroup of and, in fact, is just the kernel of the map
Moreover, whereas , one can easily check that is just .
So, the aforementioned observation which serves as a first step in the computation of is:
Theorem 13: Every central simple algebra is split by . Or, said differently, .
While not entirely obvious, this part is essentially just some simple non-commutative algebra combined with a basic understanding of unramified extensions.
The true ‘meat’ of the computation comes from the following fact:
Theorem 14: For all unramified, .
Once one has these two theorems, the proof that is actually not overly complicated. Indeed, begin by noting that by Theorem 13 we are really just trying to compute
But, note that since is, well, unramified, we have a splitting of sequences -modules:
given by choosing a uniformizer. From this, we know that
where the second equality follows from Theorem 14.
Thus, we are reduced to proving that is . But, note that, again by unramifiedness, is a trivial -module. So, we’re just trying to compute the continuous group cohomology where has a trivial structure.
The key to this computation is the following short exact sequence of groups:
thought about as a short exact sequence of trivial -modules. Passing to the long exact sequence in cohomology we get the piece
But, since is uniquely divisible (i.e. is an isomorphism) the modules are uniquely divisible. But, for they’re totally torsion which, of course, implies they are zero. Thus, we obtain an isomorphism between and .
But, this gives us precisely what we want. Indeed, since is a trivial -module, we know that
But, since is pro-cyclic, any continuous map out of it is determined by a topological generator—namely, Frobenius. Thus, the map
is an isomorphism, as desired.
Of course, this doesn’t give one a very explicit description of . Namely, what is the division algebra associated to an element ? It turns out that there is a very explicit way of describing it, in fact. In particular, for such an with, say and the two coprime, we can create the cyclic algebra where is the character which comes from composing the quotient map , where is the unique unramified extension of degree , with the isomorphism
given by sending Frobenius to . One can check that the group of such cyclic algebras creates a copy which must be surjective since doesn’t contain any proper subgroups isomorphic to itself.
As an added bonus, we see that we have shown the the cyclic algebra pairing is actually surjective.
Step 3: construction of the local Artin reciprocity map
With the setup we have developed this step is not particularly difficult and, in fact, was essentially given in the outline. But, for completeness, we give a slightly more detailed account here.
So, combining steps 1 and 2 we have created a pairing
which, of course, gives us a map
But, notice that by construction the kernel of any map obtained in this way is actually open (consider that the image is finite). So, the image of actually lands in where the latter has the discrete (equivalently subspace ) topology.
Then, by an argument similar to what happened with we can see that
One may then appeal to Pontryagin duality to conclude that the group on the right is just . Thus, we have given a map .
Now, as stated before, we won’t check that this map is actually the local Artin reciprocity map, but the task is easier than one may think. Indeed, given the semi-explicit nature of the map, checking the desired properties reduces to understanding cyclic algebras which is fairly routine.
Proof using Lubin-Tate theory
We now move on to the proof of local class field theory using the work of Lubin and Tate in their seminal paper Formal Complex Multiplication in Local Fields.
Again, to make the following discussion clearer, we outline the steps of the proof as follows:
- Show that to any local field with chosen uniformizer there is an associated canonical isomorphism class of formal -modules .
- Study the properties of the -torsion points of and the fields they generate.
- Show that for , the compositum of the , one has the equality .
- Show using 2. and 3. that there is a natural map satisfying the properties of the local Artin reciprocity map.
Again, since our goal is just to explain the construction of the objects, not to show that they satisfy the desired properties, we only discuss steps 1 and 2.
Step 1: Lubin-Tate formal modules
In this section we carry out part 1. of our outline. We begin with the recollection of the notion of a formal module.
Let be an -algebra (thought of as a topological ring with the discrete topology). Then, a (one-dimensional) formal -module is a (one-dimensional) formal Lie group over together with a ring map such that the derivative map
coincides with the ring map defining the -algebra structure on .
Remark: Of course, there is no reason why one needs to assume one-dimensional in the above definition. We restrict ourselves to this case though, so the point is moot.
If one is willing to take the point of view of formal group laws, opposed to formal Lie groups, which is the view we’ll take here, one can phrase the above in more simple terms. Namely, the group will correspond to a one-dimensional formal group law . Then, an endomorphism of will be just an element such that
and the condition on the ring map reduces to the statement that if corresponds to the endomorphism then
or, said differently, .
We shall denote the element of associated to by or .
Now, let us give some examples:
- Consider the formal group law associated to —namely . Then, note that we can make this into a formal -module by just defining . Clearly this is an endomorphism of , and one can check easily that it defines the structure of a formal -module
- A much less trivial example is the following. Consider the multiplicative group corresponding to the group law . Then, we can define a formal group law associating to the series . Of course, one should explain what means for . For such we define it as where denotes the usual which actually lies in since this element of has non-negative -adic valuation. Once again, it’s a simple check that this really does define a formal -module structure on .
- It is shown in Silverman’s Arithmetic of Elliptic Curves that the formal group law associated to an elliptic curve has the structure of a formal -module.
But, there is, in fact, a natural way to build formal -modules which was first observed by Lubin and Tate in there aforementioned paper. Let us fix a uniformizer of and let be the set of power series subject to the following two conditions:
where, recall, was the size of the residue field of .
The first result towards a proof of local class field theory is then the following:
Theorem 15: Associated to any is a unique formal -module such that . Moreover, for any two the formal -modules and are isomorphic.
The key to this proposition is the following exercise in algebra:
Lemma 16: Let and let be a homogenous linear polynomial. Then, there is a unique such that
Proof: We build iteratively by constructing it modulo for increasing . Well, by assumption we must have that . So, assume that we have a so that modulo . We must build a unique satisfying the obvious property.
We begin by noting that modulo . Indeed, since this is merely the statement that
which is true since we’re in characteristic . Thus, we see that modulo we have that and must differ by for some homogenous polynomial of degree . Divide by (a unit in ) and call this polynomial . Finally, define to be .
We now must check that satisfies the desired property. Begin by noting that is just modulo that
Similarly, since one can see that
where we have used the fact that is homogenous of degree . So, finally we check that
Thus, we have defined power series of degree for all satisfying
Letting be the unique power series having modulo we see that is the desired power series. Moreover, it’s clear in each step of our construction that , and thus is unique.
So, now we can proceed with Theorem 15:
Proof(Theorem 15): Let . By Lemma 8 there exists a unique such that and . We claim that this is a formal group law. But, since we must only check associativity and commutativity. But, this follows from the unicity in Lemma 16.
Specifically, to see that note that modulo both sides are . Moreover, it’s easy to see that they commute with in the sense of Lemma 16 which implies, by unicity, that they are equal. A similar technique works to show that is commutative.
Now, to define the -module structure on note that, again, by Lemma 16 there exists a unique such that modulo it is , and commutes with . In particular, one sees that must be itself. Checking that this actually defines the structure of a formal -module is again, just an exercise in the unicity part of Lemma 8 and is left to anyone interested in writing it all out.
Finally, we need to explain why two give isomorphic formal -modules and . Again, by Lemma 16 there is a unique such that and . One can check, using unicity, that this is actually a homomorphism of formal -modules . Moreover, since it’s leading coefficient is , a unit, it must be an isomorphism.
Let us make the following observation. There is always a simple element of : the polynomial . That said, elements of can take much more complicated forms. For example, the power series is also an example of an element of (working with and ). Indeed,
so showing that . But, also modulo we evidently have that (using the fact that Frobenius is a ring map modulo . This shows, as a consequence of Theorem 7, that the formal -module associated to the choice is since this is a formal group law with .
Note that we cannot modify the case for and to the general setting. Namely, one is tempted to think that for any such pair that the polynomial (which can be given meaning similar to the case over is an element of . Unfortunately, this fails precisely because we have no ‘-Frobenius’, so that modulo there is no reason why this should be anything nice, let alone .
In fact, wanting to work for the pair one will quickly realize that the only hope of proving that it is in is for to be a power of . But, this would force to be exactly . In other words, we need to be an unramified extension of . But, then won’t work again, since modulo it’s where it needs to be . This peculiarity has semi-deep meaning that we will mention later.
Step 2: Studying the extensions
So, let us start putting these formal groups , with , to some use.
So, let’s fix a -adic local field, a uniformizer of , and let, . Denote the formal group law by and the associated formal Lie group by . We define then the extension (temporary) denote the field obtained by adjoining to the elements of the -torsion in .
Let us make this slightly more precise. Recall that the formal group law defines a group structure on the topologically nilpotent elements of the adic ring . But, these elements bear a simple description in this case. Namely, the topologically nilpotent elements of are just the elements of —the maximal ideal. This follows immediately from considering the fact that goes to zero if and only if its absolute value does.
Thus, we have that together with the group operation
is a group. Moreover, note that it actually carries the structure of an -module by defining
which commutes with the group operations on definitionally. This is the -module which we are denoting .
Thus, with this said, it’s reasonable to consider the set
and then consider the field extension . But, from what we said, there is no reason to assume that this field extension has any reasonable properties. In particular, it’s not even clear that is a finite extension. So, our first point of order should be determining its basic properties.
The key observation is that the -modules and and the field extensions do not actually depend on the chosen . One part of this is obvious. Namely, we showed that we have an isomorphism of formal -modules for any two . Thus, it follows that we have an isomorphism of -modules .
To see why the fields and are equal is slightly more complicated. The important fact is that even though the set was defined in terms of the zeros of a power series, namely the power series , they are actually the zeros of a polynomial. This follows from the -adic analogue of the Weierstrass preparation theorem (see Gouvea’s -adic Numbers: an Introduction Theorem 6.3.6).
Thus, and are the result of adjoining the zeros of these polynomials. But, since the modules and are isomorphic they will have polynomials with the same roots—being minimal polynomials as -modules. So, we will drop the in the notation and just denote the module As and as .
This is useful, not only psychologically (i.e. we don’t have to make an arbitrary choice of ), but practically since we can now, in the proof of our theorems, assume we’re working with the element in .
So, let us begin discovering the basic properties of the extensions :
Theorem 17: The module is isomorphic, as an -module, to .
Proof: As noted above, we can assume that we’re working with the polynomial . In particular, it’s clear that is finite. So, evidently a finitely generated torsion -module. So, by the structure theorem for modules over PIDs we know that is of the form
for some integers with .
Now, we note that has -distinct roots ( and the distinct roots of ). Thus, we see that has to have size , and so is evidently . Now, for we have a short exact sequence
where the second map is multiplication by . One should justify why the multiplication by map is actually surjective. It suffices to check that it’s surjective as a map on . But, this comes down to the statement that if we want a solution to . In other words, we want a solution to . But, it’s clear that the roots of this polynomial in have positive valuation and so lie in .
The fact that has size and is cyclic now follows by a simple induction.
Note that we can conclude from the above that
which has size .
We now seek to understand the structure of the field extension :
Theorem 18: The field extension is a totally ramified Galois extension with Galois group .
Proof: Begin by noting that any root of is a root of for . This follows, with our choice of since is the -fold composition of , which has zero constant term.
Thus, we can create a filtration of as follows. First take to be a no-zero root of and consider . Next, take a non-trivial root of . This root is contained in since
and thus we have the containment . Now, take a root of . Again, this is contained in since
Repeating this process we get a filtration
Note that evidently is degree . And, for we have that is degree . This follows since they were constructed from degree Eisenstein polynomials. Thus, we may conclude that .
On the other hand, note that acts faithfully on . Moreover, it acts -linearly since the coefficients of , are fixed by and this group acts continuously (so that we can distribute over the limit). Thus, we get an injection
So, since the right hand side has size we conclude by Galoisness , and thus from the above, we may conclude that and that .
Finally, to see that is totally ramified, note that certainly contains the roots of
(which can be proven easily by induction). But, this is a -Eisenstein polynomial of degree , and thus must be the adjunction to of a root of this polynomial. But, since it’s -Eisenstein, this implies that is totally ramified as desired.
In particular, we see that if we define to be the compositum(=union) of the ‘s then the above shows that is totally ramified extension with Galois group . This is precisely the type of field that we are after—the fixed field of .
Note that also implicit in the proof was the fact that is a norm from to . Working harder (doing an analysis of the units) one can, in fact, show the following:
Theorem 19: The image of the norm map from to is the subgroup of generated by and . The image of the norm map from to is .
The first part of this proposition is a serious amount of elbow grease, and so we omit it here. The second part follows from the first since the norms from are the intersection of the norms from as varies. In particular, we see that the fields are distinct for different .
Let us remark now why it’s not a coincidence that we had issue replicating the creation of a polynomial like that worked for other field extensions. Note that, as we observed before, the formal -module associated to this is . Thus, the -torsion is just and so we see that is just . So, we see that this choice of gives us is . And using the fact that we see that this gives us precisely the local Kronecker-Weber theorem. So, it’s not shocking that we had trouble replicating the creation of such an since it’s existence is integrally linked to the Kronecker-Weber theorem, which shouldn’t hold for other !
So, as stated above, I provide below what I believe to be good sources to learn the above material (more in-depth) from:
Background on number theory:
- J.P. Serre’s Local Fields
- Jürgen Neukirch’s Algebraic Number Theory
Background on non-commutative algebra:
- Tamás Szamuely and Phillipe Gille’s Central Simple Algebras and Galois Cohomology
- Falko Lorenz’s Algebra: Volume II: Fields with Structure, Algebras and Advanced Topics
- T.Y. Lam’s A First Course on Non-Commutative Rings
- J.P. Serre’s Galois Cohomology (for non-commutative group cohomology)
Local class field theory from cohomological point of view
- Kazuya Kato, Takeshi Saito, and Nobushige Kurokawa’s Number Theory 2: an Introduction to Class Field Theory
- J.P. Serre’s Local Fields
- Georges Gras’s Class Field Theory
- James Milne’s Class Field Theory
- Jürgen Neukirch’s Class Field Theory
Local class field theory from Lubin-Tate point of view
- Kenkichi Iwasawa’s Local Class Field Theory
- Emily Riehl’s undergraduate thesis Lubin-Tate Formal Groups and Local Class Field Theory
- James Milne’s Class Field Theory