GTLect1 AbstractGroupTheory 2021
GTLect1 AbstractGroupTheory 2021
Gregory W. Moore
1. Introduction 5
1.1 Equivalence Relations 5
–1–
8.3.3 Torsors And Principal Bundles 121
8.4 More About Induced Group Actions On Function Spaces 127
–2–
11.16.1Some Useful Formulae For Working With Exponentials Of Operators 222
11.16.2Lie Algebras 229
11.16.3The Classical Matrix Groups Are Lie Groups 232
11.16.4Representations Of Lie Algebras 240
11.16.5Finite Dimensional Irreducible Representations Of sl(2, C),sl(2, R),
and su(2) 242
11.16.6Casimirs 247
11.16.7Lie Algebra Operators In The Induced Representations Of SU (2) 249
–3–
15.5.7 Lagrangian Subgroups And Induced Representations 381
15.5.8 Automorphisms Of Heisenberg Extensions 383
15.5.9 Coherent State Representations Of Heisenberg Groups: The Bargmann
Representation 395
15.5.10Some Remarks On Chern-Simons Theory 395
15.6 Non-Central Extensions Of A General Group G By An Abelian Group A:
Twisted Cohomology 395
15.6.1 Crystallographic Groups 400
15.6.2 Time Reversal 403
15.6.3 T2 = (−1)2j and the Clebsch-Gordon Decomposition 409
15.7 General Extensions 410
15.8 Group cohomology in other degrees 414
15.8.1 Definition 415
15.8.2 Interpreting the meaning of H 0+ω 419
15.8.3 Interpreting the meaning of H 1+ω 419
15.8.4 Interpreting the meaning of H 2+ω 420
15.8.5 Interpreting the meaning of H3 420
15.9 Some references 422
–4–
1. Introduction
Historically, group theory began in the early 19th century. In part it grew out of the
problem of finding explicit formulae for roots of polynomials. 1 . Later it was realized that
groups were crucial in transformation laws of tensors and in describing and constructing
geometries with symmetries. This became a major theme in mathematics near the end of
the 19th century. In part this was due to Felix Klein’s very influential Erlangen program.
In the 20th century group theory came to play a major role in physics. Einstein’s 1905
theory of special relativity is based on the symmetries of Maxwell’s equations. The general
theory of relativity is deeply involved with the groups of diffeomorphism symmetries of
manifolds. With the advent of quantum mechanics the representation theory of linear
groups, particularly SU (2) and SO(3) came to play an important role in atomic physics,
despite Niels Bohr’s complaints about “die Gruppenpest.” One basic reason for this is the
connection between group theory and symmetry, discussed in chapter ****. The theory of
symmetry in quantum mechanics is closely related to group representation theory.
Since the 1950’s group theory has played an extremely important role in particle theory.
Groups help organize the zoo of subatomic particles and, more deeply, are needed in the
very formulation of gauge theories. In order to formulate the Hamiltonian that governs
interactions of elementary particles one must have some understanding of the theory of Lie
algebras, Lie groups, and their representations.
In the late 20th and early 21st century group theory has been essential in many areas
of physics including atomic, nuclear, particle, and condensed matter physics. However,
the beautiful and deep relation between group theory and geometry is manifested perhaps
most magnificently in the areas of mathematical physics concerned with gauge theories
(especially supersymmetric gauge theories), quantum gravity, and string theory. It is with
that in the background that I decided to cover the topics in the following chapters.
Finally, the author would like to make two requests of the reader: First, much of what
follows is standard textbook material. But some is nonstandard. If you make use of any of
the nonstandard material in something you write, please give proper acknowledgement to
these notes. Second, if you find any mistakes in these notes please do not hesitate to send
me an email. (But please, first do check carefully it really is a mistake.) These notes are
a work in progress and are continually being updated. Thank you - Gregory Moore.
–5–
3. a ∼ b and b ∼ c ⇒ a ∼ c
Example 1.1.1 : The notion of equality satisfies these axioms of an equivalence
relation. So a ∼ b iff a = b is an equivalence relation. The main point, however, is that an
equivalence relation is a more flexible notion than equality, and yet captures many of the
important aspects of equality.
Example 1.1.2 : X = Z, a ∼ b if a − b is even.
Example 1.1.3 : More generally, let X = Z, and choose a positive integer N . We
can define an equivalence relation by saying that a ∼ b iff a − b is divisible by N .
Example 1.1.4 : At the other extreme from equality we could say that every element
of the set X is equivalent to every other element. This would be the coarsest possible
equivalence relation on the set.
Definition 1.1.2: Let ∼ be an equivalence relation on X. The equivalence class of
an element a is
[a] ≡ {x ∈ X : x ∼ a} (1.1)
Example 1.1.4’ : If we take the coarsest possible equivalence relation then there is
only one equivalence class, namely the full set X itself.
Here is a simple, but basic, principle:
For example, the integers are the disjoint union of the even and odd integers, and the
corresponding equivalence relation is the one mentioned above: a ∼ b iff a − b is even.
–6–
2. Groups: Basic Definitions And Examples
1. G is a set.
2.
∀g ∈ G m(g, e) = m(e, g) = g (2.2)
3.
∀g ∈ G m(I(g), g) = m(g, I(g)) = e (2.3)
The above notation is unduly heavy, and we will not use it. Thus, we give the definition
again, but more informally:
∀a, b ∈ G there exists a unique element in G, called the product, and denoted a · b ∈ G
in other words, we streamline notation by writing a · b := m(a, b).
The product is required to satisfy 3 axioms:
1. Associativity: (a · b) · c = a · (b · c)
∀a ∈ G a·e=e·a=a (2.4)
Remarks
1. We will often denote e by 1, or, when discussing more than one group at a time, we
denote the identity in a particular group G by 1G . The identity element is also often
called the unit element, although the term “unit” can have other meanings when
dealing with more general mathematical structures such as rings.
–7–
2. Also, we sometimes denote the product of a and b simply by ab.
3. We can drop some axioms and still have objects of mathematical interest. For ex-
ample, a monoid is a set M with a multiplication map m : M × M → M which is
associative. And that’s all. If there is an identity element e ∈ M which functions as
the identity for this multiplication then we speak of a unital monoid. The further as-
sumption of inverses turns the monoid into a group. The definition of a group seems
to be in the Goldilocks region of having just enough data and conditions to allow a
deep theory, but not having too many constraints to allow only a few examples. It is
just right to have a deep and rich mathematical theory.
4. We can also put further mathematical structures on the data (G, m, I, e). For exam-
ple, if G is a topological space and m and I are both continuous maps, then we have
a topological group. If G is furthermore a manifold and m and I are real analytic in
local coordinates, then we have a Lie group.
Exercise
a.) Show that e unique. 2
b.) Given a is a−1 unique?
c.) Show that axioms 2,3 above are slightly redundant: For example, just assuming
a · e = a and a · a−1 = e show that e · a = a follows as a consequence.
m(a, b) := a + b (2.5)
–8–
Exercise Subgroups
a.) Z ⊂ R ⊂ C with operation +, define subgroups.
b.) Is Z − {0} a monoid (with m given by standard multiplication) ?
c.) Is Z − {0} ⊂ R∗ a subgroup?
d.) Let R∗>0 and R∗<0 denote the positive and negative real numbers, respectively.
Using ordinary multiplication of real numbers, which of these are subgroups of R∗ ?
e.) Consider the negative real numbers R<0 with the multiplication rule:
Show that this defines a group law on R<0 , but that (R<0 , m, ...) is not a subgroup of R∗ .
3
Definition 2.3: The order of a group G, denoted |G|, is the cardinality of G as a set.
Roughly speaking this is the same as the “number of elements in G.” A group G is called
a finite group if |G| < ∞, and is called an infinite group otherwise.
Already, with the simple concepts we have just introduced, we can ask nontrivial
questions. For example:
Does every infinite group necessarily have proper subgroups of infinite order?
This is of course true of the examples we have just discussed. It is actually not easy to
think of counterexamples, but in fact there are infinite groups all of whose proper subgroups
are finite. 4
µN = {1, ω, . . . , ω N −1 } (2.8)
3
Answer : The point is that the multiplication of (2.7) is not the restriction of the multiplication on R∗
to the negative reals.
4
One example are the Prüfer groups. These are subgroups of the group of roots of unity. They are
defined by choosing a prime number p and taking the subgroup of roots of unity of order pn for some
natural number n. Even wilder examples are the “Tarski Monster groups” (not to be confused with the
Monster group, which we will discuss later). These are infinite groups all of whose subgroups are isomorphic
to the cyclic group of order p.
5
The natural numbers are the same as the positive integers.
–9–
where ω = exp[2πi/N ]. This is a finite group with N elements, as is easily checked.
Exercise
Does µ137 have any nontrivial subgroups? 6
Exercise
In example 4 show that if N is even then the subset of classes of even integers forms
a proper subgroup of µN . What happens if N is odd?
Example 2.5: The residue classes modulo N , also called “The cyclic group of
order N.” Let N be a positive integer. Recall that we can put an equivalence relation
on Z defined by a ∼ b iff a − b is divisible by N , and we denoted the class of an integer
n by n̄. (One could identify the set of equivalence classes with the set {0, 1, . . . , N − 1}.)
We take G to be the set of equivalence classes of integers modulo N , using the notation
of equivalence relation of §7.1 above. We need to define m(r̄1 , r̄2 ). To do this we choose a
representative r1 , r2 from each equivalence class and take
The main thing to check here is that the equation is well-defined, since we chose represen-
tatives for each equivalence class. This group, which appears frequently in the following,
will be denoted as Z/N Z or ZN . For example, telling time in hours is arithmetic in Z12 ,
or in Z24 in railroad/military time. The reader should note that it “resembles” closely the
group µN . We will make that precise in the next section.
So far, all our examples had the property that for any two elements a, b
a·b=b·a (2.10)
Definition 2.5: When equation (2.10) holds for two elements a, b ∈ G we say “a and
b commute.” If a and b commute for every pair (a, b) ∈ G × G then we say that G is an
Abelian group:
If a, b commute for all a, b ∈ G we say “G is Abelian.”
Note: Note that our abbreviated notation a · b for the group multiplication m(a, b) would
actually be quite confusing when working with ZN . The reason is that it is also possible
to define a ring structure (see Chapter 2) where one multiplies r1 and r2 as integers and
then takes the residue. This is NOT the same as m(r1 , r2 ) !! For example, if we take
N = 5 then m(2, 3) = 0 in Z5 because 2 + 3 = 5 is congruent to 0 modulo 5. Of course,
6
Answer : We will give an elegant answer below.
– 10 –
multiplying as integers 2 × 3 = 6 and 6 is congruent to 1 mod 5. When considering Abelian
groups we often prefer to use the abbreviated notation
a + b := m(a, b) (2.11)
When we use this additive notation for Abelian groups we will write the identity element
as 0 so that a + 0 = 0 + a = a. (Writing “a + 1 = a” would look extremely weird.) Note
that we will not always use additive notation for Abelian groups! For example, for µN the
multiplicative notation is quite natural.
Since we defined a notion of “Abelian group” we are implicitly suggesting there are
examples of groups which are not Abelian. If one tries to use the group axioms to prove
that m(g1 , g2 ) = m(g2 , g1 ) one will fail. The only way we can know conclusively that
one will fail is to provide a counterexample. The next example gives a set of examples of
nonabelian groups:
Definition 2.5: The center Z(G) of a group G is the set of elements z ∈ G that commute
with all elements of G:
– 11 –
b.) Show that the center of GL(n, κ) is the subgroup of matrices proportional to the
unit matrix with scalar factor in κ∗ . 8
Another natural class are the unitary and special unitary groups:
J = J ∗ = −J tr = −J −1 (2.19)
Atr JA = J (2.20)
8
Answer : It is obvious that matrices of the form z1n×n with z ∈ κ∗ are in the center. What is not
immediately obvious is that there are no other elements of the center. Here is a careful proof that this is
indeed the case: Consider the matrix units: eij . The matrix eij has a 1 in the ith row and j th column and
zeroes elsewhere. Note that for any matrix A we have eii Aejj = Aij eij with no sum on i, j here. On the
RHS Aij is a matrix element, not a matrix. Now let z be in the center. Check that for any pair ij the
matrix 1 + eij is invertible. Therefore, if z is in the center then z must commute with 1 + eij and hence z
must commute with eij for all i, j. Now, as we observed above, eii zejj = zij eij holds for any matrix, but
P
since z is also central eii zejj = eii ejj z = δij ejj z So z is diagonal. But for any diagonal matrix z = k zk ekk
we have (zA)ij = zi Aij and (Az)ij = Aij zj . As long as there are matrices with Aij 6= 0 and invertible we
can conclude that zi = zj .
– 12 –
We define the symplectic groups:
Remarks:
1. As an exercise you should show from the definition above that the most general
element of SO(2, R) must be of the form
!
x y
x2 + y 2 = 1 (2.22)
−y x
where the matrix elements x, y are real. Thus we recognize that group elements in
SO(2, R) are in 1-1 correspondence with points on the unit circle in the plane. We
can even go further and parametrize x = cos φ and y = sin φ and φ is a coordinate
provided we identify φ ∼ φ + 2π so the general element of SO(2, R) is of the form:
!
cos φ sin φ
R(φ) := (2.23)
− sin φ cos φ
This is familiar from the implementation of rotations of the Euclidean plane in Carte-
sian coordinates. Note that the group multiplication law is
2. Let us consider the group U (1): This is simply the group of 1 × 1 unitary matrices.
They are not hard to diagonalize. The general matrix can be written as z(φ) = eiφ
with multiplication z(φ1 )z(φ2 ) = z(φ1 + φ2 ) where φ ∼ φ + 2π yield identical group
elements. Again, as with µN and ZN the groups look like they are “the same”
although strictly speaking they are different sets and therefore have different m’s.
We will make this idea precise in the next section.
3. One of the most important groups in both mathematics and physics is SU (2). We
claim that the general element of SU (2) looks like
!
z −w∗
g= (2.25)
w z∗
– 13 –
One can prove this by studying the 4 equations for the matrix elements in the identity
gg † = 1. Another way to proceed makes use of some concepts from the linear algebra
chapter below and goes as follows: Since g is unitary it follows that the basis
! !
1 0
g g (2.27)
0 1
for some phase λ. But now if we impose detg = 1 we discover that λ = 1. Note that,
writing out the real and imaginary parts of z, w we see that the equation (2.26) is
just the equation for a 3-dimensional sphere: Just the way SO(2) and U (1) are, as
manifolds, S 1 , we see that SU (2) also has the structure of a manifold, namely the
three-dimensional sphere.
4. Some of what we have just said can be generalized to all the classical matrix groups:
They can be identified with manifolds (although no other matrix groups are spheres
- not an obvious fact.) There are parametrizations of these manifolds, so the group
multiplication and inverse are smooth operations. Moreover, the groups “act” nat-
urally on various linear spaces. (See Section 4.1 for the notion of “group action.”)
This is part of the theory of Lie groups. Lie groups have vast applications in physics.
For example, G = SU (3) is the gauge group of a Yang-Mills theory that describes the
interactions of quarks and gluons, while G = SU (3) × SU (2) × U (1) is related to the
standard model that describes all known elementary particles and their interactions.
The general theory of Lie groups will be discussed in Chapter 8(?) below, although
we will meet many many examples before then.
– 14 –
If we want to stress the role of X and/or G we write F[X → G] for F. We claim that F is
also a group. The main step to show this is simply giving a definition of the group multi-
plication and the inversion operation. The product mF (f1 , f2 ) of two functions f1 , f2 ∈ F
must be another function in F. We define this function by giving a formula for the values
at all values of x ∈ X:
mF (f1 , f2 )(x) := mG (f1 (x), f2 (x)) (2.31)
It is the only sensible thing we could write given the data at hand. In less cumbersome
notation:
(f1 · f2 )(x) := f1 (x) · f2 (x) (2.32)
Similarly inverse of f is the function that maps x → f (x)−1 , where f (x)−1 ∈ G is the
group element in G inverse to f (x) ∈ G.
If both X and G have finite cardinality then F[X → G] is a finite group. If X or G has
an infinite set of points then this is an infinite order group. If X is a positive dimensional
manifold and G is a Lie group (notions defined below) this is an infinite-dimensional space.
In the special case of the space of maps from the circle into the group:
LG = F[S 1 → G] (2.33)
we have the famous “loop group” whose representation theory has many wonderful prop-
erties, closely related to the subjects of 2d conformal field theory and string theory. In
some cases if X is a manifold and G is a classical matrix group then, taking a subgroup
defined by suitable continuity and differerentiability properties, we get the group of gauge
transformations of Yang-Mills theory. As a simple example, you are probably familiar with
the gauge transformation in Maxwell theory:
Aµ → Aµ + ∂µ (2.34)
f : x 7→ ei(x) (2.35)
and this is a function from spacetime, M1,3 to U (1), so f ∈ F[M1,3 → U (1)]. This is a
better point of view because it generalizes in interesting ways to other spacetimes. Note
that the gauge transformation law can be written as:
– 15 –
leaves every element unchanged. The inverse of a permutation is a permutation. Thus,
composition defines a group operation on the permutations of any set. This group is
designated SX . It is an extremely important group and we will be studying it a lot. In the
case where X = M is a manifold we can also ask that our permutations φ : M → M be
continuous or even differentiable. If φ and φ−1 are differentiable then φ is a diffeomorphism.
The composition of diffeomorphisms is a diffeomorphism by the chain rule, so the set of
diffeomorphisms Diff(M ) is a subgroup of the set of all permutations of M . The group
Diff(M ) is the group of gauge symmetries in General Relativity. Except in the case where
M = S 1 is the circle, remarkably little is known about the diffeomorphism groups of
manifolds. One can ask simple questions about them whose answers are unknown.
Example 2.10: Power Sets As Groups.
Let X be any set and let P(X) be the power set of X. It is, by definition, the set of all
subsets of X. If Y1 , Y2 ∈ P(X) are two subsets of X then define the symmetric difference:
This defines an abelian group structure on P(X). The identity element 0 is the empty set
∅ and the inverse of Y is Y itself: That is, in this group
2Y := Y + Y = ∅ = 0 (2.38)
Definition 2.4 Let G1 , G2 be two groups. The direct product of G1 , G2 is the set G1 × G2
with product:
mG1 ×G2 (g1 , g2 ), (g10 , g20 ) = (mG1 (g1 , g10 ), mG2 (g2 , g20 ))
(2.39)
– 16 –
Exercise Apparent Asymmetry In The Definitions
In (2.15) we used AAtr = 1 but we could have used Atr A = 1. Similarly, in (2.16)
we used AA† = 1 rather than A† A = 1. Finally, in (2.21) we could, instead, have defined
Sp(2n, κ) to be matrices in M2n (κ) such that AJAtr = J. In all three cases, writing things
the other way defines the same group: Why?
(Careful: Just taking the transpose or hermitian conjugate of these equations does not
help.) 9
b.) Show that no matrix in O(2, R) is simultaneously of the form (2.22) and (2.40).
Conclude that, as a manifold, O(2, R) is a disjoint union of two circles.
9
Answer : Hint: Remember that in a group the inverse matrix is in the group. Consider replacing
g → g −1 in the definition.
– 17 –
Suppose we define new coordinates and momenta Qi , Pi to be linear combinations of
the old:
Q1 q1
. .
.. ..
Qn a11 · · · a1,2n
q n
.. . . .
.
= . · (2.44)
. .
P1 p1
..
a2n,1 · · · a2n,2n ..
. .
Pn pn
where A = (aij ) is a constant 2n × 2n matrix.
b.) Show that
{Qi , Qj } = {Pi , Pj } = 0 {Qi , Pj } = δji (2.45)
if and only if A is a symplectic matrix.
c.) Show that J ∈ Sp(2n, R). Note that it exchanges momenta and coordinates.
d.) What are the conditions on the n × n matrix B so that
!
1B
{ } (2.46)
0 1
is a subgroup. 10
e.) What are the conditions on the n × n matrix C so that
!
1 0
{ } (2.47)
C 1
is a subgroup. 11
f.) Show that ! !
1 0 1B
=J J −1 (2.48)
C 1 0 1
for C = −B.
– 18 –
!
3 1 0
σ := (2.51)
0 −1
a.) Show that they satisfy the identity, valid for all 1 ≤ i, j ≤ 3:
σ i σ j = δ ij + iijk σ k (2.52)
forms a subgroup of order 8 of SU (2) ⊂ GL(2, C). It is known as the quaternion group.
c.) Show that the set of matrices
forms a subgroup of U (2) ⊂ GL(2, C) of order 16. It is known as the Pauli group.
Remark: 12 The Pauli group is often used in quantum information theory. If we think
of the quantum Hilbert space of a spin 1/2 particle (isomorphic to C2 with standard inner
product) then there is a natural basis of up and down spins: v1 = | ↑i and v2 = | ↓i.
Thinking of these as quantum analogs |0i and |1i of classical information bits 0, 1 we see
that X = σ 1 acts as a “bit flip,” while Z = σ 3 acts as a “phase-flip.” Y = iσ 2 flips both
bits and phases. These are then quantum error operators. Note that if we have a chain of
N spin 1/2 particles then the N th direct product
PN = P
| × ·{z
· · × P} (2.55)
N times
acts naturally on this chain of particles. 13 This group is useful in quantum information
theory. For example if H ⊂ P N is a subgroup such that (−1, ...., −1) is not in H then
we can study the subspace of Hilbert space {ψ|gψ = ψ, ∀g ∈ H}. For astutely chosen
subgroups these are useful quantum code subspaces, known as stabilizer codes.
12
Many terms used here will be more fully explained in Chapter 2.
13
See 4.1 for the formal definition of a group action on a space.
– 19 –
3. Homomorphism and Isomorphism
Remarks
product in G0
z }| {
ϕ( g1 g2 ) = ϕ(g1 )ϕ(g2 ) (3.2)
|{z}
product in G
r0
ϕ(r̄) := exp 2πi (3.4)
N
There is a crucial thing to check here: We need to check that the map is actually well-
defined. We know that any two representatives r10 and r20 for r̄ must have the property
that r10 − r20 = 0 modN , that is r10 − r20 = `N for some
integer
N and
nowby standard
r0 r0
properties of complex numbers we see that indeed exp 2πi N1 = exp 2πi N2 .
Next we check that
ϕ(r̄1 + r̄2 ) = ϕ(r̄1 )ϕ(r̄2 ) (3.5)
If you unwind the definitions you should find this follows from a standard property of the
exponential map.
Equation (3.5) implies that (3.3) is a homomorphism. In fact one easily checks:
a.) If ϕ(r̄) = 1 then r̄ = 0̄. Thus ϕ is 1-1 (a.k.a. “injective”).
– 20 –
b.) Every element of µN is of the form ϕ(r̄) for some r̄. Thus, ϕ is onto (a.k.a.
“surjective”). Note that this is equivalent to saying that every element in µN is of the form
ω j where ω = e2πi/N .
Thus, ϕ is in fact an isomorphism. As we mentioned above, the two groups “seemed
to be the same.” We have now given precise meaning to that idea.
by
pk (z) = z k (3.7)
where z is any N th root of unity. Note that z k is also an N th root of unity. Moreover
(z1 z2 )k = z1k z2k by elementary properties of complex numbers, so pk is a homomorphism.
Note that it is not always injective or surjective. For example, if k is a multiple of N it is
the stupid homomorphism. In fact pk+N = pk .
mk : ZN → ZN (3.8)
by the equation:
mk (r̄) := kr (3.9)
where on the right hand side kr is defined by choosing a representative r for the class r̄
and then using ordinary multiplication of integers k × r (e.g. 2 × 3 = 6) and then reducing
modulo N . Again, one needs to check the equation is well-defined. Note that mk+N = mk .
pk (ω j ) = (ω j )k = ω jk . (3.10)
ϕ ◦ mk = pk ◦ ϕ (3.11)
– 21 –
In mathematics one often uses commutative diagrams to express identities such as
(3.11). In this case the diagram looks like
mk
ZN / ZN (3.13)
ϕ ϕ
pk
/ µN
µN
We say a diagram commutes if the following condition holds: The diagram describes a
graph with sets associated to vertices and maps associated with oriented edges. Consider
following the arrows around any two paths on the graph with the same beginning and
final points. We compose the maps associated with those arrows to get two maps from the
initial set to the final set. The diagram commutes iff any pair of maps obtained this way
are equal.
Remark We will discuss in detail later on that when k is an integer relatively prime
to N the map pk is an automorphism of µN and mk is an automorphism of ZN . For
example in Z/3Z = {0̄, 1̄, 2̄} if we take k = 2, or any even integer not divisible by 3, then
µ exchanges 1̄ and 2̄. (Check that such an exchange is indeed a homomorphism!) We will
discuss this kind of example in greater detail in Section §13 below.
One kind of homomomorphism is especially important:
T : G → GL(n, κ) (3.14)
for some positive integer n and field κ. (One can also have matrix representations in
GL(n, R) where R is a ring.)
More generally, if V is a vector space over a field κ let GL(V ) denote the group of all
invertible linear transformations from V → V . (The group multiplication is composition.)
If G is a group then a homomorphism T : G → GL(V ) defines a representation of G.
Sometimes V is referred to as the carrier space.
*******************************************
PUT EXAMPLE OF HOMOMORPHISM π : SU (2) → SO(3) HERE. IT GIVES A
3-dimensional MATRIX REP OF SU (2).
CURRENTLY IT IS IN SECTION 10.1
BUT SOME OF THAT SHOULD BE PUT HERE
*******************************************
– 22 –
Exercise The Stupid Homomorphism
Consider the map µ : G → G0 defined by µ(g) = 1G0 . Show that this is a homomor-
phism.
– 23 –
Construct a homomorphism
µ : Q → Z2 × Z2 (3.18)
Exercise Subgroups of ZN
a.) Show that the subgroups of ZN are isomorphic to the groups ZM for M |N .
b.) For N = 8, M = 4 write out H.
Exercise
Let S2 be any set with two elements
a.) Show that there are exactly two possible group structures on S2 , and in each case
construct an isomorphism of S2 with µ2 ∼ = Z2 .
b.) Consider the matrix group of two elements:
! !
10 01
Ŝ2 = { , } (3.19)
01 10
pk (ω j ) = ω jk (3.21)
– 24 –
b.) Show that !
2πj cos( 2πj 2πj
N ) sin( N )
µ : ωj →
7 R( ) := (3.22)
N − sin( 2πj 2πj
N ) cos( N )
defines a two-dimensional matrix representation of ZN .
c.) Let P be the N × N “shift matrix” all of whose matrix elements are zero except
for 1’s just below the diagonal and P1,N = 1. See equation (10.19) below. Show that
µ(ω j ) = P j (3.23)
is an N × N dimensional representation of µN .
– 25 –
Show that the fiber product is in fact a subgroup of G1 × G2 , where G1 × G2 has the direct
product group structure.
We would also like x 7→ φ(1G , x) to be the identity map. Now, equation (4.1) implies that
which is compatible with, but does not quite imply that φ(1G , x) = x. Thus in defining a
group action we must also impose the condition:
φ(1G , x) = x ∀x ∈ X. (4.3)
Exercise
Give an example of a map φ : G × X → X that satisfies (4.1) but not (4.3). 15
Third Way: Yet another way to say this is the following: Define the map Φ : G → SX
that takes g 7→ φ(g, ·). That is, for each g ∈ G, Φ(g) is the function X → X taking
15
Answer : As the simplest example, choose any element x0 ∈ X and define φ(g, x) = x0 for all g, x. For
a slightly less trivial example consider G = S2 and let φ(e, x) = φ(σ, x) = f (x). Then if f ◦ f (x) = f (x)
the condition (4.1) will be satisfied, but there certainly exist functions with f ◦ f = f which are not the
identity map.
– 26 –
x 7→ φ(g, x). Clearly Φ(g1 ) ◦ Φ(g2 ) = Φ(g1 g2 ) because of (4.1). In order to make sure it
is a permutation we need to know that Φ(g) is invertible and therefore we need to impose
that Φ(1G ) is the identity transformation. This follows from (4.3). Then Φ(g) ∈ SX . So,
to say we have a group action of G on X is to say that Φ is a homomorphism of G into the
permutation group SX . We will discuss G-actions on sets and their properties extensively
in Chapter 3.
Notation: Again, our notation is overly cumbersome because we want to stress the con-
cept. Usually one writes a left G-action as
g · x := φ(g, x) (4.4)
g1 · (g2 · x) = (g1 g2 ) · x
(4.5)
1G · x = x
The notion of orbits is very important in geometry, gauge theory and many other subjects.
The set of orbits is denoted X/G. We will discuss many examples below.
Examples
2. Consider rotations around the origin by multiples of 2π/3. Check that this group is
isomorphic to Z3 . Consider an equilateral triangle centered on the origin. Then the
rotations act on the triangle preserving it. Thus, Z3 acts as a group of symmetries
of the equilateral triangle. Intuitively, group theory is the theory of symmetry. We
have just illustrated how that idea can be formalized through the notation of group
action on a set. ♣FIGURE HERE!
♣
– 27 –
3. Let G = GL(n, κ) and X = κn , the n-dimensional vector space over κ. Then the
usual linear action on vectors defines a group action of G on X. One can check that
there are only two orbits: The zero vector gives one orbit.
where p + q = n + 1. Note that this action preserves the equation of the sphere
P i 2 n
i (x ) − 1 = 0 and hence descends to a Z2 -action on the sphere S . The case
p = 0, q = n + 1 is the antipodal map. The set of orbits is known as RPn ∼
= S n /h−1i.
There are many other natural actions of Z2 on S . n
5. Let the group be G = C∗ . This acts on X = Cn by scaling all the coordinates. Note
that scaling a nonzero vector by a nonzero scalar gives a nonzero vector so G = C∗
also restricts to act on X̃ = Cn − {0}. The set of orbits in these two cases are very
different. One can put a natural topology on the set of orbits. One finds that Cn /C∗
is not a Hausdorff space while (Cn − {0})/C∗ is a nice manifold. This important
manifold is often denoted CPn−1 . See Chapter 3 for more discussion. We will often
denote elements of CPn−1 by [X 1 : · · · : X n ]. This stands for the equivalence class of
a vector (X 1 , . . . , X n ) ∈ Cn − {0}.
6. Now consider a set of integers (q1 , . . . , qn ) ∈ Zn . Then for each such set of integers
there is a C∗ -action on CPn−1 defined by
via
aτ + b
g · τ := (4.10)
cτ + d
where !
a b
g= (4.11)
c d
The reader should check that indeed:
– 28 –
9. Actions of Z on any set X. Let us consider Z to be the free group with one generator
g0 . Then, given any invertible map f : X → X we can define a group action of Z on
X by
f ◦ · · · ◦ f (x) n>0
| {z }
n times
n
g0 · x = x n=0 (4.13)
−1 −1
f ◦ · · · ◦ f (x) n < 0
| {z }
|n| times
Conversely, any Z-action must be of this form since we can define f (x) := g0 · x. Thus
the orbit of a point x ∈ X is the set of all images of successive actions of f and f −1 .
********************************
SHOULD DEFINE A MAP, OR MORPHISM, OF G-SPACES HERE. INTRODUCE
THE GENERAL IDEA OF AN EQUIVARIANT MAP.
GIVE SOME EXAMPLES.
*********************************
Note the inverse of g on the RHS. It is there so that the group law works out:
– 29 –
and hence φ̃(g1 , φ̃(g2 , F )) = φ̃(g1 g2 , F ) as required for a group action. It should also be
clear that φ̃(1G , F ) = F .
In less cumbersome notation we would simply write
(g · F )(x) := F (g −1 · x) (4.16)
In the above discussion we could impose various conditions, on the functions in F[X →
Y ]. For example, if X and Y are manifolds we could ask our maps to be continuous,
differentiable, etc. The above discussion would be unchanged.
As just one (important) example of this general idea: In field theory if we have fields
on a spacetime, and a group of symmetries acting on that spacetime then that group also
acts on the space of fields.
The symmetric group is an important example of a finite group. As we shall soon see, all
finite groups are isomorphic to subgroups of the symmetric group.
Recall from section 2 above that for any set X we can define a group SX of all permu-
tations of the set X. If n is a positive integer the symmetric group on n elements, denoted
Sn , is defined as the group of permutations of the set X = {1, 2, . . . , n}.
In group theory, as in politics, there are leftists and rightists and we can actually define
two group operations:
meaning: φ(1) = p1 , φ(2) = p2 , . . . , φ(n) = pn . Note that we could equally well write the
same permutation as: !
a1 a2 · · · an
φ= (5.3)
pa1 pa2 · · · pan
– 30 –
where a1 , . . . , an is any permutation of 1, . . . , n. With this understood, suppose we want
to compute φ1 ·L φ2 . We should first see what φ1 does to the ordered elements 1, . . . , n,
and then see what φ2 does to the ordered output from φ1 . So, if we write:
!
1 ··· n
φ1 =
q1 · · · qn
! (5.4)
q1 · · · qn
φ2 =
p1 · · · pn
Then !
1 ··· n
φ1 ·L φ2 = (5.5)
p1 · · · pn
On the other hand, to compute φ1 ·R φ2 we should first see what φ2 does to 1, . . . , n and
then see what φ1 does to that output. We could write represent this as:
!
1 ··· n
φ2 =
q10 · · · qn0
! (5.6)
q10 · · · qn0
φ1 =
p01 · · · p0n
and then !
1 ··· n
φ1 ·R φ2 = (5.7)
p01 · · · p0n
Exercise
a.) Show that the order of the group is |Sn | = n!.
b.) Show that if n1 ≤ n2 then we can consider Sn1 as a subgroup of Sn2 .
c.) In how many ways can you consider S2 to be a subgroup of S3 ? 16
d.) In how many ways can you consider Sn1 to be a subgroup of Sn2 when n1 ≤ n2 ?
17
– 31 –
1 2 3 4 5 6 7 8
1 2 3 4 5 6 7 8
φ1 ·L φ2 = (φ−1 −1 −1
1 ·R φ2 ) (5.9)
c.) Interpret (5.9) as the simple statement that φ1 ·R φ2 puts φ2 in the past while
φ1 ·L φ2 puts φ1 in the past.
The rearrangement lemma: Consider a totally ordered group, that is, we can list the group
elements in order
G = {g1 , g2 , . . . , } (5.10)
16
Answer : There are three subgroups of S3 isomorphic to S2 . They are the subgroups that fix 1, 2, 3
respectively.
17
Answer : for any subset T ⊂ {1, . . . , n2 } of cardinality n2 −n1 we can consider the subset of permutations
that fix all elements of T . This subset of permutations will be a subgroup isomorphic to Sn1 . So there are
n2
n1
distinct subgroups isomorphic to Sn1 .
– 32 –
Consider this as an ordered set, with all the gi distinct. The set can be finite or infinite.
Then, for any h ∈ G consider the ordered set:
h · G = {h · g1 , h · g2 , . . . , }. (5.11)
With a little thought you can (and should) convince yourself that (5.11) is a a permutation
of (5.10): No two elements coincide (since the gi 6= gj for i 6= j) and every element of G
must appear in the list h · G.
L(h) : g 7→ h · g ∀g ∈ G. (5.12)
This map is one-one and invertible so L(h) ∈ SG , the group of permutations of the set G.
(In fact, there is no need to assume G is totally ordered.) Now note that
L : G → SG (5.14)
R(a) : g 7→ g · a (5.15)
18
Answer : Choose any cyclic permutation of length n (Cyclic permutations are defined in Section 5.2
below.) Then it generates a subgroup of SN of length n for any N ≥ n.
– 33 –
a.) Show that R(a) permutes the elements of G.
b.) Show that R(a1 ) ◦ R(a2 ) = R(a2 a1 ). Thus, a 7→ R(a) is not a homomorphism of
G into the group SG of permutations of G.
c.) Show that a 7→ R(a−1 ) is a homomorphism of G into SG .
a1 → a2 → · · · → a` → a1 (5.16)
holding all other n − ` elements fixed. Such a permutation is called a cycle of length `. We
will denote such permutations as:
φ = (a1 a2 . . . a` ). (5.17)
Bear in mind that with this notation the same permutation can be written in ` different
ways:
Let us write out the elements of the first few symmetric groups in this notation:
1. S2 is abelian.
2. S3 is NOT ABELIAN 19
It is not true that all permutations are just cyclic permutations, as we first see by
considering S4 :
– 34 –
Now a key observation is:
Any permutation σ ∈ Sn can be uniquely written as a product of disjoint cycles. This
is called the cycle decomposition of σ.
For example
σ = (12)(34)(10, 11)(56789) (5.23)
is a cycle decomposition in S11 . There are 3 cycles of length 2 and 1 of length 5.
The decomposition into products of disjoint cycles is known as the cycle decomposition.
Exercise
a.) Let φ be a cyclic permutation of order `. Suppose we compose φ with itself N
times. Show that the result is the identity transformation iff ` divides N .
b.) Suppose φ has a cycle decomposition with cycles of length k1 , . . . , ks . What is the
smallest number N so that if we compose φ with itself φ ◦ · · · ◦ φ for N times that we get
the identity transformation?
5.3 Transpositions
A transposition is a permutation of the form: (ij). These satisfy some nice properties:
Suppose i, j, k are distinct. You can check as an exercise that transpositions obey the
following identities:
– 35 –
=
i j k k
i j
Figure 2: Pictorial illustration of equation (4.21) line one for transpositions where i < j < k. Note
that the identity is suggested by “moving the time lines” holding the endpoints fixed. Reading time
from bottom to top corresponds to reading the composition from left to right in the ·R convention.
1 → a1 , 2 → a2 , 3 → a3 , · · · , k → ak (5.26)
For our purposes, it won’t really matter what it does to the other integers greater than k.
Choose any such permutation and call it φ. Note that
so now multiply the above identity by φ on the left and φ−1 on the right to get:
φ(1, k)φ−1 φ(1, k − 1)φ−1 · · · φ(1, 4)φ−1 φ(1, 3)φ−1 φ(1, 2)φ−1 = (a1 , a2 , . . . , ak ) (5.28)
– 36 –
Although permutations can be written as products of transpositions in different ways,
the number of transpositions in a word modulo 2 is always the same, because the identities
(5.24) have the same number of transpositions, modulo two, on the LHS and RHS. Thus
we can define even, resp. odd, permutations to be products of even, resp. odd numbers of
transpositions.
Exercise
a.) What is the order of An ? 22
b.) Write out A2 , A3 , and A4 . Show that A3 is isomorphic to Z3 . 23
Exercise
When do two cyclic permutations commute? Illustrate the answer with pictures, as
above.
A4 = {1, (123), (132), (124), (142), (134), (143), (234), (243), (12)(34), (13)(24), (14)(23)}.
24
Answer : No. Just multiply a few elements to find a counterexample. For example (123)(134) = (234)
but (134)(123) = (124).
– 37 –
Show that, in fact, Sn can be generated by just two elements: (12) and (1 2 · · · n). 25
Exercise Center of Sn
What is the center of Sn ? 26
: Sn → Z2 (5.29)
where we identify Z2 as the multiplicative group {±1} of square roots of 1. The rule is:
: σ → +1 if σ is a product of an even number of transpositions.
: σ → −1 if σ is a product of an odd number of transpositions.
Put differently, we could define (ij) = −1 for any transposition. This is compatible
with the words defining the relations on transpositions. Since the transpositions generate
the group the homomorphism is well-defined and completely determined.
In physics one often encounters the sign homomorphism in the guise of the “epsilon
tensor” denoted:
i1 ···in (5.30)
Its value is:
1. i1 ···in = +1 if !
1 2 ··· n
(5.31)
i1 i2 · · · in
is an even permutation.
25
Answer : Conjugate (12) by the n-cycle to get (23). Then conjugate again to get (34) and so forth.
Now we have the set of generators of the previous exercise.
26
Answer : If n = 2 then Sn is Abelian and the center is all of S2 . If n > 2 then the center is the trivial
group. To prove this suppose z ∈ Z(Sn ). If z is not the trivial element then it moves some i to some j.
WLOG we can say it moves 1 to i 6= 1. Then z(i) 6= i. If z(i) = 1 then z is the transposition (1, i). If n > 2
there will be some other j 6= 1, i and z will not commute with (1, j). If z(i) = j with j 6= 1, i then φ = (1, i)
does not commute with z because zφ takes 1 → j and φz takes 1 → 1.
27
Hint: Use the pictorial interpretation mentioned above.
– 38 –
2. i1 ···in = −1 if
!
1 2 ··· n
(5.32)
i1 i2 · · · in
is an odd permutation.
3. i1 ···in = 0 if two indices are repeated. (This goes a bit beyond what we said above
since in that case we are not discussing a permutation.)
123 = 1
132 = −1
(5.33)
231 = +1
221 = 0
and so forth.
Exercise
Show that
X
i1 i2 ···in j1 j2 ···jn = (σ)δi1 jσ(1) δi2 jσ(2) · · · δin jσ(n) (5.34)
σ∈Sn
This formula is often useful when proving identities involving determinants. An im-
portant special case occurs for n = 3 where it is equivalent to the rule for the cross-product
of 3 vectors in R3 :
~ × (B
A ~ × C)
~ = B(
~ A~ · C)
~ − C(
~ A~ · B)
~ (5.35)
The next two exercises assume some familiarity with concepts from linear algebra. See
Chapter 2 below if they are not familiar.
– 39 –
and now extend by linearity so that
n
X n
X n
X
T (φ) : xi ei 7→ xi eφ(i) = xφ−1 (i) ei (5.37)
i=1 i=1 i=1
28
Answer : A(φ)i,j = δi,φ(j) = δφ−1 (i),j .
29
Answer : Show that the linear subspace spanned by the “all ones vector” v0 = e1 + · · · + en is preserved
under the action of T (φ) for all φ ∈ Sn : T (φ)(λv0 ) = λv0 , for all λ ∈ R.
– 40 –
5.4 Diversion and Example: Card shuffling
Another way to express this is that the Out-shuffle defines a permutation of {0, 1, . . . , 2n−
1}. If we let Cx , 0 ≤ x ≤ 2n − 1 denote the cards in the original order then the new ordered
set of cards Cx0 are related to the old ones by:
0
CO(x) = Cx (5.43)
where (
2x x≤n−1
O(x) = (5.44)
2x − (2n − 1) n ≤ x ≤ 2n − 1
Note that this already leads to a card trick: Modulo (2n − 1) the operation is just
x → 2x, so if k is the smallest number with 2k = 1mod(2n − 1) then k Out-shuffles will
restore the deck perfectly.
For example: For a standard deck of 52 cards, 28 = 5 × 51 + 1 so 8 perfect Out-shuffles
restores the deck!
We can also see this by working out the cycle presentation of the Out-shuffle:
O = (0)(1, 2, 4, 8, 16, 32, 13, 26)(3, 6, 12, 24, 48, 45, 39, 27)
(5, 10, 20, 40, 29, 7, 14, 28)(9, 18, 36, 21, 42, 33, 15, 30) (5.45)
(11, 22, 44, 37, 23, 46, 41, 31)(17, 34)(19, 38, 25, 50, 49, 47, 43, 35)(51)
– 41 –
Now observe that if we have a deck with 2n cards D(2n) := {0, 1, . . . , 2n − 1} and we
embed it in a Deck with 2n + 2 cards
by the map x → x + 1 then the Out-shuffle on the deck D(2n + 2) permutes the cards
1, . . . , 2n amongst themselves and acts as an In-shuffle on these cards! ♣Explain this some
more, e.g. by
Therefore, applying our formula for the Out-shuffle we find that the In-shuffle is given illustrating with a
pack of 6 cards. ♣
by the formula
(
2(x + 1) − 1 x+1≤n
I(x) = (5.47)
2(x + 1) − (2n + 1) − 1 n ≤ x ≤ 2n − 1
for x ∈ D(2n).
For 2n = 52 this turns out to be one big cycle!
(0, 1, 3, 7, 15, 31, 10, 21, 43, 34,16, 33, 14, 29, 6, 13, 27, 2, 5,
11, 23, 47, 42, 32, 12, 25, 51, 50,48, 44, 36, 20, 41, 30, 8, 17, (5.49)
35, 18, 37, 22, 45, 38, 24,49, 46, 40, 28, 4, 9, 19, 26)
where aj ∈ {0, 1}. Interpret the coefficients 1 as In-shuffles and the coefficients 0 as Out-
shuffles. Then, reading from left to right, perform the sequence of shuffles given by the
binary expression: 1ak−1 ak−2 · · · a1 a0 .
To see why this is true consider iterating the functions o(x) = 2x and i(x) = 2x + 1.
Notice that the sequence of operations given by the binary expansion of ` are
0→1
→ 2 · 1 + ak−1
→ 2 · (2 · 1 + ak−1 ) + ak−2 = 22 + 2ak−1 + ak−2
(5.51)
→ 2 · (22 + 2ak−1 + ak−2 ) + ak−3 = 23 + 22 ak−1 + 2ak−2 + ak−3
.. ..
. .
→ 2k + ak−1 2k−1 + · · · + a1 21 + a0 = `
– 42 –
For an even ordered set we can define a notion of permutations preserving central
symmetry. For x ∈ D2n let x̄ = 2n − 1 − x. Then we define the group W (Bn ) ⊂ S2n to be
the subgroup of permutations which permutes the pairs {x, x̄} amongst themselves.
Note that there is clearly a homomorphism
φ : W (Bn ) → Sn (5.52)
Moreover, both O and I are elements of W (Bn ). Therefore the shuffle group, the group
generated by these is a subgroup of W (Bn ). Using this one can say some nice things about
the structure of the group generated by the in-shuffle and the out-shuffle. It was completely
determined in a beautiful paper (the source of the above material):
“The mathematics of perfect shuffles,” P. Diaconis, R.L. Graham, W.M. Kantor, Adv.
Appl. Math. 4 pp. 175-193 (1983)
It turns out that shuffles of decks of 12 and 24 cards have some special properties. In
particular, special shuffles of a deck of 12 cards can be used to generate a very interesting
group known as the Mathieu group M12 . It was, historically, the first “sporadic” finite
simple group. See section §16.4 below.
To describe M12 we need to introduce a Mongean shuffle. Here we take the deck of
cards put the top card on the right. Then from the deck on the left alternatively put cards
on the top or the bottom. So the second card from of the deck on the left goes on top of
the first card, the third card from the deck on the left goes under the first card, and so on.
If we label our deck as cards 1, 2, . . . , 2n then the Mongean shuffle is:
m : {1, 2, . . . , 2n} → {2n, 2n − 2, . . . , 4, 2, 1, 3, 5, . . . , 2n − 3, 2n − 1} (5.53)
In formulae, acting on D(2n)
m(x) = Min[2x, 2n + 1 − 2x] (5.54)
In particular for 2n = 12 we have
{1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12} → {12, 10, 8, 6, 4, 2, 1, 3, 5, 7, 9, 11} (5.55)
which has cycle decomposition (check!)
(3 8) · (1 12 11 9 5 4 6 2 10 7) (5.56)
Now consider the reverse shuffle that simply orders the cards backwards. In general
for a deck D(2n) with n = 2mod4 Diaconis et. al. show that r and m generate the entire
symmetric group. However, for a pack of 12 cards r and m generate the Mathieu group
M12 . It turns out to have order
|M12 | = 26 · 33 · 5 · 11 = 95040 (5.57)
Compare this with the order of S12 :
12! = 210 · 35 · 52 · 7 · 11 = 479001600 (5.58)
So with the uniform probability distribution on S12 , the probability of finding a Mathieu
1
permutation is 5040 ∼ 2 × 10−4 .
We mention some final loosely related facts:
– 43 –
1. There are indications that the Mathieu groups have some intriguing relations to string
theory, conformal field theory, and K3 surfaces.
2. In the theory of L∞ algebras and associated topics, which are closely related to string
field theory one encounters the concept of the k-shuffle...
FILL IN.
Definition 6.1 A subset S ⊂ G is a generating set for a group if every element g ∈ G can
be written as a “word” or product of elements of S. That is any element g ∈ G can be
written in the form
g = si1 · · · sir (6.1)
where Ri are words in the letters of S which will be set to 1. ALL other relations,
that is, all other identities of the form W = 1 are supposed to be consequences of these
relations.
Remark: It is convenient to exclude the unit 1 from S. When we write our words it
is understood that we can raise a generator s to any integer power sn where, s0 = 1 and,
if n < 0, this means (s−1 )|n| . Alternatively, we can, for each generator s introduce another
30
See Jacobsen, Basic Algebra I, sec. 1.11 for a more precise definition.
– 44 –
generator t, which will play the role of s−1 and then impose another relation st = ts = 1G .
A generating set that contains s−1 for every generator s is said to be symmetric.
Example 2.2: The most general group with one generator and one relation must be of
the form:
ha|aN = 1i (6.3)
Example 2.3: Free groups. If we impose no relations on the generating set S then we
obtain what is known as the free group on S, denoted F (S). If S consists of one element
then we just get Z, as above. However, things are completely different if S consists of two
elements a, b. Then F (S) is very complicated. A typical element looks like one of
where ni , mi are nonzero integers (positive or negative). Three nice general results on free
groups are:
1. Two free groups F (S1 ) and F (S2 ) are isomorphic iff they have the same cardinality.
3. Every group has a presentation in terms of generators and relations. For the group
G we can consider the free group F (G) with S = G as a set. There is then a
natural homomorphism ϕ : F (G) → G where we take a word in elements of G and
map concatenation of letters to group multiplication in G. As we will see in Section
*** below, the kernel of the homomorphism is a normal subgroup K(G) so that
G∼= F (G)/K(G) and K(G) are the relations in this presentation. This presentation
can be incredibly inefficient and useless. (Think, for example, of Lie groups.)
Combinatorial group theorists use the notion of a Cayley graph to illustrate groups
presented by generators and relations. Assuming that 1 ∈/ S the Cayley graph is a graph
whose vertices correspond to all group elements in G and the oriented edges are drawn
between g1 and g2 if there is an s ∈ S with g2 = g1 s. We label the edge by s. (If S is
symmetric we can identify this edge with the edge from g2 to g1 labeled by s−1 .) For the
free group on two elements this generates the graph shown in Figure 3.
– 45 –
Figure 3: The Cayley graph for the free group on 2 generators a and b.
Example 2.4: Coxeter groups: Let mij by an n × n symmetric matrix whose entries are
positive integers or ∞, such that mii = 1, 1 ≤ i ≤ n, and mij ≥ 2 or mij = ∞ for i 6= j.
Then a Coxeter group is the group with generators and relations:
Quite generally, a group element that squares to 1 is called an involution. So all the
generators of a Coxeter group are involutions. It then follows that if mij = 2 then si and
sj commute. If mij = 3 then the relation can also be written:
si sj si = sj si sj (6.7)
A theorem of Coxeter’s from the 1930’s gives a classification of the finite Coxeter
groups. 31 Coxeter found it useful to describe these groups by a diagrammatic notation:
We draw a graph whose vertices correspond to the generators si . We draw an edge between
vertices i and j if mij ≥ 3. By convention the edges are labeled by mij and if mij = 3 then
the standard convention is to omit the label.
It turns out that the finite Coxeter groups can be classified. The corresponding Coxeter
diagrams are shown in Figure 4.
31
For a quick summary see the expository note by D. Allcock at
https://web.ma.utexas.edu/users/allcock/expos/reflec-classification.pdf.
– 46 –
Figure 4: Coxeter’s list of finite Coxeter groups. They are finite groups of reflections in some
Euclidean space.
The finite Coxeter groups turn out to be isomorphic to concrete groups of reflections
in some Euclidean space. That is, finite subgroups of O(N ) for some N . That is, there is
some vector space RN and collection of vectors vi ∈ RN with inner products
π
vi · vj = −2 cos( ) (6.8)
mi,j
so that the group generated by reflections in the plane orthogonal to the vectors vi :
2v · vi
Pvi : v 7→ v − vi (6.9)
vi · vi
is a finite group isomorphic to the Coxeter group with matrix mi,j . (Note that since
mi,i = 1 we have vi2 = 2 and Pvi (v) = v − (v · vi )vi .)
Note that, if Pv is the Euclidean reflection in the plane orthogonal to v then Pv1 ◦ Pv2
is just rotation in the plane spanned by v1 , v2 by an angle 2θ where the angle between
v1 and v2 is θ. To prove this, note that Pv1 ◦ Pv2 clearly leaves all vectors in the plane
orthogonal to v1 , v2 fixed. Now represent vectors in a 2-dimensional Euclidean plane by
complex numbers, but view C as a real vector space. WLOG take v = eiθ . Then Pv is the
transformation:
Pv : z 7→ −e2iθ z̄ (6.10)
Note that this is a linear transformation of real vector spaces: Pv (a1 z1 +a2 z2 ) = a1 Pv (z1 )+
a2 Pv (z2 ) if a1 , a2 ∈ R. To check this formula note that if z = eiθ then Pv (z) = −z and if
z = ieiθ is in the orthogonal hyperplane to v then Pv (z) = z.
Now if va = eiθa , a = 1, 2, it is an easy matter to compute:
– 47 –
to the symmetric group Sn+1 . 32 In this case we have seen that the elementary generators
σi = (i, i + 1), 1 ≤ i ≤ n indeed satisfy the Coxeter relations:
σi2 = 1
(σi σi+1 )3 = 1 1≤i≤n−1 (6.12)
2
(σi σj ) = 1 |i − j| > 1
Now consider the standard basis ei for Rn+1 , 1 ≤ i ≤ n + 1 and consider the vectors:
αi = ei − ei+1 (6.13)
which have the inner products: αi2 = 2 and αi · αi±1 = −1 (so they are at angle 2π/3) and
all other inner products vanish. This is summarized in the matrix:
Then the map si → Pαi is an isomorphism of the Coxeter group An with a subgroup of
O(n + 1). Moreover, one computes that
ej
j 6= i, i + 1
Pαi (ej ) = ei+1 j=i (6.15)
e j =i+1
i
So, referring to equation (5.36) we see that this is just the permutation action of σi on
the standard basis of Rn+1 . This makes clear that the Coxeter group is isomorphic to the
symmetric group Sn+1 . ♣A presentation of
the Monster in
terms of generators
and relations is
Remarks known.(Atlas) Give
it here? ♣
1. One very practical use of having a group presented in terms of generators and relations
is in the construction of homomorphisms. If one is constructing a homomorphism
φ : G1 → G2 , then it suffices to say what elements the generators map to. That is, if
gi are generators of G1 we can fully specify a homomorphism by choosing elements
gi0 ∈ G2 (not necessarily generators) and declaring
However, we cannot choose the gi0 arbitrarily. Rather, the gi0 must satisfy the same
relations as the gi . This puts useful constraints on what homomorphisms you can
write down. For example, using this idea you can prove that there is no nontrivial
homomorphism φ : ZN → Z.
32
The notation here is standard but exceedingly unfortunate and confusing!!! Here An does NOT refer to
the alternating group! It refers to Cartan’s classification of simple Lie groups and the Coxeter group with
this label is in fact isomorphic to Sn+1 .
– 48 –
2. In general it is hard to say much about a group given a presentation in terms of
generators and relations. For example, it is not even obvious, in general, if the
group is the trivial group! This is part of the famous “word problem for groups.”
There are finitely presented groups where the problem of saying whether two words
represent the same element is undecidable! 33 However, for many important finitely
presented groups the word problem can be solved. Indeed, the word problem was first
formulated by Max Dehn in 1911 and solved by him for the surface groups discussed
below. ♣It would be more
effective here to
give an example of
3. Nevertheless, there are four Tietze transformations (adding/removing a relation, a set of generators
and relations that is
adding/removing a generator) which can transform one presentation of a group to a actually isomorphic
to the trivial group
different presentation of an isomorphic group. It is a theorem [REF!] that any two - but not obviously
so. ♣
presentations can be related by a finite sequence of Tietze transformations. How is
this compatible with the previous remark? The point is that the number f (n) of
such transformations needed to transform a presentation of the trivial group with n
relations into the trivial presentation grows faster than any recursive function of n.
4. It turns out that the case of Coxeter groups Bn = Cn are isomorphic to the group
of symmetric permutations WBn ⊂ S2n discussed in card-shuffling. The Coxeter
diagrams are very similar to the Dynkin diagrams that are used to label finite dimen-
sional simple Lie algebras over the complex numbers except that Hn and In do not
correspond to Lie algebras.
33
The Wikipedia article on “Word problem for groups,” is useful.
34
Answer : Since ZN can be generated by one element, say 1̄, it suffices to say what the value of φ(1̄)
is. The trivial homomorphism takes the generator to zero: φ(1̄) = 0 ∈ Z and hence takes every element
to zero. On the other hand, if φ(1̄) = k is a nonzero integer, then N k = N φ(1̄) = φ(N 1̄) = φ(0̄) = 0, a
contradiction. So there is no nontrivial homomorphism.
35
Answer : Let the generator be g. The relations must be of the form g k1 = 1, . . . , g kn = 1 for some
integers k1 , . . . , kn , and WLOG we can assume they are positive. Then it is not hard to see, using the
Chinese remainder theorem (see below) that the group is isomorphic to ZN where N = gcd(k1 , ..., kn ).
– 49 –
Exercise Simple Roots Of SU (n + 1)
a.) Verify equation (6.14). The matrix Cij is known as a Cartan matrix of SU (n + 1).
b.) Show that the vectors αi are all orthogonal to the all-one vector: v = (1, . . . , 1)
and that they span the orthogonal complement of v.
c.) Show that the permutation representation of Sn+1 separately preserves v and the
orthogonal complement of v. Thus, Rn+1 gives what is known as a reducible representation
of Sn+1 .
d.) Compute the action of Pαi on αj . Give the matrix representation relative to the
ordered basis {α1 , . . . , αn }. 36
is a presentation of S3
Exercise
Consider the group with presentation:
– 50 –
Remark: Denote the smallest cardinality of a set of generators of G by d(G). If G is
a finite and transitive permutation subgroup of Sn (meaning it acts transitively on some
set X) then there is a constant C such that
n
d(G) ≤ C √ (6.19)
logn
and if G is a primitive permutation group, meaning that it acts on a set X such that it
does not preserve any nontrivial disjoint decomposition of X, then there is a constant C
so that if n ≥ 3:
logn
d(G) ≤ C √ (6.20)
loglogn
Moreover, these results are asymptotically the best possible. For a review of such results
see. 37
These define paths in X with beginning and ending point fixed at x0 . The path must be
traveled in time 1.
37
F. Menegazzo, “The Number of Generators of a Finite Group,” Irish Math. Soc. Bulletin 50 (2003),
117128.
38
Answer : If G1 = hgi |Ri i and G2 = hha |Sa i then G1 × G2 = hgi , ha |Ri , Sa , gi ha gi−1 h−1
a = 1i.
– 51 –
Figure 5: Two loops f, g with basepoint at x0 .
Figure 6: The concatenation of the looops f ? g. Note that the “later” loop is written on the right.
This is generally a more convenient convention when working with homotopy and monodromy. In
order for f ? g to be a map from [0, 1] into X we should run each of the individual loops at “twice
the speed” so that at time t = 1/2 the loop returns to x0 . However, in homotoping f ? g there is
no reason why the point at t = 1/2 has to stay at x0 .
We say that two such paths f0 , f1 are homotopic if there is a continuous map
such that
2. F (s, 0) = F (s, 1) = x0
If we define fs (t) := F (s, t) and consider fs (t) as a path in t at fixed s then, as we vary
s we are describing a path of paths.
Now, homotopy of paths in X is an equivalence relation. 39 We denote by [f ] the
equivalence class of a path f and we denote the set of such equivalence class by π1 (X, x0 ).
We will see that this set has a natural and beautiful group structure.
39
See section 1.1 above for this notion.
– 52 –
Figure 7: The homotopy demonstrating that loop concatenation is an associative multiplication
on homotopy equivalence classes of closed loops. The blue line is s = 4t − 1 and the red line is
s = 4t − 2.
Figure 8: The homotopy demonstrating that the loop g(t) = f (1 − t) provides a representative for
the inverse of [f (t)].
Remarks
1. Note that we are composing successive paths on the right. This is slightly nonstandard
but a nice convention when working with monodromy and path ordered expontentials
of gauge fields - one of the main physical applications.
2. Note well that (f1 ? f2 ) ? f3 is NOT the same path as f1 ? (f2 ? f3 ). This observation
ultimately leads to the notion of A∞ spaces.
– 53 –
3. For the moment we simply notice that if we mod out by homotopy then we have a
well-defined product on homotopy classes in π1 (X, x0 )
5. A connected space such that π1 (X, x0 ) is the trivial group is called simply connected.
π1 (S 1 , x0 ) ∼
=Z (6.27)
which just measures the number of times the path winds around the circle. The sign of
the integer takes into account winding clockwise vs. counterclockwise.
Let us amplify a little on how one proves this basic fact. We will just give the main
idea. For a thorough and careful proof see A. Hatcher’s book on Algebraic Topology, or
Section 13.2 of
– 54 –
http://www.physics.rutgers.edu/∼gmoore/511Fall2014/Physics511-2014-Ch2-Topology.pdf
We have a standard map
p : R → S1 (6.28)
given by p(x) = e2πix . Note that the inverse image of any phase e2πis with s ∈ R is the
set of real numbers s + Z. So p is the map that identifies the orbits of the Z-action by
translation on R with S 1 . That is S 1 ∼
= R/Z. Now, suppose we have a map f¯ : [0, 1] → S 1 .
We claim that there is a map f : [0, 1] → R so that f¯ “factors through p,” meaning that
there is a map f : [0, 1] → R such that:
f¯ = p ◦ f (6.29)
The problem of finding such a map f is nicely expressed in terms of diagrams. One is
trying to complete the following diagram to make it a commutative diagram by finding a
suitable map f to use on the dashed line in:
=R (6.30)
f
p
f¯
[0, 1] / S1
π1 (X × Y, (x0 , y0 )) ∼
= π1 (X, x0 ) × π1 (Y, y0 ) (6.32)
– 55 –
𝒰𝒰+ −
𝑔𝑔𝑖𝑖+ −
𝒰𝒰+ 𝒰𝒰−
Figure 9: Illustrating the Seifert-VanKampen theorem. The green curve has a homotopy class
in U +− that is one of the generators of π1 (U +− ). Now it must separately be a word Wi+ in the
generators of π1 (U + ) and Wi− in the generators of π1 (U − ) so in π1 (X) there must be a relation of
the form Wi+ = Wi− .
3. In algebraic topology books a major result which is proved is the Seifert-van Kam-
pen theorem. This is an excellent illustration of defining groups by generators and
relations. The theorem can be useful because it allows one to compute π1 (X, x0 ) by
breaking up X into simpler pieces. Specifically, suppose that X = U + ∪ U − is a
union of two open path-connected subsets and that U +− := U + ∩ U − is also path-
connected and contains x0 . See Figure 9. Now suppose we know presentations of the
fundamental groups of the pieces U + , U − , U +− in terms of generators and relations:
π1 (U + , x0 ) ∼ +
= hg1+ , . . . , gn++ |R1+ , . . . , Rm +i
− ∼ − − −
π1 (U , x0 ) = hg , . . . , g − |R , . . . , R − i −
(6.33)
1 n 1 m
π1 (U +− , x0 ) ∼
= +− +− +− +−
hg1 , . . . , gn+− |R1 , . . . , Rm+− i
Then the recipe for computing π1 (X, x0 ) is this: Denote the injection ι+ : U +− → U +
and ι− : U +− → U − . Then the generators of π1 (U +− , x0 ) push forward to words in
gi+ or gi− , respectively:
+− +
ι+
∗ (gi ) := Wi i = 1, . . . , n+−
−
(6.34)
ι− +−
∗ (gi ) := Wi i = 1, . . . , n+−
π1 (X, x0 ) ∼
= hg1+ , . . . , gn++ , g1− , . . . , gn−− |Rα i (6.35)
– 56 –
and a set of new relations:
It is obvious that these are relations on the generators. What is not obvious is that
these are the only ones. Note that in the final presentation the generators gi+− and
the relations Ri+− have dropped out of the description.
Figure 10: The house that Bing built. Taken from M. Freedman and T. Tam Nguyen-Pham,
“Non-Separating Immersions Of Spheres and Bing Houses,” which describes nice mathematical
properties of this house.
Exercise
Show that if X = S n with n > 1 then π1 (X, x0 ) is the trivial group.
Exercise
Does the fundamental group depend on a choice of basepoint x0 ?
Exercise
What is the fundamental group of Serin Physics Laboratory?
– 57 –
Exercise The house that Bing built
Show that the house in Figure 10 can be shrink-wrapped with a single balloon so that
the complement of the balloon in R3 is connected and simply connected.
Figure 11: Right: Cutting a torus along the A and B cycles the surface falls apart into a rectangle,
shown on the left. Conversely, gluing the sides of the rectangle together produces a torus with
distinguished closed curves A, B.
Figure 12: Illustrating the Seifert-VanKampen theorem for the torus: The “tubular neighborhood”
- the green region - of the cutting curves, shown in (b) is homotopy equivalent to a one-point union
of two circles. The latter space has a π1 which is a free group on two generators. The boundary
of the green region contracts into the remainder of the surface - which can be deformed to a disk.
Therefore the group commutator [a, b] = 1.
– 58 –
Figure 13: A collection of closed paths at x0 which generate the fundamental group of a two-
dimensional surface with two handles and three (green) holes.
Definition: In general, in group theory an expression of the form g1 g2 g1−1 g2−1 is known as
a group commutator and is sometimes denoted [g1 , g2 ]. It should not be confused with the
commutator of matrices [A1 , A2 ] = A1 A2 − A2 A1 .
Returning to the fundamental group of the torus, the group commutator [a, b] it can
be contracted inside the rectangle to a point. Therefore, the generators a, b satisfy the
relation:
aba−1 b−1 = 1 (6.38)
– 59 –
so this means
ab = ba (6.39)
In fact, this is the only relation and therefore:
π1 (T 2 , x0 ) ∼
= Z ⊕ Z. (6.40)
Figure 14: Illustrating the Seifert-VanKampen theorem for a genus two surface: The “tubular
neighborhood” - the green region - of the cutting curves, shown in (b) is homotopy equivalent to
a one-point union of 4 circles. The latter space has a π1 which is a free group on four generators
which we can call a1 , b1 , a2 , b2 . The boundary of the green region is a single circle homotopic to
[a1 , b1 ][a2 , b2 ]. But it contracts into the remainder of the surface - which can be deformed to a disk.
Therefore by the Seifert-van Kampen theorem the presentation of π1 of the genus two surface has
a single relation [a1 , b1 ][a2 , b2 ] = 1.
The above ideas generalize nicely. Let us consider the case of a genus 2, or 2-handled,
surface shown in Figure 14. The fundamental group can be presented as a group with four
generators a1 , b1 , a2 , b2 and one relation:
Now let us consider a more complicated surface, perhaps with punctures as shown in
Figure 15. By cutting along the paths shown there the surface unfolds to a presentation
by gluing as in Figure 16:
From these kinds of constructions one can prove 40 that the fundamental group of an
orientable surface with g handles and p punctures will be
g
Y p
Y
π1 (S, x0 ) = hai , bi , cs | [ai , bi ] cs = 1i (6.42)
i=1 s=1
40
See, for example, W. Massey, Introduction to Algebraic Topology, Springer GTM
– 60 –
Figure 15: A collection of closed paths at x0 which generate the fundamental group of a two-
dimensional surface with two handles and three (green) holes.
There is only one relation so this is very close to a free group! In fact, for p ≥ 1 we can
solve for one generator cs in terms of the rest so the group is just a free group on 2g + p − 1
generators. When there are no punctures the group is not a free group. Groups of the
form (6.42) are sometimes called surface groups.
As mentioned above, a flat connection amounts to a representation of this group - so
one is searching for matrices Ai , Bi , Cs such that
Y Y
(Ai Bi A−1 −1
i Bi ) Cs = 1 (6.43)
i s
– 61 –
Figure 16: When the directed edges are identified according to their labels the above surface
reproduces the genus two surface with three punctures. Since the disk is simply connected we
derive one relation on the curves shown here.
Exercise
Use the Seifert-van Kampen theorem to relate the fundamental group of a torus to
that of a torus with a disk cut out.
– 62 –
Figure 17: Pictorial illustration of the generator σi of the braid group Bn .
obtain a group called the nth braid group. It is generated by the overcrossing σ̃i of strings
(i, i + 1), for 1 ≤ i ≤ n − 1 and may be pictured as in Figure 17. Note that σ̃i−1 is the
undercrossing.
Now one verifies the relations
– 63 –
and
σ̃i σ̃i+1 σ̃i = σ̃i+1 σ̃i σ̃i+1 (6.47)
where the relation (6.47) is illustrated in Figure 18.
The braid group Bn may be defined as the group generated by σ̃i subject to the relations
(6.46)(6.47):
Bn := hσ̃1 , . . . , σ̃n−1 |σ̃i σ̃j σ̃i−1 σ̃j−1 = 1, |i − j| ≥ 2; σ̃i σ̃i+1 σ̃i = σ̃i+1 σ̃i σ̃i+1 i (6.48)
The braid group Bn may also be defined as the fundamental group of the space of
configurations of n unordered points on the disk D. We first consider the set:
Then we observe that there is a group action of Sn on this set. Note that this set is not
simply connected: For example if we let x1 loop around x2 holding all other xi fixed it
should be intuitively clear that the loop cannot be deformed to the trivial loop. That is
even more clear if you view the looping process as taking place in time on particles in a
plane.
Now we consider the space of orbits under this group action:
There are new nontrivial loops here where, for example, xi and xj exchange places, all ♣Since we must
quotient by Sn this
other xk staying fixed. needs to be moved
to the section on
Note that the “only” difference from the presentation of the symmetric group is that group actions on
spaces. ♣
we do not put any relation like (σ̃i )2 = 1. Indeed, Bn is of infinite order because σ̃in keeps
getting more and more twisted as n → ∞.
Remarks
1. In the theory of integrable systems the relation (6.47) is closely related to the “Yang-
Baxter relation.” It plays a fundamental role in integrable models of 2D statistical
mechanics and field theory.
– 64 –
41 In mathematical terms, there is a group action of Sn on a set of n indistinguish-
able particles leaving the physical system “the same.” In quantum mechanics this
translates into the statement that the Hilbert space of a system of n indistinguishable
particles should be a representation of (a central extension of) Sn . There are many
different representations of Sn (we have already encountered three different ones).
Most of them are higher dimensional. Particles transforming in higher-dimensional
representations are said to satisfy “parastatistics.” (This idea goes back to B. Green
in the 1950’s and Messiah and Greenberg in the 1960’s.) However, remarkably, in
relativistically invariant theories in spacetimes of dimension larger than 3 particles
are either bosons or fermions. This is related to the classification of the projective rep-
resentations of SO(d, 1), where d is the number of spatial dimensions, for relativistic
systems and to representations of SO(d) for nonrelativistic systems. (We will discuss
projective representations in section **** below.) Now, when discussing projective
representations the fundamental group of SO(d, 1) and SO(d) becomes important.
In fact π1 (SO(d, 1)) ∼= π1 (SO(d)) for d ≥ 2. However, there is a fundamental differ-
ence between d ≤ 2 and d > 2. The essential point is that the fundamental group
π1 (SO(2)) ∼ = Z is infinite while π1 (SO(d)) ∼
= Z2 for d ≥ 3. A consequence of this,
and other principles of physics is that in 2 + 1 and 1 + 1 dimensions, particles with
“anyonic” statistics can exist. 42 Anyons are defined by the property that, if we just
consider the wavefunction of two identical such particles, Ψ(z1 , z2 ) where z1 , z2 are
points in the plane and then we adiabatically switch their positions using the kind of
braiding that defines σ̃ then
Unlike bosons and fermions where θ = 0, πmod2π, respectively, for “anyons” the
phase can be anything - hence the name. There are even physical realizations of
this theoretical prediction in the fractional quantum Hall effect. Moreover, quantum
wavefunctions should transform in representations of the braid group. The law (6.51)
leads to the 1-dimensional representation σ̃ → eiθ but there can also be more interest-
ing “nonabelian representations.” That is, there can be interesting irreducible rep-
resentations of dimension greater than one, and if wavefunctions transform in such
representations there can be nonabelian statistics. The particles should be called
nonabelions. There are some theoretical models of fractional quantum Hall states in
which this takes place. 43 Nonabelions are of potentially great importance because of
their possible use in quantum computation, an observation first made by A. Kitaev.
41
See Chapters 24 and 25 in the book Einstein and the Quantum by A.D. Stone for a nice historical
account of the importance of this discovery in the development of quantum mechanics.
42
The possible existence of anyons was pointed out by Leinaas and Myrheim in 1977. Another early
reference are the papers of Goldin, Melnikof and Sharp. The term “anyon” was invented in F. Wilczek,
”Quantum Mechanics of Fractional-Spin Particles”. Physical Review Letters 49 (14): 957959. For comments
on the early history of these ideas see “The Ancestry of the ‘Anyon’ ” in Physics Today, August 1990, page
90.
43
Important work on the compatibility of nonabelions with spin-statistics theorems was done by Jurg
Fröhlich. Perhaps the first concrete proposal of a physical system with nonabelions is that of G.W. Moore
– 65 –
Here are some sources for more material about anyons: ♣Need to keep
updating and
refining this
reference list. ♣
1. There are some nice lecture notes by John Preskill, which discuss the potential rela-
tion to quantum computation and quantum information theory: http://www.theory.caltech.edu/˜preski
2. A. Stern, ”Anyons and the quantum Hall effectA pedagogical review”. Annals of
Physics 323: 204; arXiv:0711.4697v1.
and N. Read, “Nonabelions in the fractional quantum hall effect.” Nuclear Physics B360, 1991. It was
inspired, in part, by the work of G. Moore and N. Seiberg, “Polynomial Equations For RCFT” and “Clas-
sical and Quantum Conformal Field Theory,” Commun.Math.Phys. 123 (1989) 177 which gave the first
description of a modular tensor category.
– 66 –
there are topologically interesting diffeomorphisms of the torus that are used when gluing
together the two bagels: Note that if you glue with the identity transformation you get
S 2 × S 1 , not S 3 .
Indeed a vast number of new constructions comes of three-manifolds comes from ideas
of “surgery.” An important point is that there are typically many nontrivial diffeomor-
phisms of a surface. Let us see this with the case of a torus.
We consider the torus to be the space of orbits R2 /ZxZ with coordinates (σ 1 , σ 2 ) with
σ ∼ σ i + 1. Then consider the transformation
i
σ 1 → aσ 1 + bσ 2
(6.52)
σ 2 → cσ 1 + dσ 2
– 67 –
there are a lot of nontrivial diffeomorphisms of a surface to itself, and it can be hard to
recognize two equivalent 3-folds constructed from different Heegaard splittings.
Thus, any three-manifold Y , π1 (Y, y0 ) admits a presentation where the number of
generators is equal to the number of relations. The existence of such a presentation is not
possible for a general finitely generated group. So, at least in three dimensions it is not
true that any finitely generated group is the fundamental group of some three-dimensional
manifold.
Theorem Any finitely generated group G is the fundamental group of some four-manifold.
G∼
= hg1 , . . . , gn |R1 , . . . , Rm i (6.54)
We aim to produce a four-manifold M4 with fundamental group G. First, consider the free
group on one generator hgi ∼
= Z. A good manifold that has this as a fundamental group is
1 3
X4 = S × S . Now let us consider M̃4 := X4 # · · · #X4 . Then
π1 (M̃4 ) ∼
= hg1 , . . . , gn i (6.55)
is the free group with generators gi corresponding to the simple loops around the S 1 factor
in each summand (extended to some common basepoint). Now, each relation Rα is a ♣Need to explain
why it is a free
word in the gi and thus can be represented by some closed based loop `α ⊂ M̃4 . We can group and not a free
abelian group! ♣
take the `α to be nonintersecting, by simple codimension arguments. Now, take a tubular
neighborhood N (`α ) of `α . By our discussion above of the local picture of submanifolds it
is diffeomorphic to N (`α ) ∼ = S 1 × D3 , where D3 is the 3-dimensional ball. The boundary is
thus ∂N (`α ) ∼= S 1 × S 2 . This is also the boundary of D2 × S 2 . So, glue in a copy of D2 × S 2
along the boundary of N (`α ). This procedure is known as surgery. Now the loop S 1 in
S 1 × D3 (which was representing the word Rα ) becomes contractible! Thus it is a relation
on the generators gi in the new manifold. We can choose the tubular neighborhoods around
the different loops `α to be nonintersecting, and hence we can perform surgeries on each
of these loops without interference. If we do this for all the loops we produce our manifold
M4 . By the Seifert-van Kampen theorem it follows that the fundamental group of M4 is
exactly G. ♠
Since finitely presented groups cannot be classified it follows that four-manifolds cannot
be classified, even up to homotopy type.
– 68 –
Definition 7.1.1: Let H ⊆ G be a subgroup. The set
gH ≡ {gh|h ∈ H} ⊂ G (7.1)
is called a left-coset of H.
1 · H = {1, (12)}
(12) · H = {(12), 1} = {1, (12)} = H
(13) · H = {(13), (123)}
(7.2)
(23) · H = {(23), (132)}
(123) · H = {(123), (13)} = {(13), (123)} = (13) · H
(132) · H = {(132), (23)} = {(23), (132)} = (23) · H
Claim: Two left cosets are either identical or disjoint. Moreover, every element g ∈ G
lies in some coset. That is, the cosets define an equivalence relation by saying g1 ∼ g2 if
there is an h ∈ H such that g1 = g2 h. Here’s a proof written out in excruciating detail. 44
First, g is in gH, so every element is in some coset. Second, suppose g ∈ g1 H ∩ g2 H.
Then g = g1 h1 and g = g2 h2 for some h1 , h2 ∈ H. This implies g1 = g2 (h2 h−1
1 ) so g1 = g2 h
for an element h ∈ H. (Indeed h = h2 h−1 1 , but the detailed form is not important.) By
the rearrangement lemma hH = H, and hence g1 H = g2 H.
The basic principle above leads to a fundamental theorem:
Theorem 7.1.1 (Lagrange) If H is a subgroup of a finite group G then the order of
H divides the order of G:
|G|/|H| ∈ Z+ (7.3)
Proof : If G is finite G = qm1 gi H for some set of gi , leading to distinct cosets. Now
note that the order of any coset is the order of H:
Proof : Choose a nonidentity element g ∈ G and consider the subgroup generated by g i.e,
{1, g, g 2 , g 3 , . . . } (7.5)
44
In general, the reader should provide these kinds steps for herself or himself and we will not spell out
proofs in such detail.
– 69 –
The order of this group must divide |G| so if |G| = p is prime it must be the entire group.
♠
Definition 7.1.2: If G is any group and H any subgroup then the set of left cosets of H
in G is denoted G/H. It is the set of orbits under the right H action on G. A set of the
form G/H is also referred to as a homogeneous space. The order of this set is the index of
H in G, and denoted [G : H].
Theorem 7.1.2: (Sylow’s (first) theorem). Suppose p is prime and pk divides |G| for a
nonnegative integer k. Then there is a subgroup H ⊂ G of order pk .
Herstein’s book, sec. 2.12, waxes poetic on the Sylow theorems and gives three proofs.
We’ll give a proof as an application of the class equation in section 9 below. Actually,
Sylow has a bit more to say. We will explain some more about this in the next section.
45
Answer: One possible example is A4 , which has order 12, but no subgroup of order 6. By examining the
table of groups below we can see that this is the example with the smallest value of |G|. Sylow’s theorem
(discussed below) states that if a prime power pk divides |G| then there is in fact a subgroup of order pk .
This fails for composite numbers - products of more than one prime. Indeed, the smallest composite number
is 6 = 2 · 3. Thus, in regard to a hypothetical converse to Lagrange’s theorem, as soon as things can go
wrong, they do go wrong.
– 70 –
Definition: Thus far we have repeatedly spoken of the “order of a group G” and of various
subsets of G, meaning simply the cardinality of the various sets. In addition a common
terminology is to say that an element g ∈ G has order n if n is the smallest natural number
such that g n = 1.
Exercise Subgroups of A4
Write down all the subgroups of A4 . Draw a diagram indicating how these are sub-
groups of each other.
Exercise
Suppose a finite group G has subgroups Hi , i = 1, . . . , s of order hi where the hi are
Q
all mutually relatively prime integers. Show that i hi divides |G|.
7.2 Conjugacy
Now introduce a notion generalizing the idea of similarity of matrices:
46
One possible answer : Take Z or Zn or ....
47
One possible answer : Z × ZN . Another possible answer is G = U (1).
48
One possible answer : Regard U (1) as the group of complex numbers of modulus one. Let G be the
subgroup of complex numbers so that z N = 1 for some integer N . This is the group of all roots of unity of
any order. It is clearly an infinite group, and by its very definition every element has finite order. Using
the notation of the next section, this group is isomorphic to Q/Z.
– 71 –
Definition 7.2.1 :
a.) A group element h is conjugate to h0 if ∃g ∈ G h0 = ghg −1 .
b.) Conjugacy defines an equivalence relation and the conjugacy class of h is the
equivalence class under this relation:
Example 7.2.1 : We showed above that all cyclic permutations in Sn are conjugate.
However, once we have taken this into account we are done: The set of conjugacy classes in
U (N ) is the set of unordered N -tuples of phases.. We can make this assertion plausible by
noting that all the traces Tr(uk ) are invariant under conjugation, so two conjugate diagonal
matrices must provide simultaneous solutions to
N
X
zik = wk (7.10)
i=1
for k ∈ Z. This can only be the case if they are related by permutation.
The set of conjugacy classes is therefore a space of orbits U (1)N /SN .
Now suppose we have two commuting unitary matrices. Again, basic linear algebra
(explained in Chapter two) shows that they can be simultaneously diagonalized. That is, if
u1 , u2 ∈ U (N ) and [u1 , u2 ] = 1 (group commutator) then there is a single g ∈ U (N ) with
gui g −1 = Di (7.11)
– 72 –
Example 7.2.3 : Now consider G = GL(n, C). We must stress that not all matrices are
diagonalizable, so that the full description of conjugacy classes is more complicated. For
any matrix A ∈ Mn (C) we can define its characteristic polynomial
It is very important to note that the eigenvectors might not form a basis,. Here is a simple
and basic example: !
01
A= (7.15)
00
Then one easily checks pA (x) = x2 . So an eigenvector would satisfy Av = 0. But if there
were a basis of eigenvectors then A = 0, a contradiction. The general statement is that any
matrix is conjugate to its Jordan canonical form. See section 10.4 of Chapter 2. Briefly,
we define the Jordan block with eigenvalue λ;
(k)
Jλ = λ1 + N (k) (7.16)
ϕ : G → GL(n, κ) (7.20)
– 73 –
We say two matrix representations are equivalent representations if the two homomorphisms
are conjugate.
Definition: A class function on a group is a function f on G (it can be valued in any
set) such that f takes the same values on conjugate group elements:
is an example of a class function. This function is called the character of the representation.
Note that two equivalent representations must have the same character.
– 74 –
b.) Show that the set of conjugacy classes in SU (2) can be identified with S 1 /Z2 =
[0, π].
c.) Show that the most general continuous homomorphism ϕ : U (1) → SU (2) looks
like !
|α|2 z + |β|2 z −1 αβ(z −1 − z)
ϕ:z→ (7.24)
α∗ β ∗ (z −1 − z) |α|2 z −1 + |β|2 z
where (α, β) ∈ C2 satisfy |α|2 + |β|2 = 1 and z ∈ U (1). (Hint: Show that all continuous
homomorphisms ϕ : U (1) → SU (2) are conjugate.)
d.) Show that the conjugacy class of a matrix A ∈ M2 (C) is not uniquely determined
by the values of Tr(Ak ), but it is if A is diagonalizable. (k = 1, 2 will suffice.)
ϕ2 (u) = u ⊗ u (7.27)
Of course θ ∼ θ + 2π. However conjugation by iσ 1 shows that θ and θ + π define conjugate elements. So,
T r(u) = 2 cos θ determines θ up to θ ∼ θ + π.
50
Answer : They are equivalent: Conjugation by iσ 2 is equivalent to complex conjugation in SU (2):
51
Answer : They are inequivalent. By the spectral theorem u ∈ U (2) can be conjugated to a diagonal
matrix Diag{z1 , z2 } with z1 , z2 ∈ U (1). The character of ϕ1 is χ1 (u) = z1 + z2 . The character of ϕ2 is
χ2 (u) = z1−1 + z2−1 . For general elements of U (2) these are different so the character functions are different.
– 75 –
Are these representations equivalent or inequivalent? 52
gN g −1 = N ∀g ∈ G (7.28)
Sometimes this is denoted as N / G.
Warning! Equation (7.28) does not mean that gng −1 = n for all n ∈ N !
There is a beautiful theorem associated with normal subgroups. In general the set of
cosets of a subgroup H in G, denoted G/H, does not have any natural group structure. 53
However, if H is normal something special happens:
Theorem 7.2.1. If N ⊂ G is a normal subgroup then the set of left cosets G/N =
{gN |g ∈ G} has a natural group structure with group multiplication defined by:
Remarks:
1. All subgroups N of Abelian groups A are normal, and moreover the quotient group
A/N is Abelian.
2. Groups of the form G/N are known as quotient groups. A very common source of
error and confusion is to mix up quotient groups and subgroups. They are very
different!
– 76 –
Example 7.2.1 Cyclic Groups For example nZ ⊂ Z is normal, and the quotient group
is Z/nZ. This is isomorphic to the cyclic group we have previously denoted as µn or Zn .
So r̄ is the equivalence class of an integer r ∈ Z:
r̄ = r + nZ (7.30)
r̄ + s̄ = (r + s) + nZ (7.31)
Xd
H := { ni fi |ni ∈ Z} (7.33)
i=1
H is clearly a subgroup of G, so we can form the quotient group G/H. If detA 6= 0 then
in fact G/H is a finite group. One way to see this easily is to consider G as a subgroup of
Qd , so that we can write
ei = A−1
ij fj (7.34)
with A−1 ∈ GL(d, Q). Recall that A−1 = (detA)−1 Cof (A) where the cofactor matrix
Cof (A) is a matrix of minors, and therefore is a matrix of integers. Therefore (detA)ei ∈ H
and hence detA[ei ] = 0 in the quotient group so every element of G/H has a representative
P
of the form [ i xi ei ] with |xi | < |detA|.
Actually, if we invoke a nontrivial theorem we can say much more: The matrix A can
be put into Smith normal form. This means that there are matrices S, T ∈ GL(d, Z), ♣Should give a
sketch of a proof of
representing change of generators (i.e. change of basis of the Z-module ) of H and G so SNF in a
supplementary
that section. ♣
with αi = di /di−1 where d0 = 1 and di for i > 0 is the g.c.d. of the i × i minors. (For a
proof see the Wikipedia article.) Then
G/H ∼
= Zα1 × · · · Zαd (7.36)
Note that it has order |G/H| = detA. Here is a good example of the difference between a
quotient group and a subgroup: No nontrivial finite group will be a subgroup of Zd .
– 77 –
simplicity we will assume it is full rank, that is, the {vi } form a basis for Rd over R. We
denote it by Λ, so
Xd
Λ := { ni vi |ni ∈ Z} ⊂ Rd (7.37)
i=1
We define the dual lattice (closely related to the “reciprocal lattice” in solid state physics)
as the set of vectors w ∈ Rd such that w · v ∈ Z for all v ∈ Λ:
Λ∨ := {w ∈ Rd |∀v ∈ Λ v · w ∈ Z} (7.38)
Now assume that Λ is an integral lattice. This means that the matrix of inner products
Gij = vi ·vj is a d×d matrix of integers. (Note it is symmetric and of nonzero determinant.)
Then it follows that Λ ⊂ Λ∨ is a sublattice. The discriminant group of Λ is the finite group
D := Λ∨ /Λ (7.39)
Note that Λ∨ has a basis fi with vi = Gij fj so one can work out D as a product of cyclic
groups using the Smith normal form of Gij .
but conjugation by (12) induces a nontrivial permutation of the set A3 . The group S3 /A3
has order 2 and hence must be isomorphic to Z2 .
Example 7.2.5. Of course, in any group G the subgroup {1} and G itself are normal
subgroups. These are the trivial normal subgroups. It can happen that these are the only
normal subgroups of G:
Remarks
2. The term is a bit of a misnomer: Some simple groups are pretty darn complicated.
What it means is that there is no means of simplifying it using something called
the Jordan-Holder decomposition - discussed below. Simple groups are extremely
important in the structure theory of finite groups. One example of simple groups are
the cyclic groups Z/pZ for p prime. Can you think of others?
– 78 –
3. Sylow’s theorems again. Recall that Sylow’s first theorem says that if pk divides |G|
then G has a subgroup of order pk . If we take the largest prime power dividing |G|,
that is, if |G| = pk m with m relatively prime to p then a subgroup of order pk is called
a p-Sylow subgroup. Sylow’s second theorem states that all the p-Sylow subgroups
are conjugate. The third Sylow theorem says something about how many p-Sylow
subgroups there are.
4. WARNING!: In the theory of Lie groups you will find the term “simple Lie group.”
A simple Lie group is NOT a simple group in the sense we defined above ! For example
SU (2) is a simple Lie group. But it has a nontrivial center namely the two diagonal
SU (2) matrices {±12×2 }.
Example 7.2.6. Recall that SL(n, κ) ⊂ GL(n, κ), SO(n, κ) ⊂ O(n, κ), and SU (n) ⊂
U (n) are all subgroups defined by the condition detA = 1 on a matrix. Note that, since
det(gAg −1 ) = detA for any invertible matrix g these are in fact normal subgroups. The
quotient groups are
GL(n, κ)/SL(n, κ) ∼
= κ∗
O(n, R)/SO(n, R) ∼= Z2 (7.42)
U (n)/SU (n) ∼
= U (1)
Lines 1 and 3 follow since every element in GL(n, κ) can be written as zA with z ∈ κ∗ and
A ∈ SL(n, κ). For line 2 take P to be any reflection in any hyperplane orthogonal to some
vector v, then O(n, R) = SO(n, R) q P SO(n, R) because detP = −1. Recall that Pv1 Pv2 is
a rotation in the plane spanned by v1 , v2 , so it doesn’t matter which hyperplane we choose.
Example 7.2.7. Let G be a topological group. Let G0 be the (path-) connected component
of the identity element 1G ∈ G. We claim that G0 is a normal subgroup: If g0 ∈ G0 there
is a continuous path of group elements γ : [0, 1] → G with γ(0) = 1G and γ(1) = g0 .
Then if g ∈ G is any other group element gγ(t)g −1 is a continuous path connecting 1G to
gg0 g −1 . The quotient group G/G0 is the group of components, sometimes denoted π0 (G).
In general for a topological space X, the set of connected components is denoted by π0 (X),
but it carries no natural group structure.
For some examples:
– 79 –
connected: The sign of the determinant is the only obstruction to deformation to the
identity. 54
3. Very similar considerations hold for O(n, R). One can show that SO(n, R) is the
connected component of the identity and π0 ∼ = Z2 . For example, consider O(2). In
an exercise above you showed that as a manifold it has two components, each of which
can be identified with a circle. The connected component of the identity is SO(2).
We have O(2) = SO(2) q SO(2)P where P is any O(2) matrix of determinant = −1.
So π0 (O(2)) ∼
= Z2 . Similarly, π0 (O(n)) ∼
= Z2 .
∼ GL(2, Z). Indeed there is a subgroup of Dif f (T 2 )
4. One can show that π0 (Dif f (T 2 )) =
isomorphic to GL(2, Z) of diffeomorphisms
! ! !
σ1 a b σ1
7→ (7.43)
σ2 c d σ2
Example 7.2.8. The center of U (N ) consists of matrices proportional to the unit matrix.
See the exercise below. Elements in the center of SU (N ) must also be diagonal. However,
now if z1N ×N is to be in SU (N ) then z N = 1 (why?) so Z(SU (N )) ∼ = µN ∼= ZN . Since
this subgroup is normal we can take a quotient and get another group. It is known as
P SU (N ) := SU (N )/ZN (7.44)
54
The theory of fiber bundles shows that if H is a Lie subgroup of G so that G/H is a topological space
then if H is connected the set of components of G can be identified with the set of components of G/H.
We will see later from the stabilizer orbit theorem that SO(n + 1)/SO(n) = S n , so we can prove SO(n)
is connected by induction on n. By Gram-Schmidt procedure SL(n, R) is a product of SO(n) and upper
triangular matrices with unit diagonal - and this space is connected. So SL(n, R) is connected. Then the
set of components of GL(n, R) is that of R∗ , and this is measured by the determinant.
55
Answer : The main thing to check is that the product law defined by (7.29) is actually well defined.
Namely, you must check that if g1 N = g10 N and g2 N = g20 N then g1 g2 N = g10 g20 N . To show this note that
g10 = g1 n1 and g20 = g2 n2 for some n1 , n2 ∈ N . Now note that g10 g20 = g1 n1 g2 n2 = g1 g2 (g2−1 n1 g2 )n2 . But,
since N is normal (g2−1 n1 g2 ) ∈ N and hence (g2−1 n1 g2 )n2 ∈ N and hence indeed g1 g2 N = g10 g20 N . Once
we see that (7.29) is well-defined the remaining checks are straightforward. Essentially all the basic axioms
are inherited from the group law for multiplying g1 and g2 . Associativity should be obvious. The identity
is 1G N = N and the inverse of gN is g −1 N . etc. ♠
– 80 –
b.) Consider the right cosets. Show that N \G is a group.
Exercise
Look at the 3 examples of homogeneous spaces G/H in section 7.1. Decide which of
the subgroups H is normal and what the group G/H would be.
Exercise
Show that if the center Z(G) is such that G/Z(G) is cyclic then G is Abelian. 58
56
Answer : Suppose G = H q g0 H. Then take any h ∈ H. The element g0 hg0−1 must be in H or g0 H.
But if it were in g0 H then there would be an h0 ∈ H such that g0 hg0−1 = g0 h0 but this would imply g0 is
in H, which is false. Therefore, for all h ∈ H, g0 hg0−1 ∈ H, and hence H is a normal subgroup. Therefore
G/H ∼ = Z2 .
57
Answer : Note that the order of |An | is even and hence 12 |An | is a divisor of |An |. However, a subgroup
of order |An |/2 would have to be a normal subgroup, and hence does not exist, since An is simple. More
generally, a high-powered theorem, known as the Feit-Thompson theorem states that a finite simple non-
abelian group has even order. Therefore if G is a finite simple nonabelian group there is no subgroup of
order 21 |G|, even though this is a divisor.
58
Answer : Every element of G would be of the form g0n z with z ∈ Z. But then it is easy to check:
g0 zg0m z 0 = g0m z 0 g0n z so G is Abelian. So G = Z(G), and in fact the cyclic subgoup must be trivial.
n
– 81 –
Exercise Sylow subgroups of A4
Write down the 2-Sylow and 3-Sylow subgroups of A4 .
– 82 –
a.) Show that the subgroup of W (Bn ) of diagonal matrices is a normal subgroup
isomorphic to Zn2 .
b.) Show that every signed permutation matrix can be written in the form D · Π where
D is a diagonal matrix of ±1’s and Π is a permutation matrix.
c.) Conclude that the quotient of W (Bn ) by the normal subgroup of diagonal matrices
is isomorphic to Sn .
d.) Show that every signed permutation can also be written as Π0 · D0 . How is this
decomposition related to writing it as D · Π.
♣This exercise
should go to the
first section when
we introduce the
matrix groups. ♣
63
Answer : Only {1}, G1 × {1G2 } , {1G1 } × G2 and G1 × G2 .
64
Answer : There are many proofs but one nice one is to use induction on N . First establish the result for
U (2) - here the matrix multiplication is easy and this can be done by hand. Now suppose that ζ ∈ U (N + 1)
is in the center. Decompose it as follows !
A v
ζ=
w D
where A ∈ M2 (C), v ∈ M2×N −1 (C), w ∈ MN −1×2 (C), and D ∈ MN −1×N −1 (C). Now insist that it commute
with !
u 0
0 1
with u ∈ U (2) to show that uAu−1 = A, uv = v and wu−1 = w for all u ∈ U (2). These equations imply A
is diagonal and v, w = 0.
– 83 –
Exercise The Normalizer Subgroup
If H ⊂ G is a subgroup then we define the normalizer of H within G to be the largest
subgroup N of G such that H is a normal subgroup of N . Note that H is normal inside
itself so such subgroups exist. If N1 , N2 ⊂ G are subgroups and H is normal in both then
they generate a subgroup in which H is normal. In fact we have:
Exercise An Important Class of Quotient Groups: The Weyl Groups Of Lie Groups
a.) Let D ⊂ SU (2) be the subgroup of diagonal matrices. Note that D ∼ = U (1).
Compute
NSU (2) (D) (7.46)
explicitly. 65
b.) Compute the quotient group NSU (2) (D)/D. 66
c.) Show that conjugation by elements of the normalizer act by a permutation of the
diagonal elements and the permutation only depends on the projection to the quotient.
d.) Show that there is no subgroup of NSU (2) (D) whose conjugation on D induces the
permutation action.
For example, in SU (n) any maximal torus is conjugate to the subgroup D ⊂ SU (n) of
diagonal matrices. In this case, conjugation by NSU (n) (D) acts on D by permutation of
the diagonal elements and in fact
– 84 –
Note that the Weyl group is defined as a quotient of a subgroup of G. (Often this is
abbreviated to “subquotient of G.”) In general there is no subgroup of G whose conjugation
action induces the Weyl group action on T . (It is a common mistake to confuse W (G)
with a subgroup of G.)
𝑅𝑅
𝑄𝑄
𝑃𝑃
−𝑅𝑅 = 𝑃𝑃 + 𝑄𝑄
Figure 19: In a suitable range of real values of f, g the real points on the elliptic curve have the
above form. Then the elliptic curve group law is easily pictured as shown.
– 85 –
subgroup of complex numbers of the form n1 + τ n2 where n1 and n2 are integers. Since C
is abelian we can form the Abelian group C/Z + τ Z. Note that Z + τ Z is a rank two lattice
in the plane so that this quotient space can be thought of as a torus. As an Abelian group
this group is isomorphic to U (1) × U (1). The explicit isomorphism is
A remarkable fact is that this torus (minus one point) can be thought of as the space
of solutions of the algebraic equation
y 2 = x3 + f x + g (7.50)
69
We can restore the point at infinity using projective geometry. The equation ZY 2 = X 3 + f XZ 2 + gZ 3
makes sense for a point [X : Y : Z] ∈ CP2 . Indeed note that the equivalence relation says that [X : Y :
Z] = [λX : λY : λZ] and the equation is homogeneous and of degree three. The equation (7.50) is the
equation we get in the patch Z 6= 0 where we can fix the scaling degree of freedom by choosing λ so that
Z = 1. We then define x, y by [x : y : 1] = [X : Y : Z] = [X/Z : Y /Z : 1] which makes sense when Z 6= 0.
The point at infinity has Z = 0. Therefore, by the equation X = 0, and since we have a point in CP2 we
must have Y 6= 0, which can therefore be scaled to y = 1. So, the point at infinity is [0 : 1 : 0] ∈ CP2 .
– 86 –
are absolutely convergent and hence holomorphic functions of τ for k ≥ 1 when Imτ 6= 0.
They are famous functions known as Eisenstein functions and are basic examples of a
fascinating set of functions known as modular forms. In order to produce equation (7.50)
we will take x = ℘(z|τ ) and define
∂
y := ℘(z|τ )
∂z (7.55)
−2
= 3 + 6G4 z + 20G6 z 3 + · · ·
z
Now a small amount of algebra shows that we have the series expansion
So the combination y 2 − 4x3 + 60G4 x is entire, i.e., holomorphic in the entire complex
plane, and doubly-periodic. But then by Liouville’s theorem it must be constant! Thus all
the higher terms in the series vanish! Thus we have:
x3 − s2 x2 + (f − 2sd)x + (g − d2 ) = 0 (7.61)
xR = s2 − xP − xQ (7.63)
– 87 –
so we have xR explicitly as a function of xP , xQ , yP , yQ . Now the point (xR , yR ) must lie
on the line y = sx + d so we can also say that
yR = yP + s(xR − xP ) (7.64)
– 88 –
∞
4k X n2k−1 q n
E2k (τ ) = 1 − (7.72)
B2k 1 − qn
n=1
where B2k is the Bernoulli number (as defined in Wikipedia).
Example In S3 there are two cycles of length 3 and they are indeed conjugate:
Now recall that any element in Sn can be written as a product of disjoint cycles.
Examples:
Note that these can be conjugated into each other by suitable transpositions. So this
conjugacy class is determined by
`1 = 0 `2 = 2 `3 = 0 `4 = 0 (7.77)
– 89 –
In general we can denote a conjugacy class in Sn by:
Then, since we must account for all n letters being permuted we must have:
n
X
n = 1 · `1 + 2 · `2 + · · · n · `n = j · `j (7.79)
j=1
Therefore:
Example For n = 4, 5 p(4) = 5 and p(5) = 7 and the conjugacy classes of S4 and S5 are:
70
This is a term in number theory. It is not to be confused with the “partition function” of a field theory!
– 90 –
Cycle decomposition |C(g)| Typical g Order of g
(1)5 1 1 1
5
(1)3 (2)
2 = 10 (ab) 2
(1)2 (3) 2 · 53 = 20 (abc) 3
6 · 54 = 30
(1)(4) (abcd) 4
(1)(2)2 5 · 12 42 = 15
(ab)(cd) 2
2 · 52 = 20
(2)(3) (ab)(cde) 6
(5) 4! = 24 (abcde) 5
1
H = (p2 + ω 2 q 2 ) (7.80)
2
(we have scaled away the mass). The classical Poisson bracket {p, q} = 1 is quantized by
postulating there are operators p̂, q̂ with
– 91 –
and there is a Hilbert space representing this operator algebra. One forms the linear
combinations:
1
a := √ (ω q̂ + ip̂)
2~ω
(7.82)
1
ā := √ (ω q̂ − ip̂)
2~ω
so that
[a, ā] = 1 (7.83)
The operator algebra is a ∗-algebra so there is a C-antilinear map with q̂ ∗ = q̂ and p̂∗ = p̂.
So then ā = a∗ .
If we just consider the operator algebra without considering H then there are a number
of different ways to represent it. We could postulate there is a vector |0i with a|0i = 0.
Then the Hilbert space is spanned by ān |0i and ā = a† . But note that we could make a
linear transformation to
b = αa + βā
(7.84)
b̄ = γa + δā
and this will preserve the commutation relations: [b, b̄] = 1, if αδ − βγ = 1. If we preserve
the ∗ structure so that b̄ = b∗ then δ = α∗ and γ = β ∗ and we find transformations by
SU (1, 1).
If we represent p̂, q̂ on the Hilbert space L2 (R) with
This choice of quantization is also preferred when we consider the oscillator Hamiltonian:
The quantum Hamiltonian is H = ω(a† a + 21 ). The states ān |0i are eigenstates of H
with eigenvalue ω(n + 21 ). Assuming |0i has unit norm the normalized eigenstates √1n! ān |0i
form a complete ON basis for the Hilbert space.
In quantum statistical mechanics a very important quantity is the (physics) partition
function defined to be
1
−βH e− 2 βω 1
TrHsingle e = −βω
= (7.87)
h.o.
1−e 2 sinh 12 βω
Here β has the physical interpretation of 1/(kT ) where k is Boltzmann’s constant and T
is the temperature above absolute zero. Using the (physics) partition function one derives
thermodynamic quantities when the oscillator is connected to a heat bath.
– 92 –
Now, suppose we have a system which is described by an infinite collection of harmonic
oscillators:
Suppose they have frequencies which are all a multiple of a basic harmonic which we’ll de-
note ω, so the frequencies are associated with the oscillators a1 , a2 , a3 , . . . are ω, 2ω, 3ω, ....
The motivation for choosing all frequencies to be multiples of a basic frequency comes from
the theory of strings, as we will explain below.
If we write the standard sum of harmonic oscillator Hamiltonians we get, formally,
∞
1
jω(a†j aj + )
X
formal
H = (7.89)
2
j=1
This is formal, because on the usual lowest weight module of the system defined by saying
the vacuum line satisfies:
aj |vaci = 0 ∀j (7.90)
the groundstate energy is infinite. This is typical of the divergences of quantum field theory
72 An infinite number of degrees of freedom typically leads to divergences in physical quan-
tities. However, there is a very natural way to regularize and renormalize this divergence
by using the Riemann zeta function:
∞ ∞
X j ωX 1 ω ω
ω= −1
→ ζ(−1) = − (7.91)
2 2 j 2 24
j=1 j=1
73 This can be justified much more rigorously and indeed it gives the correct Casimir ♣We now arranged
the string
energy for a massless scalar field on an interval. Multiplying by two we get a similar result explanation below
so this remark is
for a scalar field on a circle. If we restore proper units so that the radius of the circle so out of place. ♣
– 93 –
In any case, things work out very nicely if we take the Hamiltonian to be:
∞
ω
jωa†j aj −
X
H= (7.93)
24
j=1
The dimension of the space of states of energy nω above the groundstate is p(n). A natural
basis of this space is labeled by partitions of n:
and hence the vectors in this basis are in 1-1 correspondence with the conjugacy classes of
Sn . This turns out to be significant in the boson-fermion correspondence in 1+1 dimen-
sional quantum field theory.
The quantum statistical mechanical partition function of this collection of oscillators
has a truly remarkable property, as we now explain:
Let q be a complex number with |q| < 1. Notice that:
1
Q∞ j)
= (1 + q + q 2 + · · · )(1 + q 2 + q 4 + · · · )(1 + q 3 + q 6 + · · · ) · · ·
j=1 (1 − q
∞
X (7.95)
n
=1+ p(n)q
n=1
Indeed, note that this is almost exactly the same as the physical partition function of our
system of oscillators!
Therefore, taking into account the full system of harmonic oscillators we have:
1
Z osc (β) = TrHall the h.o.s
e−βH = Q∞ , (7.96)
q 1/24 n=1 (1 − qn)
where we trace over the Hilbert space of states of our collection of oscillators. Here we
identify q = e−βω . Expanding out (7.95) gives the first few values of p(n):
and one can easily generate the first few 100 values using Maple or Mathematica or Sage
or ...
It turns out the generating series has a remarkable “modular transformation property”
relating Z(β) to Z(1/β):
β −1/4 Z osc (β) = β̃ −1/4 Z osc (β̃) (7.98)
2
2π
β β̃ = (7.99)
ω
This is a kind of high-low temperature duality.
The property (7.98) and (7.99) is proven in textbooks on analytic number theory. In
this context one often uses the variable
βω
τ =i (7.100)
2π
– 94 –
so that q = e−βω = e2πiτ . Although the physical motivation starts with β > 0, one
can analytically continue so that τ is in the upper half complex plane. (See below for
the physical interpretation of this continuation.) Analytic number theorists define the
Dedekind eta function :
∞
2πiτ Y
η(τ ) = exp (1 − q n ) (7.101)
24
n=1
It turns out that (7.98) and (7.99), or equivalently, (7.102) has a beautiful interpreta-
tion from physics. The string is a circle with coordinate σ ∼ σ + 2π and the displacement
of the string X(σ, t) is a real number depending on position and time. The action is:
Z Z 2π
1
dσ (∂t X)2 − ∂σ X)2
S= dt (7.103)
4π`2s 0
where `s has units of length, or inverse mass, and we have temporarily set ~ = c = 1 by
choice of units. So T = `−2
s is the tension of the string. Then the general solution of the
classical equation of motion (the wave equation) (∂t2 − ∂σ2 )X = 0 is:
2 `s X αn in(t+σ) α̃n in(t−σ)
X(t, σ) = X0 + `s pt + i √ e + e (7.104)
2 n n
n6=0
with complex numbers αn = (α−n )∗ and α̃n = (α̃−n )∗ and X0 , p are real.
We can think of this as a 1 + 1 dimensional field theory. Then spacetime is a cylinder
1
S × R with Lorentzian metric. The αn are the amplitudes of waves moving at the speed
of light to the left, while the α̃n are the amplitudes of waves moving at the speed of light
to the right.
The Hamiltonian computed from the action is
Z 2π
1
dσ (∂t X)2 + ∂σ X)2
H= (7.105)
4π`2s 0
and evaluating this on the general solution of the equation of motion gives:
1 1 X
H = `s p2 + `−1 (α−n αn + α̃−n α̃n ) (7.106)
2 2 s
n6=0
– 95 –
If we represent the Heisenberg algebras with vacua so that αn |0i = α̃n |0i = 0 for n > 0
√ √
then we get standard Harmonic oscillators by defining an = αn / n and a†n = α−n / n for
n > 0, and similarly for the right-moving modes α̃n . Therefore, the Hamiltonian becomes:
"∞ #
1 2 X 2
H = `s p̂ + `−1
s n a†n an + ã†n ãn − (7.108)
2 24
n=1
where ψn is a basis of the space of states. But now hψ1 |e−βH |ψ2 i is an analytic contin-
uation of the transition amplitude hψ1 |e−itH |ψ2 i to imaginary time. On the other hand
hψ1 |e−itH |ψ2 i can be written as a path integral with initial and final conditions ψ2 , ψ1 ,
respectively. If we set ψ1 = ψ2 and sum over a complete basis then the domain of the path
integral becomes that of all field configurations on the circle of Euclidean time. For more
on this, see the classic book by Feynman and Hibbs.
If we apply the above principle to the case of our 1 + 1 dimensional QFT, the quantum
field X is already a map from the circle to the real line so altogether we have a path integral
on a torus S 1 × S 1 with metric
" 2 #
β β
ds2 = (dσ)2 + ( )2 (dσ 2 )2 = (2π)2 (dσ 1 )2 + (dσ 2 )2 (7.112)
`s 2π`s
– 96 –
R 2π
where P is the total momentum of the field ∼ 0 ∂t X∂σ X. Note that H = HL + HR .
Explicitly, one finds: 74
"∞ #
1 2 −1
X
† 1
HL = `s p̂ + `s nan an −
4 24
n=1
"∞ # (7.114)
1 2 −1
X
† 1
HR = `s p̂ + `s nãn ãn −
4 24
n=1
We set
βω
q = e2πiτ = e−βω ⇔ τ =i . (7.115)
2π
Then the proper analytic continuation to consider is
One of the virtues of (7.116) is that it still has a nice interpretation in terms of a path
integral on a torus: After we propagate in Euclidean time by Re(βω) we shift in the σ
coordinate by Im(βω) before gluing, because P is the generator of translations in the σ
direction. The net result is that we can identify Z(β) with the pathintegral on a torus with
metric:
ds2 = (2π)2 |dσ 1 + τ dσ 2 |2 = (2π)2 |dz|2 (7.118)
We can identify the torus with our friend: C/(Z + τ Z) with a flat metric ds2 = |dz|2 where
τ is a complex number in the upper half-plane. One can easily (and rigorously) compute
that path integral and show that it is
where
∞
Y
1/24
η(τ ) = q (1 − q n ), (7.120)
n=1
– 97 –
z = x + iy then the pull-back is ds2 = |τ |2 |dx0 + τ 0 dy 0 |2 with x0 = y and y 0 = −x0 and
τ 0 = −1/τ .
What about the overall factor of |τ |2 in front of the metric? But the massless scalar
field above has a beautiful property known as conformal invariance. Let us return to the
action and restore the 1 + 1 dimensional Minkowskian metric:
Z
1 p
S=− 2
d2 σ |detη|η αβ ∂α X∂β X (7.121)
4π`s S 1 ×R
we choose ηtt = −1. The generalization to an arbitrary metric hαβ dσ α dσ β on the “world-
sheet” S 1 × R is clear:
Z
1 p
S=− 2
d2 σ |deth|hαβ ∂α X∂β X (7.122)
4π`s S 1 ×R
This is the standard minimal coupling in general relativity. But note that the action is
invariant under conformal transformations
One has to be very careful about the quantum theory: In this case the partition function
is not quite invariant but rather scales by an overall functional of Ω. But for a flat metric
and constant Ω, the overall scaling factor is just one.
We finally conclude that (Imτ )−1/2 |η(τ )|−2 is invariant under τ → −1/τ and since
η(τ ) is holomorphic one can deduce the very important result (7.102) above.
As an interesting application of (7.102), when combined with the method of stationary
phase, one can derive the Hardy-Ramanujan formula giving an asymptotic formula for large
values of n:
r
1 1 3/4 −1 n
p(n) ∼ √ n exp 2π (7.124)
2 24 6
Note that this grows much more slowly than the order of the group, n!. So we conclude
that some conjugacy classes must be very large! (See discussion in the next section on the
class equation if this is not obvious.)
Analogs of equation (7.124) for a class of functions known as modular forms plays
an important role in modern discussions of the entropy of supersymmetric (and extreme)
black hole solutions of supergravity.
– 98 –
and use the above transformation formula, together with the stationary phase method to
derive (7.124). 75
n = λ1 + λ2 + · · · + λk (7.126)
as a sum of positive integers with λ1 ≥ λ2 ≥ · · · ≥ λk . The nonzero λi are called the parts
of the partition. The above is a partition of n with k parts.
(4)
– 99 –
One way to associate a conjugacy class in Sn with a partition is as follows. We let
denote the multiplicity of i in λ. In terms of the Young diagram it is the number of rows
with i boxes. If there are mi rows with i boxes then that accounts for i × mi boxes.
Summing over i gives n, the total number of boxes in the diagram. So we can associate
the conjugacy class:
(1)m1 (2)m2 · · · (7.128)
to a partition/Young diagram.
It is worth noting that there is another associated partition and conjugacy class known
as the conjugate partition λ. It corresponds to the partition obtained by flipping on the
main diagonal: We exchange rows and columns. Let λ0i be the number of boxes in the ith
column. By the inverted L-shape we see that this is another partition
λ01 +λ02 +· · ·+λ0n = (λ01 −λ02 )+2(λ02 −λ03 )+3(λ03 −λ04 )+· · ·+(n−1)(λ0n−1 −λ0n )+nλ0n (7.133)
– 100 –
statistical theory of partitions. 76 Without going into a lot of complicated asymptotic
formulae, the main upshot is that, for large n, as a function of k, pk (n) indeed is sharply
peaked with a maximum around
√
6√
k̄(n) := nlogn (7.135)
2π
See Figure 21 for a numerical illustration. Moreover,
√ √ and again√speaking very roughly, the
∼
number of terms in the partition λj with λj = 2π6 n is order 6n/π.
Remarks:
2. A Young tableau is a Young diagram with n boxes where the boxes have been filled
in with integers drawn from {1, . . . , n} so that no integer is repeated. Note that the
symmetric group Sn acts on Young tableau. For a given tableau T we can define
two subgroups of the symmetric group: R(T ) are the permutations that only move
numbers around within the rows and C(T ) are the permutations that only move
numbers around within the columns. Young tableau and these subgroups are used
in constructing the irreducible representations of the symmetric group.
This is a partition of n − k as a sum of integers drawn from {1, . . . , k}. Enumerating those is clearly given
by kj=1 (1 − xj )−1 .
Q
– 101 –
1.5 1017
1.0 1017
5.0 1016
20 40 60 80 100 120
Figure 21: Showing the distribution of pk (n) √as a function of k for n = 400 and 1 ≤ k ≤ 120. Note
that the Erdös-Lehner mean value of k is k̄ = 2π6 20log(20) ∼
= 46.7153 is a very good approximation
to where the distribution has its sharp peak. The actual maximum is at k = 45.
Let X be any set (possibly infinite). Recall the definition we gave in Section 4.1.
A permutation of X is a 1-1 and onto mapping X → X. The set SX of all permutations
– 102 –
forms a group under composition. A transformation group on X is a subgroup of SX .
Equivalently, a G-action on a set X is a map φ : G × X → X compatible with the
group multiplication law as follows:
A left-action satisfies:
φ(g1 , φ(g2 , x)) = φ(g1 g2 , x) (8.1)
A right-action satisfies
φ(g1 , φ(g2 , x)) = φ(g2 g1 , x) (8.2)
In addition in both cases we require that
φ(1G , x) = x (8.3)
for all x ∈ X.
Remarks:
1. If φ is a left-action then it is natural to write g · x for φ(g, x). In that case we have
3. A given set X can admit more than one action by the same group G. If one is working
simultaneously with several different G actions on the same set then the notation g · x
is ambiguous and one should write, for example, φg (x) = φ(g, x) or speak of φg , etc.
A good example of a set X with several natural G actions is the case of X = G
itself. Then there are the actions of left-multiplication, right-multiplication, and
conjugation. The action of g on the group element g 0 is:
L(g, g 0 ) = gg 0
L̃(g, g 0 ) = g −1 g 0
R(g, g 0 ) = g 0 g
(8.6)
R̃(g, g 0 ) = g 0 g −1
C(g, g 0 ) = g −1 g 0 g
C̃(g, g 0 ) = gg 0 g −1
where on the RHS of these equations we use group multiplication. The reader should
work out which actions are left actions and which actions are right actions.
– 103 –
There is some important terminology one should master when working with G-actions.
First here are some terms used when describing a G-action on a set X:
Definitions:
1. A group action is effective or faithful if for any g 6= 1 there is some x such that
g · x 6= x. Equivalently, the only g ∈ G such that φg is the identity transformation
is g = 1G . A group action is ineffective if there is some g ∈ G with g 6= 1 so that
g · x = x for all x ∈ X. The set of g ∈ G that act ineffectively is a normal subgroup
of G.
In summary:
1. Effective: ∀g 6= 1, ∃x s.t. g · x 6= x.
2. Ineffective: ∃g 6= 1,s.t. ∀x g · x = x.
4. Free: ∀g 6= 1, ∀x, g · x 6= x
is called the isotropy group at x. It is also called the stabilizer group of x. It is often
denoted Gx . The reader should show that Gx ⊂ G is in fact a subgroup. Note that a
group action is free iff for every x ∈ X the stabilizer group Gx is the trivial subgroup
{1G }.
The fixed point set of g is often denoted by X g . Note that if the group action is free
then for every g 6= 1 the set FixX (g) is the empty set.
– 104 –
4. We repeat the definition from section **** above. The orbit of G through a point x
is the set of points y ∈ X which can be reached by the action of G:
Remarks:
3. If x, y are in the same orbit then the isotropy groups Gx and Gy are conjugate
subgroups in G. Therefore, to a given orbit, we can assign a definite conjugacy class
of subgroups.
Definition If G acts on X a stratum is a set of G-orbits such that the conjugacy class of
the stabilizer groups is the same. The set of strata is sometimes denoted X k G.
Exercise
Recall that a group action of G on X can be viewed as a homomorphism φ : G → SX .
Show that the action is effective iff the homomorphism is injective.
Exercise
Suppose X is a G-set.
a.) Show that the subset H of elements which act ineffectively, i.e. the set of h ∈ G
such that φ(h, x) = x for all x ∈ X is a normal subgroup of G.
78
Answer : L, R̃, C̃ are left-actions, while L̃, R, C are right-actions.
– 105 –
b.) Show that the group G/H acts effectively on X.
Exercise
Let G act on a set X.
a.) Show that the stabilizer group at x, denoted Gx above, is in fact, a subgroup of G.
b.) Show that the G action is free iff the stabilizer group at every x ∈ X is the trivial
subgroup {1G }.
c.) Suppose that y = g · x. Show that Gy and Gx are conjugate subgroups in G. 79
Exercise Derangements
A permutation in Sn which acts on {1, . . . , n} without fixed points is called a derange-
ment. Show that the number of derangements in Sn is given by
n
X (−1)k
Dn = n! (8.10)
k!
k=0
for a 1 − 1 map ψ.
ψ −1 (gGx ) ≡ g · x (8.13)
– 106 –
Corollary: If G acts transitively on a set X then the isotropy groups Gx for all the
points x ∈ X are conjugate subgroups of G, and for any x ∈ X, there is a 1 − 1
correspondence between X and the set of cosets G/Gx . If H is any one of these
isotropy groups we can therefore identify X with the set of left-cosets G/H.
Remark: Sets of the type G/H are called homogeneous spaces. This theorem is the
beginning of an important connection between the algebraic notions of subgroups and cosets
to the geometric notions of orbits and fixed points. Below we will show that if G, H are
topological groups then, in some cases, G/H are beautifully symmetric topological spaces,
and if G, H are Lie groups then, in some cases, G/H are beautifully symmetric manifolds.
80
Answer : Write X X
|X g | = |{(x, g)|g · x = x}| = |Gx | (8.15)
g∈G x∈X
Now use the stabilizer-orbit theorem to write |Gx | = |G|/|OG (x)|. Now in the sum
X 1
(8.16)
x∈X
|OG (x)|
– 107 –
Any two points are
SO(3)-related
this
orbit
is a
point
Figure 24: Notice not all orbits have the same dimensionality. There are two qualitatively different
kinds of orbits of SO(2, R).
Examples
1. Let G be any group and consider the group action defined by φ(g, x) = x for all
g ∈ G. This is as ineffective as a group action can be: For every x, the istropy group
is all of G, and for all g ∈ G, Fix(g) = X. In particular, this situation will arise if
X consists of a single point. This example is not quite as stupid as might at first
appear, once one takes the categorical viewpoint, for pt// G is a very rich category
indeed.
– 108 –
2. Let X = {1, · · · n}, so SX = Sn as before. The action is effective and transitive, but
not free. Indeed, the fixed point of any j ∈ X is just the permutations that permute
j ∼
everything else, and hence SX = Sn−1 . Note that different j have different stabilizer
subgroups isomorphic to Sn−1 , but they are all conjugate.
The stabilizer group for all other nonzero vectors will be conjugate to this one. The
stabilizer group of the origin is the entire group GL(n, R).
4. If we restrict from GL(n, R) to SO(n, R) the picture changes completely. For sim-
plicity consider the case n = 2. The left- action is:
! ! !
x1 cos φ sin φ x1
R(φ) : → (8.18)
x2 − sin φ cos φ x2
The group action is effective. It is not free, and it is not transitive. There are now
infinitely many orbits of SO(2), and they are all distinguished by the invariant value
of x2 +y 2 on the orbit. From the viewpoint of topology, there are two distinct “kinds”
of orbits acting on R2 . One has trivial isotropy group and one has isotropy group
SO(2). See Figure 24. These give two strata.
5. Orbits of O(2). We have seen that O(2) can be written as a disjoint union:
where P is not canonical and can be taken to be reflection in any line through the
origin. The orbits of SO(2) and O(2) are the same. We will find a very different
picture when we consider the orbits of the Lorentz group.
6. Now consider a fixed SO(2, R) subgroup of SO(3, R), say, the subgroup defined by
rotations around the z-axis, and consider the action of this group on a sphere in R3
of fixed radius. The action is not not transitive. The G-orbits are shown in Figure
23. It is also not free: The north and south poles are fixed points.
– 109 –
7. Now consider the action of SO(3, R) on a sphere of positive fixed radius in R3 .
(WLOG take it to be of radius one.) The action is then transitive on the sphere.
Now the isotropy subgroup StabSO(3) (n̂) ⊂ SO(3) of any unit vector n̂ ∈ S 2 is
isomorphic to SO(2):
StabSO(3) (n̂) ∼
= SO(2) (8.20)
But, for different choices of n̂ we get different subgroups of SO(3). For example, with
usual conventions, if n̂ = e3 is on the x3 -axis then the subgroup is the subgroup of
matrices of the form
cos φ sin φ 0
R12 (φ) = − sin φ cos φ 0 (8.21)
0 0 1
but if x is on the x1 -axis the subgroup is the subgroup of matrices of the form
1 0 0
R23 (φ) = 0 cos φ sin φ (8.22)
0 − sin φ cos φ
and so on. For any n̂ ∈ S 2 let SO(2)n̂ ⊂ SO(3) denote the subgroup, isomorphic
to SO(2), which stabilizes n̂. According to the stabilizer-orbit theorem there is a
natural one-one correspondence
S2 ∼
= SO(3)/SO(2)n̂ (8.23)
is the set of rotations which can be (noncanonically!) put in 1-1 correspondence with
elements of SO(2). That is because if k̂ = R1 n̂ and k̂ = R2 n̂ then R1−1 R2 n̂ = n̂ and
therefore R2 = R1 R0 where R0 ∈ StabSO(3) (n̂) ∼= SO(2).
So, for each point k̂ ∈ S 2 we can associate a copy of SO(2) inside SO(3), which is
topologically a circle, and clearly every element of SO(3) will be captured this way
as k̂ ranges over S 2 . One might think that this means that, as manifolds, SO(3) is
diffeomorphic to S 2 × SO(2) = S 2 × S 1 , but this turns out to be quite false. For
example the homotopy groups of SO(3) and S 2 × S 1 are completely different.
– 110 –
Nevertheless, we can try to parametrize the general rotation by using this idea: We
choose n̂ = e3 to be the basepoint. Then the standard polar angles of a point on the
sphere are defined by
0 sin θ sin φ
R12 (φ)R23 (θ) 0 = sin θ cos φ (8.27)
1 cos θ
But this does NOT mean every rotation matrix is of the form R12 (φ)R23 (θ) ! It only
gives us a parametrization of the cosets SO(3)/SO(2)12 . The general element can be
written as
R = R12 (φ)R23 (θ)R12 (ψ) (8.28)
with range φ ∼ φ + 2π, ψ ∼ ψ + 2π and 0 ≤ θ ≤ π. These are the famous Euler
angles But they are not global coordinates. They can’t be: (θ, φ) are not global
coordinates on S 2 (they go bad at the north and south poles) and S 2 × S 1 is not the
same manifold as SO(3).
8. GL(2, C) And SU (2) Act On CP1 . Recall that CP1 can be identified with equivalence
classes of points (z1 , z2 ) ∈ C2 − {0} with equivalence relation (z10 , z20 ) ∼ (λz1 , λz2 ).
We denote equivalence classes by [z1 : z2 ].
Note that CP1 can also be thought of as the space of states of a single Qbit: [z1 : z2 ]
always has a representative with |z1 |2 + |z2 |2 = 1 and the representative is unique up
to multiplication by a phase. We can use such a normalized representative to define
a Qbit state: !
z1
ψ= (8.29)
z2
♣THIS REMARK
BELONGS
There is a well-defined action of GL(2; C) on CP1 : EARLIER, WHEN
WE FIRST
INTRODUCED
CPn . ♣
!
a b
: [z1 : z2 ] 7→ [az1 + bz2 : cz1 + dz2 ] (8.30)
c d
(The reader should carefully check that this is a well-defined group action. Since the
GL(2, C) action on C2 − {0} is transitive, the action on CP1 is transitive. Therefore
choosing a point p ∈ CP1 we have an identification of CP1 as a homogeneous space:
GL(2, C)/B ∼
= CP1 (8.31)
Note that by the equivalence [z1 : z2 ] = [λz1 : λz2 ] we can always find a representative
so that (z1 , z2 ) is a unit vector in the Hilbert space C2 . Therefore, the restriction of
– 111 –
the GL(2, C) action on CP1 to SU (2) is still transitive. Now the stabilizer of [1 : 0]
is the subgroup of diagonal SU (2) matrices and is isomorphic to U (1). Therefore,
there is also an identification
CP1 ∼
= SU (2)/U (1) (8.33)
map:
π : S3 → S2 (8.35)
whose fibers are copies of S 1 . This is a famous map in mathematics and physics
known as the Hopf map and has many beautiful properties. It appears in the physics
of magnetic monopoles and in several other related contexts. It is very closely related
to the map π : SO(3) → S 2 defined above.
Another way of thinking about CP1 is that it is the space of lines through the origin
in C2 . This leads us to our next example:
The condition T (W0 ) = W0 is then the condition that C = 0 for the matrix of T
relative to such a basis. So the stabilizer group is isomorphic to the subgroup P of
GL(n, κ) of matrices of the form:
!
AB
∈ GL(n, κ) (8.38)
0 D
– 112 –
In fact, the Grassmannian is a manifold and the representation in terms of homoge-
neous coordinates helps us to find local coordinates. Suppose that g ∈ GL(n, κ) is
“not too far” from the identity matrix then we will try to find a representative g0
in the coset gP so that if g0 6= g00 then they must be in different cosets. The idea is
that we view right-multiplication as a “gauge freedom” and try to “fix the gauge” by
choosing g0 to be of the form:
!
1k×k 0
g0 = (8.39)
γ(n−k)×k 1(n−k)×(n−k)
for some matrix γ(n−k)×k ∈ M at(n−k)×k (κ). Note that if we right multiply by an
element of P and ask that
! ! !
1 0 AB 1 0
= (8.40)
γ 1 0 D γ0 1
then α and δ are invertible and α − βδ −1 γ is also invertible. These are the conditions
so that we can solve the equation
! !
AB 1 0
g = (8.42)
0 D γ̃ 1
for some γ̃ ∈ M at(n−k)×k (κ). Indeed, one finds that we must set: A = α−1 , D =
δ −1 (1 − γB) and and B = −(α − βδ −1 γ)−1 βδ −1 .
– 113 –
Exercise C∗ Actions On CPn−1
Consider the action of G = C∗ on CPn−1 defined by (4.8).
a.) For which values of (q1 , . . . , qn ) is the action effective?
b.) For which values of (q1 , . . . , qn ) is the action transitive?
c.) What are the fixed points of the C∗ action?
d.) What are the stabilizers at the fixed points of the C∗ action?
Exercise
Using GL(2, C)/B ∼
= CP1 show that GL(2, C) has a natural action on the Riemann
sphere given by
az + b
z 7→ (8.45)
cz + d
Exercise
Since there is a left-action of G × G on X = G there is a left-action of the diagonal
subgroup ∆ ⊂ G × G where ∆ = {(g, g)|g ∈ G} is a subgroup isomorphic to G.
a.) Show that this action is given by a 7→ I(a), where I(a) is the conjugation by a.
b.) Show that the orbits of ∆ are the conjugacy classes of G.
c.) What is the stabilizer subgroup of an element g0 ∈ G?
82
Hint: Show that Im(g · τ ) = |cτImτ
+d|2
.
83
The isotropy group is the subgroup SO(2, R) ⊂ SL(2, R). To see this set ai+b
ci+d
= i and conclude that
a = d and b = −c. Then since ad − bc = 1 we have a2 + b2 = 1 but this implies that the group element is
in SO(2, R).
– 114 –
Exercise Spheres As Homogeneous Spaces
a.) Show that there is a transitive action of SO(n + 1) on S n , considered as a sphere
of fixed radius in Rn+1 .
b.) Show that S n ∼
= SO(n + 1)/SO(n).
c.) Give an inductive proof that SO(n) is a connected manifold for n ≥ 2.
Figure 25: The distinct kinds of orbits of SO(1, 1, R) are shown in different colors. If we enlarge
the group to include transformations that reverse the orientation of time and/or space then orbits
of the larger group will be made out of these orbits by reflection in the space or time axis.
i.e. the quadratic form is (x, x) = −(x0 )2 + (x1 )2 . The two-dimensional Lorentz group is
defined by
O(1, 1) = {A|Atr ηA = η} (8.47)
– 115 –
x1 → sinh θ x0 + cosh θ x1 (8.49)
that is: !
cosh θ sinh θ
SO0 (1, 1; R) ≡ {B(θ) = | − ∞ < θ < ∞} (8.50)
sinh θ cosh θ
In the notation the S indicates we look at the determinant one subgroup and the subscript
0 means we look at the connected component of 1. This is a group since
so SO0 (1, 1) ∼
= R as groups. Indeed, note that
" !#
01
B(θ) = exp θ (8.52)
10
x± := x0 ± x1 (8.53)
x± → e±θ x± (8.54)
Then (λ, sign(x+ ), sign(x− )) is a complete invariant of the orbits. That is, given this triple
of data there is a unique orbit with these properties.
It is now easy to see what the different types of orbits there are. They are shown in
Figure 25: They are: ♣Actually, the
lightrays and
hyperbolas have
trivial stabilizer and
1. hyperbolas in the forward/backward lightcone and the left/right of the lightcone hence are in the
same strata. This is
a problem with
2. 4 disjoint lightrays. using strata. ♣
3. the origin: x+ = x− = 0.
84
√
Some authors will define these with a 1/2 or 1/ 2. One should exercise care with this choice of
convention.
– 116 –
It is now interesting to consider the orbits of the full Lorentz group O(1, 1) and its
relation to the massless wave equations. But there are clearly elements of O(1, 1) not
continuously connected to the identity such as:
! !
1 0 −1 0
P = T = (8.56)
0 −1 0 1
O(1, 1) = SO0 (1, 1) q P · SO0 (1, 1) q T · SO0 (1, 1) q P T · SO0 (1, 1) (8.57)
with
The P and T operations map various orbits of SO0 (1, 1) into each other: P is a
reflection in the time axis, i.e., a reflection of the spatial coordinate, while T is a reflection
in the space axis, i.e. a reflection of the time coordinate. Thus the orbits of the groups
SO(1, 1), SO0 (1, 1) q P T · SO0 (1, 1), and O(1, 1) all differ slightly from each other. ♣Should give more
details here, or form
As an example of a physical manifestation of orbits let us consider the energy-momentum an exercise. ♣
1. Massive particles: m2 > 0 have (E, p) along an orbit in the upper quadrant:
2. Massless particles move at the speed of light. In 1+1 dimensions there is an interesting
refinement of the massless orbits: Left-moving particles with positive energy have
support on 85 p+ = 21 (E + p) = 0 and p− = 12 (E − p) 6= 0. Right-moving particles
with positive energy have support on p− = 0 and p+ 6= 0. In d + 1 dimensions
with d > 1 the orbits of SO0 (1, d) consisting of the forward and backward lightcones
(minus the origin) are connected.
3. Tachyons have E 2 −p2 = m2 < 0 and have their support on the left or right quadrant.
If i(k0 x0 +k1 x1 ) then k 2 =
p we try to expand a solution to the wave-equation with e 0
k12 + m2 and so if the spatial momentum k1 is sufficiently small then k0 is pure
imaginary and the wave grows exponentially, signaling and instability. This tells us
our theory is out of control and some important new physical input is needed.
– 117 –
b.) Show that the group of components of the Lorentz group is O(1, 1)/SO0 (1, 1) ∼
=
Z2 × Z2
Figure 26: Illustrating orbits of the connected component of the identity in O(1, 3). In (a) the top
and bottom hyperboloids are separate orbits, and if we include time-reversing transformations the
orbits are unions of the two hyperboloids. In (b) there are three orbits shown with x0 > 0 x0 < 0
(the future and past, or forward and backward light cones), and the orbit consisting of the single
point. In (c), once x2 has been specified, there is just one orbit, for d > 2.
is in O(1, 1) iff
a2 − c2 = 1
d 2 − b2 = 1 (8.60)
ab = cd
a = κ1 cosh θ
c = sinh θ
(8.61)
d = κ2 cosh θ0
b = sinh θ0
where κi ∈ {±1} and θ, θ0 ∈ R. Now impose the third equation. The solutions split into two cases: If
κ1 /κ2 = 1 then θ = θ0 . This gives two components. If κ1 /κ2 = −1 then θ = −θ0 , giving the other two
components.
– 118 –
with quadratic form
η = Diag{−1, +1d−1 } (8.62)
and
O(1, d − 1) = {A|Atr ηA = η} (8.63)
The nature of the orbits is slightly different because of the zero-dimensional sphere S 0
is disconnected but the higher dimensional spheres are connected.
3.
O± = {x|x2 = 0 & sign(x0 ) = ±1} (8.68)
Vectors in this orbit are of the form (x0 , |x0 |n̂) where n̂ ∈ S d−2 ⊂ Rd−1 and the sign
of x0 is invariant under the action of the identity component of O(1, 3). (Show this!).
Note that, for d = 2 the sphere S 0 has two disconnected components, leading to
left- and right-movers. But for d > 2 there is only one component. We can think of
n̂ ∈ S d−2 as parametrizing the directions of light-rays. That is, the point where the
light ray hits the celestial sphere. In one spatial dimension, a light ray either moves
left or right, and this is a Lorentz-invariant concept. In d − 1 > 1 spatial dimensions,
we can rotate any direction of light ray into any other. See Figure 26(b). One can
show that these orbits too are homogeneous spaces: 87
O± ∼
= SO0 (1, d − 1)/I (8.69)
87
The isotropy group of a light ray is I ∼ = ISO(d − 2), where ISO(d − 2) is the Euclidean group
on Rd−2 . The easiest way to show this is to use the Lie algebra of so(1, d − 1) and work with light-cone
coordinates. Choosing a direction of the light ray along the xd−1 axis and introducing light-cone coordinates
x± := x0 ± xd−1 , and transverse coordinates xi , i = 1, . . . , d − 2 if the lightray satisfies x− = 0 then we
have unbroken generators M +i and M ij .
– 119 –
4. The final orbit is of course {x = 0}.
Remarks
1. Discrete symmetries in nature. The higher dimensional Lorentz groups also have four
connected components. The Lorentz group in d+1 spacetime dimensions is O(d−1, 1).
In QFT it is clear that if a theory is invariant under infinitesimal Lorentz symmetries,
then it is invariant under the connected component of the identity SO0 (d − 1, 1).
However, it turns out that such theories can nevertheless not be invariant under the
disconnected components. For a long time it was assumed that parity is a symmetry
of all the laws of physics. This means that, if you watch a video of a physical process,
then you cannot tell whether you are looking at that process in a mirror, or not.
However, in 1956 T.D. Lee and C.N. Yang carefully reviewed the evidence for parity
conservation in nature and pointed out that there was no careful experimental test for
the weak interactions (nuclear beta decay, etc.). They proposed some experimental
tests and C.S. Wu and collaborators discovered experimentally in 1957 that parity is
indeed violated.
So, what about time reversal invariance? If you run a movie backwards, is the
resulting process physically possible (however unlikely)? There is a famous theorem
in QFT stating that the product of parity and time reversal, the P T -component of
O(d − 1, 1) will be a symmetry if the connected component is a symmetry. This is
usually called the CPT theorem. In 1964 V. Fitch and J. Cronin discovered that
certain very rare processes in nature actually do violate time-reversal invariance.
That is a good thing, because, as noted by Sakharov, if there were no time-reversal
invariance in the laws of physics it would be impossible to understand why there is
matter/anti-matter asymmetry in the context of the big bang theory.
where !
1p×p 0
dp,q := (8.71)
0 −1q×q
forms a group, generalizing the Lorentz and rotation-reflection groups and, when
both p > 0 and q > 0 it has four connected components. 88
**************************
SHOW CONTRACTION TO O(p) × O(q) USING THE MATRICES
√ √
† π sinh( π † π)
!
0 π cosh( ππ √
exp[ † ] = sinh(√ππ† ) π† π
√
π 0 π † √ cosh( π † π
ππ †
88
The proof proceeds by showing that the group contracts to O(p) × O(q) and then recall that O(p) has
two connected components.
– 120 –
SINCE
Atr A = 1 + C tr C
(8.72)
Dtr D = 1 + B tr B
𝑥𝑥
Figure 27: The torsors Z + x, plotted in the y direction are the points above a point x on the
Z-axis. Note that the union of the red lines has a symmetry under the action (x, y) → (x + n, y)
for n ∈ Z. If we quotient by this action then the projection to the x-axis becomes the projection
R → S 1 given by the exponential map. This gives us a principal bundle for the group Z.
One can set up a 1-1 correspondence between a torsor and elements of G, but in general
there is no natural correspondence: A torsor has no distinguished element we can call the
identity.
Example 2: Imagine that the surface of the earth is flat and of infinite extent. Is this
a copy of R2 ? Yes and no. We can identify it with R2 , but not in any natural way: R2
– 121 –
is a vector space with a distinguished vector ~0. Where should we put the origin? Rome?
Beijing? Moscow? London? New York? Piscataway? Wuhan? If the UN tried to assign
an origin there would be endless disputes. However, there would never be any dispute
about the vector in R2 needed to translate from New York to London. So, the difference
of London minus New York is well-defined, but the sum is not. If London and New York
represented vectors in a vector space then one could both add and subtract these vectors.
The infinite flat earth is an example of two-dimensional affine Euclidean space E2 .
More formally: An affine space Ed modeled on Rd is a space of points with an action of
Rd that translates the points so that nonzero vectors always move points and one can get
from one point to any other by the action of a vector. But there is no natural choice of
origin. In equations:
2. If p + v = p then v = 0.
Example 3: Let V be a finite-dimensional vector space over a field κ. The set X = B(V ) of
all ordered bases for V is a GL(n, κ) torsor: g ·{v1 , . . . , vn } := {ṽ1 , . . . , ṽn } where ṽi = gji vj .
Any two such bases are related by some g, but, if we are just given an abstract vector space,
– 122 –
there is no natural basis. In the case κ = R the torsor B(V ) has two connected components.
A choice of connected component is known as an orientation of V .
P = P̃ /(x, y) ∼ (x + 1, y) (8.77)
π : G → G/H (8.78)
π −1 (gH) = gH (8.79)
The subset gH ⊂ G can be put into 1-1 correspondence with H, but not in any natural
way. While it is identified with H as a set, it is not identified as a group because there is
no natural element in gH to identify with the unit in H.
The above examples are special cases of an extremely important idea in mathematics
- that of a principal fiber bundle. In the language of this section, a principal G-bundle is a
continuous family of G-torsors. Here is the formal definition:
Technically, we need it to have a local triviality property: For all x ∈ X there is a neigh-
borhood Ux ⊂ X with a smooth map φx : π −1 (Ux ) → Ux × G such that
φx
π −1 (Ux ) / Ux × G (8.81)
π
π
%
Ux
commutes. Moreover, we require that the right G-action on π −1 (Ux ) is equivalent to the
natural right G-action on Ux × G, where g0 acts by (y, g) 7→ (y, gg0 ), i.e. if p ∈ π −1 (Ux )
and φx (p) = (y, g) then
φx (p · g0 ) = (y, g · g0 ) (8.82)
– 123 –
P is called the total space of the principal bundle and X is called the base space. The
preimage π −1 (x) is called the fiber above x.
A consequence of the local triviality property is that if we cover the base space X with
charts {Uα } with φα : π −1 (Uα ) → Uα × G then
φαβ := φα ◦ φ−1
β : (x, g) 7→ (x, gαβ (x)g) x ∈ Uαβ (8.83)
Examples:
1. Let X be any topological space and G any topological group. Then P = X × G with
π : (x, g) 7→ x is a principal bundle. A bundle of this form is known as the trivial
bundle.
5. G-bundles over the circle. Let G be a discrete group. Then R × G is the trivial
principal G- bundle over R. We can make a more interesting bundle by considering
the left Z-action defined by choosing an element g0 ∈ G and defining
so equivalence classes satisfy: [(x, g)] = [(x + n, g0n g)]. The quotient P = (R × G)/Z
by this action is a principal G-bundle over S 1 . Note that there is still a well-defined
free right G-action. Call this bundle π : Pg0 → S 1 . We could also describe it as
([0, 1] × G) where we glue (0, g) to (1, g0 g). We will return to it below.
6. A nice special case of the bundle Pg0 described above is obtained by considering the
map of the unit disk in the complex plane π : z 7→ z n . Restricting this map to the
boundary of the disk we get a map from S 1 → S 1 , but notice that the inverse image
of any point is a torsor for multiplication by elements of the group µn . So π describes
a principle Zn bundle over the circle.
A map of G-torsors, or, more properly, a morphism of G-torsors is a map the preserves
the mathematical structure of being a G-torsor. So, if X1 and X2 are two G-torsors then a
morphism of G-torsors is a map
ψ : X1 → X2 (8.86)
– 124 –
such that
ψ(y · g) = ψ(y) · g (8.87)
ψ
P1 / P2 (8.88)
π1 π2
~
X
commutes. That is π2 (ψ(p1 )) = π1 (p1 ) for all p1 ∈ P . Moreover, the map must be G
equivariant, or a morphism of torsors on the fibers, so that 89
This implies that ψh descends to a well-defined bundle map Pg0 → Phg0 h−1 . Clearly ψh−1
defines the inverse bundle map.
Therefore, the isomorphism classes of principal G bundles over the circle are labeled
by conjugacy classes of elements of G.
Finally, let π : P → X be a principal G bundle and let Y be any G-space with a left
G-action. Then we can define a G action on P × Y :
φg (p, y) 7→ (p · g −1 , g · y) (8.91)
Notice this is a left G-action. The quotient space, usually denoted P ×G Y has a well-defined
continuous map
π̃ : P ×G Y → X (8.92)
defined by π̃([p, y]) = π(p). The fibers of π̃ can be identified with the space Y . (See the
exercise below.) This is an example of a more general fiber bundle. The map π̃ : P ×G Y →
89
The reader familiar with the general theory of fiber bundles will note that (8.88) alone serves as the
definition of a bundle map for a fiber bundle. But a principal bundle has more structure, and for a morphism
of principal bundles we require the additional condition (8.89).
– 125 –
X defines what is called an associated bundle to the principal bundle π : P → X by the
G-set Y .
Sections Of Bundles
π ◦ s(x) = x ∀x ∈ X (8.94)
ψ:X→G→P (8.95)
be defined by ψ(x, g) := s(x)g. One checks this is a bundle morphism. Conversely, suppose
there is a bundle morphism ψ : P → G × X. Then for each x ∈ X there is a unique
s(x) ∈ P so that ψ(s(x)) = (x, 1G ). ♠
– 126 –
b.) Extend this to show that any morphism of principal G-bundles is an isomorphism
of principal G-bundles.
1. If G is a left-action on X then
2. If G is a left-action on X then
3. If G is a right-action on X then
4. If G is a right-action on X then
Example: Consider a spacetime S. With suitable analytic restrictions the space of scalar
fields on S is Map(S, κ), where κ = R or C for real or complex scalar fields. If a group
G acts on the spacetime, there is automatically an induced action on the space of scalar
fields. To be even specific, suppose X = M1,d−1 is d-dimensional Minkowski space time, G
is the Poincaré group, and Y = R. Given one scalar field Ψ and a Poincaré transformation
g −1 · x = Λx + v we have (g · Ψ)(x) = Ψ(Λx + v).
90
Answer : Let us define a map f : π̃ −1 (x) → Y . To do this we must choose an element p0 ∈ π −1 (x).
Then if [p, y] ∈ π̃ −1 (x) it follows that π(p) = x and hence p = p0 · g0 for some g0 . Then we define
f [p, y] = g0−1 · y. The reader needs to check that this map is well-defined. Since we can choose any y ∈ Y
the map is clearly surjective. Finally note that [p1 , y1 ] = [p1 , y2 ] implies that y1 = y2 since the G-action on
P is free. Therefore the map is also injective. Note that the map does depend on a choice of p0 for each x,
so there is no canonical identification of the fibers with Y .
– 127 –
Similarly, suppose that X is any set, but now Y is a G-set. Then again there is a
G-action on Map(X, Y ):
We can now combine these two observations and get the general statement: We assume
that both X is a G1 -set and Y is a G2 -set. We can assume, without loss of generality, that
we have left-actions on both X and Y . Then there is a natural G1 ×G2 -action on Map(X, Y )
defined by:
φ((g1 , g2 ), Ψ)(x) := g2 · (Ψ(g1−1 · x)) (8.102)
note that if one writes instead g2 · (Ψ(g1 · x)) on the RHS then we do not have a well-defined
G1 × G2 -action (if G1 and G2 are both nonabelian). In most applications X and Y both
have a G action for a single group and we write
This is a special case of the general action (8.102), with G1 = G2 = G and specialized to
the diagonal ∆ ⊂ G × G.
Example: Again let X = M1,d−1 be a Minkowski space time. Take G1 = G2 and let
G = ∆ ⊂ G × G be the diagonal subgroup, and take G to be the Poincaré group. Now
let Y = V be a finite-dimensional representation of the Poincaré group. Let us denote the
action of g ∈ G on V by ρ(g). Then a field Ψ ∈ Map(X, Y ) has an action of the Poincaré
group defined by
g · Ψ(x) := ρ(g)Ψ(g −1 x) (8.104)
This is the standard way that fields with nonzero “spin” transform under the Poincaré
group in field theory. As a very concrete related example, consider the transformation of
electron wavefunctions in nonrelativistic quantum mechanics. The electron wavefunction
is governed by a two-component function on R3 :
!
ψ+ (~x)
Ψ(~x) = (8.105)
ψ− (~x)
Then, suppose G = SU (2). Recall there is a surjective homomorphism π : G → SO(3) ♣IT HASN’T
BEEN DEFINED
defined by π(u) = R where YET. IT IS IN
SECTION 10.1.
u~x · ~σ u−1 = (R~x) · ~σ (8.106) SOME OF THAT
MATERIAL
SHOULD BE
Then the (double-cover) of the rotation group acts to define the transformed electron MOVED EARLIER
wavefunction u · Ψ by !
SO WE CAN
RECALL IT HERE.
ψ+ (R−1 ~x) ♣
(u · Ψ)(~x) := u (8.107)
ψ− (R−1 ~x)
In particular, u = −1 acts trivially on ~x but nontrivially on the wavefunction.
– 128 –
9. Centralizer Subgroups And Counting Conjugacy Classes
Definition 9.1: Let g ∈ G, the centralizer subgroup of g, (also known as the normalizer
subgroup ), denoted, Z(g), is defined to be:
Recall that C(g) denotes the conjugacy class of g. Using the Stabilizer-Orbit theorem
we can establish a 1-1 correspondence between C(g) and the cosets of G/Z(g). As in the
proof of that theorem we have a map ψ : G/Z(g) → C(g) by
The sum is over distinct conjugacy classes. What is g in this formula? For each class we
may choose any representative element from that class.
Now, if G is finite, then by the above 1-1 correspondence we may write:
|G|
|C(g)| = (9.5)
|Z(g)|
– 129 –
which allows us to write the above decomposition of |G| in a useful form sometimes called
the class equation:
X |G|
|G| = (9.6)
|Z(g)|
conj. classes
Again, we sum over a complete set of distinct non-conjugate elements g. Which g we choose
from each conjugacy class does not matter since if g1 = hg2 h−1 then Z(g1 ) = hZ(g2 )h−1
are conjugate groups, and hence have the same order. So, for each distinct conjugacy class
we just choose any element we like.
where the sum is over isomorphism classes of principal G bundles over the circle. and F
is a function on the set of such bundles, or equivalently, a function on all the principal
G-bundles that only depends on the isomorphism class. We call this a gauge invariant
Boltzman factor. The standard physical Boltzman factors involve curvature and holonomy.
In this setting there is no curvature, so F should be proportional to a character of g in
some representation.
In gauge theory we must also divide by the “volume” of the group of automorphisms
of the bundle so
X χρ (g)
Z(S 1 ) = (9.8)
cc
|Z(g)|
for some character χρ . We will see later that the sum is zero unless ρ contains some copies
of the trivial representation, so we might as well take χρ (g) = 1.
91
See section 18 below for a discussion of gauge theory that uses minimal prerequisites and is sufficient
to understand this remark.
– 130 –
Now notice that we can use the stabilizer-orbit theorem to rewrite this as:
X 1 1 X
Z(S 1 ) = = 1 (9.9)
cc
|Z(g)| |G|
Pg
In the second equality we have summed over all the G-bundles Pg weighted by 1 and divided
by the full “volume” |G| of the gauge group. So
1 X
Z(S 1 ) = 1=1 (9.10)
|G|
g∈G
The Hilbert space is one-dimensional with zero Hamiltonian so indeed this is Tre−βH = 1.
Proof : Observe that an element g is central if and only if C(g) = {g} has order 1. Now
let us use the class equation. We can usefully split up the sum over conjugacy classes as a
sum over the center and the rest:
X
|G| = |Z(G)| + |Ci | (9.11)
i
where the sum over i is a sum over the distinct conjugacy classes more than one
element. As we noted above, by the stabilizer orbit theorem
|G|
|Ci | = (9.12)
|Z(gi )|
where gi is any element of the conjugacy class Ci . But, for these conjugacy classes |Z(gi )| <
|G| and by Lagrange’s theorem, and the assumption that p is prime, |Z(gi )| = pn−ni for
some ni < n. Therefore, the second term on the RHS of (9.11) is divisible by p and hence
p||Z(G)|. ♠
X = {(g1 , . . . , gp )|g1 · · · gp = 1} ⊂ Gp
rather it is an
(9.13) application of
stabilizer-orbit. ♣
Note that the cyclic group Zp acts on this set with the standard generator acting by
– 131 –
A fixed point of the Zp -action corresponds to an element of the form (g, . . . , g) such that
g p = 1. If g 6= 1 then this corresponds to an element of order p. Now, by the stabilizer-orbit
theorem, the orbits of any Zp action (on any set) have cardinality either 1 or p. Let N1 be
the number of orbits of length one and let Np be the number of orbits of length p. Note
that the order of X is just |G|p−1 since one can always solve for gp in terms of g1 , . . . , gp−1 .
Then, by the counting principle we have:
It follows that p divides N1 . Also N1 > 0 because (1, ..., 1) is a fixed point of the Zp action.
Therefore N1 = kp > 1 and hence there are other fixed points, i.e. there are group elements
of order p. In fact, there must be at least (p − 1) of them. ♠
Proof 2 : We can also prove Cauchy’s theorem using induction on the order of G, dividing
the proof into two cases: First we consider the case where G is Abelian and then the case
where it is nonabelian.
Case 1: G is Abelian:
If |G| = p then G is cyclic and the statement is obvious: Any generator has order p.
More generally, note that if G is a cyclic group Z/N Z with N > p and p divides N then
N/p ∈ Z/N Z has order p. This establishes the result for cyclic groups.
Now suppose our Abelian group has order |G| > p. Choose an element g0 6= 1 and
suppose that g0 does not have order p. Let H = hg0 i. If H = G then G would be cyclic
but then as we just saw, it would have an element of order p. So now assume H is a proper
subgroup of G. If p divides |H| then H (and hence G) has an element of order p by the
inductive hypothesis. If p does not divide |H| then we consider the group G/H. But this
has order strictly less than |G| and p divides the order of G/H. So there is an element aH
of order p meaning ap = g0x for some x. If g0x = 1 we are done. If not then there is some
smallest positive integer y so that g0xy = 1 but then ay has order p. We have now proved
Cauchy’s theorem for abelian groups.
If p divides the order of the centralizer Z(G) then we can apply our previous result about
Cauchy’s theorem for Abelian groups. If p does not divide Z(G) then there must be some gi
|G|
so that p does not divide |Z(g i )|
but this means p divides |Z(gi )|, but now by the inductive
hypothesis Z(gi ), and hence G has an element of order p. This completes the proof. ♠
– 132 –
Proof 1 : The first proof is again an application of the stabilizer-orbit theorem. 92 Suppose
|G| = pk+r u with gcd(u, p) = 1 and r ≥ 0 and k > 0. We will show that G has a subgroup
of order pk . Consider the power set P(G), namely the set of all subsets of G, and consider
the subset of P(G) of all subsets (not subgroups!) of G of cardinality pk . Call this set of
subsets P(G, pk ). The cardinality of P(G, pk ) is clearly:
k −1
pY
pk+r u pk+r u/j − 1
k r
|P(G, p )| = = p u (9.17)
pk pk /j − 1
j=1
In the product we have a ratio of rational numbers of the form pk+r u/j−1 (the denominator
is a special case of this form). Any rational number r can be expressed as a product of
prime powers r = p̃ prime p̃vp̃ (r) where the vp̃ (r) ∈ Z is known as the valuation of r at p̃
Q
and the product runs over all primes p̃. Now, given a specific prime p, note that if a, b are
relatively prime to p then
pk a pk a − b
−1= (9.18)
b b
and hence for such rational numbers r the integer vp (r) = 0 for the prime p. It follows that
pr divides P(G, pk ) and that it is the maximal power which does so.
Now note that G acts on P(G, pk ) via:
φg : S 7→ g · S := {gh|h ∈ S} (9.19)
where O(S) is the G-orbit through S. Now pk+r divides |G| and if GS is divisible by at
most pk−1 then ps divides |O(S)| for s > r. But now
X
|P(G, pk )| = |O(Si )| (9.22)
distinct orbits
If all the orbits on the RHS were divisible by ps with s > r then |O(Si )| would be divisible
by ps with s > r. But this is not true. Therefore, some orbit is divisible by pr and no
higher power. Therefore some |GS | is divisible by pk , therefore |GS | = pk . Since GS is a
stabilizer group it is a subgroup of G. ♠
92
We are following the nice article on Wikipedia here.
– 133 –
Proof 2 : The more conventional proof is similar to that of Cauchy’s theorem. We work by
induction on |G|, and divide the proof into two cases:
Case 1: p divides the order of Z(G).: By Cauchy’s theorem Z(G) has an element of order
p and hence a subgroup N ⊂ Z(G) of order p. N is clearly a normal subgroup of G
(being a subgroup of the center of G) so G/N is a group. It is clearly of order pk−1 m.
So, by the inductive hypothesis there is a subgroup H̄ ⊂ G/N of order pk−1 . Now let
H = {g ∈ G|gN ∈ H̄}. It is not hard to show that H is a a subgroup of G containing N
and in fact H/N = H̄. Therefore |H| = pk , so H is a p-Sylow subgroup of G.
Case 2: p does not divide the order of Z(G).: In this case, by the class equation p must
not divide |C(g)| = |G|/|Z(g)| for some nontrivial conjugacy class C(g). But that means
that for such an element g we must have that pk divides |Z(g)| < |G|. So Z(g) has a
p-Sylow subgroup which can serve as a p-Sylow subgroup of G. ♠ ♣Again, there is a
nice proof using the
orbit-stabilizer
theorem. See
Wikipedia article.
Give this in the
Exercise section on
Orbit-Stabilizer
If pk divides |G| with k > 1 does it follow that there is an element of order pk ? 93 below? ♣
Exercise
Write out the class equation for the groups S4 and S5 .
Exercise
Find the centralizer Z(g) ⊂ Sn of g = (12 . . . n) in Sn .
Exercise
93
Answer : NO! Zkp is a counterexample: It has order pk and every element has order p.
– 134 –
Prove that if |G| = 15 then G = Z/15Z.
µ : G → G0 (10.1)
q − 1 = rp + sq
But this is impossible: If s ≥ 1 the RHS is too large. So s = 0 but then p would have to divide q − 1.
95
Answer : Consider p = 2 and q = 3 and note that S3 is not isomorphic to Z6 .
– 135 –
Definition 10.1:
a.) The kernel of µ is
The sequence can be as long as you like. It is said to be exact at Gi if im(fi−1 ) = ker(fi ).
A short exact sequence is a sequence of the form
f1 f2
1 −→ G1 −→ G2 −→ G3 −→ 1 (10.6)
which is exact at G1 , G2 , and G3 . Here 1 refers to the trivial group with one element.
There is then a unique homomorphism 1 → G1 and G3 → 1 so we don’t need to specify it.
Thus, the meaning of saying that (10.6) is a short exact sequence is that
1. Exactness at G1 : The kernel of f1 is the image of the inclusion {1} ,→ G1 , and hence
is the trivial group. Therefore f1 an injection of G1 into G2 .
– 136 –
In particular, note that if µ : G → G0 is any group homomorphism then we automati-
cally have a short exact sequence:
µ
1 → K → G → im(µ) → 1 (10.7)
It follows by Theorem 7.2.1, that G/K has a group structure. Note that µ(G) is also
naturally a group.
Theorem 10.2:
µ(G) ∼
= G/K (10.9)
Proof : We associate the coset gK to the element µ(g) in G0 .
ψ : gK 7→ µ(g) (10.10)
Claim: ψ is an isomorphism. You have to show three things:
where on the LHS we have the product in the group G/K and on the RHS we have
the product in G0 . We leave this as an exercise for the reader.
97
Answer : Definitely not! Any subgroup H ⊂ G is the image of the inclusion homomorphism. In general,
subgroups are not normal subgroups.
– 137 –
3. ψ is one-one, i.e. ψ is onto and invertible. The surjectivity should be clear. To prove
injectivity note that:
Remarks:
Example 1: Consider the group of fourth roots of unity, Res(4) and the homomorphism
π : Res(4) → Res(2) given by π(g) = g 2 . The kernel is {±1} = Res(2) and so we have:
1 → Z2 → Z4 → Z2 → 1 (10.15)
As an exercise the reader should also describe this extension thinking of Z4 additively as
Z/4Z, and generalize it to
1 → Zp → Zp2 → Zp → 1 (10.16)
where p is prime.
rN : Z → Z/N Z (10.17)
– 138 –
where ιN (x) = N x.
QP = ωP Q (10.21)
with ω = e2πi/4 . The group of matrices generated by P, Q and ω1N ×N is a finite subgroup
of GL(N, C) isomorphic to a finite Heisenberg group, denoted HeisN . It is an extension
π
1 → ZN → HeisN →ZN × ZN → 1 (10.23)
and has many pretty applications to physics and we will return to this group several times
below. See, for example section 11.11 below for a physical interpretation.
Exercise
Give a formula for π in the exact sequence (10.23).
Exercise An
Use Theorem 7.1 to show that An is a normal subgroup of Sn .
– 139 –
a.) Show that µ−1 (H2 ) ⊂ G1 is a subgroup.
b.) If H1 ⊂ µ−1 (H2 ) is a subgroup show that there is an induced map µ̄ : G1 /H1 →
G2 /H2 .
c.) Show that if H1 and H2 are normal subgroups then µ̄ is a homomorphism.
d.) In this case there is an exact sequence
Exercise
Let A, B be abelian groups and A1 ⊂ A and B1 ⊂ B subgroups, and suppose φ : A → B
is a homomorphism such that φ takes A1 into B1 .
a.) Show that φ induces a homomorphism
ker{φ : A → B}
ker{φ̄ : A/A1 → B/B1 } ∼
= (10.26)
ker{φ : A1 → B1 }
Exercise
Let n be a natural number and let
Exercise
Let G = Z × Z4 . Let K be the subgroup generated by (2, ω 2 ) where we are writing Z4
as the multiplicative group of 4th roots of 1. Note (2, ω 2 ) is of infinite order so that K ∼
= Z.
Show that G/K ∼ = 8Z .
– 140 –
Exercise The Finite Heisenberg Groups
a.) Using the matrices of (10.19) and (10.20) show that the word
Exercise
Let Bn be a braid group. Compute the kernel of the natural homomorphism φ : Bn →
Sn and show that there is an exact sequence
1 → Zn−1 → Bn → Sn → 1 (10.29)
1 → T → N (T ) → SN → 1 (10.31)
– 141 –
10.1 The Relation Of SU (2) And SO(3)
There is a standard homomorphism
To define it we note that for any u ∈ SU (2) there is a unique R ∈ SO(3) such that, for all
~x ∈ R3 we have:
u~x · ~σ u−1 = (R~x) · ~σ (10.33)
where R ∈ SO(3).
To prove (10.33) we begin by noting that, since u−1 = u† and ~x is real, the 2×2 matrix
u~x · ~σ u−1 is hermitian, and traceless, and hence has to be of the form ~y · ~σ , where ~y ∈ R3 .
Moreover, ~y depends linearly on ~x. So the transformation ~x 7→ ~y defined by u~x ·~σ u−1 = ~y ·~σ
is a linear transformation of R3 . In fact it is a norm-preserving transformation. One way
to prove this is to note that (see exercise below)
!
2 2 1 0
(~x · ~σ ) = ~x (10.34)
01
From either formula we conclude that (see exercise below) ~x2 = ~y 2 . We therefore conclude
that ~y = R~x with R ∈ O(3).
We define π(u) = R by using this equation. To be totally explicit
It should be clear from the definition that π(u1 u2 ) = π(u1 )π(u2 ), that is that π is a
homomorphism of groups. Now to show that actually R ∈ SO(3) ⊂ O(3) note that
2i = tr σ 1 σ 2 σ 3
and hence detR = 1. Alternatively, if you know about Lie groups, you can use the fact
that π is continuous, and SU (2) is a connected manifold.
We will now prove that:
– 142 –
Thus we have the extremely important extension:
ι π
1 → Z2 → SU (2) → SO(3) → 1 (10.38)
SO(3, R) ∼
= SU (2)/Z2 (10.39)
where the Z2 we quotient by is the center {±12×2 }. This is arguably the most important
exact sequence in physics.
To prove the above two claims we will need to get to know SU (2) a bit better.
First we claim that, as a manifold, SU (2) can be identified with the unit three-
dimensional sphere. One way to see this is to consider the unit sphere in R4 as the space
of unit vectors in a two-dimensional complex Hilbert space (the space of states of “one
Qbit”):
S3 ∼
= {~z|~z†~z = 1} ⊂ C2 (10.40)
and decomposing z1 , z2 into their real and imaginary parts. Next, we note that SU (2) has
a transitive action on the unit sphere:
φu : ~z 7→ u~z (10.42)
The action is transitive because, given any unit vector we can find another orthogonal unit
vector. But any two ON bases are related by some unitary transformation. By changing
the phase of the second vector we can arrange that they are related by a special unitary
transformation.
Therefore, we should invoke the stabilizer-orbit theorem and compute the stabilizer of,
say !
1
~z0 = . (10.43)
0
These are the upper triangular SU (2) matrices with one on the diagonal: The stabilizer is
trivial. So
SU (2) ∼
= S3 (10.44)
u−1 = u† (10.46)
– 143 –
we solve for the other two matrix elements and conclude that every SU (2) element is of
the form !
α β
u= (10.47)
−β̄ ᾱ
where
|α|2 + |β|2 = 1 (10.48)
This makes the identification of the group as a manifold quite clear.
There are many ways to parametrize S 3 . One is to introduce a polar angle and stratify
S 3 by two-dimensional spheres. Viewed this way, we can write the general SU (2) element
as
u = cos χ + i sin χ~n · ~σ (10.49)
where 0 ≤ χ ≤ π and ~n ∈ S 2 . From this form it is easy to check that u only with all the
σ i if sin χ = 0 so cos χ = ±1. From this we conclude
Now let us study the restriction of the homomorphism π to some special U (1) subgroups
of SU (2). First consider the subgroup of diagonal matrices D of the form:
!
ξ 0
(10.51)
0 ξ −1
If we write
ξ = e−iφ/2 (10.53)
for some angle φ then π maps the diagonal matrix to R ∈ SO(3) that is a rotation around
the x3 axis. It is a counterclockwise rotation by φ in the x1 − x2 plane with the orientation
dx1 ∧ dx2 .
Another obvious subgroup of SU (2) is SO(2), the real unitary matrices. We parametrize
the group by !
cos(θ/2) − sin(θ/2) −i θ σ2
R(θ/2) = e 2 (10.54)
sin(θ/2) cos(θ/2)
where
1
S = √ (1 − iσ 1 ) ∈ SU (2) (10.56)
2
– 144 –
(all you have to check is Sσ 2 S −1 = σ 3 . ) We find that
cos(θ/2) 0 sin(θ/2)
π(R(θ/2)) = 0 1 0 (10.57)
− sin(θ/2) 0 cos(θ/2)
is rotation by θ around the x2 axis. By the Euler angle parametrization we therefore learn
that π is onto. In fact, we can parametrize all SU (2) elements by
3 2 3
u = eφT eθT eψT (10.58)
where
i
T i = − σi 1≤i≤3 (10.59)
2
The range of Euler angles that covers SO(3) once is 0 ≤ θ ≤ π with φ and ψ identified
modulo 2π. Because SU (2) is a double cover we should extend the range of φ or ψ by a
factor of 2 if we want to cover the group SU (2) once. For example, taking:
0≤θ≤π
φ ∼ φ + 2π (10.60)
ψ ∼ ψ + 4π
with
1 1
α = ei 2 (φ+ψ) cos(θ/2) β = −ei 2 (φ−ψ) sin(θ/2) (10.62)
The Euler angle coordinates on SU (2) break down at θ = 0, π. At θ = 0 the product only
depends on (φ + ψ) even though we have a three-dimensional manifold Similarly at θ = π
the product only depends on (φ − ψ).
Remark: As we will discuss later, a good parametrization near the identity would be
u = exp[θk T k ] (10.63)
where we are exponentiating the general element of the Lie algebra su(2)
– 145 –
Exercise Polar Angle Decomposition Of SU (2)
a.) Prove that every element of SU (2) can be written in the form of (10.49).
b.) Express α, β in terms of χ and n̂.
c.) The coordinates χ, n̂ cover a product of an interval and a two-dimensional sphere.
Prove that [0, π]×S 2 is not topologically the same as SU (2). Where does the χ, n̂ coordinate
system go bad?
[T i , T j ] = ijk T k (10.64)
Exercise
Show that in the Euler angle parametrization the shift
ψ → ψ + 2π (10.65)
takes u → −u.
One of the main motivations from physics for studying representation theory stems from
Wigner’s theorem discussed below. The basic upshot of Wigner’s theorem is that, in
quantum mechanics, if G is a group of symmetries of a physical system then the Hilbert
space of the theory will be a representation space of G and in fact will define a unitary
representation: 100 For every symmetry operation g ∈ G there is a unitary operator U (g)
acting on the Hilbert space H so that
The use of representation can be very powerful and have far-reaching consequences. A
few examples:
100
We will need to amend this in two ways to be completely accurate: First, classical symmetries in
general are represented projectively on a quantum Hilbert space. Second, one must allow symmetries to be
represented by both unitary and anti-unitary operators in general.
– 146 –
Figure 28: Roots of unity on the unit circle in the complex plane. Here ω = e2πi/8 is a primitive
eighth root of 1.
1. The use of representation theory greatly aids in the diagonalization of physical ob-
servables, such as Hamiltonians.
2. Quantum states can be classified according to their symmetry types. This has im-
portant applications to selection rules governing what kind of transition amplitudes
can be nonzero.
4. The very formulation of Lagrangians and actions makes heavy use of representation
theory. For example, in relativistically invariant field theory the fields form a rep-
resentation of the Poincaré group (induced from the action on spacetime) and one
wishes to make a Lorentz invariant density when forming a Lagrangian.
Let V be a vector space over a field κ and recall that GL(V ) denotes the group of all
invertible linear transformations V → V . It is also denoted as Aut(V ) since it is the group
of linear automorphisms of V with itself.
T : g 7→ T (g)
(11.2)
G → GL(V )
– 147 –
V is sometimes called the representation space or the carrier space. We will abbreviate the
action of T (g) on a vector v ∈ V from T (g)(v) to T (g)v for readability.
Put differently, in terms of group actions, a representation of G is a G action on a
vector space that respects the linear structure:
g · (α1 v1 + α2 v2 ) = α1 g · v1 + α2 g · v2 (11.3)
If the vector space has an ordered basis then we get a matrix representation. For
example, if V is finite-dimensional then we can choose an ordered basis {v1 , . . . , vn } and
the corresponding matrix representation g 7→ M (g) ∈ GL(n, κ) is defined by:
X
T (g)vi = M (g)ji vj (11.4)
j
One easily checks that T (g1 ) ◦ T (g2 ) = T (g1 g2 ) implies M (g1 )M (g2 ) = M (g1 g2 ).
V1
A / V2 (11.5)
T1 (g) T2 (g)
V1
A / V2
commutes. Equivalently,
T2 (g)A = AT1 (g) (11.6)
– 148 –
for all g ∈ G. Put differently: A is a morphism of G actions, and put yet another way: A
is an equivariant linear map of G spaces. We denote the vector space of all intertwiners by
HomG (V1 , V2 ).
Note that if an intertwiner is invertible then T −1 is also an intertwiner. So we have:
for all g ∈ G.
Examples:
1. The general linear group GL(n, κ) with κ = R, C always has a family of one-
dimensional real representations, labeled by µ ∈ C given by
1. The direct sum ⊕ of representations. The direct sum of (T1 , V1 ) and (T2 , V2 ) is the
rep (T1 ⊕ T2 , V1 ⊕ V2 ) where the representation space is V1 ⊕ V2 and the operators
are:
((T1 ⊕ T2 )(g)) v1 ⊕ v2 := (T1 (g))(v1 ) ⊕ (T2 (g))(v2 ) (11.11)
– 149 –
2. Similarly, for the tensor product, the carrier space is V1 ⊗ V2 . See Chapter 2 for a
proper definition of the tensor product. For finite-dimensional vector spaces we can
say that if {v1 , . . . , vn } is a basis for V1 and {w1 , . . . , wm } is a basis for V2 then the
set of vectors of the form vi ⊗ wa form a basis for V1 ⊗ V2 and we impose the rules
(α1 v1 + α2 v2 ) ⊗ w = α1 v1 ⊗ w + α2 v2 ⊗ w (11.13)
v ⊗ (α1 w1 + α2 w2 ) = α1 v ⊗ w1 + α2 v ⊗ w2 (11.14)
The tensor product can be defined without reference to a basis (See Chapter 2) and
so can the tensor product of representations. We set:
((T1 ⊗ T2 )(g)) v ⊗ w := (T1 (g)v) ⊗ (T2 (g)w) (11.15)
for all v∈ V1 and w ∈ V2 and then extend by κ-linearity.
If {v1 , . . . , vn } is an ordered basis for V1 and {w1 , . . . , wm } is an ordered basis for V2
then the matrix elements of (T1 ⊗ T2 )(g) will be of the form
(M1 ⊗ M2 )(g)ia,jb = (M1 (g))ij (M2 (g))ab (11.16)
Note that while the set {vi ⊗ wa } forms a basis for V1 ⊗ V2 it does not in any natural
way define an ordered basis: One needs to make a further choice of how to order this
basis and there are several choices. Once one has done this, there will be an ordering
on pairs (i, a) and (11.16) will define the matrix relative to this ordered basis.
4. As a corollary of the previous remark note that the vector space of linear transforma-
tions from V to W , denoted Hom(V, W ), is canonically isomorphic to V ∨ ⊗ W and
hence naturally becomes a representation of G. In concrete terms, if φ ∈ Hom(V, W )
is a linear transformation φ : V → W then the G-action on φ is determined by our
general remarks about induced actions on function spaces:
(T̃ (g) · φ)(v) := TW (g) · φ TV (g −1 )v
(11.19)
– 150 –
Exercise New Matrix Representations From Old Ones
Given a matrix representation of a group g → M (g) show that
a.) g → (M (g))tr,−1 is also a representation.
b.) Check the claim (11.19) above.
c.) If M is a matrix representation in GL(n, C) then g 7→ M (g)∗ is also a representa-
tion.
d.) If T is a real representation, then there exists an S ∈ GL(n, C) such that for all
g ∈ G:
M ∗ (g) = SM (g)S −1 (11.21)
Warning: The matrix elements M (g)ij of a real representation might not be real
numbers:
e.) Show that the defining two-dimensional representation of SU (2) acting on C2 is a
real representation, but there is no basis in which the matrix elements are all real.
As we will explain later, real representations can be further distinguished as totally
real and quaternionic (a.k.a. pseudoreal).
♣Some duplication
in this section with
previous material ♣
11.2 Characters
For any finite-dimensional representation T : G → Aut(V ) of any group G we can define
the character of the representation, denoted χT . It is a function on the group:
χT : G → κ (11.23)
and it is defined by
χT (g) := TrV (T (g)) (11.24)
Some useful general remarks about characters:
– 151 –
3. χT (h−1 gh) = χT (g) for all g, h ∈ G. In other words, χT (g) only depends on g via its
conjugacy class. In general, a function F : G → C that only depends on conjugacy
class, that is, that satisfies F (h−1 gh) = F (g) for all g, h ∈ G is known as a class
function. Such functions “descend” to functions on the set of conjugacy classes of G.
Definition 11.3.2. If a rep (V, T ) is equivalent to a unitary rep then such a rep is said to
be unitarizable.
Example. A simple example of non-unitarizable reps are the detµ reps of GL(n, κ) with
κ = R, C and the “spin-s” representations ρs of the Lorentz group SO0 (1, 1) described in
section *****
Exercise
a.) Show that if T (g) is a rep on an inner product space then T (g −1 )† is a rep also.
101
See Chapter 2, section 12
– 152 –
b.) Suppose T : G → GL(V ) is a unitary rep on an inner product space V . Let {vi }
be an ordered orthonormal basis for V . Show that the corresponding matrix rep M (g)ij is
a unitary matrix rep. That is:
M : G → U (dimV ) (11.26)
is a homomorphism.
c.) Show that for a unitary matrix rep the transpose-inverse and complex conjugate
representations are equal.
χρ (g −1 ) = χρ (g)∗ . (11.27)
1 X
f→ f (g) := hf i (11.28)
|G|
g∈G
R
The map f 7→ G f (g)dg on complex-valued functions is clearly linear in the function
f . So, it defines an element of the dual space of the space of functions, i.e. a map from the
functions to C. There are lots of other elements of the dual space, such as evaluation on a
particular group element g0 ,
evg0 : f 7→ f (g0 ) (11.30)
As a measure this is i.e. the Dirac measure supported at g0 (a.k.a. the evaluation map at
g0 ). Of course we could take various linear combinations of functionals of the form evg0 to
get others. What is special about the measure (11.29) is that it satisfies the left invariance
property: Z Z
f (hg)dg = f (g)dg (11.31)
G G
for all h ∈ G.
– 153 –
The left-action on G induces an action L∗h on the functions on G. Let L∗h (f ) denote
the function on G defined by:
(L∗h f )(g) := f (hg) (11.32)
Then hL∗h (f )i = hf i.
Note that, in this case of a finite group, the measure is also right-invariant:
Z Z
f (gh)dg = f (g)dg (11.33)
G G
For a finite group left-invariant and right-invariant measures are unique up to overall
scale. Indeed, the most general measure will be of the form
X
ρ(g)f (g) (11.34)
g∈G
Now apply the statement of left-invariance to the “Dirac function” at g0 , a.k.a. the char-
acteristic function at g0 : 102 (
1 g = g0
δg0 (g) := (11.36)
0 else
Then ρ(g0 ) = ρ(h−1 g0 ) for every g0 and every h. Therefore ρ(g) = c is just some constant
function on the group. It is now easy to check that the measure is also right-invariant.
The idea of a left- or right-invariant measure extends to continuous groups. For exam-
ple for G = R the general measure is given by
Z +∞
f (x)ρ(x)dx (11.37)
−∞
for some measure ρ(x)dx. By a similar argument to the above we learn that ρ(x + y)dx =
ρ(x)dx for all x, y, and hence ρ(x) is a constant.
Remark: Haar’s Theorem: The existence of left and right-invariant measures on topo-
logical groups is very general. The topological group G must be “locally compact” and
“Hausdorff” - two mild topological conditions on G. 103
102
Do not confuse this function with the Dirac measure evg0 .
103
A topological space X is locally compact if, for every x ∈ X there is a compact neighborhood of x.
That is, there is a compact subspace K ⊂ X so that x ∈ U ⊂ K for some open neighborhood U of x. A
topological space X is Hausdorff if, for all pairs of distinct points x1 , x2 ∈ X there exist neighborhoods
x1 ∈ U1 and x2 ∈ U2 such that U1 ∩ U2 = ∅. In other words, open sets separate points.
– 154 –
Then one can define a set B of measurable subsets of G and a measure µ : B → R
is “left-invariant” if µ(gS) = µ(S) for all measurable subsets S ∈ B. There is a similar
definition of “right-invariant.” If one imposes a few more technical conditions then Haar’s
theorem states that such measures are unique up to multiplication by a positive constant.
Given a Haar measure one can define integrals of some class of functions - known as
measureable functions. For example, for G = R clearly we can only integrate functions so
that Z +∞
f (x)dx (11.38)
−∞
exists.
In general, the left- and right- invariant measures on a topological group need not
coincide, even up to scale. However, in the case of compact groups the left- and right-
invariant measures, are unique and coincide up to scale. For compact Lie groups the
essential observation is that left-invariance shows the volume form must be proportional
to hg −1 dg ∧ · · · g −1 dg, vi where v is a volume form on g∨ . 104
Examples:
where c is a constant.
where c is a constant.
3. Now let G = R∗>0 be the multiplicative group of positive real numbers. The most
general Haar measure is of the form
Z Z ∞
dx
f (g)dg := c f (x) (11.41)
G=R∗>0 0 x
where c is a constant.
104
See Chapter **** for a discussion of the Maurer-Cartan form g −1 dg and the notion of left- and right-
invariant differential forms on Lie groups. Another, more sophisticated, but also more elegant proof was
explained to me by Dan Freed: Recall that for a finite dimensional real vector space V of dimension n
there is a GL(n, R) torsor B(V ) of the bases. Consider the character |det| on GL(n, R). There is a one-
dimensional line of equivariant functions B(V ) → R transforming according to this character. There is a
natural orientation on this line given by the positive functions and a nonzero function corresponds to a
measure on V . It is unique up to scalar multiplication by a positive constant. Now let V be the vector
space of left-invariant vector fields on G. A measure on V corresponds to a left-invariant measure on the
group. Now right translation acts by a positive scalar, and this gives a homomorphism G → R>0 . But G is
compact so the only possible homomorphism is the trivial one. Therefore the measure is also right-invariant.
– 155 –
4. Similarly, consider G = GL(n, R). We can take the matrix elements gij to be co-
ordinates on the open domain {g|detg 6= 0} ⊂ Mn (R) ∼
2
= Rn . The usual Euclidean
Q
measure ij dgij changes under g → g0 g by
Y Y
dgij → |detg0 |n dgij (11.42)
ij ij
where c is a constant.
where g(θ) = f (eiθ ) and the range of the last integral is over any interval of length
2π. Here the scale of the measure has been chosen so that the “volume” of the group
is 1.
6. For the important case of G = SU (2) we can write it as follows. First, every element
of SU (2) can be written as: ♣Definition of β
! here is backwards
α −β ∗ from what we use
g= (11.45) when we describe
β α∗ reps using
homogeneous
polynomials below.
for 2 complex numbers α, β with ♣
In this way we identify the group as a manifold as S 3 . That manifold has no globally
well-defined coordinate chart. The best we can do is define coordinates that cover
“most” of the group but will have singularities are some places. (It is always impor-
tant to be careful about those singularities when using explicit coordinates!) We can
always write:
α = ζ1 cos θ/2
(11.47)
β = ζ2 sin θ/2
where ζ1 , ζ2 are phases, and the magnitude is parametrized in a 1-1 fashion by taking
0 ≤ θ ≤ π. Next it is standard to parametrize the phases by:
1
α = ei 2 (ψ+φ) cos θ/2
1
(11.48)
β = iei 2 (ψ−φ) sin θ/2
– 156 –
The virtue of this definition is that we can then write:
! ! !
eiφ/2 0 cos θ/2 i sin θ/2 eiψ/2 0
g=
0 e−iφ/2 i sin θ/2 cos θ/2 0 e−iψ/2 (11.49)
i 21 φσ 3 i 21 θσ 1 i 12 ψσ 3
=e e e
and under the standard homomorphism π : SU (2) → SO(3) the angles θ, φ, ψ become
the Euler angles.
We need to be a little careful about φ and ψ since they are not defined at θ = 0, π
and we also need to be careful about their ranges. The above expression is invariant
under the transformations:
(φ, ψ) → (φ + 4π, ψ)
(φ, ψ) → (φ, ψ + 4π) (11.50)
(φ, ψ) → (φ + 2π, ψ + 2π)
1
[dg] = dψ ∧ dφ ∧ sin θdθ (11.52)
16π 2
For much more about this, see Chapter 5 below.
We give formulae for the Haar measures on the classical compact matrix groups in
Chapter 6 below.
Proposition If (T, V ) is a rep of a compact group G and V is an inner product space then
(T, V ) is unitarizable.
Proof We make essential use of the Haar measure. If T is not already unitary with
respect to the inner product h·, ·i1 then we can define a new inner product by:
Z
hv, wi2 := hT (g)v, T (g)wi1 dg (11.53)
G
105
We have chosen an orientation so that, with a positive constant, this is Tr2 (g −1 dg)3 ).
– 157 –
For a compact group hT (g)v, T (g)wi1 will be a nice continuous function, hence bounded,
and therefore integrable. If v 6= 0 then T (g)v 6= 0 hence hT (g)v, T (g)vi1 > 0 and therefore
0 < hv, vi2 < ∞. Here we used the fact that for a compact group the volume is finite. So
h·, ·i2 will be a good inner product. Then using the properties of the Haar measure it is
easily checked that
hT (g)v, T (g)wi2 = hv, wi2 (11.54)
and hence T (g) is unitary w.r.t. the inner product h·, ·i2 ♠
Remark: We saw that the representations detµ of GL(n, R), GL(n, C) and ρs of SO0 (1, 1)
are not unitarizable. What fails in the above argument is the infinite volume of these
noncompact groups.
and hence an induced action on Map(G, Y ) for any Y . Now let Y = C. Then Map(G, C)
is a representation of G × G because the induced left-action:
– 158 –
converts the vector space of functions Ψ : G → C into a representation space for G × G.
If we equip G with a Haar measure then we can speak of L2 (G), namely the Hilbert
space based on the complex-valued functions such that
Z
hf, f i := |f (g)|2 dg < ∞ (11.60)
G
Note that the G × G action preserves the L2 -property thanks to left- and right- invariance,
and in fact the G × G action is unitary.
defines the right regular representation. Both actions are left- actions on the functions
space, so the terminology is slightly confusing.
Suppose that (T, V ) is a representation of G. As we explained above the vector space
of linear transformations End(V ) := Hom(V, V ) of V to itself is then also a representation.
In fact, it is a representation of G × G because if S ∈ End(V ) then we can define a linear
left-action of G × G on End(V ) by:
That is
ι(S) := ΨS (11.66)
– 159 –
You are asked to prove this in an exercise below. Put differently, denoting by TEnd(V ) the
representation of G×G on End(V ) and TReg.Rep. the representation of G×G on Map(G, C)
we get a commutative diagram:
End(V )
ι / Map(G, C) (11.68)
TEnd(V ) TReg.Rep.
End(V )
ι / Map(G, C)
If we choose an ordered basis {vi } for V then the operators T (g) are represented by
matrices: X
T (g) · vi = M (g)ji vj (11.69)
j
If we take S = eij to be the matrix unit in this basis then ΨS is the function on G given
by the matrix element M (g −1 )ji = M tr,−1 (g)ij . So the ΨS ’s are linear combinations of
matrix elements of the representation matrices of G. (Replacing V by its dual V ∨ we will
get the representation matrices M (g)ij .) The advantage of (11.65) is that it is completely
canonical and basis-independent.
See section 11.9 below for more about the regular representation.
Exercise
a.) Let δ0 , δ1 , δ2 be a basis of functions in the regular representation of Z3 which are
1 on 1, ω, ω 2 , respectively, and zero elsewhere. Show that ω is represented as
010
L(ω) = 0 0 1 (11.71)
100
b.) Show that
L(h) · δg = δh·g
(11.72)
R(h) · δg = δg·h−1
106
Answer : This is a straightforward computation:
(h1 , h2 ) · ΨS (g) = ΨS (h−1
1 gh2 )
– 160 –
and conclude that for the left, or right, regular representation of a finite group the repre-
sentation matrices in the δ-function basis are permuation matrices.
Examples
3. Consider the canonical representation of Sn on κn . The line through the all ones
vector is an invariant subspace.
4. We saw above that spaces of functions of the form (11.65) for a fixed V define invariant
subspaces of the regular representation with action (11.59). Let us write this out in
more detail.
Let M : G → GL(n, κ) be any matrix n-dimensional representation of G. For a
fixed i, j consider the matrix element Mij as a κ-valued function on G: So Mij is the
– 161 –
function on G whose value at g ∈ G is just M (g)ij ∈ κ. Now consider the linear span
of functions where we fix i:
is an invariant subspace under the action of G × G under the action (11.59) on L2 (G).
Remarks:
Remarks:
– 162 –
1. Given any nonzero vector v ∈ V , the linear span of {T (g)v}g∈G is an invariant sub-
space. In an irrep this will span all of V . Such a vector is called a cyclic vector.
Caution!! The existence of a cyclic vector does not imply the representation is irre-
ducible. Consider the vector e1 in the permutation representation of Sn on Rn .
{w1 , . . . , wk } (11.79)
for V . Let us write wi , i = 1, . . . , k for the basis vectors for W and ua , a = k+1, . . . , n
for our choice of a set of complementary ordered basis vectors for V . The matrix
representation associated with such a choice of basis is defined by:
where
3. If W ⊂ V is an invariant subspace then the quotient vector space (see Chapter two)
V /W is a representation of G in a natural way:
The reader can check this is well-defined. The vectors ua above define a basis for
V /W of the form ua + W , and relative to this basis the representation will look like
M22 .
– 163 –
M11 (g) 0 0 ···
0 M22 (g) 0 ···
M (g) = (11.86)
···
0 0 M33 (g)
··· ··· ··· ···
Examples
for z ∈ U (1). This is clearly an irreducible representation. We will argue below that
these are the only irreducible representations.
so if V ∼
= Cd is the carrier space we would write
V ∼
= ρn1 ⊕ · · · ⊕ ρnd (11.89)
3. G = Z2
!
10
1→
01
! (11.90)
01
(12) →
10
– 164 –
for R3 . As we have noted, the all ones vector u0 := e1 + e2 + e3 spans an invariant
subspace L ⊂ R3 . We can choose basis vectors for the orthogonal complement
u1 := e1 − e2
(11.92)
u2 := e2 − e3
and the subspace spanned by u1 , u2 is also an invariant subspace. In fact, one easily
computes that relative to this basis:
! ! !
−1 1 1 0 0 −1
M ((12)) = M ((23)) = M ((13)) =
0 1 1 −1 −1 0
! !
0 −1 −1 1
M ((123)) = M ((132)) =
1 −1 −1 0
(11.93)
√ !
−√21 − 23
M ((13)) = (11.95)
− 23 12
√ !
1 3
−
√2 2
M ((23)) = 3 1
(11.96)
2 2
√ !
2π −
√2
1
− 23
M ((123)) = R( ) = 3
(11.97)
3 2 − 12
√ !
−2π −√12 23
M ((132)) = R( )= (11.98)
3 − 23 − 21
generates a unitary representation.
– 165 –
5. Consider the representation of Sn on Rn . Then the one-dimensional subspace L =
{(x, · · · , x)} is a subrepresentation. Moreover we can take
X
L⊥ = {x1 , · · · xn | xi = 0}. (11.99)
Example 3 Let G = GL(n, κ) and H = Matn (κ). We can define a group which, as a set
is H × G, but it has a “twisted” group multiplication law:
One can check that this really does define a group structure. It is a special case of the
semi-direct product structure that we will study in more detail in Section **** . Now check
that the following is a matrix representation:
!
g hg tr,−1
T (h, g) := (11.103)
0 g tr,−1
– 166 –
R_G
W^\perp
y ∈ W ⊥ ⇔ ∀x ∈ W, hy, xi = 0 (11.104)
Let g ∈ G, y ∈ W ⊥ . Compute
T (g)y, x = y, T (g)† x
(11.105)
= y, T (g −1 )x
1. Finite dimensional unitary representations are always completely reducible. This fol-
lows from the above proposition together with induction on the dimension.
3. In particular, for a finite group the regular representation L2 (G) is completely re-
ducible as a representation of G or of G × G.
4. The complete reduction L2 (G) is given by the Peter-Weyl theorem described below.
The beautiful aspect of this theorem is that it holds not just for finite groups but for
all compact groups.
– 167 –
Isotypical Components: For each isomorphism class of irreducible representation of
G choose a representative (T (µ) , V (µ) ) where µ runs over the set of distinct irreducible
representations. If V is a completely decomposable representation then we can write
aµ
V ∼
= ⊕µ ⊕i=1 V (µ) (11.106)
where V (µ) is the carrier space of an irreducible representation of G, we are summing over all
irreps in (11.106), and aµ is the number of times that irrep appears in the decomposition.
(We understand that if a particular irrep does not appear at all then we take aµ = 0.)
When aµ 6= 0 we can write
aµ
⊕i=1 V (µ) ∼
= κaµ ⊗κ V (µ) (11.107)
as vector spaces where κ is the ground field. We can moreover interpret this as an inter-
pretation as representations where T (g) acts as the identity operator on κaµ . With this
understood, the summand κaµ ⊗κ V (µ) is called the isotypical component of V belonging
to µ. Note that here If we abbreviate κaµ ⊗ V (µ) to aµ V (µ) then the decomposition into
isotypical components can be written as:
Schur’s Lemma: Let V1 , V2 be vector spaces over any field κ such that they are
carrier spaces of irreducible representations of any group G. If A : V1 → V2 is an
intertwiner between these two irreps then A is either zero or an isomorphism of
representations.
Proof : Note that the kernel and image of A are invariant subspaces of V1 and V2 , respec-
tively. Recall these are defined by:
– 168 –
Note that if V1 = V2 = V is an irrep then the set of intertwiners V → V is not only a
linear space but is also an algebra (see Chapter 2). Namely, A1 ◦ A2 is also an intertwiner.
In the case of the algebra of self-intertwiners of a representation one can say more. But
now the choice of field κ becomes important.
Proof : Since we are working over the complex field A has a nonzero eigenvector Av = λv.
That follows because the characteristic polynomial pA (x) = det(x1 − A) is a polynomial in
the complex field and has a root in the complex numbers. The eigenspace C = {w : Aw =
λw} is therefore not the zero vector space. But it is also an invariant subspace. Therefore,
it must be the entire carrier space. ♠.
Remarks:
HomG (V (µ) , V ) ∼
= ⊕ν HomG (V (µ) , κaν ⊗ V (ν) )
∼
= ⊕ν κaν ⊗ HomG (V (µ) , V (ν) ) (11.111)
= κaµ ⊗ HomG (V (µ) , V (µ) )
In the second line we used the fact that G acts trivially on κaν . If we work over κ = C
then we just showed HomG (V (µ) , V (µ) ) ∼
= C and hence we have a better interpretation
of the degeneracy space: It is the linear space of G-invariant maps from V (µ) → V .
Indeed, note that there is a canonical equivariant map:
– 169 –
∼ Dµ ⊗ V (µ) where V (µ) is an irreducible representation and Dµ is the
That is H(µ) =
degeneracy space. The sum in (11.114) is over the irreps of G. Suppose now that H
is a Hermitian operator, such as a Hamiltonian, that commutes with the G-action.
That is
H:H→H (11.115)
is an intertwiner. This is typically what happens when we have a symmetry of some
dynamics. By Schur’s lemma we therefore have under this isomorphism
H∼
= ⊕µ H (µ) ⊗ 1V (µ) (11.116)
In terms a basis compatible with the isotypical decomposition it means that H (µ)
acts only on the degeneracy space, and hence the Hamiltonian has been partially (or
sometimes completely) diagonalized. This also leads to selection rules. If ψ1 , ψ2 are
two states in different isotypical components and O is an operator that commutes
with the G-action then the transition amplitude
********************
ABOVE DISCUSSION NEEDS TO BE SUPPLEMENTED IN MORE CONCRETE
TERMS:
D⊗V with G acting as 1⊗T (g) commutes with A⊗1, with A any linear transformation
on D. If V is irreducible
HomG (D ⊗ V, D ⊗ V ) ∼
= Hom(D, D) (11.118)
3. Schur’s lemma over other fields can lead to more complicated possibilities. All we
can say in general is that HomG (V (µ) , V (µ) ) is a division algebra over κ, meaning
that it is an algebra in which all nonzero elements are invertible. (This follows
immediately from our proof of Schur’s lemma above.) When we take κ = R we get a
division algebra over R and there are three possibilities: R, C, H. As an example of
the case R just consider the real linear transformations between the irreducible real
representations of Z2 . The algebra of intertwiners is clearly just R. If we consider
the defining representation C2 ∼= R4 as a representation of SU (2) over R, so that
SU (2) elements are represented as 4 × 4 real matrices, then there are more R-linear
transformations on R4 that can commute with the representation matrices T (g). (If
we tried to express them as transformations of C2 they would typically be a linear
combination of C-linear and C-antilinear transformations.) This is an example where
the algebra of intertwiners is H. For more on this see the section in chapter two on
quaternions.
– 170 –
11.8 Pontryagin Duality
In this section we introduce the beautiful idea of the Pontryagin dual of an Abelian group.
107 We will use it in section 11.11, and again we will use it to give a very general construction
of interesting Heisenberg extensions of Abelian groups.
Remarks:
1. The Pontryagin dual group Sb can also be thought of as the group of all complex
one-dimensional unitary representations of S. It follows from Schur’s lemma that
all irreducible finite dimensional complex representations of an Abelian group are
one-dimensional.
Note that the adjective complex is essential here. After all the defining representation
of the Abelian group SO(2) is R2 and is irreducible as a representation over R.
ŝ : χ 7→ χ(s) (11.120)
Note that ŝ is just the evaluation map evs discussed previously. The map s 7→ ŝ is a
homomorphism S → S. The main theorem is:
bb
Sb ∼
=S (11.121)
b
For a proof see, for example, the book on representation theory by A.A. Kirillov.
107
The transliteration from the Cyrillic to the Latin alphabets takes various forms. Another common one
is Pontrjagin.
– 171 –
Example 1: Consider S = Z/nZ, thought of additively. To determine χ ∈ Hom(S, U (1))
¯ = χ(1̄)` for any ` ∈ Z. Put χ(1̄) = ω ∈ U (1). But
it suffices to determine χ(1̄), since χ(`)
now we need to impose the relation χ(n̄) = χ(0̄) = 1. This implies ω n = 1, so ω is an nth
\ is
root of unity. So the most general element of Z/nZ
¯ = χω (`)
χ(`) ¯ := ω ` (11.122)
for some real number k. However, for this to be well-defined we must have k = 2πn with
n ∈ Z. Therefore χ must be of the form
χn (x + Z) = exp[2πinx] (11.129)
χn (ξ) := ξ n (11.130)
– 172 –
Comparing (11.127) and (11.131) we verify the general result (11.121).
Example 5: Tori. Consider the group G = Zd . It will be useful to consider a free G action
on affine Euclidean space Ed . This defines a subset Γ ⊂ Ed known as a lattice (sometimes
called an embedded lattice). The quotient space Ed /Γ has a natural basepoint, namely the
coset of Γ and, as a group it is isomorphic to U (1)d . By the same arguments as above its
Pontryagin dual will be isomorphic to Zd .
There is a nice way to think about the Pontryagin duality between lattices and tori.
Suppose Γ is a lattice in Rd . Using the Euclidean norm we can define another lattice, the ♣Really we should
do this using the
dual lattice dual vector space...
♣
Γ∨ = {g ∈ Rd |g · γ ∈ Z ∀γ ∈ Γ} (11.132)
The unitary irreps of Γ are represented by points in the torus T ∨ := Rd /Γ∨ . Note that
T ∨ is a torus as a manifold and is isomorphic to the group U (1)d , as an Abelian group.
For any k̄ ∈ T ∨ we can
χk̄ (γ) = exp[2πik · γ] (11.133)
where k is any representative of k̄, that is k̄ = k + Γ∨ . Note that the above formula is
well-defined because if we choose any two lifts k1 and k2 of k̄ then k1 = k2 + g with g ∈ Γ∨ ,
and then g · γ ∈ Z for all γ ∈ Γ. So
b∼
Γ = Rd /Γ∨ ∼
= U (1)d (11.134)
Conversely, the Pontryagin dual of the torus Rd /Γ∨ can naturally be identified with Γ by
the same formula:
χγ (k̄) = exp[2πik · γ] (11.135)
108
Answer : Use the isomorphism U (1) ∼ = R/Z. One needs to say what is the image of p−n . So χ( p1n ) =
exp[2πian /p ], for some integer an because χ(p−n ) must itself be a (pn )th root of unity. Note we can regard
n
– 173 –
Figure 31: Example of a bandstructure. (For silicon.) On the horizontal axis the structure is
plotted as a function of k along lines inside the Brillouin torus. The letters refer to points where
the (cubic) crystallographic group has fixed points. Γ denotes the identity element k̄ = 0 where the
full cubic symmetry group is restored.
But for the statement we are going to make all we need is that U (x + γ) = U (x) for γ ∈ Γ.
Now the group Γ acts on the Hilbert space through unitary operators commuting with H.
Explicitly:
ρ(γ) = exp[iγ · p̂/~] (11.139)
~ as is standard in quantum mechanics. Note that
where p̂ = −i~∇
ρ(γ1 )ρ(γ2 ) = ρ(γ1 + γ2 ) (11.140)
– 174 –
so the Hilbert space will be a representation of Γ. The Hamiltonian cannot make transitions
between different irreps in an isotypical decomposition.
We have classified the one-dimensional representations above so if ψ ∈ H were to be
in a one-dimensional representation then it would have to be quasi-periodic:
where k̄ ∈ Rd /Γ∨ . In this context the Pontryagin dual torus T ∨ is known as the Brillouin
torus.
Note that if ψ obeys (11.141) then we can always write (noncanonically!!!) as
Now, the Hamiltonian acts within Hk̄ . It is useful to write the eigenvalue problem as
where
Hk = e−2πik·x He2πik·x (11.146)
Note that we had to make a choice of k that projects to k̄ to write the Hamiltonian Hk .
However, if k 0 = k + g where g ∈ Γ∨ then Hk0 is unitarily equivalent to Hk . Indeed
U = e2πix̂·g is a nice unitary operator on the wavefunctions on the torus Rd /Γ.
Hk is an Hermitian elliptic operator acting on the functions on a compact manifold.
Explicitly, it works out to
~2 2 ~2 ~
Hk = − ∇ − 4π k · (i∇) + (U + 4π 2 k 2 ) (11.147)
2m 2m 2m
and it acts on L2 functions on the torus T = Rd /Γ. This operator should be viewed as a
perturbation of a Laplace operator on functions on a compact manifold. The latter has a
– 175 –
discrete spectrum of eigenvalues. The spectrum of h = −∇2 is {4π 2 g 2 }g∈Γ∨ . The theory
of elliptic Hermitian operators on compact manifolds shows that the lower order terms in
(11.147) do not change this property and hence the operator (11.147) has a discrete set of
eigenvalues {En (k)}. Note that while Hk depends on k, the spectrum itself only depends
on k̄. This is not obvious from (11.147) but it follows from the unitary equvalence between
Hk1 and Hk2 where k1 and k2 are two representatives of k̄.
The eigenvalues vary continuously as functions of k̄ ∈ Rd /Γ∨ to give what is called a
band structure. See Figure 31.
L2 (G) ∼
= ⊕µ End(V (µ) ) (11.148)
where we sum over the isomorphism class of each irreducible representation exactly once,
and for each irrep we choose a representative (T (µ) , V (µ) ). ♣Notation is too
heavy: Replace
The key to proving the Peter-Weyl theorem are the orthogonality relations for matrix V (µ) by V µ etc.
below. ♣
elements of irreps:
Theorem: Let G be a compact group, and define an Hermitian inner product on L2 (G)
by Z
hΨ1 , Ψ2 i := Ψ∗1 (g)Ψ2 (g)dg (11.149)
G
where, WLOG we normalize the Haar measure so the volume of G is one. Let {V (µ) }
be a set of representatives of the distinct isomorphism classes of irreducible unitary rep-
(µ)
resentations for G. For each representation V (µ) choose an ON basis wi , i = 1, . . . , nµ
with
nµ := dimC V (µ) . (11.150)
Then the matrix elements Mijµ (g) defined by
nµ
(µ) µ (µ)
X
T (g)wi = Mji (g)wj (11.151)
j=1
Proof : The proof is based on linear algebra and Schur’s lemma. For any linear transfor-
mation A : V (µ) → V (ν) we can average using the Haar measure
Z
à := T (ν) (g)AT (µ) (g −1 )dg (11.153)
G
– 176 –
And then a small computation shows that à is an intertwiner:
Z
(ν)
T (h)Ã = T (ν) (hg)AT (µ) (g −1 )dg
ZG
= T (ν) (g)AT (µ) (g −1 h)dg (11.154)
G
= ÃT (µ) (h)
Equation (??) holds for the matrix elements relative to any ordered bases for the V (µ) . If
we specialize to ON bases then the matrices M (µ) (g) are unitary. It is now useful to define
functions φµij ∈ L2 (G) by
(µ) √ (µ)
φij : g 7→ nµ Mij (g) (11.158)
so that we have Z
(µ) (ν)
[dg](φij (g))∗ φk` (g) = δµ,ν δik δj` . (11.159)
G
Now recall the definition of the map ι : End(V ) → L2 (G) in (11.66). This is easily
generalized: If we have any collection of finite-dimensional representations {Vλ } of G then
we have
ι : ⊕λ End(Vλ ) ,→ L2 (G) (11.160)
P
where we just add the functions ι(⊕i Si ) := i ΨSi . Thanks to the equivariance, the image
of (11.160) is a G × G-invariant subspace, i.e. a subrepresentation of L2 (G). It follows
from the orthogonality of matrix elements that if we choose the collection {Vλ } to be the
set of distinct irreps of G, namely {V (µ) } then ι is injective. We claim it is also surjective:
The orthogonal complement would have to be a representation of G, and hence would
be isomorphic to a direct sum of representations V (µ) . But then the matrix elements are
(µ)
already accounted for by the image of ι. Thus the functions φij are in fact an orthonormal ♣Be more explicit
and careful about
basis for L2 (G). ♠ this step. ♣
– 177 –
When working with finite groups then we have the following beautiful:
Proof : On the one hand L2 (G) clearly has a basis of delta-functions δg so the dimension
is |G| on the other hand End(V (µ) ) has dimension n2µ
This fact is extremely useful: If you are trying to list the dimensions of the irreps of a
finite group it is always a very useful check to verify (11.161). ♣SHOULD WE
CHANGE D TO M
FOR MATRIX
Example 1: Let G = Z2 = {1, σ} with σ 2 = 1. Then the general complex valued-function ELEMENTS
BELOW? ♣
on G is specified by two complex numbers (ψ+ , ψ− ) ∈ C2 :
(Here and in the next examples when working with 1×1 matrices we drop the µν subscript!)
The reader can check they are orthonormal, and they are complete because any function
Ψ can be expressed as:
ψ+ + ψ− + ψ+ − ψ− −
Ψ= M + M (11.165)
2 2
Note that if V is any representation of Z2 with nontrivial element represented by T (σ)
then we can form orthogonal projection operators P± = 21 (1 ± ρ(σ)) onto direct sums of
isotypical components. Note that we can write these projectors as:
Z
P± = (M ± (g))∗ T (g) (11.166)
G
– 178 –
mj
M (m) (ω j ) = ω mj = e2πi n (11.167)
The decomposition of a function Ψ on the group G is known as the discrete Fourier trans-
form: If Ψ : Zn → C is any function we can write it as
X
Ψ= Ψ̂m M (m) (11.169)
m
Z
Ψ̂m = (M (m) (g))∗ Ψ(g)dg (11.170)
Zn
Example 3: The theorem applies to all compact Lie groups. For example, when G =
U (1) = {z||z| = 1} then the invariant measure on the group is just −i dz dθ
z = 2π where
z = eiθ : Z 2π
dθ
hΨ1 , Ψ2 i = (Ψ1 (θ))∗ Ψ2 (θ) (11.171)
0 2π
Now, again since G is abelian the irreducible representations are 1-dimensional and the
unitary representations are (ρn , Vn ) where n ∈ Z, Vn ∼
= C and
ρn (z) := z n (11.172)
Now, the orthonormality of the matrix elements is the standard orthonormality of einθ and
the Peter-Weyl theorem specializes to Fourier analysis: An L2 -function Ψ(θ) on the circle
can be expanded in terms of the matrix elements of the irreps:
X
Ψ= Ψ̂n M (n) (11.173)
Irreps ρn
We stress that the Peter-Weyl theorem applies to all compact groups, not just Abelian
ones. For this reason it is an important defining subject of the subject of “nonabelian
Fourier analysis.” Here is a simple nonabelian example:
Example 4: So far all our examples have been Abelian groups. Let us consider G = S3 . It
has order 6 so that L2 (G) is a six-dimensional vector space. So far, we have discussed three
different irreps of dimensions 1, 1, 2. Are there any others? Note that 6 = 12 + 12 + 22 . So
we conclude that there are no other irreps. The matrix elements are:
1. The trivial representation: M + (g) = 1 for all g ∈ S3
– 179 –
2. The sign representation:
and so on for the other 3 matrix elements in the 2-dimensional representation. The reader
should check that the function M112 is orthogonal to the functions M + and M − and that
√ 2 has unit norm.
2M11
Remark: This leaves the separate question of actually constructing the representations
of the finite group G. Until recently there has been no general algorithm for doing this.
Recently there has been a claim that it can be done. See
Vahid Dabbaghian-Abdoly, ”An Algorithm for Constructing Representations of Finite
Groups.” Journal of Symbolic Computation Volume 39, Issue 6, June 2005, Pages 671-688
Exercise How Many One Dimensional Represenations Does An Arbitrary Finite Group
Have?
Show that the number of distinct one-dimensional representations of a finite group G
is the same as the index of the commutator subgroup [G, G] in G. 109
– 180 –
b.) Show that, for any vector ψ ∈ V we have
nµ
T (h)Pijµ ψ µ µ
X
= Mki (h)Pkj ψ (11.178)
k=1
This beautiful fact is extremely powerful. For example, as we have seen, any represen-
tation (T, V ) of G is completely reducible so
V ∼
= ⊕µ aµ V (µ) (11.182)
Moreover, by using the characters we can easily determine the decomposition into
isotypical components.
Recall that a class function on G is a function f : G → C whose value only depends on
the conjugacy class: f (g) = f (hgh−1 ) for all g, h ∈ G. The class functions form a subspace
L2 (G)class ⊂ L2 (G). The characters of the irreps provide an ON basis:
Theorem. {χµ } is an ON basis for the vector space of class functions L2 (G)class .
– 181 –
Proof : This is a corollary of the orthogonality relations for matrix elements. Any
function f ∈ L2 (G) be expanded
X µ µ
f (g) = fˆij Mij (g) (11.184)
µ,i,j
X fˆµ
= ii
χµ (g)
nµ
µ,i
We already knew that the χµ were ON and now we see that they span the subpsace of
class functions. ♠
Theorem The number of conjugacy classes of G is the same as the number of irreducible
representations of G.
Since the number of representations and conjugacy classes are the same we can define
an r × r array known as a character table:
m 1 C1 m 2 C2 ··· ··· mr Cr
χ1 χ1 (C1 ) ··· ··· ··· χ1 (Cr )
χ2 χ2 (C1 ) ··· ··· ··· χ2 (Cr )
(11.188)
··· ··· ··· ··· ··· ···
··· ··· ··· ··· ··· ···
χr χr (C1 ) ··· ··· ··· χr (Cr )
– 182 –
Here mi denotes the order of Ci .
Generally speaking, most of what you typically want to know about a group is con-
tained in its character table.
For a finite group we can rewrite the orthogonality relations of the characters, equation
(11.181), more explicitly as:
1 X
mi χµ (Ci )χν (Ci )∗ = δµν (11.189)
|G|
Ci ∈C
The proof of (11.190) is very elegant: Note that equation (11.189) can be interpreted as
the statement that the r × r matrix
r
mi
Sµi := χµ (Ci ) µ = 1, . . . , r i = 1, . . . , r (11.191)
|G|
satisfies
r
X
∗
Sµi Sνi = δµν (11.192)
i=1
Therefore, Sµi is a unitary matrix. The left-inverse is the same as the right-inverse, and
hence we obtain (11.190).
Example 1: For the simplest group there are two conjugacy classes C1 = [1] and C2 =
[(12)]. They both have cardinality mi = 1. There are two irreps 1+ and the sign represen-
tation 1− . We thus have the character table
[1] [(12)]
1+ 1 1
1− 1 −1
– 183 –
[1] 3[(12)] 2[(123)]
1+ 1 1 1
1− 1 −1 1
2 2 0 −1
Now let us see how we can use the orthogonality relations on characters to find the decom-
position of a reducible representation.
Example 1 Consider the 3 × 3 rep generated by the \ action of the permutation group S3
on R3 . We’ll compute the characters by choosing one representative from each conjugacy
class:
100 010 010
1 → 0 1 0 (12) → 1 0 0 (132) → 0 0 1 (11.193)
001 001 100
From these representatives the character of V = R3 is easily calculated:
1 3 2
a1+ = (χ1+ , χ) = 3 + 1 + 0 = 1 (11.195)
6 6 6
1 3 2
a1− = (χ1− , χ) = 3 + (−1) · 1 + 0 = 0 (11.196)
6 6 6
1 3 2
a2 = (χ2 , χ) = 3 · 2 + 0 · 1 + (−1) · 0 = 1 (11.197)
6 6 6
Therefore:
χV = χ1+ + χ2 (11.198)
showing the decomposition of R3 into irreps, and confirming what we showed above.
χV ⊗2 (1) = d2
(11.199)
χV ⊗2 ((12)) = d
– 184 –
To check the second line we choose a basis {vi } for V so that {vi ⊗ vj } is a basis for V ⊗ V .
In this basis we have (12) · vi ⊗ vj = vj ⊗ vi and hence only the basis elements vi ⊗ vi
contribute to the trace. Therefore, we compute the degeneracies of the trivial and sign
representation from:
1
a1+ = hχ+ , χV ⊗2 i = (χV ⊗2 (1) + χV ⊗2 ((12)))
2
1
= d(d + 1)
2 (11.200)
1
a1− = hχ− , χV ⊗2 i = (χV ⊗2 (1) − χV ⊗2 ((12)))
2
1
= d(d − 1)
2
so we have isotypical decomposition:
1 1
V ⊗2 = d(d + 1)1+ ⊕ d(d − 1)1− (11.201)
2 2
General elements of V ⊗2 are of the form Tij vi ⊗vj and the components are often referred to
as 2-index covariant tensors. The above decomposition is a decomposition into symmetry
types of tensors: We can choose a basis of symmetric tensors:
1
(ei ⊗ ej + ej ⊗ ei ) 1≤i≤j≤d (11.202)
2
and anti-symmetric tensors
1
(ei ⊗ ej − ej ⊗ ei ) 1≤i<j≤d (11.203)
2
Example 3: The previous example is the beginning of a very beautiful story called Schur-
Weyl duality. Let us consider the next case: Consider S3 acting by permuting the various
factors in the tensor space V ⊗ V ⊗ V for any vector space V . Now, if dimV = d then we
have
χ([1]) = d3
χ([(ab)]) = d2 (11.204)
χ[(abc)] = d
– 185 –
Thus, as a representation of S3 , we have
d(d + 1)(d + 2) d(d − 1)(d − 2) d(d + 1)(d − 1)
V ⊗3 ∼
= 1+ ⊕ 1− ⊕ 2 (11.208)
6 6 3
Note that the first two dimensions are those of S 3 V and Λ3 V , respectively, and that the
dimensions add up correctly. It is not supposed to be obvious, but it turns out that the
last summand corresponds to tensor of mixed symmetry type that satisfy
together with
Tijk = −Tkji (11.210)
Why this is so, and the generalization to the Sn action on V ⊗n is best discussed in the
context of Young diagrams, representation theory of Sn and Schur-Weyl duality. See section
*****
Exercise
Show that right-multiplication of the character table by a diagonal matrix produces a
unitary matrix.
Exercise
Using the orthogonality relations on matrix elements, derive the more general relation
on characters:
1 X δµν (ν)
χµ (g)χν (g −1 h) = χ (h) (11.211)
|G| nµ
g∈G
– 186 –
so the trace in this basis is just:
X
TrRG [L(g1 ) ⊗ R(g2 )] = χµ (g1 )∗ χµ (g2 ) (11.214)
µ
On the other hand, we can use the delta-function basis: δg . Note that
So in the delta function basis we get a contribution of +1 to the trace iff g = g1 gg2−1 , that
is iff g2 = g −1 g1 g, that is iff g1 and g2 are conjugate, otherwise we get zero.
Now when g1 and g2 are conjugate we might as well take them to be equal. Then the
functions δg contributing to the trace are precisely those with g ∈ Z(g1 ). But then
|G| |G|
|Z(g1 )| = = (11.216)
|C(g1 )| m1
Prove this again by viewing L2 (X) as a G-representation and using the orthgonality rela-
tions on characters. 110
Exercise
Write out the unitary matrix Sµi for G = S3 . 111
110
Answer : Note that one basis for L2 (X) is given by δx . Here g · δx = δg·x so |X g | is just the character of
g in this representation. Therefore the average number of fixed points is the degeneracy of the trivial repre-
sentation in the isotypical decomposition. On the other hand, if f : X → C is invariant under the g action
then f takes the same value on any two x1 , x2 in the same G-orbit. Therefore, the subspace corresponding
to the trivial representation in the isotypical decomposition has a basis given by the characteristic function
for each orbit.
111
Answer : The unitary matrix Sµi for S3 is:
1 1 1
√ √ √
6 2 3
Sµi = √16 − √12 √13 (11.218)
2 1
√
6
0 − √3
– 187 –
Exercise
a.) Suppose we tried to define a representation of S3 by taking (12) → 1 and (23) → −1.
What goes wrong? ♣this is really
about generators
b.) Show that for any n there are only two one-dimensional representations of Sn and relations and
should be moved
up. ♣
Therefore, given a function ψ ∈ L2 (S), we can define its Fourier transform, quite generally,
as Z
ψ̂(χ) := χ(s)∗ ψ(s)ds (11.221)
S
– 188 –
This is a result known as either the Plancherel theorem or the Parseval theorem. 112
An important special case of the above is the case S = Γ ⊂ Rn , an embedded lattice.
Then we have: X
χk̄1 (γ)∗ χk̄2 (γ) = δΓ̂ (k̄1 − k̄2 ) (11.223)
γ∈Γ
Γ̂ = Rn /Γ∨ (11.224)
This is one version of the Poisson summation formula. Since the PSF is an important
result we will unpack this a bit in the next section.
exists. Then F (x) is clearly periodic of period one and therefore defines a function F :
R/Z → C. Since R/Z ∼ = U (1) we can decompose in terms of irreducible representations:
X
F (x) = F̂ (ρm )e2πimx (11.229)
ρm
Z 1
F̂ (ρm ) = e−2πimt F (t)dt (11.230)
0
Now note that
Z 1 Z 1 X
−2πimt
e F (t)dt = e−2πimt f (m + t)dt
0 0 m
(11.231)
Z +∞
−2πimt
= e f (t)dt
−∞
112
The original statements by Plancherel and Parseval concerned special cases and the two terms are not
consistently used in the literature.
– 189 –
Putting x = 0 we learn that for suitably rapidly decaying functions:
X X
f (m) = fb(w) (11.232)
m∈Z w∈Z
3. The Fourier transform fb decays rapidly enough so that the sum on the RHS converges.
Note that this can also be understood as a special case of the orthogonality relations
for characters on Z:
0
X X
e2πint e−2πint = δR/Z (t − t0 ) = δR (t − t0 − k) (11.234)
nZ k∈Z
That is:
X X
f (~v ) = fˆ(~l) . (11.236)
~v ∈Γ ~l∈Γ∨
Remarks
1. Since people have different conventions for the factors of 2π in Fourier transforms
it is hard to remember the factors of 2π in the PSF. The equation (11.234) has no
factors of 2π. One easy way to see this is to integrate both sides from t = −1/2 to
t = +1/2.
2. One application of this is the x-ray crystallography: The LHS is the sum of scattered
waves. The RHS constitutes the bright peaks measured on a photographic plate.
– 190 –
3. Another application is to analytic number theory. If τ is in the upper half com-
plex plane, and θ, φ, z are complex numbers define the Riemann theta function with
characteristics θ, φ
θ X 2
ϑ[ ](z|τ ) := eiπτ (n+θ) +2πi(n+θ)(z+φ) (11.237)
φ
n∈Z
Exercise
a.) Show that
r
X
−πan2 +2πibn 1 X − π(m−b)2
e = e a (11.239)
a
n∈Z m∈Z
11.11 The Finite Heisenberg Group And The Quantum Mechanics Of A Par-
ticle On A Discrete Approximation To A Circle
*******************************
SOME OF THE MATERIAL IN THIS SUBSECTION IS NOW REDUNDANT WITH
MATERIAL ABOVE AND NEEDS TO BE REMOVED
*******************************
It is very illuminating to interpret the group HeisN in terms of the quantum mechanics
of a particle on a discrete approximation to a circle. ♣This sub-section
assumes some
Recall that if G acts on a set X then it acts on the functions F[X → Y ] for any Y . knowledge of linear
algebra and
Moreover, G always acts on itself by left-translation. Let us apply this general idea to quantum mechanics
explained in chapter
G = ZN , thought of as the N th roots of unity and Y = C, the complex numbers. So we are 2. ♣
studying complex-valued functions on the group G. We can picture the group as a discrete
set of points on the unit circle so we can think, physically, of F[X → Y ] as the space of
wavefunctions of a particle moving on a discrete approximation to a circle.
Now, as a vector space it is clear that F[ZN → C] is isomorphic to CN . To specify
a function is to specify the N different complex values Ψ(ω k ) where ω is a primitive N th
root of one, say, ω = exp[2πi/N ] for definiteness, and k = 0, . . . , N − 1. (We will not try to
normalize our wavefunctions Ψ, but we could. It would make no difference to the present
considerations.)
– 191 –
Another way to see we have an isomorphism is to choose a natural basis, the delta-
function basis:
δj (ω k ) = δj̄,k̄ (11.240)
where j̄, k̄ ∈ Z/N Z, viewed additively. So our isomorphism is δj 7→ ~ej . Put differently,
every wavefunction can be uniquely expressed as
N
X −1
Ψ= zj δj (11.241)
j=0
and we have normalized the measure so that the group has “volume 1.” This is an example
of an important idea called a Haar measure that we will discuss more later.
Now, recall the general definition from section 4.1. G acts naturally on X (which
happens to be G itself) by left-multiplication. The induced action of G on the complex-
valued functions on G in this case is such that the generator ω of ZN acts on the space of
functions via:
φ̃(ω, Ψ)(ω k ) := Ψ(φ(ω −1 , ω k ))
(11.244)
= Ψ(ω k−1 )
So the generator ω of the group ZN acts linearly on the functions F[ZN → C]. We call
this linear operator P . We can therefore rewrite (11.244) as
Note that with respect to the inner product (11.242) P is clearly a unitary operator.
The operator P can be viewed as translation operator around the discrete circle by
one step in the clockwise direction. Recall that in the quantum mechanics of a particle on
the line translation by a distance a is
This equation makes sense also for a particle on the circle, that is, with x, a considered
periodically. So our P is T (a) for translation by 2π/N times around the circle clockwise.
– 192 –
Remark: In the quantum mechanics of a particle on a line or circle we could also
write
d
(T (a) · Ψ)(x) = Ψ(x − a) = (exp[iap̂])Ψ(x) = (exp[−a ] · Ψ)(x) (11.247)
dx
so the momentum operator p̂ generates translations. In the finite Heisenberg group there is
no analog of the infinitesimal translations generated by p̂, but only of a finite set of discrete
translations.
Q ◦ P = ωP ◦ Q (11.251)
Given a linear transformation and ordered bases for domain and range we can associate
a matrix. (See Chapter two for details if you do not know this.) Now, let us choose the
ordered basis δ1 , . . . , δN . Then we easily compute
P · δj = δj+1 (11.252)
and therefore the matrix for P relative to the basis {δj }, is the matrix with matrix elements
so, for N = 3 it is
001
P = 1 0 0 (11.254)
010
Similarly, in the basis δj we have
Qi,j = ω j δi,j (11.255)
and since j = 0, 1, . . . , N − 1 we have for N = 3:
1 0 0
Q = 0 ω 0 (11.256)
0 0 ω2
– 193 –
Thus, we have recovered the N × N clock and shift matrices we discussed above. The
group of unitary operators generated by discrete position and translation operators is the
finite Heisenberg group.
It is interesting to study the operators P and Q in a different basis. We can introduce
a “plane wave basis” of functions Ψj ∈ H, with j = 0, . . . , N − 1 defined by
Ψj (ω k ) = ω jk (11.257)
P · Ψj = ω −j Ψj
(11.258)
Q · Ψj = Ψj+1
The roles of P and Q have been exchanged! P (as is P −1 ) is now representated by “clock
matrix” and Q is represented by a “shift matrix”. Indeed, what we have done is perform
a transformation from a position representation to a momentum representation in the
language of quantum mechanics.
The basis Ψj represents a very general and beautiful fact: The Ψj are actually group
homomorphisms Ψj : ZN → U (1) ⊂ GL(1, C) because
Ψj (ω k1 ω k2 ) = Ψj (ω k1 ) · Ψj (ω k2 ) (11.259)
L2 (G) ∼
might want to leave
= ⊕j (dimVj )Vj (11.260) this out and bring
back the example in
those later sections.
where Vj runs over the distinct irreducible representations. This is part of an important ♣
general theorem about compact Lie groups known as the Peter-Weyl theorem.
Moreover, the exchange of clock and shift matrices by passing from the δj basis to the
basis of characters is again an instance of a general fact:
Let Gb be the set of homomorphisms χ : G → U (1). For an Abelian group G the set G b
is also a group, using the product law:
The reader should be able to check that this defines a group law on G.
b The group G
b is
known as the Pontryagin dual group
For reasonable 114 Abelian groups we have
bb ∼
G =G (11.262)
113
See Chapter four for a detailed discussion of the idea of reducible and irreducible representations. In
brief, a representation ρ : G → GL(V ) is reducible if there is a nonzero linear subspace W ⊂ V which is
preserved by all the operators ρ(g). Note that one-dimensional representations are trivially irreducible.
114
e.g. locally compact Abelian groups. See the book by Kirillov on representation theory.
– 194 –
If χ is a homomorphism we can define the Fourier transform
1 X
Ψ(χ)
b := p χ(g)Ψ(g) (11.263)
|G| g∈G
These equations are special cases of the famous orthogonality relations for the matrix
elements of irreducible representations of compact groups.
Now, for G = ZN we have G b∼ = ZN . The passage from the δj basis to the Ψj basis,
which diagonalizes P is just the finite Fourier transform.
In more concrete terms: The trace of all the powers of P less than N is also obviously
zero and P N = 1 and no smaller power of P is the identity. So P must be unitarily
equivalent to Q. Now we can easily check that
SP S −1 = Q (11.266)
– 195 –
Exercise Orthogonality For ZN
Write out the equations (11.265) for the case G = ZN
Vµ⊗Vν ∼
= ⊕λ HomG (V λ , V µ ⊗ V ν ) ⊗ V λ (11.270)
The dimensions
λ
Nµν := dimκ HomG (V λ , V µ ⊗ V ν ) (11.271)
are known as the fusion coefficients and they give the isotypical decomposition:
µνN λ times
z }| {
V (µ) ⊗ V (ν) = ⊕λ V (λ)
⊕ ··· ⊕ V (λ)
(11.272)
λ
= ⊕λ Nµν V (λ)
Recall that
χV1 ⊗V2 (g) = χV1 (g)χV2 (g) (11.273)
Therefore, taking the trace of (11.272) we get a formula for the fusion coefficients:
X
λ
χµ (g)χν (g) = Nµν χλ (g) (11.274)
λ
λ
Nµν = hχλ , χµ χν i (11.275)
There is a very beautiful explicit formula for the fusion coefficients in the case of finite
groups. In this case we can write:
1 X
Nµνλ
= χµ (g)χν (g)χλ (g −1 ) (11.276)
|G|
g∈G
– 196 –
This can be written in a different way. For simplicity choose unitary irreps, and recall
that the orthogonality relations on characters is equivalent to the statement that
r
mi
Sµi := χµ (Ci ) µ = 1, . . . , r i = 1, . . . , r (11.277)
|G|
is a unitary q
matrix. Let 1 denote the trivial representation or the trivial conjugacy class.
mi
Then S1i = |G| . Thus we can write
X Sµi Sνi S ∗
λ λi
Nµν = (11.278)
S1i
i
Example 1: Consider the irreps ρm of Z/N Z. We then have the fusion rules:
ρm ⊗ ρn ∼
= ρm+n (11.279)
Recall that ρm only depends on the integer mmodN . The S-matrix above is the finite
Fourier transform matrix: s
1 2πiµi/N
Sµi = e (11.280)
|G|
from which one easily verifies (11.278).
Remarks
Note that we can decompose the triple product V µ ⊗ V ν ⊗ V λ in two natural ways
leading to the isomorphism
ρ κ ∼ κ ρ
⊕ρ Vµν ⊗ Vρλ = ⊕ρ Vµρ ⊗ Vνλ (11.282)
If we choose bases for the spaces of intertwiners then the components of the isomor-
phism relative to these bases are called fusion matrices and they satisfy some nice
identities. ♣Explain pentagon
and hexagon? ♣
2. The considerations of this section lead rather naturally to the beautiful subject of
Frobenius algebras and 2d topological quantum field theory. See Chapter 4 section
**** for more about this.
– 197 –
3. The various identities satisfied by the fusion matrix and the S-matrix have remarkable
analogues in the subject of 2d rational conformal field theory. For more about this
see:
a.) G. Moore and N. Seiberg, “Classical and Quantum Conformal Field Theory,”
Commun. Math. Phys. 123(1989)177
b.) G. Moore and N. Seiberg, “Lectures on Rational Conformal Field Theory,” in
Strings ’89,Proceedings of the Trieste Spring School on Superstrings, 3-14 April 1989,
M. Green, et. al. Eds. World Scientific, 1990. Available on G. Moore’s home page.
c.) Philippe Di Francesco, Pierre Mathieu, David Senechal, Conformal Field Theory,
Springer
d.) Jurgen Fuchs, Affine Lie Algebras And Quantum Groups ♣Need to add other
references ♣
V+⊗Vµ ∼
=Vµ
V−⊗V− ∼
=V+
(11.283)
V−⊗V2 ∼
=V2
∼V+⊕V−⊕V2
V2⊗V2 =
X
∗ λ Sµj
Sνi Nµν Sλj = δij (11.285)
S1j
ν,λ
– 198 –
11.13 Induced Representations
Let G be a group and H a subgroup. Suppose that ρ : H → Aut(V ) is a representation
of the subgroup H. Using this data we are going to produce, canonically, a representation
of G known as an induced representation. Note that, in general, there is no way to extend
ρ : H → Aut(V ) to a homomorphism ρ : G → Aut(V ). As a counterexample, we will see
later that the only one-dimensional representation of SU (2) is the trivial representation.
However, there are nontrivial representations of the subgroup of diagonal matrices. So, in
general, a representation ρ : H → Aut(V ) of a subgroup H ⊂ G is not just the restriction
of a representation of G on V .
Then, as we have seen, Map(G, V ) is canonically a G × H-space. The left-action of
G × H is defined by declaring that for (g, h) ∈ G × H and Ψ ∈ Map(G, V ) the new function
φ((g, h), Ψ) ∈ Map(G, V ) is the function G → V defined by:
φ((g, h), Ψ)(g0 ) := ρ(h) · Ψ(g −1 g0 h) (11.287)
for all g0 ∈ G. Or, in slightly lighter notation:
(g, h) · Ψ(g0 ) := ρ(h)Ψ(g −1 g0 h) (11.288)
Now, we can consider the subspace of functions fixed by the action of 1 × H. That is,
we consider functions which satisfy
– 199 –
1. The irreducible unitary representations of space groups in condensed matter physics.
Example: Let us take V = C with the trivial representation of H, i.e. ρ(h) = 1. Then
the induced representation is the vector space of functions on G which are invariant under
right-multiplication by H. This is precisely the vector space of C-valued functions on the
homogeneous space G/H. Recall that for G = SU (2) and H = U (1) we have seen that
G/H = CP1 ∼ = S 2 so we can get a nice basis of orthogonal functions on S 2 from the
functions on SU (2). These are called spherical harmonics and we will discuss them more
below.
Exercise H-Equivariance
Show that a function Ψ → V satisfying (11.289) fits in a commutative diagram:
G
Ψ /V (11.293)
R(h) ρ(h−1 )
G
Ψ /V
we can therefore form the quotient space of orbits. In this case it is usually denoted
G ×H V , but it is just the set of equivalence classes under the above right H-action. There
is a natural map
π : G ×H V → G/H (11.295)
115
Answer (F (A)(Ψ))(g) = A(Ψ(g)).
– 200 –
given by π : [(g, v)] 7→ gH. Referring back to the discussion of equation (8.92) we see that
this is the associated bundle to the principal H bundle π : G → G/H.
When G, H are Lie groups and ρ is a continous representation the map π is continuous.
Moreover, the fiber above any coset gH is the vector space V . We therefore have an
example of a vector bundle over G/H with fiber V . The sections of the vector bundle are, ♣WE SHOULD
EXPLAIN THE
by definition, continuous maps NOTION OF
SECTIONS IN THE
DISCUSSION OF
s : G/H → G ×H V (11.296) BUNDLES ABOVE.
♣
for some vector v(g) associated to g. But now gH = g̃H when g̃ = gh for h ∈ H. So it
must be that
[(g, v(g))] = s(gH) = s(g̃H) = [(g̃, v(g̃)] = [(gh, v(g̃)] = [(g, ρ(h)v(g̃))] (11.298)
and hence
This must hold for all g ∈ G, and hence g 7→ v(g) is an equivariant function: Thus,
the space of sections of the homogeneous vector bundle π : G ×H V → G/H is canoni-
cally identified with the space of H-equivariant functions G → V satisfying (11.289).
(We denoted this by IndGH before but H, G will be fixed in what follows so we simplify the
notation.) On the other hand, there is an obvious functor going the other way, since any
G-rep W is a foriori an H-rep, by restriction. Let us denote this “restriction functor”
How are these two maps related? The answer is that they are “adjoints” of each other!
This is the statement of Frobenius reciprocity:
– 201 –
HomG (W, Ind(V )) = HomH (R(W ), V ) (11.302)
We can restate the result in another way which is illuminating because it helps to
answer the question: How is IndG
H (V ) decomposed in terms of irreducible representations
of G? Let Wα denote the distinct irreps of G. Then Schur’s lemma tells us that
IndG ∼ G
H (V ) = ⊕α Wα ⊗ HomG (Wα , IndH (V )) (11.303)
But now Frobenius reciprocity (11.302) allows us to rewrite this as
IndG ∼
H (V ) = ⊕α Wα ⊗ HomH (R(Wα ), V ) (11.304)
where the sum runs over the unitary irreps Wα of G, with multiplicity one.
The statement (11.304) can be a very useful simplification of (11.303) if H is “much
smaller” than G. For example, G could be nonabelian, while H is abelian. But the
representation theory for abelian groups is much easier! Similarly, G could be noncompact,
while H is compact. etc.
In order to prove (11.304) we note that it is equivalent (see the exercise below) to the
statement that the character of IndG
H (V ) is given by
X
χ(g) = χ̂(x−1 gx) (11.305)
x∈G/H
where x runs over a set of representatives and χ̂ is the character χV for H when the
argument is in H and zero otherwise.
On the other hand, (11.305) can be understood in a very geometrical way. Think of
the homogeneous vector bundle G ×H V as a collection of points gj H, j = 1, . . . , n with
a copy of V sitting over each point. Now, choose a representative gj ∈ G for each coset.
Having chosen representatives gj for the distinct cosets, we may write:
Geometrically, this is a section whose support is located at the point gi H. The equivariant
function is then given by
ψi,a (gj h) := ρ(h−1 )va δi,j (11.308)
– 202 –
Now let us compute the action of g ∈ G in this basis:
Fortunately, we are only interested in the trace of this G-action. The first key point is
that only the fixed points of the g-action on G/H contribute. Note that the RHS above
is supported at j = g · i, but if we are taking the trace we must have i = j. But in
this case ggi = gi h(g, i) and hence g −1 gi = gi h(g, i)−1 so for fixed points we can simplify
h(g −1 , i) = h(g, i)−1 , and hence when we take the trace the contribution of a fixed point
ggi H = gi H is the trace in the H-rep of h(g, i) = gi−1 ggi , as was to be shown ♠
(23)
g H
1
(123) (123)
(13) (12)
g H (23) g H
3 2
(123)
(12) (13)
Figure 32: The left action of G = S3 on G/H. In fact, this picture should be considered as a
picture of a category, in this case, a groupoid.
Remark: The Ind map does not extend to a ring homomorphism of representation
rings.
– 203 –
Example A simple example from finite group theory nicely illustrates the general idea.
Let G = S3 be the permutation group. Let H = {1, (12)} ∼ = Z2 be a Z2 subgroup. G/H
consists of 3 points. The left action of G on this space is illustrated in (??).
There are two irreducible representations of H, the trivial and the sign representation.
These are both 1-dimensional. Call them V (), with = ±. Accordingly, we are looking
at a line bundle over G/H and the vector space of sections of G ×H V () is 3-dimensional.
A natural basis for the space of sections is given by the functions which are “δ-functions
supported at each of the three points”:
si (gj H) = δij
g1 H = (13)H = {(13), (123)}
(11.310)
g2 H = (23)H = {(23), (132)}
g3 H = (12)H = {1, (12)}
These sections correspond to equivariant functions on the total space. The space of all
functions F : G → R is a six-dimensional vector space. The equivariance condition:
F (12) = F (1)
F (123) = F (13) (11.311)
F (132) = F (23)
cuts this six-dimensional space down to a three-dimensional space.
We can choose a basis of equivariant functions by choosing 3 representatives g1 , g2 , g3
for the cosets in G/H and setting F i (gj ) = δij . Using such a basis the representation of
the group is easily expressed as a permutation representation.
In our example of G = S3 it is prudent to choose g1 = (13), g2 = (23), g3 = (12) so
that
(12)g1 = g2 (12)
(12)g2 = g1 (12)
(12)g3 = g3 (12)
(13)g1 = g3 (12)
(13)g2 = g2 (12) (11.312)
(13)g3 = g1 (12)
(23)g1 = g1 (12)
(23)g2 = g3 (12)
(23)g3 = g2 (12)
From this one easily gets the induced representation
0 0
ρind (12) = 0 0 (11.313)
00
– 204 –
00
ρind (132) = 0 0 (11.314)
0 0
and so forth.
Now let us look at Frobenius reciprocity. The irreducible representations of G are
W () defined by (ij) → , and W2 defined by the symmetries of the equilateral triangle,
embedded into O(2):
!
−1 0
ρ2 (12) =
0 1
√ ! (11.315)
−√12 23
ρ2 (123) =
− 23 − 12
As H = Z2 representations, we have W () ∼
= V () and
W2 ∼
= V (+1) ⊕ V (−1) (11.316)
Therefore
HomH (W (), V (0 )) = δ,0 R
(11.317)
HomH (W2 , V ()) = R
By Frobenius reciprocity we have
To find out how to decompose the representation IndSS32 (V ()) in terms of G = S3 irreps
it suffices to compute the character for g = (12) and g = (123). Now, g = (12) has exactly
one fixed point, namely g3 H and h(g, 3) = (12) for this element. Therefore,
On the other hand, g = (123) clearly has no fixed points, and therefore the character is
zero. It follows immediately that we have the decomposition (11.318).
********************
MORE MATERIAL IN GTLect4-IntroRepTheory-2020
*******************
************************************************
– 205 –
11.14 Representations Of SU (2)
11.14.1 Homogeneous Polynomials
We now use the idea of induced representations to construct representations of SU (2). In
fact, we will construct all the irreducible representations. ♣Using ρ rather
than T for
We take G = SU (2) and we take the subgroup H ⊂ SU (2) to be a “maximal torus” representations
here. Uniformize?
namely the subgroup of diagonal SU (2) matrices. We will denote it by T . As a group ♣
T ∼
= U (1). For V we choose an irreducible representation ρk of U (1) where k ∈ Z and the ♣This conflicts with
our notation T for
carrier space is V ∼= C. Thus we choose the one-dimensional representation of T defined the homomorphism
in a rep. So we use
by: !! ρ here. ♣
z 0
ρk := z k (11.321)
0 z −1
We can abbreviate !
u −v̄
F( ) ⇒ F (u, v) (11.324)
v ū
so we just have
F (ueiθ , veiθ ) = e−ikθ F (u, v) (11.325)
Of course g ∈ SU (2) implies |u|2 + |v|2 = 1. So we can also view this as the space of
equivariant functions S 3 → C for the right-action of U (1).
Actually, we don’t want all functions - that space is too big. One useful subspace to
restrict to is L2 (S 3 ).
The induced representation is infinite dimensional and very far from being irreducible.
For example, the functions uk |u|` for ` any positive integer ≥ −k are all smooth functions
on SU (2) and are independent vectors in L2 (SU (2)).
It turns out we can cut down the ρk -equivariant functions L2 (SU (2)) down to a finite
dimensional irrep of SU (2) by imposing one more condition. It is a condition of holomorphy
as we now explain.
One way to view this restriction is first to view S 3 ⊂ C2 and consider instead the space
of all functions of two variables !
u
∈ C2 (11.326)
v
which are equivariant in the sense of equation (11.325). This larger space of functions
is also a representation of SU (2) and also we can restrict functions to give equivariant
– 206 –
functions on SU (2). But now we can look at the subspace of equivariant functions which
are also holomorphic functions on C2 . This will be a finite-dimensional space of functions.
Holomorphy requires that −k ≥ 0. Such homogeneous holomorphic functions must
be polynomials in u, v. It is convenient to set −k = 2j ∈ Z+ . The restriction to the
holomorphic equivariant functions gives the finite dimensional space H2j of homogeneous
polynomials in u, v of degree 2j. In physics this is called the spin-j representation for
reasons explained below. In general we will denote the isomorphism class of the spin-j
representation by Vj .
There is a second, very useful, way to think about the restriction to holomorphic
functions. Recall that:
SU (2)/U (1) ∼
= SL(2, C)/B (11.327)
Both quotients are CP1 . But the latter description is useful because we can introduce
the idea of holomorphy. Note that the representation ρk of U (1) extends uniquely to a
holomorphic representation of B:
!
z w
ρk ( ) = zk (11.329)
0 z −1
Now consider
SL(2,C)
IndB (ρk ) (11.330)
and any three matrix elements will serve as holomorphic coordinates as long as we can
solve for the fourth. However we have matrices
!
1x
∈B (11.333)
0 1
– 207 –
and hence B-equivariant functions on SL(2, C) are again functions of just the vector
!
u
∈ C2 − {0} . (11.335)
v
such that ! !
u −k u
F( z) = z F ( ) (11.336)
v v
for z ∈ C∗ . Such functions restrict to the functions on the 3-sphere |u|2 + |v|2 = 1 and a
function equivariant for B by ρk restricts to a function on SU (2), equivariant for T by ρk
since
B ∩ SU (2) = T (11.337)
SU (2)
Conversely we can use the equivariance to extend the functions in IndU (1) (ρk ) to functions
SL(2,C)
in IndB (ρk ) so they are really the same representation:
SL(2,C) SU (2)
IndB (ρk ) ∼
= IndU (1) (ρk ) (11.338)
SL(2,C)
The great advantage of the description IndB (ρk ) is that in this description it is
manifest that there is a subspace of holomorphic equivariant functions: Such functions
extend uniquely to holomorphic functions on all of C2 (by a theorem known as Hartog’s
theorem) so we are simply considering holomorphic functions on C2 . In this way we have
SL(2,C)
found a nice finite dimensional subspace of IndB (ρk ) of equivariant holomorphic func-
tions.
Since the SU (2) action is holomorphic, the restriction to the finite-dimensional sub-
space of holomorphic functions in fact forms a representation of SU (2). It is the space of
homogeneous polynomials of degree 2j = −k in u, v. (The space is empty if k > 0.)
A basis for H2j is
we compute the matrix elements for this representation of SU (2) relative to this basis via:
– 208 –
Note that the basis f˜j,m diagonalizes the action of the diagonal SU (2) matrices with
β = 0:
g · f˜j,m = ᾱj+m αj−m f˜j,m = α−2m f˜j,m (11.342)
That is,
j
D̃m L ,mR
(g) = δmL ,mR α−2mR (11.343)
when β = 0.
j
More generally, we can derive an explicit formula for the matrix elements D̃m 0 m (g)
as functions on SU (2) by expanding out the two factors in (11.341) using the binomial
theorem and collecting terms:
j
X j + mj − m
D̃m0 m (g) = ᾱs αj−m−t β̄ j+m−s (−β)t (11.344)
0
s t
s+t=j+m
a
Here the sum is over integers s ≥ 0 and t ≥ 0 and we recall that b = 0 when b > a so
this is a finite sum. We recover the functions f˜m , up to scale from
j j−m 2j
D̃−m,−j (g) = (−1) αj+m β j−m (11.345)
j−m
We claim that the representations H2j are irreducible, and moreover give a full set
of irreducible representations of SU (2). We will prove this using characters in the next
section.
Remarks
– 209 –
When k is nonpositive we can set k = −2j0 and the span of the set of functions
j0
Dm L ,−j0
where m ranges over −j0 , −j0 + 1, . . . , j0 − 1, j0 transforms under the left
regular representation in a representation isomorphic to Vj . These are the func-
tions with a holomorphic interpretation. When k is positive there is no holomorphic
interpretation.
j
3. It is instructive to evaluate the functions D̃m 0 m (g) where g is parametrized by Euler
angles. We find:
j
D̃m L ,mR
(g) = e−iφmL P̃m
j
L ,mR
(θ)e−iψmR (11.348)
j
where P̃m L ,mR (θ) is a polynomial in cos(θ/2) and sin(θ/2):
X j + mR j − mR
j
P̃m L ,mR
(θ) 2j+mL +mR
= (−i) (−1)s
(cos θ/2)j−mR +s−t ((sin θ/2)j+mR +t−s
s t
s+t=j+mL
(11.349)
j
It is closely related to an associated Legendre polynomial and the functions D̃m 0 m are
closely related to Wigner functions. We will explain more about this below.
4. Looking ahead to the relation to Lie algebras, diagonal SU (2) matrices can be written
as g = exp[−iσ 3 φ] and then
g · f˜m = ei2mφ f˜m (11.350)
so the basis f˜m is proportional to the physicist’s basis |j, mi. Note that
!
eiφ
i 3
= exp[−2φ − σ ] (11.351)
e−iφ 2
b.) Show that for j = 1, relative to the ordered basis {f˜1 , f˜0 , f˜−1 } we have
ᾱ2 −ᾱβ β2
D̃j=1 (g) = 2ᾱβ̄ |α|2 − |β|2 −2αβ (11.353)
β̄ 2 αβ̄ α2
– 210 –
11.14.2 Chararacters Of The Representations Vj
Every g ∈ SU (2) is diagonalizable so we can say
!
z
g ∼ d(z) := (11.354)
z −1
where |z| = 1 and both z and z −1 define the same conjugacy class 116
For the spin j representation denote the character by χj . In the basis described above
D̃j (d(z)) is diagonal and given by
Therefore
z 2j+1 − z −2j−1
χj (g) = z −2j + z −2j+2 + · · · + z 2j−2 + z 2j = (11.356)
z − z −1
Now we write the orthogonality relations for the characters. We have written the Haar
measure above in Euler angles, but this is not the most convenient form for integrating
class functions. Given the Euler angles (φ, θ, ψ) of u ∈ SU (2) it is not so evident how to
find the value of the angle ξ such that
The conjugacy class with angle ξ is a two-dimensional sphere in S 3 with radius sin ξ. Using
this we see that the properly normalized measure on SU (2) is such that on a class function
F we have:
1 2π
Z Z I
1 dz
F (u)[du] = 2
f (θ) sin θdθ = − g(z)(z − z −1 )2 (11.358)
SU (2) π 0 4πi z
where !
eiθ
f (θ) = F ( ) = g(z) (11.359)
e−iθ
with z = eiθ .
Now, the space of class functions L2 (SU (2))class can be identified with the (completion
of) the space of Laurent polynomials in z which is even under the involution z → 1/z.
This space is clearly spanned by z 2j + z −2j , and that basis is related to the χj by an upper
triangular matrix. Moreover, one easily confirms the general relation
From this we learn that we have a complete set of representations and moreover these are
all irreducible representations.
Remarks
116
d(z) = vd(z −1 )v −1 where, for example, we can take v = iσ 1 .
– 211 –
1. If we write n = 2j and z = eiθ then
sin((n + 1)θ)
χj (u) = = Un+1 (cos θ) (11.361)
sin θ
where Un+1 (x) are known as the Chebyshev polynomials of the second kind.
2. Weyl Density Formula. For more details about the following see Chapter 5, Survey
of Matrix Groups. The formula for integrating class functions with the Haar measure
has a nice analog for all the classical groups. For example, if F : SU (n) → C is a
class function then F (u) only depends on the eigenvalues of u. Write the unordered
set of eigenvalues as {eiθ1 , . . . , eiθn }. Then define
1 2π
Z Z Y Y dθi
F (u)du = f (θ1 , . . . , θn ) |eiθi − eiθj |2 (11.363)
SU (n) n! 0 2π
1≤i<j≤n i
and there are similar formulae for SO(n) and U Sp(2n). In SO(n) we can conjugate
any matrix to the form
Diag{R(θ1 ), . . . , R(θr )} (11.364)
when n = 2r is even and
2 π n
2n
Z Z Y Y Y
2
f [dg] = n (cos θj −cos θk ) sin2 (θj /2)f (θ1 , . . . , θn ) dθi
SO(2n+1) π n! −π 1≤j<k≤n j=1
(11.367)
Where we have normalized the Haar measure to have volume one. Finally, for the
unitary symplectic group U Sp(2r), any matrix can be conjugated to
– 212 –
3. Weyl-Kac Character Formula The character formula for SU (2) can be written as
w(λ+ρ)
P
−ρ w∈W (w)e
χj = e Q −α )
(11.370)
α>0 (1 − e
eλ (d(z)) = z 2j (11.371)
and the sum on w is a sum over the Weyl group, with (w) ∈ {±1} a homomorphism
to µ2 .
11.14.3 Unitarization
As we have seen, the finite-dimensional representations of SU (2) are unitarizable, and
unitarity is very important in physics and mathematics.
One might first be tempted to define a unitary structure on H2j by declaring the inner
product of two homogeneous polynomials to be the integral of ψ1∗ ψ2 . But this will not
converge. Instead we take
Z
1 2 2
hψ1 , ψ2 iH2j := ψ1 (u, v)∗ ψ2 (u, v)e−|u| −|v| d2 ud2 v (11.373)
π(2j + 1)! C2
This will give finite overlaps on H2j . Note that since du ∧ dv and |u|2 + |v|2 are SU (2)
invariant the inner product is SU (2)-invariant, i.e. it is unitary:
The reason for the funny normalization constant will be clearer below.
Exercise ON Basis
Show that an ON basis is given by
s
1 (2j + 1)!
fj,m = √ uj+m v j−m (11.375)
π (j + m)!(j − m)!
– 213 –
On the other hand we can identify CP1 with S 2 and then we can stereographically
project CP1 to C ∪ {∞}. See the exercise below for the precise formulae. The projection
from CP1 to the Riemann sphere is just:
(
z := z1 /z2 z2 6= 0
[z1 : z2 ] 7→ (11.377)
∞ z2 = 0
Since the SU (2) action on S 2 factors through the usual SO(3) rotations it is not
surprising that the round volume form is invariant under SU (2). We can verify this with
the volume form
1 idz ∧ dz̄
ω := (11.380)
2π (1 + |z|2 )2
directly since if z̃ = g · z then one easily checks:
dz
dz̃ = (11.381)
(βz + α∗ )2
1 |βz + α∗ |2
= (11.382)
(1 + |z̃|2 )2 (1 + |z|2 )2
and therefore the measure ω is invariant. Indeed, as you show in an exercise below, it is
just the usual round measure on S 2 evaluated in stereographic coordinates.
Now let us return to the space H2j of homogeneous polynomials in z1 , z2 . This vector
space is canonically isomorphic to the vector space of polynomials in z = z1 /z2 of degree
≤ 2j since we can identify a homogenous polynomial F with a polynomial p by
The map F → p defines a map H2j → P2j and we can make this an SU (2)-equivariant
map if we declare the action of SU (2) on polynomials p to be 117
∗
α z + β∗
2j
(g · p)(z) := (−βz + α) p (11.384)
−βz + α
The model for the spin j representation Vj as the space of polynomials of degree ≤ 2j
can be turned into a Hilbert space by defining
– 214 –
One checks that this is invariant under the SU (2) action so we integrate:
Z
hψ1 , ψ2 iP2j = h(ψ1 , ψ2 )ω (11.386)
C
and then
hg · ψ1 , g · ψ2 i = hψ1 , ψ2 i (11.387)
so g acts as a unitary operator on P2j in this inner product.
Using the integrals
Z ∞
xn1 Γ(n1 + 1)Γ(n2 − n1 − 1)
n
dx = − 1 < n1 & 1 < n2 − n1 (11.388)
0 (1 + x) 2 Γ(n2 )
we check that an ON basis for P2j is given by:
ψ` := N` z ` ` = 0, . . . , 2j (11.389)
s
2j
N` := (2j + 1) (11.390)
`
In physics this is usually written as:
s
(2j + 1)!
ψj,m = z j+m = Nj,m z j+m (11.391)
(j + m)!(j − m)!
Now the Wigner functions are defined by the matrix elements relative to this ON basis:
X
j
g · ψj,mR = Dm L ,mR
(g)ψj,mL (11.392)
mL
In physics notation ψj,m would usually be written as the ket vector |j, mi, the spin j
representation might be written as operators T j (g) and we would have:
j 0 j
Dm 0 m (g) := hj, m |T (g)|j, mi (11.393)
Remarks:
1. There is an isometry of Hilbert spaces between H2j with inner product h·, ·iH2j defined
in equation (11.373) above and h·, ·iP2j . Indeed, in the integral (11.373) we can change
coordinates to z = u/v and v. Then du ∧ dv = vdz ∧ dv and one can do the v integral
explicitly to show the isometry.
j j
2. To relate the Wigner function Dm 0 m to our functions D̃m0 m defined above we compute
j
Dm 0 ,m (g) := hψj,m0 , g · ψj,m i
d(r2 )
Z
0 dφ
= Nj,m0 Nj,m (z j+m )∗ (−βz + α)j−m (α∗ z + β ∗ )j+m
C 2π (1 + r2 )2j+2
Z ∞
d(r2 )
X j + m j − m s j−m−t j+m−s t 0
= Nj,m Nj,m
0 ᾱ α β̄ (−β) r2j+2m
s t 0 (1 + r2 )2j+2
s+t=j+m0
s
(j + m0 )!(j − m0 )! j
= D̃ 0 (g)
(j + m)!(j − m)! m ,m
(11.394)
– 215 –
3. As we noted before, if we induce from the trivial representation then we get functions
on G/H which in this case is G/H = S 2 . The trivial representation of U (1) is mR = 0.
In this case j = ` must be an integer. The functions
`
Dm L ,0
(g) = e−imL φ Pm
`
L
(θ) = Y`,m (θ, φ) (11.395)
are known as spherical harmonics. From what we have said they form a complete
orthonormal set of functions on S 2 and are widely used in electromagnetism and
quantum mechanics.
5. The Wigner functions have many applications in physics. As we will see, from group
theory, they satisfy some nice differential equations and hence appear in the wave-
functions of atoms. They also appear in the study of the quantum Hall effect on S 2
and in the study of wavefunctions of electrons in the field of a magnetic monopole of
magnetic charge 2j.
– 216 –
b.) Also show that the standard round metric of the unit sphere can be written as
1 |dz|2
ds2 = (11.399)
π (1 + |z|2 )2
where z = α/β.
– 217 –
If we think of CP1 as the space of equivalence classes [z1 : z2 ] then on the “patch” UN
where z2 6= 0 we can define a map
Note that the point [0 : 1] corresponds to a “point at infinity” in this mapping. Again it is
manifest that zN zS = 1 on the patch overlap US ∩ UN where both functions are defined.
Now, the same homogeneous function F could be used to define a different polynomial
or equally well
pN (zN ) = zS−2j pS (zS ) (11.412)
Equations (11.411) and (11.412) are known as gluing conditions and zS−2j and zN−2j
are
known as transition functions. We can glue together the total space of the line bundle Lk
by taking two trivial line bundles
US × C (11.413)
UN × C (11.414)
−2j
and identifying (zS , vS ) with (1/zN , zN vN ) on the patch overlaps.
Note the the function h is invariant under this description:
– 218 –
because, on patch overlaps we have:
(1 + |zS |2 )−2j |pS (zS )|2 = (1 + |zN |2 )−2j |pN (zN )|2 (11.416)
as the reader should carefully check. Thus this is a globally defined notion of the length-
square of a section. It is called a Hermitian metric on the line bundle.
Remarks:
1. Recalling that 2j = −k the transition rules for all sections of the line bundle Lk can
be written
ψN (zN , z̄N ) = zSk ψS (zS , z̄S ) (11.417)
where there is now no condition of holomorphy. The space of all sections can be
SU (2)
identified with the space of all functions in IndU (1) (ρk ). This formulation makes
sense for any integer k. When k > 0 there are no holomorphic sections, but there are
plenty of C ∞ sections.
3. We will see later that SL(2, C) is isomorphic to the Lorentz group in 3+1 dimensions.
The Lorentz group acts on the set of light rays through the origin of Minkowski
space, and we can identify a light ray with its point on the celestial sphere. Under
this identification, the action of the Lorentz group on the set of light rays is just the
Mobius action on the sphere.
4. Representations Of Lorentz, Poincaré, and Affine Euclidean Groups. Also, the con-
struction of the irreducible unitary representations of Lorentz groups, and affine
Euclidean and Poincaré groups proceeds using this method. (That observation goes
back to Wigner and Bargmann.) Briefly, for a representation of the Poincaré group
one induces from a representation of the translation group (or the translation group
semidirect product with a compact rotation group). For a representation of the
Lorentz group one considers the homogeneous spaces from orbits of SO(1, d) in mo-
mentum space. For example, the mass shell p2 = m2 with p0 > 0 can be identified
with SO(1, d)/SO(d). Then one induces from a representation of SO(d) to produce
a unitary representation of SO(1, d). One source that approaches the subject from
the present viewpoint is M. Carmeli, Group Theory And General Relativity.
– 219 –
11.15 The Clebsch-Gordon Decomposition For SU (2)
In physics the spin j representation Vj shows up almost universally. Among other applica-
tions it appears in the quantum mechanical theory of spin. (See the relation to Lie algebras
below for an explanation of this.)
The combination of two systems of spins naturally leads to the question of how to give
an isotypical decomposition of Vj1 ⊗ Vj2 . This is known as the Clebsch-Gordon decompo-
sition. The general formula is:
V (j1 ) ⊗ V (j2 ) ∼
= V (|j1 − j2 |) ⊕ V (|j1 − j2 | + 1) ⊕ · · · · · · ⊕ V (j1 + j2 ) (11.418)
Note that every representation on the RHS has the same parity of (−1)2j .
Let us give a proof of (15.610) using characters. Because a representation is uniquely
determined by its character we can consider the character of V (j1 ) ⊗ V (j2 ). If we can write
this as a linear combination of characters χj with nonnegative integer coefficients we can
uniquely determine the decomposition into irreps.
The easiest thing to do is prove that
V 1 ⊗ Vj ∼
= Vj+1/2 ⊕ Vj−1/2 (11.419)
2
z 2j+1 − z −2j−1
χ1/2 (z)χj (z) = (z + z −1 )
z − z −1
z 2j+2 − z −2j−2 z 2j − z −2j (11.420)
= +
z − z −1 z − z −1
= χj+1/2 + χj−1/2
− z −2j1 −1 − z −2j2 −1
2j1 +1 2j2 +1
z z
χj1 (z)χj2 (z) = ·
z − z −1 z − z −1
! (11.421)
1 z 2j1 +2j2 +2 − z 2(j1 −j2 ) z −2j1 −2j2 −2 − z −2(j1 −j2 )
= +
z − z −1 z − z −1 z − z −1
z a+2 − z b z a−b+2 − 1
−1
= z b+1 = z b+1 + z b+3 + · · · + z a+1 (11.422)
z−z z2 − 1
for each of the two terms in the sum above, then realize that the two terms are related by
z → 1/z and we directly obtain:
– 220 –
It is also instructive to give a proof using the orthogonality of characters. One can
check directly by contour integration that
(
+1 |j1 − j2 | ≤ j ≤ j1 + j2 & 2j = 2j1 + 2j2 mod2
I
1 dz
− χj1 (z)χj2 (z)χj (z)(z−z −1 )2 =
4πi z 0 else
(11.424) ♣Explain this
better ♣
Definition Choose an ON basis ψj,m of Vj . Let Pj the the orthogonal projector onto
the subspace of Vj1 ⊗ Vj2 transforming in the representation Vj . Then
V1/2 ⊗ V1/2 ∼
= V0 ⊕ V1 (11.426)
In particular, as noted above in equation (11.180) the projector to the isotypical component
of the trivial representation is always:
Z
P = T (g)dg (11.428)
G
In our example,
T (g)|βi ⊗ |δi = gαβ gγδ |αi ⊗ |γi (11.429)
Tradition demands that an ON basis for the fundamental representation be denoted {|+i, |−i}
with
d(z) · |+i = z|+i d(z)· = z −1 |−i (11.430)
We denote the basis vectors |αi with α, .... ∈ {+, −}. If we look at the projector to the
trivial representation then we need to evaluate
Z
gαβ gγδ dg (11.431)
SU (2)
0 0
Now recall that gαβ = αα0 ββ 0 (g ∗ )α β so we can use the orthogonality relations to say
Z
1
gαβ gγδ dg = αγ βδ (11.432)
SU (2) 2
– 221 –
It follows that an ON basis for the spin 0 (“singlet”) isotypical component of V1/2 ⊗V1/2
is
1
√ (|+i ⊗ |−i − |−i ⊗ |+i) (11.433)
2
This state is also called a Bell pair in quantum information theory. It is the simplest
example of a nontrivially entangled state and was used in the famous EPR paper - the first
paper on quantum information theory.
The orthogonal complement must be the spin one states, and a basis that diagonalizes
the action of the diagonal unitary matrices is
******************************
In physics a standard example of a general selection rule is the Wigner-Eckart theorem:
Suppose Oj,m a collection of operators transforming in the spin j rep
EXPLAIN.
**********************
11.16 Lie Groups And Lie Algebras And Lie Algebra Representations
11.16.1 Some Useful Formulae For Working With Exponentials Of Operators
In this section we will discuss some formulae that are very useful for working with exponen-
tials of matrices (and linear operators). In particular we will derive the Baker-Campbell-
Hausdorff formula.
Let us recall that if A is a matrix or an operator then eA is the matrix, or operator,
defined by the exponential series. The following three identities are easily shown by direct
use of the exponential series:
1.
eαA eβA = e(α+β)A (11.436)
2.
d tA
e = AetA = etA A (11.437)
dt
– 222 –
3.
A Be−A
eA eB e−A = ee (11.438)
Now we prove some identities that are not directly obvious from the exponential series:
We also denote:
m times
z }| {
(Ad(A))m B = [A, [A, · · · [A, B] · · · ] (11.440)
in other words:
eA Be−A = eAd(A) B
1
= B + Ad(A)B + (Ad(A))2 B + · · · (11.442)
2!
1
= B + [A, B] + [A, [A, B]] + · · ·
2!
To prove this define B(t) := etA Be−tA . So B(0) = B and B(1) = eA Be−A is the quantity
we want. Now it is easy to derive the differential equation:
d
B(t) = Ad(A)B(t) (11.443)
dt
so
B(t) = etAd(A) B(0) (11.444)
Now set t = 1.
Combining with (11.438) we now have the somewhat less trivial identity:
Ad(A) B
eA eB e−A = ee (11.445)
All these identities follow from a much more nontrivial formula, known as the Baker-
Campbell-Hausdorff formula that expresses the operator C defined by
eA eB = eC (11.446)
C = A + B + s(A, B) (11.447)
where s(A, B) is an infinite series and every term involves nested commutators.
– 223 –
We will give a complete statement and proof of the BCH formula below. In order to
do that we first state the extremely useful
Lemma : Let
ez − 1 z z2
f (z) = =1+ + + ··· (11.448)
z 2! 3!
Then
d A(t) −A(t) d
e e = −eA(t) e−A(t) = f (Ad(A(t))) · Ȧ(t) (11.449)
dt dt
where A(t) is any differentiable matrix function of t.
Note that this is nontrivial because Ȧ(t) does not commute with A(t) in general!
Indeed, from the exponential series you can easily show that
d A(t) 1
e − Ȧ(t)eA(t) = [A(t), Ȧ(t)] + · · ·
dt 2 (11.450)
d A(t) A(t) 1
e −e Ȧ(t) = − [A(t), Ȧ(t)] + · · ·
dt 2
Proof : Introduce a matrix function of two variables and take derivatives wrt s:
d −sA(t)
B(s, t) := esA(t)e
dt
∂B d d −sA(t)
= A(t)esA(t) e−sA(t) − esA(t)
e A(t)
∂s dt dt (11.451)
= Ad(A(t))B(s, t) − Ȧ(t)
∂j B
= (Ad(A(t)))j B(s, t) − (AdA(t))j−1 Ȧ(t)
∂sj
1 ∂j
B(s, t) |s=0 = −Ad(A(t))j−1 Ȧ(t) j≥1 (11.452)
j! ∂sj
So
∞
X sj (Ad(A(t)))j−1
sA(t) d −sA(t)
e (e )=− Ȧ(t) (11.453)
dt j!
j=1
because:
– 224 –
Z 1
d A(t)
e = esA(t) Ȧ(t)e(1−s)A(t) ds
dt 0
Z 1
= esAd(A(t)) dsȦ(t)eA(t) (11.455)
"0 #
eAd(A(t)) − 1
= Ȧ(t) eA(t)
Ad(A(t))
up to terms of order (∆s)2 . Next we sum over these terms and take N → ∞ to get (11.454).
– 225 –
satisfies C = log(eA eB ).
Proof :
Introduce the matrix-valued function C(t) via:
and note that C(0) = B, and C(1) is the matrix we want. We derive a differential equation
for C(t). By our lemma we have:
d −C(t)
eC(t) e = −f (AdC(t))Ċ(t) (11.462)
dt
with
ez − 1
f (z) = (11.463)
z
On the other hand, plugging in the definition (11.461) we compute directly the simple
result
d d
eC(t) e−C(t) = etA e−tA = −A (11.464)
dt dt
Therefore we get a differential equation:
f (AdC(t))Ċ(t) = A (11.465)
Let us make this more explicit: Using the power series g(w) above with w = ez note
that
ez − 1 z
f (z)g(ez ) = · z =1 (11.467)
z e −1
regarded as an identity of power series in z. Now we can substitute for z any operator O,
and use
Ċ(t) = f (Ad(C(t)))−1 · A
(11.469)
= g(exp(Ad(C(t))) · A
where we applied (11.468) with O = Ad(C(t). This hardly seems useful, since we still
don’t know C(t), but now since we have power series we can say
– 226 –
To prove (11.470) note that for all H we have:
Therefore:
Remarks:
1 1 1 1
C = A + B + [A, B] + [A, [A, B]] + [B, [B, A]] + [A, [B, [A, B]]] + · · ·
2 12 12 24
(11.475)
5
where the next terms are order if we scale A, B by .120
3. For suitable operators A, B on Hilbert space the BCH formula continues to hold. But
the series has a finite radius of convergence: See the exercises below.
Exercise
Work out the BCH series to order 5 in A, B.
119
It is useful to note that [A, [B, [A, B]]] = −[B, [A, [B, A]]] = B 2 A2 − A2 B 2
120
One can find an algorithm for generating the higher order terms in Varadarajan’s book on group theory.
– 227 –
Exercise
Show that we can also write:
Z 1
C = log(eA eB ) = A + g(e−Ad(s B) −Ad(A)
e )B ds (11.476)
0
AdB
C=B+ ()
eAdB
−1 (11.477)
1 1 1
= B + − [B, ] + [B, [B, ]] − [B, [B, [B, [B, ]]]] + · · ·
2 12 720
Note:
x 1 1 2 x4 x6 x8 x10
= 1 − x + x − + − +
ex − 1 2 12 720 30240 1209600 47900160
691
− x12 + · · · (11.478)
1307674368000
∞
X Bn xn
≡
n!
n=0
is an important expansion in classical function theory -the numbers Bn are known as the
Bernoulli numbers
There are many applications of this formula. One in particle physics is to spontaneous
symmetry breaking where the formula above gives the chiral transformation law of the pion
field. Here B = π(x) is the pion field and is the chiral transformation parameter.
– 228 –
11.16.2 Lie Algebras
We recall two basic definitions - the definition of an algebra and of a Lie algebra. See
Chapter 2, Linear Algebra User’s Manual for more discussion and examples. The relation
between Lie groups and Lie algebras is covered in much more detail in Chapter 8. But here
is a preview.
Definition An algebra over a field κ is a vector space A over κ with a notion of multipli-
cation of two vectors
A×A→A (11.479)
denoted:
a1 , a2 ∈ A → a1 a2 ∈ A (11.480)
which has a ring structure compatible with the scalar multiplication by the field. Con-
cretely, this means we have axioms:
i.) (a1 + a2 ) a3 = a1 a3 + a2 a3
ii.) a1 (a2 + a3 ) = a1 a2 + a1 a3
iii.) α(a1 a2 ) = (αa1 ) a2 = a1 (αa2 ), ∀α ∈ κ.
Example: A basic example of an algebra is the vector space of n × n matrices over a field
κ. The vector addition is simply addition of matrices. The algebra product is matrix
multiplication.
Definition A Lie algebra over a field κ is an algebra A over κ where the multiplication of
vectors a1 , a2 ∈ A, satisfies in addition the two conditions:
1. ∀a1 , a2 ∈ A:
a2 a1 = −a1 a2 (11.481)
2. ∀a1 , a2 , a3 ∈ A:
Now, tradition demands that the product on a Lie algebra be denoted not as a1 a2
but rather as [a1 , a2 ] where it is usually referred to as the bracket. So then the two defining
conditions (11.481) and (11.482) are written as:
1. ∀a1 , a2 ∈ A:
[a2 , a1 ] = −[a1 , a2 ] (11.483)
– 229 –
2. ∀a1 , a2 , a3 ∈ A:
[[a1 , a2 ], a3 ] + [[a3 , a1 ], a2 ] + [[a2 , a3 ], a1 ] = 0 (11.484)
Remark: Note that if we consider the Lie algebra product as the algebra product on a
vector space, then the algebra is non-associative. If we have a Lie algebra product and use
it to define an algebra product: a1 a2 := [a1 , a2 ] then
[a1 , a2 ] := a1 a2 − a2 a1 (11.486)
However, it should be stressed that not all Lie algebras arise in this way. A good example
would be first order smooth differential operators on, say, Rn , (or more generally, smooth
vector fields on a manifold). If
∂
V = v µ (x) µ (11.487)
∂x
then the product of the differential operators - which makes sense as a second order operator
- is not a vector field, but we can define
µ ∂ ν µ ∂ ν ∂
[V1 , V2 ] = v1 µ v2 − v2 µ v1 (11.488)
∂x ∂x ∂xν
Example 2: An important example arises by applying the general remark above to the al-
gebra Mn (κ). In this case the Lie algebra is often denoted gl(n, κ). Now Mn (κ) and gl(n, κ)
are precisely the same as sets, and are precisely the same as vector spaces. The notation
simply puts a different stress on the algebraic structure which the author is considering.
– 230 –
over C these Lie algebras have special names, and for good reasons the even and odd cases
are considered as separate families. They are denoted Bn = so(2n + 1) and Dn = so(2n).
Example 6: Finally sp(2n, κ) ⊂ Mn (κ) is the Lie subalgebra such that (Ja)tr = Ja where
!
0 1
J= (11.489)
−1 0
Example 7: The intersection of two Lie algebras is a Lie algebra and a particularly
important example is usp(2n) = su(2n) ∩ sp(2n, C) ⊂ M2n (C). ♣say why ♣
Let g ⊂ gl(n, κ) be a sub-Lie algebra with κ = Q, R, C. The BCH formula gives one
way to understand a relation between Lie algebras and Lie groups: Provided the BCH
series converges it follows that if A, B ∈ g then
eA eB = eC (11.490)
exp : g → G (11.491)
is indeed surjective, although it will not be injective. A good example is the case U (1).
Here the Lie algebra is g = iR (and the commutator is zero). Then all of 2πiZ is in the
kernel of the exponential map.
Conversely, from a matrix Lie group one can recover the Lie algebra by considering the
general one-parameter subgroups g(t) with g(0) = 1 and computing g −1 (t) dt d
g(t) at t = 0.
We will elaborate on this idea by first closing a gap in our discussion at the very beginning
of the course and prove that the classical Lie groups are manifolds as follows:
– 231 –
Exercise Structure Constants
If g is a Lie algebra over κ then we can choose a basis T i for g and necessarily we have
an expansion X ij
[T i , T j ] = fk T k fkij ∈ κ (11.492)
k
from which one can construct all commutators. The constants fkij are known as structure
constants.
a.) Show that
fkij = −fkji (11.493)
f`ij fm
`k
+ f`jk fm
`i
+ f`ki fm
`j
=0 (11.494)
b.) Conversely, show that given tensors fkij ∈ κ satisfying equations (11.493) and
(11.494) one can define a Lie algebra over κ.
Remark: Sometimes Lie algebras are presented by giving a list of structure constants. If
someone tries to sell you a Lie algebra by giving you a list of structure constants don’t buy
it until you have checked equations (11.493) and (11.494).
df : Tp M1 → Tf (p) M2 (11.495)
is surjective. In this case f (p) is said to be a regular value. Using results from the calculus of
many variables one can show that if f is a submersion at p then there are local coordinates
so that in a neighborhood of p it has the form:
That is, locally, in suitable coordinate systems f is literally the map f : Rn1 → Rn2 keeping
only the first n2 coordinates. Then, every point in the target Rn2 is a regular value of f
and the inverse image of a regular value is
is a submanifold of dimension n1 − n2 .
Therefore we have
– 232 –
is, the preimage f −1 (q) is a submanifold of M1 of codimension dimM2 . The tangent space
to f −1 (q) at any point p is ker(dfp ).
are called level sets. If ~c is a regular value then the level set is a submanifold of M of
codimension `. Note that for each i, f i : M → R so df i : Tp M → Tci R ∼ = R, and hence df i
is a linear functional on Tp M . The regularity condition is the condition that these linear
functionals are all linearly independent. We say that the functions f i are independent.
We can apply these ideas to give easy proofs that the classical matrix groups are in
fact Lie groups. First we prove they are all manifolds:
M ∈ TA Mn (κ) ∼
= Mn (κ) (11.499)
we have
dfA (M ) = detATr(A−1 M ) (11.500)
This is usually written as the (very useful) identity 121
δlogdetA = Tr A−1 δA
(11.501)
for A invertible. When A is invertible the kernel of dfA is the linear subspace of n × n
matrices M such that A−1 M is traceless, which is linearly equivalent to the linear subspace
of traceless matrices, and therefore has dimension (over κ) equal to n − 1. Therefore the
rank of dfA is 1, and f = det is a submersion. So the inverse image is a manifold.
1
Example 3: For O(n; κ) we define f : Mn (κ) → Sn (κ) where Sn (κ) ∼
= κ 2 n(n+1) is the
vector space over κ of n × n symmetric matrices. We take f to be
– 233 –
Then O(n) = f −1 (0). We aim to show it is a manifold. Note that dfA is a linear operator
Mn (κ) → Sn (κ). It is just
dfA (M ) = M Atr + AM tr (11.503)
Therefore ker(dfA ) is the linear subspace of Mn (κ) of matrices such that M Atr is anti-
symmetric. When A is invertible this subspace is isomorphic to the linear subspace of
anti-symmetric matrices and hence has dimension 21 n(n − 1). It follows that 0 is a regular
value of f and O(n, κ) is a manifold.
Example 4: For Sp(2n; κ) we define f : M2n (κ) → A2n (κ) where A2n (κ) is the set of
(2n) × (2n) matrices over κ such that (Jm) is antisymmetric. This is isomorphic to the
vector space over κ of dimension 21 (2n)(2n − 1) = n(2n − 1). Now we take f to be
so that Sp(2n; κ) = f −1 (0). Again we claim that 0 is a regular value of f . Now dfA is the
linear operator Mn (κ) → A2n (κ). It is just
Therefore ker(dfA ) is the linear subspace of Mn (κ) of matrices such that M JAtr is symmet-
ric. When A is invertible this subspace is isomorphic to the linear subspace of symmetric
matrices and hence has dimension 21 2n(2n + 1), which is complementary to the dimension
of the image 12 (2n)(2n − 1) and hence dfA is surjective. It follows that 0 is a regular value
of f and Sp(2n; κ) is a manifold.
Example 5: Finally, for U (n) consider f : Mn (C) → Hn where Hn is the real vector space
of n × n Hermitian matrices in Mn (C). This has real dimension n + 2 × 12 n(n − 1) = n2 .
We now take f (A) = AA† − 1. Then
dfA (M ) = M A† + AM † (11.506)
When A is invertible the kernel is the subspace of Mn (C) of matrices such that M A† is
anti-hermitian. This is again a real vector space of real dimension n + 2 × 21 n(n − 1) = n2 .
Since Mn (C) is a real vector space of real dimension 2n2 it follows that dfA is surjective
and hence 0 is a regular value of f . Therefore U (n) is a manifold.
Example 6: It is useful to combine the previous two examples and define the Lie group:
Now we have a map: f : M2n (C) → A2n (C) ⊕ H2n defined by taking the direct sum. Again,
one must check that the real linear map dfA at an invertible matrix in the preimage of 0
has a kernel of the correct dimension. 122
122
We have not covered quaternions yet, but a superior viewpoint is to view U Sp(2n) as the group of
n × n unitary matrices over the quaternions. In this viewpoint we should define f : Mn (H) → Hn (H)
– 234 –
All the examples above are submanifolds of GL(n, κ). The group operation of mul-
tiplication is polynomial in the matrix elements and hence certainly a C ∞ function. The
group operation of inversion is a rational function of the matrix elements and is also C ∞ on
GL(n, κ). It follows from the exercise of Section §?? that the group operations of multipli-
cation and inversion are C ∞ maps in all the cases of the matrix subgroups. This concludes
the argument that the above examples are all Lie groups.
In general, the Lie algebra of a Lie group G is defined, as a vector space, to be the
tangent space at the identity:
Lie(G) := T1 G (11.508)
2. SL(n, κ):
T1 SL(n, κ) ∼
= {M ∈ Mn (κ)|Tr(M ) = 0} = sl(n, κ) (11.510)
3. O(n, κ):
so(n; κ) := T1 O(n, κ) ∼
= {M ∈ Mn (κ)|M tr = −M } = o(n, κ) = so(n, κ) (11.511)
4. Sp(2n, κ):
T1 Sp(2n, κ) ∼
= {M ∈ M2n (κ)|(M J)tr = +M J} = sp(2n, κ) (11.512)
5. U (n):
T1 U (n) ∼
= {M ∈ Mn (C)|M † = −M } = u(n) (11.513)
6. SU (n):
T1 SU (n) ∼
= {M ∈ Mn (C)|M † = −M & Tr(M ) = 0} = su(n) (11.514)
7. U Sp(2n):
T1 U Sp(2n) ∼
= {M ∈ M2n (C)|M † = −M & (M J)tr = M J}
(11.515)
∼
= {M ∈ Mn (H)|M † = −M } := usp(2n)
where Hn (H) is the space of n × n quaternionic Hermitian matrices. The above arguments work in the
same way: dimR Mn (H) = 4n2 , while dimR Hn (H) = n + 4 12 n(n − 1) = 2n2 − n. As before, the kernel of
dfA , for A invertible is the space of n × n quaternionic-antihermitian matrices. This has real dimension
3n + 4 21 n(n − 1) = 2n2 + n. (The 3n is there because one can have an arbitrary imaginary quaternion on
the diagonal.) Therefore 0 is a regular value, and U Sp(2n) is a manifold. Many authors denote this group
simply as Sp(n).
– 235 –
We now recognize that the above vector spaces of matrices are in fact the Lie algebras
we introduced earlier. We can get back the groups by exponentiation. It is a good exercise
to check, from the defining relations of T1 G above that the exponentiated matrix indeed
satisfies the defining relations of the group. Thus, for example, one should check that if M
is anti-Hermitian, i.e. if M † = −M then exp(M ) is unitary.
We can argue more generally that they must be Lie algebras as follows: Given a matrix
V ∈ T1 G ⊂ MN (κ) we can form the family of group elements
∞
X (tV )n
gV (t) = exp[tV ] := (11.516)
n!
n=0
In general, in differential geometry, there is an exponential map from the tangent space
Tp M of a manifold M to a neighborhood of p in M . It is not as explicit as an exponential
series of a matrix, but involves using the vector to define a differential equation. An
important property of the tangent spaces T1 G for the various groups above is that:
If V1 , V2 ∈ T1 G then the matrix commutator [V1 , V2 ] is also in T1 G.
This can be verified by directly checking each case. For example, in the case of so(n, κ),
if V1 , V2 are antisymmetric matrices over κ then neither V1 V2 , nor V2 V1 is antisymmetric,
but [V1 , V2 ] is. The reader should check the other cases in this way. Nevertheless, this
fact also follows from more general principles, and that is important because as we will see
not every Lie group is a classical matrix group. In fact, it is not true that every finite-
dimensional Lie group is a subgroup of GL(N, R) for some N . 123 Given V1 , V2 we can
consider the path through g = 1 at t = 0 given by the group commutator:
√ √
λ(t) = [gV1 ( t), gV2 ( t)] (11.518)
Now, usin the BCH formula one can show that for t1 , t2 small we have
1
gV1 (t1 )gV2 (t2 ) = exp[t1 V1 + t2 V2 + t1 t2 [V1 , V2 ] + O(ta1 tb2 )] (11.519)
2
where the higher order terms have a + b > 2, and therefore the tangent vector to the path
through λ(t) is the matrix commutator.
Therefore, just based on group theory and manifold theory, one can deduce that the
tangent space at the identity g = T1 G is indeed a Lie algebra.
One of the main theorems about the relation of Lie groups and Lie algebras is the
following:
Theorem:
123
A counterexample is the metaplectic group, a group which arises as a central extension of the symplectic
group when one tries to implement symplectic transformations on a the quantum mechanics of a system of
free particles.
– 236 –
a.) Every finite dimensional Lie algebra g over κ = R arises from a unique (up to
isomorphism) connected and simply connected Lie group G.
b.) Under this correspondence, Lie group homomorphisms f : G1 → G2 are in 1 − 1
correspondence with Lie algebra homomorphisms µ : T1 G1 → T1 G2 .
For more about this see Chapter 8. The best statement makes use of the language of
categories. What is described here is an equivalence of categories. See Section **** below.
Examples:
You should check the structure constants, taking careful note of signs:
Note that sl(2, R) is qualitatively different: The elements e, f ∈ sl(2, R) have non-
trivial Jordan form and cannot be diagonalized (even within GL(2, C)) So this Lie
algebra is inequivalent to su(2). It is not hard to show that if A ∈ sl(2, R) then it is
conjugate (via SL(2, R) conjugation) to one of three distinct forms
!
−1 1x
SAS = xe = x∈R (11.526)
0 1
!
−1 −x 0
SAS = xh = x∈R (11.527)
0 x
– 237 –
!
−1 0 x
SAS = x(e − f ) = x∈R (11.528)
−x 0
and hence there are three distinct maximal Abelian subgroups (up to conjugation):
!
1x
exp[xe] = x∈R (11.529)
0 1
!
e−x 0
exp[xh] = x∈R (11.530)
0 ex
!
cos x sin x
exp[x(e + f )] = x∈R (11.531)
− sin x cos x
Note that the last subgroup is compact and is just a copy of SO(2, R) and x ∼ x + 2π
parametrize the same group element.
It is easy to show these subgroups are not conjugate by considering the trace - see the
exercise below. It is now easy to see that the exponential map cannot be surjective
onto SL(2, R). Consider the group elements
!
−1 x
x 6= 0 (11.532)
0 −1
If this were of the form exp[A] then it would be in the one-parameter group exp[tA].
But the trace is −2 and the only way one of the above one-parameter groups can
have trace = −2 is to take (11.531) with x = π(2n + 1) with n ∈ Z. Nevertheless,
via the Gram-Schmidt procedure (see chapter 2) one can prove the so-called KAN
decomposition: Every g ∈ SL(2, R) can be uniquely written in the form
! ! !
cos θ sin θ λ 0 1x
g= · · (11.533)
− sin θ cos θ 0 λ−1 0 1
with λ > 0 and x ∈ R. So the products of the one-parameter groups generate all
group elements.
Remark: The group manifold SL(2, R) as anti-deSitter space: Every 2 × 2 real matrix
can plainly be written as: !
T1 + X1 X2 + T2
g= (11.534)
X2 − T2 T1 − X1
for real variables T1 , T2 , X1 , X2 . Restricting to matrices with detg = 1 gives the hyperboloid
in R2,2 :
−T12 − T22 + X12 + X22 = −1 (11.535)
– 238 –
Thus one picture of SL(2, R) can be given as a hyperboloid in R2,2 . The induced metric
has signature (−1 , +2 ). Indeed, a global set of coordinates is:
T1 = cosh ρ cos t
T2 = cosh ρ sin t
(11.536)
X1 = sinh ρ cos φ
X2 = sinh ρ sin φ
These coordinates smoothly cover the manifold once for 0 ≤ ρ < ∞, t ∼ t + 2π, φ ∼ φ + 2π.
Substituting into
ds2 = −(dT1 )2 − (dT2 )2 + (dX1 )2 + (dX2 )2 (11.537)
the induced metric becomes
This is one form of the anti-deSitter metric of constant curvature −1. Note that the time
coordinate t is periodic. What is usually meant by anti-deSitter space is the universal cover
of SL(2, R).
the group commutator, g1 g2 g1−1 g2−1 corresponds to the Lie algebra commutator:
b.) We say a Lie algebra is “Abelian” if [A1 , A2 ] = 0 for all A1 , A2 ∈ g. That that
such a Lie algebra exponentiates to form an Abelian group.
d
c.) If Ai = dt |0 gi (t) then [A1 , A2 ] is the Lie algebra element associated to the curve
√ √ √ √
g12 (t) = g1 ( t) · g2 ( t) · g1−1 ( t) · g2−1 ( t) (11.541)
– 239 –
Exercise Inequivalent One-Parameter Subgroups Of SL(2, R)
a.) Show that the three maximal Abelian subgroups (11.529), (11.530), (11.531) are
non-conjugate. 124
b.) Consider the one-parameter subgroup exp[xf ]. Is it conjugate to one of the above
three?
ρ : g → End(V ) (11.542)
for some vector space V (again sometimes called the carrier space) such that
for all x, y ∈ g.
Note that any Lie algebra g has a canonical representation with V = g and
This follows because, as one easily checks, the equation [ρ̇(x), ρ̇(y)] = ρ̇([x, y]) is equivalent
to the Jacobi identity. This representation is known as the adjoint representation.
Given a representation ρ̇ of a Lie algebra g we get a representation of a corresponding
(connected) Lie group G so that g = T1 G. Recall a representation of a Lie group is a group
homomorphism
ρ : G → GL(V ) (11.545)
For g = gl(n, κ) the group representation associated to the adjoint representation has
carrier space g and the group action is:
Remarks:
124
Answer : Consider the traces of these matrices. They are = 2, ≥ 2, and ≤ 2, respectively.
125
We are assuming there is a corresponding Lie group. Some Lie algebras can, in fact, not be exponenti-
ated to form Lie groups.
– 240 –
1. Adjoint Representation Of su(2) And Rotations. We have already seen this in
a slightly different form when we defined the natural double-cover homomorphism
π : SU (2) → SO(3). We can identify su(2) ∼ = R3 as a vector space by identifying
3
~x ∈ R with
M~x := i~x · ~σ (11.549)
The adjoint representation is ρadj (u) · M = uM u−1 , so ρadj acts linearly on ~x ∈ R3 .
Note that M~x2 = −~x2 12×2 , so this linear action preserves the norm and therefore
ρadj (u) ∈ O(3). Since
So, in a manner similar to the previous example SL(2, R) double covers the connected
component of the 2 + 1 dimensional Lorentz group SO0 (1, 2).
3. sl(2, C) And Lorentz Transformations For completeness we note that 2×2 Hermitian
matrices can be identified with 3 + 1 dimensional Minkowski space M1,3 via
Note g ∈ SL(2, C) acts linearly via ρ(g) · M = gM g † since this preserves Hermiticity.
However
detMxµ = (x0 )2 − ~x2 (11.554)
and the determinant is preserved by this actions so g 7→ ρ(g) describes SL(2, C) as a
double cover of the connected component SO0 (1, 3) of the 3 + 1 dimensional Lorentz
group.
4. su(2)⊕su(2) And Rotations In R4 Finally, a very similar construction gives the double
cover π : SU (2) × SU (2) → SO(4). But this is best discussed in the context of the
quaternions. See the section on quaternions in Chapter 2.
– 241 –
Show that for G = GL(n, κ) the corresponding Lie algebra is gl(n, κ) by considering
the 1-parameter subgroups exp[teij ] where eij are the matrix units.
Exercise
Interpret equation (11.441) as a special case of (11.546) for the adjoint representation.
Use this to derive (11.470) from the group homomorphism property of ρ.
Show that if G is a Lie gorup then the corresponding Lie algebra representation on V1 ⊗ V2
representations X ∈ g = T1 G by
e = iT 1 − T 2
h = −2iT 3 (11.557)
1 2
f = −iT − T
– 242 –
Suppose we have a finite-dimensional complex vector space V and linear operators
ρ(e), ρ(f ), ρ(h) on V satisfying the commutation relations:
As shown in Chapter two, any linear operator on a complex vector space has at least one
eigenvector. (It might have only one eigenvector.)
Suppose we choose an eigenvector v of ρ(h) and suppose the eigenvalue is λ. Then, we
claim that, so long as ρ(e)n v 6= 0 the vector ρ(e)n v has eigenvalue λ − 2n. To prove this
apply the general identity
n−1
X
[A, B n ] = B i [A, B]B n−1−i (11.559)
i=0
to conclude:
[ρ(h), ρ(e)n ] = −2nρ(e)n (11.560)
and the result follows. Now, it is general fact of linear algebra that if we have nonzero
vectors v1 , . . . , vn with distinct eigenvalues λ1 , . . . , λn for some operator then the v1 , . . . , vn
are linearly independent. (Prove this as an exercise.) Therefore, if ρ(e)n v 6= 0 then the
vectors v, ρ(e)v, . . . , ρ(e)n v are linearly independent. Therefore, since we have a finite-
dimensional representation there must be a nonnegative integer n so that ρ(e)n v 6= 0 but
ρ(e)n+1 v = 0. Let us denote v0 := ρ(e)n v. So ρ(e)v0 = 0 with ρ(h)v0 = λ0 v0 and v0 6= 0.
Now, using (11.559) again we get:
and therefore
ρ(h)(ρ(f )k v0 ) = (λ0 + 2k)ρ(f )k v0 (11.562)
By a similar argument to that above, we know that if ρ(f )n v0 is nonzero then the vectors
v0 , ρ(f )v0 , . . . , ρ(f )n v0 are linearly independent, and therefore there must exist an integer
N so that ρ(f )N v0 6= 0 but ρ(f )N +1 v0 = 0. Therefore
Now, using
N
X N
X
N −i
[ρ(e), ρ(f ) N +1
]= i
ρ(f ) [ρ(e), ρ(f )]ρ(f ) = ρ(f )i ρ(h)ρ(f )N −i (11.564)
i=0 i=0
– 243 –
But ρ(f )N v0 6= 0, by the definition of N so
N
!
X
(λ0 + 2(N − i)) =0 (11.566)
i=0
– 244 –
As mentioned above, we automatically get a representation of su(2), and hence of the
group SU (2). Therefore V is completely reducible. Therefore, we have recovered - this
time at the Lie algebra level - that there is one irreducible representation of SU (2) of
dimension n for each positive integer n. In fact, identifying N = 2j the representation WN
gives another model for the spin j representation Vj .
Remarks
[J a , J b ] = iabc J c (11.574)
which, unfortunately, obscures the fact that we are working with a real Lie algebra.
Group elements are of the form exp[iθ~ · J] ~ where θ~ is real. When working with
representations physicists generally define
J + = J 1 + iJ 2 = e
(11.575)
J − = J 1 − iJ 2 = −f
so that
[J 3 , J ± ] = ±J ±
(11.576)
[J + , J − ] = 2J 3
More generally, in a physical system with SU (2) symmetry the J a are the conserved
“Noether charges” for that symmetry. Recall that we called the irreducible represen-
tations of SU (2) of dimension n = 2j + 1 the “spin j representations.” This is the
origin of that terminology.
– 245 –
with m = −j, −j + 1, . . . , j − 1, j. We can relate this to our basis by identifying
N = 2j and
ρ(f )s v0 = Cj,s |j, j − si (11.578)
for a suitable normalization factor Cj,s .
Exercise Unitarizing Vj
– 246 –
Unitarizing Vj means equipping it with an inner product so that the ρ(J a ) are Hermi-
tian operators. Equivalently, ρ(h) is Hermitian and
11.16.6 Casimirs
We have stressed that if two elements a, b in a Lie subalgebra of a matrix Lie algebra are
multiplied as matrices ab then in general the result is not in the Lie algebra. Nevertheless,
if we have a representation ρ(a), ρ(b) ∈ End(V ) nothing stops us form multiplying the
operators ρ(a)ρ(b). Certain relations among the algebra of the operators ρ(a) for a ∈ g are
universal and independent of the representation. They can be expressed in terms of tensor
algebra using the universal enveloping algebra. See Chapter two for a description. Here we
just look at one important aspect of such universal relations.
In any representation of su(2) the operator
3
X
C2 (ρ) := ρ(T a )2 (11.587)
a=1
commutes with all the operators ρ(T a ). This operator is known as a quadratic Casimir.
It is a theorem in the theory of universal enveloping algebras that any representation, any
operator that commutes with all operators made by multiplying and adding the ρ(T a ) is
a polynomial in the operator C2 (ρ). This fact generalizes to all simple Lie algebras: The
center of the universal enveloping algebra is a polynomial in the Casimirs and there are
r independent Casimirs where r is the rank. For SU (N ) there are N − 1 independent
Casimirs.
Returning to SU (2), we can express C2 (ρ) in terms of the representations of the basis
for sl(2, R) using
1
2(ρ(e)ρ(f ) + ρ(f )ρ(e)) − ρ(h)2
C2 (ρ) = (11.588)
4
In any irreducible representation the Casimir operator must be a multiple of the iden-
tity operator. We can easily compute the value by acting on any convenient vector. For
example,
1
2(ρ(e)ρ(f ) + ρ(f )ρ(e)) − ρ(h)2 v0
C2 (ρ)v0 =
4 (11.589)
N (N + 2)
=− v0
4
– 247 –
which the physicists will prefer to write as
~ 2 = j(j + 1)1(2j+1)×(2j+1)
ρj (J) (11.590)
Examples:
We can easily use representation theory to find the spectrum of this Hamiltonian.
We note that
1 a
H= (J1 ⊗ 1 + 1 ⊗ J2a )2 − J~12 ⊗ 1 − 1 ⊗ J~22 (11.592)
2
But the Hilbert space is V1/2 ⊗ V1/2 ∼
= V0 ⊕ V1 . On the one-dimensional subspace
∼ V
= 0 we use (11.590) to compute
1 3 3 3
H|V0 = (0 − − ) = − (11.593)
2 4 4 4
∼
But on the three-dimensional subspace = V1 we have
1 3 3 1
H|V1 = (2 − − ) = + (11.594)
2 4 4 4
126
Answer : We can write:
1 a
H= (J1 ⊗ 1 ⊗ 1 + 1 ⊗ J2a ⊗ 1 + 1 ⊗ 1 ⊗ J3a )2 − J~12 ⊗ 1 ⊗ 1 − 1 ⊗ J~22 ⊗ 1 − 1 ⊗ 1 ⊗ J~32 (11.596)
2
But we have the isotypical decomposition:
– 248 –
11.16.7 Lie Algebra Operators In The Induced Representations Of SU (2)
Using the above models for the irreducible representations of SU (2) and the relation be-
tween elements of the Lie algebra and infinitesimal group elements we get representations
of the Lie algebra in terms of differential operators.
In terms of homogeneous polynomials ψ(u, v) of two variables in the space H2j recall
that ! !!
u u
(ρ(g) · ψ)( ) := ψ g −1 (11.600)
v v
Now suppose g is infinitesimally close to the identity so that
! !!
u u
(ρ(1 + X) · ψ)( ) := ψ (1 − X) (11.601)
v v
up to order 2 . Since ρ(X) is linear in X we can form complex linear combinations, and
therefore the same formula will apply to any X ∈ sl(2, C). In particular, we deduce:
∂
ρ(e) · ψ = −v ψ
∂u
∂ ∂
ρ(h) · ψ = u −v ψ (11.602)
∂u ∂v
∂
ρ(f ) · ψ = u ψ
∂v
The reader should check the operators really satisfy ρ([X, Y ]) = [ρ(X), ρ(Y )]. Note that
the Casimir is:
∂ 2 1
1 ∂ ∂ ∂
C2 (ρ) = − u +v − u +v (11.603)
4 ∂u ∂v 2 ∂u ∂v
so acting on H2j we immediately get that C2 (ρj ) acts as the scalar operator of multiplication
by j(j + 1).
It is also interesting to consider the action in the inhomogeneous representation. As
above, we extend the action to all of SL(2, C) so that if
!
a b
g= ∈ SL(2, C) (11.604)
c d
then
2j dz − b
ρ(g) · p(z) = (−cz + a) p (11.605)
−cz + a
and if g = 1 + X + O(2 ) then ρ(g) = 1 + ρ(X) + O(2 ), and in this way we derive:
∂
ρ(e) · p = − p
∂z
∂
ρ(h) · p = −(2z + 2j)p (11.606)
∂z
∂
ρ(f ) · p = −(z 2 − 2jz)p
∂z
– 249 –
For example:
2j z
(ρ(1 + f ) · p)(z) := (z + 1) p
z + 1
= (1 + 2jz)p(z − z 2 ) + O(2 ) (11.607)
∂
= p(z) − z 2 p(z) + 2jzp(z) + O(2 )
∂z
Similarly, acting on the Wigner functions themselves we learn that, after complexifi-
j
cation ρ(e) translates the group element in the argument of Dm L ,mR by
! !
α −β̄ β ᾱ
g= →g− (11.608)
β ᾱ 0 0
∂ ∂
ρ(e) = ᾱ −β
∂ β̄ ∂α
∂ ∂ ∂ ∂
ρ(h) = ᾱ −α − β̄ +β (11.610)
∂ ᾱ ∂α ∂ β̄ ∂β
∂ ∂
ρ(f ) = α − β̄
∂β ∂ ᾱ
We can specialize this to get the standard identities on spherical harmonics which are ♣NEED TO GIVE
MORE DETAILS
frequently used in mathematical physics: 127 ON THIS LAST
REMARK ♣
iφ ∂ ∂
L+ = e + icotθ
∂θ ∂φ
−iφ ∂ ∂
L− = e − + icotθ (11.612)
∂θ ∂φ
∂
L3 = −i
∂φ
127
See, for example, J.D. Jackson, Classical Electrodynamics, page 743.
– 250 –
p
L+ Y`,m = (` − m)(` + m + 1)Y`,m+1
p
L− Y`,m = (` + m)(` − m + 1)Y`,m−1
(11.613)
L3 Y`,m = mY`,m
~ 2 Y`,m = `(` + 1)Y`,m
L
Remark: We have seen above that group theory gives a nice perspective on many identi-
ties satisfied by the family of special functions associated with Legendre polynomials and
spherical harmonics. This viewpoint extends very nicely to many other special functions
in mathematical physics. Two references that explain this in some detail are:
1. J.D. Talman, Special Functions: A Group Theoretic Approach Based on Lectures
by Eugene P. Wigner
2. N. Ja. Vilenkin, Special Functions and the Theory of Group Representations ♣The following
section on Kernel,
Image, Exact
Sequence should be
moved to be just
before the
12. Group Theory And Elementary Number Theory representation
theory section, but
the quantum
mechanics on the
circle should be
In this chapter we review some very elementary number theory that has a strong connection moved either to the
Heisenberg section
to group theory. The facts here can be very useful in thinking about many physics problems. or to the Pontryagin
duality section. ♣
Two general references are
Hardy and Wright, An Introduction To The Theory Of Numbers
Ireland and Rosen, A Classical Introduction to Modern Number Theory
A = qB + r 0≤r<B (12.1)
for unique nonnegative integers q and r known as the quotient and the residue, respectively.
Next, let (A, B) = (±A, ±B) = (±B, ±A) denote the greatest common divisor of A, B.
Then we can find it using the Euclidean algorithm by looking at successive quotients. If
– 251 –
A = qB with r = 0 we are done! Then (A, B) = B. If r > 0 then we proceed as follows:
A = q1 B + r1 0 < r1 < B
B = q2 r1 + r2 0 < r2 < r1
r1 = q3 r2 + r3 0 < r3 < r2
r2 = q4 r3 + r4 0 < r4 < r3 (12.2)
.. ..
. .
rj−2 = qj rj−1 + rj 0 < rj < rj−1
rj−1 = qj+1 rj
Note that B > r1 > r2 > · · · ≥ 0 is a strictly decreasing sequence of nonnegative integers
and hence must terminate at r∗ = 0 after a finite number of steps.
Examples
A = 96 and B = 17:
96 = 5 · 17 + 11
17 = 1 · 11 + 6
11 = 1 · 6 + 5 (12.3)
6=1·5+1
5=5·1
A = 96 and B = 27:
96 = 3 · 27 + 15
27 = 1 · 15 + 12
(12.4)
15 = 1 · 12 + 3
12 = 4 · 3
Note well: In (12.1) the remainder might be zero but in the first j lines of the Euclidean
algorithm the remainder is positive, unless B divides A, in which case rather trivially
(A, B) = B. The last positive remainder rj is the gcd (A, B). Indeed if m1 , m2 are integers
then the gcd satisfies:
for any integer x. Applying this to the Euclidean algorithm above we get:
A corollary of this algorithm is that if g = (A, B) is the greatest common divisor then
there exist integers (x, y) so that
Ax + By = g (12.7)
– 252 –
In particular, two integers m1 , m2 are relatively prime, that is, have no common integral
divisors other than ±1, if and only if there exist integers x, y such that
m1 x + m2 y = 1. (12.8)
Of course x, y are not unique. Equation (12.8) is sometimes known as “Bezout’s theorem.”
We can prove these statements from the Euclidean algorithm as follows. ♣Putting this
discussion here
For an integer n define ! makes part of the
section on SL(2, Z)
1n and continued
T (n) := = Tn (12.9) fractions a little
0 1 redundant. ♣
where !
11
T := T (1) = . (12.10)
01
Now let us write the first line of the Euclidean algorithm as a matrix identity as
! !
A r1
T (−q1 ) = (12.11)
B B
Thus we have ! !
A rj−1
σ 1 T (−qj ) · · · σ 1 T (−q2 )σ 1 T (−q1 ) = (12.14)
B rj
and in the final step:
! !
A rj
σ 1 T (−qj+1 )σ 1 T (−qj ) · · · σ 1 T (−q2 )σ 1 T (−q1 ) = (12.15)
B 0
(Note that x, y, u, v are polynomials in the qi . See comments on continued fractions below.)
Remarks:
1. The Euclidean algorithm is fast: A theorem of Lamé asserts that the Euclidean
algorithm is very efficient. It should be completely obvious to you that the number
of steps cannot exceed B. (Recall that A > B.) However, Lamé asserts that in fact
the number of steps never exceeds 5log10 B. This is important for RSA (see below).
– 253 –
2. Relation to continued fractions: Note that from equation (12.15) we can also write
! !
A rj
= T (q1 )σ 1 T (q2 )σ 1 · · · T (qj )σ 1 T (qj+1 )σ 1 (12.17)
B 0
Let us write: !
q 1
M (q) := T (q)σ 1 = (12.18)
10
(The reader should check that this is a consistent definition for all n.) One easily
generates:
N1 (q1 ) = q1
N2 (q1 , q2 ) = 1 + q1 q2 (12.20)
N3 (q1 , q2 , q3 ) = q1 + q3 + q1 q2 q3
D1 (q1 ) = 1
D2 (q1 , q2 ) = q2 (12.21)
D3 (q1 , q2 , q3 ) = 1 + q2 q3
These polynomials are closely related to continued fractions, defined as:
1
[q1 , q2 , q3 , · · · , qj ] := q1 + 1 (12.22)
q2 + q3 +···+ q1
j
Pn (q1 , . . . , qn )
[q1 , q2 , q3 , · · · , qn ] := (12.25)
Qn (q1 , . . . , qn )
– 254 –
we see that
1
[q1 , q2 , q3 , · · · , qn+1 ] = q1 +
[q2 , . . . , qn+1 ]
(12.26)
q1 Pn (q2 , . . . , qn+1 ) + Qn (q2 , . . . , qn+1 )
=
Pn (q2 , . . . , qn+1 )
So, Pn , Qn satisfy the same recursion relations as Nn , Dn , respectively, and since the
initial values are also the same we conclude that Pn = Nn and Qn = Dn .
Exercise
Check the Lamé bound for the two examples above.
Exercise
Given one solution for (12.7), find all the others.
A 1
= q1 + 1 := [q1 , q2 , q3 , · · · , qj ] (12.27)
B q2 + q +···+ 1
3 qj
The fractions [q1 ], [q1 , q2 ], [q1 , q2 , q3 ], . . . are known as the convergents of the continued
fraction.
b.) Show that 128
128
Answer : Write M (q1 ) · · · M (qn+1 ) = (M (q1 ) · · · M (qn )) · M (qn+1 ).
– 255 –
12.2 Application: Expressing elements of SL(2, Z) as words in S and T
The group SL(2, Z) is generated by
! !
0 −1 11
S := & T := (12.30)
1 0 01
as a word in S and T .
First, note the following simple
Then
gh = T n (12.33)
for some integer n ∈ Z.
The proof is almost immediate by combining the criterion that gh ∈ SL(2, Z) has
determinant one and yet must have the first column (1, 0).
Now, suppose h is such that A > C > 0. Then (A, C) = 1 and hence we have the
Euclidean algorithm to define integers q` , ` = 1, . . . N + 1, where N ≥ 1, such that
A = q1 C + r1 0 < r1 < C
C = q2 r1 + r2 0 < r2 < r1
r1 = q3 r2 + r3 0 < r3 < r2
.. .. (12.34)
. .
rN −2 = qN rN −1 + rN 0 < rN < rN −1
rN −1 = qN +1 rN
with rN = (A, C) = 1. (Note you can interpret r0 = C, as is necessary if N = 1.) Now, ♣N = 0 here? ♣
write the first line in the Euclidean algorithm in matrix form as:
! ! !
1 −q1 A r1
= (12.35)
0 1 C C
We would like to have the equation in a form that we can iterate the algorithm, so we need
the larger integer on top. Therefore, rewrite the identity as:
! ! !
1 1 −q1 A C
σ = (12.36)
0 1 C r1
– 256 –
We can now iterate the procedure. So the Euclidean algorithm implies the matrix identity:
! !
A 1
g̃ = (12.37)
C 0
We can easily modify the equation to obtained a desired element g. We divide the argument
into two cases:
1. Suppose first that N + 1 = 2s is even. Then we group the factors of g̃ in pairs and
write
(σ 1 T −q2` )(σ 1 T −q2`−1 ) = (σ 1 σ 3 )(σ 3 T −q2` σ 3 )(σ 3 σ 1 )T −q2`−1
(12.40)
= −ST q2` ST −q2`−1
2. Now suppose that N + 1 = 2s + 1 is odd. Then we rewrite the identity (12.37) as:
! !
1 A 1
σ g̃ = (12.42)
C 0
– 257 –
while !
1 0
= ST m S −1 (12.47)
−m 1
Thus, if A > 0 we can use this operation to shift C so that 0 ≤ C < A. In case A < 0 we
can multiply by S 2 = −1 to reduce to the case A > 0. Finally, if A = 0 then
!
0 ±1
h= (12.48)
∓1 n
and we write !
0 −1
ST n = (12.49)
1 n
♣Need to
summarize the
result in a useful
12.3 Products Of Cyclic Groups And The Chinese Remainder Theorem way ♣
We will consider the direct product of cyclic groups. According to our general notation
we would write this as Zm1 × Zm2 . However, since Zm is also a ring the notation Zm1 ⊕ Zm2
also often used, and we will use it below, especially when we write our Abelian groups
additively.
Let us begin with the question: Is it true that
?
Zm1 × Zm2 ∼
=Zm1 m2 . (12.51)
In general (12.51) is false!
Exercise
a.) Show that Z4 is not isomorphic to Z2 ⊕ Z2 . (There is a one-line proof.) 129
Write g = gcd(m1 , m2 ) and ` = lcm(m1 , m2 ). Then there are two natural exact
sequences:
1 → Zg → Zm1 × Zm2 → Z` → 1 (12.52)
0 → Z/`Z → Z/m1 Z ⊕ Z/m2 Z → Z/gZ → 0 (12.53)
129
Answer : Every element in Z2 ⊕ Z2 is of order two. But some elements of Z4 have order four. The but
the order of a group element is preserved under isomorphism.
– 258 –
In fact, we will show below that
Zm1 × Zm2 ∼
= Zg × Z` (12.54)
Remarks:
1. The sequence (12.52) is easier to write down multiplicatively, while (12.53) is easier
to write down additively. See the discussion below. (Of course, both are true in
either formulation!)
2. If g = 1 since Z1 = Z/Z is the trivial group we can indeed conclude that Zm1 ×Zm2 ∼
=
Zm1 m2 but otherwise this is false. We will return to this point.
Now, let us prove (12.52) and (12.53).
Recall that
m1 m2 = g` (12.55)
a fact that will be useful momentarily. (If you do not know this we will prove it below.)
It will also be useful to write m1 = µ1 g and m2 = µ2 g where µ1 , µ2 are relatively prime.
Thus there are integers ν1 , ν2 with
µ1 ν1 + µ2 ν2 = 1 (12.56)
and hence
m1 ν1 + m2 ν2 = g. (12.57)
To prove (12.52) think of Zm as the multiplicative group of mth roots of 1, so they are
all subgroups of U (1). Now define a group homomorphism:
π : Zm1 × Zm2 → Z` (12.58)
by:
π : (ξ1 , ξ2 ) → ξ1 ξ2 (12.59)
That is, we merely multiply the two entries. (This makes it clear that it is a group homo-
morphism since the group law is multiplication of complex numbers and that multiplication
is commutative.) Here ξ1 is an mth th
1 root of unity and ξ2 is an m2 root of unity. The only
thing you need to check is that indeed then ξ1 ξ2 is an `th root of unity, so π indeed maps
into Z` .
2πi 2πi
Now we prove that π is surjective: Let ω1 = e m1 and ω2 = e m2 . These are generators
of Zm1 and Zm2 . Choose integers ν1 , ν2 so that ν1 m1 + ν2 m2 = g then π maps
π : (ω1ν2 , ω2ν1 ) 7→ ω1ν2 ω2ν1
ν2 ν1
= exp 2πi +
m1 m2
m2 ν2 + m1 ν1
= exp 2πi
m1 m2 (12.60)
g
= exp 2πi
m1 m2
1
= exp 2πi
`
– 259 –
But exp 2πi 1` is a generator of the multiplicative group of `th roots of unity, isomorphic
to Z` , and hence the homomorphism π is onto. Thus, we have checked exactness of the
sequence at Z` .
On the other hand the injection map
is defined by identifying Zg with the multiplicative group of g th roots of unity and just
sending:
ι(ξ) = (ξ, ξ −1 ) (12.62)
Note that a g th root of unity ξ has the property that ξ ±1 is also both an mth
1 and an m2
th
root of unity. So this makes sense. It is now easy to check that indeed the kernel of π
is the image of ι. Since π takes the product of the two entries it is immediate from the
definition (12.62) that im(ι) ⊂ ker(π). On the other hand, if π(ξ1 , ξ2 ) = ξ1 ξ2 = 1 then
clearly ξ2 = ξ1−1 , so this must be in the image of ι. Now we have checked exactness at the
middle of the sequence. Exactness at Zg is trivial. This concludes the proof of (12.52) ♠
It is worth noting that we can write “additive” version of the maps ι and π as:
ι(x) = µ1 x ⊕ (−µ2 x)
(12.63)
π(x1 ⊕ x2 ) = µ2 x1 + µ1 x2
You should check that written this way it is well defined, and the sequence is exact.
Exercise
a.) Show that there is an exact sequence
ι π
0 → Z/`Z→Z/m1 Z ⊕ Z/m2 Z→Z/gZ → 0 (12.64)
where
π : x1 ⊕ x2 7→ (x1 − x2 )modg . (12.65)
b.) Show that if we think of these groups as groups of roots of unity then we have
π(ξ1 , ξ2 ) = ξ1µ1 ξ2−µ2 and ι(ω) = (ω µ2 , ω µ1 ) with m1 = µ1 g and m2 = µ2 g.
Zm1 × Zm2 ∼
= Zg × Z` (12.67)
First, it follows from either of the two exact sequences we proved above that if
(m1 , m2 ) = 1 then indeed
Zm1 m2 ∼
= Zm1 × Zm2 (12.68)
– 260 –
Next, recall that any integer can be decomposed into its prime factors:
Y
m= pvp (m) (12.69)
p
where vp (m) ∈ Z+ , known as the valuation of m at p is zero for all but finitely many
primes. (So we have an infinite product of 1’s on the RHS of the above equation.)
Now in terms of the prime factorizations of m1 , m2 we can write:
Y
g = gcd(m1 , m2 ) = pmin[vp (m1 ),vp (m2 )]
p
Y (12.70)
` = lcm(m1 , m2 ) = pmax[vp (m1 ),vp (m2 )]
p
Applying this to each of the two factors in Zm1 × Zm2 and using G1 × G2 ∼
= G2 × G1 to
arrange the factors so the minimum power is on the left and maximum on the right and
regrouping gives (12.67). In equations:
Zm1 × Zm2 ∼
Y Y
= Zpνp (m1 ) × Zpνp (m2 )
p p
∼
Y Y
= Zpmin[νp (m1 ),νp (m2 )] × Zpmax[νp (m1 ),νp (m2 )] (12.72)
p p
∼
= Zg × Z`
A second proof gives some additional insight by providing an interesting visual picture
of what is going on, as well as relating this fact to lattices. It is related to the first by
“taking a logarithm” and involves exact sequences of infinite groups which induce sequences
on finite quotients.
Consider the sublattice of Z ⊕ Z given by
!
m1 α
Λ = m1 Z ⊕ m2 Z = { |α, β ∈ Z} (12.73)
m2 β
– 261 –
! ! !
x0 µ2 µ1 x
= (12.76)
y0 −ν1 ν2 y
that is ! ! !
x ν2 −µ1 x0
= (12.77)
y ν1 µ2 y0
which we prefer to write as:
! ! !
x ν2 −µ1
= x0 + y0 (12.78)
y ν1 µ2
We interpret this as saying that x0 , y 0 are the coordinates of the vector (x, y) ∈ Z2 relative
to the new basis vectors for Z2 .
! !
ν2 −µ1
v1 = v2 = (12.79)
ν1 µ2
The good property of this basis is that the smallest multiple of v1 that sits in Λ is `v1
(prove this) 130 Similarly, the smallest multiple of v2 in Λ is gv2 . Thus, we have a way of
writing Z2 as Zv1 ⊕ Zv2 such that the projection of Λ to the v1 axis is the group `Z while
the kernel is the subgroup of Zv2 that maps into Λ, and that is just ∼ = gZ.
2
Put differently, there is a homomorphism ψ : Z → Z that takes
!
x
ψ: 7→ x0 . (12.80)
y
This is the projection on the v1 axis. This defines a surjective homomorphism onto Z.
(Explain why.) On the other hand, using (12.76) and µ1 µ2 g = ` we see that the image of
Λ under ψ is `Z. Therefore, using the exercise result (10.24) ψ descends to a map
ψ̄ : Z2 /Λ → Z/`Z (12.81)
is in the kernel of ψ̄, and moreover it generates a cyclic subgroup of order g in Z2 /Λ. By
counting, this cyclic subgroup must be the entire kernel of ψ̄. Therefore we have an exact
sequence
0 → Zg → Z2 /Λ → Z` → 0 (12.83)
**************************
AND THERE IS A MAP TO y 0 AND TOGETHER THESE GIVE ISOMORPHISM
TO Z/`Z ⊕ Z/gZ.
**************************
– 262 –
This concludes our second proof. ♠ ♣Should really add
a figure to illustrate
this. Unfortunately
the first really
nontrivial case is
m1 = 2 · 3 and
m2 = 2 · 5 so
` = 30. ♣
Exercise
Using the Kronecker theorem show that if a finite Abelian group G is not a cyclic
group then there is a nontrivial divisor n of |G| so that g n = 1 for all g ∈ G.
where M = m1 m2 · · · mr .
Proof :
The fastest proof makes use of the previous result and induction on r.
A second proof offers some additional insight into solving simultaneous congruences:
We construct a homomorphism
by
ψ(x) = (xmodm1 , xmodm2 , . . . , xmodmr ) (12.86)
We first claim that ψ(x) is onto. That is, for any values a1 , . . . , ar we can solve the
simultaneous congruences:
x = a1 modm1
x = a2 modm2
.. .. (12.87)
. .
x = ar modmr
– 263 –
Q
To prove this note that m̂i := M/mi = j6=i mj is relatively prime to mi (by the
hypothesis of the theorem). Therefore there are integers xi , yi such that
xi mi + yi m̂i = 1 (12.88)
Therefore if we set
r
X
x= ai gi (12.90)
i=1
Remarks
1. Equation (12.67) is used implicitly all the time in physics, whenever we have two
degrees of freedom with different but commensurable frequencies. Indeed, it is used
all the time in everyday life. As a simple example, suppose you do X every other day.
You will then do X on Mondays every other week, i.e., every 14 days, because 2 and 7
are relatively prime. More generally, consider a system with a discrete configuration
space Z/pZ thought of as the multiplicative group of pth roots of 1. Suppose the time
evolution for ∆t = 1 is ωpr → ωpr+1 where ωp is a primitive pth root of 1. The basic
period is T = p. Now, if we have two oscillators of periods p, q, the configuration
space is Zp × Zq . The basic period of this system is - obviously - the least common
multiple of p and q. That is the essential content of (??).
2. Our second proof shows that in fact equation (12.84) is a statement of an isomorphism
of rings.
3. One might wonder how the theorem got this strange name. (Why don’t we refer
to the “Swiss-German theory of relativity?”) The theorem is attributed (see, e.g.
Wikipedia) to Sun-tzu Suan-ching in the 3rd century A.D. (He should not be con-
fused with Sun Tzu who lived in the earlier Spring and Autumn period and wrote The
Art of War.) For an interesting historical commentary see 131 which documents the
historical development in India and China up to the definitive treatments by Euler,
Lagrange, and Gauss who were probably unaware of previous developments hundreds
of years earlier. The original motivation was apparently related to construction of
calendars, and this is certainly mentioned by Gauss in his renowned book Disqui-
sitiones Arithmeticae. The Chinese calendar is based on both the lunar and solar
cycles. Roughly speaking, one starts the new year based on both the winter solstice
– 264 –
and the new moon. Thus, to find periods of time in this calendar one needs to ♣Some students
with Chinese
solve simultaneous congruences. I suspect the name “Chinese Remainder Theorem” background say this
is wrong. Check it
is an invention of 19th century mathematicians. Hardy & Wright (1938) do not call out. ♣
Exercise
a.) Show that the Chinese Remainder theorem is false if the mi are not pairwise
relatively prime.
b.) Show that the obstruction to finding a solution x to x = ai modmi is given by the
reductions (ai − aj )mod(mi , mj ) over all pairs i 6= j. That is, a solution exists iff all of
these vanish.
– 265 –
defines an automorphism of G. Indeed I(a) ◦ I(b) = I(ab) and hence I : G → Aut(G)
is a homomorphism. The subgroup Inn(G) of such automorphisms is called the group of
inner automorphisms. Note that if a ∈ Z(G) then I(a) is trivial, and conversely. Thus we
have:
Inn(G) ∼
= G/Z(G). (13.2)
Remarks
1. In practice one often reads or hears the statement that an element ϕ ∈ Aut(G)
is an “outer automorphism.” What this means is that it projects to a nontrivial
element of Out(G). However, strictly speaking this is an abuse of terminology and
an outer automorphism is in the quotient group (13.4). These notes might sometimes
perpetrate this abuse of terminology.
2. Note that for any abelian group G all nontrivial automorphisms are outer automor-
phisms.
Example 13.1: Consider Aut(Z3 ). This group is Abelian so all automorphisms are outer.
Thinking of it multiplicatively, the only nontrivial choice is ω → ω −1 . If we think of
A3 ∼
= Aut(Z3 ) then we are taking
So: Aut(Z3 ) ∼
= Z2 .
Example 13.2: Consider Aut(Z4 ). Think of Z4 as the group of fourth roots of unity,
generated by ω = exp[iπ/2] = i. A generator must go to a generator, so there is only
one possible nontrivial automorphism: φ : ω → ω 3 . Note that ω → ω 2 is a nontrivial
homomorphism of Z4 → Z4 , but it is not an automorphism. Thus Aut(Z4 ) ∼
= Z2 .
– 266 –
Example 13.3: Consider Aut(Z5 ). Think of Z5 as the group of fifth roots of unity,
generated by ω = exp[2πi/5]. Now there are several automorphisms: φ2 defined by its
action on the generator ω → ω 2 . Similarly, we can define φ3 , by ω → ω 3 and φ4 , by
ω → ω 4 . Letting φ1 denote the identity we have
So Aut(Z5 ) ∼
= Z4 . The explicit isomorphism is
φ2 → 1̄
φ4 → 2̄ (13.9)
φ3 → 3̄
Example 13.4: Consider Aut(ZN ), and let us think of ZN multiplicatively as the group
of N th roots of 1. An automorphism φ of ZN must send ω 7→ ω r for some r. On the
other hand, ω r must also be a generator of ZN . Automorphisms must take generators to
generators. Hence r is relatively prime to N . This is true iff there is an s with
rs = 1modN (13.10)
where a, b, c are all distinct. We claim that x, y, z, w must comprise precisely three distinct
letters. We surely can’t have (xy) = (zw) because φ is 1-1, and we also can’t have (xy)
and (zw) commuting because the group commutator of (ab) and (ac) is (abc). Therefore
we can write
φ((ab)) = (xy) φ((ac)) = (xz) (13.13)
Therefore, we have defined a permutation a → x and φ is the inner automorphism associ-
ated with this permutation.
– 267 –
Now let us consider the size of the conjugacy classes. This was computed in exercise
*** above. The size of the conjugacy class of transpositions is of course
n n!
= (13.14)
2 (n − 2)!2!
The size of a conjugacy class of the form (1)k (2)` with k + 2` = n is
n!
(13.15)
(n − 2`)!`!2`
Setting these equal results in the identity
(n − 2)!
= `!2`−1 n ≥ 2` (13.16)
(n − 2`)!
For a fixed ` the LHS is a polynomial in n which is growing for n ≥ 2` and therefore
bounded below by (2` − 2)!. Therefore we consider whether there can be a solution with
n = 2`:
(2` − 2)! = `!2`−1 (13.17)
For ` = 3, corresponding to n = 6, there is a solution, but for ` > 3 we have (2` − 2)! >
`!2`−1 . The peculiar exception n = 6 is related to the symmetries of the icosahedron. For
more information see
1.http://en.wikipedia.org/wiki/Automorphisms of the symmetric and alternating groups
2. http://www.jstor.org/pss/2321657
3. I.E. Segal, “The automorphisms of the symmetric group,” Bulletin of the American
Mathematical Society 46(1940) 565.
Example 13.6: Automorphisms Of Alternating Groups. For the group An ⊂ Sn there
is an automorphism which is not obviously inner: Conjugation by any odd permutation.
Recall that Out(G) = Aut(G)/Inn(G) is a quotient group so conjugation by any odd
permutation represents the same element in Out(G). If we consider A3 ⊂ S3 then
(12)(123)(12)−1 = (132) (13.18)
is indeed a nontrivial automorphism of A3 and since A3 is abelian this automorphism must
be an outer automorphism. In general conjugation by an odd permutation defines an outer
automorphism of An . For example suppose conjugation by (12) were inner. Then there
would be an even permutation a so that conjugation by a · (12) centralizes every h ∈ An .
But a · (12) together with An generates all of Sn and then a · (12) would have to be in
the center of Sn , a contradiction. Thus, the outer automorphism group of An contains a
nontrivial involution. Again for n = 6 there is an exceptional outer automorphism.
The above example nicely illustrates a general idea: If N / G is a normal subgroup of
G and g ∈/ H then conjugation by g defines an automorphism H → H which is, in general,
not an inner automorphism.
Example 13.7: Consider G = GL(n, C). Then A → A∗ is an outer automorphism: That
is, there is no invertible complex matrix S ∈ GL(n, C) such that, for every invertible matrix
A ∈ GL(n, C) we have
A∗ = SAS −1 (13.19)
– 268 –
Exercise Outer Automorphisms Of Some Matrix Groups
a.) Prove (13.19). 133
b.) Consider maps of GL(n, C) given by A → Atr , A → A−1 and A → Atr,−1 . Which
of these are automorphisms? Which of these are outer automorphisms?
c.) Consider G = SU (2). Is A → A∗ an outer automorphism? 134
d.) Consider the automorphism of G = SO(2)
Exercise Automorphisms of Z
Show that Aut(Z) ∼
= Z2 . 135
Exercise
Although Z2 does not have any automorphisms the product group Z2 ⊕ Z2 certainly
does.
a.) Show that an automorphism of Z2 ⊕ Z2 must be of the form
133
Hint: Consider the invertible matrix A = i1n×n .
134
Answer : No! Note that (iσ k )∗ = −i(σ k )∗ = (iσ 2 )(iσ k )(iσ 2 )−1 . But iσ 2 ∈ SU (2) and every SU (2)
matrix is a real linear combination of 1 and iσ k . This has an important implication for the representation
theory of SU (2): Every irreducible representation is either real or “pseudoreal” (quaternionic).
135
Answer : The most general homomorphism Z → Z is the map n 7→ an for some integer a. But for an
automorphism a must be mutliplicatively invertible in the integers. Therefore a is +1 or −1.
136
Hint: Consider what the group does ! to the! three nontrivial
! elements (1, 0), (0, 1), and (1, 1).! The
0 1 1 1 1 0 1 1
three transpositions correspond to , , and the two elements of order 3 are and
1 0 0 1 1 1 1 0
!
0 1
.
1 1
– 269 –
c.) Now describe Aut(Z4 × Z4 ). 137
Exercise Automorphisms of ZN p
(Warning: This is hard and uses some other ideas from algebra.)
Let p be prime. Describe the automorphisms of ZN p , and show that the group has
order 138
N −1
|Aut(ZN N N N 2 N
p )| = (p − 1)(p − p)(p − p ) · · · (p − p ) (13.23)
– 270 –
b.) Any two isomorphisms Ψ, Ψ0 ∈ Isom(G1 , G2 ) are related by Ψ0 = φ ◦ Ψ where
φ ∈ Aut(G2 ).
The set Isom(G1 , G2 ) with G1 , G2 not equal but isomorphic is a good example of what
is called a torsor. A torsor for a group G is a set X with a free transitive action.
In general, even if we omit 0̄, Z/N Z is not a group with respect to the multiplication
law (find a counterexample). Nevertheless, Z/N Z with +, × is an interesting object which
is an example of something called a ring. See the next chapter for a general definition of a
ring.
Let us define the group of units in the ring Z/N Z:
where (m, N ) is the greatest common divisor of m and N . We will also denote this group
as Z∗N .
Then, (Z/N Z)∗ is a group with the law (13.26) ! Clearly the multiplication is closed
and 1̄ is the unit. The existence of multiplicative inverses follows from (12.8).
Moreover, as we have seen above, we can identify
Aut(Z/N Z) ∼
= (Z/N Z)∗ (13.28)
– 271 –
The order of the group (Z/N Z)∗ is denoted φ(N ) and is called the Euler φ-function or
Euler’s totient function. 140 One can check that
φ(2) = 1
φ(3) = 2 (13.31)
φ(4) = 2
What can we say about the structure of Z∗N ? Now, in general it is not true that
Aut(G1 × G2 ) and Aut(G1 ) × Aut(G2 ) are isomorphic. Counterexamples abound. For
example Aut(Z) ∼ = Z2 but Aut(Z ⊕ Z) ∼ = GL(2, Z). Nevertheless, it actually is true that
∼
Aut(Zn × Zm ) = Aut(Zn ) × Aut(Zm ) when n and m are relatively prime. To prove this,
let v1 be a generator of Zn and v2 a generator of Zm and let us write our Abelian group
additively. The general endomorphism of Zn ⊕ Zm is of the form
v1 → αv1 + βv2
(13.32)
v2 → γv1 + δv2
with α, β, γ, δ ∈ Z. Now impose the conditions nv1 = 0 and mv2 = 0 and the fact that n̄ is
multiplicatively invertible in Zm and m̄ is multiplicatively invertible in Zn to learn that in
fact an endomorphism must have β = 0mod m and γ = 0mod n. Therefore βv2 = 0 and
γv1 = 0. Therefore, an automorphism of Zn ⊕ Zm is determined by v1 → αv1 with ᾱ ∈ Z∗n
and v2 → δv2 with δ̄ ∈ Z∗m and hence Aut(Zn ⊕ Zm ) ∼ = Aut(Zn ) × Aut(Zm ) when n and m
are relatively prime. (The corresponding statement is absolutely false when they are not
relatively prime.) So we have:
Z∗nm ∼
= Z∗n × Z∗m (13.33)
In particular, φ is a multiplicative function: φ(nm) = φ(n)φ(m) if (n, m) = 1. Therefore,
if N = pe11 · · · perr is the decomposition of N into distinct prime powers then
(Z/N Z)∗ ∼
= (Z/pe11 Z)∗ × · · · (Z/perr Z)∗ (13.34)
Moreover, (Z/pe Z)∗ is of order φ(pe ) = pe − pe−1 , as is easily shown 141 and hence
1
(pei i − piei −1 ) = N
Y Y
φ(N ) = (1 − ) (13.35)
p
i p|N
Remark: For later reference in our discussion of cryptography note one consequence
of this: If we choose, randomly - i.e. with uniform probability density - a number between
1 and N the probability that it will be relatively prime to N is
φ(N ) Y 1
= (1 − ) (13.36)
N p
p|N
140
Do not confuse φ(N ) with the φa above!
141
Proof: The numbers between 1 and pe which have gcd larger than one must be of the form px where
1 ≤ x ≤ pe−1 . So the rest are relatively prime.
– 272 –
This means that, if N is huge and a product of just a few primes, then a randomly chosen
number will almost certainly be relatively prime to N .
is cyclic but
when e ≥ 3.
In fact, more generally, it turns out that (Z/nZ)∗ is cyclic iff n ∈ {1, 2, 4, pk , 2pk } where
p runs over odd primes and k > 0.
Note that if we take a product of two distinct odd prime powers then
But φ(pk11 ) and φ(pk22 ) are both even, being divisible by p1 − 1 and p2 − 1, respectively, and
hence are not relatively prime, and hence (Z/(pk11 pk22 Z)∗ is not cyclic.
Examples
31 = 3, 32 = 2, 33 = 6, 34 = 4, 35 = 5, 36 = 1 mod7 (13.40)
51 = 5, 52 = 4, 53 = 6, 54 = 2, 55 = 3, 56 = 1 mod7 (13.41)
– 273 –
800
600
400
200
Figure 33: A plot of the residues of 2x δx modulo N = 2 · 3 · 5 · 7 · 11 = 2310, for 1 ≤ x ≤ 500. Here
δx = 0 if gcd(x, N ) > 1 so that we only see the values in (Z/N Z)∗ . Notice the apparently random
way in which the value jumps as we increase x.
3. (Z/8Z)∗ = {1, 3, 5, 7} ∼
= Z2 × Z2 . Note that 32 = 52 = 72 = 1mod8 and 3 · 5 = 7mod8,
so we can take 3 and 5 to be the generators of the two Z2 subgroups.
– 274 –
4. (Z/15Z)∗ ∼
= Z2 × Z4 is not cyclic.
Remarks
1. When (Z/nZ)∗ is cyclic a generator is called a primitive root modulo n, and is not
to be confused with a primitive nth root of one. It is trivial to find examples of the
latter and highly nontrivial to find examples of the former.
2. The values of f (x) = ax modN for (a, N ) = 1 appear to jump about randomly as a
function of x, as shown in Figure 33. Therefore, finding the period of this function,
that is, the smallest positive integer r so that f (x + r) = f (x) is not easy. This is
significant because of the next remark.
But now, N divides the LHS so b ∓ 1 = 0modN which implies b = ±1modN , again
contrary to assumption. Thus, d± are nontrivial divisors of N .
To give a concrete example, take N = 3 · 5 · 7 = 105, so φ(N ) = 48. Then the
period of f (x) = 2x is r = 12, and b = 212/2 = 64. Well gcd(64 + 1, 105) = 5 and
gcd(64 − 1, 105) = 21 are both divisors of 105. In fact 105 = 5 · 21.
4. Artin’s Conjecture: Finding a generator is not always easy, and it is related to some
deep conjectures in number theory. For example, the Artin conjecture on primitive
roots states that for any positive integer a which is not a perfect square there are an
infinite number of primes so that ā is a generator of the cyclic group (Z/pZ)∗ . In
fact, if a is not a power of another integer, and the square-free part of a is not 1mod4
then Artin predicts the density of primes for which a is a generator to be
Y 1
1− = 0.37.... (13.44)
p(p − 1)
Artin primes
According to the Wikipedia page, there is not a single number a for which the con-
jecture is known to be true. For example, the primes p < 500 for which a = 2 is a
– 275 –
generator of (Z/pZ)∗ is
{3, 5, 11, 13, 19, 29, 37, 53, 59, 61, 67, 83,101, 107, 131, 139, 149, 163, 173, 179, 181,
197, 211, 227, 269, 293, 317, 347, 349, 373, 379,389, 419, 421, 443, 461, 467, 491}
(13.45)
5. A good reference for this material is Ireland and Rosen, A Classical Introduction to
Modern Number Theory Springer GTM
Note that a special case of this is Fermat’s little theorem: If a is an integer and p is
prime then
ap = amodp (13.47)
Remark: This theorem has important practical applications in prime testing. If we
want to test whether an odd integer n is prime we can compute 2n modn. If the result
is 6= 2modn then we can be sure that n is not prime. Now 2n modn can be computed
much more quickly with a computer than the traditional test of seeing whether the primes
√
up to n divide n. If 2n modn is indeed = 2modn then we can suspect that n is prime.
Unfortunately, there are composite numbers which will masquerade as primes in this test.
They are called “base 2 pseudoprimes.” In fact, there are numbers n, known as Carmichael
numbers which satisfy an = amodn for all integers a. The good news is that they are rare.
The bad news is that there are infinitely many of them. According to Wikipedia the first
few Carmichael numbers are
The first Carmichael number is 561 = 3 · 11 · 17 and Erdös proved that the number C(X)
of Carmichael numbers smaller than X is bounded by
κlogXlogloglogX
C(X) < Xexp − (13.49)
loglogX
– 276 –
Exercise Periodic Functions
a.) Consider the function
f (x) = 2x mod N (13.50)
for an odd integer N . Show that this function is periodic f (x + r) = f (x) for a minimal
period r which divides φ(N ).
b.) Compute the period for N = 15, 21, 105. 142
c.) More generally, if (a, N ) = 1 show that f (x) = ax mod N is a periodic function.
b.) Is f (m) = (3m)mod26 a valid code? By adding symbols or changing the alphabet
we can change the value of N above. Is f (m) = (3m)mod27 a valid code?
The RSA public key encryption system is a beautiful application of Euler’s theorem
and works as follows. The basic idea is that with numbers with thousands of digits it is
relatively easy to compute powers an modm and greatest common divisors, but it is very
difficult to factorize such numbers into their prime parts. For example, for a 1000 digit
number the brute force method of factorization requires that we sample up to
√
101000 = 10500 (13.53)
142
Answer : r = 4, 6, 12 divides φ(N ) = 8, 12, 48.
– 277 –
divisors. Bear in mind that our universe is about π × 107 × 13.79 × 109 =∼ 4 × 1017 seconds
old. 143 There are of course more efficient algorithms, but all the publicly known ones are
still far too slow.
Now, Alice wishes to receive and decode secret messages sent by any member of the
public. She chooses two large primes (thousands of digits long) pA , qA and computes nA :=
pA qA . These primes are to be kept secret. How does she find her secret thousand-digit
primes? She chooses a random thousand digit number and applies the Fermat primality
test. By the prime number theorem she need only make a few thousand attempts, and she
will find a prime. 144
Next, Alice computes φ(nA ) = (pA − 1)(qA − 1), and then she chooses a random
thousand-digit number dA such that gcd(dA , φ(nA )) = 1 and computes an inverse dA eA =
1modφ(nA ). All these steps are relatively fast and easy, because Euclid’s algorithm is very
fast. Thus there is some integer f so that
dA eA − f φ(nA ) = 1 (13.54)
That is, she solves the congruence x = 1modφ(nA ) and x = 0moddA , for the smallest
positive x and then computes eA = x/dA .
Finally, she publishes for the world to see the encoding key: {nA , eA }, but she keeps
the numbers pA , qA , φ(nA ), dA secret. This means that if anybody, say Bob, wants to send
Alice a secret message then he can do the following:
Bob converts his plaintext message into a number less than nA by writing a ↔ 01,
b ↔ 02, . . . , z ↔ 26. (Thus, when reading a message with an odd number of digits we
should add a 0 in front. If the message is long then it should be broken into pieces of
length smaller than nA .) Let Bob’s plaintext message thus converted be denoted m. It is
a positive integer smaller than nA .
Now to compute the ciphertext Bob looks up Alice’s numbers {nA , eA } on the public
site and uses these to compute the ciphertext:
Bob sends the ciphertext c to Alice over the internet. Anyone can read it.
Then Alice can decode the message by computing
Thus, to decode the message Alice just needs one piece of private information, namely the
integer dA .
143
There are π × 107 seconds in a year, to 0.3% accuracy.
144
The prime number theorem says that if π(x) is the number of primes between 1 and x then as x → ∞
we have π(x) ∼ logxx
. Equivalently, the nth prime is asymptotically like pn ∼ nlogn. This means that
the density of primes for large x is ∼ 1/logx, so if x ∼ 10n then the density is 1/n so if we work with
thousand-digit primes then after about one thousand random choices we will find a prime.
– 278 –
Now Eve, who has a reputation for making trouble, cannot decode the message without
knowing dA . Just knowing nA and eA but not the prime factorization nA = pA qA there is
no obvious way to find dA . The reason is that even though the number nA is public it is
hard to compute φ(nA ) without knowing the prime factorization of nA . Of course, if Eve
finds out about the prime factorization of nA then she can compute φ(nA ) immediately
and then quickly (using the Euclidean algorithm) invert eA to get dA . Thus, the security
of the method hinges on the inability of Eve to factor nA into primes.
In summary,
1. The intended receiver of the message, namely Alice in our discussion, knows
3. The sender of the message, namely Bob in our discussion, takes a secret message mB
and computes the ciphertext c = meBA modnA .
4. Alice can decode Bob’s message by computing mB = cdA modnA using her secret
knowledge of dA .
5. The attacker, namely Eve in our discussion, knows (nA , eA , c) but will have to work
to find dA or some other way of decoding the ciphertext.
Remarks
1. Note that the decoding will fail if m and nA have a common factor. However,
nA = pA qA and pA , qA are primes with thousands of digits. The probability that
Bob’s message is one of these is around 1 in 101000 .
145
Factor the integer n = 541 ∗ 1223. Then you know p, q and hence φ(n) = 659880. Now take e and
compute d by using the Chinese Remainder theorem to compute x = 1modφ and x = 0mode. This
gives x = 735766201 = de and hence d = 2257. Now you can compute the message from the ciphertext
m = cd modn.
– 279 –
13.2.1 How To Break RSA: Period Finding
The attacker, Eve, can read the ciphertext cmodnA . That means the attacker can try to
compute the period of the function
Suppose (as is extremely likely when nA is a product of two large primes) that c is
relatively prime to nA . Then the cyclic group hci ∈ (Z/nA Z)∗ generated by c must coincide
with the cyclic group generated by the message mB and in particular they both have the
same period r, which divides φ(nA ). Suppose Eve figures out the period r. Since the
published value eA is relatively prime to φ(nA ) it will be relatively prime to r and therefore
there exists a new decoding method: Compute dE such that
eA dE = 1 mod r (13.61)
Then
cdE = meA dE modnA = m1+`r
B modnA = mB modnA (13.62)
decodes the message.
Thus, if the attacker can find the period of f (x) the message can be decoded.
Another way in which finding the period leads to rapid decoding is through explicit
factoring:
We saw in our discussion of Z∗N that, if one has an element ā ∈ Z∗N with even period
r and b̄ = ār/2 6= ±1 then d± = gcd(b ± 1, N ) are nontrivial factors of N . Suppose there
were a quick method to find the period r. Then we could quickly factor N as follows:
1. Choose a random integer a and using Euclid check that (a, N ) = 1. If N is a product
of two large primes you will only need to make a few choices of a before succeeding.
2. Compute the period r of the function f (x) = ax modN .
3. If r is odd go back and choose another a until you get one with r even.
4. Then check that b = ar/2 6= −1modN . Again this can be done quickly, thanks to
Euclid. If you get b = −1modN go back and choose another a, until you find one that
works. The point is that, with high probability, if you pick a at random you will succeed.
So you might have a try a few times, but not many.
So, the only real bottleneck in factoring N is computing the order r of ā in Z∗N . Equiv-
alently, this is computing the period of the function f (x) = ax mod N where (a, N ) = 1.
This is where the “quantum Fourier transform” and “phase estimation” come in. Quan-
tum computers give a way to compute this period in polynomial time in N , as opposed to
classical computers which take exponential time in N . We will come back to this.
f (x) = ax modN where (a, N ) = 1. This is just a sketch. A nice and clear and elementary
account (which we used heavily) can be found in D. Mermin’s book Quantum Computer
Science and more details and a more leisurely discussion can be found there.
– 280 –
Quantum computation is based on the action of certain unitary operators on a system
of n Qbits, that is, on a Hilbert space
equipped with the standard inner product. For each factor C2 one chooses a basis {|0i, |1i},
which one should think of as, for example spin up/down eigenstates of an electron or photon
helicity polarization states. Then there is a natural basis for Hn :
Here, for each i, xi ∈ {0, 1}. One can identify the vector ~x ∈ Fn2 , the n-dimensional vector
space over the field F2 . In our discussion we will only use its Abelian group structure, so
one can also think of it as Z2 ⊕ · · · ⊕ Z2 with n summands. The basis of states (13.64) is
known as the computational basis or, the Classical basis. Now to each computational basis
vector we can assign an integer by its binary expansion:
Now, let N := 2n . We can also define the Hilbert space L2 (ZN ) of functions on the
group ZN with the natural Haar measure. Of course Hn is isomorphic to L2 (ZN ) and the
isomorphism we choose to use is the one which identifies the computational basis vector
|~xi with the delta function supported at N (~x)mod N . We will denote the latter states as
|N (~x)iiN , where the subscript indicates which Abelian group ZN we are working with.
In quantum computation one works with a Hilbert space decomposed as
The two factors have dimension Nin = 2nin and Nout = 2nout , respectively. The quantum
gates are unitary operators and, moreover, under identification of H as a tensor product
of Qbits there should be a notion of “locality” in the sense that they only act nontrivially
on “a few” adjacent factors. The locality reflects the spatial locality in some realization in
the lab in terms of, say, spin systems. Moreover, we should only have to apply “ a few”
quantum gates in a useful circuit. With an arbitrary number of gates we can construct
any unitary out of products of local ones to arbitrary accuracy. The above notions can be
made precise, but that is beyond the scope of this section. ♣Still, it is essential
to explain more
Now suppose we have a function about the notion of
“local quantum
gate” and quantum
f : (ZN )∗ → (ZN )∗ (13.67) circuit and
illustrate a few
examples of simple
(such as f (x) = ax modN for (a, N ) = 1). We would like to convert this to a map gates. ♣
We now choose a fundamental domain which is a subset of {1, 2, . . . , N − 1} for (ZN )∗ with
N < Nout and N < Nin (in fact we will eventually assume N Nout and N Nin ) so
that we can view elements of (Z/N Z)∗ as elements of the set {1, 2, . . . , N − 1} which is, in
– 281 –
turn, a subset of Z/Nin Z and Z/Nout Z. We use the function N (~x) above to define fˇ such
that
f (N (~x)) = N (fˇ(~x))mod2nout (13.69)
This does not uniquely specify fˇ but the ambiguity will not affect the discussion. To read
this equation, suppose you want to compute fˇ(~x) for some ~x ∈ Fn2 in . Then you compute
N (~x) which is a nonnegative integer between 0 and Nin . Then you reduce it modulo N .
If it is relatively prime to N you can compute f (N (~x)) and considerate the result as a
number between 1 and 2nout − 1. The above equation then pins down fˇ(~x). Using fˇ we ♣And what if N (~ x)
is not relatively
can define a unitary operator Uf by its action on the computational basis: prime to N ? Of
course, we are
thinking this is rare,
Uf : |~xi ⊗ |~y i → |~xi ⊗ |~y + fˇ(~x)i (13.70) but it can happen.
What is the best
way to extend the
where on the right-hand side addition is in the Abelian group (Z2 )nout . We will say that the function? ♣
So, n X
⊗n 1
H |~y i = √ (−1)~x·~y |~xi (13.73)
2 x∈Fn
~ 2
and in particular n X
⊗n 1
H |~0i = √ |~xi (13.74)
2 x∈Fn
~ 2
Therefore, (recall that double brackets |jii refer to the position basis in L2 (ZN )) we
have
1 nin X
⊗nin
Uf H ⊗ 1 : |0iin × |0iout → √ |jiiNin ⊗ |f (j)iiNout (13.75)
2 j∈Z Nin
Now we wish to apply this to the function f (x) = ax mod N with (a, N ) = 1 and
N is the number we would like to factorize. So, in particular we don’t want N = 2n for
some n. Nobody will be impressed if you can factor a power of two! Rather, we identify
the group Z∗N , as a set, with the integers {1, . . . , N − 1} so that it can be considered to
be a subset of the natural fundamental domain {0, . . . , Nin − 1} for ZNin , and similarly for
– 282 –
ZNout . Then to compute f we compute ax , take the residue modulo N to get an integer in
the fundamental domain, and then consider that number modulo Nout . Hence, we should
choose Nout = 2nout to be some integer larger than N . A key claim, explained in textbooks
on quantum information theory is that such a function f is nice. That is, it makes sense
to compute it with a quantum circuit.
So, we conclude that a suitable quantum circuit can implement:
1 nin X
⊗n
|kiiNin ⊗ |ak iiNout
Uf H ⊗ 1 : |0iNin × |0iNout → √
2 k∈Z Nin
0
1 nin X
= √ (|j0 iiNin + |j0 + riiNin + |j0 + 2riiNin + · · · ) ⊗ |f0 iiNout
2 f ∈Z ∗
0 N
(13.76)
In the second line we are considering Z∗N as a subset of ZNout as explained above and the
prime on the sum means that we are just summing over the values that are in the image of
f (x) = ax mod N . This will be all the values in Z∗N if a is a generator of Z∗N but in general
might be smaller. Also, j0 is some solution of f0 = aj0 mod N , and r is the period of f (x),
that is, the smallest positive integer so that f (x + r) = f (x) for all x. We can choose j0 so
that 0 ≤ j0 < r and write the RHS of (13.76) as
O−1
!
1 nin X
X
Ψ= √ |j0 + sriiNin ⊗ |aj0 iiNout (13.77)
2 0≤j <r s=0
0
It is some kind of plane wave state in L2 (ZN ), so measuring position will give no useful
information on r. Of course, we should therefore go to the Fourier dual basis to learn about
the period. In terms of the position basis of L2 (ZN ) we can apply VF T to get:
1 X O−1
X 2πi (k0 +sr)p
V F T P f0 Ψ = √ e Nin
|piiNin (13.80)
Nin O p∈Z s=0
Nin
Note that this Fourier transform is quite nontrivial and nontransparent in the compu-
tational basis because of the nontrivial isomorphism between Hn and L2 (ZN ) with N = 2n .
– 283 –
Nevertheless, and this is nontrivial and part of the magic of Shor’s algorithm, the Fourier
transform operator VF T can be implemented nicely with quantum gates in the computa-
tional basis. Again, the textbooks on quantum information theory give explicit construction
of VF T as a quantum circuit in the computational basis. It is exactly at this point that the
exponential speed-up of the period finding takes place:
Now to find the period we make a measurement of the amplitudes for the various
Fourier components |piiNin . (Here we are using the isomorphism between ZNin and its
unitary dual.) The probability to measure p is
O−1
1 X 2πi N p
P rob(p) = | (e in /r )s |2
Nin O
s=0
2
Or
(13.81)
1 sin π·p· Nin
=
Nin O sin2 π · p · r
Nin
The basic idea is that the probability, as a function of p, will be peaked near values of p
from which we can deduce the crucial number r, but we need to be a bit careful at this
point.
Let us ask what is the probability that we will measure a value p of the form
Nin
pj = j + δj |δj | ≤ 1/2 (13.82)
r
If pj is of this form with any value j = 1, . . . , r−1 then we can extract r. Thus, substituting
such a value for pj into the formula for the probability we have
2 Or
1 sin π · δj · Nin
(13.83)
Nin O sin2 π · δ · r
j Nin
Now recall that Or/Nin is equal to 1 to excellent accuracy. Suppose we also choose a
number of Qbits so that
Nin N > r (13.84)
– 284 –
Then the argument of the sign in the denominator is extremely small and we can replace
sin(x) by x. So we get:
1sin2 (πδj )
P rob(pj ) ∼
= 2
Nin O
π · δj · Nrin
Nin 1 sin πδj 2
= · (13.85)
rO r πδj
2
∼ 1 sin πδj
=
r πδj
1.0
0.8
0.6
0.4
0.2
-3 -2 -1 1 2 3
Figure 34: A plot of the function sin2 (πx)/(πx)2 as a function of x. This function, very familiar
from the theory of diffraction, is symmetric in x → −x and monotonically decreasing in the interval
0 ≤ x ≤ 1/2. It therefore takes its minimal value in this interval at x = 1/2 where it is about
∼
= 0.405.
sin x 2
Now for 0 ≤ δ ≤ π/2 we have x ≥ π so
X r−1 4 ∼
P rob(pj ) ≥ = 0.4 (13.86)
r π2
j
Now, using various tricks one can raise this 40% value to near 100%. For these tricks
consult Mermin’s book. Two other useful textbooks on quantum information theory and
quantum computing where one can look these things up include:
We have seen a few examples of direct products of groups above. We now study a more
subtle notion, the semidirect product. The semidirect product is a twisted version of the
direct product of groups H and G which can be defined once we are given one new piece
of extra data. The new piece of data we need is a homomorphism
α : G → Aut(H). (14.1)
– 285 –
For an element g ∈ G we will denote the corresponding automorphism by αg . The value
of αg on an element h ∈ H is denoted αg (h). Thus αg (h1 h2 ) = αg (h1 )αg (h2 ) because αg
is a homomorphism of H to itself while we also have αg1 g2 (h) = αg1 (αg2 (h)) because α is
a homomorphism of G into the group of automorphisms Aut(H). We also have that α1 is
the identity automorphism. (Prove this!)
Using the extra data given by α we can form a more subtle kind of product called the
semidirect product H o G, or H oα G when we wish to stress the role of α. In the math
literature on group theory the notation H : G is also used. This group is the Cartesian
product H × G as a set but has the “twisted” multiplication law:
Example 14.1: Infinite dihedral group. Let G = {e, σ} ∼ = Z2 with generator σ, and let
H = Z, written additively. Then define a nontrivial α : G → Aut(H) by letting ασ act on
x ∈ H as ασ (x) = −x. Then Z o Z2 is a group with elements (x, e) and (x, σ), for x ∈ Z.
Note the multiplication laws:
(x1 , e)(x2 , e) = (x1 + x2 , e)
(x1 , e)(x2 , σ) = (x1 + x2 , σ)
(14.4)
(x2 , σ)(x1 , e) = (x2 − x1 , σ)
(x1 , σ)(x2 , σ) = (x1 − x2 , e)
and hence the resulting group is nonabelian with this twisted multiplication law. Since
Aut(Z) ∼
= Z2 this is the only nontrivial semidirect product we can form. This group is
known as the infinite dihedral group sometimes denoted D∞ . It has a presentation:
Z o Z2 ∼
= hr, s|s2 = 1 srs = r−1 i (14.5)
146
Answer : [(h1 , g1 ), (h2 , g2 )] = h1 αg1 (h2 )αg1 g2 g−1 (h−1 −1 −1 −1
1 )αg1 g2 g −1 g −1 (h2 ), g1 g2 g1 g2 .
1 1 2
– 286 –
(e.g. take s = (0, σ) and r = (1, e))
Remark: Taking x = s and y = rs we see that D∞ also has a presentation as a
Coxeter group:
Z o Z2 ∼ = hx, y|x2 = 1 y 2 = 1i (14.6)
Indeed it is the Weyl group for the affine Lie group LSU (2).
Example 14.2: Finite dihedral group. We can use the same formulae as in Example 1,
retaining G = {e, σ} ∼
= Z2 but now we take H = Z/N Z. We still have
ασ : n̄ → −n̄ (14.7)
where we are writing Z/N Z additively. The semi-direct product of Z/N Z with Z2 using
this automorphism gives one definition of an important group, the finite dihedral group DN .
Observe that, when we write ZN = µN multiplicatively, the automorphism is ασ (ω j ) = ω −j .
In this way we can obtain a presentation of DN of the form:
ZN oα Z2 ∼
= hr, s|s2 = 1, srs = r−1 , rN = 1i (14.8)
Note that here we have switched to a multiplicative model for the group ZN . The group
has the order given by the cardinality of the Cartesian product ZN × Z2 so it has order
2N . Note that using the relations, every word in the r, s can be converted to the form rx
or rx s with 0 ≤ x ≤ N − 1, thus accounting for all 2N elements.
3 4 3
1 2 1 2
Figure 35: A regular 3-gon and 4-gon in the plane, centered at the origin. The subgroup of O(2)
that preserves these is D3 and D4 , respectively.
– 287 –
rotation around the origin: φr = R(2π/N ). This generates a group isomorphic to ZN and
in this context it is usually denoted CN . If P is any reflection through a line through the
origin then φs = P will satisfy all the relations. The resulting group of transformations of
the plane generated by φr and φs is isomorphic to DN . Moreover, if we consider the regular
N -gon centered at the origin of the plane R2 then the subgroup of O(2) that maps it to
itself is isomorphic to DN , although to preserve the polygon we must choose P carefully
so that it is a reflection through an axis of symmetry. For example, if we consider the
regular triangle illustrated in 35 then reflection in the y-axis is a symmetry, as is rotation
by integral amounts of 2π/3. So we have a two-dimensional matrix representation of D3 :
!
−1 0
s→P =
0 1
! √ ! (14.10)
cos(2π/3) sin(2π/3) 1 −1 3
r → R(2π/3) = = √
− sin(2π/3) cos(2π/3) 2 − 3 −1
Note that, if we label the vertices of the triangle 1, 2, 3 as shown in the figure then the
various symmetries are in 1-1 correspondence with permutations of {1, 2, 3}. So in fact, we
have an isomorphism
D3 ∼= S3 (14.11)
with P → (12) and R(2π/3) → (123). Similarly, D4 is the group of symmetries of the
square. We can take s → P as before and now r → R(2π/4). Again we can label
the vertices of the square 1, 2, 3, 4. Again transformations are uniquely determined by
permutations of {1, 2, 3, 4}. However, the group of permutations we get this way is only a
subgroup of the permutation group S4 .
Clearly there is something quite different about the groups DN when N is even and
odd. This is nicely seen in the set of conjugacy classes. As you show in the exercise below
the conjugacy classes in DN are of the form
and
C(rx s) = {rx+2y s|y ∈ Z} (14.13)
Now, thinking in terms of symmetry actions on the plane, rx correspond to rotations by
2πx/N , whereas rx s correspond to reflections. Now note that for N odd, since (2, N ) = 1
the conjugacy class C(rx s) will contain all the elements of the form rz s, in other words all
the reflections. However, if N is even then there are two distinct conjugacy classes: C(rx s)
for x even and odd are distinct. This is nicely in accord with symmetries of the N -gon:
For N odd it is clear that all the symmetry axes can be mapped to each other by using the
symmetries of the N -gon. Whereas for N even there are two distinct kinds of reflection
axes: Those that go through vertices and those that go through edges.
– 288 –
Show that under the map to S4 described above the dihedral group D4 maps to the
subgroup containing just the identity, the cyclic permtuations (the rotations)
Exercise
Show that DN is a quotient of the infinite dihedral group. 147
Example 14.3: In equations (2.22) and (2.40) we found the most general form of a matrix
in O(2). It is a disjoint union of two circles, each circle being the elements of determinant
det(A) = ±1. As a group we have an isomorphism
O(2) ∼
= SO(2) o Z2 (14.17)
α : Z2 → Aut(SO(2)) (14.18)
is given by
α(σ) : R(φ) → R(−φ) (14.19)
Now choose any nonzero vector v ∈ R2 and define an isomorphism
by
Ψv : (1, σ) → Pv
(14.21)
Ψv : (R(φ), 1) → R(φ)
147
Answer : Note that N = {(x, e)|x = 0modN } ⊂ Z o Z2 is a normal subgroup and
(Z/N Z) o Z2 ∼
= (Z o Z2 )/N . (14.16)
– 289 –
Here Pv is the reflection in the line orthogonal to v. We need to check that this is a well-
defined homomorphism by checking that the images we have specified are indeed compatible
with the relations in the semidirect product. This amounts to checking that Pv2 = 1, which
is obvious, and
Pv R(φ)Pv−1 = R(−φ) (14.22)
Thanks to (14.22) the relations are indeed satisfied and now it is an easy matter to check
that Ψv is injective and surjective, so it is an isomorphism.
Note that there are many different such isomorphisms, Ψv , depending on the choice of
v. If v 0 is another nonzero vector in the plane then recall from (6.11) that Pv Pv0 = R(2θ)
where θ is the angle between v and v 0 . Then
So
Ψv0 = I(R(θ)) ◦ Ψv0 (14.24)
1. When d is odd
O(d) ∼
= SO(d) × Z2 (14.25)
2. When d is even
O(d) ∼
= SO(d) o Z2 (14.26)
In the case when d is odd the element −1d×d has determinant −1, so it is in the
nontrivial component of O(d), and yet it is also central: so conjugating elements R ∈ SO(d)
acts trivially. The semi-direct product is isomorphic to a direct product in this case. On
the other hand, when d is even −1d×d has determinant +1 and is an element of SO(d).
However, it is still true that if Ψv is reflection in the hyperplane orthogonal to a nonzero
vector v then
αv (R) := Pv RPv (14.27)
The same discussion as above shows that the dependence on v is through composition with
an inner automorphism of SO(d).
Example 14.4: Affine Euclidean Space. Recall the discussion of Affine Euclidean space
in section ******** above. We defined there the Euclidean group.
– 290 –
Some natural examples of isometries are the following: Given any vector v ∈ Rd we
have the translation operator
Tv : p → p0 (14.29)
Moreover, if R ∈ O(d) then, if we choose a point p, we can define an operation:
Rp : p + v → p + Rv (14.30)
It turns out (this is a nontrivial theorem) that all isometries are obtained by composing
such transforamtions. A simple way to express the general transformation, then, is to
choose a point p as the “origin” of the affine space thus giving an identification Ed ∼= Rd .
d
Then, to a pair R ∈ O(d) and v ∈ R we can associate the isometry: 148
{v|R} : x 7→ v + Rx ∀x ∈ Rd (14.31)
which makes clear that there is a nontrivial automorphism used to construct the semidirect
product of the group of translations, isomorphic to Rd with the rotation-inversion group
O(d).
Put differently: O(d) acts as an automorphism group of Rd :
αR : v 7→ Rv (14.33)
so we can form the abstract group Rd oα O(d). Then, once we choose an origin p ∈ Ed we
can write an isomorphism
Ψp : Rd oα O(d) → Euc(d) (14.34)
To be concrete:
Ψp (v, R) : p + x 7→ p + (v + Rx) (14.35)
As in our example of O(d) we now have a family of isomorphisms. If Ψp0 is another
such based on a different origin p0 = p + v0 then the two isomorphisms are related by a
translation. See the exercise below.
– 291 –
b.) Show that N = {(h, 1)|h ∈ H} is a normal subgroup of H o G, while Q =
{(1, g)|g ∈ G} in general is not a normal subgroup. 149 This explains the funny product
symbol o that looks like a fish: it is a combination of × with the normal subgroup symbol
/.
c.) Show that we have a short exact sequence:
1 → N → H oα G → Q → 1 (14.37)
d.) Show that H o G = N Q = QN and show that Q ∩ N = {1}. Here N Q means the
sent of elements nq where n ∈ N and q ∈ Q. 150
e.) Conversely, show that if G̃ = N Q where N is a normal subgroup of G̃ and Q is a
subgroup of G̃, (that is, every element of G̃ can be written in the form g = nq with n ∈ N
and q ∈ Q and N ∩ Q = {1} ) then G̃ is a semidirect product of N and Q. Show how
to recover the action of Q as a group of automorphisms of N by defining αq (n) := qnq −1 .
Note that αq in general is NOT an inner automorphism of N .
It is useful to think about the Euclidean group in terms of the “internal” character-
ization of semi-direct products explained in the exercise above. Here we have a normal
subgroup N := {{v|1}|v ∈ Rd } and a subgroup Q given by the set of elements of the form
{0|R}. To check that N is normal a short computation using the group law reveals
and hence:
{v|R}{w|1}{v|R}−1 = {Rw|1} (14.39)
Note that, again, thanks to the group law, π : {v|R} → R is a surjective homomorphism
Euc(d) → O(d). Thus there is an exact sequence:
150
The notation is slightly dangerous here: We are considering the group Q both as a subgroup of G and,
in equation (14.37), as a quotient. In general, as we will see below in the chapter on exact sequences, there
is no way to view a quotient of a group G as a subgroup of G. Failure to appreciate this point has led to
many, many, many errors in the physics literature.
151
Answer: Note that n1 q1 n2 q2 = n1 n2 (n−1 −1
2 q1 n2 q1 )q1 q2 . However, if both N and Q are normal subgroups
−1 −1
then (n2 q1 n2 q1 ) ∈ N ∩ Q = {1}. Therefore n1 q1 n2 q2 = n1 n2 q1 q2 is the direct product structure.
– 292 –
Almost identical considerations show that the Poincaré group is isomorphic to the
semidirect product of the translation and Lorentz groups:
Poincare(M1,d−1 )) ∼
= M1,d−1 o O(1, d − 1) (14.41)
2. If a group G2 acts on X and Y is any set then G2 actions on the function space
F[X → Y ] in a natural way.
We can combine these two ideas as follows: Suppose that G2 acts on a set X and
Y = G1 is itself a group. Then let
α : G2 → Aut(F[X → G1 ]) (14.42)
αg : F 7→ αg (F ) ∈ F[X → G1 ] (14.43)
where we define
αg (F )(x) = F (φ(g −1 , x)) ∀g ∈ G2 , x ∈ X (14.44)
Then we can form the semidirect product
F[X → G1 ] o G2 (14.45)
This is a generalized wreath product. The traditional wreath product is a special case where
G2 = Sn for some n and Sn acts on X = {1, . . . , n} by permutations in the standard way.
Note that the group F[X → G1 ] ∼ = Gn1 . The traditional wreath product G1 wrSn , also
denoted G1 o Sn , is then F[X → G1 ] o Sn . To be quite explicit, the group elements in
G1 o Sn are
(h1 , . . . , hn ; φ) (14.46)
with hi ∈ G1 and φ ∈ Sn and the product is
(h1 , . . . , hn ; φ)(h01 , . . . , h0n ; φ0 ) = (h1 h0φ−1 (1) , h2 h0φ−1 (2) , . . . , hn h0φ−1 (n) , φ ◦ φ0 ) (14.47)
Example 14.6: Kaluza-Klein theory. The basic idea of Kaluza-Klein theory is that we
study physics on a product manifold X × Y and partially rigidify the situation by putting
some structure on Y. We then regard Y as “small” and study the physics as “effectively”
taking place on X .
The idea is intuitively understood by imagining a 2 + 1 dimensional world where space
is a cylinder of radius R. If we imagine beings in this flatland of a fixed lengthscale,
– 293 –
and shrink R → 0 then the beings will end up perceiving themselves as living in a 1 + 1
dimensional world. ♣SUITABLE
FIGURE NEEDED
Historically, Kaluza-Klein theory arose from attempts to unify the field theories of HERE ♣
general relativity with Maxwell’s theory of electromagnetism. The basic idea is that pure
general relativity on X × Y appears, when Y is “small” to be a theory of several fields,
including electro-magnetism, in X . As originally conceived the idea is very beautiful, but
now regarded as too naive and simplistic. Nevertheless, the idea that there might be extra
dimensions of spacetime in a compact manifold survives to this day and models that make
use of it come astonishingly close to describing the standard model of particle physics and
gravity, in the context of “string compactification.”
The canonical example of Kaluza-Klein theory is the case where X = M1,d−1 is d-
dimensional Minkowski space and Y = S 1 is the circle. We rigidify the situation by
putting a metric on the circle S 1 so that the metric on space-time is
In Quantum Field Theory one makes a huge leap: The quantization of the field leads to
quantum states which are interpreted as the states of a system of particles. An essential
step in this feat of magic is that one makes a Fourier-decomposition of the field. The
Fourier modes of the field are interpreted as creation/annihilation operators of particle
states. For the massless scalar field the Fourier modes are
M µ
eipM x = eipµ x eipθ θ (14.50)
n2
E 2 − p~2 = (14.51)
R2
where pµ = (E, p~). From the viewpoint of a d-dimensional field theory, Fourier modes with
n 6= 0 describe massive particles with m2 = n2 /R2 .
Now consider the case that R is very small compared to the scale of any observer.
Then that observer will perceive only a d-dimensional spacetime. If R is very small the
single massless particle in d+1-dimensions is percieved as an infinite set of different massive
particles with mass |n|/R in d dimensions. As R → 0 the masses of the particles ∼ |n|/R
goes to infinity. So, except for the n = 0 modes, the particles are very massive and therefore
will not be created by low energy processes, and are hence in general unobservable. For
– 294 –
example, if R is on the order of the Planck scale then the nonzero Fourier modes are fields
that represent particles of Planck-scale mass.
In a similar spirit, one finds that the Einstein-Hilbert action in (d + 1) dimensions
describing gravity in (d + 1) dimensions is equivalent, upon keeping only the n = 0 Fourier
modes, to the action of d-dimensional general relativity together with the Maxwell action
and the action for a scalar field. In a little more detail, suppose that Y = S 1 and we use
coordinates X M , M = 0, . . . , d + 1 on X × S 1 and coordinates xµ with µ = 0, . . . , d on X .
So that X M = (xµ , θ) where θ is an angular coordinate around S 1 . Then we consider the
metric:
where the metric gµν , the “warp factor” Ω2 and the “gauge connection” Aµ are only
functions of xµ (that is, we make the restriction to zero Fourier modes).
Note that this means the metric tensor looks like
!
µ gµν (x) + Ω2 (x)Aµ (x)Aν (x) Ω2 (x)Aµ (x)
gM N (x , θ) = (14.53)
Ω2 (x)Aν (x) Ω2 (x)
1. The KK ansatz also leads to a scalar field logΩ2 (x), known as the “dilaton” because
it can dilate, in a space-time dependent way, the size of the “internal dimensions” Y.
Note that in electricity and magnetism the coupling constant enters via
√ 1
Z
SMaxwell = g 2 Fµν F µν (14.56)
4e
so the presence of the dilaton can lead to space-time variation of the fine structure
constant. In physically viable models one must explain why the dilaton does not
– 295 –
fluctuate wildly. The discovery of the naturally occurring nuclear reactor in Oklo,
Africa, has led to the bound
α̇
| | < f ew × 10−17 yr−1 (14.57)
α
2. By considering internal spaces Y equipped with metric with isometry group H similar
considerations lead gauge theory with gauge group H in X .
GD (14.60)
– 296 –
then g(x) will take θ → θ + (x), so
Aµ → Aµ + ∂ µ (14.65)
and thus, these special diffeomorphisms appear as gauge transformations of the Maxwell
field!
Exercise
Show that S3 ∼= Z3 o Z2 where the generator of Z2 acts as the nontrivial outer auto-
morphism of Z3 .
152
Answer D4 .
– 297 –
Exercise Symmetries Of The Cube
a.) Consider a perfect cube. By considering the action of proper rotations on the four
diagonal axes through vertices show that the symmetry group is isomorphic to S4 .
b.) Centering the cube on the origin with vertices (±1, ±1, ±1) show that the symmetry
group is (Z2 × Z2 ) o S3 . Deduce that
S4 ∼
= (Z2 × Z2 ) o S3 (14.66)
– 298 –
You can easily see that there are at least four components since detA ∈ {±1} and moreover
A200 = 1 + di=1 A20i so that we can independently have A00 ≥ 1 or A00 ≤ −1.
P
Exercise Holomorph
Given a finite group G a canonical semidirect product group is G o Aut(G) known as
the holomorph of G.
a.) Show that this is the normalizer of the copy of G in the symmetric group S|G| given
by Cayley’s theorem.
b.) Show that the affine Euclidean group Euc(d) is the holomorph of the Abelian
group Rd .
Ψ : H oα G → H × G (14.72)
b.) Show that the subgroup of pure translations, that is, the subgroup of elements of
the form {v|1} with v ∈ Rd is a normal subgroup of Euc(d).
154
Answer : Ψ(h, g) = (hφ(g), g).
– 299 –
c.) Can you construct a homomorphism O(d) → Euc(d)?
d.) Consider the group G = L o Z2 where L is a lattice and the nontrivial element in
Z2 acts on L by v → −v. Compute the conjugacy classes in G. 155
for
v 0 = v + (1 − R)v0 (14.75)
ι π
1→N → G → Q→1 (15.1)
There is a notion of homomorphism of two group extensions
ι
1 π
1→N → G1 →1 Q → 1 (15.2)
ι
2 π
1→N → G2 →2 Q → 1 (15.3)
This means that there is a group homomorphism ϕ : G1 → G2 so that the following diagram
commutes:
ι1
1 /N / G1 π 1 / Q /1 (15.4)
O O
Id ϕ Id
ι2 π2
1 /N / G2 /Q /1
To say that a “diagram commutes” means that if one follows the maps around two
paths with the same beginning and ending points then the compositions of the maps are
the same. Thus (15.4) is completely equivalent to the pair of equations:
π1 = π2 ◦ ϕ
(15.5)
ι2 = ϕ ◦ ι1
155
Answer : C({v|1}) = {{±v|1}} has two elements while C({v| − 1}) = {{±v + 2v1 | − 1}|v1 ∈ L} has
infinitely many elements.
– 300 –
However, drawing a diagram makes the relations between maps, domains and codomains
much more transparent. Sometimes a picture is worth a thousand equations. This is why
mathematicians like commutative diagrams.
When there is a homomorphism of group extensions based on ψ : G2 → G1 such that
ϕ ◦ ψ and ψ ◦ ϕ are the identity then the group extensions are said to be isomorphic.
It can certainly happen that there is more than one nonisomorphic extension of Q by
N . Classifying all extensions of Q by N is a difficult problem. We will discuss it more in
section 15.7 below.
We would encourage the reader to think geometrically about this problem, even in
the case when Q and N are finite groups, as in Figure 37. In particular we will use the
important notion of a section, that is, a right-inverse to π: It is a map s : Q → G such that
π(s(q)) = q for all q ∈ Q. Such sections always exist.156 Note that in general s(π(g)) 6= g.
This is obvious from Figure 37. The set of pre-images, π −1 (q), is called the fiber of π over
q. The map π projects the entire fiber over q to the single element q. A choice of section
s is a choice, for each and every q ∈ Q, of just one single point in the fiber above q.
In order to justify the picture of Figure 37 let us prove that, as a set, G is just the
product N × Q. Note that for any g ∈ G and any section s:
g(s(π(g)))−1 (15.6)
g = ι(n)s(q) (15.8)
for some n ∈ N and some q ∈ Q. In fact, this decomposition is unique: Suppose that:
– 301 –
Then we rewrite this as
ι(n2−1 n1 ) = s(q2 )s(q1 )−1 (15.10)
Now, applying π we learn that 1 = q2 π(s(q1 )−1 ) = q2 (π(s(q1 )))−1 = q2 q1−1 , so q1 = q2 .
But that implies n1 = n2 . Therefore, as a set, G can be identified with N × Q.
and since the decomposition is unique (given a choice of section) the map ϕ is 1 − 1.
ω : Q → Aut(N ) (15.12)
denoted by
q 7→ ωq (15.13)
where the definition of ωq is given by
Because ι(N ) is normal the RHS is again in ι(N ). Because ι is injective ωq (n) is well-defined.
Moreover, for each q the reader should check that indeed ωq (n1 n2 ) = ωq (n1 )ωq (n2 ), and ωq
is one-one on N . Therefore we really have a map of sets (15.12). Note carefully that we
are not saying that q 7→ ωq is a group homomorphism. In general, it is not.
Remark: Clearly the ι is a bit of a nuisance and leads to clutter and it can be safely
dropped if we consider N simply to be a subgroup of G, for then ι is simply the inclusion
map. The confident reader is encouraged to do this. The formulae will be a little cleaner.
However, we will be pedantic and retain the ι in most of our formulae.
Let us stress that the map ω : Q → Aut(N) in general is not a homomorphism and in
general depends on the choice of section s. We will discuss the dependence on the choice of
section s below when we have some more machinery and context. For now let us see how
close ω comes to being a group homomorphism:
– 302 –
We want to compare this to ι (ωq1 q2 (n)). In general they will be different unless s(q1 q2 ) =
s(q1 )s(q2 ), that is, unless s : Q → G is a homomorphism. In general the section is not a
homomorphism, but clearly something nice happens when it is:
Proof :
First suppose it splits. Choose a splitting s. Then from (15.15) we know that
where ·ω stresses that we are multiplying with the semidirect product rule.
Thus, we have shown that a split extension is isomorphic to a semidirect product
G∼= N o Q. The converse is straightforward. ♠
In §15.7 below we will continue the general line of reasoning begun here. However, in
order to appreciate the formulae better it is a good idea first to step back and consider a
simple but important special case of extensions, namely, the central extensions. These are ♣Do the general
case first and then
extensions such that ι(N ) is a subgroup of the center of G. Here is the official definition: specialize? ♣
– 303 –
Definition A central extension of G by A, 157 is a group G̃ and an extension such
that
ι π
1→A → G̃ → G→1 (15.22)
such that ι(A) ⊂ Z(G̃).
We stress again that what we called G in the previous discussion is here called G̃, and
what we called Q in the previous discussion is here called G.
Example And Remark:Sections of group extensions vs. continuous sections of prin-
cipal fiber bundles. Let us return to the very important exact sequence (10.38):
ι π
1 → Z2 → SU (2) → SO(3) → 1 (15.23)
where α, β are complex numbers with |α|2 + |β|2 = 1. Writing this equation in terms
of the real and imaginary parts of α, β we recognize the equation of the unit three
dimensional sphere. Now recall that all the spheres of dimension ≥ 2 are simply
connected. Therefore π1 (SU (2)) = 1 is simply connected.
2. But SO(3) is not simply connected! In fact, using a coffee cup you can informally
demonstrate that π1 (SO(3)) ∼
= Z2 . [Demonstrate].
3. If there were a continuous section then s∗ : π1 (SO(3)) → π1 (SU (2)) would be a well-
defined group homomorphism and s ◦ π = Id would imply that on the fundamental
groups Id∗ = s∗ π∗ in
But Id∗ takes the nontrivial element of Z2 to the nontrivial element, not to the trivial
element. This is impossible if you factor through the trivial group.
– 304 –
∼ Z2 , (this
the possible 2-fold covers of the manifold M . It turns out that H1 (SO(3); Z2 ) =
∼
is closely related to π1 (SO(3)) = Z2 . so there are two double covers of SO(3). One is
O(3) = Z2 × SO(3) and the other is SU (2), the nontrivial double cover.
The extension (15.1) generalizes to
ι π
1 → Z2 → Spin(d) → SO(d) → 1 (15.26)
we discuss these in Section *** below. Again, in these cases there is no continuous section.
Thus, these examples are nontrivial as fiber bundles. Moreover, even if we allow ourselves
to choose a discontinuous section, we cannot do so and make it a group homomorphism.
In other words these examples are also nontrivial as group extensions.
Exercise
If s : Q → G is any section of π show that for all q ∈ Q,
for some n, n0 ∈ N .
1 / H0 ι /H π / H 00 /1 (15.29)
and a homomorphism
ρ : G → H 00 (15.30)
one can define another extension of G by H 0 known as a pullback extension. We are trying
to fill in the diagram:
G (15.31)
ρ
1 / H0 ι /H π / H 00 /1
– 305 –
We have an extension of the form
1 / H0 ι / G̃ π̃ /G /1 (15.33)
where π̃(h, g) := g. Show that this extension fits in the commutative diagram
1 / H0 / G̃ π̃ / G00 /1 (15.34)
ρ̃ ρ
1 / H0 /H π / H 00 /1
1 / H0 ι /H π / H 00 /1 (15.35)
ρ
H̃
to get an extension of H 00 by H̃. This is not as universal as the pullback. But one can
construct it if ρ : H 0 → H̃ is surjective and ι(ker(ρ)) / H. Give the construction. 158
/ H0 ι
/H π
/ H 00 /1
1 O (15.36)
ρ ϕ Id
1 / H̃ ι̃ / G̃ π̃ / H 00 /1
– 306 –
Exercise Another form of splitting
Show that an equivalent definition of a split exact sequence for a central extension is
that there is a homomorphism t : G̃ → A which is a left-inverse to ι, t(ι(a)) = a.
(Hint: Define s(π(g̃)) = ιt(g̃ −1 ))g̃.)
– 307 –
15.2 Projective Representations
We have already encountered the notion of a matrix representation of a group G. This is
a homomorphism from G into GL(d, κ) for some field κ. In many contexts in mathematics
and physics (especially in quantum physics) one encounters a generalization of the notion
of matrix representation known as a projective representation. The theory of projective
representations is closely related to the theory of central extensions.
Recall that a matrix representation of a group G is a group homomorphism
ρ : G → GL(d, κ) (15.41)
ρ : G → GL(d, κ) (15.42)
? b(g1 g2 )
f (g1 , g2 )= (15.45)
b(g1 )b(g2 )
that attempts to describe the effects of a rotation on - say - a spin 1/2 particle. In fact,
there is no such thing as the “spin representation of the rotation group SO(3).” There is
a spin projective representation of SO(3). 161
We saw above that there is a very natural group homomorphism from SU (2) to SO(3),
but there is no group homomorphism back from SO(3) to SU (2): There is no splitting.
The so-called “spin representation of SO(3)” is usually presented by attempting to con-
struct a splitting ρ : SO(3) → SU (2) using Euler angles. Indeed, under the standard
161
However, there is a perfectly well-defined spin representation of the Lie algebra so(3).
– 308 –
homomorphism π : SU (2) → SO(3) one recognizes that exp[iθσ i ] maps to a rotation by
angle 2θ around the the ith axis. For example,
!
e iθ 0
u = exp[iθσ 3 ] = = cos θ + i sin θσ 3 (15.47)
0 e−iθ
acts by
!
−1 z x − iy
u~x · ~σ u =u u−1
x + iy −z
! (15.48)
z e2iθ (x − iy)
=
e−2iθ (x + iy) −z
One can represent any rotation in SO(3) by a rotation around the z-axis, then around
the x-axis, then around the z axis. Call this R(φ, θ, ψ). So one attempts to define ρ by
assigning
φ 3 θ 1 ψ 3
ρ : R(φ, θ, ψ) → ei 2 σ ei 2 σ ei 2 σ . (15.49)
Clearly, we are going to have problems making this mapping well-defined. For example,
φ 3
R(φ, 0, 0) would map to ei 2 σ , but this is not well-defined for all φ because R(2π, 0, 0) = 1
2π 3
and ei 2 σ = −1. The problem is that the Euler angle coordinates on SO(3) are sometimes
singular. So, we need to restrict the domain of φ, θ, ψ so that (15.49) is well-defined
for every R ∈ SO(3). However, when we make such a restriction we will spoil the group
homomorphism property, but only up to a phase. So, in this way, we get a two-dimensional
projective representation of SO(3).
As an exercise you can try the following: Every SU (2) matrix can be written as
u = cos(χ) + i sin(χ)n̂ · ~σ and this maps under π to a rotation by 2χ around the n̂ axis. But
again, you cannot smoothly identify every SO(3) rotation by describing it as a rotation by
2χ around an axis.
Z2 × Z2 = {1, g1 , g2 , g1 g2 } (15.50)
ρ : Z2 × Z2 → GL(2, C) (15.51)
defined by
ρ(1) = 1
ρ(g1 ) = σ 1
(15.52)
ρ(g2 ) = σ 2
ρ(g1 g2 ) = σ 3
– 309 –
a.) Show that this defines a projective representation of Z2 × Z2 .
b.) Try to remove the phase to get a true representation.
c.) Show that ρ defines a section of an exact sequence with G given by the Pauli group.
1. Physical states are identified with traceclass positive operators ρ of trace one. They
are usually called density matrices. We denote the space of physical states by S. 163
2. Physical observables are identified with self-adjoint operators. We denote the set of
(bounded) self-adjoint operators by O.
3. The Born rule states that when measuring the observable O in a state ρ the prob-
ability of measuring value e ∈ E ⊂ R, where E is a Borel-measurable subset of R,
is
Pρ,O (E) = TrPO (E)ρ. (15.53)
where P (λi ) is the projection operator onto the eigenspace with eigenvalue λi then
X
PO (E) = P (λi ) (15.55)
λi ∈E
– 310 –
4. There are further axioms regarding time-development, and so on, but the above is
all we need for the present discussion.
Given this setup up the natural notion of a general “symmetry” in quantum mechanics
is the following:
This set of mappings forms a group which we will call the group of quantum automorphisms. ♣Need to state
some appropriate
While this is conceptually straightforward, it is an unwieldy notion of symmetry. We continuity
properties. ♣
will now simplify it considerably, ending up with the crucial theorem known as Wigner’s
theorem.
We begin by noting that the space of density matrices is a convex set. The convexity
means that if ρ1 , ρ2 are density matrices then for all 0 ≤ t ≤ 1
is a density matrix. Given a convex set on defines an extremal point to be a point in the
set which cannot be written in the above form with 0 < t < 1. By definition, the pure
states are the extremal points of S. The pure states are just the dimension one projection
operators.
Pure states are often referred to in the physics literature as “rays in Hilbert space” for
the following reason:
If ψ ∈ H is a nonzero vector then it determines a line
`ψ := {zψ|z ∈ C} := ψC (15.58)
Note that the line does not depend on the normalization or phase of ψ, that is, `ψ = `zψ
for any nonzero complex number z. Put differently, the space of such lines is projective
Hilbert space
PH := (H − {0})/C∗ (15.59)
Equivalently, this can be identified with the space of rank one projection operators. Indeed,
given any line ` ⊂ H we can write, in Dirac’s bra-ket notation: 164
|ψihψ|
P` = (15.60)
hψ|ψi
164
We generally denote inner products in Hilbert space by (x1 , x2 ) ∈ C where x1 , x2 ∈ H. Our convention
is that it is complex-linear in the second argument. However, we sometimes write equations in Dirac’s
bra-ket notation because it is very popular. In this case, identify x with |xi. Using the Hermitian structure
there is a unique anti-linear isomorphism of H with H∗ which we denote x 7→ hx|. Sometimes we denote
vectors by Greek letters ψ, χ, . . . , and scalars by Latin letters z, w, . . . . But sometimes we denote vectors
by Latin letters, x, w, . . . and scalars by Greek letters, α, β, . . . .
– 311 –
where ψ is any nonzero vector in the line `.
It is possible to argue (see the above reference) that such a symmetry maps pure states
to pure states, and is completely determined by this map. So we can view the group of
quantum automorphisms as the group of transformations of one-dimensional projection
operators (or rays) that preserves overlaps
Example: Let us consider this case of a single Qbit, namely H = C2 . First we write the
most general general density matrix. Any 2 × 2 Hermitian matrix is of the form a + ~b · ~σ
where ~σ is the vector of “Pauli matrices”:
!
1 01
σ =
10
!
0 −i
σ2 = (15.62)
i 0
!
1 0
σ3 =
0 −1
a ∈ R and ~b ∈ R3 . Now a density matrix ρ must have trace one, and therefore a = 12 . Then
the eigenvalues are 12 ± |~b| so positivity means it must have the form
1
ρ = (1 + ~x · ~σ ) (15.63)
2
where ~x ∈ R3 with ~x2 ≤ 1.
The extremal states, corresponding to the rank one projection operators are therefore
of the form
1
P~n = (1 + ~n · ~σ ) (15.64)
2
where ~n is a unit vector. This gives the explicit identification of the pure states with
elements of S 2 . Moreover, we can easily compute:
1
TrP~n1 P~n2 = (1 + ~n1 · ~n2 ) (15.65)
2
and ~n1 · ~n2 = cos(θ1 − θ2 ) where |θ1 − θ2 | (with θ’s chosen so this is between 0 and π) is
the geodesic distance between the two points on the unit sphere.
There is another viewpoint which is useful. Nonzero vectors in C2 can be normalized
to be in the unit sphere S 3 . Then the association of projector to state given by
1
|ψi → |ψihψ| = (1 + ~n · ~σ ) (15.66)
2
– 312 –
defines a map π : S 3 → S 2 known as the Hopf fibration.
The unit sphere is a principal homogeneous space for SU (2) and we may coordinatize
SU (2) by the Euler angles:
φ 3 θ 2 ψ 3
u = e−i 2 σ e−i 2 σ e−i 2 σ (15.67)
with range 0 ≤ θ ≤ π and identifications:
k uψ k=k ψ k (15.72)
– 313 –
The composition of unitary operators is clearly unitary. The composition of unitary and
antiunitary is antiunitary, and the composition of antiunitaries is unitary so we have an
exact sequence
φ
1 → U (H) → Aut(H)−→Z2 → 1 (15.74)
where (
+1 g unitary
φ(g) = (15.75)
−1 g anti − unitary
π(u) : P 7→ uP u† (15.76)
for anti-unitary operators a. See the footnote above for the definition of the adjoint of an
anti-unitary operator. In both cases u−1 = u† and a−1 = a† and these operations preserve
the overlap function.
The fiber of the map π can be thought of as possible c-number phases which can
multiply the operator on Hilbert space representing a symmetry operation:
π
1 → U (1) → Aut(H) −→Aut(QM ) → 1 (15.78)
Here the U (1) is the group of phases acting on quantum states: ψ → zψ for z ∈ U (1).
The upshot is that, given a classical symmetry group G of a physical system, for each
g ∈ G we can associate a unitary, or antiunitary, operator U (g) acting on a Hilbert space.
Quantum mechanics only guarantees that
for some phase factor c(g1 , g2 ), which, in general, cannot be removed by a redefinition of
U (g) by a phase Ũ (g) = b(g)U (g). So we have a projective representation of the classical
symmetry G.
Put differently: In a given physical system we will not consider the full group Aut(QM )
as a group of symmetries because. For example, the time-dynamics of a nonrelativistic
quantum system is governed by the Schrödinger equation:
∂ψ
i~ = Hψ (15.80)
∂t
where H is a self-adjoint operator, the Hamiltonian. For time-independent Hamiltonians
the unitary time evolution is governed by
t
U (t) = exp[−i H] (15.81)
~
– 314 –
Only a subgroup of Aut(QM ) will commute with the resultant flows on the space of states.
If we think of G as embedded in Aut(QM ) then we have
G (15.82)
ρ
1 / U (1) ι / Aut(H) π / Aut(QM ) /1
The group which acts on the Hilbert space will be a central extension of G given by the
pullback construction.
One thing we can note is that finite dimensional matrices are always associative! 166
So for all g1 , g2 , g3 ∈ G
and hence
c(g1 , g2 )c(g1 g2 , g3 ) = c(g1 , g2 g3 )c(g2 , g3 ) (15.84)
We note that projective representations are quite pervasive in modern physics:
3. Projective representations are very important in conformal field theory. The Virasoro
group, and the Kac-Moody groups are both nontrivial central extensions of simpler
objects.
Now, as we will explain near the end of section 15.3 below projective representations
are very closely connected to central extensions. So in the next section we turn to a deeper
investigation into the structure of central extensions.
Remark: The fact that the symmetry operators U (g) should commute with the Hamil-
tonian has an important implication. Suppose that H has a complete set of eigenvectors
Ψλ where λ ∈ Spec(H) is a discrete set of eigenvalues. Then we can restrict U (g) to the
different eigenspaces Hλ of H, for if
Hψλ = Eλ ψλ (15.85)
and U (g) commutes with H then U (g)ψλ also is an eigenvector of eigenvalue Eλ . This
means that the eigenspaces Hλ are each projective representations of G. This can be
extremely useful in diagonalizing H and simplifying computations. ♣Need to work in
some examples ♣
166
And the same holds for linear operators on infinite-dimensional Hilbert spaces provided the domains
are such that the composition of the three operators is well-defined.
– 315 –
Exercise Kernel Of π In The Wigner Sequence
Show that the kernel of π in the exact sequence (15.78) is precisely the group of phases
times the identity operator. 167
π(s(g)) = g ∀g ∈ G (15.86)
As we have stressed, in general s is not a homomorphism. In the case when the
sequence splits, that is, when there exists a section which is a homomorphism, then we can
say G̃ is isomorphic to a direct product G̃ ∼= A × G via
– 316 –
(Note that simply substituting (15.89) into (15.91) is not obviously going to work
because G̃ need not be abelian.)
Definition: In general
1. A 2-cochain on G with values in A is a function
f :G×G→A (15.94)
Remarks:
1. The fancy terminology is introduced for a good reason because there is a topological
space and a cohomology theory underlying this discussion. See Section §15.8 and
Section §17.2 for further discussion.
169
Since we are working with central extensions we could put the ι(t(g)) on either side of the s(g). However,
when we discuss non-central extensions later the order will matter.
– 317 –
for some function t : G → A. It is easy to check that
Definition: In general two 2-cochains f and fˆ are said to differ by a coboundary if there
exists a function t : G → A such that
for all g1 , g2 ∈ G.
One can readily check, using the condition that A is Abelian, that if f is a cocycle
then any other fˆ differing by a coboundary is also a cocycle. Moreover, being related by a
cocycle defines an equivalence relation on the set of cocycles f ∼ fˆ. Thus, we may define:
Definition: The group cohomology H 2 (G, A) is the set of equivalence classes of 2-cocycles
modulo equivalence by coboundaries.
Now, the beautiful theorem states that group cohomology classifies central extensions:
170
Proof : Let E(G, A) denote the set of all extensions of G by A, and let E(G, A) denote the
set of all isomorphism classes of extensions of G by A.
We first construct a map:
is well-defined, because the RHS does not depend on the choice of section s.
Now we claim that this map descends to a map ΨE→H : E(G, A) → H 2 (G, A). Indeed,
if we have an isomorphism of central extensions:
1 /A ι /G
e π /G /1 (15.100)
Id ψ Id
/A ι0 /G
e0 π0 /G /1
1
– 318 –
where ψ : G̃ → G̃0 is an isomorphism such that the inverse also leads to a commutative
diagram, then ψ can be used to map sections of π : G̃ → G to sections of π 0 : G̃0 → G by ♣where do we use
this condition in the
s 7→ s0 where s0 (g) = ψ(s(g)). Then proof? ♣
and hence we assign precisely the same 2-cocycle f (g1 , g2 ) to the section s0 . Hence the
map ΨE→H only depends on the isomorphism class of the extension. This defines the map
ΨE→H .
Conversely, we can define a map ΨH→E : Z 2 (G, A) → E(G, A) as follows: Given a
cocycle f ∈ Z 2 (G, A) we may define G̃ = A × G as a set and we use f to define the
multiplication law:
(a1 , g1 )(a2 , g2 ) := (a1 a2 f (g1 , g2 ), g1 g2 ) (15.102)
You should check that this does define a valid group multiplication: The associativity
follows from the cocycle relation. Note that if we use the trivial cocycle: f (g1 , g2 ) = 1 for
all g1 , g2 ∈ G then we just get the direct product of groups.
Now suppose that we use two 2-cocycles f and f 0 which are related by a coboundary
as in (15.97) above. Then we claim that the map ψ : G̃ → G̃0 defined by
is an isomorphism of central extensions as in (15.100). This means that the map ΨH→E :
Z 2 (G, A) → E(G, A) actually descends to a well-defined map
We leave it to the reader to check that ΨH→E and ΨE→H are inverse maps. ♠
Remarks:
1 → U (1) → G
e→G→1 (15.105)
– 319 –
The evil failure of ρ(g) to be a true representation of G now becomes a virtuous
fact that allows ρe to be a true representation of G.
e This is the typical situation in
quantum mechanics, where G is a group of classical symmetries and G e is the group
that is implemented quantum-mechanically. A good example is the spin-1/2 system
where G = SO(3) is the classical group of rotations but for a quantum rotor the
proper symmetry group is G e = SU (2). There are many many other examples.
2. Group Structure on E(G, A). The set H 2 (G, A) carries a natural structure of an
Abelian group. Indeed, as we remarked above C 2 (G, A), being a set of maps with
target space a group, A, is naturally a group. Then, because A is Abelian, we can
define a group structure on Z 2 (G, A) by the rule:
where we are writing the product in A multiplicatively. Again using the fact that ♣It might be
clearer to write A
A is abelian this descends to a well-defined multiplication on cohomology classes: additively... ♣
[f1 ] · [f2 ] := [f1 · f2 ]. Therefore H 2 (G, A) itself is an abelian group. The identity
element corresponds to the cohomology class of the trivializable cocycles, which in
turn corresponds to the split extension A × G.
It is natural to ask whether one can give a more canonical description of the abelian
group structure on the set of equivalence classes of central extensions of G by A.
Indeed we can: We pull back the Cartesian product to the diagonal of G × G and
then push forward by the multiplication map µ : A × A → A. That is, suppose we
have two central extensions:
1ι π
E1 : 1→A → G̃1 →1 G → 1 (15.108)
2ι π
E2 : 1→A → G̃2 →2 G → 1 (15.109)
The Cartesian product E1 × E2 is the extension of G × G by A × A using the group
G̃1 × G̃2 with the Cartesian product of the group homomorphisms. We want an
extension of G by A, corresponding, under the 1-1 correspondence of the above
theorem to the natural group structure on H 2 (G, A). To construct it, let
∆:G→G×G (15.110)
be the diagonal homomorphism: ∆ : g 7→ (g, g). Then we claim that the product
extension E1 · E2 can be identified as
E1 · E2 = µ∗ ∆∗ (E1 × E2 ) (15.111)
– 320 –
where
G 12 := {(g̃1 , g̃2 )|π1 (g̃1 ) = π2 (g̃2 )} ⊂ G̃1 × G̃2 (15.113)
d
We can define π12 (g̃1 , g̃2 ) := π1 (g̃1 ) = π2 (g̃2 ). Now consider the “anti-diagonal”
3. Trivial vs. Trivializable. Above we defined the trivial cocycle to be the one with
f (g1 , g2 ) = 1A for all g1 , g2 . We define a cocycle to be trivializable if it is cohomologous
to the trivial cocycle. Note that a trivializable cocycle f could be trivialized in
multiple ways. Suppose both b and b̃ trivialize f . Then you should show that b̃ and
b “differ” by a group homomorphism φ : G → A in the sense that
There are situations where a cohomological obstruction vanishes and the choice of
trivialization has physical significance. ♣Should give some
examples: H 3 is
obstruction to
4. An analogy to gauge theory: Changing a cocycle by a coboundary is strongly anal- orbifolding CFT
and choice of
ogous to making a gauge transformation in a gauge theory. In Maxwell’s theory we trivialization is H 2
- hence discrete
can make a change of gauge of the vector potential Aµ by torsion. There are
bundle examples.
Find example where
A0µ (x) = Aµ (x) − ig −1 (x)∂µ g(x) (15.119) class in H 2 is zero
but trivialization
has physical
where g(x) = ei(x) is a function on spacetime valued in U (1). In the case of electro- meaning. ♣
– 321 –
One such quantity is the fieldstrength tensor Fµν := ∂µ Aν − ∂ν Aµ , but this is not
a complete gauge invariant. The isomorphism class of a field is completely specified
H
by the holonomies expi γ A around all the closed cycles γ in spacetime. Even when
Aµ (x) is not trivializable, it is very often useful to use gauge transformations to try
to simplify Aµ . In the next remark we do the same for cocycles.
5. Simplifying Cocycles Using Coboundaries. Using a coboundary one can usefully sim-
plify cocycles. Since this topic will be unfamiliar to some readers we explain this in
excruciating detail. Those who are familiar with cohomology can safely skip the rest
of this remark. To begin, note that a coboundary modification takes a cochain f to
f (1) satisfying:
t(1)t(1)
f (1) (1, 1) = f (1, 1) = f (1, 1)t(1) (15.120)
t(1 · 1)
so by choosing any function t such that t(1) = f (1, 1)−1 we get a new cochain
satisfying f (1) (1, 1) = 1. Choose any such function. (The simplest thing to do is set
t(g) = 1 for all other g 6= 1. We will make this choice, but it is really not necessary.)
Now recall that if f is a cocycle then a modification of f by any coboundary produces
a new cochain f (1) that is also a cocycle. So now, if f is a cocycle and we have set
f (1) (1, 1) = 1 then, by (15.92) we have f (1) (g, 1) = f (1) (1, g) = 1 for all g. Now, we
can continue to make modifications by coboundaries to simplify further our cocycle
f (1) . In order not to undo what we have done we require that the new coboundaries
we use, say, t̃ satisfy t̃(1) = 1. We may say that we are “partially choosing a gauge”
by choosing representatives so that f (1) (1, 1) = 1 and then the further coboundaries
t̃ must “preserve that gauge.” Now suppose that g 6= 1. Then (using our particular
choice of t above):
1
f (1) (g, g −1 ) = f (g, g −1 ) = f (g, g −1 )f (1, 1) (15.121)
t(1)
is not particularly special. (Remember that we are making the somewhat arbitrary
choice that t(g) = 1 for g 6= 1.) So we have not simplified these quantities. However,
we still have plenty of gauge freedom left and we can try to simplify the values as
follows: Suppose, first, that g 6= g −1 , equivalently, suppose g 2 6= 1 so g is not an
involution. Then we can make another “gauge transformation” by a coboundary
function t̃ to produce:
t̃(g)t̃(g −1 )
f (2) (g, g −1 ) = f (1) (g, g −1 ) = fˆ(g, g −1 )t̃(g)t̃(g −1 ) (15.122)
t̃(g · g −1 )
where in the second equality we used the “gauge-preserving” property that t̃(1) = 1.
Now, in any way you like, divide the non-involution elements of G into two disjoint
sets S1 q S2 so that no two group elements in S1 are related by g → g −1 . Then, if
g ∈ S2 we have g −1 ∈ S1 and vice versa. Then we can choose a function t̃ so that for
every g ∈ S2 we have
t̃(g) = (t̃(g −1 ))−1 (f (1) (g, g −1 ))−1 (15.123)
– 322 –
Consequently:
f (2) (g, g −1 ) = 1 ∀g ∈ S2 (15.124)
It doesn’t really matter what we choose for t̃ on S1 . For definiteness we choose it to
be = 1. But if we had made another choice the above procedure would still lead to
equation (15.124). Now recall from (15.93) that any cocycle f satisfies f (g, g −1 ) =
f (g −1 , g) for all g. Since f (2) is a cocycle (if we started with a cocycle f ) then we
conclude that for all the non-involutions:
Note that there is still a lot of “gauge freedom”: We have not yet constrained t̃(g) for
g ∈ S1 , nor have we constrained t̃(g) for the involutions, that is, the group elements
g with g 2 = 1. What can we say about f (2) (g, g) for g an involution? we have
t̃(g)2
f (2) (g, g) = f (1) (g, g) = f (1) (g, g)(t̃(g))2 (15.126)
t̃(g 2 )
Now, it might, or might not be the case that f (1) (g, g) is a perfect square in the
group. If it is not a perfect square then we are out of luck: We cannot make any
further gauge transformations to set f (2) (g, g) = 1. Now one can indeed check that
the property of f (g, g) being a perfect square, or not, for an involution g is a truly
“gauge invariant” condition. Therefore we have proven: If f (g, g) is not a perfect
square for some nontrivial involution g then we know that f is not “gauge equivalent”
- that is, is not cohomologous to - the trivial cocycle. That is, [f ] is a nontrivial
cohomology class. Such cocycles will define nontrivial central extensions.
Ψ : (σ1 , 1) → ω 2 = −1
(15.128)
Ψ : (1, σ2 ) → ω
– 323 –
where ω is a primitive 4th root of 1 defines an isomorphism with the group of fourth roots
of unity. In conclusion, the nontrivial central extension of Z2 by Z2 is:
1 → Z2 → Z4 → Z2 → 1 (15.129)
Recall that Z4 is not isomorphic to Z2 × Z2 . The square of this extension is the trivial
extension.
In this example we will write our cyclic groups multiplicatively. Now, using methods of
topology one can show that 171
H 2 (Zp , Zp ) ∼
= Zp . (15.131)
The result (15.131) should puzzle you. After all we know that G must be a group of
order p2 , and we know from the class equation and Sylow’s theorems that there are exactly
two groups of order p2 , up to isomorphism! How is that compatible with the p distinct
extensions predicted by equation (15.131) !? The answer is that there can be nonisomorphic
extensions (15.22) involving the same group G̃. Let us see how this works in the present
example by examining in detail the possible extensions:
ι π
1 → Zp → Zp2 → Zp → 1 (15.132)
Zp = hσ1 |σ1p = 1i
2
Zp2 = hα|αp = 1i (15.133)
Zp = hσ2 |σ2p = 1i
respectively.
For the injection ι we have
ι(σ1 ) = αx (15.134)
and therefore px = 0 mod p2 and therefore x = 0 mod p. But since ι must be an injection
it must be of the form
ιk (σ1 ) := αkp (15.136)
171
You can also show it by examining the cocycle equation directly. We will write down the nontrivial
cocycles presently.
– 324 –
where k is relatively prime to p. We can take
σ2xr = σ2 (15.141)
so that
xr = 1mod p (15.142)
Recall that r ∈ Z∗p and let r∗ be the integer 1 ≤ r∗ ≤ p − 1 such that
Then we have that x = r∗ + `p for any `. That is, s(σ2 ) could be any of
∗ ∗ +p ∗ +2p ∗ +(p−1)p
αr , αr , αr , . . . , αr (15.144)
∗
Here we will make the simplest choice s(σ2 ) = αr . The reader can check that the discussion
is not essentially changed if we make one of the other choices. (After all, this will just change
our cocycle by a coboundary!)
∗
Now that we have chosen s(σ2 ) = αr , if s were a homomorphism then we would be
forced to take:
s(1) = 1
∗
s(σ2 ) = αr
∗
s(σ22 ) = α2r (15.145)
.. ..
. .
∗
s(σ2p−1 ) = α(p−1)r
– 325 –
But now we are stuck! The property that s is a homomorphism requires two contradictory
things. On the one hand, we must have s(1) = 1 for any homomorphism. On the other
∗
hand, from the above equations we also must have s(σ2p ) = αpr . But because 1 ≤ r∗ ≤ p−1
∗
we know that αpr 6= 1. So the conditions for s being a homomorphism are impossible to
meet. Therefore, with this choice of section we find a nontrivial cocycle as follows:
(
x y x+y −1 1 x+y ≤p−1
s(σ2 )s(σ2 )s(σ2 ) = ∗
(15.146)
αr p p ≤ x + y
Here we computed:
∗ ∗ ∗ (x+y−p) ∗p
αr x αr y α−r = αr (15.147)
where you might note that if p ≤ x + y ≤ 2p − 2 then 0 ≤ x + y − p ≤ p − 2. Therefore,
our cocycle is fk,r where
(
x y 1 x+y ≤p−1
fk,r (σ2 , σ2 ) := ∗ ∗
(15.148)
σ1k r p ≤ x + y
since
∗ r∗ ∗ r ∗ kp ∗p
ιk (σ1k ) = αk = αr (15.149)
and here we have introduced an integer 1 ≤ k ∗ ≤ p − 1 so that
kk ∗ = 1 mod p (15.150)
Although it is not obvious from the above formula for fk,r , we know that fk,r will satisfy
the cocycle equation because we constructed it from a section of a group extension.
Now, we know the cocycle is nontrivial because Zp × Zp is not isomorphic to Zp2 . But
let us try to trivialize our cocycle by a coboundary. So we modify our section to
for some function τ (x) valued in Z/pZ. We are trying to find a function τ (x) so that the
new cocycle fs̃ is identically 1. We certainly need s̃(1) = 1 and hence τ (0̄) = 0̄. But now,
because f (σ2x , σ2y ) = 1 already holds for x + y ≤ p − 1 don’t want to undo that so we learn
that
τ (x) + τ (y) − τ (x + y) = 0modp (15.153)
for x + y ≤ p − 1. This means we must take
So, our coboundary is completely fixed up to a choice of τ (1). But now let us compute for
x + y ≥ p − 1:
∗ τ (x)+τ (y)−τ (x+y) ∗p
s̃(σ2x )s̃(σ2y )s̃(σ2x+y )−1 = αr p ι(σ1 ) = αr (15.155)
– 326 –
So, we cannot gauge the cocycle to one, confirming what we already knew: The cocycle is
nontrivial.
Now let us see when the different extensions defined by k, r ∈ Z∗p are actually equivalent.
To see this let us try to construct ϕ so that
ϕ(α) = αy (15.157)
k1 r1 = k2 r2 modp (15.160)
Note that the cocycles fk,r constructed in (15.148) indeed only depend on krmodp. Equiv-
alently, we can label their cohomology class by (kr)∗ = k ∗ r∗ modp.
The conclusion is that kr ∈ Z∗p is the invariant quantity. Extensions with the same
group G̃ = Zp2 in the middle, but with different kr ∈ Z∗p , define inequivalent extensions of
Zp by Zp .
Now let us examine the group structure on the group cohomology. Just multiplying
the cocycles we get:
(
1 x+y ≤p−1
(fk1 ,r1 · fk2 ,r2 ) (σ2x , σ2y ) = (k r )∗ +(k2 r2 )∗
(15.161)
σ1 1 1 p≤x+y
Thus if we map
[fk,r ] 7→ (kr)∗ modZp (15.162)
we have a homomorphism of H 2 (G, A) to the additive group Z/pZ, with the trivializable
cocycle representing the direct product and mapping to 0̄ ∈ Z/pZ.
In conclusion, we describe the group of isomorphism classes of central extensions of Zp
by Zp as follows: The identity element is the trivial extension
1 → Zp → Zp × Zp → Zp → 1 (15.163)
– 327 –
and then there is an orbit of (p − 1) nontrivial extensions of the form
1 → Zp → Zp2 → Zp → 1 (15.164)
Example 3:Prime Powers. Once we start to look at prime powers things start to get more
complicated. We will content ourselves with extensions of Z4 by Z2 . Here it can be shown
that
H 2 (Z4 , Z2 ) ∼
= Z2 (15.165)
so there should be two inequivalent extensions. One is the direct product and the other is ♣Do this more
systematically so
show that there are
precisely two
1 → Z2 → Z8 → Z4 → 1 (15.166) extensions of Z4 by
Z2 . ♣
s(ω) = α
s(ω 2 ) = α2 (15.167)
s(ω 3 ) = α3
but now we have no choice - we must set s(ω 4 ) = s(1) = 1. On the other hand, if s were
to have been a homomorphism we would have wanted to set s(ω 4 ) = s(ω)4 = α4 , but, as
we just said, we cannot do this. With the above choice of section we get the symmetric
cocycle whose nontrivial entries are
f (ω, ω 3 ) = f (ω 2 , ω 2 ) = f (ω 2 , ω 3 ) = f (ω 3 , ω 3 ) = α4 = σ. (15.168)
♣Probably should
just describe all
extensions with
Example 4.. Products Of Cyclic Groups. Another natural generalization is to consider Q = Zn ♣
products of cyclic groups. For simplicity we will only consider the case
G = Zp ⊕ · · · ⊕ Zp (15.169)
where there are k summands, but p is prime. We will think of our group additively
and moreover we will think of Zp as a ring in this example. If we write elements as
~x = (x1 , . . . , xk ) with xi ∈ Zp and our cocycle f (~x, ~y ) is also valued in Zp , so that we are
considering central extensions:
e → Z⊕k → 0
0 → Zp → G (15.170)
p
– 328 –
then the cocycle condition becomes:
Notice a linear term cancels out. It we want to restrict attention to expressions which are
quadratic then we can modify
Example 5.. As a special case of the above, consider extensions of Z2 ⊕Z2 by Z2 . This will
be a group of order 8. As we will see, there are five groups of order 8 up to isomorphism:
Z8 , Z2 × Z4 , Z32 , Q, D4 (15.176)
where Q and D4 are the quaternion and dihedral groups, respectively. Now Z8 cannot sit
in an extension of Z2 × Z2 . (Why not? 172 ) This leaves 4 isomorphisms classes of groups
which do fit in extensions of Z2 × Z2 by Z2 and it happens they are all central extensions.
They are:
1 → Z2 → Z2 × Z2 × Z2 → Z2 × Z2 → 1
1 → Z2 → Z2 × Z4 → Z2 × Z2 → 1
(15.177)
1 → Z2 → Q → Z2 × Z2 → 1
1 → Z2 → D4 → Z2 × Z2 → 1
172
Answer : Because Z2 × Z2 would have to be a quotient of Z8 . But we can easily list the subgroups of
Z8 and no quotient is of this form.
– 329 –
where Q is the quaternion group and D4 the dihedral group. We have already met Q and
D4 above. One can define a homomorphism π : Q → Z2 ⊕ Z2 by
π(±1) = 0 = (0, 0)
π(±iσ 1 ) = v1 = (1, 0)
(15.178)
π(±iσ 2 ) = v2 = (0, 1)
π(±iσ 3 ) = v1 + v2 = (1, 1)
s(v1 ) = iσ 1
s(v2 ) = iσ 2 (15.179)
s(v1 + v2 ) = iσ 3
and, computing the cocycle we find that it is given by the bilinear form (see the previous
exercise): !
11
AQ = (15.180)
01
Similarly, we can define a homomorphism π : D4 → Z2 × Z2 by
π(±1) = 0 = (0, 0)
π(±R(π/2)) = v1 = (1, 0)
(15.181)
π(±P ) = v2 = (0, 1)
π(±P R(π/2)) = v1 + v2 = (1, 1)
s(v1 ) = R(π/2)
s(v2 ) = P (15.182)
s(v1 + v2 ) = P R(π/2)
and, computing the cocycle we find that it is given by the bilinear form (see the previous
exercise): !
11
AD 4 = (15.183)
00
Now, on the other hand, using methods of topology one can prove that
H 2 (Z2 × Z2 , Z2 ) = Z2 ⊕ Z2 ⊕ Z2 (15.184)
We can understand this group in terms of the bilinear quadratic forms mentioned in the
previous example. Under Aij → Aij − (qij + qji ). Note that A11 and A22 are invariant,
but we can modify the off-diagonal part of Aij by a symmetric matrix. Thus, there are 8
possible values for A11 , A22 , A12 ∈ Z2 .
– 330 –
In a way analogous to our discussion of extensions of Zp by Zp , while there are only four
different isomorphism classes of groups, there can be different extensions. An extension
with group cocycle Aij xi1 xj2 defines a group of elements (z, ~x). If we only care about the
isomorphism class of the group we are free to consider an isomorphism
where S ∈ GL(2, Z2 ). This maps A → SAS tr . In general that will produce an isomorphic
group, but a different extension. ♣Explain in detail
how this takes us
In all our examples up to now the group G̃ has been Abelian, but in this example we from 8 extensions to
four isomorphism
have produced two nonisomorphic nonabelian groups Q and D4 of order 8. classes of groups
using the explicit
transforms by
Example 5.. Nonabelian groups can also have central extensions. Indeed, we already saw elements of
GL(2, Z2 ) ∼
= S3 . ♣
this for G = SO(3). Here is an example with G a nonabelian finite group. We take G to ♣Should add an
exercise showing
be the symmetric group Sn . It turns out that it has one nontrivial central extension by that f (g, g) for g an
involution
Z2 : determines the
H 2 (Sn ; Z2 ) ∼
= Z2 (15.186) entire cocycle in
this case. There are
three nontrivial
To define it we let σi = (i, i + 1), 1 ≤ i ≤ n − 1 be the transpositions generating Sn . Then involutions, again
giving 8 possible
nonisomorphic
Ŝn is generated by σ̂i and a central element z satisfying the relations: cocycles. ♣
♣Should also
consider central
extensions of Z2 by
z2 = 1 Z2 ⊕ Z2 . ♣
σ̂i2 = z
(15.187)
σ̂i σ̂i+1 σ̂i = σ̂i+1 σ̂i σ̂i+1
σ̂i σ̂j = zσ̂j σ̂i j >i+1
Remarks:
1. One generally associates cohomology with the subject of topology. There is indeed
a beautiful topological interpretation of group cohomology in terms of “classifying
spaces.”
2. In the case where G is itself abelian we can use more powerful methods of homological
algebra to classify central extensions.
3. The special case H 2 (G, U (1)) (or sometimes H 2 (G, C∗ ), they are the same) is known
as the Schur multiplier. It plays an important role in the study of projective repre-
sentations of G. We will return to this important point.
– 331 –
These are different mathematical structures! For example, for finite groups the bundle
is of course trivial because any global section is also continuous. However, as we have
just seen the extensions might be nontrivial. It is true, quite generally, that if a central
extension is trivial as a group extension then G̃ = A × G and hence π : G̃ → G is
trivializable as an A-bundle. ♣In general a
central extension by
U (1) is equivalent
to a line bundle
over the group and
you should explain
that here. ♣
Exercise
Suppose that the central extension (15.22) is equivalent to the trivial extension with
G̃ = A × G, the direct product. Show that the possible splittings are in one-one correspon-
dence with the set of group homomorphisms φ : G → A.
Exercise
Construct cocycles corresponding to each of the central extensions in (15.177) and
show how the automorphisms of Z2 × Z2 account for the the fact that there are only four
entries in (15.177) while (15.184) is order 8.
Exercise D4 vs. Q
a.) Show that D4 and Q both fit in exact sequences
1 → Z4 → D4 → Z2 → 1 (15.188)
1 → Z4 → Q → Z2 → 1 (15.189)
b.) Are these central extensions?
c.) Are D4 and Q isomorphic? 173
Exercise
Choosing the natural section s : σi → σ̂i in (15.187) and find the corresponding cocycle
fs .
173
Answer : No. D4 has 5 nontrivial involutions: The reflections in the four symmetry axes of the square
and the rotation by π, while Q has only one nontrivial involution, namely −1.
– 332 –
Exercise Due Diligence
Show that the associative law for the twisted product (15.102) is equivalent to the
cocycle condition on the 2-cochain f .
– 333 –
Figure 38: Spectrum of a particle on a circle as a function of B = eB/2π. The upper left shows
the low-lying spectrum for B = 0. It is symmetric under m → −m. The upper right shows the
spectrum for B = 0.2. There is no symmetry in the spectrum. The lower figure shows the spectrum
for B = 1/2. There is again a symmetry, but under m → 2B − m = 1 − m. In general there will be
no symmetry unless 2B ∈ Z. If 2B ∈ Z the spectrum is symmetric under m → 2B − m.
– 334 –
We are going to analyze the symmetries of this system and compare their realization
in the classical and quantum theories.
Classical Symmetries:
We begin by analyzing the classical symmetries. Because the θ-term does not affect
the classical dynamics the classical system has O(2) symmetry. We can rotate: R(α) :
eiφ → eiα eiφ , or, if you prefer, translate φ → φ + α (always bearing in mind that α and φ
are only defined modulo addition of an integral multiple of 2π). If we think of the circle
in the x − y plane centered on the origin, with the solenoid along the z-axis then we could
also take as usual: !
cos α sin α
R(α) = . (15.194)
− sin α cos α
Also we can make a “parity” or “charge conjugation” transformation P : φ → −φ.
The second term in the Lagrangian is not invariant but this “doesn’t matter” because it is
a total derivative. Put differently: φ → −φ is a symmetry of the equations of motion, and
hence it is a classical symmetry.
Note that these group elements in O(2) satisfy
R(α)R(β) = R(α + β)
P2 = 1 (15.195)
P R(α)P = R(−α)
with ω : hP i ∼
= Z2 → Aut(SO(2)) ∼= Z2 acting by taking the nontrivial element of Z2 to
the outer automorphism that sends R(α) → R(−α).
Now let us consider the quantum mechanics with the “θ-term” added to the La-
grangian. Our goal is to see how that term affects the quantum theory.
We will first analyze the quantum mechanics in the Hamiltonian approach. See the
remark below for some remarks on the path integral approach. The conjugate momentum
is
eB
L = I φ̇ + (15.197)
2π
We denote it by L because it can be thought of as angular momentum.
Note that the coupling to the flat gauge field has altered the usual relation of angular
momentum and velocity. Now we obtain the Hamiltonian from the Legendre transform:
Z Z
1 eB 2
Lφ̇dt − S = (L − ) dt (15.198)
2I 2π
– 335 –
∂
Upon quantization L → −i~ ∂φ , so the Hamiltonian is
2
~2
∂
HB := −i −B (15.199)
2I ∂φ
eB
where B := 2π~ .
The eigenfunctions of the Hamiltonian HB are just
1
Ψm (φ) = √ eimφ m∈Z (15.200)
2π
They give energy eigenstates with energy
~2
Em = (m − B)2 (15.201)
2I
There is just one energy eigenstate for each m ∈ Z.
Before moving on with the analysis of the symmetries in this quantum mechanical
problem let us take the opportunity to make a long list of:
Remarks:
1. The action (15.193) makes good sense for φ valued in the real line or for φ ∼ φ + 2π,
valued in the circle. Making this choice is important in the choice of what theory
we are describing. Where - in the above analysis - did we make the choice that the
target space is a circle? 176
2. Taking φ ∼ φ + 2π, even though the θ-term is a total derivative it has a nontrivial
effect on the quantum physics as we can see since B has shifted the spectrum of the
quantum Hamiltonian in a physically observable fashion: This is how we see that
topological terms matter.
3. Note that when 2B is even the energy eigenspaces are two-fold degenerate, except
for the ground state at m = B. On the other hand, when 2B is odd all the energy
eigenspaces are two-fold degenerate, including the ground state. If 2B is not an
integer all the energy eigenspaces are one-dimensional. See Figure 38.
and
U HB U −1 = HB+1 (15.203)
176
Answer : If we took the case where φ is valued in R and not the circle then there would be no quantization
on m and the spectrum of the Hamiltonian would be continuous. In this case the Chern-Simons term would
not affect the physics in the quantum mechanical version as well.
– 336 –
5. The quantum mechanics problem (15.193) and the spectrum (15.201) arise in the
discussion of the “Coulomb blockade” in physics of quantum dots. See Yoshimasa
Murayama, Mesoscopic Systems, Section 10.10.
6. Viewing the system as a field theory. We have introduced this system as describing
the quantum mechanics of a particle. However, it is important to note that it can
also be viewed as a special case of a quantum field theory. In general, in a field theory
177 we have a spacetime M and the fields φ are functions on M valued in some target
space X . (So the term “target space” means nothing more or less than the codomain
of the fields.) An important example is that of a nonrelativistic particle of mass m
moving on a Riemannian manifold X with metric ds2 = gµν (x)dxµ ⊗ dxν . The action
would be Z
m
S = dt gµν (x(t))ẋµ ẋν dt (15.204)
2
If, in addition, the particle has charge e and there is an electromagnetic potential
Aµ (x)dxµ on X then the action is
Z Z
m
S = dt gµν (x(t))ẋ ẋ dt + eAµ (x(t))ẋµ dt
µ ν
(15.205)
2
Here M is the manifold of time. It could be M = R if we describe the entire history of
the particle, or M = [tin , tf in ] if we describe only the motion in a finite time interval.
As we will soon see, it can also be interesting to let M = S 1 . The “field” is a suitably
differentiable map
x:M →X (15.206)
describing the position of the particle as a function of time. This is an example of a
“0 + 1 dimensional field theory.” A generalization would be a theory of maps from a
d-dimensional spacetime with metric hab dσ a dσ b and action
Z p 1
S = dd+1 σ |deth|hab (σ) mgµν (x(t))∂a xµ ∂b xν (15.207)
2
and the “field” would be a suitably differentiable map:
x:M →X (15.208)
Equations (15.205) and (15.207) are examples of what is known as a “nonlinear sigma
model.” 178 In our case our fields are maps
eiφ : M → S 1 (15.212)
177
As traditionally conceived. The topic of topological field theory generalizes the next few lines consid-
erably.
178
For the mathematically sophisticated reader we note that, for general nonlinear sigma models
dx : Tσ M → Tx(σ) X (15.209)
†
is a linear map between two inner-product spaces. We can use the inner products to define (dx) and then
the kinetic term is Z
Tr((dx)(dx)† )vol (h) (15.210)
M
– 337 –
We have been referring to φ → −φ as “parity” because that is the appropriate term
in the context of the quantum mechanics of a particle constrained to a circle in the
plane. The parity operation is just reflection around some line in the plane. However,
if we take the point of view that we are discussing a 0 + 1 dimensional “field theory”
then it would be better to refer to the operation as “charge conjugation” because it
complex conjugates the U (1)-valued field eiφ .
In addition there are (in the field theory interpretation) “worldvolume symmetries”
of time translation invariance and time reversal. These form the group R o Z2 . We
will put those aside. (Note that time reversal is not a symmetry of the second term
in the Lagrangian but is a symmetry of the space of solutions of the equations of
motion.)
7. Relations to higher dimensional field theories and string theory. The θ-term we have
added has a very interesting analog in 1 + 1 dimensional field theory, where it is
known as a coupling to the B-field. It can also be obtained from a Kaluza-Klein
reduction of 1 + 1 dimensional Maxwell theory:
Z Z
1 θ
S = 2 dx0 dx1 F01 2
+ F01 dx0 ∧ dx1
e 2π
Z Z (15.213)
1 θ
= 2 F ∗F + F
e 2π
In 1 + 1 dimensional theory we can choose A0 = 0 gauge and gauge away the x1
dependence so that on S 1 × R the only gauge invariant quantity is
A1 dx1
H H
eiφ(t) = ei S1 A
= ei S1 (15.214)
With this in mind we can say
θ = 2πB (15.215)
The theta term also has a close analog in 3 + 1-dimensional gauge theory. In the case
of 3 + 1 dimensional Maxwell theory we can write
Z Z
4 1 µν θ µνλρ
S = d x 2 Fµν F + Fµν Fλρ d4 x
4e 8π
Z Z (15.216)
1 θ
= F ∗F + F ∧F
2e2 (4π)
More to the point dx is a section of (T M )∨ ⊗ f ∗ (T X) and we can use the metric on this bundle to write
k dx k2 . In the case of the charged particle moving on the Riemannian manifold X, there is also the
R
M
data of a principal U (1) bundle with connection d + A and the topological term is based on the holonomy
of the pulled back connection: Z I
m
S= k dx k2 dt + ex∗ (A) (15.211)
2
There are similar topological terms for the d > 0 sigma models.
– 338 –
In fact, in the effective theory of electromagnetism in the presence of an insulator a
very similar action arises with a θ term. If a parity- and/or time-reversal symmetry
is present then θ is zero or π, corresponding to our case 2B ∈ Z. The difference
between a normal and a topological insulator is then, literally, the difference between
2B being even (normal) and odd (topological), respectively. Finally, in the 3+1-
dimensional Yang-Mills theories that describe the standard model of electro-weak
and strong interactions one can add an analogous θ-term. Topological terms matter,
and in this case the topological term for the strong gauge field leads to the prediction
of an intrinsic electric dipole moment of the neutron. However, to excellent accuracy
it is known that if the neutron dipole moment it is very small and
One of the great unsolved mysteries about nature is why the (effective) theta angle
for the strong interactions in the standard model is so small. 179
Now let us get back to the symmetries of the particle on the ring. We have seen that the
classical “internal” symmetry group - the “internal” symmetry group of the equations of
motion - is O(2). Now let us analyze how the symmetries are implemented in the quantum
theory:
In quantum mechanics the SO(2) shift symmetry φ → φ+α is realized by a translation
operator ρ(R(α)) = R(α) and acting on Ψm we have
Can we also represent ρ(P ) = P on the Hilbert space? Classically, parity symmetry
P just takes φ → −φ. If we make this substitution in the Hamiltonian HB we see that the
naive parity operation takes
PHB P −1 = H−B (15.219)
For general values of B the operator HB is not unitarily equivalent to H−B . However,
thanks to (15.203) it is clear that when 2B ∈ Z they are unitarily equivalent and the naive
operation of taking φ → −φ, which takes m → −m on eigenvectors of HB , should be
accompanied by U 2B . Therefore P should map the eigenspace associated with m to that
associated with with 2B − m. As a sanity check note that indeed Em = E2B−m . Therefore
we should define a parity operation:
P · Ψm = ξm Ψ2B−m (15.220)
where ξm is a phase which we can take to be 1. Note that the operator P so defined ♣You should allow
the possibility of a
commutes with the Hamiltonian: Indeed, it takes eigenvectors to eigenvectors with the phase in the
definition of P and
same eigenvalue. show in detail it
doesn’t matter. ♣
If 2B is not an integer the parity symmetry is broken and the quantum symmetry
group is just SO(2).
179
For much more about this see M. Dine’s TASI lectures https://arxiv.org/pdf/hep-ph/0011376.pdf.
– 339 –
Now consider the case when 2B ∈ Z and let us study the relations obeyed by the
operators R(α) and P and compare them with the classical relations (15.195). We still
have R(α)R(β) = R(α + β) and P 2 = 1 but now the third line of (15.195) is modified to:
We now consider the group of operators generated by the operators P, R(α), and z1H
where z ∈ U (1). (Do not forget that we identify α ∼ α + 2π. This will be quite important
in what follows.) Denote this group of operators by GB . Naively we might have expected
this group of operators on Hilbert space to be isomorphic to U (1) × O(2) where O(2) is our
classical symmetry group and U (1) is just the group of phases acting on wavefunctions by
scalar multiplication. However, equation (15.221) is not satisfied by a direct product. So,
how is GB related to U (1) and O(2)? General principles tell us it will be an extension
But it is called the spin group because it comes with a nontrivial double cover:
the double covering is given by restricting our standard projection π : SU (2) → SO(3) to
the subgroup of SU (2) in (15.224). In this way we get a double cover of the rotation group
around the z axis:
π : R̂(α̂) 7→ R(2α̂) (15.226)
– 340 –
See equation (15.48) above.
Now, taking Z2 to act on Spin(2) by the nontrivial outer automorphism. So, denoting
the nontrivial element of Z2 by P̂ we use the homomorphism α : Z2 → Aut(Spin(2)) defined
by
α(P̂ ) : R̂(α̂) → (R(α̂))−1 = R̂(−α̂) (15.227)
Then, one definition of the group Pin+ (2) is that it is the semidirect product:
Pin+ (2) ∼
= Spin(2) oα Z2 (15.228)
(We will give a slightly different definition below.) There is a generalization of Spin and
Pin groups to higher dimensions. They double cover SO(d) and O(d), respectively. See
the remark below for a brief description and Chapters *** and **** for full details.
Now, when 2B is an odd integer the group GB is generated by
z1H
ρ(R̂(α̂)) := e−i(2B)α̂ R(2α̂) 0 ≤ α̂ < 2π (15.229)
ρ(P̂ ) := P
where we take P̂ to be the nontrivial element in Z2 in the semidirect product that defines
Pin+ (2), so that R̂(α̂) and P̂ generate Pin+ (2). One checks that ρ is a homomorphism and
the image under ρ is an isomorphic copy of Pin+ (2) inside GB , and we have:
where Z2 ∼
= {±1} ⊂ U (1). When 2B is odd ρ has no kernel. (When 2B is even there is a
kernel.)
In conclusion:
2. In the quantum theory when 2B is not an integer the symmetry is broken to SO(2).
3. In the quantum theory, when 2B is an even integer the theory still has O(2) symmetry.
The sequence
1 / U (1) / GB / O(2) /1 (15.231)
splits and GB ∼
= U (1) × O(2).
does not split. When we try to realize the classical O(2) symmetry on the Hilbert
space we are forced to implement the pin double cover Pin+ (2), a central extension
of O(2) by Z2 . It is related to GB as in (15.230).
– 341 –
We conclude with some remarks:
1. We stress that the particle we put on the ring did NOT have any intrinsic spin!!
2
Having said that, if we define an angular momentum L so that H = L2I then indeed
when B is half-integral the angular momentum has half-integral eigenvalues, as one
expects for a spin representation. So, what we are finding is that the half flux quantum
is inducing a half-integral spin of the system so that the classical O(2) symmetry of
the classical system is implemented as a Pin+ (2) symmetry in the quantum theory.
This is an intriguing phenomenon appearing in quantum symmetries with nontrivial
gauge fields and topological terms: The statistics and spins of particles can be shifted
from their classical values, often in ways that involve curious fractions.
2. Spin And Pin Groups. Enquiring minds will wonder about the definition of Pin± (d).
These groups are defined using Clifford algebras. For much more detail and moti-
vation see the two chapters on Clifford algebras and Spin groups. In brief, consider
the Clifford algebra generated by {γi , γj } = 2Qij where Qij is an invertible d × d
symmetric matrix. For a vector v i define γ(v) := v i γi . Assume that Qij = δij . Then
Pin+ (d) is the group of expressions of the form
±γ(v1 ) · · · γ(vr ) (15.233)
−
for some r. The group Pin (d) is similarly defined with Qij = −δij . The group
Spin(d) is the subgroup of such expressions where r is even. The projection π :
Pin± (d) → O(d) is defined by the equation:
γ(π(g) · w) = (−1)r gγ(w)g −1 (15.234)
The key idea here is that
−γ(v)γ(w)γ(v)−1 = γ(Rv (w)) (15.235)
where Rv (w) is the reflection of w through the plane orthogonal to v, as the reader
can easily check in an exercise below. Then use the fact that all elements of O(d) are
products of reflections. The restriction to Spin(d) defines π : Spin(d) → SO(d). This
is a generalization of our standard double-covering π : SU (2) → SO(3). Although
Spin(3) ∼ = SU (2) for d > 3 Spin(d) is not isomorphic to a unitary or orthogonal
group. The difference between Pin+ (d) and Pin− (d) is whether the lift of a reflection
will square to +1 or −1 respectively. As a group Pin− (d) is isomorphic to (Spin(2) o
Z4 )/Z2 .
– 342 –
We stress the appearance of α̂ in the representation matrix. This transformation cor-
responds to a translation of φ by α = 2α̂. Had we tried to express the representation
in terms of α we would encounter the phase e±iα/2 which is not well-defined because
α is only defined modulo α ∼ α + 2π.
4. As pointed out in a recent paper 180 this extension of the symmetry group at half-
integral θ is an excellent baby model for how one can learn about nontrivial dynamics
of quantum systems (in particular, QCD) by thinking carefully about group exten-
sions. For example, if we were to add a potential U (φ) to the problem we just dis-
cussed we could no longer solve exactly for the eigenstates. Also, a generic potential
would be of the form
X X
U generic (φ) = cn cos(nφ) + sn sin(nφ) (15.237)
n∈Z n∈Z
Potentials with generic coefficients will explicitly break all of the O(2) symmetry.
Suppose however, that we can restrict attention to a special class of potentials with
only cosine Fourier coefficients that are 0 mod 2:
X
U special (φ) = un cos(2nφ) (15.238)
n
Then, even though we cannot solve the spectrum of the Hamiltonian exactly we
can make an interesting statement about it. For such potentials the classical O(2)
symmetry is explicitly broken to Z2 × Z2 generated by P : φ → −φ and r : φ → φ + π.
We have shown that when 2B is odd and the potential is zero the O(2) symmetry
is centrally extended and realized as the double-cover Pin+ (2) on the Hilbert space.
The double cover of the subgroup group hP, ri ⊂ O(2) acting on Hilbert space is
described by the pullback diagram:
π1
1 / Z2 ι / D4 / Z2 × Z2 /1 (15.239)
Id ι ι
π2
1 / Z2 / Pin+ (2) / O(2) /1
To check this note that P lifts to the operators ±P and r = R(π) lifts to the operators
One checks that hP, r̂i generates a group isomorphic to D4 . Indeed, these operators
satisfy the defining relations of D4 : P 2 = 1, r̂4 = 1, and P r̂P = r̂−1 . Note that,
in addition r̂2 = −1 on the entire Hilbert space. The representation on the Qbit
groundstate in (15.236) (with α̂ = ±π/2) is a two-dimensional irrep of D4 . In fact
all the doubly degenerate energy eigenspaces are two-dimensional irreps of D4 .
180
D. Gaiotto, A. Kapustin, Z. Komargodski and N. Seiberg, “Theta, Time Reversal, and Temperature,”
https://arxiv.org/pdf/1703.00501.pdf
– 343 –
It is reasonable to assume that when we turn on a weak potential of the form (15.238)
the classically preserved Z2 × Z2 subgroup again lifts to a D4 action on the Hilbert
space, even though we can no longer construct the operators in the D4 group explic-
itly. The cocycle is discrete: So if it is a continuous function of the parameters un at
un = 0 (this is an assumption) then the classical Z2 × Z2 symmetry must be realized
by D4 on the Hilbert space.
Now, we saw that, in the absence of the potential, there is a Qbit giving a two-
dimensional representation of Pin+ (2). This representation restricts to an irreducible
two-dimensional representation of D4 . (See equation (15.236) with α̂ = ±π/2.) Now,
D4 has four one-dimensional irreducible representations 1±,± and, we will show later,
exactly one two-dimensional irreducible representation. In particular, the set of repre-
sentations of D4 is discrete. Again, it is reasonable to suppose that the representation
is a continuous function of un . Again, this is an assumption. But granting this, turn-
ing on a weak potential cannot change the decomposition of the energy eigenspaces
into irreducible representations. This leads to a striking prediction: The two-fold
groundstate degeneracy is not broken by potentials of the form (15.238) when 2B is
odd! This is remarkable when one compares to the standard discussion of the double-
well potential of one-dimensional quantum mechanics. In that standard case one has
a two-fold classical degeneracy broken by tunneling (instanton) effects so that there
is a unique ground state. For potentials of the form (15.238) there are (generically)
four stationary points of the potential, at φ = 0, ±π/2, π. Generically, two will be
maxima and two will be minima. So, classically, and perturbatively in quantum me-
chanics, for a generic potential of the form (15.238) there will be a two-fold degenerate
groundstate. However, unlike the textbook discussion of the double-well potential,
the degeneracy will not be lifted by nonperturbative tunneling effects.
Exercise
Show that the ground state energy is
~2
Eground = M inm∈Z (m − B)2 (15.241)
2I
and, using the floor function, give a formula for Eground directly in terms of B (without
requiring minimization).
181
Answer : The key equation to check is (15.235). To check this consider the two cases that w is parallel
to v and that w is perpendicular to v. Then note that every element in O(d) is a product of reflections.
– 344 –
b.) Show that the general definition of Pin+ (d) specializes to the definition of Pin+ (2)
as a semidirect product. 182
β
This is in fact a very interesting function of 2I and B. Some immediate facts we can note
are
182
Answer : First reproduce Spin(2) using the general definition. We can represent the d = 2 Clifford
algebra by σ 1 , σ 2 and then the product of two vectors of the form γ(v) with v 2 = 1 is a matrix of the form
(x1 σ 1 + x2 σ 2 )(y 1 σ 1 + y 2 σ 2 ) where v1 = (x1 , x2 ) and v2 = (y 1 , y 2 ) are unit vectors in R2 . Multiplying this
out we get
cos θ + sin θσ 1 σ 2 (15.242)
where θ is the angle between v1 and v2 . Now let P̂ be represented by σ 1 (or γ(w) for any unit vector w).
183
Answer : D4 has generators x, y with x2 = 1 and y 4 = 1 and xyx = y −1 . In a one-dimensional
representation x, y will be represented by complex numbers. So we solve the above equations with x, y ∈ C.
Clearly x ∈ {±1} and then y 2 = 1 so y ∈ {±1} and there is no correlation between the choice of sign for x
and the choice of sign for y.
184
Hint:It is important to think about which group the cocycle and coboundaries take values in.
– 345 –
1. The expression is manifestly periodic under integer shifts of B, illustrating the general
claim above that the theory is invariant under integral shifts of the “theta angle” B.
2. Moreover, at low temperature, β → ∞ there is a single dominant term from the sum,
unless B is a half-integer, in which there are two equally dominant terms - this reflects
the double degeneracy of the ground state when 2B is odd: The ground state is a
Qbit. A standard technique in field theory is to study the IR behavior of a partition
function to learn about the ground states of the system.
We are going to see that this system in fact has a very interesting high/low temperature
duality and use this to understand better the θ-dependence of the previous example in terms
of path integrals.
To relate Z to a path integral we observe that we can write:
Z 2π
Z= dφhφ|e−βH |φi (15.245)
0
Now, we can interpret hφ|e−βH |φi as a specialization of the matrix elements of the Euclidean
time propagator
1 X − tE (m−B)2 +im(φ1 −φ2 )
hφ2 |e−tE HB |φ1 i = e 2I (15.246)
2π
m∈Z
Under good conditions this family of operators for t ∈ R has an “analytic continuation”
to part of the complex plane. What this means is that there is a well-defined family of
operators
e−izH/~ (15.248)
where z takes values in a region R ⊂ C. The region R should, at least contain the real axis
of time on its boundary (or closure). To see that there might be restrictions on R suppose
that R = C. Then we can consider the restriction to the imaginary axis, setting
z = −itE tE ∈ R (15.249)
– 346 –
Feynman’s argument proceeds just as well with e−tE H/~ . In fact, formally, it is better since
the integral has better (formal) convergence properties for Euclidean actions whose real
part is bounded below.
Now, by setting φ1 = φ2 = φ and integrating over φ we are making Euclidean time
periodic, with period β and computing the path integral on a compact spacetime, namely,
the circle. The path integral for φ1 = φ2 is done with boundary conditions on the fields so
that φ(0) = φ(β). This is precisely the kind of boundary condition that says that φ(t) is
defined on a circle. More details on path integrals are available in many textbooks. See,
for examples:
1. Feynman and Hibbs, Quantum Mechanics and Integrals
2. Feynman, Statistical Mechanics
3. C. Itzykson and J.B. Zuber, Quantum Field Theory,
4. J. Zinn-Justin, Quantum Field Theory and Critical Phenomena,
In this subsubsection we will henceforth drop the subscript E on tE and just use t for
the real Euclidean time coordinate.
H
In the Wick rotation to Euclidean space the “θ-angle” eA remains real so the matrix
elements of the Euclidean time propagator have the path integral representation:
Z
φ(β)=φ 1 β 1 2
R R
−βH
hφ2 |e |φ1 i = Z(φ2 , φ1 |β) := [dφ(t)]φ(0)=φ12 e− ~ 0 2 I φ̇ dt−i Bφ̇dt (15.250)
(One must be careful with the sign of the imaginary term, and it matters.)
Viewed as a field theory, this is a free field theory and the path integral can be done
exactly by semiclassical techniques:
The equation of motion is simply φ̈ = 0. Again, the θ-term has not changed it.
Thus, the classical solutions to the equations of motion with boundary conditions
φ(0) = φ1 , φ(β) = φ2 are:
φ2 − φ1 + 2πw
φcl (t) = φ1 + t w∈Z (15.251)
β
These are solutions of the Euclidean equations of motion, and are known as “instantons”
for historical reasons. Notice that because of the compact nature of the spacetime on which
we define our 0 + 1 dimensional field theory there are infinitely many solutions labeled by
w ∈ Z. There are two circles in the game: The spacetime of this 0 + 1-dimensional field
theory is the Euclidean time circle. Then the target space of the field theory is also a circle.
The quantum field eiφ(t) is a map M → X . M , which is spacetime is S 1 and X , which is the
target space is also X . As we saw: π1 (S 1 ) ∼
= Z. There can be topologically inequivalent
field configurations. That is the space of maps M ap(M → X ) has different connected
components. The different topological sectors are uniquely labelled by the winding number
of the map (15.252). In the path integral we sum over all field configurations so we should
sum over all these instanton configurations.
– 347 –
We now write
φ = φcl + φq (15.253)
where φcl is an instanton solution, as in (15.251), and φq is the quantum fluctuation with
φq (0) = φq (β) = 0, and, moreover, φq (t) is in the topologically trivial component of
M ap(S 1 , X ). The action is a sum S(φcl ) + S(φq ), precisely because φcl solves the equation
of motion. Indeed the integral factorizes and we just get:
X − 2π2 I (w+ φ2 −φ1 )2 +2πiB(w+ φ2 −φ1 )
Z(φ2 , φ1 |β) = Zq e β 2π 2π (15.254)
w∈Z
The summation runs over classical solutions, weighted by the value of the classical action
on that solution.
Zq is the path integral over φq :
Z Rβ 1 2
φ(β)=0
Zq = [dφq (t)]φq (0)=0 e− 0 2 I φ̇ dt (15.255)
We are integrating over the space of “all” maps φq : [0, 1] → U (1) with φq (0) = φq (1) = 0
that are homotopically trivial. We can do it by noticing that this is a Gaussian integral.
Now in finite dimensions we have the integral
n
dxi
Z Y
1 i j i 1 1 −1 ij
√ e− 2 x Aij x +bi x = √ e 2 bi (A ) bj (15.256)
i=1
2π detA
where Re(A) > 0 is a symmetric matrix. When A can be diagonalized by a real orthogonal
transformation we can replace
n
Y
detA = λi (15.257)
i=1
where the product runs over the eigenvalues of A. Thus, we need to generalize this expres-
sion to the determinant of an infinite-dimensional “matrix”
Z 1
I d2
Z
−1/2
[dφq ]exp[− φq (− 2
)φq ] = (2π)Det0 (O) (15.258)
0 2β dτ
Here the prime on the determinant means that we have omitted the zero-mode and the
I d2
analog of A is the operator O = − 2~β dτ 2
.
One way to make sense of DetO for an operator O on Hilbert space is known as
“ζ-function regularization.” (It will only work for a suitable class of operators.) Note that
d
|s=0 λ−s = −logλ (15.259)
ds
So if we define X
ζO (s) := λ−s (15.260)
λ
where we take the sum over the spectrum of O (and we assume O is diagonalizable with
discrete spectrum) then, formally:
Y
0
λ = exp[−ζO (0)] (15.261)
λ
– 348 –
For good operators O the spectrum goes to infinity sufficiently fast that ζO (s) exists as an
analytic function of s in a half plane Re(s) > N for some N . Moreover, ζO (s) also admits
an analytic continuation in s to an open region around s = 0. In this case, we can define
the determinant by the RHS of (15.261).
I d2 Iπ 2 s
For O = − 2~β dτ 2
we have ζO (s) = 2 2~β ζ(2s) where ζ(s) is the standard Riemann
ζ-function, and since
1 1
ζ(s) = − + slog √ + O(s2 ) (15.262)
2 2π
we have
β
Det0 (O) := exp[−ζO
0
(0)] = (15.263)
I
(There are some factors of 2 and π that need to be fixed in this equation.)
We can understand this result nicely as follows. Let us study the β → 0 behavior of
the path integral. Then for |φ1 − φ2 | < π,
I
− 2β (φ2 −φ1 )2 +iB(φ2 −φ1 ) −κ/β
Z → Zq e 1 + O(e ) (15.264)
where κ > 0. In plain English: the instantons are only important at large β. This is
intuitively very satisfying: At very small times β it must cost a lot of action for φ(t) to make
a nonzero number of circuites around the circle because the velocity must then be large, and
large velocity means large action. So for physical quantities based on such small fluctuations
the topologically nontrivial field configurations must contribute subleading effects. Let us
therefore compare Z(φ2 , φ1 |β) as β → 0 with the standard quantum mechanical propagator.
For small φ we can remove the phase from the B-field via ψ(φ) → e−iBφ ψ(φ), so Zq should
not depend on B. (Note this transformation is not globally defined in φ for generic B so
we cannot use it to remove B from the problem when we treat the full quantity Z exactly.)
After this transformation we expect to recover the standard propagator of a particle of
mass M = I on the line. Rotated to Euclidean space this would be:
s
2
M − M (φ2~β2 −φ1 )
e (15.265)
2π~β
so s
I
Zq = (15.266)
2π~β
The net result is that
s
I X − 2πβ2 I (w+ φ22π
−φ1 2 φ −φ
) −2πiB(w+ 22π 1 )
Z(φ2 , φ1 |β) = e (15.267)
2πβ
w∈Z
Now compare (15.246) (with tE = β) with (15.267). These expressions look very dif-
ferent! One involves a sum of exponentials with β in the numerator and the other with β in
the demoninator. One is well-suited to discussing the asymptotic behavior for β → ∞ (low
temperature) and the other for β → 0 (high temperature), respectively. Nevertheless, we
have computed the same physical quantity, just using two different methods. So they must
– 349 –
be the same. But the mathematical identity that says they are the same appears somewhat
miraculous. We now explain how to verify the two expressions are indeed identical using a
direct mathematical argument.
The essential fact is the Poisson summation formula discussed in section
In our case, the expression computed directly from the diagonalization of the Hamil-
tonian is
1 X − β (m−B)2 +im(φ1 −φ2 )
Z(φ2 , φ1 |β) = e 2I
2π
m∈Z (15.268)
1 iB(φ1 −φ2 ) θ
= e ϑ[ ](0|τ )
2π φ
where we have written it in terms of the Riemann theta function (11.237) with:
β
τ =i
2πI
θ = −B (15.269)
φ1 − φ2
φ=
2π
On the other hand, the expression that emerges naturally from the semiclassical eval-
uation of the Euclidean path integral is
s
I X − 2πβ2 I (w+ φ22π
−φ1 2 φ −φ
) −2πiB(w+ 22π 1 )
Z(φ2 , φ1 |β) = e
2πβ
w∈Z
s (15.270)
I θ 0
= ϑ[ 0 ](0|τ 0 )
2πβ φ
Note that the modular transformation law of the Riemann theta function, equation (??) is
equivalent to the relation between the expressions naturally arising from the Hamiltonian
and Lagrangian approaches to evaluation of the matrix elements of the Euclidean time
propagator!
Note in particular that for the partition function proper we have, as β → 0:
1/2 X
1 2πI I
−2π 2 n2 β~ +2πinB
Z(S ) = e
β~
n∈Z
1/2 (15.272)
2πI I
−2π 2 β~
∼β→0 1 + 2e cos(2πB) + · · ·
β~
– 350 –
The overall factor of β −1/2 gives the expected divergence. The first correction term to the
factor in parentheses is an instanton effect.
Note that in the Hamiltonian version the only thing that is manifest about the high-
temperature, β → 0, limit is that Z diverges. Note that for β → 0 all the terms in the sum
contribute about equally and the sum diverges. The modular transformation reveals an
interesting duality: Once we factor out this multiplicative divergence we discover another
theta function.
b.) Note that this expression is not invariant under φ → −φ. But in Pin+ (2) there is
an element P which corresponds to φ → −φ. How is this compatible with our argument
that Pin+ (2) is a valid symmetry of the quantum theory? 185
15.4.3 Gauging The Global SO(2) Symmetry, Chern-Simons Terms, And Anoma-
lies
When a theory has a symmetry one can implement a procedure called “gauging the sym-
metry.” This is a two-step process:
2. Integrate over “all possible” gauge fields consistent with the symmetry.
It is not necessary to proceed to step (2) after completing step (1). In this case, we say
that we are coupling to nondynamical external gauge fields. It makes perfectly good sense
to introduce nondynamical, external gauge fields for a symmetry. We do this all the time
in quantum mechanics courses where we couple our quantum system to an electromagnetic
field, but do not try to quantize the electromagnetic field.
For the more mathematically sophisticated reader the two-step process can be sum-
marized, somewhat more concisely and precisely, as saying that we:
1. Identify the symmetry group with the structure group of a principal bundle and we
change the bordism category in the domain of the field theory functor to include
G-bundles with connection (where G is the symmetry we are gauging).
185
Answer : Show that
P · |φi = e−iφ | − φi (15.274)
P
You can prove this by expanding |φi = m∈Z hΨm |φiΨm . Now, using this expression check that the
propagator indeed transforms correctly.
– 351 –
2. Sum over isomorphism classes of principal bundles and integrate over the isomorphism
classes of connections on those bundles.
In the present simple example of the charged particle on a ring surrounding a solenoid
we can “gauge” the global SO(2) symmetry φ → φ + α that is present for all values of
B. It is then interesting to see how coupling to the external gauge field tells us about
the subtleties of combining SO(2) symmetry with charge conjugation symmetry that we
studied above. (The following discussion was inspired by Appendix D of. 186 )
So in our simple example we implement Step 1 above as follows: We seek to make the
shift symmetry local, that is, we attempt to make
into a symmetry where α(t) is not a constant but an “arbitrary” function of time. When
α(t) is time dependent the action ∼ φ̇2 is not invariant under such transformations.
R
To compensate for this we introduce an extra function of time into the problem, call it
A(e) (t). Here the superscript e - for “external” - reminds us that this is an “external” or
“background” field: We will not do a path integral over these functions (unless we proceed
to Step 2 above). By contrast, we will do a path integral over the “dynamical” field φ(t)
or Φ(t) = eiφ(t) .
The gauged action is
Z I
1
S= I(φ̇ + A ) dt + B(φ̇ + A(e) )dt
(e) 2
2
Z 2 I (15.276)
1 −1 d (e) −1 d (e)
= I Φ(t) (−i + A )Φ(t) dt + BΦ(t) (−i + A )Φ(t)dt
2 dt dt
This action is a functional of both the nondynamical field A(e) (t) and the dynamical field
φ(t). Note that the action is invariant under the gauge transformation:
φ(t) → φ(t) + α(t)
(15.277)
A(e) (t) → A(e) (t) − ∂t α(t)
where, for the moment, we ignore boundary terms.
It turns out that it is better to regard the function A(e) (t) as a component of a 1-form:
and, better still, A(e) is the local one-form associated to a connection on a (locally trivial-
ized) principal SO(2) bundle over the time manifold M . We stress that the gauge field A(e)
is NOT the gauge field of electromagnetism. (That field has already produced our theta
term.) Rather, it is a new field in our system: It is a gauge field for the shift symmetry of
the field φ(t). 187
186
D. Gaiotto, A. Kapustin, Z. Komargodski and N. Seiberg, “Theta, Time Reversal, and Temperature,”
https://arxiv.org/pdf/1703.00501.pdf
187
The physical interpretation of the gauging process in terms of our original charged particle on a ring is
not completely clear to the author. But the mathematical structure makes sense and is a good toy model
for other field theoretic systems where the physical interpretation of the gauging is clear.
– 352 –
Note that we could also write the gauge transformation in the form:
This is better because is captures better the geometric content, Consequently, it makes
more sense when working on topologically nontrivial spacetimes, such as the Euclidean
time circle. One can make choices of gauge group so that it becomes important that eiα(t)
can be single-valued even when α(t) is not.
What about charge conjugation symmetry?
First consider the classical theory. In the absence of the external gauge field we noted
that there is an O(2) symmetry of the equations of motion, even though under φ(t) → −φ(t)
the theta term in the action flips sign. In the presence of the external gauge field the
equations of motion are modified, however, as we will see below, we can gauge A(e) (t) to
be a constant, and in this case they are not modified. So we still have an O(2) symmetry.
Now consider the quantum theory. One can show that, appropriately defined, the
quantum Hamiltonian is still HB . Under charge conjugation we must flip B and then we
change the Hamiltonian (unless B = 0). But, as noted above, if 2B ∈ Z in that case HB
is unitarily equivalent to H−B and we can implement a unitary operator P corresponding
to the charge conjugation operation. In the path integral the value of the action matters.
The action (15.276) is invariant under the charge conjugation transformation if we take
φ(t) → −φ(t)
A(e) → −A(e) (15.280)
B → −B
and consequently if we change B → −B we must also take A(e) → −A(e) as noted above.
We will return to the quantum implementation of charge conjugation symmetry.
Now let us re-examine the periodicity of the physics as a function of B. In the absence
of the external gauge field A(e) we found that physical quantities are periodic functions of
B with period one. However, in the presence of a nonzero A(e) , the term BA(e) (t)dt spoils
R
the periodicity in B, because the value of the action matters in the quantum theory.
We can restore a kind of periodicity in B by adding a Chern-Simons term to the action.
We will comment in detail on the Chern-Simons term below. In Euclidean space the new
action is:
1 (e) (e) (e)
I(φ̇+At )2 dt−i
R H R
e−S = e− 2
B(φ̇+At )dt ik
e At dt (15.281)
and the last factor is the Chern-Simons term. By introducing the Chern-Simons term we
have introduced yet another parameter, the level k, into our theory. Classically the action
with (B, k) is equivalent to the action with (B + r, k + r) where r ∈ R is any real number.
Now that we have restored some kind of periodicity we can ask about quantum im-
plementation of charge conjugation symmetry. We must take B → −B, but quantum
mechanically the theory with B is only equivalent to that with −B when 2B ∈ Z. So, in
– 353 –
the quantum theory we can only hope to have charge conjugation invariance if there is an
integer N so that
(B + N, k + N ) = (−B, −k) (15.282)
In other words k = B = N/2 ∈ Z/2.
The introduction of the Chern-Simons term raises a new issue: When one sees gauge
potentials in an action that do not enter through fieldstrengths or covariant derivatives it
is important to ask about gauge invariance. In order to discuss the gauge invariance of
the Chern-Simons term properly we need first to discuss more carefully the space of gauge
fields and the group of gauge transformations.
In our simple setting the space A of gauge fields can be identified with the space of
single-valued, continuous real-valued functions A(e) (t) on M .
We now must choose a gauge group G. This will be a Lie group - typically finite-
dimensional, although not necessarily connected. In our case, there are two natural choices:
We could take G = R or we could take G = U (1). Then the group of gauge transformations
is a group of maps
G = M ap[M → G] (15.283)
If M and G have positive dimension the group of gauge transformations will be an infinite-
dimensional Lie group.
Of particular interest in gauge theory is the quotient space A/G, the space of gauge
orbits, or, equivalently, the space of gauge-inequivalent field configurations.
Let us examine a few examples of A/G:
1. Let us first consider what happens when G = R and M is an interval or the real
line. Then the space A of gauge fields can be identified with the space of real-valued
continuous functions on M . The group G is the space of real-valued C 1 functions on
M , t 7→ α(t) ∈ R. The group G acts on A via
If M = [t1 , t2 ] with free boundary conditions on A and G then we can always solve
for some α(t) and hence we can always gauge A(e) (t) to zero. So A/G is just a point.
188
For the discussion below it is useful to note here that the expression
Z t2
(e)
exp[i At (t0 )dt0 ] (15.286)
t1
188
Actually, this is too naive: A/G is more properly thought of as a stack. See the section below on
groupoids.
– 354 –
is in general not gauge invariant. Rather:
Z t2 Z t2
(e) 0 0 (e)
exp[i At (t )dt ] → e iα(t1 )
exp[i At (t0 )dt0 ]e−iα(t2 ) (15.287)
t1 t1
is gauge invariant. There are no “local” invariants. From the previous discussion we
see that we can gauge A(e) (t) to zero in any compact region. In this case
A/G ∼
=R (15.289)
and the integral (15.288) fully determines the gauge equivalence class.
3. Now let us consider the case G = U (1), and let us also take M to be the Euclidean
time circle so
1
G = Map(Ss.t. → U (1)) (15.290)
Now, just viewing the gauge transformation as a set of continuous maps S 1 → S 1
there is a winding number. If this winding number is nonzero there is an obstruction
to finding a single-valued function α(t) so that g(t) = eiα(t) .
There is a normal subgroup G0 of small gauge transformations for which g(t) admits
a well-defined logarithm. That is, the gauge transformations g(t) ∈ G0 are of the
form g(t) = eiα(t) where α(t) is single-valued. Then
π
1 → G0 → G → Z→1 (15.291)
where the map π can be viewed as the winding number. Gauge transformations in
G0 are known as small gauge transformations. Those which have nonzero winding
numbers are known as large gauge transformations. It is worth noting that the above
sequence splits: For w ∈ Z we can take s(w) = gw to be the gauge transformation:
and gw (t)gw0 (t) = gw+w0 (t). Note that for these transformations if we tried to define
α(t) it would be α(t) = 2πwt/β and would not be single-valued when M is the circle.
Now, referring to (15.295) it is clear that if α(t) is single valued then
I
(e)
exp[i At (t0 )dt0 ] (15.293)
S1
is gauge invariant. However when we have a large gauge transformation gw (t) we can
cut the circle say at t = 0 and t = β and then, with α(t) = 2πwt/β with w ∈ Z the
– 355 –
holonomy is still gauge invariant. Note that the large gauge transformations gw (t)
take
(e) (e)
At (t0 ) → At (t0 ) + w/β (15.294)
so that on the circle the holonomy (15.293) is gauge invariant. Equation (15.295)
generalizes nicely to the case of nonabelian groups on arbitrary spacetimes.
We can ask if there are other independent gauge invariant functions of A(e) (t) be-
sides the holonomy. Since A(e) (t) is periodic we can decompose A(e) (t) in a Fourier
(e)
expansion. Write Ãt (t) for the sum of the nonzero frequency modes. Then we can
solve the differential equation
(e)
∂t α(t) = Ãt (t) (15.296)
is constant. Put differently, A/G0 can be identified with the space of real numbers,
given by the constant µ. We will denote Ared = A/G0 . Then the “large” gauge
transformations gw (t) shift µ → µ + 2πw, with w ∈ Z. Therefore we have
A/G ∼
= Ared /Z ∼
= {[µ] = [µ + 2πw] w ∈ Z} ∼
= U (1) (15.298)
and the holonomy eiµ around the circle is a complete gauge invariant.
4. Finally, let us take the gauge group to be G = O(2) = SO(2) o Z2 . Let GSO(2) be the
group of gauge transformations when the gauge group is SO(2) and GO(2) be the group
of gauge transformations when the gauge group is O(2). Then GO(2) = GSO(2) o Z2
and we have the exact sequence
1 → G0 → GO(2) → Z o Z2 ∼
= D∞ → 1 (15.299)
σ :µ → −µ
(15.300)
s :µ → µ + 2π
– 356 –
Now we are ready to discuss the gauge invariance of the Chern-Simons term. The
Chern-Simons term on M = S 1 is invariant under G0 for any value of k. However, under
the large gauge transformations gw (t) with w 6= 0:
I I
(e) 0 0 (e)
exp[ik At (t )dt ] → e 2πiwk
exp[ik At (t0 )dt0 ] (15.301)
S1 S1
and therefore, if we are going to allow our theory to make sense on a circle with the gauge
group SO(2) ∼ = U (1) then k should be quantized to be an integer. Note there would be no
such quantization of k if the gauge group is taken to be R.
The above observation is related to two extremely important conceptual points that
are essential to all discussions of the use of Chern-Simons terms in quantum physics:
It is not necessary for the action to be invariant. All that is necessary for a well-defined
path integral is that the exponentiated action must be invariant.
Gauge invariance of the “Chern-Simons term” under large gauge transformations implies
that the level k, one of the couplings of the theory, is quantized: k ∈ Z.
In our case the action in equation (15.281) is not gauge invariant! Under large gauge
transformations with winding number w we have S → S + 2πikw with w ∈ Z. However,
having a well-defined measure in the path integral only requires e−S to be well-defined,
and this will be the case if, and only if, k ∈ Z.
Note that I
(e)
exp[ik At (t0 )dt0 ] = eikµ (15.302)
S1
and, since k ∈ Z is quantized this is properly periodic under µ → µ + 2π and the expo-
nentiated Chern-Simons term descends to a well-defined function on A/G. For the group
O(2) we would have to consider cos(µ). (Of course, cos(nµ) is a Tchebyshev polynomial
of the basic invariant cos(µ).)
Anomalies
We can now discuss, very generally, the notion of anomalies. In quantum systems we
typically have both “dyanmical variables” such as dynamical fields, degrees of freedom,
etc. as well as “external” or “background” or “control” variables. We will denote generic
“background fields” by φbck and generic “dynamical fields” by φdyn . Any parameter of the
theory should be considered a “field.” The space of all fields is then fibered:
F dyn /F (15.303)
F bck
– 357 –
In the very simple situation we are discussing here the fibration is just a Cartesian product.
In the computation of physical quantities we will typically integrate over F dyn thus
producing a function (or, more generally, a section of a bundle) on F bck . We then study
the physical quantity as a function on F bck .
In our example F dyn can be taken to be the set of functions Φ(t) : M → U (1) and F bck
can be taken to be the set of functions A(e) (t), or better, the connections on a principal
G-bundle over M where in our present examples G = R, SO(2) or O(2). 189
Now suppose that there is a group G acting on F so that physical quantities are formally
invariant. For example, if we have an invariant action S[φdyn ; φbck ] and a formally invariant
measure, then the path integral will be formally invariant. Then, physical quantities such
as the partition function:
Z
dyn bck
bck
Z[φ ] = e−S[φ ;φ ] vol (φdyn ) (15.304)
F dyn
will, formally define a G-invariant function on F bck . However, it can happen that when
one defines the path integral carefully the partition function fails to be G-invariant. In
that case we say that there is a potential anomaly. Sometimes potential anomalies can be
removed by physically unimportant redefinitions. When this cannot be done we say there
is an anomaly.
If we tried to consider the Chern-Simons term for k ∈ / Z we would say it has an
anomaly. For any value of k it descends to a function on A/G0 . However, only when k ∈ Z
does it desend to a well-defined function on A/G.
It can happen that there can be different subgroups H1 ⊂ G and H2 ⊂ G such that
there are different definitions of the path integral so that it is invariant either under H1 or
under H2 but there is no definition so that it is invariant simultaneously under both H1
and H2 . In this case we say there is a mixed anomaly.
The Partition Function As Function On A And Its Behavior Under The Action Of G
Let us now illustrate some of the above ideas about anomalies by examining the par-
tition function in our example of the gauged particle on a ring.
There will not be any interesting anomalies under G0 . As we have explained we can
always use G0 to gauge A(e) to be a constant 1-form, and we will henceforth take our gauge
field to be constant. Then the equation of motion is the same as before, and performing
the path integral just as in the previous section we find
2 µ 2 µ
− 2πβ I (w+ 2π ) −2πiB(w+ 2π )
X
Z(µ) = eikµ Zq e
w∈Z (15.305)
µ/2π
= eikµ Zq ϑ[ ](0|τ )
−B
189
One can also promote B to be a field for fun and profit. This has been discussed in many places. See
e-Print: 1905.09315 for a recent discussion.
– 358 –
with τ = i 2πI
β . All we need to do here is replace the value of the classical action for solutions
with φ̇ = 2πw/β by making the substitution w → w + µ/2π.
As in the case without the external gauge field there is a Hamiltonian interpretation.
Performing the Poisson summation (or using the modular transformation law of the theta
function) we get:
1 X − β (m−B)2 −i(m−B)µ
Z(µ) = ei(k−B)µ e 2I (15.306)
2π
m∈Z
It can be shown that the Euclidean path integral with action (15.193) is in fact equal to
where Q is the operator measuring the charge of the SO(2) symmetry we gauged. In our
case QΨm = mΨm . 190 As noted above we still have
2
1 ∂
HB = −i −B (15.308)
2I ∂φ
What happens if we try to extend the gauge group to gauge the full O(2)? Then, as
we have seen, the quotient group G/G0 is the infinite dihedral group generated by σ and s
defined in equation (15.300) above.
If 2B is even then we can take k = B ∈ Z. The partition function is invariant under
µ → −µ and has the expected periodicity µ ∼ µ + 2π. In other words Z(µ) is invariant
under the group of large gauge transformations isomorphic to D∞ and generated by σ and
s, so it descends to a function on A/G and there is no anomaly.
Things are much more subtle when 2B is odd. As we saw, we can only expect charge
conjugation symmetry when
1
k =B ∈Z+ (15.309)
2
But this clashes with the constraint k ∈ Z. So we see an example of a mixed anomaly.
It is interesting to see how the mixed anomaly is manifested in the partition function.
The main point can be seen most easily by considering the leading term in the β → ∞
expansion which is (taking B = 1/2 for simplicity):
– 359 –
If k = 0 then
e−βEground
1 + e−iµ + · · ·
Z→ (15.311)
2π
the expression is properly periodic in µ, but not invariant under the analog of charge
conjugation: µ → −µ. This is not surprising since k 6= B.
As we will discuss below, by changing the physical system (yet again!) there is a
way to make sense of the half-integral level Chern-Simons term. If we just go ahead and
mindlessly substitute k = B = 1/2 in the above formula for Z(µ) we get:
e−βEground iµ/2
Z→ e + e−iµ/2 + · · · (15.312)
2π
The action is now invariant under the generator σ : µ → −µ of D∞ but it is no longer
invariant under the generator s : µ → µ + 2π.
Provided we view the different choices of Chern-Simons terms as different definitions
of the theory, we can define the theory to be invariant under the group generated by s, but
with that definition σ is anomalous, or we can define the theory to be invariant udner the
group generated by σ, but then with that definition s is anomalous. So in this sense there
is a mixed anomaly of Z and Z2 in the D∞ subgroup of global gauge transformations.
where F (e) = dA(e) . The RHS makes sense even if k is not an integer, but now the
expression depends on details of the gauge field in the “bulk” of the 1 + 1 dimensional
spacetime Σ.
A very analogous phenomenon is observed in real condensed matter systems where the
boundary theory of a 3+1 dimensional topological insulator is described by a Chern-Simons
theory with half-integral level. (That is, half the level allowed by naive gauge invariance.)
Such half-integral level Chern-Simons terms come up in many interesting physical
systems. For example, half-integral (spin) Chern-Simons theory is needed to describe the
topological features of the fractional quantum Hall effect. In supersymmetric field theories
and string theories many of the supergravity effective actions and brane effective actions
involve half-integrally quantized Chern-Simons terms.
Exercise Puzzle
Warning: This exercise requires some knowledge of topology.
– 360 –
Resolve the following paradox:
We first argued that, if k ∈
/ Z then the LHS of (15.313) is not invariant under large
gauge transformations. Then we proceeded to define the LHS by the expression on the
RHS which is manifestly gauge invariant.
How can these two statements be compatible? 191
1→A→G
e→G→1 (15.314)
In many of the examples above we had G Abelian and G e was also Abelian. However, as
our examples with Q and D4 have shown, in general G e need not be Abelian. (See equation
(15.177).) In this section we focus on an important class of examples where G is Abelian
and Ge is non-Abelian. They are known as Heisenberg groups and Heisenberg extensions. In
fact in the literature closely related but slightly different things are meant by “Heisenberg
extensions” and “Heisenberg groups.” These kinds of extensions show up all the time in
physics, in many different ways. They are very basic in quantum field theory and other
areas of physics, so we are going to dwell upon them a bit.
U (α) := exp[iαp̂]
(15.317)
V (β) := exp[iβ q̂]
These are unitary when α, β are real. When α is real U (α) implements translation in
position space by ~α. When β is real V (β) implements translation in momentum space by
−~β.
191
Answer : The gauge transformation eiα(t) must extend to a continuous map Σ → U (1). If Σ is a smooth
manifold whose only boundary is S 1 , as we have tacitly assumed in writing equation (15.313), then such
maps always restrict to small gauge transformations on the bounding S 1 .
– 361 –
The group of operators {U (α)|α ∈ R} is isomorphic to R because U (α1 )U (α2 ) =
U (α1 + α2 ). A similar statement holds for the group of operators V (β). But when we take
products of both U (α) and V (β) operators we do not get the group R ⊕ R of translations
in position and momentum, separately. Rather, one can show in a number of ways that:
Comparing (15.319) with (15.320) we arrive at (15.320). The reader should compare this
with our discussion of quantum mechanics with a finite number of degrees of freedom,
especially the derivation of (11.251).
Here is a third derivation of (15.318): Using (11.441) it follows that
With one (nice) choice of cocycle we can write the group law as:
i
(z1 , (α1 , β1 )) · (z2 , (α2 , β2 )) = (z1 z2 e 2 ~(α1 β2 −α2 β1 ) , (α1 + α2 , β1 + β2 )) (15.324)
where !
α
v= (15.326)
β
– 362 –
and the matrix !
0 1n
J= (15.327)
−1n 0
was used at the very beginning of the course (see equation (2.21)) to define the symplectic
group Sp(2n, κ) as the group of matrices such that Atr JA = J.
ω is called a symplectic form Note that
ω(A~v1 , A~v2 ) = ω(~v1 , ~v2 ) (15.328)
for A ∈ Sp(2, R). We say that ω is invariant under symplectic transformations.
Exercise
Referring to equations (15.318) and (15.316) et. seq.
a.) Show that the choice of section
s(α, β) = U (α)V (β) (15.329)
leads to the cocycle
f ((α1 , β1 ), (α2 , β2 ) = ei~(α1 β1 +α2 β2 +α1 β2 ) (15.330)
b.) Show that the choice of section
s(α, β) = exp[i(αp̂ + β q̂)] (15.331)
leads to the cocycle
i
f ((α1 , β1 ), (α2 , β2 ) = e 2 ~(α1 β2 −α2 β1 ) (15.332)
c.) Find an explicit coboundary that relates the cocycle (15.330) to (15.332). ♣NEED TO
PROVIDE
ANSWER HERE. ♣
– 363 –
15.5.2 Example: The Magnetic Translation Group For Two-Dimensional Elec-
trons
In the presence of an electromagnetic field the group of translations acting on charged
particles definitely becomes centrally extended. This shows up naturally when discussing a
charged nonrelativistic particle confined to two spatial dimensions and moving in a constant
magnetic field B. In one convenient gauge the Hamiltonian is
1 eBx2 2 eBx1 2 1 2
H= (p1 + ) + (p2 − ) = (p̃ + p̃22 ) (15.337)
2m 2 2 2m 1
where the gauge invariant momenta are p̃i := pi − eAi are
eB
p̃1 = p1 + x2
2 (15.338)
eB
p̃2 = p2 − x1
2
Ordinary translations are generated by p1 , p2 and do not commute with the Hami-
tonian: We have lost translation invariance. Nevertheless we can define the magnetic
translation operators:
eBx2 eBx1
π1 := p1 − π2 := p2 + (15.339)
2 2
Compare this carefully with the definitions of p̃i . Note the relative signs! These operators
satisfy [πi , p̃j ] = 0. In particular they are translation-like operators that commute with the
Hamiltonian: [πi , H] = 0. Hence the name. While they are called “translation operators”
note that they do not commute:
If we are interested in quantized values of a1 , a2 (as, for example, if the charged particle is
moving in a lattice, or is confined to a torus) then we obtain the basic relations (15.522).
Note that
exp[ieBa1 a2 /~] = exp[2πiΦ/Φ0 ] (15.343)
where Φ = Ba1 a2 is the flux through an area element a1 a2 and Φ0 = h/e is known as the
magnetic flux quantum. 192
192
One should be careful about a factor of two here since in superconductivity the condensing field has
charge 2e and hence the official definition of the term “flux quantum” used, for example, by NIST is
Φ0 = h/2e, half the value we use.
– 364 –
Remark: The group generated by the operators U = U (a1 ) and V = V (a2 ) is a
Heisenberg group, but it is also interesting to consider the algebra of operators generated by
U, V . This algebra admits a C∗-algebra structure and is sometimes referred to as the algebra
of functions on the noncommutative torus or the irrational rotation algebra. Abstractly it
is the C ∗ algebra generated by unitary operators U, V satisfying U V = e2πiθ V U for some θ.
The properties of the algebra are very different for θ rational and irrational. The algebra
Aθ figures prominantly in applications of noncommutative geometry to the QHE and in
applications of noncommutative geometry to toroidal compactifications of string theory.
1. The commutator function is gauge invariant, in the sense that it does not change
under the change of 2-cocycle f by a coboundary. (Check that! This uses the property
that G is abelian). It is therefore a more intrinsic quantity associated with the central
extension.
3. The extension G̃ is abelian iff κ(g1 , g2 ) = 1, that is, iff there exists a symmetric
cocycle f . 193
4. κ is skew :
κ(g1 , g2 ) = κ(g2 , g1 )−1 (15.346)
5. κ is alternating:
κ(g, g) = 1 (15.347)
6. κ is bimultiplicative:
κ(g1 g2 , g3 ) = κ(g1 , g3 )κ(g2 , g3 ) (15.348)
κ(g1 , g2 g3 ) = κ(g1 , g2 )κ(g1 , g3 ) (15.349)
193
Note that in our example of Q, D4 as extensions the cocycle we computed was not symmetric.
– 365 –
All of these properties except, perhaps, the last, are obvious. To prove the bimulti-
plicative properties (it suffices to prove just one) we rewrite (15.348) as
Now multiply the equation by f (g1 , g2 ) and use the fact that A is abelian to write
(f (g1 , g2 )f (g1 g2 , g3 ))f (g3 , g2 )f (g3 , g1 ) = f (g2 , g3 )f (g1 , g3 )(f (g1 , g2 )f (g3 , g1 g2 )) (15.351)
We apply the cocycle identity on both the LHS and the RHS (and also use the fact that
G is abelian) to get
which is the cocycle identity. This proves the bimultiplicative property (15.348). ♠
We now define the Heisenberg extensions. The function κ is said to be nondegenerate
if for all g1 6= 1 there is a g2 with κ(g1 , g2 ) 6= 1. When this is the case the center of G̃ is
precisely A:
Z(G) e ∼ =A. (15.355)
This follows immediately from equation (15.344). If κ is degenerate the center will be
larger. In the extreme case that κ(g1 , g2 ) = 1 for all g1 , g2 we get the direct product
Ge = A × G and
Z(G) e =G e. (15.356)
In general, we will have an intermediate situation and A will be a proper subgroup of Z(G̃).
One definition which is used in the literature is
– 366 –
Exercise Commutator function for Heis(R ⊕ R)
a.) Show that both of the cocycles (15.330) and (15.332) defining groups isomorphic
to Heis(R ⊕ R) have the same commutator function
where
ω(v1 , v2 ) = v1tr Jv2 (15.359)
1. Finitely generated Abelian groups. As we will prove in sections 16.2 and 16.3 below,
these can be (noncanonically) written as products of cyclic groups Zn , for various n,
and a lattice Zd for some d (possibly d = 0).
3. Tori. These are isomorphic to V /Zd where V is a d-dimensional real vector space.
Remark: This class of groups can be characterized as the set of Abelian groups A which
are topological groups so there is an exact sequence:
Theorem Let G be a topological Abelian group of the above class. The isomorphism
classes of central extensions of G by U (1) are in one-one correspondence with continuous
bimultiplicative maps
κ : G × G → U (1) (15.361)
which are alternating (and hence skew).
194
Topological, separable.
– 367 –
I do not know who originally proved this theorem, but one proof can be found in 195
Remarks
1. In other words, given the commutator function κ one can always find a corresponding
cocycle f . This theorem is useful because κ is invariant under change of f by a
coboundary, and moreover the bimultiplicative property is simpler to check than the
cocycle identity. (In fact, one can show that it is always possible to find a cocycle f
which is bimultiplicative. This property automatically ensures the cocycle relation.)
3. In this theorem we can replace U (1) by any subgroup of U (1), such as Zn realized as
the group of nth roots of unity.
1 → U (1) → Heis(S × S)
b → S × Sb → 1 (15.363)
at least up to isomorphism.
Remarks
1. Note that one natural cocycle giving the commutator function (15.362) is
1
f ((s1 , χ1 ), (s2 , χ2 )) := (15.364)
χ1 (s2 )
– 368 –
For our key examples we have
Z
hψ1 , ψ2 i = ψ1∗ (x)ψ2 (x)dx S=R
R
X
= ψ1∗ (n)ψ2 (n) S=Z
n∈Z
1
Z 2π (15.365)
= ψ1∗ (eiθ )ψ2 (eiθ )dθ S = U (1)
2π 0
n−1
1X ∗
= ψ1 (k̄)ψ2 (k̄) S = Z/nZ
n
k=0
Then one checks that this does not define a representation of the direct product S × Sb
but rather we have the operator equation:
Then, Ts in our general discussion is the operator U (s) of the basic motivating exam-
ple, and Mχk is the operator V (k) of the basic motivating example and the general
identity (15.368) becomes our starting point:
– 369 –
4. Stone-von Neumann-Mackey Theorem. Up to isomorphism (equivalence) there is a
unique irreducible unitary representation of Heis(S × S) b such that A = U (1) acts by
scalar multiplication. That is, if ξ ∈ U (1) then we require ρ(ξ) = ξ1V . In addition we
need some further technical hypotheses. 197 This is called the Stone-von Neumann
theorem or sometimes the Stone-von Neumann-Mackey theorem. For a relatively
short proof see. 198 The main idea is to consider the maximal Abelian subgroups
of Heis(S × S).
b One such subgroup is isomorphic to U (1) × S, another is U (1) × S. b
Let us consider U (1) × S.
b Over the subgroup {1} × Sb we can split the sequence and
consider the elements:
Mχ := ρ(1, (0, χ)) (15.372)
where 1 ∈ U (1), 0 ∈ S, and χ ∈ S.
b These operators commute for different choices of χ
and are simultaneously diagonalizable so we have a complete basis ψα of simultaneous
eigenvectors for the representation with eigenvalues: 199
Mχ ψα = λα (χ)ψα (15.373)
Choose some particular α0 and consider the vectors Ts ψα0 for s ∈ S. Using the
Heisenberg relations and the fact that the central U (1) group just acts by scalars we
know that:
Mχ (Ts ψα0 ) = χ(s − sα0 ) (Ts ψα0 ) (15.376)
Therefore the span of {Ts ψα0 }s∈S , which is the span of {Ts+sα0 ψα0 }s∈S is a copy of
the representation constructed above. So if (V, ρ) is irreducible, it must be equiva-
lent to our representation constructed above. This demonstrates uniqueness of the
irreducible representation.
5. Simple Proof Of Irreducibility for S = Rn . For v = (α, β) ∈ R2n we introduced the ♣This remark
assumes at least a
section little bit of
knowledge from the
s(v) := exp[i(αq̂ + β p̂)] (15.377) linear algebra
chapter 2 and the
Now, consider the standard representation of Heis(Rn × R
cn ) on H = L2 (Rn ). For idea of an
irreducible
representation, from
197 chapter 4. ♣
We need the representation to be continuous in the norm topology and we need S to have a translation-
♣The α, β here are
invariant measure so that L2 (A) makes sense. Then we replace V above by the Hilbert space L2 (S). reversed from the
198
A. Prasad, “An easy proof of the Stone-von Neumann-Mackey Theorem,” arXiv:0912.0574. convention in
199 previous section. ♣
It is exactly at this point that we are using unitarity, and a careful discussion requires more functional
analysis.
– 370 –
any two vectors ψ1 , ψ2 ∈ H define the Wigner function:
Now, an elementary computation shows that, on L2 (R2n ) with the standard measure
we have
k W (ψ1 , ψ2 ) k2 =k ψ1 k2 k ψ2 k2 (15.381)
Here are some details ♣Need to clean up
some 2π’s here... ♣
Z
2 2 2n
k W (ψ1 , ψ2 ) k = |W (ψ1 , ψ2 )| d v
2n
ZR
= dαdβdq1 dq2 eiα(q1 −q2 ) ψ1 (q1 + β/2)ψ2∗ (q1 − β/2)ψ1∗ (q2 + β/2)ψ2 (q2 − β/2)
Z
= dqdβψ1∗ (q + β/2)ψ1 (q + β/2)ψ2∗ (q − β/2)ψ2 (q − β/2)
=k ψ1 k2 k ψ2 k2
(15.382)
Now with the key result (15.381) we can show that H is irreducible. 200 Suppose that
H0 ⊂ H is preserved preserved by the Heisenberg group. Suppose there is a vector
ψ⊥ ∈
/ H0 . WLOG we can take ψ⊥ to be perpendicular to H0 (hence the notation).
But then,
W (ψ, ψ⊥ )(v) = hs(v)ψ, ψ⊥ i = 0 (15.383)
for all v ∈ R2n and all ψ ∈ H0 , because s(v)ψ ∈ H0 . But then by (15.381) we know
that k ψ k2 k ψ⊥ k2 = 0. Therefore either ψ = 0 or ψ⊥ = 0. If H0 is not the zero
vector space then we can always choose ψ 6= 0 and hence ψ⊥ = 0. But if H0 were
proper then there would be a nonzero choice for ψ⊥ . Therefore, there is no nonzero
and proper subspace H0 ⊂ H preserved by the group action of Heis. Therefore H is
irreducible.
6. Stone von-Neumann And Fourier Now let us combine the Stone-von-Neumann the-
orem with Pontryagin duality. Because Sb ∼
= S we can write
b
b ∼
Heis(S × S) = Heis(Sb × S)
bb
(15.384)
200
See chapter 4 for a thorough discussion of reducible vs. irreducible representations. Briefly - if a
representation H has a nonzero and proper subspace H0 preserved by the group action then it is said to be
reducible. A representation which is not reducible is said to be irreducible.
– 371 –
so, we could equally well give a unitary representation of the group by taking Vb :=
F un(Sb → C) with inner product
Z
hψ̂1 , ψ̂2 i := dχψ̂1∗ (χ)ψ̂2 (χ) (15.385)
S
b
S : V → Vb (15.388)
mapping
STs0 S −1 = M̂s0
(15.389)
SMχ0 S −1 = T̂χ0
– 372 –
Exercise Irreducibility Of The Stone-von Neumann Representation Of Heis(S × S)
b
n
Generalize the proof of irreducibility for S = R to other locally compact Abelian
groups. 201
Example 1: Suppose R is a commutative ring with identity. Then we can consider the
group of 3 × 3 matrices over R of the form
1ac
M (a, b, c) := 0 1 b (15.391)
001
and
ι : c 7→ M (0, 0, c) (15.394)
we have an extension
0 → R → Heis(R × R) → R ⊕ R → 0 (15.395)
with cocycle f ((a, b), (a0 , b0 )) = ab0 . Note that we are writing our Abelian group R addi-
tively so the cocycle identity becomes
In the literature one will sometimes find the above class of groups defined as the
“Heisenberg groups.” It is a special case of what we have defined as general Heisenberg
groups.
201
Answer For the general case define the Wigner function as the function W (ψ1 , ψ2 ) : S × Sb → C
by W (ψ1 , ψ2 )(s, χ) := hTs Mχ ψ1 , ψ2 i. Show that (15.381) continues to hold. You will need to use the
orthogonality relation for characters.
– 373 –
As a special case of the above construction let us take R = Z/nZ. We will now show
that we recover the group Heis(Zn × Zn ) discussed in section 11.11 in the context of a
particle on a discrete approximation to a circle.
First consider Zn written additively. So if a ∈ Z, then ā ∈ Z/nZ is just ā = a + nZ is
the coset. Then we define
1̄ 1̄ 0 1̄ 0 0 1̄ 0 1̄
U = 0 1̄ 0 V = 0 1̄ 1̄ q = 0 1̄ 0 (15.398)
0 0 1̄ 0 0 1̄ 0 0 1̄
0 → Z → G̃ → R ⊕ R → 0 (15.403)
by the law
The corresponding group cocycle is f ((a, b), (a0 , b0 )) = c(a, b0 ). The cocycle relation is
satisfied simply by virtue of c being bilinear. It will be a Heisenberg extension if κ :
(R × R) × (R × R) → Z given by κ((a, b), (a0 , b0 )) = c(a, b0 ) − c(a0 , b) is nondegenerate. In
particular, if we take Z = R and c(a, b0 ) = ab0 using the ring multiplication then we recover
(15.391).
– 374 –
Such a set of matrices can indeed be constructed, by taking suitable tensor products of Pauli
matrices. They are called “gamma matrices.” They form a matrix representation of what
is called a Clifford algebra and we will study them in more detail and more abstractly in
chapter ****. For the moment the reader should be content with the explicit representation:
202
γ1 = σ 1
(15.406)
γ2 = σ 2
for n = 2,
γ1 = σ 1
γ2 = σ 2 (15.407)
γ3 = σ 3
for n = 3,
γ1 = σ 1 ⊗ σ 1
γ2 = σ 1 ⊗ σ 2
(15.408)
γ3 = σ 1 ⊗ σ 3
γ4 = σ 2 ⊗ 1
for n = 4,
γ1 = σ 1 ⊗ σ 1
γ2 = σ 1 ⊗ σ 2
γ3 = σ 1 ⊗ σ 3 (15.409)
γ4 = σ 2 ⊗ 1
γ5 = σ 3 ⊗ 1
for n = 5,
γ1 = σ 1 ⊗ σ 1 ⊗ σ 1
γ2 = σ 1 ⊗ σ 1 ⊗ σ 2
γ3 = σ 1 ⊗ σ 1 ⊗ σ 3
(15.410)
γ4 = σ 1 ⊗ σ 2 ⊗ 1
γ5 = σ 1 ⊗ σ 3 ⊗ 1
γ6 = σ 2 ⊗ 1 ⊗ 1
for n = 6, and so on. So for the Clifford algebra with n generators we have constructed a
representation by 2[n/2] × 2[n/2] matrices.
Of course, the above choice of matrices is far from a unique choice of matrices satisfying
the Clifford relations (15.405). If the γi ∈ Matd (C) then for any S ∈ GL(d, C) we can
202
See Chapter 2, section 5.3 for a detailed discussion of tensor product ⊗.
– 375 –
change γi → Sγi S −1 . These give equivalent representations of the Clifford algebra. We
could also modify γi → i γi where i ∈ {±1} and still get a representation, although it
might not be an equivalent one.
For example, note that for n = 3
γ1 γ2 γ3 = i12×2 (15.411)
and for n = 5
γ1 · · · γ5 = −14×4 (15.412)
We cannot change the sign on the RHS by conjugating with S. So in this case we conclude
that there are at least two inequivalent representations of the Clifford algebra.
The general story, proved in detail in Chapters 11-12 is that
In Chapter 11 these statements, and more are generalized to real Clifford algebras for
a quadratic form of any signature,
For w ∈ Zn2 (where we will think of Z2 = Z/2Z as a ring in this example) we define
Then
γ(w)γ(w0 ) = (w, w0 )γ(w + w0 ) = κ(w, w0 )γ(w0 )γ(w) (15.414)
where
wi wj0
P
(w, w0 ) = (−1) i>j (15.415)
wi wj0
P
κ(w, w0 ) = (−1) i6=j (15.416)
– 376 –
We conclude that for the even degree Clifford algebras κ is nondegenerate and the
group generated by taking products of the matrices ±γ(w) defined by an irreducible rep-
resentation in fact defines a Heisenberg extension:
1 → Z2 → E2m → Z2m
2 →1 (15.417)
In finite group theory this group is an example of what is known as an “extra-special group”
and is denoted E2m = 21+2m
+ . 203
In the case when n is odd then in fact the Clifford volume form γ1 · · · γn commutes
with all the γi and the cocycle is degenerate.
Ω:Γ×Γ→Z (15.418)
It is shown in the Linear Algebra User’s Manual that we can choose an ordered basis
{γ1 , . . . , γn } for Γ so that the matrix Ω(γi , γj ) is of the form
! !
0 d1 0 d2
⊕ ··· (15.419)
−d1 0 −d2 0
V = Γ ⊗Z R ∼
= R2m
Ω:V ×V →R (15.420)
If all the di are nonzero then this is a symplectic form. Using an invertible matrix S we
can bring S tr ΩS to the standard form J. We can define a commutator function on Rn :
– 377 –
where we make the explicit choice of cocycle f (v1 , v2 ) = eiπΩ(v1 ,v2 ) . So, Heis(V, Ω) is the
group of pairs (z, v) with z ∈ U (1) and v ∈ V with multiplication:
We stress that this gives a Heisenberg group and in particular the sequence does not split.
Now consider the pullback of the sequence under the inclusion ι : Γ → V . We claim
that the pulled-back sequence splits: Let us try to choose a section
In other words, to split the sequence over Γ we need to find a function : Γ → U (1) so
that
γ1 γ2 = e−iπΩ(γ1 ,γ2 ) γ1 +γ2 (15.426)
Explicitly P (T, Ω, ) is the quotient (U (1) × V )/Γ with the equivalence relation
for all γ ∈ Γ.
Note that there is a continuous map
π : P (T, Ω, ) → T (15.429)
whose fiber is U (1). This space, together with its projection map is an example of a
principal U (1) bundle over the torus T : Each fiber is a principal homogeneous space for
the group U (1), under the natural action of U (1) on P (T, Ω, ). (Since the U (1) is central we
can consider it either as a left- or right- action.) Our construction of the bundle depended
on a choice of splitting γ , but a change of splitting defines isomorphic bundles.
Remarks
– 378 –
2. Note that while s(Γ) is a subgroup of Heis(V, Ω) it is not a normal subgroup, so that,
while, P (T, Ω, ) is a bundle, it is not a group.
Now, if one considers parallel transport around a small square loop starting at v0 by
composing paths
℘v, w1 ,w2 := ℘v0 ,w1 ? ℘v0 +w1 ,w2 ∗ ℘v0 +w1 +w2 ,−w1 ∗ ℘v0 +w2 ,−w2 (15.431)
Exercise
We illustrated how Q and D4 are the only two non-Abelian groups that sit in an
extension of Z2 × Z2 by Z2 . Which one is the Heisenberg extension?
1 → Zn → Heis(Zn × Zn ) → Zn × Zn → 1 (15.433)
where we think of Zn as the multiplicative group of nth roots of unity. Let ω = exp[2πi/n].
We distinguish the three Zn factors by writing generators as ω1 , ω2 , ω3 .
a.) Show that one natural choice of cocycle is:
0 0
0
f (ω1s , ω2t ), (ω1s , ω2t ) := ω3st (15.434)
– 379 –
c.) Connect to our general theory of extensions by defining U := (1, (ω1 , 1)), V :=
(1, (1, ω2 )) and computing
or in other words, since the center is generated by q = (ω3 , (1, 1)) we can write:
U V = qV U (15.437)
– 380 –
Exercise Two Dimensions
a.) Suppose T = C/(Z + τ Z) with Imτ > 0. So Γ = Z + τ Z. Choose
1 → Z → G̃ → G → 0 (15.441)
with (15.363) and (15.403) with Z any subgroup of U (1). The difference from the general
case is that in these examples G is explicitly presented as a product of subgroups G = L×L0
where L and L0 are maximal Lagrangian subgroups. A subgroup L ⊂ G is said to be a
Lagrangian subgroup if κ(g1 , g2 ) = 1 for all pairs (g1 , g2 ) ∈ L and similarly for L0 .
When discussing Heisenberg groups of the form Heis(S × S) b the group S × Sb has two
canonical Lagrangian subgroups, namely S and S. b With the choice of κ we made above
these are maximal Lagrangian subgroups.
The case of G = S × Sb should be contrasted with other examples where G is R2n or Z2n 2 .
These groups certainly can be presented as products of maximal Lagrangian subgroups,
but there is no canonical decomposition. Consider, for example the Heisenberg extension
Heis(Z2n2 ) constructed using the gamma-matrices. Note that Z2 = F2 is a field, and we can
consider F2n2 to be a vector space over κ = F2 . We could take L to be any half-dimensional
Lagrangian subspace.
In the general situation, with no canonical choice of L one would often like to construct
an explicit unitary representation of the Heisenberg group. One way to do this is the
following:
Choose a maximal Lagrangian subgroup L ⊂ G. The inverse image L̃ ⊂ G̃ is a maximal
commutative subgroup of G̃.
We now choose a character of L̃ such that ρ(z, x) = zρ(x). Note that such a character ♣This is bad
notation, because
must satisfy ρ(z, x) is later used
for the full SvN
ρ(x)ρ(x0 ) = f (x, x0 )ρ(x + x0 ) ∀x, x0 ∈ L (15.442) representation. The
letter here should
be changed and it
Note that f need not be trivial on L, but it does define an Abelian extension L̃ which must propagates through
the discussion. ♣
therefore be isomorphic to a product L × U (1), albeit noncanonically. Different choices of
ρ are different choices of splitting. Indeed note that (15.442) says that, when restricted to
L × L the cocycle is trivialized by ρ.
– 381 –
The carrier space of our representation will be the space F of functions ψ : G̃ → C
such that
ψ (z, x)(z 0 , x0 ) = ρ(z 0 , x0 )−1 ψ (z, x) ∀(z 0 , x0 ) ∈ L̃
(15.443)
Setting (z 0 , x0 ) = (z −1 , 0) we note that equation (15.443) implies ψ(z, x) = z −1 ψ(1, x), so
defining Ψ(x) := ψ(1, x) we can simplify the description of F by identifying it with the
space of functions Ψ : G → C such that:
f (x, x0 )
Ψ x + x0 = Ψ(x) ∀x0 ∈ L. (15.444)
ρ(x0 )
If our group is continuous or noncompact we should state an L2 condition. We take ρ
to be a unitary character so that |Ψ(x)|2 descends to a function on G/L and we demand:
Z
|Ψ(x)|2 dx < +∞. (15.445)
G/L
The group action is simply left-action of G̃ on the functions ψ(z, x). When written in terms
of Ψ(x) the representation of (x, z) ∈ G̃ is:
(ρ(z, x) · Ψ)(y) = (ρ(z, x) · ψ)(1, y)
= ψ (z, x)−1 · (1, y)
(15.446)
= ψ (z −1 f (x, −x)−1 f (−x, y), y − x)
= zf (x, y − x)Ψ(y − x)
where in the last line we assumed that f is a normalized cocycle so that f (0, y) = 1.
Let us see how we recover the standard Stone-von Neumann representation of Heis(S ×
S) from this viewpoint. Let us choose L = S.
b b Then the equivariance condition (15.444)
becomes
1
Ψ(s, χχ0 ) = Ψ(s, χ) (15.447)
ρ(χ0 )
Now set χ0 = 1/χ and conclude that
1
Ψ(s, χ) = Ψ(s, 1) (15.448)
ρ(χ)
on L2 (S). Working through the above definitions it should not be surprising that one
recovers:
(Ts ψ̃)(s0 ) = ψ̃(s0 − s)
ρ(χ) (15.450)
(Mχ ψ̃)(s0 ) = ψ̃(s0 )
χ(s0 )
– 382 –
0
P
Example. Consider F2m
2 with κ(w, w0 ) = (−1) i6=j wi wj . We can give a Lagrangian de-
composition
F2m ∼
2 =L⊕N (15.451)
in many different ways. For special values of m there are special Lagrangian subspaces
provided by classical error correcting codes. A Heisenberg representation can be given by
taking V to be the space of functions ψ : L → C. This has complex dimension 2m . N can
be identified with the group of characters on L since we can set
The usual translation and multiplication operators T` and Mn generate an algebra iso-
morphic to M atd (C). V is also a representation of the Clifford algebra (and hence the
extra-special group). So the Clifford representation matrices γi can be expressed in terms
of these, and vice versa.
Remarks
1. The representation is, geometrically, just the space of L2 -sections of the associated
line bundle G̃ ×L̃ C defined by ρ. The representation is independent of the choice of
ρ, and any two choices are related by an automorphism of L given by the restriction
of an inner automorphism of G̃.
Exercise
Construct explicit Lagrangian subspaces of F2m 2 for small values of m and write out
the matrices of the Heisenberg representation. 207
– 383 –
In the first line we defined the general notion of pullback κ → α∗ κ and the second line
is the invariance condition. In physics, such symplectic transformations are relevant to
canonical transformations. We can ask whether such automorphisms of “phase space”
actually lift to automorphisms of the full Heisenberg group, and then whether and how
this lifted group acts on the representations of the Heisenberg group. This would be the
“quantum mechanical implementation of symplectic transformations.” In this section we
will investigate those questions from the group-theoretical viewpoint.
e→G
Quite generally, (there is no need for G to be Abelian in this paragraph), if π : G
is a homomorphism and α ∈ Aut(G) is an automorphism of G we say that α lifts to an
automorphism of G e if there is an automorphism α e ∈ Aut(G)
e that completes the diagram:
G
e π /G (15.454)
α α
e
G
e π /G
π(e
α(e
g )) = α(π(e
g )) (15.455)
for all ge ∈ G.
e
1→A→G
e→G→1 (15.456)
where A and G are Abelian and we write the group operations on both A and G additively
so G
e is the group of pairs (a, g) with group multiplication
and f (g1 , g2 ) is a cocycle satisfying the additive version of the cocycle identity:
Now suppose α ∈ Aut(G). If α preserves the commutator function then we can hope
to lift it to an automorphism Tα of G.
e As explained above, “lifting” means that
– 384 –
where ξα is some function ξα : A × G → A. We can write constraints on this function
from the requirement that Tα must be an automorphism with the group law defined by the
cocycle f . In particular Tα must be a group homomorphism:
Now, specialize this equation by putting g1 = 0 and assuming (WLOG) that we have a
normalized cocycle, so that f (g, 0) = f (0, g) = 0. Then equation (15.462) simplifies to
On the other hand, putting g = 0 in (15.463) we now learn that a 7→ ξα (a, 0) is just
an automorphism of A. Composing lifts of α with such automorphisms is an inherent
ambiguity in lifting α. Thus, it is useful to make the simplifying assumption that ξα (a, 0) =
a, since we can always arrange this by composition with an automorphism of A. Therefore
we can write equation (15.464) in the general form
If we restrict to automorphisms of the type (15.465) then one easily checks that Tα is
indeed a group homomorphism iff
where
α∗ f (g1 , g2 ) := f (α(g1 ), α(g2 )) (15.467)
is known as the “pulled-back cocycle.” The conceptual meaning of this equation is the
following: α ∈ Aut(G) is a “symmetry of G.” We are asking how badly the cocycle f
breaks that symmetry. We say that “f is invariant under pullback” if α∗ f = f and in
that case we can take τα = 0 and we can easily lift the group Aut(G) to a group of
automorphisms of G. e The more general condition (15.466) says that the amount by which
f is not symmetric under α, that is α∗ f − f , must be a trivializable cocycle. Put this way,
it is clear that the condition is unchanged under shifting f by a coboundary. Indeed, if we
change f by a coboundary so
– 385 –
so the existence of a solution to (15.466) is gauge invariant. Put more simply, the co-
homology class [f ] ∈ H 2 (G, A) must be invariant under the action of α∗ on H 2 (G, A).
♣The above
paragraph should
Thus, in general we cannot lift all automorphisms of G, only those for which (15.466) be said more
succinctly. ♣
holds. The set of such automorphisms forms a subgroup of Aut(G) that we will denote as
Aut0 (G). Note that, in our additive notation we have
that is,
τα1 ◦α2 = α2∗ τα1 + τα2 (15.472)
then in fact Tα1 ◦ Tα2 = Tα1 ◦α2 generate a subgroup of Aut(G)
e isomorphic to Aut(G). ♣ Relate this to
equation (15.542))
In general, even if we can find a solution to (15.466) the criterion (15.472) will not the condition for a
twisted
hold. Nevertheless, the automorphisms Tα will generate a subgroup of Aut(G). e To see homomorphism
τ : Aut(G) → A. ♣
what subgroup it is we introduce, for ` ∈ Hom(G, A), the automorphism
Therefore, we can write any word in P ’s and T ’s in the form P`0 ◦ Tα0 for some (`0 , α0 ).
Altogether, equations (15.474), (15.475), and (15.476) mean that Tα generate a subgroup
of Aut(G). ^ of Aut(G)
e Including all transformations P` defines a subgroup Aut(G) e which
fits in an exact sequence:
^ → Aut0 (G) → 1
1 → Hom(G, A) → Aut(G) (15.477)
Finally, restoring Hom(G, A) → Hom(G, A) o Aut(A) gives the group of (possible) lifts of
automorphisms of G to automorphisms of G̃.
– 386 –
Example: Heis(R ⊕ R)
Let us consider the basic example from quantum mechanics based on a phase space
R ⊕ R with symplectic form defined by J. In this case there is a very nice way of thinking
of the matrix group Sp(2, R). We note that if
!
a b
A= (15.478)
c d
then
! ! !
a c 0 1 a b
Atr JA =
b d −1 0 c d (15.479)
= (ad − bc)J
Therefore, A ∈ Sp(2, R) iff ad − bc = 1. But this is precisely the condition that defines
SL(2, R). Therefore
SL(2, R) = Sp(2, R) (15.480)
are identical as matrix groups. The same argument applies if we replace R by any ring R.
This kind of isomorphism is definitely not true if we consider higher rank groups SL(n, R)
and Sp(2n, R).
Now, as we have seen, the section s(α, β) = exp[i(αp̂ + β q̂)] leads to the choice of
section, written additively
~
f ((α1 , β1 ), (α2 , β2 )) = (α1 β2 − α2 β1 ) (15.481)
2
This is symplectic invariant, so we can take τα (g) = 0 in the equation (15.466) and we
conclude that the symplectic group acts as a group of automorphisms on the Heisenberg
group Heis(R2n ).
It is interesting however, that the symplectic group only acts projectively on the Stone-
von-Neumann representation of the Heisenberg group. We now explain this point.
As we will discuss in detail in the chapter on Lie groups, SL(2, R) and Sp(2, R) are
examples of Lie groups. It is useful to look at group elements infinitesimally close to the
identity matrix. These can be written as
A = 1 + m + O(2 ) (15.482)
Tr(m) = 0 (15.483)
is required to satisfy the defining conditions to order . (Exercise: Prove this using both
the definition of Sp(2, R) and of SL(2, R).)
The infinitesimal group elements are thus characterized by the vector space sp(2, R) =
sl(2, R), which is the vector space of 2 × 2 real traceless matrices. These form a Lie algebra
with the standard matrix commutator. Generic (but not all) group elements are obtained
– 387 –
by exponentiating such matrices. In particular, in a neighborhood of the identity all group
elements are obtained by exponentiating elements of the Lie algebra.
Recall the standard basis of sl(2, R):
! ! !
01 −1 0 0 0
e= h= f= (15.484)
00 0 1 −1 0
From this one can in principle multiply exponentiated matrices using the BCH formula.
We now consider the quantum implementation of these operators on L2 (R) with ρ(e) =
ê etc. with:
i 2 i i 2
ê := p̂ ĥ := (q̂ p̂ + p̂q̂) fˆ := q̂ (15.486)
2~ 2~ 2~
Now, using the useful identities: 209
This is in fact a maximal compact subgroup of SL(2, R). (See chapter **** on 2x2 matrix
groups.) Note carefully that it has period θ ∼ θ + 2π.
The quantum implementation of e + f is just the standard harmonic oscillator Hamil-
tonian!
i 1
ê + fˆ = (p̂2 + q̂ 2 ) = i(āa + ) (15.491)
2 2
where
1
a = √ (q + ip)
2
(15.492)
1
ā = √ (q − ip)
2
209
These identities are very useful, but a bit hard to remember. If you are on a desert island you can
easily reconstruct them from the special cases:
– 388 –
Now, in the Stone-von-Neumann representation 2i (p̂2 + q̂ 2 ) has the spectrum i(n + 21 ),
n = 0, 1, 2, . . . . Therefore, the one-parameter subgroup exp[θ(ê + fˆ)] has period θ ∼ θ + 4π.
We see that the group generated by ê, fˆ, ĥ is at least a double cover of Sp(2, R). In fact, it
turns out to be exactly a double cover, and it is known as the metaplectic group.
One very interesting aspect of the metaplectic group is that this is a Lie group with no
finite-dimensional faithful representation. We now explain that fact, 210 and a few other
important things in the following remarks:
Remarks
1. Application to the metaplectic group. The Lie algebra of the metaplectic group is
sl(2, R). Any finite dimensional representation of the metaplectic group would give
a finite-dimensional representation of the Lie algebra sl(2, R) and we discussed in
detail what these are above. Note that e + f corresponds to iJ 1 . So: In any finite-
dimensional complex representation of sl(2, R), the operator ρ(e) + ρ(f ) is diagonaliz-
able and all the eigenvalues are of the form i`, where ` is an integer. So exp[θρ(e + f )]
has period θ ∼ θ + 2π. Therefore no finite dimensional representation of M pl can
be faithful. In particular, M pl is an example of a Lie group which is not a matrix
group: It cannot be embedded as a subgroup of GL(N, C) for any N .
2. It is very interesting to consider the action of the one-parameter family exp[θ(ê + fˆ)]
d
in the standard “position space” representation L2 (R) with p̂ = −i dq . Let us compute
the integral kernel:
hx|exp[θ(ê + fˆ)]|yi (15.493)
since Z +∞
ˆ
eθ(ê+f ) ψ (x) = hx|exp[θ(ê + fˆ)]|yiψ(y)dy (15.494)
−∞
d n −x2
x2
Hn (x) = e − e (15.496)
dx
d2
2
− 2 + x ψn = (2n + 1)ψn (15.497)
dx
210
Our demonstration of this surprising fact follows the discussion in G. Segal in Lectures On Lie Groups
and Lie Algebras.
211
What follows here are completely standard facts. We used Wikipedia.
– 389 –
and we have normalized them so that so that
Z +∞
ψn (x)ψm (x)dx = δn,m (15.498)
−∞
Now we have
∞
" 2 #
x−y 2
X
n 1 1−u x+y 1+u
u ψn (x)ψm (x) = p exp − −
2
π(1 − u ) 1+u 2 1−u 2
n=0
(15.499)
To prove this write
d n −x2 d n
Z +∞
x2 x2 1 − 14 s2 +isx
Hn (x) = e − e =e − √ e ds (15.500)
dx dx 4π −∞
Apply this to both ψn (x) and ψn (y) apply the derivatives, exchange integration and
sum and get
∞ 1 2 2 Z
X e 2 (x +y ) − 14 (s2 +t2 )− 12 stu+isx+ity
un ψn (x)ψm (x) = dsdte
n=0
4π 3/2
" #
1−u x+y 2 1+u x−y 2
1
=p exp − −
π(1 − u2 ) 1+u 2 1−u 2
(15.501)
Now we have
∞
X
hx|exp[θ(ê + fˆ)]|yi = eiθ/2 einθ ψn (x)ψn (y) (15.502)
n=0
so we apply the above identity with u = eiθ . We should be careful about convergence:
The Gaussian integral in (15.501) has quadratic form
!
1 1 u
A= (15.503)
4 u 1
The second line follows easily from the first since the parity of ψn (x) as a function of
x is the parity of n.
The quadratic form A has a positive definite real part for |u| ≤ 1 except for u = ±1.
The values for |u| > 1 have to be defined by analytic continuation and there is a
branch point at u = ±1. Note that at θ = π/2 we have
π eiπ/4
hx|exp[ (ê + fˆ)]|yi = √ eixy (15.505)
2 2π
– 390 –
and we recognize the kernel for the Fourier transform:
π ˆ
e 2 (ê+f ) ψ = eiπ/4 F(ψ) (15.506)
where F : L2 (R) → L2 (R) is the Fourier transform. This is the quantum implemen-
tation of the canonical transformation exchanging position and momenta. Note that
θ = π is the square of the Fourier transform (up to a scalar multiplication by i) but
this is not a scalar operator on the space of functions. Indeed at θ = π we have
ˆ
eπ(ê+f ) ψ (x) = iψ(−x) (15.507)
The Fourier transform is of order four not order two. Also note that at θ = 2π the
operator is just multiplication by −1.
3. We note that there are beautiful general formulae for expectation value operators
defined by exponentiating general quadratic forms in the p̂i and q̂ i , or, equivalently
in a’s and a† ’s. This is useful when working with coherent states and squeezed states.
But it is best presented in the Bargmann or geometric quantization formalism. ♣say more? ♣
Exercise
a.) Check that, for 2 × 2 matrices the condition Tr(m) = 0 is identical to the condition
(mJ)tr = mJ.
b.) Show that for any n × n matrix, the infinitesimal version of the condition detA = 1
is that A = 1 + m + O(2 ) with Tr(m) = 0.
c.) Show that for any n×n matrix, the infinitesimal version of the condition Atr JA = J
is that A = 1 + m + O(2 ) with (mJ)tr = mJ.
d.) Show that the conditions Tr(m) = 0 and (mJ)tr = mJ are inequivalent different
for n > 2.
♣Following exercise
belongs in the
Linear Algebra
chapter. ♣
212 P
Answer : Suppose s cs vs = 0 for some coefficients cs . Applying powers of A we determine that
cs λks vs = 0. If all the cs are nonzero then we learn that the matrix the matrix V C must have determinant
P
zero where Vij = λji and C is the diagonal matrix with entries c1 , . . . , cn . If some of the cs are nonzero
then we have a minor of the matrix V times the diagonal matrix of the nonzero cs . In any case, none of
the minors of Vij have zero determinant, provided the λi are distinct. Therefore, V C (or the appropriate
minor) has nonzero determinant and hence no kernel. So {vs } is a linearly independent set.
– 391 –
Exercise su(2) vs. sl(2, R)
A basis for the real Lie algebra of 2 × 2 traceless anti-Hermitian matrices is
i
T i = − σi i = 1, 2, 3 (15.508)
2
with Lie algebra
[T i , T j ] = ijk T k (15.509)
Can one make real linear combinations of T i to produce the generators e, h, f of sl(2, R)
above?
c.) Put a unitary structure on the vector space V so that ρ(T j ) are anti-Hermitian
matrices and relate the above bases to the standard basis |j, mi, m = −N/2, −N/2 +
1, . . . , N/2 − 1, N/2 appearing in quantum-mechanics textbooks. That is, find a rescaling
of the vectors ρ(f )k v0 so that in the new basis, after defining ρ(T i ) using (11.557) one
obtains anti-Hermitian matrices for ρ(T i ).
– 392 –
So there is a symmetry between momentum and position. This is part of a kind of
symplectic symmetry in this discrete system related to SL(2, Zn ). All such matrices arise
from reduction modulo n of matrices in SL(2, Z). Recall from Section [**** 8.3 ****] that
SL(2, Z) is generated by S and T with relations
(ST )3 = S 2 = −1 (15.512)
Therefore, S and T (reduced modulo n) will generate SL(2, Zn ), although there will be
further relations, such as T n = 1. The SL(2, Zn ) symmetry plays and important role in
string theory and Chern-Simons theory and illustrates nicely some ideas of duality.
We now take the cocycle for Heis(Zn × Zn ) to be
Although the cocycle is not invariant under S nevertheless the difference S ∗ f −f can indeed
be trivialized by
τS (a, b) = −ab (15.518)
Now consider the transformation
T : (a, b) → (a + b, b) (15.519)
We compute
(T ∗ f − f )((a1 , b1 ), (a2 , b2 )) = b1 b2 (15.520)
This can be trivialized by
1
τT (a, b) = b2 (15.521)
2
– 393 –
PROVIDED we are able to divide by 2!! This is possible if n is odd, but not when n is
even. Indeed, when n = 2 the cocycle (T ∗ f − f ) is not even a trivializable cocycle! (Why
not? Apply the triviality test described in [ **** Remark 5, section 11.3 ****] above.)
One way to determine the lifted group, and how the group lifts when n is even is the
following. Suppose we have any two operators U, V that satisfy
U V = e2πiθ V U (15.522)
Ue n = e−iπθn(n−1)ac = e−iπ(n−1)ac
(15.526)
Ve n = e−iπθn(n−1)bd = e−iπ(n−1)bc
Now, when n is odd, the conditions (15.526) place no restriction on A. In that case,
the group SL(2, Z) acts on Heis(Zn ⊕ Zn ) as a group of automorphisms, but the normal
subgroup
Γ(n) := {A ∈ SL(2, Z)|A = 1modn} (15.527)
acts trivially so that
SL(2, Z)/Γ(n) ∼
= SL(2, Zn ) (15.528)
indeed acts as a group of automorphisms.
However, when n is even, we must consider the subgroup of SL(2, Z) with the extra
conditions ac = bd = 0mod2. Only this subgroup acts as a group of automorphisms. Again
a finite quotient group acts effectively.
– 394 –
https://arxiv.org/pdf/hep-th/0605200.pdf
IN PARTICULAR EXPLAIN THE RELATION OF Aut0 (G) to the kernel of a homo-
morphism Aut(G) → Hom(G, H 1 (G, A)).
****************************************
****************************************
so upon quantization A2 ∼ 4π δ
k δA1 . The consequence is that Wilson lines along the a- and
b-cycles generate a finite Heisenberg group with q = e2πi/k . The Hilbert space of states is
a finite-dimensional irreducible representation of this group.
************************
EXPLAIN MORE. THETA FUNCTIONS AND METAPLECTIC REPRESENTA-
TION.
***************
Here G can be any group, not necessarily Abelian. We continue to assume that N = A is
Abelian, but now we no longer assume ι(A) is central in G̃. So we allow for the possibility
of non-central extensions by an Abelian group.
Much of our original story goes through, but now the map
ω : G → Aut(A) (15.531)
of our general discussion (defined in equations (15.12) and (15.14)) is canonically defined
and is actually a group homomorphism. As we stressed below (15.14), in general it is not
a group homomorphism. There are two ways to understand that:
– 395 –
1. G̃ acts on A by conjugation on the isomorphic image of A in G̃ which, because the
sequence is exact, is still a normal subgroup. In equations, we can define
But now ω̃g̃ only depends on the equivalence class [g̃] ∈ G̃/ι(A) beccause
2. Or you can just choose a section and define ωg exactly as in (15.14). To stress the
dependence on s we write
However, now if we change section so that 214 ŝ(g) = ι(t(g))s(g) is another section
then we compute
= ι(ωg,s (a))
and since ι is injective ωg,s is independent of section and we can just denote it as ωg .
Note carefully that only in the very last line did we use the assumption that A is
Abelian. We will come back to this when we discuss general extensions in section
15.7.
Moreover, given a choice of section we can define fs (g1 , g2 ) just as we did in equation
(15.90). This definition works for all group extensions:
= ι (ωg1 g2 (a))
and again notice that only in the very last line did we use the hypothesis that A is
Abelian. Again, since ι is injective, we conclude that ωg1 ◦ ωg2 = ωg1 g2 so that the
map ω is a group homomorphism.
214
Note that here the order of the two factors on the RHS matters, since ι(A) is not necessarily central in
G
e
– 396 –
Now, computing s(g1 )s(g2 )s(g3 ) in two ways, just as before, we derive the twisted
cocycle relation:
ωg1 (fs (g2 , g3 ))fs (g1 , g2 g3 ) = fs (g1 , g2 )fs (g1 g2 , g3 ) (15.538)
The reader should check that this really does define a valid group law on the set A × G.
Now suppose that we change section from s to ŝ(g) := ι(t(g))s(g) using some arbitrary
function t : G → A. Then one can compute that the new cocycle is related to the old one
by
fŝ (g1 , g2 ) = t(g1 )ωg1 (t(g2 ))fs (g1 , g2 )t(g1 g2 )−1 (15.540)
Note that since A is Abelian the order of the factors on the RHS do not matter, but in
the analogous formula for general extensions, equation (15.620) below, the order definitely
does matter.
We say two different twisted cocycles are related by a twisted coboundary if they are
related as in (15.540) for some function t : G → A. One can check that if f is a twisted
cocycle and we define f 0 as in (15.540) then f 0 is also a twisted cocycle. We again have
an equivalence relation and we define the twisted cohomology group H 2+ω (G, A) to be the
abelian group of equivalence classes. It is again an Abelian group, as in the untwisted case,
as one shows by a similar argument.
The analog of the main theorem of section 15.3 above is:
Theorem: Let ω : G → Aut(A) be a fixed group homomorphism. Denote the set of
isomorphism classes of extensions of the form
1 → A → G̃ → G → 1 (15.541)
which induce ω by Extω (G, A). Then the set Extω (G, A) is in 1-1 correspondence with the
twisted cohomology group H 2+ω (G, A).
The proof is very similar to the untwisted case and we will skip it. Now the trivial ele-
ment of the Abelian group H 2+ω (G, A) corresponds to the semidirect product determined
by ω.
Now we can observe an interesting phenomenon which happens often in cohomology
theory: Suppose that a twisted cocyle f is trivializable so that [f ] = 0. Then our group
extension is equivalent to a semidirect product. Nevertheless, the sequence (15.530) can
be split in many different ways: There are many distinct trivializations and the different
– 397 –
trivializations have meaning. Equivalently, there are many different coboundary transfor-
mations that preserve the trivial cocycle. A glance at (15.540) reveals that this will happen
when
t(g1 g2 ) = t(g1 )ωg1 (t(g2 )) (15.542)
This is known as a twisted homomorphism. Of course, in the case that ω : G → Aut(A)
takes every g ∈ G to the identity automorphism of Aut(A) (that is, the identity element
of Aut(A), the condition specializes to the definition of a homomorphism.
For the later discussion of group cohomology is useful:
A 1-cochain t ∈ C 1 (G, A) is simply a map t : G → A.
A twisted homomorphism is also known as a twisted one-cocycle. That is, a 1-cocycle
t ∈ Z 1+ω (G, A) with twisting ω is a 1-cochain that satisfies (15.542).
To define group cohomology H 1+ω (G, A) we need an appropriate notion of equivalence
of one-cocycles. This is motivated by noting that if s : G → G̃ is a section that is also a
homomorphism (that is, a splitting) then for any a ∈ A we can produce a new splitting
This corresponds to the change of section s̃(g) = ι(t(g))s(g) where the function t(g) is:
s̃(g) = ι(a)s(g)ι(a)−1
= ι(a) · s(g)ι(a)−1 s(g)−1 · s(g)
(15.545)
= ι(a) · ι(ωg (a−1 )) · s(g)
Theorem: When the sequence (15.530) splits, that is, when the cohomology class of the
twisted cocycle is trivial [f ] = 0, then the inequivalent splittings are in one-one corre-
spondence with the inequivalent trivializations of a trivializable cocycle, and these are in
one-one correspondence with the cohomology group H 1+ω (G, A).
Recall that if v ∈ Rd then ι(v) = Tv is the translation operator on affine space Ad . We have
Tv (p) = p + v. As we saw in (14.74) and (14.75) and the discussion preceding that exercise,
for any p ∈ Ad we have a section R 7→ sp (R) ∈ Euc(d) where sp (R) is the transformation
that takes
sp (R) : p + v 7→ p + Rv (15.547)
– 398 –
In other words, we define rotation-reflections by choosing p as the origin. Then from
we compute that
ωR (v0 ) = Rv0 (15.549)
thus ωR ∈ Aut(Rd ), and indeed R 7→ ωR is a group homomorphism. If we have two
difference sections sp0 and sp then
where
t(R) = (1 − R)(p0 − p) = (1 − R)w (15.551)
where we have put p0 = p + w, w ∈ Rd .
Note that One easily checks that for fixed w ∈ Rd
0 → Zd → G → {1, σ} → 1 (15.553)
for some ~n0 ∈ Zd . Indeed one checks s~n0 (σ)2 = 1. Now for ~a ∈ Rd we have
So not all splittings are equivalent! The equivalent ones have ~n0 − ~n00 ∈ 2Zd . Therefore
H 1+ω (Z2 , Zd ) ∼
= Zd /2Zd ∼
= (Z2 )d (15.556)
Remarks
– 399 –
2. An analogy to bundle theory might help some readers: Let G be a compact Lie group.
Then the isomorphism classes of principal G-bundles over S 3 are in 1-1 correspon-
dence with π2 (G) and a theorem states that π2 (G) = 0 for all compact Lie groups.
Therefore, every principal G-bundle over S 3 is trivializable. Distinct trivializations
differ by maps t : S 3 → G and the set of inequivalent trivializations is classified by
π3 (G), which is, in general nontrivial. This can have physical meaning. For example,
in Yang-Mills theory in 3 + 1 dimensions on S 3 × R the principal G-bundle on space
S 3 is trivializable. But if there is an instanton between two time slices then the
trivialization jumps by an element of π3 (G).
Exercise
Suppose that a twisted cocycle f (g1 , g2 ) can be trivialized by two different functions
t1 , t2 : G → A. Show that t12 (g) := t1 (g)/t2 (g) is a trivialization that preserves the trivial
cocycle. That is, show that t12 is a twisted 1-cocycle.
Remark: In solid state physics when the sequence (15.557) does not split the crystallo-
graphic group G(C) is said to be nonsymmorphic.
{δ|σ} : n 7→ δ − n = −n + δ
(15.558)
: n + δ 7→ δ − (n + δ) = −n
– 400 –
Figure 39: A portion of a crystal in the two-dimensional plane.
0 → L(C) ∼
= Z → G(C) → O(1) ∼
= Z2 → 1 (15.559)
But we can split this sequence by choosing a section s(σ) = {δ| − 1}. Note that
so s : O(1) → G(C) is a homomorphism. Another way of thinking about this is that s(σ) is
just reflection, not around the origin, but around the point 12 δ. So, by a shift of origin for
defining our rotation-inversion group O(1) we just have reflections and integer translations.
In any case we can recognize G(C) as the infinite dihedral group.
– 401 –
which exchanges L with (L+~δ). This group is symmorphic (because {~δ|σ} is an involution).
In fact, this operation is just inversion about the new origin 12 ~δ:
1 1
{~δ|σ} : ~δ + ~y → ~δ − ~y (15.563)
2 2
♣NEED TO HAVE
A FIGURE HERE.
THIS WOULD
HELP. ♣
C = L q (L + ~δ) ⊂ R2 (15.564)
where
L = a1 Z ⊕ a2 Z ⊂ R2 (15.565)
As we have just discussed, for generic a1 , a2 and ~δ the symmetry group will be isomorphic
to the semidirect product Z2 o Z2 .
However, now let 0 < δ < 21 and specialize ~δ to ~δ = (δa1 , 12 a2 ). Then the crystal has
more symmetry and in particular the point group is enhanced from Z2 to Z2 × Z2 :
1 → Z2 → G(C) → Z2 × Z2 → 1 (15.566)
– 402 –
where ~v = (n1 a1 , n2 a2 ) ∈ L where n1 , n2 ∈ Z. Now consider the square:
Since n2 ∈ Z there is no lifting that makes this an involution. Therefore, there is no section.
Therefore the sequence (15.566) does not split.
C = L q (L + ~δ) ⊂ R2 (15.574)
has a point group symmetry D4 . So this becomes a symmorphic crystal. In fact, this is
just a square lattice in disguise! We can take basis vectors δ and R(π/2)δ.
***************************
NEED TO RELATE THE ABOVE FACTS MORE DIRECTLY TO THE PREVIOUS
DISCUSSION OF GROUP COHOMOLOGY. SHOULD DO MORE ON CASE WHERE
THE SEQUENCE SPLITS BUT THERE ARE INEQUIVALENT SPLITTINGS: PROB-
ABLY A good example is Zincblend structure with tetrahedral symmetry. For example
GaAs has this structure. There are two tetrahedra around the Ga and As but they are
rotated.
*****************************
Exercise
Why does the argument of example 3 fail in the special case of example 4? 215
Exercise Honeycomb
Consider a honeycomb crystal in the plane. Discuss the crystal group, the point group,
and decide if it is symmorphic or not.
♣Need to provide
answer in a footnote
♣
15.6.2 Time Reversal
A good example of a physical situation in which it is useful to know about how twisted
cocycles define non-central extensions is when there are anti-unitary symmetries in a quan-
tum mechanical system. A typical example where this happens is when there is a time-
orientation-reversing symmetry. In this case there is a homomorphism
τ : G → {±1} ∼
= Z2 (15.575)
215
Answer : The wrong step is in equation (15.572). When δ takes the special form 12 (a, a) this is not the
most general lifting. One has now translation symmetry by multiples of ~δ, so there is an obvious lifting of
σ1 .
– 403 –
telling us whether the symmetry g ∈ G preserves or reverses the orientation of time.
In quantum mechanics it is often (but not always! - see below) the case that time-
reversal is implemented as an anti-unitary operator (see Chapter 2 below for a precise
definition of this term) and therefore when looking at the way the symmetry is implemented
quantum mechanically we should consider the nontrivial automorphism of U (1) defined by
complex conjugation.
Recall that
Aut(U (1)) ∼
= Out(U (1)) ∼= Z2 (15.576)
where: (
z τ (g) = +1
ω(g)(z) = (15.578)
z −1 τ (g) = −1
To prove this we look at the twisted cocycle identity. Exactly the same arguments as in
Remark 5 of section 15.3 show that we can choose a gauge with f (g, 1) = f (1, g) = 1 for
all g. This leaves only f (T̄ , T̄ ) to be determined. Now, the twisted cocycle relation for the
case g1 = g2 = g3 = T̄ says that
and since f (T̄ , T̄ ) ∈ U (1) this means f (T̄ , T̄ ) ∈ {±1}. We need to check that we can’t
gauge f (T̄ , T̄ ) to one using a twisted coboundary relation. That relation says that we can
gauge f to
f˜(T̄ , T̄ ) = t(T̄ )ωT̄ (t(T̄ ))f (T̄ , T̄ )/t(1) (15.581)
Now t(1) = 1 since we want to preserve the gauge f (g, 1) = f (1, g) = 1 and t(T̄ )ωT̄ (t(T̄ )) =
|t(T̄ )|2 = 1 so f (T̄ , T̄ ) is gauge invariant.
– 404 –
(Note that T̄ is an involution so our old criterion from Remark 5 of section 15.3 would
ask us to find a square root of f (T̄ , T̄ ) = −1. Indeed such a square root exists, it is ±i,
but our old criterion no longer applies because we are in the twisted case.)
So, choosing ω to be the nontrivial homomorphism M2 → Aut(U (1)) there are two
extensions:
1 / U (1) / M ± π̃ / M2 /1 (15.582)
2
– 405 –
So, in this example, T̃ 2 = −1. More generally, in analogous settings for spin j particles
T̃ 2 = (−1)2j . See section 15.6.3 below for an explanation. The fact that T̃ 2 = (−1)2j in
the spin j representation has a very important consequence known as Kramer’s theorem:
In these situations the energy eigenspaces must have even degeneracy. For if Ψ is an energy
eigenstate HΨ = EΨ and we have a time-reversal invariant system then T̃ · Ψ is also an
energy eigenstate. We can prove that it is linearly independent of Ψ as follows: Suppose
to the contrary that
T̃ · Ψ = zΨ (15.591)
for some complex number z. Then act with T̃ again and use the fact that it is anti-unitary
and squares to −1:
−Ψ = z ∗ T̃ · Ψ (15.592)
but this implies that z = −1/z ∗ which implies |z|2 = −1, which is impossible. Therefore,
(15.591) is impossible. Therefore Ψ and T̃ · Ψ are independent energy eigenstates. A slight
generalization of the argument shows that the dimension of the energy eigenspace must
be even. A more conceptual way of understanding this is that the energy eigenspace must
be a quaternionic vector space because we have an anti-linear operator on it that squares
to −1. See the discussion of real, complex, and quaternionic vector spaces in Chapter 2
below.
Example 2: In general a system can have time-orientation reversing symmetries but the
simple transformation t → −t is not a symmetry. Rather, it must be accompanied by
other transformations so that the symmetry group is not of the simple form G = G0 × Z2
where G0 is a group of time-orientation-preserving symmetries. (Such a structure is often
assumed in the literature.) As a simple example consider a crystal
where ~δ is generic so, as we saw above we have a symmorphic crystal with P (C) ∼ = D4 .
π
The action of D4 is just given by rotation around the origin {0|R( 2 )} which we will denote
by R and reflection, say, in the y-axis, which we will denote by P . So R4 = 1, P 2 = 1, and
P RP = R−1 . We have
G(C) = Z2 o D4 (15.594)
But now suppose there is a dipole moment, or spin S. We model this with a set of two
elements S = {S, −S} for dipole moment up and down and now our crystal with spin is a
subset of R2 × S. This subset is of the form
C b+ q C
b=C b− (15.595)
with
b+ = Z2 + (δ1 , δ2 ) × {S} q Z2 + (−δ1 , −δ2 ) × {S}
C (15.596)
but a spin −S on points of the complementary sub-crystal
– 406 –
Figure 40: In this figure the blue crosses represent an atom with a local magnetic moment pointing
up while the red crosses represent an atom with a local magnetic moment pointing down. The
magnetic point group is isomorphic to D4 but the homomorphism τ to Z2 has a kernel Z2 × Z2
(generated by π rotation around a lattice point together with a reflection in a diagonal). Since D4
τ
is nonabelian the sequence 1 → Pb0 → Pb→Z2 → 1 plainly does not split.
Now let Z2 = {1, σ} act on R2 × S by acting trivially on the first factor and σ : S → −S
on the second factor. Now reversal of time orientation exchanges S with −S. So the
\ ⊂ Euc(2) × Z2 known as the
symmetries of the crystal with dipole is a subgroup G(C)
magnetic crystallographic group. The subgroup of translations by the lattice is still a normal
subgroup and the quotient by the lattice of translations is the magnetic point group. In
the present example:
0 → Z2 → G(C)
\→P \(C) → 1 (15.598)
\
The elements in P (C) are
{(1, 1), (R, σ), (R2 , 1), (R3 , σ), (P, σ), (P R, 1), (P R2 , σ), (P R3 , 1)} (15.599)
This magnetic point group is isomorphic to D4 but the time reversal homomorphism takes
τ (R, σ) = −1 and τ (P, σ) = −1 so that we have
\ τ
1 → Z2 × Z2 → P (C) → Z2 → 1 (15.600)
The induced automorphism on Z2 × Z2 is trivial so clearly this sequence does not split,
\
since P (C) ∼
= D4 is nonabelian.
Remarks:
– 407 –
1. With the possible exception of exotic situations in which quantum gravity is impor-
tant, physics takes place in space and time. Except in unusual situations associated
with nontrivial gravitational fields we can assume our spacetime is time-orientable.
Then, any physical symmetry group G must be equipped with a homomorphism
τ : G → Z2 (15.601)
Let us - for simplicity - make the assumption that our physical system has time-
translation invariance so that U (t1 , t2 ) = U (t1 − t2 ) is a strongly continuous group of
unitary transformations. 217
By Stone’s theorem, U (t) has a self-adjoint generator H, the Hamiltonian, so that ♣There is an
obvious
we may write generalization of
this statement for
it U (t1 , t2 ). Is it
U (t) = exp − H (15.603) proved rigorously
~ somewhere? ♣
– 408 –
Now, substituting (15.603) and paying proper attention to φ we learn that the con-
dition for a symmetry of the dynamics (15.604) is equivalent to
in other words,
ρ(g)Hρ(g)−1 = φ(g)τ (g)H (15.606)
Thus, the answer to our question is that φ and τ are unrelated in general. We should
therefore define a third homomorphism χ : G → Z2
Note that
φ·τ ·χ=1 (15.608)
implies that if any group element has χ(g) = −1 then the spectrum of H must be
symmetric around zero. In particular, if the spectrum is bounded below but not
above this condition must fail. In many problems, e.g. in the standard Schrödinger
problem with potentials which are bounded below, or in relativistic QFT with H
bounded below we must have χ(g) = 1 for all g and hence φ(g) = τ (g), which is
what one reads in virtually every physics textbook: “A symmetry is anti-unitary iff
it reverses the orientation of time.” Not true, in general.
3. However, there are physical examples where χ(g) can be non-trivial, that is, there
can be symmetries which are both anti-unitary and time-orientation preserving. An
example are the so-called “particle-hole” symmetries in free fermion systems.
V (j1 ) ⊗ V (j2 ) ∼
= V (|j1 − j2 |) ⊕ V (|j1 − j2 | + 1) ⊕ · · · · · · ⊕ V (j1 + j2 ) (15.610)
Note that every representation on the RHS has the same parity of (−1)2j . Also note
the triangular structure of the Clebsch-Gordon decomposition of V ( 21 )⊗n allowing for an
– 409 –
inductive proof. Finally T̃ 2 on V ( 12 )⊗n is just (−1)n , so it is (−1)n on the highest summand
V (n/2).
Let us give a proof of (15.610). We need some general facts about representation
theory. See Chapter 4 for full explanations:
1. ωs : Q → Aut(N )
2. fs : Q × Q → N
Recall that for a ∈ N , I(a) ⊂ Aut(N ) denotes the inner automorphism given by conjugation
by a. The proof of (15.613) follows exactly the same steps as (15.537), except for the very
last line.
Moreover, using (15.612) to relate s(q1 )s(q2 )s(q3 ) to s(q1 q2 q3 ) in two ways gives a
twisted cocycle relation:
Note this is the same as (15.538), but unlike that equation now order of the terms is very
important since we no longer assume that N is abelian.
To summarize: Given a general extension (15.1) there exist maps (ωs , fs ), associated
with any section s and defined by (15.611) and (15.612). The maps (ωs , fs ) automatically
satisfy the identities (15.613) and (15.614).
We now consider, more generally, functions satisfying identities (15.613) and (15.614).
That is, we assume we are given two maps (not necessarily derived from some section):
– 410 –
1. A map f : Q × Q → N
2. A map ω : Q → Aut(N )
This is very similar to (15.539) but we stress that since N might be nonabelian, the order
of the factors in the first entry on the RHS matters!
With a few lines of algebra, using the identities (15.615) and (15.616) one can check
the associativity law and the other group axioms. We have already seen this simultaneous
generalization of the semidirect product (14.2) and the twisted product of a central exten-
sion (15.102) in our discussion of the case where N = A is abelian. (See equation (15.539)
above.) The new thing we have now learned is that this is the most general way of putting
a group structure on a product N × Q so that the result fits in an extension of Q by N .
Now, suppose again that we are given a group extension. As we showed, a choice of
section s gives us a pair of functions (ωs , fs ) satisfying (15.615) and (15.616). Any other
section s̃ is related to s by a function t : Q → N . Indeed that function t is defined by:
fs̃ (q1 , q2 ) = t(q1 )ωs,q1 (t(q2 ))fs (q1 , q2 )t(q1 q2 )−1 (15.620)
The proof of (15.619) follows exactly the same steps as (15.535). To prove (15.620) we
patiently combine the definition (15.618) with the definition (15.612). ♣We also skipped
this proof for
These formulae for how (ωs , fs ) change as we change the section now motivate the N = A abelian.
Probably should
following: show the steps. ♣
Suppose we are given a pair (ω, f ) satisfying (15.615) and (15.616) and an arbitrary
function t : Q → N . We can now define a new pair (ω 0 , f 0 ) by the equations:
– 411 –
The relations (15.621) and (15.622) define an equivalence relation on the set of pairs
(ω, f ) satisfying (15.615) and (15.616). Moreover, if (ω, f ) and (ω 0 , f 0 ) are related by
(15.621) and (15.622) then we can define a group structure on the set N × Q in two ways
using the equation (15.617) for each pair. Nevertheless, there is a morphism between these
two extensions in the sense of (15.4) above where we define
Then note that ϕ−1 (n, q) = (nt(q), q) is an inverse morphism of extensions, and hence we
have an isomorphism of extensions.
Now we would like to state all this a little more conceptually. The first point to
note is that a map q 7→ ωq ∈ Aut(N ) that satisfies (15.615) in fact canonically defines a
homomorphism ω̄ : Q → Out(N ) of Q into the group of outer automorphisms of N . This
homomorphism is defined more conceptually as the unique map that makes the diagram:
1 /N ι /G π /Q /1 (15.625)
I ψ ω
1 / Inn(N ) / Aut(N ) / Out(N ) /1
Here I : N → Inn(N ) is the map that takes n to the inner automorphism I(n) : n0 7→
nn0 n−1 and ψ is the map from G → Aut(N ) defined by
– 412 –
for some function t : Q → N . Note, please, that while this equation is formally very similar
to (15.619) it is conceptually different. Nothing has been said about the relation of the two
extensions, other than that they induce the same ω̄.
Now we try to relate the corresponding functions f (1) (q1 , q2 ) and f (2) (q1 , q2 ). To do
that we compute
−1
ωq(1)
1
◦ ω (1)
q2 (n) = t(q1 )ω (2)
q1 t(q 2 )ω (2)
q2 (n)t(q 2 ) t(q1 )−1
= t(q1 )ωq(2)
1
(t(q 2 )) ω (2)
q 1
◦ ω (2)
q2
(n) ωq(2)
2
(t(q2 )−1 )t(q1 )−1
−1
= t(q1 )ωq(2)
1
(t(q 2 )) f (2)
(q 1 , q 2 )ω (2)
q q
1 2
(n)f (2)
(q 1 , q2 ) ωq(2)
2
(t(q2 )−1 )t(q1 )−1
−1
(2) (2) −1 (1) (2) (2) −1
= t(q1 )ωq1 (t(q2 ))f (q1 , q2 )t(q1 q2 ) · ωq1 q2 (n) t(q1 )ωq1 (t(q2 ))f (q1 , q2 )t(q1 q2 )
where we define
fˆ(2) (q1 , q2 ) := t(q1 )ωq(2)
1
(t(q2 ))f (2) (q1 , q2 )t(q1 q2 )−1 (15.629)
On the other hand, we know that
ωq(1)
1
◦ ωq(1)
2
(n) = f (1) (q1 , q2 )ωq(1)
1 q2
(n)f (1) (q1 , q2 )−1 (15.630)
(1)
Can we conclude that fˆ(2) (q1 , q2 ) = f (1) (q1 , q2 ) ? Certainly not! Provided ωq1 q2 (n) is
sufficiently generic all we can conclude is that
for some function ζ : Q × Q → Z(N ). These two functions are not necessarily related by a
coboundary and the extensions are not necessarily equivalent!
What is true is that if fˆ(2) and f (1) satisfy the twisted cocycle relation then ζ(q1 , q2 ) in
(15.631) also satisfies the twisted cocycle relation. (This requires a lot of patient algebra....)
It follows that
ζ ∈ Z 2+ω̄ (Q, Z(N )) (15.632)
Moreover, going the other way, given one extension and corresponding (ω (1) , f (1) ), and
a ζ ∈ Z 2+ω̄ (Q, Z(N )) we can change f as in (15.631). If [z] ∈ H 2+ω̄ (Q, Z(N )) is nontrivial
we will in general get a new, nonequivalent extension.
All this is summarized by the theorem:
Theorem: Let Extω̄ (Q, N ) be the set of inequivalent extensions of Q by N inducing ω̄.
Then either this set it is empty or it is a torsor 219 for H 2+ω̄ (Q, Z(N )).
************************
219
A torsor X for a group G is a set X with a G-action on it so that given any pair x, x0 ∈ X there is a
unique g ∈ G that maps x to x0 . In this chapter we have discussed an important example of a torsor quite
extensively: Affine space Ad is a torsor for Rd with the natural action of Rd on Ad by translation.
– 413 –
NEED SOME EXAMPLES HERE. AND NEED SOME MORE INTERESTING EX-
ERCISES.
*************************
Exercise
a.) Check that (15.623) really does define a homomorphism of the group laws (15.617)
defined by (ω, f ) and (ω 0 , f 0 ) if (ω 0 , f 0 ) is related to (ω, f ) by (15.621) and (15.622).
b.) Check that the diagram (15.4) really does commute if we use (15.623).
Motivations:
a.) The word “cohomology” suggests some underlying chain complexes, so we will
show that there is such a formulation.
b.) There has been some discussion of higher degree group cohomology in physics in
1. The theory of anomalies (Faddeev-Shatashvili; Segal; Carey et. al.; Mathai et. al.;
... )
4. Condensed matter/topological phases of matter (Kitaev; Wen et. al.; Kapustin et.
al.; Freed-Hopkins;....)
– 414 –
15.8.1 Definition
Suppose we are given any group G and an Abelian group A (written additively in this
sub-section) and a homomorphism
ω : G → Aut(A) (15.633)
(dφ3 )(g1 , g2 , g3 , g4 ) = ωg1 (φ3 (g2 , g3 , g4 ))−φ3 (g1 g2 , g3 , g4 )+φ3 (g1 , g2 g3 , g4 )−φ3 (g1 , g2 , g3 g4 )+φ3 (g1 , g2 , g3 )
(15.638)
Next, one can check that for any φ, we have the absolutely essential equation:
d(dφ) = 0 (15.639)
We will give a simple proof of (15.639) below but let us just look at how it works for
the lowest degrees: If φ0 = a ∈ A is a 0-cochain then
– 415 –
if φ1 is any 1-cochain then we compute:
(d2 φ1 )(g1 , g2 , g3 ) = ωg1 (dφ1 (g2 , g3 )) − (dφ1 )(g1 g2 , g3 ) + (dφ1 )(g1 , g2 g3 ) − (dφ1 )(g1 , g2 )
= ωg1 (ωg2 (φ1 (g3 )) − φ1 (g2 g3 ) + φ1 (g2 ))
− (ωg1 g2 (φ1 (g3 )) − φ1 (g1 g2 g3 ) + φ1 (g1 g2 ))
+ (ωg1 (φ1 (g2 g3 ) − φ1 (g1 g2 g3 ) + φ1 (g1 ))
− (ωg1 (φ1 (g2 )) − φ1 (g1 g2 ) + φ1 (g1 ))
=0
(15.641)
where you can check that all terms cancel in pairs, once you use ωg1 ◦ ωg2 = ωg1 g2 .
The set of (ω-twisted) n-cocycles is defined to be the subgroup Z n+ω (G, A) ⊂ C n (G, A)
of cochains that satisfy dφn = 0.
Thanks to (15.639) we can define a subgroup B n+ω (G, A) ⊂ Z n+ω (G, A), called the
subgroup of coboundaries:
(dφ0 )(1) = 0
(
0 ω=T (15.645)
(dφ0 )(σ) = ωσ (n0 ) − n0 =
−2n0 ω=I
– 416 –
Now the cocycle condition implies φ1 (1) = 0, making the first three lines of (15.646) vanish.
Using this the fourth line becomes:
(
2φ1 (σ) ω = T
(dφ1 )(σ, σ) = ωσ (φ1 (σ)) + φ1 (σ) = (15.647)
0 ω=I
Now, when is φ1 a cocycle? When ω = T is trivial then we must take φ1 (σ) = 0 and
hence φ1 = 0 moreover, there are no coboundaries. We find H 1+T (Z2 , Z) = 0 in this case,
reproducing the simple fact that there are no nontrivial group homomorphisms from Z2 to
Z.
On the other hand, when ω = I we can take φ1 (σ) = a to be any integer a ∈ Z.
The group of twisted cocycles is isomorphic to Z. However, now there are nontrivial
coboundaries, as we see from (15.645). We can shift a by any even integer a → a − 2n0 .
So
H 1+I (Z2 , Z) ∼
= Z2 (15.648)
In addition to the interpretation in terms of splittings, this has a nice interpretation in
topology in terms of the unorientability of even-dimensional real projective spaces.
Remarks:
1. Previously we were denoting the cohomology groups by H n+ω (G, A). In the equations
above the ω is still present, (see the first term in the definition of dφ) but we leave
the ω implicit in the notation. Nevertheless, we are talking about the same groups
as before, but now generalizing to arbitrary degree n.
2. Remembering that we are now writing our abelian group A additively, we see that
the equation (dφ2 ) = 0 is just the twisted 2-cocycle conditions, and φ02 = φ2 + dφ1
are two different twisted cocycles related by a coboundary. See equations (15.538)
and (15.540) above. Roughly speaking, you should “take the logarithm” of these
equations.
3. Homological Algebra: What we are discussing here is a special case of a topic known
as homological algebra. Quite generally, a chain complex is a sequence of Abelian
groups {Cn }n∈Z equipped with group homomorphisms
∂n : Cn → Cn−1 (15.649)
– 417 –
4. Homogeneous cocycles: A nice way to prove that d2 = 0 is the following. We define
homogeneous n-cochains to be maps ϕ : Gn+1 → A which satisfy
Let C n (G, A) denote the abelian group of such homogeneous group cochains. (Warn-
ing! Elements of C n (G, A) have (n + 1) arguments!) Define
by
n+1
X
δϕ(g0 , . . . , gn+1 ) := (−1)i ϕ(g0 , . . . , gbi , . . . , gn+1 ) (15.654)
i=0
=0
That is, when φn and ϕn are related this way we say φn = ψ(ϕn ). Now one can check
that the simple formula (15.654) becomes the more complicated formula (15.634).
Put more formally: there is a unique d so that dψ = ψδ, or even more formally, there
is a unique group homomorphism d such that we have a commutative diagram:
C n (G, A)
δ / C n+1 (G, A) (15.657)
ψ ψ
C n (G, A)
d / C n+1 (G, A)
For example, if
φ1 (g) = ψ(ϕ1 )(g) = ϕ1 (1, g) (15.658)
– 418 –
then we can check that
5. Where do all these crazy formulae come from? The answer is in topology. We will
indicate it briefly in our discussion of categories and groupoids below.
6. The reader will probably find these formulae a bit opaque. It is therefore good to
stop and think about what the cohomology is measuring, at least in low degrees.
Exercise
Derive the formula for the differential on an inhomogeneous cochain dφ2 starting with
the definition on the analogous homogeneous cochain ϕ3
Exercise
If (Cn , ∂n ) is a chain complex show that one can define a cochain complex with groups:
C n := Hom(Cn , Z) (15.660)
ωg (a) = a (15.661)
There are no coboundaries to worry about, so H 0 (G, A) is just the set of fixed points of
the G action on A.
– 419 –
15.8.4 Interpreting the meaning of H 2+ω
Again, we have interpreted H 2+ω (G, A) as Extω (G, A), the set of equivalence classes of
extensions
0 → A → G̃ → G → 1 (15.663)
inducing a fixed ω : G → Aut(A). The trivial element of the cohomology group corre-
sponds to the semi-direct product and the set of inequivalent trivializations is the group
H 1+ω (G, A) of splittings of the semi-direct product.
More generally, Extω̄ (Q, N ) is a torsor for H 2+ω̄ (Q, Z(N )).
ω̄ : Q → Out(N ) (15.664)
then q 7→ ξq will not be a group homomorphism. But we do know that for all q1 , q2 ∈ Q
i.e.
ξq1 ◦ ξq2 = I(f (q1 , q2 )) ◦ ξq1 q2 (15.668)
Of course, the choice of f (q1 , q2 ) is ambiguous by an element of Z(N )!
Equation (15.668) is of course just (15.615) written in slightly different notation.
Therefore, as we saw in §15.7, if f (q1 , q2 ) were to satisfy the the “twisted cocycle con-
dition” (15.616) then we could use (15.617) to define an extension inducing ω̄.
Therefore, let us check if some choice of f (q1 , q2 ) actually does satisfy the twisted
cocycle condition (15.616). Looking at the RHS of (15.616) we compute:
– 420 –
On the other hand, looking at the LHS of (15.616) we compute:
We cannot conclude that f satisfies the twisted cocycle equation from this identity because
inner transformations are trivial for elements in the center Z(N ). Rather, what we can
conclude is that for every q1 , q2 , q3 there is an element z(q1 , q2 , q3 ) ∈ Z(N ) such that
Examples: As an example 221 where a degree three cohomology class obstructs the exis-
tence of an extension inducing a homomorphism ω̄ : Q → Out(N ) we can take N to be the
generalized quaternion group of order 16. It is generated by x and y satisfying:
Using these relations every word in x±1 and y ±1 can be reduced to either xm , or yxm , with
m = 0, . . . , 7, and these words are all different. One can show the outer automorphism
group Out(N ) ∼ = Z2 × Z2 with generators α, β acting by
1 → N → G → Z2 → 1 (15.675)
221
I learned these nice examples from Clay Cordova. They will appear in a forthcoming paper with
Po-Shen Hsin and Francesco Benini.
– 421 –
inducing the homomorphism ω̄ : Z2 → Out(N ) defined by ω̄(σ) = α ◦ β where σ is the
nontrivial element of Z2 . One way to prove this is to look up the list of groups of order 32
and search for those with maximal normal subgroup given by N . 222 There are five such.
Then one computes ω̄ for each such extension and finds that it is never of the above type.
A similar example can be constructed by taking N to be a dihedral group of order 16. ♣Really should
explain four-term
Remark There is an interpretation of H 3 (Q, Z(N )) as a classification of four-term sequences and
crossed modules
exact sequences, and there are generalizations of this to higher degree. See here.... ♣
222
See, for example, B. Shuster, “Morava K-theory of groups of order 32,” Algebr. Geom. Topol. 11
(2011) 503-521.
– 422 –
But this is too rough, it does not give us a good feeling for what the examples really
are.
Once we have a “good” criterion we often can make a nontrivial statement about the
general structure of objects in a given class. Ideally, we should be able to construct all
the examples algorithmically, and be able to distinguish the ones which are not isomor-
phic. Of course, finding such a “good” criterion is an art. For example, classification of
infinite nonabelian groups is completely out of the question. But in Chapter *** we will
see that an important class of infinite nonabelian groups, the simply connected compact
simple Lie groups, have a very beautiful classification: There are four infinite sequences
of classical matrix groups: SU (n), Spin(n), U Sp(2n) and then five exceptional cases with
names G2 , F4 , E6 , E7 , E8 . 223
One might well ask: Can we classify finite groups? In this section we survey a little of
what is known about this problem.
and has order 2n. There is a special isomorphism D3 ∼ = S3 with the symmetric group on
three letters.
The next nontrivial case is |G| = 8. Here we can invoke Sylow’s theorem: If pk ||G|
then G has a subgroup of order pk . Let us apply this to 4 dividing |G|. Such a subgroup
has index two and hence must be a normal subgroup, and hence fits in a sequence
1 → N → G → Z2 → 1 (16.2)
1 → Z4 → G → Z2 → 1 (16.3)
223
Spin(n) double covers the classical matrix group SO(n).
– 423 –
then we have α : Z2 → Aut(Z4 ) = ∼ Z2 and there are exactly two such homomorphisms.
Moreover, for a fixed α there are two possibilities for the square σ̃ 2 ∈ Z4 where σ̃ is a lift
of the generator of Z2 . Altogether this gives four possibilities: ♣Need to explain
more here. ♣
1 → Z4 → Z2 × Z4 → Z2 → 1 (16.4)
1 → Z4 → Z8 → Z2 → 1 (16.5)
1 → Z4 → D4 → Z2 → 1 (16.6)
1 → Z4 → D
e 2 → Z2 → 1 (16.7)
Here we meet the first of the series of dicyclic or binary dihedral groups defined by
224
See, however, M. Wild, “Groups of order 16 made easy,” American Mathematical Monthly, Jan 2005
– 424 –
Order Presentation name
1 ha|a = 1i Trivial group
2 ha|a2 = 1i Cyclic Z/2Z
3 ha|a3 = 1i Cyclic Z/3Z
4 ha|a4 = 1i Cyclic Z/4Z
4 ha, b|a2 = b2 = (ab)2 = 1i Dihedral D2 ∼= Z/2Z × Z/2Z, Klein
5 ha|a5 = 1i Cyclic Z/5Z
6 ha, b|a = 1, b2 = 1, bab = ai
3 Cyclic Z/6Z ∼= Z/2Z × Z/3Z
6 ha, b|a3 = 1, b2 = 1, bab = a−1 i Dihedral D3 ∼ = S3
7 ha|a7 = 1i Cyclic Z/7Z
8 ha|a8 = 1i Cyclic Z/8Z
8 ha, b|a = 1, b4 = 1, aba = bi
2 Z/2Z × Z/4Z
8 ha, b, c|a2 = b2 = c2 = 1, [a, b] = [a, c] = [b, c] = 1i Z/2Z × Z/2Z × Z/2Z
8 ha, b|a4 = 1, b2 = 1, bab = a−1 i Dihedral D4
8 ha, b|a4 = 1, a2 = b2 , b−1 ab = a−1 i f2 ∼
Dicyclic D = Q, quaternion
9 ha|a9 = 1i Cyclic Z/9Z
9 ha, b|a3 = b3 = 1, [a, b] = 1i Z/3Z × Z/3Z
10 ha|a10 = 1i Cyclic Z/10Z ∼ = Z/2Z × Z/5Z
10 ha|a5 = b2 = 1, bab = a−1 i Dihedral D5
11 ha|a11 = 1i Cyclic Z/11Z
12 ha|a12 = 1i Cyclic Z/12Z ∼ = Z/4Z × Z/3Z
12 ha, b|a2 = 1, b6 = 1, [a, b] = 1i Z/2Z × Z/6Z
12 ha, b|a6 = 1, b2 = 1, bab = a−1 i Dihedral D6
12 ha, b|a6 = 1, a3 = b2 , b−1 ab = a−1 i Dicyclic D
f3
12 ha, b|a3 = 1, b2 = 1, (ab)3 = 1i Alternating A4
13 ha|a13 = 1i Cyclic Z/13Z
14 ha|a14 = 1i Cyclic Z/14Z ∼= Z/2Z × Z/7Z
14 ha, b|a7 = 1, b2 = 1, bab = a−1 i Dihedral D7
15 ha|a15 = 1i Cyclic Z/15Z ∼= Z/3Z × Z/5Z
Remarks:
1. J.A. Gallan, “The search for finite simple groups,” Mathematics Magazine, vol.
49 (1976) p. 149. (This paper is a bit dated.)
– 425 –
Figure 41: A plot of the number of nonisomorphic groups of order n. This plot was taken from
the book by D. Joyner, Adventures in Group Theory.
2. There are also nice tables of groups of low order, in Joyner, Adventures in Group
Theory, pp. 168-172, and Karpilovsky, The Schur Multiplier which go beyond the
above table.
4. The number of isomorphism types of groups jumps wildly. Apparently, there are
49, 487, 365, 422 isomorphism types of groups of order 210 = 1024. (Besche et. al.
loc. cit.) The remarkable plot of Figure 41 from Joyner’s book shows a plot of the
number of isomorphism classes vs. order up to order 100. Figure 42 shows a log plot
of the number of groups up to order 2000.
5. There is, however, a formula giving the asymptotics of the number f (n, p) of isomor-
phism classes of groups of order pn for n → ∞ for a fixed prime p. (Of course, there
are p(n) Abelian groups, where p(n) the the number of partitions of n. Here we are
– 426 –
Figure 42: A logarithmic plot of the number of nonisomorphic groups of order n out to n ≤ 2000.
This plot was taken from online encyclopedia of integer sequences, OEIS.
talking about the number of all groups.) This is due to G. Higman 225 and C. Sims
226 and the result states that:
2 3
f (n, p) ∼ p 27 n (16.10)
1/2
Note that the asymptotics we derived for p(n) before had a growth like econst.n so,
unsurprisingly, most of the groups are nonabelian.
225
G. Higman, “Enumerating p-Groups,” Proc. London Math. Soc. 3) 10 (1960)
226
C. Sims, “Enumerating p-Groups,” Proc. London Math. Soc. (3) IS (1965) 151-66
– 427 –
Show that D
e n is a double-cover of Dn which fits into the exact sequence:
Z2 Z2 (16.11)
1 / Z2n /D
en / Z2 /1
1 / Zn / Dn / Z2 /1
Z/pZ ⊕ Z/qZ ∼
= Z/pqZ. (16.13)
then
Z/nZ ∼ e
= ⊕i Z/pi i Z (16.15)
Kronecker Structure Theorem. Any finite abelian group is a direct product of cyclic
groups of order a prime power. That is, we firstly have the decomposition:
G = G2 ⊕ G3 ⊕ G 5 ⊕ G 7 ⊕ · · ·
(16.16)
= ⊕p prime Gp
where Gp has order pn for some n ≥ 0 (n can depend on p, and for all but finitely many p,
Gp = {0}.) And, secondly, each nonzero factor Gp can be written:
Gp = ⊕i Z/(pni Z) (16.17)
– 428 –
Proof : The proof proceeds in two parts. The first, easy, part shows that we can split G
into a direct sum of “p-groups” (defined below). The second, harder, part shows that an
arbitrary abelian p-group is a direct sum of cyclic groups.
For part 1 of the proof let us consider an arbitrary finite abelian group G. We will
write the group multiplication additively. Suppose n is an integer so that ng = 0 for all
g ∈ G. To fix ideas let us take n = |G|. Suppose n = m1 m2 where m1 , m2 are relatively
prime integers. Then there are integers s1 , s2 so that
s1 m1 + s2 m2 = 1 (16.18)
G = m1 G ⊕ m2 G (16.20)
– 429 –
that H ⊂ hg0 i. Since G is a p-group the subgroup hg0 i is a p-group and hence contains a
subgroup of order p (by Cauchy) but (by hypothesis) there is a unique such subgroup in
G and any subgroup of hg0 i is a subgroup of G, so H ⊂ hg0 i. But now take any element
g ∈ G. On the one hand it must project to an element [g] ∈ G/H. Thus must be of the
form [g] = kg0 + H, since g0 + H generates G/H. That means g = kg0 + h, h ∈ H, but
since H ⊂ hg0 i we must have h = `g0 for some integer `. Therefore G = hg0 i is cyclic.
The final step proceeds by showing that if G is a finite abelian p-group and M is a
cyclic subgroup of maximal order then G = M ⊕ N for some subgroup N . Once we have
established this the desired result follows by induction.
So, now suppose that that G has a cyclic subgroup of maximal order M . If G is cyclic
then N = {0}. If G is not cyclic then we just proved that there must be at least two
distinct subgroups of order p. One of them is in M . Choose another one, say K. Note that
K must not be in M , because M is cyclic and has a unique subgroup of order p. Therefore
K ∩ M = {0}. Therefore (M + K)/K ∼ = M . Therefore (M + K)/K is a cyclic subgroup
of G/K. Any element g + K has an order which divides |g|, and |g| ≤ |M | since M is a
maximal cyclic subgroup. Therefore the cyclic subgroup (M + K)/K is a maximal order
cyclic subgroup of G/K. Now the inductive hypothesis implies G/K = (M + K)/K ⊕ H/K
for some subgroup K ⊂ H ⊂ G. But this means (M +K)∩H = K and hence M ∩H = {0}
and hence G = M ⊕ H. ♠
For other proofs see
1. S. Lang, Algebra, ch. 1, sec. 10.
2. I.N. Herstein, Ch. 2, sec. 14.
3. J. Stillwell, Classical Topology and Combinatorial Group Theory.
4. Our proof is based on G. Navarro, “On the fundamental theorem of finite abelian
groups,” Amer. Math. Monthly, Feb. 2003, vol. 110, p. 153.
One class of examples where we have a finite Abelian group, but it’s Kronecker de-
composition is far from obvious is the following: Consider the Abelian group Zd . Choose
a set of d vectors vi ∈ Zd , linearly independent as vectors in Rd .
d
X
L := { ni vi |ni ∈ Z} (16.23)
i=1
is a subgroup. Then
A = Zd /L (16.24)
is a finite Abelian group. For example if vi = kei where ei is the standard unit vector
in the ith direction then obviously A ∼ = (Z/kZ)d . But for a general set of vectors the
decomposition is not obvious.
So, here is an algorithm for giving the Kronecker decomposition of a finite Abelian
group:
– 430 –
2. You need only consider the elements whose order is a prime power. (By the Bezout
manipulation all the others will be sums of these.)
3. Focusing on one prime at a time. Take the element g1 whose order is maximal. Then
Gp = hg1 i ⊕ N . ♣Have to say how
to get N . ♣
4. Repeat for N .
Exercise
Show that an alternative of the structure theorem is the statement than any finite
abelian group is isomorphic to
Exercise p-groups
a.) Show that Z4 is not isomorphic to Z2 ⊕ Z2 .
b.) Show more generally that if p is prime Zpn and Zpn−m ⊕ Zpm are not isomorphic if
0 < m < n.
c.) How many nonisomorphic abelian groups have order pn ?
Exercise
Suppose e1 , e2 ∈ Z2 are two linearly independent vectors (over Q). Let Λ = he1 , e2 i ⊂
Z2 be the sublattice generated by these vectors. Then Z2 /Λ is a finite abelian group.
Compute its Kronecker decomposition in terms of the coordinates of e1 , e2 .
– 431 –
where we are writing the group G additively, so ng = g + · · · + g.
One can show that any finitely generated abelian group fits in an exact sequence
0 → Tors(G) → A → Zr → 0 (16.28)
Zr ⊕ Tors(G) (16.29)
The integer r, called the rank of the group, and the finite abelian group Tors(G) are
invariants of the finitely generated abelian group. Since we have a general picture of the
finite abelian groups we have now got a general picture of the finitely generated abelian
groups.
Remark:
Remarks
1. The groups C, R, Q under addition are abelian but not finitely generated. This is
obvious for C and R since these are uncountable sets. To see that Q is not finitely
generated consider any finite set of fractions { pq11 , . . . , pqss }. This set will will only
generate rational numbers which, when written in lowest terms, have denominator at
most q1 q2 · · · qs .
2. Note that a torsion abelian group need not be finite in general. For example Q/Z is
entirely torsion, but is not finite.
3. A rich source of finitely generated abelian groups are the integral cohomology groups
H n (X; Z) of smooth compact manifolds.
4. We must stress that the presentation (16.29) of a finitely generated abelian group
is not canonical! There are many distinct splittings of (16.28). They are in 1-1
correspondence with the group homomorphisms Hom(Zr , Tors(G)). For a simple
example consider Zd /Λ where Λ is a general sublattice of rank less than d.
5. In a nonabelian group the product of two finite-order elements can very well have
infinite order. Examples include free products of cyclic groups and simple rotations
by 2π/n around different axes in SO(3). So, there is no straightforward generalization
of Tors(G) to the case of nonabelian groups.
Exercise
– 432 –
Consider the finitely generated Abelian group 227
X
L = {(x1 , x2 , x3 , x4 ) ∈ Z4 | xi = 0mod2} (16.30)
i
v1 = (1, 1, 1, 1)
(16.31)
v2 = (1, 1, −1, −1)
Exercise
Given a set of finite generators of an Abelian group A try to find an algorithm for a
splitting of the sequence (16.28).
1 = Gs+1 / Gs / · · · / G2 / G1 = G (16.32)
which have the property that Gi+1 is a maximal normal subgroup of Gi . (Note: Gi+1 need
not be normal in G. Moreover, there might be more than one maximal normal subgroup
in Gi . ) As a simple example we shall see that we have ♣should give an
example of this....
♣
1 = G4 / G3 = Z2 × Z2 / G2 = A4 / G1 = S4 (16.33)
but
G 3 = 1 / G 2 = An / G 1 = S n n≥5 (16.34)
Not every group admits a composition series. For example G = Z does not admit
a composition series. (Explain why!) However, it can be shown that every finite group
admits a composition series. ♣Give a reference.
♣
227
It is the root lattice of so(8).
– 433 –
It follows that in a composition series the quotient groups Gi /Gi+1 are simple groups:
By definition, a simple group is one whose only normal subgroups are 1 and itself. From
what we have learned above, that means that a simple group has no nontrivial homomorphic
images. It also implies that the center is trivial or the whole group.
Let us prove that the Gi /Gi+1 are simple: In general, if N / G is a normal subgroup
then there is a 1-1 correspondence:
Subgroups H between N and G: N ⊂ H ⊂ G ⇔ Subgroups of G/N
Moreover, under this correspondence:
Normal subgroups of G/N ⇔ Normal subgroups N ⊂ H / G. If H/Gi+1 ⊂ Gi /Gi+1 ♣Make this an
exercise in an
were normal and 6= 1 then Gi+1 ⊂ H ⊂ Gi would be normal and and properly contain earlier section. ♣
1 = Gs+1 / Gs / · · · / G2 / G1 = G (16.35)
1 → Gi+1 → Gi → Si → 1 (16.37)
We have discussed the extension problem thoroughly above. One of the great achievements
of 20th century mathematics is the complete classification of finite simple groups, so let us
look at the finite simple groups:
First consider the abelian ones. These cannot have nontrivial subgroups and hence
must be of the form Z/pZ where p is prime.
So, now we search for the nonabelian finite simple groups. A natural source of non-
abelian groups are the symmetric groups Sn . Of course, these are not simple because
An ⊂ Sn are normal subgroups. Could the An be simple? The first nonabelian example
is A4 and it is not a simple group! Indeed, consider the cycle structures (2)2 . There are
three nontrivial elements: (12)(34), (13)(24), and (14)(23), they are all involutions, and
and therefore together with the identity they form a subgroup K ⊂ A4 isomorphic to
Z2 × Z2 . Since cycle-structure is preserved under conjugation, this is obviously a normal
subgroup of A4 !. After this unpromising beginning you might be surprised to learn:
– 434 –
Theorem An is a simple group for n ≥ 5.
We see that σ(1) = σ 0 (1) = 2, while σ(2) = 3 and σ 0 (2) = 4. We leave the case r = 2 to
the reader.
Now we proceed by induction: Suppose Aj is simple for 5 ≤ j ≤ n. Consider An+1 and
let N / An+1 . Then choose σ ∈ N and using the lemma consider σ 0 ∈ An+1 with σ 0 6= σ
and σ 0 (i) = σ(i) for some i. Let Hi ⊂ An+1 be the subgroup of permutations fixing i. It is
isomorphic to An . Now, σ 0 ∈ N since it is a conjugate of σ ∈ N and N is assumed to be
normal. Therefore σ −1 σ 0 ∈ N , and σ −1 σ 0 6= 1. Therefore N ∩ Hi 6= 1. But N ∩ Hi must
be normal in Hi . Since Hi ∼ = An it follows that N ∩ Hi = Hi . But Hi contains 3-cycles.
Therefore N contains 3-cycles and hence N ∼ = An+1 . ♠
Remark: For several other proofs of the same theorem and other interesting related
facts see
http://www.math.uconn.edu/kconrad/blurbs/grouptheory/Ansimple.pdf.
Digressive Remark: A group is called solvable if the Gi /Gi+1 are abelian (and hence
Z/pZ for some prime p). The term has its origin in Galois theory, which in turn was the
original genesis of group theory. Briefly, in Galois theory one considers a polynomial P (x)
with coefficients drawn from a field F . (e.g. consider F = Q or R). Then the roots of the
polynomial θi can be adjoined to F to produce a bigger field K = F [θi ]. The Galois group
of the polynomial Gal(P ) is the group of automorphisms of K fixing F . Galois theory
– 435 –
sets up a beautiful 1-1 correspondence between subgroups H ⊂ Gal(P ) and subfields
F ⊂ KH ⊂ K. The intuitive notion of solving a polynomial by radicals corresponds to
finding a series of subfields F ⊂ F1 ⊂ F2 ⊂ · · · ⊂ K so that Fi+1 is obtained from Fi
by adjoining the solutions of an equation y d = z. Under the Galois correspondence this
translates into a composition series where Gal(P ) is a solvable group - hence the name. If
we take F = C[a0 , . . . , an−1 ] for an nth order polynomial
then the roots θi are such that aj are the j th elementary symmetric polynomials in the θi
(See Chapter 2 below). The Galois group is then Sn . For n ≥ 5 the only nontrivial normal
subgroup of Sn is An , and this group is simple, hence certainly not solvable. That is why
there is no solution of an nth order polynomial equation in radicals for n ≥ 5.
Returning to our main theme, we ask: What other finite simple groups are there? The
full list is known. The list is absolutely fascinating: 228
4. 26 “sporadic oddballs”
We won’t explain example 3 in great detail, but it consists of a few more infinite
sequences of groups, like 1,2 above. To get a flavor of what is involved note the following:
The additive group Z/pZ where p is prime has more structure: One can multiply elements,
and if an element is nonzero then it has a multiplicative inverse, in other words, it is a finite
field. One can therefore consider the group of invertible matrices over this field GL(n, p),
and its subgroup SL(n, p) of matrices of unit determinant. Since Z/pZ has a finite number
of elements it is a finite group. This group is not simple, because it has a nontrivial center,
in general. For example, if n is even then the group {±1} is a normal subgroup isomorphic
to Z2 . If we divide by the center the we get a group P SL(n, p) which, it turns out, is
indeed a simple group. This construction can be generalized in a few directions. First,
there is a natural generalization of Z/pZ to finite fields Fq of order a prime power q = pk .
Then we can similarly define P SL(n, q) and it turns out these are simple groups except
for some low values of n, q. Just as the Lie groups SL(n, C) have counterparts O(n), Sp(n)
etc. one can generalize this construction to groups of type B, C, D, E. This construction
can be used to obtain the third class of finite simple groups. ♣Double check.
Does this figure
leave out a
228
See the Atlas of Finite Simple Groups, by Conway and Norton subgroup relation?
♣
– 436 –
Figure 43: A table of the sporadic groups including subgroup relations. Source: Wikipedia.
It turns out that there are exactly 26 oddballs, known as the “sporadic groups.” Some
relationships between them are illustrated in Figure 43. The sporadic groups first showed
up in the 19th century via the Mathieu groups
Mn is a subgroup of the symmetric group Sn . M11 , which has order |M11 | = 7920 was
discovered in 1861. We met M12 when discussing card-shuffling. The last group M24 , with
order ∼ 109 was discovered in 1873. All these groups may be understood as automorphisms
of certain combinatorial objects called “Steiner systems.”
It was a great surprise when Janko constructed a new sporadic group J1 of order
175, 560 in 1965, roughly 100 years after the discovery of the Mathieu groups. The list of
sporadic groups is now thought to be complete. The largest sporadic group is called the
Monster group and its order is:
– 437 –
but it has only 194 conjugacy classes! (Thus, by the class equation, it is “very” nonabelian.
The center is trivial and Z(g) tends to be a small order group.)
The history and status of the classification of finite simple groups is somewhat curious:
229
1. The problem was first proposed by Hölder in 1892. Intense work on the classification
begins during the 20th century.
2. Feit and Thompson show (1963) that any finite group of odd order is solvable. In
particular, it cannot be a simple group.
3. Janko discovers (1965) the first new sporadic group in almost a century.
4. Progress is then rapid and in 1972 Daniel Gorenstein (of Rutgers University) an-
nounces a detailed outline of a program to classify finite simple groups.
5. The largest sporadic group, the Monster, was first shown to exist in 1980 by Fischer
and Griess. It was explicitly constructed (as opposed to just being shown to exist)
by Griess in 1982.
Compared to the simple and elegant proof of the classification of simple Lie algebras
(to be covered in Chapter **** below) the proof is obviously terribly unwieldy.
It is conceivable that physics might actually shed some light on this problem. The
simple groups are probably best understood as automorphism groups of some mathemat-
ical, perhaps even geometrical object. For example, the first nonabelian simple group,
A5 is the group of symmetries of the icosahedron, as we will discuss in detail below. A
construction of the monster along these lines was indeed provided by Frenkel, Lepowsky,
Meurman, (at Rutgers) using vertex operator algebras, which are important in the descrip-
tion of perturbative string theory. More recently the mystery has deepened with interesting
experimental discoveries linking the largest Mathieu group M24 to nonlinear sigma models
with K3 target spaces. For more discussion about the possible role of physics in this subject
see:
1. Articles by Griess and Frenkel et. al. in Vertex Operators in Mathematics and
Physics, J. Lepowsky, S. Mandelstam, and I.M. Singer, eds.
2. J. Harvey, “Twisting the Heterotic String,” in Unified String Theories, Green and
Gross eds.
229
Our source here is the Wikipedia article on the classification of finite simple groups. See also: Solomon,
Ronald, “A brief history of the classification of the Finite simple groups,” American Mathematical Society.
Bulletin. New Series, 38 (3): 315-352 (2001).
– 438 –
3. L.J. Dixon, P.H. Ginsparg, and J.A. Harvey, “Beauty And The Beast: Superconfor-
mal Symmetry In A Monster Module,” Commun.Math.Phys. 119 (1988) 221-241
4. M.C.N. Cheng, J.F.R. Duncan, and J.A. Harvey, “Umbral Moonshine,” e-Print:
arXiv:1204.2779 [math.RT]
230
Break up the cases into d = 0 and d 6= 0. When d = 0 you can solve ad − bc = 1 for a. When d = 0
you can have arbitrary a but you must have bc = −1.
– 439 –
c.) Note that the scalar multiples of the 2 × 2 identity matrix form a normal subgroup
of SL(2, Z/pZ). Show that the number of such matrices is the number of solutions of
x2 = 1modp. Dividing by this normal subgroup produces the group P SL(2, Z/pZ). Jordan
proved that these are simple groups for p 6= 2, 3.
It turns out that P SL(2, Z5 ) ∼= A5 . (Check that the orders match.) Therefore the
next simple group in the series is P SL(2, Z7 ). It has many magical properties.
d.) Show that P SL(2, Z7 ) has order 168.
A rather abstract notion, which nevertheless has found recent application in string theory
and conformal field theory is the language of categories. Many physicists object to the high
level of abstraction entailed in the category language. Some mathematicians even refer to
the subject as “abstract nonsense.” (Others take it very seriously.) However, it seems to be
of increasing utility in the further formal development of string theory and supersymmetric
gauge theory. It is also essential for reading any of the literature on topological field theory.
We briefly illustrate some of that language here. Our main point here is to introduce a
different viewpoint on what groups are that leads to a significant generalization: groupoids.
Moreover, this point of view also provides some very interesting insight into the meaning of
group cohomology. Related constructions have been popular in condensed matter physics
and topological field theory.
(ψ1 , ψ2 ) 7→ ψ2 ◦ ψ1 (17.3)
Such that
1. A morphism φ uniquely determines its source X and target Y . That is, hom(X, Y )
are disjoint for distinct ordered pairs (X, Y ).
2. ∀X ∈ Ob(C) there is a distinguished morphism, denoted 1X ∈ hom(X, X) or
IdX ∈ hom(X, X), which satisfies:
1X ◦ φ = φ ψ ◦ 1X = ψ (17.4)
for all morphisms φ ∈ hom(Y, X) and ψ ∈ hom(X, Y ) for all Y ∈ Ob(C). 231
231
As an exercise, show that these conditions uniquely determine the morphism 1X .
– 440 –
3. Composition of morphisms is associative:
which is sometimes denoted C1p1 ×p0 C1 and called the fiber product. The composition law
takes C2 → C1 and may be pictured as the composition of arrows. If f : x0 → x1 and
g : x1 → x2 then the composed arrow will be denoted g ◦ f : x0 → x2 . The composition
law satisfies the axioms
Remarks:
232
We take an appropriate collection of sets and maps to avoid the annoying paradoxes of set theory.
– 441 –
2. TOP: The category of topological spaces and continuous maps.
4. MANIFOLD: The category of manifolds and suitable maps. We could take topo-
logical manifolds and continuous maps of manifolds. Or we could take smooth man-
ifolds and smooth maps as morphisms. The two choices lead to two (very different!)
categories.
6. GROUP: the category of groups and homomorphisms of groups. Note that here
if we took our morphisms to be isomorphisms instead of homomorphisms then we
would get a very different category. All the pairs of objects in the category with
nontrivial morphism spaces between them would be pairs of isomorphic groups.
8. Fix a group G and let G-SET be the category of G-sets, that is, sets X with a
G-action. For simplicity let us just write the G-action Φ(g, x) as g · x for x a point in
a G-set X. We take the morphisms f : X1 → X2 to satisfy satisfy f (g ·x1 ) = g ·f (x1 ).
9. VECTκ : The category of finite-dimensional vector spaces over a field κ with mor-
phisms the linear transformations.
One use of categories is that they provide a language for describing precisely notions
of “similar structures” in different mathematical contexts. When discussed in this way it
is important to introduce the notion of “functors” and “natural transformations” to speak
of interesting relationships between categories.
In order to state a relation between categories one needs a “map of categories.” This
is what is known as a functor:
Definition A functor between two categories C1 and C2 consists of a pair of maps Fobj :
Obj(C1 ) → Obj(C2 ) and Fmor : M or(C1 ) → M or(C2 ) so that if
f
x / y ∈ hom(x, y) (17.7)
then
Fmor (f )
Fobj (x) / Fobj (y) ∈ hom(Fobj (x), Fobj (y)) (17.8)
– 442 –
and moreover we require that Fmor should be compatible with composition of morphisms:
There are two ways this can happen. If f1 , f2 are composable morphisms then we say F is
a covariant functor if
Fmor (f1 ◦ f2 ) = Fmor (f1 ) ◦ Fmor (f2 ) (17.9)
and we say that F is a contravariant functor if
We usually drop the subscript on F since it is clear what is meant from context.
Exercise
Using the alternative definition of a category in terms of data p0,1 : X1 → X0 define
the notion of a functor writing out the relevant commutative diagrams.
so for every morphism f ∈ homC (Y, X) we associate f opp ∈ homC opp (X, Y ) such that
a.) Show that if F : C → D is a contravariant functor then one can define in a natural
way a covariant functor F : C opp → D.
b.) Show that if F : C → D is a covariant functor then we can naturally define another
covariant functor F opp : C opp → Dopp
Example 1: Every category has a canonical functor to itself, called the identity functor
IdC .
Example 2: There is an obvious functor, the forgetful functor from GROUP to SET.
This idea extends to many other situations where we “forget” some mathematical structure
and map to a category of more primitive objects.
233
Although we do have Fmor (IdX ) ◦ Fmor (f ) = Fmor (f ) for all f ∈ hom(Y, X) and Fmor (f ) ◦ Fmor (IdX ) =
Fmor (f ) for all f ∈ hom(X, Y ) this is not the same as the statement that Fmor (IdX ) ◦ φ = φ for all
φ ∈ hom(F (Y ), F (X)), so we need to impose this extra axiom.
– 443 –
Example 3: Since AB is a subcategory of GROUP there is an obvious functor F :
AB → GROUP.
Example 4: In an exercise below you are asked to show that the abelianization of a group
defines a functor G : GROUP → AB.
Example 5: Fix a group G. Then in the notes above we have on several occasions used
the functor
FG : SET → GROUP (17.14)
FG (f )
FG (X1 ) o FG (X2 ) (17.15)
The map FG (f ) is usually denoted f ∗ and is known as the pull-back. To be quite explicit:
If Ψ is a map of X2 → G then f ∗ (Ψ) := Ψ ◦ f is a map X1 → G.
This functor is used in the construction of certain nonlinear sigma models which are
quantum field theories where the target space is a group G. The viewpoint that we are
studying the representation theory of an infinite-dimensional group of maps to G has been
extremely successful in a particular case of the Wess-Zumino-Witten model, a certain two
dimensional quantum field theory that enjoys conformal invariance (and more).
Example 6: Now let us return to the category G-SET. Now fix any set Y . Then in the
notes above we have on several occasions used the functor
[g · (f ∗ Ψ)](x1 ) = (f ∗ Ψ)(g −1 · x1 )
= Ψ(f (g −1 · x1 ))
= Ψ(g −1 · (f (x1 ))) (17.17)
= (g · Ψ)(f (x1 ))
= (f ∗ (g · Ψ))(x1 )
Example 7: Fix a nonnegative integer n and a group G. Then the group cohomology we
discussed above (take the trivial twisting ωg = IdA for all g) defines a covariant functor
H n (G, •) : AB → AB (17.18)
– 444 –
To check this is really a functor we need to observe the following: If ϕ : A1 → A2 is a
homomorphism of Abelian groups then there is an induced homomorphim, usually denoted
ϕ∗ : H n (G, A1 ) → H n (G, A2 ) (17.19)
You have to check that Id∗ = Id and
(ϕ1 ◦ ϕ2 )∗ = (ϕ1 )∗ ◦ (ϕ2 )∗ (17.20)
Strictly speaking we should denote ϕ∗ by H n (G, ϕ), but this is too fastidious for the present
author.
Example 8: Fix a nonnegative integer n and any group A. Then the group cohomology
we discussed above (take the trivial twisting ωg = IdA for all g) defines a contravariant
functor
H n (•, A) : GROUP → AB (17.21)
To check this is really a functor we need to observe the following: If ϕ : G1 → G2 is a
homomorphism of Abelian groups then there is an induced homomorphim, usually denoted
ϕ∗
ϕ∗ : H n (G2 , A) → H n (G1 , A) (17.22)
Example 9: Topological Field Theory. The very definition of topological field theory is
that it is a functor from a bordism category of manifolds to the category of vector spaces
and linear transformations. For much more about this one can consult a number of papers.
Two online resources are
http://www.physics.rutgers.edu/∼gmoore/695Fall2015/TopologicalFieldTheory.pdf
https://www.ma.utexas.edu/users/dafr/bordism.pdf
Note that in example 2 there is no obvious functor going the reverse direction. When
there are functors both ways between two categories we might ask whether they might be,
in some sense, “the same.” But saying precisely what is meant by “the same” requires
some care.
– 445 –
Example 1: The evaluation map. Here is another tautological construction which never-
theless can be useful. Let S be any set and define a functor
It is defined by τX (f, s) := f (s). This is known as the “evaluation map.” Then we need to
check
τX
FS (X) /X (17.29)
FS (ϕ) ϕ
τY
FS (Y ) /Y
The first functor F1 maps a ring R to the multiplicative group U (R) of multiplicatively
invertible elements. This is often called the group of units in R. If ϕ is a morphism of
rings and r ∈ U (R1 ) then ϕ(r) ∈ U (R2 ) and the map ϕ∗ : U (R1 ) → U (R2 ) defined by
ϕ∗ : r 7→ ϕ(r) (17.31)
234
This example uses some terms from linear algebra which can be found in the “User’s Manual,” Chapter
2 below.
– 446 –
makes perfect sense for g ∈ GL(n, R). Moreover,
Now we claim that the determinant defines a natural transformation τ : F1 → F2 . For each
object R ∈ Ob(COMMRING) we assign the morphism
defined by τR (g) := det(g). Thanks to (17.33) this is indeed a morphism in the cate-
gory GROUP, that is, it is a group homomorphism. Moreover, it satisfies the required
commutative diagram because if ϕ : R1 → R2 is a morphism of rings then
Example 2: The basic relation between Lie groups and Lie algebras the statement that the
functor which takes a Lie group G to its tangent space at the identity, T1 G is an equivalence
of the category of connected and simply-connected Lie groups with the category of finite-
dimensional Lie algebras. One of the nontrivial theorems in the theory is the existence of
a functor from the category of finite-dimensional Lie algebras to the category of connected
and simply-connected Lie groups. Intuitively, it is given by exponentiating the elements of
the Lie algebra.
– 447 –
the category of topological spaces with an action of the group π1 (X, x0 ). Closely related
to this, Galois theory can be viewed as an equivalence of categories.
In physics, the statement of “dualities” between different physical theories can some-
times be formulated precisely as an equivalence of categories. One important example of
this is mirror symmetry, which asserts an equivalence of (A∞ )-) categories of the derived
category of holomorphic bundles on X and the Fukaya category of Lagrangians on X ∨ .
But more generally, nontrivial duality symmetries in string theory and field theory have a
strong flavor of an equivalence of categories.
– 448 –
Exercise Freyd’s theorem
A “practical” way to tell if two categories are equivalent is the following:
By definition, a fully faithful functor is a functor F : C1 → C2 where Fmor is a bijection
on all the hom-sets. That is, for all X, Y ∈ Obj(C1 ) the map
is a bijection.
Show that C1 is equivalent to C2 iff there is a fully faithful functor F : C1 → C2 so that
any object α ∈ Obj(C2 ) is isomorphic to an object of the form F (X) for some X ∈ Obj(C1 ).
Exercise
As we noted above, there is a functor AB → GROUP just given by inclusion.
a.) Show that the abelianization map G → G/[G, G] defines a functor GROUP →
AB.
b.) Show that the existence of nontrivial perfect groups, such as A5 , implies that this
functor cannot be an equivalence of categories.
In addition to the very abstract view of categories we have just sketched, very concrete
objects, like groups, manifolds, and orbifolds can profitably be viewed as categories.
One may always picture a category with the objects constituting points and the mor-
phisms directed arrows between the points as shown in Figure 44.
Figure 44: Pictorial illustration of a category. The objects are the black dots. The arrows are
shown, and one must give a rule for composing each arrow and identifying with one of the other
arrows. For example, given the arrows denoted f and g it follows that there must be an arrow
of the type denoted f ◦ g. Note that every object x has at least one arrow, the identity arrow in
Hom(x, x).
– 449 –
As an extreme example of this let us consider a category with only one object, but
we allow the possibility that there are several morphisms. For such a category let us look
carefully at the structure on morphisms f ∈ M or(C). We know that there is a binary
operation, with an identity 1 which is associative.
But this is just the definition of a monoid!
If we have in addition inverses then we get a group. Hence:
Definition A group is a category with one object, all of whose morphisms are invertible.
To see that this is equivalent to our previous notion of a group we associate to each
morphism a group element. Composition of morphisms is the group operation. The in-
vertibility of morphisms is the existence of inverses.
We will briefly describe an important and far-reaching generalization of a group af-
forded by this viewpoint. Then we will show that this viewpoint leads to a nice geometrical
construction making the formulae of group cohomology a little bit more intuitive.
*************************************
CONSTRUCT EXERCISE HERE EXAMINING HOW CONCEPTS OF FUNCTORS
AND NATURAL TRANSFORMATIONS TRANSLATE INTO GROUP THEORY LAN-
GUAGE WHEN SPECIALIZED TO THE CATEGORIES CORRESPONDING TO GROUPS
*************************************
17.1 Groupoids
Note that for any object x in a groupoid, hom(x, x) is a group. It is called the auto-
morphism group of the object x.
Example 1. Any equivalence relation on a set X defines a groupoid. The objects are the
elements of X. The set Hom(a, b) has one element if a ∼ b and is empty otherwise. The
composition law on morphisms then means that a ∼ b with b ∼ c implies a ∼ c. Clearly,
every morphism is invertible.
Example 3. Let X be a topological space. The fundamental groupoid π≤1 (X) is the
category whose objects are points x ∈ X, and whose morphisms are homotopy classes of
paths f : x → x0 . These compose in a natural way. Note that the automorphism group of
a point x ∈ X, namely, hom(x, x) is the fundamental group of X based at x, π1 (X, x).
– 450 –
exotic (abelian) gauge theories of higher degree forms which arise in supergravity and string
theories.
Example 5. In the theory of string theory orbifolds and orientifolds spacetime must be
considered to be a groupoid. Suppose we have a right action of G on a set X, so we have
a map
Φ:X ×G→X (17.37)
such that
Φ(Φ(x, g1 ), g2 ) = Φ(x, g1 g2 ) (17.38)
Φ(x, 1G ) = x (17.39)
for all x ∈ X and g1 , g2 ∈ G. We can just write Φ(x, g) := x · g for short. We can then
form the category X//G with
Ob(X//G) = X
(17.40)
M or(X//G) = X × G
Note that (x, 1G ) ∈ hom(x, x) is the identity morphism, and the composition of morphisms
makes sense because we have a group action. Also note that pt//G where G has the trivial
action on a point realizes the group G as a category, as sketched above.
Example 6. In the theory of string theory orbifolds and orientifolds spacetime must be
considered to be a groupoid. (This is closely related to the previous example.)
Exercise
For a group G let us define a groupoid denoted G//G (for reasons explained later)
whose objects are group elements Obj(G//G) = G and whose morphisms are arrows defined
by
g1
h / g2 (17.44)
iff g2 = h−1 g1 h. This is the groupoid of principal G-bundles on the circle.
Draw the groupoid corresponding to S3 .
– 451 –
Exercise The Quotient Groupoid
a.) Show that whenever G acts on a set X one can canonically define a groupoid: The
objects are the points x ∈ X. The morphisms are pairs (g, x), to be thought of as arrows
g
x → g · x. Thus, X0 = X and X1 = G × X.
b.) What is the automorphism group of an object x ∈ X.
This groupoid is commonly denoted as X// G.
– 452 –
An interesting construction that applies to any category is its associated simplicial
space |C|.
This is a space made by gluing together simplices 236 whose simplices are:
• 0-simplices = objects
• and so on. There are infinitely many simplices of arbitarily high dimension because
we can keep composing morphisms as long as we like.
And so on. See Figures 45 and 46. The figures make clear how these simplices are
glued together:
∂∆1 (f ) = x1 − x0 (17.47)
∂∆2 (f1 , f2 ) = ∆1 (f1 ) − ∆1 (f1 f2 ) + ∆1 (f2 ) (17.48)
and for Figure 46 view this as looking down on a tetrahedron. Give the 2-simplices of
Figure 45 the counterclockwise orientation and the boundary of the 3-simplex the induced
orientation from the outwards normal. Then we have
Note that on the three upper faces of Figure 46 the induced orientation is the ccw orien-
tation for ∆(f1 , f2 f3 ) and ∆(f2 , f3 ), but with the cw orientation for ∆(f1 f2 , f3 ). On the
bottom fact the inward orientation is ccw and hence the outward orientation is −∆(f1 , f2 ).
Clearly, we can keep composing morphisms so the space |C| has simplices of arbitrarily
high dimension, that is, it is an infinite-dimensional space.
Let look more closely at this space for the case of a group, regarded as a category with
one object. Then in the above pictures we identify all the vertices with a single vertex.
For each group element g we have a one-simplex ∆1 (g) beginning and ending at this
vertex.
For each ordered pair (g1 , g2 ) we have an oriented 2-simplex ∆(g1 , g2 ), etc. We simply
replace fi → gi in the above formulae, with gi now interpreted as elements of G:
∂∆(g) = 0 (17.50)
– 453 –
∂∆(g1 , g2 , g3 ) = ∆(g2 , g3 ) − ∆(g1 g2 , g3 ) + ∆(g1 , g2 g3 ) − ∆(g1 , g2 ) (17.52)
See Figure 46.
Let us construct this topological space a bit more formally:
We begin by defining n + 1 maps from Gn → Gn−1 for n ≥ 1 given by
d0 (g1 , . . . , gn ) = (g2 , . . . , gn )
d1 (g1 , . . . , gn ) = (g1 g2 , g3 , . . . , gn )
d2 (g1 , . . . , gn ) = (g1 , g2 g3 , g4 , . . . , gn )
······ (17.53)
······
n−1
d (g1 , . . . , gn ) = (g1 , . . . , gn−1 gn )
n
d (g1 , . . . , gn ) = (g1 , . . . , gn−1 )
Now, there are also (n + 1) face maps which map an (n − 1)-simplex ∆n−1 into one of the
(n + 1) faces of the n-simplex ∆n :
di embeds the (n − 1) simplex into the face ti = 0 which is opposite the ith vertex ti = 1
of ∆n .
Now we identify 237
(q∞ n
n=0 ∆n × G ) / ∼
via
(di (~t), ~g ) ∼ (~t, di (~g )). (17.56)
The space we have constructed this way has a homotopy type denoted BG. This
homotopy type is known as the classifying space of the group G. It can be characterized
as the homotopy type of a topological space which is both contractible and admits a free
G-action.
Note that for all g ∈ G, ∂∆1 (g) = 0, so for each group element there is a closed loop.
On the other hand
∆1 (1G ) = ∂(∆2 (1G , 1G )) (17.57)
237
This means we take the set of equivalence classes and impose the weakest topology on the set of
equivalence classes so that the projection map is continuous.
– 454 –
so ∆1 (1G ) is a contractible loop. But all other loops are noncontractible. (Show this!)
Therefore:
π1 (BG, ∗) ∼
=G (17.58)
Moreover, if G is a finite group one can show that all the higher homotopy groups of BG
are contractible. So then BG is what is known as an Eilenberg MacLane space K(G, 1).
Even for the simplest nontrivial group G = Z/2Z the construction is quite nontrivial
and BG has the homotopy type of RP ∞ .
Now, an n-cochain in C n (G, Z) (here we take A = Z for simplicity) is simply an
assignment of an integer for each n-simplex in BG. Then the coboundary and boundary
maps are related by
hdφn , ∆i = hφn , ∂∆i (17.59)
and from the above formulae we recover, rather beautifully, the formula for the coboundary
in group cohomology.
Remarks:
2. A Lie group is a manifold and hence has its own cohomology groups as a manifold,
H n (G; Z). There is a relation between these: There is a group homomorphism
n+1 n
Hgroup cohomology (G; Z) → Htopological space cohomology (G; Z) (17.60)
3. One can show that H n (BG; Z) is always a finite abelian group if G is a finite group.
[GIVE REFERENCE].
4. The above construction of BG is already somewhat nontrivial even for the trivial
group G = {1G }. Indeed, following it through for the 2-cell, we need to identify
the three vertices of a triangle to one vertex, and the three edges to a single edge,
embedded as a closed circle. If you do this by first identifying two edges and then try
to identify the third edge you will see why it is called the “dunce’s cap.” It is true,
but hard to visualize, that this is a contractible space. Things only get worse as we
go to higher dimensions. A better construction, due to Milnor, is to construct what
is known as a “simplicial set,” and then collapse all degenerate simplices to a point.
This gives a nicer realization of BG, but one which is homotopy equivalent to the
one we described above. For the trivial category with one object and one morphism
one just gets a topological space consisting of a single point. 238
– 455 –
BZ is as the humble circle R/Z = S 1 . This generalizes to lattices BZd = T d , the
d-dimensional torus. On the other hand BZ2 must be infinite-dimensional but it can
be realized as RP∞ , the quotient of the unit sphere in a real infinite-dimensional
separable Hilbert space by the antipodal map. Similarly, BU (1) is CP∞ , realized as
the quotient of the unit sphere in a complex infinite-dimensional separable Hilbert
space by scaling vectors by a phase: ψ → eiθ ψ.
As an application of some of the general concepts of group theory we discuss briefly lattice
gauge theory.
Lattice gauge theory can be defined on any graph: There is a set of unoriented edges
Ē. Each edge can be given either orientation and we denote the set of oriented edges by E.
The set of vertices is denoted V and source and target maps that tells us the vertex at the
beginning and end of each oriented edge:
We will view the union of edges Ē (i.e. forgetting the orientation) as a topological space
and denote it as Γ.
The original idea of Ken Wilson was that we could formulate Yang-Mills theory on
a “lattice approximation to Euclidean spacetime” which we visualize as a cubic lattice in
Rd for some d. Then, the heuristic idea is, that as the bond lengths are taken to zero we
get a good approximation to a field theory in the continuum. Making this idea precise is
highly nontrivial! For example, just one of the many issues that arise is that important
symmetries such as Euclidean or Poincaré symmetries of the continuum models we wish to
understand are broken, in this formulation, to crystallographic symmetries.
– 456 –
and so on.
In lattice field theory we attempt to go the other way: We assume that we have fields
defined on a sequence of lattices Λa ⊂ Rd and try to take an a → 0 limit to define a
continuum field theory.
Here is a simple paradigm to keep in mind: 239 Consider the one-dimensional lattice
Z, but it is embedded in the real line so that bond-length is a, so Λa = {an|n ∈ Z} ⊂ R.
Our degrees of freedom will be a real number φ` (n) at each lattice site n ∈ Z, and it will
evolve in time to give a motion φ` (n, t) according to the action:
Z X m k 2
2
S= dt φ̇` (n, t) − (φ` (n, t) − φ` (n + 1, t)) (18.6)
R 2 2
n∈Z
We can think of this as a system of particles of mass m fixed at the vertices of Λa with
neighboring particles connected by a spring with spring constant k. For the action to
have proper units, φ` (n, t) should have dimensions of length, suggesting it measures the
displacement of the particle in some orthogonal direction to the real line. The equations
of motion are of course:
d2
m φ` (n, t) = k(φ` (n + 1, t) − 2φ` (n, t) + φ` (n − 1, t)) (18.7)
dt2
Now we wish to take the a → 0 limit. We assume that there is some differentiable function
φcont (x, t) such that
φcont (x, t)|x=an = φ` (n, t) (18.8)
so by Taylor expansion
d2
φ` (n + 1, t) − 2φ` (n, t) + φ` (n − 1, t) = a2
φcont |x=an + O(a3 ) (18.9)
dx2
Now suppose we scale the parameters of the Lagrangian so that
v2T
m = aT k= (18.10)
a
then, if the limits really exist, the continuum function φcont (x, t) must satisfy the wave
equation:
d2 2 d
2
φ cont − v φcont = 0 (18.11)
dt2 dx2
whose general solution is
Φlef t (x + vt) + Φright (x − vt) (18.12)
The general solution is described by arbitrary wavepackets traveling to the left and right
along the real line. (We took v > 0 here.) We can also see this at the level of the Lagrangian
since if φ` (n, t) is well-approximated by a continuum function φcont (x, t) then
" 2 2 #
v2 d
Z Z
1 d
S→T dt dx φcont − φcont + O(a) (18.13)
R R 2 dt 2 dx
Remarks
239
Here we will just latticize the spatial dimension of a 1 + 1 dimensional field theory. In the rest of the
section we latticize spacetime with Euclidean signature.
– 457 –
1. In the lattice theory there will certainly be sequences of field configurations φlattice (n, t)
that have no good continuum limit. The idea is that these are unimportant to the
physics because they have huge actions whose contributions to the path integral is
unimportant in the continuum limit.
U : P → G, (18.15)
which satisfies the composition law: If we concatenate two paths γ1 and γ2 to make a path
γ1 ◦ γ2 , so that the concatenated path begins at γ1 (0) and ends at γ2 (1) and such that
γ1 (1) = γ2 (0), that is, the end of γ1 is the beginning of γ2 , then we must have:
so, really, in lattice gauge theory it suffices to know the U(e) for the edges. If e−1 is the
edge e with the opposite orientation then
– 458 –
We will denote the space of all connections by A(Γ).
Remark: Background heuristics: For those who know something about gauge fields in
field theory we should think of U(e) as the parallel transport (in some trivialization of
our principal bundle) along the edge e. From these parallel transports along edges we can
recover the components of the gauge field. To explain more let us assume for simplicity
that G = U (N ) is a unitary group, or some matrix subgroup of U (N ).
Recall some elementary ideas from the theory of Lie groups: If α is any anti-Hermitian
matrix then exp[α] is a unitary matrix. Moreover, if α is “small” then exp[α] is close to the
identity. Conversely, if U is “close” to the identity then it can be uniquely written in the
form U = exp[α] for a “small” anti-Hermitian matrix α. Put more formally: The tangent
space to U (N ) at the identity is the (real!) vector space of N × N anti-Hermitian matrices.
(This vector space is a real Lie algebra, because the commutator of anti-Hermitian matrices
is an anti-Hermitian matrix.) Moreover, the exponential map gives a good coordinate chart
in some neighborhood of the identity of the topological group U (N ).
The poor man’s way of understanding the relation between Lie algebras and Lie groups
is to use the very useful Baker-Campbell-Hausdorf formula: If A, B are n × n matrices then
the formula gives an expression for an n × n matrix C so that
eA eB = eC (18.21)
The formula is a (very explicit) infinite set of terms all expressed in terms of multiple
commutators. The first few terms are:
1 1 1 1
C = A + B + [A, B] + [A, [A, B]] + [B, [B, A]] + [A, [B, [A, B]]] + · · · (18.22)
2 12 12 24
The series is convergent as long as A, B are small enough (technically, such that the char-
acteristic values of Ad(A) and Ad(B) are less than 2π in magnitude). See Chapter 8 for a
full explanation. Note in particular that if we expand in small parameters 1 , 2 then
Now, returning to lattice gauge theory: In the usual picture of “approximating” Eu-
clidean Rd by Zd with bond-length a we can write a fundamental edge eµ (~n) as the straight
line in Rd from ~n to ~n+aêµ . If a is small and we have some suitable continuity then U(eµ (~n))
will be near the identity and we can write:
for some anti-Hermitian matrix Aµ (a~n). In lattice gauge theory, the connections with a
good continuum limit are those such that there is a locally defined 1-form valued in N × N
anti-Hermitian matrices Acont
µ (~x)dxµ so that Acont
µ (a~ n) = Alattice
µ (~n).
Now, the gauge field U has redundant information in it. The reason it is useful to
include this redundant information is that many aspects of locality become much clearer
– 459 –
when working with A(Γ) as we will see when trying to write actions below. The redundant
information is reflected in a gauge transformation which is simply a map
f :V→G (18.25)
The idea is that if γ is a path then the gauge fields U and U0 related by the rule
are deemed to be gauge equivalent, i.e. isomorphic. We denote the set of gauge transfor-
mations by G(Γ). Note that, being a function space whose target is a group, this set is a
group in a natural way. It is called the group of gauge transformations. 240 The group of
gauge transformations G(Γ) acts on A(Γ). The moduli space of gauge inequivalent fields is
the set of equivalence classes: A(Γ)/G(Γ). Mathematicians would call these isomorphism
classes of connections.
It might seem like there is no content here. Can’t we always choose f (s(γ)) to set U0 (γ)
to 1? Yes, in general, except when s(γ) = t(γ), that is, when γ is a closed loop based at a
vertex, say v0 . For such closed loops we are stuck, all we can do by gauge transformations
is conjugate:
U0 (γ) = gU(γ)g −1 (18.27)
where g is the gauge transformation at v0 . Moreover, if we start the closed loop at another
vertex on the loop then the parallel transport is again in the same conjugacy class. Thus
there is gauge invariant information associated to a loop γ: The conjugacy class of the
U(γ). That is: The holonomy function:
In fact, one can show that HolU is a complete invariant, meaning that we have the
converse: If HolU0 = HolU then U0 is gauge equivalent to U. Put informally:
Exercise
240
AND IS NOT TO BE CONFUSED WITH THE gauge group G!!!!
– 460 –
Show that if γ is a closed loop beginning and ending at v0 and if v1 is another vertex
on the path γ then if γ 0 describes the “same” loop but starting at v1 then U(γ) and U(γ 0 )
are in the same conjugacy class in G.
Exercise
Consider a graph Γ which forms a star: There is one central vertex, and r “legs” each
consisting or Ni edges radiating outward, where i = 1, . . . , r.
a.) Show explicitly that any gauge field can be gauged to U = 1.
b.) What is the unbroken subgroup of the group of gauge transformations? (That is,
what is the automorphism group of the gauge field U = 1? )
Exercise
Consider a d-dimensional hypercubic lattice with periodic boundary conditions, so that
we are “approximating a torus” which is a product of “circles” of length N a.
What is the maximal number of edges so that we can set U(e) = 1?
If Γ is finite and G is finite this sum is just a finite sum. If Γ is finite and G is a finite-
dimensional Lie group then A(Γ)/G(Γ) is a finite-dimensional topological space and the
“sum” needs to be interpreted as some kind of integral. Since a connection on Γ is com-
pletely determined by its values on the elementary edges (for a single orientation) we can,
noncanonically, identity the space of all connections as
A(Γ) ∼
= G|Ē| . (18.32)
Similarly
G(Γ) ∼
= G|V| (18.33)
Now we need a way of integrating over the group. If G is a finite group and F : G → C
is a function then Z
1 X
F dµ := F (g) (18.34)
G |G|
g∈G
– 461 –
This basic idea can be generalized to Lie groups. A Lie group is a manifold and we define
a measure on it dµ. (If G is a simple Lie group then there is a canonical choice of measure
up to an overall scale.) As a simple example, coonsider U (1) = {eiθ } then the integration
is Z 2π
dθ
F (eiθ ) (18.35)
0 2π
In all cases, the crucial property of the group integration is that, for all h we have
Z Z Z
F (gh)dµ(g) = F (hg)dµ(g) = F (g)dµ(g) (18.36)
G G G
This property defines what is called a left-right-invariant measure. It is also known as the
Haar measure.
In general the Haar measure is only defined up to an overall scale. In the above
examples we chose the normalization so that the volume of the group is 1.
Now, choosing a left-right-invariant measure we can define:
Z
1
Z= F̂ (U)dµA(Γ) (18.37)
vol (G(Γ)) A(Γ)
a~n → a~n + aêµ → a~n + aêµ + aêν → a~n + aêν → a~n (18.38)
where we have summed over all plaquettes in the exponential to make this look more like a
discrete approximation to a field theory path integral, and the action S(p) of a plaquette p
is some class function applied to U(p). If G is a continuous group then we need to interpret
the sum over A(Γ)/G(Γ) as some kind of integral, as discussed above.
– 462 –
Figure 47: A small plaquette, centered on a surface element in a tangent plane with coordinates
(x, y) and centered on a point with coordinates (x, y). The holonomy around the plaquette, to
leading order in an expansion in small values of bond-length a is governed by the curvature tensor
evaluated on that area element.
Remark: More background heuristics: For those who know something about gauge fields
in field theory we should think of the parallel transport U(p) around a plaquette p as
defining the components of the curvature on a small area element dxµ ∧ dxν at some point
~x0 = a~n (in some framing). Indeed, using the idea that
we can try to take a “limit” where a → 0. The plaquette pµν (~n) is two-dimensional so,
temporarily choosing coordinates so that µ = 1 and ν = 2 we can write the plaquette gauge
group element as
1 1 1 1
eaA1 (x,y− 2 a) eaA2 (x+ 2 a,y) e−aA1 (x,y+ 2 a) e−aA2 (x− 2 a,y) (18.41)
See Figure 47. Now, using the BCH formula 241 we define the fieldstrength of the gauge
field or, equivalently, the curvature of the connection by
241
Warning: If you are not careful the algebra can be extremely cumbersome here! Taylor expansion in
to order a2 gives:
a2 2 2 2
∂2 A1 aA2 + a2 ∂1 A2 −aA1 − a2 ∂2 A1 −aA2 + a2 ∂1 A2
eaA1 − 2 e e e (18.42)
2
We only need to keep the first commutator term in the BCH formula if we are working to order a so we
get
2 3
ea (∂1 A2 −∂2 A1 +[A1 ,A2 ])+O(a ) (18.43)
– 463 –
A standard action used in lattice gauge theory in the literature is constructed as follows:
First, choose a finite-dimensional unitary representation of G, that is, a group homo-
morphism
ρ : G → U (r) (18.46)
Next, define the action for a plaquette to be
for some constant K. Note that the trivial gauge field has action S(p) = 0. Moreover,
every unitary matrix can be diagonalized, by the spectral theorem, with eigenvalues eiθi (p) ,
i = 1, . . . , r and then
r
X X
S(p) = K (1 − cos θi (p)) = 2K sin2 (θi (p)/2) (18.48)
i=1 i=1
is clearly positive definite for K > 0. This is good for unitarity (or its Euclidean counterpart
- “reflection positivity.”)
Remarks:
1. Correlation Functions: The typical physical quantities we might want to compute are
expectation values of products of gauge invariant operators. In view of our discussion
of gauge equivalence classes of gauge fields above one very natural way to make such
gauge invariant operators is via Wilson loop operators. For these one chooses a matrix
representation R : G → GL(N, C) of G (totally unrelated to the choice we made in
defining the action) and a particular loop γ and defines:
2. Yet more background heuristics: For those who know something about gauge fields
in field theory we can begin to recognize something like the Yang-Mills action if we
use (18.44) and write ♣There is a bit of a
cheat here since you
did not work out
X 1 X X
(r − Re[Trρ(U(pµν (~n)))]) → − Ka4 Trρ(Fµν (a~n))2 (18.52)
the plaquette to
S(p) = K order a4 . ♣
2
pµν (~
n) d n∈Z µ6=ν
~
– 464 –
The heuristic limit (18.52) is to be compared with the Yang-Mills action
Z
1 p
SY M = − 2 dd x detgg µλ g νρ TrFµν Fλρ (18.53)
2g0 X
where here we wrote it in Euclidean signature on a Riemannian manifold M . The
trace is in some suitable representation and the normalization of the trace can be
absorbed in a rescaling of the coupling constant g0 . If we use the representation
ρ : G → U (r) then
1 4−d ad−4
= Ka ⇒ K = (18.54)
g02 g02
The constant K must be dimensionless so that d = 4 dimensions is selected as special.
For d = 4 the Yang-Mills coupling g02 is dimensionless. It has dimensions of length
to a positive power for d > 4 and length to a negative power for d < 4. To take the
continuum limit we should hold g02 fixed and scale K as above as a → 0.
3. Very important subtlety in the case d = 4 Actually, if one attempts to take the
limit more carefully, the situation becomes more complicated in d = 4, because in
quantum mechanics there are important effects known as vacuum fluctuations. What
is expected to happen (based on continuum field theory) is that, if we replace K by
g −2 (a) and allow a-dependence then we can get a good limit of, say, correlation
functions of Wilson loop vev’s if we scale g 2 (a) so that
8π 2 8π 2 a1
= + βlog + O(g 2 (a2 )) (18.55)
g 2 (a1 ) g 2 (a2 ) a2
where there are higher order terms in the RHS in an expansion in g 2 (a2 ). Here β
is a constant, depending on the gauge group G and other fields in the theory. For
G = SU (n) we have the renowned result of D. Gross and F. Wilczek, and of D.
Politzer that
11
β=− n (18.56)
3
As long as β < 0 we see that g 2 (a2 ) → 0 as a2 → 0. This is known as asymptotic
freedom. It has the good property that as we attempt to take a2 → 0 the higher
order terms on the RHS are at least formally going to zero.
4. One can therefore ask, to what extent is this continuum limit rigorously defined and
how rigorously has (18.55) been established from the lattice gauge theory approach.
My impression is that it is still open. Two textbooks on this subject are:
1. C. Itzykson and J.-M. Drouffe, Statistical Field Theory, Cambridge
2. M. Creutz, Quarks, gluons, and lattices, Cambridge
5. Phases and confinement. Many crucial physical properties can be deduced from
Wilson loop vev’s. In Yang-Mills theory a crucial question is whether, for large
planar loops γ hW (R, γ)i decays like exp[−T Area(γ)] or exp[−µP erimeter(γ)]. If it
decays like the area one can argue that quarks will be confined. For a nice explanation
see S. Coleman, Aspects Of Symmetry, for a crystal clear explanation.
– 465 –
6. Including quarks and QCD. The beta function is further modified if there are “mat-
ter fields” coupling to the gauge fields. If we introduce nf Dirac fermions in the
fundamental representation of SU (n) then (18.56) is modified to:
11 2
β=− n − nf (18.57)
3 3
The theory of the strong nuclear force between quarks and gluons is based on n = 3
and nf = 6. Actually, there is a strong hierarchy of quark masses so for low energy
questions nf = 2 (for “up” and “down” quarks) is more relevant.
7. There are very special situations in which β = 0 and in fact all the higher terms
on the RHS of the “renormalization group equation” (18.55) vanish. These lead to
scale-invariant theories, and in good cases, to conformal field theories. In the modern
viewpoint on field theory, these conformal field theories are the basic building blocks
of all quantum field theories.
– 466 –
classes of field theories that are independent of distances in spacetime. They focus on the
topological aspects of physics. A formal mathematical definition is that it is a functor from
some bordism category to, say, the category Vectκ .
If G is a finite group and we are working on a smooth manifold then there can be no
curvature tensor, so all gauge fields are “flat.” They can still be nontrivial since U(γ) can
still be nontrivial for homotopically nontrivial loops. The simplest example would be 0 + 1
dimensional Yang-Mills theory on a circle. If the action is literally zero then the partition
function is just
1 X
Z= 1 (18.58)
|G|
g∈G
Recalling our discussion of the class equation we recognize that the partition function can
be written as: X 1
Z= (18.59)
c.c.
|Z(g)|
where we sum over conjugacy classes in the group and weight each class by one over the
order of the centralizer of some (any) representative of that class. This second form of the
sum can be interpreted as a sum over the isomorphism classes of principal G-bundles over
the circle, weighted by one over the automorphism group of the bundle.
For those who know something about gauge theory note that this illustrates a very
general principle: In the partition function of a gauge theory we sum over all the iso-
morphism classes of bundles with connection: We weight the bundle with connection by a
gauge invariant functional divided by the order of the automorphism group of the bundle
with connection.
It is also worth remarking that, quite generally in field theory, the partition function
on a manifold of the form X × S 1 can be interpreted as a trace in a Hilbert space. With
proper boundary conditions for the “fields” around S 1 we simply have
where δ(g) is the Dirac delta function relative to the measure dµ(g) we chose on G, and is
concentrated at g = 1G . Here we take the product over all plaquettes that are meant to be
“filled in” in the continuum limit. That means that the parallel transport around “small”
loops defined by plaquettes will be trivial. This does not mean that the gauge field is trivial!
For example if we consider a triangulation of a compact surface or higher dimensional
manifold with nontrivial fundamental group then there can be nontrivial holonomy around
– 467 –
homotopically nontrivial loops. In general, a connection, or gauge field, such that U(γ) = 1
for homotopically nontrivial loops (this is equivalent to the vanishing of the curvature 2-
form Fµν ) is known as a flat connection or flat gauge field. In topological gauge theories
we sum over (isomorphism classes of) flat connections.
Note that (18.61) is just part of the definition of a topological gauge theory. We want
to do this so that physical quantities only depend on topological aspects of the theory.
In standard Yang-Mills theory hW (R, γ)i will depend on lots of details of γ. Indeed,
one definition of the curvature is how W (R, γ) responds to small deformations of γ. In
topological gauge theories we want
Y
h W (Ri , γi )i (18.62)
i
Remark: In general, flat gauge fields for a group G on a manifold M are classified,
up to gauge equivalence by the conjugacy classes of homomorphisms Hom(π1 (M, x0 ), G).
For a flat gauge field, the standard Wilson action we discussed above will simply vanish.
We can get a wider class of models by using group cocycles. This was pointed out in the
paper
R. Dijkgraaf and E. Witten, “Topological Gauge Theory And Group Cohomology,”
Commun.Math.Phys. 129 (1990) 39.
and topological gauge theories that make use of group cocycles for the action are now
known as Dijkgraaf-Witten models.
For simplicity we now take our group G to be a finite group. Let us start with a
two-dimensional model. We can view Γ as a triangulation of an oriented surface M as in
Figure 48. We want a local action, so let us restrict to a flat gauge field on a triangle as in
Figure 45. We want to assign the local “Boltzman weight.” It will be a function:
W : G × G → C∗ (18.63)
(If we wish to match to some popular physical theories we might take it to be U (1)-valued.
The distinction will not matter for anything we discuss here.) Now referring to Figure 45
we assign the weight
W (g1 , g2 ) (18.64)
to this triangle. But now we have to decide if we are to use this, or W (g2 , (g1 g2 )−1 ) or
W ((g1 g2 )−1 , g1 ). In general these complex numbers will not be equal to each other. So we
number the vertices 1, . . . , |V| and then for any triangle T we start with the vertices with
the two smallest numbers. Call this W (T ). This will define an orientation that might or
might not agree with that on the surface M . Let (T ) = +1 if it agrees and (T ) = −1 if
– 468 –
it does not. Then the Boltzman weight for a flat gauge field configuration U on the entire
surface is defined to be
Y
W (U) := W (T )(T ) (18.65)
T
Now, if this weight is to be at all physically meaningful we definitely want the depen-
dence on all sorts of choices to drop out.
Now, one thing we definitely want to have is independence of the choice of triangulation.
A theorem of combinatorial topology states that any two triangulations can be related by a
sequence of local changes of type I and type II illustrated in Figure 49 and 50, respectively.
We see that the invariance of the action under type I requires:
and this is the condition that W should be a 2-cocycle. Similarly, the change of type II
doesn’t matter provided
which is again guaranteed by the cocycle equation! This strongly suggests we can get a
good theory by using a 2-cocycle, and that is indeed the case. But we need to check some
things first:
– 469 –
Figure 50: A local change of triangulation of type II.
1. The dependence on the labeling of the vertices drops out using an argument based
on topology we haven’t covered. This can be found in the Dijkgraaf-Witten paper.
Similarly, if W is changed by a coboundary then we modify
t(g1 )t(g2 )
W (g1 , g2 ) → W (g1 , g2 ) (18.68)
t(g1 g2 )
that is, we modify the weight by a factor based on a product around the edges. When
multiplying the contributions of the individual triangles to get the total weight (18.65)
the edge factors will cancel out from the two triangles sharing a common edge.
2. The action is not obviously gauge invariant, since it is certainly not true in general
that W (g1 , g2 ) is equal to
for all group elements h(v1 ), h(v2 ), h(v3 ) ∈ G. The argument that, nevertheless, the
total action (18.65) is invariant is given (for the d = 3 case) in the Dijkgraaf-Witten
paper around their equation (6.29). ♣Cop out. Give a
better argument.
Explain that
3. The idea above generalizes to define a topological gauge theory on oriented manifolds Chern-Simons
actions change by
in d-dimensions for any d, where one uses a d-cocycle on G with values in C∗ (or boundary terms and
it is too much to
U (1)). These topological gauge theories are known as “Dijkgraaf-Witten theories.” hope for exact local
gauge invariance.
The Boltzmann weight W represents a topological term in the action that exists and ♣
– 470 –
4. The invariance under the change of type II in Figure 50, which can be generalized
to all dimensions is particularly interesting. It means that the action is an “exact
renormalization group invariant” in the sense reminiscent of block spin renormaliza-
tion. 242 This fits in harmoniously with the alleged the metric-independence of the
topological gauge theory.
5. The case d = 3 is of special interest, and was the main focus of the original Dijkgraaf-
Witten paper. In this case we have constructed a “lattice Chern-Simons invariant,”
and the theory with a cocycle [W ] ∈ H 3 (BG, U (1)) = Hgroupcohomology
3 (G, U (1)) is a
Chern-Simons theory for gauge group G. In the case of G finite one can show that
H 3 (BG, U (1)) ∼
= H 4 (BG; Z). In general the level of a Chern-Simons theory is valued
in H 4 (BG; Z) for all compact Lie groups G.
242
The idea of block spin renormalization, invented by Leo Kadanoff, is that we impose some small lattice
spacing a as a UV cutoff and try to describe an effective theory at ever larger distances. So, we block spins
together in some way, define an effective spin, and then an effective action
X
e−Sef f := e−S(spins) (18.70)
f ixed−ef f ective−spins
The hope is that at long distances, with ever larger blocks, the “relevant” parts of Sef f converge to a useful
infrared field theory description.
– 471 –