MTTS 2004, May 17 - June 12, 2004 Level I - Group Theory B.Sury I.S.I., Bangalore
MTTS 2004, May 17 - June 12, 2004 Level I - Group Theory B.Sury I.S.I., Bangalore
0
§0 Introduction
The presence of group theory can often be felt even when it is not seen. Let
me explain the meaning of the first statement. Generally, what one sees is
some geometrical object and the underlying group of symmetries is always
there, guiding the whole structure but, to be seen, it has to take an avataar.
Of course, the significance of a notion lies in the number of diverse situations
in which it plays a substantial role. In that respect, the notion of a group is
one of the most - if not the most - important in mathematics as well as in
other sciences. We digress for a moment to ask a question. We have been
saying that a group may be invisible but may be at work in a situation.
Here is a problem which exemplifies this.
Consider all integers of the form 14x + 20y and all integers of the form
24x + 34y where x, y run through integers. Then, look at the lattice points
(14x + 20y, 24x + 34y) in the plane. This set intersects the Y -axis and the
X-axis at times. What is the smallest natural number n for which both
(n, 0) and (0, n) are in this set? What is the proportion of lattice points on
the plane which lie on this set?
These questions can be easily and naturally answered by recognising the
groups figuring here. We shall do so later but you are urged to think about
it. I offer a small prize to whoever solves either question first.
Group as Latin square
All of you have probably already learnt a lot of group theory but let us
start with some easy and familiar things. Let us start with a visual way
of defining groups. This idea is due to Uri (not Sury !) Rimon of Hebrew
University, Jerusalem.
View multiplication table of group as a Latin square with a distinguished
vertex. That is to say, there is a square array of symbols (to represent the
elements of the group) with each symbol occurring exactly once in each row
and in each column. There is a distinguished symbol e such that whenever
one draws a rectangle eacb cornered at this symbol with ea along a column
and eb along a row, then the symbol c diagonally opposite to it can be taken
to represent the product ab. It is a nice exercise to deduce associativity and
verify that such a table is equivalent to a group structure on the symbols.
It is also a nice exercise to prove statements like (ab)−1 = b−1 a−1 .
A word of warning : Not every Latin square with a distinguished vertex
arises from a group. It is possibly true that if the operation using rectangles
as above is well-defined, then the Latin square indeed comes from a group.
1
as it tells us some new aspect of its personality. One could think of a
group as a special equivalence relation which is often how it arises. By
the way, continuing in the same vein of visual viewing, one may think of
a general equivalence relation in the following nice way. Given a set of d
objects, a relation is just a d × d matrix R of 0’s and 1’s associated in an
obvious manner. Reflexivitiy and symmetricity correspond, respectively, to
the relation matrix having diagonal entries 1 and being symmetric.
Ex : What does transitivity correspond to?
Ans. Rij 6= 0 implies (R2 )ij 6= 0.
Without further ado, let us now start with the examples of groups that
one comes across. We shall then study them in different ways; study them
according to their size, study them according to the way they ‘act’, study
them according to their internal structure etc. Needless to say, there are
still many things to be properly understood. For instance, the finite simple
groups have been classified but it cannot be said that the classification of
finite simple groups (CFSG) is well-understood (indeed, no proof which is
5000 pages long can be so accepted).
2
• SO(n) = {g ∈ SL(n, R) : gg t = In }, is known as the special orthogonal
group of degree n.
• SU (n) = {g ∈ SL(n, C) : ḡg t = In }, is known as the special unitary group
of degree n.
t 0 In 0 In
• Sp(n) = {g ∈ SL(2n, R) : g g= }, is known as
−In 0 −In 0
the symplectic group of degree n; here 0 and In denote n × n block matrices.
These are nonabelian groups when n > 1.
• B(n, Q), the upper triangular rational matrices with nonzero diagonal
entries, U (n, Q), the subset consisting of those upper triangular matrices
which have all diagonal entries equal to 1 and T (n, Q), the diagonal matri-
ces with all diagonal entries nonzero rational numbers, are groups.
Note that T (n, Q), for any n and, U (2, Q) are abelian groups.
• GL(n, ZZ) = {g an integral matrix : det(g) = ±1}.
• GL(n, ZZ[1/p]) where p is a prime number.
Here, ZZ[1/p] denotes the set of rational numbers whose denominators are
divisible only by p and, the above group consists of all matrices whose de-
terminants are ± a power of p.
Note that
GL(n, ZZ) ≤ GL(n, ZZ[1/p]) ≤ GL(n, Q)
where we have used the notation H ≤ G to denote the fact that H is a
subgroup of G. We shall also write G ≥ H at times. Also, H < G for
groups would mean that H is a proper subgroup.
Let us note that the set of matrices
a a
G={ : a ∈ R∗ }
0 0
3
the center of G.
If µ(n) denotes the complex n-th roots of unity, we have a chain of subgroups
[
µ(n) ≤ µ(n) ≤ S 1 ≤ C∗ .
n
For any subset S ⊂ G, one denotes by < S >, the subgroup generated by
S. In concrete terms, it consists of all finite products of elements of S and
their inverses. For instance, for any group G and any positive integer n, the
subset Gn := {g n : g ∈ G} gives a subgroup < Gn >. Note that, in any
abelian group G, we have < Gn >= Gn .
Recall that a subgroup N is normal in G if gxg −1 ∈ N for all g ∈ G, x ∈ N .
This notion comes from Galois theory. Evidently, for any G and any n, the
subgroup < Gn > is normal in G; we write < Gn > EG.
Given a normal subgroup N , one can give a group structure to the set G/N
of all (left or right - both are same) cosets of the subgroup. There is a natural
onto homomorphism from G to G/N whose kernel is N and, conversely, the
kernel of any homomorphism from G to some group, is a normal subgroup
of G.
Examples of homomorphisms :
• i : H → G; inclusion of a subgroup H in G.
• For any n, Z → Z; a 7→ na.
• Z → Z/n; a 7→ a mod n.
• GL(n, C) → C∗ ; g 7→ det(g).
• Sn → P erm(n); σ 7→ Pσ .
• The composition of two homomorphisms.
Note that the composite of the last-mentioned homomorphism with the de-
terminant homomorphism is the ‘sign’ of the permutation.
• G → G; g 7→ g n if G is abelian.
• If 0 6= λ ∈ R, Z → R∗ ; n 7→ λn .
• R → R∗ ; t 7→ et .
1 t
• R → U (2, R); t 7→ .
0 1
1 t iθ Cosθ Sinθ
• S → SO(2) = {g ∈ SL(2, R) : gg = 1}; e 7→ .
−Sinθ Cosθ
• SO(2) → S 1 ; g 7→ last column.
• SU (2) → S 3 ; g 7→ last column. Pn
Here S n−1 = {(z1 , · · · , zn ) ∈ Rn : 2
i=1 zi = 1}, the unit sphere in n
dimensional space. The above two are isomorphisms. Some of the other
homomorphisms above are isomorphisms as well.
4
One says that a group G is finitely generated if there exists a finite subset
S ⊂ G such that G =< S >.
Given a subset S ⊂ G, one also defines the normal subgroup generated by S
as the smallest normal subgroup < S >N of G which contains S. In concrete
terms, it consists of all finite products of conjugates of elements of S and
their inverses.
For any subset S ⊂ G, we define the normaliser
Observe < S > ENG (S) ≤ G and NG (S) is the largest subgroup of G
containing S in which < S > is normal.
For any group G, and x, y ∈ G, we denote [x, y] = xyx−1 y −1 . Such elements
are called commutators in G. If S is the subset of all commutators of G,
one obtains the commutator subgroup of G, denoted by [G, G]. Sometimes,
one also denotes [G, G] by D(G) and refers to it as the derived group of G.
More generally, for H, K ≤ G, [H, K] denotes the subgroup of G generated
by {[h, k] : h ∈ H, k ∈ K}.
A group G is said to be solvable - the terminology comes from Galois theory
- if the chain
becomes the trivial group in a finite number of steps. Evidently, any abelian
group is solvable. A nonabelian example is the group B(n, Q), the upper
triangular rational matrices with nonzero diagonal entries for any n ≥ 2.
A complementary concept is that of a perfect group; a group G is said to be
perfect if G = [G, G].
A group G is said to be nilpotent if the lower central series
becomes trivial in a finite number of steps where C n+1 (G) = [G, C n (G)].
It is not only evident that abelian groups are nilpotent but also that nilpo-
tent groups are solvable because, it is seen by induction on n that Dn (G) ≤
C n (G) for any G. The group U (n, Q) is nilpotent but nonabelian when
n > 2.
In any group G, let xy denote yxy −1 and [x, y] denote the commutator
xyx−1 y −1 . It is also convenient to define x−y to be yx−1 y −1 . Also, one
defines inductively
5
Free groups and presentations
Fn , the free group of rank n, is defined to be the set of all reduced (fi-
nite) words in n symbols x1 , · · · , xn and their ‘inverse symbols’ denoted by
x−1 −1
1 , · · · , xn . Here, a word is said to be reduced if it does not contain two
symbols xi and x−1 i in juxtaposition. The empty word is also counted and
is the identity element. The group multiplication is by concatenation of two
reduced words and cancelling off all consecutive symbols of the form xi x−1 i
or x−1
i x i .
Fn is nonabelian if n ≥ 2. It is evident that any group which can be gen-
erated by n elements can be identified upto isomorphism with a quotient
group of Fn .
A group G is said to be finitely presentable if G ∼ = Fn /K for some n where
N
K =< R > for some finite subset R of Fn . The choice of finite sets X ⊂ G
with |X| = n and R ⊂ Fn with Fn / < R >N ∼ = G is called a finite presenta-
tion of G.
If G =< X|R > and H =< Y |S >, then their free product is defined to
be the group < X t Y |R ∪ S > and is denoted by G ∗ H. Note that Fn is
a free product of ZZ with itself n times (taking free products is an associa-
tive operation). We shall see via some problems that familiar groups like
SL(2, ZZ)/{±I} are free products.
Group actions
Let us recall the concept of group actions which makes groups one of the
most powerful mathematical tools.
A group G is said to act on a set S if there is a homomorphism from G to
the permutation group Sym(S) of S. The action is said to be faithful if the
homomorphism is injective. It is customary to write g.s for the element of S
that s ∈ S is sent to by the permutation corresponding to g ∈ G. For s ∈ S,
the subset G.s := {g.s : g ∈ G} is called the orbit of s. For any g ∈ G, the
subset S g := {s ∈ S : g.s = s} is called the set of fixed points under g. For
s ∈ S, the subgroup Gs := {g ∈ G : g.s = s} is called the isotropy subgroup
at s. Note that, for s ∈ S, the map g 7→ g.s is a well-defined bijection from
the set G/Gs of left cosets of Gs to the orbit G.s of s. Thus, orbits under a
finite group action have cardinalities which are divisors of the order of the
group.
In the particular case of a group acting on itself by conjugation, we denote
the orbit of an element g (this is the conjugacy class of g) by G(g). One
also writes g ∼ h to mean that g and h are in the same orbit; that is, they
are conjugate.
6
One calls an action of G on a set S transitive if each orbit is the whole set
S.
For instance, Sn acts transitively on {1, 2, · · · , n}.
The group GL(n, R) acts transitively on Rn − (0).
A group G is said to act r-transitively on a set S if G acts transitively on
the set
{(s1 , · · · , sr ) ∈ S r : gi distinct }.
Automorphism group
For an element g in a group G, one usually denotes the automorphism
x 7→ g −1 xg of G as Int (g). The group Aut (G) of all automorphisms of G is
defined by means of the composition in the following order: g(στ ) = (g(σ))τ ;
this is consistent with our convention of multiplying permutations. In this
convention, the set of automorphisms x 7→ Int(x) can be identified with a
subgroup of Aut G because
Jordan-Hölder theorem
The last important notion we recall is that of composition series and the
Jordan-Holder theorem. Recall that a normal series of a group G is simply
a finite sequence of subgroups
{1} = G0 C G1 · · · C Gn = G.
{1} = H0 C H1 · · · C Hm = G
{1} = G0 C G1 · · · C Gn = G
7
refinements. Note that this is equivalent to the successive factors being
simple groups.
For instance, any normal series for ZZ will have a proper refinement because
the smallest nontrivial term in such a series is an infinite cyclic group and will
have proper nontrivial (normal) subgroups. Therefore, ZZ has no composition
series.
The theorem of Jordan-Holder asserts that for a finite group, each normal
series admits of a refinement which is a composition series and that any two
composition series are isomorphic.
Finally, we recall Zorn’s lemma which is equivalent to the axiom of choice.
The use of Zorn’s lemma becomes unvoidable when we need to prove facts
like every vector space has a basis. Let (S, ≤) be any non-empty partially
ordered set. This means the three properties : (i) s ≤ s for all s ∈ S; (ii)
s ≤ t, t ≤ s ⇒ s = t and, (iii) s ≤ t ≤ u ⇒ s ≤ u.
A chain is a subset T of S in which, for any two elements s, t ∈ T , either
s ≤ t or t ≤ s. Zorn’s lemma asserts that if every chain T in S has an upper
bound (that is, an element s0 ∈ S such that t ≤ s0 for all t ∈ T ), then S has
a maximal element (that is, an element m ∈ S which is not ≤ any element
of S other than itself).
Finally, we recall that a group is said to have exponent d if d is the smallest
natural number for which each element g satisfies g d = 1.
One word of convention - a subgroup M of a group G is maximal if M is a
proper subgroup which is not contained in any other proper subgroup of G.
Also, the identity element is always denoted by 1 unless there is an abelian
group written additively when it is denoted by 0. Also, we write O(g) for
the order of an element g of a group G while the order of the group itself
is written as |G|. The notation pr kn means that pr is the highest p-power
dividing n.
Now, we begin with the systematic study of groups. After studying cyclic
and abelian groups, we discuss group actions and permutation groups. Then,
we shall study nilpotent and solvable groups. Following that we study groups
of matrices and groups figuring in geometry. Finally, we discuss miscella-
neous results on groups which do not fit exactly into any of the above titles.
Each of these discussions will be interspersed with several related results
and miscellaneous comments. Throughout the notes, we use the defi-
nitions and notations introduced in section §0.
§1 Cyclic groups
8
A group generated by a single element is called cyclic.
The only infinite cyclic group, upto isomorphism, is Z and for any n, the
only finite cyclic group of order n is, upto isomorphism, the set Z/n of
integers modulo n considered under addition modulo n. Indeed, for finite
cyclic G, g 7→ 1̄ is an isomorphism onto ZZ/|G|.
Evidently, Subgroups and quotient groups of cyclic groups are cyclic as well
but it is not true that a subgroup of an n-generated group is n-generated
although a quotient of an n-generated group is n-generated.
Lemma 1.1
(i) The direct product G×H of nontrivial cyclic groups G, H is cyclic if, and
only if, both G, H are finite and their orders m, n are coprime. (ii) A finite
group is cyclic if, and only if, it has aPunique subgroup of each order dividing
the order of the group. Hence n = d|n φ(d) for each natural number n.
Lemma 1.2
Let G be a finite group in which, for every n/|G|, the set {g : g n = e} has
at most n elements. Then, G is cyclic and the set {g : g n = e} has exactly
n elements for each n||G|. In particular, any finite subgroup of K ∗ is cyclic
for any field K.
Note that for the application to field theory, we need only prove the result
for abelian G. Later, we will discuss this again and prove it as an application
of Sylow’s theorems.
Proof of 1.2
Let |G| = n and d|n.
Consider N (d) = #{g ∈ G : O(g) = d}.
If N (d) 6= 0, look at some element g with O(g) = d. As e, g, g 2 , · · · , g d−1
are distinct and are solutions of xd = e, these are all the solutions of the
equation xd = e. As elements of order d in G are among these and are
φ(d) in number, we have proved that N (d) = φ(d) if N (d) P 6= 0. As every
element
P of G has some
P order d dividing n, we have n = d|n N (d). Since
n = d|n φ(d) ≥ d|n N (d) = n, we must have the equality N (d) = φ(d)
for all d|n. In particular, N (n) = φ(n) 6= 0.
All of us are familiar with the division algebra H of Hamilton’s quaternions
H := {a0 + a1 i + a2 j + a3 k : ai ∈ R}
where the mulltiplication is defined by i2 = j 2 = k 2 = −1, and ij = k = −ji.
Note that H is a 4-dimensional vector space over R and is central over R.
In general, a finite-dimensional division algebra D over a field K is a finite-
dimensional vector space D over K such that D has a multiplication under
9
which the set D∗ of all non-zero elements forms a group; the mutliplication
is also required to satisfy x(y + z) = xy + xz and (x + y)z = xz + yz and,
λ(xy) = (λx)y for all λ ∈ K and all x, y, z ∈ D. Note that D is a field if,
and only if, D∗ is an abelian group.
Further, D is said to be central over K if the centre of D∗ is K ∗ .
A very similar construction to H above produces many examples of central
division algebras over Q.
Remark 1.3
Let D be any finite-dimensional central division algebra over a field F of
characteristic p > 0. Then, every finite subgroup of D∗ is again cyclic.
On the other hand, the above Hamilton quaternion algebra
H := {a0 + a1 i + a2 j + a3 k : ai ∈ R}
10
To prove this, let θ : (ZZ/p2 )∗ → (ZZ/p)∗ be the canonical homomorphism.
Considering a as an element of (ZZ/p2 )∗ , it follows that an ∈ ker θ. Clearly,
Ker θ has order p and an is a nontrivial element of (ZZ/p2 )∗ since p2 6 |(an −1)
by hypothesis. Therefore, an has order p in (ZZ/p2 )∗ . This means p divides
the order of a in (ZZ/p2 )∗ and, thus ap−1 6≡ 1 modulo p2 .
Theorem 1.5
(ZZ/n)∗ = {a ≤ n : (a, n) = 1} is cyclic if, and only if, n = 2, 4, pr or 2pr for
some odd prime p.
Any generator for the group for such n is called a primitive root modulo n
in number-theoretic parlance.
Exercise 1.6
Are ZZ[1/2] and ZZ[1/3] isomorphic?
§2 Abelian groups
Lemma 2.1
A finite abelian group A is isomorphic to the direct sum of cyclic p-groups
for various primes p dividing its order.
A free abelian group of rank n is defined to be a group isomorphic to Zn ,
the set of integral n-tuples under co-ordinatewise addition. Equivalently,
P it
is an abelian group G with a basis of n elements g1 , · · · , gn i.e., i ai gi = 0
implies ai = 0 for all i and G = {a1 g1 + · · · + an gn : ai ∈ Z}. The rank n is
uniquely determined since the number of homomorphisms from G to Z/2 is
2n .
One of the most important results on abelian groups is the next theorem
which, in turn, follows from the one following it :
Theorem 2.2 (Structure theorem for finitely generated abelian
groups)
A finitely generated abelian group is isomorphic to Zm × Zd1 × · · · × Zdr for
some m ≥ 0 and di dividing di+1 . The integer m as well as all the di ’s (upto
sign) are uniquely determined.
Theorem 2.3 (Invariant factor theorem)
If H is a subgroup of a free abelian group G of rank n, then H is free abelian
of rank r ≤ n. Further, there are bases {e1 , · · · , en } of G and {d1 e1 , · · · , dr er }
of H respectively where di divides di+1 for i < r. The integers di are uniquely
determined upto sign and are called the invariant factors of H.
11
The proof is carried out by induction on n using the division algorithm as
follows. It is clear for n = 1. Assume n > 1 and that the theorem holds
for m < n. Corresponding to any basis of G, there is a positive integer
with the property that it is the smallest positive integer that occurs as a
coefficient in the expression of elements of H in terms of this basis. This
positive integer can depend on the basis and let l1 be the smallest such
with respect to all basesP of G. Let v1 , · · · , vn be a corresponding basis for
G such that v = l1 v1 + ni=2 ai vi ∈ H. Dividing all P the ai by l1 ,Pwe have
ai = qi l1 +Pri with 0 ≤ ri < l1 . Evidently, v = l1 (v1 + ni=2 qi vi ) + ni=2 ri vi
and v1 + ni=2 qi vi , v2 , · · · , vn is another basis of G. By the minimality Pn of
l1 , we must have ri = 0 for all i ≥ 2. Thus, writing w1 for v1 + i=2 qi vi ,
v = l1 w1 ∈ H. Look at the subset H0 of H which have coefficients of w1 to
be zero in terms of the basis w1 , v2 , · · · , vn of G. Clearly, H0 is aP subgroup
of H such that H0 ∩ Zv = {0}. Also, if h ∈ H, write h = b1 w1 + ni=2 bi vi .
Once again, dividing the bi ’s P by l1 , say, bi = mi l1 + si with 0 ≤ si < l1 ,
we have h − m1 v = s1 w1 + ni=2 bi vi ∈ H. Thus, by the minimality of
l1 we get s1 = 0 i.e., h − m1 v ∈ P H0 . Thus, H = H0 ⊕ Zv. Now, H0 is
n
contained in the subgroup G0 = i=2 Zvi . By induction hypothesis, G0
has a basis w2 , · · · , wn and there exists r ≤ n such that H0 has a basis of
the form d2 w2 , · · · , dr wr with d2 |d3 | · · · |dn . Clearly, therefore, H itself has
rank r ≤ n and l1 w1 , d2 w2 , · · · , dn wn is a basis for H. We have only to show
that l1 |d2 . Once again, writing d2 = cl1 + d with 0 ≤ d < l1 , we notice
l1 w1 + d2 w2 = l1 (w1 + cw2 ) + dw2 ∈ H where w1 + cw2 , w2 , · · · , wn is a basis
of G. Thus, minimality of l1 forces d = 0 i.e., l1 |d2 . The proof is complete.
Exercise
(i) Deduce 2.2 from 2.3.
(ii) Give an example of a group G and a subgroup H such that G can be
generated by some number n of elements while H cannot be n-generated.
(iii) Solve the prize problem posed in the beginning.
Lemma 2.4
The existence of bases as in the invariant factor theorem is equivalent to the
following statement about matrices :
Given any A ∈ Mm,n (ZZ) of maximum possible rank, there exist P ∈ GL(m, ZZ)
and Q ∈ GL(n, ZZ) such that P AQ is a matrix whose‘diagonal’ entries are
d1 , d2 , · · · where di |di+1 .
Further, GL(n, ZZ) is generated by elementary matrices I + Eij .
Proof
Suppose the matrix statement holds. Let H be a subgroup of a free abelian
group G of rank n. Then, H is also free abelian of rank m ≤ n (this we
12
are assuming known through other arguments). Let α : ZZm → H and
β : G → ZZn be isomorphisms. If i : H ≤ G denotes the inclusion map, we
have the composite map β ◦ i ◦ α corresponds to a matrix A ∈ Mn,m (ZZ) with
respect to the canonical ordered bases of ZZm and ZZn . The matrix statement
gives us P ∈ GL(n, ZZ) and Q ∈ GL(m, ZZ) such that
d1 ··· 0
... ..
.
..
.
AQ = P .
0 ··· dm
0 ··· 0
0 ··· 0
where di |di+1 .
Hence, the bases
{v1 , · · · , vn } = {P e1 , · · · , P en }
of ZZn and
{w1 , · · · , wm } = {Qe1 , · · · , Qem }
of ZZm are so that
{β −1 (v1 ), · · · , β −1 (vn )}
is a basis for G and {α(w1 ), · · · , α(wm )} is a basis for H.
Now, note that the matrix identity above implies that AQ(ei ) = P (di ei )
where ei on the left side are in ZZm and those on the right side are in ZZn .
That is, βα(Qei ) = di P (ei ).
So, we have βα(wi ) = di vi , which means that the bases {β −1 (v1 ), · · · , β −1 (vn )}
of G and {α(w1 ), · · · , α(wm )} of H are as asserted in the invariant factor
theorem.
Conversely, let us assume that the invariant factor theorem holds. Consider
any A ∈ Mn,m (ZZ) of rank max(m, n). Without loss of generality, we shall
take m ≤ n for, otherwise, we could take the transpose. Now, A defines a
homomorphism
TA : ZZm → ZZn ; v 7→ Av.
Now the image of TA is a free abelian group generated by the n vectors
Ae1 , · · · , Aem .
Since the matrix A has rank m, the vectors Ae1 , · · · , Aem are linearly in-
dependent vectors over Q. Therefore, they are linearly independent over ZZ
also. In other words, Image TA is free abelian subgroup of ZZn of rank m.
By the invariant factor theorem, let us choose bases {v1 , · · · , vn } of ZZn and
{d1 v1 , · · · , dm vm } of Image TA such that di |d+1 . Call Awi = di vi for all
13
i ≤ m.
Let P ∈ GL(n, ZZ) denote the matrix effecting the change of basis from the
canonical basis to the vi ’s. Similarly, let Q ∈ GL(m, ZZ) be the matrix ef-
fecting the change of basis from the canonical basis to the wi ’s.
Then, P −1 AQ(ei ) = di vi for all i ≤ m. In other words, P −1 AQ has the
form asserted.
The above proof of the invariant factor theorem clearly shows the generation
of GL(n, ZZ) by the elementary matrices.
Exercise 2.5
Prove that SL(n, ZZ) is perfect for n ≥ 3.
Lemma 2.6
For any A ∈ Mm,n (ZZ) define hi (A) to be the GCD of all i × i minors of
A. If A has maximal rank, then for any P ∈ GL(m, ZZ) and Q ∈ GL(n, ZZ),
the numbers hi (A) = hi (P A) = hi (AQ) for all i. The invariant factors of a
matrix A ∈ Mm,n (ZZ) are h1 (A), hh12 (A)
(A) h3 (A)
, h2 (A) , · · · etc.
We know that GL(n, ZZ) is generated by the matrices of the form Xij =
I + Eij ; i 6= j and the matrices diag(±1, · · · , ±1). elsewhere. We shall check
for each r that
hr (AXij ) = hr (Xij A)
for all i 6= j ≤ n.
By the previous lemma, we need to consider only A of the ‘diagonal’ form
with non-zero entries d1 , · · · , dm with di |di+1 .
Therefore, it is clear that hr (AD) = hr (DA) for D = diag(±1, · · · , ±1).
Now, for such A, we have, if i > m that AXij = A and, if i ≤ m, AXij =
A + A0 where A0 is a matrix whose only nonzero entry is di at the (i, j)-th
place.
Clearly, hr (AXij ) = hr (A).
Similarly, we see also that hr (Xij A) = hi (A). Therefore, we have the first
assertion. For the second, we merely note that for ‘diagonal’ matrices A
as above, with di |di+1 , the numbers hi (A) = d1 · · · di . Thus, the invariant
factors are successive quotients of the hi ’s.
Exercise 2.7
If S is any set of generators of the additive group of Q, then S contains a
proper subset of generators. Further, show Q does not have proper maximal
subgroups.
14
One can use group actions to prove many of the basic results on groups as
follows.
Cauchy’s & Fermat’s little theorems 3.1
Let G be any group of order n and let p any prime number. Consider the
subset S of G × · · · × G (p times) defined by
S = {(g1 , · · · , gp ) : g1 g2 · · · gp = e}.
Evidently, |S| = np−1 . For each tuple (g1 , · · · , gp ) in S, there are exactly
p distinct tuples (g2 , · · · , gp , g1 ), (g3 , · · · , gp , g1 , g2 ) etc. in S unless g1 =
g2 = · · · = gp (here is where we use the fact that p is prime). Note that
g1 = g2 = · · · = gp if, and only if, g1p = e. Thus, we have |S| ≡ #{g : g p = e}
mod p.
If p|n, then p divides np−1 = |S| ≡ #{g : g p = e} mod p. In this case,
(since ep = e), there are at least p − 1 elements of order p in G. This proves
Cauchy’s theorem.
If p 6 |n, then one has g p = e for some g if, and only if, e = g (n,p) = g. Thus,
|S| ≡ 1 mod p. This proves Fermat’s little theorem.
We have the following strikingly novel proofs of Fermat’s famous theorem
asserting that a prime number of the form 4n + 1 is a sum of two squares.
The proofs are due to Zagier, HeathBrown and Mohan Nair and are variants
of the same argument.
Lemma 3.2
Let p any prime number of the form 4n + 1. Consider the finite set
(x + 2z, z, y − x − z) if x≤y−z ,
15
the general fact that for any permutation τ of order 2 on a set, the non-
fixed points can be paired off and thus the number of fixed points is of
the same parity as |S|. Applying this to σ, we have that |S| must be odd.
Turning this around and applying the above observation to the permutation
τ : (x, y, z) 7→ (x, z, y), it follows that τ must have an odd number of fixed
points. Therefore, τ does have at least one fixed point. Any fixed point of
τ is a tuple (x, y, y) which means that p = x2 + 4y 2 .
Exercise 3.3
(i) Let p be as before but define
S1 = {(x, y, z) ∈ ZZ × N × N : x2 + 4yz = p}
and S2 = {(x, y, z) ∈ S1 : z > x + y}. Consider the Nair maps
α : S2 → S2 ; (x, y, z) 7→ (−x − 2y, y, z − x − y)
and β : S2 → S2 ; (x, y, z) 7→ (−x, y, z) or (x, z, y) according as to whether
z > y − x or z < y − x.
Then, prove α is an involution with a unique fixed point and draw the
conclusion about p using β.
(ii) Let p, S1 and S2 be as above. Consider the subset S3 = {(x, y, z) ∈ S1 :
z < x + y}. Prove that if no element of S1 is of the form (x, y, y), then all
the elements of S1 can be collected in groups of 4 where exactly 2 are in S2
and two in S3 . Consider the Heathbrown-Nair map
θ : S2 → S2 ; (x, y, z) 7→ (−x − 2y, y, z − x − y)
to conclude that |S2 | is odd and arrive at a contradiction.
Exercise 3.4
(i) For any n, prove that the permutations σ = (1 2) and τ = (1 2 · · · n)
generate the whole of Sn .
Further, if p is a prime, show that any transposition and any p-cycle gener-
ate Sp .
(ii) For general n, and for a transposition σ and any n-cycle τ , find a neces-
sary and sufficient condition for Sn to be generated by σ and τ .
Since any group acts faithfully on its underlying set by left multiplications,
one has Cayley’s theorem asserting the fact that any group is a group of
permutations. More generally, if H is a subgroup of G, then G acts by left
translations on the set of left cosets of H.
Lemma 3.5
If H has some finite index n in G, then H contains a normal subgroup N
16
whose index is a divisor of n!. In particular, if G is a finite group and p
is the smallest prime dividing O(G) and, if H ≤ G has index p, then it is
normal in G. Thus, subgroups of index 2 are normal.
Proof.
Take N to be the kernel of the action. Thus N ≤ H and G/N is isomorphic
to a subgroup of Sym(G/H) = Sn .
Now, ∃N ≤ H ≤ G with N normal in G and [G : N ]|p! Since [G : N ]|O(G)
as well, [G : N ] divides the GCD of O(G) and p! which is p since p is the
smallest prime dividing O(G). But, H contains N , which implies p|[G : N ].
Thus, [G : N ] = p = [G : H].
Group actions are naturally used to prove Sylow’s theorems and their gen-
eralisations as follows.
Theorem 3.6 (Frobenius)
Let G be a finite group and pr be a prime power dividing the order of G.
Then, there exist subgroups of order pr in G and that these are ≡ 1 mod p
in number.
Theorem 3.7 (Snapper)
Let G be a finite group and p be a prime such that pn ||G|. Suppose H ≤ G
has order pm ≤ pn . Then, :
(i) there exist subgroups of order pn containing H and,
(ii) the number of subgroups of order pn which contain H, is ≡ 1 modulo p.
Proof of 3.6
Consider the action by left multiplication of G on the set Ω of all subsets
of G with pr elements. Let us write |G| = pn d with n ≥ r and p 6 |d.
pn d
The cardinality of Ω is the binomial coefficient pr . It can be shown by
elementary number theory that |Ω| ≡ pn−r d mod pn−r+1 but as we observe
below this also follows from some group-theoretic counting. Break up Ω
into disjoint orbits, say, Ω = ti=1 G.Si where Si ∈ Ω. We claim that for
S
any S ∈ Ω, G = ∪g∈G gS. To see that this claim holds, take any S ∈ Ω
and any s ∈ S. Now, 1 = s−1 s ∈ s−1 S ⊂ G.S. Further, if g ∈ G, then
g = g.1 ∈ g.S. Thus, the claim is true. Hence, in any orbit, the number of
subsets is at least pn−r d and divides |G| = pn d. This means that exactly
one of the two things happen: either an orbit has exactly pn−r d elements
or it has a multiple of pn−r+1 number of elements. Note that |Ω| 6≡ 0 mod
pn−r+1 . Hence, not each orbit can have cardinality a multiple of pn−r+1 .
This already proves that there are orbits with exactly pn−r d elements. Note
that, obviously, the orbit of a set S in Ω has exactly pn−r d elements if, and
17
only if, the corresponding isotropy subgroup has order pr . But, we see that
each orbit with exactly pn−r d elements corresponds to exactly one subgroup
of order pr and conversely. This is because in each such orbit G.A, there is
exactly one group (that gA which contains the identity) since G = ∪g gA,
and so, the orbit is the set of left cosets of that group. Suppose the number of
such orbits with exactly pn−r d elements is t, then |S| ≡ tpn−r d mod n−r+1
p n−r .
pn d
Although, one can prove by elementary number theory that pr ≡ p d
mod pn−r+1 , one could obtain this already from the above sentence as it is
valid for any G of order pn d and applying this to the corresponding cyclic
pn d
group, one gets t = 1. Hence, pr ≡ tp d ≡ pn−r d mod pn−r+1 . Thus,
n−r
Examples 3.10
1. The group of rotations of a cube which leave it invariant are
(I) 90 degree (clockwise or anti-clockwise) rotations about the axes joining
the centres of the opposite faces - there are 6 such;
(II) 180 degree rotations about each of the above axes - there are 3 such;
18
(III) 120 degree (clockwise or anti-clockwise) rotations about the axes join-
ing the opposite vertices - there are 8 such;
(IV) 180 degree rotations about the axes joining the midpoints of the oppo-
site edges - there are 6 such and;
(V) the identity.
2. The cyclic group Cn can be regarded as the group of permutations of the
vertices of a regular n-gon. That is, it is the subgroup of Sn generated by
an n-cycle (1, 2, · · · , n).
3. For n > 2, the dihedral group Dn is defined as the group of rotations of
the regular n-gon given by n rotations about an n-fold axis perpendicular to
the plane of the n-gon and reflections about the n two-fold axes in the plane
of the n-gon like the spokes of a wheel, where the angle between consecutive
spokes is 2π π
n or n according as n is odd or even. It has order 2n.
It can be regarded as a subgroup of Sn as follows. The n rotations corre-
sponding to the powers of σ = (1, 2, · · · , n) and the group Dn is the subgroup
19
of n.
(vi) An application of (ii) to ZZ2 yields the well-known partition identity
n−1
X
np(n) = σ(i)p(n − i) + σ(n).
i=1
Proof
(i) If ρ : G → Sn is any transitive representation, then the subset H := {g ∈
G : ρ(g)(1) = 1} is a subgroup of G, of index n. Conversely, if H ≤ G is a
subgroup of index n in G, the set G/H of left cosets has n elements and G
acts transitively on it. There are (n − 1)! ways to identify G/H with the set
{1, 2, · · · , n} where the coset H is identified
with 1. Thus, an = tn /(n − 1)!.
n−1
(ii) For each 1 ≤ k ≤ n, there are ways to choose the orbit of 1,
k−1
tk transitive actions on it, and hn−k actions on its complement. This proves
the relation
n−1
Xn − 1
hn = t h + tn .
k − 1 k n−k
k=1
20
2 a 0
subgroup H of finite index is of the form gZZ where g = ∈ M (2, ZZ),
c d
with a, d > 0 and 0 ≤ c < d. Clearly, the index of His ad. We claim
2 2 a1 0
finally that gZZ = hZZ for another matrix h = ∈ M (2, ZZ), with
c1 d1
a1 , d1 > 0 and 0 ≤ c1 < d1 if, and only if, g = h. To see this, suppose
g −1 h ∈ GL(2, ZZ). This gives a1 = a, d1 = d and d|(c1 − c). As 0 ≤ c1 , c < d,
we get c1 = c as well. Therefore, the numberof subgroups of index n is
n/d 0
the number of matrices of the form , with d|n and 0 ≤ c < d.
c d
Thus,P for each divisor d|n, there are exactly d subgroups of index. Therefore,
an = d|n d = σ(n).
(vi) We claim that hn (ZZ2 ) = n!p(n). The reason is as follows. x can be
arbitrarily
P chosen in Sn , P and y chosen in its centraliser CSn (x), so that
hn = x |CSn (x)| = |Sn | 1/|[x]| = |Sn ||[x]|1/|[x]| = |Sn |p(n) = n!p(n).
This yields us the identity
n−1
X
np(n) = σ(i)p(n − i) + σ(n).
i=1
Exercise 3.12
Let G be a group such that the set S of all torsion elements of G is a finite
set. Then, prove that S is a group.
§4 Nilpotent groups
Lemma 4.1
(i) A group G is nilpotent if, and only if, there exists a series
G = G0 ⊇ G1 ⊇ · · · ⊇ Gn = {1}
for some n where each Gi is normal in G and Gi−1 /Gi is contained in the
center of G/Gi .
(ii) The center of any nontrivial nilpotent group is nontrivial.
(iii) All p-groups are nilpotent.
(iv) Subgroups and quotient groups of nilpotent groups are nilpotent.
(v) B(n, Q) is not nilpotent for n ≥ 2 whereas its normal subgroup U (n, Q)
and the quotient group B(n, Q)/U (n, Q) are.
(vi) A group G is nilpotent if, and only if, G/Z(G) is nilpotent where Z(G)
is the center of G.
21
Proof
(i) If G is nilpotent, then by our definition, the lower central series
for some n. Evidently, C i+1 (G) C C i (G) and C i (G)/C i+1 (G) is contained
in the center of G/C i (G) by the very definition.
Conversely, let
G = G0 ⊇ G1 ⊇ · · · ⊇ Gn = {1}
for some n where each Gi is normal in G and Gi−1 /Gi is contained in the
center of G/Gi .
Note that C 0 (G) = G = G0 . We prove by induction that for any r ≤ n,
C r (G) ≤ Gr . If we assume C m (G) ≤ Gm for m < n, then C m+1 (G) =
[G, C m (G)] ≤ [G, Gm ] ≤ Gm+1 as Gm /Gm+1 is contained in the centre of
G/Gm+1 . Therefore, the inductive proof follows, and shows C n (G) = {1}.
(ii) The penultimate term C n−1 (G) in the lower central series
Clearly, each C i (G)N/N C G/N and let us check that any successive factor
C i (G)N/N
C i+1 (G)N/N
is contained in the centre of C i+1G/N
(G)N/N
. Now, [G/N, C i (G)N/N ] =
[G, C i (G)]N/N ≤ C i+1 (G)N/N . Thus, the above series shows that G/N is
nilpotent by (i).
(v) Call B = B(n, Q) and U = U (n, Q) for simplicity.
An easy matrix computation shows that [U, U ] ≤ U and that [U, U ] consists
of matrices which have the entries (1, 2), (2, 3), · · · , (n − 1, n) to be zero.
By induction, one can easily show that C r (U ) consists of matrices whose
(i, j)-th entries for i < j ≤ i + r − 1 are all zero. Hence, C n (U ) = {1}.
22
Clearly, B = T U with T ∩ U = {1} where T ≤ B is the subgroup of diagonal
matrices. Also, U = Ker(det : B → Q∗ ) is a normal subgroup. Since T is
abelian, it is nilpotent.
However, B is not nilpotent since [B, B] = U and [B, U ] = U by the same
computation as above.
(vi) We already know that quotient groups of nilpotent groups are nilpotent.
So, we assume that G/Z(G) is nilpotent and show that G must be nilpotent.
Using (i) for G/Z(G), we have normal subgroups Gi of G containing Z(G)
such that
where Gi−1 /Gi is contained in the center of G/Gi . Consider the series
Evidently, Gn /Gn+1 is contained in (in fact, equal to) the center of G/Gn+1 .
Thus, G is nilpotent.
Here is a general result interesting in its own right and to be used in the
next proof.
Prime factorisation in groups 4.2
Let G be any finite group and let g ∈ G. Write O(g) = pk11 · · · pkr r where
pi ’s are distinct primes. Then, prove that every element g ∈ G can be
uniquely expressed as a product g = g1 · · · gr where gi ’s commute pairwise
and O(gi ) = pki i for i = 1, · · · , r.
Proof.
Let n1 = O(g) k1
k1 . Then, since p1 6 |n1 , we may write ap1 + bn1 = 1 for some
p1
integers a, b.
k1
Take g1 = g bn1 and g2 = g ap1 .
Of course, g1 g2 = g2 g1 = g. We shall check that g1 , g2 have orders pk11 and
n1 respectively.
k1 k1
p
Now, g1 1 = g bn1 p1 = g bO(g) = 1; so O(g1 )|pk11 .
Moreover, if g1d = 1, then we get g bn1 d = 1; so O(g)|bn1 d. This gives pk11 |bd;
however (p1 , b) = 1 as seen from apk11 + bn1 = 1. Thus, pk11 |d which proves
that O(g1 ) = pk11 .
k1 k1
Similarly, g2n1 = g ap1 n1 = 1 so that n1 |O(g2 ). If g2l = 1, then g ap1 l = 1 so
that O(g) = pk11 n1 |apk11 l.
Hence n1 |l as (n1 , a) = 1. This implies that O(g2 ) = n1 .
Now, we may proceed by induction with g replaced by g2 . Ultimately, we
23
will have elements xi , i ≤ r which are powers of g and have orders pki i such
that g = x1 · · · xr .
Finally, we show that such an expression is unique. If g = y1 · · · yr is another
expression where yi has order pki i and commute pairwise, then they commute
with g. As xi ’s are powers of g, the xi ’s and the yj ’s all commute pairrwise.
So,
x−1 −1 −1
1 y1 = x2 · · · xr y2 · · · yr
has order dividing pk11 as seen from the left side and at the same time has
order dividing pk22 · · · pkr r as seen from the right side. Thus, x1 = y1 . Pro-
ceeding by induction, we get xi = yi for all i.
Lemma 4.3
For a finite group G, the following are equivalent:
(i) G is nilpotent,
(ii) every subgroup is subnormal,
(iii) every proper subgroup H is properly contained in NG (H),
(iv) all maximal subgroups are normal,
(v) all p-Sylow subgroups are normal,
(vi) elements of coprime order commute and,
(vii) G is the direct product of its Sylow subgroups.
Proof.
Let G be nilpotent and H ≤ G. Suppose
G ≥ C 1 ≥ C 2 ≥ · · · C n = {1}
be its lower central series. Since, we have that C i /C i+1 is contained in the
centre of G/C i+1 , it follows that
G = GH ≥ C 1 H ≥ · · · C n H = H
is a chain of subgroups where each term is normal in the previous one. Thus,
H is subnormal in G; that is, (i) implies (ii).
Now, suppose (ii) holds. Let H < G. We have a series
H = H0 C H1 C · · · Hr = G.
24
M for some maximal subgroup M . Since M is normal, we get for any g ∈ G,
gP g −1 is again a p-Sylow subgroup of gM g −1 = M . Thus, by Sylow’s second
theorem applied to M , there is some m ∈ M with gP g −1 = mP m−1 i.e.,
m−1 g ∈ NG (P ) ≤ M . Hence g ∈ M , a contradiction, since a maximal
subgroup, by definition, is a proper subgroup. Hence (iv) implies (v).
Assume (v) holds; that is, the Sylow subgroups are normal. We shall show
first that elements of orders powers of different primes commute. Let O(x) =
pr and O(y) = q s where p 6= q are primes. If P, Q are the unique p-Sylow
subgroup and the unique q-Sylow subgroup, then x ∈ P and y ∈ Q. As
xyx−1 y −1 ∈ P ∩ Q = {1}, we have xy = yx.
Now, we deal with the general case. We saw in 4.2 that in any finite group,
every element can be written as commuting elements of prime power orders
dividing the order of the element. Let g, h ∈ G be elements in our group
which have coprime orders m, n. Then, g = x1 · · · xr , h = y1 · · · , ys where
xi ’s commute among themselves, yj ’s commute among themselves and each
of them has prime power order. Also O(xi )|O(g) = m and O(yj )|O(h) = n
which are coprime. Thus, each xi and each yj commute. Hence gh = hg.
Thus, (v) implies (vi).
Suppose now that elements of coprime orders commute. We write
25
(iv) For any p-group G, Φ(G) = [G, G] < Gp >.
(v) For a finite abelian p-group A, A/Φ(A) is an elementary abelian group
of order pd(A) where d(A) is the minimal number of generators needed to
generate A.
Proof
(i) Let G =< S > and s ∈ S ∩ Φ(G). It suffices to show that s ∈< S \ {s} >.
Now, if H =< S \ {s} >< G, then there is some maximal subgroup M of G
containing H. But then S \ {s} is contained in M as well as s ∈ M . This
means that G =< S >≤ M , which is a contradiction. Thus, we have shown
that elements of Φ(G) can be dropped from any generating set for G.
Conversely, suppose g ∈ G be such that whenever < S ∪ {g} >= G, we have
< S >= G. Let, if possible, M be a maximal subgroup not containing g.
Then, < M ∪ {g} >= G. By the hypothesis, this gives < M >= M = G, a
contradiction. Hence all maximal subgroups contain g and (i) is proved.
(ii) Since Φ(G) is the intersection of all maximal subgroups of G, and since
any automorphism permutes maximal subgroups of G, it follows that Φ(G)
is a characteristic subgroup of G.
(iii) By the previous problem, it suffices to prove that the p-Sylow subgroups
of Φ(G), for any p, are normal in it. By the Frattini argument (problem 15
(ii)), we have G = Φ(G)NG (P ).
In particular, NG (P ) ∪ Φ(G) generate G which implies that NG (P ) =<
NG (P ) >= G; that is, P C G.
In particular, P E Φ(G).
(iv) Let M1 · · · , Mr be the (proper) maximal subgroups of G. Consider
their images in the elementary abelian group G/[G, G] < Gp >. They are
maximal subgroups and every maximal subgroup of G/[G, G] < Gp > is
the image of one of these. Of course, two different Mi ’s may have the same
image. Note that the intersection of all maximal subgroups in an elementary
abelian p-group is trivial. This is so because an elementary abelian p-group
is isomorphic to ZZ/p × · · · ZZ/p, and in this group, the subproducts with one
factor being the trivial groups are maximal subgroups and intersect in the
identity. Therefore,
r
\
Φ(G) := Mi ≤ [G, G] < Gp > .
i=1
26
p) and so, [G, G] ≤ M . Thus, [G, G] ≤ Φ(G) as well. This proves that
27
(i) The function F (g) := g −1 T (g) is a bijection on G.
(ii) For any g in G, the product g T (g) T 2 (g) . . . T p−1 (g) is the identity.
(iii) |G| is congruent to 1 modulo p.
(iv) For any prime q dividing the order of G, there is a q-Sylow subgroup Q
of G such that T (Q) = Q.
(v) For any prime q, prove there is a unique q-Sylow subgroup Q of G such
that T (Q) = Q.
(vi) Let q be a prime. Then, the q-Sylow subgroup Q fixed by T contains
any q-subgroup of G fixed by T .
(vii) For p = 2, G is abelian.
(viii) For p = 3, G is nilpotent.
The analogue of (viii) was proved for any prime p by the Fields medalist
J.Thompson in his thesis.
Proof
(i) Now, x−1 T (x) = y −1 T (y) for some x, y if, and only if T fixes yx−1 ; that
is, when x = y. Thus, the map x 7→ x−1 T (x) is 1-1. Being a map of finite
sets, this is onto as well.
(ii) From (i), an arbitrary element of g can be written as g = x−1 T (x) so
that we get
gT (g) · · · T (g)p−1 = 1.
(vii) Taking p = 2, this gives T (g) = g −1 for all g. As T is an automorphism,
G is abelian.
(iii) Consider the subgroup Z of Aut(G) generated by T . This is a cyclic
group of order p acting on G. Each orbit has cardinality dividing p and,
as T fixes only the identity element, each orbit other than that of 1 has p
elements. So, |G| has 1 mod p number of elements.
(iv) Look at the action of Z on the set S of all q-Sylow subgroups of G.
Again, the orbits which are not fixed points under Z, have order p. So, if
there is no fixed point in S, then the number of elements in S would be
a multiple of p. But, we know that the number of elements in S (i.e. the
number of q-Sylow subgroups) is a divisor of |G|. Thus, p would divide |G|,
a contradiction of (iii).
(v) Note that if T (Q) = Q, then T (N ) = N where N is the normaliser of
Q in G. Now, if T fixes another q-Sylow subgroup gQg −1 , then rewriting
T (gQg −1 = gQg −1 , we have g −1 T (g) belongs to N . But, applying (i) to
N itself, any element of N is of the form x−1 T (x) for some x in N . So,
g −1 T (g) = x−1 T (x), which gives g = x belongs to N . So, gQg −1 = Q.
(vi) Let R be any q-group in G which is fixed by T . Let S be a q-group which
contains R and is maximal with respect to this property (i.e., S is fixed by
28
T , contains R and is not contained in a strictly larger q-group which is T -
fixed). We claim that R = Q. For this, first look at the normaliser N (S)
of S in G. Since T (S) = S, we have also T (N (S)) = N (S) clearly. by
(v) applied to the subgroup M of N (S) which is T -fixed. Now, S is a q-
subgroup of N (S), say, yM y −1 where y is in N (S). But then S = y −1 Sy
is contained in M . By maximality property of S, we get S = M i.e., S is
a q-Sylow subgroup of N (S). But, any Sylow subgroup of a subgroup in
any group is the intersection of the subgroup with a Sylow subgroup of the
big group. So, we have a q-Sylow subgroup L of G such that S = N (S)
intersection L. In other words, S = NL (S), the normaliser of S in L. But,
in a q-group (indeed, in any nilpotent group), the normaliser of a proper
subgroup contains the subgroup properly. So, we have S = L. In other
words, S is a q-Sylow subgroup of G itself. By uniqueness of the T -fixed
q-Sylow subgroup of G itself. Thus, we have shown that R is contained in
S = P.
(viii) Finally, we prove the result for p = 3.
Now x T (x) T 2 (x) = e for any x in G. So, T (x−1 x−1 = (x T (x))−1 = T 2 (x)
from the above. Putting y = x−1 , we get T (y)y = T 2 (y −1 ) = (T 2 (y))−1 .
Thus, both yT (y) and T (y)y are equal to (T 2 (y))−1 . In other words, every
element x in G commutes with the element T (x). Similarly, x commutes
with T 2 (x) also.
Now, to prove the result, let us consider any prime q and the unique q-Sylow
subgroup Q of G which is T -fixed. Let g be any element of G which has
q-power order. Consider the subgroup Q0 is generated by the three elements
g, T (g) and T 2 (g). Clearly, Q0 is T -fixed. Also, since g, T (g), T 2 (g) commute
among themselves, Q0 is a q-group. Therefore, by (vi), Q0 is contained in Q.
Thus, Q contains all elements of G of q-power order. Hence, Q is the unique
q-Sylow subgroup of G. So, it is normal. This proves the result for p = 3.
§5 Solvable groups
Lemma 5.1
(i) A group G is solvable if, and only if, there exists a series
G = G0 ⊇ G1 ⊇ · · · ⊇ Gn = {1}
29
Proof
(i) Evidently, the derived series is a series as asserted and, so any solvable
group has such a series.
Conversely, suppose G admits a finite series as above. We prove by induction
that Dr (G) ≤ Gr for all r. That would prove Dn (G) = {1}. The assertion
holds for r = 0. Supposing Dr (G) ≤ Gr , we have
N = N0 ⊇ N1 ⊇ · · · ⊇ Nr = {1}
and
G/N = G0 /N ⊇ G1 /N ⊇ · · · ⊇ Gs /N = {1}
be series for N and G/N as in (i). But then
G = G0 ⊇ · · · ⊇ Gs = N ⊇ N1 · · · Nr = {1}
§6 Matrix groups
30
As we pointed out earlier, the usefulness of a group in a situation depends
on its avataar. A group is especially useful when it appears as a subgroup
of GL(n, CI ).
Lemma 6.1
SU (2) ∼
= H 1 , the group of unit quaternions; that is, the quaternions a + bi +
cj + dk which satisfy a2 + b2 + c2 + d2 = 1.
Further, there exists a homomorphism from SU (2) to SO(3) whose kernel
is {±I2 }.
Proof.
a + ib c + id
Any element of SU (2) looks like with a, b, c, d ∈ R such
−c + id a − ib
that a2 + b2 + c2 + d2 = 1.
This reminds us of Hamilton’s quaternions. Therefore, let us define the map
a + ib c + id
θ : SU (2) → H ; 7→ a + ib + cj + dk.
−c + id a − ib
Note that the image is contained in the group H 1 of ‘unit’ quaternions; that
is, a + ib + cj + dk for which a2 + b2 + c2 + d2 = 1 or, equivalently,
(a + ib + cj + dk)−1 = a − bi − cj − dk.
ρ : SU (2) → GL(V ).
To explicitly evaluate
it, we usethe isomorphism θ to view SU (2) as H 1 .
a + ib c + id
For any g = ∈ SU (2), we may compute qiq −1 , qjq −1
−c + id a − ib
and qkq −1 where q = θ(g)a + bi + cj + dk. We arrive at the following matrix
with respect to the ordered basis {i, j, k} :
2
a + b2 − c2 − d2
2(bc − ad) 2(ac + bd)
ρ(g) = 2(ad + bc) a2 − b2 + c2 − d2 2(cd − ab) .
2(bd − ac) 2(ab + cd) 2 2
a −b −c +d 2 2
31
Note that if g ∈ Kerρ, then the corresponding quaternion q commutes with
i, j, k and, hence, with the whole of H.
In particular, this gives g ∈ Z(SU (2)) = {±I}.
Of course, it is clear that −I does indeed belong to Ker ρ.
The final assertion left is to show that the determinant of ρ(g) is always 1
is proved by direct (but messy) calculation. A better way would be to use a
little bit of topology which shows that SU (2) is connected, ρ on SU (2) and
the determinant function on GL(3, R) are continuous, that the image of det
◦ρ is a connected subset of R contained in {±1} and containing I.
Bruhat lemma 6.2
For any field K, GL(n, K) is the disjoint union of double cosets BwB over
permutation matrices w. Here B := B(n, K), the group of invertible upper
triangular matrices with entries from K.
Minkowski lemma and more 6.3
(i) For any odd prime number p, an n×n integral matrix A 6= Id with entries
congruent to the corresponding entries of the identity matrix modulo p is
of infinite order. In other words, the ‘principal congruence subgroup’ Ker
(GL(n, ZZ) → GL(n, ZZ/p) is torsion-free for an odd prime p.
(ii) For p = 2, any matrix A in Ker (GL(n, ZZ) → GL(n, ZZ/p)) of finite
order, has order 1 or 2 and is expressible as
α−1 |α| + 1 2
| |≤ ≤ < 1.
p p p
Thus, all the roots of χB (t) have absolute values < 1. Since χB (t) is an
integral polynomial whose top coefficient is 1, this implies that χB (t) = tn ;
that is, all the eigenvalues of B are zero. In other words, all eigenvalues of
A are 1. As A has finite order, it must be the identity matrix.
Proposition 6.4
0 −1 1 1
(i) The matrices S = and X = generate SL(2, ZZ).
1 0 0 1
32
(ii) P SL(2, ZZ) is isomorphic to the free product ZZ/2 ∗ ZZ/3.
(iii) SL(2, ZZ) =< x, y|x2 y −3 , x4 >.
(iv) The abelianisation of SL(2, ZZ) is the cyclic group of order 12.
Proof
(i) We shall prove, equivalently, −1 −1
that S and A := S X generate SL(2, ZZ).
1 0
Note that AS = Y := .
1 1
a b
Now, start with any g = ∈ SL(2, ZZ). We shall show that left
c d
and right mutliplications by powers of X and Y lead to ±I by the usual
Euclidean division algorithm.
l a + lc b + ld
For any integer l, we have X g = . This shows us that one
c d
can divide a by c and replace a by its residue mod c.
Similarly, one can see that by left multiplication by some Y l , one can reduce
c mod a. Repeating these finitely many times, the division algorithm implies
that one of a and c becomes zero; the other has to be ±1 as the determinant
is always 1.
0 ±1 ±1 b
So, g becomes g1 = or g2 = .
∓1 d 0 ±1
Now,
±d 0 ±1
g1 X = = S ∓1
∓1 0
and g2 = X b or −X −b .
Since −I = S 2 , the assertion follows.
(ii) Since S 2 = −I also represents the identity element in P SL(2, ZZ), the
image s of S has order 2 in P SL(2,ZZ).
0 −1
Also, the image b of B := SX = in P SL(2, ZZ) has order 3 as
1 1
(SX)3 = −I.
We know that the elements s, b generate the whole group; so, we need only
show that no matrix
SB a1 SB a2 · · · SB ar
with each ai either 1 or 2, can be the matrices I, −I.
Since SB = −X and SB 2 = Y , it follows that
any word in the positive
2 a b
powers of SB and SB is a matrix g = in which a, b, c, d are of the
c d
same sign. Therefore, if b 6= 0, then the corresponding entry −b − d of SBg
and b of SB 2 g are non-zero as well. Similarly, if c 6= 0, the corresponding
entries of SBg and SB 2 g are nonzero. Since SB and SB 2 have the property
33
that either the (1, 2)-th entry or the (2, 1)-th entry is non-zero, any word g
in their positive powers has this property; hence g can not be the identity
matrix.
(iii) follows from (ii) by sending x to S and y to SX.
(iv) Now,
§7 Miscellaneous results
Lemma 7.1
The groups C I ∗ and S 1 are isomorphic.
Proof.
First of all, the polar co-ordinates provide an isomorphism C∗ ∼
= R>0 ×S 1 ∼
=
1 1
R × S . So, it suffices to show that R × S = S .∼ 1
34
Ore’s covering lemma 7.2
Let H, K ≤ G be subgroups of the same finite index. Then, there exist
g1 , · · · , gn such that
G = tni=1 gi H = tni=1 Kgi .
Even the case H = K is interesting.
Proof.
Writing G as a union of the double cosets KgH, it is necessary and sufficient
to write each double coset in the form asserted since each double coset KgH
is a union of right cosets of K in G and also a union of left cosets of H in
G.
We note the two bijections :
say
ki (K ∩ gHg −1 ) 7→ (H ∩ g −1 Kg)hi
where i = 1, · · · , r, then we see that
35
and g −1 Kg of G. Thus, it suffices to show that the subgroup g −1 Kg ∩ H
has finite index. But, this is true since both H and g −1 Kg do.
Frobenius’s theorem 7.3
Let G be a finite group and, for any natural number n, denote by fn (G) the
cardinality of the set {x ∈ G : xn = 1}. Then, :
(i) The number of elements of order n is a multiple of φ(n). In particular,
for a prime p such that pk k|G|, one has fpk (G) − fpr (G) ≡ 0 mod pr for all
r < k. More particularly, if fpk (G) ≡ 0 mod pk , then fpr (G) ≡ 0 mod pr for
all r < k.
(ii) Let pk kn and let Q be a set of representatives of conjugacy classes of ele-
k
ments y such that y |G|/p = 1. Then, fn (G) = y∈Q [G : CG (y)]fpk (CG (y)).
P
36
X
= |Q ∩ Z(G)|fpk (G) + [G : CG (y)]fpk (CG (y)).
y∈Q\Z(G)
Look at any of the proper subgroups CG (y) for y ∈ Q\Z(G). If p` k|CG (y)|,
with ` ≤ k, then [G : CG (y)] ≡ 0 mod pk−` and, p` |fp` (CG (y)) by induction
hypothesis.
Note fp` (CG (y)) = fpk (CG (y)). Therefore each term in the sum above is a
multiple of pk . As pk ||G| we have pk ||Q ∩ Z(G)|fpk (G). But p 6 ||Q ∩ Z(G)|
for, if it did then it would have an element y of order p which would also
n
satisfy y pk = 1. This is a contradiction, as ( pnk , p) = 1. Therefore pk |fpk (G).
(iv) Let n||G|. We write = pk11 . . . pkr r for distinct primes pi . Then, it suffices
to show that pki i /fn (G). Now, by (ii), fn (G) is expressible as a sum of terms
of the form [G : H]fpki (H) for subgroups H. By (iii), each such H (including
i
G itself) satisfies pri /fpri (H) where pri k|H|. Therefore pki i = pki i −r . pri divides
[G : H]fpri (H) ∀ H. Thus pki i divides the sum which is fn (G).
G = hG | x.y.(xy)−1 : x, y ∈ Gi.
S3 = hr, s | r3 , s2 , srsri.
Dn = hr, s | rn , s2 , (sr)2 i.
8. Let D∞ be the infinite dihedral group consisting of the
motions of IR which map the integers to integers, i.e. the transformations
IR → IR, x 7→ ±x + k, with k ∈ ZZ.
37
D∞ = hs, t | s2 , ststi.
K1 = H, K2 = K1 x1 , K3 = K2 x2 , . . . , Kn = Kn−1 xn−1
a−1 2
1 a2 a1 = a2 ;
38
a−1 2
2 a3 a2 = a3 ,
a−1 2
3 a1 a3 = a1
is the identity group.
14. The group G generated by a1 , a2 , a3 , a4 subject to the defining relations
a−1 2
1 a2 a1 = a2 ;
a−1 2
2 a3 a2 = a3 ,
a−1 2
3 a1 a3 = a4 ,
a−1 2
4 a1 a4 = a1
has no proper normal subgroup of finite index. Since every finitely generated
group has at least one maximal normal subgroup, it follows that there exists
a finitely generated infinite simple group, namely the quotient group G/N
where N is any maximal normal subgroup of G.
15. Let p be an odd prime. The subgroup Gp of the symmetric group on R
generated by x 7→ x + 1 and x 7→ xp is free of rank 2. This was proved by
S.White in J.Algebra 118 (1988) but a more transparent proof by S.D.Cohen
& A.M.W.Glass appears in Journal of the London Math. Society 55 (1997).
39
The idea is to slide the coins utilising the empty square and to find out
what kind of arrangements are possible. Sam Lloyd offered 1000 dollars to
anyone who could get the arrangement with 14 & 15 switched above. He
knew his money would be safe with him. Analyse how to decide whether a
given position of coins in 15 of the 16 places is ‘good’ (i.e., can be brought
back to the beginning position by the sliding process. Do this for a square
of general size.
40
To construct G, one starts with a presentation < Xi |Ri > of each Gi and
takes < X|R > as a presentation of G where X is the disjoint union of the
Xi ’s and R is the union of the Ri ’s. The homomorphisms φ : Gi → G are,
therefore, simply inclusions. The uniqueness of such a free product G up to
isomorphism follows from the uniqueness property of φ above.
One writes G = ∗i∈I Gi . If I is a finite set, say, I = {1, 2, · · · , n}, then it is
customary to write G = G1 ∗ G2 ∗ · · · ∗ Gn .
For example,
The free group of rank r is the free product ZZ ∗ · · · ∗ ZZ of r copies of ZZ.
More generally, a free group F (X) on a set X is the free product ∗x∈X < x >.
The group P SL(2, ZZ) is the free product of a cyclic group of order 2 and a
cyclic group of order 3.
Recall that if A is a group, Gi , i ∈ I is a family of groups and αi : A → Gi
(i ∈ I) are injective homomorphisms, then a group G is said to be the
free product of Gi ’s amalgamated along A, if there are homomorphisms
φi : Gi → G satisfying φi ◦ αi = φj ◦ αj for all i, j ∈ I such that the fol-
lowing universal property holds: for every group H and homomorphisms
θi : Gi → H with θi ◦ αi = θj ◦ αj for all i, j ∈ I, there is a unique homo-
morphism θ : G → H with φ ◦ φi = θi .
One denotes G by ∗A Gi if there is no confusion as to what the maps αi are.
Sometimes, the maps αi are taken to be not necessarily injective and still
the above definition can be carried out. Note that, if αi are trivial, then
∗A Gi = ∗Gi , the free product.
The construction is as follows. If Gi =< Xi |Ri >, i ∈ I, then
G :=< ti∈I Xi | ∪ Ri ∪ ∪i,j Rij >
where Rij = {αi (a)αj (a)−1 ; a ∈ A} > .
The uniqueness of G up to isomorphism is clear once again by the unique-
ness of θ.
For instance, SL(2, ZZ) = ZZ/4 ∗ZZ/2 ZZ/6.
The fundamental group of the Möbius strip is isomorphic to ZZ ∗2ZZ ZZ. A
free product with amalgamation could be the trivial group even if the groups
αi (A) are not.
For example, let α1 : ZZ → P SL(2, Q I ) be an injective homomorphism and
let α2 : ZZ → ZZ/2 be the natural homomorphism. Then, G1 ∗ZZ G2 = {1}.
Finally, we recall the notion of HNN extensions named after G.Higman,
B.H.Neumann & H.Neumann. The construction is akin to adjoining ele-
ments to fields to get field extensions.
Let G =< X|R > be a group and let A be a subgroup. For an injective
homomorphism φ : A → G, the HNN extension of G with respect to φ is the
41
group G∗ =< X ∪ {t}|R ∪ {tat−1 φ(a)−1 } >.
It is a fact that G∗ is independent of the presentation of G chosen and that
G embeds naturally into G∗ . It can be shown that, given two elements a, b of
equal order in a group G, this construction enables one to embed the group
G into a bigger group in which a, b are conjugate. The HNN construction
also finds a natural topological interpretation.
For, suppose V and W are open, path-connected subspaces of a path-
connected space X and suppose that there is a homeomorphism between
V and W inducing isomorphic embeddings of π1 (V ) and π1 (W ) in π1 (X).
One constructs a space Y by attaching the handle V ×[0, 1] to X, identifying
V × {0} with V and V × {1} with W . Then, the fundamental group π1 (Y )
of Y is the HNN extension of π1 (V ) relative to the isomorphism between its
subgroups π1 (V ) and π1 (W ).
Claim: τ is surjective.
Coset maps: For i ∈ I and u ∈ F associate an element uWi ∈ F −1 as
follows:
F −→ F̂ u 7−→ uWi
as follows;
Wi −1
1Wi = 1, xWi = yix , and (x−1 ) = (xWi x )−1 (x ∈ X).
42
Generally, if u = vy in reduced form with y ∈ X ∪ X −1 , define uWi by
induction on the length of u by means of the equation
uWi = v Wi y Wi v .
χ = τ ψ,
an endomorphism of F̂ .
The group W has a presentation τ : F̂ −→ W with generators yix and
−1 χ
defining relators yix yix (i ∈ I, x ∈ X).
The elements uW , where u runs over the set of nontrivial elements in the
transversal, form a set of defining relators for the presentation τ : F̂ −→ W .
A transversal for W is called a Schreier transversal if it has the following
property:
43
transversal to W in F .
Example 8.5
Consider the free group F = hx, y | φi. The homomorphism ϕ : F −→
ZZn = {0, 1, 2, . . . , n − 1}, n ≥ 2 defined by x, y 7→ 1 and let U be the kernel
of ϕ. As coset representatives we take 1, y, y 2 , . . . , y n−1 ; clearly this system
satisfies the Schreier property. Then the Reidemeister-Schreier generators
for U are the non-trivial elements of the following set:
{y i .x.(y i x)−1 , y i .y(y i+1 )−1 | i = 0, 1, 2, . . . , n − 1}
We obtain the non-trivial elements
y n , xi = y i .x.y −(i+1)
for i = 0, . . . , n − 2 and xn−1 = y n−1 .x. The rank of this group is n + 1.
Note that this shows that: The free group of rank 2 contains a free group of
rank n as a subgroup of index n − 1.
Exercise 8.6
1. Show that the modular group P SL(2, ZZ) contains a free subgroup of
rank 2 and index 6.
2. If F is a free group of rank two, then prove that the commutator
subgroup of F is an infinitely generated free group.
3. Prove that the group
a b p
Γ={ ∈ GL(2, IR) | a = 2n and b = q f or some n, p, q ∈ ZZ}
0 1 2
2 0 1 1
is generated by the two matrices and , while
0 1 0 1
1 b p
Γ0 = { ∈ GL(2, IR) | b = q f or some p, q ∈ ZZ}
0 1 2
is not finitely generated.
44