Notes On The Symmetric Group
Notes On The Symmetric Group
H = {σ ∈ Sn : σ({1, . . . , n1 }) = {1, . . . , n1 }}
σ({n1 + 1, . . . , n2 }) = {n1 + 1, . . . , n2 },
                                          1
Note that #(Dn ) = 2n, and hence that Dn is a proper subgroup of Sn if
n ≥ 4. We shall see (Cayley’s theorem) that, if G is a finite group, then
there exists an n such that G is isomorphic to a subgroup of Sn (and in
fact one can take n = #(G)). Thus the groups Sn are as complicated as all
possible finite groups.
    To describe a function σ : {1, . . . , n} → {1, . . . , n}, not necessarily a per-
mutation, we can give a table of its values, recording i and then σ(i), as
follows:
                         i      1           2    ...       n
                        σ(i) σ(1) σ(2) . . . σ(n)
    Of course, we could describe the same information by a 2 × n matrix:
                                                       
                             1       2       ...    n
                                                            .
                            σ(1) σ(2) . . . σ(n)
The condition that σ is a permutation is then the statement that the integers
1, . . . , n each occur exactly once in the second row of the matrix. For
example, if σ is given by the matrix
                                                 
                            1 2 3 4 5 6 7 8
                                                     ,
                            2 4 8 1 5 7 3 6
then σ(1) = 2 and σ(5) = 5.
     This notation is however cumbersome and not well suited to calculation.
We describe a much more efficient way to write down elements of Sn by
first writing down some special ones, called cycles, and then showing that
very element of Sn can be factored into a product of cycles, in an essentially
unique way if we are careful.
Definition 1.1. Let {a1 , . . . , ak } be a subset of {1, . . . , n} with exactly k
elements; equivalently, a1 , . . . , ak are distinct. We denote by (a1 , . . . , ak ) the
following element of Sn :
                     (a1 , . . . , ak )(ai ) = ai+1 ,      if i < k;
                    (a1 , . . . , ak )(ak ) = a1 ;
                      (a1 , . . . , ak )(j) = j,        if j 6= ai for any i.
We call (a1 , . . . , ak ) a k-cycle or cycle of length k. For k > 1, we define
the support Supp(a1 , . . . , ak ) to be the set {a1 , . . . , ak }. Note that, again
for k > 1, the k-cycle (a1 , . . . , ak ) moves i ⇐⇒ i ∈ Supp(a1 , . . . , ak ). Two
cycles (a1 , . . . , ak ) and (b1 , . . . , b` ) are disjoint if
                       Supp(a1 , . . . , ak ) ∩ Supp(b1 , . . . , b` ) = ∅,
i.e. the sets {a1 , . . . , ak } and {b1 , . . . , b` } are disjoint.
                                                   2
Remark 1.2. 1) A 1-cycle (a1 ) is the identity function 1, no matter what
a1 is. For this reason, we will generally only consider cycles of length at
least 2. If σ = (a1 , . . . , ak ) with k ≥ 2, then σ is never the identity, since
σ(a1 ) = a2 6= a1 .
2) A 2-cycle (a1 , a2 ) is also called a transposition. It is the unique permu-
tation of {1, . . . , n} which switches a1 and a2 and leaves all other elements
alone.
3) From the description in 2), it is clear that (a1 , a2 ) = (a2 , a1 ). But for
k ≥ 3, the order of the elements a1 , . . . , ak is important: for example σ1 =
(1, 3, 2) 6= σ2 = (1, 2, 3), because σ1 (1) = 3 but σ2 (1) = 2.
4) However, there are a few ways to change the order without changing the
element of Sn : clearly
In other words, you can start the cycle anywhere, at ai , say, but then you
have to list the elements in order: the next one must be ai+1 , and so on, with
the understanding that once you reach ak , the next one has to be a1 , then
a2 , and then so on up to ai−1 . Clearly, this the only way you can change
the order. By convention, we often start with the smallest ai . Of course,
after that, there is no constraint on the sizes of the consecutive members of
the cycle.
5) It is easy to see that the inverse (a1 , a2 , . . . , ak )−1 = (ak , ak−1 , . . . , a1 ). In
other words, the inverse of the k-cycle (a1 , a2 , . . . , ak ) is the k-cycle where the
elements are written in the opposite order. In particular, for a transposition
(a1 , a2 ), (a1 , a2 )−1 = (a2 , a1 ) = (a1 , a2 ), i.e. a transposition has order 2.
6) Generalizing the last line of 5), it is easy to see that the order of a
k-cycle σ = (a1 , a2 , . . . , ak ) is exactly k. In fact, σ(ai ) = ai+1 , so that
σ r (ai ) = ai+r , with the understanding that the addition i + r is to be taken
mod k, but using the representatives 1, . . . , k for addition mod k instead of
the more usual ones (in this course) of 0, . . . , k − 1. In particular, we see
that σ r (ai ) = ai for all i ⇐⇒ r is a multiple of k, and since σ r (j) = j for
j 6= ai , we see that k is the smallest positive integer r such that σ r = 1.
     Note however that, if σ is a k-cycle, its powers σ r need not be k-cycles.
For example,
                               (1, 2, 3, 4)2 = (1, 3)(2, 4),
and (1, 3)(2, 4) is not a k-cycle for any k.
                                                      3
7) Suppose that σ1 = (a1 , . . . , ak ) and σ2 = (b1 , . . . , b` ) are two disjoint cy-
cles. Then it is easy to see that σ1 σ2 = σ2 σ1 , i.e. “disjoint cycles commute.”
To check this we have to check that, for all j ∈ {1, . . . , n}, σ1 σ2 (j) = σ2 σ1 (j).
First, if j = ai for some i, then σ1 (ai ) = ai+1 (with our usual conventions
on adding mod k) but σ2 (ai ) = ai since ai 6= br for any r. For the same
reason, σ2 (ai+1 ) = ai+1 . Thus, for all i with 1 ≤ i ≤ k,
whereas
and similarly σ2 σ1 (j) = j. Thus, for all possible j ∈ {1, . . . , n}, σ1 σ2 (j) =
σ2 σ1 (j), hence σ1 σ2 = σ2 σ1 .
    Note that non-disjoint cycles might or might not commute. For example,
whereas
for every j ∈ {1, . . . , n}. First, if j is of the form σ(ai ) for some i, then by
definition (σ(a1 ), . . . , σ(ak ))(σ(ai )) = σ(ai+1 ), with the usual remark that if
i = k then we interpret k + 1 as 1. On the other hand,
                                                4
Thus both sides agree if j = σ(ai ) for some i. On the other hand, if j 6= σ(ai )
for any i, then (σ(a1 ), . . . , σ(ak ))(j) = j. But j 6= σ(ai ) ⇐⇒ σ −1 (j) 6= ai ,
so
Hence σ·(a1 , . . . , ak )·σ −1 (j) = (σ(a1 ), . . . , σ(ak ))(j) for every j ∈ {1, . . . , n},
proving the formula.
Example 1.5. For the group S4 , the only possibilities for products of dis-
joint cycles are: 1) the identity 1; 2) a transposition, i.e. a 2-cycle; 3) a
3-cycle; 4) a 4-cycle; 5) a product of two disjoint 2-cycles.          There is just
                                                             4
one identity 1. The number of transpositions is                  = 6. For a 3-cycle
                                                             2
(a1 , a2 , a3 ), there are 4 choice for a1 , then 3 choices for a2 , then 2 choices for
a3 , giving a total of 4 · 3 · 2 = 24 choices for the ordered triple (a1 , a2 , a3 ).
However, as an element of S4 , (a1 , a2 , a3 ) = (a2 , a3 , a1 ) = (a3 , a1 , a2 ), so the
total number of different elements of S4 which are 3-cycles is 24/3 = 8. Like-
wise, the total number of 4-cycles, viewed as elements of S4 , is 4·3·2·1/4 = 6.
Finally, to count the number of products (a, b)(c, d) of two disjoint 2-cycles,
note that, as above, there are 6 choices for (a, b). The choice of (a, b) then
determines c and d since {c, d} = {1, 2, 3, 4} − {a, b}. But since (a, b)(c, d) =
(c, d)(a, b), we should divide the total number by 2, to get 6/2 = 3 elements
                                                 5
of S4 which can be written as a product of two disjoint 2-cycles. As a check,
adding up the various possibilities gives 1 + 6 + 8 + 6 + 3 = 24, as expected.
    In this notation, D4 is the subgroup of S4 given by
D4 = {1, (1, 2, 3, 4), (1, 3)(2, 4), (1, 4, 3, 2), (2, 4), (1, 3), (1, 2)(3, 4), (1, 4)(2, 3)}.
   The proof of the theorem gives a procedure which is far easier to under-
stand and implement in practice than it is to explain in the abstract. For
example, suppose we are given a concrete permutation corresponding to a
2 × n matrix, for example
                                                   
                         1 2 3 4 5 6 7 8 9
                    σ=                                ,
                         3 6 5 7 1 4 9 8 2
here is how to write σ as a product of disjoint cycles of lengths at least 2:
beginning with 1, we see that σ(1) = 3, σ(3) = 5, σ(5) = 1, so we have
returned to where we started. Write down the cycle (1, 3, 5). Searching for
elements not in the support of (1, 3, 5), we see that the first such is 2. Then
σ(2) = 6, σ(6) = 4, σ(4) = 7, σ(7) = 9, σ(9) = 2 so that we are again where
we started. Write down the cycle (2, 6, 4, 7, 9). Search again for elements
not in the support of (1, 3, 5) or (2, 6, 4, 7, 9). The only remaining element
is 8, and σ(8) = 8. As we omit 1-cycles, we are left with the factorization
                  σ = (1, 3, 5)(2, 6, 4, 7, 9) = (2, 6, 4, 7, 9)(1, 3, 5).
To describe this procedure in general, we introduce a new idea.
Definition 1.6. Let σ ∈ Sn . We define a relation ∼σ on {1, . . . , n} as
follows: for i, j ∈ {1, . . . , n}, i ∼σ j if there exists an r ∈ Z such that
σ r (i) = j. The orbit of i under σ is the set
                               Oσ (i) = {σ r (i) : r ∈ Z}.
    Before giving examples, we note the following:
Proposition 1.7. The relation ∼σ is an equivalence relation and the equiv-
alence class [i] of i for ∼σ is the orbit Oσ (i).
Proof. Clearly i = σ 0 (i), so that i ∼σ i. Hence ∼σ is reflexive. If i ∼σ j, then
by definition there exists an r ∈ Z such that σ r (i) = j. Thus i = σ −r (j),
so that j ∼σ i and ∼σ is symmetric. To see that it is transitive, suppose
that i ∼σ j and j ∼σ k. Thus there exist r, s ∈ Z such that σ r (i) = j and
σ s (j) = k. Then σ r+s (i) = σ s (σ r (i)) = σ s (j) = k. Thus i ∼σ k and so ∼σ
is transitive, hence an equivalence relation. By definition, the equivalence
class [i] is Oσ (i).
                                             6
    By general facts about equivalence relations, the orbits Oσ (i) partition
{1, . . . , n} into disjoint subsets, in the sense that every integer j ∈ {1, . . . , n}
is in exactly one orbit Oσ (i). Note that Oσ (i) = {i} ⇐⇒ σ fixes i. For
example, the identity permutation has the n orbits {1}, {2}, . . . , {n}. A k-
cycle (a1 , . . . , ak ) with k ≥ 2 has one orbit {a1 , . . . , ak } with k elements, and
the remaining orbits are one element orbits {i} for the i ∈ {1, . . . , n} such
that i 6= ar for any r. There are then n − k one element orbits and one orbit
with k elements, for a total of n − k + 1 orbits. For another example, given
the σ ∈ S9 described above, σ = (1, 3, 5)(2, 6, 4, 7, 9), the orbits of σ are
{1, 3, 5}, {2, 4, 6, 7, 9}, and {8}. More generally, if σ is a product of disjoint
cycles ρ1 · · · ρ` of lengths at least 2, then the orbits Oσ (i) are the supports of
the cycles ρi in addition to one element orbits for each i ∈ {1, . . . , n} which
is not in the support of any ρi .
    Note that, while σ determines the orbits, the orbits do not completely
determine σ. For example, σ 0 = (1, 5, 3)(2, 9, 7, 6, 4) has the same set of
orbits as σ. On the other hand, the orbits do determine the “shape” of σ, in
other words they tell us in this case that σ is a product of a disjoint 3-cycle
and a 5-cycle, and they tell us that the support of the 3-cycle is {1, 3, 5} and
the support of the 5-cycle is {2, 4, 6, 7, 9}.
    Let us make two remarks about the equivalence relation ∼σ above. First,
since Sn is finite, every element σ has finite order, say σ d = 1 with d > 0.
Then σ d−1 = σ −1 , so we can write every negative power of σ as a positive
power as well. Thus
                                 Oσ (i) = {σ r (i) : r ∈ N}.
Second, given i ∈ {1, . . . , n}, the orbit of i is the set {i, σ(i), σ 2 (i), . . . } by
the above discussion. Inductively, set a1 = i and define ak+1 = σ(ak ), so that
ak = σ k−1 (i). Thus the orbit Oσ (i) is just {a1 , a2 , . . . }, with σ(ak ) = ak+1 .
Note that the set {a1 , a2 , . . . } is finite, so there have to exist r, s ≥ 1 with
r 6= s and ar = as . Let k ∈ N be the smallest integer ≥ 1 such that ak+1 = a`
for some ` with 1 ≤ ` ≤ k, i.e. ak+1 is the first term in the sequence which
is equal to a previous term. Equivalently, k is the largest positive integer
such that a1 , . . . , ak are all distinct. We claim that ak+1 = a1 = i, in other
words that the sequence starts to repeat only when we come back to the
starting point a1 = i. For suppose that ak+1 = a` with 1 ≤ ` ≤ k. Then
σ k (i) = σ `−1 (i), so that σ k−`+1 (i) = i. Hence ak−`+2 = a1 , in other words
there is a repetition at stage k − ` + 2. Since k + 1 was the smallest positive
integer greater than 1 for which some repetition occurs, k − ` + 2 ≥ k + 1, so
that ` ≤ 1. But also ` ≥ 1, so that ` = 1 and ak+1 = a1 , as claimed. Once
we have ak+1 = a1 , i.e. σ k (i) = i, then ak+2 = σ(i) = a2 , ak+3 = a3 , . . . . In
                                           7
general, given r ∈ Z, writing r = kq + ` with 0 ≤ ` ≤ k − 1, it follows that
                σ r (i) = σ ` (σ kq (i)) = σ ` ((σ k )q (i)) = σ ` (i) = a`+1 ,
where 1 ≤ ` + 1 ≤ k, and hence that
               Oσ (i) = {i, σ(i), σ 2 (i), . . . , σ k−1 (i)} = {a1 , . . . , ak }.
    Using the above, let us show that an arbitrary σ ∈ Sn can be factored
into a product of disjoint cycles as in the statement of Theorem 1.3. First,
given i ∈ {1, . . . , n}, either σ(i) = i, which happens ⇐⇒ Oσ (i) = {i}, or
Oσ (i) has at least 2 elements. In this second case, write Oσ (i) = {a1 , . . . , ak }
as above, where k = #(Oσ (i)) and σ(aj ) = aj+1 for 1 ≤ j ≤ k − 1 and
σ(ak ) = a1 . Thus, if ρ is the k-cycle (a1 , . . . , ak ), then ρ(aj ) = aj+1 = σ(aj )
for all aj , and ρ(r) = r if r ∈ / {a1 , . . . , ak } = Oσ (i) = Supp ρ.
    Now list the orbits of σ as O1 , . . . , ON , say, where O1 , . . . , OM are the
orbits with at least two elements and OM +1 , . . . , ON are the one-element
orbits. For each orbit Or with r ≤ M , we have found a cycle ρr of length at
least 2, with Supp ρr = Or , ρr (i) = σ(i) if i ∈ Or , and ρr (i) = i if i ∈     / Or .
Then the cycles ρ1 , . . . , ρM are disjoint, and by inspection σ = ρ1 · · · ρM .
This proves that σ is a product of disjoint cycles, and the uniqueness up to
order is easy to check from the construction.                                       
    The proof shows the following:
Corollary 1.8. Let σ ∈ Sn , and suppose that the orbits of σ are O1 , . . . , ON ,
where #(Oi ) = ki ≥ 2 if i ≤ M and #(Oi ) = 1 if i > M . Then
                                        σ = ρ1 · · · ρM ,
where each ρi is a ki -cycle, Supp ρi = Oi , and hence, if i 6= j, ρi and ρj are
disjoint.
                                                 8
Hence every k-cycle is a product of k − 1 transpositions. Since every per-
mutation σ ∈ Sn is a product of k-cycles and every k-cycle is a product of
transpositions, we conclude:
                                         9
of Nn is not important (exercise: show that Nn = (n − 1)!(n − 2)! · · · 2!1!);
clearly, Nn is just a large positive integer, and the main point is that it is
                                    Y σ ∈ Sn , consider what happens when
nonzero since no factor is zero. Given
we consider instead the product          (σ(j)−σ(i)). This is again a product
                                     1≤i<j≤n
of all possible differences of pairs of distinct positive integers between 1 and
n, but since σ mixes up the order we don’t always subtract the smaller of the
pair from the larger. Thus, there exists a sign, i.e. there exists an element
ε(σ) ∈ {±1}, such that
               Y                                         Y
                     (σ(j) − σ(i)) = ε(σ)Nn = ε(σ)             (j − i).
            1≤i<j≤n                                        1≤i<j≤n
                                           10
then τ (j) − τ (a) = j − b < 0, and there are b − a − 1 such j, so these
contribute a factor of (−1)b−a−1 . Likewise, when we look at the difference
b − i, then τ (b) − τ (i) = a − i. The sign is unchanged if i < a but becomes
negative if a < i < b. Again, there are b − a − 1 such i, so these contribute
another factor of (−1)b−a−1 which cancels out the first factor of (−1)b−a−1 .
The only remaining pair of integers that we have not yet considered is the
pair where i = a and j = b. Here τ (b) − τ (a) = a − b = −(b − a), so there
is one remaining factor of −1 which is not canceled out. Thus, the total
number of sign changes is
                      ε(τ ) = (−1)b−a−1 (−1)b−a−1 (−1) = −1.
    Finally, we note a fact which is useful in Modern Algebra II: it didn’t
matter that we chose the integers between 1 and n and looked at their
differences. We could have started with any sequence t1 , . . . , tn of, say,
real
   Ynumbers such that the ti are all distinct and looked at the product
        (ti −tj ). Comparing this product with the product where we permute
1≤i<j≤n
                    Y
the pairs, i.e.          (tσ(i) − tσ(j) ), the above analysis shows that
                  1≤i<j≤n
                     Y                                     Y
                            (tσ(i) − tσ(j) ) = ε(σ)                (ti − tj ),
                   1≤i<j≤n                               1≤i<j≤n
                                             11
Hence P (σ1 σ2 ) and P (σ1 )P (σ2 ) have the same value on each basis vector,
hence on all vectors, and thus P (σ1 σ2 ) = P (σ1 )P (σ2 ). Now, to convert
the matrix P (σ) to a number, we have the determinant det. Define ε(σ) =
det P (σ). As it stands, ε(σ) is just a real number. However, using the
multiplicative property of the determinant, for all σ1 , σ2 ∈ Sn , we have
ε(σ1 σ2 ) = det(P (σ1 σ2 )) = det(P (σ1 )P (σ2 )) = det P (σ1 ) det P (σ2 ) = ε(σ1 )ε(σ2 ).
                                           12
Claim 2.4. If ρ ∈ Sn and τ ∈ Sn is a transposition, then
Proof of Claim 2.4. Label the orbits of ρ (including the one-element orbits)
as O1 , . . . , ON , where N = N (ρ). As usual, we can assume that O1 , . . . , OM
have at least two elements and that OM +1 , . . . , ON are the one-element
orbits. We must show that ρτ has N ± 1 orbits. Since τ is a transposition,
we can write τ = (a, b) for some a, b ∈ {1, . . . , n}, a 6= b.
Case I. There is some orbit Oi of ρ, necessarily with at least two elements,
such that a, b ∈ Oi . In this case, we claim that N (ρτ ) = N (ρ) + 1. For
each j ≤ M , let Oj correspond to the kj -cycle µj , so that ρ = µ1 · · · µM ,
where the µj are disjoint and hence commute. For the cycle µi , we know
that a, b ∈ Supp µi = Oi , and a, b are not in the support of any µj , j 6= i.
Then (a, b) commutes with all of the µj , j 6= i, and hence
ρτ = µ1 · · · µM τ = µ1 · · · (µi τ ) · · · µM .
We see that a is sent to is+1 , is+1 is sent to is+2 , . . . , iki −1 to iki , iki to
i1 ,. . . , and finally it−1 is sent back to a. As for b, it is sent to it+1 , it+1 is
sent to it+2 , . . . , and finally is−1 is sent back to b. In other words,
say, where µ0 and µ00 are two cycles, disjoint from each other and from all
of the other µj and the remaining one-element orbits. (Note: it is possible
that one or both of µ0 , µ00 is a one-cycle. How could this happen?) It follows
that the orbits of ρτ are the Oj for j 6= i together with the two new orbits
O0 , O00 coming from µ0 and µ00 . Counting them, we see that there are N + 1
orbits in all, hence N (ρτ ) = N (ρ) + 1.
Case II. In this case, we assume that a and b are in different orbits, say
a ∈ Oi and b ∈ Oj with i 6= j. We let µi , µj be the corresponding cycles.
                                                     13
Note that it is possible that either Oi or Oj is a one-element orbit, in which
case we will simply define µi or µj to be the corresponding one-cycle (in
particular, as an element of Sn , it would then be the identity). Then
ρτ = µ1 · · · µM τ = µ1 · · · (µi µj τ ) · · · µM ,
where we have simply moved the µi and µj next to each other since all of the
µi commute. In this case, we claim that N (ρτ ) = N (ρ) − 1. Write as before
µi = (i1 , . . . , it−1 , a, it+1 , . . . , iki ) and µj = (`1 , . . . , `s−1 , b, `s+1 , . . . , `kj ). A
calculation similar to that in Case I shows that
say. Thus the orbits of ρτ are the Ok for k 6= i, j together with O = Supp µ.
Counting these up, we see that there are N − 1 orbits in all, hence N (ρτ ) =
N (ρ) − 1.
                                                      14
even, i.e. 1 ∈ An , either formally (1 is the product of 0 transpositions and
0 is even) or by writing 1 = (a, b)(a, b), which is possible as long as n ≥ 2.
Finally, if σ ∈ An , then σ = τ1 · · · τk is the product of k transpositions,
where k is even. Then
f (σ) = ρ · σ,
                                           15
function g : Sn − An → An such that f ◦ g and g ◦ f are both the identity
(on their respective domains). Define g : Sn − An → An by the rule:
g(σ) = ρ−1 σ.
If ρ is odd, then ρ−1 is odd, hence, if σ is also odd, then ρ−1 σ is even.
Thus g is really a function from Sn − An to An . Finally, if σ ∈ An , then
g ◦ f (σ) = ρ−1 ρσ = σ, and if σ ∈ Sn − An , then f ◦ g(σ) = ρρ−1 σ = σ. Thus
f ◦ g = IdSn −An and g ◦ f = IdAn , so that g = f −1 and f is a bijection.
16