Combinepdf 13
Combinepdf 13
edited by
M.E.Noz
Department ofRadiology,
New York University, U.S.A.
and
y. S. Kim
Department of Physics and Astronomy,
University of Maryland, U.S.A.
  ISBN-13: 978-94-010-7872-6
           978-94-010-7865-8     e-ISBN-13 978-90-277-2782-4
  001: 10.1007/978-90-277-2782-4                                                 CIP
All RightS
     Rights Reserved
© 1988 by Kluwer Academic Publishers
No part ofof the material protected by this copyright notice
                                                       not ice may be reproduced or
ulilized
ulilired in any form or by any means, electronic
                                         clectroni~ or mechanical
                                                       mechanical
including photocopying, recording or by any information storage and
retrieval system, without written permission from the copyright owner.
retrie~al
Table of Contents
Preface ix
Introduction xi
Special relativity and quantum mechanics are likely to remain the two most
important languages in physics for many years to come. The underlying language
for both disciplines is group theory. Eugene P. Wigner's 1939 paper on the Unitary
Representations of the Inhomogeneous Lorentz Group laid the foundation for
unifying the concepts and algorithms of quantum mechanics and special relativity.
In view of the strong current interest in the space-time symmetries of elementary
particles, it is safe to say that Wigner's 1939 paper was fifty years ahead of its time.
This edited volume consists of Wigner's 1939 paper and the major papers on the
Lorentz group published since 1939.                                              .
This volume is intended for graduate and advanced undergraduate students in
physics and mathematics, as well as mature physicists wishing to understand the
more fundamental aspects of physics than are available from the fashion-oriented
theoretical models which come and go. The original papers contained in this
volume are useful as supplementary reading material for students in courses on
group theory, relativistic quantum mechanics and quantum field theory, relativistic
electrodynamics, general relativity, and elementary particle physics.
This reprint collection is an extension of the textbook by the present editors entitled
"Theory and Applications of the Poincare Group." Since this book is largely
based on the articles contained herein, the present volume should be viewed as a
continuation of and supplementary reading for the previous work.
We would like to thank Professors J. Bjorken, R. Feynman, R. Hofstadter, J.
Kuperzstych, L. Michel, M. Namiki, L.Parker, S. Weinberg, E.P. Wigner, A.S.
Wightman, and Drs. P. Hussar, M. Ruiz, F. Rotbart, and B. Yurke for allowing us to
reprint their papers. We are grateful to Mrs. M. Dirac and Mrs. S. Yukawa for
giving us permission to reprint the articles of Professors P.A.M. Dirac and H.
Yukawa respectively.
We wish to thank the Annals of Mathematics for permission to reprint Professor
Wigner's historic paper. We thank the American Physical Society, the American
Association of Physics Teachers, The Royal Society of London, il Nuovo Cirnento
and Progress in Theorectical Physics for permission to reprint the articles which
appeared in their journals and for which they hold the copyright. The excerpt from
Albert Einstein: Historical and Cultural Perspective: The Centennial Symposium in
Jerusalem is reprinted with permission of Princeton University Press; that from
                                          ix
x                                                                       Preface
High Energy Collisions is reprinted with pennission of Gordon and Breach Science
publisher, Inc. and that from Aspects of Quantum Theory with pennission of
Cambridge University Press.
Introduction
One of the most fruitful and still promising approaches to unifying quantum
mechanics and special relativity has been and still is the covariant formulation of
quantum field theory. The role of Wigner' s work: on the Poincare group in quantum
field theory is nicely summarized in the fourth paragraph of an article by V.
Bargmann et al. in the commemorative issue of the Reviews of Modem Physics in
honor of Wigner's 60th birthday [Rev. Mod. Phys. 34, 587 (1962)], which
concludes with the sentences:
    "Those who had carefully read the preface of Wigner's great
    1939 paper on relativistic invariance and had understood the
    physical ideas in his 1931 book on group theory and atomic
    spectra were not surprised by the tum of events in quantum field
    theory in the 1950' s. A fair part of what happened was merely a
    matter of whipping quantum field theory into line with the
    insights achieved by Wignerin 1939".
It is important to realize that quantum field theory has not been and is not at present
the only theoretical machine with which physicists attempt to unify quantum
mechanics and special relativity. Indeed, Dirac devoted much of his professional
life to this important task, but, throughout the 1950's and 1960's, his form of
relativistic quantum mechanics was overshadowed by the success of quantum field
theory. However, in the 1970's, when it was necessary to deal with quarks confined
permanently inside hadrons, the limitations of the present form of quantum field
theory become apparent. Currently, there are two different opinions on the
difficulty of using field theory in dealing with bound-state problems or systems of
confined quarks. One of these regards the present difficulty merely as a
complication in calculation. According to this view, we should continue developing
mathematical techniques which will someday enable us to formulate a bound-state
problem with satisfactory solutions within the framework of the existing form of
quantum field theory. The opposing opinion is that quantum field theory is a model
that can handle only scattering problems in which all particles can be brought to
free-particle asymptotic states. According to this view we have to make a fresh start
for relativistic bound-state problems.
These two opposing views are not mutually exclusive. Bound-state models
developed in these two different approaches should have the same space-time
symmetry. It is quite possible that independent bound-state models, if successful in
                                                 Xl
xii                                                                      Introduction
explaining what we see in the real world, will eventually complement field theory.
One of the purposes of this book is to present the fundamental papers upon which a
relativistic bound-state model that can explain basic hadronic features observed in
high-energy laboratories could be build in accordance with the principles laid out by
Wigner in 1939.
Wigner observed in 1939 that Dirac's electron has an SU(2)-like internal space-time
symmetry. However, quarks and hadrons were unknown at that time. Dirac's form
of relativistic bound-state quantum mechanics, which starts from the representations
of the Poincare group, makes it possible to study the O(3)-like little group for
massive particles and leads to hadronic wave functions which can describe fairly
accurately the distribution of quarks inside hadrons. Thus a substantial portion of
hadronic physics can be incorporated into the O(3)-like little group for massive
particles.
Another important development in modern physics is the extensive use of gauge
transformations in connection with massless particles and their interactions.
Wigner's 1939 paper has the original discussion of space-time symmetries of
massless particles. However, it was only recently recognized that gauge-dependent
electromagnetic four-potentials form the basis for a finite-dimensional non-unitary
representation of the little group of the Poincare group. This enables us to associate
gauge degrees of freedom with the degrees of freedom left unexplained in Wigner's
work. Hence it is possible to impose a gauge condition on the electromagnetic
four-potential to construct a unitary representation of the photon polarization
vectors.
Wigner showed that the internal space-time symmetry group of massless particles is
locally isomorphic to the Euclidian group in two-dimensional space. However,
Wigner did not explore the content of this isomorphism, because the physics of the
translation-like transformations of this little group was unknown in 1939. Neutrinos
were known only as "Dirac electrons without mass", although photons were known
to have spins either parallel or antiparallel to their respective momenta. We now
know the physics of the degrees of freedom left unexplained in Wigner's paper.
Much more is also known about neutrinos today that in 1939. For instance, it is
firmly established that neutrinos and anti-neutrinos are left and right handed
respectively. Therefore, it is possible to discuss internal space-time symmetries of
massless particles starting from Wigner's E(2)-like little group. Recently, it was
observed that the O(3)-like little group becomes the E(2)-like group in the limit of
small mass and/or large momentum.
Indeed, group theory has become the standard language in physics. Until the
1960's, the only group known to the average physicist had been the three-
dimensional rotation group. Gell-Mann's work on the quark model encouraged
physicists to study the unitary groups, which are compact groups. The Weinberg-
Salam model enhanced this trend. The emergence of supersymmetry in the 1970's
has brought the space-time group closer to physicists. These groups are non-
compact, and it is difficult to prove or appreciate mathematical theorems for them.
Introduction                                                                     xiii
        Relativistic                                                   Ideal
       Quantum Mech.                                                 Mechanics
                u
               +=
               .>
                II)
                                          Poincare'
               +=                I          Group             )
               .!2
                CI)
               0::
    FIG. I. Two different routes to the ideal mechanics. Covariance and detenninism are
    the two main problems. In approaching these problems, there are two different
    routes. In either case, the Poincare group is likely to be the main scientific language.
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                                                                                     3
                      Reprinted from REVIEWS OF MODER" PHYSICS, Vol. 29, No. J, 255-268, July, 1957
                                                          Pri.ted in U. S. A.
INTRODUCTION is perhaps irritating. It does not alter the fact that the
though it may be academic in the case of the mercury       ahout the number of polarizations of a particle and the
perihelion. A difference in the tacit assumptions which    principal purpose of the following paragraphs is to
fix the coordinate system is increasingly recognized to    illuminate it from a different point of view.' Instead of
be at the bottom of many conflicting results arrived at    the question: "Why do particles with zero rest-mass
in calculations based on the general theory of relativity. have only two directions of polarization?" the slightly
Expressing our results in terms of the values of co-       different question, "Why do particles with a finite
ordinates became a habit with us to such a degree that     rest-mass have more than two directions of polariza-
we adhere to this habit also in general relativity where   tion?" is proposed.
values of coordinates are not per se meaningful. In           The intrinsic angular momentum of a particle with
order to make them meaningful, the mollusk-like            zero rest-mass is parallel to its direction of motion,
coordinate system must be somehow anchored to              that is, parallel to its velocity. Thus, if we connect
space-time events and this anchoring is often done with    any internal motion with the spin, this is perpendicular
little explicitness. If we wllnt to put general relativity  to the velocity. In case of light, we speak of transverse
on speaking terms with quantum mechanics, our first        polarization. Furthermore, and this is the salient point,
 task has to be to bring the statements of the general      the statement that the spin is parallel to the velocity
theory of relativity into such form that they conform      is a relativistically invariant statement: it holds as
with the basic principles of the general relativity theory well if the particle is viewed from a moving coordinate
itself. It will be shown below how this may be attempted.  system. If the problem of polarization is regarded from
                                                            this point of view, it results in the question, "Why
RELATIVISTIC QUANTUM THEORY OF ELEMENTARY can't the angular momentum of a particle with finite
                        SYSTEMS
                                                            rest-mass be parallel to its velocity?" or "Why can't
  The relation between special theory and quantum a plane wave represent transverse polarization unless
mechanics is most simple for single particles. The it propagates with light velocity?" The answer is that
equations and properties of these, in the absence of the angular momentum can very well be parallel to
interactions, can be deduced already from relativistic the direction of motion and the wave can have trans-
invariance. Two cases have to be distinguished: the verse polarization, but these are not Lorentz invariant
partiCle either can, or cannot, be transformed to rest. statements. In other words, even if velocity and spin
If it can, it will behave, in that coordinate system, are parallel in one coordinate system, they do not
as any other particle, such as an atom. It will have an appear to be parallel in other coordinate systems.
intrinsic angular momentum called J in the case of This is most evident if, in this other coordinate system,
atoms and spin S in the case of elementary particles. the particle is at rest: in this coordinate system the
This leads to the various possibilities with which we
are familiar from spectroscopy, that is spins 0, !, 1,
!, 2, ... each corresponding to a type of particle.
If the particle cannot be transformed to rest, its
velocity must always be equal to the velocity of light.
Every other velocity can be transformed to rest. The
rest-mass of these particles is zero because a nonzero
rest-mass would entail an infinite energy if moving
                                                                                t
                                                                                                       v
with light velocity.
   Particles with zero rest-mass have only two directions
of polarization, no matter how large their spin is. This
contrasts with the 2S+ 1 directions of polarization for
particles with nonzero rest-mass and spin S. Electro-
magnetic radiation, that is, light, is the most familiar
example for this phenomenon. The "spin" of light is 1,
but it has only two directions of polarization, instead
of 2S+ 1 = 3. The number of polarizations seems to
                                                                               i
jump discontinuously to two when the rest-mass
decreases and reaches the value O. Bass and Schrodinger"      FIG. 1. The short simple arrows illustrate the spin, the double
followed this out in detail for electromagnetic radiation, arrows the velocity of the particle. One obtains the same state,
                                                            no matter whether one first lmp:arts to it a velocity in the direction
that is, for S= 1. It is good to realize, however, that of the spin, then rotates it (R(")A (0, ..)), or whether one first
this decrease in the number of possible polarizations is rotates It, then gives a velocity in the direction of the spin
purely a property of the Lorentz transformation and (A (", ..)R(")). See Eq. (1.3).
holds for any value of the spin.                              4 The essential point of the argument which follows is contained
   There is nothing fundamentally new that can be said in the present writer's paper, Ann. Math. 40, 149 (1939) and more
                                                                    explicitly in his address at the Jubilee of Relativity Theory,
  • L. Bass and E. Schriidinger, Proc. Roy. Soc. (London) A232, 1   Bern, 1955 (Birkhauser Verlag, Basel, 1956), A. Mercier and
(1955).                                                             M. Kervaire, editors, p. 210.
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                                                                                    5
                                   I )
                                                                          direction of its motion any more. In the nonrelativistic
                                                                          case, that is, if all velocities are small as compared with
           i         f                                                    the velocity of light, the spin will still be parallel to z
                                                                          and it will, therefore, enclose an angle with the particle's
                                                                          direction of motion. This shows that the statement that
                  1//
                                                                          the spin is parallel to the direction of motion is not
                                                                          invariant in the nonrelativisitic region. However, if
                                                                          the original velocity of the particle is close to the light
                                                                          velocity, the Lorentz contraction works out in such a
                                                                          way that the angle between spin and velocity is given by
also showed a similar tendency. It is not equally                      appeared to be a perfectly valid concept in spectroscopy
alarming because, while the increase in the number of                  and in nuclear physics. This concept could be explained
elementary particles complicates our picture of nature,                very naturally as a result of the reflection symmetry
that of the symmetry properties on the whole simplifies                of space-time, the mirror image of electrons being
it. Nevertheless the clear correspondence between                      electrons and not positrons. We are now forced to believe
the invariance properties of the laws of nature, and the               that this symmetry is only approximate and the
symmetry properties of space-time, was most clearly                    concept of parity, as used in spectroscopy and nuclear
breached by the operation of charge conjugation.                       physics, is also only approximate. Even more funda-
This postulated that the laws of nature remain the                     mentally, there is a vast body of experimental informa-
same if all positive charges are replaced by negative                  tion in the chemistry of optically active substances
charges and vice versa, or more generally, if all particles            which are mirror images of each other and which have
are replaced by antiparticles. Reasonable as this                      optical activities of opposite direction but exactly
postulate appears to us, it corresponds to no symmetry                 equal strength. There is the fact that molecules which
of the space-tiine continuum. If the preceding inter-                  have symmetry planes are optically inactive; there is
pretation of the Co experiments should be sustained,                   the fact of symmetry planes in crystals. T All these
the correspondence between the natural symmetry                        facts relate properties of right-handed matter to
elements of space-time, and the invariance properties                  left-handed maller, not of right-handed matter to
of the laws of nature, would be restored. It is true that              left-handed anlimaller. The new experiments leave no
the role of the planes of reflection would not be that to              doubt that the symmetry plane in this sense is not
which we are accustomed-the mirror image of an elec-                   valid for all phenomena, in particular not valid for
tron would become a positron-but the mirror image of                   (j decay, that if the concept of symmetry plane is at all
a sequence of events would still be a possible sequence                valid for all phenomena, it can be valid only in the
of events. This possible sequence of events would be                   sense of converting matter into antimatter.
more difficult to realize in the actual physical world                     Furthermore, the old-fashioned type of symmetry
than what we had thought, but it would still be possible.              plane is not the only symmetry concept that is only
   The restoration of the correspondence between the                   approximately valid. Charge conjugation was mentioned
natural symmetry properties of space-time on one                       before, and we are remainded also of isotopic spin,
hand, and the laws of nature on the other hand, is the                 of the exchange character, that is multiplet system,
appealing feature of the proposition. It has, actually,                for electrons and also of nuclei which latter holds so
two alarming features. The first of these is that a                    accurately that, in practice, parahydrogen molecules
symmetry operation is, physically, so complicated.                     can be converted into orthohydrogen molecules only
If it should tum out that the operation of time inversion,             by first destroying them. 8 This approximate validity
as we now conceive it, is not a valid symmetry operation               of laws of symmetry is, therefore, a very general
(e.g., if one of the experiments proposed by Treiman                   phenomenon-it may be the general phenomenon. We
and Wyld gave a positive result) we could still maintain               are reminded of Mach's axiom that the laws of nature
the validity of this symmetry operation by reinterpret-                depend on the physical content of the universe, and
ing it. We could postulate, for instance, that time                    the physical content of the universe certainly shows
inversion transforms matter into meta-matter which                     no symmetry. This suggests-and this may also be
will be discovered later when higher energy accelerators               the spirit of the ideas of Yang and Lee-that all
will become available. Thus, maintaining the validity                  symmetry properties are only approximate. The
of symmetry planes forces us to a more artificial view                 weakest interaction, the gravitational force, is the basis
of the concept of symmetry and of the invariance of                    of the distinction between inertial and accelerated
the laws of physics.                                                   coordinate systems, the second weakest known inter-
   The other alarming feature of our new knowledge                     action, that leading to (j d~y, leads to the distinction
is that we have been misled for such a long time to                    between matter and antimatter. Let me conclude this
believe in more symmetry elements than actually exist.                 subject by expressing the conviction that the discoveries
There was ample reason for this and there was ample                    of Wu, Ambler, Hayward, Hoppes, and Hudson,'
experimental evidence to believe that the mirror image                 and of Garwin, Lederman, and Weinreich '• will not
of a possible event is again a possible event with                     remain isolated discoveries. More likely, they herald a
electrons being the mirror images of electrons and not                 revision of our concept of invariance and possibly
of positrons. Let us recall in this connection first how                  T For the role of the space and time inversion operators in
the concept of parity, resulting from the beautiful                    classical theory, see H. Zocher and C. Torok, Proc. Nat!. Acad.
                                                                       Sci. U.S. 39, 681 (1953) and literature quoted there.
though almost forgotten experiments of Laporte,'                          I See A. Farkas, Orthol,ydrog ... , PIJI'ahydrogen and Heavy
                                                                       Hydrogen (Cambridge University Press, New York, 1935).
  • O. Laporte, Z. Physik 23, 135 (1924). For the interpretation         • Wu, Ambler, Hayward, Hoppes, and Hudson, Phys. Rev. lOS,
of Laporte's rule in terms of the quantum-mechanic;a.1 operation       1413(L) (1957).
of inversion, see the writer's GrupPenJ/zeoru urut ihri A nwendungen     10 Garwin, Lederman, and Weinreich, Phys. Rev. lOS, 1415(1.)
auf die Quantm_lumik tier Almosp<kwen (Friedrich Vieweg und            (1957); also, J. l.. l'';p'oman and V. L. Tc1egdi, ibid. lOS, 1681 (I.)
Sohn, Braunschw"ig, 1931), Chap. XVIII.                                (19571.
8                                                                                                                  CHAPTER!
EUGENE P. WIGNER
of other concepts which are even more taken for                        Appendix III.) This shows that the establishment of a
granted.                                                               close network of points in space-time requires a
                                                                       reasonable energy density, a dense forest of world
 QUANTUM LIMITATIONS OF THE CONCEPTS OF                                lines wherever the network is to be established. How-
           GENERAL RELATMTY                                            ever, it is not necessary to discuss this in detail because
  The last remarks naturally bring us to a discussion                  the measurement of the distances between the points of
of the general theory of relativity. Tbe main premise                  the network gives more stringent requirements than
of this theory is that coordinates are only labels to                  the establishment of the network.
specify spare-time points. Their values have no partic-                   It is often said that the distances between events
ular significance unless the coordinate system is                      must be measured by yardsticks and rods. We found
somehow anchored to events in space-time.                              that measurements with a yardstick are rather difficult
  Let us look at the question of how the equations of                  to describe and that their use would involve a great
the general theory of relativity could be verified.                    deal of unnecessary complications. The yardstick gives
The purpose of these equations, as of all equations of                 the distance between events correctly only if its marks
physics, is to calculate, from the knowledge of the                    coincide with the two events simultaneously from the
present, the state of affairs that will prevail in the                 point of view of the rest-system of the yardstick.
future. The quantities describing the present state are                Furthermore, it is hard to image yardsticks as anything
called initial conditions; the ways these quantities                   but macroscopic objects. It is desirable, therefore,
change are called the equations of motion. In relativity               to reduce all measurements in space-time to measure-
theory, the state is described by the metric which                     ments by clocks. Naturally, one can measure by
consists of a network of points in splce-time, that is                 clocks directly only the distances of points which are
a network of events, and the distances between these                   in time-like relation to each other. The distances of
events. If we wish to translate these general statements               events which are in space-like relation, and which
into something concrete, we must decide what events                    would be measured more naturally by yardsticks,
are, and how we measure distances between" evenls.                     will have to be measured, therefore, indirectly.
The metric in the general theory of relativity is a                        It appears, thus, that the simplest framework in
metric in space-time, its elements are distances between               space-time, and the one which is most nearly micro-
space-time points, not between points in ordinary space.               scopic, is a set of clocks, which are only slowly moving
   The events of the general theory of relativity are                   with respect to each other, that is, with world lines
coincidences, that is, collisions between particles.                   which are approximately parallel. These clocks tick
The founder of the theory, when he created this concept,               off periods and these ticks form the network of events
had evidently macroscopic bodies in mind. Coincidences,                 which we wanted to establish. This, at the same time,
tbat is, collisions between such bodies, are immediately               establishes the distance of those adjacent points which
observable. This is not the case for elementary particles;             are on the same world line.
a collision hetween these is something much more                           Figure 4 shows two world lines and also shows an
evanescent. In fact, the point of a collision between                  event, that is, a tick of the clock, on each. The figure
two elementary particles can be closely localized in                    shows an artifice which enables one to measure the
space-time only in case of high-energy collisions. (See                distance of space-like events: a light signal is sent out
                                                                        from the. first clock which strikes the second clock
                                                                       at event 2. This clock, in tum, sends out a light signal
                                                                        which strikes the first clock at time I' after the event 1.
                                                                       If the first light signal had to be sent out at time j
                                                                        before the first event, the calculation given in Appendix
                                                                        IV shows that the space-like distance of events 1 and 2
                                                                       is the geometric average of the two measured time-like
                                                                       distances j and 1'. This is then a way to measure
                                                                       distances between space-like events by clocks instead
                                                                       of yardsticks.
                                                                           It is interesting to consider the quantum limitations
                                                                       on the accuracy of the conversion of time-like measure-
                                                                        ments into space-like measurements, which is illustrated
                                                                        in Fig. 4. Naturally, the times / and /' will be well
                                                                        defined only if the light signal is a short pulse. This
   FIG. 4. Measurement of space-like distances by means of a            implies that it is composed of many frequencies and,
clock. It is assumed that the metric tensor is essentially constant     hence, that its energy spectrum has a corresponding
within the space-time region contained in the figure. The space-like    width. As a result, it will give an indeterminate recoil
distance between events 1 and 2 is measured by means of the light
signals which pass through event 2 and a geodesic which goes            to the second clock, thus further increasing the un-
through event I. Explanation in Appendi.lV.                             certainty of its momentum. All this is closely related
PERSPECfIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                                                                       9
to Heisenberg's uncertainty principle. A more detailed For example, a clock, with a running time of a day and
calculation' shows that the added uncertainty is of an accuracy of 10-a second, must weigh almost a
the same order of magnitude as the uncertainty inherent gram-for reasons stemming solely from uncertainty
in the nature of the best clock that we could think of, principles and similar considerations.
so that the conversion of time-like measurements               So far, we have paid attention only to the physical
into space-like measurements is essentially free.           dimension of the clock and the requirement that it
    We finally come to the discussion of one of the be able to distinguish between events which are only
principal problems-the limitations on the accuracy a distance I apart on the time scale. In order to make
of the clock. It led us to the conclusion that the inherent it usable as part of the framework which was described
limitatioI1l! on the accuracy of a clock of given weight before, it is necessary to read the clock and to start it.
and size, which should run for a period of a certain As part of the framework to map out the metric of
length, are quite severe. In fact, the result in summary space-time, it must either register the readings at
is that a clock is an essentially nonmicroscopic object. which it receives impulses, or transmit these readings
In particular, what we vaguely call an atomic clock, to a part of space outside the region to be mapped out.
a single atom which ticks off its periods, is surely an This point was already noted by Schriidinger." How-
idealization which is in conflict with fundamental ever, we found it reassuring that, in the most interesting
concepts of measurability. This part of our conclusions case in which 1= ct, that is, if space and time inaccuracies
can be considered to be well established. On the other are about equal, the reading requirement introduces
hand, the actual formula which will be given for the only an insignificant numerical factor but does not
limitation of the accuracy of time measurement, a sort change the form of the expression for the minimum
of uncertainty principle, should be considered as the mass of the clock.
best present estimate.                                         The arrangement to map the metric might consist,
    Let us state the requirements as follows. The watch therefore, of a lattice of clocks, all more or less at rest
shall run T seconds, shall measure time with an accuracy with respect to each other. All these clocks can emit
of Tjn=l, its linear extension shall not exceed I, its light signals and receive them. They can also transmit
mass shall be below m. Since the pointer of the watch their reading at the time of the receipt of the light
must be able to assume n different positions, the system signal to the outside. The clocks may resemble oscil-
will have to run, in the course of the time T, over at lators, well in the nonrelativistic region. In fact, the
least n orthogonal states. Its state must, therefore, be velocity of the oscillating particle is about n times
 the superposition of at least n stationary states. It is smaller than the velocity of light where n is the .ratio
clear, furthermore, that unless its total energy is at of the error in the time measurement, to the dui-ation
least h/I, it cannot measure a time interval which is of the whole interval to be measured. This last quantity
 smaller than I. This is equivalent to the usual un- is the spacing of the events on the time axis, it is also
certainty principle. These two requirements follow the distance of the clocks from each other, divided by
directly from the basic principles of quantum theory; the light velocity. The world lines of the clocks from
they are also the requirements which could well have the dense forest which was mentioned before. Its
 been anticipated. A clock which conforms with these branches suffuse the region of space-time in which the
postulates is, for instance, an oscillator, with a period metric is to be mapped out.
 which is equal to the running time of the clock, if it        We are not absolutely convinced that our clocks
is with equal probabilty in any of the first n quantum are the best possible. Our principal concern is that we
states. Its energy is about n times the energy of the have considered only one space-like dimension. One
 first excited state. This corresponds to the uncertainty consequence of this was that the oscillator had to be a
principle with the accuracy t as time uncertainty. one-dimensional oscillator. It is possible that the size
Broadly speaking, the clock is a very soft oscillator, the limitation does not increase the necessary mass of the
oscillating particle moving very slowly and with a clock to the same extent if use is made of all three
 rather large amplitude. The pointer of the clock is spatial dimensions.
the position of the oscillating particle.                      The curvature tensor can be obtained from the
    The clock of the preceding paragraph is still very metric in the conventional way, if the metric is measured
 light. Let us consider, however, the requirement that with sufficient accuracy. It may be of interest, never-
 the linear dimensions of the clock be limited. Since theless, the describe a more direct method for measuring
 there is little point in dealing with the question in the curvature of space. It involves an arrangement,
great generality, it may as well be assumed here that illustrated in Fig. S, which is similar to that used for
 the linear dimension shall correspond to the accuracy obtaining the metric. There is a clock, and a mirror,
 in time. The requirement I=cl increases the mass of the at such a distance from each other that the curvature
clock by nl which may be a very large factor indeed:        of space can be assumed to be constant in the interven-
                                                             11 E. Schrlklinger, Ber. Preuss. Akad. Wiss. phys.-math. Kl.
                  m> n'ht/l'=n1h/c't.                (2)   1931,238.
10                                                                                                                CHAPTER I
ing region. The two clocks need not be at rest with             for the accuracy with which the curvature can be
respect to each other, in fact, such a requirement would        measured. The result is, as could be anticipated, that
involve additional measurements to verify it. If the            the curvature at a poinl in space-time cannot be
space is flat, the world lines of the clocks can be drawn       measured a t all j only the average curvature over a
straight. In order to measure the curvature, a light            finite region of space-time can be obtained. The error of
signal is emitted by the clock., and this is reflected by       the measurement1 is inversely proportional to the
the mirror. The time of return is read on the clock-it          two-thirds power of the area available in space-time,
is Ir-and the light signal returned to the mirror.              that is, the area around which a vector is carried,
The time which the light signal takes on its second trip        always parallel to itself, in the customary definition of
to return to the clock is denoted by I,. The process is         the curvature. The error is also proportional to the cube
repeated a third time, the duration of the last roundtrip       root of the Compton wavelength of the clock. Our
denoted by I,. As shown in Appendix V, the radius of            principal hesitation in considering this result as defini-
curvature, a, and the relevant componenl         ROID1    of    tive is again its being based on the consideration of
the   Ri~mann   tensor are given by                             only one space-like dimension. The possibilities of
                                                                measuring devices, as well as the problems, may be
                1,- 21,+1, 11                                   substantially different in three-dimensional space.
                ----=-=llnR OlOl)l.                      (2)
                  122 a                                            Whether or not this is the case, the essentially
                                                                nonmicroscopic nature of the general relativistic
   If classical theory would be valid also in the micro-        concepts seems to us inescapable. If we look at this
scopic domain, there would be no limit on the accuracy          first from a practical point of view, the situation is
of the measurement indicated in Fig. 5. If h is infinitely      rather reassuring. We can note first, that the measure-
small, the time intervals 11, I" I, can all be measured         ment of electric and magnetic fields, as discussed by
with arbitrary accuracy with an infinitely light clock.         Bohr and Rosenfeld,12 also requires macroscopic, in
Similarly, the light signals between clock and mirror,          fact very macroscopic, equipment and that this does
however short, need carry only an infinitesimal amount          not render the electromagnetic field concepts useless
of momentum and thus deflect clock and mirror                   for the purposes of quantum electrodynamics. It is
arbitrarily little from their geodesic paths. The quantum       true that the measurement of space-time curvature
phenomena considered before force us, 'however, to              requires a finite region of space and there is a minimum
use a clock with a minimum mass if the measurement              for the mass, and even the mass uncertainty, of the
of the time intervals is to have a given accuracy. In           measuring equipment. However, numerically, the
the present case, this accuracy must be relatively              situation is by no means alarming. Even in interstellar
high unless the time intervals I" I" I, are of the same         space, it should be possible to measure the curvature
order of magnitude as the curvature of space. Similarly,
the deflection of clock and mirror from their geodesic               "N. Bohr and L. Rosenfeld, Kg!. Danske Videnskab. Selskah
                                                                    Mat.-fys. Medd. 12, No. 8 (1933). See also further literature
paths must be very small if the result of the measure-          quoted in L. Rosenfeld's article in Niels Eo"" and lJu Development
ment is to be meaningful. This gives an effective limit         oj Physics (Pergamon Press, London. 1955).
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                                                                11
in a volume of a light second or so. Furthermore, the      axis, we look at the particle in the standard state from
mass of the clocks which one will wish to employ for       a coordinate system moving with the velocity v in
such a measurement is of the order of several micro-       the -. direction. If we wish to have a particle at rest
grams SO that the finite mass of elementary particles      but with its spin in the yz plane, including an angle a
does not cause any difficulty. The clocks will contain     with the • axis, we look at the standard state from a
many particles and there is no need, and there is not      coordinate system the y and z axes of which include an
even an incentive, to employ clocks which are lighter      angle a with the y and. axes of the coordinate system
than the elementary particles. This is hardly surprising   in which the standard state was defined. In order to
since the mass which can be derived from the gravita-      obtain a state in which both velocity and spin have the
tional constant, light velocity, and Planck's constant,    aforementioned direction (Le., a direction in the y.
is about 20 micrograms.                                    plane, including the angles a and ir-a with the y
   It is well to repeat, however, that the situation is    and z axes), we look at the standard state from the
less satisfaQ:ory from a more fundamental point of         point of view of a coordinate system in which the
view. It remains true that we consider, in ordinary        spin of the standard state is described as this direction
quantum theory, position operators as observables          and which is moving in the opposite direction.
without specifying what the coordinates mean. The            Two states of the system will be identical only if the
concepts of quantum field theories are even more           Lorentz frames of reference which define them are
weird from the point of view of the basic observation      identical. Under this definition, the relations which
that only coincidences are meaningful. This again is       will be obtained will be valid independently of the
hardly surprising because even a 20-microgram clock        properties of the particle, such as spin or mass (as
is too large for the measurement of atomic times or        long as the mass in nonzero so that the standard state
distances. If we analyze the way in which we "get          exists). Two states will be approximately the same if the
away" with the use of an absolute space concept, we        two Lorentz frames of reference which define them
simply find that we do not. In our experiments we          can be obtained from each other by a very small
surround the microscopic objects with a very macro-        Lorentz transformation, that is, one which is near
scopic framework and observe coincUknces between           the identity. Naturally, all states of a particle which
 the particles emanating from the microscopic system,      can be compared in this way are related to each other
 and parts of the framework. This gives the collision      inasmuch as they represent the same standard state
 matrix, which is observable, and observable in terms of   viewed from various coordinate systems. However, we
 macroscopic coincidences. However, the so-called          shall have to compare only these states.
 observables of the microscopic system are not only not       Let us denote by A (O,I") the matrix of the trans-
 observed, they do not even appear to be meaningful.       formation in which the transformed coordinate system
 There is, therefore, a boundary in our experiments        moves with the velocity -. in the • direction where
 between the region in which we use the quantum            .=c tanhl"
 concepts without worrying about their meaning in
 face of the fundamental observation of the general
 theory of relativity, and the surrounding region in
                                                                       A(O'¥'}=II~o smh¥,
                                                                                    c?~¥' coshl"
                                                                                          s~hl"ll·             (1.1)
 which we use concepts which are meaningful also in
 the face of the basic observation of the general theory   Since the x axis will play no role in the following
 of relativity but which cannot be described by means of   consideration, it is suppressed in (1.1) and the three
 quantum theory. This appears most unsatisfactory          rows and the three columns of this matrix refer to the
 from a strictly logical standpoint.                       y', .', d and to the y, " cl axes, respectively. The
                                                           matrix (1.1) characterizes the state in which the
                      APPENDIX I                           particle moves with a velocity v in the direction of the
  It will be necessary, in this appendix, to compare       z axis and its spin is parallel to this axis.
various states of the same physical system. These             Let us further denote the matrix of the rotation by
states will be generated by looking at the same state-     an angle I" in the yz plane by
the standard state-from various coordinate systems.
Hence every Lorentz frame of reference \ will define a
state of the system-the state as which the standard                                                            (1.2)
state appears from the point of view of this coordinate
system. In order to define the standard state, we          We refer to the direction in the yz plane which lies
choose an arbitrary but fixed Lorentz frame of reference   between the y and. axes and includes an angle {J with
and stipulate that, in this frame of reference, the        the z axis as the direction {J. The coordinate system
particle in the standard state be at rest and its spin     which moves with the velocity -. in the {J direction is
(if any) have the direction of the z axis. Thus, if we     obtained by the transformation
wish to have a particle moving with a velocity v in
the • direction and with a spin also directed along this                 A ({J,¥'}=R({J}A (O,¥,}R( -{J}.       (1.3)
12                                                                                                                       CHAPTER I
In order to obtain a particle wbich moveb in the direc-            This transformation does not have the form (1.4). In
tion {} and is polarized in this direction, we first rotate        order to bring it into that form, it has to be multiplied
the coordinate system counterclockwise by {} (to have              on the right by R(o), i.e., one bas to rotate the spin
the particle polarized in the proper direction) and                ahead of time. The angle 0 is given by the equation
impart it then a velocity - v in the {} direction. Hence,
it is the transformation                                                                tanh '1" v'
                                                                                    tan.=--=-(l-.'N)1                               (1.7)
                                                                                         sinh I" v
                        sin{} cosh I" sin{} sinh I"
                        cos{} cosh I" cos{} sinh I"
                                                      II   (1.4)
                                                                   and is called the angle between spin and velocity.
                                                                   For V«c, it becomes equal to the angle which the
                            sinh I"      coshl"                    ordinary resultant of two perpendicular velocities, v
which characterizes the aforementioned state of the                and v', includes with the first of these. However, •
particle. It follows from (1.3) that                               becomes very small if v is close to c; in this case it is
                                                                   hardly necessary to rotate the spin away from the z
           T({},I")=R({})A(O'I")=R(~)T(O,I")               (1.5)   axis before giving it a velocity in the z direction.
so that the same state can be obtained also by viewing             These statements express the identity
tbe state characterized by (1.1) from a coordinate
                                                                                  A (!r,I"/)A (O,I")R(.) = T({},I"")                (1.8)
system that is rotated by{}. It follows that the statement
"velocity and spin are parallel" is invariant under                which can be verified by direct calculation. The right
rotations. This had to be expected.                                side represents a particle with parallel spin and velocity,
   If the state generated by A (O,\O)=T(O,I") is viewed            the magnitude and direction of the latter being given
from a coordinate system which is moving with the                  by the well-known equations
velocity u in the direction of the z axis, the particle
will still appear to move in the z direction and its spin                        ."=c tanh 1"" = (.'+v"-.'v"/c')1                 (1.Sa)
will remain parallel to its direction of motion, unless            and
u>v in which case the two directions will become                                       sinh 1"/    v/
antiparallel, or unless u= v in which case the statement                            tan{}=--=----                                 (1.8b)
becomes meaningless, the particle appearing to be                                      tanhl" v(l-v"/c')1
at rest. Similarly, the otber states in which spin and             Equation (I) given in the text follows from (1.7) and
velocity are parallel, i.e., the states generated by the           (1.8b) for fY'Vc.
transformations T({},I") , remain such states if viewed              The fact that the states T(t'J,I")>{;o (where >{;o is the
from a coordinate system moving in the direction of the            standard state and 1"» 1) are approximately invariant
particle's velocity, as long as the coordinate system is           under all Lorentz transformations is expressed mathe-
not ·moving faster than the particle. This also had to             matically by the equations,
be expected. However, if the state generated by T(O,<P)
is viewed from a coordinate system moving with velocity                            R{{})· T(O,I")>{;o= T({},I")>{;o,              (1.Sa)
v' = c tanhl'" in the - y direction, spin and velocity will
nol appear parallel any more, provided Ihe velocily v                           A (0,'1")· T(O,I")>{;o= T(O,I"'+<P)>{;o,          (1.9a)
of Ihe parlicle is nol close 10 lighl velocily. This last          and
proviso is the essential one; it means tbat the bigh                                                                              (1.9b)
velocity states of a particle for which spin and velocity
are parallel (i.e., the states generated by (1.4) with a           which give the wave function of the state T(O,I")>{;o,
large 1") are states of this same nature if viewed from a          as viewed from other Lorentz frames of reference.
coordinate system which is not moving too fast in the              Naturally, similar equations apply to all T(a,I")>{;o.
direction of motion of the particle itself. In the limiting        In particular, (1.Sa) shows that the states in question
case of the particle moving with light velocity, the               are invariant under rotations of the coordinate system,
aforementioned states become invariant under all                    (1.9a) that they are invariant with respect to Lorentz
Lorentz transformations.                                           transformations with a velocity not too' high in the
   Let us first convince ourselves that' if the state              direction of motion (so that 1"'+ 1">>0, i.e., 1'" not too
(1.1) is viewed from a coordinate system moving in                 large a negative number). Finally, in order to prove
the - y direction, its spin and velocity no longer appear           (1.9b), we calculate the transition probability between
parallel. The state in question is generated from the               tbe states A (!r,I"')· T(O,I")>{;o and T({}"P")>{;o where
normal state by the transformation                                 t'J and 1"" are given by (l.8a) and (l.8b). For this,
                                                                    (1.8) gives
A <tr,<P')A (0,'1')
                                                                   (A (!r, 1"') . T(O,I")>{;o, T({},I"")>{;o)
       Il °
          cOSb<P' sinh<p sinhl'"
      =               cosh I"                              (1.6)                  = (T({},I"")R(.)-l,fo, T(IJ,I"")l/to)
          Sinhl'" sinh<p coshl'"                                                                           = (R(o)-l,fo,./to)-+(l/to,y,o).
PERSPECfIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                                                                                  13
The second line follows because T(rJ,rp") represents a                The three operations I, T, C, together with their
coordinate transformation and is, therefore, unitary.               products TC (Liiders' time inversion of the second
The last member follows because .-.0 as <p->oo as                   kind), IC, IT, ITC and the unit operation form a
can be seen from (1.7) and R(O) = 1.                                group and the products of the elements of this group
   The preceding consideration is not fundamentally                 with those of the proper Poincare group were considered
new. It is an elaboration of the facts (a) that the                 to be the symmetry operations of all laws of physics.
subgroup of the Lorentz group which leaves a null-vector            The suggestion given in the text amounts to eliminating
invariant is different from the subgroup which leaves               the operations I and C separately while continuing to
a time like vector invariant' and (b) that the representa-          postulate their product IC as symmetry operation.
tions of the latter subgroup decompose into one                     The discrete symmetry group then reduces to the unit
dimensional representations if this subgroup is "con-               operation plus
tracted" into the subgroup which leaves a null-vector                                        IC, T, and ICT,                     (2.1)
invariant."
                                                                    and the total symmetry group of the laws of physics
                        APPENDIXD                                   becomes the proper Poincare group plus its products
   Before the hypothesis of Lee and Yang" was put                   with the elements (2.1). This group is isomorphic
forward, it was commonly assumed that there are, in                 (essentially identical) with the unrestricted Poincare
addition to the symmetry operations of the proper                   group, i.e., the product of all Lorentz transformations
Poincare group, three further independent symmetry                  with all the displacements in space and time. The
operations. The proper Poincare group consists of all               quantum mechanical expressions for the operations of
Lorentz transformations which can be continuously                   the proper Lorentz gl'oup and its product with IC are
obtained from unity and all translations in space-like              unitary, those for T and ICT (as well as for their
and time-like directions, as well as the products of all            products with the elements of the proper Poincare
these transformations. It is a continuous group j the               group) antiunitary. Liiders" has pointed out that,
Lorentz transformations contained in it do not change               under certain very natural conditions, ICT belongs to
the direction of the time axis and their determinant is             the symmetry group of every local field theory.
1. The three independent further operations which were
considered to be rigorously valid, were                                                          APPENDIX    m
     Space inversion I, that is, the transformation                   Let us consider, first, the collision of two particles
  x, y, z->- x, - y, - 3, without changing particles into           of equal mass m in the coordinate system in which the
  antiparticles.                                                    average of the sum of their momenta is zero. Let us
     Time inversion T, more appropriately described                 assume that, at a given time, the wave function of
  by Liiders" as Umkehr der Bewegungsrichtung, which                both particles is confined to a distance I in the direction
  replaces every velocity by the opposite velocity                  of their average velocity with respect to each other. If
  so that the position of the particles at +1 becomes               we consider only this space-like direction, and the time
  the same as it was, without time inversion, at -/.                axis, the area in space-time in which the two wave
  The time inversion T (also called time inversion of               functions will substantially overlap is [see Fig. 6(a)]
  the first kind by Liiders' ·) does not convert particles
  into antiparticles either.                                                                                                     (3.1)
     Charge conjugation C, that is, the replacement of
  positive charges by negative charges and more                     where Vmi. is the lowest velocity which occurs with
  generally of particles by antiparticles, without chang-           substantial probability in the wave packets of the
  ing either the position or the velocity of these par-             colliding particles. Denoting the average momentum
  ticles'" The quantum-mechanical expressions for the               by ji (this has the same value for both particles) the
  symmetry operations I and C are unitary, that for T               half-width of the momentum distribution by a, then
  is antiunitary.                                                   Vml.= (ji-a)(m'+(ji-a)'/c2)-I. Since I cannot be below
                                                                    k/a, the area (3.1) is at least
   u E. Inonu and E. P. Wigner, Proc. Nat!. Acad. Sci. U.S·
39,510 (1953).                                   .                                          h' (m'+(ji-a)'/c')1
   .. T. D. Lee and C. N. Yang, Phys. Rev. 104, 254 (1956). See                                                                 (3.1a)
also E. M. Purcell and N. F. Ramsey, Phys. Rev. 78, 807 (1950).
   .. G. Lilders, Z. Physik 133,325 (1952).                                                 U'        ji-a
   II G. Lilders, Kg!. Danske Videnskab. Selskab MatAys. Medd.
28, No.5 (1954).                                                    (Note that the area becomes infinite if 6>ji.) The
   "All three symmetry operations were first discussed in detail
by J. Schwinger, Phys. Rev. 74, 1439 (I94K). See also H. A.         the present writer, Z. Physik 43, 624 (1927) and Nachr. Akad.
Kramers, Proc. Acad. Sci. Amsterdam 40, 814 (1937) and W. Pauli'.   Wi... Gallingen, Math.-physik. 1932,546. See also T. D. Newton
article in NiJs Bolw aoul 'M v...Iop"..,., of Physia (Pergamon      and E. P. Wigner, Revs. Modem Phy •. 21, 400 (1949); S.
Press, London, 1955). The significance of the tint two symmetry     Watanabe, Revs. Modern Phys. 27, 26 (1945). The concept of
operations (and their connection with the concepts of parity and    charge conjugation is   based on the observation of W. Furry, Phys.
the Kramers degeneracy respectively), were first pointed out by     Rev. 51, 125 (1937).
14                                                                                                                           CHAPTER I
                                         ,,
                                                              L                x
                                              \
                                                  ,,
                                                   ,,                                              I
                                                     "
                                                                                               I
(b)
            FIG. 6. (al Localization of a collision of two particles of equal mass. The full lines indicate the effective boundaries
         of the wave packet of the particle traveling to the right, the broken lines the effective boundaries of the wave packet
         of the particle traveling to the left. The collision can take place in the shaded area of space-time. (bl Localization
         of a collision between a particle with finite mass and a particle with zero rest-mass. The full lines. at a distance
         A apart in the % direction, indicate the boundary of the particle with uro rest-mass, the broken lines apply to the
         wave packet of the particle with nonzero rest-mass. The collision can take place in the shaded area.
minimum of (3.1a) is, apart from a numerical factor                     ties. Hence p'" hiX. The kinetic energy of the particle
                                                                        with finite restmass will be of the order of magnitude
               t'                  h'c
          amln"'-(m'+P'Ic')I",            ,(3.2)                        !{m'C<+ (p+hll)'c')I+Hm'C<+ (p-hll)'c')I_ mc', (3.4)
               p'              EI(E+mc')1
                                                                        since hll is the momentum uncertainty. Since I ~ >.,
where E is the kinetic energy (total energy minus                       one can neglect p in (3.4) if one is interested only in the
rest-energy) of the particles.                                          order of magnitude. This gives for the total kinetic
   The kinetic energy E permits the contraction of the                  energy,
wave functions of the colliding particles also in direc-                            E",hc/;"+ (m'C<+h'c'/l')I_mc',            (3.5)
tions perpendicular to the average relative velocity,                   while the area in Fig. 6(b) is of the order of magnitude
to an area h'c'IE(E+2mc'). Hence, again apart from a
numerical factor, the volume to which the collision                                            a= (Xlc) (l+<lvXlc),                     (3.6)
can be confined in four dimensional space-time becomes                  where <lv is the uncertainty in the velocity of the second
                                                                        particle
                    v mio = - - - - -                          (3.3)
                                                                                        pHil                 ji-h/I
                             EI(E+mc')!                                                                    ------.                     (3.6a)
                                                                                (m'+(p+hll)'lc')! (m'+cp-hll)'Ic')1
E is the average kinetic energy of the particles in the
coordinate system i.n which their center of mass is,                    This can again be replaced by (hll)(m'+h'/l'c')-I.
on the average, at rest. Equation (3.3) is valid apart                     For given E, the minimum value of a is assumed if the
from a numerical constant of unit order of magnitude                    kinelic energies of the two particles are of the same
but this constant depends on Elmc'.                                     order of magnitude. The two terms of (3.6) then become
   Let us consider now the opposite limiting case,                      about equal and 1/>.", (EI(m+E»)I. The -minimum
the collision of a particle with finite rest-mass m with a              value of a, as far as order of magnitude is concerned,
particle with zero rest-mass. The collision is viewed                   is again given by (3.2). Similarly, (3.3) also remains
again in the coordinate system in which the average                     valid if one of the two particles has zero rest-mass.
linear momentum is zero. In this case, one will wish to                    The two-dimensional case becomes simplest if both
confine the wave function of the particle with finite                   particles have zero rest-mass. In this case the wave
rest-mass to a narrower region I than that of the particle              packets do not spread at all and (3.2) can be im-
with zero rest-mass. If the latter is confined to a region              mediately seen to be valid. In the four-dimensional case,
of thickness >., [see Fig. 6(b)], its momentum and                      (3.3) again holds. However, its proof by means of
energy uncertainties will be at least hlX and helX                      explicitly constructed wave packets (rather than
and these expressions will also give, apart from a                      reference to the uncertainty relations) is by no means
numerical factor, the average values of these quanti-                   simple. It requires wave packets which are confined in
PERSPECflVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                                                                                  15
The radial coordinates of the points I, 2, 3, 4 are            Similar expressions apply for the traveling times of the
denoted by rl, r" r., r.                                       first and third light signals; all </> can be expressed by
                     r,=a/co&/l,.                      (5.4)   means of (5.3a), (5.3b), (5.3c) in terms of </>. and 8.
                                                               This allows the calculation of the expression (3). For
Tbe proper time I, registered by the clock, can be             small a, one obtains
obtained by integrating tbe metric form (5.2) along
the world line </>=0 of the clock
                 I=a In[r+(r-a,)tJ.                    (5.5)                       ---"'-,                          (5.7)
                                                                                        I.'      a
Hence, the traveling time I, of tbe second light signal
becomes                                                        and Riemann's invariant R=2/a' is proportional to
             r.+ (r,'-a') I      co&/l.(l +sin</>.)            the square of (5.7). In particular, it vanishes if the
      I,=a In               a In                     . (5.6)
             r.+(r.·-a')t       coSr/>.(l+sin</>.)             expression (3) is zero.
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                                      17
P. A. M. Dirac
79
Reprinted from Albert Einstein: Historical and Cultural Perspectives: The Centennial
Symposium in Jerusalem, (1979).
18                                                                      CHAPTER!
P. A. M. DIRAC
                                 80
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMME1RIES                       19
about these things, and it was all completely new to us. Then we
had Eddington to explain things.
  Eddington was very good at popular exposition. He had a great
talent for it, and he applied his talent to explaining the founda-
tions of relativity theory to the general public. He told us about
the need to consider that we cannot communicate instanta-
neously with people at a distance. We could communicate only
with the help of light signals and make a note of the time the
light signals were sent out and the time we received an answer,
and we had to work on that. Eddington also told us about the
Michelson-Morley experiment. It was quite an old experiment,
but one whose importance had not been previously appreciated.
Alben A. Michelson and Edward W. Morley had attempted to
determine the velocity of the earth through the ether, but, sur-
prisingly, their experiment failed to give any definite answer. The
only way to account for this failure was to suppose a very peculiar
behavior of measuring rods and clocks. Moving measuring rods
had to be subject to a kind of contraction, called the Lorentz-
Fitzgerald contraction. Clocks had to have their rates slowed up
when they were moving.
   All this was very hard to explain to the general public, but still
Eddington and other people wrote innumerable articles about it,
doing the best they could. The engineering students were not very
much better off. We were told that in some way the absolute
scheme of things that we had been using in our engineering stud-
ies had to be modified, but there was no very definite way to make
the modification. We were not given any definite equations.
   Now, Eddington was an astronomer and was very interested in
the general theory of relativity, and especially in testing its as-
tronomical consequences. The theory predicts a motion of the
perihelion of Mercury. It was easy to work this out and to check
that the observations agreed with the Einstein theory. This was
the first big triumph of the Einstein theory.
   Then there was the question of observing whether light would
be deflected when it passed close by the sun, another requirement
of the Einstein theory. This is something that can be tested ob-
servationally only at a time of total eclipse. Eddington had heard
about these things already in 1916, and he immediately set to
work to find out if there was a favorable total eclipse coming up
soon. He found that there would be a very favorable one occurring
in May 1919-very favorable because the time of totality was long
and also because the sun was then in a very rich field of stars, so
                                 81
20                                                                      CHAPTER!
P. A. M. DIRAC
                                 82
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                    21
   Eddington's book did not give any information about the strug-
gles that Einstein had gone through in order to set up his theory.
It just gave us the completed result. I have been very interested
in the lectures given by historians at the various Einstein sym-
posia, which enable me to understand better both Einstein's strug-
gles and also his appreciation of the need for beauty in the math-
ematical foundation. Einstein seemed to feel that beauty in the
mathematical foundation was more important, in a very funda-
mental way, than getting agreement with observation.
   This was brought out very clearly in the early work about the
theory of the electron. Hendrik Lorentz had set up a theory for
the motion of electrons that was in agreement with Einstein's
principles, and experiments were made by Walter Kaufmann to
see whether this theory was in agreement with observation. The
resulting observations did not support the theory of Lorentz and
Einstein. Instead, they supported an older theory of the electron,
given by Max Abraham. Lorentz was completely knocked out by
this result. He bewailed that all his work had gone for nothing.
Einstein seems not to have reacted very much to it. I do not know
just what he said-that's a question for the historians to decide.
I imagine that he said, "Well, I have this beautiful theory, and
I'm not going to give it up, whatever the experimenters find; let
us just wait and see."
   Well, Einstein proved to be right. Three years later the exper-
iments were done again by someone else, and the new experiment
supported the Lorentz-Einstein view of the electron. And some
years after that, a fault was found in the apparatus of Kaufmann.
So it seems that one is very well justified in attaching more im-
portance to the beauty of a theory and not allowing oneself to be
too much disturbed by experimenters, who might very well be
using faulty apparatus.
   Let us return to the general theory of relativity. The observa-
tions of the eclipse expeditions supported the theory. Right from
the beginning there was agreement. We then had a satisfactory
basis for the development of relativity. At that time I was a re-
search student and very much enjoyed the new field of work that
was opened up by relativity. One could take some previous piece
of work that had been expressed in nonrelativ,istic language and
tum it into the relativistic formalism, get a better understanding
of it, and perhaps find enough material to publish a paper.
   It was about that time, I think in 1924, that A. H. Compton
visited Cambridge and spoke at the Cavendish Laboratory about
                               83
22                                                                      CHAPTER!
P. A. M. DIRAC
                                 84
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                      23
                                85
24                                                                      CHAPTER!
P. A. M. DIRAC
                                 86
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                         25
                                  87
26                                                                    CHAPTER!
P. A. M. DIRAC
to which one could refer it. But one can transport it hom one
location in space to another, and the equations that govern this
transport are such that, if you go around a closed loop and get
back to your starting point, the final distance does not agree with
the starting value. That led to a generalization of the geometry,
which was soon found to provide just what was needed in order
to bring in the electromagnetic field.
   This seemed to;be a very wonderful solution of the problem,
but then there w~ a difficulty. Atomic events do provide a natural
scale for measuring distances. You could refer all yow distances
to this atomic scale, and then there would be no poipt at all in
having the uncertainty in distances introduced by Weyl's geom-
etry. So this very beautiful theory of Weyl was reluctantly aban-
doned.
   Einstein worked for the rest of his life on trying to solve ~s
problem of unifying the gravitational and the electromagnetic
fields. He tried one scheme after another. All were unsatisfactory.
Other people have joined in this work without achieving any
greater success than Einstein did. I have been wondering whether
Einstein was limiting his ideas too much in these attempts to
unify the gravitational and the electromagnetic fields. It seems
to me that it is quite possible that one will have to bring in
cosmological effects to arrive at a satisfactory solution of this
problem. The way cosmological effects would show up would be
this.
   Let us accept the Einstein theory as it stands for all problems
involving just classical theory, and only when we go over to atomic
problems, let us require that some modification is needed. This
modification can be expressed by saying that Planck's constant
is not really a constant in the cosmological sense but must be
considered as varying with the epoch, that is, with the time since
the origin of the universe, the "Big Bang." If we have Planck's
constant h varying, then we must also have the charge on the
electron e varying, because e21hc is a dimensionless constant that
plays an important role in physics and is observed to have the
value 11137, and it seems to be really constant. Thus, if h is
varying, e must vary according to its square root. On this basis
one could set up a new theory.
   Before introducing such a drastic revision in basic ideas, it is
desirable to have some confirmation of it by observation. If you
have these atomic constants varying when referred to the Einstein
                                88
PERSPECTIVE VIEW OF QUANTUM SPACE-TIME SYMMETRIES                        27
picture, it will mean that atomic clocks will not keep the same
time as the time of the Einstein theory. The time of the Einstem
theory is the time that governs the motion of the planets around
the sun, what astronomers call ephemeris time. So one could look
to see whether there is any difference between atomic clocks and
clocks based on ephemeris time.
   Astronomers have been studying this question. In particular,
T. C. Van Flandern, working at the Naval Research Observatory
in Washington, has spent many years studying lunar motion, re-
ferred to both ephemeris time and atomic time. Lunar motion
has been observed with atomic clocks since 1955. However, Van
Flandern's results up to the present are still not conclusive. One
must wait a little longer to see whether a difference really exists
between the two ways of measuring time.
   There is another possibility of checking these new ideas about
variation of constants that has been followed up by I. I. Shapiro.
His method consists in sending radar signals to one of the planets
and observing the reflected radar signals, and then timing the to
and fro journey with an atomic clock. In effect, he is observing
distances in the solar system with atomic clocks, and there should
be some discrepancy showing up if the atomic clocks do not keep
the same time as ephemeris time.
   Lwas'talking to Shapiro just before I came to this Symposium
and asked him about the latest information concerning his work.
There is a very good chance of observing radar waves reflected
from Mars because of the Viking Lander that landed on Mars in
1976, which can be used to send back reflected radar waves. Sha-
piro said that the time base that he has for the observations with
the Viking expedition (just about two years) is not long enough
for him to give a definite answer to this question. There were also
observations of Mars made previously with a Mariner expedition,
which give him a considerably longer time base, eight years in-
stead of two. But he told me that another year or two of work
would be needed to evaluate his results before he could answer
the question of variation of the constants.
   That is the situation at the present time. I am sorry that I cannot
offer anything more definite than that. There ate hopes that Sha-
piro will come out with a definite result, and if it is a positive
result, we shall have a new basis for looking at the question of
the unification of the gravitational and the electromagnetic fields.
It might be possible to revive Weyl's geometry, which was aban-
                                 89
28                                                                  CHAPTER I
P. A. M. DIRAC
                                      90
Chapter II
This Chapter consists of the fundamental paper of Wigner on the Poincare group
and some of the resulting papers. In his original paper, Wigner constructed
subgroups of the Lorentz group whose transformations leave the four-momentum of
a given particle invariant. These subgroups are called the little groups. In their
1948 paper, Bargmann and Wigner formulated Wigner's little groups in terms of the
infinitesimal generators. The little groups for massive, massless, and imaginary-
mass particles are locally isomorphic to the three-dimensional rotation group, the
two-dimensional Euclidean group, and the (2 + I)-dimensional Lorentz group.
In his 1945 paper, Dirac suggested the four-dimensional harmonic oscillator to
construct representations of the Lorentz group. In their 1979 papers, Kim, Noz, and
Oh constructed a representation of the O(3)-like little group for massive hadrons in
the quark model.
Quantum field theory has its place in the development of relativistic quantum theory,
and its strength and weakness are well known. As is well known, the Poincare
group is the basic language for quantum field theory, and there are many papers
which will indicate this point. In one of his 1964 papers, Weinberg gives a lucid
treatment of the role of the representations of the Poincare group in Feynman
diagrams.
REPRESENTATIONS OF THE POINCARE GROUP                                      31
By E. Wigner
  IThe possibility of a future non linear character of the quantum mechanics must be
admitted, of course. An indication in this direction is given by the theory of the positron, as
developed by PAM. Dirac (proc. Camb. Phil Soc. 30, 150, 1934, cf. also W. Heisenberg,
Zeits. f. Phys. 90, 209, 1934; 92, 623, 1934; W. Heisenberg and H. Euler, ibid. 98, 714,
1936 and R. Serber, Phys. Rev. 48, 49, 1935; 49, 545, 1936) which does not use wave
functions and is a non linear theory.
   2Cf. PAM. Dirac, The Principles of Quantum Mechanics, Oxford 1935, Chapters I and
II; J. v. Neumann, Mathematische Grundlagen der Quantenmechanik, Berlin 1932, pages
19-24.
   3The wave functions represent throughout this paper states in the sense of the
"Heisenberg picture," i.e. a single wave function represents the state for all past and
future. On the other hand, the operator which refers to a measurement at a certain time t
contains this t as a parameter. (Cf. e.g. Dirac, 1.c. ref. 2, pages 115-123). One obtains the
wave function <ps(t) of the Schroedinger picture from the wave function <PH of the
Heisenberg picture by <ps(t) = exp(-iHUij<pH • The operator of the Heisenberg picture is Q(t)   =
exp(iHUijQexp(-iHUij, where Q is the operator in the Schroedinger picture which does not
depend on time. Cf also E. Schroedinger, Sitz. d. Koen. Preuss. Akad. p. 418, 1930.
The wave functions are complex quantities and the undetermined factors in them are
complex also. Recently attempts have been made toward a theory with real wave functions.
Cf. E. Majorana, Nuovo Cim. 14, 171, 1937 and P. A. M. Dirac, in print
REPRESENTATIONS OF THE POINCARE GROUP                                                33
  4E. Wigner, Gruppentheorie und ihre Anwendungen auf die Quantenmechanik det
Atoms-pektren. Braunschweig 1931, pages 251-254.
34                                                                         CHAPTER II
A. Previous treatments
  sp. A. M. Dirac, Proc. Roy. Soc. A. 155, 447, 1936; Ai. Proca, 1. de Phys. Rad. 7, 347,
1936.
  9K1ein, Arkiv f. Matern. Astr. och Fysik, 25A, No. 15, 1936. I am indebted to Mr.
Darling for an interesting conversation on this paper.
36                                                                        CHAPTER II
The difference between the present paper and that of Majorana and Dirac
lies-apart from the finding of new representations-mainly in its greater
mathematical rigor. Majorana and Dirac freely use the notion of
infinitesimal operators and a set of functions to all members of which
every infinitesimal operator can be applied. This procedure cannot be
mathematically justified at present, and no such assumption will be used in
the present paper. Also the conditions of reducibility and irreducibility
could be, in general, somewhat more complicated than assumed by
Majorana and Dirac. Finally, the previous treatments assume from the
outset that the space and time coordinates will be continuous variables of
the wave function in the usual way. This will not be done, of course, in the
present work.
It follows from the second conditionS that there either exists a unitary
operator S by which the wave functions <1>(2) of the second representation
can be obtained from the corresponding wave functions <1>(1) of the first
representation
     <1>(2)   =   S<1>(1)                                                   (4)
or that this is true for the conjugate imaginary of <1>(2). Although, in the
latter case, the two representations are still equivalent physically, we shall,
in keeping with the mathematical convention, not call them equivalent.
The first condition now means that if the states <1>(1), <1>(2) = S<1>(I)
corres]:>Ond to each other in one coordinate system, the states n(1)(L)<1>(I)
and D(2)(L)<1>(2) correspond to each other also. We have then
     D(2)(L)<1>(2)     =    SD(l)(L)<1>(I)   =   SD(1)(L)S-I<1>(2).       (4a)
As this shall hold for every <1>(2), the existence of a unitary S which
REPRESENTATIONS OF TIlE POINCARE GROUP                                           37
transforms 0(1) into 0(2) is the condition for the equivalence of these two
representations. Equivalent representations are not considered to be really
different and it will be sufficient to find one sample from every infinite
class of equivalent representations.
Two simple theorems shall be mentioned here which will be proved later
(Sections 7A and 8C respectively). The first one refers to unitary
representations of any closed group, the second to irreducible unitary
representations of any (closed or open) group.
The second class of representations will be called factorial. For these, the
center of the representation algebra still contains only multiples of the unit
operator. Clearly, the irreducible representations are all factorial, but not
conversely. For finite dimensions, the factorial representations may
  lOp. 1. Murray and 1. v. Neumann, Ann. of Math. 37, 116, 1936; J. v. Neumann, to be
published soon.
REPRESENTATIONS OF THE POINCARE GROUP                                                    39
  11M. H. Stone, Proc. Nat. Acad. 16, 173, 1930, Ann of Math. 33,643, 1932, also J. v.
Neumann, ibid, 33, 567,1932.
40                                                                        CHAPTERll
However, the Lorentz group has many one parametric subgroups, and the
corresponding infinitesimal operators HI' H2, ... are all unbounded. For
every H an everywhere dense set of functions <\> can be found such that Hi<\>
can be defined. It is not clear, however, that an everywhere dense set can
be found to all members of which every H can be applied. In fact, it is not
clear that one such <\> can be found.
Every infinitesimal operator can be applied to R<\> if they all can be applied
to <\>, and the same holds for sums of the kind
        a1R 1<\> + a2R2<\> + ... + a"Rn<\>'                                (6)
These form, however, an everywhere dense set of functions if the
representation is irreducible.
If the representation is not irreducible, one can consider the set No of such
wave functions to which every infinitesimal operator can be applied. This
set is clearly linear and, according to the previous paragraph, invariant
under the operations of the group (i.e. contains every R<\> if it contains <\».
The same holds for the closed set N generated by No and also of the set P
REPRESENTATIONS OF THE POINCARE GROUP                                              41
It will turn out again that the inhomogeneous Lorentz group has no
pathological representations. Thus this assumption of Majorana and Dirac
also will be justified a posteriori. Every unitary representation of the
inhomogeneous Lorentz group can be decomposed into normal irreducible
representations. It should be stated, however, that the representations in
which the unit operator corresponds to every translation have not been
determined to date (cf. also section 3, end). Hence, the above statements
are not proved for these representations, which are, however, more truly
representations of the homogeneous Lorentz group, than of the
inhomogeneous group.
While all these points may be of interest to the mathematician only, the
new representation of the Lorentz group which will be described in section
7 may interest the physicist also. It describes a particle with a continuous
spin.
  lZCf. e.g. E. Wigner, l.c. Chapter III. O. Veblen and J.W. Young, Projective Geometry,
Boston 1917. Vol 2, especially Chapter VII.
 14H. Weyl, Mathem. Zeits. 23, 271; 24, 328, 377, 789, 1925; O. Schreier, Abhandl.
Mathem. Seminar Hamburg, 4,15, 1926; 5,233, 1927.
REPRESENTATIONS OF TIlE POINCill GROUP                                                        43
Sections 5, 6, 7 will deal with the "restricted Lorentz group" only, i.e.
Lorentz transformations with determinant 1 which do not reverse the
direction of the time axis. In section 8, the representations of the extended
Lorentz group will be considered, the transformations of which are not
subject to these conditions.
   15G. Frobenius, Sitz. d. Koen. Preuss. Akad. p. 501, 1898, I. Schur, ibid, p. 164, 1906;
F. Seitz, Ann. of Math. 37, 17, 1936.
44                                                                            CHAPTER II
A.
From (11) and (10) it follows that {v, w} = {Av, Aw} for every pair of
vectors v, w. This will be satisfied for every pair if it is satisfied for all
pairs vO), v(k) of four linearly independent vectors. The reality condition is
satisfied if (Av(i))* = A(v(i)*) holds for four such vectors.
REPRESENTAnONS OF TIlE POINCARE GROUP                                                   45
The scalar product of two vectors x and y is positive if both lie in the
positive light cone or both in the negative light cone. It is negative if one
lies in the positive, the other in the ne~ative light cone. Since both x and y
are time-like IX412 > IXI12 + Ixi + IX31 ; IY4 12 > IY I 12 + IY2 12 + IY31 2. Hence,
by Schwarz's inequality Ix4*Y41 > IXtYI
                                     * + x2*Y2 + x3*Y31 and the sign of the
scalar product of two real time-like vectors is determined by the product of
their time components.
This formulation of the third condition shows that the third condition holds
for the product of two homogeneous Lorentz transformations if it holds for
both factors. The same is evident for the first two conditions.
From AFA' = F one obtains by multiplying with A-I from the left and A'-I
= (A-I)' from the right F = A-IF(A-l), so that the reciprocal of a
homogeneous Lorentz transformation is again such a transformation. The
homogeneous Lorentz transformations form a group, therefore.
                   °
[1] If {v, w} = and {v, v} > 0, then {w, w} < 0; if {v, w} = 0, {v, v} = 0,
then w is either space-like, or parallel to v (either {w, w} < 0, or w = cv).
Proof:
      *
    v 4w4     v *lWl + v 2*w 2 + v 3w3'
                                   *                                          (13)
By Schwarz's inequality, then
      1v 4121w412::;; (I v   i   + 1vi + 1v 312)(lwlI2 + IW212 + IW312).      (14)
For IV412 > IVll2 + IV212 + IV312 it follows that IW412 < IWll2 + IW212 + IW312. If
IV412 = IVll2 + IV212 + IV312 the second inequality still follows if the
inequality sign holds in (14). The equality sign can hold only, however, if
the first three components of the vectors v and w are proportional. Then,
on account of (13) and both being null vectors, the fourth components are
in the same ratio also.
[2] If four vectors v(l), v(2). v(3), v(4) are mutually orthogonal and linearly
independent, one of them is time-like, three are space-like.
Proof: It follows from the previous paragraph that only one of four
mutually orthogonal, linearly independent vectors can be time-like or a
null vector. It remains to be shown therefore only that one of them is
time-like. Since they are linearly independent, it is possible to express by
them any time-like vector
                  4
       v(t) =I,akv(k).
             k=l
The scalar product of the left side of this equation with itself is positive
REPRESENTAnONS OF THE POINCARE GROUP                                                 47
and therefore
or
     ~~)Xi{V(k),      vUp«k»} > 0                                            (15)
      k
and one {v(k), v(k)} must be positive. Four mutually orthogonal vectors
are not necessarily linearly independent, because a null vector is
perpendicular to itself. The linear independence follows, however, if none
of the four is a null vector.
[3] If 'A is a characteristic value, 'A*, 'A-I and 'A*-1 are characteristic values
also.
Proof: For 'A* this follows from the fact that A is real. Furthermore, from
IA - 'All = 0 also IA' - 'All = 0 follows, and this multiplied by the
determinants of AF and F-I gives
     IAFI·IA'-'All·lFr l = IAFA'r-l-'AAI = Il-'AAI = 0,
so that 'A-I is a characteristic value also.
From {v, v} = {Av, Av} = 1'A12 {v, v} the {v, v} = 0 follows immediately
          *
for I 'A I 1. If 'A were complex, 'A * would be a characteristic value also.
The characteristic vectors of 'A and A* would be two different null vectors
and, because of [4], orthogonal to each other. This is impossible on
48                                                                           CHAPTER II
account of [1]. ThUS'). is real and va real null vector. Then, on account of
the third condition for a homogeneous Lorentz transformation, '). must be
positive.
If '). were not real, ').* would be a characteristic value also. The
corresponding characteristic vector v* would be different from v, a null
vector also, and perpendicular to v on account of [4]. This is impossible
because of [1].
[8] There is not more than one pair of conjugate complex characteristic
values, if A has no elementary divisors. Similarly, under the same
condition, there is not more than one pair A, A-I of characteristic values
whose modulus is different from 1. Otherwise their characteristic vectors
would be orthogonal, which they cannot be, being null vectors.
                                                                             (16)
and also a pair of characteristic values "-:" ').4' the modulus of which is not
1. These must be real and positive:
                     "-:,   =
                                 *
                                ~>O.                                        (16a)
The characteristic vectors of the conjugate complex characteristic values
REPRESENTATIONS OF THE POINCARE GROUP                                           49
     V3 =      V3
                   *   V4 =
                                 *
                                { V3' V4} = 1
                                V4                                   (17a)
     { V 3' V 3}   =v 4' v 4} = o.
                       {
(b) There is a pair of complex characteristic values AI' "-2 = All     =   A~, Al :F-
A;, 111.11 = 1"-21 = 1. No pair with 1"-:,1 :F- 1, however. Then on account of [8],
still "-3 = ~* which gives with 1"-31 = 1, "-3 = ± 1. Since the product
11.1"-2"-311.4 = 1, on account of the second condition for homogeneous
Lorentz transformations, also 11.4 = "-3 = ± 1. The double characteristic
value ± 1 has two linearly independent characteristic vectors v3 and v4
which can be assumed to be perpendicular to each other, {v3' v4} = o.
According to [2], one of the four characteristic vectors must be time-like
and since those of Al and "-2 are space-like, the time-like one must belong
to ± 1. This must be positive, therefore "-3 = 11.4 = 1. Out of the time-like
and space-like vectors {v3' v3} = -1 and {v4' v4} = 1,0ne can build two
null vectors v4 + v3 and v4 - v3. Doing this, case (b) becomes the special
case of (a) in which the real positive characteristic values become equal "-3
  ,\-1
= 1\.4 =   1.
(c) All characteristic values are real; there is however one pair "-3         *
                                                                              11.4
                                                                           = ~,
= ~1, the modulus of which is not unity. Then {v3' v3} = {v4' v4} = 0 and
11.3 > 0 and one can conclude for Al and "-2, as before for "-3 and 11.4 that Al
= 11.2 = ± 1. This again is a special case of (a); here the two characteristic
values of modulus 1 become equal.
(d) All characteristic values are real and of modulus 1. If all of them are
+1, we have the unit matrix which clearly can be considered as a special
case of (a). The other case is Al = "-2 = -1, "-3 = 11.4 = +1. The
characteristic vectors of Al and "-2 must be space-like, on account of the
third condition for a homogeneous Lorentz transformation; they can be
assumed to be orthogonal and normalized to -1. This is then a special case
of (b) and hence of (a) also. The cases (a), (b), (c), (d), are illustrated in
Fig. 1.
that Ae is real, positive and ve' we can be assumed to be real also. The last
equation now becomes {ve' we} = A2e{ve' we} so that either Ae = 1 or {ve'
we} = O. Finally, we have
     {we' we}     =   {Aewe,Aewe}
                A2e {we' we} + 2Ae{we' Ve} + { Ve' Ve}'
                      =
From (19c) we see that the root of the elementary divisor is 1 and this is at
                                                                     '*
least a double root. If A had a pair of characteristic values Al 1, A2 =
A~l, the corresponding characteristic vectors VI and v2 would be
orthogonal to ve and therefore space-like. On account of [5], then IAll =
1"-21 = 1 and {VI' v2} = O. Furthermore. from {we' VI} = {Aewe' Aevl} =
AI{We• VI} + AI{Ve• VI} and from {ve, VI} = 0 also {we' VI} = 0 follows.
Thus all the four vectors VI' v2' ve' we would be mutually orthogonal.
This is excluded by [2] and (19).
Two cases are conceivable now. Either the fourfold characteristic root has
only one characteristic vector, or there is in addition to ve (at least) another
characteristic vector v l' In the former case four linearly independent
vectors ve' we' ze' xe could be found such that
     Ae v e   =       ve
     Atfe = ze + we             Aexe = xe + ze
However {ve, xe} = {Aeve' Aexe} = {ve, xe} + {ve, ze} from which {ve,
ze} = 0 follows. On the other hand
     {we,ze}      =   {Aewe,Aeze}
                  =   {We,Ze} + {we,we}+{v e,ze}+{v e'we },
This gives with (19a) and (19b) {ve' ze}        =   1 so that this case must be
52                                                                       CHAPTER II
excluded.
The Ae(Y) which have the invariant null vector ve and also we (and hence
also VI) in common and differ only by adding to we different multiples YVe
of ve' form a cyclic group with Y= 0, the unit transformation as unity:
     Ae(y)AlY) = Ae(Y +         i)·
The Lorentz transformation M( a) which leaves vIand we invariant but
replaces ve by aVe (and ze by a-1ze) has the property of transforming Ae(Y)
REPRESENTATIONS OF TIlE POINCARE GROUP                                                   53
into
       M( a)Ae(y)M( ar1 = Ae( ay).                                                 (+)
An example of Ae(Y) and M(a) is
                               1 0    0      0
                               o1 Y          Y
                       Ae(Y) = 0 -y 1-1ft( -1ft(
                                    oy      1fzr 1+1fzr
                                    10        o             o
                                    o1        o             o
                       M (ex)   =   0 0 1fz(ex+a.-1) 1fz(a.-a1)
                                    o 0 1fz(a.-a.-1)   1fz(a.+a.-1)
A behavior like (+) is impossible for finite unitary matrices because the
characteristic values of M(ar 1Ae(y)M(a) and ~(y) are the same-those of
Ae(ya) = Ae(Y)CX the a th powers of those of Ae(Y). This shows very simply
that the Lorentz group has no true unitary representation in a finite number
of dimensions.
The homogeneous Lorentz transformation is, from the point of view of the
physicist, a transformation to a uniformly moving coordinate system, the
origin of which coincided at t = 0 with the origin of the first coordinate
system. One can, therefore, first perform a rotation which brings the
direction of motion of the second system into a given direction-say the
direction of the third axis-and impdii. it a velocity in this direction, which
will bring it to rest. After this, the two coordinate systems can differ only
in a rotation. This means that every homogeneous Lorentz transformation
can be decomposed in the following way17
     A   =   RZS                                                          (21)
where Rand S are pure rotations, (i.e. ~4 = R4i = Si4 = S4i = 0 for i 4    *
and R44 = S44 = 1, also R' = R-1, S' = S-1) and Z is an acceleration in the
direction of the third axis, i.e.
                                    1 000
                                    o 100
                                 Z= OOab
                                    o0 b a
with a2 - b2 = 1, a> b > O. The decomposition (21) is clearly not unique.
It will be shown, however, the Z is uniquely determined, i.e. the same in
every decomposition of the form (21).
axis. Then we take 134 = (Ai4 + A~ + A~4)i and 144 = A44 for b and a to
form Z; they satisfy the equation I~ - 1~4 = 1. Hence, the first three
components of the fourth column of J = Z-1 I = Z-1 R-l A will become zero
                         2 2 2 2
and J44 = 1, because of J44 - J14 - J24 - J34 = 1. Furthermore, the first three
components of the fourth row of J will vanish also, on account of J~4 - J~l -
J~2 - J~3 = 1, i.e. J = S = Z-1 R -1 A is a pure rotation. This proves the
possibility of the decomposition (21).
The trace of AA' = RZ2R-l is equal to the trace of Z2, i.e. equal to 2a2 +
2b2 + 2 = 4a2 = 4b2 + 4 which shows that the a and b of Z are uniquely
determined. In particular a = 1, b = 0 and Z the unit matrix if AA' = 1, i.e.
A a pure rotation.
It is easy to show now that the group space of the homogeneous Lorentz
transformations is only doubly connected. If a continuous series A(t) of
homogeneous Lorentz transformations is given, which is unity both for t =
o and t = 1, we can decompose it according to (21)
     A(t)    =   R(t)Z(t)S(t).                                           (21a)
It is also clear from the foregoing, that R(t) can be assumed to be
continuous in t, except for values of t, for which A14 = A24 = A34 = 0, i.e.
for which A is a pure rotation. Similarly, Z(t) will be continuous in t and
REPRESENTATIONS OF TIlE POINCARE GROUP                                                       55
this will hold even where A(t} is a pure rotation. Finally, S           =   Z-IR-IA will
be continuous also, except where A(t} is a pure rotation.
We can form a new group14 from the Lorentz group, the elements of which
are the elements of the Lorentz group, together with a way A(t),
connecting A(l) = A with the unity A(O) = E. However, two ways which
can be continuously deformed into each other are not considered different.
The product of the element "A with the way A(t}" with the element "I
with the way I(t}" is the element AI with the way which goes from E along
A(t) to A and hence along AI(t} to AI. Clearly, the Lorentz group is
isomorphic with this group and two elements (corresponding to the two
essentially different ways to A) of this group correspond to one element of
the Lorentz group. It is well known,I8 that this group is holomorphic with
   18Cf. H. Weyl, Gruppentheorie und Quantenmechanik, 1st. ed. Leipzig 1928, pages
110-114, 2nd ed. Leipzig 1931, pages 130-133. It may be interesting to remark that
essentially the same isomorphism has been recognized already by L. Silberstein, l.c. pages
148-157.
56                                                                        CHAPTER II
rotations are contained in the invariant subgroup also, (21) shows that this
holds for all elements of the homogeneous Lorentz group.
It follows from this that the homogeneous Lorentz group has apart from
the representation with unit matrices only true representations. It follows
then from the remark at the end of part B, that these have all infinite
dimensions. This holds even for the two-valued representations to which
we shall be led in Section 5 equ. (520), as the group elements to which the
positive or negative unit matrix corresponds must form an invariant
subgroup also, and because the argument at the end of part B holds for
two-valued representations also. One easily sees furthermore from the
equations (52B), (52C) that it holds for the inhomogeneous Lorentz group
equally well.
A.
(a) From the point of view of the physicist, the natural definition of the
continuity of a representation up to a factor is as follows.              The
neighborhood 0 of a Lorentz transformation Lo = (b, I) shall contain all the
transformations L = (a, A) for which lak - bkl < 0 and IAik - Iikl <      The           o.
representation up to a factor D(L) is continuous if there is to every positive
number e, every normalized wave function <\> and every Lorentz
transformation Lo such a neighborhood 0 of Lo that for every L of this
neighborhood one can find an n of modulus> 1 (the n depending on L
and <\» such that (u<l>' u<l» < e where
        u<l>    =   (D(Lo) - QD(L)<\».                                                       (23)
Let us now take a point Lo in the group space and find a normalized wave
function <\> for which I(<\>, DLO<\»I > 1/6. There always exists a <\> with this
property, if I(<\>, D(~o)<\»1 < 1/6 then 'V = a<\> + ~D(Lo)<\> with suitably
chosen a and ~ will be normalized and I('V, D(Lo)'V)1 > 1/6. We consider
then such a neighborhood N ofLo for all L of which IC<\>, DCL)<\»I > 1112. It
is well known 19 that the whole group space can be covered with such
neighborhoods. We want to show now that the D(L)<\> can be multiplied
with such phase factors (depending on L) of modulus unity that it becomes
strongly continuous in the region N.
We shall chose that phase factor so that (<\>, D(L)<\» becomes real and
positive. Denoting then
        (D(L 1) - D(L»<\>           =   U<l>'
the (V<l>' V<l» can be made arbitrarily small by letting L approach
sufficiently near to L 1, if Ll is in N. Indeed, on account of the continuity,
as defined above, there is an n = eik such that (u, u) < e if L is sufficiently
     1"rhis    condition is the "separability" of the group. Cf. e.g. A. Haar, Ann. of Math., 34,
147,1933.
REPRESENTATIONS OF THE POINCARE GROUP                                             59
near to LI where
     u = (D(L I ) -     eikD(L))$.                                        (23')
Taking the absolute value of the scalar product of u with $ one obtains
because of Schwartz's inequality. If only ..fE < 1112, the k must be smaller
than rc/2 because the absolute value is certainly greater than the real part
and both ($, D(L I )$) and ($, D(L)$) are real and greater than 1112.
U\jI = (23a)
                                                                          (23b)
with suitaly chosen n's. According to the foregoing, it also is possible to
choose Land Ll so close that (Vql' V<)l) < E.
Subtracting (23') and (23a) from (23b) and applying D(L)-l on both sides
gives
    (n\jl- Q\jI+<)l)'!' + (1- ~+<)l)<\> = D(Lrl(u\jI+<)l- u\jI- U<)l)
The scalar product of the right side with itself is less than 9E. Hence both
                      1                            1         1
I~ - n\jl + <)ll < 3E2 and 11 - ~ + <)ll < 3£2 or 11 - ~I < 6£2. Because of V\jI
                                             1         1
= ~ - (1 - ~)D(L),!" the (V\jI' V\jI)2 < (u\jl' U\jl)2 + 11 - ~I and thus (V\jI'
V\jI) < 49£.
This completes the proof of the theorem stated under (b). It also shows
that not only the continuity of D(L),!, has been achieved in the
neighborhood of Lo by the normalization used in (a) but also that of D(L),!,
with every '!', i.e., the continuity ofD(L).
It is clear also that every finite part of the group space can be covered by a
finite number of neighborhoods in which D(L) can be made continuous. It
is easy to see that the ro of (22) will be also continuous in these
neighborhoods so that it is possible to make them continuous, apart from
regions of lower dimensionality than their variables have. In the following
only the fact will be used that they can be made continuous in the
neighborhood of any a, b, and A.
B.
(a) We want to show next that all T(a) commute. From (22B) we have
     T(a)T(b)T(a)-l        =   e(a, b)T(b)                                 (24)
where          c(a,       b) = ro(a, b)/ro(b, a)                     and hence
     e(a, b)   =   e(b, arlo                                              (24a)
  20G. Hamel, Math. Ann. 60, 460, 1905, quoted from H. Hahn, Theorie der reellen
Funktionen. Berlin 1921, pages 581-583.
62                                                                                     CHAPTERll
     d(A)T(b)d(A)-ld(A)T(a)d(A)-ld(A)T(b)-ld(A)-l
         = ro(A, b)T(Ab)ro(A, a)T(Aa)ro(A, b)-lT(Abr l
         = ro(A, a)c(Ab, Aa)T(Aa),
while the first line is clearly d(A)c(b, a)T(a)d(At l              =   ro(A, a)c(b, a)T(Aa)
whence
     c(b, a)     =       c(Ab, Aa)                                                    (31)
holds for every Lorentz transformation A. Combined with (30) this gives
     ~(fleA.a~A. - 2.JvIlAv';"JUI.a~A.)           =    n'(a, b),
where n'(a, b) is agliin an integer. As this equation holds for every a, b
It is well to remember that it was necessary for obtaining this result to use
the existence of d(A) satisfying (22C).
As the four linearly independent directions e(1), ... ,e(4) we shall take four
null vectors. If e is a null vector, there is, according to section 3, a
homogeneous Lorentz transformation21 Ae such that Aee = 2e.
  21The index e denotes here the vector e for which Aee = 2e; this Ae has no elementary
divisor.
REPRESENTATIONS OF TIlE POINCARE GROUP                                                63
It must be shown that if VI' v2' v3' ... is a sequence of dyadic fractions.
converging to 0, lim T(vie) = 1. From T(a)·T(O) = coCa, O)T(a) it follows
that T(O) is a constant. According to the theorem of part (A)(b), the T(ve),
if multiplied by proper constants Oy will converge to 1, i.e., by choosing
an arbitrary <\>, it is possible to make both (l-nvT(ve»<\> = u and
(I-11vT(ve»'d(Aet1<j> = u' arbitrarily small, by making v small. Applying
d(Ae) to the second expression, one obtains, for (36a), that (1 - OyT(2ve»<j>
= d(A£)u' is also small. On the other hand, applying T(ve) to the first
expression one sees that (T(ve) - OyT(2ve»<\> = T(ve)u approaches zero
also. Hence. the difference of these two quantities (1 - T(ve»<\> goes to
zero, i.e. T(vie)<j> converges to <\> if vI' v2' v3' ... is a sequence of dyadic
fractions approaching O.
                                                                                (39)
where Vo is a vector independent of A, the meA, a) wil~ become meA, a) =
exp(2ni{(A - l)a, vo}). Then meA, a) in (22C) will disappear if we replace
T(a) by exp(2ni{a, vo} )T(a).
                                   -S3 C3     0 0
                                    o    0 C 3 S3
                                    o    0 S3 C 3
Now
     f(X(a, y)R)   =   R-I(1-X(a, y)-I)vx+(I-J(I)vX
                =   (1-(X(a, y)R)-I)vX.
One easily concludes from (38) that the f(E) corresponding to the unit
operation vanishes and f(A-I) = - Af(A). Hence f(R-IX(a, yrI) = (1 -
X(a,yR)vx; and one concludes further that for all Lorentz transformations
A = RX( a,y)S, (39) holds with Vo = -vx if Rand S are rotations. However,
every homogeneous Lorentz transformation can be brought into this form
(Section 4C). This completes the proof of (39) and thus of ro(A, a) = 1.
D.
The quantities ro(a, b) and ro(A, a) for which it has just been shown that
they can be assumed to be 1, are independent from the normalization of
d(A). We can affix therefore an arbitrary factor of modulus 1 to all the
d(A), without interfering with the normalizations so far accomplished. In
consequence hereof, the ensuing discussion will be simply a discussion of
the normalization of the operators for the homogeneous Lorentz group and
the result to be obtained will be valid for the group also.
                                Figure 2:
because the procedure to be followed for the Lorentz group can be
especially simply demonstrated for this group, the three dimensional
rotation group shall be taken up first.
It is well known that the nomalization cannot be carried so far that meA, I)
=  1 in (22D) and there are well known representations for which meA, I) =
± 1. We shall allow this ambiguity therefore from the outset.
One can observe, first, that the operator corresponding to the unity of the
group is a constant. This follows simply from d(A)d(E) = meA, E)d(A).
The square of an operator corresponding to an involution is a constant,
therefore.
The operator corresponding to the rotation about the axis e by the angle 1t;
normalized so that its square be actually 1, will be denoted bye; e 2 = 1.
The e are-apart from the sign-uniquely defined.
                                                                         (44)
Now d(R) commutes with every deS) if S is also a rotation about v. This is
proved in equations (24)-(30). The fu in (30) must vanish on account of
(29).
                                 Figure 3:
(Also, both Rand S can be arbitrarily accurately represented as powers of
a very small rotation about v). Hence, transforming (44) by deS) one
obtains
                                                                        (44a)
For computing d(R)d(T) we can draw the planes perpendicular to the axes
of rotation of R and T and use for d(R)= eRe C such a development that the
axis ec of the second involution coincide with the intersection line of the
REPRESENTATIONS OF TIlE POINCARE GROUP                                             69
For the Lorentz group, the proof can be performed along the same line,
only the underlying geometrical facts are less obvious. Let A be a Lorentz
transformation without elementary divisors with the characteristic values
e 2iy, e-2iy, e2x, e-2X and the characteristic vectors VI' v2 = v~, v3' v4' as
described in section 4B.
Both Ns and Nt must satisfy the first and third condition for Lorentz
transformations (cf. 4A) and both determinants must be either 1, or -1.
Furthermore, the square of both of them must be unity.
If both determinants were +1, the Nt had to be unity itself, while Ns could
be the unity or a rotation by 1t in the vlv2 plane. Thus VI' v2' v3' v4 would
be characteristic vectors of N itself.
(46e)
     MtJ!    =   -Sh2X·(J!-Ch2X·tw
Thus M also becomes a product of two reflections, one in the v I v2 = s'vSv
the other in the v3v4 = t/J!tJ! plane. This completes the decomposition of A
into two involutions. One of the involutions can be taken to be a rotation
by 1t in an arbitrary space like plane, intersecting both the vIv2 and the
v3v4 planes, as the freedom in choosing v and Il allows us to fix the lines
sV' and \t arbitrarily in those planes. The involution characterized by (46)
will be called NVJ! henceforth. The other involution M is then a similar
REPRESENTATIONS OF TIlE POINCARE GROUP                                              71
The d(M) and deN) so normalized that their squares be 1 shall be denoted
by d l (My/!) and d l (Nv/!). We must show that the normalization for
     d(A)   =   ±d1(Mv/!)d1(Nv/!)                                           (47)
is independent of v and Jl. For this purpose, we transform
                                                                           (47 a)
with deAl) where Al has the same characteristic vectors as A but different
characteristic values, namely eiv , e-iv , e/! and e-/!. Since A1MooAl- 1 = Mv/!
and AINooAI-I = Nv/! we have d(AI)dl(Moo)d(Altl = rodiMy/!) where ro
= ± 1, as the squares of both sides are 1. Hence, (47a) becomes if
transformed with deAl) just
     d(A1)d(A)d(A1)-1    ±d1(Mv/!)d1(Nv/!).
                          =                                        (47b)
The normalization (47) would be clearly independent of v and Jl if deAl)
commuted with d(A).
In order to have the analogue of (45), we must show that, having. two
Lorentz transformations A = My/!Nv/! and I = Pa.~~~ we can choose v, Jl
and ex, ~ so that Nv/! = Pa.~ i.e. that the plane of rotation Sytf.L of NvlJ.
coincide with the plane of rotation ofPa.~. As the latter plane can be made
to an arbitrary space like plane intersecting both the wIw2 and the w3w4
planes (where WI' w2' w3' w4 are the characteristic vectors of I), we must
show the existence of a space like plane, intersecting all four planes vlv2'
v3v4' wlw2' w3w4. Both the first and the second pair of planes are
orthogonal.
72                                                                                        CHAPTER II
   22We first suppose the existence of a real plane p intersecting all four planes v I v2' V3V4'
WI wZ'  w3w4· If p intersects VI Vz the plane q perpendicular to p will intersect the plane v3v4
perpendicular to v I V2' Indeed, the line which is perpendicular to both p and v I v2 (there is
such a line as p and VI Vz intersect) is contained in both q and v3v4 . This shows that if there
.
is a plane intersecting all four planes, the plane perpendicular to this will have this property
also .
If the plane p-the existence of which we suppose for the time being--contains a time-like
vector, q will be space-like (Section 4B, [I)). Both in this case and if p contains only
space-like vectors, the theorem in the text is valid. There is a last possibility, that p is
tangent to the light cone, i.e. contains only space like vectors and a null vector Y. The
space-like vectors of p are all orthogonal to v, otherwise p would contain time-like vectors
also. In this case the plane q, perpendicular to p will contain v also. The line in which v I vz
intersects p is space-like and orthogonal to the vector in which v3 v4 intersects p. The latter
intersection must coincide with v, therefore, as no other vector p is orthogonal to any
space-like vector in it. Hence, v is the intersection of p and v3v4 and is either v3 or v4' One
can conclude in the same way that v coincides with either w3' or w4 also and we see that if
p is tangent to the light cone the two transformations A and I have a common null vector as
characteristic vector. Thus the theorem in the text is correct if we can show the existence of
an arbitrary real plane p intersecting all four planes VI vz• v3v4 • WI wZ' w3w4 ·
REPRESENTATIONS OF THE POINCARE GROUP                                                                73
Let us draw a coordinate system in our four dimensional space, the Xl Xz plane of which is
the v 1v2 plane, the x3 and x4 axes having the directions of the vectors v3 - V4 and v3 + v4'
respectively. The three dimensional manifold M characterized by x4 = 1 intersects all
planes in a line, the v1v2 plane in the line at infinity of the Xl x2 plane, the v3v4 plane in the
x3 axis. The intersection of M with the w 1w2 and w3w4 planes will be lines in M with
directions perpendicular to each other. They will have a common normal through the
origin of M, intersecting it at reciprocal distances. This follows from their orthogonality in
the four dimensional space.
If A and I have one common characteristic null vector, v3 = w3' the others,
v4 and w4 respectively, being different, one can use an artifice to prove
(49a) which will be used in later parts of this section extensively. One can
find a Lorentz transformation J so that none of the pairs I - J; A - U; AU -
J-l has a common characteristic null vector. This will be true, e.g. if the
characteristic null vectors of J are v4 and another null vector, different
from v3' w4 and the characteristic vectors of AI. Then (49a) will hold for
all the above pairs and
     d(A)d(I) = ±d(A)d(I)d(J)d(r1) = ±d(A)d(IJ)d(r 1)
               = ±d(A/J)d(r1) = ±deAl).
This completes the proof of (49a) for all cases in which A, I and AI have
no elementary divisors. It is evident also that we can replace in the
normalization (47) the d 1 by d. One also concludes easily that d(M)2 is in
the same representation either + 1 for all involutions M, or -1 for every
involution. The former ones will give real representations, the latter ones
representations up to the sign.
                                        oIIlhOi
                                           II0
                             Ae =       0 0 1 0
                                        o0   0 1
and can be written, in the same scheme, as the product· of two Lorentz
transformations with the square 1
                             1 -1 1f2 0            1 000
                             0-1 1 0               0-1 0 0
            Ae   = Mello =   0 0 1 0               o0 1 0
                             o0     0-1            o 0 0-1
and thus d(Mo)d(N o) = ± d(M 1)d(N 1). This shows also that even if AI is in
case (e), meA, I) = ± 1, since (49) leads to the correct normalization.
±d(M)d(N)d(IJ)d(J)-1 = ±d(M)d(NIJ)d(J)-1
                  d(AlJ)d(r 1).
The last product has, however, the normalization corresl'onding to two
involutions, as was shown in (49a), since neither AU, nor J- 1 is in case (e).
Lastly, we must consider the case when both A and I may have an
elementary divisor. In this case, we need a J such that neither J, r1, U
have one. Then, because of the generalization of (49a) just proved, in
which the first factor is in case (e)
     d(A)d(I) = ±d(A)d(I)d(J)d(r 1) = ±d(A)d(IJ)d(r1)
               = ±d(A/J)d(r1)
specifically, (22B) and (22C) with (\l(a, b)            =   (\l(A, a)   =   1 and (22D) with
(\l(A, I) = ± 1.
                                               E.
Lastly, it shall be shown that the renormalization not only did not spoil the
partly continuous character of the representation, attained at the first
normalization in part (A) of this section, but that the same holds now
everywhere, in the ordinary sense for T(a) and, apart from the ambiguity of
sign, also for d(A). For T(a) this was proved in part (B)(b) of this section,
for d(A) it means that to every AI' E and <\> there is such a 0 that one of the
two quantities
                                                                                        (51)
if A is in the neighborhood 0 of AI' The inequality (51) is equivalent to
For d(A), equations (46) show that as A approaches E (Le., as <\> and X
approach zero) both Moo and Noo approach the same involution, which we
shall call K. Let us now consider a wave function l.!, = <\> + d l (K)<\> or, if this
vanishes", = <\> - dl(K)<\>. We have dl(K)", = ± ",. If A is sufficiently near
to unity, d l (Noo)'" will be sufficiently near to Od l (K)", = ± 0", and all we
have to show is that 0 approaches ± 1. The same thing will hold for
d l (Moo)· Indeed from d 1(Noo)'" - 0", = u it follows by applying d 1(Noo)
on both sides", - 0 2", = (dl(Noo) + O)u. As (u, u) goes to zero, 0 must
go to ± 1, and consequently, also dl(Noo)'" goes to '" or to -",. Applying
d l (MOO) to this, one sees that d l (MoO)d l (NOO)'I' = d(A)'I' goes to ± 'I' as A
goes to unity. The argument given in (A) (b) shows that this holds not only
for 'I' but for every other function also, i.e. dA converges to ± I = deE) as
A approaches E. Thus d(A) is continuous in the neighborhood of E and
hence everywhere.
According to the last remark in part 4, the operators ± d(A) form a single
valued representation of the group of complex unimodular two
dimensional matrices C. Let us denote the homogeneous Lorentz
transformation which corresponds in the isomorphism to C by C Our task
of solving the equs. (22) has been reduced to finding all single valued
unitary representation[s] of the group with the elements [a,C] = [a,l] [O,C],
the multiplication rule of which is [a,C I ] [b,C 2] = [a + Cl b,C I C 2]. For the
representations of this group D[a, C] = T(a)d[C] we had
     T(a)T(b) = T(a+b)
     d[C]T(a) = T(Oz)d[C]                                                   (52a)
     d[C 1]d[C 2] = d[C 1C2]
It would be more natural, perhaps, from the mathematical point of view, to
use henceforth this new notation for the representations and let the d
depend on the C rather than on the C or A. However, in order to be
reminded on the geometrical significance of the group elements, it
appeared to me to be better to keep the old notion. Instead of the equations
(22B), (22C), (22D) we have, then
     T(a)T(b)    =   T(a+b)                                                 (52B)
     d(A) T(a)   =   T(Aa) d(A)                                             (52C)
     d(A) d(l) = ±d(Al).                                                    (52D)
This section, unlike the other ones, will often make use of methods, which
though commonly accepted in physics, must be further justified from a
rigorous mathematical point of view. This has been done, in the
meanwhile, by 1. von Neumann in an as yet unpublished article and I am
much indebted to him for his cooperation in this respect and for his
78                                                                          CHAPTER II
A.
Of course, the fact that the Lorentzian scalar product enters in the
exponent, rather than the ordinary, is entirely arbitrary and could be
changed by changing the signs of PI' P2' P3'
The unitary scalar product of two wave functions is not yet completely
defined by the requirements so far made on the coordinate system. It can
be a summation over S and an arbitrary Stieltjes integral over the
components of p:
            (\jf, q,)   =    tJ      \jf(p, S)*q,(p, S)df(p, /;).           (54)
This equation defines the function P(A)$, which is, at the point p, S, as
great as the function <\> at the point A-Ip, /;;. The operator peA) is not
necessarily unitary, on account of the weight factor in (54). We can easily
calculate
     P(A)T(a)$(p,             s)       = T(a)<\>(A-lp, S) = ei{A-lp, a}$(A-lp, S),
            = "£..,;Q(P,       Tl
                               A)1;l1<\>(A -Ip, n).
                Tl
As the exponentials form a complete set of functions, we can approximate
the operation of multiplication with any function of PI' P2' P3' P4 by a
linear combination
    f(P)<\> = LCnT(an)$·                                                                         (58)
                      n
of the group. It commutes evidently with the T(a) and the Q(p, A), and on
account of (56) and (58), (58a) also with peA). Thus the operation of (58)
belongs to the centrum of the algebra of our representation. Since,
however, we assume that the representation is factorial (cf. 2), the centrum
contains only multiples of the unity and
     !(P)$(P, ~)           =       c$(P, ~).                           (58b)
This can be true only if $ is different from zero only for such momenta p
which can be obtained from each other by homogeneous Lorentz
transformations, because f(p) needs to be equal to f(p') only if there is a A
which brings them into each other.
These representations can now naturally be divided into the four classes
enumerated in section 3, and two classes contain two subclasses. There
will be representations, the wave functions of which are defined for p such
that
             (l ){p, p}        =   P<0           (3) p = 0
     Jr
          df(p,   ~)   =       J   Ar
                                     dj(p, 11)                          (59)
Neither T(a) nor d(A) can have matrix elements between such 11 and /; for
which (59) does not hold.
Since no wave function of the other classes can remain invariant under all
translations, no representation of the third class can be contained in any
representation of one of the other classes. In the other classes, the
variability domain of p remains three dimensional. It is possible,
therefore, to introduce instead of PI' P2' P3' P4 three independent variables.
In the cases 1 and 2 with which we shall be concerned most, PI' P2' P3 can
be kept for these three variables. On account of (59), the Stieltjes integral
can be replaced by an ordinary integral24 over these variables, the weight
                                                      1
In fact, with the weight factor Ip4 1- 1 the weight of the domain r i.e., Wr =
f f fr 1P41-ldPldP2dP3 is equal to the weight of the domain W AT as
required 25 by (59). Having the scalar product fixed in this way, P(A)
becomes a unitary operator and, hence, Q(A) will be unitary also.
which comes out to be (P + pi +       P~ + p;)2(P + p'i + p'; + p';)~.         Equ. (59a) will not be
used in later parts of this paper.
82                                                                                            CHAPTER II
     lim
     h=O k
             I±h-2(T(2he k) -        2T(he k) + 1)'1'          =    (pi + p~ + pi - p~)'I'
                   = -P'l',                                                                   (61)
where t1c is a unit vector in (or opposite) the kth coordinate axis and the ±
is + for k = 4, and - for k = 1,2,3.
if it exists, would be different from -P<\>o Suppose the limit in (6Ia) exists
and is -P<\> + <\>'. Let us choose then a normalized '1', from the above set,
such that ('1',<\>') = 0 with 0 > 0 and an h so that the expression after the lim
sign in (6Ia) assumes the value -P<\> + <\>' + u with (u, u) < 0/3 and also the
expression after the lim sign in (61), with oppositely directed ek becomes
-P'I' + u' with (u', u') < 0/3. Then, on account of the unitary character of
T(a) and because ofT(-a) = T(at!
     ('I',    L±h- (T(2he 2T(he
                       2       k) -                  k)     + 1)<1»,
                 (L±h- (T(-2he
               k
                           2                k)   -        2T(-hek ) + 1)'1',<1»,
                       k
or
     -P('I',<\»    + ('1',<\>') + ('I',u)                  -P('I',<\»   + (u',<\»,
which is clearly impossible.
B.
c.
On account of (53) and (56), (57), the equs. (52B) and (52C) are
au~om~tical1y satisfied and the Q(p, A)~T\ must be determined by (52D).
ThIS gIves
     L Q(P,A)r.,TlQ(A-Ip,l)Tle<\>(I-1A-Ip,S)
     TIe
                    =    ±IQ(p,AI)r.,e<\>(I-1A-1p,S).                    (63)
                            e
Since this must hold for every <\>, one would conclude
84                                                                              CHAPTERll
Adopting this interpretation of (64), one can also see, conversely, that the
representation q('A,) of the little group, together with the class and P of the
representation of the whole group, determines the latter representation,
apart from a similarity transformation. In order to prove this, let us define
for every p a two-dimensional unimodular matrix a(p) in such a way that
the corresponding Lorentz transformation
                                                                                (65)
brings Po into p. The a(p) can be quite arbitrary except of being an almost
everywhere continuous function of p, especially continuous for p = Po and
a(po) = 1. Then, we can set
REPRESENTATIONS OF TIlE POINCARE GROUP                                                85
Assuming this transformation to be carried out, (66) will be valid and will
define, together with d(A), all the remaining Q(p, A) uniquely. In fact,
calculating d(A)~(p, S), we can decompose A into three factors
(67)
The second factor ~             =   a(pt1Aa(k 1p) belongs into the little group:
&.(prlfi(klp)po = &.(pr11klp = O(prlp = po. We can write, therefore
(kIp = p')
     d(A)<1>(P, S) = d(a(p))d(~)d(a(p'))-l~(p, S)
             = d(~)d(a(p'))-l<1>(po, S)                                       (67a)
             = Lq(~)~Ttd(a(p'rl)~(po' 11)          =   Lq(~)~Tt<1>(P', 11)·
                 Tt                                    Tt
On the other hand, if the representations of the whole group are equivalent,
the representations of the little group are equivalent also:             the
representation of the whole group determines the representation of the
86                                                                       CHAPTER II
The representation of the little group was defined as the set of matrices
Q(po' A)1;11 if the representation is so transformed that (53) and (66a) hold.
Having two equivalent representations D and SDS- I = DO for both of
which (53) and (66a) holds, the unitary transformation S bringing the first
into the second must leave all displacement operators invariant. Hence, it
must have the form (57a), i.e., operate on the Sonly and depend on p only
as on a parameter.
     S<j>(p,s)   =   LS(P)~l1<j>(P,ll).                                  (68)
                     '11
Denoting the matrix Q for the two representations by Q and QO, the
condition SD(A) = DO(A)S gives that
     LS(P)~'I1Q(P,A)119      =   LQ°(P,A)~l1S(A-lp)119
      11                          11
holds, for every A, for almost every p. Setting A = a(PI) we can let p
approach PI in such a way that (68a) remains valid. Since Q is a
continuous function of p both Q(p, A) and QO(p, A) will approach their
limiting value 1. It follows that there is no domain in which
     S(PI) = S(a(Plrlpl) = SCpo)                                         (69)
would not hold, i.e., that (69) holds for almost every Pl. Since all our
equations must hold only for almost every p, the S(P)1;1l can be assumed to
be independent of p and (68a) then to hold for every p also. It then follows
that the representations of the little group D and DO are transformed into
each other by S1;1'r
group.
                                          D.
Lastly we shall determine the constitution of the little group in the
different cases.
1+. In case 1+ we can take for Po the vector with the components 0, 0, 0, 1.
The little group which leaves this invariant obviously contains all rotations
in the space of the fIrst three coordinates. This holds for the little group of
all representations of the first class.
00. In case 00' little group is the whole homogeneous Lorentz group.
         (ab)
          cd
                   (X4+~3X1+ix2)
                      xClx2 x4-x3
                                      (a:<)
                                       b d
                                                =   (~4+~/3Xll+iX».
                                                     x Cix'2 x'4-x 3
                                                                           (70)
                °
         ( e-i~12 (x +el~/2
                       iy')ei~/2)
                                  '
                                                                           (71)
(71a)
These equations are analogous to the equations (52)-(52D) and show that
the little group is, in this case, isomorphic with the inhomogeneous
rotation group of two dimensions, i.e. the two dimensional Euclidean
group.
(72)
(72a)
This show also that Iq(R)u1 ~ 1 and the q(R~A. are therefore, as functions
of R, square integrable:
     J    Iq(R)k1.J2dR
(74)
We shall calculate now the integral over group space of the product of
D(j)(R)*kl and
     q(RS)/q.L    =     ~ q(R)ki..q(S)~.                                           (75)
~ (~lq(S)~12)2
     L DW(S-l):UJDV)(RS):nq(RS)/q.LdR                                              (77)
      m
.t.-q(S);.~J D(})(R)~(R)ki..dR.
In the invariant integral on the left of (77), R can be substituted for RS and
we obtain, for (74) and the unitary character
converges, for (74) and (72a). The integration of (78) yiel;!s thus
REPRESENTAnONS OF TIlE POINCARE GROUP                                                                          91
(80)
or
     C~    =       cj(;.                                                                                (81)
On the other hand q(E)kA. =                    Ou yields
     I,CJ:k        =        Ok)..                                                                      (82)
     jk
These formulas suffice for the reduction of q(R). Let us choose for every
finite irreducible representation D(j) an index k, say k = O. We define then,
in the original space of the representation q(R) vectors v(kjl) with the
components
      kl     fa,..k3
     Cjkl' Cjkl' L.jkl'                  ...       •
The vectors v(kjl) for different j or 1 are orthogonal, the scalar product of
those with the same j and 1 is independent of 1. This follows from (79) and
(81)
= ~ C~c7-~k = OiJ~,OlrC~.
The v(kjl) for all k, j, 1, form a complete set of vectors. In order to show
this, it is sufficient to form, for every v, linear combination from them, the
v component of which is 1, all other components O. This linear
combination is
     ~CVk          (Jcjl)                                                                               (84)
     £.J jlk V              •
     kjl
However, two v with the same j and t but different first indices k are not
orthogonal. We can choose for every j and t, say t = 0 and go through the
vectors v(ljO), v(2jO), . .. and, following Schmidt's method, orthogonalize
and normalize them. The vectors obtained in this way shall be denoted by
Then, since according to (83) the scalar products (V(kjl), V(Aj/) do not
depend on t, the vectors
(86a)
will be mutually orthogonal and normalized also and the vectors w(njl) for
all n, j, t will form a complete set of orthonormal vectors. The same holds
for the set of the conjugate complex vectors w(njl)*. Using these vectors as
coordinate axes for the original representation q(R), we shall find that q(R)
is completely reduced. The v component of the vector q(R)v(kjl)* obtained
by applying q(R) on v(kj/)* is
(87)
The right side is uniformly convergent. Hence, its product with (2h+l)
n(h)(R);n can be integrated term by term giving
               =            °hj.(),nCfi!·
Thus we have for almost all R
carried out only over a finite number of A...          We can write therefore
immediately
     q(R)w(njl)* = LDW(R)ilw(njl)*.                                      (89)
                       i
This completes the proof of the complete reducibility of all (finite and
infinite dimensional) representations of the rotation group or unimodular
unitary group. It is clear also that the same consideration applies for all
                                                        f
closed groups, i.e., whenever the invariant integral dR converges.
The result for the inhomogeneous Lorentz group is: For every positive
numerical value of P, the representations of the little group can be, in an
                                                  1
irreducible representation, only the 0(0), 0(2), 0(1), ... , both for P + and
for P_. All these representations have been found already by Majorana and
by Dirac and for positive P there are none in addition to these.
This group, as pointed out in section 6, has a great similarity with the
inhomogeneous Lorentz group. It is possible, again24 , to introduce
, 'momenta" , i.e. variables l;, 11 and v instead of 1; in such a way that
     t(x, y)cp(Po,   l;, 11, v)   =   ei(x~+JTl)cp(Po' l;, 11, v).       (90)
Similarly, one can define again operators R(~)
     R(~)cp(Po' ~, 11, v)     =   CP(Po, ~', 11', v),                    (91)
where
     ~'   =~cos~ - 11sin~,                                              (91a)
     11' = ~sin ~ + 11cos~.
Then O(~)R(~tl = S(~) will commute, on account of (71c), with t(x, y)
and again contain ~, 11 as parameter only. The equation corresponding to
(57a) is
     O(~)cp(Po' ~, 11, v)     =   LS(~) v CJ)CP(Po' ~', 11', 00).        (92)
94                                                                         CHAPTER II
One can infer from (90) and (92) again that the variability domain of S, 11
can be restricted in such a way that all pairs S, 11 arise from one pair So, 110
by a rotation, according (91a). We have, therefore two essentially
different cases:
     S2 + 112          ~   *0                                                a.)
Case b) can be settled very easily. The "little group" is, in this case, the
group of rotations in a plane and we are interested in one and two valued
irreducible representations. These are all one dimensional (eis~)
     S(~)   =   eis~                                                       (93)
where s is integer or half integer. These representations were also all
found by Majorana and by Dirac. For s = 0 we have simply the equation
                      1
1]<1> = 0, for s = ± '2 Dirac's electron equation without mass, for s = ± 1
Maxwell's electromagnetic equations, etc.
In case a) the little group consists only of the unit matrix and the matrix
(01_?) of the two dimensional unimodular group. This group has two
irreducible representations, as (1) and (-1) can correspond to the above
two dimensional matrix of the little group. This gives two new
representations of the whole inhomogeneous Lorentz group, corresponding
to every numerical value of 8. Both these sets belong to class 0+ and two
similar new sets belong to class 0_.
determined.
A.
As most wave equations are invariant under a wider group than the one
investigated in the previous sections, and as it is very probable that the
laws of physics are all invariant under this wider group, it seems
appropriate to investigate now how the results of the previous sections will
be modified if we go over from the "restricted Lorentz group" defined in
section 4A, to the extended Lorentz group. This extended Lorentz group
contains in addition to the translations all the homogeneous
transformations X satisfying (10)
     XFX' = F                                                          (10')
while the homogeneous transformations of section 4A were restricted by
two more conditions. From (10') it follows that the determinant of X can
be + 1 or -1 only. If its -1, the determinant of Xl = XF is +1. If the
four-four element of Xl is negative, that of X2 = -Xl is positive. It is
clear, therefore, that if X is a matrix of the extended Lorentz group, one of
the matrices X, XF, -X, -XF is in the restricted Lorentz group. For F2 =
1, conversely, all homogeneous transformations of the extended Lorentz
group can be obtained from the homogeneous transformations of the
restricted group by multiplication with one of the matrices
      1, F, -1, -F.                                                      (94)
The group elements corresponding to these transformations will be
denoted by E, F, I, IF. The restricted group contains those elements of the
extended group which can be reached continuously from the unity. It
follows that the transformation of an element L of the restricted group by
F, I, or IF gives again an element of the restricted group. This is,
therefore, an invariant subgroup of the extended Lorentz group. In order
to find the representations of the extended Lorentz group, we shall use
again Frobenius' method. lS
B.
Given a representation of the extended Lorentz group, one can perform the
transformations described in section 6A, by considering the elements of
the restricted group only. We shall consider here only such representations
of the extended group, for which, after having introduced the momenta, all
representations of the restricted group are either in class 1 or 2, i.e. P ;::: 0
but not 00, Following then the procedure of section 6, one can find a set of
wave functions for which the operators D(L) of the restricted group have
one of the forms, given in section 6 as irreducible representations. We
shall proceed, next to find the operator d(F). For the wave functions
belonging to an irreducible D(L) of the restricted group, we can introduce
a complete set of orthonormal functions 'l'1(P, ~), 'l'2(P, ~), .. '. We then
have
     D(L)wip, ~)        =    :LD(L)J.Ik'l'J.l(P, ~).                        (97)
                              J.l
REPRESENTATIONS OF TIlE POINCARE GROUP                                                           97
The infinite matrices D(L)/lk defined in (97) are unitary and form a
representation which is eqUIvalent to the representation by the operators
D(L). The D(L), d(F) are, of course, operators, but the D(L)Jlk are
components of a matrix, i.e. numbers. We can now form the wave
functions d(F)'VI' d(F)2' d(F)'V3' ... and apply D(L) to these. For (96a)
and (97) we have
     D(L)d(F)'Vk                   =         d(F)D(FLF)'Vk
                     d(F)ID(FLF)J.llc'VJ.l                                               (97a)
                                       J.l
                =    ID(FLF)J.llcd(F)'V                 w
                         J.l
For the case P+' we can choose Po in the direction of the fourth axis, with
components 0, 0, 0, 1. Then Fpo = Po and a(Fpo) = 1. The little group is
the group of rotations in ordinary space and FAF = A. Hence qO(A) = q(A)
and DO(A) is equivalent to D(A) in this case. The same holds for the
representations of class P_.
(101)
This is, however, clearly AO = A*. Thus the operators of qO(A) are obtained
from the operators q(A) by (cf. (71a»
       to(x, y) = t(x, -y)                                               (lOla)
       80(~) = 8(-~).
For the representations O+s with discrete s, the qO(A) and q(A) are clearly
inequivalent as 8o(~) = (e-is~) and 8(~) = (eis~), except for s = 0, when
they are equivalent. For the representations 0/3), 0'/3), the qO(A) and
q(A) are equivalent, both in the single valued and the double valued case,
as the substitution 11 ~ -11 transforms them into each other. The same
holds for representations of the class 0_. If DO(L) and D(L) are equivalent
       D-IDO(L)U    =       D(L),                                         (102)
the square of U commutes with all D(L). As a consequence of this, U2
REPRESENTATIONS OF TIlE POINCARE GROUP                                                          99
c.
Similarly
except that in this case (-UIlV ) corresponds to d(F). The 'l'v and d(F)'I'v
can be expressed by the <\> and <\>'. If the 'I' and d(F)'I' were linearly
independent, the <\> and <\>' will be linearly independent also. If the d(F)'I'
were linear combinations of the '1', either the <\> or the <\>' will vanish.
If D(L)llv and DO(L)IlV are inequivalent, the 'l'k and d(F)'I'v = 'I"v are
orthogonal. This is again a generalization of the similar rule for finite
unitary representation~.28 One can see this in the following way:
Denoting Mkv = ('I'k' 'l'v) one has
      Mkv         =    ('I'k' "'v)     =   (D(L)'I'/r.' D(L)'I"v)
= ~D(L);PO(L)A.vM,.v.;
      M           D(L)tMD (L).     °
Hence
                                                                                       (105)
From these, one easily infers that MMt commutes with D(L), and MtM
commutes with DO(L). Hence both are constant matrices, and if neither of
them is zero, M and Mt are, apart from a constant, unitary. Thus D(L)
would be equivalent DO(L) which is contrary to supposition. Hence MMt
= 0, M = 0 and the 'I' are orthogonal to the d(F)'I' = ",. Together, they give
a representation of the group formed by the restricted Lorentz group and
F. If they do not form a complete set, the reduction can be continued as
before.
One sees, thus, that introducing the operation F "doubles" the number of
dimensions of the irreducible representations in which the little group was
the two dimensional rotation group, while it does not increase the
underlying linear manifold in the other cases. This is analogous to what
happens, if one adjoins the reflection operation to the rotation groups
themselves. 29
D.
The operations d(l) can be determined in the same manner as the d(F) were
found. A complete set of orthonormal functions corresponding to an
irreducible representation of the group formed by the Land FL shall be
denoted by '1'1' '1'2' . .. . For this, we shall assume (97) again, although
the D(L) contained therein is now not necessarily irreducible for the
restricted group alone but contains, in case of 0 +s or O_s and finite s, both s
and -so We shall set, furthermore
         d(F)'I'k   =   L d (F)I1k'VW                                     (106)
                        fI.         .
We can form then the functions d(I)'Vl' d(I)'V2' .. '. The consideration,
contained in (97a) shows that these transform according to D(ILI)Jlk for the
transformation L of the restricted group:
         D(L)d(I)'Vk     =    LD(ILl)l1kd(l)'Vw                         (106a)
                              fI.
Choosing for L a pure translation, a consideration analogous to that
performed in (98) shows that the set of momenta in the representation L ~
D(ILI) has the opposite sign to the set of momenta in the representation
D(L). If the latter belongs to a positive subclass, the former belongs to the
corresponding negative subclass and conversely. Thus the adjunction of
As we saw before, the d(I)'I'l' d(I)'I'2' are orthogonal to the original set of
wave functions '1'1' '1'2' .. '. The result of the application of the operations
D(L) and d(F) to the '1'1' '1'2" .. (i.e., the representation of the group
formed by the L, FL) was given in part C. The D(L)d(I)'I'k are given in
(106a). On account of the normalization of d(l) we can set
      d(/)d(I)'I'k    =       'l'k'                                           (106b)
For d(F)d(I)'I'k we have two possibilities, according to the two possibilities
in (95). We can either set
      d(F)d(I)'I'k = d(/)d(F)'I'k = I.d(F)~(I)'I'w                             (107)
                                                   J.L
or
      d(F)·d(I)'I'k       =    -d(I)d(F)'I'k   =         -   I.d(F)~(I)'I'w   (107a)
                                                             J.L
Madison, Wis.
REPRESENTATIONS OF THE POINCARE GROUP                              103
Princeton University
Reprinted from Proc. Nat. Acad. Sci. (U.S.A.) 34, 211 (1948).
104                                                                       CHAPTERll
Since all Lorentz frames are equivalent for the description of our system, it
follows that, together with'll, U(L)'II is also a possible state viewed from
the original Lorentz frame I. Thus, the vector space V contains, with every
'V, all transforms U(L)'V, where L is any Lorentz transformation.
The operators U may also replace the wave equation of the system. In our
discussion, we use the wave functions in the "Heisenberg" representation,
so that a given 'V represents the system for all times, and may be chosen as
the "Schroedinger" wave function at time 0 in a given Lorentz frame I. To
find 'lit ' the Schroedinger function at time to' one must therefore transform
         o
to a frame I' for which t' = t - to' while all other coordinates remain
unchanged. Then'Vt = U(L)'V, where L is the transformation leading from
                       o
I to 1'.
                                                 -
product of two wave functions, and also to the transition from one Lorentz
                                                              --
frame to another. In fact, if VI' = U(L)VI' then ~, = VVr = Q:L)'ij. Thus,
one obtains classes of equivalent wave equations. Finally, it is sufficient
to determine the irreducible representations since any other may be built
up from them.
The present discussion is not based on any hypothesis about the structure
of the wave equations provided that they be Lorentz invariant. In
particular, it is not necessary to assume differential equations in
configuration space. But it is a result of the analysis in (L) that every
irreducible wave equation is equivalent (in the sense of (2)) to a system of
differential equations. For the relation of the present point of view to other
treatments of the subjects see reference 11.
                                                    °
over the manifold pkpk = P, with a constant value P. We confine ourselves
                            *'
to the cases in which P 0, and either P > or P = 0, because the
remaining cases are unlikely to have direct physical significance.5.
In all cases the operators Mid have the form M k/ + Sk/' where the
                                               o
momenta. Both Mid and Sid satisfy the commutation rules (3a). Since the
                 o
                                                                         (10)
or, introducing the three-dimensional vector operators,
I. Ps ' Particles of finite mass and spin s.-Here P = m 2 > O. In the rest
system of the particle, the momentum vector has only the one non-
vanishing component p4 = ± m, hence, by (lOa), W = m2S2. The operator
p-l W represents the square of the spin angular momentum, and has the
value s(s + 1) (s = 0, 112, 1, ... ) for an irreducible representation. For a
given momentum vector there are 2s + 1 independent states. The
representation U(L) is single €>r double valued according to whether s is
integral or half integral. The lowest cases (s = 0, 112, 1) correspond to the
Klein-Gordon, Dirac and Proca equations, respectively.
III. 0(8) and 0'(8). Particles of zero rest mass and continuous
spin.-Here, P = 0, W = 8 2, where 8 is a real positive number. For a given
momentum vector there exist infinitely many different states of
polarization, which may be described by a continuous variable. The
representation 0(8) is single valued, while 0'(8) is double valued.
4. The Class Ps.-(a) s = 0. Here, the variable ~ assumes only one value
and may therefore be omitted. Consequently, Q(P, A) = 1 (cf. reference 8),
and for the little group the trivial one-dimensional representation is
obtained. Hence, Ski = 0, and wk = 0. The wave equation reduces to pkpk
= m2 ; the inner product (<\l,W) is determined by the norm (W,W)of a wave
function,
                                                                         (11)
        W(x)    =
                         -3
                    (21t)""f   J
                               e-i(p.x)W(p)dn,                           (12)
where x stands for xl, x2 , x3, x4. It is well known that (W,W) cannot be
simply expressed in configuration space, because for the Klein-Gordon
                                                       °
equation the density is indefinite, and the integral over the density in
configuration space coincides with (11) only if W(P) = whenever p4 < 0.
            1
(b) s  'iN with N = 1, 2, 3, .. '. For particles of higher spin we use the
        =
equations first derived by Dirac7 in the form essentially given in reference
8. We use for ~ the N four-valued variables Sl' .... , SN in which the wave
110                                                                               CHAPTERll
function 'Jf{p; ~1' ... , ~N) is symmetric. We define for every ~v four-
dimensional matrices Yvk of the same nature as are used in Dirac's electron
theory:
    y/yJ + y}y/ = 2gkIi        (k,l = 1,2,3,4).                                   (13)
                                                                                  (15)
                v
where the
                                                                        (17)
where Av is a skew Hermitian matrix involving only the fIrst v of the     t
(and the p). We can prove (17) best by induction: applying P4YV+14 to
(17) gives, by means of(14),
     (P4)V+IYV+14yv4 ... Y24Y1 4'V = mVp4Yv+14'V+P4YV+14Av'V           (17a)
      =mV+l'V+ (-mvp{YV+lk+P4YV+14Av)'V           (k= 1,2,3).
The last bracket is Av+l: it is skew Hermitian and involves only the fIrst
v + 1 of the Y so that (17) is established by induction. Setting v = N in
(17), multiplying with yand summing over the 1; yields
    P4~'V*Y14Y24. ··YN4'V          =    ~1'V12+ L,'V*AN'V.             (17b)
          ~                                ~      ~
Because of the skew Hermitian nature of AN' the last term is imaginary.
Since the two other terms of (17b) are real, they must be equal. As a
result, we can write for (16) also
     ('V,'V)   =   JIm/p4~   1'V12dn.                                   (18)
At the same time, (18) permits us to give another form to the scalar
product,
     ('V,'V)   =   J1P4rN-ltl'V12dPldP2dP3'                            (18a)
which differs from (18) or (16) by the positive constant m-N . It may be
worth noting here that the absolute signs in (16), and in the defInition (11)
of dn (or in (18a», can be omitted in case of an odd N. This makes it
possible to defIne a simple positive defInite scalar product in coordinate
space by means of (12). In particular, for N = I, (16) (or (ISa» equals the
integral of 1'V21over ordinary space. In case of even N (integer spin s) no
simple positive defInite scalar product can be defined in coordinate space.
It is now established that the solutions of (14) form a Lorentz invariant set
in which a positive definite scalar product (16) or (18a) can be defined.
We shall now determine the representation of 2 to which the solutions
belong and will also calculate the invariants P and W.
two rows of 'Yv' There are 2N such components, the rest of the 4N
components of 'II must vanish. Even these components will not be
independent: as a result of the symmetry of the 'I' in the ~, all components
of 'II will be equal in which the same number 1C of the N indices ~
correspond to the first row of the 'Y, the N - K other indices to the second
row. Since K can assume any of the values between 0 and N, there are N +
1 such components. If P4 = -m, the same considerations will hold, except
that the last two rows of'Y will play the role which the first two rows play
in case ofP4 = m.
Because of (lOa), W becomes m 2(S23 2 + S31 2 + S122) or, since the S23'
S31' S12 are the infinitesimal operators ofD(s), we have W = m 2s(s + I) as
given9 in §3. The value ofP is m2 because of (14a).
The essential difference between finite and zero mass is that, in the latter
case, not only the infinitesimal operators but also the wave equation are
REPRESENTATIONS OF THE POlNCARE GROUP                                               113
It is now advantageous to assume that the --(f are diagonal, their diagonal
elements being 1, 1, -1, -1. Equation (20) then expresses the fact that", for
the Po in question is different from zero only if all 1; have values
corresponding to the first two rows of the y. Since the r commute with the
ff but are not identical with them, they may be also assumed to be
diagonal, with diagonal elements 1, -1, 1, -1. Hence, in the manifold
defined by (20) and (19a) all components of", vanish (for P = Po) unless
all 1; have values corresponding to the first rows of the     r.
                                                               the manifold
(20), (19a) is one dimensional for given p. The same holds for the
manifold defined by (20), (19b) except that in this case "'(Po; SI' ... , sn)
differs from zero only if all Shave values corresponding to the second row
of the y. For given momentum, '" has only two independent components.
(19b). One sees this most easily by applying 1I2iyv ly/ to (20) and
making use of (19). As a result, M12'1' = ± 1I2N'I' = ±S'l' for the two
manifolds in question: these indeed belong to the representation Os' of the
inhomogeneous Lorentz group.
The value of the invariant P is zero. The above also involves a calculation
of the w for the'll at P = Po: we have w3'1' = M12'1' = ± s'l', wI'll = (M42 +
M23 )'I' = 0, w2'1' = (M31 + M I4 )'I' = 0, - w4'1' = M12'1' = ± s'l'. It follows
that the value of the second invariant W = - (w4)2 + (w 1)2 + (w2)2 + (w3)2
is also zero for all the manifolds Os; these cannot be characterized by P
and W. However, these manifolds can be characterized by the equation P =
owith the additional set
     wk = sPk and wk = -sPk'                                          (21)
the + applying to (19a), the - to (19b). Both these equations are invariant
with respect to proper Lorentz transformations. If reflections are to be
included, one can combine them into wkwl = s2pII'I'
6. The Class O(S). - Here, the auxiliary is a space like four vector ~ of
length I, orthogonal to p. The scalar function 'I'(p,~) is determined by the
equations 11
      gklpII'I'l' = 0; gklpk~l'l' = 0; gkl~k~l'l' = -'1',                    (22)
      Pka'l'ra~k = -is'll,                                                  (22a)
with a real positive constant S. By (22a), for every real number p,
      'I'(p,~ + pp) =e-ip:::'I'(p,~).                                        (23)
The infinitesimal operators of displacement are the Pk' those for rotations
are the M of (9) plus the
        o
(24)
In order to find the invariant scalar product, we introduce, for every vector
P on the light cone, two real space like vectors u(l)(p) and u(2)(p) of length
one, orthogonal to p and to each other, so that
      {u(r)(p),p}   =   0, {u(r)(p),u(s)(p)}   =   8rs   (r,s   =   1,2).    (25)
Then ~ is a linear combination of p, u(l)(p), u(2)(p),
REPRESENTATIONS OF THE POINCARE GROUP                                                                     115
                                                                                                  (26)
where a and the ~ are real. {~,~} = -1 implies ~12 + ~l = 1, hence ~I +
i~2 = ei't with a suitable real angle 'to ,!,(p,~) is therefore a function of p, a,
't,
                                                                                                  (27)
The choice of the t/(p) is, of course, not unique. Let v(r)(p) be another
system of vectors which satisfy (25). They may be expressed in the form
(26), i.e.,
     v(r)(p)     =   K'p +LAsrU(s)(p)             (r,s   =   1,2).
                                s
      S23-S24
       =   -i~2(l.... + l....) + i(~3 - ~~.                             (30b)
               a~3 a~               a~2
Because of (22a), the first term gives, if applied to 'I' at P = Po just 5~1'Y
and 5~2'1', respectively. The second terms vanish because of the second
equation of (22). Hence", is not invariant under the "displacements" M13
- M14 and M23 - M24 in ~ space, and the sum of the squares of the
"momenta" is (~12 + ~l) 52 = 52 because of the last equation of (22).
This is also the value ofW, while P = O.
7. The Class O'(5).-Since the discussion of this last case follows the
pattern of the preceding section we confine ourselves to stating the main
results. We introduce, in addition to the vector ~, a discrete spin variable 1;
which can assume four values. The wave equations become
      Y'Pk'l' = 0;
      gldp61'l' = 0;      gld~k~''l' = -'I'.                                        (31)
                                                                                   (31a)
The parameters (l and 't are introduced as before. The norm is given by
                 J
      ('I', 'II) = P4-2I, IcI>(P,0,'t)1 2dP 1dP2 dP3d't.                            (32)
(Cf. (18a) and (29).) Again W", =          s2v, P'If = o.
It may be remarked that the scalar product has a simple positive definite
form in coordinate space for these equations. 11
1 All the essential results of the present paper were obtained by the two authors
independently, but they decided to publish them jointly.
2 Wigner, E.P., Ann. Math., 40, 149-204 (1939).
3 Fierz, M., Helv. Phys. Acta, XII, 3-37 (1939).
4 Garding, L., Proc. Nat. Acad. Sci., 33,331-332 (1947).
5 Gelfand L., and Neumark, M., J. Phys. (USSR), X, 93-94 (1946); Harish-Chandra, Proc.
Roy Soc. (London), A, 189, 372-401 (1947); and Bargmann, Y., Ann. Math., 48, 568-640
(1947), have determined the representations of the homogeneous Lorentz group. These are
representations also of the inhomogeneous Lorentz group. In the quantum mechanical
interpretation, however, all the states of the corresponding particles are invariant under
translations and, in particular, independent of time. It is very unlikely that these
REPRESENTATIONS OF THE POINCARE GROUP                                                          117
representations have immediate physical significance. In addition, the third paper contains
a determination of those representations for which the momentum vectors are space like.
These are not considered in the present article as they also are unlikely to have a simple
physical interpretation.
6 Lublmski, J. K. Physica, IX, 310-324 (1942).
7 Dirac, P. A., M., Proc. Roy. Soc., A, 155,447-459 (1936).
8 The literature on relativistic wave equations is very extensive. Beside the papers quoted
in reference 11, we only mention the book by the Broglie, L., Theorie generaie des
particuies b. spin (Paris, 1943), and the following articles which give a systematic account
of the subject: Pauli, W., Rev. Mod. Phys., 13, 203-232 (1941); Bhabha, H.J., Rev. Mod.
Phys., 17, 203-209 (1945); Kramers, H. A., Belinfante, F. 1., and Lubanski, J. K., Physica,
Vill, 597-627 (1941). In this paper, the sum of (14) over all v was postulated; (14a) then
has to be added as an independent equation (except for N = 1). Reference 11 uses these
equations in the form given by Kramers, Belinfante and Lublmski.
9 One may derive this result in a more elegant way, without specializing the coordinate
system. For the sake of brevity, we omit this derivation.
10 de Wet, 1.S., Phys. Rev., 58, 236-242 (1940), in particular, p. 242.
11 Wigner, E. P., ZPhysik, (1947).
118                                                                              CHAPTER II
   Certain quantities are introduced which are like tensors in space·time with an infinite
   Nlumerable number of eemponents and with an invariant positive definite quadratic form
   for their equ8red length. Some of the main properties of these quantities are dealt' with.
   and some applications to quantum mechanics are pointed out.
                                     1. INTRODUCTION
Given any group, an important mathematical problem is to get a matrix representa-
tion of it, which means to make each element of the group correspond to a matrix
in such a way that the matrix corresponding to the product of two elements is the,
product of the matrices corresponding to the factors. The matrices may be looked
upon as linear transformations of the co-ordinates of a vector and then each element
of the group corresponds to a linear transformation of a field of vectors. Of specia.l
interest are the unitary representations, in whlch the linear transforma.tions leAve
invariant a positive definite quadratic form in the co-ordinates of a vector.
   The Lorentz group is the group of linear transformations of four real varia:bles
;0' ;1' ;1' ;" such that ;~ -;~ - ;=.- ;~ is invariant. The finite representations of this
group, i.e. those whose matrices have a finite number of rows and columns, are all
well known, and are dealt with by the usual tensor analysis and its extension spinor
analysis. None of them is unitary. The group has also some infinite representations
which are unitary. These do not seem to ha.ve been studied much, in spite of their
possible importance for physical applications.
   The present paper gives a new method of attack on these representations, which
was suggeste9 by Fock's quantum theory of the harmonic oscillator. It leads to a
new kind of tensor quantity in space-time, with an infinite number of components
and a positive definite expression for its squared length.
                            2. THREE-t>IMENSIONAL THEORY
  This section will be devoted to some preliminary work applying to the rotation
group of three-dimensional Euclidean space. Take an ascending power series
                                                                                                (1)
in a real variable 61 with real coefficients ar' Consider these coefficients to be the
co-ordinates of a vector in a certain space of an infinite number of dimensions, and
define the. squared length of the vector to be
                                            l:ci r!a~,                                          (2)
The series (2) must converge for the vector to be a finite one.
                                            [ 284 ]
in the three variables EI' Ell' Ea. The s~uared length of such a vector is
                                              ~ci r! s!t! A~.,                               (4)
and two vectors with co-ordinates A,.., and B,.., have a scalar product
                                          ~or!s!t!A,..,B,..,.                                 (5)
  If the variables El' E., Sa are subjected to a linear transformation, going over into
          say, the power series (3) will go over into a power series in E~, E~, E~,
E~, E~, E~,
                                         p    = ~A:",sirs~·E~.
in which the coefficients A' are linear functions of the previous coefficients A. Thus
each linear transformation of the fs generates a. linear transformation of the
coefficients A.
    The theorem will now be proved: A li~r trans/ormation 0/ EI' EI' E. which leo.1Je8
EI + EI +EI invariant generate8 a linear trans/ormation 0/ the eoeJficientB A which
leo.fJe8 the sqwred length (4) invariant. Consider first the infinitesimal transformation
                                                                                              (6)
which leaves ~~ + ~I + ~~ invariant, e being a small quantity whose square is negligible.
Substituting into (3), one gets
                         p   = ~A""(sir~~·+reEir-IE~HI-seEir+lE~·-I)Et
Hence                   A;'" = A,.., + (r + I) eAr+1,a-1,I- (s + I) eA r - I ..+1,,'
in which A ret with a negative suffix is counted as zero. Thus
      ~r!s! t! A;':'   = ~r! s! t! [A:. + 2(r+ l)eA,..,A r+1,a_1,I- 2(s+ l)eAr_~1.tA,..,].
The last two terms in the [ ] here cancel, as may be seen by substituting r - I for r
in the former and s - I for 8 in the latter, and hence the squared length (4) is invariant
for the tra.ns:formation (6). Any linear transformation of E1, Ea, Ea whioh leaves
EJ+Q+EI invariant can be built up from the infinitesima.l transformation (6) and
Bimila.r infinitesimal transformations with E1, EI and Ea permuted, together with
possibly a reflexion El = - Ei, E. = Ei. E. = E~, which obviously leaves the squared
length (4) invariant, and hen,ce the theorem is proved.
  The group of transfon:n:ations of the E's which leave Ef + EI + Elinva.ria.nt is the
rotation group in three-dimensional Euclidean &p&oe, 80 the tra.ns:formations of the
120                                                                       CHAPTER II
286                                 P. A .. M. Dirac
coefficients A provide a representation of this rotation group. One may restrict the
function P to be homogeneous, of degree u say, and then the representation is a
finite one. The coefficients A then form the components of a symmetrical tensor
of rank u, the connexion with the usual tensor notation being effected by taking
Ara to be U!fr!8!t! times the usual tensor component with the suffix I occurring
r times, 2 occurring 8 times and 3 occurring t times, as may be seen from the
invariance of expreasion;(3) with (61,6,,63) transforming like a vector.
   One can make a straightforward generalization of the foregoing theory by illtro-
ducing other triplets of variables, 1/1' 1/" 1/3 and '1' 'I' '8 sa.y, which transform to-
gether with 61' 6a, 6a, and setting up a power series in all the variables. The trans-
formations of the coefficients of these more general power series will provide further
representations of the three-dimensional rotation group. If such a more general
power series is restricted to be homogeneOus, its coefficients will form the components
of an unsymmetrical tensor.
                             3. FOUR-DIMENSIONAL THEORY
  Take a descending power series
                              ko/6o+ kl/fJ+ka/sg+ ks/fJ+ ...                          (7)
 Multiply this vector space into the vector space of the preceding section. A general
 vector in the product space will have co-ordinates Anrst which can be represented
'a,s the coefficients in a power series
                                 Q=   ~o Anrst6on-1S~6;s~                             (9)
in the four variables SO'   SI' 6" Sa. The squared length of such a vector is
                                    ~cin!-lr!8!t!A:"'.,                              (10)
and two vectors with co-ordinates Anrae and Bnrae have a scalar product
                                 ~"'n'-lr'8't'
                                 *'0  •   • • • .an..
                                                 A      B nrae·                      (11)
  The series (7) may be extended baokwards to inoludesome terms with non-
negative powers of 60,80 that coefficients k" occur with negativen-values, lea.djng
to coeffici,nts Anrae with negative n-values. S~ce n I is infinite for n negative, these
new coefficients
      I'
                  do not contribute to the squared length of a veotor or the scalar
product of two vectors. Thus the terms with non-negative powers of €o should be
counted as corresponding J;o the vector zero, and whether they are present in an
expansion or not does not matter.
REPRESENT ATIONS OF THE POINCARE GROUP                                                      121
(13)
The last two terms in the [ ] here cancel, as may be seen by substituting n + 1 for
  in the former and r-l for r in the latter; and hence the squared length (10) is
1/,
invariant for the transformation (15). Any Lorentz transformation can be built up
from infinitesimal transformations like (15) and three-dimensional rotations like
those considered in the preceding section, together with possibly a reflexion, which
obviously leaves the squared length (10) invariant, and hence the theorem is proved.
   The transformations of the coefficients A thus provide a unitary representation
of the Lorentz group. The coefficients themselves form the components of a new kind
of tensor quantity in space-time. I propose for it the name expansor, because of its
connexion with binomial expansions. One may restrict the function (9) to be
homogeneous and one then gets a simpler kind of expansor, which may be called a
 122                                                                            CHAPfERII
288                                   P. A. M. Dirac
 lwmogeneoU8 exptJnBOr. The analogy with the three-dimensional case suggests that
 one should look upon a homogeneous expansor as a symmetrical tensor in space-
 time with the suffix 0 occurring in its components a negative number of times.
    The foregoing theory can be generalized, like the three-dimensional theory, by
the introduction of otl1er quadruplets of /Variables, 1/0' 1/1' 1/", 1/8 and '0' '1' '2' '8 say,
 which transform tog~ther with 60' 61' 62' 68' One can then set up a power series in
 all the variables, ascending in those variables with suffixes 1, 2 and 3, and descending
in those variables with suffix O. The transformations of the coefficients of these more
general power series will provide further unitary representations of the Lorentz
group, and the coefficients themselves will form the components of more general
 expansors.
    There is another generalization which may readily be made in the theory, namely,
to take the values of the index n in (9) to be not integers, but any set of real numbers
no, no+l, no+ 2, ... extending to infinity. In the formula (10) for the squared length
n! is then to be interpreted as r(n+ 1). The terms with negative n-values can no
longer be discarded. The expressi?n for the 'squared length is still positive definite
if the minimum value of n is greater than -1, which is the case if the function (9)
IS homogeneous and its degree is negative. The resulting representation is then still
unitary. If, however, the function (9) is homogeneous and its degree is positive,
there will be a finite number of negative terms in the expression for the squared
length. The resulting kind of representation may be called nearly 'Unitary.
   If the series (2) is convergent, then (1) is convergent for all values of 61' Similarly,
if (4) is c~nvergent, then (3) is convergent for all values of 61' 62' 68' On the other
hand, if (8) is convergent, (7) need not be convergent for any value of 60' in which
case, of course, it does not define a function of 60. Similarly, if (10) is convergent,
(9) need not be convergent for any values for the 6'S. Thus, corresponding to a general
expansor A ..... , there need not exist any function Q of the fs. However, it will now
be proved that if the 8eries (9) i8 Iwmogenwus and (10) i8 convergent, then (9) i8
ab80lutely convergent for all values of the   6'8 sati8fying
                                     ;~-;j- 61-;~ > O.                                     (16)
which shows that (17) is convergent when (16) is satisfied. One may take go, gl' g" g.
and the A's to be all positive without disturbing the argument, 80 (17), considered
as a quadruple series in n, r, 8, t, is ab80lutely convergent.
  ThU8 for any homogeneous expansor of finite length, there exists a function Q
given by (9) defined within the light-cone (16). In transforming this function with
a Lorentz transformation (12), it is legitimate to use the expansion of (13) in
ascending powers of g;, g~, g~ and descending powers of g~, for the following reason.
Suppose g" is within the light-cone and take for definiteness go> O. Then
                              ~~+~~+~~+~~>~                                           (n)
One can change the sign of any of the co-ordinates gi, g~, gi, leaving g~ unchanged
and ~p will still lie within the light-cone with ~o > 0, 80 (22) must still be satisfied.
Hence
                           ~g~-I~~; I...L I~g; I-I~gi 1 >0,
which shows that the expansion of (13) in the required manner is absolutely con-
vergent. The use of this expansion in the preceding section is thus justified for the
case when (9) is .homogeneous, the new coefficients A being determined by the
transformed function Q within the light-cone. The justification for (9) not homo-
geneous then follows since terms of different degree do not interfere. It should be
noted that all the foregoing arguments are valid also when the values of n are not
integers.
   There are some expansors which are invariant under all Lorentz transformations,
namely those whose components are the coefficients of (~-         gt- -
                                                                   ~ gl)-l expanded
in ascending powers of gl' gl' ga and descending powers of go' For such an expansion
to be possible, l must not be a negative integer or zero, but it can be any other real
number. Again, there are expansors which transform like ordinary tensors under
Lorentz transformations, namely those whose components are the coefficients in
the expansion of
                                                                                  (23)
 124                                                                                              CHAPTER II
290                                          P. A. M. Dirac
where 1 is any homogeneous integral polynomial in Eo, €l' €,' €., and I is restricted.
as before. (23) transforms like the polynomial I, and thus like a tensor of order
equaJ to the degree of I. One would expect a.ll these expansors to be of infinite
length, a.s otherwise one could set up a positive definite form for the squared length
of a tensor in space-time. A forma.l proof that they are is a.s follows.
  Suppose the 1 in (23) is of degree u and express it a.s
         1 = gu+€i,gu-J     +(€I- ~')gu_I+€o(€I-~2)gu_a+ (€~- ~1)lgu-4 + ... ,
where ~I = €f + €I + €land each of the g's is a polynomial in €1' fl' fa only, of degree
indicated by the suffix. The successive terms here contribute to (23) the amounts
                                   _   ~""
                                   - --"'-0
                                               l(l+ 1)(1+ 2) ... (1+ m-l) gu;8m
                                                                     m!
and so on. These expa.nsions show that, for large values of m, the terms arising from
gu-I' 9u-4., ... are of smaller order than the corresponding terms arising from gu,
and the terms arising from gu-a, flU-5' ... are of smaller order than the corresponding
terms arising from g_I' 80 that in testing for the convergence of the series which
gives the squared length, only the gu and g..-1 terms need. be taken into account.
(It will be· found that the convergence conditions are not sufficiently critical to be
affected by this neglect.) Now express gu and g..-1 a.s
         g. = 8.. +~·8u_I+;'8.. _, + ... , 9u-l = 8.._1+;·8U-3+~'8u-5+ ... ,
where the 8's are solid harmonic functions of El' fl' €., of degrees indicated by the
suffixes. Each of them gives a contribution to (23) of the form
                                                                      (m+l-l)I ~I(m~)
                     8u_Ir;Ir(f~-;')-l = 8"_11'1::_0 ~!(l-I)!· ffm-tll                                   (24)
                                                                        (m+l-l)I      ;I(m~)
or              8u_Ir_1 Eo;Ir(€I-;I)-l        =8   U - .......1   1::_0 m!(I-I)!· €i<m-tll- 1 '         (25)
Using the result (41) of the appendix, one finds for the squared lengths of the
expa.nsors whose components are the coefficients of (24) and (25), series of the form
                                (m+I-·l)114m-tr(m+r)! (m+u-r+l)!
                         c1:m   m!I(I-I)!~                  (2m+2l-1)!
                                                                                           (29)
say. This gives the function F of the x's which represents              Eon-1gf,g~.   A general
f,-function is now represented thus,
                       ~AnratEon-lg f,g~ == ~ .... F,.,..(xo,   Xl' X" x 8)·               (30)
In this way a general expansor .An,. gets connected with a function of the x's.
    Vol. 183.   A.                                                                        20
126                                                                                     CHAPTER II
292                                         P. A. M. Dirac
  The chief interest of this connexion is that the law for the scalar product of two
expansors becomes very simple when expressed in terms of the functions of the x's.
The scalar product is the inte{1ral oj the product oj the two Junctions oj the x' 8 over aU
Xo. Xl, XI and X 3• To prove this, first evaluate the integral
                                f
                                _aoao   (d)'"
                                         Z- dz e-1zl .(d)""
                                                       z - dz e-izl dz,                        (31)
where the dot has the meaning that operators to the left of it do not operate on
functions to the right of it. For m > O. (31) goes over by partial integration into
      f ao
      ,-ao
             (d)"-l
              z--
                  dz
                    e-lzl. (d)(   d)"" e-h1dz
                            z+~ z--
                                          dz      dz
              =   fao
                   -ao   (d)"'-l
                          z- dz e-hl .{(z- dzd)""(z+ dzd) +2m' (d)""-l)
                                                                z- dz   e-h1dz
Now substitute for z each of the variables xo, Xl' XI' X. in turn, with m equal to n, r, 8, t
and m' equal to n'. r'. 8'. t' respectively. and multiply the four equations 80 obtained.
The result, after dividing through by 7T1n !12''''''''H+I, is
   The four x's of the preceding section may be looked upon &8 the co-ordinates
 of a four-dimensional harmonic oscillator, the four operators ioj'iJz,. being the con-
 jugate.momenta. y. and the energy of the oscillator may be taken to be the Lorentz
 invJriant                                                                         (32)
 The 1, 2. 3 components of the oscillator thus have positive energies and the 0 com-
 ponent negative energy.
REPRESENTATIONS OF THE POINCARE GROUP                                                   127
(33)
The first of these is the usual equation for the motion of the particle as a whole. The
second shows that at eaoh point in space-time the wave function", is homogeneous
in the fs of degree -1. The third shows that the state for which the momentum-
energy four-vector of the particle has the value PI' is represented by the wave
function
(34)
For the state for which the particle is at rest, PI     = PI = Pa = 0,    Po   = m,   and '"
reduces to
128                                                                                CHAPTER II
294                               P. A. M. Dirac
This Vr is spherically symmetrical, showing that when the particle is at rest it has
no spin. But when the particle is moving, it is represented by the general Vr (34)
and has a finite probability of a non-zero spin. In fact, taking for simplicity
PI = Pa = 0, the particle has a probability (mpflp'G+l)lof being in a state of spin
corresponding to the transformations of ~f under three-dimensional rotations.
  This example shows there is a possibility of a particle having no spin when at rest
but acquiring a spin when moving, a state ofaffairs which was not allowed by previous
theory. It is desirable fihat the new spin possibilities opened up by the present theory
should be investigated to see whether they could in some ca.ses give an improved
description of Nature. The present theory of expansors applies, of course, only to
integral spins, but probably it will be possible to set up a corresponding theory of
two-valued representations of the Lorentz group, which will apply to half odd
integral spins.
APPENDIX
  The rules (5), (II) for forming scalar products are not always convenient for direct
use. There are various ways of transforming them and making them more suitable
for practical application. One such way has been given (Dirac 1942, equation 3·22)
for the ca.se of a single ~ with ascending power series. Another way, applicable to
the ca.se of, homogeneous functions of Sl' Sa, Sa, is provided by the following.
   By partial integration with respect to S', one gets, for m> 0,
If the integrals are made precise in the sense of Cesaro, which means neglecting
oscillating terms like f"ei £{' for S' infinite, this gives
Taking m ~ 110 and applying the partial integration process 110 times, one gets
                   II: ~"'fn e.
                        II)
                                iEf' df elS' =   i"n!II:",s"'-f&   elfE' df elS'
It follows that if A and B are homogeneous functions of ~l'              S,,' Sa of degree 'ft, their
scalar Ilroduct according to (5) is
Now suppose 8 ~ r and apply the procedure by which (37)                             W&8   changed to (38)        f'
times. The result is
            (AB) = (217)-3i-U+llr 4rr!(u-r+ i)! (u- 2r+ l) !-1
where n! means r(n+ 1) for n not an integer. H 8>r, the procedure can be applied
once more, and then shows that
                                        (AB) = 0 for              r=+=8.                                   (40)
H    8   = r, (39) shows that           (AB) = c4rr!(u-r+i)!,                                              (41)
where c depends only on u - 2r and on the two 8 functions.
REFERENOE
                     The explicit Feynrnan rules are given for massive particles of any spin j, in both a 2j+l-component and
                a 2(2j+l)-component formalism. The propagators involve matrices which transform like symmetric trace-
                less tensors of rank 2j; they are the natural generalizations of the 2X2 four-vector qjJ and 4X4 four-vector
                'YJJ. for j=i. Our calculation uses field theory, but only as a convenient instrument for the construction of a
                Lorentz-invariant S matrix. This approach is also used to prove the spin-statistics theorem, crossing sym-
                 metry, and to discuss T, C, and P.
invariance of the S matrix, withont any recourse to            TABLE I. The scalar matrix n(q) = (_)2iII'I1'2·· 'Ql'lQl/2 " •• for
                                                            spins j~3. In each case J is the usual 2j+l-dimensional matrix
separate postulates of causality or analyticity.'           representation of the angular momentum. The propagator for a
    Nowhere have we mentioned field equations or La- particle of spin j is Seq) ~ -i( -im)-,jTI(q)!q'+m'-i,.
grangians, for they will not be needed. In fact, our
refusal to get enmeshed in the canonical formalism has        lI(O)(q) ~ 1
a number of important physical (and pedagogical) TIO!2) (q) ~'1'-2 (q. J)
advantages:                                                   II''' (q) ~ -q'+2(q· J) (q. J _gO)
    (1) We are able to use a 2j+l-component field for a TIIs;" (q) = -q2(g"-2q· J)H[(2q- J)'-q'][3'1'-2q' J]
massive particle of spin j. This is often thought to be        n(2) (g) ~ (_q2)'- 2g'(q· J) (q- J -'1')
impossible, because such fields do not satisfy any free-                                         +j(q. J)[(q- J)'_q2J[q. J-2qoJ
field equations (besides the Klein-Gordon equation). TIl ,2) (q) ~ (_q2)2(q"_ 2q' J) - ,q'[(2q- J)'- q'][3'1'-2q· J]
The absence of field equations is irrelevant in our ap-                                1
proach, because the fields do satisfy (1.7) and (1.8); a                          +--{ (2q· J)'- q'][(2q· J)2-9 q2J[5'1'- 2q· JJ
free-field equation is nothing but an invariant record of                            120
which components are superfluous.                             II (3) (g) = (-q')'+2( -q')(q. J) (q. J-gO)
    The 2j+ I-component fields are ideally suited to weak                      - ~g'(q' J)[ (q. J)'-q'J[q' J - 2qoJ
interaction theory, because they transform simply                                   4
                                                                                 +-(q. J)[ (q- J)2_ q'J[(q- J)'-4q'J[q· J -3'1'J
under T and CP but not under CorP. In order to                                     4.1
discuss theories with parity conservation it is con-
venient to use 2 (2j+ I)-component fields, like the Dirac
field. These do obey field equations, which can be de-
rived as incidental consequences of (1.7) and (1.8).         symmetry. Section V is devoted to a statement of the
    (2) Schwinger" has noticed a serious difficulty in the Feynman rules. The inversions T, C, and P are studied
quantization of theories of spin j~ ~ by the canonical in Sec. VI. They suggest the use of a 2 (2j+ I)-com-
 method. This can be taken to imply either that particles ponent field whose propagator is calculated in Sec. VII.
with .i~ ~ cannot be elementary, or it might be inter- More general fields are considered briefly in Sec. VIII.
preted as a shortcoming of the Lagrangian approach.          The propagator for 2j+l- and 2(2j+I)-component
    (3) Pauli's proof' of the connection between spin and fields involves a set of matrices which transform like
 statistics is straightforward for integer j, but rather symmetric traceless tensors of rank 2j, and which form
 indirect for half-integer j. We take the particle inter- the natural generalizations of the 2X2 vector {a,l} and
 pretation of "'n(X) as an assumption, and are able to the 4X4 vector 'I" respectively. These matrices are
 show almost trivially that (1.8) makes sense only with discussed in two appendices, where we also derive the
 the usual choice between commutation and anticom- general formulas for a spin .i propagator. The 2.i+ I
 mutation relations. Crossing symmetry arises in the X2j+ I propagators for spin j~3 are listed in Table I,
 same way.                                                   and the 2(2j+1)X2(2j+1) propagators for j~2 are
     (4) By avoiding the principle of least action, we are listed in Table II.
 able to remain somewhat closer throughout our de-               This article treats a quantum field as a mere artifice
 velopment of field theory to ideas of obvious physical to be used in the construction of an invariant S matrix.
 significance.                                               It is therefore not ur.likely that most of the work pre-
    At any rate the ambiguity in choosing JC(x) is no sented here could be translated into the language of
 worse than for ",(x). The one place where the La- pure S-matrix theory, with unitarity replacing our
 grangian approach does suggest a specific interaction assumptions (I) and (3).
 is in the theory of massless particles like the photon and
 graviton. Our work in this paper will be restricted to         TABLE II. The scalar matrix <9(q) = _i              i qIlIQ"'2' .. qjJzi
                                                                                                                       2i-yjJJjJ2 ··jJ2
 massive particles, but we shall come back to this point for spins j ~ 2. In each case
 in a later article.
   The transformation properties of states, creation and                                             ~~[tj) ~(iJJ.
annihilation operators, and fields are reviewed in Sec.               The propagator for a particle of spin j is
II. The 2j+ I-component field is constructed in Sec. III                               Seq)   ~   -im-21[ rJ>(q)+m,jJI<f+m'-i,.
sO that it satisfies the transformation rule (1.7). The
"causality" requirement (1.8) is invoked in Sec. IV,                   rJ>(O)(q)~1
yielding the spin-statistics connection and crossing
                                                                      rJ>(l/"(q) ~'1'i3-2(q '~h5i3
                                                                       rJ>(l)(q) ~   -g'i3+2(q·~) (q'~i3-q"~5m
  2 In this connection, it is very interesting that a Hamiltonian
without particle creation and annihilation can yield a Lorentz-       rJ>(3!2l (q) ~ -q'('1'i3-2q· ~Y5i3)+![(2q. ~)'- q'][3'1'i3-2q ·~.h5i3]
invariant S matrix, but not if we use perturbation theory. See
                                                                        rJ>(2)(g) ~ (-q')'i3-2g'(q' ~)[w gji3-q"y..s]
R. Fong and J. Sucher, University of Maryland (to be pubhshed).
  3 J. Schwinger, Phys. Rev. 130, 800 (1963).                                                         +Hw,m[(q-gj)'-q'][q' ~i3-2q"y..s]
  4 W. Pauli, Phys. Rev. 58, 716 (1940).
132                                                                                                             CHAPfERII
U[A] I p,u)= [w(Ap)/w(p)]'" L:"IAp,u')                          the physical Hilbert space, or the ordinary complex conjugate of
                                                                a (; number or a c-number matrix. A dagger is used to indicate the
                     XD.,.W[L-I(Ap)AL(p)].             (2.8)    adjoint of a ,-number matrix. Other possible statistics than
                                                                allowed by (2.10) will not be considered here.
  • E. P. Wigner, Ann. Math. 40, 149 (1939).                       I Reference 6, Eq. (4.22) .
REPRESENTATIONS OF THE POINCARE GROUP                                                                                         133
   We speak of one particle as being the antiparticle        It follows from (2.3) that
of another if their masses and spins are equal, and all
their charges, baryon numbers, etc., are opposite. We                             [J i,J;] = i'ij,J k ,                      (2.24)
won't assume that every particle has an antiparticle,                           [Ji,K;] = i'ij,K.,                           (2.25)
since this is a well-known consequence of field theory,
which will be proved from our standpoint in Sec. IV.                            [Ki,Kj] = -i'i;.I..                          (2.26)
But if an antiparticle exists then its states will trans-    The J generate rotations and the K generate boosts.
form like those of the corresponding particle. In par-       In particular, the unitary operator for the finite boost
ticular, the operator b*(p,<T) which creates the anti-       (2.4) is
particle of the particle destroyed by a(p,<T) transforms                     U[L(p)J=exp(-ip·KO).               (2.27)
by the same rule (2.16) as a*(p,<T):
                                                               The co.nmutation rules (2.24)-(2.26) can be de-
U[Ajb*(p,<T)U-'[A]                                           coupled by defining a new pair of non-Hermitian
  = [w(Ap)/w(p) Jl/2 Lu' (CDUl[L-1( p)A-IL(Ap) ]C-l} uu'     generators:
                                  Xb*(Ap,<T'). (2.17)                          A=~[HiK],                 (2.28)
To some extent this is a convention, but it has the ad-                             B=![J-iKJ,                               (2.29)
vantage of not forcing us to use different notation for      with commutation rules
purely neutral particles and for particles with distinct
antiparticles.                                                                         AxA=iA,                               (2.30)
   It cannot he stressed too strongly that the trans-                                  BxB=iB,                               (2.31)
formation rules (2.12) and (2.17) have nothing to do
with representations of the homogeneous Lorentz group,                               [Ai,BjJ=O.                              (2.32)
but only involve the familiar representations of the
ordinary rotation group. If a stranger asks how the spin               +
                                                             The (2A 1) (2B+ I)-dimensional irreducible repre-
states of a moving particle with j = 1 transform under       sentation (A ,B) is defined for any integer values of
some Lorentz transformation, it is not necessary to ask      2A and 2B by
him whether he is thinking of a four-vector, a skew                         (a,bl   AI a',b')=Dbb'Jaa,(A),                   (2.33)
symmetric tensor, a self-dual skew symmetric tensor,
or something else. One need only refer him to (2.16) or                      (a,bIBla',b')=Daa,Jbb'(B),                      (2.34)
 (2.8), and hopc that he knows the j = 1 rotation
matrices.
                                                             where a and b run by unit steps from - A to A and           +
                                                             from - B to +B, respectively, and JW is the usual
   The complexities of higher spin enter only when we        2j+ I-dimensional representation of the rotation group:
try to use a(p,<T) and b*Cp,O') to construct a field which
transforms simply under the homogeneous Lorentz               (.T/iJ±iJ /I)u'u= Du' .u±l[ (j=F<T) (j±<T+ I)Jl/2,             (2.35)
group. We will need to usc only a little of the classic              (J,(j)u'u= Du'u<T.
theory of the representations of this group, but it will
be convenient to recall its vocabulary. Any representa-      The representations (A,B) exhaust all flnite dimen-
tion is specified by a representation of the infinitesimal   sional irreducible representations of the homogeneous
Lorentz transformations. These arc of the form               Lorentz group. None of them are unitary, except for
                                                             (0,0).
                                                    (2.18)      We will be particularly concerned with the simplest
                                                             irreducible representations (j,O) and (O,j). These are
where the w's form an infinitesimal "six-vector"
                                                             respectively characterized by
                                                    (2.19)             J -> JU),     K -> -iJU), for            (j,0)        (2.36)
The corresponding unitary operators are of the form          and
                                                                       J -> JW,      K -> +iJW,           for   (O,j),       (2.37)
               U[Hw J= H (i/2)J"w"',                (2.20)
                                                             where J<il is given as always by (2.35). We denote the
                                                             2j+ I-dimensional matrix representing a finite Lorentz
                                                             transformation A by DW[AJ and DW[AJ in the (j,O)
It is very convenient to group the six operators J" into     and (O,j) representations, respectively. The two repre-
two Hermitian three-vectors                                  sentations are related by
                                                    (2.22)                                                                   (2.38)
(2.27) and (2.36) or (2.37) by                                     section. It is clear that this is the most general linear
                                                                   combination of the a's and the b*'s which has the simple
               DCil[L(p)J=exp(      _po JCilO) ,         (2.39)
                                                                   Lorentz transformation property
               DCil[L(p)J=exp(+p. JCilO) ,               (2.40)
                                                                   U[A,a]I'.(x)U-l[A,a]
with sinhB= I pi/m. For pure rotations both D(j)[R]                                          =L' D••,(i)[A-l]I'.,(Ax+a).      (3.7)
and DCil[RJ reduce to the usual rotation matrices.
                                                                   [We choose to combine a and b*, so that I'.(x) also
                                                                   behaves simply under gauge transformations.]
             Ill. 2j+l-COMPONENT FIELDS
                                                                     In terms of the original creation and annihilation
   We want to form the free field by taking linear com-            operators, the field is
                                                                                        f
binations of creation and annihilation operators. The
transformation property under translations required by                                      d'p
                                                                   1'. (x) =   (2 ..)-3/' - - -
(1. 7) forces us to do this by setting the field equal to                                   [2W(p)Jl/2
some sort of Fourier transform of these operators. But
(2.12) and (2.17) show that each a(p,.-) and b*(p,.-)                                XL [W •• ,(il[L(p)]a(p,..')e'P"
behaves under Lorentz transformations in a way that                                    "
depends on the individual momentum p, so that the                                   +I1{DW[L(p)]C-1) ..,b*(p,..')e-'P"] , (3,8)
ordinary Fourier transform would not have a covariant
character. In order to construct fields with simple                We have already derived a formula [Eq. (2.39)J for
transformation properties, it is necessary to extend               the wave function appearing in (3.8):
D(j)[RJ to a representation of the homogeneous Lorentz
group, so that the p-dependent factors in (2.12) and                               D..,W[L(p)]= {exp( _po J(/)O) ..,.
(2.17) can be grouped' with the a(p,,-) and b*(p,.-).
There are as many ways of doing this as there are                    The field obeys the Klein-Gordon equation
representations of the Lorentz group, but for the present                                    (O'-m')I'.(x)=O,                 (3.9)
we shall use the (j,O) representation defined by (2.36)
and (2.35). [The (O,j) representation will be considered           but it does not obey any other field equations, As dis-
in Sec. VI, the (j,0)® (O,j) in Sec. VII, and the general          cussed in the introduction, we consider this to be a
case in Sec. VIII. J                                               distinct advantage of the (j,O) representation, because
   Having extended the definition of the 2j+1X2j+l                 any field equation [except (3,9)J is nothing but a con-
matrix DCil in this way, we can split the rotation matrix          fession that the fi,eld contains superfluous components,
appearlUg in (2.12) and (2.17) into three factors                     If a particle has no antiparticle (including itself)
                                                                   then we have to set '1=0 in (3.6) and (3,8), In the
D(j)[L-l(p)A-1L(Ap)]
                                                                   other extreme, a theory with full crossing symmetry
          =DCil-1[L(p)]D(i)[A-']DW[L(Ap)].                (3.1)    would have IIII = I~ I· We will now show that the choice
This allows us to write (2.12) and (2.17) as'                      of ~ and 11 is dictated by requirement (1.8), and hence
                                                                   essentially by the Lorentz invariance of the S matrix,
 U[A]a(p,.-)U-l[A]= :E., D•• ,W[A-l]a(Ap,.-') , (3.2)
 U[A]Ii(p,.-)U-l[A] = L, D•• , (/>[A-l]1i (Ap,.-') ,      (3.3)                     IV. CROSSING AND STATISTICS
with                                                                 We are assuming, on the basis of their particle in-
                                                                   terpretation, that the a's and b's satisfy either the usual
a (p,.-) = [2w(p)J'/' :E., D••,W[L(p)]a(p,u') ,           (3.4)
                                                                   Bose commutation or Fermi anticommutation rules:
Ii(p,.-)= [2w(p)J'/':E.' {DW[L(p) ]C-'} •• ,b'(p,.-'). (3.5)
                                                                                    [a(p, ..),a* (p, ..')J±= a(p- p')a.."
   The operators a and Ii transform simply, so the field                                                                      (4.1)
                                                                                    [b(p, .. ),b*(p, ..')J±= a(p- p')a.., ,
can be constructed now by a Lorentz invariant Fourier
                 f
transform                                                          with all others vanishing. It is then easy to work out
                      d'p                                          the commutation or anticommutation rule for the field
1'. (x) = (211')-'/' - -                                           defined by (3.8):
                     2w(p)
                                                                   [q>.(x),q>.,t(y)J±
                                                                         ".'If
                   X[~(p, .. )eip·'+'1Ii(p,..)e-ip"J,     (3.6)
where the matrix II (p) is given by                                         The field is now in its final form:
                m-2iII(p,w)=DW[L(p)]DW[L(p)]t                    (4.3)
II ..,(p) = (- )'jt ..,m· .. ·..jP •• P.,.·· P.,j, (4.5) The commutator or anticommutator is
        X   f   d'p
                --(\~I'exp[ip·(x-y)]
                2w(p)
                        ±(-),jh:'exp[-ip,(x-y)]l.                (4.6)
                                                                                          V. THE FEYNMAN RULES
  It is well known' that such an integral will vanish                        Suppose now that the interaction Hamiltonian is
outside the light-cone if, and only if, the coefficients of
                                                                          given as some invariant polynomial in the lOa(X) and
exp[ip· (x-y)] and exp[ -ip· (x-y)] are equal and
opposite, i.e.,                                                           their adjoints. For example, the only possible non-
                                                                          derivative interaction among three particles of spin it,
                                                                 (4.7)    j" and js would be
Thus the requirement of causality leads immediately
to the two most important consequences of field theory:
   (a) Statistics: Eq. (4.7) makes sense only if
(4.8)
so a particle with integer spin must be a boson, with a                   the "vertex function" being given here by the usual
(-) sign in (4.1), while a particle with half-integer spin                3j symbol.
must be a fermion, with a (+) sign in (4.1).10                              The S matrix can be calculated from (1.1) by using
   (b) Crossing: Eq. (4.7) also requires that                             Wick's theorem as usual to derive the Feynman rules:
                               I~I =   I'll·                      (4.9)      (a) For each vertex include a factor (-i) times
                                                                          whatever coefficients appear with the fields in X(x).
Thus every particle must have an antiparticle (perhaps                    For example, each vertex arising from (5.1) will con-
itself) which enters into interactions with equal coupling                tribute a factor
strength. There is no reason why we cannot redefine
the phase of a(p,O") and b*(p,O") and the phase and
normalization of lOa(X) as we like, so Eq. (4.9) allows
                                                                                                              j,) .              (5.2)
us to take
                                                                                                             0",
                                 i;='1/=1                        (4.10)      (b) For each internal line running from a vertex at
                                                                          x to a vertex at y include a propagator
without any loss of generality.
                                                                          (r{ 10. (x) lOa' tty)} )o=8(x-Y)(lOa(X)lOa' t(y»o
  10   As a demonstration that the causality requirement cannot be
satisfied with the wrong statistics, this is certainly inferior to the                            +(- )2i8(y-X)(IO.,t(y)IO.(X»0 (5.3)
more modern proof of P. N. Burgoyne, Nuovo Cimento 8, 007
(\958). Our purpose in this section i. to show that causality can            (c) For an external line corresponding to a particle
be satisfied, but only with the right statistics and with crossing
symmetry.                                                                 of spin j, J,=p., and momentum p, include a wave
136                                                                                                                                 CHAPTER II
function
                                     1
                             -----.D./j)[L(p)] exp(ip·x)                                           [particle destroyed],
                             [2w(p)]112(211")'/2
                                      1
                             - - - - - D•• w*[L(p)] exp( -ip·x)                                    [particle created],
                             [2W(p)]'/2(211-)'12
                                                                                                                                              (5.4)
                                           1
                            -----I[DW[L(p)JG-IJ,. exp(-ip·x) [antiparticle created],
                            [2w(p) J'12(211")'/'
                                    1
                             -----[DUl[L(p)]C-I].: exp(ip·x) [antiparticle destroyed].
                             [2w(p)J1I2(211")'I'
These wave functions can be calculated from Eq.                                         where -iAC(x-y) is the usual spin-zero propagator:
(2.39). In conjunction with (4.4), this tells us that
                                                                                        -iAC(x)=i8(x)<4(x)+i8( -x)<4( -x)
                     D(j)[L(p)]=m-'IJIW{p') ,                                   (5.5)                               =![~,(x)+iE(X)~(X)J       (5.9)
                                                                                        and, as usual,
where the 4-vector p' is defined to have 8' =8/2, i.e.,
                                                                                                          E(x)=8(x)-8( -x),
            p'= {ji[im(w-m)JI/2Hm(w+m)JI'}.                                     (5.6)                    ~,(x)=i[~+(x)+<4( -x)],             (5.10)
The matrix II(j) is calculated in the Appendix; see also                                                 ~(x)=<4(x)-<4( -x).
Table I.
  (d) Integrate over all vertex positions x, y, etc. and                                It is well known that ~C(x) is scalar, because E(X) is
sum over all dummy indices II, II', etc.                                                scalar unless x is spacelike, in which case ~(x)=O.
  (e) Supply a (-) sign for each fermion loop.                                          Using the tensor transformation rule (A.5) for the
  The problem still remaining is to calculate the pro-                                  t·,· .. we find that
pagator (5.3). An elementary calculation using (4.11)                                                 DW[A]S(x)D(j)[AJt=S(Ax).               (5.11)
and (4.3) gives
                                                                                        This is just the right behavior to guarantee a Lorentz-
(""(x)",,. try»~.                                                                       invariant S matrix.
               =                 f
                                 d3p
                    (2".)-'m-'; --IT ••• (p) exp[ip· (x-y)]
                                2w{p)
                                                                                           But unfortunately the propagator (5.3) arising from
                                                                                        Wick's theorem is not equal to the covariant propagator
                                                                                        Sex) defined by (5.8), except for j=O and j=i. The
(", •• t(y) ",.{x».                                                                     trouble is that the derivatives in (5.8) act on the E
                             f
                                                                                        function in ~C(x) as well as on the functions A and A,.
                                     d3p                                                This gives rise to extra terms proportional to equal-
           = (2".)-'m-2; - - I I ••• {p) exp[ -ip· (x-y)J.
                                 2w(p)                                                  time 6 functions and their derivatives. These extra
                                                                                        terms are not covariant by themselves, but are needed
Formula (4.5) for IT(p) lets us write this as                                           to make Sex) covariant; we must conclude then that
                                                                                        (5.3) is not covariant.
(",.(x) ", •• t(y»o                                                                        For example, for spin 1 Eq. (5.3) gives
     =i( -im)-';t••. •IPI "·· li8Pl8., . ·8"'i~+(X-Y),                          (5.7)
                                                                                        (T{ ",.(x) ", •. t(y)} )o=iim-'t.... ,
(- )';(1'., '(y)",,(x»o                                                                                     X [8.8,~, (x- y)+iE(X- y)8.8,~(x-y)],
      =   i( -im)-2it/lrl,,IIl,u2"    ·fJ2i0Jl.ldp,,· • . dIJ2iLl-t   (y- x),
                                                                                        while (5.8) gives
                                     J
where
                                                                                        S .., (x-y) = !im-'t••••'8.8,
                      1      d'p
               i<4(x)=-     --exp{ip·x).                                                                        X[~,(X-y)+iE(X-y)A(x-y)].
                    (211")' 2w(p)
                                                                                        The difference can be readily calculated by using the
  At this point we encounter an infamous difficulty.                                    familiar properties of ~(x). We find that
If the 8 function in (5.3) could be commuted past the
derivatives in (5.7), then the propagator (5.3) would be                                (T{ ",.(x)"", t (y)}).
                                                                                                         =S•• , (x-y)-2m-'t ••,oo8'(x-y) ,   (5.12)
S ••• (x-y) =      -it -im)-'it...• ..'··· ..i
                                     X8Pl8","       ·8pzi~C{X-y),               (5.8)   and the second term is definitely not covariant in the
REPRESENTATIONS OF THE POINCARE GROUP                                                                                           137
sense of Eq. (5.11). [This problem does not arise for spin       the covariant propagator
0, where there are no derivatives, nor for spin !, where                     -it -im)-2it••,·1••...•• j
                                                                 S••, (:1:-Y) =
there is just one derivative and the extra tenn is pro-
portional to                                                                            Xii.lii.,.· ·aP2j~C(x-y).               (5.8)
     t"~(x-y)ii•• (x-y)    = 2t"~(x-y)~(t'-)f1) =0.              Similar modifications are required when X(x) includes
                                                                 derivative interactions.
But it does occur for any j:?,1.]                                  The Feynman rules could also be stated in momentum
  This problem has nothing to do with our noncanonical           space. The propagator (5.8) would then become
approach or our use of 2j+ 1-component fields. For ex-
ample, in the conventional theory of spin 1 (using the
four-component (!,!) representation) the propagator is
                                                                 S •• , (q)=   f   d·xe-'··xS .., (x)
                                                                                      =   -i( -m)-2 i Il ••, (q)/q'+m'-i..     (5.13)
(T{A"(x)A,(y)})o
  = - (i/2)[(g".-m-2ii"ii')~I(X-Y)                               The monomials II(q) are calculated in the Appendix,
                       +i.(x-y)(g",-m-2iiiJ,)~(x-y)]             and presented explicitly for j~3 in Table 1.
  = -i(g",-m-2ii"ii,)~C(x-y)- 2m-2~"O~,O~'(x-y);
                                                                                             VI. T, C, AND P
so here also there appears a noncovariant tenn like                 The effect of time-reversal (T), charge-conjunction
that in (5.12). The general reason why the S matrix              (C), and space-inversion (P) on the free-particle states
turns out to be noncovariant is that condition (1.5) is          is well known. It can be summarized by specifying the
not really satisfied by an interaction like (5.1) if any of      transformation properties of the annihilation operators:
the spins are higher than !, because the commutators
(4.12) of such fields are too singular at the apex of the                      Ta(p,O')T-l=1JT L., C•• ,a( -p, 0"),             (6.1)
light cone.                                                                    Tb(p,O')T-l=~T       L., C •• ,b( -   p, 0"),    (6.2)
   The cure is well known. We must add noncovariant
"contact" tenns to x (x) in such a way as to cancel out                        Ca(p,O')C-l=1J cb(p,O'),                         (6.3)
the noncovariant terms in the propagator. If we used a                         Cb(p,O')C-l=~ca(p,O'),                           (6.4)
Lagrangian fonnalism, then such noncovariant contact
tenns would be generated automatically in the transi-                          Pa(p,O')p-l=1Jpa( - p, 0'),                      (6.5)
tion from £(x) to X(x), although the proofll of this
                                                                               Pb(p,O')P-l=~pb( -p,        0').                 (6.6)
general Matthews theorem is very complicated. For
our purposes it is only necessary to remark that we take         The ~'s and ~'s are phase factors 12 representing a degree
the invariance of the S matrix as a postulate and not a          of freedom in the definition of these inversions. The
theorem, so that we have no choice but to add contact            operator T is antiunitary, while C and P are unitary.
terms to X(x) which will just cancel the noncovariant            The matrix C••' was defined in Sec. II, and has the
parts of the propagator, such as the second term in              properties
(5.12).                                                                             CJWC-l=-JW*,                       (6.7)
   In summary, we are to construct the S matrix ac-
                                                                                          C*C=(-)2i; CtC=1                      (6.8)
cording to the Feynman rules (a)-(e), but with the
slight modifications:                                            where JW are the usual 2j+ 1- dimensional angular-
   (a') Pay no attention to the noncovariant contact             momentum matrices.
interactions; compute the vertex factors using only the            In order to describe the effect that C and P have on
original covariant part of X(x).                                 the field '1'. (x), it will be necessary to introduce a
   (b') Do not use (5.3) for internal lines; instead use         second 2j+1-component field:
This is the field that we would have constructed in-             the (O,j) representation of the Lorentz group:
stead of 'I'.(x) had we chosen to represent the "boost"
generators by
                                                                   U[A]X.(X)U-l[A] =            L., D••,W[A-l]X., (Ax) ,       (6.10)
(2.37) (6,11)
instead of Eq. (2.36). The field x.(x) transfonns under             12 For a general discussion of these phases, see G. Feinberg and
                                                                 S. Weinberg, Nuovo Cimento 14, 571 (1959). The discussion there
 II See, for example, H. Umezawa, Quantum Field Theory (North-   was limited to (0,0), (!,!), and (!,O)ffi(OJ) fields, but can be
Holland Publishing Company, Amsterdam, 1956), Chap. X.           easily adapted to the general case.
138                                                                                                                   CHAPTER II
                                                                                                                °]
                                                                       where
Any other choice of the fj would result in the creation
                                                                                                 DW[A]
and annihilation parts of 1". and X. transforming with
different phases, destroying the possibility of simple
transformation laws."      .
                                                                                    !I)W[AJ= [
                                                                                                .°          _,
                                                                                                            DW[A]
                                                                                                                                (7.3)
   If a particle is its own antiparticle then we call it               the representations DW and jjw being defined by
"purely neutral," and set                                              (2.36) and (2.37) respectively. The representation !I)w
                                                                       can be defined also by specifying that the generators
                          a(p,u)=b(p,u).                     (6.20)    of rotations are to be represented by
In this special case the (j 0) and (0 j) fields are related
by
              X.t (x) = L:., C.., 1".' (x) ,         (6.21)
                                                                                          3 w=   [ °°JW]
                                                                                                     JW'
                                                                                                                                (7.4)
                 I".t(x) = (- )'iL:., C••' X.' (x) .         (6.22)    and that the generators of boosts are represented by
The fields are not Hermitian, except of course for j=O.                                      ~W=-haw,                           ~~
Nevertheless, Eq. (6.20) requires the phases ~I to be                  where 'Y. is the 2(2j+1)-dimensional matrix:
equal to the corresponding'1I, and (6.19) then implies
that these phases can only take the real values ± 1,
                                                                                                   °-1
except that ~ p must be ±i for purely neutral fermions.                                      'Y5=[1       0J.                   (7.6)
  We see that the fields I"'(x) and x.(x) transform
separately under T, and also under the combined                        This satisfies (2.24)-(2.26) because 'Y.'= 1.
operation CP:                                                             The (j,0)® (OJ) representation (7.3) differs from the
CPI".(X)P-IC-I=~C~p          L:., C..,-II".,t(-x, tJ),       (6.23)    (j,O) and (O,j) representations in the important re-
                                                                       spect that !I)t is equivalent to !I)-I:
CPX. (x) p-IC-I
                                                                                         !I)(;)[A]!=/1!I)(;)[A-']B,             (7.7)
               =~~p(-      )'iL:., C ••,-IX.,t(-x, tJ).      (6.24)
                                                                       where
[Under CPT the transformation law is just that of a
                                                                                                                                (7.8)
  13   An important consequence is that a particle-antiparticle pair
has intrinsic parity
                            qpijp=(_)'i,
                                                                       [See Eq. (6.11).J This has the consequence that
a well-known result that would be inexplicable on the basis of
nonrelativistic quantum mechanics.                                             U[AJ~.(X)U-l[AJ=L:p ~p(Ax):l)p.(j)[AJ,           (7.9)
REPRESENTAnONS OF THE POINCARE GROUP                                                                                                              139
where if, is the covariant adjoint                                        where II (g) and fi(g) are defined by (A.10) and (A.41).
                                                                          In the 2(2j+ I)-dimensional matrix notation this reads
                          if,(x)=·P(x)fI.                        (7.10)
  The T, C, and P transformation properties of ",.(x)                               [-y.,.2···.,'8,,8. 2 · ·8. 2 ,+m'J)f(x) =0,               (7.19)
can be read off immediately from (6.13)-(6.18):
                                                                          where the generalized 'I matrices, '1.1• 2"', are defined by
          T"'(x)T-'=~Te",(X, -x")                                (7.11)
                         ~ce-I{3"'*(x)        (bosons),                                                                                       (7.20)
         C",(X)C-I =                                             (7.12)
                         ~ce-l'15f!l/t*(X)        (fermions),
          P"'(X)P-I=~p{3"'( -       x, x"),                      (7.13)   and are discussed and evaluated in Appendix B.
with                                                                         The field'" obviously obeys causal commutation re-
                                 C 0                                      lations, since 'P and X commute with both \Ot and xt
                          e= [ ]                                 (7.14)   at spacelike separations. Its homogeneous Green's
                                 o C                                      functions are
A purely neutral particle will have a field which satisfies
the reality condition                                                     ("'a(x)if,~(y)o= (271')-'m- 2]j.!!....M a~(p)
                        e-'fI"'*(x) (bosons)                                                                       2w(p)
              "'(x) =                                           (7.15)
                        e- I'I5{3"'*(X)   (fermions).                                                              Xexp{ip· (x-y)),           (7.21)
Its inversion phases ~T, ~C, ~ P must be real, except that
~ p = ±i for purely neutral fermions.
                                                                          (if,~(y)"'.(x)o= (271')-'1II-   2   ]j    d'p .Y a6 (p)
                                                                                                                   2w(p)
   The field "'(x) of course satisfies the Klein-Gordon
equation                                                                                                           Xexp{ip· (y-x)),           (7.22)
                   (OLm')",.(X) =0.                 (7.16)
                                                                          where
But "'(x) has twice as many components as the opera-
tors a(p,er) and b* (p,er), so it has a chance of also satis-
                                                                                                              m']        U(P)]
                                                                                            M(p)= [                              ,            (7.23)
fying some other homogeneous field equation. In fact,                                                     fi(p)          m']
it does. LTsing (A.12) and (:\.40), we can easily show
that the (j,O) and (O,j) fields are related by                                       (-m)2 j         U(p) ]
                                                                              X(p)= [ _                              =   (-)2]M(_p).          (7.2-1)
                  n(-i8)\O(x)=m2jx(x),                          (7.17)                U(p)          (-m)2]
                             . J3 p
              = (271')-3m-,jj --[O(x-ylM a6( -i8) exp{ip·                  (x-y)l+O(y-x)Ma~( -i8) exp{ip· (y-x))].                            (7.25)
                              2w(p)
As discussed in Sec. V, this is not the covariant propagator to be used in conjunction with the Feynman rules. We
must add certain noncovariant contact terms to (7.25) which allow us to move the derivatives in M (-i8) to the
left of the 0 functions. The true propagator is
where t:,.C(x) is the invariant j=O propagator (5.9).                     In momentum space we replace                   a. by iq., so that
This can be written in a more familiar form by using
(B.13) ; we find that
                                                                                   Seq) = -im-'J[ CP(q)+m 2j]N+m L                    i.,
                                                                          where
Sex) =im-2i['1","2 ". 2j8.,a.,.· ·8"j-m,j]t:,.C(x).             (7.33)
It is easy to see from (B.4) that this has the correct
                                                                          General formulas for CP(g) are given in Appendix B;
transformation property:
                                                                          the results for j ~ 2 are in Table 2. The wave functions
              ;I)Ul[A]S(x) ;I)(jH[A] = S(Ax) .                            for creation and annihilation of particles and anti-
140                                                                                                                   CHAPTER II
particles can be read off from (7.1), (4.11), and (6.9), or going back to a and b*
                                                                             f
or alternatively found from the solutions of (7.19).
This whole formalism reduces to the Dirac theory for
j=i·
                                                                 1/t.(x) =       d'p ~ [u.(p,u)a(p,u)e'P"
                                                                                                 +vn(p,u)b*(p,u)e-iP"J,          (8.8)
                VIII. GENERAL FIELDS
   We started in Sec. III by introducing a field I"(x)           where the "wave functions" in (8.8) are
which transforms according to the (j,0) representation.          u.(p,u)= (2 ... )-'/2[2w(p)j'/2 L D.m[L(p)]it..(u) , (8.9)
Then, in order to discuss parity conserving theories,
we introduced the (O,j) field x(x) in Sec. VI and used
it in Sec. VII to construct a field 1/t(x) which transforms      V.(p,u) = (2...)-·/2[2w(p)J-l/2     L Dnm[L(p)J
                                                                                                     m,tT'
under the (reducible) representation (j,0)~ (O,j). These
particular fields have the advantage of depending very                                                       XV.. (U')C.,.-l.   (8.10)
simply and explicitly on the particular value of j, but
1", x, and 1/t are certainly not otherwise unique. In fact,      This field transforms correctly
the usual tensor representation of a field with integer j        U[A,aJ1/t.(x)U-l[A,aJ= L mD.,.[A-IJ1/tm(Ax+a). (8.11)
is (j12,j/2), while the Rarita-Schwinger representation
for half-integer j is based on the (2j+ 1)2-dimensional          It obeys the Klein-Gordon equation, and mayor may
reducible representation:                                        not obey other field equations as well. The causality
                                                                 condition (1.8) can be satisfied if we choose
                                   2j -l 2j-l)
           m,O)~(O,t)J® ( -4-'-4- .                                              L   u.(u)Um*(u) = L v.(u)V,.*(u) ,             (8.12)
Our simpler fields agree with these conventional repre-          and if we use the usual connection between spin and
sentations only for the case j = t.                              statistics. We will not pursue these matters further here.
   We now consider the general case. Let D.m[AJ be                  The chief point to be learned from this general con-
any representation (perhaps reducible) of the Lorentz            struction is that the wave functions (8.9), (8.10) which
group. Assume that when A is restricted to be a rota-            enter into the Feynman rules are always determined by
tion R, the representation D[RJ contains a particular            the matrices Dnm[L(p)J representing a boost.
component DW[R]. By this we mean that there must
be a rotation basis of vectors u.(u), such that                                        ACKNOWLEDGMENTS
                f     d3p
1/t.(x) = (2...)-'/2 --[a.(p)e'P·'+fJ.(p)e-ip"J, (8.7)
                     2w(p)
                                                                    We shall instead show here that this construction
                                                                 of a vector out of two-dimensional matrices can be
                                                                 directly generalized to the construction of a tensor of
REPRESENTATIONS OF THE POINCARE GROUP                                                                                                          141
This can be rewritten to give z2fH as a polynomial of                         Setting this equal to - I'q, then gives (At).
order 2j in z. It follows then that IT(j)(q) must itself
be such a polynomial, since all powers of z beyond the                       To go through this sort of calculation for general j
2jth in the Taylor series for the exponential can be                       would be tedious and difficult. We shall approach the
reduced to polynomials in z of order 2j.                                   problem of representing exp( - 20z) as a polynomial in z
   For example, in the case of spin j=!, Eq. (A24)                         more directly. First split it into even and odd parts,
gives, z'= I, so that                                                                        exp( -Oz) =coshOz-sinhOz.         (A26)
  exp( -zO) = l-zO+!O'-tzO'+' .. =coshO-z sinhO.                            We consider separately the cases of j integer and
                                                                            half-integer.
Then (AI2) gives
                                                                                                    1. Integer Spin
ITO/2) (q) =m[coshO- 2(g· JO/2» sinhOJ                                       The eigenvalues 2j, 2j- 2, etc., of the Hermitian
                               =q'-2(q.J(lf2».                    (A2S)     matrix z= 2(g· J) are even integers. If follows thatl '
                                                              2. Half-Integer Spin
     The eigenvalues 2j, 2j- 2, etc., of z= 2(q· J) are now odd integers. It follows that"
                                                       i-l/'   (z2-1 2)(z'-32). "(Z'-(211-1)2)                 ]
                           coshzO= coshe[ 1+            L:                                             sinh 2"O ,              (A31)
                                                        n-l                    (Zn)!
                                                        ;-1/2    (z2-1')(z'-3 2) ... (Z2_ (2n-l)2)              ]
                           sinhzO= z sinhB[ 1+           L:                                              sinh2"O .             (A32)
                                                         n-l                   (ZII+I)1
   "For (A27) and (A31) see, for example, H. B. Dwight, Table oj Integrals and Other Mathematical Data (The Macmillan Company,
 New York, 1961), fourth edition, formulas 403.11 and 403.13, respectively. Equations (A28) and (A32) can be checked by
 differentiating with respect to 8; we get (A27) and (A31). I would like to thank C. Zemach for suggesting the existence of such ex-
 pressions and a method of deriving them.
REPRESENTATIONS OF THE POINCARE GROUP                                                                                                                                         143
or
                                                1
IIUI(q) = (_q2)H12[q'-2q. J]+-( _q2)H/2[(2q· J)'-q'][3q"-2q· J]
                             31
                                                                     1
                                                               +-( _q2)i- 5/ 2[(2q· J)2-q'][(2q· J)'- (3q)'][5q"-2q' J]+" '.                                                (A34)
                                                                51
The series (A34) cuts itself off after j+~ terms. The                                        It follows immediately from (A12) and (A40) that
terms we have listed suffice to caleula te II for j = ~, t ! ;                                           lI Ul (q)fi(j)(q) = fi(j) (q)IIU)(q) = (_q2)';,                    (A46)
the results are in Table I.
   Having calculated II(g), the coefficients lem'" may                                  Substitution of (A1O) and (A41) into (A46) gives
be determined by inspection, For example, in the case
j = 1, Eq. (A30) gives                                                                  tIl- I1l2 "   'Mit v1 1''/,-' 'P2 j qPIQ/J-2'   .• qP2iq~lq~2' •• q~2i
                                                                                              = lPIP2" 'P2it"1~2" '~2iq/JlqP2' •• Q/J2jQ"lQ"2' •• Q"2i
            JICl)(q) = -q'+2(q. J)(q. J _gil),                              (:\35)
                                                                                                                                                           =     (-q')';.   (A47)
Setting this equal to t.'q.q, gives
                                                                                        Since this holds true for any q, we can use it to derive
                     100=1                                                              formulas for any symmetrized product of t and t. For
                     tU'=lill=+J,                                           (A36)       j=!:
                      lij= {J,,1;} -O'j.
                                                                                                                                                                            (A48)
Observe that this is traceless, because                                                      APPENDIX B: DIRAC MATRICES FOR ANY SPIN
            V=[2J'-3]-1=2(J'-2)=O,                                          (A3i)          We will use the 2j+ l-dimensional matrices I·'" ,
\I'e won't bother extracting the Ie'''' for j> 1, because                               t"'''' discussed in Appendix A to construct a set of
it is II(q) that we really need to know.                                                2(2j+ 1)-dimensional matrices:
   We could have gone through this whole analysis using
the (O,j) instead of the (j,O) representation in (A5),                                                                                                                       (Ill)
In that case we should have defined a symmetric trace-
less object 1·'·'" '." which is a tensor in the sense that
DUI[A]lm'"'.'IDCi)[A]t
                      =.A~/l.\~l2 ... AII2jIJ.2itvIV2' "1I2j,               (A38)
                                                                                                                        ~,=G -~l                                             (B2)
                                                                                                                          ~=C ~l
where D(j)[A] is the matrix corresponding to A in the
                                                                                                                                                                             (B3)
(O,j) representation:
                   D(jI[A] = D())["~-IJt,                                   (:\39)
                                                                                        Their properties follow immediately from the work of
The fundamental formula (:\12) would then read                                          Appendix A,
fi(iI (q) =m2jD(jl[L(q)]2= m2iD(jJ[L( - q)]2
                                                                                                                    1. Lorentz Transformations
                            = m'i exp(20q· JUI) ,                           (A40)
where                                                                                     It follows from (AS), (A38), and (A39) that the                                     ~'s
                                                                                        are tensors, in the sense that
Hence                                                                                   !DU)[A]~.,.',' '.';!D(j)-l[A]
                 tIl1 fJ'2 ... /J.2i=(±)tJl.1iJ.2 ......2
                                                        j ,
                                                                            (A42)                                               =A~tlA~l2 .. ·A"2/2j'Y~1~2 "~2j,             (B4)
                +
the sign being 1 or -1 according to whether the /-I'S
                                                                                        where !D Ci) is the (j,O)!JJ (O,j) representation
contain altogether an even or an odd number of space-
like indices. There is another relation between barred
and unbarred matrices which follows from (6.i):                                                                                                                              (B5)
                 fiCiI(q)*= ClI(j)(q)C-l ,                                  (A43)
and so                                                                                  Obviously              ~5   is a scalar
                                                                                                                         :DCii[A }y,:DCi)-l[A]=~"                            (B6)
Equation (:\44) in conjunction with (0\42) yields the
reality condition                                                                       but     ~     is not, because
                                                                           (A45)                                                ~=       _i-2;,),00",o                       (1l7)
144                                                                                                                                CHAPTER II
   I have not studied the algebra generated by these 'Y                                                       X[(2n+I)q"/l-2q·~'Yr.,Il].    (BI7)
matrices in detail, but there is one simple relation that
can be derived very easily. It follows from (A47) that                               The results for j   ~2   are presented in Table II.
for any q:
-yJlIJl'J.·' 'Jl1i'Y"1Jl'A" ·,I1iq"'lQI'2· .. Q",2;Q'lQ'2' . 'q""i= (q2)2(   (Bi0)                          5. Spin   !   and 1
Cancellation of the q's gives the symmetrized product                                  Table II gives
of two 'Y's as a symmetrized product of g". For example,
it follows from (B lO) that                                                                     (pCI/2) (q)= -i'Y'q.=qo/l- 2 (q·3hr.,ll ,
                                                                                     so that
j=!:         {-Y','Y'}=2g",                                                  (Bll)
j= 1 : {-y'P,'Y'~} + {-y",'YP~}+{-y''''YP'}
                      = 2[g"gp~+g'Pg'~+g"~g'p],                              (BI2)
and so on.                                                                                                                                  (BIB)
                                  4. Evaluation
   Comparison with (AlO) and (A4I) shows that
(p(q)   =   -t~j'Y"""       ·"iq..q.. ' .. q",/
                                                                                     Tbis is just the standard representation of the Dirac
                                                                                     matrices with 'Y5 diagonal.
                                              = [_ 0         n(q)].          (BI3)     For spin I, Table II gives
                                                  II (g)       0                         (pCl)(q)='Y"q.q.= -1//l+2(q'3) (q. 3/l-q°'Yr.,Il) ,
The matrix n(q) was evaluated in Appendix A, and                                     so that
fi (q) is just                                                                                           'YOO=/l ,
                fi(q) =n( - q, qO).        (BI4)
                                                                                                         'Y'''='Y''=3;'Yr.,Il,              (B19)
It follows that we can calculate (P(q) from the formulae
(A29) and (A33) for n(q), by making the substitution                                                     'Yii= {3i,3i}/l-~'j-/l.
        JW-.3Cj)'Y5
                             ,
                                  where 3Cil=[JCj)
                                                0
                                                                   OJ
                                                                 JCi)
                                                                             (ElS)
                                                                                        Notes added in proof. (I) The external-line wave
                                                                                     functions are much simpler in the Jacob-Wick helicity
                                                                                     formalism. They are given for both massive and massless
and then multiplying the whole resulting formula on                                  particles in a second article on the Feynman rules for
the right by fl. We find that for integer j:                                         any spin (Phys. Rev., to be published). (We also give
(pCj) (q) = ( - q')i/l                                                               general rules for constructing Lorentz-invariant inter-
                                                                                     actions involving derivatives, field adjoints, etc.) (2)
                  i-l ( - 1/)'"-1-0
                                                                                     It is not strictly necessary to introduce 2 (2j+ I)-com-
               +L                        (2q·~)[(2q·~)2_(2q)2]
                                                                                     ponent fields in order to satisfy P and C conservation,
                  0-<)   (2n+2)!
                                                                                     because the x. fields in (6.15) and (6.17) may be ex-
               X[(2q·~)2-            (4q)'} .. [(2q·~)2- (2nq)2]                     pressed in terms of «3. by using (7.17). I would like to
                                  X[2q'~/l- (2n+2)q"'Yr.,Il],                (BI6)   thank H. Stapp for a discussion on this point.
REPRESENT ATIONS OF THE POINCARE GROUP                                                                                                        145
        Marilyn E. Noz
        Department of Radiology. New York University. New York, New York 10016
        S.H.Oh
        L'aboratory oj Nuclear Science and Department of Physics. Massachusetts Institute of Technology,
        Cambridge, Massachusetts 02139
        (Received 7 June 1978; revised manuscript received 23 October 1978)
        Representations of the Poincare group are comtructed from the relativistic harmonic oscillator wave
        functions which have been effective in descnhing the physics of internal quark motions in the relativistiC
        quark model. These wave functions are solutions of the Lorentz-invariant harmonic oscillator differential
        equation m the "cylindrical" coordmate system moving with the hadronic velocity in which the time-
        separation variable is treated separately. This result enables us to assert that the hadronic mass spectrum
        is generated by the internal quark level excltatlon, and that the hadronic spin is due to the internal
        orbital angular momentum. An addendum relegated to PAPS contains diSCUSSions of detailed calculational
        aspects of the Lorentz transformation, and of solutions of the oscillator equation which are diagonal in the
        Casimir operators of the homogeneous Lorent7 group. It is shown there that the representation of the
        homogeneous Lorentz group consists of solutions of the oscillator partial differential equation in a
        "spherical" coordinate sy~tem in which the Lorentz-mvariant Minkowskian distance between the
        constituent quarks is the radial variable.
1341 J. Math. Phys. 20(7), July 1979 0022·2488179/071341·04$01.00 © 1979 American Institute of PhySK;s 1341
       P        =i~                                         (10)
                                                                       I,(z') =   (V-;2'n!)        '12H ,(Z')   exp( - z"/2),
        •        ax·                                                   1,(1')=    (V-;2 k k!) -II2Hk (I')eXp( -1"/2).
generate space-time translations. Lorentz transformations,
                                                                   If the excitation numbers. b.... ,k are allowed to take all possi-
which include boosts and rotations, are generated by
                                                                   ble nonnegative integer values, the solutions in Eq. (16) form
       M." = L :,. + L,n'                                   (11)   a complete set. However, the eigenvalues A takes the form
where                                                                   A=b+s+n-k.                                              (17)
                                                                   Because the coefficient of k is negative in the above expres-
       L:.,=i(Xp~ -x,~),
            r                ax"            ax"                    sion, A has no lower bound, and there is an infinite degener-
                                                                   acy for a given value of A.
       L." =         i(X• ..!.... - X, ...!....).
                           ax"           ax"                            In terms of the primed coordinates, the subsidiary con-
      The translation operators p. act only on the hadronic        dition of Eq. (9) takes the simple form
coordinate, and do not affect the internal coordinate. The
                     ;n.
operators L and L,n' Lorentz-transform the hadronic and                 (:r,    + l}p(X) =0.                                    (18)
internal coordinates respectively. The above ten generators
                                                                   This limits/, (I ') to/.(1 '), and the eigenvalue A becomes
satisfy the commutation relations for the Poincare group.
                                                                        A=b+s+n,                                                (19)
     In order to consider irreducible representations of the
Poincare group, we have to construct wave functions which          The physical wave functions satisfying the subsidiary condi-
are diagonal in the invariant Casimir operators of the group,      tion ofEq. (9) or (18) have nonnegative values of A.
which commute with all the generators ofEqs. (10) and (II).             As far as the x'. y', z' coordinates are concerned, they
The Casimir operators in this case are                             form an orthogonal Euclidean space. and/.(x'),f,(y'),/,,(z')
       PllP'l          and     Wf'~j'                       (12)   form a complete set in this three-dimensional space. The
                                                                   Hermite polynomials in these Cartesian wave functions can
where                                                              then be combined to form the eigenfunctions of W' which, ill
                                                                   terms of the primed coordinate variables. takes the form
                                                                        W'=M'(L')'.                                             (20)
The eigenvalues of the aboveP' and W' represent respective-
ly the mass and spin of the hadron.                                where
1342             J. Malh. Phys.• Vol. 20. No.7, July 1979                                                  Kim, Noz, and Oh     1342
REPRESENTATIONS OF THE POINCARE GROUP                                                                                                            147
       ",,"'m(x) = (lhT)'''[exp( - t ")]R,,(r')Y'm(O ',,p '), (21)                      r'   =   x' + y' + z'.
where r', () " ,p , are the radial and spherical variables in the                  In terms of this form, it is very inconvenient, if not impossi-
three-dimensional space spanned by x', y', z'. R.,(r') is the                      ble, to describe functions which are localized in a finite
normalized radial wave function for the three-dimensional                          space-time region.
isotropic harmonic oscillator, and its form is well known.                              In contrast to the above hyperbolic case, the wave func-
The above wave function is diagonal in W' for which the                            tions which we constructed in this paper are well localized
eigenvalue is / (/ + I)M', and / represents the total spin of the                  within the region
hadron in the present case. The quantum number m corre-
sponds to the helicity.
                                                                                        (z"      + t ") < 2,                                     (26)
     Since the eigenvalue p' of the Casimir operator P' is                         due to the Gaussian factor appearing in the wave functions.
constrained to take the numerical values allowed by Eq. (8),                       This elliptic form was obtained from the covariant
the hadronic mass is given by                                                      expression
M' = m o' + (), + I). (22) _ x"x" + 2(x·p/M)' = x" + y" + z', + 1 ". (27)
     Ifwe relax the subsidiary condition ofEq. (18), we in-                        The x' and y' variables have been omitted in Eq. (26) because
deed obtain a complete set. In this case, A of Eq. (17) can                        they are trivial. In terms ofz and t, the above inequality takes
become negative for sufficiently large values of k. For A> 0,                      the form
                                                                                                                               )'j < 2.
the solutions become
       l/fii'''''(x) = [V-;':-2'k,j        ,/2H,(t')[exp(-t"/2)]                        [ 1 - /3 (z + t)' + 1 + /3 (z - t                        (28)
                                                                                          1+/3              1-/3
                                                                                   We are therefore dealing with the function localized within
                        )( RA , <.I(r') Y'm(fJ',,p ').                      (23)   an elliptic region defined by this inequality, and can control
For..l < 0, the solutions take the form                                            the 1 variable in the same manner as we do in the case of the
                                                                                   spatial variables appearing in nonrelativistic quantum me-
       ifI''''' (x) =   [V--;2"        "(k - A)! j        'I'H, _"(I ')            chanics. This localization property together with the hyper-
                                                                                   bolic case is illustrated in Fig. 1. .
                        X [exp( - t ")]Ru(r')Y'm«()',,p ').                 (24)
The eigenvalues of P' and W' are again me' + (A + I) and
                                                                                   IV. CONCLUDING REMARKS
/ (l + I)M' respectively. In both of the above cases, k is al-                          We have shown in this paper that the wave functions
lowed to take all possible integer values.                                         used in our previous papers are diagonal in the Casimir oper-
1343          J. Malh. Phys .. Vol. 20. NO.7. JuLy 1979                                                                      Kim, Noz, and Oh    1343
148                                                                                                                                CHAPTER II
ators of the Poincare group, which specify covariantly the                   Kim and M.E. Noz, Prog. Theor. Phys. 57, IJ73 (1977); 60, 801 (1978);
mass and total spin of the hadron. These wave functions are                  Y.S. Kim and M.E. Noz, Found. Phys. 9, 375 (1979); Y.S. Kim, M.E. Noz,
                                                                             and S.H. Oh, "Lorentz Deformation and the Jet Phenomenon," Found.
well localized in a space-time region, and undergoes elliptic                Phys. (to be published). For review articles written for teaching purposes,
Lorentz deformation.                                                         see Y.S. Kim and M.E. Noz, Am. J. Phys. 46, 480, 486 (1978). For a review
                                                                             written for the purpose of form,ulating a field theory of extended hadrons,
     An addendum to this paper containing a discussion of                    see T.J. Karr, Ph.D. thesis (University of Maryland, 1916).
Lorentz transformation of the physical wave function and a                   'L.e. Biedenham and H. van Dam, Phys. Rev. D 9, 471 (1974).
construction of the representation of the homogeneous Lo-                    'T. Takabayashi, Phys. Rev. 139, BI381 (1965); S.lshidaandl.Otokozawa,
rentz group is relegated to PAPS.' It is shown there that                     Prog. Theor. Phys. 47, 2117 (1972).
                                                                             'See AlP document no. PAPS lMAPA-20-I336-12 for twelve pages of
solutions of the oscillator equation diagonal in the Casimir                 discussions of the Lorentz transformation of the physical wave functions,
operators of the homogeneous Lorentz group are localized                     and of the representations of the homogeneous Lorentz group. Order by
within the Lorentz-invariant hyperbolic region illustrated in                PAPS number and journal reference from American Institute of Physics,
Fig. I.                                                                      Physics Auxiliary Publication Service, 335 East 45th Street, New York,
                                                                             N.Y. 10017. The price is $1.50 for each microfiche (98 pages), or $5 for
                                                                             photocopies of up to 30 pages with SO. 15 for each additiona1 page over 30
                                                                             pages. Airmail additional. Make checks payable to the American Institute
'E.P. Wigner, Ann. Math. 40,149 (1939).                                      of Physics. This material also appears in Current Physics Microfilm, the
'Y.S. Kim and M.E. Noz, Phys. Rev. D 8,3521 (1973), 12, 129 (1975); IS,      monthly microfilm edition of the complete set of journals published by
 335(1977); Y.S. Kim, 1. Korean Phys. Soc. 9, 54(1976); 11, I (1978); Y.S.   AlP, on the frames immediately following this journal article.
1344       J. Math. Phys., Vol. 20, No.7, July 1979                                                                     Kim, Noz, and Oh         1344
REPRESENT ATIONS OF THE POINCARE GROUP                                                                                                     149
A simple method for illustrating the difference between the homogeneous and
inhomogeneous Lorentz groups
        Y. S. Kim
        Center for Theoretical Physics, Department of Physics and Astronomy, University of Maryland. College
        Park. Maryland 20742
        Marilyn E. Noz
        Department of Radiology, New York University, New York. New York 10016
        S. H. Oh
        Laboratory for Nuclear Science and Department of Physics. Massachusetts Institute of Technology.
        Cambridge. Massachusetts 02139
        (Received 5 January 1979; accepted 12 June 1979)
892 Am. J. Phys. 47( 10). Oct. 1979 0002-9505/79 J100892-06$00.50 © 1979 American A~sociation of Physics Teachers 892
893    Am. J. Phys., Vol. 47, No. 10, October 1979                                                          Kim, Noz. and Oh    X93
REPRESENTATIONS OF THE POINCARE GROUP                                                                                                               151
                                                   LORENTZ                          where
                                                                                                     r' = (X'2 + y'2 + Z'2)1/2,
                                                                                    Rp,(r') is the normalized radial wave function for the os-
                                                                                    cillator, and its form is well known. The total wave func-
                                                                                    tion now takes the form
                                                                                               >/;~'mk(X) =   R p ,(r')Y'!'((J',1>')fdl'),      (23)
                      p-,_
                         -    nlo
                                    ,   + -I (()2
                                              --'-' -        ")
                                                          XI-'    .        (19)                       K = -i
                                                                                                          I
                                                                                                                (x'()t~ + t~)(}Xi'
                                                                                                                                                (27)
                                          2 ()x,t
   With the above forms of the Casimir operators, it is not                         where i = 1,2,3.
difficult to construct normalizable wave functions which                              The rotation around the n axis through an angle ~ is
are diagonal in Wl and p'. First, we observe that the os-                           represented by
cillator equation of Eq. (6) is Lorentz invariant and is sep-
arable also in the x'. y'. z'. t' variables. We can therefore                                         R(n.O = exp[ -i~o· L),                    (28)
choose the cylindrical coordinate system in which the ('
                                                                                    and its mathematics is well known. The Lorentz boost along
variable is treated separately, and write the wave function                         the direction n by '7 is
as
                           >/;(x) = f(t')g(x'),                            (20)                       T(o,1)) = exp[ -i'7o' K),                 (29)
g(x') = R",(r') Yf"(Ii',1>'). (22) If we apply these operators to the wave functions of Eq (23)
S94     Am. J. Ph}:>. .. Vol. 47.   ~o     10. October 1979                                                                 Kim. Noz, and Oh     894
152                                                                                                                                       CHAPIERII
  with {3        =0, we obtain                                                       It is shown in Ref. I that this expression can be simplified
       iK      .I,A' -
               3';'lm - 3
                         A((I +(21m ++ .1)(/- m + 1))'/2
                                       1)(21 + 3)
                                                                (J
                                                          YI+ I( .</»
                                                                        m
                                                                                     to
teach this point is to construct solutions of the same dif-                       The radial wave function in this case takes the form
ferential equation, Eq. (6), which are diagonal in the
Casimir operators of the Lorentz group given in Eqs. (I5)                                    R~..(p)   = pnLtn+il(p2)exp(-p2/2),                      (55)
and (16).                                                                         with f = 2(2/l + n), /l = 0,1,2,.... L~n+I)(p2) is the gen-
   I n terms of the rotation and boost generators, the Casimir                    eralized Laguerre function,13
operators take the form                                                              With this preparation, we now write the "angular"
                   C I =L2_K2,C2 =L.K.                                    (43)    function B as
If we evaluate C 2 using the explicit expression for Land K,                                     B~(a,O,</» = A~(a)Y7'(O,cP).                         (56)
this operator vanishes for the present spinless case. In order                    For the timelike region where t       II>"          we use the nota-
to construct solutions diagonal in C I, we use a hyperbolic                       tion
coordinate with the Lorentz invariant distance
                                                                                                        A~(a)   = T~(a),                              (57)
                                                                          (44)
                                                                                  and for the spacelike region,
where
                                                                                                        A~(a) = S~(a).                                (58)
   , = (x 2 + y2 + z2)1/2, t = ± pcosha, , = psinha,                              Then T~(a) and S~(a) satisfy the following differential
                                                                          (45)    equations, respectively:
for   It I >', and
                                                                                           ~   (sinh2aT~) -     [n(n   + 2) + 1(1 + l)lT~ =            0,
                     t   = p sinha, , = pcosha,                           (46)             (ja
for   It!
        <,. For both cases, we use the usual three-dimen-                                                                                             (59)
sional spherical coordinate for x,y,z:
                                                                                           ~   (cosh2aS~) -     [n(n    + 2) -        1(1 + I )lS~ =   o.
                             x = , sinO cos</>,                                            (ja
                             y = ,sinO sin</>,                            (47)                                                       (60)
                             Z   = 'cosO.                                         If 1 = 0, the solutions to the above equations take the
                                                                                  form
   In terms of p, a, 0, cP, the differential equation of Eq. (6)
takes the form                                                                                  T~(a)   = sinh(n + I )n/sinha,
                                                                                                                                                      (61)
                                                                                                S~(a)   = cosh(n + I )a/cosha .
where f = ±2().. + I) for the timelike and spacelike cases                                                      I   d)l T~(a),
                                                                                            T~(a) = (sinha)l (-'-h--d
respectively. The form of L is well known. The operator (Ll                                                       SIn    Q:'    a
                                                                                                                                                      (62)
- K2) takes the form
                                                                                                                         d)'
                                                                                                                I - - S~(a).
                                                                                            S~(a) = (coshn)l (' -
                                                                                                                  cosha dn
  (U - K') = -.-1- ~ (sinh 2a                 ~)   -     -.-'- L2, (49)
             slnh 2 a ()a                     ()lY       smh 2 a
                                                                                     The solutions given in Eqs. (61) and (62) become infinite
for   It I >'. and                                                                when a ~ 00. This means that the Lorentz harmonics are
                                                                                  singular along the light cones. At this point, we are tempted
              ,    ')    I   ( ()    h' () )                                      to make n imaginary in order to make T~(n) and Sn(a)
            (L- - K- = cosh'" ()" cos -" ()"                 + coshI 2"   I '
                                                                           ,-,
                                                                                  normalizable. In fact, this and other interesting possibilities
                                                                          (50)    have been extensively discussed in the literature. 14 However,
                                                                                  if n takes noninteger values, the radial wave function be-
for It I <,. We arc interested in representations which arc                       comes singular along the light cones. In either case, the
diagonal in the above operators.                                                  light-cone singularity is unavoidable.
   In order to construct the desired representation, we solve                        The wave functions which arc diagonal in the Casimir
the partial differential equation given in Eq. (4g) by sepa-                      operators ('I arc now
rating the variables
                                                                                               I/;'!;,(x) = R~(p)A;,(,,)Y7'(O.cP),                    (63)
                         I/;(x) = R(p)B(a./I.</»,                         (51 )
                                                                                  where R~, A~ arc given in Eqs. (55), and (57), and (58),
In terms of R(p) and B(n,(I,</», Eg. (48) is separated
                                                                                  respectively.
into
                                                                                     The localization property of the above solution is dictated
             -- p3_(») --,+p--l
             [ I (j (    ry ,  1R(p) =0,                                  (52)    by the Gaussian factor in the radial function R~(p), and is
                                                                                  illustrated in Fig. I. Unlike the case of wave functions
              ,,' (jp
                  (jp   p'
                                                                                   representing the Poincare group, the hyperbolic localization
and
                                                                                  region is independent of the hadronic velocity and is thus
                (L2 - K2)B(n,lI,</>1 = ryB(",1I,</».                      (53)    Lorentz invariant. We can of course carry out the mathe-
                                                                                  matics of the operators L, and K, applied to the wave
   In order that the radial equation have regular solu-
                                                                                  functions given in Eq. (63). However, it is not yet clear
tions,
                                                                                  whether this wave function carries any physical interpre-
                  ry = n(n       +   I). n = 0,' ,2,.                     (54)    tation.)
896     Am. J. Phys., Vol. 47. No. 10, OClober 1979                                                                            Kim.   NOI,   and Oh    896
154                                                                                                                   CHAPTER II
 897   Am . .I. Phys .• Vol. 47. No. 10. October 1979                                                       Kim, Noz. and Oh       R97
Chapter III
                                         155
THE TIME-ENERGY UNCERTAINTY RELA TION                                               157
244 P. A. M. Dirac.
When there is interaction between the field and the atom, it could be taken into
account on the classical theory by the addition of an interaction term to the
Hamiltonian (1), which would be a function of the variables of the atom and of
the variables En eT that describe the field. This interaction term would give
the effect of the radiation on the atom, and also the reaction of the atom on the
radiation field.
   In order that an analogous method may be used on the quantum theory,
it is necessary to assume that the variables E r, er are q-numbers satisfying
the standard quantum conditions erE, - Erer = ih, etc., where h is (21t')-1
times the usual Planck's constant, like the other dynamical variables of the
problem. This assumption immediately gives light-quantum properties to
THE TIME-ENERGY UNCERTAINTY RELATION                                                                159
   • Similar assumptions have been used by Bom a.nd Jordan [' Z. f. Physik,' vol. 34,
p. 886 (1925» for the purpose of taking over the classical formula. for the emission of radiation
by a dipole into the quantum theory, and by Bom, Heisenberg and Jordan [' Z. f. Physik,'
voL 35, p. 606 (1925» for calculating the energy fluctuations in a field of bla.ck-body
radiation.
   t ' Roy. Soc. Proc.,' A, voL 112, p. 661, § 5 (1926). This is quoted later by, loco cit., I.
   t ' Roy. Soc. Proc.,' A, voL 113, p. 621 (1927). This is quoted later by loco cit., II. An
6118entially equivalent theory has been obtained independently by Jordan [' Z. f. Physik.'
vol. 40, p. 809 (1927)]. See also. F. London •. Z. f. Physik,' vol. 40. p. 193 (1926).
160                                                                                   CHAPTERlll
246                                   P. A. M. Dirac.
a q-number, capable of being represented by a generalised" matrix" according
to many different matrix schemes, some of which may have continuous ranges
of rows and columns, and may require the matrix elements to involve certain
kinds of infinities (of the type given by the () functions·). A matrix scheme can
be found in which any desired set of constants of integration of the dynamical
system that commute are represented by diagonal matrices, or in which a set of
variables that commute are represented by matrices that are diagonal at a
specified time. t The values of the diagonal elements of a diagonal matrix
representing any q-number are the characteristic values of that q-number. A
Cartesian co-ordinate or momentum will in general have all characteristic values
from - 00 to + 00 , while an action variable has only a discrete set of character-
istic values. (We shall make it a rule to use unprimed letters to denote the
dynamical variables or q-numbers, and the same letters primed or multiply
primed, to denote their characteristic values. Transformation functions or eigen-
functions are functions of the characteristic values and not of the q-numbers
themselves, so they should always be written in terms of primed variables.)
   Iff(~, '1) is any function of the canonical variables ~b ''lk> the matrix repre-
sentingf at any time t in the matrix scheme in which the ~I: at time t are diagonal
 matrices may be written down without any trouble, since the matrices repre-
senting the ~I: and '1)1: themselves at time t are known, namely,
                                                                                           I
                                                                                           f.(2)
                                   ~k (r:'~") = ~k' () (~'r:"),
''l1(~'~'') =   -ih 0(~1' -~l") ... O(~k-l' -~k-l") 0' (~k' -~I:") 0 (~+l' -~l:+l") .. '
Thus if the Hamiltonian H is given as a function of the ~k and 'Y)b we can at
once write down the matrix H(~' ~"). We can then obtain the transformation
function, (~' /a.') say, which transforms to a matrix scheme (a.) in which the
Hamiltonian is a diagonal matrix, as (~' /«') must satisfy the integral equation
of which the characteristic values W(<<') are the energy levels. This equation
is just SchrOdinger's wave equation for the eigenfunctions (~' /«'), which becomes
an ordinary differential equation when H is a simple algebraic function of the
~"and lJk on     accoWlt of the special equations (2) for the matrices representing
~I:   and "'II:' Equation (3) may be written in the more general form
in which it can be applied to systems for which the Hamiltonian involves the
time explicitly.
    One may have a dynamical system specified by a Hamiltonian H which
cannot be expressed as an algebraic function of any set of canonical variables,
but which can all the same be represented by a matrix H(~'~"). Such a problem
can still be solved by the present method, since one can still use equation (3)
to obtain the energy levels and eigenfunctions. We shall find that the Hamilto-
nian which describes the interaction of a light-quantum and an atomic system is
of this more general type, so that the interaction can be treated mathematically,
although one cannot talk about an interaction potential energy in the usual
sense.
    It should be observed that there is a difference between a light-wave and the
de Broglie or SchrOdinger wave associated with the light-quanta. Firstly, the
light-wave is always real, while the de Broglie wave associated with a light-
quantum moving in a definite direction must be taken to involve an imaginary
exponential. A more important difference is that their intensities are to be
interpreted in different ways. The number of light-quanta per Wlit volume
associated with a monochromatic light-wave equals the energy per unit volume
of the wave divided by the energy (j7th)v of a single light-quantum. On the
other hand a monochromatic de Broglie wave of amplitude a (multiplied into
 the imaginary exponential factor) must be interpreted as' representing a 2 light-
quanta per unit volume for all frequencies. This is a special case of the general
rule for interpreting the matrix analysis, * according to which, if (~' fa') or
 ~.' (~,,') is the eigenfunction in the variables ~k of the state a' of an atomic
 system (or simple particle), I ~A' (~k')11 is the probability of each ~I: having the
 value ~k" [or I ~... (~') II d~l' d;2' .. , is the probability of each ~" lying between
 the values ~k' and ~k' + d;,,', when the ~k have continuous ranges of character-
 istic values] on the assumption that all phases of the system are equally probable.
  The wave whose intensity is to be interpreted in the first of these two ways
  appears in the theory only when one is dealing with an assembly pf the associated
  particles satisfying the Einstein-Bose statistics. There is thtis no such wave
  associated with electrons.
                                 • Loc. cit•• II, §§ 6, 7.
162                                                                                CHAPTERll
248 P. A. M. Dirac.
This transformation makes the new variables N, and                +, real, N, being equal
to apr· = I a,. the probable number of systems in the state r, and
                12,                                                                 +,1"
being the phase of the eigenfunction that represents them. The Hamiltonian
F 1 now becomes
                              F1_- ....
                                   6J" "
                                        V.N IN le,(+,-,,)/A
                                           r,               ,
and the equations that determine the rate at which transitions occur            hav~   the
canonical form
                              N     = _   aFl          .~
                                ,         a!fo,'      4>, = aN'
                                                              ,
   A more convenient way of putting the transition equations in the Hamiltonian
form may be obtained with the help 'of the quantities
W, being the energy of the state r. W~ have Ib, III equal to Ia, 12, the probable
number of systems in the state r. For b, we find
                            ihb,= W,b,    + iha,e-iW.l/     A
with the help of (4). If we put Yr. = v"ei(W,-w.) tilt, so that v" is a constant
when V does not involve the time explicitly, this reduces to
                               ih b, = W,b,        + r..Vrob.
                                      = ~.H,.,b"                                       (6)
where H,. = W, 8,. + v", which is a matrix element of the total Hamiltonian
H = Ho + V with the time factor ei(w.-w.)t/It removed, so that Hr. is a constant
when H does not involve the time explicitly. Equation (5) is of the same form
as equation (4), and may be put in the Hamiltonian form in the same way.
  It should be noticed that equation (5) is obtained directly if one writes down
the SchrOdinger equation in a set of variables that specify the stationary states
of the unperturbed system. If these variables are ~, and if H(~'~")denptes
   VOL. CXIV.-A.                                                                8
164                                                                            CHAPTERm
250                               P. A. M. Dirac.
a matrix element of the total Hamiltonian H in the (~) scheme, this
8chrOdinger equation would be
                          in OtV (~')fat = ~r' H (~'~.) tV (~"),                     (6)
like equation (3'). This difiers from the previous equation (5) only in the
notation, a single 'suffix r being there used to denote a stationary state instead
of a set of numerical values ~k' for the variables ~I:> and b, being used instead
of tV (~'). Equation (6), and therefore also equation (5), can still be used when
the Hamiltonian is of the more general type which cannot be expressed as an
algebraic function of a set of canonial variables, but can still be represented
by a matrix H (~'~") or H".
  We now take b, and in b,* to be canonically conjugate variables instead of
Or and ina,*.   The equation (5) and its conjugate imaginary equation will
now take the Hamiltonian form with the Hamiltonian function
                                  F=   ~,/J,. H,}J,.                                 (7)
to the new canonical variables N" 6" where N, is, as before, the probable
number of systems in the state r, and 6, is a new phase. The Hamiltonian F
will now become
                          F = 1:" H" N,l N,l ei(B,-',)/",
and the equations for the rates of change of N, and 6, will take the canonical
form
                              .        of        . aF
                             N,= - 06,'         0, = aN'
                                                       ,
  The Hamiltonian may be written
                        F=    ~,W,N,   + ~"v.. N,lN,le'(·'-'·)/1a.                   (9)
The first term 'f"W,N, is the total proper energy of the assembly, and the
second may be regarded as the additional energy due to the perturbation. If
the perturbation is zero, the phases 6, would increase linearly with the time,
while the previous phases "" would in this case be constants.
The transformation equations (8) must now be written in the quantum form
   t See § 8 of the author'8 paper , Roy. 800. Proo.,' A, vol. 111, p. 281 (1926). What are
 there called the c-number values that a q-number can take are here given the more precise
 D&IlUI of the characteristic values of that q-number.
                                                                                      8   2
166                                                                                                                   CHAPTERll
252                                                 P. A. M. Dirac.
  The Hamiltonian (7) now becomes
                             F =   "J:.,p,*H,p, =       "J:.,,N,tei8,/AH,,(N.       + l)ie-;S./h
                               = "J:.,.II,.,N,i (N,      +1-       8..). ei (S, -   9,)/h                               (11)
in which the H" are still c-numbers.                          We may write this F in the form corre-
8ponding to (9)
                                                                                                                        (11')
ih ~ tjI (Nt', N 2', N,' ... ) = FtjI (N1', N 2', Na' ... ), (12)
            =   l:,. H,.N,'i (N,'        + 1-:- 8,,)1 tjI (N1', N 2' ... N,' -1, ... N,' + 1, ... ).                     (13)
We see from the right-hand side of this equation that in the matrix repre-
senting F, the term in F involving eil90 - ',)fA will contribute only to
those matrix elements that refer to transitions in which N, decreases
by unity and N, increases by unity, i.e., to matrix elements of the type
F (N I ', N 2' ... N,' ... N,'; N l ', Nt' ... N,' - 1 ... N,' + 1 ... ). If we find a
solution ~(Nt', N 2' ... ) of equation (13) that is normalised [i.e., one for which
 "J:.N ." N.· ... I tjI (N1', N z' ... ) 12 = 1] and that satisfies the proper initial con-
 ditions, then 1tjI (N1', N 2' ... ) 12 will be the probability of that distribution in
 which Nt' systems are in state 1, N z' in state 2, ... at any time.
      Consider first the case when there is only one system in the assembly. The
  probability of its being in the state q is determined by the eigenfunction
   t We are supposing for definiteness that the label r of the stationary states takes the
 values I, 2, 3, ....
                                                              +
   t Whens = r, '" (N1', N 2 ' ••• N.' - 1. .. N: 1) is to be taken to mean'" (N1'N2 ' ... N: ... ).
TIlE TIME-ENERGY UNCERTAINTY RELATION                                                               167
q,(Nt ', N 2', ••• ) in which all the N"s are put equal to zero except N,,', which is
put equal to unity. .This eigenfunction we shall denote by 4{q}. When it ,.
substituted in the left-hand side of (13), all the terms in the summation on
the right-hand side vanish except those for which, = q, and we are left with
                               ih ; 4{q}     =   ~.4{8},
which is the same equation as (5) with 4{q} playing the part of btl. This estab-
lishes the fact that the present theory is equivalent to that of the preceding
section when there is only one system in the assembly.
   Now take the general case of an arbitrary number of systems in the assembly,
and assume that they obey the Einstein-Bose statistical mechanics. This
requires that, in the ordinary treatment of the problem, only those e~gen
functions that are symmetrical between all the systems must be taken into
account, these eigenfunctions being by themselves sufficient to give a complete
quantum solution of the problem.t We shall now obtain the equation for the
rate of change of one of these symmetrical eigenfunctions, and show that it is
identical with equation (13).
   If we label each system with a number n, then the Hamiltonian for the
assembly will be HA. = L"H (n), where H (n) is the H of §2 (equal to Ho + V)
expressed in terms of the variables of the nth system. A stationary state of
the assembly is defined by the numbers,1>           '2.... '" ...
                                                      which are the labels of the
stationary states in which the separate systems lie. The Schr5dinger equation
for the assembly in a set of variables that specify the stationary states will be
of the form (6) [with HA. instead of H), a.nd we can write it in the notation of
equation (5) thus :-
254                                     P. A. M. Dirac.
b('I'1 ... ) is also symmetrical at that time, 80 that b (r1'1 ... ) will remain
symmetrical.
  Let Nr denote the number of systems in the state ,. Then a stationary state
of the assembly describable by a symmetrical eigenfunction may be specified
by the numbers NI, Na ... N, ... just as well as by the numbers 'I' '2 ... '" ... ,
and we shall be able to transform equation (15) to the variables NI, Nz ... .
We cannot actually take the new eigenfunction b (NI' Na ... ) equal to the pre-
vious one b ('I'a ... ), but must take one to be a numerical multiple of the
other in order that each may be correctly normalised with respect to its
respective variables. We must have, in fact,
  If we make this substitution in equation (15), the left-hand side will become
ih (NI ! N2! ... (N !)l b (NI' Nz ... ). The term Hrm.mb (rlr2 ... rm-I Sm'm+l ... )
in the first summation on the right-hand side will become
where we have written r for rm and S for Sm. This term must be summed for
all values of S except" and must then be summed for, taking each of the values
'1' '2.... Thus each term (16) gets repeated by the summation process until
it occurs a total of Nr times, so that it contributes
 N,[N!! Nz ! .. , (N, - I)! ... (N.:t-1)! ... jN !]l H,,b (Nl' Nz ... N,-I ... N, + 1...)
              = Nri(N.+ l)i(NI! Nz ! ... jN !)lH,,b (N!, Nz ... N,-1 .. , N. + 1 ... )
to the right-hand side of (15). Finally, the term l::"H,,,,,,b('I' r 2 ... ) becomes
l::,N,lI".b(rl,z ... ) =l::,N.Hr,. (Nl! N2! ... (N !)i beN}> N z' ... J.
Hence equation (15) becomes, with the removal of the factor (Nl! N z L.. (N !)l.
which is identical with (13) [except for the fact that in (17) the primes have
been omitted from the N's, which is permissible when we do not require to refer
to the N's as q-numbers]. We have thus established that the Hamiltonian
(11) describes the effect of a perturbation on an assembly satisfying the Einstein-
Bose statistics.
which is the (J',"'; J"8.) matrix element (with the time factor removed) of
         +
H = Ho V, the proper energy plus the perturbation energy of a single
system of the assembly; while when every 8" equals f'", it has the value
               +
Hp (J') 8J 'J" L" H (J',,,; J",..). If, as before, we restrict the eigenfunctions
170                                                                                            CHAPTERll
256 P. A. M. Dirac.
§ 5. ,Thwry oj Transitions in a System Jrom One State to Others oj the Same Energy.
  Before applying the results of the preceding sections to light-quanta, we
shall consider the solution of the problem presented by a Hamiltonian of the
type (19). The essential feature of the problem is that it refers to a dynamical
system which can, under the influence of a perturbation energy which does
not involve the time explicitly, make transitions from one state to others of
the same energy. The problem of collisions between an atomic system and an
electron, which baa been treated by Born,· is a special case of this type. Born's
methOd is to find a periodic solution of the wave equation which consists, in
so far as it involves the co-ordinates of the colliding electron, of plane waves,
                        • Born, • Z. f. Phyaik,' vol. 38, p. 803 (1926).
THE TIME-ENERGY UNCERTAINTY RELA TION                                                                    171
representing the incident electron, approaching the atomic system, which are
scattered or diffracted in all directions. The square of the amplitude of the
waves scattered in any direction with any frequency is then assumed by Born
to be the probability of the electron being scattered in that direction with
the corresponding energy.
   This method does not appear to be capable of extension in any simple manner
to the general problem of systems that make transitions from one state to others
of the same energy. Also there is at present no very direct and certain way
of interpreting a periodic solution of a wave equation to apply to a non-periodic
physical phenomenon such as a collision. (The more definite method that
will now be given shows that Born's assumption is not quite right, it being
necessary to multiply the square of the amplitude by a certain factor.)
   An alternative method of solving a collision problem is to find a rwn-periodic
solution of the wave equation which consists initially simply of plane waves
moving over the whole of space in the necessary direction with the necessary
frequency to represent the incident electron. In course of time waves moving
in other directions must appear in order that the wave equation may remain
satisfied_ The probability of the electron being scattered in any direction with
any energy will then be determine4l by the rate of growth of the corresponding
harmonic component of these waves. The way the mathematics is to be
interpreted is by this method quite definite, being the same as that of the
beginning of § 2_
   We shall apply this method to the general problem of a system which makes
transitions from one state to others of the same energy under the action of a
perturbation. Let Ho be the Hamiltonian of the unperturbed system and
V the perturbing energy, which must not involve the time explicitly. If we
take the case of a continuous range of stationary states, specified by the first
integrals, IX" say, of the unperturbed motion, then, following the method of
§2, we obtain
                                                                                                (21)
258 P. A. M. Dirac.
we may substitute for a (<<") in the right-hand side of (21) its initial value. This
gIves
              ih a(at') = aOY (at'«O) = at0V («'<<0) ei(W (a')-W (a")] 1/,\
where v(«'<<O) is a constant and W(<<') is the energy of the state IX'.            Hence
                         .                             ei(W(.')-W(.O)U/A - 1
                                            +
           ih a (<<') =;: aO 8 (<<' - «0) aOv (at'«O) i [W (<<') _ W (<<O)]/h·        (22)
For values of the «k' such that W (at') differs appreciably from W (<<0), a (IX')
is a periodic function of the time whose amplitude is small when the perturbing
energy V is small, so that the eigenfunctions corresponding to these stationary
states are not excited to any appreciable extent. On the other hand, for values
of the «/c' such that W (<<') = W (<<0) and atk' ~ «k° for some le, a (IX') increases
uniformly with respect to the time, so that the probability of the system being
in the state IX' at any time increases proportionally with the square of the time.
Physically, the probability of the system being in a state with exactly the same
proper energy as the initial proper energy W (at°) is of no importance, being
infinitesimal. We are interested only in the integral of the probability
through a small range of proper energy values about the initial proper energy,
which, as we shall find, increases linearly with the time, in agreement with the
ordinary probability laws.
   We transform from the variables «I' at2 ... 0Cu to a set of variables that are
arbitrary independent functions of the «'s such that one of them is the proper
energy W, say, the variables W, YI' Y2' ... Y,-I. The probability at any time
of the system lying in a stationary state for which each Yk lies between Yt' and
Yk'   + dyJr.' is now (apart from the normalising factor) equal to
                  dYl' . dy,,' ... dYv-l'
                                 .
                                            r      a
                                     I a (at') 12 ~;,I" «,2' ... «v'~) dW'.
                                                      , YI ... Yv-I
                                                                                    (23)
For a time that i~ large compared with the periods of the system we shall find
that practically the whole of the integral in (23) is contributed by values of
W' very close to WO = W (<<0). Put
   a (<<') = a (W', Y') and          a
                              (<<I', at2' ... «v')/o (W', YI' ... Yv-I') = J (W', y').
Then for the integral in (23) we find, with the help of (22) (provided Yt' ~ Ykf)
for some k)
JI    a (W', Y') 12   J (W', Y') dW'
              f
  ="1 aO 12 IV (W', y'; WO, yO) 12 J (W', Y')
                                                  [e
                                                    i(W'-WO)t/A    1] [-i(W'-WO)t/A 1]
                                                                  ~,:.wo)"         -   aw'
  =             J
        21ao 12 I v (W',y'; WO,yO)\IJ(W',y')[I-cos(W'- W~tlh]/(W'- WO)".dW'
   = 21 aO I" t/h.       fIv (WO + hx/t, y'; WO, yO) IIJ (WO +hx/t, Y') {I-cos x)/zl. ax.
THE TIME-ENERGY UNCERTAINTY RELA TION                                                                             173
if one makes the substitution (W' - WO)tlh = x.                          For large values of t this. ....
reduces to
            21ao l'l.tlh.1 v (WO, y'; WO,          l) \2 J (WO, y') [         .. (I-COS x)/x2. dx
Thus the J (WO, y') of the expression (24) has the value
                                      J (WO, y') = E'P' sin 6' 1c2,                                        (25)
where E' and P' refer to that value for the energy of the scattered electron which
makes the total energy equal the initial energy WO (i.e" to that value required
by the conservation of energy).
  We must now interpret the initial value of a (ex'), namely, aO a(oc' - exO),
which we did not normalise. According to § 2 the wave function in terms of the
variables exk is b (oc')= a (ex') e-iW'/Ih, so that its initial value is
       aO   a(oc' -   exO) e-iW'ljh = aO 8(p",' - PzO) a(Pv' - PliO)            a(po' _    p.O)e-iW'/,h.
If we use the transformation function·
                                      (x'ip') = (2rch)-3/2 eiI.,.P.':r:'J",
and the transformation rule
we obtain for the initial wave function in the co-ordinates x, .y, z the value
                                      aO   (2rch)-3/2 eiI."ph'/h e-iW'I/h.
                        • The symbol x is used for brevity to denote x, y, z.
174                                                                               CHAPTERll
260                                P. A. M. Dirac.
This corresponds to an initial distribution of lao \II (27th)-3 electrons per unit
volume. Since their velocity is J>Oc2/Eo, the number per unit time striking a
unit surface at right-angles to their direction of motion is lao \2 J>Oc2 j(27th)3 EO.
Dividing this into the expression (24) we obtain, with the help of (25),
                               E'Eo                 P'
                  4~ (27th)2   7    1v (p';   pO) \II po sin 6' d6' dep'.             (26)
   This is the effective area that must be hit by an electron in order that it shall
be scattered in the solid angle sin 0' dO' dep' with the energy E'. This result
differs by the factor (21th)2j2mE' . P' fPo from Born's. * The necessity for the
factor P' fPo in (26) could have been predicted from the principle of detailed
balancing, as the factor 1v (p'; pO) 12 is symmetrical between the direct and
reverse processes.t
                          § 6. Application to Light-Quanta.
   We shall now apply the theory of §4 to the case when the systems of the
assembly are light-quanta, the theory being applicable to this case since light-
quanta obey the Einstein-Bose statistics and have no mutual interaction. A
light-quantum is in a stationary state when it is moving with constant momen-
tum in-a straight line. Thus a stationary state r is fixed by the three com-
ponents of momentum of the light-quantum and a variable that specifies its
state of polarisation. We shall work on the assumption that there are a finite
number of these stationary states, lying very close to one another, as it would
be inconvenient to use continuous ranges. The interaction of the light-quanta
with an, atomic system will be described by a Hamiltonian of the form (20),
in which Hp (J) is the Hamiltonian for the atomic system alone, and the
coefficients v,. are for the present unknown. We shall show that this form
for the Hamiltonian, with the V r• arbitrary, leads to Einstein's laws for the
emission and absorption of radiation.
   The light-quantum has the peculiarity that it apparently ceases to exist
 when it is in one of its stationary states, namely, the zero state, in which its
 momentum, and therefore also its energy, are zero. When a light-quantum
is absorbed it can be considered to jump into this zero state, and when one is
 emitted it can be considered to jump from the zero state to one in which it is
  *In a more recent paper (' Na.ohr. Gesell. d. Wisa.,' Gottingen, p. 146 (1926» Bom has
obtained a result in agreement with that of the present paper for non.relativity mechanics,
by using an interpretation of the analysis based on the oon.eervation theorems. I am
indebted to Prof. N. Bohr for seeing an advance copy of this work.
  t See Klein and Roeseland, ' Z. f. Physik,' vol. 4, p. (6, equation (4) (1921).
THE TIME-ENERGY UNCERTAINTY RELATION                                                  175
262                               P. A. M. Dirac.
range v, to v, + dv, and whose direction of motion lies in the solid angle dw,
about the direction of motion for state r will now be Av,2dv,dw,/c'. The energy
of the light-quanta in these stationary states is thus N,' . 21th'll,. Av,'-dv,dw,/c'.
This must equal Ac-1I,dv,dw" where I, is the intensity per unit frequency
range of the radiation about the state 'T. Hence
                                  I, = N,' (21th) '11,3 jc2,                          (28)
                                               +                           +
so that .N,' is proportional to I, and (N,' 1) is proportional to I, (21th)v.3 /cS.
We thus obtain that the probability of an absorption process is proportional to
I" the incident intensity per unit frequency range, and that of an emission
process is proportional to I, + (21th)v,3/c2, which are just Einstein's laws. *
In the same ~ay the probability of a process in which a light-quantum is scattered
                                                           +
from a state r to a state s is proportional to I, [I, (21th)v,3/cS], which is Pauli's
law for the scattering of radiation by an electron.t
area per unit time for the component r is t1tC-1 a,"v,'-. Hence the intensity
  • The ratio of stimulated to spontaneous emission in the present theory is just twice its
value in Einstein's. This is because in the present theory either polarised oomponent of
the incident radiation can stimulate only radiation poIa.rised in the Bame way, while in
Einstein's the two polarised oomponents are treated together. This remark applies al80
to the scattering Pl'OC6ll8.
  t Pauli, , Z. f. Physik: vol. 18, p. 272 (1923).
THE TIME-ENERGY UNCERTAINTY RELATION                                                              177
per unit frequency range of the radiation in the neighbourhood of the com·
ponent l' is I, = 17tc-1 a,"v,"a,. Comparing this with equation (28), we obtain
4, = 2 (hv,/ca,)IN,i, and hence
  The Hamiltonian for the whole system of atom plus radiation would now be,
according to the classical theory,
         F = Hp (J) +~, (27th v,) N, + 2c-1 t, (hv,/ca,)l X,N",l cos 8,/h, (29)
where Hp (J) is the Hamiltonian for the atom alone. On the quantum theory
we must make the variables N, and 8, canonical q-numbers like the variables
J", 'Wt that describe the atom. We must now replace the N,l cos 8,/h in (29)
by the real q-number
             1{N,l eilJr/h + e- iBr/1a N,l} = I {N,l eilJr/h + (N, + 1)1 e- ilJr/h}
so that the Hamiltonian (29) becomes
  F = Hp (J) + :E, (27th v,) N,+ hi c-;:E, (v,/a,)~ :X, {N,I eilJr /h + (N, + 1)1 e- ilJr/Ia }.
                                                                                       (30)
This is of the form (27), with
                            tJ, = tJ,* = hi c-; (v,/a,)1 X,                            (31)
and                             tJ" = 0     (1', S ~ 0).
The wave point of view is thus consistent with the light-quantum point of view
and gives values for the unknown interaction coefficient tJ" in the light-
quantum theory. These values are not such as would enable one to express
the interaction energy as an algebraic function of canonical variables. Since
the wave theory gives v" = 0 for 1', 8 ¢ 0, it would seem to show that there are
no direct scattering processes, but this may be due to an incompleteness in
the present wave theory.
   We shall now show that the Hamiltonian (30) leads to the correct expressions
for Einstein's A's and B's. We must first modify slightly the analysis of §5
ao as to apply to the case when the system has a large number of discrete station-
ary states instead of a continuous range. Instead of equation (21) we shall
now have
                           ih Ii (<<') = ~." V («'<</F) a (<</F).
If the system is initially in the state «0, we must take the initial value of a (<<')
to be 3....0, which is now correctly normalised. This gives for a first approxi.
mation
which leads to
178                                                                             CHAPTERll
264                                P. A. M. Dirac.
corresponding to (22). If, as before, we transform to the variables W, Y1'
Y2 ... Y_-1> we obtain (when Y' ~ y~
             a (W'y') = v (W', y'; WO, yO) [l_e,<w'-wO)I/h]/(W' - WO).
The probability of the system being in a state for which each YIt. equals Yt'
is ~w' Ia (W' y')\I, If the stationary states lie close together and if the time t
is not too great, we can replace this sum by the integral (~W)-lJ la (W'y/) \2 dW',
where ~W is the separation between the energy levels. Evaluating this integral
as before, we obtain for the probability per unit time of a transition to a state
for which each Ylt = Yk'
                          27t/h~W   . I v·(WO, y'; Wo, yO) 12.                     (32)
In applying this result we can take the Y's to be any set of variables that are
independent of the total proper energy Wand that together with W define
a stationary state.
   We now return to the problem defined by the Hamiltonian (30) and consider
an absorption process in which the atom jumps from the state JO to the state
J' with the absorption of a light-quantum from state r. We take the variable8
y' to be the variables J' of the atom together with variables that define the
direction of motion and state of polarisation of the absorbed quantum, but
not its energy. The matrix element v (WO, y'; WO,..,o) is now
                            h1!2c- 3/2 (v,/a,)l 12 X, (JoJ')N/,
where:X,. (JoJ') is the ordinary (JoJ') matrix element of i,.       Hence from (32) the
probability per unit .time of the absorption process is
                              ~ hv, I X· (JoJ') I IN 0
                               W . ff'a,'
                              h~                    ,.
To obtain the probability for the process when the light-quantum comes from
any direction in a solid angle dw, we must multiply this expression by the number
of possible directions for the light-quantum in the solid angle dw, which is
dw    a,~W/21th.   This gives
 I would like to express my thanks to Prof. Niels Bohr for his interest in this
work and for much friendly discussion about it.
                                   Summary.
  The problem is treated of an assembly of similar systems satisfying the
Einstein-Bose statistical mechanics, which interact with another different
system, a Hamiltonian function being obtained to describe the motion. The
theory is applied to the interaction of an assembly of light-quanta with an
ordinary atom, and it is shown that it gives Einstein's laws for the emission
and absorption of radiation.
  The interaction of an atom with electromagnetic waves is then considered,
and it is shown that if one takes the energies and phases of the waves to be
q-numbers satisfying the proper quantum conditions instead of c-numbers,
the Hamiltonian function takes the same form as in the light-quantum treat-
ment. The theory leads to the correct expressions for Einstein's A's and B's.
  VOL. CXIV.-A.                                                         T
180                                                                               CHAPTER III
710 P. A. M. Dirac.
   • In loco cit., § 6. it was in error assumed that V..... caused transitions from state m to
state., and consequently the information there obtained about an absorption (or emission)
process in terms of the number of light-quanta existing before the process should really
apply to an emission (or absorption) process in terms of the number of light-quanta in exist.
ence after the process. This change, of course, does not affect the results (namely the
proof of Einstein's laws) which can depend on 1v,"" I t = IV ..... I".
 ot Loc. cit., equation (4). In the present paper h is taken to mean just Planck's constant
[instead of (2".)-1 times this qua.ntity as in loco cit.] which is preferable when one has to deal
much with quanta. hv of radiation.
182                                                                               CHAPTERm
712                                P. A. M. Dirac.
total proper energy of. the state m. To solve these equations one obtains a
first approximation by substituting for the a's on the right-hand side their
initial values, a second approximation by substituting for these a's their values
given by the first approximation, and so on. One or two such approximations
will usually be sufficient to give a solution that is fairly accurate for times
that are small compared with the life time, but may all the same be large
compared with thelperiods of the atom. From the first approximation, namely,
            a", = am.   + l:"v",,,a,,,, (1 -   e2..i (W. - W.II/")/(W", - W,,),      (2)
where afll) denotes the initial value of a", one sees readily that when two states
m and n have appreciably different proper energies, the amplitude a", gets
changed only by a small extent, varying periodically with the time, on account
of transitions from state n. Only when twostates, m and m' say, have the same
energy does the amplitude a", of one of them grow continually at the expense
of that of the 9ther, as is necessary for physically recognisable transitions to
occur, and the rate of growth is then proportional to V mm"
   The interaction term of the Hamiltonian function obtained in Zoe. cit. [equation
(30)] does not give rise to any direct scattering processes, in which a light-
quantum jumps from one state to another of the same frequency but different
direction of motion (i.e., the corresponding matrix element V m ",' = 0). All
the same, radiation that has apparently been scattered can appear by a double
process in which a third state, n say, with difierent proper energy from m and
m', plays a part. If initially all the a's vanish except am', then a.. gets excited
on accoUnt of transitions from state m' by an amount proportional to V".',
and although it must itself always remain small, a calculation shows that it
will cause am to grow continually with the time at a rate proportional to
V",..V"","  The scattered radiation thus appears as the result of the two processes
m' -+ nand n -+ m,. one of which must be an absorption and the other an
emission, in neither of which is the total proper energy even approximately
conserved.
   The more accurate expression for the interaction energy obtained in § 3
of the present paper does give rise to direct scattering processes, whose eftect
is of the same order of magnitude as that of the double processes, and must
be added to it. The sum of the two will· be found to give just Kramers' and
Heisenberg's dispersion formula* when the incident frequency does not coincide
with that of an absorption or emission line of the atom. If, however, the
incident frequency coincides with that of, say, an absorption line, one of the
            • Kramera and Heisenberg, C Z. f. Phyaik,' vol. 31, p. 681 (1925).
THE TIME-ENERGY UNCERTAINTY RELATION                                                   183
714 P. A. M. Dirac.
                              § 2. Prelimirw.ry Formula!.
   We consider tJie electromagnetic field to be resolved into its components.
of plane, plane-polarised, progressive waves, each component r having a definite
frequency, direction of motion and state of polarisation, and being associated.
with a certain type of light-quanta. (To save writing we shall in future suppose
the words " directipn of motion" applied to a light-quantum or a component
                   I
of the field to imply also its state of polarisation, and a sum or integral taken
over all directions of motion to imply also the summation over both states of
polarisation for each direction of motion. This is convenient because the two
variables, direction of motion and state of polarisation, are alw!l-Ys treated
mathematically in the same way.) For an electromagnetic field of infinite
extent there will be a continuous three-dimensional range of these components.
As this would be inconvenient to deal with mathematically, we suppose it to be
replaced by a large number of discrete components. If there are a, components
per unit solid angle of direction of motion per unit frequency range, we can
keep a, an arbitrary function of the frequency and direction of motion of the
component r, provided it is large and reasonably continuous, and shall find
that it always cancels from the final results of a calculation, which fact appears
to justify our replacement of the continuous range by the discrete set.
   We can express a, in the form a, = (Av, AN,}-l, where Av, can be regarded
as the frequency interval between successive components in the neighbourhood
of the component r, and AN, is in the same way the solid angle of direction
of motion to be associated with this component. The quantities Av" AN, enable
one to pass directly from sums to integrals. Thus if f, is any function of the
frequency and direction of motion of the component r that varies only slightly
frOnl on~ component to a. neighbouring one, the sum of f, Av, ior all components.
having a specified direction of motion is
(3)
and the sum of f, AN, for all components having a specified frequency is
(3')·
716                                         P. A. M. Dirac.
If now there is a perturbing field of radiation, given by the magnetic vector
potential ICc, "., K. chosen so that the electric scalar potential is zero, the
Hamiltonian equation for the perturbed system will be
where CXzr is the angle between the electric vector. of the component r and the
z-axis. In the quantum theory, where the variables N" 0, are q-numbers,
the expression 2N,I cos 2rtO,Ih must be replaced by the real q-number
                +        +
N,le2ri1,/A (N, 1)1 e- 2friI./A• With this change one can take over the
TIlE TIME-ENERGY UNCERTAINTY RELA TION                                                                            187
where x, denotes the component of the vector (x, y, z) in the direction of the
electric vector of the component r, i.e.,
and IX,. denotes the angle between the electric vectors of the components r
and s, i.e.
   The terms in the first line of (13) are just those obtained in loco cit., equation
(30), and give rise only to emission and absorption processes. The remaining
terms (i.e., those in the double summation) were neglected in loco cit. These
terms may be divided into three sets :-
  (i) Those terms that are independent of the 6's, which can be added to the
proper energy Ho + };N,hv,. The sum of all such terms, which can arise
only when r = s, is
    e2h/4rtmc3   • };, v,/a,   . [N,le2lfiB,/h (N, + 1)1 e-Zlfi/l,/h
                                        + (N, + 1)te-        2 ui/"/1I   N;e2ri8,fI\)
                                                            = e2 },/4rtmc3        •   };,v,/a,. (2N,   + 1).
The terms e2h/4rtmc. "'i.v,/a, . 2N, are negligible compared with };N,hv"
owing to the very large quantity a, in the denominator, while the terms
e2h/4~. };v,/~, may be ignored since they do not involve any of the
dynamical variables, in spite of the fact that the smn 'J:..v,!a" equal to
Jv, dv, dw, from (3"), does not converge for the high frequencies.
  (ii) The terms containing a factor of the form cZri (8,- •• ).'li (r                  ~   s), whose sum. is
  e2h/4rtmc3 };,};."" cos IX,. (v, v./ap,)l [N,l (N, + 1)1 e2..i (B,-B.)/h
                                                               + (N,+ 1)1 N,l e- 2..i (B,-B.)/h]
         = e2hl2rtm~ "'i.,};.~, cos 0(" (v,v./a,rJ,)l N,l (N,+ 1)1 e2lfi(6,-B.)/h.                        (14)
These terms, which are the only important ones in the three sets, give rise to
transitions in which a light-quantum jumps directly from a state s to a state r_
                                                                3 D 2
188                                                                                                CHAPTERID
718                                   P. A. M. Dirac.
Such transitions may be called true seattering processes, to distinguish them
from the double scattering processes described in § l.
   (iii) The remaining temls, each of which involves a factor of one or other of
the forms e±4>ri8,''', e±2..i (8. +8.)'1,. These terms correspond to processes in
which two light-quanta are emitted or absorbed simultaneously, and cannot
arise in a light-qua~tum theory in which there are no forces between the light
quanta. The effects of these terms will be found to be negligible, 80 that the
disagreement with the light-quantum theory is not serious.
Thus
            WIll - Wk    =   Ho (J")    + hv, -      Ho(J')    =    h [v, - v (J' J")]
where v (J' J") = [Ho (J') - Ho (J")]/h is the transition frequency between
states J' and J", if one assumes J' to be the higher one. Hence from (15)
   The int.egral does not converge for the high frequencies. This is due, as
mentioned in § 1, to the non-legitimacy of taking only the dipole action of the
atom into account, which is what one does when one substitutes for the magnetic
potential in (10) its value given by (12), which is its value at some fixed point
such as the nucleus instead of its value where the electron is momentarily
situated. To obtain the interaction energy exactly, one should put cOs 27t
[6,/h - vr;r/c] instead of cos 2rt6r/h in (11), where ;r is the component of the
vector (x, y, z) in the direction of motion of the component r of radiation.
This will make no appreciable change for low frequencies Vr , but will cause a
llew factor cos 2rtvr;r/c or sin 2rtvr~r/c, whose matrix elements tend to zero
as Vr tends to infinity, to appear in the coefficients of (13). This will presumably
cause the integral in (16) to ~onverge when corrected, as its divergence when
uncorrected is only logarithmic.
   Assuming that the integrand in (16) has been suitably modified in the high
frequencies, one sees that for values of t large compared with the periods of the
atom (but small compared with the life time in order that the approximations
may be valid) practically the whole of the integral is contributed by values of v,
close to v (J' J"), which means physically that only radiation close to a transitio?
frequency can be spontaneously emitted. One fmds readily for the total
probability of .the emission, by performing t.he integration,
which leads to the correct value for Einstein's A coefficient per unit solid angle,
namely,
   We shall now determine the rate at which true scattering processes occur,
caused by the terms (14) in the Hamiltonian. We see at once that the frequency
of occurrence of these processes is independent of the nature of the atom,
and is thus the same for a bound as for a free electron. The true scattering
is the only kind of scattering that can occur for a free electron, so that we should
expect the terms (14) to lead to the correct formula for the scattering of radiation
by a free electron, with neglect of relativity mechanics and thus of the Compton
effect.
   Suppose that initially the atom is in the state J' and all the N's vanish except
one of them, N, say, which has the value Y.,'_ We label this state for the whole
system by k, and the state for which J = J' and N. = N,' - 1, Nr = 1 with
190                                                                                               CHAPTERm
720 P. A. M. Dirac.
all the other N's zero by m. In the first approximation a", is again
                                                                 . given by.
(15), where we now have
                        11",1:   = e2hJ2rcmc3 • cos 0:" (vrv.!O',a.)~ N.'~,                        (17)
       ~ ~ \a", 12 ~ ~ e4 2  ~ 1\."- ,
           - - - ow,~ cos IX" n,
       .,w~v
                                                              Jd
                                                              V,   V,
                                                                        1 - cos 21t (v; - v,) t
                                                                                          2.       (20)
               /J.w,              21t m c            cr,                      (V, - V,)
We again obtain a divergent integral, of the same form as before, which we may
assume becomes convergent in the more exact theory. We now have that
practically the whole of the integral is contributed by values of Vr close to v,
and the total probability for the scattering process is
from (6); where I is the' rate of flow of incident energy per unit area. The rate
of emission of scattered energy per unit solid angle is thus
where 0:" is the angle between the electric vectors of the incident and scattered
radiation, which is the correct classical formula.
   • The reason why tHere is 0. small probability for the scattered frequency II, ditJeriDg
by a finite amount from the incident frequency ". is because we are considering the scattered
radiation, after the scattering process has been acting for only a finite time I, resolved into
its Fourier components. One sees from the formula (20) that 88 the time' gets greater.
the scattered radiation gets more and more nearly monochromatic with the frequency •••
n one obtained a periodic solution of the SchrOdinger equation corresponding to permaneDt
physical conditiona, one would then find that the scattered frequency was exactly equal
to the incident frequency.
THE TIME-ENERGY UNCERTAINTY RELATION                                                                                                191
                                         § 5. Throry of Dispersion.
   We shall now work out the second approximation to the solution of equations
(1), taking the case when the system is initially in the state k, so that the fiplt
approximation, given by (2) with a"" = 0",,, reduces to
                       am   = Omk + vmd1 - e2•• (w* -                     w.)I/")/(W", -             W,J.
When one substitutes these values for the aA's in the right-hand side of (1).
one obtains
ih/2rc . tim =          e
                 tlmt 2.' (W. -         w.) I,'h
                      + L"Vm"tlnk (1 -               e2wi (W.-   W.)ll 1a ) e2 "';(w.- W.)II"           I (W" - Wk )
            _    (.         _   ~       V m.. Vn'·    ') 2".i(W. -W,) 1:"+· ~                    tiM" tiM   2"'; (W.-\\".)II"
            -         Vmk       .<J"   W" _ Wb e                                       .<J ..   W" _ Wk e                       ,
and hence when m ¢- k
            _    (                      V m"   v.. ,. ) 1 - e2wi (W. - w.)I.'Ia
        am -          Vmk -     ~"W _ W .                    W - W
                  \                      "k,                      m                k
                                                                      V        V           1 - e2,..;{w.-·W.)I/1I
                                                        + L" W" -         mA   "k
                                                                                   Wk            W'" - WA
                                                                                                                   •      (21)
 We may suppose the diagonal elements V"" of the perturbing energy to be zero.
since if they were not zero they could be included with the proper energy W".
There will then be no terms in (21) with vanishing denominators, provided all
the energy levels are different.
  Suppose now that the proper energy of the state m is equal to that of the
initial state k. Then the first term on the right-hand side of (21) ceases to be
periodic in the time, and becomes
722 P. A. M. Dirac.
may appear. One must now determine the total probability of the system
lying in anyone of these final states, which is
where 6.Wm iS'the interval between the energy levels. The second term in the
expression (21) for am may be neglected siuce it always remains small (except
in the case of resonance which will be considered later) and hence
If one assumes that the integral converges, so that for large values of t practically
the whole of it is contributed by values of Wm close to Wk> one obtains
                      l: Ia 12 =
                             m       kW
                                      6. m mk
                                             I
                                      47t2t v _ ~  V.."Vnl.: \2
                                                  "W-W'
                                                   "k
                                                                                                    (22)
where the quantities on the right refer to that final state that has exactly the
initial proper energy.
  We take the states k and m to be the sa.me as for the true sca.ttering process
considered in the preceding section, so that equations (17), (18) and (19) still
hold, and IlWIII = Allv, = AJa, 6.6>,. We can now take the state 11. to be either
the state J = J", N. = N,' - 1, N. = 0 (t =1= s) for any J", which would make
the process k-+ 11. an absorption of an s-quantum and 11.-+ m an emission of an
r-quantum, or the state J = J", N, = N,/, N, = 1, N, = 0 (t':;f! 8, f), which
would make k-+ 11. the emission and 11.-+ m the absorption. In the first case
we should'have
          _ e ( hv. \)1,. (J"J/) N 'I
            -c . -2-
                                                                 = ~(. hv,         )1 '. (J/J") ,
      V"k -
                  7tca. x.        •                      V .."       c 27tca, x,
and
      W" = Ho(J")          + (N,' -1) hv.         W" - W" = h[V(J"J/) - v,]·,
and in the second
          _ e (-nv,
       ,V"k--    -
                       )1., (J"J/)
                            iI"                          VIR..
                                                                 =   ~ ('2kv. ')1,.X. (J/J") N, '1,
              C   27tca,                                             C ,   7tca,
and
      W" = Ho (J")     + N.'hv, + hv,             w" -   WI.:    =   k[ v(J"J')        + v,l.
We shall neglect the other possible states 11., namely those for which the matrix
elements v"'", VIII: come from terms in the double summation in the Hamiltonian
                     • The frequency" (J"J') is not necess&Jily positive.
THE TIME-ENERGY UNCERTAINTY RELA TION                                                                                              193
  N't          e~v.21!
      , /lW r h2c6a, 1n cos OCr,
                                   _~ ,,{X,(J'J").;;,(J"J')
                                            J(J"J')
                                                            + X, (J'J") x, (J"J')} 12 (23}
                                                                   (J"J') +                '
                                                   'I           -      'Is             V                  V,
  The most convenient way of expressing this result is to find the amplitude
P (a vector) of the electric moment of that vibrating dipole of frequency V,
that would, according to the classical theory, emit the same distribution of
radiation as that actually scattered by the atom. The number of light-quanta
of the type r (with Vr = v,) emitted by the dipole P in time t per unit solid
angle is
where P, is the component of P in the direction of the electric vector of the light-
quanta 'T. Comparing this with (23) (which must first be divided by /lwr
to change it to the probability of a light quantum being scattered per unit
solid angle) one finds for Pr
   _ (81tN,')l I
              e2 h                               {i:, (J'(J"J')-
                                                          J") (J" J') ,v, (J' J") (J" J')} I        J:,
              1t
                                                                ,j',
P,- ~
    (I(;-v,a, 4
              2  -CQs(X'.-~J"
                 m                                      V
                                                                     +     (J"J')+
                                                                             V,.           V                     V,
   =   E~2-I_h_           _~ ,[ (J"J,)]:!Jxr(J'J")x,(J"J')
         h ~2 4~m
                2 cos X,.   J' V         l '(J"J')-    ~
                                                                                                 J')} I
                                            I
using (7), where E is the amplitude of the electric vector of the incident radiation.
  We can put this result in a different form by using the following relations,
which follow from the quantum conditions,
  ~J"    [x, (J' JII) x, (J" J') - x, (J' JII)          X,   (J" J')]        = [:t, x, -       x. x,] (J' J')          =0
                                                                                                                            (25)
and
  ~JII   [x, (J' JII) X. (J" J') - X, (J' 1") I, (1" J')] = [XT X. -                           x. x,) (J' J')
                                                                                   = ih,27t1n . cos       (Xm               (26)
which gives
  ~J"    [x, (J' JII)   X.   (J" J')   V   (J" J') + x. (J' JII) X T (J" J') '1(1" J')]
                                                                                    = hj41t2m . cos            'Y-".        (27)
194                                                                                              CHAPTER ill
724 P. A. M. Dirac.
where x without a suffix means the vector (x, y, z). This is identical with
Kramers' and Heisenberg's result.*
   In applying the formula (22), instead of taking the final state m of the system
to be one for which the atom is again in its initial state J = J', we can take a new
final state for the atom, J = J'" say. The frequency Vr for the scattered
radiation that gives no change of total proper energy is now
                        V, = v. - v (J"'J')   =   v.   + v (J"J''') -       v (J"J'),                (30)
which differs from the incident frequency V" so that we ebtain in this way
the non-coherent scattered radiation. (We assume that this V, is positive
as otherwise there would be no non-coherent scattered radiation associated
with the final state J = J'" of the atom.) In the present case we have Vml: = 0,
corresponding to the fact that the true scattering process does not contribute
to the non-coherent radiation. We now obtain for P" after a similar and almost
identical calculation to that leading to equation (24),
  This result can be put in the form corresponding to (29) with the help of
equations analogous to (25) and (26) referring to the non-diagonal (J"'J')
matrix elements of [x,x. - x,x,] and [xri. - :t,x,]. These equations give,
corresponding to (28),
~J" [x, (J"'J") x, (J"J') {v (J"J')        + vr} +x. (J"'J") x, (J"J') {v (J" Jill) -           v,}] =   o.
  • Kramers and Heisenberg, loco cit., equation (18). For previous quantum.theoref;ioal
deductions of the dispersion formula see Born, Heisenberg and Jordan, 'z. f. Physik,'
vol. 35, p. 557, Kap. 1, equation (40) (1926); SchrMinger, loco cit., § ·2, equation (23); and
Klein, loco cit., § 5, equation (82).
THE TIME-ENERGY UNCERTAINTY RELA TION                                                         195
When the left-hand side of this equation is subtracted from the summation in
{31) one obtains, on account of the relations
726 P. A. M. Dirac.
absorption frequency to the state of the atom J = J', say, then that intermediate
state of the system for which J = J' and for which the s-quantum has already
been absorbed will have very nearly the same proper energy as the initial state.
Calling this intermediate state l we have
W,-W,,=h('1o-v,) (33)
  '"
 fo
           1-      e2.n(v,-v.)1        {I -         e-:!lfi(v,-vu)1 _          1-    e-2 ..i<v,-v.)I} dv
       ('1,-'1,)('10-'1,)                        '1,-'10                             '1,-'1,                            ,
                   _ ') f 2.. (v, -              '10)  t - sin 2.. (v, -             '10)   t    + .1 -                cos 2.. (v, -    '10)    t)
                   - .o" l.                           (v, - '10)2                                       ~               ('I, - '10)2             I'
THE TIME-ENERGY UNCERTAINTY RELATION                                                                                                 197
~.,I an. I
                 2
                     =
                             I  Vm/.: -     Luil
                                                     Vm ..Vnl: 12 4,rt
                                                    W.. _ WI: h2~v.
        +I        Vml VI/': :!   ! 41t 21t (v, - vo) t - sin 27: (V, - vo)t
                         h'            !lv,                           (V,-Vo)S
        -.L      ')R (                _ L           V m .. 11..1.:   '.) Vkl'V'm ')      J21t (V, - Volt - sin 21t( v, - Volt
             I   ~
                         .
                             17m!.:         nil   W ft _ ur
                                                         n k'
                                                              h3 lJ.V,
                                                                  A    ... 1t            l            (Vr _ Vo )2
                                                                                       + .1 -       cos 21t (V, - volt)       (34)
                                                                                           ~         (v, - vo )2      f
 where the quantities on the right now refer to that incident light-quaBtum s
 for which v. = v" and R means the real part of all that occurs in the term after it.
    The first of these three t.erms is just the contribution of those terms of the
.dispersion formula (22) that remain finite, the second is that which replaces the
 contribution of the infinite term,· and the third gives the interference between
 the first two, and replaces the cross terms obtained when one squares the dis-
 persion electric moment. One can see the meaning of the second term more
 clearly if one sums it for all frequences v, of the scattered radiation in a small
 frequency range Vo -(:I.' to Vo + (:I." about the resonance frequency Vo (which
 frequency range must be large compared with the theoretical breadth of the
spectral line in order that the approximations may be valid). This is equivalent
to multiplying the term by (~v,)-l and integrating through the frequency range.
 If, for brevity, one denotes the quantity 41t I VmIV,1.: \2/h4 ~v, h.v, .by f (v,), the
result is, neglecting terms that do not increase indefinitely with t or that tend
 to zero as the (:I.'S tend to zero,
j..
 "0 -
     + ""f (v,) 21t (v, - vo) t - sin ~1t (v, - vo) t dVr
        .'                   (v, - vo)
        -f( ) JVo+ ·"21t (v, - vo) t - sin 21t (v r - vo) td
        -  Vo                                                v,
                                  ('V r - vol
                                              3
               "0 -                   Q'
                                                   +f' (vo) f"o + .." 21t (vr -                 vol t - sid ~1t (v, - vol t dV r
                                                                     • "0 -.'                      (v, - Vol
  Thus the contribution of the second term in (34) to the small frequency range
'10 -   a.' to '10 + a." consists of two parts, one of which increases proportionally
to tll and the other proportionally to t. The part that increases proportionally
to tll, na.mely,
is just that which would arise from actual transitions to the higher state of
the atom and down again governed by Einstein's laws, since the probability
that the atom has been raised to the higher state by the time 't' is*
(2 'It)1I1 'Ilk III/h2 flv, ..., and when it is in the higher state the probability per
unit time of its jumping down again with emission of a light-quantum in the
required direction is (27t)21 vm/12 /h2 Av" so that the total probability of the two-
transitions taking place within a time t is
The part that increases linearly with the time may be added to the contributions
of the first and third terms, which also increase according to this law. For
values of t large compared with the periods of the atom, the terms proportional
to t will be negligible compared with those proportional to t2, and hence the
resona.nce scattered radiation is due practically entirely to absorptions and
emissions according to Einstein's laws.
      • This result and the one for the emission follow at once from formula (32) of loco cit.
TIlE TIME-ENERGY UNCERTAINTY RELATION                                                199
238 E. P. WIGNER
                                                                        (3)
holds again for all values of to and Eo. However, whereas there are, for
any Xo and Po in (1), state vectors for which the equality sign is valid in
(1), namely those for which the x dependence of", is a Gaussian of
X-X o multiplied with exp ipoX, the inequality sign always holds in (3).
This is a consequence of the fact that the energy is a positive definite
operator (or has, in the non-relativistic case, a lower bound). The lower
limit of TE is, naturally, independent of to but does depend on Eo and
increases substantially as Eo approaches the lower bound of H from
above. Naturally, it increases even further as Eo crosses that bound and
decreases further. All this, as well as equation (2a), will be further
discussed below.
      3. A GENERALIZATION OF HEISENBERG'S
                  UNCERTAINTY RELATION
There are uncertainty relations for practically any two non-commuting
operators, but the generalization of Heisenberg's relation to be pointed
out here is a very special one and bears a close resemblance to the
original form of the relation. We denote, first, the variables of ,p by
202                                                                                       CHAPTER ill
240 E. P. WIGNER
x and T, the latter one standing for all variables different from x. The
relation (1) then remains valid if one replaces'" in (la) by
                                                                                    (4)
cfobeing any function of T and the integration is over all values of the
continuous coordinates implied by T and summation over the discrete
ones. This is, of cour!>e, a well-known fact, most commonly used with
a cfo which is a delta function of all coordinates different from x. The
derivation of the relation (1), given by Robertson,2 remains equally
valid, however, if T is assumed to involve also the time, with ",(x, r)
satisfying Schrodinger's equation and cfo depending on time in an
arbitrary fashion. The right side of (4) is then a generalized transition
amplitude which can be thought of as corresponding to a measurement
lasting a finite length of time, hut not affecting x.
   Accepting the generalization of Heisenberg's relation just outlined:
one sees that ~he time-energy uncertainty relation referring to the
life-time of resonance states, which was mentioned in the first paragraph
of this article, is not as different from (3) as it first appeared. It is in fact
included in the generalization of (3) which is the analogue of the
generalization of (1) just pointed to. The generalization replaces in this
case ",(x, y. z, t) by (ul"'(t» = X(t) where u is any state vector. The
time spread T is then the positive square root of
                   2_    II (ul"'(t»12 (t -
                                          t o)2 dt _   IIx(t)12 (t- t o)2 dt        (S)
               T    -        JI(ul",(t»12 dt       -       Ilx(t)\2 dt         .
In order to define the energy spread, we calculate, in analogy to (Zh)
                               cfo(E)=(271li)-i I ",(t)iEtl"dt.                    (Sa)
This is a stationary state of energy E. Hence, E2 will be
          lL       II (ul1>(E»1 2 (E-Eo)2 dE _         II7}(E)12(E-Eo)2dE
                                                                                   (Sb)
         E -              JI(u\cfo(E»\2 dE        -         JI7J{E)\2 dE
where
                        7J{E) = (ulcfo(E» = (27T1i)-i f(ul",(t»eiEt'''dt
                              = (27T1i)-i fx(t)iE11"dt                             (Sc)
is the Fourier transform of x. The factor (2711i)-i renders the denomi-
nators of (Sb) and (S) equal. With these definitions, and identifying u
with the state vector of the resonance, (3) will represent the uncertainty
relation giving the minimum energy spread of a resonance with given
life-time. This is the time-energy uncertainty relation mentioned in
the tfirst paragraph of the present article. On the other hand, if one
THE TIME-ENERGY UNCERTAINTY RELATION                                           203
242 E. P. WIGNER
                     J dg JJTJo(E) 7Jo(E')·(82/8E8E')e i
                     00
              211                        II7Jo(E)12 dE
Since 7Jo(E) and TJo(E')* both vanish at both ends of the integration, at
o and 00, partial integration with respect to E and E' gives
                          co
               1i2        J d~ JJ(Ur]o(E)loE)(Ur]o(E')·loE')ei{E'-ElfdEdE'
         2 ___        -00
        T -    211    ~-----:;JI-1Jo7.(E)=12-;'dE:::---·---
244 E. P. WIGNER
and this will be decreased if one sets of3/oE = O. It follows that the
minimum of T( will be assumed for a real 7Jo and such an 7Jo will be
assumed henceforth. The real nature of 7Jo could have been inferred
also from time inversion invariance.
   We are ready to obtain the minimum of T2(2 for given (2 (and Eo). To
obtain it, we set the variation of
                                                                         (20)
where
                                                                        (20a)
 has been introduced to make (20) somewhat more symmetric; >.. must
 be so determined that (lsa) become valid.
    Since 7Jo must vanish for both E = 0 and E = 00, (20) is essentially a
 characteristic value - characteristic function equation. It is well known,
 aQd can be easily verified, that its solution which approaches 0 for very
 large E approaches 0 as exp (- lcW) with c= >..IT/Ii(. This verifies the
 statement made about 7Jo at the end of the last section.
THE TIME-ENERGY UNCERTAINTY RELA nON                                            2fJ7
(21)
This does not quite satisfy the boundary condition at E = 0 but for
large Eo it satisfies it quite closely. Similarly, (15a) is satisfied closely
enough. In the present case, actually, the minimum of TE is independent
of E as long as this remains very much smaller than Eo.
   The otlier case in which (20) can be easily solved is Eo=O. In this
case the only solutions of (20) which satisfy the boundary conditions
have the form
                            'YJo=E exp( - lcW).                          (22)
                                                                         (23)
208                                                                                          CHAPTER III
246 E. P. WIGNER
the primes denoting differentiation with respect to the first variable and
T the value for which g(O, eo) = 0, and g tending to 0 for largt! E.
showing that the mean value of the energy is always larger than Eo.
THE TIME-ENERGY UNCERTAINTY RELA nON                                                 209
This last equation does not contain T, the former one gives an expres~ion
for Ar in terms of 710' Naturally, (25a) and (25b) can be combined in
various ways to eliminate A or the integral in (25a). These equations
play the role of the virial theorem which applies for the wave function
giving minimal position-energy spread and Heisenberg's uncertainty
relation. The last term on the left of (25b) vanishes in the simple cases
considered in the last section: Eo=O in the second case and the integral
in the numerator vanishes for Eo=O, giving A= 1 in both cases. In all
other cases, A< 1.
                               REFERENCES
1. P. A. M. Dirac, Proc. Roy. Soc. (London) A1l4, 243, 710 (19~7). The
   calculation was carried out by V. Weisskopf and E. Wigner, Z. Physik 63,
   54 (1930). See also the article of the same authors, ibid. 65, 18 (1930) and
   many subsequent discussions of the same subject and of resonance states
   decaying by the emission of particles rather than radiation.
2. W. Heisenberg, Z. Physik 43, 172 (1927). For a rigorous derivation, see
   H. P. Robertson, Phys. Rev. 34, 163 (1929). The derivation of section 5 is
   patterned on that of this article.
3. G. R. Allcock, Ann. Phys. (N.Y.) 53, 253, 286, 311 (1969). Section II of
   the first of these articles gives a very illuminating discussion of the ideas
   which underlie also the present section. It also contains a review of the
   literature of the time-energy uncertainty principle, making it unnecessary
   to give such a review here. The review also gives a criticism of some of the
   unprecise interpretations of the time-energy uncertainty relation which are
   widely spread in the literature. The later parts of the aforementioned articles
   arrive at a pessimistic view on th~ possibility of incorporating into the
   present framework of quantum mechanics time measurements as described
   by Allcock in section II or in the present section. This pessimism, which is
   not shared by the present writer, is expressed, however, quite cautiously
   and is mitigated by the various assumptions on which it is based.
4. The notation of chapter Xl of P. A. M. Dirac's The Principles of Quantum
   Mechanics (Oxford University Press, various editions) is used.
210                                                                                                                  CHAPTER III
143          Am. 1. Phys" Vol. 53, No.2, February 1985                                                     Hussar, Kim, and NOl      143
212                                                                                                              CHAPfERIII
As for the third commutation relation of Eq. (5) for the         and (intrinsic spin)' of the hadron."'" As a consequence,
longitudinal direction, we have to consider the Lorentz          the harmonic oscillator model constitutes a solution ofDir-
transformation of the coordinate system:                         ac's "Poisson bracket" equations for his "instant form"
  Z=   (ZO +Pl°)l(I_P,)II"                                       quantum mechanics. 24 .25
                                                                   While the exact form for the hadronic wave function is
                                                           (9)   somewhat complicated, the essential element of the wave
  1= (10 +pzO)/(1 _p')II'.                                       function takes the form 16
Likewise, the transformation equations for the momen-              ,pn,(xl = (I/lT2' + kn!k !)"'Hn(zO)H'(1 0)
tum-energy variables can be written as
  q, = (q;   + Pqt)/(1 -     p')I12,
                                                                             X exp{ - [(z»'      + (I O)']/2}.            (15)
                                                                 For simplicity, we assume here that the harmonic oscilla-
                                                          (10)
                                                                 tor has a unit strength. In the above expression, we have
  qo = (q~ +Pq:)/(1 _p')II'.                                     suppressed all the factors which are not affected by the
   In terms of the lab-frame variables, the uncertainty rela-    Lorentz transformation along the z axis. This is possible
tion [ZO,q:] = i ofEq. (5) and the TE relation ofEq. (6) can     because the oscillator wave functions are separable in both
be written as                                                    the Cartesian and spherical coordinate systems.
                                                                    In terms of the standard step-up and step-down opera-
  [z, q,] = i/(1 -P'),
                                                                 tors,
                                                          (11)
  [I, qo] = iP'/(I-P').
In addition, because the Lorentz transformation is not an
orthogonal transformation, the commutation relations                                                                       (16)
between z and qo and between I and q, do not vanish:
  [z,qo] = iP I( I _ P '),
                                                                   a~ = ~ (XI' + a~).
                                                          (12)
                                                                 and the oscillator wave function of Eq. (15) satisfies the
                                                                 differential equation
  [I,q,] = iP/(I-P').
The commutation relations of Eqs. (8), (II), and (12) can          a!a~,p(x) = (II   + I)¢(x),                             (17)
now be combined into a covariant form 14:                        where the eigenvalue II, together with transverse excita-
  [x~,qv]= -g~v +P~PvIM',                                 (13)   tions, determines the (mass)' of the hadron.23
                                                                    The operators given in Eq. (16) satisfy the algebraic rela-
with the covariant form of Eq. (7):                              tion
  [.<:I (P·xIM)][.<:I(P·qIM)]=eI,                         (14)                                                             (18)
where M is the hadronic mass. We use the convention              This commutation relation is Lorentz invariant. The time-
goo= -gil = -g,,= -g33=1.                                        like component of the above commutator is - I in every
   Although the uncertainty relations can be brought to the      Lorentz frame. This allows timelike excitations. Indeed, it
above convariant form, it is very difficult to give physical     is possible to construct a covariant Hilbert space of har-
interpretations to them. First, in Eq. (II), the right-hand      monic oscillator wave functions in which timelike excita-
side is dependent on the velocity parameter. Does this           tions are allowed in all Lorentz frames.'o
mean that Planck's constant becomes dependent on the                On the other hand, as we shall see in Sec. V, there is no
velocity? Second, the commutators ofEq. (12) do not van-         evidence to indicate the existence of such timelike excita-
ish while there are no conjugate relations between the var-      tions in the real world. 17 This is perfectly consistent with
iables involved. IS In order to resolve these puzzles, we have   the fact that the basic space-time symmetry of confined
to resort to an interpretation based on wave functions.          quarks is that of the o (3)-like little group of the Poincare
                                                                 group."·27 We can suppress timelike excitations in the ha-
III. USE OF THE HARMONIC OSCILLATOR                              dronic rest frame by imposing the subsidiary condition 10.23.
FORMALISM
                                                                                                                           (19)
   Traditonally, in nonrelativistic quantum mechanics, the
harmonic oscillator has been very useful in giving interpre-     Then only the solutions with k = 0 are allowed, and the
tations to the uncertainty relations. It is therefore not un-    commutator given in Eq. (18) is not consistent with the
reasonable to expect that a relativistic harmonic oscillator     above subsidiary condition.
model may prove useful in clarifying the questions raised at        How can we then construct a covariant commutator
the end of Sec. II. Is there then a model which can be used      consistent with the condition of Eq. (17)1 In order to attack
for this purpose?                                                this problem, let us divide the four-dimensional Minkow-
   It has been shown that the covariant harmonic oscillator      skian space-time into theone-dimensional timelike space 1 0
formalism introduced to this journal by the present authors      parallel to the hadronic four-momentum and to the three-
in 1978 serves many useful purposes. IO It can explain the       dimensional spacelike hyperplane perpendicular to the
basic hadronic features in the quark model, including the        four-momentum spanned by x', y', and zO. 15 This leads us
mass spectrum,l7 form factors,I8 parton picture,I •. 'o and      to consider the operator
the jet phenomenon. 2I In addition, the relativistic bound-
state wave functions in the oscillator formalism form a vec-        b~ =a~ -(P~pvIM')av'                                   (20)
tor space for the representations of the Poincare group di-      Then b~ is perpendicular to P ~ , and satisfies the constraint
agonal in the Casimir operators corresponding to (mass)'         condition
144      Am. 1. Phys., Vol. 53, No.2, February 1985                                                Hussar, Kim, and Noz    144
 THE TIME-ENERGY UNCERT AINTY RELATION                                                                                                     213
  P"b
     "
         = PI'b:' = o.                                       121)
bl-' and b It, satisfy the covariant commutation relation::'H
   [b",b;]    =    -glll.+P"P,./M'.                          122)
 The right-hand side ofthe above expression issymmteric in
f1 and v, and satisfies the relation
  PI'I. - gill'   + PpP,IM') = O.                            123)
Therefore the covariant commutation relation given in Eg.
121) is consistent with the subsidiary condition of Eg. 119).
   The covariant form given in Eg.I22) represents the usual
Heisenberg uncertainty relations on the three-dimensional
spacelike hypersurface perpendicular to the hadronic four-
momentum. This form enables us to treat separately the
uncertainty relation applicable to the timelike direction,
without destroying covariance. The existence of the t * dis-
tribution due to the ground-state wave function of Eg. lIS)
restricted to k = 0 by Eg.1I8) allows us to write the time-         Fig. 2. The Lorentz deformation in the light-cone coordinate system. The
energy uncertainty relation in the form                             major (minor) axis in the uv coordinate system is conjugate to the minor
                                                                    (major) axis in the q"q" coordinate system. Ifwe rotate these figures by
  ILlt 'IILlqti)~ I,                                         (24)   45~, they become Fig. 3 of Ref. 3 which explains the peculiarities observed
                                                                                              J
gated while the other goes through a contraction so that the
product uv will stay constant:
                                                                       4> (q"qo) =     C~)        exp[ i(q,z - qat)] ¢(z,t)dt dz
  uv = u*v*
      = (t 2 _ z')/2 = [(t *)' _ (z*)']l2.                  (27)
                                                                                     =(~)exp{ - [(q:)'+(q~)']/2},                         (33)
This transformation property is explained in detail in Figs.
145       Am. J. Phys., Vol. 53, No.2, February 1985                                                       Hussar, Kim, and Noz            145
214                                                                                                                     CHAffER III
which can also be written in a form similar to Eq. (32).                          QUARKS                  PARTONS
 The Fourier relations 14 between the space-time and mo-
mentum-energy coordinates are
(34) Fig. 3. Pictures of the proton in the quark and parton models. Suppose
  q, =q+ =     -{fu}                                             that a proton is sitting quietlY on the desk. According to the quark model,
                                                                 it appears like a bound state of three quarks to an observer who is sitting
                                                                 on the chair. However, to an observer who is on a jet plane with its speed
This means that the major and minor axes of the momen-           close to thal of light, the prolon would look like a collection offree parti-
tum-energy coordinates are the "Fourier conjugates" of           cles with a Wide momentum distribution. This is called Feynman's parton
the minor and major axes of the space-time coordinates,          picture.
respectively. This aspect is illustrated in Fig. 2. Thus we
have the following Lorentz-invariant reiations.JI
  (,ju)(,jq_) = (,ju*)(,jq*- )=1,                                   (bl The interaction time between the quarks becomes di-
                                                          (35)   lated, and the partons become free to allow an incoherent
  l,jv)(,jq+) = (,jv*)(,jq*,. )=1.                               sum of cross sections of the constituent particles.
This is indeed a Lorentz-invariant statement of the c num-          (c) The momentum distribution of partons becomes
ber time-energy uncertainty reiation combined with Hei-          widespread as the hadron moves very fast.
senberg's position-momentum uncertainty relations.                  (d) The number of partons appears to be much larger
                                                                 than that of quarks.
V. OBSERVABLE CONSEQUENCES                                          Because the hadron is believed to be a bound state of two
    In order that the time-separation variable be a c number,    or three quarks, each of the above phenomena appears as a
it is essential that there be no timelike oscillations. We im-   paradox, particularly Ib) and Ic) together. How can bound-
plement this concept by imposing the subsidiary condition        state quarks become free while the momentum distribution
ofEq. (19), which restricts the wave functions ofEq. (15) to     becomes widespread? This question has been discussed in
those with k = O.                                                Ref. 20. Peculiarities (a) and (b) have been addressed in Ref.
    On the other hand, if we allow timelike oscillations, the    19. According to Hussar's calculation," the ground-state
eigenvalue of the oscillator wave function which corre-          harmonic oscillator wave function leads to a parton distri-
sponds to the (mass)' spectrum will be"                          bution in good agreement with the experimental data. The
                                                                 parton phenomenon is therefore a direct manifestation of
  A = (n - k ).                                           (36)   the time-energy uncertainty relation combined covariantly
For a given value n of the longitudinal excitation, k can        with Heisenberg's position-momentum uncertainty rela-
take an arbitrarily large number. Thus (mass)' can take          tion.
negative numbers with no lower bound. This does not hap-
pen in nature. Furthermore, for a given value of A, there are
infinite possible combinations of nand k, resulting in infi-
nite degeneracy. There is no evidence to indicate the exis-      B)Present address: mjnDis Institute of Technology Research Institute. An-
tence of such a degeneracy in hadronic mass spectra. 17            napolis, Maryland 21401.
   We noted in Sec. IV that the Lorentz deformation prop-        IE. P. Wigner, in Aspects 0/ Quantum Theory, in Honour of P. A. M.
erty of the harmonic oscillator wave function enables us to        Dirac's70th Birthday, edited by A. Salam and E. P. Wigner (Cambridge
define the uncertainty products in a Lorentz-invariant             University, London, 1972).
                                                                 'Po A. M. DIrac, Proc. R. Soc. London Ser. A 114, 234, 710 (1927).
manner. Then the question is whether the deformation
                                                                  'E. P. Wigner and V. Weisskopf. Z. Phys. 63, 54(1930): 65, 18(1930).
property described in Fig. 2 manifests itself in the real        4E. P. Wigner, Phys. Rev. 10, IS. 609 (1946); E. P. Wigner and L. Etsen-
world. This question has been addressed in Refs. 19 and 20.        bud, ibid. 70. 29(1947); M. Moshinsky. Rev. Mex. Fis.l. 28(1952); Phys
The point is that Lorentz deformation given in Fig. 2 can be       Rev. 81, 347 (1951); 84. 525, 533 (1951); 88, 625(1952). P. T. Mathews
described in the zt and qzqo planes, which are simply 45'          and A. Salam, ibid. 115, 1979 (1959); F. T. Smith, ibid. 118. 349 (1960);
rotations of the figures in Fig. 2. Figure 2 of the present        Y. Aharonov and D. Bohm, ibld.1l2, 1649(1961); V. A. Fock, J. Exp.
paper and Fig. 3 of Ref. 20 are only two different forms of        Thear. Phys. (U.S.S.R.) 42, 1115 (1962): Sov. Phys. JETP 15, 784(1962):
the same figure. Figure 2 is designed to explain the Lorentz       Y. Aharonov and D. Bohm, Phys. Rev. 134, 1417 (1964); B. A. Lipp-
invariance of the uncertainty relation, while Fig. 3 of Ref.       mann, ibid. 151, 1023 (1966); G. R. Allcock. Ann. Phys. 53, 253 (1969);
                                                                   53,28611969); 53, 311 (1969); J. H. Eberly and L. P. S. Singh, Phys. Rev.
20 is designed to explain Feynman's parton picture.
                                                                   D7, 359 (1973). See also articles by J. Rayski and J. M. Rayski, Jr.; by E.
   It is by now a widely accepted view that hadrons such as
                                                                   Recami; and by E. W. R. Papp, in The Uncertainty Principles and Quan-
nucleons and mesons are bound states of quarks, if they do         tum Mechanics. edited by W. C. Price and S. S. Chissick (Wiley, New
not move rapidly. However, Feynman observed in 1969                York. 1977); M. Bauer and PA. Mello, Ann. Phys. (NY) 111, 38 (1978):
that, if a hadron moves with a velocity close to that oflight,     M. Bauer, ibid. 150, 1 (1983).
it appears as a collection of partons,32 as is illustrated in     'W. Heisenberg, Z. Phys. 43,172(1927); 45,172(1927).
Fig. 3. It is also believed that partons are quarks. The par-    ·W. Heisenberg, Am. J. Phys. 43, 389(1975).
ton picture, which has been a primary vehicle toward our          7The relation between the size of wave train and the linewidth was known
present understanding of high-energy hadronic interac-             in classical optics long before quantum mechanics was formulated.
tions, has the following peculiarities.                            However. this is only the reJation between the lifetime and jinewidlh of
                                                                   an unstable system mentioned in Ref. I. See W. Heitler, The Quantum
  la) The picture is valid only for hadrons moving with            Theory 0/ Radiation (O.ford University, London, 1954), 3rd ed.
velocity close to that of light.                                 'D. Han, Y. S. Kim, and M. E. Noz, Found. Phys. 11, 895 (1981).
146      Am. J. Phys., Vol. 53, No.2, February 1985                                                      Hussar, Kim, and Noz             146
  THE TIME-ENERGY UNCERTAINTY RELATION                                                                                                                215
<JFor papers dealing with the time-separation variable in bound systems,           1038 (1980); 48,1043 (1980).
 see H. Yukawa, Phys. Rev. 79, 416 11953); G. C. Wick, ibid. 96, 1124          I~For    papers dealing with form factor behavior, see K. Fujimura, T. Ko-
 (1954); M. Markov, Suppl. Nuovo Cimento3, 760 11956); T. Takabayasi,              bayashi, and M. Namiki, Prog. Theor. Phys. 43, 7J (1970); R. G. Lipes,
 Nuovo Cimento 33, 668 (1964); S. Ishida, Prog. Theor. Phys. 46, 1570.             Phys. Rev. D 5, 2849 (1972); Y. S. Kim and M. E Noz, ibid. 8, 3521
 1905 (1971); R. p, Feynman, M. Kislinger, and F. Ravndal, Phys. Rev.              (1973)
 D 3, 2706 1l972); G. Preparata and N. S. Craigie, Nuel. Phys. 8102,478         IQy. S. Kim and M. E. Noz, Phys. Rev. DIS, 335 (1977).
 (1976); Y. S. Kim, Phys. Rev. 0 14, 273 (1976); J. Lukierski and M.           "Y. S. Kim and M. E. Noz, Am. J. Phys. 51, 368 (1983).
 Oziewics, Phys. Lett. 69B, 3390977); D. Dominici and G. Longhi,               :' lFor papers dealing with the jet phenomenon, see T. Kitazoe and S.
   Nuovo Cimento A 42, 235 (1977); T. Goto, Prog. Theor. Phys. 58, 1635            Hama, Phys. Rev. D 19, 2006 (1979); Y. S. Kim, M. E. Noz, and S. H.
   (1977); H. Leutwyler and J. Stem, Phys. Lett. 73B, 75 (1978); and Nuel.         Oh, Found. Phys. 9, 947 (1979); T. Kitazoe and T Morii, Phys. Rev. D
   Phys. 8157, 327 (1979); I. Fujiwara, K. Wakita. and H. Yoro, Prog.              21,685(1980); Nuel. Phys. 8164, 76(1980).
  Theor. Phys. 64, 363 (1980); 1. lersak and D. Rein, Z_ Phys_ C 3. 339        "E. P. Wigner, Ann. Math. 40, 149 (1939).
   11980); I. Sogami and H. Yabukl, Phys. Lett. 94B, 15711980); M. Paun,       "~Yo S. Kim, M. E. Noz, andS. H.Oh,J. Math. Phys.l0, 1341 (1979);Am.
   in Group Theoretical Methods In Physics, Proceedings of the 9th Interna-        J. Phys. 47, 892 (1979); J. Math Phys. 21, 1224 (1980).
   tional Colloquium, Coeoyoe, Mexico, edited by K. B. Wolf (Springer-         "P. A. M. Dirac, Rev. Mod. Phys. 21, 392 (1949).
   Verlag, Berlin, 1980); G. Marehesini and E. Onofri, Nuovo Cimento           "D. Han and Y. S. Kim, Am. J. Phys. 49,1157 (1981).
   A6S, 298 (1981); E. C. G. Sudarshan, N. Mukunda, and C. C. Chiang,          ~t-.F. C. Rotbart. Phys. Rev. 023,3078 (1981). For a physical basis for
   Phys. Rev. D 25,3237 (1982).                                                    Rotbart'scalculation, see L. P. Horwitz and C. Piron, Helv. Phys. Acta
 IOFor a recent pedagogical paper on this problem, see C. H. Blanchard,            46,316 (1973). See also Ref. 16.
   Am. J. Phys. 50, 64211982).                                                 "D. Han, M. E. Noz, and Y. S. Kim, Phys. Rev. D 25,1740(1982).
I IFor a possible departure from the accepted view, see £. Prugovecki,         2MD. Han, M. E. Noz, Y. S. Kim, and D. Son, Phys. Rev. D 27, 3032
   Found. Phys. 12, 555 (1982); Phys. Rev_ Lett. 49, 1065 (1982).                  (1983).
12E. Prugovecki, Quantum Mechanics in Hilbert Space (Academic. New             2QThis commutator can be translated into wave-function fonnalism. For a
   York, 1981). 2nd ed.                                                            wave-function description of this commutation relation. see M. 1. Ruiz,
I1H. P. Robertson, Phys. Rev. 34,163 (1929).                                       Phys. Rev. D 10. 4306 (1974).
14The word "Fourier relation" was used earlier by Blanchard in Ref. 10.        lOFor a pedagogical treatment of the light-cone coordinate system, see Y.
   This word is necessary because the energy is not a variable canonically         S. Kim and M. E. Noz, Am. J. Phys. 50, 721 (1982).
   conjugate to the time variable                                              "Y. S. Kim and M. E. Noz, Found Phys. 9, 375 (1979); J. Math Phys. 22,
\~G. N. Flemmg, Phys. Rev. 137, B188 (1965:1; G. N. Fleming, 1. Math               2289 (1981).
   Phys.ll, 1959 (1966)                                                        '~R. P. Feynman, in High Energy Collisions, Proceedings of the Third
"Y. S. Kim and M. E. Noz, Am. J. Phys. 46, 48411978).                              International Conference, Stony Brook, NY, edited by C. N. Yang et al.
17For some of the latest papers on hadronic mass spectra, see N. lsgur and         (Gordon and Breach, New York, 1969); Pholon Hadron Interactions
  G. Karl. Phys. Rev. D 19, 2653 (1978); D. P. Stanley and D. Robson,              (Benjamin, New York, 1972). See also J. D. Bjorken and E. A. Paschos,
   Phys. Rev. Lett. 45. 235 (1980). For review articles written for teaching       Phys. Rev. 185, 1975 (1969).
   purposes, see P. E. Hussar, Y S. Kim, and M. E. Noz, Am. J. Phys. 48,       "~Po E. Hussar, Phys. Rev. D 23, 2781 (1981).
147         Am. J. Phys. 53 (2), February 1985                                         ® 1985 American Association of Physics Teachers               147
Chapter IV
The success of quantum field theory in the 1950's led a large number of physicists
to believe that field theory will solve all dynamical problems. Naturally, they
attempted to solve the hydrogen atom problem within the framework of field theory.
The Bethe-Salpeter equation is the most suitable field theoretic equation for
quantum bound-state problems. Although this equation offers us mathematical
challenge and generates some useful solutions, it is plagued with fundamental
difficulties, as Wick pointed out clearly in 1954. During the 1970's, field theoretic
bound state models have been proven to be ineffective in dealing with hadrons in the
quark model. We are thus fully justified to construct a relativistic model of bound
states consistent with special relativity and quantum mechanics, but not necessarily
within the framework of the mathematical framework of field theory, as is indicated
in Figure 2.
The question is then whether we can construct a relativistic bound-state model
without the fundamental difficulties contained in the Bether-Salpeter wave function.
In 1973, Kim and Noz investigated this possibility, and showed that the harmonic
oscillator model of Yukawa (1953) can satisfy this condition. Yukawa's 1953
papers are based on his earlier effort made in 1950 to formulate a field theory of
particles with space-time extension. Yukawa's aim was not different from that of
the present day string models.
The covariant harmonic oscillator model has been studied extensively by the present
authors and their associates. The orthogonality relation and Lorentz transformation
properties have also been studied by Ruiz (1974) and Rotbart (1981). In 1981, Han
and Kim showed that the covariant oscillator formalism can serve as a solution of
the Poisson-bracket equations for relativistic bound state formulated by Dirac in
1949.
The paper of Kim, Noz, and Oh in Chapter II shows that the same oscillator
formalism constitutes a representation of the Poincare group for relativistic extended
hadrons.
                                         217
218                                                                                 CHAPTER IV
COMET PLANET
NEWTON
              ><
                                                               GALILEO
                                           BOHR
HEISENBERG, SCHRODINGER
                   FEYNMAN            C§J)
                        1                     1                EINSTEIN
STEP 2
              For the purposes of atomic theory it is necessary to combine the restricted principle of relativity with
           the Hamiltonian formulation of dynamics. This combination leads to the appearance of ten fundamental
           quantities for each dynamical system, namely the total energy, the total momentum_ and the 6-vector
           which has three components equal to the total angular momentum. The usual form of dynamics expresses
           everything in terms of dynamical variables at one instant of time, which results in specially simple expres-
           sions for six or these ten, namely the components of momentum and of angular momentum. There are
           other forms for relativistic dynamics in which others of the ten are specially simple, corresponding to
           various sub-groups of the inhomogeneous Lorentz group. These forms are investigated and applied to a
           system of particles in interaction and to the electromagnetic field.
of the atom. In the atomic world the departure of            another. The imperfections may well arise from the use
space-time from flatness is so excessively small that        of wrong dynamical systems to represent atomic phe-
there would be no point in taking it into account at the     nomena, i.e., wrong Hamiltonians and wrong interaction
present time, when many large effects are still unex-        energies. It thus becomes a mailer of great importance to
plained. Thus one naturally works with the simplest          set up >lew dynamical syslems ana see if tlzey will better
kind of coordinate system, for which the tensor g"'          describe the alomic world. In setting up such a new
that defines the metric has the components                   dynamical system one is faced at the outset by the two
                                                             requirements of special relativity and of Hamiltonian
              Ff'= _gll= -g"= -Ff'= 1                        equations of motion. The present paper is intended to
              g"'=O for Jl;o!v.                              make a beginning on this work by providing the
Einstein's restricted principle of relativity is now of      simplest methods for satisfying the two requirements
paramount importance, requiring that physical laws           simultaneously.
shall be invariant under transformations from one such
coordinate system to another. A transformation of this            2. THE TEN FUNDAMENTAL QUANTITIES
kind is called an inhomogeneous Loren tz transforma-           The theory of a dynamical system is built up in
tion. The coordinates u" transform linearly according        terms of a number of algebraic quantities, called
to the equations                                             dynamical variables, each of which is defined with
                                                             respect to a system of coordinates in space-time. The
with                                                         usual dynamical variables are the coordinates and
                                                             momenta of particles at particular times and field
                                                             quantities at particular points in space-time, but other
the a's and {3's being constants.                            kinds of quantities are permissible, as will appear later.
   A transformation of the type (1) may involve a               In order that the dynamical theory may be expres-
reflectio'l of the coordinate system in the three spacial    sible in the Hamiltonian form, it is necessary that any
dimensions and it may involve a time reflection, the         two dynamical variables, ~ and ~, shall have a P.b.
direction duo in space-time changing from the future         (Poisson bracket) U, ~], subject to the following laws,
to the past. I do not believe there is any need for
physical laws to be invariant under these reflections,
although all the exact laws of nature so far known do
have this invariance. The restricted principle of rela-
tivity arose from the requirement that the laws of
nature should be independent of the position and             A number or physical constant may be counted as a
velocity of the observer, and any change the observer        special case of a dynamical variable, and has the
may make in his position and velocity, taking his            property that its P.b. with anything vanishes.
coordinate system with him, will lead to a transforma-         Dynamical variables change when the system of
tion (1) of a kind that can be built up from infinitesimal   coordinates with respect to which they are defined
transformations and cannot involve a reflection. Thus        changes, and must do so in such a way that P.b.
it appears that restricted relativity will be satisfied by   relations between them remain invariant. This requires
the requirement that physical laws shall be invariant        that with an infinitesimal change in the coordinate
under infinitesimal transformations of the coordinate        system (2) each dynamical variable ~ shall change
system of the type (1). Such an infinitesimal transfor-      according to the law
mation is given by
                                                                                   ~'=H[~,    F],                   (4)
394 P. A. M. DIRAC
396 P. A. M. DIRAC
It is then necessary that the ten fundamental quantities,                     M.,= L:.(q.P,-q,P.).               (24)
and indeed all physical variables, shall have zero P.b.     The Hamiltonian~ p. will be the sum of their values
with gPqp. The condition for this is that they should       for the particles separately plus interaction terms,
involve the p's only through the combinations g.P,
-g,P.·                                                                  p.=L:lp.+q.B(p·p.-m')I+V"                (25)
   The ten fundamental quantities may be obtained by        The V. must be chosen so as to make the p. satisfy
a method parallel to that of the preceding section, with    the correct P.b. relations. The relations for [M '" P,]
the subsidiary equation (21) taking the place of Eq.        are satisfied provided the V. are the components of a
(12). We again assume Eqs. (13), and now choose the         4-vector. The remaining relations, which require the
A's so as to make their right-hand sides have zero P.b.     p. to have zero P.b. with one another, lead to quadratic
with qPgp. The resulting expressions for the ten funda-     conditions for the V .. These cause the real difficulty in
mental quantities will again satisfy the P.b. relations     the problem of constructing a theory of a relativistic
(6), as may be inferred by a similar argument to the        dynamical system in the point form.
one given in the preceding section.
   We find at once                                                           5. THE FRONT FORM
  We may set up a dynamical theory in which the              that it shall be invariant under all transformations of
dynamical variables refer to physical conditions on a        the three coordinates u" u" u+ of the front except those
front. This will make specially simple those of the          for which du+ gets multiplied by a factor, and for the
fundamental quantities associated with infinitesimal         latter transformations V must get multiplied by the
transformations of coordinates that leave the front          same factor. The linear conditions for V; require it to
invariant, and will give a third form of dynamics,           be of the form
which may be called the fro1l! form.
   IC Ap is any 4-vector, put                                                      Vi=q.V+V;',                      (30)
398 P. A. M. DIRAC
four potentials A .(u) satisfying the subsidiary equation             The second of Eqs. (34) then leads to
                       aA ./Ou. "" O.
Their Fourier resolution is
                                                           (31)
                                                                      M.,=4ri   fI   At..(k.il/ilk'-k..a/ilk·)A.·
                                                                                             +At•• A .. -At..A•• }ko-'d'k. (37)
point and front forms, respectively. The subsidiary           Hamiltonians. The former are the components of a
Eq. (31) must be modified when a charge is present.           6-vector, the latter are the components of a 4-vector.
  The point form will be worked out as an illustration.       Thus the four Hamiltonians can easily be treated as a
In this case we have at once Ap.=O. We can get Ap con-        single entity. All the equations with this form can be
veniently by arranging that qp(P.- P.F)_q.(Pp_ P/)            expressed neatly and concisely in four-dimensional
and !Pp-ppF-eAP(q)I!Pp-PpF-eAp(q)1 shall have                 tensor notation.
zero P.b. with q'q,. The first condition gives A.= q.B.          The front form has the advantage that it requires
The second then gives                                         only three Hamiltonians, instead of the four of the
                                                              other forms. This makes it mathematically the most
           1+ 2"'qpB+q'q.B'("·7f.-m') =0.                     interesting form, and makes any problem of finding
Thus we get finally                                           Hamiltonians substantially easier. The front form has
                                                              the further advantage that there is no square root in
 Pp=p.F+P.+qpK-'![(7f·q.)'                       }            the Hamiltonians (28), which means that one can avoid
     _   F     _      - K'(7f·... -m')]I- .. •q,1 (40)        negative energies for particles by suitably choosing the
M •• -M•• +q.p. q,P•.                                         values of the dynamical variables in the front, without
                                                              having to make a special convention about the sign
  The above theory of point charges is subject to the
                                                              of a square root. It may then be easier to eliminate
usual difficulty that infinities will arise in the solution
                                                              negative energies from the quantum theory. This
of the equations of motion, on account of the infinite
                                                              advantage also occurs with the point form with <=0,
electromagnetic energy of a point charge. The present
                                                              there being no square root in (23).
treatment has the advantage over the usual treatment
                                                                 There is no conclusive argument in favor of one or
of the electromagnetic equations that it offers simpler
                                                              other of the forms. Even if it could be decided that one
opportunities for departure from the point-charge
                                                              of them is the most convenient, this would not neces-
model for elementary particles.
                                                              sarily be the one chosen by nature, in the event that
                                                              only one of them is possible for atomic systems. Thus
                      8. DISCUSSION
                                                              all three forms should be studied further.
  Three forms have been given in which relativistic              The conditions discussed in this paper for a relativistic
dynamical theory may be put. For particles with no            dynamical system are necessary but not sufficient. Some
interaction, anyone of the three is possible. For particles   further condition is needed to ensure that the inter-
with interaction, it may be that all three are still          action between two physical objects becomes small when
possible, or it may be that only one is possible, de-         the objects become far apart. It is not clear how this
pending on the kind of interaction. If one wants to set       condition can be formulated mathematiCally. Present-
up a new kind of interaction between particles in order       day atomic theories involve the assumption of local-
to improve atomic theory, the way to proceed would            izability, which is sufficient but is very likely too
be to take one of the three forms and try to find the         stringent. The assumption requires that the theory
interaction terms V, or to find directly the Hamil-           shall be built up in terms of dynamical variables that
tonians, satisfying. the required P.b. relations. The         are each localized at some point in space-time, two
question arises, which is the best form to take for this      variables localized at two points lying outside each
purpose.                                                      other's light-cones being assumed to have zero P.b. A
   The instant form has the advantage of being the one         less drastic assumption may be adequate, e.g., that
people are most familiar with, but I do not believe it        there is a fundamental length A such that the P.b. of
is intrinsically any better for this reason. The four         two dynamical variables must vanish if they are
Hamiltonians Po, M,o form a rather clumsy combina-            localized at two points whose separation is space-like
tion.                                                         and greater than A, but need not vanish if it is less
  The point form has the advantage that it makes a             than X.
clean separation between those of the fundamental                 I hope to come back elsewhere to the transition to
quantities that are simple and those that are the              the quantum theory.
COVARIANT PICTURE OF QUANTUM BOUND STATES                                                                                               227
                 The possibility of a theory of non-local fields, which is free from the restriction that field quantities are
              always point functions in the ordinary space, is investigated. Certain types of non-local fields, each satis-
              fying a set of mutually compatible commutation relations, which can be ob~ned by extending familiar
              field equations for local fields in conformity with the principle of reciprocity, are considered in detail. Thus
              a scalar non-local field is obtained, which represents an assembly of particles with the mass, radius and spin 0,
              provided that the field is quantized according to the procedure similar to the method of second quantization
              in the usual field theory. Non-local vector and spinor fields corresponding to assemblies of particles with the
              finite radius and the spins 1 and t respectively are obtained in the similar way.
respectively. Equations (7) and (8) are obviously com-          Thus the most general form of U(X., '.), which satisfies
patible with each other and the former implies that             all the relations (7), (8), and (14), is
U(X p, 'p) is, in general, a superposition of plane waves
of the form expik.Xp with k. satisfying the condition
                                                          (9)
                                                                U(X.,            '.)=   J... f      (dk)'u(k., ,.)o(k.k·+,')
moving with the velocity v., v" v•. Accordingly, we perform /irst                  tha Lorentz transformation
                                                               x,.' =a,.,x,                                                      (23)
with the transformation matrix
                                                               k1k,/K'          k1k./K'
                                                              H(kJK)'           k,kJK'
                                                                                                                                 (24)
                                                               k,k./K'         1+ (k./l/.)'
                                                                k,/K               k,/.
where K = (.(.- k,))I. Then the wave function for the                         m.   QUAN'rIZATION OF NON-LOCAL SCALAR
internal motion can be described by a function u'(8', ",')                                      FIELD
of the polar angle 8', ",' defined by                                     In order to show that the non-local field above con-
  ,
11,=01"r,.="  , SIn
                 . 8',COStp,"   T2,=a2rT,=r , SIn        'f (25)
                                               . 8" Stnly?,            sidered represents exactly the assembly of identical
                                                                       particles with the finite radius, we have to quantize the
'3 =aa,t'.. =r cosO,            '. =atr1".=k,.r P/K.
                                                                       field on the same lines as the method of second quan-
Incidentally, '.' as defined by the last expression in (25)            tization in ordinary field theory. For this purpose, it is
is nothing but the proper time multiplied by - e for the               convenient to write (16) in another form
particle, which is moving with the velocity Vx, v., v,.
Again, u'(8', ",') can be expanded into series of spherical
harmonics:                                                             U(X .. ")=         f··· f (dk)'(dl)'u(k., I.)
            u'{8', ",')= l: e(k .. I, m)pt M{8', ",').        (26)
                          ~M
                                                                                       x 6(k.k·+ ")0(1,)'- A')o(ki')
  Since the above arguments are in conformity with the                                               X exp(ik.X·)ll. 6(,.+1.),   (29)
principle of relativity perfectly, the non-local field in
question can be regarded as a field-theoretical represen-              where I. is a four vector. The integrand is different from
tation of a system of identical particles, each with the               zero only for those values of k., I.. which satisfy the
mass m, the radius and the spin 0, which can rotate as                 relations
the relativistic rigid sphere without any change in                                  k.k'+<'=O, I,I'-A'=O,             k,I'=O.   (30)
shape other than the Lorentz contraction associated
                                                                       Accordingly, the matrix elements for the operator U are
with the change of the proper time axis.
  The non-local field U given by (16) reduces to the
ordinary local scalar field in the limit A-->O, as it should           (x.' 1 U 1x.") =     f··· f (dk)'(dJ)'u(k.. I,)
be, provided that the rest mass m is different from zero.
Namely, (x.' 1Ulx.") is different from zero only for                                X 6(k,k'+ .')6(1.1'- A') exp(ik·x.' /2)
x.' = x.", because the only possible solution of the
simultaneous Eqs. (9), (11), and (15) with m#O and                                         xll. o(x.'-x."+I,) exp(ik'x."/2),     (31)
A=O is '1=,,=,,=,,=0. On the contrary, the case of
the zero rest mass m = 0 is exceptional in that the non-               which is equivalent to the relation
local field U does not necessarily reduce to the local field
in the limit A=O. This is because the simultaneous
Eqs. (9), (11), and (15) with m=O and A=O have
                                                                       U=      f· .. f (dk)'(dJ)'u(k.,I,) exp(ik,x'/2)
solutions of the form
                                                                                                   Xexp(u'p./Ii) exp(ik.x·/2),   (32)
           ,.=±(A')'k.. k,=±(kl'+k,'+k.')I,                    (27)
                                                                       between the operators         x', P.   and U, where
where A' is an arbitrary constant with the dimension of
length. More generally, the simultaneous equations                            u(k.,I.)=u(k .. I.)6(k.k"+.')6(l.Z·- A')6(k.I").   (33)
with m=O and A#O has the general solution of the form
                                                                       As the operators k.x" and I"P. in the same term on the
          ,.=,/±(A')'k.. k,=±(k.'+k,'+ka')I,                   (28)    right-hand side of (32) are commutative with each other
                                                                       on account of the relations (I) and (30), (32) can also
 where,.' is any particular solution of the same equa-                 be written in the form
. tions. Thus the radius of the particle without the rest
 mass cannot be defined so naturally as in the case of the
 particle with the rest mass, corresponding to the cir-                U=      f· .. f (dk)'(dJ)'u(k.,I.) exp(ik.x·)
 cumstance that there is no rest system in the former                                                            Xexp(u·p./h).   (32')
 case. Detailed discussions of this particular case will be
 made elsewhere.                                                       Similarly the operator          U·,    which is the Hermitian
COVARIANT PICTURE OF QUANTUM BOUND STATES                                                                                                          231
  =   -Ik.T
         k.
            II• oCk.-k.')oCI.-I.')· 6Ck.k·+K')                           uCk, I, m)=        JJ    uCk, El,   <1»
                                                                                            JJ
effects of non-Iocalizability of the field are negligible,
because they are confined to small regions very near
the surface of the cube." In this case, the integrations
                                                                        U'Ck, I, m)=              U*Ck, El,        <1»
with respect to k. on the right-hand side of Eqs. C32')                                                      xp,mce,          <1»   sin8dEld<l>
and (34) are replaced by the summations with respect
to k.. which take the values                                            assuming that the spherical harmonics p,"Ce, <1» and
                                                                        their complex conjugate p,mC8, <P) are normalized
k,=(h/L)n"          k,=(h/L)n" k,=C2-tr/L)n"                            according to the rule:
                                                                             fJ
                   k.=±Ck,'+k,'+k,'+,,)I,                        (38)
where nt, nt, ns are integers, either positive or negative,                           p,mCEl, <I»P,"CEl,           <1»   sinedEld4>m= 1.               (45)
including zero. The integrations with respect to I. with
fixed k. are replaced by those with respect to z.' defined              Similarly, U* is transformed into the form
by
                                                       (39)
                                                                        U*=     I:    L(~)'                 A            IvCk, I, m)
where the coefficients a" are given by (24). Further,                         ., •••• 'M      L     4.Ck'+ .') I
w~ intr:m~ce ~h.e polar ~ngl~ El, <1>, which are conn~cte?
With I, , I, ,I, Just as 8 , 'P are connected With T, , T, ,                               X U(k, I, mJ+u*(k,l, m)U*Ck,l, m) I,                        (46)
r,' by the relations (25). Thus we obtain                               where
U=     L
      .,.,',
               JJ( -2~)' A sin8dEld<l>
                      L       4KCk'+K')1                                u'Ck, I, m)=       JJ     u*Ck,   e, <I»P,M(El, <1»
                                                                                                                               X sin EldEld<l>,
                    X luCk,    e, <I»UCk, El, <1»                                                                                                      (47)
                             +v*Ck, El, <I»U*Ck, ~,   <1»
  "More precisely, L must be large compared withV(I-I!')I,
                                                            I,   C4O)    vCk, I, m)=       JJ     vCk,   e, <I»P,M(El, <1»
where (3c is the maximum velocity of particles in consideration.                                                               Xsin8dEld<l>.
232                                                                                                                            CHAPTER IV
By the same transformation, we obtain from Eq. (37)                                instead of Eq. (37), where
the commutation relations
                                                                                                     [A, BJ+=AB+BA                      (53)
[ark, I, m), a*[k', 1', m')J=o(k, k')W, l')o(m, m')'}
[b(k, I, m), bOCk', 1', m')J=o(k, k')o(/, norm, m'), (48)                          for any two operators A and B. However, in this case,
                                                                                   we arrive at the well-known contradiction in the limit
 [ark, I, m), b(k', I', m')J=O, etc.
                                                                                   of A-+O, which prohibits the elementary particles with
for the operators defined by                                                              °
                                                                                   spin from obeying Fermi statistics.
                 2... ),
 ark, I, m)= ( ( -
                             A                     )1   ·f/(k, I, m),
                                                                                              IV. NON-LOCAL SPINOR FIELD
 b(k, I, m)= (( -lr)'     A )1·v(k, I, m),                                         spinor operator", with four components, which trans-
                                                                                   form as the components of Dirac wave function. Each
                          L     4«k'+K')1                                          of these components can be considered as a non-local
                                                                                   operator just like the operator U in the case of the
b*(k,l,m)=((~)'                        A           )1.V*(k,l,m).                   scalar field. As an extension of Dirac's wave equations
                          L     4«k'+ <')1                                         for the local spinor field, we assume the relations
                                                                                   between the operators x', p, and",:
Hence, each of the operators defined by
                  n+(k, I, m)=a*(k, I, m)a(k, I, m);
                                                                            (50)
                                                                                                     r'[P .. "'J+   ='"= 0,             (54)
X U(k, I, m)+u*(k, I, m)U*(k, I, m) I. (52) which can be readily obtained by considering Eqs. (54)
                                                                        I
                                                                                   and (55), must have the same form, so that fl. must
   It should be noticed, further, that we could start                              satisfy an additional condition:
from the commutation relations                                                                                                          (62)
[u(k .. I.), u*(k:, I:)J+                                               I          This condition reduces to the form
   =   II. o(k,-k:)o(/,-I:)· o(k,k'+<')                                                                                                 (63)
                                            X 0(1.1'- A')o(k,I'), (37)             which is the same as the condition (12) or (13) for the
                                                                                   scalar field, if fl, are so chosen as to satisfy the com-
  ruCk"~ I,), u(k:,           I:)J+=O,                                             mutation relations
[u*(k., I.),u*(k:, I:)J+=O.                                                                                                              (64)
COY ARIANT PICTURE OF QUANTUM BOUND STATES                                                                                                        233
where C is a matrix with the determinant different                              where S is a matrix with four rows and four columns."
from zero. Equation (64) can be satisfied by matrices                           In our case, in which the spinor y, has eight components,
1", fl, which are expressed in the form                                         we assume the same form for S in Eq. (69) except that
                                                                                the numbers of rows and columns are doubled, when
)'l=ip'Pll    -y2=iP'P2)    -y3=ip1fTJ 1        Y'=PJ)                  (65)
                                                                                Eq. (70) is a proper Lorentz transformation with the
                                                                                determinant + 1, whereas we have to replace Eq. (69) by
fJl=P3CT l,   f32=P3(f21    i33   =   P3(f3!    f3.=-ip'l!              (66)
in terms of sets of mutually independent Pauli matrices
                                                                                                              >/I'=w,S>/I,                        (71)
CThCT2, CT, and Pl, P2, p,. It is well known that the matrices
as given by (66) do not form an ordinary vector, but a                          when Eq. (70) is an improper Lorentz transformation
pseudovector. Thus, if we confine our attention to the                          with the determinant -1. This guarantees the invari-
proper Lorentz transformation, the relations (54) and                           ance of the relation (68) with respect to improper as
(55) are both invariant. However, if we perform the                             well as proper Lorentz transformations.
improper Lorentz transformation, for which the deter-                               However, the above procedure is unsatisfactory, par-
minant of the transformation matrix has the value -1                            ticularly because it is difficult to give:a simple physical
instead of + 1, the form of the relation (55) changes into                      meaning to the new degree of freedom. As will be shown
                                                                                in the additional remark at the end'of this paper, there
                                                                        (67)
                                                                                is an alternative way, in which we have no need to
whereas the relation (54) is invariant. In other words,                         increase the number:of components of y, from 4 to 8.
the fundamental equations for the non-local spinor                                  Now, each component >/I; (i= 1,2,3,4) of the spinor
field, which has similar properties as the non-local scalar                     >/I can be represented as a matrix (x:l>/Idx:') in the
field considered in the preceding sections, can be con-                         representation, in which x, are diagonal. (x: Iy,d x:')
structed so as to be invariant with respect to the whole                        can be regarded, in turn, as a function >/I,(X., r,) of
group of Lorentz transformations including reflections,                         X., r., where X., r, are defined by Eq. (6). Therewith
only if both forms (55) and (67) are put together into                          the relations (Sol) and (68) can be represented by
one relation for one spinor field with the components
                                                                                             1"(a>/l(X., r,)/aX'l+i<>/I(X., r,)=O,                (72)
twice as many as the four components for the usual
spinor field. This is equivalent to introduce one morc
                                                                                                 fl,r'y,(X., r,)+ Xy,(X., r ,)= 0,                (73)
independent set of Pauli matrices WI, W2,          and to  w,                   respectively, where y,(.J(" r ,) is a spinor with four com-
assume that all of the matrices 'I', fl, have each eight                        ponents 1';(X., r,l (i= 1,2,3, ol). The simultaneous
rows and columns characterized by eight combinations                            Eqs. (72), (73) for >/I(X., r,) have a particular solution
of eigenvalues of (T" P3, w,. Therewith the spinor must                         of the form
have eight components, tirst four components and the
remaining four corresponding respectively to the eigen-
values + 1 and - 1 of W3.                                                       where u(k., r,) is a spinor with four components satis-
   In order to eStablish the invariance of fundamental                          fying
 laws for the non-local spinor tield with respect to the
whole group of Lorentz transformations, we assume                                                1"k,lI+<u=O,         fl,r'u+ AIL =   ().         (75)
further that w, and        W3     change sign under improper                    It follows immediately from (75) that ,t must satisfy
Lorentz transformation, whereas                Wl   does not.   ,,,re   call
no\\' adopt the relation                                                                  (k,k'+K')U=O,         (r,r'-h')u=tJ,        k,r'I'=O (76)
                                                                        (68)    so that " can be written in the form
in place of Eq. (55). It is clear from the above arguments                                  U= u(k .. r ,)o(k,k'+ <')o(r ,r'- h')o(k,r').          (ii)
that the fundamental Eqs. (Sol) and (68) are invariant                          Each of four components of " can be expanded in the
with respect to the whole group of Lorentz transforma-                          same way as the scalar operator u in the preceding
tions. However, for the purpose of proving it more                              sections. The second quantization can be performed by
explicitly, we consider the transformation properties of                        assuming commutation relations of the type (37)
>/I with respect to the Lorentz transformation, whereby                         between field quantities, so that the non-local field
we assume that the matrices "f'" fJ,. have prescribed                           represents an assembly of fermions with the mass til,
forms as defined by Eqs. (65), (66) independent of the                          the radius X and the spin ~. Further analysis of the
coordinate system. In the usual theory, in which the                            non-local spinor field will be made in Part II of this
spin or field y, has four components, we have the linear                        paper. At any rate it is now clear that there exist non-
transforma tion                                                                 local scalar, vector, and spinor fields, each corresponding
                                                                        (6(»)   to the assembly of particles with the mass, radius, and
associated with each of the Lorentz transformations for
                                                                                the spin 0, \, and ~.
the coordinates:                                                                  J2   Sec, for example, \\'. Pauli, lla'ldbll{}, deT Physik 24, Part J,
                                                                        (iO)    8J (\933).
234                                                                                                           CHAPTER IV
   Now the question, with which we are met first, when                 Tbis work was done during the author's stay at The
we go over to the case of two or more non-local fields Institute for Advanced Study, Princeton. The author is
interacting with each other, is whether we can start grateful to Professor J. R. Oppenheimer for giving him
from Schrodinger equation for the total system (or any the opportunity of staying there and also for stimulating
substitute for it), thus retaining the most essential discussions. He is also indebted to Dr. A. Pais and
feature of quantum mechanics. We know that Schro- Professor G. Uhlenbeck for fruitful conversation.
dinger equation in its simplest form is not obviously
relativistic in that it is a differential equation with the                 ADDITIONAL REMARKS ON NON-LOCAL
                                                                                            SPINOR FIELD
time variable as independent variable, space coordinates
being regarded merely as parameters. It can be extended                 The problem of invariance of the relation (55) with
to a relativistic form as in Dirac's many-time formalism respect to improper Lorentz transformation can be
or, more satisfactorily, in Tomonaga-Schwinger's super- solved without introducing extra components to the
many-time formalism, as long as we are dealing with spinor field. Namely, we take advantage of the anti-
local fields satisfying the infinitesimal commutation symmetric tensor of the fourth rank with the com-
relations. However, if we introduce the non-local fields ponents <,.,., which are + 1 or -1 according as
or the non-localizability in the interaction between (K, A,}J., v) are even or odd permutations of (1, 2, 3, 4)
local fields, the clean-cut distinction between space-like and 0 otherwise. Further we take into account the
and time-like directions is impossible in general. This relations
is because the interaction term in the Lagrangian or
                                                                                                                             (78)
Hamiltonian for the system of non-local fields contains
the displacement operators in the time-like directions where (K, A, lA, v) are even permutations of (1,2,3,4).
as well as those in the space-like directions. Thus, even Then (55) can be written in the form
if there exists an equation of SchrOdinger type, it cannot
be solved, in general, by giving the initial condition at                        i L <".-1"1"1"[1", ¢'J+iX¢,=O,              (i9)
                                                                                    a:).I'~
a certain time in the past. Under the3e circumstances,
we must have recourse to more general formalism such which is obviously invariant with respect to the whole
as the S-matrix scheme, which was proposed by Heisen- group of Lorentz transformation. The invariance can be
berg." In other words, we had better start from the proved more explicitly by associa ting a linear trans-
 integral formalism rather than the differential for- formation
 malism. In locallield theory, the integral formalism such                                                                   (80)
 as that, which was developed by Feynman, can be
 deduced from the ordinary differential formalism. H. "              with each of the Lorentz transformation (70), where S
 In non-local Iield theory, however, it may well happen is a matrix with four rows and columns satisfying the
 that we are left only with some kind of in tegral for· relations
 malism. In fact it will be shown in Part II that the non-
                                                                                                                             (81)
 local fields above considered can be Iitted into the
 S-matrix scheme.                                                       It should be noticed, however, that the relation (79)
                                                                     is a unification of the relations (55) and (67) rather than
   "W. Heisenberg, Zeits. f. Physik 120, 513, 673 (1943); Zeits. f.  the simple reproduction of (55), because (79) must be
 ~aturforsch. 1, 608 (1946); C. M~ller, Kg!. Danske Vid. Sels. Math.
 Fys. Medd. 23, Nr. 1 (1945); 22, Nr. 19 (1946) •                    identified witb (67) in the coordinate system, which is
   .. R. P. Feynman, Phys. Rev. 76, 749, un (1949).                  connected with the original coordinate system by an
   "F. J. Dyson, Phys. Rev. 75,486, 1736 (1949). See also man)'      improper Lorent7. transformation with the determinant
 papers by E. C. G. Stueckelberg. which appeared mainly in Relv.
 Phys. Acta.                                                         -1.
COVARIANT PICTURE OF QUANTUM BOUND STATES                                                                                             235
Quantum Theory of Non-Local Fields. Part II. Irreducible Fields and their Interaction·
                                                          HmEJ[1 ¥uv.wAf
                                             Columbia   U1li~sily.   New York. New York
                                                      (Received August 7. 1950)
               General properties of non-local OpentoB are considered in connection with the problem of invariance
            with respect to the group of inhomogeneous Lorentz transformations. It is shown that irreducible fields
            can be classi6ed by the eigenvalues of four invariant quantities. Three of these quantities can be interpreted.
            respectively. as the mass. radius. and magnitude of the internal angular momentum of the particles ass0-
            ciated with the quantized non-local field in question. Further. space-time displacement operatoB arc
            introduced as a particular kind of non-local operator. As a tentative method of dealing with the interaction
            of non-local fields, an invariant matrix is defined by the space-time integral of a certain invariant operator,
            which is a sum of products of non-local field operatoB and displacement operatoB. It is shown that the
            matrix thus constructed satisfies the requirements that it be unitary and invariant and that the matrix
            elements arc different from zero only if the initial and final .tates had the same energy and momentum.
            However. the remaining conditions of correspondence and convergence cannot be ful6lled simultaneously.
            in general, by the S-matrix for the non-local fields. It i. yet to be investigated whether all of these require-
            ments are satisfied by an appropriate change in the definition of the S-matrix.
decomposing a quantized field into its irreducible parts.             where x/ (1'= 1,2,3,4) denote this time the space-time
Accordingly, if the concept of the field itself is so                 operators in the new coordinate system. Therewith,
extended as to include the non-local field, the definition            two sets of parameters, X and " are transformed into
of the elementary particle will be altered in its turn.                                  X,.'=a",x .., r,.'c::a"",                     (5)
In Part I, t we confined our attention to certain types
of non-local fields which satisfied a set of operator                 and U(X, ,) becomes
equations and were supposed to represent assemblies of
elementary particles with finite radii. Our problem is
now to decompose more general non-local fields into
                                                                      U(X'. r')=    f··· f u'(k', I') exp(ik:X")
irreducible parts. Again we start from an arbitrary                                               XII6(,.'-I.') (dk.')'(dl.')',        (6)
unquantized non-local scalar field U, which can be
represented by an arbitrary matrix (x'l U I x"), where                where u'(k', I')=u(k, f). k', I' are connected with k, I
x' and x" stand for x-' and x." (1'= 1, 2, 3, 4), respec-             just as X', r' are connected with X, ,. In order that
tively. The matrix (x'l Ulx") can be regarded as a                    Eq. (6) retain the same form as Eq. (3) for an arbitrary
function U(X, ,) of two sets of real variables.                       Lorentz transformation (4), either one of the following
                                                                      two requirements must be satisfied:
             X.=i(x.'+x."), ,.=x.'-x."                        (1)
                                                                         (i) u(k. f) is a function of k and I, which retains its
as in Part I. Then an arbitrary function U(X,') can                   form under an arbitrary Lorentz transformation;
be expanded in the form                                                  (ii) u(k, f) is not a mere function of k and I, but is an
                                                                      ensemble of quantities, which are distinguished by the
       U(X,') =   f··· f u(k,') exp(ik,.x·)(dk.)'              (2)
                                                                      parameters k and I and which are to be subject to
                                                                      second quantization.
           f _.. f
                                                                                              u(k', I') = u(k, f)                      (7)
U(X, ,)=             u(k, f) exp(ik,.x·)                              for an arbitrary transformation
                             XII6(r.-I.) (dk.)'(dl,)',         (3)                                                                     (8)
where u(k,') and u(k, f) are arbitrary functions of SO that u(k, l) must be the function of invariant quan-
parameters k, ' and k, I, respectively.                             tities such as k,k', 1,.1> and k,.l· alone. In many cases,
    Now, if we perform an arbitrary homogeneous however, we can confine our attention to the suhgroup
____                                                                of the homogeneous Lorentz group which does not
   • Publication assisted by the Ernest ~em~ton Adams Fund.         include the reversal of the time, so that u(k, l) may
    t On leave of absence from Kyoto Uruve.... ty. Kyoto. Japan.    depend also on kJI k I provided that k is a time-like
   I H. Yukawa, Phys. Rev. 77. 219 (1950). See also B. Kwal.                       ..       "                    •
J. de phys. et rod. 11. 213 (195Oi.                                 vector, and similarly for l •. Thus U(X,') can be
                                                                 1047
system. Each of these ope !'ators thus obtained is not               tation relations for u(k, I) and u*Ck, f) as given by
yet irreducible in general, because it is a mixture of               Eq. (37) of Part I, for example. Of course, it must
two types of fields belonf;1ng to the same resultant                 always be kept in mind that the time-reversal is
(half-integral) spin. For instance, the operator corre-              associated with the interchange of the annihilation
sponding to the resultant spin 1/2 may have an internal              operator u(k, /) and the creation operator u*(k, l).
orbital angular momentum of eithet zero or unity. In                    These arguments can be applied to non-local spinor
the usual local field theory, however, a spinor field with           fields without essential change. In this way we amve
the spin 1/2, for example, is already irreducible. Thus              at the following suggestion: according to the non-local
the difference between the non-local spinor field and                field theory it is possible that there are only two kinds
the local spinor fields with arbitrary half-integral spins           of elementary particles, Bose-Einstein particles and
is apparent without taking into account the interaction              Fermi-Dirac particles, which are described by a scalar
between the fields.'                                                 field and a spinor field, respectively. The customary
   So far we have considered the problem of invariance               discrimination of particles with spins 0, 1, 2, etc.,
of non-local operators with respect to homogeneous                   among Bose-Einstein particles, for instance, may well
Lorentz transformations. Now we go over to the more                  be reduced to the difference in the quantum number 1
general inhomogeneous Lorentz transformation of the                  for the internal motion of the same kind of particles.
 type
                      x.' =a.,(x,+b,)                 (14)              n.   S-MATRIX IN NON-LOCAL FIELD THEORY'
or                                                                       Now we must undertake the problem of interaction
                                                      (15)           between non-local fields. In the usual field theory we
with   b: = a.,b"   X and r are transformed thereby into             could always start from the SchrOdinger equation for
                                                                     the total system. The Hamiltonian in the Schrooinger
               X.'=a.,(X,+b,), r.'=a.,r,.                   (16)     equation is derived from the Lagrangian which, in turn,
Accordingly, we have                                                 is so chosen as to give the correct field equations for
                                                                     unquantized fields, when the classical variation principle
                                                             (17)    was applied to the system consisting of unquantized
and                                                                  fields. In the non-local field theory, however, it is
               u'(k', 1')= exp( -ik.b·)u(k, I),              (18)    difficult to follow the same procedure as in local field
in order that U be invariant with respect to the trans-              theories for two reasons. Firstly, even in the case of
formation (14). The implication of the relation (18)                 the free field, it is difficult to deduce all of the field
must be considered for the cases (i) and (ii) separately.            equations, (4), (5), and (12), for example, for the scalar
   In case (i), relation (18) is compatible with the                 non-local field from an invariant operator which is
assumption that u(k, I) is an invariant function of k                supposed to correspond to the Lagrangian in the usual
and I, only if uCk, I) is zero for all values of k. except           theory. Moreover, the procedure of variation itself is
k.=O Cp.= 1,2,3,4). This is equivalent to the following              ambiguous.' Secondly, it is rather dubious whether the
statement:                                                           differentiation of the SchrOdinger function with respect
                                                                      to time will play an important role in non-local field
  (i)' A non-local operator U whiCh satisfies require-                theory because other operators, in general, are related
ment Cil is invariant with respect to the whole group                to two time instants, which differ from each other by a
of inhomogeneous Lorentz transformations only if                     finite amount. Even the existence of the SchrOdinger
U(X, r) is an invariant function of r alone.                         function in the same sense as in the local field theory is
It will be shown in the next section that some of the                 not at all certain.
invariant operators satisfying the requirement (iJ' will                 Although it is not yet clear whether these difficulties
be of importance in constructing the S-matrix for the                could be overcome without renouncing the fundamental
interacting non-local fields.                                        principles of quantum mechanics, there seems to exist
   In case (ii), relation (18) reflects the situation that           a tentative solution which retains many of the char-
the creation or annihilation operator u(k, I) or u*(k, I)            acteristics of the present field theory. Namely, we can
is defined unambiguously except for an arbitl-ary phase              start from the so-called interaction representation in
factor. In spite of this ambiguity or complication, the               the usual theory, laying aside for the moment the
                                                                      question of whether the free field equations in non-local
operator u*(k, I)u(k, 1)/1 k,l, which is to be identified
with the occupation operator for particles in the                     field theory can be deduced from the Lagrangian
quantum state characterized by k. and I. apart from                   formalism or not. Furthermore, we can adopt the
the purely numerical factor, is defioed uniquely and is               integral formalism of the usual theory, which has been
invariant with respect to the whole group of inhomo-
                                                                       • A preliminary account of the subject was published by H.
geneous Lorentz transformations. So are the commu-                   Yukawa, Phys. Rev. 77, 849 (1950).
                                                                       • Variation principles in the non·local field theory were di!!CUssed
   • Detailed di5cussioll!l of non-local spinor field will be made   by C. Bloch, Kg!. Danske V.d. 50!. Math.-Fys. Medd. See also
e1aewhere.                                                           C. Gregory, Phys. Rev. 78, 67, 479 (1950).
238                                                                                                                 CHAPTER IV
proved to be equivalent to the dilIerential forma1ism                Equation (21) can be writtt'n in the form
and in which the S-matrix, instead of the SchrOdinger
wave function, came in the foreground. Then the                      (n'ISln")=(II'llll1")
S-matrix for local fields can be transformed in the
following manner so as to be easily extended to the                        +(i/,hc) f I(II', ~IL'III", %")(Ih')«Ih")<
case of non-local fields. We consider a system of local
fields, for which the interaction Hamiltonian density
H'(%, ',', I) is invariant and is equal to - L'(%, "I, I),                 +(i/Ac)'I· .. I           E
                                                                                                _",•• IV
                                                                                                           (1I',~IL'ln"',~")
where L' is the interaction part in the Lagrangian
density for the system. In,the usual one-time forma1ism,
the SchrOdinger equation' has the form                                       X (n"',   ~"I flnlv,   %IV)(nIV, %IV!L'III", ~')
                                                                                        X (d%')«th")«d~")«lhIV)<+ . . • . (24)
                             ."
         i/ul'li(II', 1)/iJI= E(II'IO'(t) 11I")'li(II", I),
             -
  +(i/h) f+"~(II'IL'(I)III")dt''li(II'" - co)
invariant, in spite of the fact that the operator • as          for any values of rand r'. In order to prove this,
defined by Eq. (23) is not invariant with respect to            we have only to multiply S as given by Eq. (31) by
Lorentz transformations. This is due to the fact that           S*=l-(i/MHL'I+(i/M)'{L'D+*L'1
the Hamiltonian density B'(r) at a point r is com-                           -(i/M)'{L'D+*L'D+*L'I+···. (37)
mutative with the density B'(r') at any other point
r', which is located in a space-like direction with             Then the condition of unitarity
respect to r. It is not so, in general, in non-local field      L     (n'IS*I,,",)(nIllISI,,',)
theory. An obvious way of guaranteeing the invariance           .'"
of the S-matrix in such a case is to replace the operator                  =   L      (,,'ISIIIIII)(nllllS*III")=(n'llln")   (38)
• in Eq. (26) by a suitable invariant non-local operator                       ,,",
D+ such that conditions (i) and (ii) are still fulti11ed.       comes out by the help of Eq. (32) and the relation
Thus the S-matrix for the system of non-local fields                                      {AEB}={A }{Bf,                     (39)
takes the form
                                                                which holds for any two non-local operators A and B.
S=H(i/MHL'f+(i/M)'{L'D~'f                                         The operators D+* and D+ which satisfy all of these
             +(i/M)'{L'D~'D~'f+""                       (31)    conditions are given by matrices
The actual form of the opet;ltor D+ can be determined            (n', rID+III", r')=(~'llln"), 1(11'11111"), or 0;
in the following manner. If we assume that the invariant        (II',rID+*ln",r')=O,l<,,'llln"), or (II'llllI"), (4O)
operator L' is a sum of products of non-local field
                                                                according as r - r' is futvre-Iike, space-like, or past-
operators, condition (ii) is satisfied for any displacement
                                                                like.
operator D+ whose matrix element (rID+lr')=D+{X,
                                                                   This modification of the definition of S-matrix gives
r) is an invariant function of r. alone. The proof is simple.
                                                                rise to the new question: does it reduce to the usual
Any non-local operator A can be represented by a
                                                                definition (21) in the limit of local fields? This question
matrix (rIAlr') or a function A(X,r) and Eq. (25)
                                                                is very intimately connected with another, and probably
can be written alternatively in the form
                                                                the most important, question: is the S-matrix for non-
                                                                                            .'
of a particle the energy and momentum of its center of          (OI{L'D~'f 10)=ILCOI{L'f 1,,')(11'1 {L'f 10)
mass. Thus the energy of internal motion is supposed
to be included already in the mass Ii«/c. '. In other                                                  +1(01 {L'DL'}lO)       (44)
words, « must be, in general, a function' of other              on account of relations (32) and (36), where the
constants such as X and l. The problem of determining           operator D is defined by
the form of such a function is still completely open.
  The condition (i) is also fulti11ed, if we further                                                                          (45)
imply the condition                                             with the matrix element
                                                         (35)
                                                                (11', rlDIII", r')=(II'llllI"),          0,   -(n'lll""),     (46)
on D+, where D+* is the Hermitian conjugate of D+               according as r - r' is future-like, space-like, or past-
and E is an invariant displacement operator with the            like.' The first term on the right-hand side of (44)
matrix element
                                                                 , This operator was iDtroduced by Koba independently. See Z.
              (n', riEl,,", r')=(n'llln")                (36)   Koba, Prog. Tbeor. Phys. 5,139 (1950).
240                                                                                                        CHAPTER IV
vanishes on account of the fact thilt (n' I{L' flO) is          directioll-.of time is not reversed. Namely, we can write
zero provided that KU < 2KY and the second term also
                                                                  U=U++U_,         V=V++V_,         V*=V+o+V_·,         (49)
vanishes for the following reason: first we expand U,
V, V·, and D in Fourier series and integrate each of            where U +, U _ are positive and negative frequency parts
the terms of (OI{L'DL'f 10) with respect to all of the          of U, while V +, V +0 and V _, V _0 are corresponding
space-time parameters. Actually we have eight sets of           parts of V and V*. If we take the new interaction
such parameters. Then we are left with the expression           operator
of the form
                                                                L'=g{V+'"'UV++ V _UV _*+ V +oUV _+ V _*UV +1 (SO)
                                                                instead of Eiq. (39), the self-energy terms for the U-type
f f ff(k.(I), k.(l), k.(I»O/(K,.K·)
                                                                particle as 'well as the V - yO_type particles are con-
                                                                vergent, although there still remains an undesirable
                        X (dk. (I»·(dk. (I»·(dk. (I)',   (47)   feature, as discussed by Yennie.'
                                                                   Now, in order to remove the discrepancy between the
where K.= L; k/') and k.(I) , k.(l), k.(·) are the wave         present formalism and the usual formalism in the limit
vectors of the three particles created in the intermediate      of local fields, we may imagine that D-operator above
state. The first of them is a particle of U-type and the        defined is a limit of the operator with the matrix
other two are particles of V - V* -type. f( .•. ) is a func-    element, which is a function of r. and is different from
tion of k.''', k.(') , k.(3), which .could be determined by     zero in a narrow region outside the light cone in r-space.
elementary calculations, but it is not necessary for our        Then the correspondence between the present formalism
purpose to write it explicitly. 01. denotes the derivative      and the usual formalism in the limit of local fields is
of the a-function with respect to the argument, which           restored up to the second order, but the essential
comes from the Fourier transform of the operator D,             difference between .. and D-operators remains in the
as discussed in detail by Yennie.' Thus, (oI{L'DL'fIO)          third- and higher order terms. Moreover, the diver-
must be zero, unless the condition                              gences reappear in the case of non-local fields. It is
                                                                very difficult to construct an S-matrix which is con-
                         K.K·=O                          (48)
                                                                vergent and which reduces to the usual S-matrix in the
is fulfilled. The condition (48) can be satisfied by            limit of local fields. It is not yet clear whether the
certain sets of k. (I), k/'), k. (I) only if both types of      S-matrix formalism itself is not adequate for dealing
particles have the rest mass zero.                              with the problem of interaction of non-local fields. It
   The above arguments can be applied to local fields as        might be possible that the S-matrix as defined by Eq.
well as to non-local fields. According to the usual theory      (24) is invariant, if the interaction operator L' has an
of local fields, the third term on the right-hand side of       appropriate form, even in the case of non-local fields.
Eq. (41) must be the divergent self-energy of the               However, it is more probable that the clean-cut sepa-
vacuum, whereas it is actually zero according to our            ration of the free fields from their interaction is justified
formalism, except for the very particular case of               only if we are dealing with the weak coupling between
particles both with the rest mass zero. The same argu-          local fields. If so, we m,ust go back in search of the
ment can be applied to the case of charged particles            Lagrangian formalism for the whole system of non-local
interacting with the electromagnetic field, and according       fields interacting with one another. In any case, the
to our formalism the self-energy of the vacuum is zero,         compatibility conditions for the field equations or the
at least up to the second order, if we assume that              integrability conditions for any substitute for the
there is no charged particle with the rest mass zero.           Schriidinger equation will be of fundamental importance.
Thus, the discrepancy between our formalism and the                In this connection it should be noticed that so far
usual theory is already clear; they give different              we have not been able to find any relation between the
answers to the same problem for local fields.                   mass and other constants. It is clear that a relation
   Next we consider the matrix element (1ISI1) of S,            which connects the mass of an elementary particle with
where only one particle of the same type in the same            other constants such as the radius, the internal angular
state exists in the initial and final states. The second-       momentum, and the constants of coupling with other
order term of (11 S11) corresponds to the divergent             particles will be ·of vital importance in any future theory
self-energy of the particle in local field theory. As           of elementary particles. Again this is closely related to
discussed by Yennie in detail,' if we start from a              the problem of finding the Lagrangian operator for the
system of two non-local scalar fields of U-type and             whole system or any substitute for it.
V - YO_type with the interaction operator L' as given              The author wishes to express his appreciation to
by Eq. (42), the self-energy term is again divergent.           Columbia University, where this work was done, for
However, the fields U and V-V· can be decomposed                the hospitality shown to him, and to the Rockefeller
further into positive frequency and negative frequency          Foundation for financial support. He is indebted also to
parts without destroying the invariance with respect to         Mr. Yennie for his useful criticism and elucidation of
the subgroup of Lorentz transformation, in which the            some of the important consequences of the formalism.
COVARIANT PICTURE OF QUANTUM BOUND STATES                                                      241
                                Hideki Yukawa*
                    Columbia University, New York, New York
                            (Received May 25, 1953)
As discussed in previous papers, 1 the nonlocal field was introduced in order to
describe relativistically a system which was elementary in the sense that it could no
longer be decomposed into more elementary constituents, but was so substantial,
nevertheless, as to be able to contain implicitly a great variety of particles with
different masses, spins, and other intrinsic properties. However, the conclusions
reached so far were very unsatisfactory in many respects. 2 Among other things, the
masses of the particles associated with the irreducible nonlocal fields remained
completely arbitrary and simple and plausible assumptions concerning the
interaction between fields did not result in the expected convergence of self-
energies. It seems to the author that these disappointing consequences are not
inherent in nonlocal field theory, in general, but are rather related to the particular
type of field to which the author restricted himself. Instead, if we start anew from
less restricted nonlocal fields, a more promising aspect of possible nonlocal theories
is revealed, as shown in the following.
Let us take a scaler (or pseudoscalar) nonlocal field,
J..L being the separation constant. Thus, the masses of the free particles associated
with the nonlocal field <ll are given as the eigenvalues of J..L'h in Eq. (4) for the
internal eigenfunction X . If one chooses the operator p(r) such that the eigenvalues
J..Ln ==m; are all positive and discrete, one can expand an arbitrary nonlocal field <j>
into a series of internal eigenfunctions, Xn (r) :
                                                                                                   (5)
Now, the field equations for a scalar nonlocal field (x'I<lllx") interacting with a
local spinor field 'V(x '), for instance, can be deduced from an appropriate
Lagrangian and are
             {-aa;    XIlX Il
                                + p(r)} <I>(X, r) = -gL'Va(X + Yzr )'Va(X - Y2r),
                                                            a
                                                                                                   (6)
We insert (5) in (6), multiply both sides by the complex conjugate x:(r), and
integrate over the four-dimensional space of r 1, r 2, r 3, r 0 = -ir 4. The result is
[ ax,. ?~,," - m.'j u. (x'~ =f<il. (x', x ", x "n: .iii"(x 1",.(x "1<1x'<Ix "', (8)
where
                     <lln(x', x", x"') == gx:(x' -x"')o(llz(x'           + x"') -   x").           (9)
Similarly, we obtain from (7) the equation
If we com~are these equations with the corresponding equations (19) of Moller and
Kristensen in the theory of nonlocal interaction between a local scalar (or
pseudoscalar) field and a local spinor field, we notice that the internal eigenfunction
Xn (r) plays the role of a convergence factor. There is, however, an essential
difference between their equations and ours. Namely, in our theory, we are obliged
to take into account simultaneously all the particles with different masses mn which
were derived from an eigenvalue problem. Furthermore, the form function for each
of these particles is uniquely determined by the same eigenvalue problem.
In the following letter, the above general considerations will be illustrated and
further details will be examined.
       "Now at Kyoto University, Kyoto, Japan, on leave of absence from Columbia University (July,
1953  1,H. Yukawa, Phys. Rev. 77, 219 (1950); 80,1047 (1950).
       2D. R. Yennie, Phys. Rev. 80, 1053 (1950); 1. Rayski, Acta. Phys. Polonica 10, 103 (1950); Proc.
COY ARIANT PICTURE OF QUANTUM BOUND STATES                                                              243
Phys. Soc. (London) A64, 957 (1951); M. Fierz, Helv. Phys. Acta 23, 412 (1950); Z. Tokuoka and Y.
Katayama, Progr. Theoret. Phys. 6, 132 (1951); C. Bloch, Kg!. Danske Videnskab. Selskab, MatAys.
Medd. 24, No.1 (1950); Progr. Theoret, Phys. 5, 606 (1950); O. Hara and H. Shimazu, Prog. Theoret.
Phys.5, 1055 (1950); 7, 255 (1952); 9, 137 (1953).
    3 p. Kristensen and C. Moller, Kg!. Danske Videnskab. Selkab, Mat.-fys. Medd. 27, No.7 (1952); C.
Bloch, Kg!. Danske Videnskab. Seiskab, MatAys. Medd. 27, No.8 (1952). Y. Katayama, Progr.
Theoret. Phys. 8,381 (1952).
244                                                                                        CHAPTER IV
                                 Hideki Yukawa*
                     Columbia University, New York, New York
                             (Received May 25, 1953)
As an illustration of the general considerations on nonlocal fields in the preceding
letter, let us assume that the operator F has a very simple form
                                   2                    2
                 F ==            ax + Tt..2[ ar Jill
                            ax a1111            a
                                                 ar               1           J2
                                                              + . . 4 r Ji r Ji '
                                                                  /I,
                                                                                              (1)
where A is a small constant with the dimension of length. One may call this the
four-dimensional oscillator model for the elementary particle, which was considered
first by Born} in connection with his idea of a self-reciprocity. However, our model
differs from his model in that we have introduced internal degrees of freedom of the
particles which are related to the nonlocalizability of the field itself. The internal
eigenfunctions in our case are
           Xnl n2 n3 no (r)        = H,., (rill..) H,.. (r21A) H,., (r31A) H,.. (roll..)
                                  xexp {- (rr + ri + r; + rJ) 12A2}                           (2)
and the corresponding eigenvalues for the mass become
                 1nII1   "21'13   "0   = c-fi.1A I nl + n2 + n3 -       no + 1   I,           (3)
where r 0 = ir 4 is a real variable and n l,n 2,n3,n 0 are quantum numbers which can
take only zero or positive integer values. H,. (x) denotes the Hermite polynomial of
x of degree n. All these eigenfunctions (2) decrease rapidly in any direction
whatsoever in the four-dimensional r space. Furthermore, the Fourier transform of
each of these eigenfunctions has exactly the same form as the original function due
to the self-reciprocity. Thus, the form function (9) in the preceding letter seems to
be sufficient to cut off high energy-momentum intermediate states in such a way that
each term corresponding to each Feynman diagram in the expansion of the nonlocal
S-matrix according to the Bloch-Kristensen-M011er formulation is convergent.
However, since we have to take into account all of infinitely many of different mass
states of the nonlocal system, the number of terms in the S matrix increases very
rapidly with the increasing power of the coupling constant, so that we can claim
nothing for the moment concerning the convergence or divergence of the S matrix as
a whole.
The totality of the internal eigenfunctions (2) constitutes a complete set of
orthogonal and quadratically integrable functions in the four-dimensional, space
and can be regarded as the eigenvectors for an infinite-dimensional unitary
representation of the Lorentz group. The eigenvalues (3) for the mass are all
infinitely degenerate. For instance, all those values of n's which satisfy
nl + n2 + n3 - no = 0 give the same mass, mo = fitA.. This is not a peculiar feature
of the oscillator model; it is common to all those models for which the operator F is
separable, because there can be no unitary representation of finite dimensions for the
Lorentz group. Presumably, such an undesired degeneracy could be removed either
by introducing interaction with other fields or by first introducing the coupling
between the external and internal degrees of freedom. The latter possibility can be
illustrated by the addition of the coupling term,
                                    nar~r. ~r.-.r
to the expression (1) for F, where      ~    is a dimensionless real constant. The free field
equation becomes
[k l .+ +
in the eight-dimensional space k". and '"., where x (k".. ,,,.) is the Fourier transfonn
of cjI (X 11' ,,,.) as defined by
                                    J
                    cjI (X".. ,,,.) = exp (ik".X".) X (k".. ,,,.) (dk".).4                (6)
One can solve Eq. (5) in the coordinate system in which only one component of the
wave vector is different from zero. 2 Thus, one obtains the mass spectrum
                                       ..J2l n l     +n2+ n 3- n O+ 11
                     InIIln2n3nO   =             [                 In1                    (7)
                                            A.       1 - 2~2(no + Y2J
If we take, for instance, ~ = 1Ifi, only no =0 is allowed and the mass spectrum
reduces to
                                                                                          (9)
and the degree of degeneracy of the mass eigenvalues is now finite. In particular,
246                                                                                              CHAPTER IV
the lowest mass, mo = 21A, is free from degeneracy and the corresponding solution
of (5) is given by
              A boundary condition at t= ± 00 (t being the "relative" time variable) is obtained for the four-dimensional
            wave function of a two-body system in a bound state. It is shown that this condition implies that the wave
            function c~n ~e continued analytically to complex values of the "relative time" variable j similarly the
            wave functIOn In momentum space can be continued analytically to complex values of the Urelative energy"
            variable po. In particular one is allowed to consider the wave function for purely imaginary values of t or
            respectively po, i.e., for real values of x4=ict and p4.=ipo. A wave equation satisfied by this functio~ is
            obtained by rotation of the integration path in the complex plane of the variable Po, and it is further shown
            that the formulation of the eigenvalue problem in terms of this equation presents several advantages in
            that many of the ordinary mathematical methods become available.
              In an especially simple case (Uladder approximation" equation for two spinless particles bound by a
            scalar field of zero rest mass) an integral representation method is presented which allows one to reduce
            the problem exactly (and for arbitrary values of the total energy of the bound state) to an eigenvalue
            problem of the Sturm-Liouville type. A complete set of solutions for this problem is obtained in the sub-
            sequent paper by Cutkosky.
the analytic continuation of the wave function to                The sum extends in principle over all states, but in
complex values of the relative time (or relative energy)         fact the states n giving a nonzero contribution will
variable. As far as we can tell these properties cannot          belong to a rather special class. Consider for example
be obtained from the B-S equation itself. Vice versa,            the case where a and b are an electron and proton,
they can be used (Sec. 3) to transform the equation,             respectively. If 'fl. and 'fI bwere noninteracting fields, it
by rotation of the integration path in the complex               is obvious that only one-electron states would have to
plane, to an equation in which x. = ixo (respectively            be considered in the sum (3). In the presence of inter-
p. = ipo) is real. While the concept of an imaginary             action, the states n may also contain photons, electron-
relative time variable does not help physical intuition,         positron pairs and proton-antiproton pairs. But at any
it has mathematically several advantages. A discussion           rate the fundamental integrals of the motion N.
of the eigenvalue problem in terms of the transformed             (number of electrons-number of positrons) and Nb
equation will be given (Sec. 4), and the existence of            (number of protons- number of antiprotons) must have
solutions will be shown to follow, under fairly general          the same values,
assumptions, from considerations similar to those
commonly employed in the nonrelativistic case. No                                     N.=l, Nb=O,                         (4)
claim of completeness or rigor is made for this "proof."         as the one-electron states. This may be rigorously shown
Finally in Sec. 5 we shall merely itemize various                from the commutation properties of N. and N b with
approximation methods that have been studied, but                the field operators, (N.+l)'fI.='fI.N., etc.
will be reserved for another publication.                           In a similar manner, one can show that the total
   The second line of attack (Sec. 6), which is the subject      angular momentum quantum number J for a state n,
of a more extensive investigation in the subsequent              when measured in a system of reference in which the
paper by Cutkosky,· is rather different in nature. It is         total momentum p is zero, must be equal to !.
an attempt to make much more specific statements                    Now all states known to us in nature, and satisfying
about the exact solutions of the equation, by restricting        condition (4), also satisfy the inequality,
the character of the equation to an especially simple
type. It has not been possible so far to extend this                                                                      (5)
approach to any case of real practical interest. But the         E. and p being the total energy and momentum in the
fact that in one case, which is not entirely artificial,         state n. Furthermore, the equality sign holds true only
one can get a complete picture of all the solution (as is        for one-e1ectron states.
shown more completely in the following paper') is not              The inequality (5) means that among all the states
perhaps devoid of general interest. In particular the            having the same values of the fundamental constants
presence of "abnormal" solutions, which do not possess           of the motion p, N., N b, etc., as a one-electron state,
a nonrelativistic limit, and the circumstances under             the latter is the state of lowest energy. We shall refer
which they occur may well give a qualitative indication          to (5), therefore, as the stability condition for an
as to properties that will occur also in the cases of real       electron.                                        .
physical interest.                                                  In a similar way, when the relative time t is negative,
           Z. THE STABILITY CONDITIONS                           the wave function X may be shown to depend on the sum
   The relativistic wave function x(x) for a system of                      Lo,(OI'fl,(Xb) In')(n'I'fI.(x.) la),          (3')
two particles, a and b, bound together in a state Ia)
is defined' as the matrix element, between a and the             in which the contributing states n' must satisfy the
"true" vacuum state 10), of the time ordered product             condition,
                                                                                   N.=O; Nb=l,                    (4')
of the Heisenberg field operators .y. and .y. describing
the two kinds of particles. If, for example, the relative        and hence the inequality,
time t= t.= t. is positive,
                                                                                        E"2-p'~mb,                        (5')
            x(x)=e iP . X (OI'fl.(x.)'fI,(Xb) la),         (1)
                                                                 which shall be called the stability condition for a proton.
where X=X.-Xb, X= (maX.+moXb)/(m.+mb), and                          Summing up, we have three inequalities (2), (5), and
p. X is the four-dimensional scalar product of X with            (5'), which will form the basis of the following discus-
the total momentum P of the system in state a. If for            sion. It should be pointed out that the above considera-
simplicity we assume tbat the compound system is at              tions can be extended to other systems. If a and b were
rest, then P= (0, iE), E being the total energy. For a           a neutron and proton, bound together in the ground
bound state,                                                     state a of the deuteron by a meson field, with the
              E~m.+mb- B<m.+mb.                    (2)           customary assumptions, one would then have, as
Now the matrix element in (1) can be written                     integrals of the motion, the number of nucleons minus
                                                                 antinucleons N and the total electric charge Q. The
             L.(OI'fl.(x.) In)(nl'flb(xb) la).             (3)   states n could be shown to have values N = 1, Q= 0
  • R. Cutkosky, following paper [Phys. Rev. 96, 1135 (1954)].   and the states n' the values N = 1, Q= 1. In a theory
COVARIANT PICTURE OF QUANTUM BOUND STATES                                                                                        249
1126 G. C. WICK
which neglects the ,B-decay interaction, one has the              where. is an infinitesimal positive constant. We must
right to regard both neutron and proton as essentially            assume, of course, that the wave function exists for
stable particles. If there were states n(n') not satisfying       real values of po [i.e., that the integral (9) converges].
conditions (5) (5') the neutron (proton) could decay              From the theory of Stieltjes transforms, we then infer
into those states by emission of photons, without                 that (9) defines an analytic function of po in the whole
viola ting any of the known conservation theorems.                complex plane, in the region
Thus it is extremely reasonable to postulate that these
conditions must again be satisfied.                                                  2.->arg(pO-WmiP);;: o.                      (10)
   Now going back to (1) and using (3) with the                   Similarly q,2 is defined in the region
conditions (2) and (5), we see that for 1>0, and assum-
ing p= (O,iE), x(x) is of the form                                                  -.-<arg(po-wmox)<'-,                         (11)
                                                                  where
        x(x)=   f f. .
                  dp
                           -roo
                          Wmln
                                  dwl(p,w) exp(ip·x-iwt),   (6)
                                                                           -Wm",=BIl.+(m.'+p)!-m.>Bp..>O.
                                                                  Thus ¢(p) =q,(p,po) =q"+q,, is defined in the complex
where                                                             po plane with two cuts from Wmin to + 00 and from - 00
                                                                  to W mox (Fig. 1). In this case analytic continuation from
                                                                  the lower to the upper half-plane is ensured through
with I'a=m./(m.+m.). Thus, when 1>0, x(x) is a                    the gap between the two cuts. (B>O is essential for
superposition of positive frequency terms only.                   the existence of the gap.) Notice also that the sense of
   Similarly, from (2) and (5') it follows that, when             rotation implied by (10) and (11) is the opposite of
1<0, x(x) contains negative frequencies only. Thus we             that in the I plane. From the real po axis one goes
find that x (x) has properties with which we are familiar         continuously into the upper half-plane if po>Wmin>O,
in the case of Feynman propagation kernels. There is,             into the lower half-plane if po<Wmu<O.
of course, an analogy between the definition of these
kernels and Eq. (1).                                                 3. TRANSFORMATION OF THE B-S EQUATION
   Let us now consider I as a complex variable. Equation            We shall now use the analytic properties of the
 (6) shows that X(x) can be continued analytically in             wave function to transform the B-S equation by a
the lower half-plane, in the region 0;;: argl> -.-.               rotation of the axis of integration in the complex po
Similarly starting from the negative real axis, x(x) can          (respectively xo) plane.
be con tinued in the upper half-plane, in the region                The equation' may be written
.- ~ argl> 0. There is, of course, no analytic continuation
from one half-plane to the other; the two regions touch                                                                          (12)
one another at one point only, 1=0.
    It should be pointed out that the statements just             where q, is the wave function in momentum space, i.e.,
made are not dependent on the assumption that the                 the Fourier transform of x (x) ; it is a function of the
sta te a is bound; they follow from well-known properties         relative momentum p defined by
of the Laplace transform from the mere fact tha t w is
finite. If, however, B>O and hence Wmin>O, we can
further assert that X(x)-->O when t tends to 00 in any            P, Il. and 1'. being the total momentum and the mass
direction in the lower or upper half-plane different from         ratios previously defined. F. and F. are one-particle
 the real axis. This suggests that the eigenvalue problem         propagators, which, if one neglects radiative corrections
may take a more familiar and a simpler form if the
 wave function and the wave equation are considered on
                                                                                                             ",.",+ijl
the imaginary 1 axis (i.e., for x.=il real).
    fn order to examine this possibility carefully, it is
desirable to go over to momentum space. We write
x(X)=X+X2, where x,=O for 1<0 and x,=O for 1>0.
Let us calculate the Fourier transform of x,.                                                o
                                                                                                                            '"
                      2n
                             +00
                           "'min
                                   l(p,w) (w- po-i.)-'dw,   (9)
                                                                     FIG.!. The complex plane 01 the variable p•. The wave function
                                                                  is analytic everywhere, excluding the cuts (heavy Jines) on real
                                                                  axis.
250                                                                                                                           CHAPTER IV
reduce to
F.=-y.p.-im., Fb=-YbPb-imb,                  (Dirac particles)                            Co}
                                                              (14)
F.= p.'+m.',          Fb= Pb'+mb'.           (Klein-Gordon)
Finally, I. b is the interaction operator, which has
                                                                                                            ,
different forms, depending on the kind of theory. The
                                                                                                                \
                                                                                                                    'r   P01 ••
1128 G. C. WICK
of real rotations in four dimensions. 8 This is important                 Eq. (20) becomes a symmetric integral equation,
in the first place, because the group determines the
polar variables, which may be used with advantage.
In the Lorentz case integrals over a surface p' = const,
                                                                                           v(x)=X    f" K(x,y)v(y)dy,                   (22)
or x'= const are usually divergent; there are no                                                      •
orthogonality theorems for spherical harmonics, no                        with the finite kernel,
completeness theorems, etc. Here instead we have the
whole familiar machinery at our disposal.                                   K(x,y) = 2(st)I/{s+t+<'+[ (s+t+<')'-4stN,                   (23)
   Other advantages appear in the configuration space                     and the finite interval a= f( 00). Fredholm's theory can
formulation of the equation, as we shall presently see.                   then be applied, to conclude that (22) has a discrete
                                                                          eigenvalue spectrum. The case where q, is proportional
  4. DISCUSSION OF THE EIGENVALUE PROBLEM
                                                                          to a four-dimensional spherical harmonic can be
   We shall now examine several cases and show that                       similarly handled.
the transformed equation presents us with an eigenvalue                      It may be pointed out that if K=O, Eq. (20) can be
problem, to which many of the ordinary methods and                        reduced to a second order differential equation either
conclusions can be applied.                                               by differentiating twice, or by a parametric representa-
   We shall begin, like Goldstein,' with the extreme                      tion of the solution. Both methods will be used later,
case E=O, where the equation acquires full four-                          and especially in the subsequent paper by Cutkosky,·
dimensional symmetry in relative momentum space.                          to obtain more precise information about this case.
Unlike Goldstein, however, and for reasons to appear                         Let us now consider briefly Goldstein's Eq. (10),
later, we shall choose in Eq. (14) the K.G. (Klein-                       which applies to the case of two Dirac particles. When
Gordon) form of the factors Fa and F b. Tha t is, we                      written in our notation, the equation is quite similar
assume that a and b have zero spin. The equation for                      to (19) except that it contains only one quadratic
E=O thus has the form                                                     factor in p on the left. Goldstein manages to reduce
which presents a striking analogy to the ordinary                     condition on A. This will, in general, determine a
three-dimensional SchrOdinger equation. With X4 real,                 discrete spectrum of eigenvalues.
(26) is, of course, an elliptic difierential equation. This,            We shall see later that for K=O the analysis can be
together with the boundary condition x(x)->o at                       carried much further. Let us now turn to the more
infinity, allows a discussion of the eigenvalue problem               interesting general case E;>!O. Let us write (in the
along familiar lines.                                                 c.m. system)
   A special difficulty, also encountered by Goldstein,
is presented by the boundary condition at the origin                                 p= (O,iE)= i(m.+mb)'I,                 (28)
R=O, about which we have unfortunately no definite                    where 'I is the four vector
indication from general field-theoretic considerations.
The difficulty arises because of the Fuchsian singularity                           '1= (0,.),   E=Ej(m.+mb).
(2Sa) j if the potential were regular everywhere, there               Notice that
would be little doubt that x(x) must be regular too.                                       ,f="<1.                   (29)
   One can see at once, however, that the singularity                 The factor on the left of Eq. (19) now becomes, re-
of the potential affects (24) and (26) in a very different            membering (13):
manner. Consider, for example, spherically symmetric
solutions. The radial equation corresponding to (26),                 (m.'+p.') (mb'+pb') = p4+ (m.'+mb') (1-,f)P'
or                                                                                +4m.mbCP'l)'+m.'mb'(1-,f)'
      [cl'jdW+(3jR)(djdR)-m'+AV(R)]X=0,                     (26a)                      +2i(m.-mb)(P'-m.mb)CP'I).            (30)
has two solutions near the origin, of the type x =]?a                 It is at first sight rather puzzling that the equation now
                                                                      contains an imaginary term whose presence depends
X (HcIR+· .. ) with
                                                                      on m. being;>!mb. In configuration space this means
                     a= -1± (1-4X)I.                         (27)     that the operator corresponding to (30) is self-adjoint
                                                                      only when m.=mb. One can show that this feature is
Thus, if X<i, it is possible to make a distinction                    connected with the time-reversal properties of the
between the "regular" (less singular) and the "irreg-                 equation.
ular" solution. If X>i, it seems highly unlikely that a                  We shall point out, when the occasion arises, the
plausible condition to determine the right solution can               difierences produced by the term in m.-mb. For the
be found. In the case K= 0, moreover, the equation can                moment, we shall consider only the case m.=mb
be solved explicitly,' the "regular" solution being                   (=m, say). The analog of Eq. (24) then is
JrIJ.(iR), where n=+(1-4X)I. This solution, how-
ever, never satisfies the condition at infinity. We thus              ([ -D+m'(1-,f)]'-4mVo'jox.')x(x)
reach the conclusion that no value X<i is an eigenvalue.                                          =XV(R)x(x).               (31)
In our opinion, for 1I> t the eigenvalue problem becomes
                                                                      Since complete separation of variables is impossible, a
ill-defined. We shall not try to discuss further herell
                                                                      solution must now be a superposition x=E,J.(R)Y.
whether the limiting case X= i can actually be regarded
                                                                      of four-dimensional spherical harmonics Y. of difierent
as an eigenvalue.'
                                                                      orders. The radial functions !. satisfy a system of
    In Eq. (24), on the other hand, the singularity (2Sa)
                                                                      coupled fourth-order difierential equations, and it is
does not affect the indicial equation. The radial equation
                                                                      no longer possible to discuss the eigenvalue problem in
for a spherically symmetric solution, for example, has
                                                                      terms of a single radial function. This is a considerable
four independent solutions near origin, say Xl, X" X" X.,
                                                                      complication, but one may notice, nevertheless, that
behaving respectively like W, R", InR, and Jr'. If
                                                                      the term in (31) which produces the coupling is of
there were no potential, we would clearly say that the
                                                                      second order only, so that the indicial equation for
acceptable solution is a linear combination CIXI+C2X2
                                                                      each radial function !. is the same as in Eq. (24). If
of the two "regular" solutions. We shall make the same
                                                                      one writes !.(R)=R"(I+cIR+ ..• ) the possible
assumption when there is a potential'" Likewise we
                                                                      values for a are ±n, ±(n+2)j we may assume that
can define, for large R values, four solutions behaving
                                                                      only the positive values are allowed in a "regular"
respectively like Jrl exp(±p.R) and R .... exp(±p"R).
                                                                      solution, just as in Eq. (24). Thus there is no qualitative
The solution CIXl+C2X2 will be a linear combination of
                                                                      difierence between the two equations, with regard to
these four. In order to satisfy the condition x->O at
                                                                      the behavior of solutions near R=O.
infinity, two coefficients must be zero; that is, we have
                                                                         The asymptotic behavior of x(x) at infinity, on the
two conditions. One of these may be satisfied by a
                                                                      other hand, is more interesting. It will be shown below
suitable choice of c';C2 j the remaining one gives a
                                                                      that when x tends to infinity, X behaves asymptotically
   11 It may be remarked that in reference 6 Goldstein's eigenvalue   like exp[ - Rq>(8,)], i.e., it tends to zero exponentially
is also obtained from Eq. (19) in tbe limit m./m.--+O (and «=0).      but with a coefficient depending on the direction,
   II One can argue that x~lnR is not reaUy a solution of (24)
since it gives an additional term ~a.(x). x~l<' gives a term
                                                                      more specifically on the angle 8. with the "4" axis.
[]a.(x).                                                              For our present purpose, however, it is only interest-
COVARIANT PICTURE OF QUANTUM BOUND STATES                                                                                  253
1130 G. C. WICK
ing to notice that <p(O,) has a posItIve lower limit                 constructed in the Appendix, and it may be seen from
<p?;, 1-. so that, in a certain sense, there is again                Eqs. (A7) and (AS) there that G(x) has a very weak
no fundamental difference in behavior between the                    singularity at the origin (it is in fact finite at x=O) and
solutions of (31) and those of (24), and we may expect               tends to zero at infinity like
that in both cases the boundary conditions at R= 0
                                                                                                                           (36)
and R = 00 will determine a discrete A spectrum.
   The elementary considerations developed previously                where g is a factor which varies slowly compared to the
seemed of interest, because of the analogy with con-                 exponential and
siderations often made with regard to the ordinary
Schrodinger equation. In this sense we may say that                             <p(O,)=m(I-.cos8,) /cos8,/>.
 (31) presents an analogy to the Schrodinger equation                               =mO-.')!sinO, /cosO,/ <..              (37)
for a particle in an asymmetric field, where again the               If V(R)-->O sufficiently rapidly when R-->oo, the asymp-
reduction of the eigenvalue problem to a simple one-                 totic behavior of x (x) as given by the integral in Eq.
dimensional Sturm-Liouville problem is not feasible.                 (35) will reflect that of G(x), from which the conclusions
   In either case, a rigorous discussion of the eigenvalue           previously mentioned may be obtained. Incidentally it
problem can only be achieved by less elementary means,               may be noticed that in the nonrelativistic limit. <'" 1,
such as the reduction of the problem to an integral                  the lower form in Eq. (37) covers almost the whole
equation. We do not wish to carry out such a study here,             solid angle, and furthermore <P'" (mB)! sinO" R>p(O,)
but we may point out along what lines it could be                    = (mB)lr, where r=x,2+x22+ x}. We thus find the
carried out.                                                         typical exponential of the three-dimensional Schr6dinger
   We already have, of course, in Eq. (19) and its                   function. It is indeed rather remarkable that in this
generalization for E,eO, an integral formulation of the              region, i.e., with the exception of a narrow cone around
problem. In the case ma= mb corresponding to Eq. (31),               the x, axis, the asymptotic form of x(x) is not time-
 the equation can be reduced to the real symmetric form              dependent.
                   f
                                                                     Goldstein' already met this situation for the special
                       H'(p,k)[dp][dk] < 00,                  (34)   case E= O. It is, of course, also possible to formulate
                                                                     the problem in a form similar to (3S), namely,
which together with other similar inequalities, which the
mathematically inclined reader can readily discover,                          X(X)=Aj GD(x-X') V (R')x (X') [dx'],          (38)
may be used to show that (32) is "nonsingular" and                   where
thus possesses a discrete 'A spectrum. Furthermore all
eigenvalues are real. Finally, one can see that the                  GD(x) = ['Ya(a/iJx)-ma(l +M)]
kernel is positive-definite!' so that A> O.
  An alternative integral formulation can be obtained
                                                                                        x ['Yb(iJjiJx)+mb(Hn)]G(x).         (39)
as usual in configuration space. In fact, Eq. (31)                   In this case the singular character of the equation comes
together with the regularity condition at the origin                 about because Gv(x) has a much stronger singularity
and the boundary condition x(x)-->O at infinity, can                 than G(x), near x=O. When this is combined with the
be replaced by an integral equation,                                 IjR2 singularityof VCR) [Eqs. (25) and (25a)], Eq. (38)
coordinates, such questions can be attacked by ordinary        One then finds easily that
mathematical methods.
                                                                                   10<1>= (1!2m') (p'+m')-t,                (43)
            s.   APPROXIMATION METHODS                         showing that Eq. (40) is satisfied.
   It is also possible to show that our transformed               More generally, one can see that 10 applied to
equation has several advantages if one wants to employ         (p'+2p·q+M')-', where M' and the vector q are
approximate methods of solution. We have in mind,              constants, gives (p'+ 2p· q+ M')-I, apart from a
in particular; (a) a perturbation expansion in the             proportionality factor. This peculiar self-reproducing
neighborhood of E=O (see also reference 4), (b)                property of a quadratic form in p, under the operation
variational principles, (c) nonrelativistic approxima-         I, is characteristic of the case K=O.
tions, without special restrictions as to the form of V (R).      Consider now the equation for E","O. For simplicity
These questions will be discussed in a paper which the         let m= 1 from now on. The equation is
author hopes to present shortly in another periodical."           [p'+2ip·~+ 1-~'][p'- 2ip·~+ 1-~']q,=I 0<1>. (44)
            6. EXACT SOLUTIONS FOR ,,=0                        Clearly q, cannot be a function of p' alone; it must be
                                                               at least a function of p' and p.~ (for an S state). The
   A comparison of Eqs. (25) and (2Sa) suggests                above considerations suggest that we may be able to
that the problem of solving the B-S equation exactly           generalize solution (41) by writing q, as a superposition
may be far more elementary in the latter (K=O) case.           of termsl ' of the type (p'+ 2p· q+ M')-3 where q is
This is borne out by Goldstein's solution' for Eq. (26),       parallel to 71, say, q=iZ71. That is
                                                                              f dzdM'g(z,M')[p'+2izp·~+M'J3.
and we shall see in a moment that also Eq. (24) has
quite simple solutions if K=O and ma=m •. And, of
course, one will remember that the ordinary nonrelat-
                                                                    q,(p) =                                                  (45)
ivistic Schrodinger problem is far more elementary
with a Coulomb than with a Yukawa potential.                   One then sees immediately that
   At first, however, one would regard this analogy as
encouraging only for the special case E= 0, when the
B-S equation is separable. We were, therefore, quite
                                                                     Io<I>=!   f    dzdM'gl(z,M')[p'+2izp·71+M']-l,
surprised when we first realized that for K=O even the                                                                       (46)
                                                                               g, (z,M') = g(z,M')j (M'+z''1').
nonseparable Eq. (31) can be reduced to a one-dimen-
tional integral equation, or alternatively to a one-           Inserting on the right of (44) and dividing by the two
dimensional eigenvalue problem of the Sturm-Liouville          quadratic factors on the left, one then tries to reduce
type. We shall explain the basic idea for the simplest         the result again to the form (45) by reassembling the
type of solution and for ma = mb only. The extension to        three quadratic denominators into a cube [in a similar
other cases was carried out by eu tkosky and is described      way as in Eqs. (42) and (A3) in the Appendix]. One
in the accompanying paper.                                     sees at once that if M'= 1-~' the "mass term" repro-
   Choosing ma=m. (=m, say), let us first examine the          duces itself. Thus we set
separable case, Eq. (24). In momentum space, the
equation has the form                                                              g(z,M')=g(z)o(M'-H~').                    (47)
                                                               Carrying out the transformations indicated above and
                  (p'+m')'</>(p)=XI r!I>(p),            (40)
                                                               writing
which is very similar to the nonrelativistic hydrogen                                                           (48)
equation in momentum space. The latter, of course, is          we find
a three-dimensional equation and does not have the
square power on the left, but it will appear that the
analogy is closest when the two changes are made
                                                               q,(p)=!X   J Q-'(z)g(z)dz J+1 dy {
                                                                                              -I       0
simultaneously.                                                                         Xxdx[P'+2il"p·~+1-~']-·,             (49)
  In particular, the ground-state wave function of
hydrogen: </>(p) = (p'+po')-', is duplicated here by the                                l"=xy+(l-x)z.
solution                                                         14 An expression of this type has a certain resemblance to the
                    </>(p)= (p'+m')-3               (41)       parametric representations for SF' and tip' developed by M.
                                                               Gell-Mann and F. E. Low, Phys. Rev. 95, 1300 (1954). G. Kiillen
                                                               [Helv. Phys. Acta 25, 417 (1952)] has previously used similar
corresponding to the eigenvalue X= 2m'. That (41)              representations for other quantities that are a little less closely
satisfies Eq. (40) can be verified most easily if one          related to the B-S wave function, Eq. (I). In the case of these
first writes, a la Feynman;                                    quantities, and of the functions SF' Ap' J it is possible as the
                                l'
                                                               above-mentioned authors have shown, to derive the general
                                                               form of the parametric representation from the definition of the
(k'-2k·p+p')-I(k'+m')--"=             3(1-x)'dx                quantities, and from considerations of relativistic invariance.
                                  o                            The author has not been able to do the same for Eq. (1). Ntver-
                                                               theless the analogy with SF' and III was used to "guess" the
                  X[(k-xp)'+ (l-x) (m'+xp')]-'.         (42)   form of Eq. (45).
COY ARIANT PICTURE OF QUANTUM BOUND STATES                                                                               255
1132 G. C. WICK
Eliminating y in favor of I, and carrying out the                   The lowest eigenfunction simply develops a kink at
integrations over x and Z first, (49) acquires indeed the           Z= 0, while the behavior of the higher states is more
general form required by (45) and (47). Writing that the            complicated; if one inserts the approximation (55)
two expressions are identical gives an integral equation            into (52), one finds
for g(z).
   To this end notice that if z in (45) is allowed to vary                          g(t)=!... (I_~2)-lg(0)A(I-ltl)'      (56)
                       +
between -1 and 1, I will also vary between the same                 which requires
limits. Writing dl=xdy and noting that for given
z and \,                                                                                   A= (2/...)(1_~2)1.            (57)
   In the second interval we write Q(z) = z', again small values of u are of the form
neglecting terms of order (J _'1/')1 at most, and write •
                                                                                   a=au, .B=bu,                    (68)
     s=z', d'g/ds'+!s- I dg/ds+1>-g/(I-s)s'=0, (63) a and b being constants, whose precise value we shall
which again is of Riemann's type. The solution satisfy- not determine.
ing g=O at z= 1 is                                              Obviously (64/1) and (67') can be joined smoothly if
                                                              a-ti-!u In(I-'I')=n1r, where n is an integer. If 1-1/'
       g= (l-z')zi+ P.F,(ll+!p, t+!p; 2; l-z'). (64) is so small that -In(I-'I')>>I, the above equation
                                                              will have small roots u so that by using (68),
We will first show that if >-<1, the "internal" and
"external" solutions (62) and (64) cannot join smoothly                U= (;I.-i)!=n7r/[a-b-t In(I-1/')].          (69)
at Z=Zo, i.e., x=xo=1/zo(I-1/')-I. In fact, since xo»I,
we may evaluate (62) by means of the asymptotic               To  an even cruder  approximation,  one has
formula for the hypergeometric function. One finds,             q~-2...n/ln(I-1/'); >-= 1+[21rn/ln(I-1/')J'. (70)
omitting a proportionality factor,
                                                              Equation (70), for n=O, 1, 2,.·· gives an infinity of
        gint~xP+I(H"')+A (P)x-P+!(H"')'                 (62') eigenvalues all tending to >-=1 when 1/'->1. It should
                                                              be pointed out that these correspond to odd eigenfunc-
The dots indicate expansions in powers of X-I, and since tions. In a similar way one can show, however, that the
p <! it is consistent to keep the first term of the
                                                              same formula, with 2n replaced by 2n+ 1, gives the
second expansion, while neglecting the higher terms of
                                                              eigenvalues for the even eigenfunctions.
the first expansion. Furthermore,                                About the possible significance of these "abnormal"
          A (p) = 2'p tan(!?r-!1rp)r(2p)/r( -2p)         (65) solutions we shall not try to speculate here. Since they
                                                              occur only for finite values of A (>- ~ 1), it would be
is a negative quantity which varies from 0 to -1 as unwise to assume that they are a property of the com-
>- varies from 0 to 1.                                        plete B-S Equation. Certainly the ladder approxima-
   Similarly, (64) may be evaluated for small values of tion cannot be trusted to that extent. If the theory is
z by means of the known transformation of pea, b, c, used only for small values of the coupling constant,
 l-s) to hypergeometric functions of the variable s. One the abnormal solutions do not exist, in the case we have
finds                                                         studied, and no contradiction with known facts can be
          g.xt~zp+I(H· .. )+ B(p)zP+I(H' .. ),          (64') established. Nevertheless it would seem that these
                                                              solutions deserve further study.
where the dots now indicate expansions in powers of z',
and                                                                            ACKNOWLEDGMENTS
                                                                 The present work was begun while the author was a
                                                              guest of the Institute for Advanced Study, Princeton,
is a quantity which on the whole interval 0<>- < 1 New Jersey. The author is happy to acknowledge his
 (O<p<!) stays quite close to -1 (and is in fact <-1). indebtedness to the Director of the Institute, Professor
   Rewriting (62') in terms of the variable z and J. R. Oppenheimer, for the stimulation and encourage-
omitting again a proportionality factor, we find              ment he derived from a year's stay at the Institute.
                                                              Various members of the Institute, in particular,
               gint~zP+I+A (P)(1-1/')pz-P+I,             (67)
                                                              Professor F. Dyson, Professor G. Kiillen, Professor
which is of the same form as (64'), but with a coefficient A. Pais, Professor W. Pauli, and Professor R. Jost gave
for the second term which is smaller than B (P) in kind encouragement and invaluable criticism. Special
absolute value, for all values of p in the stated interval. thanks are due to Dr. Murray Gell-Mann for suggesting
Hence (64') and (67) can never join smoothly. In that the analogy discussed in reference 14 might be
addition it is easy to verify that the slope g'/ g is larger of help.
for (67) than for (64'), as one expects if >- is too low
                                                                                     APPENDIX
to be an eigenvalue.
   Let us now turn to the case >->1. One can see that            We shall construct here the Green's function G(x),
essentially the same formulae will hold, except that p which is a solution of (p.'+ma')(pb'+mb')G(x)=o(x),
will be a pure imaginary, say p=iu, u= (A-i)l. One pa and Pb being defined by Eqs. (13) and (28), with
sees, then, that (64') and (67) take the respective forms p= -i Grad. We shall calculate G for the general case
                                                              ma~mb, since this involves no additional difficulty.
                    g•.xt~ZI sin(u Inz+.B)             (64/1)
                                                              Using Fourier transforms, one sees at once that
and
            gint~zl sin(u lnz-!u In(I-'1')+a),          (67')
                                                              G(x) = (2'-)-'j [(Pa'+m.') (p.z+mb')J-1eipx[dPJ. (AI)
where a and .B are phases depending on u, which for
COVARIANT PICfURE OF QUANTUM BOUND STATES                                                                                                 257
1134 G. C. WICK
In the following we use !(m.+mb) as the unit of mass, XHO'(iz). The asymptotic behavior of (A7) when
setting                                               • R->oo in a specified direction (i.e., keeping xJ R
               m.=1+.<l, mb=1-.<l.               (A2) constant) is found noting that Ko(z)~(.../2z)I.-'. The
                                                        exponential part of (A7) is then
Furthermore we transform, d fa Feynman,
    [(p.'+m.')(Pb'+mb')J-1=!                    J    +l
                                                          [p,y,.<lJ-'dy,    (A3)
                                                                                               G(x)~· ..     J dy exp[ - Rf(y)J,
                                                    -1                             where
where
                                                                                      fey) = [(1- '1/') (1-y')+ (y+.<l)'JI- E(y+.<l)X.R-' .
[p,y,.<lJ= P'+ 2i(y+.<l) (p.~)
                                       +(1-~')(1+2y.<l+.<l').               (A4)   It is easy to see that f(Y»O in the whole interval
                                                                                   -l="y="+1. Hence G(x) satisfies the boundary
Furthermore, applying to Q= [p,y,.<l J the formula                                 condition G->O as R->oo in any direction. H Ym is the
                                                                                   point in the interval where fey) is a minimum, then the
                         (!'=      f'" e-aQada                                     strongest factor in the asymptotic dependence of G(x) is
                                        o                                                           G(x)~exp[ -Rf(ym)].                  (AS)
and inserting into (Al), the integration over p may be                             Notice that Ym depends on the direction. Consider, for
performed, with the result                                                         example, the simplest case .<l=O. Then if Ix.1 <ER, Ym
                    J f
                                                                                   is defined by the minimum condition
                        +1             '"
G(x) = (3211"')-1            dy             a-1da                                                  YmER=x,(l-,'+y'E')I;                  (A9)
                    -1             0
                                                                                   that is, writing x,/R=cosO" ymE=(1-o')lcotO,. If
               Xexp[ -aU-iR'a-1+(y+.<l)(x~)J,                               (AS)   IcosO,l > E the root (A9) is not inside the interval, so
with                                                                               the minimum of fey) occurs at y=±l, according as
           u= (1+2y.<l+.<l') (l-'1')+1]'(Y+.<l)'.                           (A6)   cosO,~O; summarizing, one has
Owing to (29), U is positive for Iyl ="1; hence the                                           IcosO.I>,      f(Ym)=l-ElcosO.1
                                                                                                                                       (AIO)
integral over a in (AS) is always meaningful.                                                 IcosO. I < E   f(ym) = (1- E')I sinO,.
   We then find that
                                                                                   Notice that in the latter case,
                                                           Marilyn E. Noz
                       Department of Physic~ Indiana University of Pennsylvanill, Indiana, Pennsylvanill 15701
                               (Received 22 March 1973; revised manuscript received 20 July 1973)
               An attempt is made to give a physical interpretation to the phenomenological wave function o~ Yukawa,
             which gives a correct nucleon form factor in the symmetric quark model. This wave function is first
             compared with the Bethe-Salpeter wave function. It is shown that they have similar Lorentz-contraction
             properties in the high-momentum limit. A hyperplane harmonic oscillator is then introduced. It is shown
             that the Yukawa wave function, which is defined over the entire four-dimensional Euclidean space, can be
             interpreted in terms of the three-dimensional hyperplane oscillators. It is shown further that this wave
             function satisfies a Lorentz-invariant differential equation from which excited harmonic-oscillator states can
             be constructed, and from which a gauge-invariant electromagnetic interaction can be generated.
8 3521
tion. IS
  As we increase i pi , this property holds for
Eq. (2) until the kinetic energy becomes larger
than the binding energy." For I pI larger than the
                                                                     '.t: -   z
binding energy, the Bethe-Salpeter wave function
is no longer normalizable in the above-mentioned
four-dimensional Euclidean space. The harmonic-
oscillator wave function of Eq. (1) does not suffer
from this effect and remains normalizable for
large values of I pI . This is expected because
particles bound by an oscillator potential have in-
finite binding energy.
  Let us rewrite the oscillator wave function as-
suming that p is in the z direction. We use E for
Po and p for P•. Then
    'iI(x,p) ~exp [ - !w(x' + l)l
                  xexp{(- w/4m')[ (E - pY'(t +z f
                                                                 FIG. 1. Lorentz-contracted wave func tions with two
                                       +(E+p)'(t -d]).   (3)   equal and opposite momenta. The form-factor integral
                                                               of Fujimura etal. receives contributions primarily from
For large p,                                                   the small overlapping region.
      w(E-PY' _~('!!)'
        4m'     16 p ,                                         shrinkage is responsible for the nonexponential
                                                         (4)
                                                               decrease of the form factor.
      w(E+p)' _ w(P..)'                                          In Eq. (6), the integral is performed over Eu-
        4m'        m                                           clidean space-time. We know clearly the physical
Thus                                                           meaning of the probability distribution over the
                                                               three-dimensional space, but we do not know what
      "'(x,p)-exp[ -~w(x '+y')l                                physics, if any, the time like probability distribu-
                   x exp[ - t.w(m/pY'(t +zl'J                  tion corresponds to. We shall discuss this problem
                                                               in Sec . III.
                   xexp[-w(pl m)'(t - z )' J.            (5)
The last factor becomes (f; Iw )(m/P)6(i- z) for                                Ill. HYPERPLANE FORMALISM OF
                                                                                     HARMONIC OSCILLATOR
large p, and the dependence on the variable (I +z)
becomes insensitive by the factor (mlp Y'. This                  Here we study Yukawa's phenomenological wave
contraction behavior is strikingly similar to that             function from the point of view of the nonrelativis-
of the Bethe-Salpeter equation. 14 The Bethe-                  tic-harmonic-oscillator wave function, generalized
Salpeter wave function is a model derivable from               to covariant hyperplanes .
field theory. The oscillator function is a phenom-                Let us start with the non relativistic harmonic
enological wave function giving correct form fac-              oscillator. The Hamiltonian is separable and the
tors. It is interesting to note that these two wave            wave function is Gaussian multiplied by the ap-
functions have the same Lorentz contraction prop-              propriate polynomials corresponding to excited
erties in the large-p limit.                                   energy levels. Because the ground-state wave
  We now restrict ourselves to the Yukawa wave                 function depends only on (x)' in the exponent, we
function . Let us analyze the form factor calcula-                                              x
                                                               can Lorentz- generalize to the three-vector on
tion of Fujimura et al. lo in the Breit system. We             the hyperplane which is perpendicular to the total
can sketch the initial and final "Lorentz-con-                 four-momentum of the system. We follow the
tracted" wave functions as in Fig. 1. The form-                standard method of constructing this three-vector
factor integral                                                and
where q is the momentum transfer, receives con-                When the momentum          p is   zero, l, becomes   x.
tributions only from the small overlapping region              For nonzero p,
indicated in Fig. 1. This region shrinks as the
momentum transfer increases, and this coherent
COY ARIANT PICTURE OF QUANTUM BOUND STATES                                                                        261
   There are two important differences between                Here again the integration measure d'x is hyper-
the above wave function and that of Eq. (1). First,           plane-independent and is good for both the Il, and
the coefficients of (p. xlm)2 are different. In Eq.           the ~,plane. The above expression becomes
(1), it is 2, while it is 1 in Eq. (11). Next, Eq. (1)        Eq. (12) when {3, and fl, are equal.
is integrated over the entire four-space while                  The next and most crucial question is whether
Eq. (11) is integrated only over the three-dimen-             the above inner product produces experimentally
sional hyperplane. The purpose of this section is             measurable effects. The answer is contained in
to point out that we can indeed give a hyperplane             the fact that because of the additional exponential
interpretation to the Yukawa wave function of Eq.             factor, the form factor calculation with this inner
(1).                                                          product becomes exactly the phenomenological
   The wave function given in Eq. (11), which de-             form of Fujimura el al. which we discussed in
pends explicitly on {3, is the ground-state wave              Sec. II. The Single-oscillator ground-state form
function. We can excite the harmonic oscillator               factor becomes in the Breit system
just as in the nonrelativistic case. If we multiply
cp or its excited form by exp[ -~w(l- jl')-'(t -{Jz)'],           F(q2)~ fd   4 xexp[ii'j.x]    exp[-w(x'+y')]
it does not change the hyperplane oscillator be-
cause the variable -(I-iJ')-'f'(t-pz) is perpen-                          xexp[   _-;,(m   2   +2q2)(t' +z')J.   (14)
dicular to the three hyperplane variables given in
Eq. (9). If we perform the integration over the               For large q', the time integral is like a I)-function
variable -(1- iJ,)-u'(t - pz) after this multiplica-          integral, and hence this form becomes that of
tion, this certainly leaves the hyperplane oscil-             Licht and Pagnamenta 12 who proposed the instant
lator intact. Therefore we can write the inner                (t ~O) probability integral.
product of two wave functions belonging to the                 We have thus generalized the time-independent
same hyperplane as                                            harmonic oscillator to covariant hyperplanes, and
understand the covariant harmonic oscillator.23                the Y variables which contain the P dependence.
In this paper, we used the hype rplane coordinates             Thus we have to use Eq. (17) to construct excited
to avoid time like excitations. The advantages                 states. Because of the Lorentz invariance of the
are similar to those in the Coulomb gauge case.                harmonic-oscillator operator, the excited-state
By eliminating completely the burden of handling               wave functions also satisfy the differential equa-
those unphysical excitations, we have been able to             tion of Eq. (18).
separate clearly what can be done and what can-                  We now write the excited-state solution as
not be done in the framework of non relativistic
quantum mechanics. We emphasize here that a                        If! ,Iy) =H'I IYI)H" (y,)H, 3(Y3)H,o(Yo)
relativistic measurement theory has yet to be con-                        xexp[ -~w(y' +Yo')] ,                   (19)
structed."
                                                               where
        IV. COVARIANT DIFFERENTIAL EQUATION                                                                       (20)
                 AND EXCITED STATES
                                                                 The above solution is possible because the start-
  In the preceding sections, we studied a possible
                                                               ing differential equation of Eq. (17) is separable
physical interpretation of the Gaussian factor
                                                               and remains separable as we change the value of
which corresponds to a ground-state harmonic                   the total four-momentump. The quantum numbers
oscillator. In order to construct excited states,              n, are separation constants. Our Lorentz trans-
we use the Lorentz-invariant differential equation
                                                               formation therefore preserves this separability.
which is needed in generating a gauge-invariant
                                                               Because of the minus sign in front of no, the eigen-
electromagnetic interaction of the harmonic-oscil-
                                                               values of Eq. (20) are infinitely degenerate. In
lator quarks. 11
                                                               order to remove this ambiguity, we set no=O;
  We rewrite here the ground-state solution
                                                               the physics of this procedure has been discussed
                                                               in Sec. III. Thus
                                                                   ,\=w(N+l),
as                                                             where                                              (21)
      l/Jo(X,p) = IMy)
                 = exp[ -~W(y,' + Y,' + Y3' + Yo')] ,   (15)      Since the separability is preserved, the no = 0
where                                                          condition is invariant under a Lorentz transforma-
                                                               tion. The covariant harmonic oscillator now has
     Yl =x 1 ,     Y2 =.l2,                                    three normal excitation variables, namely, Yu
     Y3 = (1- {l')-'/'(X 3    -   (3t),                 (16)   y" and y" and they are preCisely the hyperplane
                                                               variables mentioned in Sec. III. They are 0(3)-
     Yo = (1- (3')-l/'(t - (3x 3)'                             invariant within the hyperplane and generate co-
                                                               variant excited-state wave functions in exactly the
  The above linear transformation is a homoge-
                                                               same way as in the non relativistic oscillator.
neous Lorentz transformation of the original co-
                                                                  The eigenvalue A can serve as the mass of the
ordinate variables. Thus ,"o(x,p) satisfies the
                                                               covariant harmonic oscillator or as its mass
equation
                                                               squared. There have been many previous attempts
                                                             I
Klein-Gordon equation, and that the interaction
                                                                            I
                                                                                            stronge                      strange
                                                                                strange                    strange
can be manufactured in the usual way.                               ;
               Y. CONCLUDING REMARKS                         Ii    n= 0           A              A
                                                                                                      I
                                                                                                             A
                                                                                                                     I
                                                                                                                            A
                                                                                                                                   I
                                                                                                                                   I
  In this paper, we discussed, first, Lorentz
contraction properties of the covarient Gaussian
                                                             Ii    n =I            A             -
                                                                                                 A    I!     B
                                                                                                               -     I      c      iI
factor. We then proposed the use of the hyper-
plane technique to study possible relativistic in-
gredients in quantum mechanics. Finally, we in-                             I      A-
                                                                            I
                                                                   n=2                           C           D              D
troduced the normal-coordinate method in solving             I
                                                             l
the covariant harmonic-oscillator equation, and
showed that this method is technically equivalent                  n=3
to the hyperplane method.
  The normal-coordinate method is the most pow-
erful weapon in attacking harmonic-oscillator
                                                              FIG. 2. Summary of the present status of the multiplet
problems. It is a convenient way of describing              scheme in the symmetric quark model. A means "ex-
cQvariantly the orbital and radial quantum num-             cellent", B means "good", etc.
bers. Therefore we have studied in this paper a
possible theoretical tool which can link the basic
concepts of quantum mechanics to quantities that
                                                              In this paper, we have restricted ourselves to
can be measured experimentally.
                                                            nonstrange baryons. We realize that there are
  The most widely available numbers that can be
                                                            some difficulties in pionic form factors.;o As we
both calculated and measured are decay rates.'
                                                            see in the experimental summary of Fig. 2, we do
Since the decay rate calculations are not sensitive
                                                            not yet have enough experimental information from
to the exact shape of the wave function, the dec ay
                                                            which a linear mass spectrum can be derived for
rate alone does not force us to accept the harmonic-
                                                            the mesons. Therefore we cannot and do not in-
oscillator model.
                                                            sist on the simple harmonic oscillator for the
  The form factor study such as the one discussed
                                                            mesons. Consequently, we do not have to explain
in this paper strengthens our assertion on the
                                                            the above-mentioned difficulty at this time.
harmonic oscillator and enables us to relate the
observed curve to Lorentz contractions."
                                                                                   ACKNOWLEDGMENTS
  The most important characteristic of the har-
monic oscillator is, of course, the linearity of              This work was started when one of us (M. E. N.)
its eigenvalues. In order to study the linearity in         was visiting the UniverSity. of Maryland during the
the observed mass spectra, we need at least three           summer of 1972. This visit was supported by the
radial modes. For nonstrange baryons, we barely             National Science Foundation. She would like to
have these three levels, and the present authors            thank her colleauges at Maryland for the hospital-
studied this linearity. 7                                   ity extended to her during the summer.
*Work supported in part by the National Science Founda-       175, 2024 (1968). For the latest numerical analysis
  tion Grant No. NSF GP 8748.                                 ~he N:-=- 1 andN:-o 2 multiplets, see C. T. Chen-Tsai
1M. Gell-Mann, Phys. Lett. 8, 214 (1964).                     and T. Y. Lee, Phys. Rev. D 6, 2459 (1972).
'G. Zweig, CERN Report Nos. TH401 and TH412, 1964           7y. S. Kim and M. E. Noz, Nuo;o Cimento I1A, 513
  (unpublished) .                                              (1972). See also T. De, Y. S. Kim, and M.'E. Noz,
3J . J. J. Kokkedee, The Quark Model (Benjamin, New           ibid. 13A, 1089 (1973).
  York, 1969).                                              SR. P. Feynman, M. Kislinger. and F. Ravndal, Phys.
4For the latest attempt to keep quarks inside the hadron,     Rev. D~, 2706 (1971), and the references contained
  see K. Johnson, Phys. Rev. D 6,1101 (1972).                 therein.
'0. W .. Greenberg, Phys. Rev. Lett. 13, 598 (1964).        'So D. Drell, A. Finn, and M. Goldhaber, Phys. Rev. 157,
60. W. Greenberg and M. Resnikoff, Phys. Rev. 163,            1402 (1967).                                        -
  1844 (1967); D. R. Divgi and O. W. Greenberg, ibid.       10K. Fujimura, T. Kobayashi, and M. Namiki, Prog.
264                                                                                                  CHAPTER IV
                                     Michael J. Ruiz
       Center or Theoretical Physics. Department of Physics and Astronomy.
              University of Maryland. College Park. Maryland 20742
                                 (Received 20 May 1974)
               'I').(y)   =NHnt(Yl)Hn,(Y2)Hn.(Y3)exp[-zrof.Y
                                                       1.,-+2    2
                                                              + Yo)],               (4)
(6)
                                 F.C. Rotbart
  Department of Physics and Astronomy, Tel Aviv University, Ramat Aviv, Israel
                           (Received 12 June 1980)
Within relativistic quantum mechanics the complete orthogonality relations for the
covariant harmonic oscillator are derived. These relations include time-axis
excitations and are valid for wave functions belonging to different Lorentz frames.
In relativistic quantum mechanics (RQM), l the covariant harmonic oscillator
appears as a natural extension of the nomelativistic case. 2 As pointed out by Kim
and Noz3 in an earlier development of a covariant oscillator wave function, these
functions can be applied to a wide range of hadronic processes. The RQM
formalism in general, and the covariant oscillator wave functions in particular, allow
the probability interpretation to be extended to the relativistic domain. This raises
the important question of orthogonality between states in different Lorentz frames.
For spatial excitations, this was investigated by Ruiz.4 However, the RQM
oscillator functions differ from those of Kim and Noz, not only in that they derive
from a more general covariant formalism, but also in that time-axis excitation states
are not excluded from the ground state as they are in the formalism by Kim and
Noz. Of course, in both formalisms these time excitation states are necessary for
completeness of the Hilbert space. The purpose of this note is to calculate the
complete orthogonality relations for states in differing Lorentz frames, including the
time-axis excitations. We shall, to some extent, be following the paper of Ruiz.
The covariant harmonic oscillator in a stationary state is described bl
where m and k are positive constants and the x in'l'(x) denotes xf1= (xO, xl, xZ, x 3) .
Under the Lorentz transformation
Y3 = 1 (x 3 - /h0)' (2)
                 1 [
              [ 2m -
                         vz + aY5
                           y
                               iP ] + '2k<Y
                                       l.,-+z
                                              - YO)J 'I'(y) = K\j1(y),
                                                            2]                      (3)
(6)
/(s,r,u,v)= Jd 4 x exp[-s2+2sOY3-r2+2roy'3-u2+2uoy'0-v2
where y and y' are, by (2), functions of x. Introducing the variables                           ~   and 11, defined
by
(9)
            J'1ft   n3nO(y )'lfn' 3n' o(y' )d 4x     =   B:-: =~l,(1 -   ( 2)(no + n, + 1)/2   a n'o- no
                                  II"       (i a)2"'(1 - a 2r"'(n3!n' 3!nO!n' 0!)1I2
                               XL                                                                          (15)
                                 A.=O   (n3 - A)!(no - A)!(n' 0 - no + A)!A!
for n' 0 ~ no. Note that the requirement that A be summed over the smaller of
no or n3 is automatically taken care of by the factorials in the denominators. In the
case of no ~ n' 0 we simply interchange the primed quantities on the right-hand side
with the unprimed ones and replace a with (-a).
For no = n' 0 = 0, these orthogonality relations reduce to the relations obtained by
Ruiz. In addition, it can be seen from (15) that Lorentz transformations contract the
wave function not only along the z axis, but also along the time axis. This is due to
the fact that time in the wave function is on an equal footing with the space
coordinates. A hypothetical observer will measure the extent of the wave function in
time, in his frame, at equal z in essentially the same way that he would measure the
length at equal time. This extent in time is not to be confused with the time interval
between two events which occur at the same z only in one particular frame.
It is amusing to note that in evaluating the sum in expression (15), the exponent of
(1 - a) ranges from              ~ (no + n3 +            1) to   ~ (I no -    n31 +1). This behavior is
reminiscent of angular momentum representations and suggests that a "spherical"
coordinate representation for the covariant harmonic oscillator might be useful.
This research was partially supported by the Binational Science Foundation (BSF),
Jerusalem, Israel.
lL. P. Horwitz and C. Piron, Helv. Phys. 46,316 (1973).
2L. P. Horwitz and F. C. Rotbart (unpublished).
3y . s. Kim and M. E. Noz, Phys. Rev. D 8,3521 (1973); Found. Phys. 9 375 (1979); Y. S. Kim, M. E.
Noz, and S. H. Oh, ibid 9, 947 (1979).
4M. 1. Ruiz, Phys. Rev. D 104306 (1974).
272                                                                                                                   CHAPTER IV
         Y.S.Kim
         Center for Theoretical Physics, Department of Physics and Astronomy, University of Mary/and, College Park.
         Maryland 20742
         (Received 29 November 1979; accepted 22 January 1981)
IIS8    Am. J. Phys., Vol. 49. No. 12, December 1981                                                                P. Han and Y. S. Kim        11'8
274                                                                                                                CHAPTER IV
   The harmonic oscillator equation given in Eq. (10) is sep-      this requirement.
arable in many different coordinate systems. We are inter-            What Dirac wanted from his "conditional" equality was
ested here in the coordinate that is most convenient for           to freeze the motion along the time separation variable in a
constructing solutions that are diagonal in the Casimir op-        manner consistent with quantum mechanics and relativity.
erators ofEq. (7). These operators take the simplest form in       This means that we can allow a time-energy uncertainty
the Lorentz coordinate system in which the hadron is at            along this timelike axis without excitations, in accordance
rest:                                                              with Dirac's own "C-number" time-energy uncertainty re-
                           x' = x,   y' =y,                        lation. 7 This time-energy uncertainty without excitation is
                                                                   widely observed in the relation between the decay lifetime
                  z'=(z-/1I)/(I-/1'),I2,                    (13)   and energy width of unstable systems.' The C number in
                    I' =   (t -/1z)l(1 _ /1')'/'.                  the matrix language is one-by-one matrix, and is the
                                                                   ground state with no excitations in the harmonic oscillator
We assume here that the hadron moves along the z direc-
                                                                   system.
tion with velocity parameter /1.
   In te.-ms of the above coordinate variables, we can write          We have observed in Sec. III that the subsidiary condi-
the Casimir operators as                                           tion of Eq. (II) becomes Eq. (17) in terms of the coordinate
                                                                   variables in which the hadron is at rest. Eq. (17) restricts the
                      P'= -(alaX~)'                         (14)   I' dependence to that of the ground state. Equation (II)
and                                                                therefore eliminates all timelike excitations in the Lorentz
                                                                   frame where the hadron is at rest, and makes the uncertain-
                         w, = M '(L')',                     (15)
                                                                   ty associated with the t ' direction a C-number uncertainty
where                                                              relation. We can therefore conclude that the subsidiary
                     L:    =-   iEij,X;(alax').                    condition ofEq. (II) is a quantum-mechanical form of Dir-
                                                                   ac's "instant form" constraint given in Eq. (3).
We also have to take into account the fact that the hadron            In order that the dynamical system be completely consis-
(mass)' operators is constrained to take the eigenvalues de-       tent, the subsidiary condition should commute with the
termined by the oscillator equation for the internal wave          generators of the Poincare group:
function and that we have to consider p' of the form
                                                                                           [Pa , P"a,; 1= 0,
                      p' = m~     + H(x').                  (16)                                                                (19)
   In addition, we have to consider the form of the subsid-
iary condition given in Eq. (II). In terms of this moving
                                                                                          [Mao' P"a"+ 1= o.
coordinate system, the subsidiary condition takes the form         The above equations follow immediately from the fact that
                                                                   the operator P"a"+ is invariant under translations and Lo-
                   (I' +alal')¢(x,P) =0.                    (17)
                                                                   rentz transformations.
By using the moving coordinate system, we have achieved               Since the Casimir operators are constructed from the
considerable simplification in the expressions for the Casi-       generators of the Poincare group, we are tempted to con-
mir operator W' and the subsidiary condition, without              clude that the constraint operator commutes also with the
complicating the forms for p' given in Eqs. (14) and (16).         invariant Casimir operators. However, we have to note
This subsidiary condition restricts the t ' dependence to that     that the operator p' also takes the form of Eq. (16). There-
of the ground state and forbids excitations along the time-        fore, it should commute with H (x) given in Eq. (10). How-
separation variable in the Lorentz frame where the hadron          ever, a simple calculation gives
is at rest.
   As was discussed in Ref. I, the subsidiary condition of                           [H(x), P"a,;   1=    P"a"+.                (20)
Eq. (17) forbids timelike excitations that contribute nega-        This means that the right-hand side is not identically zero,
tively to the total eigenvalue. This condition therefore           but vanishes when applied to the wave functions satisfying
guarantees the existence of the lower limit in the (mass)'         the subsidiary condition of Eq. (II).
spectrum. The absence of such timelike excitations are per-           In his paper, Dirac considered also the commutation re-
fectly consistent with what we observe in the real world.          lations between dynamical quantities and the constraint
   After these preparations, it is a simple matter to write        condition that is only "approximately" zero. He asserted
down the solutions which form the desired representation           that the resulting "Poisson bracket" should also vanish in
of the Poincare group':                                            the same "approximate" sense. The commutatorofEq. (20)
                ¢(X,x) = ifJ(x, P)exp(    ± iPX),           (18)   indeed vanishes in the manner prescribed by Dirac'
                                                                      It is well known that Dirac concludes his paper by noting
with                                                               some difficulties associated with the potential term in mak-
       ifJ(x, P) = (1/1T)'I2[exp( - t "/2)]Rn(r')yrn(O', ¢ 'I,     ing his system of "Poisson brackets" completely consistent.
where r', 0', ¢' are the spherical coordinate variables in a       The crucial question is whether the harmonic oscillator
three-dimensional Euclidian space spanned by x, y, and z'.         formalism can resolve Dirac's "real difficulty." We shall
                                                                   attack this problem here by carrying out some explicit
IV. FURTHER CONSIDERATIONS OF THE                                  calculations.
                                                                      In formulating his scheme to solve the commutator
CONSTRAINT CONDITION
                                                                   equations for the generators of the Poincare group, Dirac
  As was pointed out in Sec. II, Dirac was interested in a         chose to adopt the view that each constituent particle in
possible quantum-mechanical form of his "instant form"             "atom" (bound or confined state) is on its mass shell, and
constraint of Eq. (3). The key question at this point is           that the total energy is the sum of all the free-particle ener-
whether the subsidiary condition given in Eq. (II) meets           gies and the potential energy. This potential term indeed
1159     Am. J. Phys., Vol. 49, No. 12, December 1981                                                    D. Han and Y. S. Kim   1159
  COVARIANT PICfURE OF QUANTUM BOUND STATES                                                                                           275
 causes the real difficulty in making the commutator system       and considered the constraint
 self-consisten t.                                                                        v = O.                                       (26)
    In the covariant oscillator formalism, we observe that
                                                                    The basic advantage of using the light-cone variables is
 the Casimir operators of the Poincare group clearly indi-
                                                                  that the Lorentz transformation takes a very simple form.
 cate that the mass of the hadron is a Poincare-invariant
                                                                  For instance, the transformation given in Eq. 113) can be
 constant, but they do not tell anything about the masses of
                                                                  written as
 constituent particles. Let us write down the four-momen-
 tum operators for the constituents in terms of the X and x                                                                            (27)
 variables:
                                                                                        v' = [(l-P)/[1 +P)l'!2 v.
1160   Am. 1. Phys., Vol. 49, No. 12. December 1981                                                         D. Han and Y. S. Kim     1160
 276                                                                                                                                     CHAPTER IV
                                                                                       special relativity.
                                                                                    'The most successful bound-state model in field theory is of course the
                                                                                       Bethe-&lpeter equation. However, the Bethe-Salpeter wave function
                                                                                       does not yet have proper quantum-mechanical interpretation. See Sec. I
                                                          "                            ofG. C. Wick, Phys. Rev. 96, 1124 (1954). The difficulty in giving a
                                                                                       physical interpretation to the relative time-separation variable between
                                                                                       two bound-state particles was mentioned earlier by Karplus and Klein.
                                                          "                            See R. Karplu. and A. Klein, Phys. Rev. 87, 848 (1952).
                                                                                   'Po A. M. Dirac, Rev. Mod. Phys. 21, 392 (1949).
                                                                                   'Yo S. Kim, M. E. Noz, and S. H. Oh, Am. J. Phys. 47, 892 (1979); 1.
                                                                                       Math. Phys.20, 1341(1979).
Fig. 2. Space-time geometry for the instant form in the light--cone coordi·        'E. P. Wigner, Ann. Math. 40, 149(1939).
nate system. The usual quantum excitations take place along the z' direc-          'P. A. M. Dirac, Proc. R. Soc. wndon A 114, 243, 710 (1927).
tion. The "C-number" uncertainty relation holds along the       I'   axis, and     HE. P. Wigner, in Aspects o!Quantum Field Theory, in HonourofP.A. M.
there are no excitations along this timelike direction.                                Dirac's 70th Birthday, edited by A. Salam and E. P. Wigner (Cambridge
                                                                                       University, wndon, 1972).
terms of thisLorentz deformation picture, to explain Feyn-                         9W. Heitler, The Quantum Theory of Radiation, 3rd ed. (Oxford Univer-
man's parton phenomenon I. both qualitatively'7 and                                   sity, wndon, 1954). See also D. Han, Y. S. Kim. and M. E. Noz, Found.
quantitatively. '8                                                                     Phys. (to be published).
                                                                                  '''Yo S. Kim, M. E. NOl, and S. H. Oh, J. Math. Phys. 21, 1224(1980).
   Finally, let us describe the "instant form" quantum me-                       11This exponential form is also derivable from Yukawa's work. See Eq.
chanics discussed in Secs. II, III, and IV using the light-                           (10) ofH. Yukawa, Phys. Rev. 91, 416(1953). For an interpretation of
cone coordinate system. As is seen in Eq. (29), the Gaussian                          this original paper, see D. Han and Y. S. Kim, Prog. Theor. Phys. 64,
form is diagonal also in the instant-form coordinate varia-                            1852(19801·
bles. The z' and t' axes in the uv coordinate system are                         '2The fact that the proton (one ofhadrons) is not a point particle and has a
shown in Fig. 2. There are usual quantum excitations along                            space-time extension was discovered by Hofstadter. See R. Hofstadter,
the z' axis. Along the t ' direction, Dirac's C-number time-                          Rev. Mod. Phys. 28, 214(1956).
                                                                                 I 'Since Hofstadter's discovery, there have been many attempts to con-
energy uncertainty relation holds.
                                                                                      struct theoretical models for relativistic extended hadrons. See. for in-
                                                                                      stance, V. N. Gribov, D. L. loWe, and I. Ya. Pomeranchuk, J. Nuel.
VI. CONCLUDING REMARKS                                                                Phys. (USSR) 2, 768 (1965) or5ov. J. Nucl. Phys. 2, 549(19661; N. Byers
   As we stated in Sec. I, quantum mechanics and relativity                           and C. N. Yang. Phys. Rev. 142. 796 (1966); J. D. Bjorken and E. A.
                                                                                      Paschos, ibid. 185, 1975 (1969); B. L. Iolfee, Phys. Lett. B 30, 123
are two of the most important subjects in the physics cur-
                                                                                      /1969); K. Fujimura, T. Kobayashi, and M. Namiki, Prog. Theor. Phys.
riculum. Because of its mathematical simplicity, the har-                             43,73(1970); A. L. Licht and A. Pagnamenla, Phys. Rev. D 2, 1150,
monic oscillator is one of the most effective teaching instru-                         1156(1970); S. D. Drell and T. M. Yan, Ann. Phys. (NYI60, 578 (19711.
ments. In their recent papers,"s Kim et al. emphasized that                           Y. S. Kim and R. Zaoui, Phys. Rev. D 4,1764 (1971); R. G. Lipes, ibid.
the oscillator model can serve an effective purpose in teach-                         5.2849(1972); S. IshidaandJ. Otokozawa, Prog. Theor. Phys. 47, 21\7
ing relativistic quantum mechanics and high-energy phys-                              (19721; T. D. Lee, Phys. Rev. D 5,1738 (19721; G. Feldman, T. Fulton,
ics at the level of the first- and second-year graduate                               andJ. Townsend, ibid. 7,1814(1973). Y. S. Kim and M. E. Noz. ibid. 8.
curriculum.                                                                           3521 (1973). See also Refs. 1,2, and the references contained therein.
   The wave functions in the oscillator formalism are com-                       '''Perhaps one of the current models of extended hadrons is the "MIT bag
patible with the known principles of quantum mechanics                                model," as is explained by K. Johnson in Sci. Am. 241 (1),112 (July
                                                                                       1979). One interesting question in this model is how "bags" would look
and relativity. However, what was missing in the past has
                                                                                      to moving observers.
been a broader theoretical base from which this specific                         '·'The quark confinement problem is regarded as one of the most impor-
model is derivable. In this paper, we have shown that this                            tant current problems in the particle theory front. The ultimate goal of
theoretical base had already been given by Dirac in his                               this program is to find a potential that confines the quarks Inside ha-
"instant" form quantum mechanics.                                                    drons within the field theoretic framework ofQCD (quantum chromo-
   Since the appearance of Driac's original paper in 1949,"                          dynamics). The batiic question is then this. What are we going to do with
                                                                                      this confining potential? The next step is naturally to construct bound-
many authors have made and are still making attempts to
                                                                                     state wave functions, which eventually leads to the question of their
construct solutions of the "Poisson brackets" given in Eq.                            Lorentz transformation properties. As was noted in QED (quantum
(6).'9 The point is that the commutator equations are basi-                          electrodynamics),l this does not as yet appear to be an easy problem.
cally differential equations without any specific form for                            For an introductory review article on QCD, see W. Marciano and H.
potentials. The final form of solutions therefore depends on                          Pagels, Phys. Rep. 36 C, 138(19781.
boundary conditions andlor forms of potentials. It is thus                       II'We have to say that the most important observation made on Lorentz-
possible to end up with solutions that are not covariant. 2"                         deformed hadronswas Feynman's parton model. See R. P. Feynman, in
In the present paper, we discussed a solution with a covar-                          High Energy Collisions, Proceedings of the 3rd International Confer-
iant form for potential satisfying a covariant space-time                            ence, Stony Brook, New York, edited by C. N. Yangetal. (Gordon and
boundary condition.                                                                   Breach. New York, 1969); Photon-Hadron intcractions {Benjamin,
                                                                                     Reading, MA, 19721.
'Yo S. Kim and M. E. Noz, Am. J. Phys. 46, 484(1978).                            '1For an explanation of the peculiarities in Feynman's parton picture, see
2R. P. Feynman, M. Kislinger. and F. Ravndal. Phys. Rev. D 3, 2706               Y. S. Kim and M. E. Noz, Phys. Rev. DIS, 335(19771. For a graphical
  (1971). The point of this paper is that the inventor of Feynman diagrams       interpretation of the formulas in this paper, see Y. S. Kim and M. E. Noz,
  stated that it is not practical, if not impossible, to use Feynman dia-        Found. Phys. 9, 375 (19791.
  grams for relativistic bound-state problems. Feynman et al. suggested          1KFor a calculation of the proton structure function, see P. E. Hussar.
  that the relativistic harmonic oscillator model, even if it is not totally     Phys. Rev. D 23. 2781(1981).
  consistent, can serve useful purposes. The point of Ref. I is that the         I~For one of the most re(..'Cnt papers on this subject, see A. Kihlberg, R.
  oscillator model does not have to be imperfect, and therefore that it can      Marnelius, and N. Mukunda, Phyc;. Rev. D 23, 2201 (1981).
  be made consistent with the known rules of quantum mechanics and               1HSee, for instance, R. Fong and J. Sucher, J. Math. Phys. 5,456 (19641.
1161     Am. J. Phys., Vol. 49, No. 12, December 1981                                                                        D. Han and Y. S. Kim        1161
Chapter V
In 1955, Hofstadter and McAllister observed that the proton is not a point particle.
Although several field theoretic approaches had been made immediately after this
discovery to explain the space-time extension of the proton, a satisfactory answer to
this question can be found in the quark model, in which the proton is a bound state
of three quarks.
In order to explain the high momentum-transfer behavior in the Hofstadter
experiment, we need a wave function for the proton which can be Lorentz boosted.
The covariant harmonic wave function discussed in the papers of Chapter IV is a
suitable wave function for this purpose. In 1970, Fujimura, Kobayashi, and Namiki
calculated the form factor of the proton, and showed that the asymptotic behavior of
the form factor is due to the Lorentz deformation of the wave function.
The most peculiar behavior in high-energy physics is Feynman's parton picture. In
1969, Feynman observed that a rapidly moving proton can be regarded as a
collection of an infinite number of partons whose properties appear quite different
from those of quarks. This model is clearly spelled out in the paper of Bjorken and
Paschos (1969). In 1977, using the covariant oscillator formalism, Kim and Noz
showed that the static quark model and Feynman's parton picture are two different
limiting cases of one covariant physics. Hussar in 1981 calculated the parton
distribution for the rapidly moving proton using the covariant harmonic oscillator
wave function.
LORENlZ-DIRAC DEFORMATION IN HIGH ENERGY PHYSICS                                        279
Nevertheless, if we make the naive assumption that the proton charge cloud and its
magnetic moment are both spread out in the same proportions we can calculate
simple form factors for various values of the proton "size". When these calculations
are carried out we find that the experimental curves can be represented very well by
the following choices of size. At 188 MeV, the data are fitted accurately by an rms
radius of (7.0±2.4lX10-14 cm. At 236 MeV, the data are well fitted by an rms radius
of (7.8±2.4)X10-1 cm. At 100 MeV the data are relatively insensitive to the radius
but the experimental results are fitted by both choices given above. The 100-MeV
data serve therefore as a valuable check of the app'aratus. A compromise value
fitting all the experimental results is (7.4±2.4)XlO-14 cm. If the proton were a
spherical ball of charge, this rms radius would indicate a true radius of 9.5xlO-14
cm, or in round numbers 1.0xlO-13 cm. It is to be noted that if our interpretation is
correct the Coulomb law of force has not been violated at distances as small as
7xlO- 14 cm.
\ E~ECTRONI SCATT~RING
                                     ,
                                              FROM HYDROGEN  r--
                            \                 188 MEV (LAB)
\ -- - -
                                                                     r--
                                         ~        ANOMALOUS MOMENT
                                                  CURVE ___
-~
                    I    MOTT CURVE      r        ~
                                                    '~
                                                     ,   ~
                                     EXPE~IMENTAL   CURV!>?'         ~   I
                                                             \
                                                                 \
                                                         -- ~
                                                                             \
                    2
                                50      70     90     110        130             150
                        LABORATORY ANGLE OF SCATTERING (IN DEGREES)
Fig. 1. The figure shows the experimental curve, the Mou curve, and the point-charge, point-magnetic-
moment curve. The experimental curve passes through the points with the auached margins of elTOr. The
margins of elTOr are not statistical; statistical error would be much smaller than the elTOrs shown. The
limits of elTOr are, rather, the largest deviations observed in the many complete and partial runs taken over
a period of several months. Absolute cross sections given in the ordinate scale were not measured
experimentally but were taken from theory. The radiative corrections of Schwinger have been ignored
since they affect the angular distribution hardly at all. The radiative corrections do influence the absolute
cross sections. Experimental points in the figure refer to areas under the elastic peaks taken over an
LORENIZ-DIRAC DEFORMATION IN HIGH ENERGY PHYSICS                                                              281
energy interval of ± 1.5 MeV centering about the peale. The data at the various points are unchanged in
relation to each other when the energy interval is increased to ± 2.5 MeV about the peak; the latter widths
include essentially all the area under the peale.
73
      Taking account of the Lorentz contraction effect of the extended nucleon core as a nucleon
  but not as a quark, it is shown that the Gaussian inner orbital wave function can produce
  the form factor very close to the dipole formula.
     Recent experiments show that the nucleon electromagnetic form factor are
empirically described by the "scaling law" e-1GEP=/1p-1GMP=IL,.-IGMn (~F) and
GEn=o and by the "dipole formula" F= (l+K-'[t[)-', where we have followed
the usual notations and K'=0.71 (GeV/c)'. The scaling law was already dis-
cussed on the theoretical basis of the nonrelativistic urbaryon (quark) model. 1).')
Ishida et a1.') and Drell et a1. 3) attempted to extract information about the inner
orbital wave function at short distances from the [t[-dependence of F in a wide
region of [t[ over M' (At being the nucleon mass), using nonrelativistic for-
mulas. In this note we show that if possible relativistic effects as a nucleon
 (not as a quark), especially the Lorentz contraction of the nucleon core, are taken
into account in a proper way, their conclusions become never true but the simple
Gaussian inner orbital wave function can produce the form factor very close to
the dipole formula.
      Those who are working with the nonrelativistic quark model have believed
that if It I<;Mq' (Mq being the quark mass), nonrelativistic formulas can be used
for everything. As for the form factor, therefore, they have used
(2)
where m~ is the mean square mass of p and (J) mesons and q the momentum
transfer. ¢ (x, ... ) stands for the inner orbital wave function, where independent
inner coordinates are denoted by x and         Assuming the simple Gaussian func-
tion for ¢, we have got
(3)
where (r'), is the mean square radius of the nucleon core. It is evident that
the simple Gaussian function never gives us the form factor consistent with the
dipole formula for It I?:..M'. This is the reason why Ishida et a1. introduced a
singular wave function and Drell et a1. discussed singular potentials among con-
stituent particles. It is, however, to be noted that Eq. (2) is a nonrelativistic
formula to be verified not only for It I<Mq' but also for It I<M'. Here we want
to emphasize that relativistic effects as a nucleon (but not as a quark) become
very important for ItI2M'. Indeed, we can see that the Lorents contraction ef-
fect as a nucleon for Itl>M' should reduce <r'), in Eq. (3) by the Lorentz
factor, r- l, approximately proportional to M'ltl- l • Hence Eq. (3) must be
modified essentially in its It I-dependence in the following way:
(4)
for the inner orbital wave function, *) where P stands for the center-of-mass
momentum of the composite system, i.e. the nucleon momentum, and (a/a)' is
the normalization costant determined by !! IcfJI'd'rd's"= 1. The constant a is related
to the mean square radius of the nucleon core through a-I = <r').l3. It may be
worth while to emphasize another reason why Eq. (5) is used here: Equation
 (5) represents the ground state eigenfunction of the Hamiltonian of a four-
dimensional harmonic oscillator consistent with the famous linearly raising tra-
jectory in an extended particle modeL') Our procedure should be regarded as
one theoretical attempt in an extended particle model represented by a trilocal field
based on the quark model, rather than one in the naive relativistic quark model.
284                                                                                              CHAPTER V
or symbolically
                                                                                                     (6')
where Pr and PF are, respectively, the initial and final momenta of the nucleon
and q = PF - Pr . a and b is one of the following pairs; (0, - -/'2,), (-/3/2, 1/ -/'2,)
and (- -/3/2, 1/ -/'2,). Note here that a' + b' = 2 for every pair. Inserting Eq. (5)
into Eq. (6), one obtains
(7)
Here the first factor is not other than the overlap integral
with (8)
*) It is to be noted that most of the infinite component field theories have identified the form
factor with the overlap function (¢F, ¢I) but never with (¢F, e iqx ¢1) itself. In fact, some authors
have derived our later result, (¢ F, ¢ 1) = (1 + Itl/2M2) -2 as the form factor, using the infinite component
field theory. See A. O. Barut's lecture given at the Colorado Summer School in 1967. We must em-
phasize here that the form factor should not be given by (¢r, ¢,).
LORENTZ-DIRAC DEFORMATION IN HIGH ENERGY PHYSICS                                               285
 modified by a constant factor for M q '>ltl2:M". Note that they contain only
 one free parameter <r'), to be adjusted. Let us first compare the theoretical
 form factor given by Eq. (9a) together with Eq. (7)--call it Case (i)--
 with experiment') in Fig. 1, in which we have used <r')o=7.50 (GeV/c)~'. From
 them one can see that the theoretical curve given by Eqs. (9a) and (7) is not
 inconsistent with the experimental plot but quite different from the nonrelativistic
 Gaussian form factor Woo Next we examine Case (ii) in which Eq. (9b) is
 combined with Eq. (7), namely, each quark has the vector meson cloud. Choos-
 ing <r'), = 1.82 (Ge V / c)~', we see in Fig. 2 that the theoretical curve can
 reproduce the experimental plot in a wide range of It I from zero to about 25
 (Ge V / c)'. It is repeatedly noted that this fit has been obtained by adjusting only
 one parameter <r')" and that the nonrelativistic Gaussian form factor is strongly
 modified in its essence.
                o
                                       It I (GeVid
         Fig. 1. Comparison of the theoretical form factors with experiments in Case (i)
             with <r')c=7.50 (GeV/c) -'. The dipole formula and the nonrelativistic Gaussian
             form factor are, respectively, shown by the broken and chain lines.
286                                                                                       CHAPTER V
0.9
                                   0.8
              I ci'
0.7
                                   os
                -2                    0~~0.~~~O~I~0--~O~15~~~~--~~~~
              10
                                                 II I (GeV/d
Case(iil
16 Case (iv)
                    o                     10                20                 30
                                               III {GeVid
         Fig. 2. Comparison of the theoretical form factors with experiments in Case (ii)
             with     <r
                      2 ), =1.82 (GeV/c) - 2, in Case (iii) with <r» , =8.81 (GeV/c) - 2 and
             .1=1.4, and in Case (iv) with <r» , =1.20 (GeV/c) - 2 and .1=0.9. The broken
              line shows the dipole formula.
    Here we want to introduce a new parameter, say A, into the lllner orbital
wave function as follows:
                        (a)" - -
      ¢,(r, s ; P) = ,- ;; v2A -1 exp            [a{
                                                  "2 r' + s'- M'
                                                              2A (P·r)'- M'
                                                                         2A (p·s)' }] .        (10)
It is easy to see that the parameter A distinguishes the time-like extension from
the space-like one of the inner orbital motion, and that A= 1 gives us the original
one Eq. (5). Using ¢., we have got
(12)
The form factor F, is obtained by Eq. (11) together with the modified formulas
(13a)
(13b)
 In case (iii) Eq. (13a) is combined with Eq. (11), and Case (iv) is given by
 Eq. (13b) together with Eq. (11). In both cases the form factor goes to one
 proportional to (22\,1'ltl- 1)' like the dipole formula. Figure 2 shows us that the
 experimental plot can be fitted by the theoretical curves with <r'), = 8.81 (Ge V / c)-'
 and A=1.4 in Case (iii) and with <r'),=1.20 (GeV/ct' and A=O.9 in Case (iv).
 The theoretical curves are in good agreement with experiment. Needless to say,
 Cases (iii) and (iv) include Cases (i) and (ii), respectively, as their special cases
 with l= l.
      From the above arguments we have inferred that the Lorentz contraction of
 the extended nucleon core can be a possible origin of the "dipole formula".
 The same effects will appear also in inelastic electron proton collisions leading
 to the isobar excitation. In the nonrelativistic quark model we have got the
 differential cross section for the inelastic collision in the following form :6)
(14)
using the simple Gaussian wave function, where L is an integral number deter-
mined by the type of transition, and A a numerical factor. If we take the
Lorentz contraction factor into account, then we can infer that Eq. (14) should
be replaced with
                                                                                 (15)
where Vr and OF! (q') are, respectively, the effective Lorentz contraction factor
and the onrlap integral in the inelastic collision. The similar structure of the
wave function suggests us that r ~q' and OF! (q') ~ (q')-' as q' goes over M', and
then that the inelastic cross section would behave like the dipole formula squared
for q'?:J1,1'. Indeed, it seems to us that recent experiments indicate such a be-
havior for the cross section.') Detailed discussions will be given in a forth-
coming paper in which the full nucleon and isobar wave functions and the quark
current to be valid for .~lq'';P It I?M' will be formulated.
     The earlier form of this work was done wheh one of the authors (M. N.) was
working in the Niels Bohr Institute in Copenhagen. He would like to express
his sincere gratitude to Professor A. Bohr for his kind hospitality and to Professor
z. Koba for many discussions. Be is also much indepted to Professor T. Taka-
bayashi for helpful discussions.
288                                                                                   CHAPTER V
References
      1) Y. Kinoshita, T. Kobayashi, S. Machida and M. Namiki, Prog. Theor. Phys. 36 (1966), 107.
         In the first several sections they derived the scaling law, assuming that a virtual photon
         couples directly to one point quark in the nucleon core subject to the 56-dimensional re-
         presentation of the SU(6) symmetry.
      2) S. Ishida, K. Konno and H. Shimodaira, Prog. Theor. Phys. 36 (1966),1243. They first derived
         the scaling law in a semi phenomenological way, assuming that a virtual photon couples to
         the nucleon only through p and (J) mesons. They have also got the quark-theoretical form
         factor, Eq. (lb), together with Eq. (2), assuming each quark to have a vector meson cloud.
      3) S. D. Drell, A. C. Finn and M. II. Goldhaber, Phys. Rev. 157 (1967), Bl57.
      4) T. Takabayasi, Phys. Rev. 139 (1965), B138L
      5) D. H. Coward, H. DeStalbler, R. A. Early, J. Litt, A. Minten, L. W. Mo, W. K. H. Panofsky,
         R. E. Taylor, M. Breidenbach, ]. I. Friedman, H. W. Kendall, P. N. Kirk, B. C. Barish,
         .r. Mar and J. Pine, Phys. Rev. Letters 20 (1968), 292.
         L. N. Hand, D. G. Miller and R. Wilson, Rev. Mod. Phys. 35 (1963), 335.
         T. Jansens, R. Hofstadter, E. B. Hughes and M. R. Yearian, Phys. Rev. 142 (1966), 922.
         W. Albrecht, H. ]. Behrend, W. Flauger, H. Hultshig and K. G. Steffen, Phys. Rev. Letters
         17 (1966). 1192.
         W. Albrecht, H. J. Behrend, H. Dorner, W. Flauger and H. Hultshig, Phys. Rev. Letters
         18 (1967), 1014.
      6) K. Fujimura, Ts. Kobayashi, Te. Kobayashi and M. Namiki, Prog. Theor. Phys. 38 (1967),
         210.
      7) For example, see W. K. H_ Panofsky's report presented at the XIVth International Con-
         ference on High Energy Physics in Vienna in 1968.
LORENTZ-DIRAC DEFORMATION IN lllGH ENERGY PHYSICS                   289
RICHARD P. FEYNMAN
Talk given at
but their total energy LiEi (where each energy Ei is calculated from the
mass ~ of the parton via Ei = "';~2 + Pi . Pi is not equal to that of the final
proton Eo' In fact, the amplitude to find this state contains, among many
other factors, one which is inversely proportional to this energy difference
          A - (Eo- ~E)-l                                                    (1)
                               i
Knowing this wave function completely for some Po' say at rest, how can
we find it at some other momentum? It is very difficult to do, and in fact
requires knowledge of the entire Hamiltonian operator H, for the wave
function is not a relativistic invariant. This is emphasized by the point that
the momentum is the sum of the momenta of the parts but the energy is
not. The wave functions that should be useful for us are those in which Po
is very large in the z-direction and finite in the directions perpendicular to
that. If we take Poz = Xo Wand measure the parton's momentum in the
z-direction in the same scale Piz = ~ W, then the wave function has a
definite limiting form as W ---7 00 for xo' Xi finite. (xo ' of course, is
arbitrary; it may be taken to be unity, for example.) We have
          xo   =   k.J
                   ""    x·I                                                (2)
(3)
     E - P = (m 2 + p2 + p2)1/2 _ P
           z                            z       z
           =   (m2 + Q2 + x 2WZ)1I2 - xW
           ,., I m2 +Q2
              2W x
where m is the mass of the particle. Therefore, the amplitude becomes
                               2        2       2        2
               mo + Qo "" mi + Qi ]-1                                       (4)
          A - 2W[-- k.J
                                   Xo       i       Xi
294                                                                                   CHAPTER V
   IThe statement is not precisely correct. What is meant is the density matrix has definite
limits.
LORENn-DIRAC DEFORMATION IN HIGH ENERGY PHYSICS                                     295
The equations for x not wee simplify if one concentrates on the small x
part. It is then seen that there is an approximate scaling law for small x
(the approximation improving as x decreases) so that solutions with special
distributions of partons with a power law scale dependence (x-<X) are
eigenfunctions natural to field theory.
It may help to give a few, nearly trivial examples. First, according to first
order perturbation theory in the expression (4) for the amplitude, the
numerator does not depend on x, Q for scalar partons (couplings involve
no momenta). If one of the partons has an especially low x, the the term
(J..l2 + ~)/x belonging to it dominates and we get an amplitude
proportional to x (times the scale dP/E, or dx/x, of relativistic phase space).
This corresponds to <X = 0 for the scalar meson. Likewise, it can be shown
that the amplitude for (longitudinally polarized) vector partons varies as
constant (times dxlx). In this case a factor 1/x comes from the numerator
couplings. For spin 1/2 particles coupled in the simplest ways, the
amplitude varies as xll2 (times dx/x). In general, <X equals the spin of the
particle. In perturbation theory, these agree with well-known results for
the energy dependence of x sections, in particular that vector meson
exchange as in electrodynamics lead to constant cross sections in
perturbation theory.
Bremsstrahlung.
The theory of Bremsstrahlung with strong coupling and with the "photons"
of the field carrying the very type of currents which are sources of further
Bremsstrahlung has not been worked out in detail. Nevertheless, we may
boldly try to guess. that certain analogies to electromagnetic weak
interaction Bremsstrahlung exist. Some hope for sense here comes from
noting that many features can be be seen from a classical view which takes
h ~ 0 so e 21f:c large. Therefore some properties are understandable both
for e2lf:clarge, and for e2{fc small, may have more general validity. This is
especially likely if we understand the reasons for them clearly.
Next, the energy in the field this radiated is some fraction of the energy of
the particle which radiates. Thus the particle may be found after the
radiation to have lost on the average some fixed fraction of its energy.
LOREN1Z-DlRAC DEFORMATION IN HIGH ENERGY PHYSICS                                     299
For weak coupling electrodynamics, the vector field particles are emitted
independently into a Poisson distribution with mean number n emitted.
The probability that none are emitted is e-n. The sum of the chance of
emitting none, one, two, etc., (that is, the total cross section) is much like it
would be without coupling to the photons. Here we know the total x
section is constant, and so can try to interpret the energy fall- off s2cxo-2 of
the pure two body charge exchange reaction as the factor e-n for the
probability of no emission, where i1 is the expected mean number of
primary particles emitted. This multiplicity n must rise logarithmically
with energy then as n= (2-20.0 ) In s. The particles we observe are not, of
course, the primary field particles emitted, but rather the observed particles
are secondary disintegration products of these unknown primaries. But if
each primary produces on the average a fixed number of secondaries, we
see that the expectation is that the multiplicity grows logarithmically with
E.
This is necessary if our various ideas are to fit together. Because we have
already suggested that the mean number of any kind of particle emitted is
to vary with x as c dxlx for small x and, for a given x, not to vary
otherwise with the energy W of the collision (so that c is a constant). The
                                                        J
total mean number emitted, then, is c dxlx. The upper limit of x is of
finite order (for the formula fails as x ~ 1 and x cannot exceed 1) but the
lower limit is of order of wee x, (i.e., order trw) where the dxlx fails.
Thus the mean number emitted to the right must vary as c(ln W + const).
(Actually we can do the integral all the way to zero, for we expect the
integrand to be c dP.J..J~2 + OZ + p/ where ~ is the mass and Q is the
transverse momentum of a typical particle. Putting P z = xW, this is
     J X!
            c
                        dx
                                       =   c in     2
                                                     2Wx1
                                                            2 112
      o         ..Jx2 + (~2 + Q2)/WZ              (J.L + Q )
  3Report on the Topical Conference on High Energy Collisions, CERN 68-7, February
1968, Turkot, p.316.
300                                                                   CHAPTER V
momentum distribution.)
I have not yet studied the regularities involving the transverse momenta
(items (4) and (5) in our Introduction) from the viewpoint being developed
here. In the meantime, we can take these as empirical facts to be included
in any expectations. In the same way we leave for further research
strangeness and isospin character of these effects. We should notice,
however, that, although we discussed a charge exchange arising from an
exchange of a particle of the quantum numbers of the p-, the exchange of
any current of the usual octet would have analogous effects on the possible
Bremsstrahlung of particles coupled to other (non-commuting) currents.
The quantum numbers exchanged may involve not only currents of unitary
symmetry, for baryon number (and possibly spin) may be exchanged, and
we do not know if there are special couplings to baryon currents (or spin
currents) which are also involved in determining ao' However, I should
like to hazard the guess that baryon number cannot be exchanged without
the transfer of a fundamental part of spin 112. Such an ideal part already
has a = 112 which would imply a l/Vs behavior of amplitudes before
corrections to Bremsstrahlung. Therefore, if we do an experiment which
freely allows the emission of wee mesons, except that the quantum
numbers are controlled so that a baryon must be exchanged between right
and left systems this cross section probably approaches a 1/s behavior,
instead of the constant expected for similar experiments in which no
baryons need be exchanged.
The probability that the total momentum of all the emitted right moving
p 's is less than y is proportional to yC so that (aside from diffraction
dissociation) the momentum distribution of the ongoing particle, when it
takes a fraction of momentum x close to 1 should vary as (l-x)C where I-x
is small.
TABLEt
  30                   7                  7.8
 470                 13 ± 1              14.1
 1500                18 ± 2              16.8
12300                24±4                21.6
LORENTZ-DIRAC DEFORMATION IN HIGH ENERGY PHYSICS                                                                                    305
               A model for highly inelastic electron-nucleon scattering at high energies is studied and compared with
            existing data. This model envisages the proton to be composed of pointlike constituents ("partons")
            from which the electron scatters incoherently. We propose that the model be tested by observing 'Y rays
            scattered inelastically in a similar way from the nucleon. The magnitude of this inelastic Compton-scat-
            tering cross section can be predicted from existing electron-scattering data, indicating that the experiment
            is feasible, but difficult, at presently available energies.
high energies and large momentum transfers is the                                    II. INELASTIC         e-p SCATTERING
possibility of obtaining detailed information about the                    The basic idea in the model is to represent the in-
structure, and about any fundamental constituents, of                   elastic scattering as quasifree scattering from pointlike
hadrons. We discuss here ,m intuitive but powerful                      constituents within the proton, when viewed from a
model, in which the nucleon is built of fundamental                     frame in which the proton has infmite momentum. Tbe
pointlike constituents. The important feature of this                   electron-proton center-of-mass frame is, at high energies,
model, as developed by Feynman, is its emphasis on                      a good approximation of such a frame. In the infinite-
the infinite-momentum frame of reference.                               momentum frame, the proton is Lorentz-contracted
   It is argued that when the inelastic scattering process              into a thin pancake, and the lepton scatters instan-
is viewed from this frame, the proper motion of the                     taneously. Furthermore, the proper motion of the con-
constituents of the proton is slowed down by the                        stituents, of partons, within the proton is slowed down
relativistic time dilatation, and the proton charge dis-                by time dilatation. We can estimate the interaction
tribution is Lorentz-contracted as well. Then, under                    time and the lifetime of the virtual 5t:ttes within the
appropriate experimental conditions, the incident lepton                proton. By using the notation of Fig. 1, we find the
scatters instantaneously and incoherently from the                      following.
individual constituents of the proton, assuming such a                     Time of interaction:
concept makes sense.
   We were greatly motivated in this investigation by                                        r'" 1j qo= 4P j(2M v-Q'),             (2.1)
Fcynman, who put the above ideas into a highly work-
                                                                        where 4" was uLlculatcd in the lepton-proton center-of-
able form. In Sec. II, we discuss the basic ideas and
                                                                        mass frame.
equations for the model as they apply to electron-proton
                                                                            Lifetime of virtual states:
scattering. Two models are then discussed in detail,
with interesting consequences for the ratio of clectron-                T={[(XP)2+!'12J1I 2
proton and electron-neutron scattering. For a broad
class of such models, we lInd a sum rule which indicates                      +[ (1_x)2P2+!'22],!2_ [P'+ M p2Jl12) ~l
that, although it is not difficult to fit the data within
~50%, it is more difficult to do beller; the observed
                                                                                                          21'
                                                                                                                                   (2.2)
cross section is uncomfortably small.
   In Sec. III, we look for stringent tests of F eynman's
picture. We propose that, under similar experimental                    If we now require that
conditions, inelastic Compton scattering can also be                                                    r«T,                       (2.3)
calculated within the model. It is shown that the ratio
of inelastic electron-proton to inelastic 'Y-proton scatter-            then we can consider the partons, contained in the
ing, under identical kinematical conditions, is model-                  proton, as free during the interaction. Furthermore, if
independent and of order unity, provided the proton                     we consider large momentum transfers -rf»M2, then
constituents (which Feynman calls "partons") possess                    we expect the scattering from tbe individual partons to
unit charge and spin 0 or !. We propose experiments                     be incoherent. The above conditions appear to be
which can measure inelastic Compton scattering. To                      satisfied in the high-energy, large-mom en tum-transfer
this end we have estimated the yield and background                     experiments at SLAC.
                                                                          The kinematics for e-p inelastic scattering have been
  * Work supported   by the U. S. Atomic Energy Commission.             discussed in many places, in as many different nota-
                                                                185    1975
                                                                                                xJ' ax jN(X)a(v-~).
rest remain undisturbed during the interaction. The
interaction with the parton is as if the parton were a                                                                          (2.14)
free, structureless particle. The cross section du I dlliE'                                        o                    2xM
is then a sum over individual electron-parton interac-
tions appropriately weighted by the parton charge and                 Here peN) is the probability of finding a configuration
momentum. For a free particle of any spin and unit                    of N partons in the proton, (L, Q")N equals the average
charge, elementary calculation yields                                 value of L' Ql in such configurations, and jN(X) is the
                                                                      probability of finding in such configurations a parton
        W,(v,rf) = a(v-Q"12M) = Ma(q·p-!Q') ,                 (2.8)   with longitudinal fraction X of the proton's momentum,
while for WI, we have                                                 that is, with four-momentum xP'.
                                                                         Upon integrating over x, we find
                         u,=O for spinO,
                                                                          vW,(v,rf) =L P(N)(L Q,')NXjN(X) = F(x) ,              (2.15)
                         0",=0 for spin     L                 (2.9)                       N
0.4
0.3
            ]0.2
                                                            2'
                                                             (M/dSl)        I+R        02
                                                                                         2
                                                  Flw)' /I d (TI SldE 11+2 ....!.... (I+~ )Ton 2 8
                                                                                                2
                                                                                                                       r   I
            "-                                                            MOTT
                                                                                                                1.20
                                                                                                                        ·
                                                                                                                       ·+
                                                  R '(T, I"'l
                                                                                              ..   R            1.40   v
                  0.1
                                                  e, 6°                      2 (8eV/c)2                 0
                                                                                                                LGO
                                                                                                                1.80   4
                                                                                                                       ··
                                                                                                                2.00
                                                                                              .
                                                                                    0.70      0
                                                                                    ()'80     ~                 2.20
                                                                                     1.00                       2.30
                   0
                        0                           2            3             4                   5              6                 7
                                                                W'   /1/ 0 2 in (BeV) - 1
where jN(XI, "', XN) is the joint probability of finding                                           A. Three-Quark Model
partons (irrespective of charge) with longitudinal frac-
tions X " • • . , XN . It follows that IN is a symmetric                      Assuming that the proton is made up of three quarks
function of its arguments. Therefore,                                       with the usual charges,' we obtain
                                                                                      vW,=j,(x)=2x(1-x) , x=Q'/2Mv.                             (2 .23)
 ['   xldx,jN(XI)=~ j           dXI" ·dXN(L, Xi)
Jo                          N                '                              While th e data support vW, -> const as p ->c<> (or
                                                                            x -> 0), the model predicts that pW, should vanish, a
                              XjN(.~ I, "',xN)o(l-L, Xi)
                                                                            result not dependent on the specific choice of j,(x), but
                        =1/N.                                    (2.18)     only on the fact that ja is normalizable. In fact, within
                                                                            our one-dimensional model, if the number of partons
Putting together (2 .18) and (2. 15), we obtain a sum rule                  is held finite, then the cross section vanishes as x -> O.
                                                                               1     (I-x) I
                                                                        F.(x)=-- ---+---
                                                                                              12
                                                                                               x
                                                                             1-lnl 9 (Z-xl 6 (I-x)'
                                                                                                      X[lnC:
                           02    O~        ~        ~    I~
                                  1.'11l Mv
  FIG. 3. Plot of the results for a model of three quarks in a sca
                                                                                                                  X
                                                                                                                      )-2(I-xl   Jl,   (2.31)
of quark-antiquark pairs. The dashed line is visual fit through the
experimental points of Ref. 4.
 l'
                                                                        are present.
       =c      r:
            N-a,5 .. ·
                         (=+":")X(I-X)lV-'
                          9 3.Y
                                                                                  IC~
g@\; \~~~
             Icf'"                                                                                                                          \'
                                                                                                                                                I"
                                                               -          PHOTONS fROM                                                           I
                                                                     CONPTOH                                                                         \
                                                                                                                                                         \
                                                               - - - A«:ITONS FROM
             10- 31                                                   .". DECAv                                                                              \-_. r;~~5 fR             ~ r·
                                                                          ... J8 rAWe.                                                                        \
                                                                                                                                                                  \
                                                                                                                                                                  .\
                                                                                                                                                                   I
 MI~
  Ee>
  u    :.
             10"
                                                                                                                                                                       I
 ~• b r~c:
       'D
 "     'D
             lOll
10"
             100 J)
                       0        2          6         lO        '2        14        16        16     20       FIG. 8. Inelastic Compton scattering for a 22-GeV incident
                                               k# GeV/(                                                    bremsstrahlung spectrum. The background curves are the same
                                                                                                           as in Fig. 6.
   FIG. 6. Double-difJerentiallaboratory cross section for inelastic
Compton scattering for an 18-GeV incident bremsstrahlung spec-
trum. The solid curve corresponds to the signal. The dotted curve                                          where for any operator O(N) we have
is the background of 'Y's from "'s using the data of Ref, 8 as dis-
cussed in the text.                                                                                                          (0)= EN P(N)O (N)fN (x) ,                                  (3.11)
                                                                                                             For our model of three quarks in a cloud of quark-
unity, In general,'
                                                                                                           antiquark pairs, there exist upper and lower limits for
         d") £j'd")
      (tKldE'
         -                 (LQ;')/(EQ,'),
              kk' tKldE' '. '
                           7P
                                =
                                   ,
                                       -                                                          (3,10)
                                                                                                           (L Q')! (E Q'), We note from (2,25) and (2,26), for
                                                                                                           the proton, that
                                                                                                                        11     2            1                                 2
                                                                                                           (E Q;')N = - +-(N - 3) =-(E Q;')N+-
                  IV
                                                                                                                        27    27           3 ,                              27
                                                                                                                                                5                       4
                                                                                                                                           = -(~                  Q" )N--N.             (3.12)
                  ·. t                                                                                                                          9 ,
                                                                                                           Therefore, for identical kinematical regions' we have
                                                                                                                                                                        81
      Mel~ lOll
        u    :.
                                                                                                           1 -,,'j dU)             ( du)                 5.' ( du )
      .
      ~I~"
       b "
       ~"
                  10''''                                                                                   "3 £i?,.,dfldE' '. < dfldE'     >P   <"9 EE'                    dfldE' '.
                                                                                                                                                                                        (3,13)
                                                                                                     It   :0   25 GeV/c
                                                                                                     - - PHOTONS FROM COMPTON
                                                                                                     ---- PHOTONS FROM   DECAYS   .,,0
                                                                           5             10                    15            20              25
                                                                                              k' GeV/c
                              -l
charged ".'s_ We define                                                                                             X (Avogadro No.)
                     d u eff  E electron dk du'tp                                  =8.2X10--"(yield) r+ cm'/ sr BeV.                       (4.4)
                     --                  --                     (4.1)
                     d'Jdk'  k'+I ' I/2M k d'Jdk ' -                    The terms in the denominator have the following
                                                                        origin: The thin-target bremsstrahlung spectrum is
We assume, optimistically, that the partons have unit                   tdk / k, where t is the thickness in radiation lengths (r.l.)
charge and spin!; from (3.3) and (4_1) we obtain                        (the target was 0.3 r.!. Be). The factor 0.7 is a thick.-
                                                                        target correction calculated by Tsai and Van Whitis.'
dqef f         4a2   J .\!(E -k')/ 4Ek' ,;in2 (!9)
-         =-                                         XdXF(X)            (Yield)r+ is taken from the SLAC User's Handbook'
d'Jdk'       M'k'      ' I'                                             and the 'Y-ray flux is obtained by folding the ,,-'-decay
                                                                        spectrum into (4.4):
                x[    1 4k'X     ~'GO) +8k"X';'(!8)] ,          (4_2)
                                                                        ( -dU)'
                                                                              - =2 lE(dur)
                                                                                       --        dk' 21~(dur)
                                                                                                -""-      - - dk'
                                                                          dfldk r'  k, dfldk' ,H k' k, k, dfldk' 'If
with
                                                                                   '" (2Eo/ k,8) (8.2 X 10--") (yield)r+
                                  X=M. / Q'.                    (4.3)
                                                                                                                (as function of k,),       (4.5)
   We have calculated this expression for several in-
cident electron energies as a function of k' and 8, using               where in the last step we have used the empirical ob-
for F(,,) the values given in Ref. 2. The results are                   servation that
shown as Figs. 6-9. In the same figures are shown our                                    dUr/df!dk'~e-k'''EO ,          (4.6)
estimates of the corresponding background from the                      with
decay of photoproduced ".o's into "Y rays. The estimate                                     E o",0.154 BeV.
was made by assuming that the yield of ".+ measured
in the SLAC beam survey experiment' equals the ,,-'                       In Figs. 6-9, the background is that from an 18-BeV
yield. We thereby obtain, for the effective cross section               bremsstrahlung beam. It is expected that this back-
                                                                        ground increases slowly with beam energy, and keeps
    8   SLAC User's Handbook, Sec. 0.1, Figs. 1 and 2 (unpublished).      • Y. S. Tsai and Van Whitis, Phys. Rev. 149, 1948 (1966).
312                                                                                                                   CHAPTER V
                                                                              1
inelastic Compton process, if it exists, is displaced up-
 ward in energy, so that for 20-BeV electrons and above,                                              (l-x),N-'e"N
 there is a region where the Compton signal dominates
                                                                 F(x)=2x            NP(2N+l)(L: Q")(2N+l).        dN.
                                                                                 t:           i           sln1rN
 the ,..0 noise. In any case, it will certainly be necessary
 to compare the 'Y-ray spectrum with that of the ,..+                                                                             (AI)
 under the same conditions as reassurance that Compton
'Y rays are indeed being seen.                                   The contribution of the semicircle at infinity is negli-
   A variation of this experiment is to consider the             gible. For what remains in the integral, we can use
inelastic Compton terms in I'-pair photoproduction               (for x --70)
 (see Fig. 10). We have not analyzed this process in                 x(l-x)'N-''''xe-('N-')x= _!M-('N-l)x/ON,                     (A2)
detail. The rate is diminished by a factor roughly'O
 ~ (2a/3,..)[In (Emax/m,) -3.5J~ 1/350 but if the charged        and integrate by parts:
pions are absorbed immediately downstream from the
                                                                                              iJ
                                                                          l
                                                                               ·+i~
target, the background muon flux from 7r± decay can              F(x) =               .('N-l)x_
be reduced by a factor ~ 1/700 as well. Furthermore,                          a-ioa              aN
the two muons are strongly correlated in angle, pro-
viding a quite unique signature. All this is encourage-
ment that perhaps the background is manageable. The                                                                               (A3)
"singles" background from the Bethe-Heitler diagrams,
for which the undetected muon predominantly goes in              In the limit of x --7 0, we have
the forward direction, is interesting, as well, and very
likely exceeds the singles rate from Compton I' pairs.                                    N P(2N +1) (L Q;)(2N+lle'rNI·+i~
But this is also of interest in testing I'-e universality at     F(x) ~                   --=----=-=--=-=--
                                                                                                    sinn-N
very high '1'. The "Bethe-Heitler" muons probably                                                                         4-iao
                                                         Marilyn E. Noz
                     Department of Radiology, New York University Medical Center. New York. New York 10016
                                                  (Received 23 February 1976)
             It is shown that the covariant~harmonic·oscillator wave function exhibits the peculiarities of the Feynman
            parton picture in the infinite· momentum frame.
  In our previous publications",2 we discussed both                  singularities which cause deviations from Bjorken
the conceptual and phenomenological aspects of                       scaling. lO Our oscillator model does not contra-
the covariant-harmonic-oscillator formalism.                         dict this physical picture.
Based on the Lorentz-invariant differential equa-                       Perhaps the most puzzling and irritating ques-
tion proposed by Yukawa in connection with Born's                    tions in Feynman's parton picture' have been the
reciprocity hypothesis,' our starting point was a                    following problems:
technical innovation over the work of Feynman                          (a) The picture is valid only in the infinite-mo-
et ai.' Our solutions to the same oscillator equa-                   mentum frame.
tion satisfy all the requirements of nonrelativistic                   (b) Partons behave as free independent particles.
quantum mechanics in a given Lorentz frame, and                        (c) While the hadron moves fast, there are wee
satisfy the requirement of Lorentz-contracted                        partons.
probability interpretation for different Lorentz                       (d) The longitudinal parton momenta are light-
frames. We contend that our oscillator model is                      like.
the first formalism since the invention of quantum                     (e) The number of partons seemS much larger
mechanics in which the wave functions carry a                        than the number of quarks inside the hadron.
covariant probability interpretation. 3                                The purpose of this paper is to provide qualita-
  The real strength of our oscillator model lies in                  tive answers to all of the above questions. Our
the fact that one and the same wave function can                     starting point is the system of two bound quarks
provide the languages for both slow and fast had-                    in the rest frame which can be described by a
rons. Our formalism can be applied to the quark-                     covariant-harmonie-oscillator wave function. We
model calculations til the low-, intermediate-, and                  shall then boost this covariant bound system to an
high-energy regions.I.,2,6 However, one of the                       infinite-momentum frame and show that the pecu-
most challenging questions in high-energy physics                    liarities of the covariant oscillator coincide ex-
has been how to explain Feynman's parton" 8 pic-                     actly with the parton properties mentioned above.
ture in terms of a formalism Which can also de-                         Following Feynman et al.' we call these two
scribe the static properties of the hadron.                          quarks a and b. In the harmonic-oscillator for-
  Another approach to this problem has been to                       malism,'·2,3 the quark momenta P. and p, are not
explain Bjorken scaling in terms of the light-cone                   on the mass shell, but the total hadronic momen-
commutators and the initial hadron in its rest                       tum
frame. 9 Here, one promising line of reasoning
                                                                                                                                  (1)
has been that the hadron is a composite particle
and that its distribution function eliminates all the                 is on the mass shell. It is convenient to use the
15 335
                                                                   G:~j-(~r '
      x=                    .
                                                                                                                      (10)
  The transverse variables play only trivial roles
in the harmonic-oscillator formalism and also in               where M is the mass of the hadron. As P o -"",
the parton picture. For this reason, we shall                  the width of the L (and q J distribution becomes
omit the transverse part of the wave function in the           vanishingly small. Consequently,
following discussion.
                                                                   c=O and q_=O.                                      (11)
  If the hadron moves along the z axis with veloc-
ity fl, the ground-state wave function for this two-           This means that both     ~   and q are lightlike vectors,
quark system can be written as'·ll                             and
                          - 2" "i+i3 ~+ 2
                    Wexp {W[(1-fJ)       + (1+/'
                                            1=131) L 2l}
                                                                   ~+=l2z=/2t ,
      iJ!(x,ll) =   2iT                              J     '      q+=/2q.=/2qo·
                                                                                                                      (12)
where
(7)
momentum of the constituent quark                          particles. According to Eq. (12), the ~+ axis is
        1        1
                                                           also the f2t axis. Therefore, the time duration is
     P'='iP-m q                                     (13)   of the order of (P';M rw). This interval increases
                       ·
Since the four-vector q is lightlike, and we are
                                                           as Po  becomes large. If this interval is much
                                                           larger than the characteristic time of electro-
considering here only longitudinal momenta, P. is          magnetic interaction, then the partons of the pres-
also lightlike.                                            ent paper will indeed behave as Feynman's partons.
  Considering the width of the Gaussian function             We have shown above qualitatively how the co-
for the q + distribution, which is also the f2 q 0         variant oscillator produces Feynman's parton pic-
distribution, we can say that the momentum of the          ture in the infinite-momentum limit. The next
constituent quark mostly lies in the interval de-          question then is how we can use this formalism to
fined by the following limits:                             carry out the parton-model calculations.
     Pm~ =PoG +
                                                             In order to answer this question, we note first
                     ;) ,                                  of all that the above two-body formalism can be
                                                    (14)   easily generalized to the three-quark nucleon sys-
     Pmm =Po(~   - ;).                                     tem. 4 In performing the parton-model calcula-
                                                           tions, we have to square the wave function to get
The quantity (rw/2M) is of the same order of               the probability-density function. The Gaussian
magnitude as ~. For this reason, the lightlike             form remains Gaussian during the squaring pro-
four-momentum P. can be written as                         cess. The Lorentz -contraction property of the
                                                           Gaussian probability distribution is identical to
     p.=aP,                                         (15)
                                                           that of the wave function except for the factor of 2
with a ranging apprOXimately from zero to one.             in the exponent. In fact, the width quoted in Eq.
This wide-spread distribution and division of the          (14) is derived from the width of the probability
four-momentum are exactly like those of the par-           function.
ton model.                                                    As was noted earlier in this paper, the proba-
  Let us go back to the C distribution, which is           bility function exhibits a 6 function in the (q. - qo)
also the f2z distribution. We noted above that the         variable in the infinite-momentum limit. We can
motion along this axis should be almost free. Then         now eliminate the q 0 dependence by integrating
the momentum has to be sharply defined, and the            over this variable. The resulting function be-
momentum cannot have a wide-spread distribution.           comes the parton distribution function in the three-
Therefore the momentum distribution we noted in            dimensional space.
Eqs. (14) and (15) should be regarded as a distri-            The immediate calculations we can do using the
bution of free particles which are lightlike. This         above-mentioned procedure have already been
is exactly what we have in the original form of            carried out by Le Yaouanc e tal. 12 Starting from
Feynman's parton model, as well as being charac-           the three-dimensional parton distribution function
teristic of the quantum-mechanical picture of              which we could obtain by following the procedure
blackbody radiation. In both cases, the number of          outlined above, Le Yaouanc et al. indeed carried
lightlike particles is not conserved.                      out a comprehensive phenomenological analysis of
   Finally, let us consider the time interval during       all interesting physical quantities in the inelastic
which the above-mentioned partons behave as free           electron-nucleon scattering.
!Y. S. Kim and M. E. Noz, Phys. Rev. D 8,3521 (1973).      7R. P. Feynman. in High Energy ColliSions, proceedings
'Yo S. Kim and M. E. Noz, Phys. Rev. D 12, 129 (1975).       of the Third International Conference, Stony Brook,
  For orthogonality and Lorentz-contractiOn properties       New York, edited by C. N. Yanget ai. (Gordon and
  of the harmonic-oscillator wave functions, see M. J.       Breach, New York, 1969).
  Rulz, Phys. Rev. D 10, 4306 (1974).                      8J. D. Bjorken and E. A. Paschos, Phys. Rev. 185, 1975
'H. Yukawa, Phys. Re-;'- 91, 416 (1953).                     (1969).                                      -
4R. P. Feynman, M. Kisllnger, and F. Ravndal, Phys.        'R. A. Brandt, Phys. Rev. Lett. 22, 1149 (1969); 23,
  Rev. D 3, 2706 (1971).                                     1260 (1969). For a review article, see Y. Frisb;;;an,
sp. A. M:-Dirac, The Development of Quantum Theory           in Proceedings of the XVI Internaiional Co1!ference on
  (Gordon and Breach, New York, 1971).                      High Energy Physics, Chicago-Batavia,    m., 1972,
'Yo S. Kim and M. E. Noz, Phys. Rev. D 12,122 (1975);        edited by J. D. Jackson and A. Roberts (NAL, Batavia,
  M. J. Ruiz, ibid. 12,2922 (1975); Y. S. Kim, ibid. 14,     Ill., 1973), Vol. 4, p. 119.
  273 (1976); Y. S. Kim and M. E. Noz, ibid. (to be p~_    lOS. D. Drell and T. D. Lee, Phys. Rev. D 5, 1738 (1972);
  Jished).                                                   C.lI. Woo, Phys. Rev. D.!!., 1128 (1972).-We would
316                                                                                               CHAPTER V
  like to thank C. H. Woo for explaining the content of     wave ftmctions. The contraction property is explained
  his paper.                                                in terms of the step-up operator which transforms
lIy. S. Kim, Univ. of Maryland CTP Tech. Report No.         like the longitudinal coordinate.
  76-008, 1975 (unpublished). This paper contains also    12A. Le Yaouanc, L. Oliver, O. P~ne, and J.-C. Raynal,
  a more precise explanation of the Lorentz-contrac-        Phys. Rev. D g, 2137 (1975).
  tlon property of excited-state barmonlc-oscillator
LORENTZ-DIRAC DEFORMATION IN HIGH ENERGY PHYSICS                                                                                    317
  Since the introduction of the parton model,' one                      smeared by the momentum-fraction distribution
of the central issues in high-energy physics has                        of the valons. The assumption is that the inter-
been the relationship between the partons and the                       action which confines the valons, that is, the
valence quarks which seem responsible for other                         gluons which are exchanged among them, will not
high-energy properties such as mass spectra                             play so large a role in scattering processes as
and form factors. It was once naively believed                          to make the above analysis insufficient. The con-
that the proton structure function could be cal-                        sistency with which the valon picture appears to
culated from the valence-quark distributions inside                     model the actual behavior of nucleon structure
the hadron.' However, it is by now firmly esta-                         functions' is a persuasive argument that the
blished that quantum-chromodynamics (QeD)                               assumption made here is a good one.
processes stand between the valence quarks and                            The nucleon structure functions in the valon
the observed structure functions.                                       model take the form'
  With this point in mind, Hwa recently developed
an appealing method for dealing with nucleon                                   FN(X,Q2)~ ~ { dY GVIN (y)FV(x/y,Q 2),                 (1)
structure functions. Hwa's approach separates
out a component of the structure functions which                         where FN(x, Q2) is a nucleon structure function
is completely determined by QeD renormalization,                         (either F 2 or xF ,), P(x, Q2) is the corresponding
and uses the data to calculate a momentum-fraction                       function for a valon v, and GVIN(y)dy is the prob-
distribution for three constituent quark clusters                        ability of the valon having momentum fraction
or "valons".' The purpose of this paper is to point                      between y and y + dy. The sum is taken over the
out that Hwa's valon distribution is close to the                        three valons which constitute the nucleon. From
valence-quark distribution derivable from the                            the definition, we must have
covariant harmonic-oscillator model which has
been effective in explaining the nucleon mass spec-                                                                                  (2)
tra and form factors.
  The valon picture'·' is basically an attempt to                        and
establish a connection between the quark model
in which hadrons are bound states of their con-                                                                                      (3)
stituent quarks and the parton model which seems
necessary if we are to explain the observed struc-                         In the expression of Eq. (1) the nucleon structure
ture functions. When probed at high Q2, each of                          functions FN(x, Q') are well known from experi-
the valence quarks will itself be resolved into                          ment. For high-Q' processes, the renormalization-
infinitely many constituents due to the fact that                        group methods in QeD allow a description of
each of the valence quarks will be accompanied                           FV(x, Q2) in terms of its moments. Making use of
by a cloud of quarks, antiquarks, and gluons                             data from neutrino and muon scattering, and known
produced in ongoing QeD processes. We can cal-                           results from QeD, Hwa has obtained
culate the evolution of the valence quarks in-
                                                                                                                                     (4)
volved here to leading order in QeD using the
renormalization-group methods. s The nucleon
structure functions in the valon model are then                                                                                      (5)
given by the corresponding functions for each of
                                                                                                                                     (6)
the valons (the valence quark plus its cloud)
2782 COMMENTS 23
where G./~(y) and G./~(y) are the momentum-                function can explain the peculiarities in the orig-
fraction distributions for the u valon and the d           inal version of Feynman's parton modeL'
valon in a proton, respectively, and Go/~(y) is the          The three-particle kinematics associated with
momentum-fraction distribution obtained assuming           the proton wave function has been studied exten-
that these distributions are flavor independent.           sively.' It is, then, a straightforward matter to
The G./~ and Go/~ functions are plotted in Fig. 1.         derive the momentum-fraction distribution func-
   In spite of QCD's effectiveness in dealing with         tion in this scheme.'o The result is
the Q2 evolution of the hadronic structure func-
tions, it has not yet been helpful in determining                G.se(Y) =[3m/(21rw)'/2] exp[ -(m 2 /2w){l_ 3y)"j.
what should really be the starting point of this                                                                (7)
evolution, that Is, the distribution of the valence
quarks inside the hadron. In the meantime, we              In the above expreSSion, m represents the nucleon
                                                           mass, while w is the oscillator spring constant.
are allowed to consider other models which are
                                                           G•• eIY) is also plotted in Fig. 1. The value of w
consistent with existing rules of quantum mech-
anics and special relativity. The covariant har-           is taken to be m 2/2, which is the most acceptable
monic oscillator is such a model.                          value in the calculations of the mass spectrum 11
   The relatiVistic oscillator model existed long          and the gAlg. ratio.'2
before the quark model was invented." The early               Noteworthy, perhaps, is the fact that G•• eIY) is
applications of the osc illator model in the quark         normalized over the whole real line, while Hwa's
picture of hadrons include the study of hadronic           distribution extended only from y =0 to y =1. The
mass spectra. Its effectiveness in the relativistic        oscillator wave function is not restricted to the
domain was demonstrated first by Fujimura et               region of physically observable constituent en-
al. in their successful calculation of the proton          ergies. However, this is nothing strange, inas-
form factor.7 Fujimura et al. used normalizable            much as the nonrelativistic oscillator exhibits the
relativistic wave functions for three valence quarks.      same property. It should also be pointed out that
 We propose to use the same relativistic wave              the integral given in Eq .. (1) which determines the
                                                           structure functions is taken only over values from
function to determine the valence-quark distri-
bution in the parton regime.
                                                           o to 1 so that a nonzero value of G outside of this
                                                           region will play no role. At the present time,
   The covariant harmonic oscillator has been
                                                           we do not have enough experimental accuracy to
discussed extensively in the literature. In par-
                                                           decide whether the curvature is of the polynomial
 ticular, it was shown by Kim and Noz· that a
                                                           type given by Hwa or of the GauSSian form.
 rapidly moving hadron in this model has a broad
                                                             The agreement between G (y) from the data and
longitudinal-momentum distribution while the
                                                           the momentum-fraction distribution function for
spring constant of the oscillator becomes weak
                                                           the constituent quarks in the covariant oscillator
 to the laboratory-frame observer. It was pointed
                                                           model is surprisingly good. Clearly for y > 0.25,
out in Ref. 8 that this behavior of the oscillator
                                                           the oscillator distribution is very close to the
                                                           phenomenological curve given by Hwa. For y
                                                           < 0.25, the numerical agreement is not as good
                                                           as in the larger-y region. However, we have to
                                                           accept the fact that there are still large experi-
      2.0                                                  mental uncertainties in this small-y region, and
                                                           it would be difficult to trust at this time any closer
                                                           agreement than that given in Fig. 1.
       1.5                                                    As for the flavor dependence indicated by Hwa's
                                                           valon curves, it is not yet clear to us to what
       1.0
                                                           extent the difference between G./~ and Go/, is
                                                           actually required by experimental evidence since
                                                           this feature is observed mostly at smaller values
      0.5                                                  of y where, again, the experimental uncertainties
                                                           are the greatest. In any case, the covariant oscil-
                                                           lator is not designed to account for such a dif-
                                            1.0   J        ference. If the difference really exists, there
                                                           must be additional dynamical effects, beyond
   FIG. 1. The experimental and calculated GIY) func-
 tions. The G function calculated in the covariant os-     what we can account for with the oscillator model
 cUlator model is compared with the valon distribution     and QCD, to explain the flavor dependence. This
 functions obtained by Hwa from experimental data.         is beyond the scope of this paper.
LORENTZ-DIRAC DEFORMATION IN HIGH ENERGY PHYSICS                                                            319
23 COMMENTS 2783
  This paper is based on a part of the anthor's          losophy. The author would like to thank Professor
dissertation to be submitted to the faculty of the       Y. S. Kim for suggesting this research. The serv-
University of Maryland in partial fulfillment of         ices of the University of Maryland Computer Sci-
the requirements for a degree of Doctor of Phi-          ence Center are also appreciated.
IR• P. Feynman, in Third Topical Conference on High      'i K.Fujimura, T. Kobayashi, and M. Namiki, Prog.
  Energy Collisions, edited by C. N. Yang (Gordon and      Theor. Phys. 43, 73 (1970). See also R. G. Llpes,
  Breach, New York, 1969).                                 Phys. Rev. D5.    2849 (1972).
'J. D. Bjorken and E. A. Paschos, Phys. Rev. 185, 1975
                                                         8
                                                          Y. S. Kim and -M. E. Noz, Phys. Rev. D 15, 335 (1977).
  (1969).                                      -           See also Y. S. Kim and M. E. Noz, Fou;;;). Phys.!!,
3R• C. Hwa, Phys. Rev. D 22, 759 (1980).                   375 (1979).
'R. C. Hwa and M. S. Zahlr,- Phys. Rev. D 23, 2539       9R. P. Feynman, M. Kislinger. and F. Ravndal, Phys.
  (1981).                                  -                Rev. D 3, 2706 (1971).
                                                         10        -
('. A. DeGrand, Nucl. Phys. B151, 485 (1970).               Y. S. Kim  and M. E. Noz, Prog. Theor. Phys. 60, 801
 H. Yukawa, Phys. Rev. 91, 461 (1953); M. Markov,           Qn~                                          -
  Nuovo eimento Suppl. 3~760 (1956). For the latest      Up. E. HUBsar, Y. S. Kim. and M. E. Noz, Am. J. Phys.
  reinterpretation of these early papers, see D. Han        48, 1043 (1980).
  and Y. S. Kim, Prog. Theor. Phys. &i, 1852 (1980).     I'M. Ruiz, Phys. Rev. D 11, 2922 (1975).
Chapter VI
It was shown in Wigner's 1939 paper that the little group for massless particles is
isomorphic to the two-dimensional Euclidean group consisting of rotations around
the origin and translations along the two perpendicular directions. It is not difficult
to associate the rotation with the helicity. However, the physieal interpretation of
the translation-like degrees of freedom had been an unsolved problem. Fortunately,
two of Weinberg's 1964 papers started breaking ground on this problem, and led to
the clue that the translation-like transformations are gauge transformations.
In 1982, Han, Kim, and Son used the Lorentz condition to reduce the complicated
transformation matrix of the little group into the transpose of the coordinate
transformation matrix on the two-dimensional plane. They confirmed therefore, that
the translation-like transformations are gauge transformations. These authors
extended their study to the symmetry of massless particles with spin 1/2 and
concluded that the polarization of neutrinos is due the requirement of gauge
invariance.
                                          321
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                                         323
                 The Feynman rules are derived for massless particles of arbitrary spin j. The rules are the same as those
              presented in an earlier article for m>O, provided that we let m --+ 0 in propagators and wave functions,'and
              provided that we keep to the (2j+1)-component formalism [with fields of the (j,O) or (O,j) type] or the
              2 (2j+ I)-component formalism [with (j,O) Ell (O,j) fields]' But there are other field types which cannot be
              constructed for m=O; these include the (j/2,j/2) tensor fields, and in particular the vector potential for
              j = 1. This restriction arises from the non-semi-sirnple structure of the little group for m = O. Some other
              subjects discussed include: T, C, and P for massless particles and fields; the extent to which chirality con-
              servation implies zero physical mass; and the Feynman rules for massive particles in the helicity formalism.
              Our approach is based on the assumption that the S matrix is Lorentz invariant, and makes no use of
              Lagrangians or the canonical formalism.
creation operator for the antiparticle with helicity - >.               The states 1>.) must furnish a representation of the
can only be used to form a field transforming as in (1.3)               little group. That is, the unitary operator U[ ffi] corre-
under those representations (A,B) of the homogeneous                    sponding to ffi'. does not change the momentum of the
Lorentz group such that >.=B-A. This limitation                         states 1>'), and thus must just induce a linear trans-
arises purely because of the non-semi-simple structure of               formation:
the little group for m = 0. The difficulties (indefinite
metric, negative energies, etc.) encountered in previous                               U[ffi]I>')=L dA'A[ffi]1 >.'),         (2.3)
                                                                                                   A'
attempts to represent the photon by a quantized vector
potential A ,(x) can therefore now be understood as due                 with
to the fact that such a field transforms according to the
(t,!) representation, which is not one of the repre-
sentations allowed by the theorem of Sec. III for                       Therefore, we can catalog the various possible spin
helicity X= ±1. On the other hand, the (j,O) and (O,j)                  states I>.) by studying the representations d[ffi] of the
representations used in this article (corresponding for                 little group.
j = 1 to the field strengths) are allowed by our theorem,                  This is most easily accomplished by examining the
and they cause no trouble.' In a future article we shall                infinitesimal transformations of the little group. They
show that it is in fact possible to evade our theorem,                  take the form
and that the Lorentz invariance of the S matrix then                                                                        (2.5)
forces us to the principle of extended gauge invariance.
   In Ref. 1 we gave the Feynman rules for initial and                  where !l'. is infinitesimal and annihilates k:
final states specified by the z components of the massive
particle spins. In order to facilitate the comparison with                                                                   (2.6)
the case of zero mass, and for the sake of completeness,
we present in Sec. VIII the corresponding F eynman                      In order that (2.5) be a Lorentz transformation we must
rules in the helicity formalism of Jacob and Wick! The                  also require that
external-line wave functions are much simpler, though                                                                      (2.7)
of course the propagators are the same.
                                                                        the index v being raised in the usual way with the metric
                                                                        tensor g", defined here to have nonzero components:
           II. TRANSFORMATION OF STATES
                                                                                       gl1=g"=g"'=l, g"0=-l.                 (2.8)
   The starting point in our approach is a statement of
the Lorentz transformation properties of massless par-                  Inspection of (2.6) and (2.7) shows that the general !l"
ticle states. The transformation rules have been com-                   is a function of three parameters 8, Xl, X2, with nonzero
pletely worked out by Wigner,' but it will be convenient                components given by
to review them here, particularly as there are" some little
known but extremely important peculiarities that are                                   !l12= _(l21=8,                        (2.9)
special to the case of zero mass.
   Consider a massless particle moving in the z direction
with energy K. It may have several possible spin states,
which we denote 1>'), the significance of the label>. to be                            !J20 = -!l"'=!J'3= -!J'2= X2 •       (2.11)
determined by examining the transformation properties
of these states. Wigner defines the "little group" as the               The Lie algebra generated by these transformations can
subgroup of the Lorentz group consisting of all homo-                   be determined by recalling the algebra generated by the
geneous proper Lorentz transformations ffi', which do                   full homogeneous Lorentz group, of which the little
not alter the four-momentum k' of our particle.                         group is a subgroup. An infinitesimal Lorentz trans-
                                                                        formationA'. can be written as in (2.5), with(l'. subject
                                                               (2.1)    only to (2.7). The corresponding unitary operator takes
                                                                        the form
                    kl=k'=O; k'=ko=K.                          (2.2)                   U[H!l]=H(i/2)(l'·J."                (2.12)
  • As • case in point, there does not seem to be any obstacle to the
construction of field theories for massless charged particles of                             J,.= -J.,=J,.I.                (2.13)
arbitrary spin j, provided that we use only proper field types, like
(j,0) or (O,j). The trouble encountered for j~ 1 by K. M. Case
and S. G. Gasiorowicz [Phys. Rev. 125, 1055 (1962)], can be
                                                                        It is conventional to group the six components of J.,
ascribed to their use of improper field types, such as (!.t). We plan   into two three-vectors:
to discuss this in more detail in a later article on the electro-
magnetic interactions of particles of any spin.
   'M. Jacob and G. C. Wick, Ann. Phys. (N. Y.) 7, 404 (1959).
                                                                                                                            (2.14)
   • E. P. Wigner, in Tlreorelical Physics (International Atomic
Energy Agency, Vienna, 1963), p. 59.                                                         K;=J;o=-Jo;,                   (2.15)
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                               325
   A general state containing several free particles will               tation D[AJ of the homogeneous proper orthochronous
transform like (2.38), with a factor [I pi I/ Ipi J1I'e ilH for         Lorentz group:
each particle. These states can be built up by acting on
the bare vacuum with creation operators a*(p,>') which                   ULA]1/In l+) (x; X)U[AJ-l
satisfy either the usual Bose or Fermi rules:                                                  = L Dnm[A-l]1/Im l +) (Ax; >.).   (3.2)
                                                              (2.39)
so the general transformation law can be summarized                       It is well known that the various representations
in the statement                                                        D[AJ can be cataloged by writing the matrices J and K,
                                                                        which represent the rotation generator J and the boost
 U[AJa*(p,>.)U-l[AJ= [I Api / Ipi J1I2                                  generator K as
         Xexp{i>'El[£-l(Ap)A£(p)])a*(Ap,>.).                  (2.40)
                                                                                        J=A+B; K=-i(A-B).                        (3.3)
 Taking the adjoint and using the property [see (2.30)J
                                                                        Since J and K satisfy the same commutation rules
                        El[R] = - El[R-IJ                     (2.41)    (2.16)-(2.18) as J and K, the A and B satisfy decoupled
gives the transformation rule of the annihilation                       commutation rules
operator                                                                                 AXA=iA; BXB=iB,
                                                                                                                                 (3.4)
U[AJa(p,>')U-I[AJ= [IAPI/ Ipi JI2                                                               [cti,lBjJ=O.
       X exp{i>.El[£-l(p)A-l£(Ap)J)a(Ap,>') .                 (2.42)
                                                                        The general (2A + 1)(2B+ l)-dimensional irreducible
   We speak of one massless particle as being the anti-                 representation (A ,B) is conventionally defined for inte-
particle of another if their spins j are the same, while                ger values of 2A and 2B by
all their charges, baryon numbers, etc., are equal and
                                                                                           Aab.a'b' = ~bb'] aa,(A) ,
opposi teo Whether or not every massless particle has                                                                            (3.5)
such an antiparticle is an open question, to be answered                                   Bab.a'b'   = Oaa'] bb,CB) ,
affirmatively in Sec. V. But if an antiparticle exists,
then its creation operator b*(p,>.) will transform just                 where a and b run by unit steps from -A to +A and
like a*(p,>'), and b*(p, ->.) will transform just like                  from - B to + B, respectively, and JI>l is the usual
a(p,>.) :                                                               2j+1-dimensional representation of the angular mo-
                                                                        mentum
U[AJb*(p, ->')U-l[AJ=[IAPl/lpIJU'
                                                                          [Jl(i)±iJ,!i)Jq'q=Oq'.q±l[U'FCT) (j±CT+ 1»)112,
    Xexp{i>.El[£-l(p)A-l£(Ap)J)b*(Ap, ->.).                  (2.43)                                                              (3.6)
                                                                                  [JaliJ].'q=CTOq'q.
If a particle is its own antiparticle, 6 then we just set
b(p,>') = a(p,>.).                                                         For massive particles of spin j, we have already seen
                                                                        in Sec. VIII of Ref. 1 that a field 1/1<+) (x) can be Con-
         III. A THEOREM ON GENERAL FIELDS                               structed out of the 2j+l annihilation operators a(p,CT),
                                                                        which will satisfy the transformation requirements (3.1)
   As a first step, let us try to construct the "annihilation
                                                                        and (3.2), for any representation (A,B) that "contains"
fields" 1/In l+) (x; >'), as linear combinations of the annihi-
                                                                        j, i.e., such that
lation operators a(p,>'), with fixed helicity >.. We require
that the 1/I n I+J transform as usual under translations                       j=A+BorA+B-10r .. · or IA-BI.                     (3.7)
              i[P .,1/In(+) (x; >.)J= 0.1/In(+) (x; >.)        (3.1)    [A spin-one field could be a four-vector a,!), a tensor
and transform according to some irreducible represen-                   (1,0) or (0,1), etc.J We might expect the same to be true
                                                                        for mass zero, but this is not the case. We will prove in
   \I It is not so obvious what is meant by a massless particle being   this section that a massless particle operator a(p,X) of
its own antiparticle. If charge conjugation were conserved, then        helicity >. can only be used to construct fields which
we would call a particle purely neutral if it were invariant (up to a
phase) under C. But if we take weak interactions into account then      transform according to representations (A ,B) such that
only CP and CPT are available, and they convert a particle into
the antiparticle with opposite helicity, For massless particles there                            B-A=>'.                         (3.8)
is no way of deciding whether a particle is the "same" as another
of opposite he1icity, since one cannot be converted into the other
by a rotation. This point has been thoroughly explored with regard      For instance, a left-circularly polarized photon with
to the neutrino by J. A. McLennan, Phys. Rev. 106, 821 (1957)           >.= -1 can be associated with (1,0), (!.t), (2,1), ...
and K. M. Case, ibid. 107, 307 (1957). See also C. Ryan and S.          fields but not with the vector potential (!,!), at least
Okubo, Rochester Preprint URPA-3 (to be published). Even if a
massless particle carries some quantum number (like lepton              until we broaden our notion of what we mean by a
number), we can still call it purely neutral if we let its quantum      Lorentz transformation. It will be seen that the restric-
number depend on the helicitYi however, in this case it seems more      tion (3.8) arises because of the non-semi-simple structure
natural to adopt the convention that the particle is different from
its antiparticle, with b(p,X) ;o'a(p,X).                                of the little group.
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                              327
  The condition (3.1) requires that ,pn(+) be constructed       matrix representatives J and K:
as a Fourier transform
                                                                D[ <R(O,x"x,)] = 1+iO,9,+ix,(XI- ,92)
                 1                                                                            +ix,(X2+,9I) , (3.16)
,p.(+)(Xj X)=--
              (2,..)3/'                                         or, using (3.3),
                 X   f      d'p
                          --e,p··a(p,X)un(p,X) , (3.9)
                          [2Ipl]'/'
                                                                D[ <R(O,X"x,)]= 1+iO( Cl 3+tB,)+ (x,+iX.) (Cl,-iCl.)
                                                                                        + (x,-iX,) (tB,+iCB,). (3.17)
exp{ iXEl[£-' (p)A-I£ (Ap) ]}L: Dnm[£(p)]um(h) [left] (j,O), (jH, !), (jH, 1), ... , (3.24)
                     j
                                                                   Using the wave functions (4.8) and (4.9) in (4.1) and
               1            d'p                                 (4.2), the annihilation fields now take the form
9'.<+)(x)=--             ---
       (2.-)'/2          [21 pi ]'/2
                                                                9'/+)(x)=_1_ jd'P[2 Ipl ]/-l/2
                    XeiND •. _;w[.e(p)Ja(p, - j),       (4.1)           (2.-)'/2
x.<+)(x)=--
               1    j    ---
                            d'p                                                            XeiP·'D •._Pl[R(p)]a(p, - j),   (4.10)
       (2.-)'/2          [21 pi Jl/2
                                                                x.<+)(x)=_1_ jd'P[2IpIJH/2
                           XeipxD./"[.e(p)Ja(p,j),      (4.2)                   (2.-)'12
                            .
                                                                rotation matrices' are needed; R(p) is the rotation that
U[AJX. C
       +)(X)U-l[AJ=L D ..,W[A-1JX.,C+) (Ax) . (4.4)             carries the z axis into the direction of p.
                              '
                                                                   If our particle has an antiparticle (perhaps itself), then
Here DW[AJ and DW[AJ are the nonunitary (2j+l)                  there is available another operator b*(p, -A) which
X (2j+l)-dimensional matrices corresponding to A in             transforms just like a(p,A) [see (2.43)J, and which
the (j,O) and (O,j) representations, respectively. They         carries the same charge, baryon number, etc. It is then
are the same as used in Ref. 1, and can be defined by           possible to define creation fields
taking Cl3=0 or a=o, or, equivalently, by representing
the generators J, K with                                        9'.H(X)=_l_ jd'P[2IpIJ/-l/2
                                                                       (211')'/2
                DC1l: J=JC1l, K=-iJC1l,                 (4.5)
                                                                                           Xe-iPXD •._P)[R(p)Jb*(p,j),     (4.12)
                fjw: l=JW, K=+iJW,                      (4.6)
where JW is the usual spin- j representation of the             x/-)(x)=_1_ jd3P[2Ipl]H/2
angular momentum, defined by (3.6). In particular, the                          (2.-)'12
transformation .e(p) defined by (2.33) is represented on                              Xe-ip,xD •. Pl[RCP)]b*(p, - j),      (4.13)
Hilbert space by
                                                                which satisfy (3.1), which transform according to (4.3)
           U[.e(p)J= U[RCP)] exp{ -i¢(1 pi )K,} ,       (4.7)   and (4.4), respectively, and which also transform like
                                                                 9'<+) and x c+) under gauge transformations of the first
            </>(lpl)=ln[lpl/K],                        (2.35)
                                                                kind. [For a "purely neutral" particle,' b*(p,A) is to be
and therefore the wave functions appearing in (4.1) and         replaced by a*(p,A).]
(4.2) are                                                           The most general fields satisfying all these conditions
                                                                are linear combinations of creation and annihilation
D•._Pl[.e(p) ]                                                  fields.
                                                                               1'. (x) = hl'.<+)(X)+~RI'.H (x),      (4.14)
      .'
  = L D ...</J[R(p)][exp{ -q,(lpl)J,W}].,._i
                                                                                  x.(x) =     ~RX.C+)(X)+~LX.C-)(x).       (4.15)
                                                                They again transform as in (4.3) and (4.4):
D•. pl[.e(p)]
                                                                       U[AJ9'.(x) U-l[A] = L D..,W[A-1J9'., (Ax) ,
                                                                                                 .'
                                                                                                                           (4.16)
      L D..,W[R (p)][exp{</> (I p I)J,W}].,.i
  =
      .'
                                                                                                  .'
                             =D •. PJ[R(P)](lpl/Kl/.   (4.9)           U[AJX.(x) U-l[AJ=         L D..   ,W[A]X., (Ax) .   (4.17)
Note that the matrices DW[R] and fjW[R] for a pure                IJ   See, for example, M. E. Rose 1 Elementary Theory of Angular
rotation R are both equal, being given by the familiar          Momentum        (J. Wiley & Sons, Inc., New York, 1957), p. 48 If.
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                           329
If these particles have no antiparticles (including them-        The matrices .. and if can be easily calculated by use
selves), then we have to take 'IL='1Il=O. We will see in      of the obvious formulas
the next section that, instead, requirement (1.4) (and
hence the Lorentz invariance of the S matrix) dictates                                    1           ;
full crossing symmetry, with 1'I1l1 = Ihi, I'ILI = 1~1l1·
                                                                        a•._,1i"._i=-[ IT (X-J,)] ••"                 (5.5)
                                                                                     (2j)!    X-i+1
   The fields obviously obey the Klein-Gordon equation
                                                                                          1       H
              D'<p.(x) = 0;   D'x.(x) = O.           (4.18)                a.".a.,,J=-[ IT (J,-A)] ••,.               (5.6)
                                                                                     (2j)!    X-i
However, they are (2j+ l)-component objects con-
structed out of just two independent operators a(p,A),        Applying the rotation matrix DW[R(P)] and multi-
b*(p, -A), and so they have a chance of obeying other         plying by 12pI'i gives
field equations as well. It is not hard to see from
(4.10)-(4.13) that they do indeed satisfy the additional                            22j       i
                                                                           .. (p)=- IT (XP-p·J)                       (5.7)
field equations                                                                  (2 j) ! X-;+l  '
                                                                                                          J d'p
just Maxwell's free-space equations for left- and right-
circularly polarized radiation:                                                         1
                                                                 [<p.(x) ,<p.,t (y)]±=---1I' ••' ( -ia)     -
          vX[E-iB]+i(O/at)[E-iB]=O,                  (4.21)                           (2".)3                21pI
The fact that these field equations are of first order for
any spin seems to me to be of no great significance, since
                                                                                  1
                                                                 [x.(x),x.,t(y)]±=-if.., (-ia)
                                                                                                          J d'p
                                                                                                            -
in the case of massive particles we can get along per-                          (2".)3                      21pI
fectly well with (2j+ 1)-component fields which satisfy              XC Itill 'e'P'(x-u)± (- )'i!'1LI 'e-"'(%-u)].   (5.10)
only the Klein-Gordon equation.
                                                                 In order that (5.9) and (5.10) vanish for x-y space-
           V. CROSSING AND STATISTICS                         like, it is necessary and sufficient that exp[ip· (x-y)]
                                                              and exp[ -ip· (x-y)] have equal and opposite coeffi-
   We are assuming that the a's and b's satisfy the usual     cients
commutation (or anticommutation) rules (2.39), so it is                          Ihl'='F(_)'il'7IlI',            (5.11)
easy to work out the commutators or anticommutators
of the fields 'P. and X. defined by (4.10)-(4.15):                              1~1l1'='F(_)'il'7LI2.                (5.12)
   ['P.(x),'P.,t(y)]±=-
                         1    J d'p
                                --1I' •• ,(p)
                                                              So we must have the usual connection between spin and
                                                              statistics
                    (2 ..)'     21pI
                                                                                 (±)=_(_)2i,                  (5.13)
              xCI hl'e"'(~u)± 1'1IlI'e-"'(~')],       (5.1)
   [x.(x),x.,t(y)]±=-
                  (2..)3
                         1    J d'p
                                - i f•• ,(P)
                                21pI
                                                              and furthermore, every left- or right-handed particle
                                                              must be associated, respectively, with a right- or left-
                                                              handed antiparticle (perhaps itself) which enters into
                                                              interactions with equal strength:
              x[1 til I'ei"(~u)± I'1L I'e-"'(~')],    (5.2)
where                                                                                 Ihl=I'7IlI,
                                                                                                                     (5.14)
 ....,(p) = I2pl'iD •._iW[R(p)]D.,._/I1'[R(p)] ,      (5.3)                          Itlll =1'7LI·
 if •• '(p) = 12pl'iD.ji)[R(P)]D.,./,l'[R(p)]'        (5.4)   By redefining the phases of the a's and b's, and the
                                                              normalization of 'P and x, we can therefore set
These are the only nonvanishing commutators (or
anticommutators) among the 'P, <pI, X, and Xl (except
for a "purely neutral" particle, in which case X is
proportional to 'P I; see Sec. IX).                           with no loss of generality. The fields are now in their
330                                                                                                                              CHAPTER VI
(7.16) to the left of the 0 functions in (7.11) and (7.12),                        (It should be kept in mind that the index 0', which is of
obtaining the propagators                                                          no direct physical significance, will appear on some
                                                                                   other wave function or propagator, and eventually be
S ••' (x-y)= -ill", (-iil)a'(x-y)
                                                                                   summed over.) The corresponding wave function for a
           =   -i 2 i+ 1ttTl1 ,J.llJ.l2· "J.l 2j dp./J iJ2 , ••                    particle of definite helicity A is
                                                       X il"ia' (x-y) ,   (7.18)
                                                                                            U.(x j p,A) = L Dp,lil[R(P)]u.(Xj p,p.),       (8.3)
S •• , (x-y)= -ifi ••, (-iil)a'(x-y)
           =   -i 2 i+ 1lur1 ,fJIJ.12"   . J.l   2id J.ll°J.l2 ,   ••
                                                                                   where R(p), as always, is the rotation that carries the z
                                                       X il p2i a'(x-y) , (7.19)   axis into the direction of p. Using (8.1) in (8.3) gives
where -ia'(x-y) is the usual propagator for spin zero
                                                                                   U .(Xj p,A) = (2W)-I!'(21r)-'!'
and mass zero
                                                                                                 X {exp( -p·](i)O)Dli)[R(p)]}.,e'p,
-ia'(x) =i8(x)a+(x)+iO( -x)~( -x)                                                              = (2W)-I!'(21r)-'!'
                        = +[l/47r'(x'+i<)].                               (7.20)                  X{DW[R(p)] exp(-],WO)}.,e'p,
Equations (6.3) and (6.4) show that these propagators                                          = (2w)-I!'(21r)-3I2D.,Ii)[R(p)]e-"e'P·'. (8.4)
are covariant in the sense that
                                                                                   Furthermore we see from (8.2) that
               D(i)[A]S(x)DW[A]t=S(Ax) ,                                  (7.21)
                                                                                                    e-A8=[w(p)+ Ipl/m]-'.                  (8.5)
               D(i)[A]S(x)DW[A]t=S(Ax).                                   (7.22)
                                                                                      In order to avoid m's appearing in the denominator of
  The propagators in momentum space are given by the                               U. for negative helicity, it will be convenient to re-
Fourier transforms of (7.18) and (7.19)                                            normalize all fields of mass m by multiplying them with
   According to the Feynman rules of Ref. 1, the wave                              antiparticle by 'P.t(x) is the complex conjugate
function for a particle of spin j, ],=p., momentum p,                              V. '(Xj p,A) = (2W)-I!' (21r)-'!2 (- )-i+AD •. _,W'[R(p)]
and mass m, destroyed by 'P.(x), is                                                                            XmH(w+lpl)'e+'p,. (8.9)
u.(Xj p,p.) = (2w)-I/'(21r)-3I'
                                                                                      A massive particle can be created or destroyed in any
                          X[exp(-p.]WO)].pe'P" ,                           (8.1)   helicity state by either the (j,O) field 'P.(x) or the (O,j)
where                                                                              field x.(x). Inspection of the field x.(x) given in Eq.
                              w= [p'+m']'!' ,                                      (6.9) of Ref. 1 shows that the wave functions corre-
                                                                           (S.2)
                       sinhO= Ipl/m.                                               sponding to (8.6)-(8.9) are given by replacing 0 by -0,
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                          333
 O,(x; p,A) = (2W)-1/'(21T)-3I2D"W[R(p)]                         Time-reversal (T) and space inversion (P) are classi-
                                                               cally defined as transforming a particle of momentum p
                  XmH(w+ Ipi Ye ipx
                                                               and helicity A into
                      [particle destroyed],          (S.10)
                                                                                     Tlp,A)a: I-p, A),                (9.1)
O,*(x; p,A) = (2W)-1/'(21T)-3I2D"W'[R(P)]
                  XmH(w+1pI )'e- ipx                                                 Plp,A)a:I-p,-A),                 (9.2)
                        [particle created],          (S.11)    while charge conjugation (e) just changes all particles
 V,(x; PA)= (2w)-1/2(2rr)-3/2( - )i+'D,._,U)[R(Pl]             into antiparticles, with no change in p and A. However,
                                                               in quantum mechanics there appear phases in (9.1) and
                 XmiH(w+ Ipi )-'e- ip . x
                                                               (9.2), which we shall see are necessarily momentum-
                           [antiparticle created],   (8.12)    dependent for massless particles. In order to get these
V.*(x; p,A)= (2W)-1/2(2rr)-3/2( - )iHD,,_,U)'[R(P)]            phases right it is necessary first to define the action of T
                                                               and P on our standard states IA) of momentum
                 XmiH(w+ Ipl )-Ae+ ip . x
                                                               k={O,O,K}, and then use the definition (2.31) of Ip,A).
                         [antiparticle destroyed].   (8.13)       We will define "standard phases" ~A(T) and ~A(P) by
   Now suppose that m -> 0, or, more precisely, that                       TIA)=~,*(T)U[R,]IA),                       (9.3)
Ipl/m-toc. The only wave functions among (8.6)-
(8.13) that survive in this limit are (8.6), (8.7), (8.12),                PI;\)= (- )i+A~A'(P)U[R,]I-A),             (9.4)
and (8.13) for A= - j, and (8.8), (8.9), (8.10), and           where R, is some fixed but arbitrary rotation such that
(8.11) for A= + j. This agrees with the situation for
m=O, in which case we know that <p, and <p/ can only                             R,{O,O,l) =   to, 0, -1),            (9.5)
create and destroy particles with A= - j and antipar-
                                                               so that U[R,] 1;\) is a state of momentum {O,O, -K).
ticles with A= + j, while x, and x,t only create and
                                                               [The factor (- )i+A is extracted from ~A *(P) for con-
destroy particles with A= + j and antiparticles with
                                                               venience later.] In order to calculate the effect of T and
A= - j. Furthermore, if we set A= - j in (8.6) or A= + j
                                                               P on Ip,A) we need the well-known formulas
in (8.10) we see that these wave functions reduce for
               °
Ipl/m-toc to the particle destruction wave function
given for m = by (7.7). The same agreement is ob-
tained on comparison of (8.7) and (8.11) with (7.8),
                                                                                       TJ iT-l=-J"
                                                                                       TKiT-l=K"
                                                                                                                      (9.6)
                                                                                                                      (9.7)
(8.8), and (8.12) with (7.9), and (S.9), and (S.13) with                               PJ,P-'=J"                      (9.8)
(7.10). [The observation that particles described only
by <p,(x) are difficult to create or destroy for Ipi »m in                             PK,P-'=-K,.                    (9.9)
any helicity state other than A= - j is very familiar for      [It is easy to check that (9.6)-(9.9) are consistent with
electrons in beta decay.]                                      the commutation relations (2.16)-(2.18), if we recall
   The propagators for an internal <p or X line are given      that T is antiunitary.] According to (2.31) and (4.7),
in Ref. 1 as                                                   the state Ip,;\) is
        S ... (x-y)= -iTI ... ( -ia)Ll'(X-y; m),     (8.14)    [p,A)=TK/[p[ JI/'U[R(P)] exp[-iq,([ p[ ) K a][;\) , (9.10)
        ,~ ... (x-y)=   -ifi ... (-iJW(x-y; m).      (S.1S)    so therefore
[Recall that we are now using fields renormalized by a         TI p,;\)=~, * (T)[</ Ip I ]'/'U[R(P) ]
factor mi, so the factor m- 2i in Eq. (5.7) of Ref. 1 is
                                                                                              Xexp[i</>(I pi )K,]U[R,]IA) ,
absent here.] We see that the propagators given for
m=O by (7.1S) and (7.19) are the limits respectively of        Plp,A)= (- )iH~, *(P)[K/lpl ]1/'U[R(P)]
(S.14) and (8.15) as m -t 0. For m~O there is also a                                  Xexp[i</>(I pi )K,]U[R,]I-A).
"transition propagator" between <p, and X,.t, but it is        But
                                  ° °
proportional to m'i and disappears as m -> 0.
   In contrast, the Feynman rules for m = could not be         and thus
obtained as the limit as m -> of the corresponding
                                                               TI p,A)=~' *(T)[</I pi ]'/2U[R(p)R,]
rules for m>O, if we used one of the field types like
(j/2,j/2) which are forbidden by the theorem of Sec.                                   Xexp[ -i</>(lpl)K,]IA),       (9.11)
III. For example, it is well known that the propagator         Plp,A)= (- )iH~A'(P)[K/lpl ]'/'U[R(P)R,]
for a vector field has a longitudinal part which blows up
                                                                                  Xexp[ -i</>(lp[)K,]I-A).           (9.12)
as m-' for m -t 0; this is just our punishment for
attempting to use the forbidden (t.!) field type for j= 1         The rotation R(p)R, carries the z axis into the direc-
particles of zero mass.'                                       tion of -p, and must therefore be the product of R( -p)
334                                                                                                                         CHAPTER VI
                                                                                               .'
        U[R(p)R,] = U[R( -p)] exp[iip(p)J,J. (9.13)                   T\".(x)T-'=~_;(T)E C"'\"., (x, -x") ,                        (9.26)
The angle ip (P) depends on how we standardize R, and
                                                                                             .'
R(P), but we will fortunately not need to calculate it, as             TX.(x)T-'=~;(T)E C ••,X., (x,                      -x") ,   (9.27)
it will cancel in the field transformation laws. Using
                                                                                               .'
(9.13) in (9.11) and (9.12), and recalling that J.                    C\".(x)C-l=~_;(C)E            C••,-lX.,I(x),                 (9.28)
commutes with K" we have at last
                                                                                                          .'
                                                                       CX.(X)C-I=~j(C)( -           )2; E C•• ,-l\".,I(X) , (9.29)
Tlp,A)=~,*(T)    exp[iAip(P)]I-p, A),                 (9.14)
Plp,A)= (- );H~,*(P) exp[ -iAip(P)]I-p,        -xl.   (9.15)          P\".(X)P-I=~_j(P)X.(-X,                      x"),            (9.30)
   These one-particle transformation equations can be
                                                                       pX.(X)P-l=~j(P)\".(-x,                  x").                (9.31)
translated immediately into transformation rules for the
annihilation operator:                                          In deriving (9.26)-(9.31) it is necessary to fix the
Ta(p,X)T-'=~,(T)     exp[ -iAip(p)]a( -p, A),         (9.16)    antiparticle inversion phases as
Pa(p,X)P-I= (- );H~,(P)                                                           ij,(T)=~_,*(T)               ,                   (9.32)
                   Xexp[iAip(p)]a(-p, -X).            (9.17)                      ij,(C)=~-,*(C)               ,                   (9.33)
The antiparticle operators will transform similarly, but                          ij,(P) = (- )2;~_,'(P),                          (9.34)
perhaps with different "standard" phases ij,(T) and
ij,(p) :                                                        because any other choice of the ij, would result in the
                                                                creation and annihilation parts of the field transforming
Tb(p,A)T-'=ij,(T) exp[ -iAip(P)]b( -p, A),            (9.18)    with different phases, and would therefore destroy the
Pb (p,A)P-1= (- );Hij,(P)
                                                                possibility of simple transformation laws.
                                                                   It is interesting that the tran~formation rules (9.26)-
                       X exp[t"Aip(p)]b ( -p, -A).    (9.19)    (9.31) tum out to be identical with those derived in
And, of course, C just changes a's into b's and vice versa.     Sec. 6 of Ref. 1 for the case of massive particles, though
                                                                the derivation has been different in many respects. The
                Ca(p,A)C-'=~,(C)b(p,X),               (9.20)    same is true of the phase relations (9.32)-(9.34), except
                Cb(p,X)C-'= ij,(C)a(p,X).             (9.21)    that the only correlated particle and antiparticle in-
                                                                version phases are those of opposite helicity. In par-
The phases ~,(T,C,P), ij,(T,C,P) are partly arbitrary,'         ticular, (9.34) tells us that a left- or right-handed
partly determined by the structure of the Hamiltonian,          particle plus a right- or left-handed antiparticle together
and partly fixed by the specifically field-theoretic con-       have intrinsic parity
sidera tions below.
   In order to calculate the effect of T, C, and P on the                         ~_,(P)ij,(P)=            (- )';,                 (9.35)
fields \".(x) and x.(x), it will be necessary to use the        while the intrinsic parity of a massless particle anti-
well-known reality property of the rotation matrices            particle pair of the same helicity is not fixed by these
                DW[R]*=CDW[R]C-1 ,                    (9.22)    general field-theoretic arguments.
                                                                  If a particle is its own antiparticle' then we must set
where, with the usual phase conventions,
                                                                                     b(p,X)=a(p,X).                                (9.36)
         C".= (- );+'6., ,_.= [exp(i.-J2(;»].,..      (9.23)
                                                                In this special case, the (j,0) and (O,j) fields are related
We shall fix the rotation R, introduced in Eq. (9.5) as a       by
rotation of 1800 about the y axis, such that
                 DW[R,]=C-'= (- )2;C.                 (9.24)
                                                                                        .'
                                                                               X.t(x)=E C•• ,\"., (x) ,
                                                                                                     .'
                                                                                                                                   (9.38)
Another needed relation then follows from (9.13).
                                                                Also (9.36) requires that the antiparticle inversion
D.,W[R(p)]                                                      phases ij, be equal to the corresponding ~" and therefore
      = (- );H exp[ -iAip(P)]D•. _,W[R( -P)].         (9.25)    (9.32)-(9.34) provide relations between ~, and ~_,:
  The effect of T, C, and P on the fields (5.16) and                              ~,(T)=~_,'(T)                ,                   (9.39)
(5.17) can now be easily determined by using (9.16)-
                                                                                  ~'(C)=~-' "(C),                                  (9.40)
 9 For a general discussion, see G. Feinberg and S. Weinberg,
Nuovo Cimento 14, 571 (1959).                                                     ~,(P)=     (- )2;~_,*(P).                        (9.41)
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                                 335
However, there is still no necessity for any of these               and we write the transformation (10.1) as
phases to be real.
   Observe that (9.17) and (9.19)-(9.21) make sense                                   "'(x) --> exp(ie/,6)"'(X).           (10.3)
                                                                                          /'6=[1 0].
only if both the particle and its antiparticle each exist in
both helicity states h= ±j. For a particle not identical
with its antiparticle, this is now a part of the assumption
of C or P invariance, whereas in the case of massive
                                                                                               °     -1
particles it followed directly from the Lorentz invariance          There are other possible discrete or continuous chirality
of the S matrix.                                                    transformations, but our discussion will apply equally
   In contrast, T conservation leaves open the possibility          to all of them.
that the particle exists in only one of the two helicity               The question, of whether chirality conservation im-
states, with an antiparticle of the opposite helicity.              plies zero physical mass, can be asked on two different
This is consistent with (9.26) and (9.27), which show               levels:
that T does not mix 'P. and x •. The same is true of the               (1) Suppose that Ho is chosen so the interaction
combined inversion CP.                                              representation fields 'P.(x) and/or x.(x) describe free
                                                                    particles of zero mass, and suppose that the interaction
CP'P.(X)P-1C-l                                                      density JC(x) is invariant under the transformation
                                                                    (10.1). Is the renormalized mass then zero in each order
             =~i(C)~-j(P)L C..,-l'P.' t( -x,       x"),   (9.42)    of perturbation theory?
                              .'                                       (2) Suppose that there exists a unitary operator
                                                                    which induces the transformation (10.1) on the Heisen-
                                                                    berg representation fields, and which leaves the physical
                                                                    vacuum invariant. Can we then prove anything about
                              .'
             =~-j(C)~j(P)L         C..,-lx"t( -x, x"),    (9.43)    the physical mass spectrum?
                                                                       Our answers to these two questions are (1) yes, and
and of course it is also true of CPT.                               (2) not necessarily. Let us consider perturbation theory
                                                                    first. The bare momentum-space propagator of the 'P.
                                                                    field is given by (7.23) as
      X. CHIRALITY AND RENORMALIZED MASS
   We have not made any distinction, either here or in                               S(g)= -iII(g)/ (q'-ie).               (10.4)
Ref. 1, between the mass characterizing the free field
                                                                    The exact propagator is
and the mass of the physical particles. This was
purposeful, because it is always possible and preferable                          S' (q) = S(q)+S(g)2:(*) (q)S' (q)
to arrange that the unperturbed and the full Hamil-
                                                                                         = [S-l(q)-2:(*) (q)J-l.           (10.5)
tonians have the same spectrum. But there still remains
the question: Under what circumstances will the physi-              The (2j+l)X(2j+l) matrix 2: 1*) (g) is the sum of all
cal particle mass in fact be zero? The classic conditions           proper diagrams with one 'P. line coming in and one
are gauge invariance or chirality [i.e., "/,6"J conserva-           going out, with no propagators on these lines. Stripping
tion. Gauge invariance is without content for the (j,O)             away its external propagators changes the Lorentz
and (O,j) fields discussed in this article, so we are led to        transformation behavior of 2: ..,1*) from that of 'P.'P.' * to
consider the implications of chirality conservation. Our            that of X.X., *, so Lorentz invariance dictates its form as
work in this section is entirely academic except for
j=t, but even in this familiar case our conclusions are                            2: ..,1*) (g) =ifi.., (q)F( _q2).       (10.6)
not quite in accord with public opinion.
   For definiteness we will understand chirality conser-            Using (6.8) now gives the exact propagator (10.5) as
vation as invariance under a continuous transformation
                                                                                                  -iII(q)
                                                                              S'(q)=-------                                (10.7)
                                                                                 [1- (_q2)iF( _q2)J[q2-ie]
In the 2(2j+l)-component formalism lO we unite the                  We have not used chirality yet. In general the self-
(j,O) and (O,j) fields 'P.(x) and x.(x) into a (j,0)(f) (O,j)       energy parUI*) (q), and hence the function F( _q2), may
field t(x):                                                         have a pole at q'=0, due to graphs with one intermedi-
                                                                    ate X.line. But under any form of chirality conservation
                                   'P(X)]
                        t(x)= [                           (10.2)    such graphs are forbidden. (For example, there is no
                                   X (x)                            neutrino X. field.) Hence F( -q') has no pole at q2=0,
  10 See Ref. 1. Many features of this formalism have been worked
                                                                    and therefore S' (q) does have such a pole, corresponding
out independently in unpublished work by D. N. Williams.            to a particle of zero renormalized mass.
336                                                                                                                       CHAPTER VI
FEY N MAN R U L E S FOR ANY S PIN. 11. MAS S L E SSP ART I C L E S B895
  Of course there may also be another particle with non-                  Unfortunately this theorem offers no proof that the
zero mass m given by                                                   accepted chirality-conserving weak interactions do not
                                                                       give a massive neutrino, with a distinct massive
                         1=m2iF(m2).
                                                                       antineutrino. It should be kept in mind that we cannot
But such a particle would have to be unstable so m                     decide just by looking at a Lagrangian whether the
would lie off the physical sheet.                                      physical one-particle states will be purely neutral or not.
   Now let us turn to the second question. We assume                   Of course, any massless particle can be called purely
that there exists a unitary chirality operator X(o) which              neutral, but this is not relevant if what we want is to
transforms the Heisenberg representation fields into                   prove the absence of massive particles.
                                                                          We can say somewhat more about the mass spectrum
              X(O)I".H(X)X-l(o)=ei'I".H(X) ,                (10.8)     if we are willing to assume parity conservation [which
              X(o)x.H(x)X-l(o)=e-i·X.H(x) ,                 (10.9)     links I".(x) with x.(x) by (9.30) and (9.31)J as well as
                                                                       chirality conservation. In this case the propagator of
and which leaves the physical vacuum invariant. It is                  I".(x) or x.(x) can receive no contribution from any
certain that this assumption alone is not sufficient, in               massive one-particle state that has no degeneracy, be-
itself, to allow us to prove anything about physical                   yond the (2j+ I)-fold degeneracy associated with its
particle masses, because we have not yet said anything                 spin, and an additional 2-fold degeneracy if it happens
to connect the fields I".(x) and x.(x) with each other.                to have a distinct antiparticle. For it would then be
For instance, we might choose I".(x) as (1+1'6)/2 times                possible to form a one-particle chirality eigenstate 1p,!,):
the electron field, and x.(x) as (1-1',)/2 times the
muon field. Then (10.8) and (10.9) are obviously                                        X(o) 1 p,!,) = exp(io~) 1 p,!,)        (10.14)
satisfied if we choose the chirality operator as                       by taking 1 P,I') as either the one-particle state itself or
X(o)=exp{io [electron number                                           some linear combination of it and its charge conjugate.
                        -muon numberJ).                    (10.10)     Lorentz invariance requires that
But we can hardly conclude from this that the electron                 (OII".H(X) 1 p,I')=N .(2w)-1/2D,.W[L(p)Je ip .%,       (10.15)
or muon is massless.                                                    (01 X.H (x) 1 p,I')=N x(2w)-1/2.ii,.W[L(p)Je iP ·%.    (10.16)
   Clearly, the only information that can be gleaned
solely from the existence of X (0) is just what would                  Parity conservation tells us further that
follow from any ordinary additive conservation law.                                                                            (10.17)
Namely, the propagator of I".(x) or x.(x) can receive no
contribution from any massive purely neutral one-                      This is just to say that the matrix element of the
particle state that has no degeneracy beyond the                       2(2j+1)-component field ,!(x) satisfies the generalized
(2 j+ 1)-fold degeneracy associated with its spinY For                 Dirac equation [Eq. (7.19) of Ref. 1J, which is to be
any such state Ip,!,) would have to be a chirality                     expected under the assumption of parity conservation.
eigenstate                                                             But (10.8) and (10.14) give N=O unless ~=+1, while
                                                                       (10.9) and (10.14) give N =0 unless ~= -1, so we may
        X(o) 1 p,!,)=eil'l p,l')   (I'=-j, "',j),          (1O.11)     conclude that N = O. Again, this proof does not apply for
and thus                                                               zero mass, because the two helicity states are uncon-
                                                                       nected by space rotations and hence may have differ-
                                                           (10.12)     ent es.
          (OII".Ht(x)lp,!,)=O unless          ~=-1.        (10.13)        [It might at first sight appear that the free fields
                                                                       constructed in Ref. 1 provide a counter-example to this
But CP or CPT conservation tells us that these two                     proof. In the absence of interactions they certainly
matrix elements are proportional to each other, and                    describe nondegenerate particles with nonvanishing bare
hence must both vanish. [Observe that we cannot                        and physical masses, and yet there is no coupling that
forbid a massless purely neutral particle from contrib-                violates either parity or chirality. The trouble with this
uting to the propagator of I".(x) or x.(x), since CP and               argument is that no operator X (0) can be constructed;
CPT reverse its helicity, and its two helicity states                  in fact Eqs. (7.23) and (7.25) of Ref. 1 show that
might have opposite chirality. This is consistent with
                                                                                         (T{ 1".(x),x.,t(y»))o,.oO.            (10.18)
the remark' that it is only a matter of convention
whether we call a massless particle purely neutral or                  This point is more transparent in the conventional
not.J                                                                  language in which we would just say that the free-field
  11 This is an abbreviated version of a proof given by B. Touschek,
                                                                       Lagrangian does not conserve chirality. As m -> 0,
in Lectures on Field Theory and the Many·Body Problem, edited by       (10.18) vanishes as m2i, and for m=O it is easy to con-
E. R. Caianiello (Academic Press Inc., New York, 1961), p. 173.        struct X (0) explicitly. J
It is not clear from Touschek's article whetber he feels that this
theorem implies that the neutrino cannot have finite mass. As             The last proof is of some interest, because it shows
indicated herein, I do not.                                            that unless the vacuum or electron is degenerate, the
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                                   337
mass of the electron cannot arise entirely from electro-            field ",(x), which transforms according to the reducible
magnetic interactions, which conserve both parity and               (j,O)(f) (O,j) representation; for j =! this yields the
chirality. But it is useless for the neutrino, and we are           Dirac formalism, while for j = 1 it corresponds to the
forced to conclude that only perturbation theory can                union of the irreducible fields E±iB into a six-vector
account for its zero mass.                                          {E,B}. Here again there is no distinction to be made
                                                                    between zero and nonzero mass, so we need not repeat
                    XI. CONCLUSIONS                                 here the details of the 2(2j+ 1)-component formalism lO
  The Feynman rules for massless particles in the                   constructed in Ref. 1.
(2 j+ 1)-component formalism are identical with those                  We have seen no hint of anything like gauge invari-
derived in Ref. 1 for particles of mass m>O. It is only             ance in our work so far. In fact, the really significant
necessary to pass to the limit m -. 0 to obtain the cor-            distinctions between field theories for zero and nonzero
rect propagators for internal lines, and wave functions             mass arise when we try to go beyond the (2j+ 1)- or
for external lines. Also, the various possible invariant            2(2j+ 1)-component formalisms. In particular, for
Hamiltonians JC(x) can be constructed out of the fields             m>O there is no difficulty in constructing tensor fields
",.(x) and x.(x), with no distinction between massive               transforming according to the (j/2,j/2) representations,
and massless particle fields.                                       while for m=O this is strictly forbidden by the theorem
   Furthermore, the transformation properties of ",,(x)             proven in Sec. III. We will see in a forthcoming article
and x.(x) under T, C, and P are the same for m>O and                that the attempt to evade this prohibition and yet keep
m=O. If P and/or C are conserved it is very convenient              the S matrix Lorentz-invariant yields all the results
to unite ",.(x) and x.(x) into a 2(2j+1).component                  usually associated with gauge invariance.
                                                           H. S.   MANIt
                                  Physics Department, Columbia Uni'lJersity, New York, New York
                                                   (Received 23 December 1963)
                In a previous paper, a simplified model was used to study the effects of strong interactions on the weak
             interaction theory of Feinberg and Pais. In this paper, we use a more general argument, a power count
             based upon the \Vard-Takahashi-Nishijima multimeson vertex function identity, to show that the same
             conclusion remains valid even when crossed ladder graphs are included. Our conclusion may not apply, how-
             ever, to the modified program of peratization where W - W scattering plays an essential role.
                We ~ve a pu~ely S-matrix-theoretic proof of the conservation of charge (defined by the strength of soft
              phot?D m~eractIons) and the equality of gravitational and inertial mass. Our only assumptions are the Lor-
              entz .mvanance and pole structure of the S matrix, and the zero mass and spins 1 and 2 of the photon and
              gravlton. We also prove that Lorentz invariance alone requires the S matrix for emission of a massless
              particle of arbitraryint~ger spin to satisfy a "mass-shell gauge invariance" condition, and we explain why
              there are no macroscopIc fields corresponding to particles of spin 3 or higher.
theory as a complete dynamical theory even for strong                The polarization '±'(q) is defined by
interactions alone, and the presence of massless particles
                                                                                           '±'(q)=R(q)",,±' ,                        (2.4)
will certainly add a formidable technical difficulty, since
every pole sits at the beginning of an infinite nnmber of            where R(q) is a standard rotation that carries the z axis
branch cuts. All such "infrared" problems are outside                into the direction of q, and '±' is the polarization for
the scope of the present work. We shall simply make                  momentum in the z direction:
believe that there does exist an S-matrix theory, and
that one of its consequences is that the S matrix has                                   ,±'=(1, ±i, 0, O}/V1.                        (2.5)
the same poles that it has in perturbation theory, with              Some properties of '±'(q) are obvious:
residues that factor in the same way as in perturbation
theory. (We will lapse into the language of Feynman                                      ,±:(q)'±'(q) = 1,                           (2.6)
diagrams when we do our 2.. bookkeeping in Sec. IV,                                       ,±,(q)'±'(q) = 0,                          (2.7)
but the reader will recognize in this the effects of our
                                                                                                ,±,* (<1) = ''I'' (<1) ,             (2.8)
childhood training, rather than any essential dependence
on field theory.)                                                                                ,±O(q)=O,                           (2.9)
    When we refer to the "photon" or the "graviton" in                                         q,,±'(tj)=O,                         (2.10)
 this article, we assume no properties beyond their zero
mass and spin 1 or 2. We will not attempt to explain why             L± f±'(q)'±'*(q) = 11"(<1)= g"'+ (ij'q'Hl'q')/ iqi 2 ,
 there should exist such massless particles, but may guess                                         [Ii'={-q,iqi}], (2.11)
 from perturbation theory that zero mass has a special               L± f±"(q)'±"(q)f±"*(q)'±'2*(<1)
 kind of dynamical self-consistency for spins 1 and 2,
 which it would not have for spin O.                                         = H II"" (<1)II""'(<1)+ II""(q)II""(q)
    Most of our work in the present article has a counter-                                       - II""(q)II"'2(<1)}.               (2.12)
 part in Feynrnan-Dyson perturbation theory. In a                    We also note the very important transformation rule,
 future paper we will show how the Lorentz invariance                proved in Appendix A,
 of the S matrix forces the coupling of the photon and
 graviton "potentials" to take the same fonn as required             (A,"-q"M/ i qi ),±'(A<1) =exp{±i8[q,A]}'±'(q) , (2.13)
 by gauge invariance and the equivalence principle.                  with 8 the same angle as in (2.1).
          II. TENSOR AMPLITUDES FOR MASSLESS
                                                                       If it were not for the q' term in (2.13), the polarization
                PARTICLES OF INTEGER SPIN                            "tensor" f±Pl.. "I::I./ i would be a true tensor, and the
                                                                     tensor transformation law (2.3) for M±""""i would be
   Let us consider a process in which a massless particle            sufficient to ensure the correct behavior (2.1) of the
is emitted with momentum q and helicity ±j. We shall                 S matrix. But '±' is not a vector,' and (2.3) and (2.13)
call theS-matrix element simply S±j(q,p), letting p stand            give the S-matrix transformation rule
for the momenta and helici ties of all other particles
participating in the reaction. The Lorentz transforma-               S±i(q,P) = (21 q I)-1/2 exp{±ij8 (q,A)}
tion property of S can be inferred from the well-known                          X[,±"(Aq)- (Aq)"A,O,±'(Aq)/ iqi]*' ..
transformation law for one-particle states'; we find that                       X [,±'i(Aq)- (Aq)"iA,',±'(Aq)/ i q i]*
S±j(q,p)~       (IAql/lql)li2                                                                    XM ±..,...,,(Aq,Ap). (2.14)
                      Xexp[±ij8 (q,A)]S±i(Aq,Ap).           (2.1)    For an infinitesimal Lorentz transformation A',= 0',
The angle 8 is given in Appendix A as a function of the              +w"" we can use (2.2) and the symmetry of M to put
momentum q and the Lorentz transformation A',.                       (2.14) in the form
  We prove in Appendix n that, in consequence of                     S±Jeq,p) = (IAq I/ I ql )1/2 exp{ ±ij8 (q,A) }S±i(Aq,Ap)
(2.1), it is always possible for integer j to write S±i as
the scalar product of a "polarization tensor" and what                          - je21 q 1')-'1/2(w,O,±'*eq))q'le±"*eq)' ..
Stapp' would call an "M function":                                                             X,±'i*(<1)M±." ... ,,(q,p). (2.15)
S±i(q,P) = e2[ q I)-1/2'±"*(q) ...                                    Hence the necessary and sufficient condition that (2.14)
                           X,,,ti*(q)M±"""i(q,P)            (2.2)     agree with the correct Lorentz transformation property
                                                                      (2.1), is that S± vanish when one of the '±" is replaced
 with M a symmetric tensor,' in the sense that                        with q':
        M ±'" ·,'i(q,p)=A,,'1. .. .\,j"iM ±'i"·'i(Aq,Ap).   (2.3)              q",±"'*(q)' .. 'x'i* (<1)k! ±." ... ,,(q,p) = O. (2.16)
   Ii    functions for massive particles \verc introduced by H.
        jf                                                              For .i = 1 this may be expressed            'IS    the conservation
 Stapp, Ph),s. Rev. 125, 2139 (1962). See also A. O. Barut, 1.
 Muzinich, and D. N. Williams, Ph),s. Rev. 130,442 (1963).              7The transformation rule (2.13) shows that EJ.IJ(q) transforms
                                             + + -}.
    e \Ve use a real metric, with signature {+         Indices are    according to one of the infinite-dimensional representations of the
 raised and lowered in the usual way. The inverse of the Lorentz      Lorentz group discllssed by V. Bargmann and E. P. \Vigner,
 transformation AJJ.~ is [A-l]"p=A/'.                                 Proc. Nat!. Acad. Sci. 34, 211 (1948).
340                                                                                                              CHAPTER VI
exchanged particle. Hence it is possible to give a purely      e± for photon helicities ±1. Parity conservation would normally
                                                               require that e+ = L (with an appropriate convention for the photon
S-matrix-theoretic definition of the vertex amplitude r        parity). However if space inversion takes some particle into its
for any set of physical particles, as a function of their      antiparticle then its "right chargeJJ e+ will be equal to the "left
                                                               charge" e_ of its antiparticle, and we will see in the next section
momenta and heIicities; the coupling constant or con-          that this gives e+=t_= - L . In this case we speak of a magnetic
stants define the magnitude of r. (As discussed in the         monopole rather than a charge. The same conclusions can be
introduction, we will not be concerned in this article         drawn from CP conservation. We will not consider magnetic
                                                               monopoles in this paper, though in fact none of our work in Sec. IV
with whether the above remarks can be proven rigor-            will depend on any relation between e+ and e_. Time-reversal
ously in S-matrix theories involving massless particles,       iuvarianl;e allows 1:IS to take e as real.
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                                        341
the factors 2, i, and .. being separated from e in obedi-                  On the other hand, if a is massless or extremely rela-
ence to convention. And in the same way we may define                      tivistic, then Ea»ma and (3.S) gives
a "gravitational charge" f, by the statement that the
j=2 vertex amplitude is"                                                                                                           (3.10)
                 2if(s..G) [/'(2..)'<l .., (p,'±'*(tJJ'                    [Formulas (3.8) or (3.10) should not of course be under-
                                                                  (3.5)    stood to mean anything more than already stated in
                     (2 .. )9/2[2E(p)J(2 Iq i )1/ 2                        (3.7). However, they serve to remind us that the re-
                                                                           sponse of a massless particle to a static gravitational
the extra factor (s..G) [/2 (where G is Newton's constant)
                                                                           field is finite, and proportional to J. J
being inserted to make f dimensionless.
                                                                              The presence of massless particles in the initial or
  In order to see howe and f are related to the usual
                                                                           final state will also generate poles in the S matrix,
charge and gravitational mss, let us consider the near
                                                                           which, like that in (3.7), lie on the edge of the physical
forward scattering of two particles with masses ma and
                                                                           region. It is therefore possible to measure the coupling
mb, spins J a and h, photon coupling constants eaand eb,                   constants e and J in a variety of process, such as
and graviton coupling constants fa and J.. As the in-
                                                                           Thomson scattering or soft bremsstrahlung, or their
variant momentum transfer 1= ~ (pa~ pa')2 goes to
                                                                           analogs for gravitons. All these different experiments
zero, the S matrix becomes dominated by its one-
                                                                           will give the same value for any given particle's e or f,
photon-exchange and one-graviton-exchange poles. An
                                                                           for purely S-matrix-theoretic reasons. The task before
elementary calculation" using (2.11) and (2.12) shows
                                                                           us is to show how the e's and 1's are related for different
that for t --+ 0, the S matrix becomes
                                                                           particles.
Ol1al1a'OUb"b'
---[eaeb(Pa' Pb)                                                            IV. CONSERVATION OF e AND UNIVERSALITY OF f
 4..'E aE bt
             +S..GfaJ.{ (pa' pb)'-ma'mb2/2}].                     (3.6)       Let Spa be the S matrix for some reaction a --> /3, the
                                                                           states a and /3 consisting of various charged and un-
If particle b is at rest, this gives                                       charged particles, perhaps including gravitons and
                                                                           photons. The same reaction can also occur with emission
                                                                  (3.7)    of a very soft extra photon or graviton of momentum
                                                                           q and helicity ± 1, or ±2, and we will denote the corre-
                                                                           sponding S-matrix element as Spa±l(q) or SPa±2(q).
Hence we may identify ea as the charge of particle a,                         These emission matrix elements will have poles at
while its effective gravitational mass is                                  q = 0, corresponding to the Feynman diagrams in which
                    ma= Ja{2Ea~ (m.','Ea)}.                       (3.8)    the extra photon or graviton is emitted by one of the
                                                                           incoming or outgoing particles in statesa or /3. The poles
If particle a is nonrelalivistic, then Ea~ma, and (3.8)                    arise because the virtual particle line connecting the
gives its gravitational rest mass as                                       photon or graviton vertex with the rest of the diagram
                                                                           gives a vanishing denominator
                                                                 (3.9)
  1I  Proper Lorentz invariance alone would not rule out different         1/[(pn+q)'+mn2J= 1j2pn·q
values for the gravitational charges f ± for gravitons of helicity ±2.                                   (particle n outgoing) ,
Parity conservation (with an appropriate convention for the                                                                         (4.1)
graviton parity) requires that i+= f-. This conclusion holds even          1/[(pn~q)2+mn2J= ~        1/2pn·q
for the magnetic monopole case discussed in footnote 10, since
then f+=J-, and we will see in Sec. IV that the antiparticle has                                         (particle n incoming) .
"left gravitational charge" f- equal to f-. The same conclusions
can be drawn from CP conservation. Time-reversal invariance                  For Iql sufficiently small, these poles will completely
allows us to take f as real.                                               dominate the emission-matrix element. The singular
   12 The residue of the pole at t=Ocan be mut easily calculated by
adopting a coordinate system in which q=pa,-pa=Pb-Pb' is a                 factor (4.1) will be multiplied by a factor ~i(2 .. )-'
finite real light-like four-vector, while p", Pl., pal, Pb' are on their   associated with the extra internal line, a factor
mass shells, and hence necessarily complex. Then the gradient
terms in (2.11) and (2.12) do not contribute, because q,p"~q'P'
 ~ 0, so that II" may be replaced by       g,,,yielding (3.6). We are                         2ie[pn' '±* (q) J(2 .. )'
justified in using (3.6) in the physical region (where pa, Ph, Pal, Ph'                                                             (·U)
are real and q is small, though not in the direction of the light cone)                          (2 .. )3/2(2 Iql )[/2
because Lorentz invariance tells us that the matrix element                or
depends only upon sand t. Lorentz invariance is actually far from
trivial in a perturbation theory based on physical photons and                           2if(S..G) [/2[pn' <± *(<1) J'(211")'
gravitons, since then the Coulomb force and Newtonian attraction                                                                    (4.3)
must be explicitly introduced into the interaction in order to get                               (2 .. )'/2(2Iql)1/2
the invariant S matrix (3.6). (Such a perturbation theory 'will be
discussed in an article now in preparation.) The Lorentz-invariant         arising frolll the vertices (3.4) or (3.5), and a factor Spa
extrapolation of (3.6) into the physical region of small t is the
analog, in S-matrix theory, of the introduction of the Coulomb             for the rest of the diagram. Hence the S matrix for soft
and Newton forces in perturbation theory.                                  photon or graviton emission is given in the limit
342                                                                                                                CHAPTER VI
                        X[   r:.• 'I/.f.[P"E±*(q)J']
                                           (p.-q)
                                                     S~.,        (4.5)
                                                                         The requirement that (4.12) vanish for all such po', can
                                                                         be met if and only if all particles have the same gravita-
                                                                         tional charge. The conventional definition of Newton's
the sign '1/. being + 1 or -1 according to whether                       constant G is such as to make the cornmon value of the
particle n is outgoing or incoming.                                      f. unity, so
   These emission matrices are of the general form (2.2),                                     f.=1 (alln)                   '(4.14)
i.e.,                                                                    and (3.8) then tells us that any particle with inertial
                                                                 (4.6)   mass m and energy E has effective gravitational mass
                                                                                                 iii=2E-m'/E.                 (4.15)
where M. and M •• are tensor M functions                                 In particular, a particle at rest has gravitational mass
                                                                         iii equal to its inertial mass m.
                                                                             It seems worth emphasizing that our proof also
                                                                         applies when some particle n in the initial or final state
M"(q, a --> fJ) = (2".)-3/2(&rG)'/2                                      is itself a graviton. Hence the graviton must emit and
                                                                         absorb single soft gravitons (and therefore respond to a
                        X[r:. 'I/.f.Pn"Pn'/(p.·q)JS~..           (4.9)   uniform gravitational field) with gravitational mass 2E.
                                                                         It would be conceivable to have a universe in which all
However, we have learned in Sec. II that the covariance                  f. vanish, but since we know that soft gravitonsinteract
of M. and M •• is not sufficient by itself to guarantee the              with matter, they must also interact with gravitons.
Lorentz invariance of the S matrix; Lorentz invariance                       Having reached our goal, we may look back, and see
also requires the vanishing of (2.2) when anyone E±'(q)                  that no other vertex amplitudes could have been used
is replaced with g'. For photons this implies (2.17), i.e.,              for q --> 0 except (3.4) and (3.5). A helicity-flip or
                                                                         helicity-dependent vertex amplitude could never give
        O=q'M,(q, a --> (j)= (2".)-"'[r:.         'I/.e.]S~.,   (4.10)   rise to the cancellations between different poles [as in
                                                                          (4.10) and (4.12) ] needed to satisfy the Lorentz in-
                                                                         variance conditions (2.17) and (2.19). It is also interest-
so if    S~.   is not to vanish, the transition a --> {j must
                                                                         ing that such cancellations cannot occur for massless
conserve charge, with                                                    particles of integer spin higher than 2. For suppose we
                                                                (4.11)   take the vertex amplitude for emission of a soft massless
                                                                         particle of helicity ±j (j=3, 4, ... ) as
For gravitons Lorentz invariance requires (2.18), which                                    2igW(2".)4(E± *(q). p);a ••,
                                                                                                                              (4.16)
   "Formula (4.4) is well known to hold to all orders in quantum                            (2".)'/2[2E(p)J(2J ql )1/2
electrodynamic perturbation theory. See, for example, J. M. Jauch
and F. Rohrlich, Theory of Photons and Electrons (Addison-               in analogy with (3.4) and (3.5), the S matrix S~«±;(q)
Wesley Publishing Company, Inc., Reading, Massachusetts,                 for emission of this particle in a reaction a --> {j will be
1955), p. 392, and F. E. Low, Ref. 14.
   If It has been shown by F. E. Low, Phys. Rev. 110, 974 (1958),
                                                                         given in the limit q --> 0 by
that the next term in an expansion of the S matrix in powers of
 Iq I is uniquely determined by the electromagnetic multipole            S~.±;(q)   .... (2".)-3/'(21 ql )-1/2
moments of the participating particles and by S~a.. However, this                   x[r:. 'I/.g.W[P,·E±*(q)Ji/(po·q)]S~..     (4.17)
next (zeroth-order) term is Lorentz-invariant for any values of the
multipole moments .
   .. Relations like (4.4) and (4.5) are also valid if S~a±l(q),         This is only Lorentz invariant if it vanishes when any
S,a",±2(q), and Sfja are interpreted as the effective matrix: elements
for the transition a ~ p, respectively, with or without one extra        one E±' is replaced with g', so we must have
soft photon or graviton of momentum q, plus any number of un-
observed soft photons or gravitons with total energy less than some                                                           (4.18)
small resolution .1.E. [For a proof in quantum-electrodynamic
perturbation theory, see, for example, D. R. Yennie and H. Suura,
Phys. Rev. 105, 1378 (1957). The same is undoubtedly true also           But there is no way that this can be satisfied for all
for gravitons, and in pure S-matrix theory.]                             momenta p. obeying (4.13), unless j=1 or j=2. This
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                          343
                                cosEl
                        61" = [ -sinEl
                                          sinEl
                                          cose
                                                  -XlcosEl-X.sine
                                                   XlsinEl-X,cosEl
                                                                           Xl cosEl+X, sine
                                                                          -XlsinEl+X,cosEl
                                                                                                1
                                                                                                                     (A3)
                          ,      Xl        X,          l-X'j2                     X'j2'
                                 Xl        X,           -X'j2                    1+X'j2
                        X'""XI'+X,'.
(The rows and columns are in order 1, 2, 3, 0.) Wigner' has noted that this group is isomorphic to the group of
rotations (by angle e) and translations (by vector (XI,X,}) in the Euclidean plane. In particular the "transla-
tions" form an invariant Abelian subgroup, defined by the condition El=O, and are represented on the physical
Hilbert space by unity. It is possible to factor any 61", into
                                     cose sinEl                          -Xl
                                                                                           1
                           61"-
                             ,-
                                [   -sine cose
                                      0
                                      o
                                              o
                                              o o
                                                    ~ ~l[i 1.
                                                       1
                                                                      -X.
                                                                    l-X'j2
                                                                            X,
                                                                            Xl
                                                                           X'j2
                                                              Xl X, -X'j2 1+X'j2
                                                                                 .                                   (A4)
Hence CR(q,A) does belong to the little group.                     or, recalling that E,:'=O,
  It was shown in Ref. 3 that, as a consequence of (AS),
the S matrix obeys the transformation rule (2.1), with                     (A/-AN.')e,:'(Aq)=exp[±i8(q,A)J<,:;(q).
EJ(q,A) given as the EJ angle of CR(q,A):                          This also incidentally shows that EJ(q,A) does not
                                                                   depend on Iql.
             EJ(q,A)= El[.e-1(q)A-1.e (Aq)].             (AU)
                                                                     We have not had to define the rotation R(~) any
  We now tum to the polarization "vectors" E;,t(~),                further than by just specifying that it carries the z axis
defined in Sec. II by                                              into the direction of q. However, the reader may wish to
                                                                   see explicit expressions for the polarization vectors, so
               E,:'(q)=R'.(~)E,:·,                        (A12)    we will consider one particular standardization of R(q).
                   <,:'",,{l,   ±i, 0, 0}/v1.             (A 13)   Write ~ in the form
                                                                                 <1= {-sin~ cos'Y, si~ sin'Y, co~}       (A22)
Observe that we could just as well write (A12) as
                                                                    and let R(~) be the rotation with Euler angles 0, fj, 'Y:
               E,:'(q)=.e'.(q)<,:·                       (A14)
                                                                                 co~ cos'y      sin'Y   - si~ cOS'Y 0]
since B ( Iq I) has no effect on E,:.
   An arbitrary CR'. of the form (A3) will transform E,:'           R"(~)= [ -C~?in'Y           cr       siU:i~n'Y ~.    (A23)
into
         CR''',:'=exp(±iEJ[CR]<,:'+ X,:[CR]K', (A15)
                                                                    Then (2.4) and (2.5) give
where
                                                                    .,:.(~)= (co~    cos'Y±i sin'Y,
                                                          (A16)                      -co~ sin'Y±i cos'y, sinfj, 0}/v1
                                                                                                      CI'= 1, 2, 3,0).   (A24)
If we let CR be the transformation (A6), and use (A14),             We can easily check (2.6)-(2.12) explicitly for (A24).
then (AIS) gives
                                                                               APPENDIX B: CONSTRUCTION OF
[.e-1(q)A-l]''',:·(Aq)=exp[±iEJ(q,A)},:·                                            TENSOR AMPLITUDES
                               +X,:(q,A)K',               (A17)        We consider a reaction in which is emitted a massless
where                                                               particle of momentum q andintegerhelicity±j,allother
X,: (q,A)                                                           particle variables being collected in the single symbol p.
                                                                    Let us first divide the set of all possible {q,p} into dis-
      X1[.e-1(q)A-1.e (Aq)]±iX,[.e-1(q)A-1.e (Aq)]                  joint equivalence classes, {q,p} being equivalent to
                                                                    {q',p'} if one can be transformed into the other by a
                            J(I/l
                                                          (AIS)     Lorentz transformation. (This is an equivalence rela-
                                                                    tion, because the Lorentz group is a group.) The axiom
Multiplying (AI7) by .e(q), we have the desired result              of choice allows us to make an arbitrary selection of one
                                                                    set of standard values {q,.P,} from each equivalence
A.·E,:·(Aq) = exp[±iEJ(q,A) J<,:'(q) + X,:(q,A)q·. (AI9)            class. so any {q.p} determines a unique standard {q,.P,}.
Note that it is the "translations" which at the same                such that for some Lorentz transformation L', we have
time make the little group non-semi-simple, and which                                     q=Lq"         p=Lp,.             (BI)
yield the gradient term in (AI9).
   The quantity X,:(p,A) may be found in terms of                   It will invariably be the case in physical processes that
E,:(q) by setting 1'=0 in (AI9):                                    the only AP, leaving both q and p invariant is the identity
                                                                    a••• so the L', in (BI) is uniquely determined by q and p.
                   X,:(q,A) Iql =A,'e,:'(Aq).             (A20)     (This is true. for instance. if p stands for two or more
                                                                    general four-momenta.) Hence the arguments {q.p}
Hence we may rewrite (AI9) as a homogeneous trans-
                                                                    stand in one-to-one relation to the variables {q"p,.L}.
formation rule:                                                        Now let us construct an M,J,"l···.;(q"p,) satisfying
(A.'-A,'q'/lql)<,:(Aq)=exp[±i8(q,A)h'(~)                  (A21)     (2.2) for each standard {q"p,}. A suitable choice is
                                                        D. Han
                        Systems and Applied Sciences Corporation, Riverdale, Maryland 20737
              +
              transpose of the coordinate transformation matrix in the E(2) plane. In the case of spin-
                 particles, it is shown that the polarization of neutrinos is a consequence of the require-
              ment of gauge invariance.
the rotation group where the rotation operator con-             pZ=Pf'P,... W z= Wf'Wf' '                      (2)
sists of the orbital and spin parts.
   We shall then show that the SL(2,C) part corre-       where
sponds to spin-T massless particles. From our                   Wf'=TEf'",ppvMaJ/ .
analysis, we conclude that neutrino polarization is
a consequence of the requirement of the invariance          We are considering in this paper massless parti-
under the translationlike transformation of the          cles, and assume without loss of generality that the
E(2)-like little group, which in the case of photons     momentum of a given particle is along the z direc-
is a gauge transformation.                               tion. Then
   In Sec. II, we reorganize Wigner's work l on                                                                (3)
massless particles into a form suitable for studying
the content of the isomorphism between the E(2)          where PI' are the eigenvalues of the operators Pf'"
and E(2)-like little groups. It is emphasized in Sec.    The generators of the little group which commute
m    that the case of the O(3)-like little group for     with PI' in this case are
massive particles will be helpful in understanding
                                                                NI =K1-JZ, Nz=Kz+JI> JJ'                       (4)
the relation between the little group and E(2). Sec-
tion IV contains a detailed discussion of the E(2)       where
group. It is pointed out that the E(2) representa-
tive given in our previous papei1 is adequate only              J j = TEjjkMik, K j =Mjo   •
for integer-spin particles, and that the E(2)-like       These operators satisfy the commutation relations
subgroup of SL(2,C) should be used for spin-T
massless particles.                                             [JJ,Nil=iN z ,
   In Sec. V, the representation of the E(2}-1ike lit-          [h,Nz)=-iN 1 ,                                 (5)
tle group is studied in detail. The four-by-four
little-group transformation matrix is reduced to a              [NI>Nz)=O,
form similar to the three-by-three regular represen-
tation of the E(2) group. The relationship between       which are like those for the generators of the two-
these two matrices is worked out in detail. In Sec.      dimensional Euclidean group which is often called
VI, the algorithm developed for photons is applied       the E(2) group. 1 The little group for massless par-
to the case of neutrinos. It is shown that the re-       ticles is therefore locally isomorphic to the E(2)
quirement of gauge invariance leads to the polari-       group.
zation of neutrinos.                                        The study of isomorphism does not stop at the
   In Appendix A, it is shown that the gauge             commutation relations. As in the case of the
transformation on photon or neutrino wave func-          O(3)-like little group for massive particles, the
tions is a transformation within an equivalence          study should include explicit construction of repre-
class defined by a given rotation angle in the E(2)      sentations, and this construction starts with the
plane. In Appendix B, the connection between the         choice of commuting operators.
photon wave function and the E(2) coordinate is             The above generators of the little group com-
discussed in detail. It is pointed out that, although    mute with N Z, where
there is a one-to-one correspondence between these              NZ=Nlz+Nzz,                                    (6)
two quantities, one cannot be transformed to the
other through a linear transformation.                   and the Casimir operator W Z of Eq. (2) takes the
                                                         form
operators are still the generators of Lorentz                 (12) are given in the literature. 4,7-10 Compared
transformations. However, since the commutation               with the 0(3) case, this matrix appears to be com-
relations of Eg. (5) are exactly like those for the           plicated, and this probably was the reason why not
generators of the E(2) group, we can learn lessons            many authors were encouraged in the past to study
from this simpler group.l                                     the E(2) problem for massless particles. In the fol-
   If we use the four-vector convention l ,4                  lowing sections, we shall examine whether this ma-
                                                              trix can be reduced to a transformation matrix in a
       x~=(x,y,z,t)   ,                                 (9)
                                                              two-dimensional Euclidean space.
then the generators of the coordinate transforma-
tion take the form
                                                                   III. LESSONS FROM THE O(3)-LIKE
             o
             0 -i                                               LITILE GROUP FOR MASSIVE PARTICLES
           0 0 0 0
       Nl=                                                       There are enough books and papers on the
             0 0 0                                            three-dimensional rotation group, and we are guite
             0 0 0                                            familiar with the language developed for studying
                                                              this group. Therefore, the most effective way to
            0 0 0 0                                           study the E(2) group and its isomorphism with the
              o
              0 -i                                            little group for massless particles is to organize the
       N z=                                         (10)
            0   0 0                                           material in a way parallel to the case of the 0(3)-
            0   0 0                                           like little group for massive particles.
                                                                 Also for the case of massive particles, the little
             o    -i 0 0                                      group consists of four-by-four Lorentz transforma-
                  o   0 0
                                                              tion matrices. However, in the Lorentz frame
        JJ= 0     0   0 0                                     where the particle is at rest, the four-by-four ma-
                                                              trix reduces to a three-by-three rotation matrix and
             o    0   0 0                                     a one-by-one unit matrix. From this reduced ex-
                                                              pression, we can immediately see the content of the
The above generators lead to the transformation
                                                              isomorphism between the 0(3) group and the little
matrices
                                                              group. In the E(2) case, with the form given in
                                                    (II)      Eg. (12), it is not easy to see the correspondence.
                                                                 The physical quantities associated with the 0(3)
where                                                         degrees of freedom are well known. The 0(3)
       Dl(u)=exp( -iuNl) '                                    group has three parameters. One of them is used
                                                              for the amount of rotation around a given axis,
       Dz(v)=exp(-ivN z ) ,                                   and two of them are for the orientation of the axis.
       DJ(O)=exp(-iOJ J ) .                                   The direction of the rotation axis is the direction
                                                              of the spin. All rotations with the same amount of
After a straightforward algebra, we can write the             rotation, but with different axis orientations, be-
D matrix as                                                   long to the same equivalence class. Therefore, the
                                                              reorientation of the axis, without changing the
                 cosO -sinO   -u                u             amount of the rotation, is a transformation within
                 sinO cosO    -v                v             an equivalence class. s Is there this kind of reason-
       D(u,v,O)=
                  u·    v·  l-rz/2             r Z/2          ing for the E(2)-like little group?
                      u·     v·       -r2/2   l+r z/2            Again for the 0(3)-like little group, both 0(3)
                                                              and SU(2) groups are needed. The SU(2) group is
                                                    (12)      needed for specifying particles with half-integer
                                                              spin, particularly the electron. The 0(3) group is
                                                              needed for the description of orbital motion of
                                                              quarks inside an extended hadron. 6 In studying
                TABLE I. Table of the little groups for massive and massless particles. Both the 0(3)
             and E(2)-like little groups are subgroups of the 0(3,1) or SL(2,C) groups depending on the
             spin. The little groups for electrons and hadrons have been studied in Refs. I and 6, respec-
             tively. The little group for photons has also been studied in Refs. 1-4 and 7 -10, but there
             is enough room for further investigation. The little group for neutrinos is expected to be a
             subgroup of SL( 2, C)
0(3,1) SL(2,C)
group, we can summarize what has been done and                  mutes with all three of the above generators. Thus
what to expect in Table I. In the following sec-                we have to solve the eigenvalue equation
tions, we shall use the above-mentioned parallelism
                                                                                                                (15)
with the familiar 0(3) group to exploit the contents
of the E(2)-like little group for massless particles.           where
   For this purpose, let us consider a two-                        The expression given in Eq. (16) leads us to the
dimensional Cartesian plane with coordinate vari-               temptation to say that m should take either integer
able u and v. In Ref. 2, we used the following                  or half-integer values, 2 in view of the form given
forms as the generators of the E(2) group:                      in Eq. (93) of Wigner's paper. I,ll However, the
                                                                continuity of the transformation requires that
       NI = -ia/au ,
                                                                group representatives be analytic. For this reason,
       N 2 =-ia/av,                                  (14)       we have to write the b'=O solution as
J) = -i(ua/av-va/au) . (17)
We noted in Ref. 2 that the operator N' com- only for integer values of m. Then, where are the
                 TABLE II. Solutions of Laplace's equation in two-dimensional space for E(2) representa-
              tives. Physical particles correspond to the finite-dimensional representation diagonal in J 3
              with b'=O.
             Commuting set                    Infinite-dimensional                    Finite-dimensional
              of operators                       representation                         representation
                                               exp[i(blu +b 2v)]
                                                                                                  I
      E 2 (v)=exp(-ivN 2 )           ,             (20)   (u 0, Vo, 1) takes the following matrix form:
                                                  (21)
                                                          It is easy to show, if not well known, that this ma-
                                                          trix is the three-parameter regular representation of
whose algebraic properties have been discussed in         the E(2) group. The geometrical properties of this
the literature." 7,8,10                                   E(2) matrix are discussed in AppendixA. Appen-
   For photons, only the "orbital" parts are needed       dix B contains a discussion of the vector spaces to
in Eq. (19), and we can still use the function given      which this matrix is to be applied.
in Eq. (17) as the state vector. Since m = I, ,p(r,lI)       It is also easy to calculate the inverse of the
of Eq. (17) is the coordinate variable in the two-        above form:
dimensional Euclidean space. The E(2) transfor-
mation in this case can be achieved through the                             cose sinll - u'
three-parameter three-by-three matrix for the regu-           E-1(u,v,lI)= -sinll cosll -v'                  (23)
lar representation of the E(2) group.                                              o       0
   The explicit form of this regular representation
is given in Sec. V. The similarity between the E(2)       where u' and v· are given in Eq. (13). This ma-
geometry and that of the familiar 0(3) group is           trix also has vanishing elements in the lower left
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                               351
                                                                           l
                                                  (26)    Thus
                                                                                                  exp(~O/2) ].
                                                                           exp( -iO/2)
   Let us next consider antiparticles. Wigner's ori-          D(u,v,O)= (u+iv)exp(-iO/2)
ginal work I includes discussions of the little groups
for particles with negative energies. If the energy                                                        (30)
is negative, NI and N z of Eq. (10) should be re-           If this matrix is applied to the spin-up and
placed by their respective Hermitian conjugates.          spin-down states we get
J 3 is Hermitian. This replacement does not change
the E(2) commutation relations given in Eq. (5).
   Thus the little group for antiphotons is also iso-
                                                              D(u,v,O)   [~ 1
                                                                            =   l(u :~~)~:;;~~2) 1'        (31)
The spin-down state remains invariant under the u        theory based on a definite eigenvalue of rs is called
and v transformations, while the spin-up state un-       the "two-component theory of neutrinos." It is in-
dergoes spin flips.                                      teresting to note that this two-component theory is
   In the case of photons, the parameters u and v        a gauge-invariant theory.
generate gauge transformations. 2,4,10 Then, accord-
ing to Eq. (3 I), the gauge transformation changes
                                                                  VII. CONCLUDING REMARKS
the spin orientation if the spin is parallel to the
direction of momentum. However, according to                In this paper, it was noted first that the little-
Eq. (32), the gauge transformation does not change
                                                         group transformation matrix applicable to the pho-
the spin state if the spin is anti parallel to the
                                                         ton four-vector is somewhat complicated. We have
momentum.                                                reduced this unattractive form into a three-by-three
   The above analysis therefore leads us to the con-
                                                         matrix which can be compared with the regular
clusion that the spin of the spin-+ massless par-        representation of the E(2) group. This reduced
ticle should be antiparallel to the momentum in or-      form allows us to compare the little-group parame-
der that the spin state be gauge invariant. For          ters for massless particles with those in the well-
the case of spin- + antiparticles, we construct the E    known O(3)-like little group for massive particles.
matrix using the Hermitian conjugates of Tl> T 2 ,          The explicit construction of the isomorphism be-
and S3 given in Eq. (18), and calculate the D ma-        tween the little group and E(2) allows us to study
trix using again Eq. (29), Consequently, the spin        internal space-time parameters for neutrinos. We
of the antiparticle has to be parallel to the momen-     have shown in this paper that the polarization of
tum.                                                     neutrinos is a consequence of the requirement of
   However, it is important to realize that the          invariance under the translationlike transformation
above conclusions on the directions of neutrino and      of the little group of the Poincare group which, in
antineutrino spins depend on the choice of the E(2)      the case of photons, is a gauge transformation.
representation. The E(2) matrix which was given             As is well known, the subject of neutrino polari-
in Ref. 1 and which we used in the above analysis        zation has a stormy history. It was Weyl who first
has a vanishing element in the lower left comer,         proposed the two-component theory of neutrinos,
similar to the regular representation of the E(2)        but this suggestion was rejected by Pauli on the
group given in Eq. (22). However, this is not the        grounds that the theory does not preserve parity in-
only E(2) for which can be constructed as a sub-         variance. IS Since 1956,19 we have understood neu-
group of SL(2,C).14 We can also consider                 trino polarization as a manifestation of parity
      E'(u,v,O)=[E-I(u,v,O)jt.                    (33)   violation. The time has come for us to ask what
                                                         space-time invariance principle is responsible for
If we use the matrix in Eq. (28) and follow the          the polarization of neutrinos. In this paper, we
same reasoning as before, the spins of neutrinos         have provided an answer to this question.
and antineutrinos would be parallel and anti parallel
to the momentum, respectively.                                         ACKNOWLEDGMENT
   It is clear in either case that the polarization of
neutrinos is a conseqnence of the requirement of            We would like to thank Professor George A.
gauge invariance. Let us translate this conclusion       Snow for very helpful criticisms and stimulating
into the familiar language of the Dirac equation.        discussions.
It is easy to construct the NI and N2 operators ap-
plicable to the Dirac spinors for massless particles.
It then turns out that the Dirac spinors are invari-                        APPENDIX A
ant under the N I and N 2 transformations for both
polarizations. However, rs commutes with the                In this appendix, we discuss the similarity be-
Hamiltonian, and this allows us to choose a defin-       tween the E(2) and 0(3) groups using the concept
ite eigenvalue of rs. If the eigenvalue is -I, neu-      of equivalence class. s
trinos and antineutrinos are left- and right-handed,        Both the 0(3) and E(2) groups are thref-
respectively.ls,16 If the eigenvalue of rs is + 1,       parameter groups. We can obtain the E(2) group
then the polarizations are opposite to those for the     from 0(3) through a process of group contrac-
rs= -I case. Experimentally, the eigenvalue of rs        tion. 2o The group contraction process goes as fol-
is known to be - 1.17 As is well known, neutrino         lows. Every rotation can be regarded as a rotation
MASSLESS PARTICLES AND GAUGE TRANSFORMATIONS                                                                353
around a given axis. S The orientation of this rota-       effect of the E matrix given in Eq. (22) on the
tion axis can be specified by two angular variables.       column vector (uo,vo, 1) is well known. However,
This can also be achieved through the coordinate           an interesting case here is the action of E -I in
specification on a spherical surface. The reorienta-       view of Eq. (26):
tion of the rotation axis in this case can be speci-
fied by a movement of a point on the spherical                 E-1(-u,-v,IJ)=E(u·,v·,-0).                   (BI)
surface. If the radius of this sphere becomes suffi-
ciently large, and if the reorientation is sufficiently    In order to see the effect of the above matrix on
localized, the axis reorientation would appear like        the column vector (uo,vo, 1) let us carry out expli-
a motion of a point on a flat surface. As is               citly the following matrix multiplication:
described in the Iiterature,20 the translation on the
                                                                                        :: I[~ ~ ::
E(2) plane is the limiting case of the axis reorienta-
tion in the 0(3) group.
                                                                u'       cosO   sinlJ                   o
   The amount of rotation around a given axis is                v'      -sinO cosO                      o
an independent quantity. All rotations with the                           o     0  I        0 0
same amount of rotation, but not necessarily
                                                                                                            (B2)
around the same axis, belong to the same
"equivalence c1ass."s The traces of transformation         From this matrix algebra, u' and v' can be written
matrices belonging to the same equivalence class           out as
are known to be the same. Likewise, for photons
and neutrinos, it is easy to see from the expression           u'=(uo+u)coslJ+(vo+v)sinO,
given in this paper that the trace of the transfor-                                                         (B3)
                                                               v'=-(uo+u)sinO+(vo+v)cosO.
mation matrix is independent of the u and v vari-
able which only change the location of the rotation        The above E(2) geometry is easy to understand,
axis on the E(2) plane. Since u and v are the gauge        and does not require any further explanation.
transformation parameters, the gauge transforma-             If we insist on doing the same matrix algebra us-
tion is a transformation within the same equiva-           ing the D matrix, applicable to the photon polari-
lence class.                                               zation vector,
   Let us translate what we said above into formu-
las. As is well known, every rotation matrix R can
                                                                                           -sinO
be brought to the form
                                                                                            cosO
       R =A exp(-iaJz)A    -I,                      (AI)
                                                                                             v·
where a is the rotation angle, and A is the two-
parameter matrix which brings the rotation axis to                                                          (84)
the desired direction from the z axis. The trace R
is independent of the parameters of the A matrix.
   In the E(2) case, it is always possible to write           For convenience, we shall hereafter call the
                                                           column vectors, to which the E and D matrices are
       E (u,v,O)=E(5, 1/,O)E(O,O,O)E -1(5,1/,0) .          applicable, the E and D vectors, respectively. The
                                                    (A2)   geometry of the E vector is well known. Its third
                                                           component is trivial. Its first two components
E (0,0,0) is a rotation around the origin. The             specify the coordinate position in the two-
E(5,1/,0) in the above expression moves the rota-          dimensional space spanned by the u and v vari-
tion axis from the origin to (5,1/). The trace of the      ables.
above matrix depends only on the rotation parame-             The physics of the D vector is well known. Its
ter. The translation of the axis to (5,1/) is a            first two components specify the photon polariza-
transformation within the same equivalence class.          tion state, and its third component is parallel to
                                                           the direction of the momentum and is an un-
                                                           measurable gauge parameter. The question then is
                     APPENDIX B                            how the D vector is related to the geometry of the
                                                           E vector.
  We study in this appendix the vector spaces to              The matrix algebras of Eqs. (82) and (B4) allow
which the D and E matrices are to be applied. The          us to see the correspondence between the D and E
354                                                                                                CHAPTER VI
vectors. The matrix algebra for D given in Eq.              Hermitian conjugate, and this is one of the compli-
(B3) is simply the Hermitian conjugate of the alge-         cations in groups containing Abelian invariant sub-
bra of Eq. (B4) for the E matrix. However, it is            groupS.13 Therefore, the D vector is not a linear
not possible to construct a similarity transforma-          transformation of the E vector, although there is a
tion matrix which will bring the E matrix to its            one-to-one correspondence between them.
IE. P. Wigner, Ann. Math. ~, 149 (1939).                       of the present paper. Our statement in Ref. 2 about
20. Han, Y. S. Kim, and O. Son, Phys. Rev. 0 ~, 461            the integer values of m is correct. However, what we
   (1982).                                                    said there about the half-integer values is incorrect
3The fact that the compact 0(2) group is only a sub-        12The D matrix of Eq. (12) performs both Lorentz and
   group of the full noncom pact E(2) group was pointed       gauge transformations. Once reduced to the form of
   out by Gottlieb. See H. P. W. Gottlieb, Proc. R. Soc.      Eq. (24), D is nO longer a Lorentz transformation ma-
   London A368, 429 (1979).                                   trix. It only performs gauge transformations.
4S. Weinberg, Phys. Rev. 134, B882 (1964); 135, BI049       13G. Racah, CERN Report No. 1961-8 (unpUblished).
   (1964).                                                  14M. Flato and P. Hillion, Phys. Rev. 0 I, 1667 (1970).
5E. P. Wigner, Group Theory and Atomic Spectra,             1ST. O. Lee and C. N. Yang, Phys. Rev. 105, 1671
   translated by J. J. Griffin (Academic, New York,           (1957); A. Salam, Nuovo Cimento ~, 299 (1957); L. O.
    1959).                                                    Landau, Nuc!. Phys. 1, 127 (1957).
6y. S. Kim, M. E. Noz, and S. H. Oh, J. Math Phys.          16For a density matrix formulation of this problem, see
   2Q, 1341 (1979); O. Han and Y. S. Kim, Am. J. Phys.        A. S. Wightman, in Dispersion Relations and Elemen-
   ~, 1157 (1981); O. Han, M. E. Noz, Y. S. Kim, and          tary Particles, edited by C. OeWitt and R. Omnes
   O. Son, Phys. Rev. O~, 1740 (1982).                        (Hermann, Paris, 1960).
7E. P. Wigner, Z. Phys. 124, 665 (1948).                    17M. Goldhaber, L. Grodzins, and A. W. Sunyar, Phys.
8F. R. Halpern, Special Relativity and Quantum                Rev. li)2, 1015 (1958),
   Mechanics (Prentice-Hall, Englewood Cliffs, New Jer-     18W. Pauli, Ann. Inst. Henri Poi~care Q, 137 (1936).
   sey, 1968).                                              19The violation of parity invariance was discovered in
9J. Kupersztych, Nuovo Cimento ill, I (1976).                 1956. See C. S. Wu, E. Ambler, R. W. Hayward, O.
100. Han and Y. S. Kim, Am. J. Phys. ~,348 (1981).            O. Hoppes, and R. P. Hudson, Phys. Rev. lill., 1413
IIWhat we did in Ref. 2 was to replace Wigner's Eq.           (1957).
   (93) (Ref. D, which takes the form exp( tim f)) for      2oR. Gilmore, Lie Groups and Lie Algebras, and Some
   both integer and half-integer values of m, by Eq. (16)     oj Their Applications (Wiley, New York, 1974).
Chapter VII
Group Contractions
While the earth is like a sphere, its surface is like a flat plane in a reasonably
confined area. This raises the question of whether the symmetry governing the
sphere can be made that of a flat plane in certain limits. Indeed, in 1953, Inonu and
Wigner formulated this problem as the contraction of the three-dimensional rotation
group into the two- dimensional Euclidean group. In this case, the limiting
parameter is the radius of the sphere. When the radius becomes very large, the area
element on the surface becomes flat.
Since the little groups of massive and massless particles are locally isomorphic to
the three-dimensional rotation group and the two-dimensional Euclidean group
respectively, it is natural to suspect that the little group for massless particles is a
limiting case of that of massive particles. Then, what is the limiting parameter.
This has been shown to be the momentum/mass. The 1984 paper of Han, Kim, Noz,
and Son explains why the momentum/mass acts as the radius of the sphere in the
limiting process.
While the above-mentioned procedures constitute applications of the group
contraction, it is possible to achieve the same purpose by obtaining the little group
for massless particles from kinematical considerations. In 1986, Han, Kim, and Son
observed that the little group transformation is the transformation which does not
change the momentum, and constructed a set of non-colinear transformations whose
net effect is to leave the momentum invariant. This matrix is analytic in the (mass)2
variable in the neighborhood of (mass)2 = O. Therefore, it is possible to obtain the
zero-mass limit using the explicit expression for the little group transformation
matrix. The role of Wigner's little groups is summarized in Figure 3.
                                         355
356                                                                                               CHAPTERVll
                                              I                            I
                             Massive                                            Massless
                                                      between
                             Slow                                               Fast
                                              I
        Energy                        2       I      Einstein's
                            E=-P-                                                E=p
      Momentum                2m                  E=.Jm2+p2
       Helicity             SI         S2
                                              I Little Group I Gauge Trans.
Reprinted from Proc. Nat. Acad. Sci. (U.S.A.) 39, 510 (1953).
358                                                                       CHAPTER VII
I. CONTRACTION OF GROUPS
according to which the J are obtained by the same equation (1) as the
I, except that the et have to be replaced in it by a similar quantity, defined
with respect to the b.
  The above transformation may lead to a new group only if the matrix
U of (3) is singular. We shad call the operation of obtaining a new group
by a singular transformation of the infinitesimal elements of the old group
a contraction of the latter. The reason for this term will become clear
below. The singular matrix will be a limiting case of a non-singular matrix.
The latter will depend linearly on a parameter fi which will tend to zero:
                                                                        (4)
For 0 < E < EO the determinant of (4) is different from zero, it vanishes for
E = O.
                                                1 0                           v 0
                              u =                                    w =                                          (5)
                                                o        0                    o        1
The number of rows and columns in the unit matrix in u, and in v, is equal
to the rank r of u. It is advantageous to label the transformed I and I
with a pair of indices, the first referring to the subdivision of u given in
(5), the second specifying the various I and I within that subdivision.
Hence, (3) assumes the form
                                            r
                               + E L v,pb lp
                                            T
wherein a and (3 can assume the values 1 and 2. This gives for
(8)
i.e., the Ib must span a subgroup. On the other hand, if this happens
to be the case, the structure constants will converge to definite values
Ca •• 11/1"Y· as E - 0
                           h              1.               2.                2c
                  Cb.i/l        Cb.i/l         Cb.i/l           C b . l ,.        0
                           1.                              2.
                  Cb.2/1        0              Cb.2/1           C b • 2/12.           (9)
                                C2o.2/1
                                          1.
                                               C2 •• 2/1
                                                           2.
                                                                o.
These structure constants satisfy Jacobi's identities since the structure
constants for the J do this for non-vanishing E. We shall say that the
above operation is a contraction of the group with respect to the infinitesi-
mal elements Ib or that the infinitesimal elements 12/1 are contracted. We
then have, from (9).
   THEOREM 1. Every Lie group can be contracted with respect to any of its
continuous subgroups and only with respect to these. The subgroup with
respect to which the contraction is undertaken will be called- S. The con-
tracted infinite#mal elements form an abelian invariant subgroup of the con-
tracted group. The subgroup S with respect to which the contraction was under-
taken is isomorphic with the factor group of this invari.ant subgroup. Con-
versely, the existence of an abelian invariant subgroup and the possibility to
choose from each of its cosets an element so that these fot"m a subgroup S, is a
necessary condition for the possibility to obtain th, group from another group
by contraction.
   It is easy to visualize now the effect of the contraction oil the whole
group. The subgroup S with respect to which the contraction is undj!I"-
taken remains unchanged and it is advantageous to choose the group
parameters in such a way that afp = 0 throughout S. Then (6a) can be
replaced by
                                                                                      (6b)
and this can be assumed to be valid throughout the whole group, not only
in the neighborhood of the unit element. As E decreases, a fixed range of
the parameter b will describe an increasingly small surrounding of S. As
E tends to 0, the range of the bfp will become infinite and describe only
those group elements which differ infinitesimally from the elements of S.
The elements which are in the neighborhood of the unit element of the
original group but have finite parameters b20 will commute and form the
aforementioned commutative invariant subgroup. Naturally, the ele-
ments of this invariant subgroup will not commute, in general, with the
elements of the subgroup S: the change of the parameters a20 = Ebb will
be, upon transformation by finite elements of S, of the same order of
magnitUde as these parameters themselves. Naturally, the convergence
GROUP CONTRACTIONS                                                                   361
(10)
(lOa)
Hence ES. -II2S. not only converges to 12 = 12 but remains equal to it for
all E.
   It is more surprising, perhaps, to see that the same device is possible
also if one contracts the group with respect to the subgroup of the T({3).
The contracted group is, in this case, the two parametric abelian group.
We demand, in this case, that S, commute with I{J and hence, by (lOa),
that it be multiplication with a function of x. Because of Sf unitary na-
ture, we can give it the form exp (if(x, E)). Transformation of the Ela of
(lOa) with this gives
                  ES,-IIaS,   =     Ee-;/(z, ')('/2    +
                                                    x-d/dx)e;/(z, ,)
                              =     E('/2 +x d/dx)         +
                                                         Eix d/dx f(x, E).    (12)
The first part of this converges to 0 as it should since (12) should converge
to an operator which commutes with ix. The second part converges to
12 = ixJ'(x) = ig(x) if one sets
                                         f(x, E)   =   clf(x).               (12a)
Hence, the transformations of the contracted group corresponding to the
parameters cr, {3 is mUltiplication with
                                                                              (13)
GROUP CON1RACTIONS                                                                  363
of contraction with respect to Ra. Similarly, (13) could have been ob-
tained directly by trljnsforming O.a. (J = T({3)R.a with exp (t-1if(x».
   It is not clear how generally one can obtain a faithful representation
of the contracted group as a limit of an E dependent transform of a faithful
representation of the original group and the substitution (6b) of its param-
eters. Certainly, the procedure is not applicable to irreducible repre-
sentations of compact groups or, more generally, if the infinitesimal oper-
ators are bounded.
   (b) Representations of the Contracted Group from a Sequence of Repre-
sentations.-We shall now give a few examples for the second procedure,
i.e., obtaining a representation of the contracted group by choosing a
sequence of unitary representations D(I), D(2), ... , D(I), ... so that each
of the operators
                         (II = 1,2, .. , ri J.I = 1,2, ... , n - r)       (15)
It follows from the fact that there are only a finite number (two) non-
vanishing matrix elements in both I 2r and 12, that the operators satisfy
the commutation relations of the contracted group
                  [J2X1 J 2v ] = 0
It further follows from
                           (11(1»)2   +    (12r(l»)2   + (/2,(/»)2    = -let     + 1)
by multiplication with                E2   and going to the limit in the above way that
                                                                                                  (18a)
GROUP CONTRACTIONS                                                                           365
(19a)
One should keep i~ mind that J~z arose from 12z which is the infinitesimal
rotation about the x axis and corresponds to a displacement in the - y
direction. Similarly J~. corresponds to a displacement in the x direction.
  The function 'P(a, r) corresponds in the new Hilbert space to the vector
which has the components 'Pm in the Hilbert space of (l7a), (17b). It
further follows from (l8a) that
(20)
All integrations are from 0 to 211".          Expanding g(a -        a') into a Fourier
series of a - a', one finds
               'P(x,y)     =   fe-jErcosa',£gme-jm(a-a')da'                      (20b)
                                                           °
This permits an explicit determination of the T(t, '1)mm'. We shall not
carry this out completely but set only r = in (22a). Since ali Jm(O) =
except Jo(O) = 1, the summation over m disappears on the right side.
                                                                                                   °
The left side becomes, at the same time by (21)
                  \O(-t, -'1)    =    2r:E\Omi -m e -im(Il+-) Jm(Zp)                       (22b)
where tJ, p are the polar coordinates for             t    '1.   Comparing (22a) and (22b)
one finds, with t = 0, tJ = i/27f, '1 = P
                                     J m(:~P)   =   T(O, P),Om.                             (23)
The group relations and the form of the infinitesimal operators (19a) gives
at once the most important relations for Bessel functions, such as the addi-
tion theorem, differential equation (d. (20», etc. Up to this point the
argument is not new but merely a repetition, for the two dimensional
Euclidean group, of a similar reasoning given before 4 for the rotation
group. This led to the equation 4
                                     _
              DOl(O, tJO)Om = pm(cos {:J) =
                                                      (I +-
                                                          (l
                                                                  m)!)11 2
                                                                  m)!        P7 (cos (3)    (24)
This, together with (23), gives the asymptotic expression for the associated
Legendre functions 5
                             lim P~(cos (pll»)             =     Jm(P).                    (25a)
                             ,-~
GROUP CON1RACTIONS                                                                                        367
   Let us consider. first. the inhomogeneous Lorentz group with one space-
like. one time-like dimension. It is given by the transformations
                         x'        x cosh X + t sinh X + ar                                     (26)
                         t'   =    x sinh X+ t cosh X + a I.
We wish to contract it with respect to the subgroup of time displacements
t' = t + a l • The infinitesimal elements of (26) are: time displacement
It. space displacement I2r and "rotation" in space-time /2)". Their com-
mutation relations read
and
                   1"0 = - (p~   + p~ + pi + P)1/z ()/fJPt.            (2&)
It is useful to introduce new variables instead of the  Pl both in order to
simplify the definition domain of the variables and also to bring the opera-
tions of the subgroup S into a form which is independent of P. This can
GROUP CONTRACTIONS                                                                                    369
and
                                                                                           (30a)
If we now set 1.1: = EIh the 1t will converge to zero unless -P becomes
inversely proportional to E2, i.e., unless - f2p converges to a definite limit
P. If this is assumed, the second term of UtO will converge to zero and
the infinitesimal elements of the representation of the contracted group
become
                 1tl     = 0 1 C)/C)O.l: - Gt   ()/()G 1        10    = -ipo                 (31)
Since the Jl; are skew hermitean, their squares are positive definite
hermitean operators. Since E2P is also positive the expectation value
('P, iJo'P) of ilo is, for any state 'P, greater than E- l times the expectation
value of ill' It follows that if 11'P converges to a vector in Hilbert space
as E -+ 0, the vector Jo'P must grow beyond all limits. The same is true,
of course, for the other ll;. It follows that the representations considered
cannot be contracted in the sense discussed in the previous sections and
the same is true of all representations of the classes P ~ O.
   It is possible, however, to contract these representations to representa-
tions up to a factor of the Galilei group. The commutation relations of
the infinitesimal elements of representations up to a factor differ from the
commutation relations of real representations by the appearance of a
constant in the structural relations. Hence
                                                                                      (34)
where c~/J are the structure constants of the group to be represented (in
our case, the inhomogeneous Galilei group) and the balll are multiples·of
the unit operator. One will, therefore, obtain infinitesimal elements of
representations up to a factor if one sets, instead of (6)
                                                                                     (34a)
in which all a may depend on f. Since the additional terms in (34a)
commute with all other operators, these additional terms will not affect
the left side of (34). Hence, they must be compensated also on the right
side and this is done by the additional terms baill. The point of introduc-
ing the terms al in (34a), which then necessitates the introduction of the
bin (34), is that the right sides of (34a) may converge to finite non-vanish-
ing operators even if the I b , d 2• cannot be made to converge.
   The above generalization of the concept of contraction indeed allows
a contraction of the representations given by (28a), (28b), (2&) also for
p > O. As we let P go to infinity, 10 will also tend to infinity «Io'P, Io'P)
converges to infinity for all 'to). However, subtracting _ipl/2l from 10 ,
it will converge to
       10     lim -i( -Ii -      n - Ii + P)I/2 + ip         l/2
This shows that J o will converge to a finite operator if the J" = El" do and
if P·"E 2 converges to a finite constant m as P - to, E - O. Both can be
accomplished by assuming the representation in such a form that, instead
of (28)
                                                                                  (36)
This is indeed possibl~ because the variability domain of the PIt is un-
restricted and the abov~ form of the infinitesimal elements can be obtained
by unitary transformation of the operators given in (28a). Such a trans-
formation leaves the Iu of (28b) unchanged but transforms the ItO of
(2&) into
                                                 - (i/2m)(pi   + p~ + p!)
Ju     = Pi()/()Pt - P,,()/()P.l                                               (37.1)
The reader familiar with the transition from the Klein-Gordon to the
SchrOdinger equation will recognize the increase of the rest mass with
increasing c and the elimination of this rest mass by the subtraction of
 _iPI/21 from the infinitesimal operator of the time-displacement operator.
The infinitesimal operators (37.1), (37.2) for the contracted group are in
fact those of SchrOdinger's theory. It is likely that a similar contraction
is possible also for the other representations with positive rest mass (i.e.,
P> 0) but this and the behavior of the representations with P = 0 will
not be further discussed here.
  J  Segal, I. E., Duke Math. J., 18,221 (1951).
  2  Cf. e.g. Wigner, E., Gruppentheorie und ihre Auwendiengen etc., Friedr. Vieweg,
Braunschweig (1931) and Edwards Brothers, Ann Arbor (1944). Chapter XV.
   I Cf. Jahnke, E., and Emde, F., Tables of Functions, Dover Publications, 1943, p. 149;
or Watson, G. N., Treatise on Bessel Functions, Cambridge Univ. Press, 1922, p. 19ft.
   • Reference 2, Chapter XIX, particularly p. 230, 232. Cf. also Wigner, E. P., J.
Franklin Inst., 250, 477 (1950), and Godement, R., Trans. Amer. Math. Soc. 73, 496
(1952).
  I For m = 0 this is given on p. 65, of Watson's Bessel Functions (ref. 3).
  • Cf. e.g. Bargmann, V., and Wigner, E. P., these PROCEEDINGS, 34, 211 (1948) and
further literature quoted there.
  7 Inonu, .;;., and Wigner, E. P., NuOfJO cimento, 9, 705 (1952).
                                                                                                                    CHAPfERVII
372
Table I little groups for massive ami ma!.sless partlde~                     Why can we not formulate a group theory based on this
                                                                             daily experience?
  P: fOUf-momentum         Subgroup nf 0 13.11    Subgroup of SL (2,c)          (cl Strictly speaking, when we travel on the surface of the
                                                                             Earth, we are performing rotations around the center of the
          \1a:'S1H'        o (JI-Ilke ~uhgrour    SL' 121-hkt: ~ubgloup of
                                                                             Earth. Can the E (21 group be regarded as a limiting case of
           P     ,ll       nf 0 iJ.11 hadron\     Sr:2,(1 c/ecrron\
                                                                             the rotation group"
          Massle"s         El21-hke   ~ubgroup    EI2Htke subgroup of        We shall therefore start this paper with a discussion of the
           p 'll           ofO:3.1I.phOlonl       SL 12.cl neurnnor;         E (21 group.
                                                                                In Sec. II, transformations on the two-dimensional Eu-
                                                                             clidean plane are discussed. It is shown that solutions of the
                                                                             two-dimensional Laplace equation form the basis for finite-
                                                                             dimensional representations of the E (2) group. The 3 X 3
particles have internal space-time degrees offreedom, For                    matrices representing coordinate transformations on the
instance, a massive particle has rotational degrees of free-                 E (2) plane are discussed in detail. In Sec. III, the E (21 group
dom in the Lorentz frame in which the particle is at rest On                 is discussed as a contracted form of 0 (3). It is noted that we
the other hand, free massless particles have the helicity and                can achieve this purpose by looking into a small and almost
gauge degrees offreedom, We are therefore led to the ques-                   flat portion of a spherical surface whose radius becomes
tion of whether the internal symmetry for massless parti-                    large.
cles can be obtained as an infinite-momentumlzero-mass                          We discuss in Sec. IV the internal space-time symmetries
limit of the space-time symmetry for massive particles, as                   of massive and massless particles. It is shown that the E (2)-
in the case of the energy-momentum relation.                                 like symmetry of massless particles may be regarded as a
   In order to study internal space-time symmetries of rela-                 limiting case of the 0 (3)-like symmetry for massive parti-
tivistic particles, Wigner in 1939 formulated a method                       cles. In Sec. V, we discuss further applications of group
based on the little groups of the Poincare group. ' The little               contractions. It is shown in the Appendix that the E (21
group is a subgroup of the Lorentz group which leaves the                    group can serve as a useful example for illustrating the dif-
four-momentum of a given particle invariant. The little                      ference between active and passive transformations.
groups for massive and massless particles are locally iso-
morphic to the three-dimensIOnal rotation group and the
two-dimensional Euclidean group, respectively. For con-                      II. WHAT IS THE E (2) GROUP?
venience, we shall use the word "like" in order to indicate
that two groups have the same algebraic properties. The                        The two-dimensional Euclidean group, often called E (2),
internal symmetries of massive and massless parhcJes are                     consists of rotations and translations on a two-dimensional
dictated by the 0 (31-like and E (2)-like little groups, respec-             Euclidean plane. The coordinate transformation takes the
tively.                                                                      form
   The first step in obtaining a unified picture of both mas-                  x' = x cos IJ - y sin IJ        -t   u,
sive and massless particles is to gain thorough understand-
                                                                                                                                                (2)
ing of each of the four cases listed in Table I. The represen-
tation suitable for electrons and positrons was discussed in
                                                                               y'   =   x sin IJ   + Y cos IJ + u.
Wigner's original paper. I The representations for relativis~                This transformation can be written in matrix form as
tic extended hadrons in the quark model have been dis-                                                  - sin B
cussed extensively in the literature. '.l The representations
suitable for photons and neutrinos have also been worked
                                                                               (~'X') = (COS  IJ
                                                                                          Si~ B          cos B                                  (3)
OUt,4.5                                                                                                    o
   The purpose of the present paper is to discuss in detail                  The algebraic properties of the above transformation ma-
the problem of showing that the 0 131-like little group for                  trix have been discussed in Ref. 4.
                                                                                             =
massive particles becomes the E 12)-like little group for                       The 3 X 3 matrix in Eq. (3) can be exponentiated as
massless particles in the infinite-momentum/zero-mass                          D(B,u,v)        exp[ - i(uP,         + vP2 )]expl- iBL ,),       14)
limit. In order to deal with this problem, we have to show
first that the E 12) group can be regarded as a limiting case of             The generators in this case are
0(3). We are quite familiar with the three-dimensional ro-                                      -i
tation group. However, the E (21 group is largely unknown
to us, in spite of the fact that this group can serve as a good
illustrative example for many important aspects of geome-
                                                                               LJ=C            0
                                                                                               0        ~).
try and group theory.                                                                                                             15)
                                                                                         0                     0
                                                                                    =(~              ~} P2=(~                 D
   Indeed, from a pedagogical point of view, the E (2) group
has its own merit. When we study the three-dimensional                                   0                     0
rotation group, it is quite natural for us to ask the following
                                                                                P,
                                                                                         0                     0
questions.
                                                                             These generators satisfy the following commutation rela-
   (a) We discuss 0 (3) repeatedly in the established curricu-
                                                                             tions:
lum, because it describes an important aspect of physics,
and because it generates a beautiful mathematics. Then, is                      [P"P,] =0,
the rotation group the only interesting example in the exist-                   [L"P,] =iP2,                                                   (6)
ing curriculum?
   (b) When we commute from home to school, we are mak-                         [L"P 2 ] = - iP,.
ing translations and rotations on a two-dimensional plane.                   The transformation described in Eqs.12) and (3) and gener-
1038           Am. J. Phys., Vol. 52, No. 11, November 1984                                                                      Han et af.   1038
374                                                                                                                                CHAPfERVIl
ated by the matrices ofEq. (5) is "active" in the sense that it               and is suitable for exercise problems even in the undergrad-
transforms the object, as is described in Eq. (2).                            uate curriculum.
   Let us next consider transformations of functions of x                        If k ' = 0, the differential equation of Eq. (9) takes the
and y, and continue to use P, and P, as the generators of                     form
translations and L, as the generator of rotations. These
generators take the form                                                         [( ~)' + (~),I1b(xJ') =       O.                         (10)
  P                .   a                                                      This is a two-dimensional Laplace equation, and its solu-
       I   =    -I     ax'
                                                                              tions are quite familiar to us. The analytic solution of this
                  .a
  P
      ,=        -lay'                                                   (7)
                                                                              equation takes the form
                                                                                1b = r'" exp[ ± imq,] = (x ± IJr,                         (II)
  L, =          -i(X!!...-y~),                                                where
                ay       ax                                                     <6=tan- l (ylx).
The above operators satisfy the commutation relations of
                                                                              This is an eigenstate of L, or a rotation around the origin.
Eq. (6). The transformation using these differential opera-
                                                                              The effect of the rotation operator
tors is "passive" in the sense that it is achieved through a
coordinate transformation which is the inverse of that giv-                     R(O)=exp[-iOL,]                                           (12)
en in Eq. (2). Although they achieve the same purpose, the
active and passive transformations result in coordinate
                                                                                 1b
                                                                              on ofEq. (10) is well known. Ifwe translate the expression
                                                                              of Eq. (11) by applying the operator
transformations in the opposite directions. In the Appen-
dix, the effects of active and' passive transformations are                     T(u,u) = exp[ - i(uP, + uP,)],                           (13)
discussed in detail.                                                          then the translated form becomes
   As in the case of the rotation group, the standard method                    T(u,u),p(x,y) = [(x - u)    ± i(y -   u)]m.              (14)
of studying this group is to find an operator which com-
mutes with all three of the above generators. It is easy to                   This is an eigenstate of a rotation around the point x = u
check that p', defined as                                                     andy = u.
                                                                                If m = I, the basis vector for the representation diagonal
   P'=P;+Pi,                                                            (8)   inL, is
commutes with all three generators. Thus one way to con-
struct representations of the E (2) group is to solve theequa-                          X + iY )
tion                                                                             W, = ( X~iY .                                           (15)
                                                                        (9)
                                                                              The generators of the E (2) transformation matrices take the
using the differential forms of P, and P, given in Eq. (7).                   form
                                                                                              -I ~).
This partial differential equation can be separated in the
                                                                                             0
polar, Cartesian, parabolic, or elliptic coordinate system.·
   If we are interested in constructing representations diag-                   L,=G
onal in p' and P, and P" we use the Cartesian coordinate                                       0
system. On the other hand, if we are interested in represen-
                                                                                                -)
                                                                                                                                         (16)
tations diagonal in p, and L" the polar coordinate system                                    0                        0
is appropriate. These possibilities have been considered in
the past as summarized in Table II.
   Inonu and Wigner constructed infinite-dimensional uni-
                                                                                 PI=G        0
                                                                                             0
                                                                                                  -I ,
                                                                                                   0
                                                                                                          P2=G 0
                                                                                                                      0
                                                                                                                               ~I).
tary representations by solving the differential equation of                  There is a nonsingular matrix which transforms the col-
Eq. (9) with nonvanishing k '.7 On the other hand, the finite-                umn vector in Eq. (2) to W, of Eq. (15). The above 3 X 3
dimensional non unitary representations are based on the                      matrices are applicable to functions and not to the coordi-
solutions with k' = O. We are interested here in the repre-                   nates. For this reason, each of them is related to the nega-
sentations diagonal in L, with k 2 = 0, because they de-                      tive of its counterpart in Eq. (5) through a similarity trans-
scribe the wave functions for massless particles observed in                  formation.
the real world. 4 •s In addition, the mathematics required for                   The matrices of Eq. (16) satisfy the commutation rela-
studying this case is much easier than that for the general                   tions for the E (2) group given in Eq. (6), and
case discussed in the original paper of Inonu and Wigner, 7
                                                                                P;+P;=O,                                                 (17)
                                                                              which is a reflection ofEq. (5). In addition, P, andP, satisfy
                                                                                P; =P; =P,P,=O,                                          (18)
Table II. Representations of the E(2) group.
                                                                              because
  Diagonal
     in
                             Unitary infinite
                              dimensional
                                                   Nonunitary finite
                                                     dimensional
                                                                                (!    Y(X   ± iy) =   (~)'tx ± iy)
  P, and P,             Wigner in 1939 (Ref. I)         Trivial                                 =bx±iy)=O.                            (19)
           L)          Ioonu and Wigner in 1953    Han et 01. in 1982                              axiJy
                               (Ref. 7)                (Ref. 5)                  We expect that the procedure of constructing represen-
                                                                              tations for larger values of m will be similar to the m = 1
1039            Am. 1. Phys., Vol. 52, No. 11, November 1984                                                                  HanetaJ.   1039
GROUP CONTRACTIONS                                                                                                                  375
case. It would be an interesting exercise to construct explic-          For the present purpose, we can consider the case where
it matrices for an arbitrary integer value of m.' The basic          z is large and approximately equal to the radius of the
vector W, given in Eq. (15)is not unlike the spherical vector        sphere, and write
whose components are (x ± iy) and z in which z is replaced
by I.
                                                                                                                                 (27)
III. E (2) GROUP AS A LIMITING CASE OF 0 (3)
                                                                     The column vectors on the left- and right-hand sides are,
  The discussion given in Sec. II on E (2) is quite similar to
                                                                     respectively, the coordinate vectors on which the 0 (3) and
the case ofO (3). Like 0 (3). the E (2) group has three genera-
                                                                     E (2) transformations are applicable. We shall use A for the
tors. and its coordinate transformation matrices are 3 X 3.
                                                                     3 X 3 matrix on the right-hand side. Then, in the limit of
Then how are these two groups related? This fundamental
                                                                     large R,
question was addressed by Inonu and Wigner in their the-
ory of group contraction. 7.9                                          L, =A -IL,A,
   One way to define this problem is to consider a sphere              P, = (IIR IA -IL,A.                                       (28)
with a large radius. Imagine a football field on the surface
ofthe Earth covering the north pole. A player can run from
                                                                       P, = - (IIR IA-IL,A,
east to west, and from north to south. He can also turn              where L,. P" and P, are given in Eq. (5). This limiting
around at any point in the field. Indeed, the player can             procedure is called the contraction of 0 (3) to E(2).
performE (2) transformations on himself. Strictly speaking,             During the contraction process, L, remains invariant.
however, these translations and rotations are all rotations          For L, and L" the upper-right parts of the above matrices
on the spherical surface of the Earth.                               remain unchanged, except for a sign change in L, due to
   Let us start with the familiar 0 (3) rotation operator            Eq. (24). However, the lower-left parts become zero. In
which can be written as                                              terms of the spherical harmonics Y;"(O,,p ), the above limit-
  exp[ - i(aL , + (3L,          + OL,)].                      (20)   ing procedure is the same as replacing cos 0 by I, and
                                                                     (e ± '. sin 0 ) by (x ± iy).
We are interested in the effect of this rotation on the Hat
                                                                        Another interesting property of E (2) which is inherited
football field. L, generates rotations around the north pole.        from 0 (3) is the concept of equivalence class. 10.11 In 0(3),
L" which generates rotation around the y axis, takes the             rotations by the same angle around different axes belong to
form                                                                 the same equivalence class. This notion is translated to ro-
       - -i(Z~-xi.).
  L,=                                                         (21)   tations by the same angle around different points on the xy
             ax az                                                   plane forming an equivalence class. It is not difficult to
                                                                     form a geometrical visualization of the concept of equiv-
Therefore, for large values of the radius R, z = Rand
                                                                     alence classes applicable to 0 (3) and E(2).
  L = - iR ~ = RP, or P, = IIIR )L.                           (22)
       -              ax                             -
                                                                     IV. INTERNAL SPACE· TIME SYMMETRIES OF
If we rotate the system by angle {3 around the y axis, the           RELATIVISTIC PARTICLES
resulting translation on the E (2) plane is
                                                                         In describing a free relativistic particle, we specify first
  {3L,     = - i{3R ~ = uP,.                                  (23)   its mass, momentum, and energy. After determining its
                           ax                                        four-momentum, we should ask what other space-time de-
with                                                                 grees of freedom the particle has. This question was sys-
                                                                     tematically formulated by Wigner in his 1939 paper on re-
  u =(3R or{3= uiR.
                                                                     presentations of the inhomogeneous Lorentz group or the
Likewise                                                             Poincare group.' The subgroups of the Poincare group
  L, = - RP,.              and a = viR.                       (24)   governing internal space-time symmetries are called the
                                                                     little groups. '
The parameters u and v are discussed in Sec. II.
                                                                         The little group is generated by a maximal subset of J,
   Ifwe write the commutation relations for the 0 (3) group
                                                                     and K, which leaves the four-momentum invariant, where
as
                                                                     J, is the generator of rotations around the ith axis. and K, is
   [L"IIIR )L,] = i(IIR )L"                                          the boost generator along the ith axis. These generators
   [L,.IIIR )L,] = - i(IIR )L ,•                              (25)   satisfy the commutation relations
   III1R )L ,,(IIR )L,] = illlR )'L,.                                   [J,.J;) =iEijJ,.
it is easy to see that these expressions, in the large R limit,         [J"K}) =iE.,K"                                          (29)
become the commutation relations for the E (2) group given
                                                                        [K"K}) = -iE,},J,.
in Eq.12).
    Let us translate the above limiting procedure into the              The little groups for massive and massless particles are
language of matrices. 3 X 3 rotation matrices applicable to          locally isomorphic to 0 (3) and E (2), respectively, and they
coordinate variables (x.y,z) are well known. They are gener-         have been discussed in separate papers in this Journal. 2.'
                                                                     After studying the procedure of obtaining the E (2) group as
                                J
ated by L, of Eq. (5) and
                                                                     a limiting case of 0 (3), we are naturally led to consider the
               G~ ~                          ~ ~ D
                                                                     internal symmetry group of massless particles as a limiting
  L,       =                       L,   =(    i               (26)   case of the o (3)-like little group for massless particles.
                                                                        If a massive particle is at rest, the symmetry group is
1040           Am. J. Phys., Vol. 52, No. II, November 1984                                                      Haneto[.       1040
376                                                                                                                   CHAITER VlI
generated by the angular momentum operators J" J" and                     Let us start with a massive particle at rest with its mass
J,.' These operators do not change the four-momentum of                M. Then the little group is isomorphic to the 0(3) group
the particle at rest. If this particle moves along the Z direc-        generated by J" J" and J,. Ifwe boost this massive particle
tion, J l remains invariant, and its eigenvalue is the helicity.       along the z direction, its momentum and energy will be-
However, we have been avoiding in the past the question of             come Pand E = [P' + M ']'1', respectively. The boost ma-
                                                                          . (I~
what happens to J, and J" particularly in the infinite-mo-             trix is
mentum limit.
   There are no Lorentz frames in which massless particles
                                                                                             o        o
are at rest. The little group for a massless particle moving                                 I        o                         (34)
along the z direction is generated by fl' N" and N,,' where
                                                                          B(P) =             o       E/M
   N, =K,-J"                                                                                 o       PIM
                                                                (30)   Under this boost operation, J, given in Eq. (33) remains
   N,=K 2      +J,·                                                    invariant:
The four-momentum of the massless particle remains in-                    J; = BJ,B -, =J,.                                     1351
variant under transformations generated by these opera-                However, the boosted J, and J, become
tors. These generators satisfy the commutation relations
                                                                          J;     =IEIMiJ,-IPfM)K"
   [N"N,] =0,                                                                                                                   (36)
   [f"N,]=iN2 ,                                                 (31)      J;     =IEIMiJ,        + (P/M)K,.
                                                                       Because the Lorentz boosts in Eqs. (35) and (36) are similar-
   [J 3 ,N,] = - iN"
                                                                       ity transformations, the J' operators still satisfy the 0 (3)
which are identical to those for the E (2) group given in Eq.          commutation relations:
(6). J 3 is like the generator of rotation while N, and N2 are
                                                                                                                                (37)
like the generators of translations in the two-dimensional
plane. These translationlike operators are known to gener-             Since the quantities in Eq. (36) become very large as the
ate gauge transformations'                                             momentum increases. we introduce new operators:
   Einstein's energy-momentum relation of Eq. (I) clearly
                                                                          G,= -(MfEiJ;,
indicates that a massive particle becomes like a massless
                                                                                                                                138)
particle as the momentum/mass ratio becomes infinite. We                  G,=IM/EiJ;.
are thus led to the suspicion that the 0 (3)-like internal sym-
                                                                       In terms of these new operators, wecan write the 0 (3) com-
metry for massive particles will become the E (2)-like sym-
                                                                       mutation relations of Eq. 137) as
metry for massless particles, and that this limiting proce-
dure will be like the group contraction procedure discussed                [J"G,] = - iG"
in Sec. III.                                                              [J"G,] =iG"                                           (39)
   If we boost the massive particle along the z direction, J,
will remain invariant. The analysis of Sec. III leads us to               [G"G,] = - (M /E)'J,.
expect thatJ, andJ, for massive particle at rest will become             The quantity (M IE)' becomes vanishingly small if the
N, and - N" respectively, in the infinite-momentum/                    mass becomes small or the momentum becomes very large.
zero-mass limit, just like the contraction of 0 (3) resulting in       In this limit,
E(2)."                                                                    G,~N,        and G,~N"                                140)
   Let us carry out an explicit calculation to justify the
 above expectation starting with a massive particle at rest. If        where
 we use the four-vector convention                                                       0       -i
   X   e = Ix,y,z,t),     x e = (x,y,z, - II,
 with c = I, the generators of Lorentz transformations ap-
 plicable to this coordinate space are
                                                                (32)
                                                                          N'~(~          0
                                                                                         0
                                                                                         0
                                                                                                 0
                                                                                                 0
                                                                                                 0     1}                       (41)
                                                                                                       )
              0     0                           0   0
,~(!                         ~ K'~U                        1)
                                                                                         0 0
                                                                          N'~(:
              0     -i                          0   0
                                                                                         0 -i
                    0                           0   0
                                                                                           0    o .
,,{                          )
              0     0                           0   0
                                                                                 0         0    0
                0                             0 0
                                                           ~,
                         1
                                                                       The fact that the above N matrices generate gauge transfor-
                0        0 o       K _ 0 0 0                           mations has been extensively discussed in the litera-
                                     ,- ("
                                                              (33)     ture. 4 ,5,13
        -1      0        0 ~'              ~ 0 0
          0     0        0                        0                     Indeed, rotations around the axes perpendicular to the
                                                                       momentum become gauge transformations in the infinite-
             -i          0                    0 0
"~(l                        V K'~(:
                                                                       momentum/zero-mass limit.
              0          0                    0 0
             0
              0
 where J i and K,
                         0
                         0
                                              0 0
                                              6            V
                        generate rotations and boosts, respective-
                                                                       V. FURTHER PHYSICAL APPLICATIONS OF
                                                                       GROUP CONTRACfIONS
                                                                        The purpose of this section is to indicate that there are
 ly.                                                                   many other interesting applications of group contractions.
1041        Am. J. Phys., Vol. 52, No. ll, November 1984                                                         Han etal.     1041
GROUP CONTRAcrIONS                                                                                                                        377
In Sec. III, we studied the contraction ofO (3) toE(2) using     let
the notion of a plane tangent to a spherical surface. The
                                                                    f(l+ 1)"",(f+I)2                                                (47)
concept of this tangent plane plays a very important role in
many branches of physics and engineering dealing with               In their 1976 paper,15 Misra and Maharana made an
curved surfaces.                                                 interesting observation that, when the scattering angle is
   For example, let us consider the surface of the hyperbola     very small, we can replace (L ~ + L ;) by
in a three-dimensional space spanned by x, y, and t:                R'(P~ +P;)=(/+!)',                                              (48)
  (et)' - x' - y' = const,                                (42)   in the spirit of Eqs. (22) and (24) with suitable redefinitions
where e is a constant and may become very large. This is a       for R and Pi' In view of the discussion given in Sec. IV, we
description of the 0 (2,1) group consisting of Lorentz boosts    can readily letR be (Po/M). 15 Thus, for a given value of Po,
along the x and y directions and rotations on the xy plane.      the eigenvalue of
The pedagogical value of this group has been amply dis-
cussed in Ref. 14.
                                                                    (P~    + Pi)                                                    (49)
   We can now consider a plane tangent to the surface at         will give a measure of f. In view of the discussion given in
x = y = O. As the constant e becomes very large, the por-        Sec. II, thi~ new parameter will be that of the Bessel func-
tion of surface in which (x' + y') is finite becomes fiat and    tion. Thus, for large values of f, the Legendre polynomial
coincides with the tangent plane. It is then not difficult to    becomes the Bessel function"·:
imagine that transformations on this tangent plane are Ga-             PI (cos 0 )--Jo\aqO),                                        (50)
lilean transformations.
   In order to see this point, let us start with coordinate      where
transformations of the group 0 (2,1) generated by L, ofEq.          a=Po/M.
(5), and                                                         The parameter q now measures f, and becomes continuous
                                             o                   for large values of PD'
  KI=G ~i                                    o
                                             o
                                                          (43)      The above Bessel-function form is commonly used for
                                                                 studying high-energy data. 15.1. It is interesting to note that
                                                                 the transition from the use of the Legendre polynomials for
applicable to the column vector (x,y,et). 14 K, andK, are the    low-energy processes to that ofthe Bessel functions in high-
generators of Lorentz boosts along the x and y directions,       energy scattering is a group contraction of 0(3) to E(2).
respectively.
  The column vector (x,y,et) can be written as                   ACKNOWLEDGMENTS
  C)=G : DG)'
                                                                    We are grateful to O. W. Greenberg for providing the
                                                          (44)   following information. In 1962, Wigner gave a series of
                                                                 lectures on the representations of the Poincare group at
Then, as e becomes very large, the circumstance is identical     Trieste and Istanbul. At one of his lectures, the problem of
to the case of Eqs. (26) and (28). The resulting transforma-     obtaining the E (2)-like little group for massless particles as
tion matrix becomes                                              a limiting case of the 0 (3)-like little group for massive parti-
                                                                 cles was informally discussed as an unsolved problem, al-
  (y'X') = (COSO
                      - sinO                                     though this discussion was not included in Wigner's lecture
             smO       cos 0                              (45)   notes published in Ref. 17. We would like to thank M.
    t'        0          o                                       Parida for bringing Ref. IS to our attention and for explain-
This form is a rotation on the xy plane followed by Galilean     ing its content to us.
boosts along the x andy directions. Indeed, special relativi-
ty becomes Galilean relativity in the limit of large e.'··       APPENDIX
   Let us consider another example. In scattering processes         We study in this Appendix active and passive transfor-
in which two incoming particles collide with each other          mations on the E (2) plane. Let us define the transforma-
resulting in two particles moving in different directions, we    tions given in Eqs. (2) and (3) to be active. This transforma-
commonly use the Legendre polynomials Plicos 0 ) to de-          tion first rotates the coordinate point (x,y) by angle 0
scribe the dependence on the scattering angle. The quan-         around the origin. It then translates the rotated point by u
tum number f is the angular momentum around the scatter-         and v along the x and y directions, respectively.
ing center, and can be regarded as a measure of the                 On the other hand, if we perform the same rotation on
incoming momentum multiplied by the impact parameter.            the function
   When particles move slowly, it is sufficient to consider         g(x,y) = (x + iyr = r"'e im "',                         (AI)
only two or three lowest values f. On the other hand, when
the particles move with speed very close to that oflight, the    using L, given in Eq. (7),
scattering becomes predominantly forward and becomes               (e ~ i8L')g(X,y) = r"'eimIO ~ 81.                               (A2)
like the Fraunhofer diffraction. In this case, we have to deal
with large values of f. One way to approach this problem is      If we apply the translation operators on the above expres-
                                                                 sion,
to start from the operator
                                                                   (e ~ il"p, + 'P,le ~ iOL')g(X,y)   = (x" + iy")m = g(x" ,y"),   (A3)
                                                          (46)
                                                                 where
with the eigenvalue f (f + I). For large values of f, we can       x· = (x - u)cos 0 + (y - v)sin 0,
ignoreL, whose eigenvalue is usually not larger than I, and        y" = - (x - u)sin 0 + (y - v)cos O.
1042     Am. J. Phys., Vol. 52, No. 11, November 1984                                                              Hanetal.        1042
378                                                                                                                                    CHAPTER VII
The above linear transformation can also be written as                      'E. Innnu and E. P. Wigner. Proc. Natl. Acad. Sci. U. S. A. 39. 510119531.
                                                                            "d. Han, y, S. Kim. and D. Son. University of Maryland Physics Publi-
                         sin 0
                         cosO       ~U(:i~O; ~: ~O~~ ~)~)              .
                                                                             cation No. 83-141 (19831.
                                                                            "'For a pedagogical reformulation of the theory of group contraction.
                           o                                                 including a discussion of the E(2) group as a contraction of the 0(3)
                                                                             group, see R. Gilmore, Lie Groups and Lie Algebras. and Some o/Their
                                                                   (A4)
                                                                             Applications (Wiley. New York. 1974).
The matrix in this expression is precisely the inverse of that              lOA. S. Wightman, in Dispersion Relations and Elementary Particles, edit-
of the active transformation matrix of Eq. (3).                               ed by C. De Witt and R. Omnes (Hermann, Paris. 1960).
                                                                            liThe concept of equivalence class was discussed in detail for the 0 (3) case
                                                                              by Wigner. This concept survives in £12) after contraction. See E. P.
                                                                              Wigner, Group Theory, and Its Applications to the Quantum Theory of
IE. P. Wigner, Ann. Math. 149.40(1939). See also V. Bargmann and E. P.        AlOmicSpectra (Academic, New York, 19591.
  Wign". Proc. Natl. Acad. Sci. U. S. A. 34. 211 (1946).                    "D. Han. Y. S. Kim. and D. Son. Phy,. Lett. 131B. 327119831.
'V. S. Kim. M. E. Noz. and S. H. Dh. Am. J. Phy •. 47. 892 (1979); D. Han   "S. Weinberg. Phy,. Rev. 134. B882 11964); 135. 81049119641.
  and Y. S. Kim. ibid. 49.115711981).                                       "Y. S. Kim and M. E. Noz. Am. J. Phys. 51. 368119831.
 'V. S. Kim. M. E. Noz. and S. H. Dh. J. Math. Phy•. 20.1341 (1979); D.     I~S. P. Misra and J. Maharana. Phys. Rev. D 14.1330976).
  Han. M. E. Noz. Y. S. Kim. and D. Son. Phy•. Rev. D 25.1740 11982).       IhR. Blankenbecler and M. L. Goldberger, Phys. Rev. 126, 766( 1962). See
'D. Han and Y. S. Kim. Am. J. Phy •. 49. 348 11981).                          a1,o S. 1. Wallace. Phys. Rev. D 8. 1846119731 and D 9. 40611974).
'D. Han. Y. S. Kim. and D. Son. Phy,. Rev. D 25. 46111982); 26. 3717        17E. P. Wlgner, in Group Theoretical Concepts and Methods in Elementary
  (1982).                                                                     Particle Physics. edited by F. Glirsey (Gordon and Breach. New York.
'Po Wintemitz and I. Fri,. Yad. Fiz.l. 88911965) [Sov. J. Nucl. Phy,. I.      1962); and in Theoretical Physics, edited by A. Salam (International
  63611965)).                                                                 Atomic Energy Agency, Vienna. 19621.
           Am. J. Phys. S2 (11), November 1984                                      @ 1984 American Association of Physics Teachers               1043
GROUP CONTRACfIONS                                                                                                                  379
where
This matrix depends on the rotation angle 0 and the velocity parameter a, and becomes an identity matrix when the particle is
at rest with a = O.
     Indeed, the rotation R(O) followed by the boost S(a,O) leaves the four-momentum p ofEq. (2) invariant:
P = D(a,O)P, (7)
where
      D(a ,O) = S(a ,O) R(O) .
The multiplication of the two matrices is straightforward, and the result is
 2229        J . Math. Phys" Vol. 27. No. 9. September 1986                                                                       Han , Kim, and Son   2229
GROUP CONTRACfIONS                                                                                                                                               381
where                                                                              convenient for studying the relation between the Euler an-
                                                                                   gles and the parameters of the O(3)-like little group.
u = - 2(tan(e12))          and         T= I   + (I-a          2   )(tan(e/2))'.
                                                                                        We have so far discussed the transformations in the x-z
This complicated expression leaves the four-momentum Pof                           plane. It is quite clear that the same analysis can be carried
Eq. (Z) invariant. Indeed, if the particle is at rest with van-                    out in the y-z plane or any other plane containing the z axis,
ishing velocity parameter a, the above expression becomes a                        This means that we can perform rotations R,(¢» and
rotation matrix. As the velocity parameter a increases, this                       R, (,p), respectively, before and after carrying out the trans-
D matrix performs a combination of rotation and boost, but                         formations in the x-z plane. Indeed, together with the veloc-
leaves the four-momentum invariant.                                                                                                      e,
                                                                                   ity parameter a, the three parameters ¢>, and ,p constitute
     Let us approach this problem in the traditional frame-                        the Eulerian parametrization of the 0(3)-like little group.
work. 1 The above transformation is clearly an element of the
0(3 )-like little group that leaves the four-momentum P in-
variant. Then we can boost the particle with its four-momen-                       111. E(2)-LlKE LITTLE GROUP FOR MASSLESS
tum Pby A-I until the four-momentum becomes that ofEq.                             PARTICLES
(1), rotate it around the y axis, and then boost it by A until
                                                                                        Let us study in this section the D matrix ofEq. (8) as the
the four-momentum becomes PofEq. (2). It is appropriate
                                                                                   particle mass becomes vanishingly small, by taking the limit
to call this rotation in the rest frame the Wigner rotation."
                                                                                   of a~l. In this limit, the D matrix ofEq. (8) becomes
The transformation of the O( 3 )-like little group constructed
in this manner should take the form                                                                              o         -u
    D(a,e) =A(a)W(e')[A(a)]                       1
                                                                           (9)                                   I         o              ou       )           (12)
                                                                                                                 o    1- u'/2            u'/2          .
                 ceo
where W is the Wigner rotation matrix
                                                                                                                 o        -u'/2      1+ u2 /2
                                                          :)
                                       0    sin   e*
                                              0
     Wee') =
                                                  e* o .
                          0                                               (10)
                     -   Slll   e'     0    cos
                          0            0      0           I
                                                                                         8'
                e
We may call * the Wigner angle. The question then is                               ISoof----------
whether D ofEq. (9) isthesameasDofEq. (8). In order to                             170 0 1 - - - -_ __
answer this question, we first take the trace of the expression
given in Eq. (9). The similarity transformation of Eq. (9)
assures us that the trace of Wbe equal to that of D. This leads
to
     e* _
        - cos
                -I   (1-
                      I+
                         (l-a )(tan(e/2))')
                                        2
                            a')(tan(e 12))2 .
                                (I _
                                                                          (II)
 2230       J. Math. Phys., Vol. 27. No.9, September 1986                                                                           Han, Kim, and Son            2230
382                                                                                                                                 CHAPTER VII
After losing the memory of how the zero-mass limit was                         invariant, with a greater than I. Although particles with
taken, it is impossible to transform this matrix into a rotation               imaginary mass are not observed in the real world, the trans-
matrix. There is no Lorentz frame in which the particle is at                  formation group that leaves the above four-momentum in-
rest. If we boost this expression along the z direction using                  variant is locally isomorphic to 0(2, I) and plays a pivotal
the boost matrix                                                               role in studying noncompact groups and their applications
                   0               0                                           in physics. This group has been discussed extensively in the
                                                    PI(I~P""}
                                                                               literature. II
                                  0
                                                                                    We are interested here in the question of whether the D
                   0        1/(1 _ P 2) 1/2
                                                                               matrix constructed in Sees. II and III can be analytically
                   0        P/(I_P 2)1/2            1/(I_P2)1/2                continued to a > I, Indeed, we can perform the rotation and
                                                                    (13)       boost of Fig. I to obtain the D matrix of the form given in Eq,
D remains form-invariant:                                                      (8), if a is smaller than a o where
       D'(u) =B(P)D(u)[B(P)]-1 =D(u'),                              (14)           a~ = [I    + (tan(0/2»)2]/(tan(0 /2»)2.                 (17)
where                                                                          As a increases, some elements ofthe D matrix become singu-
       u' = [(I   + P)/(I - Pll 112 u .                                        lar when T vanishes or a = ao' Mathematically, this is a
                                                                               simple pole that can be avoided either clockwise or counter-
     The matrix of Eq. (12) is the case where the Kupersz-
                                                                               clockwise, However, the physics of this continuation process
tych kinematics is performed in the x-z plane. This kinema-                    requires a more careful investigation,
                                                                         )
tics also can be performed in the y-z plane. Thus the most
                                                                                    One way to study the D transformation more effectively
general form for the D matrix is                                               is to boost the spacelike four-vector of Eq, (16) along the z
                       0               -u                                      direction to a simpler vector
                                       -v                      ,v
D'' '-O
                                                                                   (O,O,im,O) ,                                            (18)
                              1- (u 2 + v2)/2           (u 2+ v2)/2        .
                       v·      _ (u 2   +   v2)/2     I + (u 2 + v2)/2
                                                                               using the boost matrix ofEq. (13) with the boost parameter
                                                                               P = I/a, Consequently, the D matrix is a Lorentz-boosted
                                                                    (15)
                                                                               form of a simpler matrix F:
The algebraic property of this expression has been discussed                       D=B(l/a)F(A)[B(I/a)j-I.                                 (19)
extensively in the literature. I.'" If applied to the photon
four-potential, this matrix performs a gauge transforma-                       Here F is a boost matrix along the x direction:
tion.'" The reduction of the above matrix into the three-by-
                                                                                                 cosh A   o    0
three matrix representing a finite-dimensional representa-
                                                                                                           I   0
                                                                                                                   Sin~A)
tion of the two-dimensional Euclidean group has also been
discussed in the literature.'
                                                                                   F(A) =
                                                                                             (
                                                                                                     ~    o    I     o          '          (20)
     Let us go back to Eq. (9). We have obtained the above                                       sinhA    o    0   cosh A
gauge transformation by boosting the rotation matrix W giv-                    where
en in Eq. (10). This means that the Lorentz-boosted rota-
                                                                                                 -2(a'-I)I/'tan(e/2)
tion becomes a gauge transformation in the infinite-momen-                         tanh'! =                                 ,
tum and/or zero-mass limit. This observation was made                                            I + (a' - I) (tan(e /2»)'
                                                                                                 I + (a: - I) (tan(e /2»)' ,
                                                                                                                                           (21)
earlier in terms of the group contraction of 0 (3) to E (2),9.10                   cosh'! =
which is a singular transformation. We are then led to the                                       1- (a- - 1) (tan(e /2»)'
question of how the method used in this section can be ana-
lytic, while the traditional method is singular.                               Ifwe add the rotational degree offreedom around the z axis,
     The answer to this question is very simple. The group                     the above result is perfectly consistent with Wigner's origi-
contraction is a language of Lie groups.9.10 The parameter a                   nal observation that the little group for imaginary-mass par-
we use in this paperis not a parameter of the Lie group. If we                 ticles is locally isomorphic to 0(2, I). I
use TJ as the Lie-group parameter for boost along the z direc-                       We have observed earlier that Ihe D matrix ofEq, (8)
tion, it is related toa by sinh TJ = a/(I - a 2) II'. However,                 can be analytically continued from a = I to I < a < ao, At
this expression is singular at a = ± I. Therefore, the con-                    a = ao, some of its elements are singular, If a> ao, cosh'! in
tinuation in a is not necessarily singular. We shall continue                  Eqs. (20) and (21) become negative, and this is not accepta-
the discussion of this limiting process in terms of the SL(2,c)                ble,
spinors in Sec. VI.                                                                  One way to deal with this problem is to take advantage
                                                                               of the fact that the expression for tanh A in Eq, (21) is never
                                                                               singular for real a greater than I. This is possible if we
IV. 0(2,1)-LlKE LITTLE GROUP FOR IMAGINARY-MASS                                change the signs of both sinh'! and cosh A when we jump
PARTICLES                                                                      from a < a o to a > a". Indeed, the continuation is possible if
                                                                               it is accompanied by the reflection of x and t coordinates,
    We are now interested in transformations that leave the                    After taking into account the reflection of the x and t coordi-
four-vector of the form                                                        nates, we can construct the D matrix by boosting F of Eq,
      P=im(0,0,a/(a'-1)1:2,I/(a'-I)1/2)                             (16)       (20). The expression for the D matrix for a> a" becomes
2231        J. Math. Phys .• Vol. 27, NO.9, September 1986                                                          Han, Kim, and 50n       2231
GROUP CONTRACfIONS                                                                                                                                                383
                             o              ulT
       D=(I_~/T               I               o
                                                                             -aulT           )
This expression cannot be used for the a-> I limit, but can be                  corresponding to WofEq. (10) is
used for the a-> 00 limit. In the limit a-> 00, P of Eq. (16)
                                                                                            0* _ (COS(O*/2)                  - Sin(O"/2»)
                                                                                                                                                               (25)
becomes identical to Eq. (18), and the above expression be-                            W(        ) - sin(O*/2)                cos(O*/2) •
comes an identity matrix. As for the question of whether D of
                                                                                where the rotation angle 0 * is given in Eq. (II).
Eq. (22) isananalyticcontinuationofEq. (8), the answer is
                                                                                    Using the formula of Eq. (9), we can calculate the D
"no," because the transition from Eq. (22) to Eq. (8) re-
                                                                                matrix for the SL(2,e) spinors. The D matrix applicable to
quires the reflection of the x and taxes.
                                                                                the undotted spinors is
                                                                                .+x;
The boost matrix, which brings the SL (2,e) spinors from the
zero-momentum state to that of p, is
A'   ± '(a)
                                                                                x_
       = (((   I   ± a)/ (\ =ta»)"·                0                 )                           x,
                        O
                                         ((I =t a)/(l      ± a»)"·       '            X~
                                                                     (24)
                                                                                FIG. 3. Lorentz-boosted rotation~ or the four SL( 2.e) spillors. Ifthe part i-
where the superscripts ( + ) and ( - ) are applicable to the                    deveiocilY is zero, all rlw!o.pinors rolate likc the Pauli !'Ipinors. A!'I the parti-
                                                                                clespeed approaches that nflight. two of the !o.pin .. linc up with the momen-
undotted and dotted spinars, respectively. In the Lorentz
                                                                                tum, while the remaining two refu!o.e 10 do!.o. Tho!.e !.pincmth4t line up are
frame in which the particle is at rest, there is only one rota-                 gauge-invariant spinOfs. Tho!o.e that do 1I0t are not gauge imariant. and they
tion applicable to both sets of spinors. The rotation matrix                    form the origin of the gauge degrees of freedom for photon four-potentials.
2232          J. Math. Phys .• Vol. 27. No.9. September t986                                                                     Han, Kim, and Son             2232
384                                                                                                                     CHAPTER VII
In the limit of a--+I, this angle becomes 0" where                         pure boost along the x axis:
      0, = tan-'(2(tan(0/2»)).                                  (31)
                                                                               F' ± '(A) = ( cosh(A 12)         ± sinh  (A 12»)
Indeed, the spins represented by X _ and X+ refuse to align                                   ±sinh(AI2)         cosh(A 12)        ,   (32)
themselves with the momentum. This result is illustrated in
Fig. 3.                                                                    where A is given in Eq. (21).
     There are D transformations for the a > I case. In the                    Fora<ao,wecancontinuetouse D'+'and D'-'given
special Lorentz frame in which the four-momentum takes                     in Eq. (26) and Eq. (27), respectively. However, fora >ao,
the form ofEq. ( 18), the D transformation becomes that of a               the D matrix is
                                                                       I
      D '±' (a,u)
               £I _ ( (a'-I)/'(tan(OI2WFT
                  -
                                                                  ±(a± 1)/(a=F I»)'/'/FT\
                                                                                                                                       (33)
                         ± (a=F I)/(a ±          I)))/'/~ - T     (a' - I) '/2(tan(O 12WFT           ) .
The above expression becomes an identity matrix when                       The four-potentials are gauge dependent, while the spinors
a-+ 00, as is expected from the result of Sec. IV. The D matri-            allowed in the Dirac equation are gauge invariant. There-
ces of Eq. (33) are not analytic continuations of their coun-              fore, it is not possible to construct four-potentials from the
terparts given in Eqs. (26) and (27), because the continu-                 Dirac spinors.
ation procedure, which we adopted in Sec. IV and used in                        On the other hand, there are gauge-dependent SL(2,c)
this section, involves reflections in the x and t coordinates.             spinors, which are given in Eq. (37). They disappear from
                                                                           the Dirac spinors because N _ vanishes in the a-+ I limit.
VI_ GAUGE TRANSFORMATIONS IN TERMS OF                                      However, these spinors can still play an important role if
ROTATIONS OF SPINORS                                                       they are multiplied by N +, which neutralizes N _. Indeed,
     It is clear from the discussions of Secs. III-V that the              we can construct unit vectors in the Minkowskian space by
limit a-+ I can be defined from both directions, namely from               taking the direct products of two SL(2,c) spinors
a< I and from a > I. In thelimita-+I,D1+landD'-'ofEq.                           - X +X + = (l,i,O,O),      X-X- = (I, -i,O,O),
(26) and Eq. (27) become                                                                                                               (38)
                                                                               X+X- = (0,0,1,1),      X.X+ = (0,0,1,-1).
                  1
                    u), D'.'=(      I
                                   -u        .   ~)    (34)                These unit vectors in one Lorentz frame are not the unit
                                                                           vectors in other frames. For instance, if we boost a massive
After going through the same procedure as that from Eq.                    particle initially at rest along the z direction, Ix +X +) and
(12) to Eq. (15), we arrive at the gauge transformation ma-                Ix -X -) remain invariant. However, Ix +X _) and Ix -X +)
trices·                                                                    acquire the constant factors [( I + a)/(1 - a) ]'/2 and
                     I               iV)                                   [(I-a)/(1 +a)]'/',respectively. We can therefore drop
      D1+'(u,v)   = (0
                           U -
                                 I         '                               Ix-X   +) when we go through the renormalization process of
                                           °
                                                                (35)       replacing the coefficient [(1 +a)/(1-a)])/'by I for par-
      D'-'(u,v) = (     I .                1),                             ticles moving with the speed of light.
                      -u-w                                                      The D(u,v) matrix for the above spinor combinations
applicable to the SL (2,c) spinors, where the D ' ± , are appli-           should take the form
cable to undotted and dotted spinors, respectively.
                                                                                                                                       (39)
     The SL(2,c) spinors are gauge invariant in the sense
that                                                                                 +'
                                                                           where D' and D 1-' are applicable to the first and second
                                                                           spinors ofEq. (38), respectively. Then
      D'+'(u,v)X + = X +,            D'-'(u,v)X _ = X _ .       (36)
On the other hand, the SL( 2,c) spinors are gauge dependent
                                                                               D(u,v)( -Ix+x+» = lX+x+)           + (u+iv)lx+x_)         ,
in the sense that                                                              D(u,v)lx_x_) = Ix-x-'>        + (u -iv)lX+x-) ,         (40)
2233       J. Math. Phys. Vol. 27. No.9. September '986                                                        Han, Kim, and Son       2233
GROUP CONTRACfIONS                                                                                                                                        385
                   E=1                                      E=p                These generators accommodate both signs of the boost gen-
                    2m
 Momentum
                                  I E=.Jm2+p2 I                                erators for the SL(2,c) spinors. In this representation, y, is
Spin, Gouge             S3        I Wigner's I S3                              diagonal, and its eigenvalue determines the sign of the boost
                                                                               generators,
2234         J. Math. Phys., Vol. 27, No.9, September t 986                                                                  Han, Kim, and Son           2234
386                                                                                                                                   CHAPTER VII
   role in the development of quantum mechanics and atomic spectra. See L.     100. Han, y'S. Kim,andD. Son,Phys. Lett. B131. 327 (1983); D. Han, Y.
   H.Thomas, Nature 117, 514 (1926); Philos. Mag. 3, I (1927).                   S. Kim, M. E. Noz, and D. Son, Am. I. Phys. 52, 1037 (1984).
 'D. Han and Y. S. Kim, Am. I. Phys. 49, 348 (1981); D. Han, Y. S. Kim,        lly. Bargmann, Ann. Math. 48, 568 (1947); L. Pukanszky, Trans. Am.
  and D. Son, Phys. Rev. D 31, 328 (1985).                                       Math. Soc. 100, 116 (1961); L. Serterio and M. Toller, Nuovo Cimento
 (,E. P. Wigner, Z. Phys. 124, 665 (1948); A. S. Wightman, in Dispersion         33,413 (1964); A. O. Barnt and C. Fronsdal, Proc. R. Soc. London Ser. A
  Relations and Elementary Particles, edited by C. De Witt and R. Dmnes          287,532 (1965); M. Toller, Nuovo Cimento 37, 631 (1968); W. I. Hol-
   (Hermann, Paris,1960); M. Hamermesh, Group Theory (Addison-Wes-               man and L. C. Biedenharn, Ann. Phys. (NY) 39, I (1966); 47, 205
   ley, Reading, MA, 1962); E. P. Wigner, in Theoretical Physics, edited by      (1968); N. Makunda, I. Math. Phys. 9, 50,417 (1968); 10, 2068, 2092
  A. Salam (LA.E.A., Vienna, 1962); A. Janoer and T. Jenssen, Physica 53,        (1973); K. B. Wolf, I. Math. Phys. 15, 1295, 2102 (1974); S. Lang,
   I (1971); 60, 292 (1972);1. L. Richard, NuovoCimento A8,485 (1972);           SL(2,r) (Addison-Wesley, Reading, MA, 1975).
  H. P. W. Gottlieb, Proc. R. Soc. London Ser. A 368, 429 (1979).              12M. A. Naimark, Am. Math. Soc. Transl. 6, 379 (1957); I. M.Gel'fand, R.
'So Weinberg, Phys. Rev. 134, B 882 (1964); 135, BI049 (1964).                   A. Minlos, and Z. Va. Shapiro, Representations o/the Rotation and Lor-
liD. Han, Y. S. Kim, and D. Son, Phys. Rev. D 26,3717 (1982).                    entz Groups and their Applications (MacMillan, New Yark, 1963).
9E. Inonu and E. P. Wigner, Proc. Nat!. Acad. Sci. (U.S.A.) 39, 510            nyu. V. Novozhilov, introduction to Elementary Particle Theory (Perga-
  (1953); D. W. Robinson, Helv. Phys. Acta 35, 98 (1962); D. Korff, I.           mon, Oxford, 1975).
  Math. Phys. 5, 869 ( 1964); S. Weinberg, in Lectureson Particles and Field   14S.1. Gates, M. T. Grisaru, M. Rocek, and W. Siegel, SuperJpaces (Benja-
  Theory, Brandeis 1964, Yol. 2, edited by S. Deser and K. W. Ford (Pren-        min/Cummings, Reading, MA, 1983). See also A. Chados, A. I. Hauser,
  tice-Hail, Englewood Cliffs, NJ, 1965); J. D. Talman, Special Functions,       and y. A. Kostelecky, Phys. Lett. B ISO, 431 (1985); H. van Dam, Y. J.
  A Group Theoretical Approach Based on Lectures by E. P. Wigner (Benja-         Ng, and L. C. Biedenharn, ibid. 158,227 (1985).
  mm, New York, 1968); S. P. MisraandJ. Maharana, Phys. Rev. 0 14,133          !5L C. Biedenharn, M. Y. Han, and H. van Dam, Phys. Rev. D 6, 500
  (1976).                                                                        ( 1972).
2235        J. Math. Phys., Vol. 27, NO.9, September 1986                                                                Han, Kim, and Son         2235
GROUP CONTRACTIONS                                                                                                                        387
                   0]
               [0 00,
                                                                                        then x and y can be written as
            -i
       L3=i                                                                                   x = cos ,p, y = sin ,p,                                              (2.14)
                               i]
          000                                                                           and the transformation ofEq. (2.12) takes the form
                                                                                                                                                                      ].
                                                                               (2.4)
                                                  oo     0]                                    a
                                                                                                     -:~:a ~][~:~]=[ :~:~~:;~
                                                                                            COS
              0           0          [0
       P, = [ 0           0    0 , P,O                    i,                            [   sma
                  o       0    0      0           o      0                                   u                         I         z         z+ucos,p+vsin,p
and satisfy the commutation relaiions                                                                                                                  (2.15)
[P"P,] = 0,               [L 3 ,P, ] = iP"       [L 3 ,P,] = - iP"             (2.5)    We shall see in the following sections how this cylindrical
which form the Lie algebra for E(2).                                                    group describes gauge transformations for massless parti-
     The above commutation relations are invariant under                                cles.
the sign change in P, and P,. They are also invariant under
Hermitian conjugation. Since L3 is Hermitian, we can re-
place P, and P, by                                                                      III. E(2)-L1KE LITTLE GROUP FOR PHOTONS
       Q,   =-        (P,)',       Q, =    -    (P,)t,                         (2.6)        Let us consider a single free photon moving along the z
                                                                                        direction. Then we can write the four-potential as
respectively, to obtain
                                                                                              A "'(x) =APeiw(z-t),                                                  (3.1)
       [Q"Q,J         = 0,     [L 3 ,Q'] = iQ"           [L 3 ,Q,J     = - iQ,.
                                                                               (2.7)    where
These commutation relations are identical to those for E (2)                               A ,. = (A I.A ,.A ,.A o).
given in Eq. (2.5). However, Q, and Q, are not the genera-                              The momentum four-vector is clearly
tors of Euclidean translations in the two-dimensional space.
Let us write their matrix forms:                                                              pi' = (O,O,UJ,UJ).                                                   (3.2)
                                                                                        Then, the little group applicable to the photon four-potential
       Q, =    [~ ~ ~],
                  i       0    0
                                         Q, =    [~ ~ ~].
                                                  0      i       0
                                                                               (2.8)
                                                                                        is generated by
                                                                                                              -i           0
HereL 3 is given in Eq. (2.4). As in the case ofE(2), we can
consider the transformation matrix                                                            J,-   [1        0
                                                                                                              0
                                                                                                              0
                                                                                                                           0
                                                                                                                           0
                                                                                                                           0    11
       C{u,v,a) = C(O,O,a)C{u,v,Q),                                            (2.9)
                                                                                                                                 il
                                                                                                                                                                   (3.3)
where C{ O,O,a) is the rotation matrix and takes the form
                                                                                                              0        -i                            0         0
                                                       Si~ a
                                                             a       -sina
                                                                      cos a
                                                                        o      ~]   ,
                                                                                              N'~[1           0
                                                                                                              0
                                                                                                              0
                                                                                                                       0
                                                                                                                       0
                                                                                                                       0
                                                                                                                                        N,~ [1       0
                                                                                                                                                     i
                                                                                                                                                     i
                                                                                                                                                           -i
                                                                                                                                                           0
                                                                                                                                                           0        11
                                                                              (2.10)
                                                                                        These matrices satisfy the commutation relations:
                                                                                              [J,.N,] = iN"                [J"N,] = - iN"          [N,.N,] = 0,
       C(u,V,Q) = exp[ - i(uQ, + vQ,) J =                            [~ ~ ~].           which are identical to those for E(2). From these genera-
                                                                                                                                                              (3.4)
The multiplication of the above two matrices results in the                                   D(u,v,a) = D(O,O,a)D(u,v,O),                                         (3.5)
most general form of C( u,v,a). If this matrix is applied to
                                                                                        where
the column vector (x,y,z) , the result is
                                                                                              D(u,v,O) = exp[ - i(uN , + vN,) J,
       [   c~sa
           sma        :o::n a ~][;]              = [:     :~:: ~;        ::::] .              D(O,O,a) =R(a) =exp[ -iaJ,].
           u          v              I     z                 z + ux + vy                We can now expand the above formulas in power series, and
                                                                              (2.12)
                                                                                        the results are
This transformation leaves (x' + y') invariant, while z can
vary from - 00 to + 00. For this reason, it is quite appro-
priate to call the group of the above linear transformation the                                              COS   a           -sina
cylindrical group. This group is locally isomorphic to E(2).                                             [   sin a              cos a
                                                                                                              ~                                                    (3.6)
     If, for convenience, we set the radius of the cylinder to be                             R(a)=                              o
unity,                                                                                                                           o
       (x'+y') = I,                                                           (2.13)    and
,,76          J. Math. Phys .• Vol. 28, No.5, May 1987                                                                            Y. S. Kim and E. P. Wign.,        1176
GROUP CONTRACTIONS                                                                                                                                389
D(u,v,O)                                                                                                                    o o
                                                                                                          0]o                       [0
                                                                                              0      0
                                                         +~V')/2]' N~ ~
                    o           -u                                                            0      0                N     o oI     0
                         1 - (u'
                          _
                                -v
                                  + v')/2
                              (u' + v')/2
                                                   (u'
                                                 1 + (u'   + v')/2
                                                                                        [:    0
                                                                                              0
                                                                                                     0
                                                                                                     0
                                                                                                          ~'          2=V2~   o
                                                                                                                            o o                11
                                                                                                                                              (3.12)
                                                                     (3.7)   As a consequence, D( u,v) takes the form
When applied to the four-potential, the above D matrix per-
                                                                                                                0         0
                                                                                  D("D){~,
forms a gauge transformation: while R(a) is the rotation
matrix around the momentum.                                                                                               0
                                                                                                                               1]
                                                                                                                                              (113 )
      The D matrices of Eq. (3.5) have the same algebraic                                                      vlV2       I
property as that for the Ematrices discussed in Sec. II. Why,                                                   0         0
then, do they look so different? In the case of the 0(3)-like                andR (a) remains the same as before. It is now clear that the
little group, the four-by-four matrices of the little group can              four-by-four representation of the little group is reduced to
be reduced to a block diagonal form consisting of the three-                 one three-by-three matrix and one trivial one-by-one matrix.
by-three rotation matrix and one-by-one unit matrix.' Is it                  If we use )3' lV" and lV, for the three-by-three portion of the
then possible to reduce the D matrices to the form which can                 four-by-four J" N" and N, matrices, respectively, then
be directly compared with the three-by-three E or C matrices
discussed in Sec. II?
                                                                                 .I, = L"    lV, =   (lIV2)Q"             lV, =    (lN2)Q,.   (3.14)
      One major problem in bringing the D matrix to the form                 Now the identification ofE(2)-like little group with the cy-
of the E matrix is that theD matrix is quadratic in the u and v              lindrical group is complete.
variables. In order to attack this problem, let us impose the
Lorentz condition on the four-potential:
                                                                             IV. THE CYLINDRICAL GROUP AS A CONTRACTION OF
       ~ (A "(x)) =P"A"
       axil
                                 (x) = 0,                            (3.8)   0(3)
                                                                                   The contraction of 0(3) to E(2) is well known and
resulting in A, = Ao. Since the third and fourth components
                                                                             discussed widely in the literature. ' The easiest way to under-
are identical, the N, and ;V, matrices of Eq. (3.3) can be
                                                                             stand this procedure is to consider a sphere with large radius,
replaced, respectively, by
                                                                             and a small area around the north pole. This area would
                                                                             appear likea flat surface. We can then make Euclidean trans-
                        o o                                                  formations on this surface, consisting of translations along
                        o o                                                  the x andy directions and rotations around any point within
                        o o                                                  this area. Strictly speaking, however, these Euclidean trans-
                                                                             formations are SO (3) rotations around the x axis, y axis, and
                        o o
                                                                             around the axis which makes a very small angle with the z
                                                                             axis.
At the same time, the D(u,v,O) ofEq. (3.7) becomes
                                                                                   Let us start with the generators of 0(3), which satisfy
       Di.,D) ~ [~ : ~ ;]
                                                                             the commutation relations:
                                                                                                                                               (4.1)
                                                                 (310)
                                                                             Here L3 generates rotations around the north pole, and its
                                                                             matrix form is given in Eq. (2.4). Also, L, and L, take the
This matrix has some resemblance to the representation of                    form
the cylindrical group given in Eq. (2.11).'
                                                                                             o
     In order to make the above form identical to Eq. (2.11),
we use the light cone coordinate system in which the combi-                      L, =   [~   o       ~   /],    L,    =   [~ ~ ~].      (4.2)
nationsx,y, (z + t)1V2, and (z - t)/V2 are used as the coor-                                         o                    0 0 -/
dinate variables." In this system the four-potential of Eq.                  For the present purpose, we can restrict ourselves to a small
(3.1) is written as                                                          region near the north pole, where z is large and is equal to the
                                                                             radius of the sphere R, and x and yare much smaller than the
       A I'   =   (A,A,,(A,   + A o )/V2,(A, -    Ao)IV2).       (3.11 )
                                                                             radius. We can then write
The linear transformation from the four-vector of Eq. (3.1)
to the above expression is straightforward. According to the
                                                                                                                                               (4.3)
Lorentz condition, the fourth component of the above
expression vanishes. We are thus left with the first three
components.                                                                  The column vectors on the left- and right-hand sides are,
     During the transformation into the light-cone coordi-                   respectively, the coordinate vectors on which the E(2) and
nate system, J, remains the same. Ifwe take into account the                 o (3) transformations are applicable. We shall use the nota-
fact that the fourth component of A I' vanishes, N, and N,                   tion A for the three-by-three matrix on the right-hand side.
become                                                                       In the limit oflarge R,
1177          J. Math. Phys .. Vol. 28. No.5. May 1987                                                          Y. S. Kim and E. P. Wigner      1177
390                                                                                                                         CHAPTER VII
       L3=ALy4 -',
       P, = (IIR)AL,A -',                                (4.4)
       P, = - (IIR)AL,A -'.
This procedure leaves L, invariant. However, L, and L, be-
come the P, and P, matrices discussed in Sec. II. Further-
more, in terms of P" P, and L" the commutation relations
for 0(3) given in Eq. (4.1) become
       [L 3 ,P,] = iP"      [L 3 'p,] = - iP"
                                                         (4.5)
       [P"P,] = -i(IIR)'L3.
In the large-R limit, the commutator [P" P,] vanishes, and
the above set of commutators becomes the Lie algebra for
E(2).
     We have so far considered the area near the north pole
                                                                  FIG. 1. Contraction of the three-dimensional rotation group to the two-
wherezis much larger than (x' + y') 1/2. Let us next consid-      dimensional Euclidean group and to the cylindrical group. The rotation
er the opposite case, in which (x' + y') ,/, is much larger       around the z axis remains unchanged as the radius becomes large. In the
than z. This is the equatorial belt of the sphere. Around this    case of E( 2), rotations around they and x axes become translations in the x
                                                                  and - y directions. respectively, within a flat area near the north pole. In
belt, x and y can be written as
                                                                  the case of the cylindrical group, the rotations around they andx axes result
       x=Rcos</J, y=Rsin</J.                             (4.6)    in translations in the negative and positive z directions, respectively, within
                                                                  a cylindrical belt around the equator.
We can now write
                                 o
       [C~s</J]
      SID</J =
                 [IIR
                    0     IIR                            (4.7)
         z          0       o                                     mation. Therefore if the boost matrix takes a diagonal form
                                                                  as in the case ofEq. (4.3) or Eq. (4.7), we should be able 10
to obtain the vector space for the cylindrical group discussed
in Sec. II. The three-by-three matrix on the right-hand side      obtain N, and N, by boosting J, and J" respectively, along
                                                                  the z direction. 7
of the above expression is proportional to the inverse of the
matrix A given in Eq. (4.3). Thus in the limit oflarge R,              Indeed, in the light-cone coordinate system, the boost
                                                                  matrix takes the form
       L3 =A -'Ly4,
       Q, = - (lIR)A -'L,A,                              (4.8)                               o o
                                                                                               o
                                                                                                           oo 1'
       Q, = (lIR)A -'L,A.                                                                                   o                             (5.1 )
In terms of L 3 , Q" and Q" the commutation relations for                                    o R
0(3) given in Eq. (4.1) become                                                               o o         11R
       [L 3 ,Q,] = iQ"      [L 3 ,Q,] = - iQ"
       [Q"Q,]     = -    i(lIR)'L 3,                     (4.9)           _( 1+11)'/'
                                                                       R-      1_11 '
which become the Lie algebra for E(2) in the large-R limit.
The contraction of 0(3) to E(2) and to the cylindrical            where 11 is the velocity parameter of the particle. Under this
group is illustrated in Fig. I.                                   boost, J 3 will remain invariant:
                                                                       Ji=BJ3B-'=J3 •                                                    (5.2)
                                                                  Here J, and J, in the light-cone coordinate system take the
                                                                  form
V. E(2)-LIKE LITTLE GROUP AS AN INFINITE-
                                                                                                               il
MOMENTUM/ZERO-MASS LIMIT OF THE O(3)-LIKE                                                    0        0
                                                                       J,- ~ [:
LITTLE GROUP FOR MASSIVE PARTICLES
                                                                                             0       -i
     If a massive particle is at rest, the symmetry group is
                                                                                                     0
generatedbytheangularmomenlumoperatorsJ"J"andJ3 •
                                                                                     0       -i      0
If this particle moves along thez direction, J 3 remains invar-
iant, and its eigenvalue is the helicity. However, what hap-                                                                             (5.3)
                                                                       J'-~[ :
                                                                                                 0
                                                                                                            ~']o .
pens to J, andJ" particularly in the infinite-momentum lim-
it?                                                                                              0   0
     In orJer to tackle this problem, let us summarize the                     v2     -I         0   0
results of the preceding sections. The generators of the E (2)-                          i       0   0       0
like little group can be reduced to those of the cylindrical
group. The cylindrical group can be obtained from the three-      If we boost this massive particle along the z direction, the
dimensional rotation group through a large-radius approxi-        boosted J, and J, become
1178          J. Math. Phys., Vol. 28, No.5, May 1987                                                    Y. S. Kim and E. P. Wign.,        1178
  GROUP CONTRACTIONS                                                                                                                                               391
J; =BJ,B          ,~ ,'2 1 [:
                                0
                                        0
                                       iR
                                                       i/R
                                                       0
                                                               ,;o1'                 London Ser. A 368, 429 (1979); H. van Dam, Y. J Ng, and L. C. Bieden-
                                                                                     ham. Phy,- Lett B 158, 227 ( 1Qg5) For a recent textbook on thiS subject,
                                                                                     see Y. S. Kim and M. E. Noz, Theory and AppiicatlOnx oj/he Poincare
                                                                                     Group (Reidel, Dordlecht, Holland, 1(86).
                                0     - ilR            0        0                   lE. P. W!gner, Rev. Mod. Phy~. 29, 255 (1957). See also D. W. Robinson,
                                                                        (SA)         He\v. Phys. Acta 35, 98 ( 1962); D. Korff,!. Math. Phy,. 5. 869 ( 1964); S.
                                                          Tl
                                                                                     Weinberg, in LeClurewn Particles and Field Theory, Brandeis 1964, edited
                                           0     ilR
                    ~~PR
                                                                                     by S. Deser and K. W. Ford (Prentice-Hall, Englewood Cliffs, NJ, 1965)
                                           0      0                                  Vol 2; S. P. MisraandJ. Maharana, Phys. Rev. D 14,133 (1976); O. Han,
J, =BJ,B                                                                             Y. S. Kim. and D. Son, J. Math. Phys. 27, 2228 (1986).
                                           0      0                                4S. Weinberg, Phys Rev. B 134, 882 (1964); B 135, 1049 (1964); 1. Ku-
                                ilR        0      0                                  perzstych, N uovo Omento B 31, I (1976); D. Han and Y> S. Kim, Am. 1.
                                                                                     Phys. 49, 348 (1(81); 1. J. van der Bij, H. van Dam, and Y. 1. Ng, Physica
                                                                                     A \16. 307 (1982). D. Han. Y. S. Kim, and D. Son, Phys. Rev. D 31, 328
 Because of the Lorentz condition, the iR terms in the fourth                        ( 1985).
 column of the above matrices can be dropped. Therefore, in                        ~D. Han, Y. S Kim, and D. Son, Phys. Rev. D 26,3717 (1982). For an
 the large-R limit which is the limit of large momentum,                             earlier effort to study the E( 2 )-like little group in terms of the cylindrical
                                                                                     group, sec L. J. Boya andJ. A. de Azcarraga, An. R. Soc. Esp. Fis. Quim. A
       N,= -(1IR)J;,                N 2 =I1IR)J;,                        (5.5)       63, 143 (1967). We are grateful to Professor Azcarraga for bringing this
                                                                                     paper to our attention.
 where N, and N, are given in Eq. (3.12). This completes the                       {'P. A. M. Dirac, Rev. Mod. Phys. 21, 392 (1949); L. P. Parker and O. M.
 proof that the gauge degrees of freedom in the E(2)-like                            Schmieg, Am. J. Phys. 38, 218,1298 (1970); Y. S. Kim and M. E. Noz, J.
 little group for photons are Lorentz-boosted rotational de-                         Math. Phy,. 22, 2289 (1981).
                                                                                   70. Han, Y. S. Kim, and D. Son, Phy~. Lett. B 131, 327 (1983); D. Han, Y.
 grees of freedom. The limiting process is the same as the
                                                                                    S. Kim, M. E. Noz, and D. Son, Am J. Phys. 52, 1037 (1984). These
 contraction of the three-dimensional rotation group to the                          authors studied the correspondence between the contraction of O( 3) to
 cylindrical group.                                                                  E(2) and the Lorentz boost of the O(3)-like little group.
1179        J. Math. Phys., Vol. 28, No.5, May 1987                                                                     Y. S. Kim and E. P. Wigner            1179
Chapter VIII
Localization Problems
                                         393
LOCALIZATION PROBLEMS                                                                                                           395
                     Reprinted from REVIEWS       OF MODERN PHYSICS,       Vol. 21, No.3, pp. 400-406, July. 1949
                                                             Printed in U. S. A.
                It is attempted to formulate the properties of localized states on the basis of natural invariance require-
              ments. Chief of these is that a state, localized at a certain point, becomes, after a translation, orthogonal
              to all the undisplaced states localized at that point. It is found that the required properties uniquely define
              the set of localized s~tes for elementary systems of nOD-zero mass and arbitrary spin. The localized func-
              tions belong to a continuous spectrum of an operator which it is natural to call the position operator. This
              operator has automatically the property of preserving the positive energy character of the wave function
              to which it is applied (and it should be applied only to such wave functions). It is believed that the develop-
              ment here presented may have applications in the theory of elementary particles and of the collision matrix.
on the basis of which operators for the position coordi-             the wave functions of those states which are, at time
nates can be found.                                                  1=0, localized at the origin of the coordinate system.
   If we restrict ourselves to an elementary system, the                We postulate that the states which represent a
physical interpretation of the operators to be found is              system localized at time 1=0 at x=y=z=O: (a) form
unique: they will correspond to the position of the                  a linear set So, i.e., that the superposition of two such
particle if we deal with an elementary particle. Other-              localized states be again localized in the same manner;
wise they may correspond to the center of mass of the                (b) that the set So be invariant under rotations about
system. If the system is not elementary, the interpreta-             the origin and reflections both of the spatial and of the
tion will not be unique and neither will our postulates              time coordinate; (c) that if a state'" is localized as
lead us to a uniquely determined set of operators.                   above, a spatial displacement of'" shall make it orthog-
   Before proceeding with our argument, we wish to                   onal to all states of So; (d) certain regularity conditions,
refer to other investigations with somewhat similar                  amounting essentially to the requirement that all the
objectives. The problem of the center of mass in                     infinitesimal operators of the Lorentz group be appli-
relativity theory has been treated particularly by                   cable to the localized states, will be introduced later.
Eddington' and by Fokker' on the basis of non-quantum                   It is to be expected that the states localized at a
mechanics. Their work was evaluated and a quantum                    certain point have the same properties as characteristic
mechanical generalization thereto given by Pryce.' We                functions of a continuous spectrum, i.e., they will not
shall have frequent occasion to refer to his results.                be square integrable but the limits of square integrable
Ideas related to Pryce's work have been first put                    functions. It seems to us that the above postulates are
forward by Schrodinger' and, more recently, by                       a reasonable expression for the localization of the system
Finkelstein' and also by Mpller. 7                                   to the extent that one would naturally call a system
   The present paper arose from a reinvestigation of the             unlocalizable if it should prove to be impossible to
irreducible representations' of de Sitter space which                satisfy these requirements.
was undertaken by one of us.' These representations                     We shall carry out our calculations in the realization
are in a one to one correspondence with relativistically             of the elementary systems which was described by
invariant wave equations for elementary systems in de                Bargmann and WignerlO and will proceed with the
Sitter space. At the conclusion of the investigation it              calculation.
appeared that the physical content of the equations
which were obtained could be understood much more                           Particle with no spin (Klein-Gordon particle)
readily if position operators could be defined on an
                                                                        The determination of the localized state is particu-
invariant theoretic basis. As an introduction to this,               larly simple in this case. It will be carried out in some
a similar investigation was undertaken in flat space                 detail in spite of this, because the same steps occur in
with the results given in the following sections.                    the consideration of systems with spin.
                                                                       The wave functions are defined, in this case, on the
      POSTULATES FOR LOCALIZED STATES AND
              POSITION OPERATORS                                     positive shell of a hyperboloid PO'=P,'+P,'+p,'+J.l'
                                                                     and we shall use P', p2, P' as independent variables. In
   The position operator could easily be written down                any formula, po is an abbreviation for (P,'+P,'+p,'
if the wave function of the state (or the states) were               +J.l')I. The invariant scalar product is
known for which the three space coordinates are zero
at t=O. If '" is such a function and T(a) the operator
of displacement by ax, a., a" a" the wave function
T(a)-l", represents a state for which the space coordi-
nates are ax, au, a, at time a,. Thus the knowledge of               The wave function <I> in coordinate space becomes
the wave functions corresponding to the state X= y= Z
=0 at t=O (and the knowledge of the displacement
operators) entails the knowledge of all localized states,
                                                                     <I>(x" x', x', x')= (2.,..)-1   f   q,(p"   P" p,)
i.e., of all characteristic functions of the position                                          Xexp( -i{x, p})dp,dp,dp,/ Po,         (2)
operators. From these, the position operators are easily             where
obtained. For this reason we concentrated on obtaining
                                                                     {x, p} =XOP'-X'p'-x'p2_x'p2
  'A. S. Eddington, Fundamental Theory (Cambridge University                               =XOPO-XlP,-X,P,-x,p"                      (3)
Press, London, 1946).
  'A. D. Fokker, Relativitatstheorie (Groningen, Noordhoff, 1929).   is the Lorentz invariant scalar product. Throughout
  • M. H. L. Pryce, Proe. Roy. Soc. 195A, 62 (1948).                 this paper, the covariant and contravariant components
  'E. Sehr5dinger, Ber!' Ber. 418 (1930); 63 (1931).                 are equal for the time (0) coordinate, oppositely equal
  'R. J. Finkelstein, Phys. Rev. 74, 1563A (1948).
  7 Chr. M~ller, Comm. Dublin Inst. for Adv. Studies A, No.5
                                                                     for the space (1, 2, 3) coordinates. This governs the
(1949); also A. Papapetrou, Acad. Athens 14, 540 (1939).
  8 L. H. Thomas, Ann. of Math. 42, 113 (1941).                        10   V. Bargmann and E. P. Wigner, Proc. Nat. Acad. Sci. 34,
  9 T. D. Newton, Princeton Dissertation (1949).                     211 (1948).
LOCALIZATION PROBLEMS                                                                                                         397
raising and lowering of all indices. Occasionally, we                 As far as postulates (a), (b), (e) are concerned, I{; could
shall use for the scalar product of two space-like vectors         be a discontinuous function, being +pol= (P'+I") I for
the notation (x,p) so that, e.g., {x,pi=xop'-(x·p).                some p, and -(P'+I")I for the remaining p. However,
  The linear manifolds which are invariant with respect            no matter how I{; is chosen, consistently with (7), there
to rotations about P.=P2=P,=0 are, for any integer j,              is, in this case, only one state localized at the origin
the 2j+ 1 functions                                                because if there were two, say I{;1, and I{;" the I{;1 would
                                                                   have to be orthogonal not only to I{;1 exp( -i(a, p)) but
         Pmi((}, cp)j(p)   (m= -j, -HI, .. oj-l,}), (4)
                                                                   also to I{;, exp( -i(a, p)) from which not only 1I{;1'~Po
where p, tJ and cp are polar coordinates for p., P" pa and         but also I{; 1*I{;,~ po and hence the proportionality of I{;1
j is an arbitrary function. The Pmi are the well-known             to I{;, follows.
spherical harmonics. The sets (4) are also invariant                  In order to eliminate the discontinuous I{; as localized
with respect to inversion, i.e., replacement of PI, p" pa          state, we introduce the further regularity condition that
by -PI, -P" -pa. Naturally, not only a single set
                                                                                                                               (8)
(4) has these properties of invariance but the sum of
an arbitrary number of such sets as long as one includes           shall remain finite as the normalizable wave functions
with one function (4) all 2j+ 1 functions and their                I{;n approach I{;. The Mo. is the infinitesimal operator of
linear combinations. The j(p) could be different for               a proper Lorentz transformation in the x!'x' plane, its
different j.                                                       operator is lO
   Under time reversal I{;(p.p,pa) goes over into\!
                                                                                         MOk = ip°i)/ iJp•.                   (8a)
              EJV;(p1, p" pa)=I{;(-p., -P" -palO.            (5)
                                                                   This further postulate eliminates all discontinuous I{;
We understand by time reversal the operation which                 and we obtain for the wave function of the only state
makes out of a wave function I{; the wave function IJif;           which is localized at the origin
on which every experiment, if carried out at - t, yields
same results as the same experiment carried out on I{;                                                                         (9)
at time t. Because of (5) and our postulate (b), if                The regularity requirement (d) actually asks for the
the Pmi(O, cp)j(p) are localized at the origin, the set            finiteness of (8) for all Mkl. However, if one substitutes
P ~i(O, cp)j(p)*, i.e., the P+mi(O, cp)j(p)* are also local-       M23, M31 or M12 for MOk in (8), the resulting expression
ized. The same is then true for the sum and difference             is automatically bounded-in fact their sum is j(j+ 1).
of the corresponding pairs of functions which shows                Hence requiring the applicability of the M", M31, M12
that the f(p) can be assumed to be real without loss               to I{; does not introduce a new condition.
of generality.                                                        The localized wave function in coordinate space is
   The displacement operator in momentum space is                  obtained by (2). It is, apart from a constant"
simply multiplication with exp( -i{a, pi);
                                                                                                                              (9a)
                    T(a)I{;= exp( -if a, pi )1{;.            (6)
                                                                   It goes to zero at r= 00 as e-.', at r= 0 it becomes
We shall have to consider purely space-like displace-
                                                                   infinite as r- 5I'. It is, of course, not square integrable
ments, i.e., assume that aO=O. It then follows from
our postulate (e) that, in particular, exp(i(a, P))f is            since it is part of a continuous spectrum.
                                                                      Applying the operator of displacement to (9) we
orthogonal to I{; if f is localized, or that
                                                                   obtain for the wave function of the state which is
can transform (10) in a well-known fashion The consistency of these was shown before. to The y.
                       P')
                                                                   apply to ~., two y with different first indices commute,
qkq,(p)= ( i a
             - -i- - (21r)-'                                       with the same first index they satisfy the well-known
           iifJ' 2 po'                                             relations
                                                                                                                      (13a)
                                                                   The great difference between the present case and that
                                                                   of zero spin consists in the limitation (13) of the
                                                                   permissible wave functions, in addition to the limitation
       =   -i(~+.f..)q,(P)'                                 (11)
                                                                   to the positive hyperboloid. This latter limitation can
                                                                   be taken care of by using only the PI, P" p. as inde-
                 ap. 2po'
                                                                   pendent variables, the former limitation cannot be
These expressions are valid for finite as well as for              taken care of in an equally simple fashion. We shall
vanishing rest mass. It is remarkable that the operator            make extensive use, however, of a device, most success-
rf can be transformed into coordinate space and retains            fully employed by Schrodinger' and define operators
a relatively simple form
                                                                                E.=~(pO)-I(LY.'p,+I')y.o.               (14)
                        1 fexP(-1'1 x-yl) a<J>(y)                  This is a projection operator: E.'=E. and E.¥t auto-
  qk<J>(x) =   x'<J>(x)+-                 -dy.              (12)
                       8..      Ix-yl      oy.                     matically satisfies the corresponding Eq. (13). Denoting
                                                                   the product of all the E. by E
x and y stand for the spatial part of the four vectors x·
and y. and dy indicates integration over yl,           t,
                                                   y'. The
                                                                                                                      (14a)
customary rf operator contains only' the first term of             any E¥t is a permissible wave function, satisfying all
(11),                                                              Eqs. (13).
   It may be well to remember at this point that the                 For the scalar product, we shall use the expression
position operators to which our postulates lead neces-
sarily commute with each other so that only Pryce's
case (e) can be used for comparison. In fact, our rf is
identical with his 'i/. It may be pointed out, second,
that a state which is localized at the origin in one               It follows from this at once, because of our postulate
coordinate system, is not localized in a moving coordi-            (e), and since (6) is valid in this case also, that every
nate system, even if the origins coincide at t=O.                  wave function which is localized at the origin satisfies
Hence our operators rf have no simple covariant mean-              the analogon of (i):
ing under relativistic transfarmations. This is not the
case for the customary operators rf either. Further-                                Lfl.yI'~(2")-'PoMl.                (16)
more, even though it appears that <J>(x) = <I (x) is in-           The operator for time reversal is
variant under relativistic transformations which leave
the origin unchanged, this is not much more than a
mathematical quirk. One sees this best by transforming             where C is a matrix which operates on the ~ coordinates
the <I-function to momentum space through the inver-               and satisfies the equations
sion of (2). The result, po, seems to have a simple
covariant meaning. However, it does not represent a                  Cy"* = y.oC ; (Ol= 1,2, .. ',25)
square integrable function and if one approximates it                Cy.'*=-y.'C. (0l=1, 2, ···,25;k=1,2,3). (lia)
by one, say by ¥t.= po exp( -a'po') , the Lorentz trans-           If yO, y', y' are real, yl imaginary
form of ¥t. will not approach ¥t. with decreasing Ol. In
fact, as soon as 0l1'«1, the scalar product of ¥t. and its                            ,.
transform will be independent of Ol and smaller than                             C=   II j'h.';   C'=(-)2..           (lib)
the norm of ¥t•.
                                                                                      ~I
           Particles with spin and finite mass                     Since C, as defined above, is a real matrix we also have
                                                                   8'= (- )" which is true independently of the chcice of
   We again use the description given in reference 10,             the y-matrices. The operator for the inversion of the
i.e., define wave functions on the positive hyperboloid            space coordinates is
po'= PI'+P,'+P,'+I" and use in addition to PI, P" P.
the 25 spin variables tl, t" ... t,. all of .them four-              N(pI, P" p,)=.yt'y,o .. 'y,N( -PI, -P" -p,); (Iii)
valued. The wave functions which describe the possible             it commutes with the E. of (14).
states of the system will be symmetric functions of the               In order to determine the sets of wave functions
t and satisfy the 25 equations                                     which are invariant under rotations, we first define the
               2:" y.'P,¥t = I'¥t Ol= 1, 2,   ,,·25.        (13)   analogue of the pure spin function for the relativistic
LOCALIZATION PROBLEMS                                                                                                                      399
Eqs. (13). For this purpose we define auxiliary functions                giving a totalj from wave functions with given "orbital
Vm which are independent of PI, P" p, and functions of                   momentum" / and "spin momentum" s.
the ~ only. They satisfy the equations                                      Since the polar angles tJ, q, are indeterminate for p=O,
                                                                         the fl(P) must vanish for p=O unless /=0. Otherwise,
                     y.oVm=Vm       (a=l,   2,' ", 2s)           (19)
and                                                                      the !/tjm would become singular at p=O and the MO.
                                                                         could not be applied to them in the sense of the bound-
!i~a 'Yct1"Ya2Vm=mvm                                                     edness of (8). (Actually it is necessary to postulate this
                           (m= -s, -s+ 1, .. " s-l, s).        (19a)     equation for the square of MOk instead only for MOk.)
                                                                         It follows that !/tim vanishes at p= 0 unless the series
Since the yO and the iy'y' commute, it is possible to
                                                                         (21) contains a term with /=0. However, (16) shows
assume temporarily that they are all diagonal. Equation
                                                                         that !/t cannot vanish at this point if the rest mass is
(19) then demands that they all belong to the character-
                                                                         finite and that, hence, every localizable wave function
istic value + 1 of each yO; there are 2" such functions.
                                                                         must have an /=0 term in its expansion. This happens
However, we are interested only in symmetric functions
                                                                         only if j = s and the wave function 'has even parity.
of the ~ and there are only 2s+ 1 of these. They are
                                                                         If the parity of !/t;m is odd, only /I, f3, etc., enter (21)
distinguished by the index m: the Vm has non-zero
                                                                         and these still vanish at p=O. It follows that the wave
components only for those t for which s+m of the
                                                                         functions which are localized at the origin all have
iyly' are +1, the remaining s-m are -1. For these ~
                                                                         angular momentum j = s and the form
the value of Vm is «s+m) !(s-m) !/(2s) !)I2-' so that
Vm is normalized in the sense
                                                                                        ,.
                                                                                 "'m=   L L S(/, S) •. m_m'.m'P'm-m,(tJ, q,)h Vm'.       (21a)
                                                                                        l=O m'
                                                               (19b)
Physically, m corresponds to the spin angular momen-                     We now skip the part of the calculation which deals
                                                                         with the determination of the fl and give only the
tum about the ~ axis, the parity of Vm is even because
of (19) and (18).                                                        result: The wave functions localized at the origin are
  The v.. are not permissible wave functions because                     the 2s+ 1 functions
they do not satisfy the wave Eqs. (13). We therefore                     !/tm= (211")-3"2'Po"+!(Po+IL)-'
define as spin functions                                                                             X Vm(PIP,P,; h,'"          .~.)    (21b)
V..(PI, P"      p"   ~l,   "',   t,.)=Ev,. (m= -s, "', s). (20)          (i.e., the /=0 term alone remains from (21a)). Actually,
                                                                         this result is far from being surprising."
They are permissible wave functions of even parity                          The operator for the position coordinate can be
and Vm represents a state of angular momentum mk                         calculated in exactly the same way as this was done in
about the third coordinate axis. Their normalization                     the case of zero spin, and gives
is, instead of (19b)
                       Lt 1Vml'= «Po+IL)/2po)".                (20a)
                                                                                    ,.      Po"+! (       Po-I
                                                                                <t=EII (1+1'.0)-- - i - - - E . (22)
                                                                                                                     a)
   The most general solution of (13) is a linear combi-
                                                                                    ~l     (Po+IL)' ap. (Po+IL)'
nation of the V.. multiplied with arbitrary functions of                 For s=! this again agrees with Pryce's result' for his
the PI, P" p,. A set of wave functions which is invariant                case (el, i.e., for his operator q.
under rotations and reflections contains wave functions                    The significance of the projection operators E in
of the form                                                              (22) is only to annihilate any negative energy part of
                                                                         the wave function to which it is applied and to produce
       I/;;m=    L S(l, S)I.~m'.m'P'~m,(tJ, q,)f,(P)Vm,.         (21)
                                                                         a purely positive energy wave function. Since q' is the
                l,m'
                                                                         position operator only for wave functions which are
The p, tJ, q, are again polar coordinates for pI, po, p3;
the f' are arbitrary unknown functions of the length of                    14   The proof runs as follows. One first shows, by considering
p. However, if one function of the form (21) occurs in                   >Pm±e.y-m that the J,    of (21a) can all be assumed to be real. One
                                                                         then subdivides >Pm into two parts: the 1= 0 part of the sum (21a)
the set, all others with different m but the same fl also                will be denoted by >P', tbe rest >p'. As we have seen, ",,"is finite at
occur. The summation over / is to be extended over all                   p=O, while ",r vanishes at that point. The proof then consists in
even values between [j-sl and j+s if the parity of                       showing that there can be no region in which ",r is finite but very
                                                                         much smaller than >p'. It then follows from the continuity of botb
the!/tl is to be even, over all odd values of / if the parity            >P' and ,p' that the latter vanishes everywhere. Inserting ,p'+,p'
of I/;i is odd. The SCI, s) are the customary coefficients"              for >P in (16), one can neglect in the aforementioned region the
                                                                         square of 1/Ir as compared with the other terms. The right side, as
   13 See e.g., E. P. Wigner, Gruppentheorie, etc. (V. Vieweg & Sobn,    well as the term from the square of 1/10, are independent of iJ and Ip.
Braunschweig, 1931). The composition of the V and the spherical          This must be true, therefore, also for the term arising from the
harmonics pi to the 1/11 is the same operation as the composition of     cross product of 1/10 and t/lr, This term is, however, a sum of expres-
the spin functions with a definite S and the space coordinate func-      sions Pm'({J,,,)J,J. whicb cannot be independent of {J,,, except if
tions with a definite L, to functions of both, with definite J. This     allJ. with 1>0 vanish (j, is finite by assumption). It tben follows
composition is explained in Chapter XXII. The coefficients of            that tbe J. vanish everywhere and (21a) reduces to a single term.
the composition, i.e., our S(l, s) are calculated p. 202 ff. (tbey are   This can be obtained from (16) by taking the square root on both
denoted by ,ILS').                                                       sides.
400                                                                                                   CHAPfER VIII
defined on the positive hyperboloid alone, the E on           (22) transforms positive energy functions into positive
the right could logically be omitted. Both E can be           energy functions.
omitted' if one calculates a matrix element between              It is often stated that a measurement of the position
two purely positive energy wave functions. The factors        of a particle, such as an electron, if carried out with a
involving po are necessary in order to make iiJ!iJp;          greater precision than the Compton wave-length, would
hermitian: because of the factor PO-·_l in the volume         lead to pair production and that it is, therefore, natural
element (15), an operator is hermitian if it looks            that the position operators do not preserve the positive
hermitian after multiplication with po<+1 on the right       energy nature of a wave function. Since a position
and division with the same factor on the left. The            measurement on a particle should result in a particle
         z,                                                  at a definite position, and not in a particle and some
operator II !(1+YaO) is a projection operator, i.e., it is    pairs, this consideration really denies the possibility of
         0.=1
identical with its square and could therefore be inserted     the measurement of the position of the particle. If this is
into (22) once more before the second E, thus making         accepted it still remains strange that pair creation ren-
(22) somewhat more symmetric. The position operator          ders the position measurement impossible to the same
(11) for the Klein-Gordon particle is a special case of      degree in such widely different systems as an electron, a
(22) and can be obtained from (22) by setting s=O.            neutron and even a neutriono. The calculations given
  If one displaces a state by a and measures its x·          above prove, at any rate, that there is nothing absurd in
coordinate afterwards, the result will be greater by a"      assuming the measurability of the position, and the
than the x" coordinate measured on the undisplaced           existence of localized states, of elementary systems of
state. This leads to the relation                            non-zero mass. Moreover, the postulates (a), (b), (c) and
                                                              (d), which are based on considerations of invariance, de-
                 T(-a)q'T(a) = q'+a'                 (23)    fine the localized states and position operators uniquely
for aO=O. Inserting the expression (6) for T(a) and          for all non-zero mass elementary systems.
going to the limit of very small ak, one obtains                No similarly unique definition of localized states is
                                                             possible for composite systems. Although it remains
                 (q'pl_ plqk)q,= -io.!t/>           (23a)    easy to show that definite total angular momenta j can
where q, is any permissible wave function. Actually          be attributed to localized states, one soon runs into
one obtains by direct calculation, using in particular       difficulties with the rest of the argument. In particular,
the identity                                                 the summation in (16) must be extended not only over
             Ea(1+'YaO)E.= Po-l(po+p.)Ea          (24)       the spin coordinates ~ but also over all states with
                                                             different total rest mass and different intrinsic spin.
the commutation relation                                     As a result, one can, e.g., find states which can coexist
                                                     (25)    as localized states in the sense of our axioms even
                                                             though their j values are different. This is also what
The commutation relations of the q' with po are natu-        one would expect on ordinary reasoning since, if the
rally also the usual ones as po is a function of the pk      system contains several particles, the states in which
alone. Since the q" are the components of a vector           anyone of them is localized at the origin satisfy our
operator in three-dimensional space, their commutation       postulates. This holds also for the states in which
relations with the spatial components of Mk! are also        another one of the particles is so localized or for states
the usual ones.                                              in which an arbitrary linear combination of the coordi-
  We wish to remark, finally, that a consideration,          nates is· zero. Asa result, not only is the number of
similar to the above, has been carried out also for the      localized states greatly augmented but, further, one
equations with zero mass. In the case of spin 0 and t,       must expect to find many such large sets for which our
we were led back to the expressions for localized systems    postulates hold, although no two sets can be considered
which were given in (9) and (21b). However, for higher       to be localized simultaneously. In oth·er words, each
but finite s, beginning with $= 1 (i.e., Maxwell's           set of localized states is not only much larger for
equations), we found that no localized states in the         composite systems but one also has to make a choice
above sense exist. This is an unsatisfactory, if not         between many sets all of which satisfy our postulates
unexpected, feature of our work. The situation is not        by themselves. It does not appear that one can proceed
entirely satisfactory for infinite spin either.              much further in the definition of localized states for
                                                             composite systems without making much more specific
                     DISCUSSION
                                                             assumptions. Naturally, one can define as localized
   One might wonder, first, what the reason is that our      states those, which, in any of the elementary parts of
localized states are not the o-functions in coordinate       the composite system, appear localizable. It appears
space which are usually considered to represent localized    reasonable to assume that this definition corresponds
states. The reason is, naturally, that all our wave          to the center of mass of the whole system.
functions represent pure positive energy states. This           One may wonder, even in the case of elementary
is not true of the o-function. Similarly, our operator       particles, whether the determination of the localized
LOCALIZATION PROBLEMS                                                                                            401
states and positIOn operators has much significance.        had gone straight to the scattering center and then
Such doubts might arise particularly strongly if one is     continued in the new direction without any delay." In
inclined to consider the collision matrix as the future     order to answer such questions in the relativistic region,
form of the theory. One must not forget, however,           one will need some definition of localized states for
that the customary exposition of this theory refers         elementary systems. From this point of view it is
only to questions about cross sections. There is another    satisfactory that the localized states could be defined
interesting set of questions referring to the position of   without ambiguity just for these systems.
the scattered particles: how much further back (i.e.,
closer to the scattering center) are they than if they        15   L. Eisenbud, Princeton Dissertation (1948),
402                                                                                               CHAPTER VIII
846 A. S. WIGHTMAN
operators for relativistic particles was left without a          sional space at a given time, must satisfy the follow-
 clear resolution. That does not mean that papers                ing axioms:
 were not written on the subject, but that those                    Ca) S. is a linear manifold;
papers had completely different objectives in mind:                 (b) S. is invariant under rotations about a, re-
 They permitted the particles in question to be in              flections in a, and time inversions;
 nonphysical (negative energy) states or they studied               (c) S. is orthogonal to all its space translates;
 operators which could not serve as position observa-               (d) certain regularity conditions.
 bles since their three components did not commute.                 The solutions of (a) ... Cd) for elementary systems,
    In the opinion of the present author, the decisive          i.e., for systems whose states transform according to
 clarification of the relativistic case occurs in a paper       an irreducible representation of the inhomogeneous
of Newton and Wigner.' These authors show that,                  Lorentz group, turn out to be continuum wave func-
 if the notion of localized state satisfies certain nearly       tions when they exist at all, i.e., according to the
inevitable requirements, for a single free particle it           usual definitions of Hilbert space, there is no mani-
is uniquely determined by the transformation law                fold S•. However, it is physically and mathematically
of the wave function under inhomogeneous Lorentz                clear that Newton and Wigner's formulation ought
transformations. The resulting position observables              to be regarded as the limiting case of a notion of
turn out, in the case of spin-!, to be identical with            localizability in a region.
the Foldy-Wouthuysen "mean position" operators.'                    In the present paper, I propose a reformulation of
An analogous investigation for the case of Galilean             the physical ideas of (a) ... Cd) in terms of a notion
relativity was carried out by Inonti and Wigner.'               of localizability in a region. When the ideas are so
    The essential result of Newton and Wigner is that           formulated, one sees that the existence and unique-
for single particles a notion of localizability and a           ness of a notion of localizability for a physical system
corresponding position observable are uniquely de-              are properties which depend only on the transforma-
termined by relativistic kinematics when they exist             tion law of the system under the Euclidean group,
at all. Whether, in fact, the position of such a particle       i.e., the group of all space translations and rotations.
is observable in the sense of the quantum theory of             The analysis of localizability in the Lorentz and
measurement is, of course, a much deeper problem;               Galilei invariant cases is then just a matter of dis-
that probably can only be decided within the context            cussing what representations of the Euclidean group
of a specific consequent dynamical theory of parti-             can arise there. To obtain uniqueness, one must add
cles. All investigations of localizability for relativistic     invariance under time inversion and an analogy of
particles up to now, including the present one, must            Newton and Wigner's regularity assumption. As
be regarded as preliminary from this point of view:             would be expected, all the results obtained earlier in
They construct position observables consistent with             the old formulation come out. One can ask what is
a·given transformation law. It remains to construct             the point of the present extended footnote to Newton
complete dynamical theories consistent with a given             and Wigner's paper. First, it seems worthwhile to me
transformation law and then to investigate whether              to have a math~matically rigorous proof of the
the position observables are indeed observable with             fundamental result of Newton and Wigner that a
the apparatus that the dynamical theories them-                 single photon is not localizable. Second, the work of
selves predict.                                                 Newton and Wigner can be regarded as a contribu-
    In Newton and Wigner's formulation, the set S.              tion to the general problem of determining what
of states localized at a point a of the three-dimen-            physical characteristics of a quantum mechanical
                                                                system are consistent with a given relativistic
                                                                transformation law. In this connection, it is inter-
  1 T. D. Newton and E. P. Wigner, Revs. Modern Phys. 21,
400 (1949).
                                                                esting to regard the axioms I ... V below for localiza-
   2 L. Foldy and S. Wouthuysen, Phrs. Rev. 78, 29 (1950).      bility in a region as a very special case of the notion
This paper was widely read because 0 its exceptional clarity.   of particle observables for a quantum theory. Else-
The mean position operators themselves were discussed before
by A. Papapetrou, Acad. Athens 14, 540 (1939); R. Becker,       where' I gave a set of axioms for the notion of a
Gott. Nach. p. 39 (1945); and M. H. L. Pryce, Proc. Roy. Soc.   particle interpretation which yield I ... V when
(London) A150, 166 (1935); A195, 62 (1948). For further
references and discussion see A. S. Wightman and S. Schweber,   specialized to the case of a single particle. One of the
Phys. Rev. 98, 812 (1955).                                      main reasons for giving full mathematical detail in
  'E. Inonii and E. Wigner, Nuovo cimento 9, 705 (1952).
The main point of this paper is that laws of transformation
of the states of a particle under the inhomogeneous Galilei
group other than those in the ordinary Schrodinger mechanics      4 See, Les problbMs mathbruJtiques de la theoTie quantique
are inconsistent with localizability.                           des champs, (CNRS, Paris, 1959), especially pages 36-38.
404                                                                                                       CHAFfER VIII
the present simple case is in preparation for the prob-         III. E(S, uS.) = E(S,)        + E(S.) -      E(S, n S.).
lem of determining particle interpretations.
  It turns out that the natural mathematical tool for                II S;,i = 1, 2, ... are disjoint Borel sets then
the analysis of localizability as understood here is the
theory of imprimitive representations of the Euclid-
                                                                     E(U S;) =       L;-' E(S;).
ean group. The notion of imprimitivity was intro-               IV. E(R3 )    = 1.
duced for finite groups early in the history of group
theory. It was generalized to the case of a large class         V~   E(RS    + a)    =   U(a,R)E(S)U(a,R)-',
of topological groups by Mackey.' From a mathe-
matical point of view, the present paper merely                  where RS + a is the set obtained from S by carrying
writes out Mackey's theory in detail for the case of            out the rotation R followed by the translation a, and
the Euclidean group. However, I decided to make                  U(a,R) is the unitary operator whose application
the exposition as self-contained as possible, and to            yields the wave function rotated by R and translated
incorporate certain elegant ideas of Loomis in the               bya.
proofs.· The purpose of this expository account is to              The notation S, n S. and S, u S. is used to
make it possible for the reader to understand how               indicate the common part and union, respectively,
the mathematical arguments go for the Euclidean                 of the sets S, and S.. U S; is the union of the
group without having to work through the general                sets 8..
case, however character building that experience                   The physical significance of these axioms is as
might be.                                                       follows.
                                                                   The Borel sets form the smallest family of sets
2. MATHEMATICAL FORMULATION OF THE AXIOMS                       which includes cubes and is closed under the opera-
    AND PRELIMINARY HEURISTIC DISCUSSION                        tions of forming complements and denumerable
                                                                unions. One might try to replace the Borel sets by
   The axioms are formulated in terms of projection             all sets obtained by forming complements and finite
operators E(S), where S is some subset of Euclidean             unions starting from cubes and require III only for
space at a given time. The E(S) are supposed to be              finite sums. However, it can be shown that any such
observables. They must be projection operators be-              E(S) could be extended to one defined on the Borel
cause they are supposed to describe a prope:rty of the          sets and satisfying III as it stands. (See Appendix I
system, the property of being localized in S. That              for further discussion of this point.) In fact, E(S)
is, if of> is a vector in a separable Hilbert space, x,         can be extended even further to all Lebesgue measur-
describing a state in which the system lies in S, then          able sets, but this extension will not be needed here. 8
E(S)of> = of>. If the system does not lie in S then                II states that a system which is in both S, and S.
E(S)if> = O. E(S) can therefore only have proper                is in S, n S •. It is immediately clear from II th"\t
values one or zero and, as an observable, must be               E(S,)E(S.) = E(S.)E(Sl).
self-adjoint. Thus, it is a projection operator.'                  III states that the set of states of the system for
   The axioms are:                                              which it is localized in S, u S. is the closed linear
I. For every Borel set, S, of three-dimensional                 manifold spanned by the states localized in S, and
Euclidean space, R3, there is a projection operator             those localized in S2.
E(S) whose expectation value is the probability of                 IV says that the system has probability one of
finding the system in S.                                        being somewhere.
 II. E(S, n S.)    =   E(S,)E(S.).                                 V' says that if of> is a state in which the system is
                                                                localized in S, then U(a,R) of> is a state in which the
  • G. W. Mackey, Proc. Nat!. Acad. Sci. U.S. 35, 537 (1949);   system is localized in RS + a.
Ann. Math. 55, 101 (1952); 58, 193 (1953); Acta Math. 99,
265 (1958). That Mackey's theory applies to localizability in      I venture to say that any notion of localizability
quantum mechanics was independently realized by Mackey          in three-dimensional space which does not satisfy
himself. I thank Professor Mackey for correspondence on the
subject. Mackey's treatment is summarized in his Colloquium     I . . . V' will represent a radical departure from
Lectures to the American Mathematical Society, Stillwater,      present physical ideas.
Oklahoma Aug. 29-Sept. 1, 1961. It is a part of a coherent
axiomatic treatment of quantum mechanics given in his un-          The E(S) define a set of commuting coordinate
published Harvard lectures 1960-61.
  6 L. H. Loomis, Duke Math. J. 27, 569 (1960).
  7 For a general discussion of observable. describing a
property see J. von Neumann, Mathematical Foundntitm. of          8 An argument that the Lebesgue measurable set. form a
Quantum Mechanics (Princeton University Press, Princeton,       physically natural class is contained in J. von Neumann, Ann.
New Jersey, 1955), pp. 247-254.                                 Math. 33, 595 (1932).
LOCALIZATION PROBLEMS                                                                                                        405
848 A. S. WIGHTMAN
operators q"q.,q. which form a vector in 3-space. In                   write A,a,. This will be done throughout the follow-
fact,                                                                  ing. Thus, V' is replaced by
T
    , [0 1J
     =   1 0 '        T' =   [? -iJo
                                 ,            ,T
                                                  3    =   [1 OJ
                                                           0   -1 .
                                                                       mere change of names M can be replaced by the
                                                                       space of left cosets, usually denoted GIG•. In the more
The mUltiplication law of 8, is                                        general case of a nontransitive system the space M
                                                                       will split into orbits and the points of an orbit can
         {a,A,) {a"A,}       =       {a,   + A,a.,A,A,} .              be labeled by the points of GIG. where x is any point
Here, for brevity, instead of writing R(A,)a. we                       of the orbit.
                                                                          In the problem of localizability considered here,
                                                                       the system of imprimitivity is transitive but for
  9 The argument (originally due to E. P. Wigner) is outlined          momentum observables and particle observables, in
in Disp<TSicm Relaticms and Ekmentary PaTticles (John Wiley
& Sons, Inc., 1961), pp. 176-18l.                                      general, the system of imprimitivity is not transitive.
  10 The argument (originally due to E. P. Wigner for the
rotation group and Poincare group) is given for the Euclidean
                                                                          Mackey's theory shows that the transitive system
group in V. Bargmann, Ann. Math. 59, 1 (1954).                         of imprimitivity and its associated representation
406                                                                                                            CHAPTER VIII
can be brought into a standard form by a suitably         To obtain V, one may note first that the operator
chosen unitary transformation, V:                         T(a) defined by
if and only if the unitary representations of G. are Clearly, the kind of nonuniqueness appearing in this
equivalent.                                              example may be expected to be absent only when one
   Detailed proofs of these assertions of Mackey's is dealing with a single particle. Theorem 4 obtained
theory for the special case of 8. will be offered in the below gives a precise criterion for uniqueness and a
following sections. For the moment, the results will parametrization of the possible answers when more
be taken for granted and used to discuss the unique- than one exists.
ness of E(S) for given U(g). Clearly, for U(g) given        The uniqueness of the notion of localizability for
the only unitary transformations, V, which can give given representation of the Euclidean group has
new VE(S) V-' -F- E(S) are ones which commute been discussed assuming Mackey's theory. Now I
with the given U(g) but not with the E(S).               attempt to give an intuitive idea of the circumstances
   That this possibility is actually realized in simple in which a notion of localizability exists.
physical examples can be seen by considering a com-         Since all the E(S) commute, diagonalize them.
pound system of two free spinless Schriidinger parti- Then the state vectors are represented by quantities
cles with wave function ",(x"x,). Let the correspond- <I>(x) defined on space and with a number of compo-
ing representation of the Euclidean group be U(a,R): nents which may vary with x. [In fact, these 4>(x)
                                                         for x = a are just Newton and Wigner's linear mani-
         ",(x"x,) -> (U(a,R)",)(x"x,)
                                                         fold S •. ] In this realization the scalar product of
            = ",(R-'(x, - a),R-'(X2 -           a» .     two vectors 4> and'll is
 Define the operators XCa) by
                                                                         (<1>,'1') =   !   dx(<I>(x),'1'(x» ,
               X Ca ) = ax~P   + (1   - a)x~P
                                                          where the scalar product appearing under the
where a is any real number, and by definition             integral sign is in the components of 4>(x) and ,y(x)
                                                          for fixed x. The operators E(S) take the form
               (x?4»(y"y,) = y,4>(y"y,)
                                 Y24>(y"y,) .                            (E(S)4>)(x) = x.(x)4>(x) ,
                                                                                                  °
               (x~P4»(y"Y2) =
Then, for each a, XCa) defines a possible position        where x.(x) = 1 if xES,          if x $ S. From the
operator (the spectral representation of x/a), j = 1,     transformation law of E(S) it is plausible that by a
2, 3 yields the projections appearing in (2.1), and       suitable choice of basis it can be arranged that
the general E(S) can be found from these). In par-                       (U(a,l)4>)(x)        =    4>(x - a) .
ticular, X CO) = X~P and XC') = xlP, are possible posi-
tion operators.                                           From this equation, it follows that the number of
   Now there exists a unitary operator, V, which          components of 4>(x) is the same for all x. It is also
commutes with the representation of the Euclidean         plausible that by a suitable choice of basis the
group                                                     transformation law under rotation can be made to
                                                          look the same for each x:
                    [V,U(a,R)l_ = 0
and carries X(a) into X(~)                                           (U(O,A)4»(x) = ~(A)4>(A -'x),
850 A. S. WIGHTMAN
each point. Once these results are accepted, one can            quence of this simple kinematical fact." For spin-O,
pass by Fourier transform to momentum space                     (iii) is satisfied and so the phonon is localizable. l'
amplitudes. There one has                                       It is an oddity that the same is not true for Wigner's
                                                                particles of infinite spin, I' as will be seen in Sec. 5,
         (U(a,A)<I»(p) = e-,p'a:n(A)<I>(A-lp)           (2.2)   even though in that case each angular momentum
with the scalar product                                         along p appears just once.
                                                                   There is one paradox to which the preceding dis-
              (<I>;w) =   f    dp(<I>(p),w(p)) .        (2.3)
                                                                cussion might appear to give rise. Suppose one
                                                                describes a photon by a real-valued three-component
                                                                field B(x) satisfying
The canonical form (2.2) is to be compared with
                                                                                        divB=O,                          (2.5)
        (U(a,A)<I>)(p) = e-'P'aQ(P,A)<I>(A-lp), (2.4)
                                                                defines a scalar product (this is a real Hilbert space)
where                                                           by
           Q(P,A)Q(A -lp,B) = Q(p,AB) ,
and the scalar product is
            (<I>,w)   =   f   dp.(p)(<I>(p),w(p)) ,
                                                                and a representation of the Euclidean group
                                                                          (U(a,R)B)(x) = RB(R-l(x - a)) .
a form which will be derived in Sec. 3.                         Attempt to define projection operators by the equa-
   The comparison shows:                                        tion
   (i) When the representation is in the canonical
form (2.4) the measure dp.(P) on momentum space is                             (E(S)B)(x) = xs(x)B(x) .
just Lebesgue measure dp.                                       Why does not this describe the photon as a localiza-
   (ii) The dimension of the vectors <I>(P) is the same         ble system? The answer is that the E(S) carry vectors
for all p.                                                      satisfying the condition (2.5) into vectors which do
   (iii) The operators Q(p,A) are of the form :n(A),            not satisfy it, so E(S) is not a well-defined operator
where A ----> :n(A) is a representation of the unitary          in the manifold of states and the x in B(x) has noth-
unimodular group.                                               ing to do with localizability.
   Intuitively (i) and (ii) are accounted for because,             The notion of localizability discussed here is con-
if one makes any state whose x dependence is a                  cerned with states localized in space at a given time.
a function one gets all momenta. Thus, one would                It is natural to inquire whether there exists a corre-
expect to have the same number of linearly inde-                sponding property in space-time. Then the E(S)
pendent states for each p. (iii) is essentially a conse-        would satisfy
quence of the rotational invariance of the states
localized at a point.                                                    U(a,A)E(S)U(a,A)-l = E(AS            + a) ,
   All three restrictions are nontrivial if applied to an       where S is a Borel set of space-time and {a,A I is an
arbitrary representation of 83 • However, as will be            inhomogeneous Lorentz transformation of space-
seen in Sees. 5 and 6, (i) and (ii) are always satisfied        time translation, a, and homogeneous Lorentz trans-
in any relativistic theory (provided one leaves out             formation, A. However, a requirement analogous to
the vacuum state). (iii) excludes a very important              (i) follows from Mackey's theory: All four-momenta
physical system, the single photon. One can see this            must occur in the theory. This is in flat violation of
immediately by looking at the Q(p,A) for those A                the physical requirement that there be a lowest
which leave p invariant. Such Q's have two eigen-
                                                                   11 That the photon was nonlocalizable was stated and be-
vectors corresponding to right-circularly and left-             lieved long before reference 1 was written. See, for example,
circularly polarized photons having angular mo-                 L. Landau and R. Peierls Z. Physik 62, 188 (1930); 69, 56
mentum along p, ± h, respectively. On the other                 (1931); especially p. 67 of the latter. While the arguments
                                                                given could possibly be regarded as plausible, they do not make
hand, in :n(A) one cannot have states with angular              clear what is the heart of the problem.
                                                                   12 If the neutrino had turned out to possess states of both
momentum ± h along p without also having states                 helicities, i.e., states with components ±!n. of the component
with zero component of angular momentum along p.                of angular momentum along p, then it too would be localizable.
The nonlocalizability of the photon (and all other              A neutrino of definite helicity is not localizable.
                                                                   13 E. P. Wigner, Ann. Math. 40, 149 (1939); Z. Physik 124,
particles of spin >! and mass zero) is a conse-                 665 (1947-8).
408                                                                                                            CHAPrER VIII
energy state. Thus, a sensible notion of localizability               of the three-dimensional translation group            :r,   is
in space-time does not exist.                                         unitary equivalent to one of the following form:
852 A. S. WIGHTMAN
J~:dl'(P)XA' where dI'A(P) = dl'(Ap). The unitary                   Since Q(A) is unitary Q(p,A) must be unitary for
equivalence criterion given in Theorem 1 then                       almost all p. Furthermore, the group multiplication
implies                                                             law implies
                             I' ==   P.A                   (3.5)          Q(A)T(A)Q(B)T(B) = Q(AB)T(AB) ,
pep) = p(Ap) for all p except possibly on a set of p.               which yields
measure zero.                                  (3.6)
                                                                               Q(p,A)Q(A-'p,B) = Q(p,AB)                     (3.10)
  Now in Appendix 2, it is shown that the only
measures on      :t:
                 satisfying (3.5) are equivalent to                 for each A and B and almost all p.
                                                                       At this point a measure-theoretic technicality
ones of the form
                                                                    arises. It is possible a priori, that the set of measure
                  p.o5(p)   + dp(ipi)d",(p) ,              (3.7)    zero on which (3.10) does not hold could depend on
where P.o ;;. 0, dw(p) is the area on the sphere of                 A and B in such a way that when one took the union
radius Ipi and dp is a measure on the positive real                 over all such sets one would get a set of measure
                   ==
axis. Since, if p. P.I, the unitary mapping                         greater than zero. Actually, one can show that one
                                                                    can alter Q(p,A) on a set of measure zero in p so
              (W<p) (p) = <p(p) [ dp.(p)JI/2                        that Q(A) is unaffected, but (3.10) holds for all
                                           dp.,(p)                  p,A,B and Q(P,A) is measurable in both variables.
carries the direct integral J~:dp.(P)Xp into                        This argument is deferred to Appendix IV, because
J~:dl"(P)X" one may for convenience choose p. in                    of its technical character. The result will be assumed
the form (3.7)." Later on I' will be taken in this form             in what follows.
but for the moment a general p. satisfying (3.5) will                  The representation has now been reduced to the
be carried along. Furthermore since any two Hilbert                 standard form
space of the same dimension can be mapped on one                    (U(a,A)<p)(p) = e-,p'aQ(p,A)
another by unitary transformation, there is no loss
in generality in taking X. = XAP for all A.                                            X <p(A -1     )   [dl'(A -l p )JI/2   (3.11)
   The next task is to put the operators U(O,A) in                                                  P       dp.(p)
standard form. They will be written as a product                       To understand the physical meaning of the Q(p,A)
U(O,A) = Q(A)T(A) where T(A) is defined by                          it is helpful to consider some elementary examples.
                                                                    For a single free particle in Schr6dinger theory, the
       (T(A)<p)(p) = <p(A-'p) [dp.(A- l p)JI 1 2                    wave function may be taken as a complex-valued
                                 dp.(p)
                                                                    function of p, the scalar product is
                                                                                               f
(Here the convention X. = X Ap has made it possible
to equate vectors from two different Hilbert spaces.)                              (<p,'Y) =       dp <p(p)*'Y(p)            (3.12)
It is easy to verify that T(A) is unitary, with an
adjoint given by                                                    and the representation of the Euclidean group is
 by unitary transformation of U(a,A) to a form              in [pi such that for all other p and all A, (3.17) holds.
 independent of p one can consider the case of a single       Next, it will be shown that, if there exists a V(P)
 photon described in Sec. 5. [One of the results of         for a single p which satisfies
 section 4 is that for a localizable system Q(p,A) can
 always be chosen independent of p.] Clearly, in all                       Ql(P,A) = V(P)Q.(p,A)V(p)*            (3.18)
 these examples the Q(p,A) gives the transformation         for all A in the little group of p, then V(q) can be
 law of the internal degrees of freedom of the system       extended to all q with [q[ = [pi so that (3.17) holds.
 under rotations.                                           (The statement holds trivially for p = 0 so p ~ 0 is
   A detailed analysis of the consequences of the           assumed.) Solved for V(A-lp), (3.17) reads
 multiplication law of the Q's, Eq. (3.10), will be
 undertaken shortly. For the moment, only the fact                    V(A-lp) = Ql(p,A)-lV(p)Q,(p,A).            (3.19)
 that for those A which satisfy Ap = p, (3.10)              This will be consistent as a definition of V at A -lp
 implies                                                    only if the right-hand side is constant on right cosets
                                                            of the little group of p, i.e., only if Al' = A.lA s'
              Q(P,A)Q(p,B) = Q(p,AB)              (3.14)    with A,l in the little group of p implies that the
 is needed. Such A form a group called the litae grQUp      right-hand side of (3.19) takes the same value for
 of p, and (3.14) means that A -> Q(p,A) defines a          A = Al and A,:
 continuous unitary representation of the little group
 of p. (Again see Appendix IV for a proof that every         Ql (p,Al)-lV(p)Q.(p,A ,)
 measurable unitary representation is continuous.)             =    [Ql (P,A.)Ql (A;lp,A.WlV(p)
 Evidently, when p = 0 the little group of p is the
                                                                    X [Q,(P,A.)Q.(A;lp,A.)]
 group of all A, i.e., the unitary unimodular group
 itself. On the other hand, when p ~ 0, the little             =    Ql (P,A,) -l[Ql (p,A.) -IV (P) Q,(P,A 3 )]Q. (P,A.)
 group is the two sheeted covering group of the group           =   Ql(p,A,)-lV(p)Q.(p,A.) .
 of rotations around a fixed axis. It is therefore
                                                            This defines V(q) for all q with [q[ = [pl. Next, it
 isomorphic to the multiplicative group of the com-
                                                            has to be verified that V so defined satisfies
 plex numbers e"/', 0 '" 8 < 471'.
    The problem of determining when two representa-                  Ql(q, A) = V(q)Q,(q,A)V(A-lq)-l. (3.20)
 tions of &. are unitary equivalent can now be reduced      Suppose that q      =   B-lp. Then, the right-hand side
 to a related problem for their Q(p,A). For, if {a,A}       of (3.20) is
 -> Ul(a,A) and {a,A}-> U.(a,A) are equivalent
 representations, Theorem 1 implies PI == P. and                    [Ql (p,B) -1 V (p) Q. (p,B)] Q. (B-lp,A)
 " =" almost everywhere. Thus, by a unitary                                 X [Q,(p,BA)-lV(p)Q.(p,BA)r l
 transformation one can bring Ul(a,A) into a form
 where Ul(a,l) = U.(a,1). Then Ul and U. differ                        =    Ql(P,B)-lQl(P,BA) = Ql(q,A) ,
 only in their Q(P,A). If                                   where, in the last step, the identity Ql (p,B)-l
                                                            = Ql(B-lp,B-l) which follows from (3.10), has been
               Ul(a,A) = VU.(a,A) V-I ,           (3.15)
                                                            used.
 where V is a unitary operator, then, applying                Therefore, a necessary and sufficient condition
 Theorem 1, one finds that V is of the form                 that Ul be unitary equivalent to U, is PI = P" '1
                                                             = " almost everywhere and the representations of
                 (V<J»(P) = V(P)<J>(p)            (3.16)
                                                            the little groups A -> Ql(p,A), A -> Q,(p,A) be
 and (3.15) reduces to                                      unitary equivalent for almost all [pi and at least one
                                                            p for each [pl.
          Ql(p,A) = V(P)Q.(p,A)V(A-lp)-l. (3.17)
                                                              Incidentally, in the course of the argument, it has
 If p rather than A -lp occurred in the last factor, this   been established that the little groups for p and q
 would describe unitary equivalence of Ql(P,A) and          have unitary equivalent representations if [pi = [q[.
 Q.(p,A). When A belongs to the little group of p,          Explicitly, if q = Bp and Aq = q, then B-lABp = P
 A -lp = P and that is indeed the case.                     and
    Again at this point a measure-theoretic technicality
 arises. Equation (3.17) holds for almost all p, for        Q(q,A) = Q(p,B-l)-lQ(p,B-lAB)Q(p,B-l ).              (3.21)
 each A. Again the reader is referred to Appendix IV        The mapping A -> B-lAB is an isomorphism be-
 for a proof that there is a fixed set of measure zero      tween the little groups of q and p and (3.21) displays
LOCALIZATION PROBLEMS                                                                                                        411
854 A. S. WIGHTMAN
the unitary equivalence of the corresponding repre-             and almost all Ipl
sentations.
   The classification of the unitary inequivalent                     no/') =   noP)   for all j   =       0, " 1, j,' ...
representations of the little groups is well known.                Any Euclidean invariant theory has a manifold
For p = 0, they are labeled by giving an intf~r                of states whose transformation law is unitary equiva-
valued multiplicity function no; for j = 0, + "                 lent to one of this form. It is to be expected (and may
l,j,' . '. no; is the number of times the irreducible           be seen in detail from the discussion of Sees. 6 and 7)
      °
representation of angular momentum j appears. For
p rf. the unitary inequivalent representations are
labeled by an integer or       +
                              infinity valued function,
                                                                that the imposition of requirements of relativistic
                                                                invariance will eliminate some of these representa-
                                                                tions.
n"".,m = 0, ± t, ± 1" ., where npm is the number                   Up to this point, the only assumption that has
of times the one-dimensional irreducible representa-            been made about the quantum mechanical system
tion </> -> e'm. occurs.                                        under consideration is its invariance under the
   All these results are collected in Theorem 2.                Euclidean group. Now the operation of time in-
                                                                version I, will be adjoined. It is well known that I,
   Theorem 2. Every continuous unitary representa-              has to be represented by an antiunitary operator,
tion of ea, the universal covering group of the                 U(I,), whose square is w(I,) = ± 1, and that by
Euclidean group, is unitary equivalent to one of the            suitable choice of phase it can be arranged that ' •
following form.
   Let Xp be a family of Hilbert spaces, one for each             U(I,)U(a,A)U(I,)-' = U(a,A)
pE    :r:
                    r
        identical for all p with the same Ipl. Let                U(a,A)U(l,) = U({a,A}I,)
Here K stands for complex conjugation. Only a spe-         (b), (c) also satisfies (a), (b), (c) so these functions
cial case will be considered here, namely, that in         form a vector space. If a scalar product of eI> and w
which Q(P,A) = :D(A) where A -> :D(A) is a continu-        is defined
                                                                                    f
ous unitary representation of the unimodular group.
As will be shown in the next section, for localizable                   (eI>,w) =       da (eI> (a,A) w(a,A»
systems this can always be arranged. A second spe-
cialization will be made. Only time inversion trans-       the vector space becomes a Hilbert space :IC.'. The
formation laws for which                                   representation [jD is defined in :IC by
will be considered. This amounts to considering the        This representation possesses a transitive system of
case of ordinary type." Time inversion invariance          imprimitivity defined by
will be used only to get Theorem 4 on the unique-                       (E(S)eI>)(a,A) = xs(a)eI>(a,A)
ness of the position observables.
                                                           defined for Borel sets S of :to where, as usual, Xs is
                                                           the characteristic function of S: xs(a) = 1 if a
 4. REPRESENTATIONS OF 8. WHICH POSSESS A
                                                           E S, 0 if a Et: S. It is easy to verify using (4.2) that
     TRANSITIVE SYSTEM OF IMPRIMITIVITY'
                                                           the E(S) transform correctly under U(a,A), i.e.,
   The discussion of this section is in three parts.       satisfy V.
First, Mackey's standard form of an imprimitive               Because of the smooth fashion in which A acts
representation is given and shown to be equivalent,        on a this representation can be put in a simpler form.
in the special case at hand, to a simpler form which       If, for the moment, attention is restricted to con-
will be more convenient for present purposes. Second,      tinuous functions eI>(a,A), Eq. (4.1) can be used to
for a given imprimitive representation a unitary           write
transformation is found which brings it into Mackey's
form. Third, the unitary transformations which                             eI>(a,A) = :D (A) eI>(A -'a,l)            (4.3)
commute with an imprimitive representation U(a,A)          which expresses eI>(a,A) for general values of A in
but not with its system of imprimitivity E(S) are          terms of its value for A = 1. Conversely, given any
parametrized. This yields a parametrization of the         continuous function eI>(a) with values in :IC(:D), one
nonuniqueness in the definition of a position opera-       can define a continuous eI>(a,A) by (4.3) and it will
tor.                                                       then satisfy (4.1). The scalar product of two such
   Suppose there is given a continuous unitary             eI>(a) and w(a), Jda(q;(a),'l'(a» is equal to that of
representation A -> :D(A) of the 2 X 2 unitary             the corresponding eI>(a,A),w(a,A) so the one to one
unimodular group in a Hilbert space :IC(:D). Then          correspondence can be extended by continuity to a
the representation of 8, iruiuced by :D(A) is denoted      unitary mapping between the Hilbert space :IC and
[jD and constructed as follows. Consider functions         the Hilbert space of the measurable square integrable
eI>(a,A) which are defined on 83, whose values lie         eI>(a).
in :IC(:D), and which satisfy                                 The representation (4.2) determines a correspond-
   (a) (eI>(a,A),x) is a measurable function of .Ia,A},    ing representation on the eI>(a) given by
for every X E :IC(:D). [The indicated scalar product
is in :IC(:D).]                                                       :D(B) (U (a,A)eI» (B-'b)
856 A. S. WIGHTMAN
theory except that there one has -a instead of a on                 since G is a group. The positivity of the quadratic
the right-hand side. That just means that one uses as               form then implies that the determinant of its matrix
representative of the function cI>( - b) instead of                 is positive, i.e.,
cI>(b). This will be done from this point on. Thus, in
the present context, Mackey's form of the imprimi-
                                                                                               1<p(g)1   < <p(e) .
tive representation induced by ~ may be taken as                    Any unitary representation of G, g --t U(g), yields
                                                                    examples of positive definite functions'·
        (U(a,A)cI>)(b) = ~(A)cI>(A-'(b - a))               (4.4)
                                                                                          <p(g) = (tJ.>,u(g)cI»
            (E(S)cI>)(b) = xs(b)cI>(b)                     (4.5)
                                                                    because, in this case,
with the scalar product
                                                                            I: a,*ak<P(g,'g.)      =     III: a,U(g,)cI>ll':> O.
                 (cI>,'l1)   =   Jdb (cI>(b),'l1(b)) .      (4.6)   If the representation is continuous then <p(g) is con-
                                                                    tinuous.
   Now, the second step of the argument is under-                      Conversely, given a continuous positive definite
taken; it is to be shown that for each pair consisting              function one can construct a continuous representa-
of a continuous unitary representation {a,A I                       tion of G. Let r, s be complex-valued functions on G
--t U(a,A) and a system of imprimitivity E(S), there                which are different from zero only at a finite number
exists a unitary operator V such that VU(a,A)V-'                    of points. (Such functions form a vector space.)
and VE(S)V-' are of the form (4.4) and (4.5), re-                   Introduce the form
spectively. Available to show this are several lines of
argument, not one of them trivial. Here the elegant                                (r,s) =       I: reg) <p(g -'h)s(h)             (4.9)
                                                                                                (J,hEG
proof of Loomis" will be written out for the present
                                                                    (r,s) is sesqui-linear, i.e.,
simple case.
   The first step in the argument is to express the                 (r,s!   + S2) =   (r,s,)   + (r,s,) ,      (r,as) = a(r,s) (4.10)
problem in terms of certain complex-valued functions
defined on the group. This is quite analogous to the                        (r,s) = (s,r)                                      (4.11)
study of general unitary representations in terms of                by virtue of (4.8), and
positive definite functions on the group. To motivate
Loomis' method, a brief sketch will first be given of                                            (r,r)    :> O.                (4.12)
the relation of positive definite functions and repre-              Now it may happen that there are some r for which
sentations.                                                                                      (r,r) = O.
   A function <p defined on a group G is positive
definite if for each n = 1,2" .. and all complex                    If so, it is easy to see that they form a linear subspace
numbers a, ... Un and g, ... gn E G                                 and the components orthogonal to this linear sub-
                                                                    space form a vector space on which (r,s) again
                                                                    satisfies (4.10), (4.11), and (4.12) but, in addition,
                                                                    (r,r) = 0 implies r = O. This space mayor may not
Clearly, taking n = 1, one gets                                     be complete. If not, complete it and get a Hilbert
                                                                    space H•. To get a continuous representation of G in
                                 <p(e):> O.                 (4.7)   H., define, first on functions with only a finite num-
For n   =   2,                                                      ber of values different from zero,
  ,. See reference 6. One of the main virtues of Loomis' treat-     ce:;:r~~~v~:.,~:. f~fc~:h:.~re .!"dd ~J:~l-R~~o~ }~~
ment is that it applies to nonseparable HilbertsJlaces. Since       Abelian and locally-eompact groups, respectively. A system-
separability is assumed here th,s advantage will not be ap-         atic account of their properties is found in R. Godement, Trans.
parent.                                                             Am. Math. Soc. 63, 1 (1948).
414                                                                                                      CHAPfER VIII
 so the subspace of those r for which (r,r) = 0 is left        is a cyclic vector for the representation defined above.
 invariant by U(g). Therefore, so is its orthogonal            Thus, what has been established in the preceding
 complement. Because U(g) is therefore defined and             paragraphs is that all cyclic representations are
 continuous on a dense subset of H. it can be extended         unitary equivalent to those of the form (4.9) and
 by continuity to be a unitary operator in H•. Clearly,        (4.13). Since any representation can be written as a
 on the original functions                                     direct sum of cyclic representations, it suffices for
                                                               many purposes to study cyclic representations.
                 U(gdU(g,) = U(g,g,) ,                            In the present case, there is a system of imprimi-
 so by continuity, g -> U(g) defines a representation.         tivity E(S) in addition to the group representation
 To prove U(g) is continuous in g, consider                    U (g) so one has to consider cyclic vectors and repre-
                                                               sentations of E(S) and U(g) together. This suggests
 II(Ug) - U(g'))rll' = II(U(g-'g') - 1)rll'                    studying the function (E(S)cf!,U(g)cf!) = 'Po (g) and
                          = 2[(r,r) - Re (U(g-'g')r,r)].       using it to construct a pair unitary equivalent to
                                                                {E(S),U(g) I and in Mackey's form.
 Clearly, this equation implies that it suffices to               Now return to the special case of 8•. When the
 verify (U(g)r,r) is continuous in g at g = e for all
                                                               representation and system of imprimitivity is in
 r E H•. For r of the special kind appearing in (4.9),         Mackey's form (4.4) and (4.5), the function 'Po(a,A)
 which only take values different from zero at a finite        is
 number of points the continuity is easy to verify:
                      =   L •.• r(h)*'P(gh)-'k)r(k) ,
 which clearly converges to (r,r) as g -> e because 'P
                                                                              =   f db(cf!(b),~(A)cf!(A-'(b
                                                                                   o
                                                                                                              - a))) (4.14)
 is continuous and there is only a finite number of            The next task is to show that 'Po(a,A) has a form
 terms in the sum. For a general r, there always exists        closely related to this for any representation and
 an 8 of the above form so that Ilr - 811 < ./3. By            transitive system of imprimitivity.
 the above argument a neighborhood of e can be                     Before the discussion can begin a preliminary re-
 found so that II U(g)s - sll < ./3. Then                      mark is necessary. Extensive use is going to be made
                                                               of the part of the Radon-Nikodym theorem which
 IIU(g)r -    Til <   IIU(g)r - U(g)sll   + IIU(g)s - sll      says that if, for two measures p" and p", p" (S) = 0
      + lis - rll,                                             implies P,.(S) = 0, then there exists a measurable
                                                               function p(x) such that dp.(x) = p(x)dp, (x). To make
 which completes the proof that g U(g) is a continuous         these applications it is essential to know that E(S)
 unitary representation of G.                                   = 0 for all Borel sets S of Lebesgue measure zero.
    Actually, if the continuous positive definite func-        To obtain this result, it is convenient to use the fact
 tion from which one starts is of the form (cf!, V(g)cf!),     that the E(S) possess a separating vector, i.e., a vector
 the representation constructed by the above process           cf! such that E(S)cf! = 0 implies E(S) = O. Although
 will be closely related to V itself. For, if the subspace     this is a standard result" a proof will be outlined.
 (of the Hilbert space X in which 'P lies) spanned by          Choose an arbitrary unit vector cf!" and let X, be
 vectors of the form V(g)cf! is denoted X, the con-            the subspace spanned by the E(S)cf!,. Choose a unit
 structed representation as unitary equivalent to the          vector cf!. orthogonal to X, and let X, be the sub-
 restriction of V to X. The required unitary equiva-           space spanned by the E(S) cf!•. Continuing in this
 lence is obtained by making ~ r(g)V(g) correspond             way one gets a family of orthogonal subspaces such
 to r, for r differing from zero only at a finite number       that X is the direct sum of the X, and cf!, is a cyclic
 of points. Equation (4.9) is just arranged to make            unit vector for X,. Take as separating vector cf!
 scalar products correspond. Clearly (U(g)r) corre-             = ~.2-·cf!•. Clearly, if E(T)cf! = 0 then E(T)cf!, = 0
 sponds to V(g) ~ r(h) V(h) cf!. The correspondence            for all i. Consequently, E(T) yields zero when
 can be extended by continuity to yield the required           applied to a dense set of vectors, the linear combina-
 unitary equivalence.                                          tions of the E(S) cf!,. It is therefore zero and cf! is a
    A representation V for which there is a vector cf!         separating vector. Note first that if E(S) = 0, then
 such that the V(g) cf! span the representation space          E(AS + a) = U(a,A)E(S)U(a,A)-' = O. Thus if cf!
 is called cyclic and cf! is then a cyclic vector. Note that
 the function which is one at g = e and zero elsewhere           21   See reference 14, p. 20.
LOCALIZATION PROBLEMS                                                                                                                   415
858 A. S. WIGHTMAN
is a separating vector, (<JI,E(S)<JI) = IIE(S)<JIII' is                 absolute value less than or equal to 1 such that
quasi-invariant under Euclidean transformation, i.e.,
for all {a,A), (<JI,E(S)<JI) = oif and only if (<JI,E{AS
+  a)<JI) = O. Furthermore, (<JI,E(S)<JI) defines a (1-
                                                                        fdadA\08(a,A) =            ffT    s
                                                                                                              dadAdbq(a,A;b)p(b). (4.17)
additive positive measure on the Borel sets S of R'.                    From (4.17), it follows that
Now in Appendix II it is shown that any measure
defined on the Borel sets of R' and quasi-invariant
under translations is equivalent to Lebesgue measure.                                   \Os(a,A) =        f   s
                                                                                                                  dbq(a,A;b)p(b)
That implies in particular that (<JI,E(S) <JI) and there-
fore E(S) = 0 whenever S is a Borel set of measure                      for almost all {a,A} which begins to look like (4.14).
zero. Thus, the Radon-Nikodymn theorem implies                          This completes the first stage of the proof.
that if <JI is any vector there exists a non-negative                     The next stage is the construction of the Hilbert
measurable function p such that                                         space of the <JI(a) which appears in (4:4) ... (4.6).
                                                                        This is done in close analogy with the construction
                        (<JI,E(S)<JI) =   fs
                                               p(b)db          (4.15)
                                                                        carried out in connection with (4.9) but for technical
                                                                        reasons which will appear in the proof it is convenient
                                                                        to consider continuous functions of compact support
p(b) is clearly integrable over all space.                              on &. rather than the functions differing from zero
   This equation can be used to get an expression                       only at a finite number of points, which were used
for \Os(a,A) which is the first step in proving that it                 there. Therefore let f and g be continuous complex-
can always be arranged to have the form (4.14).                         valued functions of compact support on &, and define
Note that
          f•
               \Os(a,A)dadA       <   f
                                      •
                                          dadA   f s
                                                       p(b)db. (4.16)   (E(S)UU)<JI,U(a,A)U(g)<JI) = f dbdB                  f   dcdCf(b,B)*
Then the form in the subspace spanned by the U(f)~ into a dense
                                f f
«U(f)~)(r),(U(g)~)(r)) = dbdB dedC
                                                              set of vectors in the Hilbert space spanned by the
                                                              functions of r: (U(f)~)(r) and preserves scalar prod-
                                                              ucts it can be extended by continuity to become a
  Xf(b + r,B)*g(e + r,C)q({b,Bj-' {e,C},           -B-'b)     unitary transformation V.
  X p( -B-'b)                                        (4.21)     All this discussion is collected in Theorem 3.
is suggested as the scalar product appearing in the
                                                                Theorem 3. Let {a,Aj-- U(a,A) be a continuous
integrand of (4.14).
                                                              unitary representation of s. with a transitive system
   With these definitions, one has
                                                              of imprimitivity, E(S), based on R'. Then there
(E(S)U(f)~,U(a,A)U(g)~)         =   f8
                                         dr
                                                              exists a unitary transformation V, such that
                                                              VU(a,A) V-' = W(a,A) and VE(S) V-' = F(S), .re-
  X    «U(f)~)(r),W(a,A)(U(f)~))(r)),                (4.22)
                                                              spectively, given by
scalar product (4.21) invariant.                                The remaining task of this section is to examine
   Now it has to be verified that (4.21) does indeed          the arbitrariness in the definition of the position
define a scalar product. First note that it is linear in      observable. For this purpose, one can bring the pair
g and conjugate linear in f. Furthermore, because             {E(S),U(a,A) j into the form (4.26) and (4.27), and
(4.20) holds for every Borel set S,                           then determine all unitary operators which commute
   (E(S)U(f)<I>,U(y)<I» = [(E(S)U(y)<I>,U(f)<I»]              with U(a,A) but not with E(S). It is convenient for
                                                              this purpose to rewrite (4.26) in momentum space
and
                                                                      (U(a,A)~)(p) = e-'P··:O(A)~(A-'p).
                   (E(S)U(f)~,U(f)~)     >0
imply                                                         If B is a unitary operator such that [B,U(a,l)] = 0,
                                                              Theorem 1 shows that B can be written in the form
             «U(f)~) (r), (U(g)~)   (r))
                                                                              (B~)(p) = B(p)~(p)
                  = [«U(g)~)(r),(U(f)~)(r))]*        (4.24)
                                                              where B(P) is a unitary operator in XP       =   X. The
and
                                                              commutativity with U(O,A) then implies
              «U(f)~)(r),(U(f)~)(r))          >0     (4.25)
                                                                          B(o):O(A) = :o(A)B(A -'p)             (4.28)
for almost all r. However, since f and yare continuous
and of compact support the integral appearing in r            for almost all p.
is continuous in r. Therefore (4.24) and (4.25) hold             This equation can be discussed along lines familiar
for all r. Now, just as in the case of (4.12), one can        from Sec. 3 and· Appendix IV. For those A which
introduce components of vectors orthogonal to the             satisfy Ap = p, i.e., for A in the little group of p,
subspace for which (4.25) is an equality, and com-            (4.28) reduces to
plete the resulting space to get a Hilbert space X
                                                                            B(p):o(A) = :o(A)B(p) .             (4.29)
the same for each r. A -- !D(A) is then a continuous
unitary representation inX. Since the correspondence          The set of all B(p) satisfying this equation is easy
U(f)~ -- (U(f)~)(r) carries a dense set of vectors            to compute. Supposing them known one gets the
LOCALIZATION PROBLEMS                                                                                                          417
860 A. S. WIGHTMAN
Here the summation over j is over integers if m is                       could be + 1 for some Ipi and -1 for others without
integral and half-odd integers if m is half an odd                       violating either Euclidean or time inversion in-
integer.                                                                 variance. It is here that Newton and Wigner's
  TheB(p) corresponding to a given set {nm},m = 0,                       assumption of regularity has the effect of making
± !, ± 1,.·· is a direct sum of unitary operators                        B a constant and F(S) = E(S). They require (in a
acting in the subspaces of vectors with a definite                       Lorentz invariant theory) that the infinitesimal
value of m, and any such defines a possible B(p).                        Lorentz transformation operators be applicable to
The number of real parameters free in an arbitrary                       localized states in the sense that if <I>,.is a sequence of
nm X n" unitary matrix is n!. so that B(p) contains                      vectors which converge to a state localized at a point
Lm n!. arbitrary real parameters, each of which could                    a, as n -> 00, then lim.~. 11M,,<1>.11/11 <1>.11 < 00.
be a function of Ipl.                                                    Since M."i = 1,2,3 are essentially differentiation op-
  Collecting the information acquired in the preced-                     erators this forces continuity on the momentum space
ing discussion one has Theorem 4.                                        representation of Newton and Wigner's localized
                                                                         (continuum) state. An analogous requirement in the
   Theorem 4. If E(S) is a system of imprimitivity                       present formulation has an analogous consequence.
for the unitary representation {a,A} -> U(a,A) of                        The details are as follows.
03 in the standard form (4.26), (4.27), then all other                      According to (3.14), the transformation law of
systems of imprimitivity consistent with U are given                     states under time inversion is of the form
by
                                                                                      (U(I.)<I>)(p) = ~(/)<I>( -p)*
                F(S) = BE(S)B- 1 ,
where B is a unitary operator given by                                   The requirement that B commute with U(I,) then
                                                                         forces
       (B<I»(p)      = ~(Aq+p)-lB(q)~(Aq+p)<I>(p)               (4.32)
so that
     (F(S)<I»(p)            = ~(Aq.p)-lB(q)-l~(A.+p)                     which is
         X (2 .. -   3 /2   jxs(p - r)ddl(A ••,)-'B(q)-'                            ~(/)~(Aq+ .... )-lB(q)~(Aq+ ....)
                                                            in X,
It is easy to choose p so that (p X q/lpllql'~) r' = 1;
then (4.36) follows. However p is chosen provided
q is along the 3 axis (q X p/lqllpl) '~T' leaves q in-
variant. This proves the second statement.]                 with
   A comparison of these statements with the discus-
sion just before Theorem 4 shows that the effect of                 dp.(p) = p.oo(p)dp     + dp+(m)dQm+(p)
time inversion invariance on the arbitrariness of
B(q) is to reduce the number of arbitrary real
                                                                                           + dp-(m)dQm-(p)
parameters from L n;' to L nm(n m - 1) each of                                             + dp(im)dQ'm(P) ,
which could be on a function of Iql. It is clear that       dQm~(p) =   d p/[m' + p']'/' being the invariant meas-
the position observable will be nonunique as long as        ure on the hyperboloids p' = m', pO ::: 0, respectively.
:D(A) is not irreducible. If :D(A) is irreducible and the   dQ'm(p) is the invariant measure on the hyperboloid
elements of the little group have :D(A) reduced to          p' = -m'. Q(p,A) is unitary and satisfies
diagonal form B(q) is diagonal with diagonal ele-
ments which are real functions of Iql of square 1; the                  Q(p,A)Q(A -'p,B) = Q(p,AB) .
position observable is still not unique. However,
unless B(q) is the constant matrix ±1, the formula
                                                            For the subrepresentations with m'         > 0, Q(p,A)      can
                                                            be chosen in the form
(4.33) will yield discontinuous functions of p. [Take
a compact set S, then the integral in (4.33) is dif-
ferentiable, so discontinuities in the function outside
the integral are discontinuities of (F(S)ol'»(p).] Such     where k = (m,O,O,O) and A,+k is given by
discontinuities will appear at any value of q where
B(lql) jumps so B([q[) must be constant in [qt. It
                                                             A,+k = [2(qO   + m)mr'l2[m1 + q] , q =            qO    + q.~
must be a constant multiple of the identity if it is        and A -> Q(A) is a continuous unitary representa-
LOCALIZATION PROBLEMS                                                                                               419
862 A. S. WIGHTMAN
tion of the unitary unimodular group. For m = 0,               system is localizable if the representation of the little
the Q(P,A) are a direct sum of two parts, the first of         group A -> Q, (k,O,A) is the restriction of a repre-
which contains all the finite spin constituents while          sentation of the unitary unimodular group. This
the second contains all infinite spin constituents. For        happens for the spin-zero case but for no other ir-
both of these (5.1) again holds but k is some standard         reducible representation. For the case of mass zero
light-like vector, say (1,0,0,1), and A ... , is a parame-     and S ,e 0, the representation of the little group is a
trization of the cosets of the little group of k. That         direct integral over irreducible representations which
little group is isomorphic to the two-sheeted covering         are determined by the value of [sf and the represen-
group of the Euclidean group of the plane and                  tation of the little group of the little group, A = ± 1.
A -> Q(A) is a continuous unitary representation of            The representatives of the state vectors, <I>(k,S) can
it. For the finite spin part this representation is trivial    be expanded in Fourier series on the circle [s [
for the "translations" while for the infinite spin part         = const. This corresponds to a decomposition into
it is not. The subspace of the mass zero representa-           irreducible representations of the subgroup of the
tions can be written as a direct integral over two-            unitary unimodular group that leaves k fixed. In case
dimensional S space                                            the little group of the little group is trivially repre-
                                                               sented, each integer angular momentum along k
            3Cp =    J'" dIT(S)3C z,
                                   p       p' = 0,             appears exactly once. In case it is nontrivially repre-
                                                               sented, each half odd integer angular momentum
with the scalar product                                        along k appears twice. Such representations can
                     J J
                                                               never be the restriction of a representation of the
  (<I>,>It)m-o =
where
                      dflo(p)    dIT(S) (<I>(p,S),>It(p,S))
                                                                          °
                                                               full unimodular group. Thus elementary systems
                                                               with S ,e are never localizable. Reducible systems
                                                               are localizable only if each representation [S [ ap-
                                                               pears with infinite mUltiplicity or not at all.
dIT(S) = ITo5(S)ds         + dIT, ([s[)d<p
with S      =   Z,   + iZ, =     Isle"~    and                    Theorem 6. Lorentz invariant systems of m' >
                                                               are always localizable. Their position observables
                                                                                                                      °
(Q(k,A)<I>)(k,S)       =   exp (is.t)Q,(k,S,A)<I>(k,e- i6 S)
                                                               are unique if the systems are elementary, i.e., their
for                                                            representations are irreducible.
          A = [1     +! tee, + ie')'~l                            For m = 0, the only localizable elementary ~;y~tem
                                                               has spin zero. For a reducible system to be localizable
                X [cos 0/2 - i sin (0/2)(k/kobl
                                       °
with ei = 1 = el, e,'e, = = e,·k = e.·k, t = t,
+ it,. Here Q,(k,O,A) may be expressed in terms of a
                                                               it is necessary and sufficient that each irreducible
                                                               representation of infinite spin appear with zero or
                                                               infinite multiplicity, and the finite spin parts con-
representation Q, of the above A leaving k fixed.              tribute states of angular momentum along a fixed
                                                               direction whose multiplicities coincide with those of
      Q, (k,O,A) = Q,[cos 0/2 - i sin (0/2)(k/ko) ."']
                      °
Q,(k,S,A), S ,e may be expressed in terms of a
representation Qof the two element groups A = ± 1,
                                                               the restriction of a representation of the unimodular
                                                               group.
                                                                  The identity representation for which p = can  °
which is the subgroup of those unitary unimodular A            not appear in the transformation law of any localiza-
which leave k and some s, say S, fixed:                        ble system.
           Q,(k,S,A) = Q(Az+z,AAA-'Z"z.)                              6. REPRESENTATIONS OF 8. ARISING
where A is a transformation of the form cos 0/2 - i                      IN GALILEI-INVARIANT SYSTEMS
sin (0/2) (k/kO).~ carrying S, into S.                           Unlike the case of Lorentz invariance where all
   The representations of imaginary mass and null              representations up to a factor are physically equiva-
four-momentum (apart from the identity represen-               lent to representations of the covering group, Galilei
tation) will be ignored here as being irrelevant to the        invariance leads to factors which cannot be got rid of
transformation properties of physical systems.                 by passing to the covering group. However, as Barg-
   Clearly, when {a,A I is restricted to lie in 8., the
subrepresentation which comes from mass is in
precisely the form (2.2) and Theorems 4 and 5 apply
                                                       °       mann showed," one can regard them as true repre-
sentations of a certain extension of the covering                       direct integral over the character group whose ele-
group of the Galilei group. The first task of this sec-                 ments are exp i[qt1 + h-'(ET - p·a)]. The states are
tion is to express this statement in explicit formulas                  then functions <I>(q,p) labeled by integers q and a real
and summarize the classification of the representa-                     four-component p = (E/h,p/h). The scalar product is
                                                                                                   f
tions.
   The Galilei transformations will be denoted (a,r)                                  (<I>,if) =       dp.(q,p) (<I>(q,p),if(q,p))
or in more detail (T,a,v,R) where (O,r) = (O,O,v,R)
(a,l) = (T,a,O,l) and                                                   and
864 A. S. WIGHTMAN
representation takes the form                                        (a). The measure on momentum space is equiva-
                                                                  lent to Lebesgue measure;
    Q(O,q,v,A)<I»(q,n) = e;v'DQI(q,n,A)<I>(q,A -'n) .                (b). The subrepresentation of the little group of
Here n is a two-component vector in the plane                     (O,q) for which the pure Galilei transformations r
perpendicular to q which labels the characters of the              = (v,1) are trivially represented is, for almost all
"translation" subgroup. The scalar product is                     Iql, the restriction to the group of A such that Aq
                           f
                                                                   = q of a fixed representation of the 2 X 2 unitary
         (<I>(q),v(q)) =       du(n)(<I>(q,n),v(q,n)) ,           unimodular group;
                                                                     (c). The subrepresentation of the little group of
where the measure u is equivalent to one of the form              (O,q) for which the pure Galilei transformations are
                                                                  non-trivially represented contains each irreducible
 du(n)    =   u0 8(n)dn + du, (inlld<p, n, + in.   =   Inle;' .   with multiplicity zero or infinity, the same for almost
The little group of the little group is the little group          all [q[.
itself if n = 0, while it is the two-element group:
                                                                                  ACKNOWLEDGMENTS
A = ±1 if n "e 0. In the former case A ...... Q,(q,O,A)
is any continuous unitary representation of the little              A substantial part of this paper was written in
group of q. In the latter case, ±1 ...... Q,(q,n, ± 1) is         1952, when the author was a National Research
any unitary representation of the 2-element group                 Council Post Doctoral Fellow in Copenhagen. He
and the QI of general argument is expressed in terms              thanks Professor Niels Bohr for the hospitality of
of the elements of the little group by                            the Institut for Theoretisk Fysik and Professor Lars
                                                                  Garding for the hospitality of Lunds Matematiska
Q,(q,n,A) = [QI(q,Ro,AD<-D,-')r'                                  Institution. The paper was completed in 1962 with
    X QI (q,Ro,AD+D,AAA-'D<-D.)QI (q,Ro,AA-'D+D. )                the support of the National Science Foundation. The
                                                                  author thanks Professor Robert Oppenheimer for the
  The irreducible representations of the little group
of the little group have either Uo > 0, du, = or Uo
= 0, du,(lnl) = 8(lnl - a)dlnl, for some a> 0. The
                                                        °         hospitality of the Institute for Advanced Study dur-
                                                                  ing the later period.
       °
    Theorem 7. Every Galilei invariant system with                interval is meant a set [a,b) of the form
M  > is localizable.                                              {y; al";; YI < b" a.";; y. < b., a3";; y. < b.j ,
  For M = 0, no elementary system is localizable
866 A. S. WIGHTM AN
ferred to as an algebra of sets because it is closed           If F is real valued it is said to be positively monotonic
under the operations of taking the complement of a             if t.F[a,b) ;;. 0 for all [a,b)" If the values of Fare
set and taking the union of a finite number of sets.           commuting projections the analogous requirement
A cr algebra of sets is one closed under complemen-            is that t.F[a,b) be a projection for all intervals [a,b).
tation and denumerable unions. A projection-valued             Notice that if E(S) is any finitely additive projection
finitely-additive measure on A is a function, E, with          valued measure defined on A, it yields such an F
values which are projections in a Hilbert space X,             from the definition
defined for all sets of A and satisfying II, and
                                                               F(X"X2,X3) = E({Y;Yl          < Xl,Y, < X2,Y3 < X31l . (AI)
 III'    E(S, uS,) = E(S,)         + E(S,) -     E(S, n S,)
                                                               Conversely, the following theorem holds.
for any S" S, E A.                                               Theorem Ai. Let F be a positively monotonic
A projection-valued finitely additive measure that             function defined on R3 with values which are com-
satisfies in addition                                          mutative projections. Suppose
for any sequence of S, E A,i = 1,2,' " such that                                            =   F(Xl,X" -      <XI)   =   O.   (A2)
S, n Si = 0, i rf j and US, E A is called completely           Then there exists a finitely additive projection
additive or cr additive. The precise statement of the          valued measure E on A satisfying (AI).
result of this Appendix is                                        The proof is completely elementary and will be
   Theorem A5. Any finitely-additive projection-               omitted.
valued measure on A which satisfies                               Now consider the increasing sequence of projec-
              E(S   + a)   =    U(a)E(S)U(a)-l
                                                               tions
for some continuous unitary representation of the                     F(Xl - 11k", 'X3 - 11k)            k     =   1,2" ...
translation group a -> U(a) is necessarily completely          It converges to a projection F-(Xl,·· 'X3) which
additive on A. It then possesses a unique completely           mayor may not be F(Xl," ·X3).
additive extension to the (J algebra of all Borel sets             Example. Consider the function Et defined on A
on R3.                                                         which is the projection E '" 0 for a set S if there is
   Variants of the last statement of the theorem are           an interval of the form {y;ll - • -< y, < 1"t2 - •
quite standard in various contexts in measure theory,
so it will not be proved here. (In Halmos' book,
                                                                -< Y, < t,,13 - • -< Y3 < t31 which lies in S and zero
                                                               otherwise. It is easy to see that E, is a finitely addi-
reference 15, p. 54, the theorem is stated: "If J.I is a       tive projection valued measure on A. It is not com-
cr finite measure on a ring R, then there is a unique          pletely additive because the interval Iy;tl - 1 -< y,
measure )l on the cr ring, S(R), generated by R such            < Il,t, - 1 -< y, < 12,t3 - 1 -< y3 < t31 can be
that for E in R, )l (E) = J.I(E); the measure )l is            written as a denumerable union of intervals for which
q finite." The assumptions of the present Appendix
                                                               the coordinates Yi lie in intervals where right-hand
are more general in that one has a projection-valued           end points are lcss than ti . For each such interval
measure rather than a real-valued measure, but                 E,(S) = 0 but for the union Et(S) = E. Clearly,
otherwise everything is more special: The ring of sets,        the F corresponding to Et does not satisfy F(tl' .. t3)
R, is an algebra because the whole space is in R, the           = L(tl" ·t3).
measure is finite rather than only (J finite.) The first           If for each x E R3, F(x) = F _(x), then the phe-
part of the theorem is a consequence of the following          nomenon occurring in the example cannot happen
chain of four theorems. The argument is a straight-            and the projection valued measure defined by F is
forward generalization of one due to Hewitt.25                 (J' additive on A.
   If F is any function on R 3 whose values can be                  Theorem A2. Every projection valued positively
added and subtracted and [a,b) is an interval, define          monotonic function F on R3 which satisfies (A2) and
Lh[a,b) = F(bl,b"b3) - F(al,b"b3) - F(b " a2,b3)               lim F(xl - 11k" . 'X3 - 11k) = F(Xl,' . ·X3).                   (A3)
              - F(bl,b"a,,)     + F(a " a2,b3) + F(al,b"a3)    k~w
defines a projection valued measure E on A which is            of discontinuity unless F(!) = O. Thus, there are no
(J additive.                                                   nontrivial purely finitely-additive projection valued
   Proof. Since each element of A is a finite union of         measures quasi-invariant under translations.
disjoint intervals and E is finitely additive according           Theorem A4. Every finitely additive projection
to Theorem A2, it suffices to consider the case of a           valued measure on A which is quasi-invariant under
denumerable union of sets in A whose union is an               translations is (T additive.
interval. But such a union defines a monotonically                From Theorem A4 and the result already cited
increasing sequence of projections which converges             that (J additive projection-valued measures on A
to the projection belonging to the interval by virtue          have unique extensions to the Borel sets of R',
of (A3). Therefore E is complctely additive.                   Theorem A5 follows.
   A finitely-additive projection-valued measure E                While the results \if this Appendix make it clear
is called purely finitely additive if there is no nontrivial   that the assumptions of I to V can be weakened
(J additive projection-valued measure which is zero            without impairing the results of the paper, it should
on every set S for which E(S) = o. (It is not difficult        be noted that the particular weakened assumptions
to see that the example E, is purely finitely additive.)       used have been chosen primarily for reasons of
   Theorem AS. Every finitely additive projection              mathematical elegance. A deeper physical analysis
valued measure on A is the sum of a purely finitely            would ask whether the existence of some kind of
additive part and a (J additive part. This decomposi-          approximate position measurement implied the exist-
tion is unique.                                                ence of precise position measurements in the sense
   Proof. The difference F(x) - F _(x) is a projection,        of I to V.
and two such, corresponding to distinct points x are
orthogonal. Because the Hilbert space is separable,             APPENDIX II. SKETCH OF THE DERIVATION OF
there can be at most a denumerable set of points x              THE CONTINUOUS UNITARY REPRESENTATIONS
 where F(x) - F_(x) ;c 0; call them t(k). Let E,(k)(S)                 OF THE TRANSLATION GROUP
 be the projection-valued measure given in the                    The result of Theorem 1 which describes all unitary
example above with E = F(tC'» - F_(t(k». Then                  representations of the translation groups has been
                  E(S) -    Lk E,(k,(S)                        used in physics since the beginning of quantum
                                                               mechanics, but explicit mathematical statements and
defines a finitely-additive projection-valued measure          proofs of it are relatively recent. The purpose of
whose F satisfies (A3) for all points x and so by              this Appendix is to outline some of the ideas involved
Theorem A2 is (J additive. Thus                                in the proofs.
           E(S) = L,E,(k)(S)        + E(2)(S)                     The translation group of n-dimensional real
                                                               Euclidean space Rn will here be denoted ;t with
defines a decomposition into a purely finitely additive        elements a. (The whole machinery works in the same
part and a (J additive part. For the case in which             way for any dimension n so the assumption n = 3 is
E(S) is purely finitely additive, E(2)(S) = 0 because          dropped.) The derivation of Theorem 1 can be
otherwise E(2)(S) would be a (J additive projection-           divided into three parts:
valued measure vanishing whenever E(S) does in                    (1) Determination of the character group ;to of ;t,
contradiction with the definition of a purely finitely-           (2) Derivation of the spectral representation
additive measure. This shows that the purely finitely-
additive part of any E is uniquely determined by
the discontinuities of the corresponding F.
                                                                             U(a)   =   r e-i•. adF(p) ,
                                                                                        lx·
    Now note that if E(S) is quasi-invariant under
translations in the sense that E(S + a) = 0 if and
                                                                  (3) Spectral multiplicity theory for the projection
                                                               valued measure F on ;t*.
only if E(S) = 0, then the same applies to the purely
                                                               These stages actually reflect the historical develop-
finitely-additive part, E(J)(S), of E(S) and the
                                                               ment of the theorem and I will follow them here at
(J additive part of S. [E(S) is surely quasi-invariant
                                                               least in part.
if there exists a representation a -> U(a) of the transla-
                                                                  A character of X is a one-dimensional continuous
tion group such that E(S + a) = U(a)E(S)U(a)-11
                                                               unitary representation of X, i.e., a complex-valued
Furthermore, if F(!) has a nonzero discontinuity
F(I)(X) - F _(!)(x) at x = t, it must also have a              continuous function X of modulus one, which
                                                               satisfies
nonzero discontinuity at x = t + a. This statement
is in conflict with the dcnumerability of the points                          x(a   + b)   = x(a)x(b} .         (A4)
LOCALIZATION PROBLEMS                                                                                                       425
868 A. S. WIGHTMAN
It is well known that any such X is of the form XJ"              for self-adjoint operators H         = f:~pdF(p)      so (A6)
where                                                            can be written
[The argument goes as follows. From (A4), x(O) = 1               Here F defines a projection valued measure via
and x(a) can be written                                          F(S) = f sdF(p). The extension of (A7) to arbitrary
                                                                 Abelian groups was carried out by a number of
 x(a) = x(a',O,···Olx(O,a',O,·· ·0)· ··x(O,O,,· ·an ) ,          authors." Since the step from the one-dimensional
                                                                 to n-dimensional translation group is easy, and excel-
where x(O·· ·a;···) is a charnci.er of thp, one-dimen-           lent textbook accounts of Stone's theorem are
sional translation group of ai . Thus the problem is             available," no more details of (2) will be given here.
reduced to finding all characters for the translation               The problem of determining when two represen-
group of the real line. By introducing i In X = f one            tations are unitary equivalent is reduced by the
reduces the problem to that of finding all real con-             SNAG theorem to the corresponding problem for
tinuous f(a) defined mod 2.. such that                           their F's. A solution of this problem is provided by
            f(a)   + f(b)   =   f(a   + b) mod 2..       (A.5)   (3), the theory of spectral multiplicity. It shows that
                                                                 the unitary equivalence class of an F can be charac-
To complete the proof it is convenient to specify                terized by two objects, a measure class on :to and a
f(a) completely instead of mod 2... Because X is con-            multiplicity function on :t*, which described, re-
tinuous, a unique specification is obtained in some              spectively (and roughly), tell which irreducible
neighborhood of a = 0 by requiring f(O) = 0 and                  representations of :t occur in a --> Ural and how
f(a) continuous in the neighborhood. From (A5), one              often. This theory is to the theory of (2) what the
then derives qf(rr'e) = f(e) for any c in the neighbor-          Hellinger-Hahn theory of a self-adjoint operator30 is
hood and any integer q. Thus, again using (A5),                  to the spectral resolution of a self-adjoint operator.
f«p/q)c) = (p/q)f(c) for any rational number p/q                    There are available nearly as many approaches to
 < 1. The continuity of f then implies frye) = yf(c)             the theory of spectral multiplicity as there are
for every real number < 1, i.e., f(y) = yf(e)/e for y            authors who have written on the subject. One may
in the neighborhood. Finally, using (A5) again, one              make a direct analysis of the commutative algebra
getsf(y) = yf(e)/e mod 2dor all y. Q.E.D.]                       of projections." This leads to a decomposition of the
   The characters clearly form a group under multipli-           Hilbert space into orthogonal subspaces X; on which
cation                                                           the projections are uniformly j-dimensional. That
                                                                 means that X; is a direct sum of j subspaces X( .. ·x;
               X., (alx., (a) = X., +', (a)                      such that the projections E take of the form
and, if the usual topology of Euclidean space is
                                                                             E(ip,,···ip;)   =   (E,ip,,···E;ip;)
introduced for the p's, the group operations are con-
tinuous. The set of all characters (or equivalently the          and on xi the E, are uniformly one dimensional.
set of all p' s) is denoted :to and called the character         Finally, a uniformly one-dimensional algebra of
group of :t".                                                    projections is one which is maximal Abelian, i.e.,
   The step (2) alone can be regarded as a decompo-              any projection which commutes with all the given
sition of an arbitrary continuous unitary represen-              projections is one of them. It is shown that a uni-
tation into irreducibles. This operation is familiar in
quantum mechanics for the one dimensional transla-                  28 Stone's original paper is Ann. Math. 33, 643 (1932). The
                                                                 ~xten8ion :0 any locally compact Abelian group is contained
tion group as Stone's theorem: Anyone-parameter                  m M. Naumark, Izvest. Akad. Nauk U.S.S.R. 7, 237 (1943);
continuous unitary group is of the form                          W. Ambrose, Duke Math. J. 11, 589 (1944); R. Godement,
                                                                 Compt. rend. 218, 901 (1944). It is sometimes referred to as
                                                                 the SNAG theorem.
                   U(a)     =   exp -iaH ,               (A6)       ,. See for example F. Riesz and B. Sz.-Nagy, Le~tm8 d'analyse
                                                                 fonctionelle (Budapest, 1953), p. 377.
where H is self-adjoint. Then by the spectral theorem               30 See M. H. Stone, Linear Transformations in Hilbert Space
                                                                 (American Mathematical Society, Providence, Rhode Island,
                                                                 1932), Chap. VII.
                                                                    31 See, for example, H. Nakano, Ann. Math. 42, 657 (1941);
  27 This construction of the character group can be carried     I. E. Segal, Memoirs Am. Math. Soc. 9 (1951), Secs. I and II;
out for an arbitrary locally compact Abelian group. See, for     P. R. Halmos, Introduction to Hilbert Space and the Theary of
example, L. Pontrjagin, Topological Groups (Princeton Uni-       Spectral Multiplicity (Chelsea Publishing Company, New
versity Press, Princeton, New Jersey, 1939), Chap. V.            York,1951).
426                                                                                                     CHAPTER VIII
      APPENDIX III. QUASI-INVARIANT MEASURES                     From this equality, the required equivalence can be
                                                                 deduced as follows. Note that I sdx = 0 if and only
   In this Appendix, the structure of quasi-invariant
                                                                 if I _sdx = O. Thus, from (A9) , I sdx = 0 implies
measures defined on the Borel sets of R' is determined
for two different situations. In the first, the group
                                                                 /J(S+    x) = 0 for almost all x. By the quasi-invari-
                                                                 ance of /J, this, in turn, implies /J(S) = O. Conversely,
acting on R' is R' itself. Then, every finite quasi-
invariant measure is equivalent to Lebesgue measure.
                                                                                                 +
                                                                 if /J(S) = 0 and therefore /J(S x) = 0, (A9) implies
                                                                 Isdx = O.
In the second, the group acting on R' is the rotation
                                                                     This completes the proof of the equivalence of /J
group. Then the most general finite quasi-invariant
                                                                 with Lebesgue measure. The Radon-Nikodym theo-
measure is equivalent to a measure of the form
                                r
                                                                 rem guarantees that d/J(x) = p(x)dx where p(x) is
         /J(8) = /JoXs(O)   +    o
                                     dp(a)   J
                                             d",.(p) , (A8)
                                                                 positive and measurable.
                                                                     In the second situation, one has a finite measure /J
                                       sn~p;lpl-ar               on the Borel sets of R' such that for every Borel set S
                                                                 and every rotation R, /J(S) = 0 if and only if p.(RS)
where 1'0 ;;. O,Xs is the characteristic function of the
                                                                  = O.
set 8, d",.(p) is the invariant surface element on the
                                                                     It is easy to see that any such quasi-invariant
sphere Ipi = a, and dp(a) is a measure on the positive
                                                                 measure is equivalent to an invariant measure. In
real axis.
                                                                 fact, consider the non-negative set function
   The result for the first situation is a special case
of the general result that any Borel measure on a
locally compact group quasi-invariant with respect                                ji(S) =   JdRp.(RS) ,
to the action of the group on itself is equivalent to
Haar measure" The proof of Loomis given in" is so                where the integration is over all the rotation group
simple that it will be repeated here in the special              and dR is the invariant measure on the rotation group
context of R'.                                                   for which I dR = 1. It is not difficult to verify that
   Let S be any Borel set in R'. Denote the finite               ji is (f additive. Furthermore, it is equivalent to /J,
measure quasi-invariant with respect to Lebesgue                 because ji(S) = 0 implies /J(RS) = 0 for almost all
measure by /J. Let S' be the set in R' defined by                R, which, because of the quasi-invariance of /J, yields
x - yES. (It is a Borel set because x - y is a con-              /J(S) = O. Conversely, /J(S) = 0 implies /J(RS) = 0,
tinuous function of x and y. Then the characteristic             which implies ji(S) = O. Thus, it suffices to consider
                                                                 invariant /J.
  32 R. Godement, Ann. Math. 53, 68 (1951); J. Dixmier,             It is convenient in completing the proof to use an
"Les Algebres d'Operateurs dans l'Espace Hilbertien,"            alternative characterization of a finite measure on R'
Algebres de von Neumann (Gauthier-Villars, Paris, 1957), Chap.
II; see especially pp. 216-224.                                  as a non-negative bounded linear functional on the
  " See, for example, G. W. Mackey, Duke Math. J. 16, 313        continuous functions of compact support, e(R').
(1949), Lemma 33; J. von Neumann, Bull. Amer. Math. Soc.
42, 343 (1936).                                                  That the functional, fI, is non-negative means p.(f)
LOCALIZATION PROBLEMS                                                                                                                         427
870 A. S. WIGHTMAN
; ;. ° f ;;;.
      for           0, fEe. That          110   is bounded means                 necessary, be altered on a set of p.-measure zero so
                                                                                 that it becomes measurable in both variables relative
    sup     1110 (f) I   <   cx),   where        If I =   sup If(x) I .          to the measure 110 X a, where a is the invariant
    Ifl""                                                 .ER·
                                                                                 measure on the 2 X 2 unitary unimodular group.
The relation between the functional                       110   and the corre-      Let ~;,j = 1,2,··· be a complete orthonormal
sponding measure 110 is simply                                                   set in X. Then it suffices to treat the functions
                                                                                 (~j(P), Q(P,A)~.(p)) separately because the general
                         p.(f) = jf(X)dp.(x) .                                   case then follows by the expansions
Since the measure is uniquely determined by the                                         'It,   =   L aj~"    'It,   =   L bj~j,        and
functional, to verify the equality of two measures it                                   ('It, (p),Q(p,A)'It,(p)) = L            a1b,
suffices to verify the equality of the corresponding                                                                    i,k-I
functionals. 34                                                                                X   (~j(p),Q(p,A)~.(p))    .
    Now, for an invariant measure                                                An ugly little lemma is necessary.
                                                                                   Lemma. Let f(p,A) be a complex-valued function
                                                                                 on R' X G which is 110 measurable and p. essentially
                                                                                 bounded on R' for each A E G, the 2 X 2 unitary
because the approximating sums to the integral                                   unimodular group. Suppose J f(p,A)xE(p)dp.(p) is a
f (Rf)(x)dR converge uniformly in x, and p.(f) is                                measurable on G for each p. measurable subset E of
continuous for uniform convergence of its argument.                              R' of finite measure. Here, a-measurability on Gis
Butf(x) ---> f (Rf)(x)dR = f f(Rx)dR maps the con-                               with respect to the invariant measure dA.
tinuous functions of compact support on R' onto the
continuous functions of compact support on
" Ixl <        and convergence in e(R') implies con-
              CX)
                                                                            °      Then there exists a function, g, 110 X a measurable
                                                                                 on R' X G, and such that for a certain p.-measurable
                                                                                 subset N of R' of zero measure
vergence in e([O, ro )). Thus, the functional 110 re-
garded as defined on e([O,     defines a finite measure
                                      CX) ) )
                                                                                  f(p,A) = g(p,A)         for all A E G and pEEN.
on the non-negative real axis. Splitting it into a con-                             This lemma is a special case of Lemma 3.1 of ref-
tribution with support at 0, and the rest, one has                               erence 33, and will not be proved here.
just the 1100 and dp of (AS). In fact, (AS) is just an                              The lemma shows that by a suitable redefinition
explicit form in terms of measure of                                             of Q(p,A) which does not affect the corresponding op-
                                                                                 erator Q(A), one can have Q(P,A),p. X A measurable.
                         p.(f) = p.URfdR ) .                                        The next step in the argument is to show that in the
                                                                                 equation
   APPENDIX IV. SOME MEASURE-THEORETIC                                            L1 (~j(p),Q(P,A)~I(P))(~'(P),Q(A -1p,B)~.(p))
NICETIES CONNECTED WITH EQS. (3.10) AND (3.17)
                                                                                     = (~j(p),Q(p,AB)~.(p))                                  (AI0)
   This Appendix is devoted to some fine points which
arise in the otherwise elementary derivation of                                  which holds for each A,B E G and pER' such that
Sec. 3.                                                                          pEE N , (A,B),A-'p EE N,(A,B) where N, and N, are
   Recall that Theorem 1 states that if                                          p.-measurable sets of p'-measure zero, the right- and
                                                                                 left-hand sides are p. X a X a measurable on R' X iG
                         [Q(A),U(a,l)] = 0,                                       X G. Because a Borel-measurable function of a Borel-
                                                                                 measurable function is Borel measurable, it suffices to
then Q(A) is of the form
                                                                                 prove that the mappings T , :{p,A,B} ---> {p,AB} and
                    (Q(A)~)(p)       =   Q(p,A)~(p)              ,               T,:[p,A,B} ---> {A-1p,B} are Borel-measurable func-
                                                                                 tions.
where for each unitary unimodular A, Q(P,A) is
                                                                                     Now T, and T, are continuous, and a set F which is
measurable in p in the sense that for each 'It " 'It,
                                                                                 110 X a measurable in R' X G differs from a Borel set
EX, ('It, (p), Q(p,A)'It,(p)) is 110 measurable. The first
                                                                                 by a subset of a Borel set of zero p. X a measure."
step in the argument is to prove that Q(p,A) can, if
                                                                                     Furthermore, a continuous function has the prop-
                                                                                 erty that the antecedent of any Borel set of its range
  34 See, for example, P. R. Halmos, Measure Theory (D. van
Nostrand Company, Inc., Princeton, New Jersey, 1950), pp.
243-9.                                                                            .. See reference 15, pp. 55-56.
428                                                                                                         CHAPTER VIII
872 A. S. WIGHTMAN
so that the required continuity follows from (A12)        sides of this equality are (p. X a) measurable func-
and the proof is complete.                                tions of p and A and the set on which the equality
   Finally, there is the matter of sets of measure zero   fails is of p. X a measure zero. It then follows that, for
in the criterion for unitary equivalence (3.17). Solved   fixed p, the set of A on which it fails is of a measure
for V(A-lp) it reads                                      zero. That in turn implies that the set of A -lp for
                                                          which it fails is of p. measure zero. Picking one p from
                                                          each orbit and altering yep) on the corresponding set
                                                          of measure zero one gets a new family V (p) which is
By an argument just like that used in the first few       also measurable andyields the same V but for which
paragraphs of this Appendix, one concludes that both       (3.17) always holds.
430                                                                                                                   CHAPTER VIII
                                                                D.Han
        National Aeronautics and Space Administration, Goddard Space Flight Center (Code 636), Greenbelt, Maryland 20771
                                                               Y.S.Kim
                     Department of Physics and Astronomy, University of Maryland, College Park, Maryland 20742
                                                           Marilyn E. Noz
                             Department of Radiology, New York University, New York, New York 10016
                                                   (Received 25 September 1986)
                    A Lorentz-covariant localization for light waves is presented. The unitary representation for the
                 electromagnetic four-potential is constructed for a monochromatic light wave. A model for covari-
                 ant superposition is constructed for light waves with different frequencies. It is therefore possible to
                 construct a wave function for light waves carrying a covariant probability interpretation. It is
                 shown that the time-energy uncertainty relation (~t)( ~w b 1 for light waves is a Lorentz-invariant
                 relation. The connection between photons and localized light waves is examined critically.
                                                                                #
other physical phenomena.
   In Sec. II, we start with the motion of free-particle
wave packets in the Schriidinger picture of nonrelativistic
quantum mechanics. For localized light waves, there is
no difficulty in giving a probability interpretation if                                 1_ BO#5T
                                                                                              1
Lorentz boosts are not considered. It is pointed out that
the basic problem for light waves is how to make the
probability interpretation Lorentz covariant.                             ib)                   z               z
   In Sec. III, we discuss Lorentz-transformation proper-
ties of the four-vector representation for photons. Section
                                                                                --                   -
                                                                                      r'
                                                                be given to light waves.
   If the momentum is not sharply defined, we have to
                                                                   For light waves, let us start with the usual expression
take the linear superposition
     1/llz,l)=   Jgik)exp[ilkz -k't!2m)jdk                i6)
                                                                     /lz,O=     12~        f g(k)ei,h-wtldk .               (8)
   (I) We like to have a quantal wave function for light        called the photon polarization vectors.
waves. However, it is not clear which component of the             In order that the four-vector be a helicity state, it is
Maxwell wave should be identified with the quantal wave         essential that the time.like and longitudinal components
whose absolute square gives a probability distribution.         vanish:
Should this be the electric or magnetic field, or should it
                                                                                                                          (13)
be the four-potential?
   (2) The expression given in Eq. (8) is valid in a given      This condition is equivalent to the combined effect of the
Lorentz frame. What form does this equation take for an         Lorentz condition
observer in a different frame?
   (3) Even if we are able to construct localized light             _a_[A"(x)]=O                                          (14)
waves, does this solve the photon localization problem?              ax"               '
   (4) The photon has spin I either parallel or anti parallel   and the transversality condition
to its momentum. The photon also has gauge degrees of
freedom. How are these related to the above-mentioned               V·A(x)=O.                                             (15)
problems?
                                                                As before, we call this combined condition the helicity
   Indeed, the burden on Eq. (8) is the Lorentz covariance.
                                                                gauge. IS
It is not difficult to carry out a spectral analysis and
                                                                  While the Lorentz condition of Eq. (14) is Lorentz in-
therefore to give a probability interpretation for the ex-
                                                                variant, the transversality condition of Eq. (15) is not.
pression of Eq. (8) in a given Lorentz frame. However,
                                                                However, both conditions are invariant under rotations
this interpretation has to be covariant. This is precisely
                                                                and under boosts along the direction of momentum. We
the problem we are addressing in the present paper.
                                                                call these helicity preserving transformations. The boost
                                                                along an arbitrary direction is illustrated in Fig. 2. This is
            III. UNITARY REPRESENTATION
                                                                not a helicity preserving transformation. However, ac-
                FOR FOUR-POTENTIALS
                                                                cording to Ref. 15, we can express this in terms of helicity
   One of the difficulties in dealing with the photon prob-     preserving transformations preceded by a gauge transfor-
lem has been that the electromagnetic four-potential could      mation.
not be identified with a unitary irreducible representation        Let us consider in detail a boost along the arbitrary
of the Poincare group.12-15 The purpose of this section is      direction specified in Fig. 2. This boost will transform
to resolve this problem. In Ref. 15 we studied unitary          the momentum to p', as is illustrated in Fig. 2;
transformations associated with Lorentz boosts along the                                                                  (]61
direction perpendicular to the momentum. In this section
we shall deal with the most general case of boosting along      However, this is not the only way in which p can be
an arbitrary direction.                                         transformed to p'. We can boost p along the z direction
   Let us consider a monochromatic light wave traveling         and rotate it around the y axis as is shown in Fig. 2. The
along the z axis with four-momentum p. The four-                application of the transformation [R(&IB,(S-I] on the
potential takes the form                                        four-momentum gives the same effect as that of the appli-
                                                                cation of B¢( 71). Indeed, the matrix
                                                          (9)
                                                                    D( 71)= [B.( 71 I]-IR W)B,(S)                         1171
with
            o   -i 0 0
                o   0 0
       S]= 0    0   0 0                                 (Ill
            o   0   0 0                                                    ----------__ p-6p
                                                                   FIG. 2. Lorentz boost along an arbitrary direction of the
 The four-vectors satisfying this condition are                 light wave. The four-momentum can be boosted either directly
       Ai=(I,±i,O,O) ,                                   (121   by B, or through the rotation Ry preceded by B, along the     l
leaves the four-momentum invariant, and is therefore an                   course, is a limitation of the model we present. However,
element of the E(2)-like little group for photons.                        our apology is limited in view of the fact that laser beams
   The effect of the above D matrix on the polarization                   these days can go to the moon and come back after reflec-
vectors A,± has been calculated in the Appendix, and the                  tionYl
result is                                                                    With this point in mind, we note first that the above-
                                                                          mentioned unitary transformation preserves the photon
                                                                (18)
                                                                          polarization. This means that we can drop the polariza-
where                                                                     tion index from A ~ assuming that the photon has either
                                                                          positive or negative polarization. A ~(x) can now be re-
                                                                          placed by A (x).
                                                                             Next, the transformation matrices discussed in this sec-
                                                                          tion depend only on the direction and the magnitude of
uly/,I)I= [
            cos~ 1 [cost 1J + [[ cos~cosh-i         ]'
                                                         -I
                                                              ]112        the boost but not on the photon energy. This is due to the
                                                                          fact that the photon is a massless partic1e 17 Indeed, the
                                                                          matrices in Sec. III remain invariant even if w in Eq. (91 is
                                                                          replaced by a different value. This means that for the su-
Thus D 1y/) applied to the polarization vector results in                 perposition of two different frequency states,
the addition of a term which is proportional to the four-
                                                                                                                                       (241
momentum. D (y/ 1 therefore performs a gauge transfor-
mation on A~. 14,15                                                       a Lorentz boost along an arbitrary direction results in a
   With this preparation, let us boost the photon polariza-               rotation preceded by a boost along the z direction. Since
tion vector                                                               neither the rotation nor the boost along the z axis changes
                                                                          the magnitude of Aili = 1,2), the quantity
                                                                (19)
                                                                                                                                       (25)
The four-vector      A~ satisfies the Lorentz condition
p'~ A±~ ~ 0, but its fourth component will not vanish.                    remains invariant under the Lorentz transformation. This
The four-vector .4~ does not satisfy the helicity condi-                  result can be generalized to the superposition of many dif-
tion.                                                                     ferent frequencies:
   On the other hand, if we boost the four-vector  after A'ie                 A(x)=~AkelWklz-t) ,
performing the gauge transformation D 11/ I,                                                                                           (26)
                                                                                        k
      A'f =B"Iy/IA,±                                                      with
          ~B"IY/)I [B"IY/I]-IRII)IB,(#;) IA,±                                    I A I'=.ll Ak   12 .                                  (27)
                                                                                            k
         =R(I)IB,(#;IA'±                                        (201
                                                                          The norm I A I' remains invariant under the Lorentz
Since B,(#;) leaves A,± invariant, we arrive at the con-                  transformation in the sense that it is invariant under rota-
clusion that                                                              tions and is invariant under the boost along the z direc-
                                                                          tion.
                                                                (21)
                                                                             Can this sum be transformed into an integral form of
This means                                                                Eq. (8)? From the physical point of view, the answer
                                                                          should be yes. Mathematically, the problem is how to
      A'±=Bx(y/JD(y/)A~     =lcose,+i,-sinl),O),                (22)      construct a Lorentz-invariant integral measure. It is not
which satisfies the helicity condition                                    difficult to see that the norm of Eq. (27) remains invariant
                                                                          under rotations, which perform unitary transformations
      A';'=O                                                              on the system. The problem is how to construct a mea-
                                                                          sure invariant under the boost along the z direction.
and                                                             (231
                                                                             For this purpose, we shall borrow the techniques
      p'·A·,=O.                                                           developed for the covariant harmonic-oscillator formalism
                                                                          which has been very effective in explaining the basic rela-
The Lorentz boost B (y/ 1 on A ~ preceded by the gauge                    tivistic features in the quark model,lO, 18-20 and which en-
transformation Diy/I leads to the pure rotation R(el.                     ables us to combine covariantly Heisenberg'S position-
This rotation j:.-, a finite-dimensional unitary transforma-              momentum uncertainty relations and the c-number time-
tion.                                                                     energy uncertainty relation.),9,21
   The abme result indicates, for a monochromatic wave,
that all we have to know is how to rotate. If, however,                                IV. LOCALIZATION PROBLEMS
rhe photol! momentum has a distribution, we have to deal                           IN THE RELA TIVISTIC QUARK MODEL
Wit il a linear superposition of waves with different nl()-
melaa. The phot~)n momentum can have both longitudi-                        We ~hall di~cu~.., in thi,;,; section the :.lSp~Ch d' ~he co-
nal anO tJ,lIl~\~r...,c distributions. In thi~) paper we shall Q\-        vuriant harmunic-t)~cillator formalism whi..:h a:'(..: ~[seful in
.\utf/e t!!at th~re [~onl)' longitudinal distribution. Thh,          or   converting the :-,lllll ,1f Eg. (26) into an llltefld form
434                                                                                                                      CHAPTER VIII
                                                                                           -,
jet phenomenon 2o                                                                      ,,·0 BOOS1'   ,'Oil
origin in the hadron-rest frame with the z' and /' vari-
ables. The ground-state harmonic-oscillator wave func-
                                                                    ~
                                                                    -
                                                                            ------------------------------
                                                                                Cl O                            ~o
tion takes the form
tively, then the standard procedure is to introduce P and q           is independent of {3. However, dz ~ or dz + alone is not.
                                                                      The     integration     over    z+     gIves   the   factor
                                                               (35)
                                                                      [i 1+f3I/( 1_{3)]'12, and this factor is compensated by its
where P is the four-momentum of the hadron, and q is                  inverse [( l-f3I/( I +{3)]'12 coming from the z~ integra-
the momentum-energy separation between the quarks.                    tion.
We are concerned here with the uncertainty relations be-                 We used in this section the Gaussian form of the wave
tween the x variables of Eq. (28) and the above q vari-               function purely for convenience. The above reasoning is
ables. The momentum-energy wave function is                           valid for all forms of distributions having the same
                                                                      space-time boundary condition as that of the Gaussian
     <!>iq"qo)=   [5.; 1f   exp[i(q,z-qot)]1/J(z,t)dtdz
                                                                      function. Indeed, if we give up the localization along the
                                                                      z + axis, then the integration measure along the z ~ axis
                                                                      should be compensated by the contraction or elongation
              =(l/V;)expl-[(q:)'+(q~ )']/21,                  (36)    along the z + direction. If the system is boosted along the
where q, and qo are the momentum and energy separation
                                                                      z direction, dz + and dz ~ are transformed as
                                                                                  V. COVARIANT LOCALIZATION
                                                                                        OF LIGHT WAVES
                                                                                          r'
                                                              (39)    age value like the internal momentum distribution.
                                                                        With this point in mind, let us rewrite Eq. (8) as
This means that        the major and minor axes of the
momentum-energy
of the minor and
                       coordinates are the Fourier conjugates
                       major axes of the space-time coordi-
                                                                          /(z,I)=   [2~          f g(k)e;(k'~wt)dk   .        (43)
nates, respectively.    Thus we have the following Lorentz-           We shall approach this problem using Dirac's light-cone
invariant uncertainty relations. 25                                   coordinate system discussed in Sec. IV. For convenience,
     (<lz+ i(f!.q~ )=(<lz: i(f!.q"- bl ,                              we shall define here the light-cone variables as
                                                              (40)
     (<lz~)(f!.q+)=(<lz*-)(f!.q:bl .                                      s=(z+I)/2, u=(z-t).                                 (44)
  These uncertainty relations are well understood when                sand u are different from z + and z _ of Eq. (31) by a
the hadron is at rest with {3=0. On the other hand, the               factor of Vl. but their Lorentz transformation property
limit {3~ I can teach us many interesting lessons. The                remains the same. We shall also define the new momen-
connection between this limit and Feynman's original                  tum variables as
form of the parton model" has been discussed repeatedly                    ku =(k +w)/2, k, =(k -wi .                         (45)
in the literature!' 10 As far as the localization of massless
particles is concerned, the distribution along one of the             In the case of light waves, k, vanishes and k. becomes k
                                                                                        r'
light-cone axes becomes so widespread that it loses its lo-           or w. In terms of the light-cone variables, the expression
calization along the axis.                                            of Eq. (43) becomes
                                                                                  [2~
   In Sec. V, we shall "give up" the localization along one
of the light-cone axes in order to study photons and light                /(u)=              f   g(k)eikUdk .                (46)
waves. In so doing, we will have the problem of normal-
izing the wave function by integration. The integration                  For a massive particle, the most convenient Lorentz
measure dz + dz       is a boost-invariant ·quantity. This            frame is the frame in which the particle is at rest, as was
means that the normalization integral,                                noted in Sec. IV. For a massless particle, as our study in
                                                                      Sec. III suggests,26 we can start with a specific Lorentz
     f   ¥,(z,ti,'dz +dz~ ,                                   (41)    frame in which the photon momentum has a given magni-
436                                                                                                                             CHAPTER VIII
 tude along the z direction. In this Lorentz frame, we as-            this in order to make the system covariant. The net result
 sume that the average photon frequency is 0 0                        is that
                                                                                                       '!4
                                                               (47)
                                                                            !(u)=
                                                                                       [
                                                                                           :~~     ]
                                                                                                             A(u)                              (57)
 with
                                                                      and
             f
                                                                                                                                Il-W-]'!2n 1I
        N=       I golk,Oo)     I'dk                           (48)
        0=*fklglk,0)12dk,                                      ISO)
                                                                            g(   k)=   I    I
                                                                                           7Tb
                                                                                                 1'/4 [       I ]'12 [1-.{3
                                                                                                             no
                                                                                                                            r
                                                                                                                      ]:1(3 J
                                                                                                                                 4
                                                                            (Jlu)(Jlkbl.                                                       (61)
 then 10/Oo)du is a Lorentz-invariant measure, and f( u)
 can take the form                                                    From the definition given in Eq. (45), Jlk = Jlu). From
                                                                      Eq. (44), Jlu = - JlI for a fixed value of z. This relation
                                                               (54)
                                                                      becomes Jlu = JlI when the symbol tJ. means the width of
 where A lu) is a scalar quantity. The integral form of Eq.           distribution. Thus the time-frequency relation IJlw)(JltJ
 (26)is                                                               is a Lorentz-invariant relation.
                                                               ISS)
                                                                                       VI. THE CONCEPT OF PHOTONS
 with
                                                                         We discussed in this paper Lorentz-covariant wave
 (!1!!1o) L~~     I Alu)    1
                                2du =(I!!1)   L~~ Ia Ik -0) 2dk .
                                                           1
                                                                      functions for light waves. It is possible to construct a lo-
                                                               (56)   calized wave function for light waves with a Lorentz-
                                                                      invariant normalization. The mathematics of this pro-
    Indeed, in Eq. (56), we have to multiply and divide the           cedure is not complicated. We are then led to the ques-
 right- and left-hand sides, respectively, by VQ. We did              tion of why the photon localization is so difficult, while it
LOCALIZATION PROBLEMS                                                                                                               437
is possible to produce photons in states narrowly confined            when we make the transition from localized Maxwell
in space and time.2J                                                  waves to photons through second quantization.
   Let us see how the mathematics for the light-wave lo-
calization is different from that of quantum electro-                                       ACKNOWLEDGMENT
dynamics (QED) where photons acquire a particle inter-
pretation through second quantization. In QED, we start                 We are grateful to Professor Eugene P. Wigner for ex-
with the Klein-Gordon equation with its normalization                 plaining to us the background of the photon localization
procedure. As a consequence, we use the expression 28                 problem.
     Ilu)=A(u)                                                 (63)
                                                                      with e'=cosh1)+lcosd>lsinh1). The rotation matrix is
in QED, without the factor 1fl/110)1/2 discussed in Sec.
V. As a consequence, the normalization condition is that                            cose       0 sine 0
the integral                                                                         0         I   0    0
                                                                          R(8)=                                                    IA3)
         f [A'(U)~A(U)-A(U)~A'IU) ]dZ
                                                                                   -sine 0 cose 0
     i                                                         (64)
                                                                                        0      0   0       1
IW. Heitler, The Quantum Theory of Radiation, 3rd ed. (Claren-            features of the parton picture, see Y. S. Kim and M. E. Noz,
   don, Oxford, 19541.                                                    Phys. Rev. D 15, 335 (1977); J. Math. Phys. 22, 2289 1198});
'E. P. Wigner, in Aspects of Quantum Theory, in Honour of P.              P. E. Hussar, Y. S. Kim, and M. E. Noz, Am. J. Phys. 53,
   A. M. Dirac's 70th Birthday, edited by A. Salam and E. P.               142 11 985). For earlier papers dealing with the dependence of
   Wigner (Cambridge University Press, London, 19721.                     the quark-model wave function on the time-separation vari-
Jp. A. M. Dirac, Proc. R. Soc. London, Ser. A 114, 243, 11927);           able, see G. Preparata and N. S. Craigie, Nucl. Phys. B \02,
    114,710 11927).                                                       478 11976); 1. Lukierski and M. Oziewics, Phys. Lett. 69B,
4T. D. Newton and E. P. Wigner, Rev. Mod. Phys. 21, 400                   339 (1977); D. Dominici and G. Longhi, Nuovo Cimento
   (19491; A. S. Wightman, ibid. 34, Ser. A 845 (19621; T. O.             42A, 235 119771; T. Goto, Prog. Theor. Phys. 58,1635 (19771;
    Philips, Phys. Rev. 136, B893 (19641.                                 H. Leutwyler and J. Stern, Phys. Lett. 73B, 75 (19781; Nucl.
5J. M. Jauch and C. Piron, Helv. Phys. Acta 40,559119671; W.              Phys. B 157, 327 119791; I. Fujiwara, K. Wakita, and H.
   O. Amrein, ibid. 42, 149 11 9691; K. Kraus, Am. J. Phys. 38,           Yoro, Prog. Theor. Phys. 64, 363 119801; 1. Jersak and D.
    1489 (1970); Ann. Phys. (NYI, 64, 311 119711; H. Neumann,             Rein, Z. Phys. C 3, 339 119801; I. Sogami and H. Yabuki,
   Commun. Math. Phys. 23, 100 119711; H. Neumann, lIelv.                 Phys. Lett. 94B, 157 (1980); M. Pauri, in Group Theoretical
   Phys. Acta 45, 881 119721; S. T. Ali and G. G. Emch, J.                Methods in Physics, Proceedings of the IX International Colla-
   Math. Phys. 15, 176 (19741; G. C. Hegerfeldt, Phys. Rev. D             quim, Cocoyoc, Mexico, edited by K. B. Wolf ISpringer-
    10, 3320 (1974); K. Kraus, in Foundations of Quantum                  Verlag, Berlin, 1980); G. Marchesini and E. Onofri, Nuovo
   Mechanics and Ordered Linear Space, Vol. 29 of Lecture                 Cimento A 65, 298 1198}); E. c. G. Sudarshan, N. Mukunda,
   Notes in Physics, edited by J. Ehlers et 01. (Springer-VerIag,         and C. C. Chiang, Phys. Rev. D 25, 3237 11982). The uncer-
   BerIin, 19741, p. 206.                                                 tainty relation applicable to the time~scparation between the
6For review papers on this subject, see R. Kraus, in The Uncer~           quarks is not inconsistent with the proposition that the time~
   tainty Principle and Foundations of Quantum Mechanics, edit-           energy uncertainty is applicable only to the time separation
   ed by W. C. Price and S. S. Chissick (Wiley, New York,                between two independent events. See L. D. Landau and E. M.
   19771; S. T. Ali, Riv. Nuovo Cimento 8, 1 (19851.                      Lifschitz, Quantum Mechanics, 2nd ed. (Pergamon, New
7For early papers on this subject, see P. A. M. Dirac, Proc. R.           York, 19581.
   Soc. London, Ser. A 183,284119451; H. Yukawa, Phys. Rev.            lOY. S. Kim and M. E. Noz, Theory and Applications of the
   91, 416 11953); M. Markov, Nuovo Cimento Suppl. 3, 760                Poincare Group IReidel, Dordrecht, 19861.
   (1956); T. Takabayasi, Nuovo Omento 33, 668 (1964): S.             11E. Inonu and E. p, Wigner, Nuova Omenta 9,705(952); M.
   Ishida, Prog. Theor. Phys. 46, 1570 119711; 46, 1905 119711.          Hamermesh, Group Theory (Addison~ Wesley, Reading,
8For some of the recent articles, see R. P. Feynman, M. Kis-             Mass., 19621.
   linger, and F. Ravnda1, Phys. Rev. D 3,2706119711; Y. S.           12£. p, Wigner, Ann. Math, 40,149 (1939); V. Bargmann and E.
   Kim and M. E. Noz, ibid. 8,3521 (1973); M. J. Ruiz, Phys.             P. Wigner, Proc. Natl. Acad. Sci. U.S.A. 34, 211 11948); E. P.
   Rev. 10,430611974); Y. S. Kim, Phys. Rev. D 14,273 (1976);            Wigner, Z. Phys. 124, 665 11948); E. P. Wigner, in Theoreti-
   Y. S. Kim and M. E. Noz, Found. Phys. 9, 375 119791; Y. S.            cal Physics. edited by A. Salam (International Atomic Energy
   Kim, M. E. Noz, and S. H. Oh, J. Math. Phys. 20, 1341                 Agency, Vienna, 19621.
   (19791; Am. J. Phys. 47, 892 119791; J. Math. Phys. 21 1224        "Chou Kuang-Chao and L. G. Zastavenco, Zh. Eksp. Teor.
  119801; D. Han, Y. S. Kim, and M. E. NOL, Found. Phys. 11,             Fiz. 35, 1417 119581 [Sov. Phys.~JETP 8, 990 (19591]; M.
   895 (1980); D. Han and Y. S. Kim, Am. J. Phys. 49, 1157               Jacob and G. C. Wick, Ann. Phys. INYI 7, 404 119591; A. S.
   (1981); 49, 348119811; D. Han, M. E. Noz, Y. S. Kim, and D.           Wightman, in Dlspenioll Relations and Elemclllary Particlcs,
  Son, Phys. Rev. D 25,1740119821; D. Han, Y. S. Kim. and                edited by C. DeWitt and R. Omnes (Hermann, Paris, 1960);].
   D. Son, ibid. 26,371711982); D. Han, M. E. Noz, Y. S. Kim,            Kuperzslych, Nuovo Cimenlo 31B, I 119761; Phys. Rev. D
  and D. Son, ibid. 27,3032119831. For review oriented arti-             17,629 (1978); A. Janner and T. Jenssen, Physica 53, I (19711;
  cles comparing various early approaches to this problem, see           60, 292 119721; J. L. Richard, Nuovo Cm]ento 8A, 485
  T. Takabayasi, Prog. Theor. Phys. Suppl. 67, I (19791; D.              119721; H. P. W. Gottlieb. Proc. R. Soc. London. Ser. A 368,
   Han and Y. S. Kim, Prog. Thcor. Phys. 64,1854 (19801.                 429 119791.
9The uncertainty rclation applicable to the timLO separation be-      145. Weinberg, Phys. Re\ 134, R 8~2 (19641; S. WClI1herg, Phys.
   tween the constituent quarks is responsible for the pl'ctliiari-      Rev. 135, B1049 119041; J. Kupersztych. Ph)s. Rev. D 17,629
   ties in Feynman's parton p:cture universally \.)hst'fved In           1.19781; D.Han. Y. S. Kiln. and D. Son, ibid. 26.3717119821;
  high-energy hadronic experiments.. Sec, P. E.     Hu~s"r,   Phys.      D. Han, Y S. Kim, and D. ')on, Phy". Let\. 13IB, :'27 (198.1);
  Rev. D 23, 2781 (19811. For papers dealing with qualitative            D. Han, Y. S. Kim, \1. E ~1)7, and D. Son. Am. J. Phys. 52,
LOCALIZATION PROBLEMS                                                                                                              439
     1037 (1984).                                                      Phys. 50, 642 119821. For some of the recent articles on the
ISD. Han, Y. S. Kim, and D. Son, Phys. Rev. D 31, 328 (1985).          subject, see E. A. Gislason, N. H. Sabelli, and J. W. Wood,
16C. O. Alley, in Quantum Optics, Experimental Gravity, and            Phys_ Rev. A 31, 2078 (1985); M. Hossein Partovi, Phys. Rev.
  Measurement Theory, edited by P. Meystre and M. O. Scully            Lett. 23, 2887(1986).
  (Plenum, New York, 1983).                                          22R. P. Feynman, in High Energy Collisions, Proceedings of The
17E. P. Wigner, Rev. Mod. Phys. 29, 255 11957); D. Han, Y. S.           Third International Conference, Stony Brook, New York,
  Kim, and D. Son, J. Math. Phys. 27, 2228([9861.                       1969, edited by C. N. Yang et al. (Gordon and Breach, New
18For some of the latest papers on hadronic mass spectra, see N.        York, 1969); J. D. Bjorken and E. A. Paschos, Phys. Rev.
  Isgur and G. Karl, Phys. Rev. D 19, 2653 ([978); D. P. Stan-         185,1975 (19691.
  ley and D. Robson, Phys. Rev. Lett. 45, 235 ([9801. For re-        23p. A. M. Dirac, Rev. Mod. Phys. 21, 392 (1949); D. Bohm,
  view articles written for teaching purposes, see P. E. Hussar,       The Special Theory of Relativity (Benjamin/Cummings, Read-
  Y. S. Kim, and M. E. Noz. Am. J. Phys. 48.1038,119801; 48,           ing, Mass., 1965); Y. S. Kim and M. E. Noz, Am. J. Phys.
  1043([980). See also O. W. Greenberg, Am. J. Phys. 50,1074           50, 721 (] 9821.
  (1982).                                                            24For earlier discussions on Lorentz-deformed hadrons, see N.
19For papers dealing with form factor behavior, K. Fujimura, T.        Byers and C. N. Yang, Phys. Rev. 142, 976 (1966); T. T.
  Kobayashi, and M. Namiki, Prog. Theor. Phys. 43, 73 (1970);          Chou and C. N. Yang, ibid. 170, 1591 (1968); J. D. Bjorken
  R. G. Lipes, Phys. Rev. D 5, 2849 ([972); Y. S. Kim and M.           and E. A. Paschos, ibid. 185, 1975 (1969); A. L. Licht and A.
  E. Noz, ibid. 8,3521 (19731. See also Ref. 10.                       Pagnamenta, Phys. Rev. D 2,1150, (1970); 2,1156 (1970). V.
2oFor papers dealing with the jet phenomenon, see T. Kitazoe           N. Gribov, B. L. Ioffe, and I. Ya. Pomeranchuk, J. Nucl.
  and S. Hama, Phys. Rev. D 19,2006 ([979); Y. S. Kim, M.              Phys. (USSR) J 2, 768 (1965) [Sov. J. Nucl. Phys. 2, 549
  E. Noz, and S. H. Oh, Found. Phys. 9,947 (1979); T. Kita-            (1966)]; B. L. Ioffe, Phys. Lett. B 30. 123 (19691; Y. S. Kim
  zoe and T. Morii, Phys. Rev. D 21, 685 ([980); Nucl. Phys. B         and R. Zaoui, Phys. Rev. D 4,1738119711; S. D. Drell and T.
  164, 76 ([980).                                                      M. Yan, Ann. Phys. (N.Y.) 60, 578 (1971).
21For continuing debates on the time-energy uncertainty rela-        2Sy. S. Kim and M. E. Noz, Found. Phys. 9, 375 ([979).
  tion, see Y. Aharonov and D. Bohm, Phys. Rev. 122, 1649            26Th is point has been extensively discussed in the literature. See
  (19611; V. A. Fock, Zh. Eksp. Teor. Fiz. 42, 1135 ([962)             Refs. 10, 12, 13, 14, 15, and 17.
  [Sov. Phys.-JETP 15,784119621]; J. H. Eberly and L. P. S.          27For a pedagogical discussion of the connection between pho-
  Singh, Phys. Rev. D 7, 359 (1973); M. Bauer and P. A. Mel-           tons and light waves, see E. Gordin, Waves and Photons (Wi-
  lo, Ann. Phys. (N.Y.) 11,38 ([978); M. Bauer, ibid. 150, I           ley, New Yack, 1982).
  (1975), For some review papers on the time-energy uncertain-       2RFor the role of the Klein-Gordon equation in the development
  ty relation, see articles by J. Rayski and J. M. Rayski, Jr., E.     of Schrooinger's form of quantum mechanics, see P. A. M.
  Recami, and E. W. R. Papp, in The Uncertainty Principles             Dirac, Development of Quantum Theory (Gordon and Breach,
  and Quantum Mechanics, edited by W. C. Price and S. S.               New York, 1972).
  Chissick (Wiley, New York, 1977); C. H. Blanchard, Am. J.
Chapter IX
Lorentz Transformations
Let us start with a massive particle at rest with its spin along the x direction. If we
boost this particle along the x axis, it will gain a momentum along the same
direction. If we boost this moving particle along the y direction, the direction and
the magnitude of the momentum will be changed. The resulting transformation will
be a Lorentz boost preceded by a rotation. This rotation does not change the
momentum of the particle at rest, but will change the direction of the spin. This is
called the Wigner rotation and, as was carefully analyzed by Han, Kim and Son in
1987, manifests itself as the Thomas precession in atomic physics.
The Lorentz group is useful also in studying charged particles in electromagnetic
fields. In 1959, Bargmann, Michel, and Telegdi studied the precession of the spin of
a charged particle in a homogeneous magnetic field. In 1976, Kuperzstych studied a
charged electron in a plane-wave electromagnetic field. He discussed the possible
origin of the Lorentz force in terms of the little group for photons.
The light-cone coordinate system is very useful in many physical applications. In
this system, Lorentz boosts are scale transformations in the light-cone variables. In
InO, Parker and Schmieg studied the fundamental hypotheses of special relativity
in terms of the light-cone variables.
The group of Lorentz transformations, while being the basic language for special
relativity, is becoming an indispensable theoretical tool many other branches of
physics. The (2 + I)-dimensional Lorentz group is locally isomorphic to the group
of homogeneous linear canonical transformations in phase space. As was discussed
by Han, Kim, and Noz in 1988, the group of canonical transformations is very
useful in studying coherent and squeezed states in terms of the Wigner distribution
function. In their 1986 paper Yurke, McCall and Klauder (1986) used the (2 + 1)-
dimensional Lorentz group very effectively in their discussion of a new
interferometer. Figure 4 illustrates the point that the Lorentz group is useful both in
special relativity and modern optics.
                                          441
442                                                                                           CHAPTER IX
Fo COSwt
mg
                 Special                                          Modern
                 Relativity                                       Optics
      FIG. 4. Analogy of analogies. The analogy between the forced hannonic oscillator
      and the driven LCR. circuit is well known. Since the Lorentz group is rapidly
      becoming one of the standard languages in optical sciences, there will be many
      instances in which one fonnuIa in the Lorentz group will describe one physics in
      optics and another physics in special relativity. This figure is from D. Han and Y.S.
      Kim, Univ. of Maryland P.P. #88-92 (to be published in Phys. Rev. A in 1988).
LORENTZ TRANSFORMATIONS                                                                        443
                                    V. Bargmann
                     Princeton University, Princeton, New Jersey
                                    Louis Michel
                          Ecole Poly technique, Paris, France
                                          and
                                     V.L. Telegdi
                       University of Chicago, Chicago, Illinois
                              (Received April 27, 1959)
                                                                                          (2)
We further assume (b) that st obeys in (R) the customary equation of motion
                           Uld'C     = (ge 12m )(stxii),               (R)                (3)
where it, e, and m have their standard meanings, while the gyromagnetic ratio g is
defined by this very equation. While sO vanishes by hypothesis in any instantaneous
rest-frame, ds Old 'C need not. In fact, (2) implies
                              ds°ld'C   =st·(tNld'C),               (R)                   (4)
as can be checked by reducing to the rest-frame. With (5), one has for
homogeneous fields
                  dsld'C   = (elm)[(gl2)F·s + (gl2 -                1) (s·p·u)u].         (7)
(5) and (7) constitute, for any value of g and arbitrary spin S, a consistent set of
equations of motion; they imply that s·s and s·u are constant, so that condition (2)
is maintained? For experiments of current interest, the main use of (7) is in the
computation of the rate n at which longitudinal polarization is transformed into a
transverse one (and vice versa). For this, we express s in the laboratory frame (L) in
terms of two unit polarization four-vectors, el and et:
                                sIS = e,coscj) + e,sincj),
where
                                            S   = (-s·s)~,
                         el='Y(v,V/v)='Y(v, v), e,=(O,Ii),
change, but the transverse polarization precesses around V in longitudinal fields with
an angular frequency ro = (ge 12m y)H = (g 12)roL' as follows readily from (8).
(C)9  E''; = E, iI = 0; Q = rop [-g 12y+ (g 12 - l)y], where rop=eE Imyv is the
angular frequency of the particle's motion in the laboratory.
expressed in tenns of the skew tensor M of Frenkel (which satisfies M·1l = 0), and vice versa:
a =M- ·Il,M- =aXll, i.e.,M- iIc =a i Ilk - akll i • For the quantum-mechanical applications of a see, e.g.,
C. Bouchiat and L. Michel, Phys. Rev. 106, 170 (1955).
                60ur notation is: c - I, It - 1 throughout; coordinate four-vector of components
xO=1 ,Xl, xl, x 3: x = (%0,1'>, ~ = {xa.} (11 = 1,2,3); metric of signature (+ --); t - proper time; a dot
between symbols, contraction of neighboring indices with the metric tensor, e.g., x·x = (x~l -    rz;skew
tensor of components Til; indicated as T = ct', f'), 1" = {TOm}, f· = {Tftr}, 11, p, Y= I, 2, 3; its
dual by T- = ct" - 1,).
                7Equations (5) and (I) can be integrated explicitly by reference to four orthonormal four-
vectors Il (i) such that each of them obeys (5), and 1l(0l:u.
                8Crane, Pidd, and Louisell, Bull. Am. Phys. Soc. Ser. 11,3,369 (1958).
                9H. Frauenfelder III al., Phys. Rev. 106, 386 (1957).
                lOp. E. Cavanagh III al., Phil. Mag. 2, 1105 (1957).
J. KUPERSZTYCH
Introduction.
   It is well known that the notion of spin is a notion which has no classical
equivalent. Historically, the spin appeared as a supplementary degree of freedom
(an intrinsic angular moment) necessary to explain the Zeeman effect. One of
the most remarkable results of Dirac's electron theory is, perhaps, the theoretical
derivation of the correct value of the electron magnetic moment, initially
postulated by KRONIG, UHLENBECK and GOUDSMIT from experimental data (1).
   The fact that the existence of spin can be revealed by seeking a manifestly
Lorentz-invariant theory seems to show that the spin is an essentially relativistic
notion. This idea is reinforced by the fact that electron spin occurs when
2 J. KUPERSZTYCH
(2)   V.   BARGMANN,   L.   MICHEL   and V. L.   TELEGDI:   Phys. Rev. Lett., 2, 435 (1959).
LOREN1Z TRANSFORMATIONS                                                                    449
(1) I E'(r:')
                                    H'(r:')
                                              = E'(r:) = E(r:) ,
                                              = H'(r:) = H(r:) ,
where E, Hand E', H' are respectively the electric and magnetic fields of the
plane wave, derived from the four-potentials             (~)   and   (~:) in the frame Land
in the transformed frame L'.
    If we use the definition of fields from potentials, the system of eqs. (1)
involves the following equation:
(2)
(3)   c.   MOLLER:   The Theory of Relativity, Second Edition, Subsect. 2.4 (London, 1972).
450                                                                                   CHAPTER IX
4. J. KUPERSZTYCH
where
      Because of the axial symmetry around the x-axis, we can choose                        ~   in the
(x, y)-plane without prejudicing the generality of the calculation. Let a: be the
angle between n and ~.
     A priori, the operator &I is a function of three independent parameters
(for instance the three Euler angles). However, in order to shorten the cal-
culation, the following must be kept in mind: since the Lorentz transformation
..It in question would be equivalent to a gauge transformation, the direction n
of propagation of the plane wave must not, of course, be modified. Now, owing
to the ~-transformation, the a:-angle will change in the (n, ~)-plane. Therefore
to compensate for this change, we can immediately conclude that it is neces-
sary for the desired rotation to be in the (n, ~)-plane, that is, around the z-axis.
Calling "P the angle of rotation, we can then write the operator ..It in the fol-
lowing form:
                               ..It = &I("P) ~(P, a:),
that is
o 0
0 0 0 1
   The Lorentz transformation ..It which will leave the fields of the plane
wave unaltered is obviously gauge invariant. We can therefore require the
potentials cp, A, cp', A', and the gauge function A(t, r) to be functions of 7: only.
Then, these requirements involves the following relations:
                                              aA = OA = 0           cp =
                                              ay      oz       '           A~,
LORENTZ TRANSFORMATIONS                                                                    451
which, inserted in the basic equation (2) with .A given by (4), now give the
system of two equations
[sin 1p + (I' -1) cos oe sin (oe + 1p) - yfJ sin (oe + 1p)] rp +
+ [- sin 1p + (I' -1) sin oe cos (oe + 1p) + yfJ sin oe] All = 0 .
    Since rp and All are linearly independent, all their coefficients must be equal
to zero. We finally obtain the solution to our problem in the form of the two
following relations:
                                                                                     1
(5)          oe = a.rc cos I' fJ1 = arc cos JL                or     1 - fJ cos oe = - ,
                             I'             1'+1                                     I'
(6) 1p=n-2oe.
0 0 0
and is dependent on the arbitrary parameter 1';;;.1. We can now obtain the
gauge function A(T) such that eq. (2) has a solution. It is
                                           I
                                           T
    Therefore, when the potentials are functions of the retarded time T like the
fields of the plane wave, the Lorentz transformation .A(y) given by (7) is equi-
valent to a gauge transformation whose gauge function is given by (8).
    Thus, we have shown that it is possible to find a Lorentz transformation
which will leave both the electric and magnetic fields of a plane wave unaltered.
452                                                                                 CHAPTER IX
J. KUPERSZTYCU
where e(1') represents the energy of the particle in the field, P(1') is its momentum,
v is its velocity, m is its rest mass and - e (e> 0) is its charge. j is the unit
vector of the y-axis, 11 = (e/m)(- AJ' A,i is a dimensionless parameter.
    If we use the same device as that employed for the definition of proper time
of a moving particle (6), the motion of a charged particle in the field of a plane
wave may be considered as uniform at each moment of time.
    Thus, at each moment of time we can use a Lorentz transformation in order
to pass from the frame L to the particle instantaneous rest frame R. It will
now be shown that the frame R can be deduced from the frame L using pre-
cisely a Lorentz transformation of type .A(y) which is equivalent to a gauge
transformation.
    In order to do this, the parameter y of the operator .A(y) given by (7), has
now to be determined. We have consequently to solve the following equation:
                                          e(1'))
(10)                              .A(y) ( P(1')    = (m)
                                                      0 '
                                                      ')12
                                     1+~2             2
                                                              -')I   0
                                      ')I"         ')12
                                                 1--          -')I   0
                                      2             2
(12)                .#(')I(T))   =
                                     -')I          ')I         1     0
0 0 0 1
    In other words, the equations of motion (9) of a charged particle in the field
of a plane wave can be written in the simple following form:
(13)
    It is clear that the operator 2(')1) is sufficient to bring about the passage
from the laboratory L to a frame where the particle is at rest at each moment
of time.
    We remember that 2 is a Lorentz transformation (without change in direc-
tion of the space co-ordinate axes) characterized by the following relation:
(5')
                                 I-v(T)·n    =   (1      + ')12)-1
                                                            2 '
which is exactly the relation (5). Moreover, it follows from this above relation
that T is also the proper time of the particle.
454                                                                                             CHAPTER IX
8 J. KUPERSZTYCH
                           1           0                     0      0
                                   1-'1'2/4            v
                           0                                        0
                                   1 + '1'2/4       1+'1'2/4
(14)       fJ?(V(i))   =
                                       v            1-'1'2/4
                           0                                        0
                                   1 + '1'2/4       1+ '1'2/4
                           0           0                     0      1
                                                    '1'2
                           1+~                                              -v              0
                                   2                  2
                                '1'2                  '1'4                   '1'3
                                                                                            0
(15)       2'(V(T))    =        2          1 + 8(1 + '1'2/4)            4(1 + '1'2/4)
                                                  '1'3                              '1'2
                               -v                                                           0
                                             4(1 + '1'2/4)          1 + 2(1 + '1'2[4)
                               0                  0                          0              1
                                                 ,13(T))
                                       2'(V(T)\P(i)
                                                                  (m)
                                                                 = 0 .
     Thus, the particle is also at rest in the frame K which is transformed from
the frame L by the Lorentz transformation 2'{v).
     We arrive now at the crucial point of the paper. The operator fJ?{v) given
by (14), which stems from the calculation, has been derived without any as-
sumption further than the theory of classical electrodynamics.
     In order to find its physical significance, we shall now consider a dynamic
quantity for the particle other than its four-momentum, namely its intrinsic
angular moment, that is to say, its spin (if it exists). In what follows it will
be shown that the motion of the spin of a charged particle, the gyromagnetic
ratio of which is g = 2 as for a Dirac particle, is shown by just the operator
fJ?{v) in question.
LORENTZ TRANSFORMATIONS                                                                          455
   Let us consider the particle in its instantaneous rest frame K. We will now
look for the solution to the problem of the behaviour of the spin of a charged
particle which is executing a given classical motion in the field of a plane wave.
   This solution will be given by the solution of the following equation (6),
which is derived from the BMT equation:
where ~ is the spin vector of the particle in its instantaneous rest frame K, and
where f-l' = f-l + e/2m is the anomalous part of the magnetic moment f-l of the
particle.
   If we take into account eqs. (9) and after a straightforward calculation,
the eq. (16) can be written in the form of the following equations:
                                        dCx       dv
(17)                                    dr = e(v) dr CII ,
                                        dCy                dv
(18)                                    dr =     -   e(v) dr Cx   ,
(19)                                    dC. _ 0
                                        dr -    ,
where
                                                V2)-1 m
                                 e{v)   = ( 1 + 4" -2 ef-l'           .
                                        dX         .  dv
                                        -    =   -te(v)- X.
                                        dr                 dr
10 J. KUPERSZTYCH
(20)
 where
                                                v         m
 (21)                       "P'(r) = 2 arctg--2 _,u' v,
                                           2    e
and where Co,., Co", Co. were the components of the vector t( r) when the potentials
were put equal to zero, that is before the field was switched on.
   Obviously eq. (19) gives
 (22)
                                          and        •
                                                    sm"P r
                                                              '()
                                                                    = 1 +V'1'2/4 •
The motion of spin of a classical Dirac particle in the field of a plane wave is
then given by
                            t(r)   = 8i(- v) to =   8i- 1 (v) to ,
or
                                     to = 8i(v)t(r),
4. - Conclusion.
    We have therefore shown that for the problem of a charged particle clas-
sically interacting with the field of a plane wave, it is possible to derive an
operator which cannot be physically understood without introducing the notion
of electron spin. The operator in question &?(v) represents exactly the motion
of the spin ?f a particle the magnetic moment of which is precisely that of a
Dirac particle. As was shown, another value of the magnetic moment would
provide another operator than the one which was previously derived.
    This result seems to indicate that there is a link between relativistic in-
variance, gauge invariance and electron spin on the classical level.
    However, we must not be too optimistic: the results presented in this paper
do not imply that the spin « must exist ». The Lorentz force law is a priori
valid for any charged particle (neglecting radiation reaction) and can therefore
be used in the problem of a charged pion which is classically interacting with the
field of a plane wave. But since this particle has no spin, a physical interpre-
tation of the operator ~(v) does not exist in the case of the pion.
    For such a particle, from an aesthetic point of view we can only come to
an irritating conclusion .
• RIASSUNTO (.)
Senza alcuna ipotesi oltre la teo ria dell'elettrodinamica classica applicata al problema
di una particella carica interagente con il campo di un'onda elettromagnetica piana,
si deduce un operatore che non ha altro significato che il moto dello spin di una par-
ticella di Dirac, dato dalla soluzione dell'equazione di Bargmann-Michel-Telegdi. Si
dimostra che il moto di un elettrone ed il motodel suo spin in un'onda piana sono dati
da un operatore del tipo di Lorentz che ha la lodevole propriet1t di essere equivalente
a trasformazioni di gauge.
('J   Tmduzione a cum della Redazione.
              Reprinted from   AMERICAN JOURNAL OF PHYSICS,        Vol. 38, No, 2, 218-222, February 1970
                                                   Printed in U. S. A.
            We discuss the form of the special Lorentz transformation, and the corresponding transforma-
            tion of the electromagnetic field, in which the transformation matrix is diagonal. We derive
            the diagonal form of the special Lorentz transformation directly, in a simple way, and show
            that it is sometimes more convenient to apply than the algebraically equivalent conventional
            form of the transformation. The convenience i, especially evident in deriving the linear
            Doppler effect, and the relativistic addition of more than two parallel velocities. By writing
            Maxwell's equations in terms of linear combinations of coordinates which have simple
            transformation properties, we arrive at the transformation eqnations of the Maxwell fields
            in a diagonal form, as well as at the plane wave solutions, in a natural manner. The derivations
            and applications described above should be of use in a comse on relativity because of their
            simplicity and directness.
One then readily finds, with the aid of Eqs.              One can easily show that Eqs. (17) are
 (17b) and (17d), that Eq. (Vi) is form invariant      equivalent to the more conventional form of the
if                                                     transformation equations by using Eqs. (G).
                                                       However, as with Eqs. (;"i), the transformation
                     Bx'=Bx.                 (l7f)
                                                       Eqs. (17) are often useful in their original form.
Equations (17a)-(17f) are analogous to Eqs. Ui)        For example, multiplication of Eq. (l7b) by
in that they only involve multiplication by the        Eq. (17c) , and Eq. (l7d) by Eq. (17e), followed
factor A"                                              by addition yields the invariance of E'-B'.
   The above considerations arrive at Eqs. (17)        Similarly, multiplication of Eq. (17c) by Eq.
in a natural and simple marmer, and are therefore      (17d), and Eq. (17b) by Eq. (17e) yields, upon
suitable for use in an introductory course. In         subtraction, the invari~1nce of E· B. The invariance
fact, analogous derivations of the more conven-        of these quantities can immediately be generalized
tional form of the transformation equations are        to arbitrary Lorentz transformations because the
given in the texts cited in Ref. 11. However, it       quantities are clearly invariant under three-dimen-
should be noted that the considerations given          sional rotations.
show only that form-in variance is satisfied if the
                                                                     B. Plane Wave Solutions
correct transformation equations of the Maxwell
fields hold. Those considerations alone are not           It is worth noting how simply one can extract
sufficient to arrive uniquely at the transformation    the general plane wave solution from Maxwell's
equations, since six field quantities arc involved,    equations written in terms of i; and~. We seek solu-
while less than six equations are used. Further-       tions of Maxwell's equations which are inde-
more, since the equations used were all homo-          pendent of y and z. It is then evident in the usual
geneous, the transformation equations arrived at       way that the x component of Eq. (12a) together
could obviously all be mUltiplied by a common          with Eq. (12b) implies thatBx is a constant, which
factor dependent on v. In the paper cited in Ref.      we set equal to zero. Similarly one finds that Ex
11, Einstein avoids the above difficulties by          is a constant, which we put equal to zero. It is easy
considering six of the Maxwell equations in the        to show that the remaining four components of
absence of charges, thereby deriving uniquely the      Maxwell's equations are equivalent to
transformation of the fields, to within a common
                                                       (a/a~)   (E,-B y) =0       (a/a~)   (E,+By) =0
factor", (v). He then shows by simple considera-
tions that "'(v) = 1.                                  (a/ao (Ey+B,) =0           (a/a.,,) (Ey-B,) =0.   (18)
   We can overcome the uniqueness difficulty in
                                                        The solutions, to within additive constants,
our derivation by noting that Eqs. (12a) and           which we put equal to zero, are
 (12b) become the remaining Maxwell equations
in the absence of charges, if one makes the sub-                E,-By=2jl(~)         E,+By=2g 1 W
stitutions E-+B and B-+-E. That substitution in
Eqs. (14) and (15) then gives three new equa-                   E y+B,=2j,(.,,)      Ey-B,= 2g2(~).      (19)
tions, which are form-invariant if Eqs. (17) with      Hence
E->B and B-+-E hold. However, the full set of
Eqs. (17) are simply interchanged among them-            E y=j2(.,,)+02W            E,=jl(~)+glW
selves by that substitution. Hence, if Eqs. (17)                                                         (20)
                                                         Bu= -!t(.,,)+g,W           B,=j2(~) -g2(~).
hold, then the six independent equations given by
Eqs. (14), (Li), and the equations obtained            Equations (20) involve four arbitrary differen-
from them under the substitution E-+B and              tiable functions, and represent the most general
B-+-E are form-invariant. Consequently, form           plane wave solution of Maxwell's equations (the
invariance of Maxwell's equations in the absence       x axis has been chosen to be along the direction of
of charges uniquely determines Eqs. (17) to            the wave motion, without significant loss of
within a common factor", (v). We refer the reader      generality). The solution clearly consists of a
to Einstein's paper cited in Ref. 11, for the          superposition of a wave traveling at velocity c in
proof that y;( v) = 1.                                 the positive x direction (the." dependence), with a
462                                                                                                        CHAP1ERIX
wave traveling at velocity c in the negative x                connection with the linear Doppler effect [see the deriva-
direction (the ~ dependence). By comparing                    tion of Eq. (7)]. Bondi emphasizes the reflection of radar
                                                              signals in his applications, whereas Ollr viewpoint is
Eqs. (19) with Eqs. (17) and Eqs. (5), it is                  generally quite different. Also, our considerations with
evident that the plane wave solutions in one                  regard to Maxwell's equations are unrelat~d to the k-cal-
inertial frame transform into similar plane wave              culus. For the k-calculus, see H. Bondi, Relativity and
solutions under special Lorentz transformations.              Comman Sense (Doubleday & Co., Inc., Garden City,
                                                              N. Y., 1964).
                 IV. CONCLUSIONS                                 • The use of null coordinates in treating radiation is well
                                                              known. See, for example, R. Penrose in Relativity, Groups
   We feel that the diagonal form of the special              and Topowgy, C. and B. DeWitt, Eds. (Gordon and
                                                              Breach Science Publ., Inc., New York, 1964), p. 565.
Lorentz transformation, and the corresponding                    • See, for example Ref. 1, or A. P. French, Speciat
transformation of the electromagnetic field, can              Relativity (W. W. Norton & Company, Inc., New York,
be advantageously employed when covering                      1968); R. Resnick, Introduction to Special Relativity
certain material in a relativity course. A derivation          (John Wiley & Sons, Inc., New York, 1968); W. G. V.
which arrives directly at Eqs. (5) is evidently               Rosser, An Introduction to the Theory of Relativity (Butter-
                                                              wortl", Scientific Publications Ltd., London, 1964), as well
somewhat simpler than the more conventional                   as many others.
derivations of the Lorentz transformation. Equa-                 • Or because A (v) must clearly reduce to unity as v
tions (5) can be applied directly to the Doppler              approaches zero.
effect, aberration, and the addition of any number               • Invariance of the phase follows from a consideration of
of parallel velocities. 14                                    the counting of wave crests. See for example, W. G. V.
                                                              Rosser, Ref. 4, p. 154, or C. Mpller, Theory of Relativity
   By expressing the vacuum form of Maxwell's                  (Oxford University Press, London, 1952), p. 7.
equations in terms of the variables x+ct and x- ct               , The derivation of Eq. (8) was suggested by a problem
we arrived at the transformation Eqs. (17).                   in Ref. 1, p. 54.
Equations (17) are clearly analogous to Eqs. (5)                 • The fact that Eqs. (9) have the form of a special
in that they involve only multiplication by factors           Lorentz transformation illustrates the group property of
                                                              the special Lorentz transformations.
of A. We also pointed out that the plane wave
                                                                 • The general form of Eq. (10) has been given in a
solutions follow naturally from thc above form of             group theoretic context by P. Malvaux, Compt. Rend.
Maxwell's equations.                                          236,1009 (1952).
   In group theoretic language, the variables                    10 D. Bohm, Special Relativity (W. A. Benjamin, Inc.,
x-ct, x+ct, y, and z form the basis of the diagonal           New York, 1965), p. 68; N. D. Mermin, Space and Time in
                                                              Special Relativity (McGraw-Hill Book Co., New York,
representation of the special Lorentz transforma-
                                                              19G8), p. 132.
tion, which is given by Eqs. (5). Similarly, the                 11 A. Einstein, Ann. Physik 17, 891 (1905); translation in
linear combinations of the fields appearing in                Principle of Relativity (Dover Publications, Inc., New
Eqs. (17) form the components of an antisym-                  York, 1923), p. 37; R. Resllick, Ref. 4, pp. 178-181;
metric tensor which transforms like the direct                W. G. V. Rosser, Introductory Relativity (Plenum Press,
                                                              Inc., New York, 1967), pp. 224-227.
product of two basis vectors of the diagonal
                                                                 "We use Heaviside-Lorentz units. To change the
representation of the special Lorentz transforma-             formulas in this paper into rationalized MKS units,
tion. To go more deeply into such matters would               simply replace B by cB. (That procedure works for the
take us beyond the scope of this article.                     formulas in this paper, but not in general.)
                                                                 13 We are not aware of any reference where it is pointed
            Reprinted from   AMERICAN JOURNAL OF PHYSICS,        Vol. 38, No. 11, 129S-1302, November 1970
                                                    Printed in U. S. A.
            We give a diagrammatic representation of the diagonal form of the special Lorentz trans-
            formation. The null coordinates z:!::ct are plotted along a single set of orthogonal axes. Special
            Lorentz transformations are then represented only by a change of scale along those orthogonal
            axes. This diagram, which we call a null coordinate diagram, and the Minkowski diagram are
            closely connected. To demonstrate the use of the null coordinate diagram, we apply it to the
            linear Doppler effect, time dilation, and Lorentz contraction.
II. APPLICATIONS
Thus, making use of Eqs. (1) and (6), we have                which is the familiar form of the time dilation
                                                             equation. The ease with which Eqs. (9) and (10)
         .:l1/AO' =A-l (v) .:l'1AO =A-I(V).:l'1BO.    (7)
                                                             were obtained from Fig. 4 illustrates the utility
This gives the linear Doppler effect:                        of null coordinate diagrams in problems involving
                      r'=A-'(v)r                             light pulses.
or
                                                                            C. Length Contraction
                                                      (8)
                                                                Suppose that a rod of proper length I is at rest
where v and v' are the frequencies relative to S
                                                             along the x axis of S, with its center at x=O. If a
and S', respectively.                                        light flash is emitted at the center of the rod, it
                     B. Time Dilation
   To illustrate further the usefulness of the null
coordinate diagram, consider a clock7 consisting
of two mirrors at the ends of a rod, with a light
pulse bouncing back and forth between the                                        :;""'_--1 8
mirrors. The period of the clock is the round trip
time of the light pulse. Figure 4 shows the world
line of the light pulse in one such round trip. The                     _ _ _ _ _ _ _+A"-_ _ '1''1'
rod is at rest in the inertial system S. The light
starts from the mirror at space-time point A, is
reflected from the other mirror at space-time
point B, and returns to the original mirror at               FIG. 4. Round trip of light pulse bouncing between two
space-time point C.                                          mirrors.
466                                                                                                       CHAPTER IX
FIG. 5. World lines L. and L, of ends of rod. World lines   Then Eq. (20) yields the length contraction
AB and Be of flash emitted from center of rod. World                         l=~[A-'(V) +A (v) Jl'.
lines AD and DE of flash emitted from position such that
events A and E are simultaneous in S'.                      The time dilation can be derived analogously.
LORENTZ TRANSFORMATIONS                                                                                               467
  The above examples should be sufficient to                vertically and horizontally, or rotated through a 45·
familiarize the reader with the use of the null             angle, is purely a matter of taste.
                                                              o The world lines of light pulses are always parallel to the
coordinate form of the Minkowski diagram.                   t or ~ axis.
                                                               o We use the convention that A~PQ denotes the positive
  1 L. Parker and G. M. Schmieg, Amer. J. Phys. 38, 218     change in ~ between events P and Q. Similarly, in our
(1970).                                                     notation A followed by any variable always denotes the
  • Although diagrams involving null coordinates are not    positive increment in that variable. An alternate procedure
new, we have not previously seen null coordinate diagrams   would be to use coordinates ct±x in Eq. (2). If this were
used to represent special Lorentz transformations, nor      done, many examples could be easily discussed in the first
have we seen them applied to elementary problems in         quadrant of the t, ~ plane.
special relativity.                                            7 Often called a "Feynman clock."
  • The Minkowski, Brehme, and Loedel diagrams are            • Since the rod is moving relative to S', the source at the
described in A. Shadowitz, Special Relativity (Saunders,    time of emission must be closer to the leading end of the rod.
Philadelphia, Pa., 1968).                                      o For convenience, in Fig. 5 both flashes are drawn so
  • The question of whether to draw the t and ~ axes        that they reach the end of the rod at x =1/2 simultaneously.
468                                                                                                                        CHAPTER IX
                                                                 where I in) is the state vector for the light before it has
                                                         (2.5)
                                                                 interacted with the beam splitter. Throughout this paper
                                                                 we will hop back and forth between the Heisenberg pic-
in fact, N itself commutes with all the operators of (2.2),      ture where J is rotated while the state vector remains
   Why one should want to characterize a lossless passive        fixed and the Schriidinger picture where J remains fixed
device with two input ports and two output ports with the        depending on which picture is most convenient for the
operators (2.2) and (2.3) will now be explained. Let a I in      discussion at hand.
and a2in denote the annihilation operators for the light            Another realizable scattering matrix for a beam splitter
entering the two input ports and similarly let a lout and        is
a20ut denote the annihilation operators for two light
beams leaving the two output ports. The scattering ma-                       cos fi        -sin fi
trix for the device will have the form                                             2              2
                                                                       U=                                                     (2.13)
                                                                             sin fi         cos fi
                                                         (2.6)                     2              2
                                                                 At radio frequencies devices with the scattering matrix
Since the creation and annihilation operators for the two        Eq. (2.8) and Eq. (2.13) would be distinguished, respec-
input beams and the two output beams must satisfy (2. I)         tively, as 90' and 180' couplers. The scattering matrix Eq.
the matrix                                                       (2.13) transforms J according to
                                                         (2.7)                                    o    sin/31 Jx
                                                                                                  I      0    Jy              (2.14)
                                                                       1
   Consider a beam splitter with the scattering matrix
                   a
                 cos-        -lsm~
                                  .. a                                 [
                                                                        Jx J =e,f3J,         1
                                                                                 . [Jx Je -,f3J,
                                                                                             . .                              (2.15)
                    2                                                    Z   out              Z
       U=       .. a                 a
                                                         (2.8)
              -lsmT               COST                           Hence in the Schriidinger picture where J remains fixed
                                                                 the state vector for the light after interacting with the
                                                                 beam splitter is
This transformation will transform J according to
                                                                       lout)=e- if3J, lin).                                   (2.16)
        Jx
        Jy         =
                       [I0 cosa
                            0        0
                                   -sma
                                             J Jx
                                               Jy        (2.9)
                                                                    How J transforms under a phase shift or change in op-
                                                                 tical path length is now determined. Let light beams I
        Jz   out       0 sina      cosa        Iz                and 2 incur a phase shift 1'1 and 1'2. respectively. The uni-
                                                                 tary matrix associated with this process is
That is, the abstract angular momentum vectors are rotat-
ed about the x axis by an angle a. This transformation                       e'O'Y 1
can be expressed in the form                                           U= [                                                   (2.17)
       [~Z
             J =/oJ,
             out
                           [~ Ie
                             Z
                                   -ial, ,              (2.10)   Under this transformation J transforms as
                                                                  Jx           COS(1'2-1'I) -sin(1'2-1'I) 0              Jx
                                                                  Jy           sin(l'2-1'I)           COS(1'2-1't)   0   Jy   (2.18)
where the angular momentum operators on the right-hand
side are evaluated for the input beams a I in and a 2in' The      Jz out               0                   0             Jz
equivalence of (2.9) and (2.10) can be checked using the
operator identity                                                This represents a rotation about the z axis by the angle
470                                                                                                                          CHAPTER IX
r2-rl corresponding to the relative phase shift between               depicted in Fig. 1. It consists of two 50-50 beam splitters
the two light beams. This transformation can be ex-                   Sl and S2. The relative phase shift <P=<P2 - <Pl is mea-
pressed as                                                            sured by observing the interference fringes in the light
                                                                      leaving S2. Here, as depicted in Fig. I, the case will be
       Jx                       Jx                                    considered where the photodetector is placed in each of
                =/(Y2- YI IJ,          -I(Yz-rI IJ,
       J,                       J, e                  .     (2.19)    the two output beams 0 t OUI and 02 oul' By counting the
                                                                      number of photoelectrons generated by each detector, D t
       J, oul                   J,                                    and    9  2, separately, onf measures the operators
                                                                      N1=010,,010uI and N2=02 0uI02 0uI' From Eqs. (2.2) and
Hence in the Schriidinger picture this represents a
                                                                      (2.3) one sees that this is equivalent to measuring both
transformation of the incoming state vector \ in) accord-
                                                                      N out and Jzouto
ing to
                                                                         A geometrical picture of the operation of the inter-
      \ out) =e - i lr,-r liJ, \ in) .                      (2.20)    ferometer will now be developed. For definiteness the
                                                                      beam splitters S 1 and S2 will be chosen to have scattering
It is worth noting that under the full transformation Eq.             matrices of the form (2 .8). For a 50-50 beam
(2.17)     the    incoming      state    transforms       as          splitter a must take on the value rr/2 or - rr/2. For the
 \ out)=edYI+y,IN/2e -dr,-rlIJ'\in) but since N com-                  beam splitter SI we take a= + rr/2, for the beam splitter
mutes with J the operator e' IYt+r,IN/l gives rise to phase           S2 we take a= -rr/2. Let \ in) denote the state vector
factors which do not contribute to the expectation values             for the light in the two light beams entering the inter-
or moments of number-conserving operators such as J                   ferometer. From Eq. (2.12) the state \ 1ft) of the light
and N. In fact, it is the insensitivity of photodetectors
(photon counters) to the extra phase ei lYI+r,INIl that
allows one to fully characterize an interferometer by the                                                              ,
                                                                                                                ........
SU(2) transformations described above. It has now been
                                                                                                                ~~ . _. {
shown that the transformations the beam splitters and
phase shifters perform on the two incoming light beams                                                           \ /
can be visualized as rotations of the vector J. Further,
                                                                                                                  \ .'
                                                                                                                   "
since the operator (0101 or 0 lOll characterizing the num-
ber of photons counted by a photodetector placed in one
of the light beams can be expressed in terms of the opera-
tor Nand J" interferometry can be visualized as the pro-
cess of measuring rotations of J. The operators giving
rise to the mode transformations of Eqs. (2.12), (2.16), and
(2.20) have also been recently discussed by Schumaker in
Ref. 6 where they are referred to as two-mode mixing
operators.
                                                                                       ICI
                                                                                                                       I.'
                                                                        FIG. 2. A rotation-group picture of the performance of a
                                                                      Mach-Zehnder interferometer. When light enters only one input
                                                                      port of the interferometer the input state has the form
                                                                      ij, m ) = Ij,j) in a fictitious (Jx.J"J,) space and can be
                                                                      represented by a cone centered along the z axis with height j (a).
                                                                      The first beam splitter performs a -1T/2 rotation about the x
                                                                      axis. The cone now lies along the y axis Ibl. The phase shifts
                                                                      accumulated by the two light beams in the interferometer corre-
                                                                      spond to a rotation -,p about the z axis Ic). The second beam
                                                                      splitter performs a 1T 12 rotation about the x axis (dl. Since J, is
                                                                      proportional to the difference in the number of photons counted
                                                                      by the two pholodetectors in the interferometer output beam the
  FIG. 1. A Mach-Zehnder interferometer. Light entering one           interferometer can resolve states whose overalJ rotation is suffi-
of the two input ports O lin or alin is split into two beams by       ciently far from the z axis so that on average the Jz measured
beam splitter S1. The two light beams bl and b, accumulate a          will differ from j by one. In order for this to be the case the
phase shift ,pI and ,p" respectively, before entering beam splitter   cone must be rotated by approximately the width of its base
S2. The photons leaving the interferometer are counted by             which is   VJ.  Hence the minimum detectable <P is of order
detectors D I and D2.                                                 IN}.
 LORENTZ TRANSFORMATIONS                                                                                                            471
upon leaving the S I is                                                   As shown in Fig. 2(d), the net result of this sequence of
                                                                          rotations is a rotation of the initial state vector about the
       11/1) =e -ii_/2IJ, I in)                                  (3.\)
                                                                          y axis by an amount "'.
which amounts to a rotation of the state vector about the                    As pointed out earlier, by placing photodetectors in
x axis by an amount -7T/2. This is depicted in Figs. 2(a)                 both of the output beams one can measure both N (the to-
and 2(b) where for definiteness I in) was chosen to be the                tal number of photons passing through the interferometer)
state I j,m = j), that is, J lies on the circle surrounding               and lz (the difference in the number of photons arriving
the base Llf the cone in Fig. 2(a). With a -7T /2 rotation                at each detector divided by 2), Because N commutes with
about the x axis this cone now lies along the y axis.                     J it by itself gives one no useful information about "'. It
   Upon reaching the input ports of S2 one light beam CI                  does, however, give one useful information about I in), in
has undergone a phase shift of "'I while the other C2 has                 particular the total number of photoelectrons n counted
undergone a phase shift "'2' Thus, from Eq. 12.19), upon                  after the light has passed through the interferometer tells
arriving at S2 the light is in the state I vi):                           one that I in) was in an eigenstate of N:
(3.7)
One can show                                                              From (3.6) and (3.11) one concludes that the incoming
                                                                          light beam was in an eigenstate of Ij ,m )
                                                                                !in)= Ij=n!2,m=n/2).                             (3.12)
                                      = -lsincP)l, +lcoscPiJz.    (3.8)   Hence the incoming light is in the eigenstate that was de-
Hence                                                                     picted in Fig. 2.
                                                                             Intuitively the smallest cP that can be measured is one
       (Jz ) = (out i lz lout)                                            where the cones of Fig. 2(d) do not appreciably overlap.
                                                                          The distance from the apex of one of the cones to a point
             =-sincP(inIJ, iin)+coscP(inIJ,iin)                   13.9)
                                                                          on the circle of the cone's base is the square root of the
and                                                                       eigenvalue of J2 or Vj (j + I ). The distance from the
                                                                          apex of one of the cones to the center of its base is the
       (J,2 ) = (out IJ,2 lout)
                                                                          eigenvalue of Jz or J. Hence the radius of the base of one
             = sin'cP (in IJ;: in)                                        of the cones is [j Ij + 1) - /]1/2 = vJ. The minimum
                                                                          detectable cP is thus of order cPm,"~j-I/2, and since from
                 -sincPcoscP(in iJ']z+J,}, I in)                          Eq. 13.12) j = n /2,
                 +cos'dJ(inIJ; in).                              13.10)        4>mm-:::::=.n   -1/2.                             (3.13)
    To proceed further one needs additional information on                Hence the sensitivity of an interferometer operated in the
 I in). Let us suppose the interferometer is operated in the              mode where light enters only one of the two input ports
usual manner where light enters the interferometer only                   has a sensitivity that goes as the square root of the num-
along one of the input beam paths, say a l' Then from the                 ber of photons passing through the interferometer.
total number of photons n counted by D I and D2 one                          Equation (3.13) is now made more rigorous by a direct
knows that there were n photons in the incoming light                     calculation from Eq. (3.9) and Eq. (3.10). For the state
beam. Hence I in) is an eigenstate of lz:                                 13.12) the mean value of Jz is
                                                                               -  n
       lz   in)=~ !in).                                          (3.11)        lZ=2coscP .                                       (3.14)
472                                                                                                               CHAPTER IX
The mean-square fluctuation (4Jz )2 about this value is              A Fabry-Perot interferometer is depicted in Fig. 3(a). It
                                                                  consists of semitransparent mirrors Ml and M2. This in-
      (4Jz)'=.7}-J;                                               terferometer measures the phase shift rf> suffered by light
                                                                  as it propagates from one mirror to the other. This device
                  =1"sin 2rf> •                          (3.15)
                                                                  has two input ports a I in and a, in' and two output ports
                                                                  al out and a'out. Although alin and a'out and a'in and
The mean-square noise in rf> is thus
                                                                  a lout are collinear they can be separated with optical cir-
      (4rf»
              2
                  =r- =-.
                     (4Jz )'
                      aJz J
                       arf>
                                  1
                                  n
                                                         (3.16)
                                                                  culators as shown in Fig. 3(b). In this manner one can
                                                                  place photodetectors in both beams a lout and a, out
                                                                  without obstructing the light injected into a I in or a, in'
                                                                  Hence one is allowed to measure N and Jz for the two
                                                                  output beams.
Hence the rms fluctuation of rf> due to photon noise goes
                                                                     An analysis of the Fabry-Perot interferometer is now
asn- l12 ,
                                                                  carried out. The mirrors M I and M2 will be taken to
                                                         (3.17)   have scattering matrices of the form Eq. (2.8). In particu-
                                                                  lar for the mirror M 1 we take
in agreement with the intuitive argument based on Fig. 2.
We will refer to (3.17) as the "standard noise limit" for an          bl=COS(+flJalin+isin(+flJb, ,
interferometer.                                                                                                              (4.1)
   Note that no assumption was made about the quantum                 a,out=+isin(+flJalin+COS(+{3)b, ,
statistics of the source of light entering the interferometer.
The total number of photons n entering the interferometer         and for the mirror M2
completely characterizes the ultimate sensitivity that can
be achieved with an interferometer in which light is fed              a lout =cos( +(3)cl -i sin( +(3)a'in ,
                                                                                                                             (4.2)
into only one input port. If instead of using photodetec-
tors in both output ports and measuring Nand Jz one
                                                                      c, = -i sin( +mci +cos( +(3)a'in .
chooses to use only one photodetector or to measure only          In writing (4. I) and (4.2) it has been assumed that both
J" then one is throwing away information. In this case            mirrors have the same transmission coefficient
knowledge about the photon statistics of the source be-
comes important. For this situation the performance of                                                                       (4.3)
the interferometer will generally degrade although for
some particular values of ci>1 -</>, the n 1/2 phase sensitivi-
                                                                                         e
                                                                  The phase shift sustained by the light as it propagates
                                                                  between the two mirrors is given by
ty can still be achieved.
   As will be pointed out in Sec. V, the sensitivity of an in-        Cj =ei(Jb\ ,
terferometer can be dramatically improved if photons are                                                                     (4.4)
allowed to enter both input ports provided the photons are            b 2 = ei9c 2   .
prepared in the right quantum state.
                                                                  Equations (4.1)-(4.4) can be solved to obtain a I out,a'out
                                                                  in terms of a 1 '" and alin' One finds
       IV. THE FABRY-PEROT INTERFEROMETER
                                                                                                        de Imax
The scattering matrix of Eq. (4.51 is unitary. Using Eq.
(2.21 one can determine how J transforms under this uni-                              For mirrors with a small transmission coefficient T, and
tary transformation. One finds                                                        using (4.91
                                                                                                         T
                                                                                                                                             (4.17)
                                              o sin</> 1 Jx                               flO min ---- 4n 1/2   .
                                              I    0     Jy                   (4.7)
                                                                                      Hence, as with the Mach-Zehnder, the sensivity of the in-
                                              o    cos</>       J,
                                                                                      terferometer scales as n -1/2 where n is the total number
where                                                                                 of photons entering the interferometer. As with the
                                                                                      derivation of (3.17), Eq. (4.17) is based on the assumption
    cos</> = 1Jl       i2-     'I   y   'I   2 ,                                      that light enters only one port of the interferometer.
                                                                              (4.8)      In the next section it is shown that the sensitivity of an
    sin</> =Jl'v+ y' Jl .                                                             interferometer can be greatly enhanced if light, prepared
So the Fabry-Perot interferometer, for the mirrors chosen,                            in a suitable quantum state, is allowed to enter both ports
performs a rotation of J about the y axis. Hence, follow-                             of the interferometer. Although the arguments will be ap-
ing the same line of reasoning as in the last section, if                             plied to the Mach-Zehnder, with the tools developed in
light enters the Fabry-Perot in only one input port the ul-                           this section, they can be applied to the Fabry-Perot inter-
timate phase sensitivity fl</> is given by                                            ferometer as well.
                                                                           (,v,)2=+[j(j+1)-1] •
                                                                                                                               (5.7)
                                                                           (,v,)2= t U (}+ I)- t ]·
la) (5.8)
                                                                      and consequently
                                                                           (inIJ,J,+J,J, l in)=tU(}+1ljl l2.                   (5.9)
(5.10)
                Ie)                                                   Substituting Eqs. (5.6) and (5.9) into Eq. 0.10) one has
                                                I"
                                                                           J;= tU(} + 1) - t ]sin2q, + tcos2q, .              15.11)
  FIG. 4. The perfonnance of a Mach-Zehnder interferometer
in which an input state, of length j when depicted in the             The mean-square fluctuation in Jz is then
(J~,J"Jt)   space, is a flattened cone whose width along the z axis
is of order unity. The sequence of rotations perfonned by the              (,vz)2= +U() + I )-I]sin 2q,+ +cos 2q, .           (5.12)
interferometer is the same as that of Fig. 2. In contrast to the
state depicted in Fig. 2, an overall rotation '" - II j can be
                                                                      The mean-square fluctuation in    q, is given by
resolved with the state depicted here.                                          2       (,v,)2
                                                                                                                              (5.13)
                                                                           (~q,)= [~r
                                                              (5.])
                                                                      or
Hence, by choosing the appropriate incoming state I in),
an interferometer's sensitivity can be greatly improved                    I~q,)'= U(j+I) - ljsin2f +cos'f .                  15.14)
over the n -112 sensitivity of Eq. (3.17) or Eq. (4.9).                             lUI} + I)f12cosq,+sinq,I'
  The ~bcve discussion is now made rigorous by explicitly
exhibiting a state with the properties described above.               This quantity has its minimum va1ue when sinq, =0. then
Consider the state                                                         1~.I.)2 _ _I_
                                                                             'I' mon-   }I} +1)
                                                                                                                              15.15)
Hence this state lies close to the x-y plane and has a                Hence when the state Eq. (5.2) is fed into the input ports
mean-square height of order unity,                                    of an interferometer a minimum rms fluctuation ~<Pmin in
      (,v,)2=+ .                                              (5.4)   the phase of order n - I can be achieved:
      (in IJ,in) = t[j(j + 1)]112 ,                                   This maximum sensitivity is however achieved only at
      {inIJ, lin)=O,
                                                              (5.5)
                                                                      particular values of q, satisfying sint,6 =0. For other values
                                                                      of <P the sensitivity of the interferometer is degraded.
and                                                                   Since q,=<PI-<P" q,1 may be tracked as a function of time
                                                                      with the precision Eq. (5.17) by controlling <P2 with a feed-
      (in IJi lin) =tU(j + 1)- t         l.                           back loop which maintains <PI -q,2 at zero. The error sig-
                                                              (5.6)
      {in IJ; I in)=tU(}+ 1)- t          1.                           nal for this loop is the differenced photodetector current
                                                                      2Jz . The use of feedback loops with be further discussed
So tbe mean-square uncertainties in J, and J, are                     in Sec. VII.
LORENTZ TRANSFORMAnONS                                                                                                   475
    A state i in) which allows an interferometer to achieve      where J, is given in Eq. (2.2). In fact, the operator J,
 phase uncertainty of order n ~ I has now been presented.        commutes with all the K i .
 How one prepares light in such a state, or a state similar         There has been a considerable amount of theoretical
 to it, is the topic of the next section. Here we simply         work, beginning with Yuen and Shapiro,1O on four-wave
 point out some properties of the state I in) of Eq. (5.2). It   mixers as possible sources of squeezed states. The reader
 is a superposition of the states I},O) and I}, 1 ). For the     is directed to Reid and Walls II and references therein for
 state I},O), N has the eigenvalue n =2} and J, has the          work that has been done on four-wave mixers. For the
 eigenvalue m =0. Equations (2.2) and (2.3) allow one to         purposes of this paper, a four-wave mixer will be regarded
 recognize this state as one in which exactly j photons          as a device with two input ports alin,02m and two output
 enter each of the two input ports of the interferometer.        ports Glou"G2o", which performs the mode transforma-
 For the state I}, 1 ), N has the eigenvalue n = 2} and J,       tion of the form 12.1l
 has the eigenvalue m = 1. This state can be recognized as
 one in which exactly} + 1 photons enter the input port
                                                                                                                          (6.7)
 G I '" while exactly} -1 photons enter the input port G, ,"'
        VI. THE TWO-MODE FOUR-WAVE MIXER                         Both backward degenerate four-wave mixing in which
                                                                 two counter propagating pump beams pass through the
    In the last section it was shown that the sensitivity of     nonlinear medium, and forward four-wave mixing, in
 an interferometer could be greatly improved provided one        which the pump beam propagates in only one direction
 could prepare the light delivered to the input ports of the     through the nonlinear medium, perform mode transforma-
 interferometer in a state which consists of a superposition     tions l4 of the form (6.71. Since the incoming and outgoing
 of two states, one in which exactly} photons enter each of      creation and annihilation operators must satisfy (2.1), the
 the two input ports in the interferometer and a state in        following restrictions are placed on the Sij:
 which} + 1 photons enter one port while} - 1 photons
 enter the other port. In this section it is shown that states        i   SIII'-ISI, ;'=1,
 similar to this can be generated with two-mode four-wave                                                                 (6.8)
 mixers. For the analysis of such a device it will be con-
 venient to introduce a set of operators whose commuta-              SIIS~I =S12S~2 .
 tion relations are those for the generators of the group
                                                                 From these relationships one cal' show
 SU(1,II.
    In particular we introduce the Hermitian operators                ISIII'=I S22!',
                                                                                                                          16.9)
       K x =+(a1ai+aj a2) ,                                           ISI,I'= IS,II'·
               itt                                       (6.1)   The phases of the Sij are controlled by the pump phase.
       KY=-2IaIG2-GIG,I,
                                                                   How the operators 16.1) transform under the scattering
                                                                 matrix (6.7)
       K,=+laial+a,Gil.
                                                                            [~:: ~:: I
 The commutation relations for these operators,
                                                                                                                         (6.10)
                                                                     S=
       [K"Kyl = -iK, ,
       [Ky,K,l=iK x   ,                                  (6.2)   will now be determined for some particular examples. A
                                                                 possible realization of S is
       [K"Kxl=iKy,
 can be recognized as those belonging to the group'"
 SUIl ,Ii. It is also useful to introduce the raising and
                                                                          _I
                                                                        cosh(+f3I             e~issinhl+f3I I'
                                                                                                                         16.11)
                                                                     S-s       I                       I
 lowering operators                                                    e'sinhl T f3I            cosh(T/3)
       K+=Kx+iKy=aiai,
                                                         16.3)   where 8 is controlled by the phase of the pump light rela-
       K _ =K x -iKy =a1 Q 2                                     tive to some master clock and /3 is related to the reflectivi-
                                                                 ty R of the four-wave mixer (when it is used as a phase-
 which satisfy the commutation relations'                        conjugating mirror) via sinh'( +/3I=R.
       [K~,K+l~2K,        ,                                         When the pump phase is set such that 8=1T/2 Eq.
                                                         16.4)   16.11) becomes
       [K"Ktl=±K±
                                                                                              -isinh~+/3) J
 The Casimir invariant K2 is
                                                         16.51
                                                                     S=     I  coshl +/3)
       ~ [~        =
                               o       o
                             cosh{3 sinh{3
                                               I   K.
                                                   Ky                     (6.13)
                                                                                        K.
                                                                                        Ky
                                                                                                                           K.
                                                                                                    =e -;(~I+~2)Kz Ky /(I/JI+t/lzlKz        .
                                                                                                                                                     (6.22)
                             sinh{3 cosh{3         K
             out                                           in                           Kz    out                          Kz
   •
                                                       Z
which represents a Lorentz boost along the y axis, where z                         In the Schriidinger picture the state vector is transformed
transforms as time. This transformation can be expressed                           as
in the form
                                                                                                                                                     (6.23)
                                          I
(6.11) becomes
                       +
       _ [COSh( {3) sinh( (3)        +                                                 S(/»=    [e~i5 ~),                                            (6.25)
      S- .                                                                (6.16)
                                                                                                                            +mI
                I         I '
          smh( ,(3) cosh( ,{3)
This transformation has the form of a Lorentz boost                                From (6.20) the transformation S( -/» can be recognized
along the x axis and can be expressed in the form                                  as a rotation about the z axis by an angle -/). S({3)
                                                                                   represents a Lorentz boost along the x axis, and S(/»
       K.                       K.                                                 represents a rotation about the z axis by the angle /). The
                        -iPK          ifJK y                                       product of transformations Eq. (6.24) thus represent a
       Ky          =e        ' Ky e            .                          (6.18)
                                                                                   Lorentz transformation along a direction making an angle
       Kz    out
                                Kz                                                 Ii with respect to the x axis. Hence, in the SchrOdinger
                                                                                   picture, after the incoming light I in) has passed through
In the Schriidinger picture the state vector is transformed
                                                                                   a four-wave mixer, it will be in the state
as
                                                                                                                                                     (6.28)
                                                                          (6.19)
                                                                                      It has now been demonstrated that a four-wave mixer
The operators performing the transformations of Eqs.
                                                                                   performs Lorentz transformations on the vector K, the
(6.15) and (6.19) are two-mode squeeze operators. 6.,z.IJ
                                                                                   direction of the Lorentz boost being determined by the
   At this point it will be useful to determine how K
                                                                                   pump phase which is at the experimenter's control. Since
transforms when the two input light beams sustain phase
                                                                                   Jz commutes with K, it remains unchanged under the
shifts. Letting a'in undergo a phase shift of q" and aZin
                                                                                   transformations performed by the four-wave mixer.
undergo a phase shift of q,z, then
                                                                                   From Eq. (2.2) one sees that this invariant is equal to half
            [eoi~t
                                                                                   the difference in the number of photons entering the input
                                                                          (6.20)   port of the four-wave mixer. This invariant has been not-
      S=
                                                                                   ed by Graham's and Reid and Walls.'6
                                                                                      Let us now consider the case when no light enters the
Under this transformation, K transforms as                                         input ports of the four-wave mixer. The state delivered to
                                                                                   the output is then given by Eq. (6.28) where Iin) is the
                   COS(q" +q,2)      sin(q,,+q,z) 0             K.                 vacuum state I0).
               -sin(q,,+q,2) cos(q" +q,2) 0                     Ky        (6.21)      The probability amplitude that n, photons will appear
                                                                                   in the output beam a, out and nz photons in the beam
                         o                     o                K z out            a20ut is
which can be recognized as a rotation about the z axis by                              (n"nzl out) = (n"n21 e -i6K'eiPK'ei5K, 10), (6.29)
an angle q,= -(q" +q,2)' This transformation may be ex-
pressed as                                                                         where the state      In "nz)       is
LORENTZ TRANSFORMATIONS                                                                                                                      477
                          t        nIt
                >       (a lout)
                                             n~
                                    (a 2 out) ~                                     n, photons and the beam a, out n 2 photons is thus
       In"n,        =         ~,                  10).                     (6.30)
                                                                                           P(n, ,n,) =On,.n,sech'l +(3)[tanh'( +(3)]n, .   (6.411
From (6.11 one sees that K, can be put in the fonn
                                                                                    From this equation one sees that P(n"n,) is zero if
      K,=+IN,+N,+lJ,                                                       (6.31)   n I ¥en,. For the vacuum state one has
                                                                                                                                           (6.42)
where Nj=aioutQloul and .""'2=a;out Q 2ouI are the number
operators for output beams I and 2, respectively. With                              Since J, is an invariant for the four-wave mixing process,
this equation it is readily apparent that                                           when there are n I photons in beam I there must be n,
                                                                           (6.32)   photons in the second beam as well, that is, the photons
                                                                                    are emitted in correlated pairs. These photons are in fact
and                                                                                 more highly correlated than allowed classically.""·
                                                                                       From (6.411 the mean (n) and the mean-square (n 2)
                                                                           (6.33)   number of photons emitted by the four-wave mixer can be
So Eq. (6.291 simplifies to                                                         computed:
                                                                                                     00
In order to simplify things further we make use of the                                          =2 sinh 2( +(3) ,                          (6.43)
identity'
                                                                                           (n')=     i         (n,+n,)'P(nl>n,)
exp(TK + -T*L )=exp                  I[~tanh          IT' ]K+         I                              1!]n Z
                                                                                                    i
                                                                              - t sin/)sinh{3(a ia +alal -aiai -a,a,)
                                                                                itt
                                                                              - Z"cosh{3(a la, -a 2a I) .                     (7.8)
                                                                    one has
                                                                          (aIJ.la)=+ laI 2sinh{3cosI20-/),
                                                                                                                             (7.10)
   FIG. 5. A method by which the state depicted in Fig. 4 can             (a IJy I a) = + Ia 1 2sinh{3sin(20-/) .
be generated and fed into an interferometer. The state is gen-
erated via a degenerate four-wave mixer IFWM) pumped via a          The mean-square fluctuation in Jx and Jy is independent
laser. A small fraction of the pump light is split off of the       of cf> and /):
pump beam, phase shifted by 9, attenuated by A and then fed
into one of the FWM inputs, a,. The input port a, is terminat-            (Mx )2=(My )2
ed with a cold blackbody absorber B. The two output ports b,
and b, of the four-wave mixer are fed into the input ports of the               =¥(Sinh2{3++)+tsinh'{3.                      (7.11)
Mach-Zehnder interferometer. 8 is a phase shifter for the pump
light before it enters FWMI.                                        One also has
LORENTZ TRANSFORMATIONS                                                                                                                  479
terized and in particular one knows the numbers (J,) and          .p1(O). It will be determined how rapidly .p2 approached
(Jx), which according to Eq. (7.4) and (7.10) are                 .p1(O) given the feedhack algorithm (S.8). Equation (8.S)
                                                                  iteratively substituted into itself yields
      (Jz)=+laI 2     ,
                                                                            II-I                       n-l    n-I
                                                          (S.l)
                                                                  .p( n)=    l:    (1 -AA k ).p(o)+A   l: BK 11      (1 -AAm) ,
                                                                            k=O                        k=O   m=k+1
The differenced photocurrent is measured at the output of                                                                    (8.9)
the interferometer, that is, the photodetectors measure
2Jz . A sequence of measurements will generate a string           where the product is defined in the usual way, except that
of numbers, each of which is an eigenvalue of 2Jz. One is              "-I
free to process these numbers and in particular one can                11 F(m)=I.                                           (8.10)
subtract (Jz) from them and divide them by -2(J.).
Then the sequence of numbers Idl>d2, ... 1are eigenstates         The mean value (.p(n» is, using Eq. (S.5),
of the operator D
                                                                       (.p(n»=(1-A)"(.p(O» .                                (8.11)
       (sin.plJx          (cos.p lJ, - (Jz )
      D=---                                               (8.2)
           (J. )                (Jx)                              It is apparent that the mean value of .p(n) will converge to
                                                                  zero only if 11- A 1< I. Hence the feedback parameter is
For simplicity it will be assumed that .p is small so that        restricted to the range
the approximations sin.p"",.p and cos.p"", I can be made.
Then one can write                                                                                                          (S.12)
                                                                                                  "-I
                                                          (8.4)                        +A2(B2)    l:    W_AA)2)k
                                                                                                  k=O
                                                                                       -A2(AB+BA ).p(O)
Since
      (A)=l,                                                                               "-I
                                                          (8.5)                           xl:  ((l-AA)2)"-I-k(l_AA)k.
      (B)=O                                                                                 k=O
                                                                                                                           (8.13)
it is immediately evident that                                    The sums can be evaluated to yield
                                                         (S.6)    ([.p(n)f) =      « I-AA )2)"[.p(O)f
that is, the sequence of numbers Id I>d 2 , ••• 1 are esti-
                                                                                   +A2(B2) 1-({\_AA2»"
mates of.p.
                                                                                           1- ((l-AA )2)
   The phase shifter .p2 of Fig. 5 will be taken to be con-
trollable. A feedback algorithm that will track .p 1 main-                         -A2(AB+BA) ((l-AA)2)"_(I-AA)"
taining .p=.p2-.p1 at zero will now be described. Let .p2(i)                                           ((l-AA )2) - (I-AA)
be the setting of .p2 during the ith measurement. The
                                                                                                                   (8.14)
measurement provides the estimate of .pU)=.p2(i)-.pI(i),
 d" which is an eigenvalue of                                     The expectation values « 1- AA )2) and (1- AA ) can be
                                                                  written, keeping in mind Eq. (8.5), as
    D,=.p(j)A,-B, .                                       (8.7)
                                                                      (I-AA)=I-A,
The feedback loop then adjusts .p2 to the new setting                                                                      (8.15)
    .p2(i + 1)=.p2(i)-Ad, ,
                                                                  Substituting these expressions into (8.14) one finally has
or in operator form
                                                                  ([.p(n)]2)
    .p2(j +1)=.p2(i)-AAM2(i)-.pM)]+AB, ,                  (S.S)
                                                                      =[( I_A)2+A2(~A )2]"[.p(O)f
where A is a feedback parameter.
  It is now assumed that the successive measurements are                 +A2(B2) 1-[(1 _A)2+A2(~A )2]"
performed on a time scale equal to the characteristic                            1-[( I_A)2+A2(~A )2]
coherence time of the four-wave mixer so that the ith
                                                                         -A2(AB+BA) [( I_A)2+A2(~A )2]"_( I-A)"
operators A, and B, are independent of the j operators Aj
                                                                                     [(I_A)2+A2(~A )']-(I-A)
and Bj • Then Eq. (8.8) can readily be iteratively substitut-
ed into itself. For convenience .pIU) will be held fixed to                                                                (S.l6)
LORENTZ TRANSFORMATIONS                                                                                                               481
   From this equation one sees that in order for ([~(n)]')         approximately 1.44 measurements in order for ~, to adjust
to converge one must, in addition to (8.12), have                  itself to the new ~l. Hence, on the average, the total num-
[(1-A)'+A'(..1A)'J"<1. This expression yields the re-              ber of photons NT used to detect this displacement is of
striction                                                          the order N T = 1.44N and ~~ in terms of the total num-
                                                                   ber of photons used is
                                                          (8.17)
  As a particular example, consider the case when A= I,                  ~~"",,: .                                                  (8.27)
                                                                                   NT
then
                                                                   Hence, by increasing the number of photons fed into the
      ([~(n)]2) =(~A )2"[~(O)]2+ (B2) I +(~A ),.                   four-wave mixer from I to 1a 1'",,2+ v'S ",,4. 24, the in-
                                                I-(~A)'            terferometer can be operated stably in a feedback mode
                                                                   and ~, can detect changes in ~l as small as that given by
                        -(AB+BA)(ilA)2.-1.                (B.18)
                                                                   (8.27).
                                                                      Consider now the case where, instead of choosing 1a I'
The mean-square value of'" converges to                            large enough so that the feedback loop would be stable
                                                                   with the feedback parameter A set to unity, one chose
      ("").-~
      'f' mm- I-(~A)'               .
                                                          (B.19)    I ai' = I as was done in Sec. VII in order to optimize the
                                                                   sensitivity (7.22) with N fixed. In this case
   As ~ is driven to zero, the characteristic number of
measurements il that must be made to reduce ~' to 1/e of                 (~A)'=+ ,
its original value is
                                                                         (B,) _ _  I-                                               (8.28)
      _        I                                                              - sinh'/3 '
                                                          (8.20)
      n = - In(~A )2 .                                                                              2
                                                                         (AB+BA) = sinh/3 .
  We now substitute the results of Sec. VII into these ex-
pressions. One has                                                 Since (~A)'> I it is apparent from (8.18) that the feed-
                                                                   back loop cannot be operated stably with the feedback pa-
                   21 a 1'(sinh'/3+ +)+ +sinh'/3
                                                                                                           +
      (~A )'= - - - - - : - - - : ; - - -                 (8.21)   rameter set to unity. In fact, from Eq. (8.17) it follows
                               Ia 14sinh'/3                        that A must be less than    if the feedback loop is to be
                                                                   operated stably. In order to make the e-folding time for
                                                          (8.22)   ~' as short as possible we choose the value of A which
                                                                   minimizes
and
                                                                         [(l-A)'+A'(~A )']
      (AB+BA)=_--:2~                                     (8.23)   of Eq. (8.16), that is,
                             Ia l'sinh/3
In the large-/3limit Eq. (8.21) reduces to                               A= _ _I__ =l                                               (B.29)
                                                                            I+(~A)' 7·
                   21     1'+ I
      (~A)2=            aT.                               (8.24)   If ~I(O) is held fixed (~') settles to a steady-state value
                        I a 14
                                                                         ( "") . =                      A'(B')                      (830)
In order that the sensitivity (~~)' not be degraded too                      'f'   mm      1_[(l_A)2+A2(~A )']                        .
much from its minimum value (il~)2=(B2) [see Eq.
(8.19)], let us choose (~A)2 = +. Then the characteristic          The characteristic number of measurements which must
number of measurements necessary to reduce ~' to 1/e of            be made to reduce ~2 to 1/ e of its initial value is
its original value is il = 1.44. From (8.24) one sees that               _                          I
                                                                                                                                    (B.31)
 lal' has the value lal'=2+v'S. Using (7.21), Eq.                        n=-ln[(l_A)'+A'(~A)'] .
(8.19) becomes, for large N,
      ("") .      ~2( la 1'+1)'
                                                                   For A= t and (ilA )'=                +one has, upon using Eq. (7.21),
       'f'   mm         la   I'N'                                        ( ",2) .
                                                                             "P    mln-
                                                                                        _.!.:..!!
                                                                                           N2                                       (8.32)
                    12.9
                                                          (8.25)   and
                  "" N' .
                                                                         il=3.0.                                                    (8.33)
Suppose ~ I is stationary so that ~, has settled down and ~
fluctuates with the mean-square value of (8.25), i.e.,                Again consider the case where ~l has remained constant
                                                                   for a long time so that the rms fluctuations in ~ have set-
                                                          (8.26)   tled down to the value determined by (8.32), il~",,1.07lN.
                                                                   Suppose now that ~ I is displaced instantaneously to a new
Then if a small disturbance should come along to displace          value a distance ~~ from its old value. It takes character-
~l by an amount ~~ from its quiescent value it will take           istically three measurements for ~, to adjust itself to the
482                                                                                                                CHAPTER IX
      A.I.=E                                          (8.34)
       'I' NT'
a number that is somewhat better than Eq. (8.27).
                                                                 FIG. 6. An SU(1,1) interferometer. The beam splitters of a
   In this section it has been shown that by using suitable    conventional interferometer have been replaced by the four·
feedback loops the interferometer of Sec. VII can track        wave mixers FWMI and FWM2. The light pumping FWM2 is
changes in 4>, in a stable manner and can achieve a phase      phase shifted from the light pumping FWMI by the angle.p.
sensitivity of order I! N. Hence the two problems en-
countered in Sec. VII, namely the fact that the inter-
ferometer achieves its optimum sensitivity only for a
                                                                                                                   I
small range of phases, 4> < I!N, and that the fluctuations     particular let FWMI have the scattering matrix
in Jz ou" the interferometer's output, are greater than
(lzou,) for 1a 12 set at its optimum value, can be over-                      cosh( t{3)         +i sinh( t{3)
come be operating the interferometer with 1a 12 slightly           S( -{3)= f . '      I                   ,             (9.1)
                                                                             - I smh( ,{3)        cosh( ,(3)
degraded or by choosing the response of the feedback loop
to be such that it averages enough successive measure-         As can be seen from Eq. (6.13), K transforms as a
ments of 4> that a useful error signal can be generated.       Lorentz boost L ( - (3,y) along the - y axis under this
   In the literature2 - 4 a number of schemes for achieving    scattering matrix:
interferometer sensitivities of II N have been described.
                                                                                    [~           -S~nh/3j
All of these schemes employ standard interferometers into
which light from degenerate-parametric amplifiers or               L(-{3,y)=             C:h/3                           (9.2)
four-wave mixers is injected. In the next section we will                           o - sinh/3    cosh{3
describe a novel set of interferometers which dispense
with beam splitters and use the SUIl,1) boosts to convert      The scattering matrix for FWM2 is
phase shifts into light amplitude changes rather than the
SU (2) rotations employed by a conventional interferome-                    cosh( t{3)      -i sinh( t/3) 1
                                                                   S({3)= fi sinh( t{3)
                                                                                                                         (9.3)
ter.                                                                                         cosh( t{3)
IX. AN SU(1,O MACH-ZEHNDER INTERFEROMETER                      This scattering matrix transforms K as a Lorentz boost
                                                               L ((3,y) along the + y axis. The transformation per-
   In S-c. III it was shown how the operation of a Ma~h       formed by the phase shifters 4>, and 4>2 is, from Eq. (6.20),
Zehnder mterferometer could be viewed in terms of rota-
tions of the vector I under the rotation group SU(2).
   In this picture relative phase shifts between two light         S(4))=
                                                                            eoi¢'
beams correspond to rotations about the z axis while pho-
                                                                            f                                            (9.4)
                   COSh Y    o   sinhy   j
    L (y, x) = [      0            o         .             (9.9)
                    sinhy    o   coshy
Equation (9.8) holds when 8 and yare chosen such that
       cos8=                 sin¢>
               [sin'¢ + (I -coscl> )'cosh'/3J 11l
                                                          (9.11)
                                                                        '.
                                                                                ..,                          Ib,
          +.
picted in Fig. 7(a) as a cone whose base intersects the z          accumulated by the light beams propagating in the interferome-
axis at     The Lorentz boost L ( - /3,y) is equivalent, in        ter result in a rotation in the xy plane. (d) The second four-
the Schriidinger picture, to a boost of the state vector in        wave mixer performs a Lorentz transformation along the nega·
the opposite direction. The Lorentz boost performed by             tive y axis. The total number of photons leaving the interferom·
the first four-wave mixer is depicted in Fig. 7(b)' The            etef is a linear function of K z .
mean value of K, in terms of the mean number of pho-
tons (N) emitted by the four-wave mixer is, from Eq.
(6.31),                                                            explicit calculation . From Eq. (9.6) and Eqs. (6.15) and
       (K,)=+((N)+I) .                                             (6.23) the incoming state vector 1in) is transformed as
The phase shifts cI> 1 and ¢, encountered by the two light                                                                   (9.17)
beams leaving the four-wave mixer then rotate the state            but from 19.8) this is equivalent to the transformation
vector about the z axis by an angle -¢ = ¢I +¢,. This is
depicted in Fig. 7(c). A second Lorentz boost with the                                                                      (9.18)
same rapidity, but in the opposite direction, is then per-
formed. If ¢ = 0the final state will be a vacuum state and         The operator Nd for the total number of photons detected
no photons will be detected by the photodetectors in the           by the photodetectors placed in the output beam is from
                                                                   Eq. (6.31)
output beams. If ¢ is nonzero the state of the light
delivered to the photodetectors will be a Lorentz-boosted              Nd=2K,-I.                                             (9.19)
vacuum, the rapidity parameter being determined by Eq.
(9.12) or (9.13).                                                  Hence in order- to evaluate (Nd ) and ll.Nd one needs to
   In Fig. 7(b) the projected ellipse lying in the x- y plane      evaluate (out 1K, l out) and (out IK; lout). From Eq.
has a width of      +
                    and the distance from the origin to its        (9.18) one has
center is (K,)";'      +(
                       (N) + I), Hence Fig. 7(c) suggests
                                                                   (out 1K z lout) = (in 1e i6K'e -iYK'K,eiyK'e -i6K, 1in) .
that the minimum detectable phase ¢min is of the order
                                                                                                                             (9.20)
             I
                                                          (9.16)
    ¢min= (N) +1 '                                                 Since I in) is the vacuum state, one has
that is, this detector can achieve a phase sensitivity ap-             e -i6K, 1in) =e - i612 1in) .                         (9.21)
proaching 1/ N.
  That this is the case will now be demonstrated with an           Equation (9.20) thus simplifies to
484                                                                                                                             CHAPTER IX
      (out IK. lout) = (0 Ie -yK,K./ YK, 10) .               (9.22)             The quantity    (11t/!)2 is minimized when t/!=O, then
From Eqs. (6.18) and (6.17),
                                                                                         2     I
                                                                                    (dt/!)min=-'-2- .                                     (9.30)
      e -iyK, K./ yK,      = (sinhy )K. + (coshy )K. ,       (9.23)                          smh ,B
                               r
                                                                                that is, with beam 1 at the frequency "'0+ d", and beam 2
                                                                                at the frequency "'0- d",; the scattering matrix 6•12. 13 for
                   [a(~d)                                                       the four-wave mixer will still have the form (6.11).
                                                                                    By using techniques similar to those used in deriving
                                                                                Eq. (6.41) one can show that the probability P(N) of
               sin 24>+( l-coS¢)2cosh2,B                                        detecting a total of N photons leaving the output ports of
                                                             (9.29)
                      sin 2t/! sinh2p                                           the interferometer is
                   o   ifNisodd
                                                                          NI2
      P(N)=                  2          [ (I-cost/!)sinh 2p                                                                               (9.32)
                                                                      ]
                                                                                 if N is even.
                   (l-cost/!)sinh 2,B+2 (l-cost/!)sinh 2,B+2
Let Ni denote the number of photons counted during the                          The sum has the form
ith measurement in a sequence of measurements. One is
free to take the square root of each of these numbers.
Hence it is meaningful to talk about the average and rms
value of v'N. The motivation for investigating the statis-
tics of v'N stems from the fact that (N) is an even func-                       which can be approximated by the integral
tion of t/!. Now
      (v'N)=        l:     v'NP(N)                           (9.33)                                                                      (9.35)
                   Neven
Since this approximation holds reasonably well for              Hence for N > 4 it has been shown that the uncertainty in
                                                                the inferred oorm of the phase 4' is to a good approxima-
     (l-cosp)sinh2p~ 4                                 (9.37)   tion
the logarithm can be approximated via        In(l+x)~    and          al4'1 =0.52214'1·                                         (9.44)
one has
                                                                It is also instruEtive to ask what the probability P( I4'J :
       (v'N)""   ~1r (l-co~)ll2sinhp.                  (9.38)   (l-a)I4>1 < 14>1 dl+a)I4>I) is that a measured 14>1
                                                                will lie in the range
                     If
Approximating (1- co~) as 4>2/2 one finally has
                                                                      (I-a)I4>1 < 14'1 <O+a)I4>1 .
       (v'N )=t            f1214> Isinhp .             (9.39)   From (9.40) this is equivalent to determining the probabil-
                                                                ity PIN: Nl <N <N 2) that N lies in the range
Hence the norm of the phase 4' inferred from a measure-         Nl <N <N2 where
mentof Nis
                                                                      N =~(I-a)24>2sinh2p
                                                                       1       8                         '
                                                                                                                                (9.45)
                                                                      N2=f(l+a)24>2sinh2p.
                                                       (9.40)
                                                                One can show rigorously
From (9.39) one has
                                                       (9.42)                      [
                                                                                        (I-co~)sinh 2    P ]N212]               (9.46)
                                                                           -           (l-co~)sinh2f:l+2            .
so
                                                       (9.43)   So from Eq. (9.45)
Approximating I-cos,p by 4>2/2, this expression can be          P(I4'I:(l-a)I<I>1 < 14'1 dl+a)I<I>I)
put into the form
                                                                                ""e -1".12111-aI2-e -1 ..I2IIl+al'              (9.50)
           I[                                                                                  +, then
P(lhO-a)I4>1 < 14'1 dl+a)I4>I)
                                                                As an example, let a =
       =     x :4 r16111-a12x - [ x :4 r16111+a12X     j,       p<lh       1<1>1/2< If I dl<l>I/2)
                                                                                               ""e - ../8_ e -9.-/8=0.646   .   (9.51)
                                                       (9.48)
where x = <l>2sinh2fJ.                                          Hence from a single measurement of 14' lone has 65%
  Now                                                           confidence that I <I> I /2 < If I d I <I> I /2.
                                                                   The interferometer described here suffers from draw-
     'x_co
             [x+4 j =e-
       lim _x_
                      x
                             4 •                       (9.49)
                                                                backs similar to those of the SU(2) interferometer of Sec.
                                                                VII. Maximum sensitivity occurs at 4>=0 and the sensi-
                                                                tivity rapidly degrades as 4> is adjusted away from zero. It
This limiting value is not a bad approximation for              was shown in Sec. VIII that such drawbacks can be over-
[x/Ix +4)]' even for x as low as 10, the level at which         come with feedback. However, implementing a feedback
on average five photons are counted in the interferometer       algorithm for the SUO,I) interferometer described here is
output beams. Hence                                             complicated by the fact that (N) is an even function of 4>
486                                                                                                                         CHAPTER IX
 and hence the sign of the error signal cannot be deter-                 [Lx>Lyl=-iL, ,
 mined from the ~umber of photons counted by the photo-
'detector during a single measurement.                                   [LyLzl=iLx'                                                 (10.3)
    The sign of the error signal can be generated by chang-              (L"Lxl=iLy.
 ing (dithering) </>, between successive measurements and
 constructing the derivative signal (N; + I -N;)IIl</>,.             Again, it is useful to introduce the raising and lowering
    Alternatively, one could impleIl]ent the feedback algo-          operators
 rithm which will now be described. Make repeated mea-
                                                                         L+ =Lx+iLy=fata t                  ,
 surements of Ii: until I</> I is determined to some                                                                                 110.4)
 predetermined precision: Ill</> I =a I</> I where a is a con-           L_ =Lx -iLy = Taa
 stant. Then move </>2 according to
                                                                     which satisfy the commutation relations
      cb,(new)=</>,Iold)+ Iii.                             (9.52)
                                                                         [L_,L + 1=2L, ,
                                        i
Make repeated measurements of I I at this new setting                                                                                (10.5)
so that a new i </> I can be inferred with the precision                 [L"L±l=±L± .
Ill</> I =a I</> I· If I</> I inferred for the new setting of </>,   The Casimir invariant
is less than I</> i inferred for the old setting one assumes
that one has moved in the right direction. If, on the other              L'=L;-L;-L; ,                                              110.6)
hand, the inferred value of I</> I for the new setting of </>,
                                                                     when expressed in terms of the operators a and at,
is greater than the inferred I</> I for the old setting one as-
                                                                     reduces to the number
sumes that one has moved </>2 in the wrong direction and
cb, is then readjusted so that                                                                                                      (10.7)
                                                           (9.53)
                                                                        It is useful to determine how L=(Lx>Ly,L,)
The process is then repeated.                                        transforms under specific cases of Eq. (10.1). Under the
   If I</> I is determined to sufficient precision this algo-        mode transformation
rithm will move one closer to <I> = 0 most of the time. On               aout=cosh(+(3)Qln+sinh(+f3)a~l ,                           110.8)
the occasions when this algorithm moves one in the wrong
direction it generally does not move <1>, very far in the            L transforms as a boost along the x axis:
wrong direction and the lost ground is regained during the
next few iterations of the feedback procedure. Further,                   Lx ]             [COSh{3 0 Sinh{3] Lx
since a single measurement already determines I cb I with                 Ly           =      0    1   0     Ly                     110.9)
a precision Il</> ",,0. 51</> I at a 65% confidence level, one
                                                                          Lz     out        sinh{3 0 cosh{3          Lz
does not have to make very many repeated measurements
of I '" I in order for the feedback algorithm to work.               Under the mode transformation
  In this section interferometers based on devices having            L transforms as a boost along the y axis:
the scattering matrix
                                                                          Lx
                                                                                                                j Lx
will be described. Such a single-mode device can be re-
                                                           (10.1)         Ly
                                                                          L,     out
                                                                                       =   [~     o       0
                                                                                                cosh{3 sinh{3
                                                                                                sinh{3 cosh{3
                                                                                                                  Ly
                                                                                                                     L,
                                                                                                                                   110.11)
       lout) =e -;(612)L,//JL'e'1612)L, Iin) ,           (10.14)   photons will be counted leaving the two-port four-mixer
                                                                   of Sec. VI. From Eq. (6.41)
where e'/JL, is a single-mode squeeze operator l9 and                                    O, n odd
e '(612)L, has been called a single-mode rotation opera-                                {                                   (10.25)
                                                                       PT(n)= sech 2( t(3)[tanh 2( t(3)f,         n even.
tor. 6• 12.13
    More generally one could consider a device which               With the identity
transforms a state vector according to
       lout) =e'<;L'e'/JL'e'9L, I in) .                 (10.15)        k~J: 1[2~n~:)l=22n                                   (10.26)
The probability distribution for the number n of photons
in the output beam will now be determined for the case             one can show
when the input consists of vacuum fluctuations. A more
general case, when the input consists of coherent states,
                                                                       PT(n)=              !        P(nl)P(n2)'             (10.27)
                                                                                          nl,n2
has been treated by Yuen. 20 A photodetector in the out-                                n l +n2=n
put beam measures N =a t a, which can, from Eq. (10.2),
be written in the form                                             This equation implies that the statistics of the total num-
                                                                   ber of photons leaving the two-mode four-wave mixer is
      N=2Lz    -t·                                      (10.16)    the same as the statistics of the total number of photons
                                                                   coming from two independent single-mode devices. This
The amplitude that n photons will be counted in the out-
                                                                   observation will allow a simplification of the discussion of
put beam is (n lout), hence the probability P(n) that n
                                                                   feedback loops for the interferometer discussed in this sec-
photons will be counted is
                                                                   tion, since the results of Sec. IX can be made to apply by
      P(n)=   I (n   lout) 12 .                         (10.17)    pairwise averaging successive measurements made with a
                                                                   single-mode device.
Now for an n-photon state In) one has                                 The interferometer to be considered in this section is
The probability distribution (10.17) thus reduces to               where Ii is proportional to the phase of the pump light
                                                                   entering OPA2. Letting I in) denote the state vector for
      P(n)=   I (n Ie'/JL,   10) 12 .                   (10.21)    the incoming light, the state vector lout) for light leav-
                                                                   ing the interferometer is
Hence P(n) is independent of the phase angles <P and 0,
i.e., P(n) depends only on the magnitude of the boost.                  lout) =e -'6L'e -'/JL'e'6L'e -'<;L'e'/JL, I in).    (10.28)
Again, using (6.35) one has
                                                                      The behavior of this device when I in) is the vacuum
      e '/J L, =exp(i tanh( t(3)L + Jexp[ -2Incosh( t(3)L z J      state will now be considered. Figures analogous to Fig. 7
                                                                   can be drawn to illustrate the behavior of the interferome-
                                                        (10.22)    ter. However, in this case the Casimir invariant Eq. (10.6)
Hence by using the techniques used to arrive at Eq. (6.41)
                                                                   bas the numerical value -               +.-.
                                                                                                    Hence L lies on a space-
                                                                   like hyperboloid instead of the light cone of Fig. 7. The
one can show                                                       vacuum state is an eigenstate of L,
0, n odd
      P(n)=
                ;n   [i j     cosh( t(3) ,
                                             n even      (10.23)
                                                                           PUMP
where
       [: l= (n-~)!m!
                                                                     FIG. 8. A single-mode SU(J,I) interferometer. The device
                                                         (10.24)   employs two degenerate-parametric amplifiers OPAl and
                                                                   OPA2. The output of the device is sensitive to the difference
  It is useful to compare this probability distribution with       between the phases '" and Ii accumulated by the signal and pump
distribution PT ( n) for the probability that a total of n         beam, respectively.
488                                                                                                            CHAPTER IX
                                                        110.29)     ter. Since the number operator N for the total number of
and hence could be represented as a circle drawn around             photons counted at the output of the detector is linear in
the hyperboloid at a height along the z axis of +.                  L" E~. (10.16), one would like to determine
  We now determine the mean and variance in the num-                (outlLz lout). One can readily show from Eq. (l0.28)
ber of photons counted at the output of the interferome-            that
where in analogy with Eqs. 19.12) and 19.13),                       taining 4>-0=0 by using the error-correcting signal to
                                                                    adjust the phase shifter Ii in the pump beam delivered to
coshy=[ l-cosl4>-0)]cosh 2/3+cosl4>-0) ,
                                                        (10.32)     DPA2. As was mentioned earlier the statistics of the total
                                                                    number of photons counted in two successive measure-
sinhy=sinhtJ! sin'I4>-o)+ [1-cos(4)-0)j2cosh'/3jll' .               ments of 4> are the same as for the total number of pho-
                                                                    tons leaving the interferometer of Sec. IX. Hence the
It is straightforward then to show that
                                                                    feedback algorithms discussed in Sec. IX will also work
      (N) "" (out IN lout) =+coshy ,                    110.33)     for the single-mode device discussed here.
Hence it has been shown that the device of Fig. 8 can                                   ACKNOWLEDGMENT
indeed achieve a phase sensitivity approaching lin. This
minimum sensitivity is achieved when 4>-15=0. Hence                    We would like to thank R. E. Slusher for stimulating
by implementing a feedback loop one can track 4> main-              discussions on the work presented here.
LORENTZ TRANSFORMATIONS                                                                                                      489
1M. Born and E. Wolf, Principles of Optics (Pergamon, Oxford,      IOH. P. Yuen and J. H. Shapiro, Opt. Lett. 4,334 (1979).
  1975).                                                           11M. D. Reid and D. F. Walls, Phys. Rev. A 31, 1622 (1985).
'C. M. Caves, Phys. Rev. D 23, 1693 119811.                        12c. M. Caves and B. L. Schumaker, Phys. Rev. A 31, 3068
3R. S. Bondurant and J. H. Shapiro, Phys. Rev. D 30, 2548             (1985).
  (1984).                                                          13B. L. Schumaker and C. M. Caves, Phys. Rev. A 31, 3093
4R. S. Bondurant, Ph.D. thesis, Massachusetts Institute of Tech-      (1985),
  nology, 1983.                                                    14P. Kumar and J. H. Shapiro, Phys. Rev. A 30, 1568 (1984).
lWei-Tou Ni (unpublished).                                         lsR. Graham, Phys. Rev. Lett. 52, 117 (1984).
6B. L. Schumaker, Phys. Rep. (to be published).                    16M. D. Reid and D. F. Walls, Phys. Rev. Lett. 53, 955 (1984).
'w. H. Steel, Interferometry, 2nd ed. (Cambridge University,       I'B. Yurke, Phys. Rev. A 32, 300 (1985).
  Cambridge, 1983).                                                18B. Yurke, Phys. Rev. A 29, 408 (1984).
8K. W6dkiewicz and J. H. Eberly, J. Opt. Soc. Am. B 2, 458         19J. N. Hollenhorst, Phys. Rev. D 19, 1669 (1979).
  (1985).                                                          2oH. P. Yuen, Phys. Rev. A 13,2226 (1976),
9D. R. Truax, Phys. Rev. D 31, 1988 (1985).
490                                                                                              CHAPTER IX
                 Abstract. The exact Lorentz kinematics of the Thomas precession is discussed in terms of
                 Wigner's 0(3 )-like little group which describes rotations in the Lorentz frame in which the
                 particle is at rest. A Lorentz-covariant form for the Thomas factor is derived. It is shown
                 that this factor is a Lorentz-boosted rotation matrix, which becomes a gauge transformation
                 in the infinite-momentum or zero-mass limit.
1. Introduction
To most physicists, the Thomas precession is known as an isolated event of the ! factor
in the spin-orbit coupling in the hydrogen atom [1-6]. The purpose of this paper is
to point out that the Thomas rotation plays a very important role in studying the
internal spacetime symmetries of massive and massless particles. Einstein's E = me 2 ,
which means E = (p2+ m 2)1/2, unifies the momentum-energy relations for massive and
massless particles, as is illustrated in table 1. We are interested in the question of
whether the internal spacetime symmetries can be unified in a similar manner.
     The Thoma~ precession is caued by the extra rotation the particle in a circular orbit
feels in its own rest frame [1,3, 5, 7]. In this paper, we point out first that Wigner's
little group [8] is the natural language for the Thomas effect and perform the exact
calculation of the precession angle. We shall then examine its implications.
Spin, gauge, helicity S3' S"S2 Wigner's little group S3, Gauge transformations
     The little group is the maximal subgroup of the Lorentz group which leaves the
4-momentum of a given particle invariant [8]. The little groups for massive and massless
particles are locally isomorphic to 0(3) and E(2) respectively. The 0(3)-like little
group for a massive particle becomes the three-dimensional rotation group in the
Lorentz frame in which the particle is at rest, which is therefore the Thomas frame.
     In the case of massless particles, the rotational degree of freedom of E(2) is
associated with the helicity [8,9] and the translational degrees of freedom correspond
to the gauge degrees of freedom [10-12]. It has been established that the O(3)-like
little group becomes the E(2)-like little group in the limit of infinite momentum and/ or
vanishing mass [13-16]. This means that both the 0(3)- and E(2)-like little groups
are two different manifestations of a single little group, just as E = p2/2m and E = cP
are two different limits of E = [( CP)2 + (mc 2f]\/2. We shall show in this paper that the
Thomas effect stands between these two limiting cases.
     In § 2, the Wigner rotation is constructed for the kinematics of the Thomas pre-
cession. In § 3, the Thomas effect is shown to be an element of the 0(3)-like little
group. In § 4, it is shown that the Thomas factor becomes a gauge transformation in
the large-momentum or zero-mass limit.
2. Wigner rotations
a 8, b
              Figure I. Closed Lorentz boosts. Initially, a massive particle is at rest with 4-momentum
              p•. The first boost B, brings p. to Pb' The second boost B2 transforms Pb to Pc. The
              third boost B3 brings Pc back to p•. The net effect is a rotation around the axis perpendicular
              to the plane containing these three transformations. We may assume for convenience that
              Ph is along the z axis, and Pc in the zx plane. The rotation is then made around the y axis.
492                                                                                         CHAPTER IX
The second boost B2 transforms Ph into Pc whose momentum has the same magnitude
as that of Ph but makes an angle 0 with the direction of Ph:
                                                                                                  (4)
From the triangular geometry of figure 1, the direction of the boost for B2 becomes
( 0 + 7T) /2 and its boost parameter is
                A =2tanh- I ([sin(0/2)] tanh 1)).                                                 (5)
The transformation matrix is
         1 + (sin 0/2)( cosh A -1)               ! sin 0(1- cosh A)         (cos 0/2) sinh  A)
B2 = (      ! sin 0(1- cosh A)              1 + (sin 0/2)2(cosh A -1)      -(sin 0/2) sinh A .    (6)
             (cos 0/2) sinh A                    -(sin 0/2) sinh A               cosh A
    This means that the above three successive boosts will leave the 4-momentum
Pa = (0, 0, m) invariant. Since all the boosts are made in the zx plane, the net effect
will be a rotation around the y axis, which does not change Pa:
                B3B2BI        =W            or         B3B2BI W- I    =I                          (9)
where I is the identity matrix. W is a one-parameter matrix representing a rotation
around the y axis in the Lorentz frame in which the particle is at rest and its form is
                              COS    a
                 W=   (
                          -s~n a                                                                 (10)
with
                          .   -I (            (sin O)(sinh 1)/2)2          )
                a =2sm                               .                       .                   (11)
                                     [(cosh 1))2 - (smh 1) )\sin 0/2)2]1/2
W is definitely a rotation matrix of Wigner's O(3)-like little group which leaves Pa
invariant. Since Wigner was the first to introduce the concept of this little group in
terms of rotations in the Lorentz frame in which the particle is at rest [8], it is quite
appropriate to call W(ll') the 'Wigner rotation' [17].
3. Thomas effect
The Thomas effect is caused by two successive boosts with the same boost parameter
in different directions [1-7]. The transformation needed for this case is (B3 )-I(B 1)-1
LORENTZ TRANSFORMATIONS                                                                                493
which brings Pb to Pa and then transforms Pa into Pc. However, this is not B 2 , but
requires an additional matrix T:
               (B 3 )-I(B 1)-I=B2 T                                                                 (12)
where T is the Thomas factor. According to (9)
               (B 3)-I(B1)-1      = B2BI W-'(B1)-'.                                                 (13)
If we compare (13) and (14):
               T=B,W-I(B,)           1                                                              (14)
While W is the rotation matrix whose form is given in (10), T is not a rotation matrix
but is a Lorentz-boosted rotation matrix. Its form is
      '    cos a             -(sin a) cosh 1/                   (sin a) sinh 1/           )
    (
T = (sin a) cosh 1/                  2
                        (cos a) cosh 1/ - sinh 1/2   (cosh 1/)[ cosh 1/ - (sinh 1/) cos a] .
      (sin a) sinh 1/ (sinh 1/)( cosh 1/)( cos a -1)     cosh 1 1/ - (cos a) sinhl1/
                                                                                                    (15)
Furthermore, T can be written as
             T = (B1 )-I(B 3 )-I(B 1) -I.                                                           (16)
This matrix leaves the 4-momentum Pb invariant and is therefore an element of the
O(3)-like little group for Pb.
    The right-hand side of (12) is different from the decomposition given in the existing
literature [2-6, 18-20]. (B 3 )-'(Bd- ' can also be written as a boost preceded by a
rotation:
                                                                                                    (17)
As is illustrated in figure 2, R represents a rotation in the Lorentz frame in which the
particle is not at rest and the boost B' is quite different from B2 . The rotation angle
for R is
                 ,         _I (          (sin O)[sinh( 1//2)f       )
                                                                                                    (18)
               a =2tan            [cosh(1// 2 )]2-[sinh(1// 2 )]2cosO.
a~----~----~----~~b
               Figure 2. Lorentz boosts which are not closed. Two successive boosts (B 3 )-'(B,)-' result
               in boost B' preceded by rotation R performed in the Lorentz frame where the particle is
               not at rest. It is quite clear that B' is not B2 •
494                                                                            CHAPTER IX
This angle has been reported in the literature [18-20] but is conceptually quite different
from a of equation (11). Figures 1 and 2 will illustrate this difference. a and a' are
numerically different.
    The Thomas precession of a charged particle in an electromagnetic field and its
connection with the Wigner rotation has been thoroughly discussed in a recent review
article by Chakrabarti [21]. Our main interest in this paper is the Thomas effect in
the large-momentum or zero-mass limit.
Let us go back to the T matrix of (15). In the low-energy (small-1) limit, T becomes
a rotation matrix and the rotation angle a becomes equal to a' in the same limit. The
above discussion therefore does not alter the existing treatment of the Thomas pre-
cession in the small-1) limit. If 1) is not small, (15) has a new meaning. In the limit
of large 1), T takes the form:
                                   -u
              T=(~ ~    u 0
                                                                                     (19)
u 0
with
For a finite value of u, a vanishes as 1) becomes very large. We have restored the y
coordinate in order to compare this form with those given in the literature [8, 10, 11].
The T matrix is now applicable to the 4-vector (x, y, z, t).
    The above expression for the T matrix represents an element of the E(2)-like little
group which leaves invariant the 4-momentum of a massless particle moving in the z
direction [8]. Furthermore, when applied to the 4-potential of a plane electromagnetic
wave, it performs a gauge transformation [10-12]. Therefore, the Lorentz-boosted
rotation given in (19) becomes a gauge transformation. Indeed, the Thomas rotation
stands between slow particles with spin degrees of freedom and massless particles with
the helicity and gauge degrees of freedom, as is illustrated in table 1.
    Using techniques different from the Thomas kinematics given in this paper, we
have reported in our earlier publications [15,16] that Lorentz-boosted rotations become
gauge transformations in the limit of large momentum and/ or small mass. What is
new in the present paper is that the Thomas effect is one concrete physical example
of the Lorentz-boosted rotation which stands between massive and massless particles.
    Let us illustrate what we did above using the representation of SL(2, c) for spino!
particles. The group of Lorentz transformations is generated by three rotation
generators Sj and three boost generators K j • They satisfy the Lie algebra:
                 BI
                   (±) _
                          -
                              (eX P(±1] 12)                                                    (24)
                                    1
Consequently, there are two different forms of T:
                 T(±)     = B\±) W-I(B\±»-I.                                                   (25)
In the limit of large 1]
(26)
This reflects the fact that, in the SL(2, c) regime, there are two different representations
of the E(2)-like little group for a given direction [22]. The 4 x 4 gauge transformation
matrix of (19) can be constructed from the direct product of the above 2 x 2 matrices
[16, 22].
Acknowledgment
We would like to thank Professor Eugene P Wigner for very illuminating discussions
on his little groups and table 1.
References
Linear canonical transformations of coherent and squeezed states in the Wigner phase space
                                                            D. Han
       /Vational Aeronautics and Space Administration, Goddard Space Fligh"t Center (Code 636), Greenbelt, Maryland 20771
                                                           Y. S. Kim
                  Department of Physics and Astronomy, Unh'ersity of lvlary/and, College Park, Maryland 20742
                                                       Marilyn E. Noz
                           Department of Radiology, New York University. New York, l\/cw York ]0016
                                                  (Received 8 September 19871
                    It i, ~h()wn that classical linear canonical transformations are possible in the Wigner phase
               . . pace. Coherent and squeezed states arc shown to be linear canonical transforms of the ground-
               state harmonic oscillator. It is therefore possihlc to evaluate the Wigner functions for coherent
               and squeezed states from that for the harmonic oscillator. Since the group of linear canonical
               transformations has a subgroup whose algebraic property is the same as that of the (2 + 0-
               dimensional Lorentz group, it may be possible to test certain properties of [he Lorentz group using
               optical devices. A possible experiment to measure the Wigner rotation angle is discussed.
       S.(7/)=
                  e V/2    0 0]
                 [ 0 e- v/2 0 .                                (4)
                                                                         [B I ,Nd=(i/2)NI' [B I ,N2]=(-i/2)N 2 ,
                  o        0             I                               [B 2,Nd=(i/2)N2, [B 2,N2]=(i/2)NI'
                                                                                                                              (13)
                                                                         [N I ,L]=(i/2)N2' [N I ,L]=(-i/2)N I ,
The elongation along the x axis is necessarily the con-
traction along the p axis.                                               [N I ,N2 ]=0.
   Since a canonical transformation followed by another
one is a canonical transformation, the most general form             These commutators, together with those of Eq. (12),
of the transformation matrix is a product of the above               form the set of closed commutation relations (or Lie
three forms of matrices. We can simplify these                       algebra) of the group of canonical transformations. This
mathematics by using the generators of the transforma-               group is the inhomogeneous symplectic group in the
tion matrices. If we use T(u,v) for the translation ma-              two-dimensional space or ISp(2).1I
trix given in Eq. (2), it can be written as
                                                                        The translations form an Abelian subgroup generated
                                                               (5)
                                                                     by NI and N 2 • Since their commutation relations with
                                                                     all the generators result in N I' N 2' or 0, the translation
                                                                     subgroup is an invariant subgroup. The translations and
                                                                     the rotation form the two-dimensional Euclidean group
                 oOil
                                                                     generated by N I ,N2 , and L, which have closed commu-
                   0 ,     N 2 = [0
                                  0 00 01
                                       i .                     (6)
                                                                     tation relations. This group also has been extensively
                 o0               0 0 0                              discussed recently in connection with the internal space-
                                                                     time symmetries of massless particles. 16. 17
The rotation matrix is generated by
                                                                        Indeed, it is of interest to see how the representations
                            ~ j,
                  -i/2                                               of the Lorentz group can be useful in optical sciences. It
                      o                                        (7)   is also of interest to see how the experimental resources
                      o                                              in optical science can be helpful in understanding some
                                                                     of the "abstract" mathematical identities in group
and                                                                  theory.
LORENTZ TRANSFORMATIONS                                                                                                 499
                                                                  [=±l[a~r-x'l
 If ",Ix) is a solution of the Schriidinger equation, the
Wigner distribution function in phase space is defined as                                                              (18)
These operators satisfy the commutation relations given       The interchange of the above two translations results in
in Eqs. (12) and (13). We can therefore derive the alge-      a mUltiplication of the wave function by a constant fac-
braic relations involving the above differential forms us-    tor of unit modulus.
ing the matrix representation discussed in Sec. II.              However, this factor disappears when the Wigner
  The rotation of the translation operators takes the         function W is constructed according to the definition of
form                                                          Eq. (14). Therefore, the translation along the x direction
                                                              and the translation along the p direction commute with
    RWIN1R( -/1)=         [cos~ 1NI-lsin~ 1N , ,              each other in the Wigner phase space. This means that
                                                              the commutation relation [N 1,N,1=O in the Wigner
                                                       (16)   phase space and the Heisenberg relation [N I' N ,1 = - i
    RI/lI\',RI    /11= [Sill   i   INI +   [cos~ IN,          are perfectly consistent with each other. The basic ad-
                                                              vantage of the Wigner phase-space representation is that
                                                              its canonical transformation property is the same as that
Under the same rotation, the squeeze generators become        of classical mechanics.
                                                                 We now have three sets of operators. The first set
    RW)B1R I -/l)=(cos/l)B 1+ (sin/l)B, '                     consists of the three·by-three matrices in Eqs. (6), (7),
                                                       (17)   (10), and (III, and this set is for classical mechanics.
    R 1!JIB,R I -!J)= -lsin!J)B 1+lcos8)B,                    The differential operators in two-dimensional phase
                                                              space form the second set, and they are for the Wigner
Likewise, we can derive all the algebraic relations using     function. The third set consists of the differential opera-
matrix algebra. The important point is that the group of      tors of Eq. (18) applicable to the Schriidinger wave func-
canonical transformations in the Wigner phase space is        tion. The first and second sets are the same. While both
identical to that for classical mechanics.                    the second set of double-variable operators and the third
   Next, let us consider the above transformations in         set of single-variable operators are extensively used in
terms of operators applicable to the Schriidinger wave        the literature,II,14,18 it is interesting to see that the con-
function. From the expression of Eq. (14) it is quite         nection between these two sets can be established
clear that the operation e ~h>x on the wave function leads    through the Wigner function.
to a translation along the P axis by u. The operation of         The transformations discussed in this section consti-
exp[ -u(a/ax)1 on the wave function leads to a transla-       tute the basic language for coherent and squeezed states
tion of the above distribution function along the x axis      in quantum optics. The relevance of the translation in
by u.                                                         phase space to coherent states has been noted before,l
   Likewise, the operation in the Wigner phase space of       The word squeeze comes from quantum optics. It has
ix(a/ap) and ip(a/ax) become x' /2 and f(a/ax )', re-         been also noted that its mathematics is like that of
spectively. Thus, the transformations in phase space can      (2 + II-dimensional Lorentz transformations. As was
be generated from the operators applicable to the wave        emphasized in the literature, I', 17 combining translations
function. The generators applicable to the wave func-         with Lorentz transformations is not a trivial problem.
tion are                                                      We shall discuss the problem in Secs. V and VI.
500                                                                                                             CHAPTER IX
                                                                                 f-
monic oscillator is the same as the study of a circle on
                                                                 L=± [[ :x
the two-dimensional plane. The canonical transforma-
tion consists of rotations, translations, and area-
preserving elliptic deformations of this circle. These
                                                                                      x2   j=+(-m.                          (29)
                                                 r
                                                             case as
a t =(I/1I'2)
                                                      (25)
                                                                     p
where T(r,O) and R(8) are the translation and rotation
operators. Because the circle of Eq. (23) is invariant un-
der rotations around the point where x =r and p =0, the
above Wigner function is the same as the translated
Wigner function,
                                                                                                       re7j/2
   Let us next elongate the translated circle of Eq. (24)
along the x direction. The circle will be deformed into
      e -V(x _r')2+e vp2= 1 ,                         (27)     FIG. 1. Coherent and squeezed states in the Wigner phase
                                                             space. The circle centered around the origin describes the
where
                                                             ground-state harmonic oscillator. The circle around   (r,O)   is for
      r'=re Tl !2   .                                        the coherent state. This coherent state can be squeezed to el-
                                                             lipse along the x axis, with a real value of the squeeze parame-
If we rotate this ellipse, the resulting Wigner function     ter. When the squeeze parameter becomes complex then the
will be                                                      ellipse is rotated around the origin in tbe Wigner pbase space.
LORENTZ TRANSFORMATIONS                                                                                                501
These operators serve two distinct purposes in physics.        We can obtain this state by applying the translation
They are step-up and step-down operators for the one-          operator to the ground state,
dimensional harmonic oscillator in nonrelativistic quan-
tum mechanics.                                                                                                         (38)
  On the other hand, in quantum-field theory, they serve
as the annihilation and creation operators. We are here        where
interested in the creation and annihilation of photons.
Then, what is the physics of the phase space spanned by               T(a)=exp(aa'-a*a) .
x and p variables? Indeed, the concept of creation and
                                                               The translation operator in the phase space depends on
annihilation comes from the commutation relation
                                                               two real parameters. In the above case, the parameter a
                                                        (31)   is a complex number containing two real parameters.
                                                                  It is possible to evaluate the Wigner function from the
This form of uncertainty relation states also that the         above expression to obtain the form given in Eq. (25),7,9
area element in phase space cannot be smaller than             with
Planck's constant. The area element in the Cartesian
coordinate system is (~x)( ~p). It is also possible to
write the area element in the polar coordinate system. If
this area is described in the polar-coordinate system, the     It is also possible to obtain the Wigner function starting
uncertainty relation is the relation between phase and in-     from a real value of a by rotation. From the rotation
tensity.20 This is the uncertainty relation we are discuss-    properties of the a and a t operators given in Sec. IV, the
ing in this paper. We are particularly interested in the       rotation of this operator becomes
minimum-uncertainty states.
   In both Eq. (25) and Eq. (28) the rotation plays the                                                                (40)
essential role. Let us see how the operators a and a' can
be rotated. For two operators A and B, we note the re-         with
lation 21
e ABe - A=B +[ A,B ]+H A,[A,Bll                                This means that we can make a complex starting from a
                                                               real number r by rotation.
            +HA,[A,[A,Blll+"              .             (32)
                                                                 The squeezed state ! s,a) is defined to be , ,3,S,IS,22
and
                                                                      !s,a)=S(s)!a)=S(s)T(a)!O) ,                      (41)
      [L,a]=-ta, [L,a']=ta'.                            (33)
                                                               where
Since R((J)=e- iOL ,
      R(lllaRI   !lIcit     iO"la,                                 S(s)=exp [fa'a'-f aa         1                      (42)
                                                        (34)
      R(IJla RI·-OJ=(e,",2)a'.
                                                               Here again the parameter S is complex and contains two
  In terms of the a and a' operators, the generators of        real numbers for specifying the direction and the
canonical transformations take the form                        strength of the squeeze.
                                                                  If 5 is real, it is possible to evaluate the Wigner func-
      N,=(-;/v2)(a-a'l, N 2 =(\/Vl)(a+a t J,                   tion by direct evaluation of the integral. If, on the other
      L =t(aa' +a 'a   J,                               (35)   hand,s is complex, the present authors were not able to
                                                               manage the calculation, We can, however, overcome
      B,=t(aa-ata\ B,=t(aa+ata').                              this difficulty by using the method of canonical transfor-
                                                               mation developed in this paper, We can make S com-
We can rotate these operators using Eq. (34). In particu-      plex starting from a real value of '1/ by rotating the above
lar, the rotations given in Eq. (17) can now be written as     squeeze operator using the rotation properties of the a
      R(IJ)aaR(-IJ)=e-iOaa,                                    and a t operators.
                                                        (36)      Let us start from a real value of S for which the evalu-
                                                               ation is possible. S For the real value '1/, the squeeze
                                                               operator becomes
These relations will be useful in evaluating the Wigner
function for the squeezed state.
may therefore be possible to design experiments in optics          This will deform the circle into the ellipse
to test the mathematical identities in the Lorentz group.                (e~")x2+(e")p2=1 .                                       (46)
The Wigner rotation is a case in point. Two successive
applications of Lorentz boosts in different directions is               If we squeeze the circle centered around the origin
not a Lorentz boost, but is a boost preceded by a rota-            along the 012 direction with the deformation parameter
tion which is commonly called the Wigner rotationy~26              '1/, the squeeze matrix is
      S((},A)=
                 cosh   + +]
                         + [sinh       cosO
                                                                                                                                 (47)
                        [sinh+ ]sino               A-
                                              cosh "2   [.smh"2A] cosO
                                                        r
                                                                   tana=       (sinO)[sinhA+(tanh'1/)(coshA-1 )cosO]
(51)
                                    r
                                                                  a!2 direction.
e- S [(x-alcosT+(y-bISinT
where
The parameters g and a can be measured or determined              work indicates that some of optical experiments may
from Eq. (491. The angle ¢ determined from the above              serve as analog computers for the (2 + I }-dimensional
expression can be compared with the angle calculated              Lorentz group.
from '1, J..., and a according to the expression given in Eq.
(50).
   Indeed, if the parameters of the coherent and squeezed
                                                                                VII. CONCLUDING REMARKS
states can be determined experimentally, the Wigner ro-
tation can be measured in optical laboratories. The
question is then whether this experiment can be carried              It is quite clear from this paper that the coherent and
out with the techniques available at the present time.            squeezed states can be described by circles and ellipses in
While the analysis presented in this section is based on          the Wigner phase space. One circle or ellipse can be
single-mode squeezed states, the squeezed states that             transformed into another by area-preserving transforma-
have been generated to date are two-mode states. 3.\8             tions. The group governing these transformations is the
Hence, in order to be directly applicable to experiment,          inhomogeneous symplectic group ISp(21.
the present work has to be extended to the two-mode                  We studied the generators of these transformations
case, unless the single-mode squeezed state can be gen-           both for phase space and for the Schrodinger representa-
erated in the near future. In the meantime, the present           tion. It has been shown that the connection between
504                                                                                                                 CHAPTER IX
 these two sets of operators can be established through            be possible to measure the Wigner rotation angle in opti-
 the Wigner function.                                              cal laboratories.
   We also studied in detail rotations in the Wigner
phase space and their counterparts in the Schriidinger                                ACKNOWLEDGMENTS
representation. It is now possible to evaluate the Wigner
function for a squeezed state with a complex parameter.               We are grateful to Professor Eugene P. Wigner for
   The correspondence (local isomorphism) between Sp(2)            very helpful discussions on the subject of canonical
 and the 12 + I)-dimensional Lorentz group allows us to            transformations in quantum mechanics and for main-
 study quantum optics using the established language of            taining his interest in the present work. We would like
 the Lorentz group. At the same time it allows us to               to thank Mr. Seng-Tiong Ho and Dr. Yanhua Shi for ex-
 look into possible experiments in optical science to study        plaining to us the experimental techniques available at
 some of mathematical formulas in group theory. It may             the present time.
I),  R, Klauder, Ann. Phys. IN.Y.) II, 123 11960); R. J.              Space, Col/ege Park, Maryland, 1986, edited by y, S. Kim
  Glauber, Phys. Rev. Lett, 10, 84 11963); F, T. Arechi, ibid.        and W, W. Zachary ISpringer-Verlag, Heidelberg, 1987),
  IS, 9\2 11965); E. Goldin, Waves and Photons IWiley, New         7p. Carruthers and F. Zachariasen, Rev. Mod. Phys. 55, 245
  York, 1982); J. R. Klauder and B. S. Skagerstam, Coherent           (19831.
  States (World Scientific, Singapore, 1985).                      BH. Goldstein, Classical Mechanics, 2nd ed. (Addison·Wesley,
'D, Stoler, Phys. Rev, D 1, 3217 II 970l; H. p, Yuen, Phys.           Reading, MA, 1980),
  Rev, A 13, 2226 11976),                                          9H. Weyl, The Theory of Groups and Quantum Mechanics, 2nd
3For some of the recent papers on the squeezed state, see C. M.       ed. (Dover, New York, 1950),
  Caves, Phys. Rev. D 23, 1693 1198]); D. F. Walls, Nature         10K. B. Wolf, Kinam 6, 141 (19861.
  ILondon) 306, 141 (1983); R, S. Bondourant and J. H.             llA. Perelomov, Generalized Coherent States (Springer-Verlag,
  Shapiro, Phys, Rev. D 30, 2584 II 984); M. D. Reid and D. F.        Heidelberg, 19861.
  Walls, Phys. Rev. A 31, 1622 (1985); B. Yurke, ibid. 32, 300     12E, p, Wigner, Ann. Math. 40, 149119391; V. Bargmann, ibid.
  (1985); 32, 311 119851; R, E. Slusher, L. W. Hollberg, B.           48, 568 119471; V, Bargmann and E, p, Wigner, Proc, Nat.
  Yurke, J. C. Mertz, and J. F. Valley, Phys. Rev, Lett. 55,          Acad, Sci. U,S,A, 34, 211 119481.
  2409119851; C. M. Caves and B. L. Schumaker, Phys, Rev.           11H. Weyl, Classical Groups, 2nd ed, (Princeton University,
  A 31, 3068 119851; B. L. Schumaker and C, M, Caves, ibid.           Princeton, N.J., 1946).
  31, 3093 119851; M. J, Collet and D, F, Walls, ibid, 32, 2887     !4V. Guillemin and S. Sternberg, Symplectic Techniques in
  11985); J, R. Klauder, S. L. McCall, and B. Yurke, ibid. 33,        Physics ICambridge University Press, Cambridge, England,
  3204 119861; S, T, Ho, P. Kumar, J, H, Shapiro, ibid. 34, 293       1984),
  119861; 36, 3982 119871; Z. Y. Ou, C. K. Hong, and L. Man-        lly. S, Kim and M, E, Noz, Am, J, Phys. 51, 368 119831.
  del, ibid, 36, 192 119871.                                       16 0. Han, O. Son, and Y. S. Kim, Phys. Rev. 0 26, 3717
4E. p, Wigner, Phys. Rev. 40, 749 11932), For review articles         11982),
  on this subject, see E. P. Wigner, in Perspective in Quantum      l7y. S. Kim and M. E. Noz, Theory and Applications of the
  Theory, edited by W. Yourgrau and A. van def Merwe (MIT              Poincare Group (Reidel, Dordrecht, 1986l.
  Press, Cambridge, MA, 1971); M. Hillery, R, F, O'Connell,        18B. Yurke, S. McCall, and J. R. Klauder, Phys. Rev, A 33,
  M. O. Scully, and E, P. Wigner, Phys. Rep,· 106, 121 119841;        4033 (19861.
  N. L. Balazs and B, K. Jennings, ibid. 104C, 347 (1984),         19S. Sh10mo, J, Phys, A 16, 3463 (1983).
5W. Schleich and J. A. Wheeler, in Proceedings of the First In·    'oW. Heit1er, Quantum Theory of Radiation, 3rd ed, (Oxford
  ternational Conference all the Physics of Phase Space, College      University Press, London, 1954),
  Park, Maryland 1986, edited by y, S. Kim and W. W. Za-           21W, Miller, Symmetry Groups and Their Applications
  chary ISpringer-Verlag, Heidelberg, 19871; Nature ILondon)          (Academic, New York, 1972); D. R, Truax, Phys, Rev, D 31,
  326,57411987).                                                      1988119851.
6W. H. Louisell, Quantum Statistical Properties of Radiation       "This definition is equivalent to !g,a) = T(a')S(~) 10), with a
   IWiley, :-lew York, 19731; B. V, K. Vijaya Kumar and C. W.         suitably adjusted value of a, since the translation subgroup is
   Carroll, Opt. Eng, 23, 732 11984); R, L. Easton, A. J, Tick-       an invariant subgroup which enables us to write and
  nor, and H. H. Barrett, ibid. 23, 738 (19841; R. Procida and        ST=STS- 1S with ST- 1 =T', which is another element in
   H. W. Lee, Opt. Commun, 49, 201 11984); N. S_ Subotic and          the translation subgroup.
   B. E. A. Saleh, ibid, 52, 259 119841; A. Conner and Y. Li,      2lV, I. Ritus, Zh, Eksp, Teor, Fiz, 40, 352 11961) [Sov. Phys,-
   App!. Opt. 24, 3825 (1985); S. W, McDonald and A, N.               JETP 13, 240 (19611],
   Kaufman, Phys, Rev, A 32, 1708 119851; O. T. Serima, ),         '4A. Chakrabarti, J. Math. Phys, 5,174711964),
   Javanainen, and S. Varro, ibid. 33, 2913 11986); K, J. Kim,     25D. Han, Y. S. Kim, and D. Son, Class. Quantum Grav. 4,
   Nuc!' Instrum, A 246, 71 119861; H. Szu, in Proceedings of         1777 11987).
   the First International Conference on the Physics of Phase      26A, Ben-Menahim, Am. ). Phys. 53, 62 11985),
                           Of Related Interest
              Theory and
           Applications of the
            Poincare Group
                                    by
                            Y. S. Kim
    Department of Physics and Astronomy, University of Maryland, U.S.A.
and
 Special relativity and quantum mechanics are likely to remain the two
 most important languages in physics for many years to come. The
 underlying language for both disciplines is group theory. Eugene P.
 Wigner's 1939 paper on the Poincare group laid the foundation for
 unifying the concepts and algorithms of quantum mechanics and
 special relativity. This book systematically presents physical examples
which can best be explained in terms of Wigner's representation
theory. The examples include the relativistic quark model, hadronic
mass spectra, the Lorentz-Dirac deformation of hadrons, the form
factors of nucleons, Feynman's parton picture and the proton structure
function, the kinematical origin of the gauge degrees of freedom for
massless particles, the polarization of neutrinos as a consequence of
the gauge invariance, and massless particles as the (small-
mass/large-momentum) limits of massive particles.
This book is intended mainly as a teaching tool directed toward those
who desire a deeper understanding of group theory in terms of
examples applicable to the physical world and/or of the physical world
in terms of the symmetry properties which can best be formulated in
terms of group theory. Each chapter contains problems and solutions,
and this makes it potentially useful as a textbook. Graduate students
and researchers interested in space-time symmetries of relativistic
particles will find the book of interest.