Invitation To QM
Invitation To QM
Invitation to
Quantum Mechanics
Daniel F. Styer
ii
Daniel F. Styer
Schiffer Professor of Physics, Oberlin College
The copyright holder grants the freedom to copy, modify, convey, adapt,
and/or redistribute this work under the terms of the Creative Commons
Attribution Share Alike 4.0 International License. A copy of that license is
available at http://creativecommons.org/licenses/by-sa/4.0/legalcode.
iii
iv
Synoptic Contents 1
Welcome 3
2.1 Quantization . . . . . . . . . . . . . . . . . . . . . . . . . 47
2.2 Interference . . . . . . . . . . . . . . . . . . . . . . . . . . 59
2.3 Aharonov-Bohm effect . . . . . . . . . . . . . . . . . . . . 67
2.4 Light on the atoms . . . . . . . . . . . . . . . . . . . . . . 70
2.5 Entanglement . . . . . . . . . . . . . . . . . . . . . . . . . 72
2.6 Quantum cryptography . . . . . . . . . . . . . . . . . . . 84
2.7 What is a qubit? . . . . . . . . . . . . . . . . . . . . . . . 88
v
vi Contents
7. Atoms 229
Welcome
1
2 Synoptic Contents
6. Identical Particles
7. Atoms
This book is an invitation. Where might you and quantum mechanics travel
together?
Welcome
3
4 Welcome
speed
c -
quantum classical
slow mechanics mechanics
0 - size
small big
Finally, is there a framework that applies to situations that are both fast
and small? There is: it is called “relativistic quantum mechanics” and is
closely related to “quantum field theory”. Ordinary non-relativistic quan-
tum mechanics is a good approximation for relativistic quantum mechanics
when applied to slow things. Relativistic mechanics is a good approxima-
tion for relativistic quantum mechanics when applied to big things. And
classical mechanics is a good approximation for relativistic quantum me-
chanics when applied to big, slow things.
6 Welcome
you write out for yourself, in your own words, a summary of the ideas and
equations that you consider most important and most difficult to remember.
The object of the biographical footnotes in this book is twofold: First, to
present the briefest of outlines of the subject’s historical development, lest
anyone get the misimpression that quantum mechanics arose fully formed,
like Aphrodite from sea foam. Second, to show that the founders of quan-
tum mechanics were not inaccessible giants, but people with foibles and
strengths, with interests both inside and outside of physics, just like you
and me.
Teaching tips
Acknowledgments
We are used to things that vary continuously: An oven can take on any
temperature, a recipe might call for any quantity of flour, a child can grow to
a range of heights. If I told you that an oven might take on the temperature
of 172.1 ◦ C or 181.7 ◦ C, but that a temperature of 173.8 ◦ C was physically
impossible, you would laugh in my face.
So you can imagine the surprise of physicists on 14 December 1900,
when Max Planck announced that certain features of blackbody radiation
(that is, of light in thermal equilibrium) could be explained by assuming
that the energy of the light could not take on any value, but only certain
discrete values. Specifically, Planck found that light of frequency f could
take on only the energies of
E = nhf, where n = 0, 1, 2, 3, . . ., (1.1)
and where the constant h (now called the “Planck constant”) is
h = 6.626 070 15 × 10−34 J s. (1.2)
That is, light of frequency f can have an energy of 3.0 hf , and it can have
an energy of 4.0 hf , but it is physically impossible for this light to have an
energy of 3.8 hf . Any numerical quantity that can take on only discrete
values like this is called “quantized”. By contrast, a numerical quantity
that can take on any value is called “continuous”.
The photoelectric effect (section 1.2) supplies additional evidence that
the energy of light comes only in discrete values. And if the energy of
light comes in discrete values, then it’s a good guess that the energy of
an atom comes in discrete values too. This good guess was confirmed
through investigations of atomic spectra (where energy goes into or out of
9
10 Light in thermal equilibrium
You know that the logs of a campfire, or the coils of an electric stove, glow
orange. You might not know that objects at higher temperatures glow
white, although blacksmiths and glass blowers are quite familiar with this
fact and use it to judge the temperature of the metal or the molten glass
they work with. Objects at still higher temperatures, like the star Sirius,
glow blue. A nuclear bomb explosion glows with x-rays.
Going down the temperature scale, the tables, chairs, walls, trees, and
books around us glow with infrared radiation. (Many people are unaware
of this fact because our eyes can’t detect infrared light.) In fact, our own
bodies glow in the infrared — at a somewhat shorter wavelength than our
books, because our bodies are slightly warmer than our books. And the
bitter cold of outer space glows with the famous 3 K cosmic microwave
background radiation.
All these situations are examples of electromagnetic radiation — light
— in thermal equilibrium. What does that mean? The light streaming
from, say, a red neon tube is not in thermal equilibrium: for one thing, it
has only one color, for another all the light streams in the same direction.
Just as a stream of nitrogen molecules, each one with the same speed and
each one moving in the same direction, is not in thermal equilibrium, so
the red light, all the same wavelength and all moving in the same direction,
is not in thermal equilibrium. But after that light is absorbed by matter
at a given temperature, then re-emitted, then reabsorbed and then re-
emitted again, several times, that light relaxes into equilibrium at the same
1 This book is about physics, not the history of physics. In order to present the physical
ideas clearly they are sometimes developed ahistorically. For example in the next section
Planck’s 1900 radiation law is developed as a refinement of the 1905 Rayleigh–Jeans law.
In section 1.5.2 Bohr’s 1913 atomic theory is developed as a consequence of de Broglie’s
1924 concept of matter waves.
“Something Isn’t Quite Right” 11
temperature as the matter with which it’s been interacting. This light in
thermal equilibrium has a variety of wavelengths, and it moves with equal
probability in any direction. In exactly the same way, a stream of nitrogen
molecules will, after many collisions, have a variety of molecular speeds,
and the molecules move with equal probability in any direction.
If you want to do a high-accuracy experiment with light in thermal equi-
librium, you will want light that has been absorbed and re-emitted many
times. The worst possible object for putting light into thermal equilibrium
would be a mirror, which reflects rather than absorbs most incoming light.
Somewhat better would be matter painted white, which reflects much in-
coming light. Better still would be matter painted black. Best of all would
be a cavity surrounded by matter, like a cave, so that the light in the cavity
is absorbed by the walls and re-emitted many times. For these experimental
reasons, light in thermal equilibrium is often called “blackbody radiation”
or “cavity radiation”.
Qualitative arguments explain a number of familiar features of black-
body radiation. You know that the atoms in matter oscillate: as the tem-
perature increases, the oscillations become both farther and faster. The
“farther” oscillations suggest that high-temperature objects should glow
brighter; the “faster” oscillations suggest that they should glow with higher-
frequency (hence shorter-wavelength) light. Turning these qualitative ar-
guments into a quantitative prediction requires an understanding of elec-
trodynamics and of statistical mechanics beyond the needs of this book.
Here I summarize the reasoning involved. First, three principles from elec-
trodynamics:
(1) In all cases, the state of light within a cavity can be expressed
as a sum over the “normal modes” of light within that cavity. Normal
modes come about when one half-wavelength fits within a cubical cavity;
or two half-wavelengths; or three; or any integer. This principle states that
~ n (~x, t) denotes the electric field due to the light of the normal mode
if E
indexed n, then the electric field of an arbitrary state of light is
X
~ x, t) =
E(~ ~ n (~x, t)
an E (1.3)
n
where an sets the amplitude of that mode in the particular state. Each
mode is characterized by a particular wavelength.
12 Light in thermal equilibrium
energy (mJ)
6
1.5
1.0
T = 6000 K
0.5
T = 5000 K
T = 4000 K
0.0 -
wavelength λ (nm)
(2) The energy of any arbitrary state of light is just the sum of the
energies due to each normal mode.
(3) There are more normal modes at shorter wavelength. In May 1905,
Lord Rayleigh2 calculated that the number of modes with wavelength falling
within the window between λ and λ + dλ was
64 πV
dλ,
λ4
where V is the volume of the cavity (assuming a cavity large in the sense
that V λ3 ). Two months later James Jeans3 pointed out that Rayleigh
had made a counting error: the correct result is
8πV
dλ. (1.4)
λ4
To these three electrodynamics principles, add one principle from clas-
sical statistical mechanics: In thermal equilibrium, the energy4 of each
mode is kB T , where T is the (absolute) temperature and kB is the Boltz-
mann constant. This principle is called “equipartition”: some modes have
short wavelengths and some have long, but the energy is equally partitioned
among the various different modes, independent of wavelength.
[[The Boltzmann5 constant kB comes up whenever drawing a connection
between energy and temperature. The U.S. National Institute of Standards
and Technology gives its value as
kB = 1.380 649 × 10−23 J/K, (1.5)
2 John William Strutt, the third Baron Rayleigh (1842–1919), of England, is usually
called just “Lord Rayleigh”. Although particularly interested in acoustics, he made con-
tributions throughout physics: he was the first to explain why the sky is blue and
how seabirds soar. The mode counting described here was published in Nature on
18 May 1905.
3 English physicist and astronomer (1877–1946). His 1930 popular book The Mysterious
Universe did much to introduce the new quantum mechanics to a wide audience. His
correct mode counting argument was published in Philosophical Magazine in July 1905.
Rayleigh acknowledged his error (“Mr. Jeans has just pointed out that I have introduced
a redundant factor 8. . . . I hasten to admit the justice of this correction.”) in Nature on
13 July 1905.
4 Technically the “average energy”, but in these situations the thermal fluctuations
about average energy are so small I’ll just call it the “energy”.
5 Ludwig Boltzmann (1844–1906) of Austria developed statistical mechanics, which ex-
plains the properties of matter in bulk (such as density, hardness, luster, viscosity, and
resistivity) in terms of the properties of atoms (such as mass, charge, and potential en-
ergy of interaction). His students include Paul Ehrenfest, who we will meet later, and
Lise Meitner, who discovered nuclear fission.
14 Light in thermal equilibrium
but this number is hard to remember. For one thing, the joule (J) is a huge
unit for measuring atomic energies — using it would be like measuring the
distance between screw threads in miles or kilometers. The energy unit
typically used in atomic discussions is instead the “electron volt”, where
1 eV = 1.60 × 10−19 J. (1.6)
Second, I like to remember kB through a product: at room temperature,
1
about 300 K, the value of kB T is close to 40 eV. (This knowledge has
rescued me several times during physics oral exams.) The famous ideal
gas constant R (as in pV = nRT ) is just kB times the Avogadro number,
6.02 × 1023 .]]
Putting these four principles together results in the “Rayleigh–Jeans
law”, which says that for light in thermal equilibrium at temperature T
within a volume V , the electromagnetic energy due to wavelengths from λ
to λ + dλ is
8πV
(kB T ) 4 dλ. (1.7)
λ
This formula has numerous admirable features: It is dimensionally consis-
tent, as required. Doubling the volume results in doubling the energy, as ex-
pected. Higher temperature results in higher energy, in agreement with our
previous expectation that “high-temperature objects should glow brighter”.
The very long wavelength modes are unimportant because there’s no signif-
icant amount of energy in them anyway, so we can disregard our previous
qualifier that the formula holds only when V λ3 .
On the other hand, there is nothing in this formula supporting our
expectation and common experience that high-temperature objects should
glow with shorter-wavelength light. To the contrary, at any temperature
the light spectrum should have exactly the same 1/λ4 character! Even
worse: What is the total energy in blackbody radiation? It is
Z ∞ ∞
1 1 1
(kB T )8πV 4
dλ = (kB T )8πV − 3
= +∞. (1.8)
0 λ 3 λ 0
Infinite energy! In fact, infinite energy at any finite temperature! If this
were true, then every book and table and wall — not to mention every
person — would be as deadly as an exploding nuclear bomb. The infinite
energy arises from the short wavelengths of the spectrum, so this disastrous
feature of the Rayleigh–Jeans prediction is called the “ultraviolet catastro-
phe”.
“Something Isn’t Quite Right” 15
energy (µJ)
6
0.6
d
d
d
d
d
0.4 d
Rayleigh–Jeans law
d
d
d
d
0.2
d d
d
d d
0.0 -
In the year 1900 Max Planck7 decided to pursue this third possibility.
He wondered what would happen if the energy of a normal mode with
frequency f couldn’t take on any possible energy, but only certain values
E = nhf, where n = 0, 1, 2, 3, . . . (1.9)
and where h is some constant to be determined by a fit to experiment.
(Except that, being a formal German professor, Planck didn’t say that he
“wondered”, he said that he “hypothesized”.) Because f λ = c for light,
Planck’s hypothesis is equivalent to
hc
E=n. (1.10)
λ
When Planck worked out the statistical mechanical consequences of his
hypothesis, he found that if it were correct then the energy of a mode
would not be the equipartition result kB T , but instead
hc/λ
. (1.11)
e(hc/λ)/(kB T )
−1
No longer would the energy be equally partitioned. . . instead it would de-
pend upon wavelength.
Before rushing into the laboratory to test Planck’s idea, let’s see if it
even makes sense. Planck’s hypothesis is that the energy is discrete, not
continuous, that it comes in packets of size hf . (Except that he didn’t call
them packets, he called them “quanta”, from the Latin word for “amount”.
The singular is “quantum”, the plural is “quanta”.) When the frequency is
small, that is, when the wavelength is long, these quanta are so small that
the continuous approximation ought to be excellent. What does Planck’s
formula say for for long wavelengths? If λ is large, then (hc/λ)/(kB T ) is
small. How does ex behave when x is small? Remember Taylor’s formula:
ex ≈ 1 + x when |x| 1.
Thus for long wavelengths
hc/λ
e(hc/λ)/(kB T ) ≈ 1 + when λ hc .
kB T kB T
7 Max Karl Ernst Ludwig Planck (1858–1947) was a German theoretical physicist par-
energy (µJ)
6
0.6
d
d
d
d
d
0.4 d
Rayleigh–Jeans law
d
d
d
d
0.2
d
Planck law d
d
d d
0.0 -
We saw that the Rayleigh–Jeans formula (1.7) could not represent reality,
because it said that any body in thermal equilibrium would contain an
infinite amount of radiant energy. Does the Planck formula (1.13) suffer
from the same excruciating defect?
“Something Isn’t Quite Right” 19
Solution: If the Planck radiation law holds, then the energy in blackbody
radiation of all wavelengths is of course
Z ∞
hc/λ 8πV
dλ. (1.14)
0 e hc/(kB T λ) − 1 λ4
You will be tempted to jump immediately into evaluating this integral,
but I urge you to pause for a moment and find a good strategy before
executing it. Integration is a mathematical, not a physical, operation, so I
will first convert to a mathematical variable — that is, to a variable without
dimensions. A glance suggests that the proper dimensionless variable is
hc/λ
x= . (1.15)
kB T
(Comparison to the crossover wavelength λ× in definition (1.12) shows that
this variable is just the ratio x = λ× /λ.) Converting the integral to this
variable shows that the total energy is
(kB T )4 ∞ x3
Z
8πV dx. (1.16)
(hc)3 0 ex − 1
Even without evaluating the integral, this equation gives us a lot of in-
formation. The integral on the right is just a number (unless it diverges)
independent of T (or h, or c), so the total energy of light in thermal equi-
librium is proportional to T 4 . This fact, called the Stefan–Boltzmann law,
is the formal result corresponding to our common experience that objects
glow brighter at higher temperatures.
20 Light in thermal equilibrium
x3 /(ex − 1)
6
1.5
1.0
0.5
0.0 - x
0 2 4 6
Solution: Like most equations in physics, the Planck radiation law (1.13
in our textbook) is concise, but this conciseness hides a lot of information.
This essay unpacks some of this information to see what the equation is
trying to tell us about nature. I address two points:
“Something Isn’t Quite Right” 21
Problems
1.1 How big is an atom?
How many times can a liter of water be cut in half until you’re left
with a single water molecule?
1.2 Units for atomic-sized energies
Physicists are fond of measuring typical atomic energies in
“eV/atom”, chemists are fond of measuring typical atomic energies
in “kJ/mole”. An energy of exactly 1 eV in an atom corresponds
to what energy, in kJ, in a mole of atoms? (In reporting your
numerical result, be sure to use significant figures and units: see
appendices A and B.)
1.2. Photoelectric effect 23
claimed that he had “thought a hundred times as much about the quantum problems as I
have about general relativity theory.” [Remark to Otto Stern, reported in Abraham Pais,
“Subtle Is the Lord. . . ”: The Science and the Life of Albert Einstein (Oxford University
Press, Oxford, UK, 1982) page 9.]
24 Photoelectric effect
the metal, some less so. Call the trapping energy of the least-well-bound
electron Ut .
(3) If an electron within the metal absorbs a certain amount of energy
from the light shining on it, then some of that energy will go into getting
the electron out of the metal. The rest will become kinetic energy of the
ejected electron. The electron ejected with maximum kinetic energy will
be the one with the minimum trapping energy, namely Ut . [The reasoning
up to this point has been purely classical.]
(4) Add to this the quantum hypothesis: the amount of light energy
available for absorption can’t take on any old value — it has to be hf .
Then the maximum kinetic energy of an ejected electron will depend upon
the frequency of the light shining on the metal:
KEmax = hf − Ut . (1.19)
If the light has low frequency, hf < Ut , then no electron will be ejected at
all.
This analysis is clearly oversimplified. A metal is a complex system
rather than a simple potential energy well, and it ignores the possibility
that the electron might absorb an energy of 2hf , or 3hf , or more, but it’s
worth an experimental test: Plot the measured maximum kinetic energy of
ejected electrons as a function of frequency f . Will the plot give a straight
line with slope h matching the slope determined through the completely
different blackbody radiation experiment?
The challenge was taken up by Robert A. Millikan.9 By 1916 he exper-
imentally verified Einstein’s prediction, writing that “Einstein’s photoelec-
tric equation has been subjected to very searching tests and it appears in
every case to predict exactly the observed results.” Nevertheless, he found
the quantum condition troubling: this same 1916 paper calls quantization
a “bold, not to say reckless, hypothesis” because it “flies in the face of the
thoroughly established facts of interference”, and in his 1923 Nobel Prize
acceptance speech he said that his confirmation of the Einstein equation
had come “contrary to my own expectation”.
9 American experimental physicist (1868–1953), famous also for measuring the charge
of the electron through his oil-drop experiment, and for his research into cosmic rays.
A graduate of Oberlin College and of Columbia University, he used his administrative
acumen to build the small vocational school called Throop College of Technology into
the research and teaching powerhouse known today as the California Institute of Tech-
nology. His photoelectric results were reported in “A direct photoelectric determination
of Planck’s ‘h’ ” Physical Review 7 (1 March 1916) 355–390.
“Something Isn’t Quite Right” 25
energy quantization in light is both wide and deep.10 It is worth your effort
to memorize that electromagnetic energy comes in lumps of magnitude
hc 1240 eV·nm
E= = . (1.20)
λ λ
[[This equation comes up frequently in the physics Graduate Record Exam
and in physics oral exams. I recommend that you remember the constant
hc in terms of the unit usually used for energy in atomic situations, namely
the electron volt, and the unit usually used for the wavelength of optical
light, namely the nanometer.]]
No one would write a computer program and call it finished without test-
ing and debugging their first attempt. Yet some approach physics problem
solving in exactly this way: they get to the equation that is “the solution”,
stop, and then head off to bed for some well-earned sleep without investi-
gating whether the solution makes sense. This is a loss, because the real
fun and interest in a problem comes not from our cleverness in finding “the
solution”, but from uncovering what that solution tells us about nature.
(Appendix D, “Problem-Solving Tips and Techniques”, calls this final step
“reflection”.) To give you experience in this reflection step, I’ve designed
“find the flaw” problems in which you don’t find the solution, you only test
it. Here’s an example.
This is a physics problem that you are not supposed to solve:
Four friends work this problem independently. When they get together
afterwards to compare results, they find that they have produced four dif-
ferent answers! Their candidate answers are
hc2
(a) λ̂ = 0.201
kB T
hc
(b) λ̂ = 0.201
kB T
hc
(c) λ̂ = 0.201 × 10−3
kB T
kB T
(d) λ̂ = 0.201
hc
Provide simple reasons showing that three of these candidates must be
wrong.
Solution: Candidate (a) does not have the correct dimensions for wave-
length. There are no problems with candidate (b), which is in fact the
correct “Wien displacement law”. Candidate (c) claims that at room tem-
1
perature (kB T = 40 eV), the dominant wavelength would be λ̂ = 10 nm,
deep in the ultraviolet, whereas the problem statement tells you that it’s
actually in the infrared. (Recall that hc = 1240 eV·nm.) Candidate (d)
not only has incorrect dimensions, it also shows the dominant wavelength
increasing, not decreasing, with temperature.
Problems
1.5 Visible light photons (recommended problem)
The wavelength of visible light stretches from 700 nm (red) to
400 nm (violet). (Figures with one significant digit.) What is
the energy range of visible photons?
28 Photoelectric effect
initial:
photon,
energy E0
c - s
stationary electron,
mass m
final:
3
c
scattered photon,
θ energy E
@ φ
@s
@
scattered electron,@
@
momentum p @@R
@
@
called “Prince de Broglie”, although I am told that he was actually a duke. He earned
an undergraduate degree in history, but then switched into physics and introduced the
concept of particle waves in his 1924 Ph.D. thesis.
“Something Isn’t Quite Right” 31
wavelength of
hc
λ= √
2Kmc2
1240 eV·nm
= p
2 × (100 eV) × (511 000 eV)
1240 nm
= p
2 × (102 ) × (0.511 × 106 )
1240 nm
= √
1.022 × 108
= 0.123 nm.
This is a very short wavelength — if it were an electromagnetic wave, it
would be in the x-ray regime (10 nm to 0.01 nm). In 1924, it was impossible
to manufacture slits — or anything else — with a size near 0.123 nm.
(Higher-energy electrons, or protons of the same energy, would have even
shorter wavelengths, and their wave character would be even harder to
discern.)
This difficulty was overcome by using, not human-manufactured slits,
but the rows of atoms within a crystal of nickel (with an atomic spacing of
0.352 nm). In a series of experiments executed from 1923 to 1927, Clinton
Davisson and Lester Germer12 showed that electrons scattering from nickel
exhibit interference as predicted by de Broglie’s strange hypothesis.
Since 1927, the wave character of particles, as demonstrated through in-
terference experiments, has been tested time and again. One breakthrough
came in 1987, when Akira Tonomura13 and his colleagues at the Hitachi
Advanced Research Laboratory in Tokyo demonstrated interference in elec-
trons thrown one at a time through a classic two-slit apparatus. In 2013,
Markus Arndt and his colleagues at the University of Vienna,14 building on
12 Davisson (1881–1958) and Germer (1896–1971) were experimenting at Bell Telephone
Laboratories in Manhattan with the ultimate goal not of testing quantum mechanics,
but of building better telephone amplifiers. Earlier, Germer had served as a World War I
fighter pilot. Later, he would become an innovative rock climber.
13 A. Tonomura, J. Endo, T. Matsuda, T. Kawasaki, and H. Ezawa, “Demonstration
Problems
1.10 de Broglie wavelengths for various particles
We have found the de Broglie wavelength of an electron with energy
100 eV. What about for a neutron with that energy? An atom of
gold? (A neutron’s mass is 1849 times the mass of an electron,
and a gold nucleus consists of 197 protons and neutrons.) If these
de Broglie waves had instead been electromagnetic waves, would
such wavelengths be characterized as ultraviolet, x-rays, or gamma
rays?
1.11 de Broglie wavelength for a big molecule
The 2013 experiments by Markus Arndt, mentioned in the
text, used the molecule C284 H190 F320 N4 S12 with mass 10 118 amu
(where the “atomic mass unit” is very close to the mass of a hydro-
gen atom), and with a velocity of 85 m/s. What was its de Broglie
wavelength?
1.12 Rephrasing the de Broglie relation (essential problem)
Rewrite the de Broglie relation, p = h/λ, in terms of the wavenum-
ber k = 2π/λ. Employ the shortcut notation ~ = h/2π and com-
pare your result to
p = ~k. (1.24)
of two-slit-type interference from a single atom” Physical Review A 100 (12 July 2019)
013407.
34 Wave character of electrons
scattering and nuclear structure determinations, II” Physical Review 95 (15 July 1954)
512–515.
1.4. How does an electron behave? 35
If the energy of light comes in quantized amounts, how about the energy
of an atom?
1.5.1 Experiments
to power. He went first to Denmark, then to the United States where he worked to build
the nuclear bomb. He authored a report recommending that U.S. nuclear bombs not be
used on Japanese cities without warning.
19 German physicist (1887–1975), nephew of Heinrich Hertz, for whom the unit of fre-
quency is named. Hertz was forced out of his career in Germany due to distant Jewish
ancestry. He went to the Soviet Union and there worked to build the Soviet nuclear
bomb.
“Something Isn’t Quite Right” 37
Niels Bohr20 decided in 1913 not just to accept the quantization of atomic
energies as an experimental fact, but to find a theoretical underpinning.
He started with the simplest atom: hydrogen. Hydrogen consists of an
electron (mass me , charge −e) and a far more massive proton (charge +e).
Bohr made21 two assumptions: first that the electron orbits the proton
only in circular (never elliptical) orbits, second that the circumference of
the circular orbit holds an whole number of de Broglie wavelengths (one
or two or three or more but never 2.7). If you remember F = ma, and
the formula for centripetal acceleration (v 2 /r), and Coulomb’s law, you’ll
realize that the circular orbit assumption demands
e2 1 v2 p2
= m e = . (1.25)
4π0 r2 r me r
And if you remember the de Broglie formula λ = h/p you’ll realize that the
assumption of a whole number of wavelengths demands
2πr = nλ = nh/p where n = 1, 2, 3, . . .. (1.26)
Here are two different formulas connecting the two variables r and p, so we
can solve for these variables in terms of n, h, me , and e2 /4π0 . (Don’t get
distracted by the quantities λ and v . . . if we decide we want them later on
we can easily find them once we’ve solved for r and p.) Once our objectives
are clear it’s not hard to achieve them. Solve equation (1.26) for p,
h/2π
p=n , (1.27)
r
and then plug this into equation (1.25) giving
e2 /4π0 2 (h/2π)
2
= n .
r2 me r3
Solve this equation for r giving
(h/2π)2
r = n2 , (1.28)
me (e2 /4π0 )
20 Danish physicist (1885–1962), fond of revolutionary ideas. In 1924 and again in 1929
he suggested that the law of energy conservation be abandoned, but both suggestions
proved to be on the wrong track. Father of six children, all boys, one of whom won the
Nobel Prize in Physics and another of whom played in the 1948 Danish Olympic field
hockey team.
21 The treatment in this book captures the spirit but not all the nuance of Bohr’s argu-
ment.
38 Quantization of atomic energies
1.5.3 Visualizations
As with the photon, it is the job of the rest of this book to come up with
a description of an electron that is correct, but one thing is clear already:
the visualization of an electron as a classical point particle — a smaller
and harder version of a marble — cannot be correct. At this stage in your
quantal education I cannot yet give you a perfect picture of an electron, but
“Something Isn’t Quite Right” 39
you can see that the picture of an electron as a point particle with a position,
a speed, and an energy — the picture that appeared in, for example, the
seal of the U.S. Atomic Energy Commission — must be wrong.
The electron is not a small, hard marble with a position, a speed, and an
energy, and any intuition you hold in your mind based on that mistaken
visualization is likely to lead you astray.
40 Quantization of atomic energies
Atoms larger than sodium required even more elaborate schemes, and
each new atom required a new set of assumptions. Furthermore, even
for simple hydrogen, Bohr could never explain how the quantized ener-
gies changed when the atom was placed in a magnetic field (the “Zeeman23
effect”).
We must conclude that the Bohr model, despite its impressive prediction
concerning the energies of a hydrogen atom in the absence of magnetic field,
is wrong.
22 Reproduced from K.A. Kramers and Helge Holst, The Atom and the Bohr Theory of
after earning his Ph.D. The prestigious journal Nature had earlier described his obser-
vations of the meteoroid of 17 November 1882, made at age 17 years.
“Something Isn’t Quite Right” 41
Problem
1.15 Characteristic quantities (recommended problem)
[[It’s a good idea to develop a sense of typical, or “characteristic”,
sizes: if a problem in classical mechanics asks you to calculate the
mass of a squirrel, and you find 937 kg, then you know you’ve made
a mistake somewhere. In classical mechanics this sense of typical
quantities comes from everyday experience. In atomic physics this
sense has to be built.24 Although the Bohr model is not correct, it
does provide a reasonable picture of typical sizes for atomic quan-
tities, and this problem is your first step toward such a “tangible
picture”.]]
The “characteristic length” for atomic systems is the so-called
“Bohr radius”, the radius of the smallest allowed orbit, which is
(see equation 1.28)
(h/2π)2
a0 ≡ . (1.31)
me (e2 /4π0 )
a. Evaluate the Bohr radius numerically in nanometers. Com-
pare to a wavelength of blue light.
The “characteristic time” for atomic systems is conventionally de-
fined not as the period of the smallest allowed Bohr orbit, but as
this period divided by 2π. (Not the time for the electron to make
one orbit, but the time for it to sweep out an angle of one radian.)
b. Derive a formula for this time and evaluate it numerically in
femtoseconds. Compare to a period of blue light.
[[Characteristic quantities for atomic systems will be explored fur-
ther in problem 7.7, “Characteristic quantities for the Coulomb
problem”, on page 246. That problem shows why it makes sense
to divide the period by 2π.]]
“discovered that if one gets a feeling for no more than a dozen. . . nuclear constants, one
can imagine the subatomic world almost tangibly, and manipulate the picture dimen-
sionally and qualitatively, before calculating more precise relationships.” [Stanislaw M.
Ulam, Adventures of a Mathematician (Charles Scribner’s Sons, New York, 1976) pages
147–148. (From the chapter “Life among the Physicists: Los Alamos”.)]
42 Quantization of magnetic moment
to both theory and experiment. He left Germany for the United States in 1933 upon
the Nazi ascension to power. Walter Gerlach (1889–1979) was a German experimental
physicist. During the Second World War he led the physics section of the Reich Research
Council and for a time directed the German effort to build a nuclear bomb.
“Something Isn’t Quite Right” 43
~
B
z
6 µ
~
Here are the results that I expect: I expect that an atom which happens
to enter the field with magnetic moment pointing straight up (in the z
direction) will experience a large upward force. Hence it will move upward
and stick high up on the glass-plate detector. I expect that an atom which
happens to enter with magnetic moment pointing straight down (in the −z
direction) will experience a large downward force, and hence will stick far
down on the glass plate. I expect that an atom entering with magnetic
moment tilted upward, but not straight upward, will move upward but
not as far up as the straight-up atoms, and the mirror image for an atom
entering with magnetic moment tilted downward. I expect that an atom
entering with horizontal magnetic moment will experience a net force of
zero, so it will pass through the non-uniform field undeflected.
Furthermore, I expect that when a silver atom emerges from the oven
source, its magnetic moment will be oriented randomly — as likely to point
in one direction as in any other. There is only one way to point straight up,
so I expect that very few atoms will stick high on the glass plate. There are
many ways to point horizontally, so I expect many atoms to pass through
undeflected. There is only one way to point straight down, so I expect very
few atoms to stick far down on the glass plate.26
In summary, I expect that atoms would leave the magnetic field in any of
a range of deflections: a very few with large positive deflection, more with a
small positive deflection, a lot with no deflection, some with a small negative
deflection, and a very few with large negative deflection. This continuity of
deflections reflects a continuity of magnetic moment projections.
26 To be specific, this reasoning suggests that the number of atoms with moment tilted
Distribution of µz
Expected: Actual:
µz µz
+µB
0 0
−µB
Problems
1.16 Force on a classical magnetic moment
The force on a classical magnetic moment is most easily calculated using
“magnetic charge fiction”: Consider the magnetic moment to consist of
two “magnetic charges” of magnitude +m and −m, separated by the
position vector d~ running from −m to +m. The magnetic moment is
then µ
~ = md.~
a. Use B+ for the magnitude of the magnetic field at +m, and B−
for the magnitude of the magnetic field at −m. Show that the net
force on the magnetic moment is in the z direction with magnitude
mB+ − mB− .
b. Use dz for the z-component of the vector d. ~ Show that to high
accuracy
∂B
B+ = B− + dz .
∂z
Surely, for distances of atomic scale, this accuracy is more than
adequate.
c. Derive expression (1.32) for the force on a magnetic moment.
27 “The important thing is not to stop questioning,” said Einstein. “Never lose a holy
curiosity.” [Interview by William Miller, “Death of a Genius”, Life magazine, volume 38,
number 18 (2 May 1955) pages 61–64 on page 64.]
Chapter 2
2.1 Quantization
47
48 Quantization
+µB
µ
~
µ
~
µ
~
What Is Quantum Mechanics About? 49
packaged into
An atom enters a vertical analyzer through the single hole on the left.
If it exits through the upper hole on the right (the “+ port”) then the
outgoing atom has µz = +µB . If it exits through the lower hole on the
right (the “− port”) then the outgoing atom has µz = −µB .
1 In general, the “pipes” will manipulate the atoms through electromagnetic fields, not
through touching. One way way to make such “pipes” is to insert a second Stern-Gerlach
apparatus, oriented upside-down relative to the first. The atoms with µz = +µB , which
had experienced an upward force in the first half, will experience an equal downward
force in the second half, and the net impulse delivered will be zero. But whatever their
manner of construction, the pipes must not change the magnetic moment of an atom
passing through them.
50 Quantization
µz = +µB
µz = −µB
all
µz = +µB
none
µz = −µB
(ignore these)
Atoms are fed into a vertical analyzer. Any atom exiting from the + port is
then channeled into a second analyzer, but this analyzer is oriented upside-
down. What happens? If the projection on an upward-pointing axis is +µB
(that is, µz = +µB ), then the projection on a downward-pointing axis is
−µB (we write this as µ(−z) = −µB ). So I expect that these atoms will
emerge from the − port of the second analyzer (which happens to be the
higher port). And this is exactly what happens.
What Is Quantum Mechanics About? 51
all
µz = +µB
none
µz = −µB
(ignore these)
Atoms are fed into a vertical analyzer. Any atom exiting from the + port is
then channeled into a second analyzer, but this analyzer is oriented horizon-
tally. The second analyzer doesn’t measure the projection µz , it measures
the projection µx . What happens in this case? Experiment shows that the
atoms emerge randomly: half from the + port, half from the − port.
z
y
x
(ignore these)
µz = −µB
turn left half the time and right half the time. (Don’t take this paragraph
literally. . . atoms have no personalities and they don’t “want” anything.
But it is a useful mnemonic.)
Perform the same experiment as above (section 2.1.4), except insert the
horizontal analyzer in the opposite sense, so that it measures the projection
on the negative x axis rather than the positive x axis. Again, half the atoms
emerge from the + port, and half emerge from the − port.
z
y
x
(ignore these)
µz = −µB
z
y
x
µx = −µB µz = +µB
µx = +µB
µz = −µB
Atoms are fed into a vertical analyzer. Any atom exiting from the + port
is then channeled into a horizontal analyzer. Half of these atoms exit from
the + port of the horizontal analyzer (see section 2.1.4), and these atoms
are channeled into a third analyzer, oriented vertically. What happens at
the third analyzer?
z
y
x
µx =−µB ?
µz =+µB
µx =+µB ?
µz =−µB
There are two ways to think of this: (I) When the atom emerged from
the + port of the first analyzer, it was determined to have µz = +µB .
When that same atom emerged from the + port of the second analyzer,
it was determined to have µx = +µB . Now we know two projections
of the magnetic moment. When it enters the third analyzer, it still has
µz = +µB , so it will emerge from the + port. (II) The last two analyzers
in this sequence are a horizontal analyzer followed by a vertical analyzer,
and from section 2.1.6 we know what happens in this case: a 50/50 split.
That will happen in this case, too.
54 Quantization
So, analysis (I) predicts that all the atoms entering the third analyzer
will exit through the + port and none through the − port. Analysis (II)
predicts that half the atoms will exit through the + port and half through
the − port.
Experiment shows that analysis (II) gives the correct result. But what
could possibly be wrong with analysis (I)? Let’s go through line by line:
“When the atom emerged from the + port of the first analyzer, it was
determined to have µz = +µB .” Nothing wrong here — this is what an
analyzer does. “When that same atom emerged from the + port of the
second analyzer, it was determined to have µx = +µB .” Ditto. “Now we
know two projections of the magnetic moment.” This has got to be the
problem. To underscore that problem, look at the figure below.
µ
~
+µB
x
+µB
If an atom did have both µz = +µB and µx = +µB , then the √ projection
◦
on an axis rotated 45 from the vertical would be µ45 = + 2 µB . But
◦
Because it’s easy to fall into misconceptions, let me emphasize what I’m
saying and what I’m not saying:
I’m saying that if an atom has a value for µx , then it doesn’t have
a value for µz .
I’m not saying that the atom has a value for µz but no one knows
what it is.
I’m not saying that the atom has a value for µz but that value is
changing rapidly.
I’m not saying that the atom has a value for µz but that value is
changing unpredictably.
I’m not saying that a random half of such atoms have the value
µz = +µB and the other half have the value µz = −µB .
I’m not saying that the atom has a value for µz which will be
disturbed upon measurement.
The atom with a value for µx does not have a value for µz in the same way
that love does not have a color.
This is a new phenomenon, and it deserves a new name. That name
is “indeterminacy”. This is perhaps not the best name, because it might
suggest, incorrectly, that an atom with a value for µx has a value for µz and
we merely haven’t yet determined what that value is. The English language
was invented by people who didn’t understand quantum mechanics, so it is
not surprising that there are no perfectly appropriate names for quantum
mechanical phenomena. This is a defect in our language, not a defect in
quantum mechanics or in our understanding of quantum mechanics, and it
is certainly not a defect in nature.2
How can a vector have a projection on one axis but not on another? It
is the job of the rest of this book to answer that question, but one thing
is clear already: The visualization of an atomic magnetic moment as a
classical arrow must be wrong.
2 In exactly the same manner, the name “orange” applies to light within the wavelength
range 590–620 nm and the name“red” applies to light within the wavelength range 620–
740 nm, but the English language has no word to distinguish the wavelength range
1590–1620 nm from the wavelength range 1620–1740 nm. This is not because optical
light is “better” or “more deserving” than infrared light. It is due merely to the accident
that our eyes detect optical light but not infrared light.
56 Quantization
θ
µθ = +µB
µz = +µB
µθ = −µB
µz = −µB
P+ (θ)
1
2
0
0◦ 90◦ 180◦ 270◦ 360◦
θ
What Is Quantum Mechanics About? 57
Problems
2.1 Exit probabilities (essential problem)
a. An analyzer is tilted from the vertical by angle α. An atom leaving
its + port is channeled into a vertical analyzer. What is the proba-
bility that this atom emerges from the + port? The − port? (Clue:
Use the “rotate as a unit” concept introduced in section 2.1.6.)
z
β
α γ
- or -
A C
Find the probability that it emerges from (a) the − port of analyzer
A; (b) the + port of analyzer B; (c) the + port of analyzer C; (d) the
− port of analyzer C.
2.3 Properties of the P+ (θ) function
a. An atom exits the + port of a vertical analyzer; that is, it has
µz = +µB . Argue that the probability of this atom exiting from
the − port of a θ analyzer is the same as the probability of it
exiting from the + port of a (180◦ − θ) analyzer.
b. Conclude that the P+ (θ) function introduced in section 2.1.8 must
satisfy
P+ (θ) + P+ (180◦ − θ) = 1.
c. Does the experimental result (2.1) satisfy this condition?
2.2. Interference 59
2.2 Interference
packaged into
performed exactly as described here, although researchers are getting close. [See Shi-
mon Machluf, Yonathan Japha, and Ron Folman, “Coherent Stern–Gerlach momentum
splitting on an atom chip” Nature Communications 4 (9 September 2013) 2424.] We
know the results that would come from these experiments because conceptually parallel
(but more complex!) experiments have been performed on photons, neutrons, atoms,
and molecules. (See page 33.)
4 If you followed the footnote on page 49, you will recall that these “pipes” manipulate
atoms through electromagnetic fields, not through touching. One way to make them
would be to insert two more Stern-Gerlach apparatuses, the first one upside-down and
the second one rightside-up relative to the initial apparatus. But whatever the manner of
their construction, the pipes must not change the magnetic moment of an atom passing
through them.
60 Interference
b
input ignore
µz = +µB output
a µz = −µB
If you perform this experiment, you will find that this analysis is correct
and that these results are indeed obtained.
5 To make sure that all of these atoms have µ = +µ , they are harvested from the
z B
+ port of a vertical analyzer.
What Is Quantum Mechanics About? 61
Here, I have not just one, but two ways to analyze the experiment:
Analysis I:
(1) An atom passes through the set-up either via path b or via path a.
(2) From section 2.2.1, the probability of passing through via path b is 14 .
(3) From section 2.2.2, the probability of passing through via path a is 14 .
(4) Thus the probability of passing through the entire set-up is 14 + 14 = 21 .
Analysis II:
“longitude” when it was thought that the Earth was flat. The discovery of the near-
spherical character of the Earth forced our forebears to invent new words to represent
these new concepts. Words do not determine reality; instead reality determines which
words are worth inventing.
What Is Quantum Mechanics About? 63
Consider the same set-up as on page 60, but now ignore atoms leaving the
− port of the vertical analyzer and consider as output atoms leaving the
+ port. What is the probability of passing through the set-up when path
a is blocked? When path b is blocked? When neither path is blocked?
1 1 1
Solution: 4; 4; 1. Because 4 + 14 < 1, this is an example of constructive
interference.
64 Interference
2a
1b
input output
µz = +µB
1a
2b
(a) 2a (d) 1b
(b) 2b (e) 1b and 2a
(c) 1a (f) 1a and 2b
Solution: Only two principles are needed to solve this problem: First,
an atom leaving an unblocked analyzer loop leaves in the same condition
it had when it entered. Second, an atom leaving an analyzer loop with
one path blocked leaves in the condition specified by the path that it took,
regardless of the condition it had when it entered. Use of these principles
gives the solution in the table on the next page. Notice that in changing
from situation (a) to situation (e), you add blockage, yet you increase the
output!
paths input path taken intermediate path taken output probability of
blocked condition through # 1 condition through # 2 condition input → output
none µz = +µB “both” µz = +µB a µz = +µB 100%
2a µz = +µB “both” µz = +µB 100% blocked at a none 0%
What Is Quantum Mechanics About?
Problems
2.4 Tilted analyzer loop (recommended problem)
z
θ
a
input
µz =+µB output
7 Background concerning “find the flaw” type problems is provided in sample prob-
2a
1b 3b
µz =+µB output
1a 3a
2b
If all paths are open, 100% of the incoming atoms exit from the out-
put. What percent of the incoming atoms leave from the output if the
following paths are blocked?
emerge in the same condition it had when it entered. We call this box a
“replicator”.
If you play with one of these boxes you’ll find that you can build any
elaborate set-up of sources, detectors, blockages, and analyzers, and that
inserting a replicator into any path will not affect the outcome of any exper-
iment. But notice that this apparatus list does not include interferometers
(our “analyzer loops”)! Build the interference experiment of page 60. Do
not block either path. Instead, slip a replicator into one of the two paths a
or b — it doesn’t matter which.
b
µz =+µB ignore
output
a µz =−µB
replicator
https://arxiv.org/abs/1304.4736.
What Is Quantum Mechanics About? 69
Problem
2.7 Bomb-testing interferometer9 (recommended problem)
The Acme Bomb Company sells a bomb triggered by the presence of
silver, and claims that the trigger is so sensitive that the bomb explodes
when its trigger absorbs even a single silver atom. You have heard sim-
ilar extravagant claims from other manufacturers, so you’re suspicious.
You purchase a dozen bombs, then shoot individual silver atoms at
each in turn. The first bomb tested explodes! The trigger worked as
advertised, but now it’s useless because it’s blasted to pieces. The sec-
ond bomb tested doesn’t explode — the atom slips through a hole in
the trigger. This confirms your suspicion that not all the triggers are
as sensitive as claimed, so this bomb is useless to you as well. If you
continue testing in this fashion, at the end all your good bombs will be
blown up and you will be left with a stash of bad bombs.
So instead, you set up the test apparatus sketched here:
b
µz =+µB ?
a
?
bomb with trigger
Our conclusion that, under some circumstances, the atom “does not have
a position” is so dramatically counterintuitive that you might — no, you
should — be tempted to test it experimentally. Set up the interference ex-
periment on page 60, but instead of simply allowing atoms to pass through
the interferometer, watch to see which path the atom takes through the
set-up. To watch them, you need light. So set up the apparatus with lamps
trained on the two paths a and b.
Send in one atom. There’s a flash of light at path a.
Another atom. Flash of light at b.
Another atom. Flash at b again.
Then a, then a, then b.
You get the drift. Always the light appears at one path or the other. (In
fact, the flashes come at random with probability 21 for a flash at a and 12
for a flash at b.) Never is there no flash. Never are there “two half flashes”.
The atom always has a position when passing through the interferometer.
“So much”, say the skeptics, “for this metaphysical nonsense about ‘the
atom takes both paths’.”
But wait. Go back and look at the output of the vertical analyzer.
When we ran the experiment with no light, the probability of coming out
the − port was 0. When we turn the lamps on, then the probability of
coming out the − port becomes 21 .
When the lamps are off, analysis II on page 61 is correct: the atoms
ambivate through both paths, and the probability of exiting from the − port
is 0. When the lamps are on and a flash is seen at path a, then the atom
does take path a, and now the analysis of section 2.2.2 on page 61 is correct:
the probability of exiting from the − port is 21 .
The process when the lamps are on is called “observation” or “measure-
ment”, and a lot of nonsense has come from the use of these two words.
The important thing is whether the light is present or absent. Whether
or not the flashes are “observed” by a person is irrelevant. To prove this
to yourself, you may, instead of observing the flashes in person, record the
flashes on video. If the lamps are on, the probability of exiting from the
− port is 12 . If the lamps are off, the probability of exiting from the − port
What Is Quantum Mechanics About? 71
is 0. Now, after the experiment is performed, you may either destroy the
video, or play it back to a human audience, or play it back to a feline au-
dience. Surely, by this point it is too late to change the results at the exit
port.
It’s not just light. Any method you can dream up for determining the
path taken will show that the atom takes just one path, but that method
will also change the output probability from 0 to 21 . No person needs to
actually read the results of this mechanism: as long as the mechanism is at
work, as long as it is in principle possible to determine which path is taken,
then one path is taken and no interference happens.
What happens if you train a lamp on path a but leave path b in the
dark? In this case a flash means the atom has taken path a. No flash means
the atom has taken path b. In both cases the probability of passage for the
atom is 12 .
How can the atom taking path b “know” that the lamp at path a is
turned on? The atom initially “sniffs out” both paths, like a fog creeping
down two passageways. The atom that eventually does take path b in
the dark started out attempting both paths, and that’s how it “knows”
the lamp at path a is on. This is called the “Renninger negative-result
experiment”.
It is not surprising that the presence or absence of light should affect an
atom’s motion: this happens even in classical mechanics. When an object
absorbs or reflects light, that object experiences a force, so its motion is
altered. For example, a baseball tossed upward in a gymnasium with the
overhead lamps off attains a slightly greater height that an identical baseball
experiencing an identical toss in the same gymnasium with the overhead
lamps on, because the downward-directed light beams push the baseball
downward. (This is the same “radiation pressure” that is responsible for
the tails of comets. And of course, the effect occurs whenever the lamps are
turned on: whether any person actually watches the illuminated baseball
is irrelevant.) This effect is negligible for typical human-scale baseballs
and tosses and lamps, but atoms are far smaller than baseballs and it is
reasonable that the light should alter the motion of an atom more than it
alters the motion of a baseball.
One last experiment: Look for the atoms with dim light. In this case,
some of the atoms will pass through with a flash. But — because of the
dimness — some atoms will pass through without any flash at all. For those
72 Entanglement
atoms passing through with a flash, the probability for exiting the − port
is 21 . For those atoms passing through without a flash, the probability of
exiting the − port is 0.
2.5 Entanglement
V V
120◦
I O I O
Up to now, our atoms have come from an oven. For the next experiments we
need a special source10 that expels two atoms at once, one moving to the left
and the other to the right. For the time being we call this an “EPR” source,
which produces an atomic pair in an “EPR” condition. The letters come
from the names of those who discovered this condition: Albert Einstein,
Boris Podolsky, and Nathan Rosen. After investigating this condition we
will develop a more descriptive name.
The following four experiments investigate the EPR condition:
(1) Each atom encounters a vertical Stern-Gerlach analyzer. The ex-
perimental result: the two atoms exit through opposite ports. To be precise:
with probability 21 , the left atom exits + and the right atom exits −, and
with probability 12 , the left atom exits − and the right atom exits +, but
it never happens that both atoms exit + or that both atoms exit −.
1
probability 2
1
probability 2
never
never
10 The question of how to build this special source need not concern us at the moment: it
is an experimental fact that such sources do exist. One way to make one would start with
a diatomic molecule with zero magnetic moment. Cause the molecule to disintegrate and
eject the two daughter atoms in opposite directions. Because the initial molecule had
zero magnetic moment, the pair of daughter atoms will have the properties of magnetic
moment described. In fact, it’s easier to build a source, not for a pair of atoms, but for
a pair of photons using a process called spontaneous parametric down-conversion.
What Is Quantum Mechanics About? 75
You might suppose that this is because for half the pairs, the left atom
is generated with µz = +µB while the right atom is generated with
µz = −µB , while for the other half of the pairs, the left atom is generated
with µz = −µB while the right atom is generated with µz = +µB . This
supposition seems suspicious, because it singles out the z axis as special,
but at this stage in our experimentation it’s possible.
(3) Repeat the above experiment with the two Stern-Gerlach analyzers
oriented at +120◦ , or with both oriented at −120◦ , or with both oriented
at 57◦ , or for any other angle, as long as both have the same orientation.
The experimental result: Exactly the same for any orientation!
(4) In an attempt to trick the atoms, we set the analyzers to vertical,
then launch the pair of atoms, then (while the atoms are in flight) switch
both analyzers to, say, 42◦ , and have the atoms encounter these analyzers
both with switched orientation. The experimental result: Regardless of
what the orientation is, and regardless of when that orientation is set, the
two atoms always exit through opposite ports.
Here is one way to picture this situation: The pair of atoms has a total
magnetic moment of zero. But whenever the projection of a single atom
on any axis is measured, the result must be +µB or −µB , never zero.
The only way to insure that that total magnetic moment, projected on
any axis, sums to zero is the way described above. Do not put too much
weight on this picture: like the “wants to go straight” story of section 2.1.4
(page 51), this is a classical story that happens to give the correct result.
The definitive answer to any question is always experiment, not any picture
or story, however appealing it may be.
These four experiments show that it is impossible to describe the con-
dition of the atoms through anything like “the left atom has µz = +µB ,
the right atom has µz = −µB ”. How can we describe the condition of the
pair? This will require further experimentation. For now, we say it has an
EPR condition.
76 Entanglement
A pair of atoms leaves the EPR source, and each atom travels at the same
speed to vertical analyzers located 100 meters away. The left atom exits the
− port, the right atom exits the + port. When the pair is flying from source
to analyzer, it’s not correct to describe it as “the left atom has µz = −µB ,
the right atom has µz = +µB ”, but after the atoms leave their analyzers,
then this is a correct description.
Now shift the left analyzer one meter closer to the source. The left atom
encounters its analyzer before the right atom encounters its. Suppose the
left atom exits the − port, while the right atom is still in flight toward its
analyzer. We know that when the right atom eventually does encounter
its vertical analyzer, it will exit the + port. Thus it is correct to describe
the right atom as having “µz = +µB ”, even though that atom hasn’t yet
encountered its analyzer.
Replace the right vertical analyzer with a flipping Stern-Gerlach ana-
lyzer. (In the figure below, it is in orientation O, out of the page.) Suppose
the left atom encounters its vertical analyzer and exits the − port. Through
the reasoning of the previous paragraph, the right atom now has µz = +µB .
We know that when such an atom encounters a flipping Stern-Gerlach an-
alyzer, it exits the + port with probability 21 .
Similarly, if the left atom encounters its vertical analyzer and exits the
+ port, the right atom now has µz = −µB , and once it arrives at its flipping
analyzer, it will exit the − port with probability 21 . Summarizing these two
paragraphs: Regardless of which port the left atom exits, the right atom
will exit the opposite port with probability 12 .
Now suppose that the left analyzer were not vertical, but instead in
orientation I, tilted into the page by one-third of a circle. It’s easy to see
that, again, regardless of which port the left atom exits, the right atom will
exit the opposite port with probability 21 .
What Is Quantum Mechanics About? 77
Finally, suppose that the left analyzer is a flipping analyzer. Once again,
the two atoms will exit from opposite ports with probability 21 .
The above analysis supposed that the left analyzer was one meter closer
to the source than the right analyzer, but clearly it also works if the right
analyzer is one meter closer to the source than the left analyzer. Or one
centimeter. One suspects that the same result will hold even if the two
analyzers are exactly equidistant from the source, and experiment bears
out this suspicion.
In summary: Each atom from this EPR source enters a flipping Stern-
Gerlach analyzer.
Suppose you didn’t know anything about quantum mechanics, and you
were told the result that “if the two analyzers have the same orientation,
the atoms exit from opposite ports.” Could you explain it?
I am sure you could. In fact, there are two possible explanations: First,
the communication explanation. The left atom enters its vertical analyzer,
and notices that it’s being pulled toward the + port. It calls up the right
atom with its walkie-talkie and says “If your analyzer has orientation I or O
then you might go either way, but if your analyzer has orientation V you’ve
got to go to the − port!” This is a possible explanation, but it’s not a local
explanation. The two analyzers might be 200 meters apart, or they might
be 200 light-years apart. In either case, the message would have to get from
the left analyzer to the right analyzer instantaneously. The walkie-talkies
would have to use not radio waves, which propagate at the speed of light,
but some sort of not-yet-discovered “insta-rays”. Physicists have always
been skeptical of non-local explanations, and since the advent of relativity
they have grown even more skeptical, so we set this explanation aside. Can
you find a local explanation?
78 Entanglement
Again, I am sure you can. Suppose that when the atoms are launched,
they have some sort of characteristic that specifies which exit port they will
take when they arrive at their analyzer. This very reasonable supposition,
called “determinism”, pervades all of classical mechanics. It is similar to
saying “If I stand atop a 131 meter cliff and toss a ball horizontally with
speed 23.3 m/s, I can predict the angle with which the ball strikes the
ground, even though that event will happen far away and long in the fu-
ture.” In the case of the ball, the resulting strike angle is encoded into the
initial position and velocity. In the case of the atoms, it’s not clear how the
exit port will be encoded: perhaps through the orientation of its magnetic
moment, perhaps in some other, more elaborate way. But the method of
encoding is irrelevant: if local determinism holds, then something within
the atom determines which exit port it will take when it reaches its ana-
lyzer.11 I’ll represent this “something” through a code like (+ + −). The
first symbol means that if the atom encounters an analyzer in orientation V,
it will exit through the + port. The second means that if it encounters an
analyzer in orientation O, it will exit through the + port. The third means
that if it encounters an analyzer in orientation I, it will exit through the
− port. The only way to ensure that “if the two analyzers have the same
orientation, the atoms exit from opposite ports” is to assume that when the
two atoms separate from each other within the source, they have opposite
codes. If the left atom has (+ − +), the right atom must have (− + −). If
the left atom has (− − −), the right atom must have (+ + +). This is the
local deterministic scheme for explaining fact (B) that “if the two analyzers
have the same orientation, the atoms exit from opposite ports”.
But can this scheme explain fact (A)? Let’s investigate. Consider first
the case mentioned above: the left atom has (+−+) and the right atom has
(− + −). These atoms will encounter analyzers set to any of 32 = 9 possible
pairs of orientations. We list them below, along with with exit ports taken
by the atoms. (For example, the third line of the table considers a left
analyzer in orientation V and a right analyzer in orientation I. The left
atom has code (+ − +), and the first entry in that code determines that
the left atom will exit from the V analyzer through the + port. The right
atom has code (− + −), and the third entry in that code determines that
the right atom will exit from the I analyzer through the − port.)
11 But remember that in quantum mechanics determinism does not hold. The infor-
mation can’t be encoded within the three projections of a classical magnetic moment
vector, because at any one instant, the quantum magnetic moment vector has only one
projection.
What Is Quantum Mechanics About? 79
Each of the nine orientation pairs (VV, OI, etc.) are equally likely, five of
the orientation pairs result in atoms exiting from opposite ports, so when
atoms of this type emerge from the source, the probability of these atoms
exiting from opposite ports is 59 .
What about a pair of atoms generated with different codes? Suppose the
left atom has (− − +) so the right atom must have (+ + −). If you perform
the analysis again, you will find that the probability of atoms exiting from
opposite ports is once again 95 .
Suppose the left atom has (−−−), so the right atom must have (+++).
The probability of the atoms exiting from opposite ports is of course 1.
There are, in fact, just 23 = 8 possible codes:
code probability
for of exiting
left atom opposite
+++ 1
−++ 5/9
+−+ 5/9
++− 5/9
+−− 5/9
−+− 5/9
−−+ 5/9
−−− 1
80 Entanglement
If the source makes left atoms of only type (−−+), then the probability
of atoms exiting from opposite ports is 59 . If the source makes left atoms
of only type (+ + +), then the probability of atoms exiting from opposite
ports is 1. If the source makes left atoms of type (− − +) half the time,
and of type (+ + +) half the time, then the probability of atoms exiting
from opposite ports is halfway between 95 and 1, namely 79 . But no matter
how the source makes atoms, the probability of atoms exiting from opposite
ports must be somewhere between 59 and 1.
But experiment and quantum mechanics agree: That probability is ac-
tually 12 — and 12 is not between 95 and 1. No local deterministic scheme
— no matter how clever, or how elaborate, or how baroque — can give the
result 12 . There is no “something within the atom that determines which
exit port it will take when it reaches its analyzer”. If the magnetic moment
has a projection on axis V, then it doesn’t have a projection on axis O or
axis I.
There is a reason that Einstein, despite his many attempts, never pro-
duced a scheme that explained quantum mechanics in terms of some more
fundamental, local and deterministic mechanism. It is not that Einstein
wasn’t clever. It is that no such scheme exists.
“source of entangled atom pairs” and describe the condition of the atom
pair as “entangled”.
The failure of local determinism described above is a special case of
“Bell’s Theorem”, developed by John Bell14 in 1964. The theorem has
by now been tested experimentally numerous times in numerous contexts
(various different angles; various distances between the analyzers; various
sources of entangled pairs; various kinds of particles flying apart — gamma
rays, or optical photons, or ions). In every test, quantum mechanics has
been shown correct and local determinism wrong. What do we gain from
these results?
First, they show that nature does not obey local determinism. To our
minds, local determinism is common sense and any departure from it is
weird. Thus whatever theory of quantum mechanics we eventually develop
will be, to our eyes, weird. This will be a strength, not a defect, in the
theory. The weirdness lies in nature, not in the theory used to describe
nature.
Each of us feels a strong psychological tendency to reject the unfamil-
iar. In 1633, the Holy Office of the Inquisition found Galileo Galilei’s idea
that the Earth orbited the Sun so unfamiliar that they rejected it. The
inquisitors put Galileo on trial and forced him to abjure his position. From
the point of view of nature, the trial was irrelevant, Galileo’s abjuration
was irrelevant: the Earth orbits the Sun whether the Holy Office finds that
fact comforting or not. It is our job as scientists to change our minds to fit
nature; we do not change nature to fit our preconceptions. Don’t make the
inquisitors’ mistake.
Second, the Bell’s theorem result guides not just our calculations about
nature but also our visualizations of nature, and even the very idea of
what it means to “understand” nature. Lord Kelvin15 framed the situation
perfectly in his 1884 Baltimore lectures: “I never satisfy myself until I can
14 John Stewart Bell (1928–1990), a Northern Irish physicist, worked principally in accel-
erator design, and his investigation of the foundations of quantum mechanics was some-
thing of a hobby. Concerning tests of his theorem, he remarked that “The reasonable
thing just doesn’t work.” [Jeremy Bernstein, Quantum Profiles (Princeton University
Press, Princeton, NJ, 1991) page 84.]
15 William Thomson, the first Baron Kelvin (1824–1907), was an Irish mathematical
physicist and engineer who worked in Scotland. He is best known today for establishing
the thermodynamic temperature scale that bears his name, but he also made fundamen-
tal contributions to electromagnetism. He was knighted for his engineering work on the
first transatlantic telegraph cable.
82 Entanglement
Robert Kargon and Peter Achinstein, editors, Kelvin’s Baltimore Lectures and Modern
Theoretical Physics (MIT Press, Cambridge, MA, 1987) page 206.
17 The first time I studied quantum mechanics seriously, I wrote in the margin of my
textbook “Good God they do it! But how?” I see now that I was looking for a mechanical
mechanism undergirding quantum mechanics. It doesn’t exist, but it’s very natural for
anyone to want it to exist.
18 Max Born (1882–1970) was a German-Jewish theoretical physicist with a particular in-
behave in some ways like small hard marbles, in some ways like classical
waves, and in some ways like a cloud or fog of probability. Atoms don’t
behave exactly like any of these things, but if you keep in mind both the
analogy and its limitations, then you can develop a pretty good visualization
and understanding.
And that brings us back to the name “entanglement”. It’s an important
name for an important phenomenon, but it suggests that the two distant
atoms are connected mechanically, through strings. They aren’t. The two
atoms are correlated — if the left comes out +, the right comes out −, and
vice versa — but they aren’t correlated because of some signal sent back
and forth through either strings or walkie-talkies. Entanglement involves
correlation without causality.
Problems
2.8 An atom walks into an analyzer
Execute the “similar analysis” mentioned in the sentence below equa-
tion (2.2).
2.9 A supposition squashed (essential problem)
If atoms were generated according to the supposition presented below
experiment (1) on page 74, then would would happen when they en-
countered the two horizontal analyzers of experiment (2)?
2.10 A probability found through local determinism
Suppose that the codes postulated on page 78 did exist. Suppose also
that a given source produces the various possible codes with these prob-
abilities:
code probability
for of making
left atom such a pair
+++ 1/2
++− 1/4
+−− 1/8
−−+ 1/8
If this given source were used in the experiment of section 2.5.3 with
distant flipping Stern-Gerlach analyzers, what would be the probability
of the two atoms exiting from opposite ports?
84 Quantum cryptography
We’ve seen a lot of new phenomena, and the rest of this book is devoted
to filling out our understanding of these phenomena and applying that
understanding to various circumstances. But first, can we use them for
anything?
We can. The sending of coded messages used to be the province of
armies and spies and giant corporations, but today everyone does it. All
transactions through automatic teller machines are coded. All Internet
commerce is coded. This section describes a particular, highly reliable
encoding scheme and then shows how quantal entanglement may someday
be used to implement this scheme. (Quantum cryptography was used to
securely transmit voting ballots cast in the Geneva canton of Switzerland
during parliamentary elections held 21 October 2007. But it is not today
in regular use anywhere.)
In this section I use names conventional in the field of coded messages
(called cryptography). Alice and Bob wish to exchange private messages,
but they know that Eve is eavesdropping on their communication. How
can they encode their messages to maintain their privacy?
The Vernam cipher or “one-time pad” technique is the only coding scheme
proven to be absolutely unbreakable (if used correctly). It does not rely on
the use of computers — it was invented by Gilbert Vernam in 1919 — but
today it is mostly implemented using computers, so I’ll describe it in that
context.
Data are stored on computer disks through a series of magnetic patches
on the disk that are magnetized either “up” or “down”. An “up” patch
is taken to represent 1, and a “down” patch is taken to represent 0. A
string of seven patches is used to represent a character. For example, by a
What Is Quantum Mechanics About? 85
convention called ASCII, the letter “a” is represented through the sequence
1100001 (or, in terms of magnetizations, up, up, down, down, down, down,
up). The letter “W” is represented through the sequence 1010111. Any
computer the world around will represent the message “What?” through
the sequence
Then Alice gives Bob a copy of that random number – the “key”.
Instead of sending the plaintext, Alice modifies her plaintext into a
coded “ciphertext” using the key. She writes down her plaintext and writes
the key below it, then works through column by column. For each position,
if the key is 0 the plaintext is left unchanged; but if the key is 1 the plaintext
is reversed (from 0 to 1 or vice versa). For the first column, the key is 0, so
Alice doesn’t change the plaintext: the first character of ciphertext is the
same as the first character of plaintext. For the second column, the key is
1, so Alice does change the plaintext: the second character of ciphertext
is the reverse of the second character of plaintext. Alice goes through all
the columns, duplicating the plaintext where the key is 0 and reversing the
plaintext where the key is 1.
Then, Alice sends out her ciphertext over open communication lines.
86 Quantum cryptography
Now, the ciphertext that Bob (and Eve) receive translates to some mes-
sage through the ASCII convention – in fact, it translates to “q[78c” — but
because the key is random, the ciphertext is just as random. Bob deciphers
Alice’s message by carrying out the encoding process on the ciphertext,
namely, duplicating the ciphertext where the key is 0 and reversing the
ciphertext where the key is 1. The result is the plaintext. Eve does not
know the key, so she cannot produce the plaintext.
The whole scheme relies on the facts that the key is (1) random and
(2) unknown to Eve. The very name “one-time pad” underscores that a
key can only be used once and must then be discarded. If a single key is
used for two messages, then the second key is not “random” — it is instead
perfectly correlated with the first key. There are easy methods to break the
code when a key is reused.
Generating random numbers is not easy, and the Vernam cipher de-
mands keys as long as the messages transmitted. As recently as 1992,
high-quality computer random-number generators were classified by the
U.S. government as munitions, along with tanks and fighter planes, and
their export from the country was prohibited.
And of course Eve must not know the key. So there must be some way
for Alice to get the key to Bob securely. If they have some secure method
for transmitting keys, why don’t they just use that same secure method for
sending their messages?
In common parlance, the word “random” can mean “unimportant, not
worth considering” (as in “Joe made a random comment”). So it may
seem remarkable that a major problem for government, the military, and
commerce is the generation and distribution of randomness, but that is
indeed the case.
atoms always exit from opposite ports, Alice and Bob end up with the
same random number, which they use as a key for their Vernam-cipher
communications over conventional telephone or computer lines.
This scheme will indeed produce and distribute copious, high-quality
random numbers. But Eve can get at those same numbers through the
following trick: She cuts open the atom pipe leading from the entangled
source to Alice’s home, and inserts a vertical interferometer.20 She watches
the atoms pass through her interferometer. If the atom takes path a, Eve
knows that when Alice receives that same atom, it will exit from Eve’s
+ port. If the atom takes path b, the opposite holds. Eve gets the key, Eve
breaks the code.
It’s worth looking at this eavesdropping in just a bit more detail. When
the two atoms depart from their source, they are entangled. It is not true
that, say, Alice’s atom has µz = +µB while Bob’s atom has µz = −µB
— the pair of atoms is in the condition we’ve called “entangled”, but the
individual atoms themselves are not in any condition. However, after Eve
sees the atom taking path a of her interferometer, then the two atoms are
no longer entangled — now it is true that Alice’s atom has the condition
µz = +µB while Bob’s atom has the condition µz = −µB . The key received
by Alice and Bob will be random whether or not Eve is listening in. To
test for evesdropping, Alice and Bob must examine it in some other way.
Replace Alice and Bob’s vertical analyzers with flipping Stern-Gerlach
analyzers. After Alice receives her random sequence of pluses and minuses,
encountering her random sequence of analyzer orientations, she sends both
these sequences to Bob over an open communication line. (Eve will in-
tercept this information but it won’t do her any good, because she won’t
know the corresponding information for Bob.) Bob now knows both the
results at his analyzer and the results at Alice’s analyzer, so he can test
to see whether the atom pairs were entangled. If he finds that they were,
then Eve is not listening in. If he finds that they were not entangled, then
he knows for certain that Eve is listening in, and they must not use their
compromised key.
Is there some other way for Eve to tap the line? No! If the atom pairs
pass the test for entanglement, then no one can know the values of their
20 Inspired by James Bond, I always picture Eve as exotic beauty in a little black dress
slinking to the back of an eastern European café to tap the diplomatic cable which
conveniently runs there. But in point of fact Eve would be a computer.
88 What is a qubit?
Problem
2.12 Questions (recommended problem)
Update your list of quantum mechanics questions that you started at
problem 1.17 on page 46. Write down new questions and, if you have un-
covered answers to any of your old questions, write them down briefly.
Chapter 3
When you walked into your introductory classical mechanics course, you
were already familiar with the phenomena of introductory classical mechan-
ics: flying balls, spinning wheels, colliding billiard balls. Your introductory
mechanics textbook didn’t need to introduce these things to you, but in-
stead jumped right into describing these phenomena mathematically and
explaining them in terms of more general principles.
The last chapter of this book made you familiar with the phenomena
of quantum mechanics: quantization, interference, and entanglement —
at least, insofar as these phenomena are manifest in the behavior of the
magnetic moment of a silver atom. You are now, with respect to quan-
tum mechanics, at the same level that you were, with respect to classical
mechanics, when you walked into your introductory mechanics course. It
is now our job to describe these quantal phenomena mathematically, to
explain them in terms of more general principles, and (eventually) to inves-
tigate situations more complex than the magnetic moment of one or two
silver atoms.
We’ve been talking about the state of the silver atom’s magnetic moment
by saying things like “the projection of the magnetic moment on the z axis
is µz = −µB ” or “µx = +µB ” or “µθ = −µB ”. This notation is clumsy.
First of all, it requires you to write down the same old µs time and time
again. Second, the most important thing is the axis (z or x or θ), and the
symbol for the axis is also the smallest and easiest to overlook.
89
90 What is a quantal state?
and it’s absurd to demand a specification for something that doesn’t ex-
ist. As we learn more and more quantum physics, we will learn better and
better how to specify states. There will be surprises. But always keep in
mind that (just as in classical mechanics) it is experiment, not philosophy
or meditation, and certainly not common sense, that tells us how to specify
states.
3.2 Amplitude
b
input
|z+i output
a
|z−i
a sinusoidal signal — in the function A sin(ωt), the symbol A represents the amplitude —
and this sinusoidal signal “amplitude” has nothing to do with the quantal “amplitude”.
One of my students correctly suggested that a better name for quantal amplitude would
be “proclivity”. But it’s too late now to change the word.
92 Amplitude
The first rule is a simple way to make sure that probabilities are al-
ways positive. The second rule is a natural generalization of the rule for
probabilities in series — that if an action happens through several stages,
the probability for the action as a whole is the product of the probabilities
for each stage. And the third rule simply restates the “desired property”
presented in equation (3.2).
We apply these rules to various situations that we’ve already encoun-
tered, beginning with the interference experiment sketched above. Recall
the probabilities already established (first column in table):
If rule (1) is to hold, then the amplitude to go from input to output must
also be 0, while the amplitude to go via a path must have magnitude 12
(second column in table). According to rule (3), the two amplitudes to
go via a and via b must sum to zero, so they cannot both be represented
by positive numbers. Whatever mathematical entity is used to represent
amplitude, it must enable two such entities, each with non-zero magnitude,
to sum to zero. There are many such entities: real numbers, complex
Forging Mathematical Tools 93
θ
|θ+i
|z+i
|θ−i
The amplitude that an atom entering the θ-analyzer in state |z+i exits in
state |θ+i is called3 hθ+|z+i. That phrase is a real mouthful, so the symbol
hθ+|z+i is pronounced “the amplitude that |z+i is in |θ+i”, even though
this briefer pronunciation leaves out the important role of the analyzer.4
From rule (1), we know that
|hθ+|z+i|2 = cos2 (θ/2) (3.3)
2 2
|hθ−|z+i| = sin (θ/2). (3.4)
You can also use rule (1), in connection with the experiments described in
3 The states appear in the symbol in the opposite sequence from their appearance in
the description.
4 The ultimate source of such problems is that the English language was invented by
people who did not understand quantum mechanics, hence they never produced concise,
accurate phrases to describe quantal phenomena. In the same way, the ancient phrase
“search the four corners of the Earth” is still colorful and practical, and is used today
even by those who know that the Earth doesn’t have four corners.
94 Amplitude
z
θ
a
input
|z+i output
|z−i
b
Rule (2), actions in series, tells us that the amplitude to go from |z+i to
|z−i via path a is the product of the amplitude to go from |z+i to |θ+i
times the amplitude to go from |θ+i to |z−i:
amplitude to go via path a = hz−|θ+ihθ+|z+i.
Similarly
amplitude to go via path b = hz−|θ−ihθ−|z+i.
And then rule (3), actions in parallel, tells us that the amplitude to go from
|z+i to |z−i is the sum of the amplitude to go via path a and the amplitude
to go via path b. In other words
hz−|z+i = hz−|θ+ihθ+|z+i + hz−|θ−ihθ−|z+i. (3.5)
5 The terms phase and magnitude are explained in appendix C, “Complex Arithmetic”.
Forging Mathematical Tools 95
amplitude magnitude
hz−|z+i 0
hz−|θ+i | sin(θ/2)|
hθ+|z+i | cos(θ/2)|
hz−|θ−i | cos(θ/2)|
hθ−|z+i | sin(θ/2)|
The task now is to assign phases to these magnitudes in such a way that
equation (3.5) is satisfied. In doing so we are faced with an embarrassment
of riches: there are many consistent ways to make this assignment. Here
are two commonly used conventions:
or the Swahili word “farasi”. The fact that language is pure human con-
vention, and that there are multiple conventions for the name of a horse,
doesn’t mean that language is unimportant: on the contrary language is
an immensely powerful tool. And the fact that language is pure human
convention doesn’t mean that you can’t develop intuition about language:
on the contrary if you know the meaning of “arachnid” and the meaning
of “phobia”, then your intuition for English tells you that “arachnopho-
bia” means fear of spiders. Exactly the same is true for amplitude: it is a
powerful tool, and with practice you can develop intuition for it.
When I introduced the phenomenon of quantal interference on page 62,
I said that there was no word or phrase in the English language that ac-
curately represents what’s going on: It’s flat-out wrong to say “the atom
takes path a” and it’s flat-out wrong to say “the atom takes path b”. It
gives a wrong impression to say “the atom takes no path” or “the atom
takes both paths”. I introduced the phrase “the atom ambivates through
the two paths of the interferometer”. Now we have a technically correct
way of describing the phenomenon: “the atom has an amplitude to take
path a and an amplitude to take path b”.
Here’s another warning about language: If an atom in state |ψi enters
a vertical analyzer, the amplitude for it to exit from the + port is hz+|ψi.
(And of course the amplitude for it exit from the − port is hz−|ψi.) This is
often stated “If the atom is in state |ψi, the amplitude of it being in state
|z+i is hz+|ψi.” This is an acceptable shorthand for the full explanation,
which requires thinking about an analyzer experiment, even though the
shorthand never mentions the analyzer. But never say “If the atom is in
state |ψi, the probability of it being in state |z+i is |hz+|ψi|2 .” This gives
the distinct and incorrect impression that before entering the analyzer, the
atom was either in state |z+i or in state |z−i, and you just didn’t know
which it was. Instead, say “If an atom in state |ψi enters a vertical analyzer,
the probability of exiting from the + port in state |z+i is |hz+|ψi|2 .”
Forging Mathematical Tools 97
1a
|ψi |φi
input output
1b
Solution: Because of rule (2), actions in series, the amplitude for the
atom to take the top path is the product
hφ|z+ihz+|ψi.
Similarly the amplitude for it to take the bottom path is
hφ|z−ihz−|ψi.
Because of rule (3), actions in parallel, the amplitude for it to ambivate
through both paths is the sum of these two, and we conclude that
hφ|ψi = hφ|z+ihz+|ψi + hφ|z−ihz−|ψi. (3.6)
1a
θ
2a
|ψi |φi
input output
1b
2b
Solution:
hφ|ψi = hφ|z+ihz+|ψi
+ hφ|z−ihz−|θ+ihθ+|z−ihz−|ψi (3.7)
+ hφ|z−ihz−|θ−ihθ−|z−ihz−|ψi
Problems
3.1 Talking about interference
An atom in state |ψi ambivates through a vertical analyzer. We say,
appropriately, that “the atom has an amplitude to take the top path
and an amplitude to take the bottom path”. For the benefit of students
in next year’s offering of this class (see page 20), find expressions for
those two amplitudes and describe, in ten sentences or fewer, why it is
not appropriate to say “the atom has probability |hz+|ψi|2 to take the
top path and probability |hz−|ψi|2 to take the bottom path”.
3.2 Other conventions
Two conventions for assigning amplitudes are given in the table on
page 95. Show that if hz−|θ+i and hz−|θ−i are multiplied by phase
factor eiα , and if hz+|θ+i and hz+|θ−i are multiplied by phase factor
eiβ (where α and β are both real), then the resulting amplitudes are
just as good as the original (for either convention I or convention II).
3.3. Reversal-conjugation relation 99
The “complex conjugate” of any complex number is the same number but
with every “i” changed to “−i”: if x and y are real numbers, then
z = x + iy has complex conjugate z ∗ = x − iy. (3.8)
A useful theorem says that the amplitude to go from state |ψi to state
|φi and the amplitude to go in the opposite direction are related through
complex conjugation:
∗
hφ|ψi = hψ|φi . (3.9)
The proof below works for states of the magnetic moment of a silver atom
— the kind of states we’ve worked with so far — but in fact the result holds
for any quantal system.
The proof relies on three facts: First, the probability for one state to
be analyzed into another depends only on the magnitude of the angle be-
tween the incoming magnetic moment and the analyzer, and not on the
sense of that angle. (An atom in state |z+i has the same probability of
leaving the + port of an analyzer whether it is rotated 17◦ clockwise or 17◦
counterclockwise.) Thus
|hφ|ψi|2 = |hψ|φi|2 . (3.10)
Second, an atom exits an interferometer in the same state in which it en-
tered, so
hφ|ψi = hφ|θ+ihθ+|ψi + hφ|θ−ihθ−|ψi. (3.11)
Third, an atom entering an analyzer comes out somewhere, so
1 = |hθ+|ψi|2 + |hθ−|ψi|2 . (3.12)
100 Reversal-conjugation relation
The proof also relies on a mathematical result called “the triangle in-
equality for complex numbers”: If a and b are real numbers with a + b = 1,
and in addition eiα a + eiβ b = 1, with α and β real, then α = β = 0. You
can find very general, very abstract, proofs of the triangle inequality, but
the complex plane sketch below encapsulates the idea:
imaginary
eiα a eiβ b
real
a b 1
From the first fact (3.10), the two complex numbers hφ|ψi and hψ|φi
have the same magnitude, so they differ only in phase. Write this statement
as
∗
hφ|ψi = eiδ hψ|φi (3.13)
where the phase δ is a real number that might depend on the states |φi and
|ψi. Apply this general result first to the particular state |φi = |θ+i:
∗
hθ+|ψi = eiδ+ hψ|θ+i , (3.14)
and then to the particular state |φi = |θ−i:
∗
hθ−|ψi = eiδ− hψ|θ−i , (3.15)
where the two real numbers δ+ and δ− might be different. Our objective is
to prove that δ+ = δ− = 0.
Apply the second fact (3.11) with |φi = |ψi, giving
1 = hψ|θ+ihθ+|ψi + hψ|θ−ihθ−|ψi
∗ ∗
= eiδ+ hψ|θ+ihψ|θ+i + eiδ− hψ|θ−ihψ|θ−i
= eiδ+ |hψ|θ+i|2 + eiδ− |hψ|θ−i|2
= eiδ+ |hθ+|ψi|2 + eiδ− |hθ−|ψi|2 . (3.16)
Compare this result to the third fact (3.12)
1 = |hθ+|ψi|2 + |hθ−|ψi|2 (3.17)
and use the triangle inequality with a = |hθ+|ψi|2 and b = |hθ−|ψi|2 . The
two phases δ+ and δ− must vanish, so the “reversal-conjugation relation”
is proven.
3.4. Establishing a phase convention 101
Problems
3.4 Other conventions, other peculiarities
Write what this section would have been had we adopted convention II
rather than convention I from page 95. In addition, evaluate the four
amplitudes of equation (3.18) for θ = +180◦ and θ = −180◦ .
3.5 Finding amplitudes (recommended problem)
Using the interference idea embodied in equation (3.22), calculate the
amplitudes hθ+|54◦ +i and hθ−|54◦ +i as a function of θ. Do these
amplitudes have the values you expect for θ = 54◦ ? For θ = 234◦ ?
Plot hθ+|54◦ +i for θ from 0◦ to 360◦ . Compare the result for θ = 0◦
and θ = 360◦ .
3.6 Rotations
Use the interference idea embodied in equation (3.22) to show that
hx+|θ+i = √12 [cos(θ/2) + sin(θ/2)]
hx−|θ+i = − √12 [cos(θ/2) − sin(θ/2)]
(3.23)
hx+|θ−i = √12 [cos(θ/2) − sin(θ/2)]
hx−|θ−i = √12 [cos(θ/2) + sin(θ/2)]
If and only if you enjoy trigonometric identities, you should then show
that these results can be written equivalently as
hx+|θ+i = cos((θ − 90◦ )/2)
hx−|θ+i = sin((θ − 90◦ )/2)
(3.24)
hx+|θ−i = − sin((θ − 90◦ )/2)
hx−|θ−i = cos((θ − 90◦ )/2)
This makes perfect geometric sense, as the angle relative to the x axis
is 90◦ less than the angle relative to the z axis:
x
3.5. How can I specify a quantal state? 103
We introduced the Dirac notation for quantal states on page 90, but haven’t
yet fleshed out that notation by specifying a state mathematically. Start
with an analogy:
We are so used to writing down the position vector ~r that we rarely stop
to ask ourselves what it means. But the plain fact is that whenever we
measure a length (say, with a meter stick) we find not a vector, but a single
number! Experiments measure never the vector ~r but always a scalar —
the dot product between ~r and some other vector, call it ~s for “some other”.
If we know the dot product between ~r and every vector ~s, then we know
everything there is to know about ~r. Does this mean that to specify ~r, we
must keep a list of all possible dot products ~s · ~r ? Of course not. . . such a
list would be infinitely long!
You know that if you write ~r in terms of an orthonormal basis {î, ĵ, k̂},
namely
~r = rx î + ry ĵ + rz k̂ (3.25)
where rx = î · ~r, ry = ĵ · ~r, and rz = k̂ · ~r, then you’ve specified the vector.
Why? Because if you know the triplet (rx , ry , rz ) and the triplet (sx , sy , sz ),
then you can easily find the desired dot product
rx
~s · ~r = sx sy sz ry = sx rx + sy ry + sz rz . (3.26)
rz
It’s a lot more compact to specify the vector through three dot products
— namely î · ~r, ĵ · ~r, and k̂ · ~r — from which you can readily calculate an
infinite number of desired dot products, than it is to list all infinity dot
products themselves!
Like the position vector ~r, the quantal state |ψi cannot by itself be mea-
sured. But if we determine (through some combination of analyzer exper-
iments, interference experiments, and convention) the amplitude hσ|ψi for
104 How can I specify a quantal state?
every possible state |σi, then we know everything there is to know about
|ψi. Is there some compact way of specifying the state, or do we have to
keep an infinitely long list of all these amplitudes?
This nut is cracked through the interference experiment result
hσ|ψi = hσ|θ+ihθ+|ψi + hσ|θ−ihθ−|ψi, (3.27)
which simply says, in symbols, that the atom exits an interferometer in the
same state in which it entered (see equation 3.11). It gets hard to keep
track of all these symbols, so I’ll introduce the names
hθ+|ψi = ψ+
hθ−|ψi = ψ−
and
hθ+|σi = σ+
hθ−|σi = σ− .
From the reversal-conjugation relation, this means
∗
hσ|θ+i = σ+
∗
hσ|θ−i = σ− .
In terms of these symbols, the interference result (3.27) is
∗ ∗ ∗ ∗
ψ+
hσ|ψi = σ+ ψ+ + σ− ψ− = σ+ σ− . (3.28)
ψ−
And this is our shortcut! By keeping track of only two amplitudes, ψ+ and
ψ− , for each state, we can readily calculate any amplitude desired. We
don’t have to keep an infinitely long list of amplitudes.
This dot product result for computing amplitude is so useful and so
convenient that sometimes people say the amplitude is a dot product. No.
The amplitude reflects analyzer experiments, plus interference experiments,
plus convention. The dot product is a powerful mathematical tool for com-
puting amplitudes. (A parallel situation: There are many ways to find the
latitude and longitude coordinates for a point on the Earth’s surface, but
the easiest is to use a GPS device. Some people are so enamored of this
ease that they call the latitude and longitude the “GPS coordinates”. But
in fact the coordinates were established long before the Global Positioning
System was built.)
Forging Mathematical Tools 105
For quantal states, we’ve seen that a set of two states such as
{|θ+i, |θ−i} plays a similar role, so it too is called a basis. For the magnetic
moment of a silver atom, two states |ai and |bi constitute a basis when-
ever ha|bi = 0, and the analyzer experiment of section 2.1.3 shows that
the states |θ+i and |θ−i certainly satisfy this requirement. In the basis
{|ai, |bi} an arbitrary state |ψi can be conveniently represented through
the pair of amplitudes
ha|ψi
.
hb|ψi
tional analysis, geometry, mathematical physics, and other areas. He formalized and
extended the concept of a vector space. Hilbert and Albert Einstein raced to uncover
the field equations of general relativity, but Einstein beat Hilbert by a matter of weeks.
106 How can I specify a quantal state?
Whereas (in light of equation 3.23) in the basis {|x+i, |x−i} that same
state |θ+i is represented by the different column matrix
!
√1 [cos(θ/2) + sin(θ/2)]
hx+|θ+i 2
= . (3.30)
hx−|θ+i − √12 [cos(θ/2) − sin(θ/2)]
N. Boyd, Organic Chemistry, second edition (Allyn & Bacon, Boston, 1966) page 318.
Forging Mathematical Tools 107
(“taking path a” and “taking path b”) that are familiar (to all of us steeped
in the classical approximation) but that do not happen.
In principle, any calculation performed using the Hilbert space rep-
resentation of states could be performed by considering suitable, cleverly
designed analyzer and interference experiments. But it’s a lot easier to use
the abstract Hilbert space machinery. (Similarly, any result in electrostatics
could be found using Coulomb’s Law, but it’s a lot easier to use the ab-
stract electric field and electric potential. Any calculation involving vectors
could be performed graphically, but it’s a lot easier to use abstract compo-
nents. Any addition or subtraction of whole numbers could be performed
by counting out marbles, but it’s a lot easier to use abstract mathematical
tools like carrying and borrowing.)
Because state vectors are built from amplitudes, and amplitudes have pe-
culiarities (see pages 95 and 101), it is natural that state vectors have
similar peculiarities. For example, since the angle θ is the same as the an-
gle θ + 360◦ , I would expect that the state vector |θ+i would be the same
as the state vector |(θ + 360◦ )+i.
But in fact, in the {|z+i, |z−i} basis, the state |θ+i is represented by
hz+|θ+i cos(θ/2)
= , (3.32)
hz−|θ+i sin(θ/2)
so the state |(θ + 360◦ )+i is represented by
hz+|(θ + 360◦ )+i cos((θ + 360◦ )/2)
= (3.33)
hz−|(θ + 360◦ )+i sin((θ + 360◦ )/2)
cos(θ/2 + 180◦ )
− cos(θ/2)
= = .
sin(θ/2 + 180◦ ) − sin(θ/2)
So in fact |θ+i = −|(θ + 360◦ )+i. Bizarre!
This bizarreness is one facet of a general rule: If you multiply any state
vector by a complex number with magnitude unity — a number such as
−1, or i, or √12 (−1 + i), or e2.7i — a so-called “complex unit” or “phase
factor” — then you get a different state vector that represents the same
state. This fact is called “global phase freedom” — you are free to set the
overall phase of your state vector for your own convenience. This general
108 How can I specify a quantal state?
rule applies only for multiplying both elements of the state vector by the
same complex unit: if you multiply the two elements with different complex
units, you will obtain a vector representing a different state (see problem 3.8
on page 110).
The vector ~r is specified in the basis {î, ĵ, k̂} by the three components
î · ~r
rx
ry = ĵ · ~r .
rz k̂ · ~r
Because this component specification is so convenient, it is sometimes said
that the vector ~r is not just specified, but is equal to this triplet of numbers.
That’s false.
Think of the vector ~r = 5î + 5ĵ. It is represented in the basis {î, ĵ, k̂} by
the triplet (5,√5, 0). But this is√not the only basis that exists. In the basis
{î0 = (î+√
ĵ)/ 2, ĵ 0 = (−î+ ĵ)/ 2, k̂}, that same vector is represented
√ by the
triplet (5 2, 0, 0). If we had said that ~r = (5, 5, 0)√ and that ~
r = (5 2, 0, 0),
then we would be forced to conclude that 5 = 5 2 and that 5 = 0!
ĵ
6
0
ĵ î0
@
I
@
@
@
@
@ - î
(5, 5, 0)” meaning “the vector ~r = 5î + 5ĵ is represented by the triplet
(5, 5, 0) in the basis {î, ĵ, k̂}”.
Vectors are physical things: a caveman throwing a spear at a mam-
moth was performing addition of position vectors, even though the caveman
didn’t understand basis vectors or Cartesian coordinates. The concept of
“position” was known to cavemen who did not have any concept of “basis”.
We’ve been specifying a state like |ψi = |17◦ +i by stating the axis upon
which the projection of µ~ is definite and equal to +µB — in this case, the
axis tilted 17◦ from the vertical.
Another way to specify a state |ψi would be to give the amplitude
that |ψi is in any possible state: that is, to list hθ+|ψi and hθ−|ψi for
all values of θ: 0◦ ≤ θ < 360◦ . One of those amplitudes (in this case
h17◦ +|ψi) will have value 1, and finding this one amplitude would give
us back the information in the specification |17◦ +i. In some ways this is a
more convenient specification because we don’t have to look up amplitudes:
they’re right there in the list. On the other hand it is an awful lot of
information to have to carry around.
The Hilbert space approach is a third way to specify a state that com-
bines the brevity of the first way with the convenience of the second way.
Instead of listing the amplitude hσ|ψi for every state |σi we list only the
two amplitudes ha|ψi and hb|φi for the elements {|ai, |bi} of a basis. We’ve
already seen (equation 3.28) how quantal interference then allows us to
readily calculate any amplitude.
Just as we said “the position vector ~r is represented in the basis {î, ĵ, k̂}
as (1, 1, 0)” or
.
~r = (1, 1, 0),
so we say “the quantal state |ψi is represented in the basis {|z+i, |z−i} as
. hz+|ψi
|ψi = .”
hz−|ψi
110 How can I specify a quantal state?
Problems
3.7 Superposition and interference (recommended problem)
On page 106 I wrote that “When an atom ambivates through an in-
terferometer, its state is a superposition of the state of an atom taking
path a and the state of an atom taking path b.”
a. Write down a superposition equation reflecting this sentence for
the interference experiment sketched on page 91.
b. Do the same for the interference experiment sketched on page 94.
3.8 Representations (recommended problem)
In the {|z+i, |z−i} basis the state |ψi is represented by
ψ+
.
ψ−
(In other words, ψ+ = hz+|ψi and ψ− = hz−|ψi.)
a. If ψ+ and ψ− are both real, show that there is one and only one
axis upon which the projection of µ
~ has a definite, positive value,
and find the angle between that axis and the z axis in terms of
ψ+ and ψ− .
b. What would change if you multiplied both ψ+ and ψ− by the same
phase factor (complex unit)?
c. What would change if you multiplied ψ+ and ψ− by different phase
factors?
This problem invites the question “What if the ratio of ψ+ /ψ− is not
pure real?” When you study more quantum mechanics, you will find
that in this case the axis upon which the projection of µ
~ has a definite,
positive value is not in the x-z plane, but instead has a component in
the y direction as well.
3.9 Addition of states
Some students in your class wonder “What does it mean to ‘add two
quantal states’ ? You never add two classical states.” For the Under-
ground Guide to Quantum Mechanics (see page 20) you decide to write
four sentences interpreting the equation
|ψi = a|z+i + b|z−i (3.34)
describing why you can add quantal states but can’t add classical states.
Your four sentences should include a formula for the amplitude a in
terms of the states |ψi and |z+i.
Forging Mathematical Tools 111
|ψi
| ↑↓ i
| ↓↑ i
| ↑↑ i
| ↓↓ i
Set up this EPR experiment with the left analyzer 100 kilometers from the
source, and the right analyzer 101 kilometers from the source. As soon as
the left atom comes out of its − port, then it is known that the right atom
will come out if its + port. The system is no longer in the entangled state
√1 (| ↑↓ i − | ↓↑ i); instead the left atom is in state | ↓ i and the right atom
2
is in state | ↑ i. The state of the right atom has changed (some say it has
“collapsed”) despite the fact that it is 200 kilometers from the left analyzer
that did the state changing!
This fact disturbs those who hold the misconception that states are
physical things located out in space like nitrogen molecules, because it
seems that information about state has made an instantaneous jump across
200 kilometers. In fact no information has been transferred from left to
right: true, Alice at the left interferometer knows that the right atom will
8 LeonardSusskind and Art Friedman, Quantum Mechanics: The Theoretical Minimum
(Basic Books, New York, 2014) page xii.
114 States for entangled systems
exit the + port 201 kilometers away, but Bob at the right interferome-
ter doesn’t have this information and won’t unless she tells him in some
conventional, light-speed-or-slower fashion.9
If Alice could in some magical way manipulate her atom to ensure that
it would exit the − port, then she could send a message instantaneously.
But Alice does not possess magic, so she cannot manipulate the left-bound
atom in this way. Neither Alice, nor Bob, nor even the left-bound atom
itself knows from which port it will exit. Neither Alice, nor Bob, nor even
the left-bound atom itself can influence from which port it will exit.
Back in section 2.4, “Light on the atoms” (page 70), we discussed the
character of “observation” or “measurment” in quantum mechanics. Let’s
bring our new machinery concerning quantal states to bear on this situation.
The figure on the next page shows, in the top panel, a potential mea-
surement about to happen. An atom (represented by a black dot) in state
|z+i approaches a horizontal interferometer at the same time that a photon
(represented by a white dot) approaches path a of that interferometer.
We employ a simplified model in which the photon either misses the
atom, in which case it continues undeflected upward, or else the photon
interacts with the atom, in which case it is deflected outward from the
page. In this model there are four possible outcomes, shown in the bottom
four panels of the figure.
After this potential measurement, the system of photon plus atom is
in an entangled state: the states shown on the right must list both the
condition of the photon (“up” or “out”) and the condition of the atom (+
or −).
If the photon misses the atom, then the atom must emerge from the +
port of the analyzer: there is zero probability that the system has final state
|up; −i. But if the photon interacts with the atom, then the atom might
emerge from either port: there is non-zero probability that the system has
9 If you are familiar with gauges in electrodynamics, you will find quantal state similar
to the Coulomb gauge. In the Coulomb gauge, the electric potential at a point in
space changes the instant that any charged particle moves, regardless of how far away
that charged particle is. This does not imply that information moves instantly, because
electric potential by itself is not measurable. The same applies for quantal state.
Forging Mathematical Tools 115
final state |out; −i. These two states are exactly the same as far as the
atom is concerned; they differ only in the position of the photon.
If we focus only on the atom, we would say that something strange has
happened (a “measurement” at path a) that enabled the atom to emerge
from the − port which (in the absence of “measurement”) that atom would
never do. But if we focus on the entire system of photon plus atom, then
it is an issue of entanglement, not of measurement.
b
|z+i
|ψi
a
|up; +i
a
|up; −i
a
|out; +i
a
|out; −i
a
116 What is a qubit?
Problem
3.13 Amplitudes for “Measurement and entanglement”
Suppose that, in the “simplified model” for measurement and entan-
glement, the probability for photon deflection is 15 . Find the four prob-
abilities |hup; +|ψi|2 , |hup; −|ψi|2 , |hout; +|ψi|2 , and |hout; −|ψi|2 .
At the end of the last chapter (on page 88) we listed several so-called “two-
state systems” or “spin- 12 systems” or “qubit systems”. You might have
found these terms strange: There are an infinite number of states for the
magnetic moment of a silver atom: |z+i, |1◦ +i, |2◦ +i, and so forth. Where
does the name “two-state system” come from? You now see the answer:
it’s short for “two-basis-state system”.
The term “spin” originated in the 1920s when it was thought that an
electron was a classical charged rigid sphere that created a magnetic mo-
ment through spinning about an axis. A residual of that history is that
Forging Mathematical Tools 117
people still call10 the state |z+i by the name “spin up” and by the symbol
| ↑ i, and the state |z−i by “spin down” and | ↓ i. (Sometimes the associa-
tion is made in the opposite way.) Meanwhile the state |x+i is given the
name “spin sideways” and the symbol | → i.
Today, two-basis-state systems are more often called “qubit” systems
from the term used in quantum information processing. In a classical com-
puter, like the ones we use today, a bit of information can be represented
physically by a patch of magnetic material on a disk: the patch magnetized
“up” is interpreted as a 1, the patch magnetized “down” is interpreted as
a 0. Those are the only two possibilities. In a quantum computer, a qubit
of information can be represented physically by the magnetic moment of a
silver atom: the atom in state |z+i is interpreted as |1i, the atom in state
|z−i is interpreted as |0i. But the atom might be in any (normalized) su-
perposition a|1i + b|0i, so rather than two possibilities there are an infinite
number.
Furthermore, qubits can interfere with and become entangled with other
qubits, options that are simply unavailable to classical bits. With more
states, and more ways to interact, quantum computers can only be faster
than classical computers, and even as I write these possibilities are being
explored.
In today’s state of technology, quantum computers are hard to build,
and they may never live up to their promise. But maybe they will.
Chapters 2 and 3 have focused on two-basis-state systems, but of course
nature provides other systems as well. For example, the magnetic moment
of a nitrogen atom (mentioned on page 45) is a “four-basis-state” system,
where one basis is
|z; +2i, |z; +1i, |z; −1i, |z; −2i. (3.38)
In fact, the next chapter shifts our focus to a system with an infinite number
of basis states.
10 The very most precise and pedantic people restrict the term “spin” to elementary
particles, such as electrons and neutrinos. For composite systems like the silver atom
they speak instead of “the total angular momentum J~ of the silver atom in its ground
state, projected on a given axis, and divided by ~.” For me, the payoff in precision is
not worth the penalty in polysyllables.
118 What is a qubit?
Problem
3.14 Questions (recommended problem)
Update your list of quantum mechanics questions that you started at
problem 1.17 on page 46. Write down new questions and, if you have un-
covered answers to any of your old questions, write them down briefly.
[[For example, one of my questions would be: “I’d like to see a proof
that the global phase freedom mentioned on page 107, which obviously
changes the amplitudes computed, does not change any experimentally
accessible result.”]]
Chapter 4
In the last two chapters we’ve studied the quantum mechanics of a silver
atom’s magnetic moment, and we got a lot out of it: we uncovered the
phenomena of quantization and interference and entanglement; we found
how to use amplitude as a mathematical tool to predict probabilities; we
learned about quantum mechanical states. If all of this makes you feel weak
and dizzy, that’s a good thing: Niels Bohr pointed out that “those who are
not shocked when they first come across quantum theory cannot possibly
have understood it.”1 As profitable as this has been, we knew from the start
(page 47) that eventually we would need to treat the quantum mechanics
of position. Now is the time.
Chapters 2 and 3 treated the atom’s magnetic moment but (to the extent
possible) ignored the atom’s position. This chapter starts off with the
opposite approach: it treats only position and ignores magnetic moment.
Section 4.12, “Position plus spin”, at the end of this chapter welds the two
aspects together.
A single particle ambivates in one dimension. You know the story of quan-
tum mechanics: The particle doesn’t have a position. Yet if we measure
the position (say, by shining a lamp), then we will find that it has a sin-
gle position. However, because the particle started out without a position,
1 Recalled by Werner Heisenberg in Physics and Beyond (Harper and Row, New York,
119
120 Probability and probability density: One particle in one dimension
0.2
0.1
0.0
00 20 40 60 80 100
position (cm)
Now you want more detail about the ant’s location. Instead of dividing
the ant farm into five conceptual bins each of width 20.0 cm, divide it into
ten bins each of width 10.0 cm. The probabilities now look like:
0.2
0.1
0.0
00 10 20 30 40 50 60 70 80 90 100
position (cm)
There are now ten probabilities, yet they still sum to 1, so the probabilities
are each smaller. (For example, the first graph shows a probability of 0.28
for the ant appearing between 0.0 cm and 20.0 cm. The second graph
shows probability 0.18 for the ant appearing between 0.0 cm and 10.0 cm
and probability 0.10 for the ant appearing between 10.0 cm and 20.0 cm.
Sure enough 0.28 = 0.18 + 0.10.)
122 Probability and probability density: One particle in one dimension
If you want still more detail, you can divide the ant farm into fifty bins,
each of width 2.0 cm, as in:
0.2
0.1
0.0
00 10 20 30 40 50 60 70 80 90 100
position (cm)
The fact that the bin probabilities, summed over all bins, is unity, or in
symbols
X
Pi = 1,
i
Problem
4.1 Mean and standard deviation for an ant (essential problem)
The mean2 ant position hxi is given by summing all the position mea-
surements and dividing that sum by the number of measurements (in
this case 9741). Using Pi for the probability of the ant appearing in
bin i and xi for the position of the center of bin i, argue that this mean
position is given approximately by
X
xi P i
bin i
and that the approximation grows better and better as the bins grow
narrower and narrower. The formula becomes exact when ∆x → 0, so
show that the mean value is given by
Z 100.0 cm
hxi = xρ(x) dx. (4.1)
0.0 cm
In the same way, argue that the standard deviation of ant position is
s
Z 100.0 cm
(x − hxi)2 ρ(x) dx. (4.2)
0.0 cm
2 The “mean value” is also called the “average value” and sometimes the “expectation
value”. The latter name is particularly poor. If you toss a die, the mean value of the
number facing up is 3.5. Yet no one expects to toss a die and find the number 3.5 facing
up!
124 Probability amplitude
The probability considerations for one ant walking in one dimension are
directly analogous to the probability considerations for one quantal parti-
cle ambivating in one dimension. A graph with five bins like the one on
page 120 approximates the quantal particle as a five-state system. A graph
with ten bins like the one on page 121 approximates the quantal particle
as a ten-state system. A graph with fifty bins like the one on page 122
approximates the quantal particle as a fifty-state system. You know the
drill of quantum mechanics: in all these cases the bin probability P0 will
be related to some sort of bin amplitude ψ0 through P0 = |ψ0 |2 . How does
bin amplitude behave as ∆x → 0? Because
P0 |ψ0 |2 ψ
= → ρ(x0 ), √ 0 → ψ(x0 ),
we will have
∆x ∆x ∆x
where, for any point x0 , the probability density is
ρ(x0 ) = |ψ(x0 )|2 . (4.3)
What would be a good name for this function ψ(x)? I like the name
“amplitude density”. It’s not really a density: a density
p would have di-
mensions 1/[length], whereas ψ(x) has dimensions 1/ [length]. But it’s
closer to a density than it is to anything else. Unfortunately, someone else
(namely Schrödinger3 ) got to name it before I came up with this sensible
name, and that name has stuck. It’s called “wavefunction”.
The normalization condition for wavefunction is
Z +∞
|ψ(x)|2 dx = 1. (4.4)
−∞
You should check for yourself that this equation is dimensionally consistent.
The global phase freedom described for qubit systems on page 107 ap-
plies for wavefunctions as well: If you multiply any wavefunction by a com-
plex number with magnitude unity — called a “complex unit” or “phase
3 Erwin Schrödinger (1887–1961) had interests in physics, biology, philosophy, and East-
ern religion. Born in Vienna, he held physics faculty positions in Germany, Poland, and
Switzerland. In 1926 he developed the concept of wavefunction and discovered the quan-
tum mechanical time evolution equation (4.12) that now bears his name. This led, in
1927, to a prestigious appointment in Berlin. In 1933, disgusted with the Nazi regime, he
left Berlin for Oxford, England. He held several positions in various cities before ending
up in Dublin. There, in 1944, he wrote a book titled What is Life? which stimulated
interest in what had previously been a backwater of science: biochemistry.
The Quantum Mechanics of Position 125
factor” — then you get a different wavefunction that represents the same
state. You must multiply by a number with magnitude unity, not a function
with magnitude unity: the wavefunctions ψ(x) and eiδ ψ(x) represent the
same state, but the wavefunction eiδ(x) ψ(x) represents a different state.
Keep in mind that to specify a quantal state we must know amplitudes
(wavefunction) rather than merely probabilities (probability density). Just
as in classical mechanics (see page 90) we say that we know a state when we
know all the information needed to describe the system now and to predict
its future. The probability density ρ(x) alone tells you a lot about the
state right now, but cannot predict how the state will change in the future.
Knowing probability density alone in quantum mechanics is like knowing
the position alone in classical mechanics: The probability density gives a
lot of information about now, but not enough information to predict the
future.
Problems
4.2 Mean and standard deviation for a quantal particle
(essential problem)
For any function f (x), define the mean value
Z +∞
hf (x)i = f (x)|ψ(x)|2 dx. (4.5)
−∞
Show that the mean position is hxi and that the standard deviation of
position ∆x is given through
2
(∆x)2 = hx2 i − hxi . (4.6)
I’m going to throw down three equations. First, the classical formula for
energy,
p2
E= + V, (4.7)
2m
where V is the potential energy. Second, the Einstein and de Broglie rela-
tions for energy and momentum (1.21) and (1.24)
E = ~ω and p = ~k. (4.8)
Third, the particular wavefunction
ψ(x, t) = Aei(kx−ωt) . (4.9)
I concede from the very start that this is a stupid argument, and that if
you had proposed it to me I would have gone ballistic. First, equation (4.7)
is a classical fact plopped mindlessly into a quantal argument. Second,
the Einstein relation (4.8) applies to photons, not to massive particles.
Third, there are many possible wavefunctions other than equation (4.9).
The unjustified change of potential energy value V in equation (4.11) to
potential energy function V (x) in equation (4.12) merely adds insult to
4.4. Wavefunction: Two particles 127
injury. The only good thing I can say about this equation is that it’s
dimensionally consistent.
Oh, and one more thing. The equation is correct. Despite its dubious
provenance, experimental tests have demonstrated to everyone’s satisfac-
tion that wavefunction really does evolve in time this way. (I must qualify:
wavefunction evolves this way in a wide range of situations: non-relativistic,
no magnetic field, no friction or any other non-conservative force, and where
the particle’s magnetic moment is unimportant.)
This equation for time evolution in quantal systems plays the same cen-
tral role in quantum mechanics that F~ = m~a does in classical mechanics.
And just as F~ = m~a cannot be derived, only motivated and then tested
experimentally, so this time-evolution result cannot be derived. The mo-
tivation is lame, but the experimental tests are impressive and cannot be
ignored.
This time evolution equation has a name: it is “the Schrödinger equa-
tion”.
We will soon work on solving the Schrödinger equation for one particle
in one dimension, but first we ask how to describe two particles in one
dimension.
Two particles, say an electron and a neutron, ambivate in one dimension.
As before, we start with a grid of bins in one-dimensional space:
- ∆x
- x
i j
We ask for the probability that the electron will be found in bin i and
the neutron will be found in bin j, and call the result Pi,j . Although
our situation is one-dimensional, this question generates a two-dimensional
array of probabilities.
128 Wavefunction: Two particles
bin of neutron
6
Pi,j
j
- bin of electron
i
?
To produce a probability density, we must divide the bin probability Pi,j
by (∆x)2 , and then take the limit as ∆x → 0, resulting in
Pi,j
→ ρ(xe , xn ).
(∆x)2
So the probability of finding an electron within a narrow window of width w
centered on xe = 5 and finding the neutron within a narrow window of width
u centered on xn = 9 is approximately ρ(5, 9)wu, and this approximation
grows better and better as the two windows grow narrower and narrower.
2
p Pi,j = |ψi,j | . To turn a bin amplitude
The bin amplitude is ψi,j with
2
into a wavefunction, divide by (∆x) = ∆x and take the limit
ψi,j
lim = ψ(xe , xn ). (4.13)
∆x→0 ∆x
This wavefunction has dimensions 1/[length].
The generalization to more particles and higher dimensionality is
straightforward. For a single electron in three-dimensional space, the wave-
function ψ(~x) has dimensions 1/[length]3/2 . For an electron and a neu-
tron in three-dimensional space, the wavefunction ψ(~xe , ~xn ) has dimensions
1/[length]3 . Note carefully: For a two-particle system, the state is speci-
fied by one function ψ(~xe , ~xn ) of six variables. It is not specified by two
functions of three variables, with ψe (~x) giving the state of the electron and
The Quantum Mechanics of Position 129
ψn (~x) giving the state of the neutron. There are four consequences of this
simple yet profound observation.
First, the wavefunction (like amplitude in general) is a mathematical
tool for calculating the results of experiments; it is not physically “real”.
I have mentioned this before, but it particularly stands out here. Even
for a system as simple as two particles, the wavefunction does not exist in
ordinary three-dimensional space, but in the six-dimensional “configuration
space”, as it is called. I don’t care how clever or talented an experimentalist
you are: you cannot insert an instrument into six-dimensional space in order
to measure wavefunction.
Second, wavefunction is associated with a system, not with a particle. If
you’re interested in a single electron and you say “the wavefunction of the
electron”, then you’re technically incorrect — you should say “the wave-
function of the system consisting of a single electron” — but no one will go
ballistic and say that you are in thrall of a deep misconception. However,
if you’re interested in a pair of particles (an electron and a neutron, for
instance) and you say “the wavefunction of the electron”, then someone
(namely me) will go ballistic because you are in thrall of a deep misconcep-
tion.
Third, it might happen that the wavefunction factorizes:
ψ(~xe , ~xn ) = ψe (~xe )ψn (~xn ).
In this case the electron has state ψe (~xe ) and the neutron has state ψn (~xn ).
Such a peculiar case is called “non-entangled”. But in all other cases the
state is called “entangled” and the individual particles making up the sys-
tem do not have states. The system has a state, namely ψ(~xe , ~xn ), but
there is no state for the electron and no state for the neutron, in exactly
the same sense that there is no position for a silver atom ambivating through
an interferometer.
Fourth, quantum mechanics is intricate. To understand this point, con-
trast the description needed in classical versus quantum mechanics.
How does one describe the state of a single classical particle moving
in one dimension? It requires two numbers: a position and a velocity.
Two particles moving in one dimension require merely that we specify the
state of each particle: four numbers. Similarly specifying the state of three
particles require six numbers and N particles require 2N numbers. Exactly
the same specification counts hold if the particle moves relativistically.
130 Wavefunction: Two particles
How, in contrast, does one describe the state of a single quantal par-
ticle ambivating in one dimension? Here an issue arises at the very start,
because the specification is given through a complex-valued wavefunction
ψ(x). Technically the specification requires an infinite number of numbers!
Let’s approximate the wavefunction through its value on a grid of, say, 100
points. This suggests that a specification requires 200 real numbers, a com-
plex number at each grid point, but global phase freedom means that we
can always set one of those numbers to zero through an overall phase factor,
and one number is not independent through the normalization requirement.
The specification actually requires 198 independent real numbers.
How does one describe the state of two quantal particles ambivating
in one dimension? Now the wavefunction is a function of two variables,
ψ(xe , xn ). The wavefunction of the system is a function of two-dimensional
configuration space, so an approximation of the accuracy established previ-
ously requires a 100×100 grid of points. Each grid point carries one complex
number, and again overall phase and normalization reduce the number of
real numbers required by two. For two particles the specification requires
2 × (100)2 − 2 = 19 998 independent real numbers.
Similarly, specifying the state of N quantal particles moving in one
dimension requires a wavefunction in N -dimensional configuration space
which (for a grid of the accuracy we’ve been using) is specified through
2 × (100)N − 2 independent real numbers.
The specification of a quantal state not only requires more real numbers
than the specification of the corresponding classical state, but that number
increases exponentially rather than linearly with particle number N .
The fact that a quantal state holds more information than a classical
state is the fundamental reason that a quantal computer can be (in prin-
ciple) faster than a classical computer, and the basis for much of quantum
information theory.
Relativity is different from classical physics, but no more complicated.
Quantum mechanics, in contrast, is both different from and richer than
classical physics. You may refer to this richness using terms like “splendor”,
or “abounding”, or “intricate”, or “ripe with possibilities”. Or you may
refer to it using terms like “complicated”, or “messy”, or “full of details
likely to trip the innocent”. It’s your choice how to react to this richness,
but you can’t deny it.
4.5. Solving the Schrödinger equation for the I.S.W. 131
V (x)
ψ(x)
x
0 L
The infinite square well potential energy function V (x) in olive green, and
a possible wavefunction ψ(x) in red.
It is reasonable (although not rigorously proven) that for the infinite
square well
0 for x ≤ 0
ψ(x, t) = something for 0 < x < L
for L ≤ x
0
and we adopt these conditions.
Strategy. The PDE is linear, so if we find some special solutions
f1 (x, t), f2 (x, t), f3 (x, t), . . . , then we can generate many more solutions
through
X
Dn fn (x, t),
n
where D1 , D2 , D3 , . . . represent constants. Because any Dn can be any
possible complex number, this is a big set of solutions; indeed it might be
a big enough set to be the most general solution. Once we have the most
general solution, we will need to find the values of Dn that correspond to
the particular initial condition ψ(x, 0).
The Quantum Mechanics of Position 133
When you solved the classical problem of a mass on a spring, you had to
supplement the ODE solution with the initial values f (0) = x0 , f 0 (0) = v0 ,
to find the constants C and D. This is called an “initial value problem”. For
the problem of a particle in a box, we don’t have an initial value problem;
instead we are given Xn (0) = 0 and Xn (L) = 0, which is called a “boundary
value problem”.
Plugging x = 0 into equation (4.18) will be easier than plugging in
x = L, so I’ll do that first. The result gives
Xn (0) = Cn cos(0) + Dn sin(0) = Cn ,
so the boundary value Xn (0) = 0 means that Cn = 0 — for all values of n!
Thus
p
Xn (x) = Dn sin(( 2mEn /~)x). (4.19)
or else
e+iθ − e−iθ
sin θ = ,
2i
whichever you like better. (Or you can look at problem 4.5, “Informal inte-
gration”, on page 138, for an informal but easily remembered treatment.)
To employ this fact, start with
∞
X
Dn sin(nπx/L) = ψ0 (x),
n=1
This looks even worse, until you realize that all but one of the terms on the
left vanish! Once you do make that realization, you find
Z L
Dm (L/2) = ψ0 (x) sin(mπx/L) dx
0
and you have a formula for Dm .
Pulling all things together. For a particle of mass m ambivating
in an infinite square well of width L, how does the quantal wave function
change (“evolve”) with time? If the initial wavefunction is ψ0 (x), then the
wavefunction at time t is
X∞
ψ(x, t) = Dn e−(i/~)En t sin(nπx/L), (4.21)
n=1
where
π 2 ~2
En = n2 (4.22)
2mL2
and
Z L
2
Dn = ψ0 (x) sin(nπx/L) dx. (4.23)
L 0
138 Solving the Schrödinger equation for the I.S.W.
Problem
4.5 Informal integration (recommended problem)
The integral (4.20) undergirds the Fourier sine series technique, and
it’s useful to remember. Here’s how I do it. If n 6= m the integrand is
sometimes positive, sometimes negative over its range from 0 to L, so
it’s plausible that the two signs cancel out and result in a zero integral.
If n = m the integrand is always positive, so it must not be zero. But
what is it?
a. Below is a graph of sin2 (3πx/L).
sin2 (3πx/L)
6
1
0 - x
0 L
What is the area within the dashed box? Does it look like the area
above the curve within the box is the same as the area below the
curve within the box? What can you conclude about the value of
the integral
Z L
sin2 (3πx/L) dx ?
0
b. Make the above argument rigorous using the relationship
sin2 θ + cos2 θ = 1.
Your result should be that the integral
Z L
sin2 (nπx/L) dx
0
has the same value whenever n is a non-zero integer.
4.6. What did we learn? 139
V (x)
η(x)
x
0 L
|η(x)|2
This particular wavefunction has two interior zeros, also called nodes. A
common question is “There is zero probability of finding the particle at
the node, so how can it move from one side of the node to the other?”
People who ask this question suffer from the misconception that the particle
is an infinitely small, infinitely hard version of a classical marble, which
hence has a definite position. They think that the definite position of
this infinitely small marble is changing rapidly, or changing erratically, or
changing unpredictably, or changing subject to the slings and arrows of
outrageous fortune. In truth, the quantal particle in this state doesn’t have
a definite position: it doesn’t have a position at all! The quantal particle
in the state above doesn’t, can’t, change its position from one side of the
node to the other, because the particle doesn’t have a position.
Investigating the solution technique. But I want to do more than
investigate the properties of the solution, I want to investigate the charac-
teristics of the solution technique. In his book Mathematics in Action,
O. Graham Sutton writes that “A technique succeeds in mathematical
The Quantum Mechanics of Position 141
(1) Finding many particular solutions of the PDE that happen to factorize:
f (x, t) = X(x)T (t) (“separation of variables”).
(2) Summing all of these particular solutions to find a more general (and,
P
as it turns out, the most general) PDE solution: Dn Xn (x)Tn (t)
(“superposition”).
(3) Finding the coefficients Dn that match up to initial value ψ0 (x)
(“Fourier6 sine series”).
Fourier sine series. Let’s look at the last step first. The technique of
Fourier sine series is generally powerful. Any function f (x) with f (0) = 0
and f (L) = 0 can be expressed as
∞
2 L
X Z
f (x) = fn sin(nπx/L) where fn = f (x) sin(nπx/L) dx.
n=1
L 0
This seems paradoxical: complete information about the function is ob-
tained through knowing f (x) at every real number 0 ≤ x ≤ L. Alterna-
tively, complete information about the function is obtained through know-
ing the coefficients fn for every positive integer n. But there are more real
numbers between 0 and L than there are positive integers! I have no res-
olution for this paradox — I’ll just remark that in knowing the function
through its Fourier coefficients fn , it seems that we’re getting something
for nothing.
6 Joseph Fourier (1768–1830) was French, so his name is pronounced “Four - e - a” with
a silent “r”. He arrived at the series which today bears his name through studies of heat
flow. He was the first to propose the phenomenon that we today call “the greenhouse
effect”. (So much for the climate-change denialist claim that the greenhouse effect is a
modern day liberal/Chinese hoax.)
142 What did we learn?
Well, there are lots of times when we want to get something for nothing!
Fourier sine series are useful in data compression. For example, suppose
you want to record a sound that starts with silence at time 0, proceeds
through several notes, then ends with silence at time L. You could do this
by keeping track of the air pressure f (t) at every instant from 0 to L, or
you could do it by keeping track of the corresponding Fourier coefficients
fn . In either case an infinite amount of data are required, so some will have
to be thrown out to let it fit within a finite computer. It is more efficient to
store this information in the form of fn than in the form of f (t): for a given
amount of storage space, the fn provide a more accurate reproduction of
the sound than the f (t). There are many schemes for the details of exactly
when the Fourier series should be truncated: one such scheme is called
“MP3”.
Or, for pictures rather than sounds: A black-and-white photograph is a
two-dimensional intensity function f (x, y). You could store the image on a
computer by breaking space (x, y) into a grid (“pixels”) and storing a value
for the intensity at each grid point (the so-called bitmap or BMP format)
or you could store the information through Fourier coefficients fn,m (the
so-called JPEG format). For a given level of image quality, the JPEG file
is considerably smaller than the BMP file.
Stationary states. Okay, this is fun and profitable, but it tells us
about how clever humans are; it doesn’t tell us anything about nature. I’m
going to probe in another direction: We see that, as far as time evolution is
concerned, functions like sin(nπx/L) play a special role. What if the initial
wavefunction ψ0 (x) happens to have this form? We investigate n = 3.
Once you see how things work in this case, you can readily generalize to
any positive integer n.
So the initial wavefunction is
ψ0 (x) = A sin(3πx/L).
We need the constant A so that the initial wavefunction will (1) have di-
mensions and (2) be normalized. For all wavefunctions, the probability of
being somewhere is 1, that is
Z +∞
|ψ(x)|2 dx = 1.
−∞
results in
Z L
A2 sin2 (3πx/L) dx = 1,
0
whence (remembering the sine-squared integral 4.20)
p
A2 (L/2) = 1 so A = 2/L.
Notice that then r
2
ψ0 (x) = sin(3πx/L)
L
has the proper dimensions.
Well, for this initial wavefunction, what are the values of
2 L
Z
Dn = ψ0 (x) sin(nπx/L) dx ?
L 0
They are
r Z
2 2 L
Dn = sin(3πx/L) sin(nπx/L) dx
L L 0
r
2 2 L/2 for n = 3
= ×
L L 0 for n 6= 3
r
2 1 for n = 3
= × ,
L 0 for n 6= 3
so r
2 −(i/~)E3 t
ψ(x, t) = e sin(3πx/L). (4.26)
L
That’s it! For this particular initial wavefunction, the system remains al-
ways in that same wavefunction, except multiplied by an time-dependent
phase factor of e−(i/~)E3 t . This uniform phase factor has no effect whatso-
ever on the probability density! Such states are called “stationary states”.
Generic states. Contrast the time evolution of stationary states with
the time evolution of generic states. For example, suppose the initial wave-
function were r r
4 2 3 2
ψ0 (x) = sin(3πx/L) + sin(7πx/L).
5 L 5 L
How does this state change with time? You should check two things: First,
the wavefunction ψ0 (x) given here is normalized. Second, it evolves in time
to r r
4 2 −(i/~)E3 t 3 2 −(i/~)E7 t
ψ(x, t) = e sin(3πx/L) + e sin(7πx/L).
5 L 5 L
(4.27)
144 What did we learn?
Although it takes a little effort to see exactly how the probability density
changes with time, it’s clear from a glance that it does change with time.
This is not a stationary state.
Let’s go back to the Einstein relation
E = ~ω.
Neither Einstein nor de Broglie was ever clear about what it was that was
“oscillating” with frequency ω, but now we have a better idea. In stationary
state (4.26), the amplitude at every point oscillates with frequency E3 /~.
Using the Einstein relation, we say this state has energy E3 .
In contrast, the amplitude in generic state (4.27) has no single oscilla-
tion: there’s a combination of frequency E3 /~ and frequency E7 /~. This
state doesn’t have an energy, in the same way that a silver atom with
µx = +µB doesn’t have a value of µz , in the same way that an atom in
state |z+i passing through a horizontal interferometer doesn’t have a posi-
tion, in the same way that love doesn’t have a color. Instead, this state has
amplitude 54 to have energy E3 and amplitude 35 to have energy E7 .
We have uncovered the “aspect of physical truth” expressed by the
separation constant En .
Energy eigenstates. How did the remarkable stationary states come
about? Remember how they arose mathematically: we looked for solutions
to
~2 d2 Xn (x)
− + V (x)Xn (x) = En Xn (x),
2m dx2
and the solutions we found (for the infinite square well) were those functions
Xn (x) = sin(nπx/L)
that we later used as building blocks to build up any wavefunction. These
now seem important enough that they warrant their own name. Because
each is associated with a particularly energy En we call them “energy
states”. Because wavefunctions are usually represented by Greek letters
we give them the name ηn (x) where the Greek letter η (eta) suggests “en-
ergy” through alliteration. We write
~2 d2 ηn (x)
− + V (x)ηn (x) = En ηn (x), (4.28)
2m dx2
and recognize this as one of those “unfair” problems where you must find not
only the ODE solution ηn (x), but you must also find the value of En . Such
The Quantum Mechanics of Position 145
This particle does not have an energy. In more detail, it has amplitude √12
to have energy E3 and amplitude − √12 to have energy E4 , but it doesn’t
have an energy in the same way that love doesn’t have a color.
Summary. Our journey into quantum mechanics started with the ex-
perimental fact of quantized energies of blackbody radiation or of an atom.
This inspired a search for quantized values of µz , which in turn prompted
discovery of the new phenomena of interference and entanglement. In-
terference experiments suggested the mathematical tool of amplitude, and
generalizing amplitude from magnetic moment to position prompted the
mathematical tool of wavefunction. We asked the obvious question of how
wavefunction changed with time, and answering that question brought us
back to energy quantization with deeper insight. As T.S. Eliot wrote,
Problems
4.6 Revival
In an infinite square well, any wavefunction comes back to itself after
the revival time given in equation (4.25) has passed. What happens
after one-half of this time has passed?
4.7 Normalized for all time
Show that the wavefunction (4.27) is normalized for all values of the
time t.
4.8 Zero-point energy
The lowest possible energy for a classical infinite square well is zero.
The lowest possible energy for a quantal infinite square well is E1 as
given in equation (4.22). This difference is called the “zero-point en-
ergy” or the “vacuum energy”.
Your grandparents are intelligent and thoughtful but have little back-
ground in science. They hold a bank trust fund for your eventual
benefit. They have learned that on 27 May 2008, U.S. Patent 7379286
for “Quantum Vacuum Energy Extraction” was issued to the Jovion
Corporation and they know that, if zero-point energy could be har-
nessed, it would produce enormous societal and financial gains. Your
grandparents are thinking of withdrawing the trust fund money from
the bank and investing it in the Jovion Corporation, but they want
your advice before making the investment. What do you tell them?
(Remember their intelligence: they want not just advice but concise,
cogent reasoning behind that advice.)
4.9 Fourier sine series for tent (recommended problem)
Suppose the initial wavefunction is a pure real tent:
0 x<0
Ax 0 ≤ x ≤ L/2
ψ0 (x) = .
A(L − x) L/2 ≤ x ≤ L
0 L<x
a. Sketch this initial wavefunction.
b. Show that, to insure normalization, we must use
r
2 3
A= .
L L
c. Verify that ψ0 (x) has the proper dimensions.
148 What did we learn?
r ∞
2 3 X (−1)n
Aldo: − sin(nπx/L)
π L3 n=1 n
r ∞
2 3 X (−1)n
Beth: − cos(nπx/L)
π L n=1 n
r
2 3 X (−1)n
Celine: − sin(nπx/L)
π L n=1,3,5,··· n
r ∞
2 3 X (−1)n
Denzel: − sin(nπx/L)
π L n=1 n
spent a decade managing a farm, which made him financially comfortable enough that
he could pursue mathematics research for the rest of his life as a private scholar without
university position.
150 Other potentials
where
Z +∞
cn = ηn∗ (x)ψ0 (x) dx. (4.35)
−∞
We say that the energy states “span the set of wavefunctions” or that
they constitute a “basis”, but don’t let these fancy terms bamboozle you:
they just mean that starting from the energy states, you can use linear
combinations to build up any wavefunction. The basis states in quantum
mechanics play the same role as building blocks in a child’s construction
set. Just as a child can build castles, roadways, or trees — anything she
wants — out of building blocks, so you can build any wavefunction you
want out of energy states.
Superposition is the mathematical reflection of the physical phenomenon
of interference. An example of quantal interference is “the atom passing
through an interferometer doesn’t take either path; instead it has amplitude
ca to take path a and amplitude cb to take path b”. An example of super-
position is “the particle with wavefunction ψ0 (x) given in equation (4.34)
doesn’t have an energy; instead it has amplitude cn to have energy En ,
with n = 1, 2, 3, . . .”. It requires no great sophistication to see that these
are parallel statements.
In the infinite square well, each energy eigenstate has a different en-
ergy. This is not always true: it might happen that two energy eigenvalues,
perhaps E8 and E9 , are equal. In this situation any linear combination
α η8 (x) + β η9 (x) (4.36)
is again an energy eigenstate with energy E8 = E9 . (Although, to insure
normalization, we must select |α|2 +|β|2 = 1.) The two eigenvalues are then
said to be “degenerate”. I don’t know how such a disparaging term9 came
to be attached to such a charming result, but it has been. If two eigenstates
9 According to George F. Simmons, “This terminology follows a time-honored tradition
in mathematics, according to which situations that elude simple analysis are dismissed
by such pejorative terms as ‘improper’, ‘inadmissible’, ‘degenerate’, ‘irregular’, and so
on.” [Differential Equations with Applications and Historical Notes, third edition (CRC
Press, Boca Raton, Florida, 2017) page 220.]
The Quantum Mechanics of Position 151
have the same energy, that energy is called “two-fold degenerate”. If three
have the same energy, it is “three-fold degenerate”. And so forth.
Finally, solving the energy eigenproblem opens the door to solving the
time evolution problem, because the wavefunction ψ0 (x) evolves in time to
∞
X
ψ(x, t) = cn e−(i/~)En t ηn (x). (4.37)
n=1
Because the energy eigenproblem (4.31) tells you the resulting quantized
energy values, and because energy quantization is one of the easiest aspects
of quantum mechanics to access experimentally, some people develop the
mistaken impression that finding energy values is all there is to quantum
mechanics. No. It’s actually just like a classical mechanics problem: “You
stand atop a cliff 97 meters tall and hurl a ball horizontally at 12 m/s.
(a) How far from the base of the cliff does it land? (b) At what speed does
it strike the ground?” Using energy techniques alone you can answer ques-
tion (b). But to answer both questions you need to solve the time evolution
problem. The same holds in quantum mechanics. Some questions can be
answered knowing only the energy eigenvalues, but to answer any question
you must solve the time evolution problem. Often the easiest way to do
this is by first solving the energy eigenproblem (finding both eigenvalues
and eigenfunctions) and then employing the time evolution equation (4.37).
The energy eigenvalues are important — no doubt about that — but they’re
not the full story.
Problem
4.11 Basis with degeneracy (essential problem) n o
You know that if the two orthonormal vectors î, ĵ constitute a basis
for position vectors in two dimensions, then rotating the pair by angle
θ produces two different orthonormal vectors
î0 = cos θ î + sin θ ĵ
0
ĵ = − sin θ î + cos θ ĵ (4.38)
that constitute a basis just as good as the original basis.
Show that if
ηn (x) n = 1, 2, 3, . . .
152 Energy loss
I said earlier [at equations (4.26) and (4.29)] that any energy eigenstate
η6 (x) is a “stationary state”: that if the system started off in state η6 (x),
it would remain in that state forever (with a time-dependent phase factor
in front). This seems to contradict the experimental fact that most of the
atoms we find lying about are in their ground states.10 Why don’t they
just stay in state η6 (x) for ever and ever?
Furthermore: If the system starts off in the state c3 η3 (x)+c7 η7 (x), then
for all time the probability of measuring energy E3 is |c3 |2 , the probability
of measuring energy E7 is |c7 |2 , and the probability of measuring the ground
state energy is zero. Again, how can this conclusion be consistent with the
experimental observation that most atoms are in the ground state?
The answer is that if time evolution were given exactly by equa-
tion (4.12),
~2 ∂ 2 ψ(x, t)
∂ψ(x, t) i
=− − + V (x)ψ(x, t) , (4.40)
∂t ~ 2m ∂x2
then the atom would stay in that stationary state forever. But real atoms
are subject to collisions and radiation meaning that the time-evolution
equation above is not exactly correct. These phenomena, unaccounted for
in the equation above, cause the atom to fall into its ground state.
Because collisions and radiation are small effects, an atom starting off
in state η6 (x) stays in that stationary state for a “long” time — but that
means long relative to typical atomic times, such as the characteristic time
10−17 seconds generated at problem 1.15 on page 41. If you study more
10 The energy eigenstate with lowest energy eigenvalue has a special name: the ground
state.
4.9. Mean values 153
quantum mechanics,11 you will find that a typical atomic excited state
lifetime is 10−9 seconds. So the excited state lifetime is very short by
human standards, but very long by atomic standards. (To say “very long”
is an understatement: it is 100 million times longer; by contrast the Earth
has completed only 66 million orbits since the demise of the dinosaurs.)
b xb = −w/2
a xa = +w/2
Say the interferometer has width w, so that path a has position xa = +w/2
while path b has position xb = −w/2.
You know the drill: the atom has amplitude ca of taking path a, am-
plitude cb of taking path b. If a lamp is turned on while an interference
experiment is proceeding, the probability of the atom appearing in path a
is |ca |2 , the probability of the atom appearing in path b is |cb |2 . In other
words, if the atom’s position is measured while the interference experiment
is proceeding, the result would be +w/2 with probability |ca |2 , and it would
be −w/2 with probability |cb |2 . Hence the mean position measured would
be
+ (w/2)|ca |2 − (w/2)|cb |2 . (4.41)
It seems weird to say “The atom doesn’t have a position but its mean
position is given by equation (4.41)” — sort of like saying “Unicorns don’t
exist but their mean height is 1.3 meters.” Indeed, it would be more accu-
rate to say “The atom doesn’t have a position but if a light were turned on
11 See for example David J. Griffiths and Darrell F. Schroeter, Introduction to Quan-
tum Mechanics, third edition (Cambridge University Press, Cambridge, UK, 2018) sec-
tion 11.3.2, “The Lifetime of an Excited State”.
154 Mean values
Once again, it seems weird to have a formula for the mean energy of
a particle that doesn’t have an energy. The meaning is that if the energy
were measured, hEi is the mean of the energy that would be found.
New expression for mean energy. The above expression for mean
energy is correct but difficult to use. Suppose a particle with wavefunction
ψ(x) is subject to a potential energy function V (x). To find the mean
energy you must: (1) Solve the energy eigenproblem to find the energy
eigenvalues En and eigenfunctions ηn (x). (2) Write the wavefunction ψ(x)
P
in the form ψ(x) = n cn ηn (x). (3) Now that you know the energies En and
the amplitudes (expansion coefficients) cn , execute the sum n |cn |2 En .
P
and
Z +∞
∂ψ(x, t)
ψ ∗ (x, t) dx (4.47)
−∞ ∂t
∞ X ∞ Z +∞
X
∗ i −(i/~)(En −Em )t ∗
= cm cn − En e ηm (x)ηn (x) dx.
m=1 n=1
~ −∞
But (see equation 4.32) the integral on the right is zero unless m = n, in
which case it is 1. Thus
Z +∞ ∞
∗ ∂ψ(x, t) X
∗ i i
ψ (x, t) dx = cn cn − En = − hEi (4.48)
−∞ ∂t n=1
~ ~
or
Z +∞
i ∂ψ(x, t)
− hEi = ψ ∗ (x, t) dx. (4.49)
~ −∞ ∂t
Problems
4.12 Mean position vs. “expected position”
For the infinite square well energy eigenstate η2 (x), what is the mean
position? What is the probability density at that point? Is this mean
position really the “expected position”?
4.13 Wavefunction vs. probability density (recommended problem)
The wavefunction ψ(x) is not directly measurable, but can be inferred
(up to an overall phase) through a number of position and interference
experiments. The probability density |ψ(x)|2 is measurable through
a number of position experiments alone. These facts lead some to
the misconception that the probability density tells the “whole story”
of a quantal state. This problem demonstrates the falsehood of that
misconception by presenting a series of wavefunctions, all with the same
probability density, but each with a different mean energy. (And hence
each with different behavior in the future.) The so-called Gaussian
wavefunctions are
2
/2σ 2 ikx
ψ(x) = Ae−x e ,
where A is a normalization constant.
a. (Mathematical preliminary.) Use integration by parts to show that
Z +∞ Z +∞
2 2
e−t dt = 2 t2 e−t dt,
−∞ −∞
where t is any dimensionless variable.
b. When the particle is free, V (x) = 0, find the mean energy. (In this
case the mean kinetic energy, since there is no potential energy.) If
you use the above result, you will not need to evaluate any integral
nor find the normalization constant.
I told you way back on page 4 that when quantum mechanics is applied
to big things, it gives the results of classical mechanics. It’s hard to see
how my claim could possibly be correct: the whole structure of quantum
mechanics differs so dramatically from the structure of classical mechanics
— the character of a “state”, the focus on potential energy function rather
than on force, the emphasis on energy eigenproblems instead of initial value
problems, the fact that the quantal time evolution equation involves a first
derivative with respect to time while the classical time evolution equation
involves a second derivative with respect to time.
This nut is cracked by focusing, not on the full quantal state ψ(x, t), but
on the mean position
Z +∞
hxi = ψ ∗ (x, t)xψ(x, t) dx, (4.55)
−∞
How does this mean position change with time?
The answer depends on the classical force function F (x) — i.e., the
classical force that would be exerted on a classical particle if it were at
position x. (I’m not saying that the particle is at x, I’m not even saying
that the particle has a position; I’m saying that’s what the force would be
if the particle were classical and at position x.)
The answer is that
d2 hxi
hF (x)i = m , (4.56)
dt2
a formula that certainly plucks our classical heartstrings! This result is
called Ehrenfest’s theorem.12 We will prove this theorem later (in sec-
tion 4.10.4 on page 162), but first discuss its significance.
Although the theorem is true in all cases, it is most useful when the
spread in position ∆x is in some sense small, so the wavefunction is rel-
ativity compact. Such wavefunctions are called “wavepackets”. In this
12 Paul Ehrenfest (1880–1933) contributed to relativity theory, quantum mechanics, and
hF (x)i
F (x)
F (hxi)
|ψ(x)|2
x
hxi
∆x
But if the force function varies rapidly on the scale of ∆x, then our
hopes are dashed: the spread in position is small, but the spread in force
is not, and the classical approximation is not appropriate.
F (x)
hF (x)i
F (hxi)
|ψ(x)|2
x
hxi
∆x
The Quantum Mechanics of Position 159
Because the correct classical limit was essential in producing this theory,
it was easy to fall into the misconception that an electron really did behave
classically, with a single position, but that this single position is disturbed
14 Werner Heisenberg, Physics and Beyond (Harper and Row, New York, 1971) page 37.
15 Physics and Beyond, page 61.
The Quantum Mechanics of Position 161
position. The “de Broglie–Bohm pilot wave” formulation of quantum mechanics can be
interpreted as saying that “measurement disturbs the system”, but the measurement at
one point in space is felt instantly at points arbitrarily far away. When this formulation is
applied to a two-particle system, a “pilot wave” situated in six-dimensional configuration
space somehow physically guides the two particles situated in ordinary three-dimensional
space.
19 Heisenberg himself, writing in German, called it the “Genauigkeit Beziehung” — ac-
curacy relationship. See “Über den anschaulichen Inhalt der quantentheoretischen Kine-
matik und Mechanik” Zeitschrift für Physik 43 (March 1927) 172–198.
162 The classical limit of quantum mechanics
Solution: For those of us who know and love classical mechanics, there’s
a band-aid, the idea that “measurement disturbs the system”. This idea is
that fundamentally classical mechanics actually holds, but that quantum
mechanics is a mask layered over top of, and obscuring the view of, the
classical mechanics because our measuring devices disturb the underlying
classical system. That’s not possible. It is no defect of our measuring
instruments that they cannot determine what does not exist, just as it is
no defect of a colorimeter that it cannot determine the color of love.
This idea that “measurement disturbs the system” is a psychological
trick to comfort us, and at the same time to keep us from exploring, fully
and openly, the strange world of quantum mechanics. I urge you, I implore
you, to discard this security blanket, to go forth and discover the new world
as it really is rather than cling to the familiar classical world. Like Miranda
in Shakespeare’s Tempest, take delight in this “brave new world, that has
such people in’t”.
Unlike most band-aids, this band-aid does not protect or cover up. In-
stead it exposes a lack of imagination.
I’m not going to kid you: this derivation is long, difficult, and, frankly,
unenlightening. It is necessary to show the coherence of the entire quantal
scheme we’ve been building, and if you follow it critically you will learn
some tricks of the trade, but if you decide to skip this section I won’t be
offended.
The Quantum Mechanics of Position 163
It seems odd that the two terms involving potential energy cancel, so no
explicit dependence on V (x) appears in this result, but we’ll just push on.
Can we say anything about integrals such as the second integral in
square brackets above? Surprisingly, the answer is yes. If we define
∂ψ
f (x) = xψ ∗ and g(x) = (4.65)
∂x
then
Z +∞ Z +∞
∂2ψ
xψ ∗ 2 dx = f (x)g 0 (x) dx (4.66)
−∞ ∂x −∞
which suggests integration by parts:
Z +∞ +∞ Z +∞
0
f (x)g (x) dx = f (x)g(x) − f 0 (x)g(x) dx. (4.67)
−∞ −∞ −∞
Now remember that the wavefunction is normalized, so it has to fall to
zero at both infinity and negative infinity. Typically the slope ∂ψ/∂x also
falls to zero at both infinity and negative infinity, and does so very rapidly
— much more rapidly than linearly. (There are exceptions to these typical
behaviors, such as scattering wavefunctions, and in these atypical cases this
argument has to be rethought.) The upshot is that in typical situations
+∞
f (x)g(x) =0 (4.68)
−∞
so
+∞ Z +∞
∂2ψ ∂(xψ ∗ ) ∂ψ
Z
xψ ∗2
dx = − dx. (4.69)
−∞ ∂x −∞ ∂x ∂x
We’ll use this trick several times. . . I’ll just call it the “integration-by-parts
trick”.
Applying this trick to both integrals of equation (4.64) gives
Z +∞ Z +∞
∂(xψ) ∂ψ ∗ ∂(xψ ∗ ) ∂ψ
dhxi ~
= −i − dx + dx
dt 2m −∞ ∂x ∂x −∞ ∂x ∂x
Z +∞ ∗ Z +∞
~ ∂ψ ∂ψ ∂ψ ∗
= −i − x dx − ψ dx
2m −∞ ∂x ∂x −∞ ∂x
Z +∞ Z +∞
∂ψ ∗ ∂ψ
∗ ∂ψ
+ x dx + ψ dx
−∞ ∂x ∂x −∞ ∂x
Z +∞ Z +∞
∂ψ ∗
~ ∂ψ
= −i − ψ dx + ψ∗ dx
2m −∞ ∂x −∞ ∂x
Z +∞
~ ∂ψ
= =m ψ∗ dx . (4.70)
m −∞ ∂x
The Quantum Mechanics of Position 165
Notice that dhxi/dt is pure real, as it must be. And notice that the di-
mensions are the same on both sides. (This isn’t proof that we’ve made no
algebra errors, but if our expression for dhxi/dt had been complex, or if it
had been dimensionally incorrect, then that would have been proof that we
had made algebra errors.)
All this is fine and good, but it takes us only part way to our goal.
This is clearly not a classical equation. . . it contains ~ right there! Since the
classical F = ma involves the second derivative of position with respect to
time, we take one more time derivative of hxi, finding
Z +∞ Z +∞
d2 hxi ∂ψ ∗ ∂ψ
~ ∗ ∂ ∂ψ
= =m dx + ψ dx . (4.71)
dt2 m −∞ ∂t ∂x −∞ ∂x ∂t
The second-order derivative on the right looks particularly grotesque, so we
use the integration-by-parts trick to get rid of it:
Z +∞ Z +∞
d2 hxi ∂ψ ∗ ∂ψ ∂ψ ∗ ∂ψ
~
= =m dx − dx
dt2 m −∞ ∂t ∂x −∞ ∂x ∂t
Z +∞ ∗
2~ ∂ψ ∂ψ
= − =m dx . (4.72)
m −∞ ∂x ∂t
Now use the Schrödinger equation:
Z +∞
d2 hxi ∂ψ ∗ ~2 ∂ 2 ψ
2~ i
= − =m − − +Vψ dx
dt2 m −∞ ∂x ~ 2m ∂x2
Z +∞
∂ψ ∗ ~2 ∂ 2 ψ
2
= <e − + V ψ dx . (4.73)
m −∞ ∂x 2m ∂x2
Look at that. . . two of the ~s have canceled out! We’re not home yet because
there’s still an ~ within the square brackets, but we’re certainly making
progress. We have that
Z +∞ Z +∞
d2 hxi ~2 ∂ψ ∗ ∂ 2 ψ ∂ψ ∗
2
= <e − dx + V ψ dx , (4.74)
dt2 m 2m −∞ ∂x ∂x2 −∞ ∂x
but let’s apply the integration-by-parts trick to the first integral:
Z +∞ Z +∞ 2 ∗
∂ψ ∗ ∂ 2 ψ ∂ ψ ∂ψ
2
dx = − dx. (4.75)
−∞ ∂x ∂x −∞ ∂x2 ∂x
Think about this for a minute: if the integral on the left is z, this equation
says that z = −z ∗ , whence z is pure imaginary or <e{z} = 0. Thus
Z +∞
d2 hxi ∂ψ ∗
2
= <e V ψ dx , (4.76)
dt2 m −∞ ∂x
166 Transitions induced by light
This section is more intricate than other sections of this book, and it takes
many steps to reach its conclusion. Furthermore, it is not needed as back-
ground for any following section, so you might want to skip over it. But the
steps are valuable and the conclusion itself is one of the most fascinating
and useful relations in all of physics.
The problem. An electron in the ground state of a symmetric potential
well is exposed to light. What is the probability that it transitions through
light absorption to a particular excited state?
The Quantum Mechanics of Position 167
transition probability
6
- t
0
0
Setup. Call the ground state wavefunction ηg (x) with energy Eg and
the excited state wavefunction ηe (x) with energy Ee . In light of the Einstein
relation (1.21) we define the frequency characteristic of this transition
ω0 = (Ee − Eg )/~. (4.81)
For most symmetric potential wells the mean position for any energy eigen-
state vanishes,
hxig = 0 and hxie = 0, (4.82)
and we will assume this. (If this assumption is wrong, the derivation needs
to be rethought.)
When no light shines, the Schrödinger time evolution equation is
~2 ∂ 2 ψ(x, t)
∂ψ(x, t) i
=− − + Vwell (x)ψ(x, t)
∂t ~ 2m ∂x2
i
≡ − [Hwell ψ(x, t)] . (4.83)
~
In this section we abbreviate the term in square brackets as “Hwell ψ(x, t)”.
For example, when ψ(x, t) = ηg (x) we have
Hwell ηg (x) = Eg ηg (x), (4.84)
168 Transitions induced by light
When light does shine, the electron is subject not only to the well’s
potential energy function, but also to the potential energy function due to
the light. If the electric field at the center of the well is E0 cos(ωt), then
that additional potential energy function is
eE0 cos(ωt) x, (4.88)
where the charge on an electron is −e. In this circumstance the wavefunc-
tion no longer evolves like (4.87), but instead like
ψ(t) = cg (t)e−(i/~)Eg t ηg (x) + ce (t)e−(i/~)Ee t ηe (x). (4.89)
Our job is to find the probability of starting in the ground state and
ending in the excited state. That is, assuming cg (0) = 1 and ce (0) = 0, we
need to find ce (t). The transition probability is then |ce (t)|2 .
Time evolution when the light shines. When light shines, the
potential energy function changes from that of the well alone, Vwell (x),
to Vwell (x) + eE0 cos(ωt) x. Thus the Schrödinger time evolution equation
changes from equation (4.83) to
d i
ψ(t) = − [Hwell ψ(t) + eE0 cos(ωt) xψ(t)] . (4.90)
dt ~
First, look at the left-hand side:
d d h i
ψ(t) = cg (t)e−(i/~)Eg t ηg (x) + ce (t)e−(i/~)Ee t ηe (x) (4.91)
dt hdt i
= ċg (t)e−(i/~)Eg t ηg (x) + ċe (t)e−(i/~)Ee t ηe (x)
ih i
− Eg cg (t)e−(i/~)Eg t ηg (x) + Ee ce (t)e−(i/~)Ee t ηe (x) .
~
Meanwhile, on the right-hand side
Hwell ψ(t) = cg (t)e−(i/~)Eg t Hwell ηg (x) + ce (t)e−(i/~)Ee t Hwell ηe (x)
= cg (t)e−(i/~)Eg t Eg ηg (x) + ce (t)e−(i/~)Ee t Ee ηe (x). (4.92)
The Quantum Mechanics of Position 169
Multiply the above equation by ηe∗ (x) and integrate over all values of x.
Because of orthonormality
Z +∞ Z +∞
ηe∗ (x)ηg (x) dx = 0 and ηe∗ (x)ηe (x) dx = 1, (4.94)
−∞ −∞
Hence we find
Z +∞
i
ċe (t)e−(i/~)Ee t = − eE0 cos(ωt)cg (t)e−(i/~)Eg t ηe∗ (x) x ηg (x) dx.
~ −∞
(4.96)
The integral on the right is just a (complex) number — not a function
of x, not a function of t — and we’ll call that number he|x|gi. Recalling
definition (4.81), we write
i
ċe (t) = − eE0 he|x|gi cos(ωt)e+iω0 t cg (t). (4.97)
~
and theoretical physics. He directed the building of the first nuclear reactor and produced
the first theory of the weak nuclear interaction. The Fermi surface in the physics of metals
was named in his honor. He elucidated the statistics of what are now called fermions in
1926. He produced so many thoughtful conceptual and estimation problems that such
problems are today called “Fermi problems”. I never met him (he died before I was
born) but I have met several of his students, and each of them speaks of him in that
rare tone reserved for someone who is not just a great scientist and a great teacher and
a great leader, but also a great human being.
The Quantum Mechanics of Position 171
is highly accurate.
Furthermore,
1 ei(ω0 −ω)t − 1 1 [ei(ω0 −ω)t/2 − e−i(ω0 −ω)t/2 ]ei(ω0 −ω)t/2
= (4.102)
2i ω0 − ω 2i ω0 − ω
which seems like a step backwards, until you remember that eiθ − e−iθ =
2i sin θ, so
1 ei(ω0 −ω)t − 1 sin[(ω0 − ω)t/2] i(ω0 −ω)t/2
= e . (4.103)
2i ω0 − ω ω0 − ω
So, at this excellent level of approximation,
i sin[(ω0 − ω)t/2] i(ω0 −ω)t/2
ce (t) = − eE0 he|x|gi e (4.104)
~ ω0 − ω
and the transition probability is
e2 E02 |he|x|gi|2 sin2 [(ω0 − ω)t/2]
|ce (t)|2 = . (4.105)
~2 (ω0 − ω)2
Given all the assumptions and approximations we introduced to derive this
result, you might think it’s an obscure equation of limited applicability.
You’d be wrong. It is used so often that Fermi called it the “golden rule”.
Reflection. The transition probability result, graphed below as a func-
tion of time, shows oscillatory behavior called “Rabi21 flopping”. This is
the beat at the heart of an atomic clock.
transition probability
6
e2 E02 |he|x|gi|2
~2 (ω0 − ω)2
0 - t
0 2π 4π 6π
|ω0 − ω| |ω0 − ω| |ω0 − ω|
21 Isidor Isaac Rabi (1898–1988), Polish-Jewish-American physicist. His fascinating life
I have made bad guesses in my life, but none worse than the difference
between my expectation graphed on page 167 and the real behavior graphed
above. It’s as if, while hammering a nail into a board, the first few strikes
drive the nail deeper and deeper into the board, but additional strikes make
the nail come out of the board. And one strike (at time 2π/|ω0 − ω|) makes
the nail pop out of the board altogether! Is there any way to account for
this bizarre result other than shrugging that “It comes out of the math”?
There is. This is a form of interference22 where the particle moves not
from point to point through two possible slits, but from energy state to
energy state with two possible intermediate states. The initial state is the
ground state and the final state is the ground state. The two possible inter-
mediates are the excited state and the ground state. There is an amplitude
to go from ground state to ground state via the excited state, and an am-
plitude to go from ground state to ground state via the ground state. At
time π/|ω0 − ω| those two amplitudes interfere destructively so there is a
small probability of ending up in the ground state and hence a large prob-
ability of ending up in the excited state. At time 2π/|ω0 − ω| those two
amplitudes interfere constructively so there is a large probability of ending
up in the ground state and hence a small probability of ending up in the
excited state.
Problem
4.15 Explore some more
There’s a lot more to say to flesh out the story told by equation (4.105),
but I’ll restrict myself to one question: The denominator vanishes when
ω = ω0 , so you might think that the probability goes to infinity there.
Bad idea. Show that the probability is instead
e2 E02 |he|x|gi|2 t2
.
~2 4
Quantum Mechanics and Path Integrals, emended edition (Dover Publications, Mineola,
NY, 2010).
The Quantum Mechanics of Position 173
Epilogue
Problems
4.16 Spin- 12 electron in a potential well, I (essential problem)
All electrons are spin- 21 particles. Using the χ+ , χ− notation, write
down the wavefunction for an electron ambivating in a potential well
with energy eigenfunctions ηn (x):
a. with amplitude 45 of being in the spatial ground state (n = 1) with
spin up and amplitude 35 of being in the spatial n = 3 state with
spin down;
b. with amplitude 54 of being in the spatial ground state (n = 1) with
spin up and amplitude 35 of being in the spatial n = 3 state with
negative spin projection on the x axis (see equation 3.19).
4.17 Spin- 21 electron in a potential well, II
An electron ambivates in a potential well with with energy eigenvalues
En and energy eigenfunctions ηn (x). The electron’s wavefunction is
3 χ
5 η3 (x) + + 45 η4 (x)χ− .
a. What is the mean energy?
b. If the vertical spin projection is measured, what is the probability
of finding +? (That is, of finding ms = + 21 .)
c. The vertical spin projection is measured and found to be +. Now
what is the mean energy?
4.18 Questions (recommended problem)
Update your list of quantum mechanics questions that you started at
problem 1.17 on page 46. Write down new questions and, if you have un-
covered answers to any of your old questions, write them down briefly.
[[For example, one of my questions would be: “What are the detailed
mechanisms for the energy loss outlined in section 4.8?”]]
Chapter 5
177
178 Sketching energy eigenfunctions
V (x)
Kc (x)
E
the slope becomes zero and then negative, but the curvature continues
negative. Because of the negative slope, η(x) eventually plunges through
η(x) = 0 (where its curvature is zero) and into regions where η(x) is negative
and hence the curvature is positive. The process repeats to produce the
following graph:
strong
negative
curvature
η(x) etc.
weak
negative zero
curvature curvature x
weak
positive
curvature
strong
positive
curvature
strong
positive
curvature
η(x)
weak
positive
zero
curvature curvature x
Suppose the wavefunction starts out on the left small and just above the
axis. The region is strongly prohibited, that is Kc (x) is strongly negative,
so η(x) curves strongly away from the axis. Then (at the dashed vertical
line) the solution moves into a classically allowed region. But Kc (x) is only
weakly positive, so η(x) curves only weakly toward the axis. By the time
the solution gets to the right-hand classically prohibited region at the next
dashed vertical line, η(x) has only a weakly negative slope. In the prohib-
ited region the slope increases as η(x) curves strongly away from the axis
and rockets off to infinity.
curve strongly
away from
axis
curve weakly
toward
curve strongly axis
away from
axis x
You should check that the curvatures and tangents of this energy eigen-
function strictly obey the rules set down at (5.4) and (5.6). What happens
when η(x) crosses a dashed vertical line, the boundary between a classically
prohibited and a classically allowed region?
If you have studied differential equations you know that for any value
of E, equation (5.2) has two linearly independent solutions. We’ve just
sketched one of them. The other is the mirror image of it: small to the
right and rocketing to infinity toward the left. Because of the “rocketing off
to infinity” neither solution is normalizable. So these two solutions don’t
correspond to any physical energy eigenstate. To find such a solution we
have to try a different energy.
184 Sketching energy eigenfunctions
So we try an energy slightly higher. Now the region on the left is not so
strongly prohibited as it was before, so η(x) curves away from the axis less
dramatically. Then when it reaches the classically allowed region it curves
more sharply toward the axis, so that it’s strongly sloping downward when
it reaches the right-hand prohibited region. But not strongly enough: it
curves away from the axis and again rockets off to infinity — although this
time not so dramatically.
Once again we find a solution (and its mirror image is also a solution), but
it’s a non-physical, unnormalizable solution.
As we try energies higher and higher, the “rocketing to infinity” happens
further and further to the right, until at one special energy it doesn’t happen
at all. Now the wavefunction is normalizable, and now we have found an
energy eigenfunction.
Solving the Energy Eigenproblem 185
weakly prohibited
curve weakly away from axis
strongly prohibited
curve strongly away from axis x
In some way it makes sense that the wavefunction tail should be longer
where the classical prohibition is milder.
188 Sketching energy eigenfunctions
Within the deep left side of the well, Kc is relatively high, so the tendency
for η to curve toward the axis is strong; within the shallow right side Kc is
relatively low, so the tendency to curve toward the axis is weak. Thus within
the deep side of the well, η(x) snaps back toward the axis, taking the curves
like an expertly driven sports car; within the shallow side η(x) leisurely
curves back toward the axis, curving like a student driver in a station
wagon. Within the deep side, wavelength will be short and amplitude will
be small; within the shallow side, wavelength will be longer and amplitude
will be large (or at least the same size). One finds smaller amplitude at
the deeper side of the well, and hence, all other things being equal, smaller
probability for the particle to be in the deep side of the well.
Solving the Energy Eigenproblem 189
Similar results hold for three-level square wells, for four-level square
wells, and so forth. And because any potential energy function can be
approximated by a series of steps, similar results hold for any potential
energy function.
Number of nodes. For the infinite square well, the energy eigen-
function ηn (x) has n − 1 interior nodes. The following argument1 shows
that same holds for any one-dimensional potential energy function V (x).
Imagine a modified potential
∞ x ≤ −a
Va (x) = V (x) −a < x < +a .
∞ +a ≤ x
When a is very small this is virtually an infinite square well, whose en-
ergy eigenfunctions we know. As a grows larger and larger, this potential
becomes more and more like the potential of interest V (x). During this
expansion, can an extra node pop into an energy eigenfunction? If it does,
then at the point xp where it pops in the wavefunction vanishes, η(xp ) = 0,
and its slope vanishes, η 0 (xp ) = 0. But the energy eigenproblem is a second-
order ordinary differential equation: the only solution with η(xp ) = 0 and
η 0 (xp ) = 0 is η(x) = 0 everywhere. This is not an eigenfunction. This can
never happen.
284–285.
Solving the Energy Eigenproblem 191
Summary
Problems
5.1 Would you buy a used eigenfunction from this man?
(recommended problem)
The four drawings below and on the next pages show four one-
dimensional potential energy functions V (x) (in olive green) along with
candidate energy eigenfunctions η(x) (in red) that purport to associate
with those potential energy functions. There is something wrong with
every candidate. Using the letter codes below, identify all eigenfunc-
tion errors, and sketch a qualitatively correct eigenfunction for each
potential.
The energy eigenfunction is drawn incorrectly because:
A. Wrong curvature. (It curves toward the axis in a classi-
cally prohibited region or away from the axis in a classi-
cally allowed region.)
B. Its wavy part has the wrong number of nodes.
C. The amplitude of the wavy part varies incorrectly.
D. The wavelength of the wavy part varies incorrectly.
E. One or more of the declining tails has the wrong length.
a.
E3
η3 (x)
x
Solving the Energy Eigenproblem 193
b.
E4
η4 (x)
x
c.
E5
η5 (x)
x
194 Sketching energy eigenfunctions
d.
E6
η6 (x)
x
d. If ηm (x) does not have a zero within x1 < x < x2 , then argue that
we can select ηm (x) always positive on the same interval, including
the endpoints.
The assumption that “ηm (x) does not have a zero” hence implies that
the left-hand side of (5.10) is strictly negative, while the right-hand
side is strictly positive. This assumption, therefore, must be false.
5.7 Parity
a. Think of an arbitrary potential energy function V (x). Now think
of its mirror image potential energy function U (x) = V (−x) Show
that if η(x) is an eigenfunction of V (x) with energy E, then σ(x) =
η(−x) is an eigenfunction of U (x) with the same energy.
b. If V (x) is symmetric under reflection about the origin, that is
U (x) = V (x), you might think that σ(x) = η(x). But no! This
identification ignores global phase freedom (pages 107 and 124).
Show that in fact σ(x) = rη(x) where the “overall phase factor”
r is a complex number with magnitude 1.
Solving the Energy Eigenproblem 197
V (x) U (y)
x y
The scaled problem has many advantages. Instead of five there are only
three parameters: Ẽ, V˜1 , and V˜2 . And those parameters have nicely sized
values like 1 or 0.5 or 6. But it has the disadvantage that you have to write
down all those tildes. Because no one likes to write down tildes, we just
drop them, writing the problem as
d2 η(x)
= −2[E − V (x)]η(x) (5.15)
dx2
where
V1 x<0
V (x) = 0 0<x<1 (5.16)
V2 1<x
and saying that these equations are written down “using scaled quantities”.
When you compare these equations with equations (5.11) and (5.12),
you see that we would get the same result if we had simply said “let ~ =
m = L = 1”. This phrase as stated is of course absurd: ~ is not equal to
1; ~, m, and L do have dimensions. But some people don’t like to explain
what they’re doing so they do say this as shorthand. Whenever you hear
this phrase, remember that it covers up a more elaborate — and more
interesting — truth.
200 Scaled quantities
Show that there is only one way to combine the quantities L, m, and ~ to
form a quantity with the dimensions of energy, and find an expression for
this so-called characteristic energy Ec .
Solution:
quantity dimensions
L [length]
m [mass]
2
~ [mass] × [length] /[time]
2 2
Ec [mass] × [length] /[time]
quantity dimensions
L [length]
m [mass]
2 4 2
~2 [mass] × [length] /[time]
2 2
Ec [mass] × [length] /[time]
But ~2 has too many factors of [mass] and [length] to make an energy.
There is only one way to get rid of them: to divide by m once and by L
twice.
quantity dimensions
2 2
~2 /mL2 [mass] × [length] /[time]
2 2
Ec [mass] × [length] /[time]
Problems
5.9 Characteristic time
Find the characteristic time for the square well problem by combining
the parameters L, m, and ~ to form a quantity with the dimensions
of time. Compare this characteristic time to the infinite square well
revival time found at equation (4.25).
5.10 Scaling for the simple harmonic oscillator
(recommended problem)
Execute the scaling strategy for the simple harmonic oscillator poten-
tial energy function V (x) = 12 kx2 . What is the characteristic length in
terms of k, ~, and m? What is the resulting scaled energy eigenprob-
lem? If you didn’t like to explain what you were doing, how would you
use shorthand to describe the result of this scaling strategy?
Now that the quantities are scaled, we return to our task of writing a
computer program to solve, numerically, the energy eigenproblem. In order
to fit the potential energy function V (x) and the energy eigenfunction η(x)
into a finite computer, we must of course approximate those continuum
functions through their values on a finite grid. The grid points are separated
by a small quantity ∆. It is straightforward to replace the function V (x)
with grid values Vi and the function η(x) with grid values ηi . But what
should we do with the second derivative d2 η/dx2 ?
202 Numerical solution of the energy eigenproblem
- x
i−1 i i+1
ηi+1 − ηi
ηi − ηi−1 ""a
6aa ∆
∆ "" aa
a
" 6
"
6
- x
u u
i−1 i i+1
so at point i we approximate
d2 η ηi+1 − 2ηi + ηi−1
2
≈ . (5.17)
dx ∆2
Problems
5.11 Program
a. Implement the shooting algorithm using a computer spreadsheet,
your favorite programming language, or in any other way. You
will have to select reasonable values for ∆ and η2 .
b. Check your implementation by solving the energy eigenproblem
for a free particle and for an infinite square well.
c. Find the three lowest-energy eigenvalues for a square well with
V1 = V2 = 30. Do the corresponding eigenfunctions have the
qualitative character you expect?
d. Repeat for a square well with V1 = 50 and V2 = 30.
204 Numerical solution of the energy eigenproblem
Identical Particles
x
5 8
And here is the situation when one particle is found in bin 8, the other in
bin 5:
x
5 8
205
206 Two or three identical particles
amplitudes ψi,j that describes the state of the system. Then the set of
amplitudes φi,j = ψj,i describes that state just as well as the original set
ψi,j . Does this mean that φi,j = ψi,j ? Not at all. Remember global phase
freedom (pages 107 and 124): If every bin amplitude is multiplied by the
same “overall phase factor” — a complex number with magnitude unity
— then the resulting set of amplitudes describes the state just as well as
the original set did. Calling that overall phase factor s, we conclude that
φi,j = sψi,j .
But, because φi,j = ψj,i , the original set of amplitudes must satisfy
ψj,i = sψi,j . The variable name s comes from “swap”: when we swap
subscripts, we introduce a factor of s. The quantity s is a number. . . not
a function of i or j. For example, the same value of s must work for
ψ8,5 = sψ5,8 , for ψ7,3 = sψ3,7 , for ψ5,8 = sψ8,5 , . . . . Wait. What was that
last one? Put together the first and last examples:
ψ8,5 = sψ5,8 = s(sψ8,5 ) = s2 ψ8,5 .
Clearly, s2 = 1, so s can’t be any old complex number with magnitude
unity: it can be only s = +1 or s = −1.
Execute the now-familiar program of turning bin amplitudes into am-
plitude density, that is wavefunction, to find that
ψ(xA , xB ) = +ψ(xB , xA ) or ψ(xA , xB ) = −ψ(xB , xA ). (6.1)
The first kind of wavefunction is called “symmetric under coordinate swap-
ping”, the second is called “antisymmetric under coordinate swapping”.
This requirement for symmetry or antisymmetry under coordinate swap-
ping is called the Pauli1 principle.
It might distress you to see variables like xA : doesn’t xA mean the
position of particle “A” while xB means the position of particle “B”? So
doesn’t this terminology label the particles as “A” and “B”, which would
1 Wolfgang Pauli (1900–1958), Vienna-born Swiss physicist, was one of the founders of
violate our initial requirement that the particles be identical? The answer
is that this terminology does not label one particle “A” and the other
particle “B”. Instead, it labels one point “A” and the other point “B”.
Look back to the figures on page 205: the numbers 5 and 8 label bins,
not particles, so when these bins shrink to zero the variables xA and xB
apply to points, not particles. That’s why I like to call these wavefunctions
“(anti)symmetric under swap of coordinates”. But you’ll hear people using
terms like “(anti)symmetric under particle swapping” or “. . . under particle
interchange” or “. . . under particle exchange”.
What if the two particles are in three-dimensional space, and what if
they have spin? In that case, the swap applies to all the coordinates: using
the sans serif notation of equation (4.106),
ψ(xA , xB ) = +ψ(xB , xA ) or ψ(xA , xB ) = −ψ(xB , xA ). (6.2)
from chemistry to school administration, but his signal contribution was elucidating the
statistics of photons. Remarkably, he made this discovery in 1922, three years before
Schrödinger developed the concept of wavefunction.
3 Named for Enrico Fermi. See footnote on page 170.
208 Symmetrization and antisymmetrization
gorithms, Part 1” (Addison-Wesley, Boston, 1997) section 7.2.1.2, “Generating all per-
mutations”.
Identical Particles 209
tions to our understanding of atoms, molecules, and solids. Also important as a teacher,
textbook author, and administrator.
6 I am not alone. See Sheldon Axler, “Down with determinants!” American Mathemat-
Problems
6.1 Product functions
Can a product function F (xA )G(xB ) ever be symmetric under swap-
ping? Antisymmetric?
6.2 Special case
Implement the symmetrization and antisymmetrization proce-
dures (6.7) and (6.8) for the garden-variety function f (xA , xB , xC ) =
xA x2B x3C . Evaluate the resulting functions s(xA , xB , xC ) and
a(xA , xB , xC ) first at xA = 1, xB = 2, and xC = 3, then at xA = 3,
xB = 2, and xC = 1. [[Results: s(1, 2, 3) = s(3, 2, 1) = 288,
a(1, 2, 3) = 12, a(3, 2, 1) = −12.]]
6.3 Antisymmetrizing the symmetric
a. There is one function that is both completely symmetric and com-
pletely antisymmetric. What is it?
b. Suppose the seed function is symmetric under a swap of the first
two coordinates
f (xA , xB , xC ) = f (xB , xA , xC )
and the antisymmetrization process (6.8) is executed. What is the
result?
c. Repeat part (b) for a seed function symmetric under a swap of the
last two coordinates.
d. Repeat part (b) for a seed function symmetric under a swap of the
first and third coordinates.
e. Suppose the seed function is a product as in equation (6.9), and
two of the functions happen to be equal. What is the result of the
antisymmetrization process?
214 Consequences of the Pauli principle
xB
xA
xA
xB xB
xA xA
xB xB
xA xA
This rule is not a theorem and you can find counterexamples,7 but such
exceptions are rare.
In everyday experience, when two people tend to huddle together or
spread apart, it’s for emotional reasons. In everyday experience, when
two particles tend to huddle together or spread apart, it’s because they’re
attracted to or repelled from each other through a force. This quantal
case is vastly different. The huddling or spreading is of course not caused
by emotions and it’s also not caused by a force — it occurs for identical
particles even when they don’t interact. The cause is instead the symme-
try/antisymmetry requirement: not a force like a hammer blow, but a piece
of mathematics!
Therefore it’s difficult to come up with terms for the behavior of identical
particles that don’t suggest either emotions or forces ascribed to particles:
congregate, avoid; gregarious, loner; attract, repel; flock, scatter. “Huddle
together” and “spread apart” are the best terms I’ve been able to devise,
but you might be able to find better ones.
Problem
6.4 Symmetric and antisymmetric combinations
Two identical particles ambivate in a one-dimensional infinite square
well. Take as a seed function the product of energy eigenstates
η2 (xA )η3 (xB ). Use your favorite graphics package to plot the proba-
bility densities associated with the symmetric and antisymmetric com-
binations generated from this seed. Does the “huddle together/spread
apart” rule hold?
7 See D.F. Styer, “On the separation of identical particles in quantum mechanics” Eu-
ψA (xA )ψB (xB ), the definition of entangled state on page 129 suggests that these states
should be called entangled. However the correct definition of entanglement for identical
particles remains unsettled, so I use the term “non-product state” instead.
6.6. Spin plus space, two electrons 219
Problem
6.5 Building three-particle basis states
Suppose you had three particles and three “building block” levels (say
the orthonormal levels η1 (x), η3 (x), and η7 (x)). Construct normalized
three-particle basis states for the case of
a. three non-identical particles
b. three identical bosons
c. three identical fermions
How many states are there in each basis? Repeat for three particles
with four one-particle levels, but in this case simply count and don’t
write down all the three-particle states.
rather than “level”. This term unfortunately suggests a circular Bohr orbit. An electron
with an energy does not execute a circular Bohr orbit at constant speed. Instead it
ambivates without position or velocity.
220 Spin plus space, two electrons
ηn (~x)χ+ and ηn (~x)χ− . Now the question: What are the energy eigenstates
for the two noninteracting electrons?
Well, what two-particle states can we build from the one-particle spatial
levels with, say, n = 1 and n = 3? (Once you see how to do it for n = 1 and
n = 3, you can readily generalize to any two values of n.) These correspond
to four levels:
η1 (~x)χ+ , (6.18)
η1 (~x)χ− , (6.19)
η3 (~x)χ+ , (6.20)
η3 (~x)χ− . (6.21)
What states mixing n = 1 with n = 3 can be built from these four levels?
The antisymmetric combination of (6.18) with itself vanishes. The an-
tisymmetric combination of (6.18) with (6.19) is a combination of n = 1
with n = 1, not of n = 1 with n = 3. The (unnormalzed) antisymmetric
combination of (6.18) with (6.20) is
η1 (~xA )χ+ (A)η3 (~xB )χ+ (B) − η3 (~xA )χ+ (A)η1 (~xB )χ+ (B). (6.22)
The antisymmetric combination of (6.18) with (6.21) is
η1 (~xA )χ+ (A)η3 (~xB )χ− (B) − η3 (~xA )χ− (A)η1 (~xB )χ+ (B). (6.23)
The antisymmetric combination of (6.19) with (6.20) is
η1 (~xA )χ− (A)η3 (~xB )χ+ (B) − η3 (~xA )χ+ (A)η1 (~xB )χ− (B). (6.24)
The antisymmetric combination of (6.19) with (6.21) is
η1 (~xA )χ− (A)η3 (~xB )χ− (B) − η3 (~xA )χ− (A)η1 (~xB )χ− (B). (6.25)
Finally, the antisymmetric combination of (6.20) with (6.21) is a combina-
tion of n = 3 with n = 3, not of n = 1 with n = 3.
All four of these states are energy eigenstates with energy E1 + E3 .
State (6.22) factorizes into a convenient space-times-spin form:
η1 (~xA )χ+ (A)η3 (~xB )χ+ (B) − η3 (~xA )χ+ (A)η1 (~xB )χ+ (B)
= η1 (~xA )η3 (~xB ) − η3 (~xA )η1 (~xB ) χ+ (A)χ+ (B). (6.26)
Before proceeding I confess that I’m sick and tired of writing all these
ηs and χs and As and Bs that convey no information. I always write the
η in front of the χ. I always write the As in front of the Bs. You’ll never
confuse an η with a χ, because the ηs are labeled 1, 3 while the χs are
labeled +, −. Dirac introduced a notation (see page 90) that takes all this
for granted, so that neither you nor I have to write the same thing out over
and over again. This notation usually replaces + with ↑ and − with ↓ (see
page 112). In this notation, equation (6.26) is written
|1 ↑, 3 ↑i − |3 ↑, 1 ↑i = |1, 3i − |3, 1i | ↑↑ i. (6.27)
In this new notation the states (6.22) through (6.25) are written
|1, 3i − |3, 1i | ↑↑ i (6.28)
|1 ↑, 3 ↓i − |3 ↓, 1 ↑i (6.29)
|1 ↓, 3 ↑i − |3 ↑, 1 ↓i (6.30)
|1, 3i − |3, 1i | ↓↓ i. (6.31)
Well, this is cute. Two of the four states have this convenient space-times-
spin form. . . and furthermore these two have the same spatial wavefunction!
Two other states, however, don’t have this convenient form.
One thing to do about this is nothing. There’s no requirement that
states have a space-times-spin form. But in this two-electron case there’s a
slick trick that enables us to put the states into space-times-spin form.
Because all four states (6.28) through (6.31) have the same energy,
namely E1 + E3 , I can make linear combinations of the states to form other
equally good energy states. Can I make a combination of states (6.29) and
(6.30) that does factorize into space times spin? Nothing ventured, nothing
gained. Let’s try it:
α |1 ↑, 3 ↓i − |3 ↓, 1 ↑i + β |1 ↓, 3 ↑i − |3 ↑, 1 ↓i
= |1, 3i α| ↑↓ i + β| ↓↑ i − |3, 1i α| ↓↑ i + β| ↑↓ i .
This will factorize only if the left term in square brackets is proportional
to the right term in square brackets:
α| ↑↓ i + β| ↓↑ i = c β| ↑↓ i + α| ↓↑ i ,
222 Spin plus space, two electrons
that is only if
α = cβ and β = cα.
Combining these two equations results in c = ±1. If c = +1 then the
combination results in the state
|1, 3i − |3, 1i α | ↑↓ i + | ↓↑ i , (6.32)
Putting all this together and, for the sake of good form, insuring normal-
ized states, we find that the two-electron energy states in equations (6.28)
through (6.31) can be recast as
√1 (|1, 3i − |3, 1i) | ↑↑ i (6.34)
2
√1 (|1, 3i − |3, 1i) √1 (| ↑↓ i + | ↓↑ i) (6.35)
2 2
√1 (|1, 3i − |3, 1i) | ↓↓ i (6.36)
2
√1 (|1, 3i + |3, 1i) √1 (| ↑↓ i − | ↓↑ i) . (6.37)
2 2
This abstract machinery might seem purely formal, but in fact it has
tangible experimental consequences. In the sample problem below, the
machinery suggests that the ground state of the hydrogen atom is two-fold
degenerate, while the ground state of the helium atom is non-degenerate.
And this prediction is borne out by experiment!
The first two states listed are both ground states, so the ground state is
two-fold degenerate.
(b) For the two electrons, we build states from levels just as we did
in this section. The first line below is the antisymmetrized combination of
η1 (~x)χ+ with η1 (~x)χ− . This state has energy 2E1 . The next four lines are
built up exactly as equations (6.34) through (6.37) were. Each of these four
224 Spin plus space, two electrons
states has energy E1 +E2 . The last line is the antisymmetrized combination
of η2 (~x)χ+ with η2 (~x)χ− . This state has energy 2E2 .
η1 (~xA )η1 (~xB ) √1 [χ+ (A)χ− (B) − χ− (A)χ+ (B)]
2
√1 [η1 (~xA )η2 (~xB ) − η2 (~xA )η1 (~xB )] [χ+ (A)χ+ (B)]
2
√1 [η1 (~xA )η2 (~xB ) − η2 (~xA )η1 (~xB )] √12 [χ+ (A)χ− (B) + χ− (A)χ+ (B)]
2
√1 [η1 (~xA )η2 (~xB ) − η2 (~xA )η1 (~xB )] [χ− (A)χ− (B)]
2
√1 [η1 (~xA )η2 (~xB ) + η2 (~xA )η1 (~xB )] √12 [χ+ (A)χ− (B) − χ− (A)χ+ (B)]
2
η2 (~xA )η2 (~xB ) √12 [χ+ (A)χ− (B) − χ− (A)χ+ (B)] .
The ground state of the two-electron system is the first state listed: it is
non-degenerate.
Problems
6.6 Combining a spatial one-particle level with itself
What two-particle states can we build from the one-particle spatial level
with n = 3? How many of the resulting states are ortho, how many
para?
6.7 Change of basis through abstract rotation
Show that, in retrospect, the process of building states (6.35) and (6.37)
from states (6.29) and (6.30) is nothing but a “45◦ rotation” in the style
of equation (4.39).
6.8 Normalization of singlet spin state
Justify the normalization constant √12 that enters in moving from equa-
tion (6.33) to equation (6.37). Compare this singlet spin state to the
entangled state (3.37). (Indeed, one way to produce an entangled pair
of electrons is to start in a singlet state and then draw the two electrons
apart.)
6.9 Ortho and para accounting
Show that in our case with M/2 spatial energy levels, the two-electron
energy basis has 21 M (M − 1) elements, of which
3
2 (M/2)[(M/2) − 1] are ortho
(antisymmetric in space and symmetric in spin) and
1
2 (M/2)[(M/2) + 1] are para
(symmetric in space and antisymmetric in spin).
6.7. Spin plus space, three electrons, ground state 225
Three electrons are in the situation described in the first paragraph of sec-
tion 6.6 (energy independent of spin, electrons don’t interact). The full
listing of energy eigenstates has been done, but it’s an accounting night-
mare, so I ask a simpler question: What is the ground state?
Call the one-particle spatial energy levels η1 (~x), η2 (~x), η3 (~x), . . . . The
ground state will be the antisymmetrized combination of the three levels
η1 (~xA )χ+ (A) η1 (~xB )χ− (B) η2 (~xC )χ+ (C)
226 Spin plus space, three electrons, ground state
Problems
6.11 A doomed attempt (essential problem)
Any linear combination of state (6.41) with state (6.42) has the form
|1, 1, 2i α| ↑↓↑i + β| ↑↓↓i
−|1, 2, 1i α| ↑↑↓i + β| ↑↓↓i
+|2, 1, 1i α| ↑↑↓i + β| ↓↑↓i
−|2, 1, 1i α| ↑↓↑i + β| ↓↓↑i
+|1, 2, 1i α| ↓↑↑i + β| ↓↓↑i
−|1, 1, 2i α| ↓↑↑i + β| ↓↑↓i . (6.43)
Show that this form can never be factorized into a space part times a
spin part.
6.12 Questions (recommended problem)
Update your list of quantum mechanics questions that you started at
problem 1.17 on page 46. Write down new questions and, if you have un-
covered answers to any of your old questions, write them down briefly.
Chapter 7
Atoms
During the months following these discussions [in the autumn of 1926] an
intensive study of all questions concerning the interpretation of quantum
theory in Copenhagen finally led to a complete and, as many physicists
believe, satisfactory clarification of the situation. But it was not a so-
lution which one could easily accept. I remember discussions with Bohr
which went through many hours till very late at night and ended almost
in despair; and when at the end of the discussion I went alone for a
walk in the neighboring park I repeated to myself again and again the
question: Can nature possibly be as absurd as it seemed to us in these
atomic experiments?
— Werner Heisenberg, Physics and Philosophy
(Harper, New York, 1958) page 42
All this is fine and good — lovely, in fact. But we have to apply quan-
tum mechanics to experimentally accessible systems, and while things like
carbon nanotubes exist, the most readily accessible systems are atoms.
229
230 Central potentials in two dimensions
The part in square brackets is called “the Laplacian of η(x, y)” and is
represented by the symbol “∇2 ” as follows
2
∂ f (x, y) ∂ 2 f (x, y)
+ ≡ ∇2 f (x, y). (7.3)
∂x2 ∂y 2
Thus the “mathematical form” of the energy eigenproblem is
2M
∇2 η(~r) + [E − V (~r)]η(~r) = 0. (7.4)
~2
This suggests that the true analog of the one-dimensional η(x) is not
R(r), but rather
√
u(r) = rR(r). (7.20)
Furthermore,
√
1 d dR(r) 1 1 u(r)
00
if u(r) = rR(r), then r =√u (r) + .
r dr dr r 4 r2
(7.21)
Using this change of function, the radial equation (7.16) becomes
d2 u(r) 1 u(r) 2M ~2 `2
+ + E − V (r) − u(r) = 0,
dr2 4 r2 ~2 2M r2
d2 u(r) 2M ~2 (`2 − 41 ) 1
+ 2 E − V (r) − u(r) = 0. (7.22)
dr2 ~ 2M r2
energy eigenvalue
Problem
7.1 Normalization condition
What is the normalization condition for un,` (r)? Be sure to distinguish
the cases ` = 0 and ` 6= 0.
236 Central potentials in three dimensions
The method used for central potentials in two dimensions works in three
dimensions as well. The details are (as expected) messier: you have to
use three spherical coordinates (r, θ, φ) rather than two polar coordinates
(r, θ), so you have to use separation of variables with a product of three one-
variable functions rather than a product of two one-variable functions. Thus
there are two separation constants rather than one. Instead of presenting
these messy details, I’ll just quote the result:
To solve the three-dimensional energy eigenproblem for a spherically
symmetric potential energy function V (r), namely
~2 2
− ∇ η(~r) + V (r)η(~r) = Eη(~r), (7.28)
2M
first solve the one-dimensional radial energy eigenproblem
~2 d2 u(r) ~2 `(` + 1) 1
− + V (r) + u(r) = Eu(r) (7.29)
2M dr2 2M r2
for ` = 0, 1, 2, . . .. For a given `, call the resulting energy eigenfunctions and
eigenvalues un,` (r) and En,` for n = 1, 2, 3, . . .. Then the three-dimensional
solutions are
un,` (r) m
ηn,`,m (r, θ, φ) = Y` (θ, φ) with energy En,` , (7.30)
r
where the “spherical harmonics” Y`m (θ, φ) are particular special functions
of the angular variables that you could look up if you needed to. The integer
separation constant m takes on the 2` + 1 values
−`, −` + 1, . . . , 0, . . . , ` − 1, `.
Notice that the 2` + 1 different solutions for a given n and `, but with
different m, are degenerate.
In addition, there’s a strange terminology that you need to know. You’d
think that the states with ` = 0 would be called “` = 0 states”, but in fact
they’re called “s states”. You’d think that the states with ` = 1 would be
called “` = 1 states”, but in fact they’re called “p states”. States with ` = 2
are called “d states” and states with ` = 3 are called “f states”. (I am told
that these names come from a now-obsolete system for categorizing atomic
spectral lines as “sharp”, “principal”, “diffuse”, and “fundamental”. States
with ` ≥ 4 are not frequently encountered, but they are called g, h, i, k, l,
m, . . . states. For some reason j is omitted. “Sober physicists don’t find
giraffes hiding in kitchens.”)
Atoms 237
energy eigenvalue
This graph shows only the four lowest energy eigenvalues for each value of `.
A single horizontal line in the “` = 0 (s)” column represents a single energy
eigenfunction, whereas a single horizontal line in the “` = 2 (d)” column
represents five linearly independent energy eigenfunctions, each with the
same energy (“degenerate states”).
238 The hydrogen atom
Problem
7.2 Dimensions of η(~r) and of u(r)
In equation (7.27) for the two-dimensional central potential problem,
what are the dimensions of η(~r) and of u(r)? In equation (7.30) for the
three-dimensional central potential problem, what are the dimensions
of η(~r) and of u(r)? [This result helps motivate the definitions u(r) =
√
rR(r) in two dimensions and u(r) = rR(r) in three dimensions.]
An electron (of mass M ) and a proton interact through the classical elec-
trostatic potential energy function — called the “Coulomb potential” —
1 e2
V (r) = − , (7.31)
4π0 r
so you might think that the energy eigenproblem for the hydrogen atom is
~2 2 1 e2
− ∇ η(~r) − η(~r) = Eη(~r). (7.32)
2M 4π0 r
That’s not exactly correct. This eigenproblem treats the proton as station-
ary while the electron does all the moving: in fact, although the proton is
almost 2000 times more massive than the electron, it’s not infinitely massive
and it does do some moving. This eigenproblem assumes the proton is a
point particle: in fact, although the nucleus is small compared to an atom,
it does have some size. This eigenproblem is non-relativistic and it treats
the electromagnetic field as purely classical: both false. This eigenproblem
ignores the electron’s spin. All of these are good approximations, but this
is a model for a hydrogen atom, not the exact thing.2
But let’s work with the approximation we have, rather than holding out
for an exact solution of an exact eigenproblem that will never come.3 What
happens if we solve the three-dimensional central potential problem with
the model potential energy function (7.31)? We don’t yet have the mathe-
matical tools to actually perform this solution, but we are in a position to
appreciate the character of the solution.
2 The corrections to the energy eigenvalues produced by equation (7.32) due to these
prediction that a hurricane has an 80% chance of arriving at about 7:00 pm than be
totally clueless about a hurricane bearing down on your home.
Atoms 239
n=4
n=3
n=2
n=1
energy eigenvalue
240 The hydrogen atom
formula empirically in 1888. Do not confuse the Rydberg energy Ry = 13.6 eV with the
Rydberg constant R∞ = 1.097 × 107 m−1 .
Atoms 241
It’s a triumph to know the energy eigenvalues, but we should know also
something about the energy eigenfunctions, which are labeled ηn,`,m (~r). A
terminology note is that an energy eigenfunction with n = 3, ` = 2, and
any value of m — that is η3,2,m (x) — is called a “3d state”.
To gain this knowledge we need to first understand the effective potential
energy function falling within square brackets in equation (7.29):
1 e2 ~2 `(` + 1) 1
Veff (r) = − + . (7.35)
4π0 r 2M r2
This function is sketched schematically on the next page. For large values
of r, to the right in the sketch, 1/r is bigger than 1/r2 , so Veff (r) is almost
the same as the 1/r Coulomb potential energy alone. For small values of
r, to the left in the sketch, 1/r is smaller than 1/r2 , so Veff (r) is almost
the same as the 1/r2 part alone. For intermediate values of r, the function
Veff (r) has to swing between these two limits, as sketched.
The result of this swinging will of course depend upon the value of `,
and the results for four values of ` are sketched schematically on page 243.
242 The hydrogen atom
Veff (r)
~2 `(` + 1) 1
+
2M r2
1 e2
−
4π0 r
Atoms 243
Veff (r)
`=3
`=2
`=1
r
`=0
244 The hydrogen atom
This graph suggests that for a given value of n, the states with larger
` will have larger mean values for r, the distance from the proton to the
electron.
7.3.4 Transitions
If the energy eigenequation (7.32) for the hydrogen atom were exactly cor-
rect, then a hydrogen atom starting in the excited energy state η3,2,−1 (~r)
would remain in that state forever. Furthermore, a hydrogen atom start-
ing in a linear combination with probability 0.6 of being in energy state
η3,2,−1 (~r) and probability 0.4 of being in energy state η2,1,0 (~r) would main-
tain those probabilities forever.
But the energy eigenequation (7.32) is not exactly correct. It ignores
collisions, it ignores external electromagnetic field (e.g., incident light), and
it ignores coupling to the electromagnetic field (e.g., radiated light). These
effects mean that the state η3,2,−1 (~r) is a stationary state of the model
eigenproblem, but it is not a stationary state of the exact eigenproblem. In
other words, these effects result in transitions between stationary states of
the model eigenproblem.
To understand these transitions you need to understand the transition-
causing effects, and at this point in your education you’re not ready to
do that. But I’ll tell you one thing right now: a transition involving a
single photon (either absorbing or emitting a single photon) will result in
a transition with ∆` = ±1. So, for example, a hydrogen atom in a 2p state
(that is, one with n = 2, ` = 1, and any legal value of m) could transition
to the 1s ground state by emitting a single photon. A hydrogen atom in a
2s state (that is, one with n = 2, ` = 0, and m = 0) cannot transition to the
ground state by emitting a single photon. (It could do so by emitting two
photons, or through a collision.) An atom in the 1s ground state, exposed
to a light source with photons of energy 43 Ry, can be excited to 2p state
by absorbing a single photon, but it cannot be excited to the 2s state by
absorbing a single photon.
I regard this fact (which, by the way, holds not only for the hydrogen
atom but for any central potential) as a picky detail appropriate for an
advanced course, but the people who write the Graduate Record Exam in
physics seem to think it’s important so you should probably remember it.
(Or at least review this page the evening before you take the physics GRE.)
Atoms 245
Problems
7.3 Counting hydrogen states
Show that the degeneracy for states characterized by n is n2 .
7.4 Dimensional analysis for energy eigenvalues
The eigenproblem (7.32) contains only two parameters:
~2 e2
and .
M 4π0
Use dimensional analysis to show that these two parameters can come
together to form an energy in only one way. [[I remember the Rydberg
energy as
1 (e2 /4π0 )2
Ry = (7.36)
2 ~2 /M
using this dimensional analysis trick.]]
7.5 Which states are distant, which are close? (essential problem)
Argue, on the basis of the graph on page 242, that for a given value of
`, states with larger n will have larger mean values for r.
7.6 Energy eigenvalues for the He+ ion (essential problem)
A helium atom with one electron stripped away is called a He+ ion.
This situation is simply one electron ambivating in the potential energy
established by a highly-massive nucleus: it is just like the hydrogen
atom, except that the nuclear change is +2e rather than +e. At the
level of approximation used in equation (7.32), the energy eigenproblem
for the He+ ion is
~2 2 1 2e2
− ∇ η(~r) − η(~r) = Eη(~r). (7.37)
2M 4π0 r
Show that (at this level of approximation) the energy eigenvalues for
the He+ ion are
Ry
En = −4 2 . (7.38)
n
The situation of a single electron ambivating in the potential energy
established by a highly-massive carbon nucleus of charge +6e is called
the C5+ ion. Show that (at this level of approximation) the energy
eigenvalues are
Ry
En = −36 2 . (7.39)
n
246 The hydrogen atom
Here’s the energy eigenproblem for the helium atom, at the same level of
approximation as the eigenproblem (7.32) for the hydrogen problem:
~2 2 ~2 2
− ∇A η(~rA , ~rB ) − ∇ η(~rA , ~rB )
2M 2M B
1 2e2 1 2e2 1 e2
− η(~rA , ~rB ) − η(~rA , ~rB ) + η(~rA , ~rB )
4π0 rA 4π0 rB 4π0 |~rA − ~rB |
= Eη(~rA , ~rB ). (7.43)
We have no chance whatsoever of solving this “two electron plus one nu-
cleus” problem exactly. Even the classical problem of three particles inter-
acting through 1/r potentials, first posed by Isaac Newton in 1687, has not
yet been solved exactly. (And probably never will be, because the result-
ing behavior is known to be chaotic.) Since classical mechanics is a subset
of quantum mechanics, an exact solution to this helium problem would
contain within it an exact solution to the unsolved classical “three-body
problem”.
Does this mean we should give up? Not at all. We should instead look
for approximate solutions that are not exact, but highly accurate for the
bound-state regime of interest.
Our approach will involve solving the one-electron problem for a differ-
ent potential, and then using those one-electron levels as building blocks
for the two-electron problem through the antisymmetrization machinery
of equation (6.11). The strategy may seem crude, but in practice it can
produce highly accurate results.
Instead of focusing on two electrons, interacting with the nucleus and
with each other, focus on one electron interacting with the nucleus and with
the average of the other electron. I don’t yet know exactly how the “other”
electron is averaged, but I assume it spreads out in a spherically symmetric
cloud-like fashion.
Finding the potential. Remember, from your electrostatics course,
the shell theorem for spherically symmetric charge distributions: When the
electron under focus is close to the nucleus, it feels only the electric field
due to the nucleus, so the potential energy is
1 2e2
for small r, V (r) ≈ − . (7.44)
4π0 r
248 The helium atom
Whereas when the electron under focus is far from the nucleus, it feels the
electric field due to the nucleus, plus the electric field due to the cloud
collapsed into the nucleus, so the potential energy is
1 e2
for large r, V (r) ≈ − . (7.45)
4π0 r
The potential energy felt by the electron under focus will interpolate be-
tween these two limits, something like the solid line graphed below.
V (r)
1 e2
−
4π0 r 1 2e2
−
4π0 r
n=4
n=3
n=2
n=1
energy eigenvalue
To the left, under the heading “He+ ion”, are the s state eigenvalues (7.49),
which are four times deeper than those under the heading “H atom”, the
s state eigenvalues (7.47). I’ve drawn a vertical line midway between them
and dashed lines connecting the the two sets of eigenvalues. If the eigenval-
ues for the shielded potential were exactly halfway between the eigenvalues
for the two limits, then they would fall where the dashed lines cross the ver-
tical line. But they don’t fall exactly there. For states that are mostly near
the nucleus, the energies are closer to (7.49). For states that are mostly far
from the nucleus, the energies are closer to (7.47). We have already seen
(problem 7.5 on page 245) that, for a given `, the eigenfunctions with larger
Atoms 251
n are farther from the nucleus. Chemists like to say that the eigenfunctions
that are mostly close to the nucleus — those with smaller n — have more
“penetration”.
This process can be repeated for p states, d states, and f states. Because,
for a given n, the eigenfunction with larger ` is farther from the nucleus
(less “penetration”, see page 244), the eigenfunction with larger ` will have
higher energy. Thus a shielded potential energy function will give rise to a
set of energy eigenvalues like this:
n=4
n=3
n=2
n=1
energy eigenvalue
252 The lithium atom
Problem
7.9 The hydrogen molecule ion
If a hydrogen molecule H2 is stripped of one electron, the result is
the “hydrogen molecule ion” consisting of two nuclei and one electron.
Show that if we had solved the helium problem exactly we would have
also solved the hydrogen molecule ion problem. (But we have not solved
the problem exactly: instead we found approximations appropriate for
the case of atomic helium. A completely different set of approximations
are appropriate for the hydrogen molecule ion.)
Atoms larger than lithium are difficult. Specialists have examined them in
exquisite detail, but in this book we’re not going to try to find the energy
spectrum, we’re not going to try to find the ground state degeneracy, we’re
not even going to try to write down a ground state. Instead, we’re only
going to list the one-electron levels that are thrown together through the
antisymmetrization machinery (6.11) to make the many-electron ground
state.
Try this buliding-up machinery for carbon, with a nucleus and six elec-
trons.6 In the figure on page 250, pertaining to helium, the left-hand ener-
gies are four times deeper than the right-hand energies. If I were to draw a
parallel figure for carbon (see equation 7.39), the left-hand energies would
be 36 times deeper than the right-hand energies! The net result is that,
while the figure on page 251 shows a modest increase in energy En,` for
a given n as you move right to higher values of `, for carbon the energy
increase will be dramatic: something like the figure on the next page. For
atoms bigger than carbon, the increase will be still more dramatic.
6 Carbon is a seven-body problem, not a three-body problem like helium, so of course
6d 5f
7s 6p 5d 4f
6s
5p 4d
5s
4p
4s 3d
3p
3s
2p
2s
1s
energy eigenvalue
Schematic energy levels for a generic atom, about the size of carbon.
planation of the periodic system: Review and update” Journal of Chemical Education
87 (April 2020) 435–443; Gregory Anderson, Ravi Gomatam, and Laxmidhar Behera,
256 The periodic table
Madelung sequence is 1s2 2s2 2p6 3s2 3p6 4s2 3d4 whereas experiment shows the
actual configuration ends instead with 4s1 3d5 .
Problem
7.10 Ground state degeneracy
At the level of approximation of the diagram on page 254, find the
degeneracy of the ground state of boron, of carbon, of nitrogen, of
oxygen, of fluorine, and of neon.
While the Madelung sequence is not perfect (few things are), it makes sense
to see what it has to say (“better to light a single candle, no matter how
faint, than to curse the darkness”).
Compare the levels that go into building up carbon (atomic number
6) with those that go into building up silicon (atomic number 14). For
carbon they are 1s2 2s2 2p2 ; for silicon they are 1s2 2s2 2p6 3s2 3p2 . Note the
similarities of those last, highest energy levels: carbon ends with 2s2 2p2 ,
silicon ends with 3s2 3p2 . It’s possible that for carbon the nucleus (charge
+6) and the 1s and 2s electron levels (charge −4) act together to form an
atom core of net charge +2. Meanwhile it’s just as possible that for silicon
the nucleus (charge +14) and the 1s, 2s, 2p, and 3s electron levels (charge
−12) similarly act together to form an atom core again with net charge +2.
If this possibility is correct, then you would expect carbon and silicon to
have similar chemical behavior. Sure enough each of them bonds with four
other atoms: methane (CH4 ) is chemically analogous to silane (SiH4 ).
Through parallel reasoning you would expect neon, 1s2 2s2 2p6 , to behave
similarly to argon, 1s2 2s2 2p6 3s2 3p6 . Again this expectation holds: both
neon and argon are noble gases that react reluctantly with other atoms.
Chemists have an ingenious system for showing these chemical similarities
through a graphic called “the periodic table”, shown on the next page.
1
H He
1 2
2
Li Be B C N O F Ne
3 4 5 6 7 8 9 10
3
Na Mg Al Si P S Cl Ar
11 12 13 14 15 16 17 18
4
K Ca Sc Ti V Cr Mn Fe Co Ni Cu Zn Ga Ge As Se Br Kr
19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36
5
Rb Sr Y Zr Nb Mo Tc Ru Rh Pd Ag Cd In Sn Sb Te I Xe
37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54
6
Cs Ba ∗ Lu Hf Ta W Re Os Ir Pt Au Hg Tl Pb Bi Po At Rn
55 56 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86
7
Fr Ra ∗ Lr Rf Db Sg Bh Hs Mt Ds Rg Cn
87 88 ∗ 103 104 105 106 107 108 109 110 111 112
∗ La Ce Pr Nd Pm Sm Eu Gd Tb Dy Ho Er Tm Yb
57 58 59 60 61 62 63 64 65 66 67 68 69 70
∗ Ac Th Pa U Np Pu Am Cm Bk Cf Es Fm Md No
∗ 89 90 91 92 93 94 95 96 97 98 99 100 101 102
comes from feeding a 1s spin up level, plus a 1s spin down level, plus now a 2s level, into
the “multiply and antisymmetrize” machinery at equation (6.11) in order to generate
a three-electron state. This is a real mouthful, so we simply write “add one 2s level”.
In some books you will see this written as “add one 2s electron”, but that language
reinforces the misconception that the electron in the 2s level can be distinguished from
the other two electrons, in which case the three electrons would not be identical. See
the warnings on page 252 and in problem 6.10, “Intersystem crossing”, on page 225.
258 The periodic table
energy level in scandium is not a p level at all, it’s a 3d level. There are
ten such levels, accounting for the ten elements from scandium through
zinc. The element beyond zinc is gallium, which sure enough shares a lot of
properties with its vertical neighbor aluminum. Row 4 continues by adding
4p levels until ending with the noble gas krypton.
Row 5 is very much like row 4.
Row 6 adds a new twist after barium, atomic number 56. The extra
level in lanthanum, atomic number 57, is a 4f level. The fourteen 4f levels
account for the fourteen elements from lanthanum through ytterbium. (The
page in this book is not wide enough to hold row 6 of the periodic table, so
I shoehorn in these fourteen elements using an asterisk.) Then the ten 5d
levels account for the ten elements from lutetium through mercury (symbol
Hg from the Greek “hydrargyrum”, meaning “liquid silver”). Finally the
six 6p levels account for the six elements from thallium through the noble
gas radon.
Row 7 is very much like row 6. Many of the elements in row 7 have
short-lived nuclei, but that’s a different story.
Any chemist will tell you, correctly, that this lightning tour of the pe-
riodic table leaves out a lot of fascinating detail. But its very briefness
means that you have not been distracted by detail and have kept sight of
the central fact that the entire structure of the periodic table — and hence
all of chemistry — follows from the Pauli requirement for antisymmetry
under fermion coordinate swaps.
Problem
7.11 Questions (recommended problem)
Update your list of quantum mechanics questions that you started at
problem 1.17 on page 46. Write down new questions and, if you have un-
covered answers to any of your old questions, write them down briefly.
This is the last chapter of the book, but this book itself is an invita-
tion only, so it is not the last chapter of quantum mechanics. There are
many fascinating topics that this book hasn’t even touched on. Quantum
mechanics will — if you allow it — surprise and delight and mystify you
for the rest of your life.
This book devotes two chapters to qubits, also called spin- 12 systems.
Plenty remains to investigate: “which path” interference experiments,
delayed-choice interference experiments, many different entanglement situ-
ations. For example, we developed entanglement through a situation where
the quantal probability was 21 while the local deterministic probability was
5
9 or more (page 80). Different, to be sure, but not dramatically differ-
ent. In the Greenberger–Horne–Zeilinger entanglement situation the quan-
tal probability is 1 and the local deterministic probability is 0. You can’t
find probabilities more different than that! If you find these situations as
fascinating as I do, then I recommend George Greenstein and Arthur G.
Zajonc, The Quantum Challenge: Modern Research on the Foundations of
Quantum Mechanics.
For many decades, research into qubits yielded insight and understand-
ing, but no practical applications. All that changed with the advent of
quantum computing. This is a rapidly changing field, but the essay “Quan-
tum Entanglement: A Modern Perspective” by Barbara M. Terhal, Michael
M. Wolf, and Andrew C. Doherty (Physics Today, April 2003) contains core
259
260 The territory ahead
insights that will outlive any transient. From the abstract: “It’s not your
grandfather’s quantum mechanics. Today, researchers treat entanglement
as a physical resource: Quantum information can now be measured, mixed,
distilled, concentrated, and diluted.”
Because quantum mechanics is both intricate and unfamiliar, a
formidable yet beautiful mathematical formalism has developed around
it: position wavefunctions, momentum wavefunctions, Fourier transforms,
operators, Wigner functions. These are powerful precision tools, so mag-
nificent that some confuse the tools with nature itself. Every quantum
mechanics textbook develops this formalism to a greater or lesser extent,
but I also recommend the cute book by Leonard Susskind and Art Fried-
man, Quantum Mechanics: The Theoretical Minimum.
Here are two formal problems to whet your appetite: Back on page 140
I quoted O. Graham Sutton that “A technique succeeds in mathematical
physics, not by a clever trick, or a happy accident, but because it expresses
some aspect of a physical truth.” Thus inspired, we asked about the mean-
ing of the separation constant (4.16), and that inquiry led us to the whole
structure of stationary states and the energy eigenproblem. (And to a poem
by T.S. Eliot, page 146.) But when we faced a similar separation constant
at equation (7.10) we were too busy to follow up and ask what it was telling
us. If you study more quantum mechanics, you will learn that this sepa-
ration constant is related to angular momentum, that angular momentum
is related to rotations, and that the conservation of angular momentum is
related to rotational symmetry!
The second problem involves the simple harmonic oscillator, that is,
the potential energy function V (x) = 12 kx2 . As with any one-dimensional
potential well there are energy eigenstates. If any one of these states is
shifted by a distance, it is of course no longer an energy eigenstate, so it
does change with time. The remarkable thing is how it changes with time:
the probability density does not spread, nor compact, nor change shape.
Instead it rigidly slides back and forth with the same period that a classical
particle would have in that same potential well.1 When I first did the math
to show that this is so, I was so astounded that I wrote a computer program
to check out the math. This remarkable fact is true, and I have the feeling
1 M.E. Marhic, “Oscillating Hermite-Gaussian wave functions of the harmonic oscilla-
tor” Lettere al Nuovo Cimento 22 (1978) 376–378, and C.C. Yan, “Soliton like solutions
of the Schrödinger equation for simple harmonic oscillator” American Journal of Physics
62 (1994) 147–151.
The Vistas Open to Us 261
Problem
8.1 Questions (recommended problem)
This is the end of the book, not the end of quantum mechanics. Write
down any questions you have concerning quantum mechanics. Perhaps
you will answer some of these through future study. Others might
suggest future research directions for you.
Appendix A
Significant Figures
263
264 Significant Figures
our lack of certainty? How do we work with (add, subtract, multiply, take
logarithms of, and so forth) quantities that aren’t certain?
Expressing uncertainty
The convention for expressing uncertain quantities is simple: any digit writ-
ten down is a significant digit. A plank measured to the nearest millimeter
has a length expressed as, say, 103.7 cm or 91.5 cm or 135.0 cm. Note
particularly the trailing zero in 135.0 cm: this final digit is significant.
The quantity “135.0 cm” is different from “135 cm”. The former means
“135.0?? cm”, the latter means “135.??? cm”. In the former, the digit in
the tenths place is 0, while in the latter, the digit in the tenths place is
unknown.
This convention gives rise to a problem for representing large numbers.
Suppose the distance between two stakes is 45.6 meters. What is this
distance expressed in centimeters? The answer 4560 cm is unsatisfactory,
because the trailing zero is not significant and so, according to our rule,
should not be written down. This quandary is resolved using exponential
notation: 45.6 meters is the same as 4.56 × 103 cm. (This is, unfortunately,
one of the world’s most widely violated rules.)
187.6
+ 2.3405
--------
189.9405
But no! This is treating the unknown digits in 187.6 cm as if they were
zeros, when in fact they’re question marks. The proper way to perform the
265
sum is
187.6?????
+ 2.3405??
----------
189.9?????
1243?
x 52?
--------
?????
2486?
6215?
--------
64????
Any measured number is uncertain, but counted and defined quantities can
be certain. If there are seven people in a room, there are 7.0000000 . . .
people. There are never 7.00395 people in a room. And the inch is defined
to be exactly 2.5400000 . . . centimeters — there’s no uncertainty in this
conversion factor, either.
Conclusions
Dimensions
269
270 Dimensions
spacecraft had survived perfectly the long and perilous trip from Earth to
Mars. How could it have failed so spectacularly in the final phase of its
journey? The manufacturer, Lockheed Martin Corporation, had told the
the spacecraft controllers, at NASA’s Jet Propulsion Laboratory, the thrust
that the probe’s rockets could produce. But the Lockheed engineers gave
the thruster performance data in pounds (the English unit of force), and
they didn’t specify which units they used. The NASA controllers assumed
that the data were in newtons (the SI unit of force).
Unit conversions
In short, the symbol “cm” can be manipulated exactly like the symbol
“`cm ”, because that’s exactly what it means!
272 Dimensions
Incompatible dimensions
If I walk for 4.00 m, and then for 59 cm, how far did I go? The answer is
459 cm or 4.59 m, but not 4.00 + 59 = 63 of anything!
If I walk for 4.00 m, and then pause for 42 seconds, how far did I go?
Certainly not 4.00 m + 42 s. The number 46 has no significance in this
problem. For example, it can’t be converted into minutes.
In general, you can’t add quantities with different units.
This rule can be quite helpful. Suppose you’re working a problem that
involves a speed v and a time t, and you’re asked to find a distance d.
Someone approaches you and whispers: “Here’s a hint: use the equa-
tion d = vt + 21 vt2 .” You know that this hint is wrong: The quantity
vt has the dimensions of [length], but the quantity 12 vt2 has the dimen-
sions of [length]×[time]. You can’t add a quantity with the dimensions
of [length]×[time] to a quantity with the dimensions of [length], any more
than you could add 42 seconds to 4.00 meters.
2 Details presented in Michael B.A. Deakin, “G.I. Taylor and the Trinity test” In-
Taylor realized that this sequence of photos showed a shock wave ex-
panding into an undisturbed medium, and he knew from his previous expe-
rience that the radius of the fireball r could depend upon only three things:
the density of undisturbed air ρ, the energy released through the explosion
E, and the time since the explosion t.
quantity dimensions
3
ρ [mass]/[length]
2 2
E [mass] × [length] /[time]
t [time]
r [length]
Taylor knew that to build r out of ρ, E, and t, he had to cancel out the
[mass] that appears in ρ and E but that cannot enter into r. Thus he had
to build r out of E/ρ and t.
quantity dimensions
5 2
E/ρ [length] /[time]
t [time]
r [length]
2
Now Taylor had to cancel the [time] from the denominator of E/ρ using
the variable t:
quantity dimensions
2 5
Et /ρ [length]
r [length]
Conclusions
Problems
Complex Arithmetic
√
The familiar numbers like 17, 35 , 2, −π and so forth are called “real
numbers”. The square of any real number is non-negative. But we can
imagine a new category of numbers that have negative squares. We first
imagine the number i, with i2 = −1. Then we can imagine the number 3
times i, with (3i)2 = −9. These are called “imaginary numbers”.
The names “real” and “imaginary” are unfortunate. Numbers are use-
ful abstractions that exist in our minds: you’ve seen two hands, and two
fingers, and two apples; you’ve seen the Arabic numeral “2” and the Roman
numeral “II”, which are made of ink; you’ve seen the English word “two”,
the German word “zwei”, and the Somali word “laba”, again made of ink;
but you’ve never seen the number 2, which is made of pure thought. In
the usual sense of the words “real” and “imaginary”, no number is real; all
numbers are imaginary.
The sum of a real number and an imaginary number is called a “complex
number” (another unfortunate name). Just as the real number x can be
profitably visualized as a point on the one-dimensional real line, so the
complex number z = x+iy can be profitably visualized as the point (x, y) on
the two-dimensional “complex plane”. This book assumes some background
in complex arithmetic. If your knowledge is rusty, these problem should
grease your mental gears.
Problems
277
278 Complex Arithmetic
A physicist can wax eloquent about concepts like interference and entangle-
ment, but can also use those concepts to solve problems about the behavior
of nature and the results of experiments. This appendix gives general ad-
vice on problem solving, then lists the problem-solving tools introduced and
elaborated upon in this book.
You have heard that “practice makes perfect”, but in fact practice makes
permanent. If you practice slouchy posture, sloppy reasoning, or inefficient
problem-solving technique, these bad habits will become second nature to
you. For proof of this, just consider the career of [[insert here the name of
your least favorite public figure, current or historical, foreign or domestic]].
So I urge you to start now with straight posture, dexterous reasoning, and
facile problem-solving technique, lest you end up like [[insert same name
here]].
Suppose you want to travel from San Diego to Boston. You start by deciding
whether to fly, take a bus, or drive a car. If you decide to fly, you then
make subsidiary decisions like choice of airline. If you decide to drive, you
make different subsidiary decisions: Should you first change your car’s oil?
Should you take a side trip to the Grand Canyon? Or to visit your friends
in Boulder, Colorado? What you don’t do is just step out of your front door
and walk northeast: you make a plan before taking that very first step.
And just as your journey begins before you take your first step, so it
extends after you take your last step. Any journey, properly considered,
includes reflection upon that journey. This might be merely technical (“I’ll
279
280 Problem-Solving Tips and Techniques
never travel on that bus line again.”) or it might open a door to future
travel (“The Grand Canyon was so spectacular! Next time I’ll hike from
the rim down to the Colorado River.”) or it might be deeper still (“My
friends in Boulder seem so happy together. I need to rethink my plan of
remaining single all my life.”).
As with travel, so with physics problem solving. The first step is to un-
derstand the problem. What is given? What is asked for? For problems
in classical mechanics and electromagnetism, one tool for understanding
the problem is to sketch the situation. In relativity, if often helps to
make two sketches: one for each reference frame. Some quantum mechan-
ics problems are so abstract that a sketch doesn’t help you understand the
problem, but often doodling plays that same role.
The next step is to select a strategy — a key idea to employ — before
rushing in to make detailed derivations. Is this a time evolution problem?
An energy eigenproblem? An interference or entanglement problem?
Once you pass on to implementing your strategy, keep your goal in
mind to avoid deriving endless numbers of equations that are true but that
don’t help you reach your goal.
Finally, once you’re reached that goal, reflect on your final result.
What is nature trying to tell you through this problem? Is the result in
accord with your expectations? A good example of this stage of problem
solving appears on page 171. Instead of reaching equation (4.105), saying
“That’s the end”, and heading to bed for some well-earned sleep, we spent
two paragraphs on the consequences of that equation, found them remark-
able and unexpected, and used them to illuminate the role of interference
in quantum mechanics. I.I. Rabi reflected on the consequences even more
deeply, and used those reflections to invent the atomic clock. Another ex-
ample concerns the Planck radiation law (1.13): pages 16–23, plus sample
problem 1.2.1, are ten pages of reflection upon that single brief equation.
As with travel, such reflection might be merely technical (“Why did I
have to work out that integral in detail? I should have seen from symmetry
that the result would be zero.”) or it might be deeper (“When Styer said
an atom might not have a position, I thought he was spouting bullshit. But
after working problem 2.7, ‘Bomb-testing interferometer’, I realize that I
have to rethink my ideas about how atoms behave.”).
Problem-Solving Tips and Techniques 281
Catalog of Misconceptions
283
284 Catalog of Misconceptions