Bholes
Bholes
Lecture notes
              IAP, Paris, March-April 2016
       CP3, UCL, Louvain-la-Neuve, November 2016
            DIAS-TH, BLTP, Dubna, May 2017
                 Les Houches, July 2018
           CSGC, IITM, Chennai, January 2022
               ENS, Paris, May-June 2023
              AEI, Potsdam, December 2023
                 IHP, Paris, March 2024
               ENS, Paris, April-June 2025
                         Éric Gourgoulhon
    Laboratoire d’étude de l’Univers et des phénomènes eXtrêmes
      CNRS, Observatoire de Paris - PSL, Sorbonne Université
                    eric.gourgoulhon@obspm.fr
https://relativite.obspm.fr/blackholes
                     — DRAFT —
                      version of 29 May 2025
   • at the Bogoliubov Laboratory of Theoretical Physics, in Dubna (Russia) in May 2017, within
     the framework of the Dubna International Advanced School of Theoretical Physics:
     http://www.jinr.ru/posts/lecture-course-geometry-and-physics-of-black-holes/
   • at the Summer School Gravitational Waves 2018, taking place at Les Houches (France) in
     July 2018:
     https://www.lkb.upmc.fr/gravitationalwaves2018/
   • remotely at the School on Black Holes and Gravitational Waves organized at the Centre for
     Strings, Gravitation and Cosmology of the Indian Institute of Technology Madras, Chennai
     (India) in January 2022:
     https://physics.iitm.ac.in/~csgc/events/sbhgw
   • at the École Normale Supérieure, Paris (France) in May-June 2023, as part of the PSL
     graduate programs in Physics and in Astrophysics:
     https://relativite.obspm.fr/blackholes/paris23
   • at the Institut Henri Poincaré, Paris (France) in March 2024, within the program Quantum
     and classical fields interacting with geometry:
     https://relativite.obspm.fr/blackholes/ihp24
   • at the École Normale Supérieure, Paris (France) in April-June 2025, as part of the PSL
     graduate program in Physics:
     https://relativite.obspm.fr/blackholes/paris25
4
         In complement to these notes, one may recommend various monographs devoted to black
    holes: O’Neill (1995) [391], Heusler (1996) [275], Frolov & Novikov (1998) [202], Poisson (2004)
    [416], Frolov & Zelnikov (2011) [204], Bambi (2017) [30], Chruściel (2020) [119], Grumiller &
    Sheikh-Jabbari (2022) [246] and King (2023) [316], as well as review articles by Carter (1987)
    [101], Wald (2001) [505], Chruściel (2002, 2005) [117, 118] and Chruściel, Lopes Costa & Heusler
    (2012) [123] and recent monographs with various chapters about black holes: Andersson (2020)
    [14] and Shibata (2016) [456]. In addition, let us point out other lecture notes on black holes:
    Hawking (1994) [265, 268], Townsend (1997) [483], Compère (2006, 2019) [136, 138], Dafermos
    and Rodnianski (2008) [149], Deruelle (2009) [161], Andersson, Bäckdahl & Blue (2016) [13]
    and Reall (2020) [426].
         The history of black holes in theoretical physics and astrophysics is very rich and fascinating.
    It is however not discussed here, except in some small historical notes. The interested reader is
    referred to Nathalie Deruelle’s lectures [161], to Kip Thorne’s book [479], to Carter’s article
    [105] and to Jean Eisenstaedt’s articles [180, 182].
         The web pages associated to these notes are
https://relativite.obspm.fr/blackholes
    They contain supplementary material, such as the SageMath notebooks presented in Ap-
    pendix D.
        I warmly thank Cyril Pitrou for having organized the Paris 2016 lectures, Fabio Maltoni and
    Christophe Ringeval for the Louvain-la-Neuve ones, Anastasia Golubtsova and Irina Pirozhenko
    for the Dubna ones, Bruce Allen, Marie-Anne Bizouard, Nelson Christensen and Pierre-François
    Cohadon for the Les Houches ones, Chandra Kant Mishra for the Chennai ones, Jean-François
    Allemand for the Paris 2023 and 2025 ones, Masaru Shibata and Karim Van Aeslt for the Potsdam
    ones and Dietrich Häfner, Frédéric Hélein and Michał Wrochna for the Paris 2024 ones.
        Besides, I am deeply indebted to Imène Belahcene, Jack Borthwick, Brandon Carter, Marc
    Casals, Udit Narayan Chowdhury, Stéphane Collion, Xiangyang Chen, Sumit Dey, Jean Eisen-
    staedt, Romain Gervalle, David Hirondel, Ted Jacobson, Michel Le Bellac, Alexandre Le Tiec,
    Jean-Philippe Nicolas, Jordan Nicoules, Micaela Oertel, Paul Ramond, Nicolas Seroux, Irène
    Urso and Frédéric Vincent for spotting mistakes, correcting typos and making nice suggestions
    in preliminary versions of the text.
I   Foundations                                                                                                                                 11
1 General framework                                                                                                                             13
  1.1 Introduction . . . . . . . . . . . . .    .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   13
  1.2 Spacetime . . . . . . . . . . . . . .     .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   13
  1.3 Worldlines . . . . . . . . . . . . . .    .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   17
  1.4 Quantities measured by an observer        .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   20
  1.5 Einstein equation . . . . . . . . . .     .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   22
    V    Appendices                                                                                                                                        709
    A Basic differential geometry                                                                                                                          711
      A.1 Introduction . . . . . . . . . .     .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   711
      A.2 Differentiable manifolds . . .       .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   711
      A.3 Pseudo-Riemannian manifolds          .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   721
      A.4 The three basic derivatives . .      .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   726
      A.5 Curvature . . . . . . . . . . .      .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   734
    B Geodesics                                                                                                                                            739
      B.1 Introduction . . . . . . . . . . . . . . .               .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   739
      B.2 Definition and first properties . . . . .                .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   739
      B.3 Existence and uniqueness of geodesics                    .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   744
      B.4 Geodesics and extremal lengths . . . .                   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   750
      B.5 Geodesics and symmetries . . . . . . .                   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   757
      B.6 Geodesics and curvature . . . . . . . .                  .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   .   759
Bibliography 787
Index                                                                                       819
10   CONTENTS
   Part I
Foundations
Chapter 1
General framework
Contents
        1.1   Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   13
        1.2   Spacetime . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .    13
        1.3 Worldlines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .     17
        1.4   Quantities measured by an observer . . . . . . . . . . . . . . . . . . . .         20
        1.5   Einstein equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .    22
1.1     Introduction
This chapter succinctly presents the spacetime framework used in these lectures (Sec. 1.2)
and recalls useful basic concepts, such as worldlines of particles and observers (Sec. 1.3 and
1.4). In an important part of these lectures (but not always!), we shall assume that the theory
of gravitation is general relativity; this means that the spacetime metric obeys the Einstein
equation, which is recalled in Sec. 1.5.
    This chapter is by no means an introduction to relativistic gravity. In this respect, we
recommend the textbooks [87, 109, 163, 257, 371, 464, 499], as well as [162, 229, 337] for
the French-speaking reader. The reader might also find useful to start by taking a look at
Appendix A, which recaps the concepts from differential geometry employed here.
1.2     Spacetime
1.2.1    General settings
In these lectures we consider a n-dimensional spacetime, i.e. a pair (M , g), where M is a
smooth real manifold of dimension n ≥ 2 and g is a Lorentzian metric on M . In some parts, n
will be set to 4 — the standard spacetime dimension — but we shall consider spacetimes with
n ̸= 4 as well.
14                                        General framework
     Figure 1.1: A smooth manifold M : the infinitesimal vector dx connects the nearby points p and q and thus
     can be thought as a displacement within the manifold, while the finite vector v does not correspond to any
     displacement within the manifold and “lives” in the tangent space Tp M .
          The precise definition and basic properties of a smooth manifold are recalled in Appendix A.
     Here let us simply say that, in loose terms, a manifold M of dimension n is a “space” (precisely
     a topological space) that locally resembles Rn , i.e. can be described by a n-tuple of coordinates
     (x0 , . . . , xn−1 ). However, globally, M can be very different from Rn , in particular regarding its
     topology. It could also happen that many coordinate systems are required to cover M , while a
     single one is obviously sufficient for Rn .
          The smooth structure endows the manifold with the concept of infinitesimal displace-
     ment vectors dx, which connect infinitely close points of M (cf. Fig. 1.1 and Sec. A.2.3 of
     Appendix A). However, for finitely separated points, there is no longer the concept of connect-
     ing vector (contrary for instance to points in Rn ). In other words, vectors on M do not live in
     the manifold but in the tangent spaces Tp M , which are defined at each point p ∈ M . Each
     Tp M is a n-dimensional vector space, which is generated for instance by the infinitesimal
     displacement vectors along the n coordinate lines of some coordinate system. Unless explicitly
     specified, we assume that M is an orientable manifold (cf. Sec. A.3.4).
          The full definition of the metric tensor g is given in Sec. A.3 of Appendix A. At each point
     p ∈ M , g induces a (non positive definite) scalar product on Tp M , which we shall denote by a
     dot:
                                  ∀(u, v) ∈ Tp M × Tp M , u · v := g(u, v).                           (1.1)
     Given a coordinate system (xα ), the components of g with respect to it are the n2 scalar fields
     (gαβ ) such that [cf. Eq. (A.40)]
                                                g = gαβ dxα dxβ ,                                         (1.2)
     where dxα is the differential 1-form of xα (cf. Sec. A.2.4) and the notation dxα dxβ stands for
     the symmetric tensor product defined by Eq. (A.39). The scalar square ds2 of an infinitesimal
     displacement vector dx = dxα ∂α is expressed in terms of the components (gαβ ) by the line
     element formula
                                    ds2 := g(dx, dx) = gαβ dxα dxβ .                           (1.3)
     This formula follows from Eq. (1.2) via the identity ⟨dxα , dx⟩ = dxα [Eq. (A.18)].
                                                1.2 Spacetime                                                  15
Figure 1.2: A Lorentzian manifold (M , g): at each point, the metric tensor g defines privileged directions:
those lying in the null cone at p.
Remark 1: In this book, we shall use the bilinear form identity (1.2) rather than the line element (1.3) to
present a metric (cf. Box. 3.2 of MTW [371] for a discussion). It turns out to be more convenient when
various metric tensors are involved, the notation ds2 being then ambiguous.
    The fact that the signature of g is Lorentzian, i.e.
implies that at each point p ∈ M , there are privileged directions, along which the line element
(1.3) vanishes; they form the so-called null cones or light cones (cf. Fig. 1.2). The null cones
constitute an absolute structure of spacetime, independent from any observer. A vector at a
point p ∈ M that is either timelike or null (cf. Sec. A.3.2) is said to be causal. It lies necessarily
inside the null cone at p (timelike vector) or along it (null vector).
     Proof. Without any loss of generality, we may assume that u is a unit vector: g(u, u) = −1.
     Let then (ei )1≤i≤n−1 be a family of n − 1 unit spacelike vectors such that (u, e1 , . . . , en−1 )
     is an orthonormal basis of Tp M . We may expand v on this basis: v = v 0 u + v i ei . We have
     necessarily v 0 ̸= 0, otherwise v = v i ei would be a spacelike vector, which is excluded by
     hypothesis. Moreover, the time-orientation of v is the same as that of u iff v 0 > 0. Since
     g(u, v) = −v 0 , this establishes (1.5a) and (1.5b).
     Proof. Without any loss of generality, we may find an orthonormal basis (eα )0≤α≤n−1 of Tp M
     such that u = e0 + e1 , where the timelike unit vector e0 is future-directed since u is. Let us
     expand v on this basis: v = v 0 e0 + v i ei , with v 0 ̸= 0 since v is not spacelike. We have then
     g(u, v) = −v 0 + v 1 . Now, since v is null or timelike, g(v, v) ≤ 0, which is equivalent to
                                                          n−1
                                                          X
                                                   0 2
                                                (v ) ≥          (v i )2 .                                  (1.7)
                                                          i=1
     This implies |v 0 | ≥ |v 1 |. If |v 0 | > |v 1 |, then v cannot be collinear with u (since this would
     imply v 0 = v 1 ) and g(u, v) = −v 0 + v 1 ̸= 0, with a sign identical to that of −v 0 . If |v 0 | = |v 1 |,
     Eq. (1.7) implies v 2 = v 3 = · · · = v n−1 = 0. We have then either v = v 0 (e0 + e1 ) = v 0 u or
     v = v 0 (e0 − e1 ). In the first case, v is collinear with u and g(u, v) = 0. In the second case,
     g(u, v) = −2v 0 ̸= 0. To summarize, the only case where g(u, v) = 0 is v being collinear with
     u. This establishes (1.6b). In all the other cases, v is not collinear with u and the sign of g(u, v)
     is that of −v 0 . Since v is future-directed if v 0 > 0 and past-directed if v 0 < 0, this establishes
     (1.6a) and (1.6c).
        Two useful properties are immediate consequences of the above lemmas. From Lemma 1.1,
     we get
                                                   1.3 Worldlines                                    17
Two null vectors are orthogonal if, and only if, they are collinear.
1.3         Worldlines
1.3.1        Definitions
In relativity, a particle is described by its spacetime extent, which is a smooth curve, L say,
and not a point. This curve is called the particle’s worldline and might be thought of as the set
of the “successive positions” occupied by the particle as “time evolves”. Except for pathological
cases (tachyons), the worldline has to be a causal curve, i.e. at any point, a tangent vector
to L is either timelike or null. This reflects the impossibility for the particle to travel faster
than light with respect to any local inertial frame. The dynamics of a simple particle (i.e. a
particle without any internal structure nor spin) is entirely described by its 4-momentum or
energy-momentum vector 1 , which is a vector field p defined along L , tangent to L at each
point and future-directed (cf. Fig. 1.3).
    One distinguishes two types of particles:
   1
       When n ̸= 4, energy-momentum vector is definitely a better name than 4-momentum!
18                                             General framework
        • the massive particles, for which L is a timelike curve, or equivalently, for which p is a
          timelike vector:
                                           g(p, p) = p · p < 0;                               (1.8)
        • the massless particles, such as the photon, for which L is a null curve, or equivalently,
          for which p is a null vector:
                                            g(p, p) = p · p = 0.                              (1.9)
∇p p = 0 , (1.11)
This implies that the worldline L must be a geodesic of the spacetime (M , g) (cf. Appendix B).
     Remark 1: The reverse is not true, i.e. having L geodesic and p tangent to L does not imply (1.11),
     but the weaker condition ∇p p = α p, with α a scalar field along L . In this case, one says that p is a
     pregeodesic vector (cf. Sec. B.2.2 in Appendix B).
        For massive particles, Eq. (1.11) can be derived from a variational principle, the action being
     simply the worldline’s length τ as given by the metric tensor:
                                                                   s
                                           Z   B          Z   λB                   
                                                                            dx dx
                                      S=           dτ =            −g         ,         dλ           (1.13)
                                             A            λA                dλ dλ
     (cf. Sec. B.4.2 for details). For photons, Eq. (1.11) can be derived from Maxwell equations within
     the geometrical optics approximation (see e.g. Box 5.6 of Ref. [418]), with the assumption that
     the photon energy-momentum vector is related to the wave 4-vector k by
                                                              p = ℏk.                                (1.14)
        2
            Unless specified, we use geometrized units, for which G = 1 and c = 1.
                                           1.3 Worldlines                                            19
                                                                                         (1.15)
                                  p                 p
                            dτ = −g(dx, dx) = −gµν dxµ dxν ,
u · u = −1. (1.17)
For a simple particle (no internal structure), the 4-momentum p is tangent to L ; it is then
necessarily collinear to u. Since both vectors are future-directed, Eqs. (1.10) and (1.17) lead to
p = mu . (1.18)
λ′ = aλ + b, (1.20)
with a and b two constants. Given that the photon energy-momentum vector p is a geodesic
vector [Eq. (1.11)], a natural choice of the affine parameter λ is that associated with p:
                                                   dx
                                            p=        .                                     (1.21)
                                                   dλ
This restricts the transformations (1.20) between the affine parameters to a = 1.
   3
       Cf. Sec. 1.2.2.
20                                      General framework
     Figure 1.4: Orthogonal decomposition of the energy-momentum vector p of a particle with respect to the
     4-velocity uO of an observer O, giving birth to the energy E and linear momentum P as measured by O.
                                           P = p + (uO · p) uO .                                     (1.24)
     The scalar square of Eq. (1.22) leads to
                                 p · p = E 2 uO · uO +2E uO · P +P · P ,                             (1.25)
                                 |{z}        | {z }      | {z }
                                  −m2           −1              0
     where we have used Eq. (1.10) to let appear the particle’s mass m. Hence we recover Einstein’s
     relation:
                                           E 2 = m2 + P · P .                                (1.26)
                              1.4 Quantities measured by an observer                                    21
  An infinitesimal displacement dx of the particle along its worldline is related to the energy-
momentum vector p by
                                       dx = p dλ,                                          (1.27)
where λ is the affine parameter along the particle’s worldline whose tangent vector is p [cf.
Eq. (1.21) for a massless particle and Eqs. (1.16) and (1.18) with λ := τ /m for a massive particle].
Substituting (1.22) for p in (1.27), we get the orthogonal decomposition of dx with respect to
LO :
                                      dx = Edλ uO + dλ P .                                    (1.28)
O’s proper time elapsed during the particle’s displacement is the coefficient in front of uO :
dτO = Edλ and the particle’s displacement in O’s rest frame is the part orthogonal to uO :
dX = dλ P . By definition, the particle’s velocity with respect to O is
                                              dX    dλ P
                                       V :=       =      .                                     (1.29)
                                              dτO   Edλ
Hence the relation
                                            P =EV .                                            (1.30)
By combining with (1.22), we get the following orthogonal decomposition of the particle’s
4-momentum:
                                    p = E (uO + V ) .                              (1.31)
    Relations (1.26), (1.31) and (1.30) are valid for any kind of particle, massive or not. For a
massive particle, the energy-momentum vector p is related to the particle’s 4-velocity u via
(1.18). Inserting this relation into (1.23), we obtain
                                             E = Γm ,                                          (1.32)
where
                                          Γ := −uO · u                                         (1.33)
is the Lorentz factor of the particle with respect to the observer. If we depart from units with
c = 1, Eq. (1.32) becomes the famous relation E = Γmc2 . Furthermore, combining (1.30) and
(1.32) yields the familiar relation between the linear momentum and the velocity:
                                           P = Γm V .                                          (1.34)
Finally, inserting (1.32) and (1.34) into (1.26) leads to the well-known expression of the Lorentz
factor in terms of the velocity:
                                       Γ = (1 − V · V )−1/2 .                                (1.35)
If we divide Eq. (1.22) by m and use Eqs. (1.18), (1.32) and (1.34) to express respectively p/m,
E/m and P /m, we get the following orthogonal split of the particle’s 4-velocity u with respect
to observer O:
                                      u = Γ (uO + V ) .                                     (1.36)
    For a massless particle (photon), inserting (1.30) into the Einstein relation (1.26) with m = 0
yields
                                           V · V = 1.                                          (1.37)
22                                       General framework
     This means that the norm of the velocity of the massless particle with respect to O equals the
     speed of light c (= 1 in our units). For a photon associated with a monochromatic radiation,
     the wave 4-vector k admits the following orthogonal decomposition:
k = ω (uO + V ) , (1.38)
     where ω = 2πν, with ν being the radiation frequency as measured by observer O. In view of
     Eq. (1.31) with p = ℏk [Eq. (1.14)], we get the Planck-Einstein relation:
E = hν . (1.39)
G + Λ g = 8πT . (1.41)
     Remark 1: The case n = 2 has been excluded since the Einstein equation would no longer involve the
     spacetime curvature, given that the Einstein tensor G is identically zero for any metric g if n = 2 (the
     trace of Eq. (A.112) in Appendix A yields R = (R/2)g). The exclusion of n = 2 also follows by noticing
     that the Einstein-Hilbert action, which gives birth to Eq. (1.40) for n ≥ 3, is proportional to the Euler
     characteristic of M for n = 2 (by virtue of the Gauss-Bonnet theorem), the Euler characteristic being a
     topological invariant independent of g.
         By taking the trace of (1.40) with respect to g, it is easy to show that the Einstein equation
     (1.40) is equivalent to
                                                           
                                      2                1
                                  R=     Λ g + 8π T −     Tg ,                                          (1.42)
                                     n−2              n−2
   The vacuum Einstein equation with cosmological constant is Eq. (1.42) with T = 0:
                                                 2
                                          R=        Λ g.                                        (1.43)
                                                n−2
In the mathematical literature, a solution to Eq. (1.43) is called an Einstein metric. The special
case Λ = 0 is called the vacuum Einstein equation:
                                               R=0.                                             (1.44)
It thus corresponds to the vanishing of the Ricci tensor. Solutions of Eq. (1.44) are sometimes
called Ricci-flat metrics.
     Taking the covariant divergence of the Einstein equation (1.41) and invoking the contracted
Bianchi identity (A.110) leads to
                                               →
                                               −
                                          ∇· T =0,                                         (1.45)
        →
        −
where T in the type-(1, 1) tensor associated by metric duality to T [cf. Eq. (A.48)]. In index
notation, the above equation reads [cf. Eq. (A.67)]
                                            ∇µ T µα = 0.
Equation (1.45) is often referred to as the equation of energy-momentum conservation.
     where g stands for the determinant of the components (gαβ ) of the metric tensor g with respect to the
     coordinates (xα ).
     Remark 4: The second Maxwell equation can be expressed in terms of differential forms, as the first one,
     by introducing the (n − 2)-form ⋆F (n being the spacetime dimension) and the (n − 1)-form ⋆j, which
     are respectively the Hodge dual of the 2-form F and the Hodge dual of the 1-form j (the metric dual of
     the vector field j, cf. Sec. A.3.3). We shall define the Hodge dual of a p-form in Chap. 5 [Eq. (5.38)]; here,
     let us simply state the Maxwell equations in terms of it and of the exterior derivative d:
        A source-free electromagnetic field is a 2-form F that obeys the Maxwell equations (1.47)
     with j = 0:
                                                                       ↠
                                         dF = 0         and        ∇· F = 0.                                (1.51)
     The energy-momentum tensor T of a source-free electromagnetic field F is
                                                             
                                   1               1
                            Tαβ =            µ            µν
                                        Fµα F β − Fµν F gαβ ,                                               (1.52)
                                  µ0               4
Contents
         2.1   Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   25
         2.2   Black holes and null hypersurfaces . . . . . . . . . . . . . . . . . . . .         25
         2.3   Geometry of null hypersurfaces . . . . . . . . . . . . . . . . . . . . . .         28
         2.4   Evolution of the expansion . . . . . . . . . . . . . . . . . . . . . . . . .       53
2.1     Introduction
In this chapter, we shall start from a naive “definition” of a black hole, as a region of spacetime
from which no particle can escape, and we shall convince ourselves that the black hole boundary
— the so-called event horizon — must be a null hypersurface (Sec. 2.2). We shall then study the
properties of these hypersurfaces (Secs. 2.3 and 2.4). The precise mathematical definition of a
black hole will be given in Chap. 4.
There are essentially two features in this definition: localization and inescapability. Let us for a
moment focus on the latter. It implies the existence of a boundary, which no particle emitted
26               The concept of black hole 1: Horizons as null hypersurfaces
Figure 2.1: The three causal types of hypersurfaces: spacelike (left), timelike (middle) and null (right).
     in the black hole region can cross. This boundary is called the event horizon and is quite
     often referred to simply as the horizon. It is a one-way membrane, in the sense that it can be
     crossed from the black hole “exterior” towards the black hole “interior”, but not in the reverse
     way. The one-way membrane must be a hypersurface of the spacetime manifold M , for it has
     to divide M in two regions: the interior (the black hole itself) and the exterior region. Let us
     recall that a hypersurface is an embedded submanifold of M of codimension 1 (cf. Sec. A.2.7
     in Appendix A).
         • spacelike iff g|Σ is positive definite, i.e. iff sign g|Σ = (+, · · · , +) (n − 1 plus signs), i.e.
           iff (Σ, g|Σ ) is a Riemannian manifold;
         • timelike iff g|Σ is a Lorentzian metric, i.e. iff sign g|Σ = (−, +, · · · , +) (1 minus sign
           and n − 2 plus signs), i.e. iff (Σ, g|Σ ) is a Lorentzian manifold;
• null iff g|Σ is degenerate1 i.e. iff sign g|Σ = (0, +, · · · , +) (1 zero and n − 2 plus signs).
     All these definitions are local, i.e. apply to a point p ∈ Σ. Of course, it may happen that Σ has
     not the same causal type among all its points.
          The causal type of the hypersurface Σ can also be deduced from any normal vector2 n to it
     (cf. Fig. 2.1):
         • Σ spacelike ⇐⇒ n timelike;
        1
           Cf. Sec. A.3.1 in Appendix A for the definition of a degenerate bilinear form; the degeneracy implies that the
     bilinear form g|Σ is not, strictly speaking, a metric on Σ.
         2
           The definition of a vector normal to a hypersurface is recalled in Sec. A.3.5 of Appendix A.
                                  2.2 Black holes and null hypersurfaces                                    27
1 2
Figure 2.2: A timelike hypersurface is a two-way membrane: L1→2 is a timelike worldline from Region 1 to
Region 2, while L2→1 is a timelike worldline from Region 2 to Region 1.
                                           1
Figure 2.3: A spacelike hypersurface is a one-way membrane: L1→2 is a timelike worldline from Region 1 to
Region 2, while there is no timelike or null worldline from Region 2 to Region 1.
• Σ timelike ⇐⇒ n spacelike;
• Σ null ⇐⇒ n null.
2 1
     Figure 2.4: A null hypersurface is a one-way membrane: L1→2 is a timelike worldline from Region 1 to
     Region 2, while there is no timelike or null worldline from Region 2 to Region 1.
     not going from Region 1 to Region 2 must stay on the hypersurface; an example of such null
     worldline is the one depicted in Fig. 2.4 as the thin black line tangent to the normal n.
         The limit case between two-way membranes (timelike hypersurfaces) and one-way ones
     being null hypersurfaces, it is quite natural to select the latter ones for the black hole boundary,
     rather than spacelike hypersurfaces. This choice will be fully justified in Chap. 4, where we
     shall see that the generic definition of a black hole implies that its boundary (the event horizon)
     is a null hypersurface as soon as it is smooth (Property 4.11). Note however that in Chap. 18,
     we shall see that spacelike hypersurfaces, called dynamical horizons, are involved in quasi-local
     approaches to black holes.
Figure 2.5: Null hyperplane H of equation t − x = 0 in Minkowski spacetime. The dimension along z has
been suppressed, so that H is pictured as a 2-plane.
Example 1 (null hyperplane): A very simple example of null hypersurface is a null hyperplane of the
4-dimensional Minkowski spacetime. The Minkowski spacetime is defined by M = R4 with g being a
flat Lorentzian metric. Natural coordinates are Minkowskian coordinates (t, x, y, z), i.e. coordinates
with respect to which the metric components are gαβ = diag(−1, 1, 1, 1). The scalar field
u(t, x, y, z) = t − x (2.3)
Example 2 (light cone): Another simple example of null hypersurface, still in the 4-dimensional
Minkowski spacetime, is the future sheet H of a light cone, also called future light cone. Note that we
have to take out the cone apex from H , in order to have a regular hypersurface. In the Minkowskian
coordinates (t, x, y, z), the choice of the “retarded time”
                                                        p
                                     u(t, x, y, z) = t − x2 + y 2 + z 2                            (2.4)
where m is a positive constant. We shall see in Chap. 6 that (M , g) is actually a part of Schwarzschild
spacetime, described in coordinates different from the standard Schwarzschild-Droste ones, (t̄, r, θ, φ)
   3
    Formula (2.5) follows the notation introduced in Eq. (1.2); see Sec. A.3.1 in Appendix A for an extended
discussion, in particular Eq. (A.39). When reading the metric components (gαβ ) from (2.5), keep in mind that
cross terms involve a factor 2: the 4m/r coefficient of dt dr implies that gtr = 2m/r.
30                    The concept of black hole 1: Horizons as null hypersurfaces
     Figure 2.6: Future sheet H of the light cone of equation t −     x2 + y 2 + z 2 = 0 in Minkowski spacetime.
                                                                        p
The dimension along z has been suppressed, so that H looks 2-dimensional, whereas it is actually 3-dimensional.
     say, the link between the two being t = t̄ + 2m ln |r/(2m) − 1|. The present coordinates are called the
     ingoing Eddington-Finkelstein coordinates and have the advantage over the standard ones to be
     regular on the event horizon, which is located at r = 2m. Indeed, the metric components (2.5) remain
     finite when r → 2m, as do those of the inverse metric, which are
                                                                             
                                          − 1 + 2m         2m
                                                     
                                                   r        r    0      0
                                                                             
                                               2m
                                                        1 − 2m  0      0     
                                 αβ
                                g =            r             r
                                                                              .                        (2.6)
                                                                             
                                                                  1
                                      
                                               0           0    r 2    0     
                                                                              
                                                                         1
                                                0           0    0 r2 sin2 θ
ℓ · ℓ = 0. (2.8)
Figure 2.7: Schwarzschild horizon H introduced in Example 3; The figure is drawn for θ = π/2 and is based
on coordinates (t, x, y) related to the ingoing Eddington-Finkelstein coordinates (t, r, θ, φ) by x = r cos φ and
y = r sin φ.
square equal to 1 or −1). It follows that there is no unique choice of ℓ. At this stage, any rescaling
ℓ 7→ ℓ′ = αℓ, with α some strictly positive (to preserve the future orientation of ℓ) scalar field on H ,
yields a normal vector field ℓ′ as valid as ℓ.
    The null normal vector field ℓ is a priori defined on H only and not at points p ̸∈ H .
However, it is worth to consider ℓ as a vector field not confined to H but defined in some
open subset of M around H . In particular this would permit to define the spacetime covariant
derivative ∇ℓ, which is not possible if the support of ℓ is restricted to H . Following Carter
[102], a simple way to achieve this is to consider not only a single null hypersurface H , but
a foliation of M (in the vicinity of H ) by a family of null hypersurfaces, such that H is an
element of this family. Without any loss of generality, we may select the value of the scalar
field u defining H to label these hypersurfaces and denote the family by (Hu ). The null
hypersurface H is then nothing but the element H = Hu=0 of this family [Eq. (2.1)]. The
vector field ℓ can then be viewed as being defined in the part of M foliated by (Hu ), such that
at each point in this region, ℓ is null and normal to Hu for some value of u.
Example 4: The scalar field u introduced in Example 1 (null hyperplane) does define a family of null
hypersurfaces (Hu ). A counter-example would be u(t, x, y, z) = (t − x)(1 + x2 ), since u = a does not
define a null hypersurface except for a = 0. Similarly, the scalar fields u of Example 2 (light cone) and
Example 3 (Schwarzschild horizon) do define a family of null hypersurfaces (Hu ). In the latter example,
this would not have been the case for the simpler choice u(t, r, θ, φ) = r − 2m. Some of these null
hypersurfaces are represented in Fig. 2.8
   Obviously the family (Hu ) is nonunique but all geometrical quantities that we shall intro-
duce hereafter do not depend upon the choice of the foliation Hu once they are evaluated at
H.
32               The concept of black hole 1: Horizons as null hypersurfaces
15
                                                            u=0   u= −1       u= −2
                                              10
                                        t/m
                                               0 u=1
u=2
-5
                                              -10
                                                    0   2     4    6      8   10 12 14
                                                                       r/m
Figure 2.8: Hypersurfaces Hu defined by u = const for the Schwarzschild horizon (Example 3).
                                                                                ∂u
                                              ∇α u = g αµ ∇µ u = g αµ               .                     (2.10)
                                                                                ∂xµ
                                 →
                                 −
     Property (2.9) means that ∇u is a normal vector field to H . By uniqueness of the normal
     direction to a hypersurface, it must then be collinear to ℓ. Therefore, there must exist some
     scalar field ρ such that
                                                         →
                                                         −
                                               ℓ = −eρ ∇u .                                      (2.11)
                                                    →
                                                    −
     We have chosen the coefficient linking ℓ and ∇u to be strictly negative (minus an exponential).
     This is always possible by a suitable choice of the scalar field u. The minus sign ensures that in
     the case of u increasing toward the future, ℓ is future-directed, as the following example shows:
     Example 5 (null hyperplane): We deduce from expression (2.3) for u in Example 1 that du = dt − dx.
                                                             →
                                                             −
     The gradient vector field obtained by metric duality is ∇u = −∂t − ∂x . Choosing for simplicity ρ = 0,
     formula (2.11) yields
                                                  ℓ = ∂t + ∂x .                                       (2.12)
     This vector field is depicted in Fig. 2.5.
                                   2.3 Geometry of null hypersurfaces                                        33
Example 6 (light cone): Regarding Example 2, we have, given expression (2.4) for u,
                            x      y      z                   p
                 du = dt − dx − dy − dz, with r := x2 + y 2 + z 2 .
                            r      r      r
Choosing for simplicity ρ = 0 in (2.11), we get the normal
                                                  x     y    z
                                       ℓ = ∂t +     ∂x + ∂y + ∂z .                                  (2.13)
                                                  r     r    r
This vector field is depicted in Fig. 2.6.
Example 7 (Schwarzschild horizon): We deduce from expression (2.7) for u in Example 3 that
                        1 (r−t)/(4m) h        r              r  i
                 du =     e           − 1−         dt − 1 +          dr .
                       4m                     2m              2m
The corresponding gradient vector field, is computed from (2.10) via expression (2.6) for g αµ :
                    →
                    −       1 (r−t)/(4m) h           r              r  i
                    ∇u =       e              − 1+        ∂t + 1 −         ∂r .
                           4m                       2m                2m
This time, we do not chose ρ = 0 but rather select ρ so that ℓt = 1:
                               1              t−r               r 
                      eρ = − t ⇐⇒ ρ =                − ln 1 +        + ln(4m).                      (2.14)
                             ∇u                4m               2m
Equation (2.11) leads then to
                                                     r − 2m
                                             ℓ = ∂t +        ∂r .                                   (2.15)
                                                     r + 2m
Given the metric (2.5), we check that g(ℓ, ℓ) = 0. Since ℓ ̸= 0, this proves that all hypersurfaces Hu ,
and in particular H , are null. The vector field ℓ is depicted on H in Fig. 2.7 and in all M in Fig. 2.11.
     Geodesic generators
     Let us contract the Frobenius identity (2.16) with ℓ:
                                   ℓµ ∇µ ℓα − ℓµ ∇α ℓµ = ℓµ ∇µ ρ ℓα − ℓµ ℓµ ∇α ρ.                                (2.18)
                                                                      |{z}
                                                                               0
        5
          A vector field that obeys the weaker condition (2.22), with κ possibly nonzero, is called a pregeodesic vector
     field, cf. Sec. B.2.2.
                                   2.3 Geometry of null hypersurfaces                                             35
  Any null hypersurface H is ruled by a family of null geodesics, called the null geodesic
  generators of H , and each vector field ℓ normal to H is tangent to these null geodesics.
Remark 2: The above result is not trivial: while it is obvious that the field lines of the normal vector
field ℓ are null curves that are tangent to H , one must keep in mind that not all null curves are null
geodesics. For instance, in Minkowski spacetime, the helix defined in terms of some Minkowskian
coordinates (xα ) = (t, x, y, z) by the parametric equation xα (λ) = (λ, cos λ, sin λ, 0) is a null curve, i.e.
it has a null tangent vector at each point, but it is not a geodesic, given that all geodesics of Minkowski
spacetime are straight lines.
    As a by-product of Eq. (2.25), we get the behavior of the non-affinity coefficient under a
rescaling of the null normal:
ℓ′ = αℓ =⇒ κ′ = ακ + ∇ℓ α . (2.26)
Example 8 (null hyperplane): It is clear on expression (2.12) for ℓ that the covariant derivative ∇ℓ
vanishes identically. In particular ∇ℓ ℓ = 0. Equation (2.22) then implies κ = 0, which is in agreement
with Eq. (2.21) and the choice ρ = 0 performed in Example 5. The null geodesic generators of H are the
straight lines defined by t = x, y = y0 and z = z0 for some constants (y0 , z0 ) ∈ R2 . They are depicted
as green lines in Fig. 2.5. Either t or x can be chosen as affine parameters of these generators.
Example 9 (light cone): From expression (2.13) for ℓ and the fact that ∇β ℓα = ∂β ℓα in the Minkowskian
coordinates (t, x, y, z), we get
                                                             
                              0    0        0           0
                                                             
                                 2   2
                            0 y +z
                         1               −xy         −xz              (α = row index;
                  α
             ∇β ℓ = 3                                                                            (2.27)
                                                              
                        r  0                                          β = column index).
                                                              
                                 −xy    x 2 + z2      −yz     
                                                             
                              0  −xz      −yz         2
                                                    x +y   2
We obtain then ℓµ ∇µ ℓα = 0. From Eq. (2.22), we conclude that κ = 0, which is in agreement with
Eq. (2.21) and the choice ρ = 0 performed in √   Example 6. The null geodesic generators of H are
the half-lines defined by x = at, y = bt, z = 1 − a2 − b2 t, with t > 0 and (a, b) ∈ R2 such that
a2 + b2 ≤ 1. They are depicted as green lines in Fig. 2.6. Since from (2.13) ∇ℓ t = 1 and κ = 0, λ = t is
an affine parameter along these null geodesic generators.
Example 10 (Schwarzschild horizon): The covariant derivative of the vector field ℓ as given by (2.15)
is (cf. Sec. D.4.1 for the computation)
                                                          
                        m        m 3r+2m
                       r2       r2 r+2m
                                            0        0
                                                          
                 m r−2m m 3r2 −4m(r+m)
                                                          
                 r2 r+2m r2 (r+2m)2        0        0               (α = row index;
           α
      ∇β ℓ =                                                                                  (2.28)
                                                           
                                                                    β = column index).
                                                           
                                         r−2m             
                       0           0    r(r+2m)     0     
                                                          
                                                   r−2m
                        0           0       0     r(r+2m)
36               The concept of black hole 1: Horizons as null hypersurfaces
     2.3.4     Cross-sections
     Let us now focus on the first aspect of the black hole definition given in Sec. 2.2.1: localization.
     This feature is crucial to distinguish a black hole boundary from other kinds of null hypersur-
     faces. For instance the interior of a future null cone in Minkowski spacetime is a region from
     which no particle may escape, but since the null cone is expanding indefinitely, particles can
     travel arbitrarily far from the center. Therefore, a null cone does not define a black hole. A
     key parameter is hence the expansion of null hypersurfaces, which we shall discuss in the next
     section, after having introduced cross-sections.
       Let H be a null hypersurface of a n-dimensional spacetime (M , g) with n ≥ 3. We define
       a cross-section of H as a submanifold S of H of dimension n − 2 such that (i) the null
       normal ℓ is nowhere tangent to S and (ii) each null geodesic generator of H intersects
       S at most once. If each null geodesic generator intersects S exactly once, we shall say
       that S is a complete cross-section (cf. Fig. 2.9).
       Notation: Indices relative to a cross-section will range from 2 to n − 1 and will be denoted
       by a Latin letter from the beginning of the alphabet: a, b, etc.
         To encompass the idea that an event horizon H delimitates some region of spacetime, we
     shall assume that its cross-sections are closed manifolds, i.e. are compact without boundary6 .
     The simplest example is the sphere, more precisely the (n − 2)-dimensional sphere Sn−2 , as
     illustrated in Fig. 2.9. This example is the one relevant for standard 4-dimensional stationary
     black holes, thanks a topology theorem that we shall discuss in Sec. 5.2.3. But for n > 4, other
     compact topologies, like that of a torus, are allowed for cross-sections of a stationary black
     hole.
         A first important property of cross-sections, independently of whether they are closed or
     not, is:
        6
          Scrictly speaking, a closed manifold is simply a compact manifold, since a topological manifold, as defined
     in Sec. A.2.1, has no boundary. Indeed, the concept of manifold with boundary requires a separate definition (cf.
     Sec. A.2.2), so that a manifold with boundary is not a manifold. Hence, we may consider the terminology closed
     manifold as stressing the no-boundary aspect of a compact manifold.
                                   2.3 Geometry of null hypersurfaces                                          37
Figure 2.9: The null hypersurface H and two (complete) cross-sections S and S ′ . The green curves represent
some null geodesic generators, with the null normal ℓ tangent to them.
  Any cross-section S of a null hypersurface is spacelike, i.e. all vectors tangent to S are
  spacelike.
  Every nonzero tangent vector v to a null hypersurface is either spacelike or null. Moreover,
  if it is null, v is tangent to a null geodesic generator, i.e. v is normal to the hypersurface.
Proof. Tangent vectors to a null hypersurface H are by definition vectors v such that g(ℓ, v) =
0, where ℓ is the normal to H . Since ℓ is null, it follows then from Property 1.3 (Sec. 1.2.2)
that v cannot be timelike. Besides, if v is null, Property 1.4 (Sec. 1.2.2) implies that it must be
collinear to ℓ.
Proof of Property 2.2. Let p ∈ S and v ∈ Tp M be a nonzero tangent vector to S . The above
lemma implies that v is either spacelike or tangent to the null geodesic generator L going
through p, but then L would be tangent to S , which is not allowed, given the definition of a
cross-section. We conclude that v is necessarily spacelike, which proves that S is a spacelike
submanifold.
Example 11 (light cone): The future sheet H of the Minkowski-spacetime light cone considered
in Examples 2, 6 and 9 has the topology R × S2 since we have excluded the cone apex from H . The
38              The concept of black hole 1: Horizons as null hypersurfaces
     complete cross-sections are then compact and it natural to choose them as the spheres defined by t = t0
     for some positive constant t0 : S = {p ∈ H , t(p) = t0 }. That S is a 2-dimensional sphere in the
     hyperplane t = t0 is clear on its equation in terms of the Minkowskian coordinates (t, x, y, z):
S : t = t0 and x2 + y 2 + z 2 = t20 ,
     which follows immediately from u = 0 [cf. Eq. (2.4)]. Moreover, this equation shows that the radius of
     the sphere is t0 .
     Example 12 (Schwarzschild horizon): The 3-dimensional cylinder H introduced in Example 3 has the
     topology R × S2 (cf. Fig. 2.7). Since it is defined by r = 2m in terms of the ingoing Eddington-Finkelstein
     coordinates (t, r, θ, φ), a natural coordinate system on H is xA = (t, θ, φ). Moreover, we have seen
     that the coordinate t is the (non-affine) parameter of the null geodesics generating H associated with
     the null normal ℓ. As in Example 11, a natural choice of cross-section is a sphere defined by t = t0 for
     some constant t0 : S = {p ∈ H , t(p) = t0 }. The equation of S in terms of the coordinates (t, r, θ, φ)
     is then
                                             S : t = t0 and r = 2m.
     Note that xa = (θ, φ) constitutes a coordinate system on S .
     Example 13 (binary black hole): Some cross-sections of the event horizon H in numerically generated
     binary black hole spacetimes are displayed in Figs. 4.21 and 4.22 of Chap. 4.
         Let us denote by q the metric induced by g on a cross-section S , i.e. the bilinear form
     defined at any point p ∈ S by
     Saying that S is spacelike (Property 2.2) is equivalent to saying that q is positive definite,
     i.e. q(v, v) ≥ 0 for all v ∈ Tp S , with q(v, v) = 0 ⇐⇒ v = 0. In other words, (S , q) is a
     Riemannian manifold (cf Sec. A.3.2 in Appendix A).
     Example 14 (Schwarzschild horizon): The metric induced by g on the cross-section S of the
     Schwarzschild horizon defined in Example 12 is readily obtained by setting t = const = t0 and
     r = const = 2m in Eq. (2.5), given that (xa ) = (θ, φ) is a coordinate system on S :
                                             Tp M = Tp S ⊕ Tp⊥ S                                       (2.32)
                                    2.3 Geometry of null hypersurfaces                                               39
Figure 2.10: The tangent space Tp S to the cross-section S and its 2-dimensional orthogonal complement
Tp⊥ S . Only the dimensionality of the latter is respected in the figure: S and Tp S are depicted as 1-dimensional
objects, while they are truly (n − 2)-dimensional ones.
Proof. As a timelike plane, Tp⊥ S has two independent null directions, which can be seen as
the two intersections of the null cone at p with the 2-plane Tp⊥ S (cf. Fig. 2.10). Let us denote
by k a future-directed null vector in the null direction of Tp⊥ S that is not along ℓ. By a proper
rescaling k 7→ αk, we may choose k so that k · ℓ = −1 (cf. Lemma 1.2). Since ℓ and k are
non-collinear vectors of Tp⊥ S and dim Tp⊥ S = 2, they constitute a basis of Tp⊥ S .
Remark 3: While ℓ is independent of the choice of the cross-section S of H through p, k does depend
on S .
40                The concept of black hole 1: Horizons as null hypersurfaces
       The bilinear form q, which has been defined only on Tp S via (2.30), can be extended to all
       vectors of Tp M by setting
                                       q := g + ℓ ⊗ k + k ⊗ ℓ ,                            (2.34)
       or, in index notation,
                                          qαβ := gαβ + ℓα kβ + kα ℓβ .                                   (2.35)
       It obeys
                              ∀(u, v) ∈ Tp M × Tp M ,            q(u, v) = q(u∥ , v ∥ ),                 (2.36)
       where u∥ and v ∥ are the tangent parts of u and v to S , which are uniquely defined by the
       direct sum (2.32):
u = u∥ + u0 ℓ + u1 k and v = v ∥ + v 0 ℓ + v 1 k.
     Example 15 (light cone): In continuation with Example 11, the null vector k orthogonal to the sphere
     S and obeying k · ℓ = −1 is
                                          1    x    y    z
                                       k = ∂t − ∂x − ∂y − ∂z .
                                          2    2r   2r   2r
     Evaluating q via (2.34), given expression (2.13) for ℓ, we get the following components of q with respect
     to the Minkowskian coordinates (xα ) = (t, x, y, z):
                                                                                 
                                               0      0          0        0
                                                   y 2 +z 2
                                                                             
                                            0
                                                      r2
                                                                − xy
                                                                  r2
                                                                        − xz
                                                                          r2 
                                                                              
                                    qαβ   =                                  .
                                           
                                                              x2 +z 2
                                            0
                                                  − xy r2      r2
                                                                        − yz
                                                                          r 2
                                                                              
                                                                              
                                                                        x2 +y 2
                                             0     − xz r2
                                                              − yz
                                                                 r2       r2
     If we consider the spherical coordinates (xα ) = (t, r, θ, φ) deduced from the Minkowskian ones via the
     standard formulas: x = r sin θ cos φ, y = r sin θ sin φ and z = r cos θ, the components of q become
                                   2.3 Geometry of null hypersurfaces                                             41
                                                                                      r
                                          2             4           6           8
Figure 2.11: Null vector fields ℓ (green) and k (red) corresponding to Example 16 (Schwarzschild horizon). The
plot is a 2-dimensional slice θ = const and φ = const of the spacetime M , with t and r labelled in units of m.
Note that since k diverges at r = 0 [cf. Eq. (2.39)], it is not represented there.
instead                                                                
                                                  0 0       0   0
                                                                       
                                            0 0 0     0                
                                    qαβ   =                            .                              (2.38)
                                                                       
                                            0 0 r2    0                
                                                                       
                                             0 0 0 r2 sin2 θ
and we recognize in qab = diag(r2 , r2 sin2 θ) the standard metric on the 2-sphere of radius r.
Example 16 (Schwarzschild horizon): For the Schwarzschild horizon case, we deduce from the metric
(2.5) and the expression (2.15) for ℓ that the null vector k obeying k · ℓ = −1 and orthogonal to the
sphere S introduced in Example 12 is
                                                              
                                         1 m              1 m
                                  k=        +      ∂t −     +      ∂r .                         (2.39)
                                         2     r          2    r
The vector field k is depicted in Fig. 2.11. We have (cf. the notebook D.4.1)
                                                                                 
                    2m − r                                   1 m              1 m
               ℓ=           dt + dr        and     k=−         +      dt −      +     dr,               (2.40)
                    2m + r                                   2    r           2   r
so that Eq. (2.34) leads to the following components of q in terms of the ingoing Eddington-Finkelstein
42                 The concept of black hole 1: Horizons as null hypersurfaces
         The pair (ℓ, k) forms a null basis of Tp⊥ S [cf. Eq. (2.33)]. One can construct from it an
     orthonormal basis (n, s) as follows:
                                                     
                            n = 1ℓ + k                ℓ = n+s
                                      2
                                                 ⇐⇒                                          (2.42)
                            s = 1 ℓ − k.              k = 1 (n − s) .
                                              2                            2
     In other words, (n, s) is an orthonormal basis of the Lorentzian plane (Tp⊥ S , g).
         If we substitute (2.42) for ℓ and k in Eq. (2.34), we get
q = g + n ⊗ n − s ⊗ s. (2.44)
       The metric duala of the bilinear form q defined by Eq. (2.34), i.e. the tensor of type (1, 1)
       defined by
                                      →
                                      −q := Id + ℓ ⊗ k + k ⊗ ℓ ,                              (2.45)
       or, in index notation,
                                              q αβ := δ αβ + ℓα kβ + k α ℓβ ,                 (2.46)
       is nothing but the orthogonal projector onto the cross-section S :
                                              ∀v ∈ Tp M ,      →
                                                               −
                                                               q (v) = v ∥ .                  (2.47)
          a
              See Eq. (A.48) of Appendix A for the arrow notation.
Figure 2.12: Lie dragging of the surface S along ℓ by the small parameter ε. S is drawn as a 1-dimensional
submanifold, while it is actually a (n − 2)-dimensional one, n being the spacetime dimension.
                                                       1 δAε − δA
                                          θ(ℓ) := lim             .                                       (2.49)
                                                   ε→0 ε    δA
  In the above formula, δAε stands for the area of the surface element δSε ⊂ Sε that is
  obtained from δS by Lie dragging along ℓ by the parameter ε (cf. Fig. 2.12).
     a
      We are using the words area and surface even if n − 2 ̸= 2, i.e. even if n ̸= 4, being aware that for n = 3
  the words length and line would be more appropriate, as well as volume for n ≥ 5.
Remark 4: The reader may wonder why the expansion is not denoted by something like θ(ℓ) (S ), since
its definition depends explicitly on S . We shall show below that, because H is a null hypersurface,
θ(ℓ) is actually independent of the choice of the cross-section S .
    In formula (2.49), the area δA is measured with respect to the metric q of S and similarly
the area δAε is measured by the Riemannian metric induced by g on Sε . For instance, if the
surface element δS ⊂ S is a (n−2)-dimensional parallelogram delimited by some infinitesimal
44               The concept of black hole 1: Horizons as null hypersurfaces
       The Levi-Civita tensor Sϵ associated with the metric q of S is expressible in terms of the
       spacetime Levi-Civita tensor ϵ as
                                          S
                                           ϵ = ϵ(n, s, . . .) = ϵ(k, ℓ, . . .) ,                            (2.51)
       where (n, s) is the orthonormal basis of Tp⊥ S associated to the null basis (ℓ, k) by
       Eq. (2.42).
where we have used the multilinearity of Sϵ and Eq. (A.52) for the components of the Levi-
Civita tensor Sϵ, q standing for the determinant of the metric q with respect to the coordinates
(xa ). By the very definition of the Lie dragging, the surface element δSε on Sε is defined by
the same values of the coordinates (xa ) as δS. In particular, the coordinate increments dx2 , . . .,
dxn−1 on Sε take the same values as on S . Therefore, the area of δSε is
                                                                                           (2.54)
                                          p
                                   δAε = q(ε) dx2 · · · dxn−1 ,
where q(ε) stands for the determinant of the components with respect to the coordinates (xa )
of the metric qε induced by g on Sε . Since Sε is spacelike (cf. above), qε is positive definite, so
that q(ε) > 0. In view of (2.53)-(2.54), the definition (2.49) of the expansion of S along ℓ can
be rewritten as                                p          p
                                             1 q(ε) − q(0)
                                 θ(ℓ) = lim          p           .
                                         ε→0 ε         q(0)
We recognize the derivative of the function ε 7→ ln q(ε) = 1/2 ln q(ε) at ε = 0:
                                                       p
                                                    1 d
                                           θ(ℓ) =        ln q.                                   (2.55)
                                                    2 dε
Given that Sε is deduced from S by Lie dragging along ℓ and ε is the value of the coordinate
v associated with ℓ [cf. Eq. (2.52)], we may rewrite this formula as the Lie derivative of ln q
along ℓ:
                                              1
                                        θ(ℓ) = Lℓ ln q .                                 (2.56)
                                              2
Example 17 (light cone): For the light cone in Minkowski spacetime, it is easy to evaluate θ(ℓ) by
means of the spherical coordinates introduced in Example 15, since these coordinates are adapted
to the surface S , the metric of S being q = r2 dθ2 + r2 sin2 θ dφ2 [cf. Eq. (2.38)]. We have then
q = det(qab ) = r4 sin2 θ. Moreover, the coordinate v can be chosen as v = t − t0 since t is an (affine)
parameter associated with ℓ (cf. Example 9). Given that t = r on H , we have ε = v = r − t0 , so that
Eq. (2.55) yields
                                    1 d           1 d
                             θ(ℓ) =       ln q =       (4 ln r + 2 ln sin θ) ,
                                    2 dr          2 dr
i.e.
                                                       2
                                                θ(ℓ) = .                                          (2.57)
                                                       r
Example 18 (Schwarzschild horizon): As above, we have q = r4 sin2 θ [cf. Eq. (2.5)], so that Eq. (2.56)
yields
          1         1   ∂         ∂                r − 2m ∂                 2 r − 2m
    θ(ℓ) = Lℓ ln q = ℓµ µ ln q =    ln(r2 sin θ) +           ln(r2 sin θ) =          ,            (2.58)
          2         2 ∂x         ∂t
                                 |     {z      }   r + 2m ∂r                r r + 2m
                                              0
where we have used the components ℓµ read on (2.15). The above expression is valid for any hypersurface
of the family Hu . For the case of the Schwarzschild horizon, r = 2m and Eq. (2.58) yields a vanishing
expansion:
                                               θ(ℓ) = 0.                                          (2.59)
46              The concept of black hole 1: Horizons as null hypersurfaces
     Note that for large r, Eq. (2.58) yields θ(ℓ) ∼ 2/r, i.e. we recover the flat spacetime result (2.57), which
     is consistent with the fact that for large r, Hu is close to a Minkowskian light cone (cf. Fig. 2.8). Note
     also that Eq. (2.58) yields θ(ℓ) < 0 for r < 2m and θ(ℓ) > 0 for r > 2m. These expansion values are
     in agreement with what can be inferred from Fig. 2.11, since r is directly related to the area of the
     cross-sections of H : A = 4πr2 from Eq. (2.41) and ℓ points towards decreasing (resp. increasing) values
     of r for r < 2m (resp. r > 2m).
         Using the general law of variation of a determinant, as given by Eq. (A.73) in Appendix A,
     Eq. (2.56) can be rewritten as
                                                    1
                                                      tr Q−1 × Lℓ Q ,
                                                                   
                                           θ(ℓ) =
                                                    2
     where Q is the matrix representing the components of q with respect to the coordinates
     (xa ) = (x2 , . . . , xn−1 ): Q = (qab ). Since Q−1 = (q ab ), there comes
                                                         1 ab
                                                θ(ℓ) =     q Lℓ qab .                                     (2.60)
                                                         2
     The Lie derivative along ℓ of the metric q of the cross-section S that appears in this formula is
     defined as follows. As in Sec. A.4.2 of Appendix A, let us denote by Φε the smooth map S → H
     that corresponds to the displacement of points of S by some infinitesimal quantity ε along
     ℓ. Using the notations of Fig. 2.12, we have then pε = Φε (p), qε = Φε (q) and Sε = Φε (S ).
     The Lie derivative along ℓ of q is then the field Lℓ q of bilinear forms on S defined by the
     following action on any pair of vectors (u, v) tangent to S at the same point p:
                                                    1
                              Lℓ q (u, v) := lim      [qε (Φε∗ u, Φε∗ v) − q(u, v)] ,                     (2.61)
                                                ε→0 ε
          One may wonder about the link between the Lie derivative Lℓ q defined by Eq. (2.61), which
     is a tensor field on S , and the Lie derivative along ℓ of the spacetime extension q introduced
     by Eq. (2.34). For the sake of clarity, let us denote here the latter by q̄. More precisely, we
     may
     S consider that q̄ is a field defined in some neighborhood of the portion of H sliced            by
        ε S ε via Eq. (2.34), with k defined at each point  p ∈ S ε as the  unique null vector of T ⊥
                                                                                                    p S ε
     obeying ℓ · k = −1. Let u and v be vector fields on H that are tangent to the cross-sections
     Sε . Applying the bilinear form Lℓ q̄ to them and using the Leibniz rule to expand Lℓ [q̄(u, v)]
     yields
                          Lℓ q̄ (u, v) = Lℓ [q̄(u, v)] − q̄ (Lℓ u, v) − q̄ (u, Lℓ v) .             (2.63)
                                  2.3 Geometry of null hypersurfaces                                          47
Now, since u and v are tangent to Sε , we may write q̄(u, v) = q(u, v). Moreover, by the very
definition of the Lie derivative of a vector field (cf. Sec. A.4.2) and the fact that the cross-sections
Sε are Lie-dragged along ℓ, the vectors Lℓ u and Lℓ v are also tangent to Sε . Therefore, we
have q̄ (Lℓ u, v) = q (Lℓ u, v) and q̄ (u, Lℓ v) = q (u, Lℓ v) as well. Thus, we may rewrite
(2.63) as
                    Lℓ q̄ (u, v) = Lℓ [q(u, v)] − q (Lℓ u, v) − q (u, Lℓ v) .
The right-hand side is identical to what would be obtained by expressing Lℓ q (u, v) via the
Leibniz rule. Hence we conclude that Lℓ q̄ (u, v) = Lℓ q (u, v). Since this identity holds
for a pair (u, v) of vectors tangent to Sε , we may express it for any pair of vectors, i.e. not
necessarily tangent to Sε by introducing the orthogonal projector → −q onto Sε [cf. Eq. (2.45)]:
                            Lℓ q̄ (→
                                   −
                                   q (u), →
                                          −
                                          q (v)) = Lℓ q (→
                                                         −
                                                         q (u), →
                                                                −
                                                                q (v)),                             (2.64)
where the second equality follows from q̄ ab = q ab . Hence we may rewrite (2.60) as
                                                     1 µν
                                           θ(ℓ) =      q Lℓ qµν .                                   (2.65)
                                                     2
Note that we have dropped the bar over q, i.e. we revert to the previous notation.
   Substituting (2.34) for qµν , and using the Leibniz rule, we get
                1 µν                                                       1
       θ(ℓ) =     q (Lℓ gµν + Lℓ ℓµ kν + ℓµ Lℓ kν + Lℓ kµ ℓν + kµ Lℓ ℓν ) = q µν Lℓ gµν ,
                2                                                          2
where the last equality follows from q µν ℓµ = 0 and q µν kµ = 0 [Eq. (2.48)]. Using the Killing
expression (A.87) of the Lie derivative of g: Lℓ gµν = ∇µ ℓν + ∇ν ℓµ , and the symmetry of q µν ,
we arrive at
                                        θ(ℓ) = q µν ∇µ ℓν .                               (2.66)
    We can transform this relation further by expressing q µν via (2.34):
             θ(ℓ) = (g µν + ℓµ k ν + k µ ℓν ) ∇µ ℓν = ∇µ ℓµ + k ν ℓµ ∇µ ℓν +k µ ℓν ∇µ ℓν
                                                                  | {z }
                                                                                 κℓν
                                        1
                    = ∇µ ℓµ + κ k ν ℓν + k µ ∇µ (ℓν ℓν ) = ∇µ ℓµ − κ,
                                |{z} 2 | {z }
                                     −1                0
where we have used respectively the properties (2.22), (2.19) and (2.33). Denoting the divergence
of ℓ by ∇ · ℓ = ∇µ ℓµ , we may write
                                            θ(ℓ) = ∇ · ℓ − κ .                                      (2.67)
48              The concept of black hole 1: Horizons as null hypersurfaces
     Remark 6: Contrary to θ(ℓ) or κ, the quantity ∇ · ℓ depends a priori on the extension of ℓ outside H
     (cf. the discussion in Sec. 2.3.2). For Eq. (2.67) to hold, we have supposed that ℓ remains null outside H ,
     so that k µ ∇µ (ℓν ℓν ), which is a derivative in a direction transverse to H , could be set to zero in the
     computation leading to (2.67).
     Example 19 (light cone): ∇ · ℓ is easily computed by taking the trace of (2.27) and we have κ = 0 (cf.
     Example 9), so that (2.67) yields
                                                    2(x2 + y 2 + z 2 )  2
                                           θ(ℓ) =                      = .
                                                          r3            r
     Hence we recover the result obtained in Example 17.
Example 20 (Schwarzschild horizon): Here also, ∇ · ℓ is easily computed by taking the trace of (2.28):
Given the value κ = 4m/(r + 2m)2 found in Example 10, formula (2.67) leads to
a point of S infinitely close to p and let q ′ be the point of S ′ located on the same null geodesic
                     −
                     →
generator as q, i.e. qq ′ = εℓ, with ε infinitely small. Let dx (resp. dx′ ) be the infinitesimal
vector connecting p to q (resp. p to q ′ ). We have then dx′ = dx + εℓ, the scalar square of
which is
                            dx′ · dx′ = dx · dx + 2ε dx             2
                                                         | {z· ℓ} +ε |{z}
                                                                      ℓ · ℓ,
                                                          0         0
where we have used the fact that ℓ is normal to any tangent vector to H , such as dx and ℓ
itself. Hence dx′ · dx′ = dx · dx. In other words, the lengths of all segments from p do not
depend on the cross-section in which they are taken, provided their second end lies on the
same null geodesic generator of H . It follows that all infinitesimal surfaces δS that (i) contain
p and (ii) are enclosed in a tube made of null geodesic generators have the same area δA. Hence
the expansion θ(ℓ) at p does not depend on the choice of δS. We conclude:
  The expansion θ(ℓ) at a point p of a null hypersurface H does not depend on the choice of
  the cross-section through p. It depends only on the null normal ℓ of H . Accordingly, from
  now on, we shall call θ(ℓ) the expansion of the null hypersurface H along ℓ.
  Two null normals ℓ and ℓ′ to a null hypersurface H are necessarily collinear: ℓ′ = αℓ,
  where α is a non-vanishing scalar field on H . We have then the following scaling law:
Proof. This follows immediately from expression (2.56) for θ(ℓ) , given that the metric q is
independent of ℓ and Lαℓ ln q = αLℓ ln q. One can also get the result from Eq. (2.66): θ(ℓ′ ) =
q µν ∇µ (αℓν ) = αq µν ∇µ ℓν = αθ(ℓ) , since q µν ℓν = 0 [Eq. (2.48)].
Remark 7: The reader may check that the rescaling laws (2.26) and (2.68) for respectively κ and θ(ℓ)
are compatible with expression (2.67) for θ(ℓ) , given that ∇ · ℓ′ = α∇ · ℓ + ∇ℓ α.
Let us gather all the expressions of the expansion θ(ℓ) obtained so far:
                  1 δAε − δA  1         1
       θ(ℓ) = lim            = Lℓ ln q = q µν Lℓ qµν = q µν ∇µ ℓν = ∇ · ℓ − κ ,               (2.69)
              ε→0 ε    δA     2         2
with the reminder that the last equality is valid insofar as the vector field ℓ is null in some
entire open neighborhood of H (and not only on H ), as stressed in Remark 6.
50             The concept of black hole 1: Horizons as null hypersurfaces
                   ∀(u, v) ∈ Tp M × Tp M , →    −q ∗ Lℓ q (u, v) := Lℓ q →
                                                                         −
                                                                         q (u), →
                                                                                −              (2.71)
                                                                                      
                                                                                q (v) .
     Since q is symmetric, it is clear from the above definition that Θ is a field of symmetric bilinear
     forms. Thanks to the identity (2.64), we can use for q the spacetime extension (2.34) and write
     the index-notation version of the definition (2.70) as
                                               1
                                          Θαβ = q µα q νβ Lℓ qµν .                                (2.72)
                                               2
     Substituting gµν + ℓµ kν + kµ ℓν for qµν [Eq. (2.34)], we get
              1                                                                 1
         Θαβ = q µα q νβ (Lℓ gµν + kν Lℓ ℓµ + ℓµ Lℓ kν + ℓν Lℓ kµ + kµ Lℓ ℓν ) = q µα q νβ Lℓ gµν ,
              2                                                                 2
     where the last equality follows from q µα ℓµ = 0 and q µα kµ = 0 [Eq. (2.48)]. Now, using the
     Killing expression (A.87) and the Frobenius identity (2.16), we may write
Lℓ gµν = ∇µ ℓν + ∇ν ℓµ = 2∇µ ℓν + ∇ν ρ ℓµ − ∇µ ρ ℓν .
Θαβ = q µα q νβ ∇µ ℓν . (2.73)
                         Θαβ = (δ µα + ℓµ kα + k µ ℓα ) δ νβ + ℓν kβ + k ν ℓβ ∇µ ℓν .
                                                                             
∇α ℓβ = Θαβ + ωα ℓβ − ℓα k µ ∇µ ℓβ , (2.74)
ωα := −k µ ∇ν ℓµ Πνα = −k µ ∇α ℓµ − k µ k ν ∇µ ℓν ℓα . (2.75)
                                            Παβ = δ αβ + k α ℓβ .                                 (2.76)
                                 2.3 Geometry of null hypersurfaces                                       51
Indeed, for any vector v tangent to H , one has Π(v) = v since Παµ v µ = v α + k α ℓµ v µ with
ℓµ v µ = 0, while Π(k) = 0 since Παµ k µ = k α + k α ℓµ k µ = k α − k α = 0. Consequently, the
action of ω on tangent vectors to H takes a simple form:
∀v ∈ Tp H , ⟨ω, v⟩ = −k · ∇v ℓ. (2.77)
⟨ω, ℓ⟩ = κ . (2.78)
Remark 8: Thanks to the projector Π involved in its definition, the 1-form ω does not depend on
the extension of the vector field ℓ away from H . The same property holds for Θ. On the contrary,
the tensor field ∇ℓ, which appears in the left-hand side of formula (2.74) and in the last term of its
right-hand side, depends on the extension of ℓ away from H .
     By comparing (2.65) and (2.72), we notice that the trace of Θ is nothing but the expansion
θ(ℓ) :
                               θ(ℓ) = g µν Θµν = q µν Θµν = Θµµ .                         (2.79)
                                                     1
                                      σ := Θ −          θ(ℓ) q ,                                 (2.80)
                                                    n−2
Remark 9: The 1/(n − 2) factor arises from the trace of q, which is n − 2. This follows immediately
from q being a metric tensor on the (n − 2)-dimensional manifold S ; this can also be recovered from
the spacetime extension (2.34) of q: q µµ = δ µµ + 2ℓµ k µ , with δ µµ = n and ℓµ k µ = −1.
   By construction, we have σ µµ = g µν σµν = q µν σµν = 0. Note that, as q, the tensor fields Θ
and σ are tangent to S , in the sense that
                                                 ∂        ∂
                                      Lℓ qab =      qab =    qab ,
                                                 ∂t       ∂r
52              The concept of black hole 1: Horizons as null hypersurfaces
     where the second equality follows from t = r on H . Given that qab = diag(r2 , r2 sin2 θ) [cf. Eq. (2.38)],
     we obtain                                                
                                                   2r        0
                                      Lℓ qab =                     = 2 qab .
                                                    0    2r sin2 θ     r
     Example 22 (Schwarzschild horizon): The Lie derivative of q, as given by Eq. (2.41), along ℓ is (cf.
     Appendix D for the computation):
                                          r − 2m                  2 r − 2m
                              Lℓ q = 2r          dθ2 + sin2 θ dφ2 =          q.
                                          r + 2m                    r r + 2m
     Since →
           −
           q ∗ q = q, Eq. (2.70) yields
                                                         r − 2m
                                                Θ=                q.
                                                        r(r + 2m)
     This formula is valid for any hypersurface of the Hu family. For the specific case of the Schwarzschild
     horizon H , r = 2m and it reduces to
                                                     Θ = 0.                                            (2.83)
       Property 2.11: rescaling of the deformation rate and shear upon a change of null
       normal
       Two null normals ℓ and ℓ′ to a null hypersurface H are necessarily collinear: ℓ′ = αℓ,
       where α is a non-vanishing scalar field on H . The deformation rates Θ and Θ′ and shear
       tensors σ and σ ′ of a given cross-section S of H along respectively ℓ and ℓ′ obey the
       following scaling law:
     Remark 10: The scaling law (2.84) shows explicitely that the deformation rate Θ and shear tensor
     σ depend on the choice of the null normal ℓ to H . Both tensors also depend on the choice of the
     cross-section S of H , contrary to the expansion θ(ℓ) (Property 2.9).
                                     2.4 Evolution of the expansion                                   53
Expanding the left-hand side and using again Eqs. (2.67) and (2.74) results in
      ∇µ Θµα + ℓµ ∇µ ωα − ∇α θ(ℓ) + κ + θ(ℓ) + κ ωα − Θαµ k ν ∇ν ℓµ
                                                      
Now, using Θµν ℓν = 0 [Eq. (2.82)] and Eq. (2.74) to express ∇µ ℓν , we can write
  ℓν ∇µ Θµν = ∇µ (Θµν ℓν ) − Θµν ∇µ ℓν = −Θµν ∇µ ℓν = −Θµν (Θµν + ωµ ℓν − ℓµ k σ ∇σ ℓν )
                  | {z }
                          0
              = −Θµν Θµν .
On the other side, thanks to Eqs. (2.78) and (2.22), we have
                    ℓν ℓµ ∇µ ων = ℓµ ∇µ (ων ℓν ) − ων ℓµ ∇µ ℓν = ℓµ ∇µ κ − κ2 .
                                         |{z}         | {z }
                                             κ           κℓν
                                                     1
                              ∇ℓ θ(ℓ) = κθ(ℓ) −         θ2 − σab σ ab − R(ℓ, ℓ) .                      (2.88)
                                                   n − 2 (ℓ)
If ℓ is future-directed, this is an evolution equation for θ(ℓ) along the null generators of H .
     Remark 1: Actually Eq. (2.88) is a particular case of what is generally called the null Raychaudhuri
     equation, namely the case where the vorticity of the vector field ℓ vanishes. This appends because ℓ is
     hypersurface-orthogonal, being normal to H . The general case regards a generic congruence of null
     geodesics, i.e. a family of null geodesics, one, and exactly one, through each point of M . A null vector
     field ℓ tangent to the geodesics of the congruence has a priori some vorticity w and a term +wab wab
     must be added to the right-hand side of Eq. (2.88) (see e.g. Eq. (4.35) of Ref. [266]).
     Remark 2: Since θ(ℓ) is a scalar field on H , ∇ℓ θ(ℓ) can be replaced by the Lie derivative Lℓ θ(ℓ) in the
     left-hand side of the Raychaudhuri equation.
        If the spacetime (M , g) is ruled by general relativity, i.e. if g obeys Einstein equation (1.42),
     we may express the term involving the Ricci tensor in terms of the total energy-momentum
     tensor T :
                           2                             1
                                                                                                         (2.89)
                                                                    
             R(ℓ, ℓ) =        Λ g(ℓ, ℓ) +8π T (ℓ, ℓ) −      T g(ℓ, ℓ) = 8πT (ℓ, ℓ).
                         n − 2 | {z }                  n − 2 | {z }
                                       0                                   0
                                                    1
                             ∇ℓ θ(ℓ) = κθ(ℓ) −         θ2 − σab σ ab − 8πT (ℓ, ℓ).                       (2.90)
                                                  n − 2 (ℓ)
     Remark 3: The cosmological constant Λ does not appear in the null Raychaudhuri equation (2.90),
     despite the latter involves the Einstein equation with Λ [Eq. (1.42)].
     Remark 4: We have stressed above that the shear tensor σ does depend on the choice of a cross-section
     of H , in addition to ℓ (cf. Remark 10 on p. 52). The null Raychaudhuri equation (2.88) shows that the
     “shear square” σab σ ab does not depend on the choice of a cross-section, for all the other terms in (2.88)
     are independent of it.
     Example 23 (light cone): Let us check the null Raychaudhuri equation on the light cone in Minkowski
     spacetime. From Example 9, we have κ = 0, while from Example 21, we have σ = 0, hence σab σ ab = 0.
     Moreover, the Ricci tensor of Minkowski spacetime vanishes identically. The null Raychaudhuri equation
     (2.88) reduces then to
                                                           1 2
                                              ∇ℓ θ(ℓ) = − θ(ℓ)   ,
                                                           2
                                    2.4 Evolution of the expansion                                          55
where we have set n = 4. Now, from Example 17, we have θ(ℓ) = 2/r. Since, in the present case
∇ℓ θ(ℓ) = Lℓ θ(ℓ) = ∂θ(ℓ) /∂r = −2/r2 , we conclude that the null Raychaudhuri equation is satisfied
(as it should!).
Example 24 (Schwarzschild horizon): For the Schwarzschild horizon H , the null Raychaudhuri
equation is trivially satisfied, i.e. each of its terms vanishes identically: θ(ℓ) = 0 on H [Eq. (2.59)],
which implies ∇ℓ θ(ℓ) = 0 since ℓ is tangent to H , σ = 0 since Θ = 0 [Eq. (2.83)] and the Ricci tensor
of the metric (2.5) is zero (cf. the notebook D.4.1).
Regarding the last term in the right-hand side of Eq. (2.91), it is non-positive if the Ricci tensor
obeys the so-called null convergence condition:
If general relativity is assumed, i.e. if the Einstein equation (1.40) is fulfilled, we have R(ℓ, ℓ) =
8πT (ℓ, ℓ) [Eq. (2.89)], where T is the total energy-momentum tensor of the matter and non-
gravitational fields. Accordingly, the null convergence condition is equivalent to the so-called
null energy condition:
where the index ‘GR’ stands for general relativity. The null energy condition (2.95) is a pretty
weak physical requirement: it is satisfied by
    • vacuum: T = 0;
    • any “reasonable” matter model, such as a perfect fluid with a proper energy density ρ
      and pressure p obeying ρ + p ≥ 0 (8 );
   8
    Indeed, for the perfect-fluid energy-momentum tensor T = (ρ+p)u⊗u+pg, one has T (ℓ, ℓ) = (ρ+p)(u·ℓ)2
with (u · ℓ)2 ≥ 0.
56               The concept of black hole 1: Horizons as null hypersurfaces
     Note also that the null energy condition is implied by the so-called weak energy condition,
     which states that
                               T (u, u) ≥ 0 for any timelike vector u.                    (2.96)
     The null energy condition follows from the weak energy condition by continuity. Selecting for
     u the 4-velocity of an observer, we see that the weak energy condition has a simple physical
     interpretation: the energy density ε = T (u, u) as measured by any observer is non-negative,
     hence the name energy condition.
         If the null convergence condition holds, all the terms in the right-hand side of the reduced
     Raychaudhuri equation (2.91) are non-positive and we may conclude:
       If the null convergence condition (2.94) is fulfilled on a null hypersurface H — for general
       relativity, this is equivalent to demanding that the null energy condition (2.95) holds on H
       —, then the expansion θ(ℓ) along any null normal ℓ associated to an affine parametrization
       of H ’s null generators cannot increase along ℓ, i.e. it obeys
∇ℓ θ(ℓ) ≤ 0. (2.97)
        This property justifies the name convergence condition given to R(ℓ, ℓ) ≥ 0: if θ(ℓ) < 0 in
     some part of H , so that nearby null generators are converging, Property 2.13 shows that the
     null generators cannot subsequently diverge.
        9
         This is readily seen from the electromagnetic energy-momentum tensor (1.52), which yields T (ℓ, ℓ) =
     µ−1
      0 E  · E where E α := F αµ ℓµ . Thanks to the antisymmetry of F , the vector E obeys E · ℓ = 0. It follows from
     Property 1.3 that E cannot be timelike. Hence E · E ≥ 0 and T (ℓ, ℓ) ≥ 0.
Chapter 3
Contents
        3.1   Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   57
        3.2   Non-expanding horizons . . . . . . . . . . . . . . . . . . . . . . . . . .         57
        3.3   Killing horizons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .     63
        3.4   Bifurcate Killing horizons . . . . . . . . . . . . . . . . . . . . . . . . .       83
        3.5   Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .      89
3.1     Introduction
Having discussed in depth the geometry of null hypersurfaces in Chap. 2 we move forward to
distinguish a null hypersurface representing a black hole event horizon from, let us say, that
representing a mere future light cone. We do it here for black holes in equilibrium. Indeed, for
such objects, it is quite natural to assume a vanishing expansion. This leads us to the concept
of non-expanding horizon (Sec. 3.2). A special kind of these objects is that of Killing horizons,
which occur in spacetimes that are globally stationary (Secs. 3.3 and 3.4). Actually, we shall see
in Chap. 5 that, under some rather general hypotheses, the event horizon of a black hole in
equilibrium must be a Killing horizon.
     facet of the definition, we could demand that the cross-sections are closed (compact without
     boundary1 ) and have a constant area, i.e. a vanishing expansion. Hence the definition:
H ≃ R×S, (3.1)
       where S is a closed manifold of dimension n − 2, and such that the expansion of H along
       any null normal ℓ vanishes identically:
θ(ℓ) = 0. (3.2)
     Remark 1: Given the scaling law (2.68), if θ(ℓ) = 0 for some normal ℓ, then θ(ℓ′ ) = 0 for any other
     normal ℓ′ . Hence the definition of a non-expanding horizon does not depend on the choice of the null
     normal.
         The above definition captures only the event horizon of black holes in equilibrium, to be
     discussed in detail in Chap. 5. For a black hole out of equilibrium, one has generically θ(ℓ) > 0,
     as we shall see in Chap. 16.
     Example 1 (Schwarzschild horizon): In view of Eq. (2.59), we may assert that the Schwarzschild
     horizon considered in Examples 3, 7, 10, 12, 16, 18, 20, 22 and 24 of Chap. 2 is a non-expanding horizon.
     Example 2 (null hyperplane and light cone as counter-examples): The null hyperplane and light
     cone in Minkowski spacetime considered in the examples of Chap. 2 are excluded by the above definition,
     having non-compact cross-sections (null hyperplane) or nonzero expansion (light cone).
     Historical note : The concept of non-expanding horizon has been introduced by Petr Hájíček in 1973
     under the name of totally geodesic null hypersurface [251] or perfect horizon [252, 253]. The terminology
     non-expanding horizon is due to Abhay Ashtekar, Stephen Fairhurst and Badri Krishnan in 2000 [25]
     (see also [23]).
                                                                     √
                                Z                                  Z
                                    S
                           A=        ϵ(dx(2) , . . . , dx(n−1) ) =    q dx2 · · · dxn−1 ,   (3.3)
                                       S                          S
     where xa = (x2 , . . . , xn−1 ) is a coordinate system on S and q is the determinant with respect
     to these coordinates of the Riemannian metric q induced by g on S .
        1
            Cf. the discussion in Sec. 2.3.4.
                                   3.2 Non-expanding horizons                                         59
  The area A does not depend on the choice of the complete cross-section S of H . For this
  reason, A is called the area of the non-expanding horizon H .
If H is a non-expanding horizon, then θ(ℓ) = 0 and it follows that A(λ) is a constant function.
Hence the area of S ′ is equal to that of S . If S and S ′ have a non-empty intersecting part,
the argument can be repeated on each of the non-intersecting parts, given that the area of the
part S ∩ S ′ is obviously the same for S and S ′ .
Example 3 (Schwarzschild horizon): The area of the Schwarzschild horizon is readily computed from
                                                      √
the metric (2.31): q = 4m2 dθ2 + sin2 θ dφ2 , yielding q = 4m2 ; we get
A = 16πm2 . (3.4)
     where the expansion along k is defined in the same way as that along ℓ [cf. Eq. (2.69)]:
                                       (k)
                                    1 δAε − δA  1         1
                      θ(k)   := lim            = Lk ln q = q µν Lk qµν = q µν ∇µ kν ,                     (3.6)
                                ε→0 ε    δA     2         2
        (k)
     δAε being the area of the surface element that is deduced from the surface element of area
     δA on S by the Lie dragging along k by a parameter ε (cf. Fig. 3.1). The limit case θ(ℓ) = 0
     and θ(k) < 0 correspond to the so-called marginally trapped surface. In flat spacetime
     (Minkowski), given any closed spacelike surface, one has θ(ℓ) > 0 and θ(k) < 0 (cf. Fig. 3.1
     right), so there is no trapped surface. We shall discuss trapped surfaces in more detail in
     Sec. 18.2.1.
         Cross-sections of a non-expanding horizon are usually marginally trapped surfaces (cf. the
     example below). However, there exist some pathological situations for which θ(k) > 0 at some
     points of S [217].
     Example 4: Let us consider a cross-section S of the Schwarzschild horizon as defined in Example 12 of
     Chap. 2. Computing q µν ∇µ kν from the components kν given by (2.40), we get (cf. the notebook D.4.1 for
     the computation) θ(k) = −(r + 2m)/r2 . In particular, on S (r = 2m), θ(k) = −1/m. Hence θ(k) < 0.
     Since we had already θ(ℓ) = 0 [cf. Eq. (2.59)], we conclude that S is a marginally trapped surface.
     This could also have been inferred from Fig. 2.11, since according to the metric (2.41), the area of the
     cross-sections of H is nothing but 4πr2 and k points to decreasing values of r, while, on H , ℓ points
     to a fixed value of r.
R(ℓ, ℓ) = 0 . (3.8)
The identity σab σ ab = 0 is possible only if each of the terms σab in the sum (2.93) is zero. Hence
we have necessarily σ = 0. Since we had already θ(ℓ) = 0 (H non-expanding), this implies
that the full deformation rate tensor Θ vanishes identically [cf. Eq. (2.80)]. Moreover, given the
scaling law (2.84), this holds for any null normal ℓ. We can thus conclude:
Property 3.2: invariance of the cross-section metric along the null generators
  Provided that the null convergence condition (2.94) holds — which occurs in general
  relativity if the null energy condition (2.95) holds —, the deformation rate of any cross-
  section S of a non-expanding horizon H along any null normal ℓ is identically zero:
                                            1− ∗
                                        Θ := →q Lℓ q = 0 .                                        (3.9)
                                            2
  In other words, the whole metric q (and not only the area form Sϵ, as a mere θ(ℓ) = 0 would
  suggest) of any cross-section S is invariant along the geodesic generators of H .
Example 5 (Schwarzschild horizon): We had already noticed that, for the Schwarzschild horizon,
Θ = 0 [Eq. (2.83) in Example 22 of Chap. 2].
  ℓµ uν ∇ν v µ = uν ∇ν (ℓµ v µ ) − v µ uν ∇ν ℓµ = − Θνµ v µ uν − ων uν ℓµ v µ +v µ uν ℓν k σ ∇σ ℓµ = 0.
                        |{z}                        |{z}               |{z}        |{z}
                          0                          0                    0         0
  Let X(H ) be the space of vector fields tangent to the non-expanding horizon H . Under
62     The concept of black hole 2: Non-expanding horizons and Killing horizons
                            H
                                ∇ : X(H ) × X(H ) −→                     X(H )
                                                                                                 (3.10)
                                                                H
                                           (u, v)         7−→       ∇u v := ∇u v,
       is well-defined, i.e. H ∇u v belongs to X(H ). Moreover, this operator fulfills all the
       properties of an affine connection (cf. Sec. A.4.1), since ∇ does. We naturally call H ∇ the
       affine connection induced on H by ∇.
          As a consequence of the identity H ∇u v = ∇u v for tangent vector fields to H , (H , H ∇)
     is a totally geodesic submanifold of (M , g), i.e. any geodesic of (H , H ∇) is also a geodesic
     of (M , g) (cf. the historical note on p. 58).
       Property 3.4: horizon-intrinsic derivative of the null normal
       Under the hypotheses of Property 3.2, the covariant derivative of the null normal ℓ with
       respect to the affine connection H ∇ takes the form
                                              H
                                                  ∇ℓ = ℓ ⊗ H ω ,                                 (3.11)
       where H ω is the tensor field on H defined as the restriction of the 1-form ω introduced
       in Sec. 2.3.6 to tangent vectors to H [cf. Eq. (2.77)]. In other words, H ω is the pullback
       ι∗ ω of ω by the inclusion map ι : H → M (cf. Sec. A.2.8).
     Proof. By definition of the covariant derivative with respect to an affine connection [cf. Eq. (A.62)
     in Appendix A], H ∇ℓ is a tensor field of type (1, 1) on H , the action of which on any pair
     (a, u) formed by a 1-form a on H and a vector field u on H is H ∇ℓ(a, u) = ⟨a, H ∇u ℓ⟩.
     Let ā be the extension of a to a 1-form on M defined by ā := a ◦ Π, where Π is the projector
     onto H along k [Eq. (2.76)]. Since ∇u ℓ is tangent to H , we have ⟨a, H ∇u ℓ⟩ = ⟨ā, ∇u ℓ⟩. In
     view of (3.10) and using Eq. (2.74) to express ∇ℓ, we then get
                                                                                         
              H
                ∇ℓ(a, u) = ⟨ā, ∇u ℓ⟩ = āµ uν ∇ν ℓµ = āµ uν Θµν +ων ℓµ − ℓν k ρ ∇ρ ℓµ
                                                                |{z}
                                                                     0
                           = āµ ℓ ων u − ℓν u āµ k ∇ρ ℓ = ⟨ā, ℓ⟩ ⟨ω, u⟩ = ⟨a, ℓ⟩ ⟨H ω, u⟩.
                                 µ     ν         ν    ρ   µ
                                          |{z}
                                             0
     We can extend ξ to a vector field on all M , by varying p over M and setting ξ = 0 at fixed
     points of the group action. For any p ∈ M and dt infinitesimal, the (infinitesimal) vector
     connecting p to Φdt (p) is then dx = dt ξ (cf. Fig. 3.3), so that we may state:
         Given a spacetime (M , g) and a 1-dimensional Lie group G, one says that G is a symmetry
     group or a isometry group of (M , g), or equivalently that (M , g) is invariant under the
     action of G, iff there is an action Φ of G on M such that for any value of the parameter t of G,
     Φt is an isometry of (M , g), i.e. Φt preserves the “scalar products” (and hence the “distances”)
     in the following sense: for any p ∈ M and any pair of points (q, r) infinitely close to p, one has
                                    g|Φt (p) (dx′ , dy ′ ) = g|p (dx, dy),                       (3.15)
                                                                                        −−−−−−−→
     for the infinitesimal displacement vectors dx := →        −
                                                               pq, dy := → −
                                                                           pr, dx′ := Φt (p)Φt (q) and
              −−−−−−−→
     dy ′ := Φt (p)Φt (r) (cf. Sec. 1.2). Now, by definition, dx′ is nothing but the pushforward of the
     vector dx ∈ Tp M to the tangent space TΦt (p) M by the map Φt (cf. Sec. A.2.8), and similarly
     dy ′ is the pushforward of dy by Φt :
                                 dx′ = Φt∗ (dx)     and dy ′ = Φt∗ (dy).
                                           3.3 Killing horizons                                              65
Figure 3.3: Orbit Lp of a point p under the action Φ of a 1-dimensional Lie group, parameterized by t ∈ R.
The tangent vector ξ = dx/dt|Lp is the group generator associated with this parameter.
By rescaling by infinitely small parameters (using the bilinearity of g), it is clear that (3.15)
holds for finite vectors as well, so that we may say that Φt is an isometry of (M , g) iff
               ∀p ∈ M , ∀(u, v) ∈ (Tp M )2 ,        g|Φt (p) (Φt∗ u, Φt∗ v) = g|p (u, v),          (3.16)
where Φt∗ u (resp. Φt∗ v) is the pushforward of the vector u ∈ Tp M (resp. v ∈ Tp M ) to the
tangent space TΦt (p) M by Φt (cf. Sec. A.2.8). Given the definition (A.33) of the pullback of a
multilinear form, we may reexpress the isometry condition (3.16) in terms of the pullback of g
by Φt :
                                           Φ∗t g = g.                                     (3.17)
This is equivalent to the vanishing of the Lie derivative of g along the generators of G:
  Property 3.6: characterization of continuous spacetime isometries
  A 1-dimensional Lie group G is a symmetry group of the spacetime (M , g) iff the Lie
  derivative of the metric tensor along a generator ξ of G vanishes identically:
Lξ g = 0 . (3.18)
  The vector field ξ is then called a Killing vector of (M , g), Eq. (3.18) being equivalent to
  the so-called Killing equation:
∇α ξβ + ∇β ξα = 0 . (3.19)
The isometry condition (3.17) implies then Lξ g = 0. The reverse is true by integration. The
equivalence between Eqs. (3.18) and (3.19) immediately results from expression (A.87) for the
Lie derivative of g.
66     The concept of black hole 2: Non-expanding horizons and Killing horizons
       In terms of the components gαβ of g with respect to coordinates (xα ) = (t, x1 , . . . , xn−1 )
       adapted to the Killing vector ξ, i.e. such that ξ = ∂t , the isometry condition (3.18) is
       equivalent to
                                               ∂gαβ
                                                    = 0.                                        (3.20)
                                                ∂t
       t is then called an ignorable coordinate.
         Thus the existence of a Killing horizon requires that the spacetime (M , g) has some
     continuous symmetry (usually stationarity), namely that it is invariant under the action of a
     1-parameter group, as described in Sec. 3.3.1. Since the normal to a null hypersurface is tangent
     to its null geodesic generators and a Killing vector is tangent to the orbits of the isometry
     group, a definition equivalent to the above one is:
       Its immediately follows from the definition that the Killing vector ξ is non-vanishing on
     H (a normal to a hypersurface cannot vanish) and is null on H :
                                     ξ|H ̸= 0       and        ξ · ξ|H = 0.                            (3.21)
         We shall see in Chap. 5 that in a stationary spacetime, under some rather generic hypotheses,
     a (connected part of a) black hole event horizon must be a Killing horizon.
     Example 6 (null hyperplane as a translation-Killing horizon): Let us consider the null hyperplane
     of Minkowski spacetime H discussed in Examples 1, 5 and 8 of Chap. 2. H is defined by the equation
     t = x. The vector field
                                              ξ := ∂t + ∂x                                        (3.22)
     is a Killing vector of Minkowski spacetime: ξ is the generator of translations in the direction ∂t + ∂x ,
     and these translations constitute a 1-dimensional subgroup of the Poincaré group — the symmetry group
     of Minkowski spacetime. We note that ξ coincides with the null vector ℓ defined by Eq. (2.12). Since ℓ is
     normal to H , we conclude immediately that H is a Killing horizon with respect to ξ.
     Example 7 (null hyperplane as a boost-Killing horizon): Let us consider the same null hyperplane
     H as above, but with another Killing vector of Minkowski spacetime:
Figure 3.4: Null half-hyperplanes H + and H − as Killing horizons for the Killing vector field ξ = x∂t + t∂x
generating Lorentz boosts in Minkowski spacetime. The green lines are the null geodesic generators of H , while
the thick black line (actually a 2-plane) marks the location where ξ vanishes.
This vector is the generator of the 1-parameter group of Lorentz boosts in the (t, x) plane. On H we
have (cf. Fig. 3.4):
                                             H               H
                                           ξ = t(∂t + ∂x ) = t ℓ,
                                                                            H
where ℓ is the null normal to H defined by Eq. (2.12) and the notation = means that the equality holds
only on H . We conclude that ξ is a normal to the null hypersurface H as soon as t ̸= 0. Therefore, we
may split H \ {t = 0} in two open half-hyperplanes:
so that each of them is a Killing horizon with respect to ξ (cf. Fig. 3.4).
It follows that ξ is a null normal to H as soon as y ̸= 0. We may then split H \ {y = 0} in two open
half-hyperplanes:
     Figure 3.5: Null half-hyperplanes H1 and H2 as Killing horizons for the Killing vector field ξ = y(∂t + ∂x ) +
     (t − x)∂y generating null rotations in Minkowski spacetime. The green lines are the null geodesic generators of
     H , while the thick black line (actually a 2-plane) marks the location where ξ vanishes.
each of them being a Killing horizon with respect to ξ (cf. Fig. 3.5).
     Example 9 (light cone as a counter-example): The future light cone introduced in Example 2 of
     Chap. 2 is not a Killing horizon of Minkowski spacetime: it is invariant under the action of the Lorentz
     group, but its null generators are not invariant under the action of a single 1-dimensional subgroup of
     the Lorentz group. Actually the future light cone is an example of a more general structure, which Carter
     has termed a local isometry horizon [89, 92]: a null hypersurface that is invariant under some group
     G of isometries (here: the Lorentz group) and such that each null geodesic generator is an orbit of some
     1-dimensional subgroup of G, this subgroup being not necessarily the same from one null generator to
     the next (here: using Minkowskian spherical coordinates (t, r, θ, φ), the null geodesic generator through
     the point of coordinates (1, 1, θ0 , φ0 ) is the orbit of this point under the subgroup of boosts in the plane
     (θ, φ) = (θ0 , φ0 )). A Killing horizon is a local isometry horizon for which dim G = 1.
     Example 10 (Schwarzschild horizon): Given the expression (2.15) for the null normal ℓ of the family
     of hypersurfaces Hu and the fact that the Schwarzschild horizon H is defined by r = 2m, we have
                                                          H
                                                        ℓ = ∂t .                                             (3.26)
     Now the vector field ∂t is clearly a Killing vector of metric g as given by (2.5), since none of the metric
     components gαβ depends upon t. Hence (3.26) shows that the Schwarzschild horizon is a Killing horizon.
     By the way, Eq. (3.26) was our motivation for the choice of the null normal ℓ performed in Example 7 of
     Chap. 2.
     Historical note : The concept of Killing horizon has been introduced by Brandon Carter in 1966
     [88, 89] and developed in an article published in 1969 [92]. The properties of Killing horizons have been
     studied in detail by Robert H. Boyer, in an article prepared posthumously from his notes by J. Ehlers and
     J.L. Stachel and published in 1969 [71], leading to the concept of bifurcate Killing horizon, to be discussed
     in Sec. 3.4 (cf. the historical note on p. 87).
                                           3.3 Killing horizons                                             69
where the identifications with zero hold on S , the second one because u is constant (being
equal to zero) on S . Now thanks to the Killing equation (3.19), q µα q νβ ∇µ ξν is an antisymmetric
tensor field, while Θαβ is symmetric (cf. Sec. 2.3.6). It follows that Θαβ = 0. In view of the
scaling law (2.84), we can extend this result to the deformation rate of S along any null normal
to H and therefore state:
  Property 3.8: vanishing of the deformation rate of cross-sections of a Killing
  horizon
  On a Killing horizon H , the deformation rate Θ of any cross-section along any null normal
  ℓ is identically zero:
                                           Θ=0.                                        (3.27)
  This implies that the expansion of H along any null normal ℓ vanishes:
θ(ℓ) = 0. (3.28)
In view of the definition of a non-expanding horizon (cf. Sec. 3.2.1), an immediate corollary is
Remark 1: Θ vanishes for any Killing horizon [Eq. (3.27)], while to get the same result on a generic non-
expanding horizon, one has to assume that the null convergence condition holds on H (Property 3.2).
                                                                          H
     Let us consider the metric dual of this relation, i.e. ξ µ ∇µ ξα = κ ξα , and use the Killing equation
     (3.19) as ∇µ ξα = −∇α ξµ ; we get
                                                             H
                                                ξ µ ∇α ξµ = −κ ξα .
     Now ξ µ ∇α ξµ = 1/2 ∇α (ξµ ξ µ ). Hence
                                                             H
                                              ∇α (ξµ ξ µ ) = −2κ ξα .                                    (3.30)
Since ξµ ξ µ = ξ · ξ is a scalar field, we may replace the covariant derivative by the differential:
                                                             H
                                                d(ξ · ξ) = −2κ ξ .                                       (3.31)
     or equivalently
                                                             H
                                        ∇α ξβ − ∇β ξα = aα ξβ − aβ ξα .                                  (3.33)
     Remark 3: In the case of the vector ℓ, which is normal to H by definition, the Frobenius identity is
                                                                 H
     Eq. (2.16): ∇α ℓβ − ∇β ℓα = ∇α ρ ℓβ − ∇β ρ ℓα . Since ℓ = ξ, we may write
                                                         H
                                       ∇α ℓβ − ∇β ℓα = ∇α ρ ξβ − ∇β ρ ξα .
     But in general, ∇α ℓβ ̸= ∇α ξβ on H , since ℓ and ξ do not coincide outside H . Accordingly, one cannot
     identify the left-hand side of the above equation with the left-hand side of Eq. (3.33), so that the 1-form
     a is not ∇ρ.
         Thanks to the Killing equation (3.19), we may reshape (3.33) to
                                                     H
                                            2∇α ξβ = aα ξβ − aβ ξα .                                     (3.34)
In view of Eq. (3.29), the left-hand side of this equation is 2κξα . Hence we obtain
                                                          H
                                                   aµ ξ µ = 2κ.                                               (3.35)
                                                 H  1
                                              κ2 = − ∇µ ξν ∇µ ξ ν .                                           (3.36)
                                                    2
This is an explicit expression of κ in terms of the Killing vector field ξ. However, in actual
calculations, it is generally preferable to employ formula (3.31) to evaluate κ, because it does
not involve the computation of any covariant derivative, contrary to formula (3.36).
Lℓ κ = 0. (3.37)
This means that κ is constant along any field line of ℓ (i.e. any null geodesic generator of H ).
It could however vary from one field line to another. To prove that this is not the case, let us
consider a complete cross-section S of H and show that κ is constant on S . The starting
point is applying the 1-form identity (2.85) to a generic vector field v tangent to S , i.e. contract
(2.85) with v α . Since ℓµ v µ = 0, we get
Now, since H is a Killing horizon, we have Θ = 0 [Eq. (3.27)] and in particular θ(ℓ) = 0, so
that many terms in the above equation vanish. The term ∇µ Θµν requires some special care
                                                       →
                                                       −
though, because it potentially involves derivatives of Θ in directions transverse3 to H . Let us
then introduce in the vicinity of S a spacetime coordinate system (xα ) = (u, v, x2 , . . . , xn−1 )
   3
     Recall that, in our setting, tensor fields are extended beyond H by considering H as the element u = 0
of a family (Hu )u∈R of null hypersurfaces (cf. Sec. 2.3.2). We cannot assume that each Hu is a Killing horizon
                                                                                                        →
                                                                                                        −
(because typically the scalar ξ · ξ vanishes on a single hypersurface, not on an open subset of M ), so Θ is a priori
not zero outside H .
72     The concept of black hole 2: Non-expanding horizons and Killing horizons
     adapted to S , as the one introduced in Sec. 2.3.5 [cf. Eq. (2.52)], namely a coordinate system such
                                                                      →
                                                                      −
     that S is the set (u, v) = (0, 0). Then the components of Θ with respect to (xα ) necessarily
                                                       →
                                                       −
     verify Θ0α = 0 and Θ1α = 0. Indeed, since Θ is a tensor field tangent to S , on which u
     and v are constant, we have Θµα ∇µ u = 0 and Θµα ∇µ v = 0 with ∇µ u = ∂µ u = δ 0µ and
     ∇µ v = ∂µ v = δ 1µ by definition of (xα ). Expressing the covariant derivative via Eq. (A.64), we
     get
                                  ∇µ Θµν = ∂a Θaν + Γµµσ Θσν − Γσµν Θµσ ,
     where the sum of the partial derivatives has been limited to the indices a ∈ {2, . . . , n − 1}
                                                                                    →
                                                                                    −
     since Θ0ν = Θ1ν = 0. Given that the term ∂a Θaν involves only the variation of Θ in directions
                                                     →
                                                     −
     tangent to S , we conclude that ∇µ Θµν = 0 if Θ = 0 on S . Hence, for a Killing horizon,
     Eq. (3.38) reduces to
                             (ℓν ∇ν ωµ + κ ωµ )v µ − v µ ∇µ κ = Rµν ℓµ v ν .                   (3.39)
     The terms in the parentheses are related to the Lie derivative of ω along ℓ; indeed formula
     (A.86) gives:
                  Lℓ ωµ = ℓν ∇ν ωµ + ων ∇µ ℓν = ℓν ∇ν ωµ + ων Θµν + ωµ ℓν − ℓµ k σ ∇σ ℓν
                                                                                             
= ℓν ∇ν ωµ + κωµ − ων ℓµ k σ ∇σ ℓν ,
     where we have expressed ∇µ ℓν via Eq. (2.74) and have used Θµν = 0 and ων ℓν = κ [Eq. (2.78)].
     Given that ℓµ v µ = 0, Eq. (3.39) becomes
     where the idendities ⟨ω, v⟩ = ⟨H ω, v⟩ and ⟨ω, Lℓ v⟩ = ⟨H ω, Lℓ v⟩ hold by the very definition
     of H ω, since v and Lℓ v are tangent4 to H . It follows then from (3.41) that ⟨Lℓ ω, v⟩ = 0;
     hence the first term in Eq. (3.40) vanishes identically and there remains only:
    To go further, we shall assume the null dominance condition [421], namely that there
exists a scalar field f such that
                       →
                       −
      the vector W := − G(ℓ) − f ℓ         is zero or future-directed (null or timelike)
                                                                                                 .   (3.43)
                                           for any future-directed null vector ℓ
                        →
                        −
In the above equation, G stands for the type-(1, 1) tensor associated by metric duality to the
Einstein tensor G [Eq. (A.111)], so that the expression of W in index notation is [cf. Eq. (A.50a)]
W α := −Gαµ ℓµ − f ℓα . (3.44)
Note that the null dominance condition implies the null convergence condition (2.94) since
                            R
    R(ℓ, ℓ) = R(ℓ, ℓ) −       g(ℓ, ℓ) +f g(ℓ, ℓ) = G(ℓ, ℓ) + f g(ℓ, ℓ) = −W · ℓ ≥ 0,                 (3.45)
                            2 | {z }     | {z }
                                  0           0
the inequality holding because both W and ℓ are future-directed (cf. Lemma 1.2).
    If gravity is described by general relativity, i.e. if the metric g fulfills the Einstein equation
(1.41), then the null dominance condition with f = Λ is equivalent to the null dominant
energy condition:
                        →
                        −
       The vector W := − T (ℓ)        is zero or future-directed (null or timelike)
                                                                                             ,       (3.46)
                                      for any future-directed null vector ℓ
                                                                                        GR
Physically, the dominant energy condition states that, with respect to any observer (represented
by its 4-velocity u, which is future-directed timelike), the energy of matter and non-gravitational
fields does not move faster than light (see Ref. [104] for an extended discussion).
Remark 4: While the name null convergence condition for R(ℓ, ℓ) ≥ 0 [Eq. (2.94)] is standard in the
literature (e.g. [266, 452, 454]), the name null dominance condition for (3.43) is not standard. We are
using it to distinguish from the null dominant energy condition (3.46), which a condition on the matter
energy-momentum tensor, while (3.43) is a pure geometrical identity, independent of the Einstein
equation. In this way, null dominance condition is on the same footing as null convergence condition.
    Coming back to Eq. (3.42), we note that its right-hand side is nothing but the scalar product
of the vector W defined by Eq. (3.43) with v:
                                                                 
                      →
                      −                                   R
           W · v = −( G(ℓ) + f ℓ) · v = −R(ℓ, v) +           − f |{z}
                                                                    ℓ · v = −R(ℓ, v).
                                                          2
                                                                         0
74     The concept of black hole 2: Non-expanding horizons and Killing horizons
     If we assume the null dominance condition, the null convergence condition holds, so that
     R(ℓ, ℓ) = 0 on H [Eq. (3.8)]. Then, according to Eq. (3.45), ℓ · W = −R(ℓ, ℓ) = 0. This
     implies that the vector W is tangent to H . The latter being a null hypersurface, W must
     then be either collinear to ℓ or spacelike (cf. Lemma 2.3 in Sec. 2.3.4). Now, according to the
     null dominance condition (3.43), W cannot be spacelike. We conclude that W is collinear
     to ℓ. Consequently, we have W · v = 0. Hence the right-hand side of Eq. (3.42) vanishes
     identically and we are left with ∇v κ = 0. Since v is a generic vector field tangent to S and
     S is connected (for it is a complete cross-section of a Killing horizon, which is connected by
     definition), this implies that κ is constant over S . Given that κ is constant along each null
     geodesic generator of H , this completes the demonstration that κ is constant over H . More
     precisely, we have established the following property:
     In the context of Killing horizons, the non-affinity coefficient κ is called the horizon’s surface
     gravity, for a reason to be detailed in Sec. 3.3.7, and the result (3.48) is known as the zeroth
     law of black hole dynamics. More precisely, the zeroth law — to be discussed in detail in
     Chap. 16 — states that the surface gravity of a black hole in equilibrium is constant and we shall
     see in Chap. 5 that (any connected part of) the event horizon of a black hole in equilibrium is a
     Killing horizon.
     Remark 5: The standard proof of the zeroth law, based on taking the covariant derivative of Eq. (3.29)
     [40, 101, 499] or of Eq. (3.31) [275] and on an identity expressing the second order derivative of a Killing
     vector in terms of the Riemann tensor [Eq. (3.83) below], is quite long (see e.g. pp. 333-334 of Wald’s
     textbook [499] and the remark at the end of Sec. 5.5.2 of Poisson’s textbook [416]). The proof presented
     above is shorter but it relies on the concept of induced affine connection on a non-exanding horizon
     (Sec. 3.2.5) and the associated rotation 1-form H ω (cf. Property 3.4). This proof is actually adapted from
     a proof presented by Damour [151, 152] (cf. historical note below); see also Ref. [22] for a related proof
     regarding isolated horizons.
     Remark 6: The constancy of κ on a Killing horizon can also be proved without the null dominance
     condition, but at the price of additional hypotheses: either the Killing horizon H is part of a so-called
     bifurcate Killing horizon, as we shall see in Sec. 3.4.3 (Property 3.16) or the spacetime is axisymmetric,
     in addition to be stationary, and the two Killing vectors associated with stationarity and axisymmetry
     are orthogonal to (n − 2)-dimensional surfaces [96, 422].
Example 12 (null hyperplane as a boost-Killing horizon): Let us consider each of the null half-
hyperplanes H + and H − of Example 7, which are Killing horizons with respect to the boost Killing
vector ξ = x∂t + t∂x . On H + , the future-directed null normal coinciding with this Killing vector is
ℓ+ = t ℓ, ℓ being the geodesic null normal defined by ℓ := ∂t + ∂x [cf. Eq. (2.12)]. Using κℓ = 0 and
the scaling law (2.26), we get the non-affinity coefficient of ℓ+ as κ+ = ∇ℓ t = ∂t t + ∂x t, i.e.
κ+ = 1.
On H − , ξ is past-directed (cf. Fig. 3.4). Sticking to future-directed null normals, we shall then consider
H − as a Killing horizon with respect to the Killing vector field −ξ. The future-directed null normal
coinciding with −ξ on H − is then ℓ− = −t ℓ, from which we deduce the non-affinity coefficient of ℓ− :
κ− = ∇ℓ (−t) = ∂t (−t) + ∂x (−t), i.e.
                                                   κ− = −1.
We check that κ+ (resp. κ− ) is constant over the Killing horizon H + (resp. H − ), in agreement with
Property 3.10.
Example 14 (Schwarzschild and Kerr horizons): We have found in Example 10 of Chap. 2 [cf.
Eq. (2.29)] that on a Schwarzschild horizon κ = 1/(4m), which is clearly constant. But this last feature
is rather trivial since the Schwarzschild horizon is spherically symmetric, so that no dependence of κ on
θ nor φ could have been expected. A much less trivial example is that of the event horizon of a Kerr
black hole, which we shall discuss in Chap. 10. This horizon is only axisymmetric, so that a priori κ
could depend on θ. But it does not, as we shall see in Sec. 10.5.4:
                                                  √
                                                     m2 − a2
                                        κ=             √           ,
                                             2m(m + m2 − a2 )
where (m, a) are the two constant parameters of the Kerr solution. Note that for a = 0, we recover the
Schwarzschild value: κ = 1/(4m).
Example 15 (Cubic Galileon black hole as a counter-example): It has been found recently that the
surface gravity of rotating stationary black holes in a scalar-tensor theory of gravity known as the cubic
Galileon is not constant [241]. This evades the zeroth law (Property 3.10) because the null dominance
condition is not satisfied by these solutions.
Historical note : The constancy of κ for a Killing horizon has been proven by Stephen Hawking
in his lecture at the famous 1972 Les Houches Summer School [261] (p. 43). It has also been proven
without requiring the null dominance condition, but assuming axisymmetry and orthogonal transitivy
(cf. Remark 6 on p. 74) by Brandon Carter in his lecture at the same summer school [96] (Theorem 8,
p. 167). A third proof of the constancy of κ, using the null dominance condition, has also been given in
76     The concept of black hole 2: Non-expanding horizons and Killing horizons
     1973 by James Bardeen, Brandon Carter and Stephen Hawking in their seminal article The Four Laws of
     Black Hole Mechanics [40]. Yet another proof has been provided in 1979 by Thibault Damour [151, 152],
     who developed a fluid-bubble approach to the dynamics of an event horizon. In Damour’s framework,
     the pullback of the 1-form −ω/(8π) to the cross-section S is considered as the momentum surface
     density of the fluid bubble and Eq. (3.38) is turned into a 2-dimensional Navier-Stokes equation (see e.g.
     Sec. 6.3 of Ref. [230] for details), where κ/(8π) plays the role of the bubble’s surface pressure, so that it
     must constant in equilibrium. As mentioned in Remark 5, the proof presented above is derived from
     Damour’s one.
     Example 16 (Killing horizons in Minkowski spacetime): In Minkowski spacetime, the null hyper-
     plane as a translation-Killing horizon (Example 11) and the two half-hyperplanes as null-rotation-Killing
     horizons (Example 13) are degenerate Killing horizons, while the two half-hyperplanes as boost-Killing
     horizons (Example 12) are non-degenerate.
     Example 17 (Schwarzschild and Kerr horizons): From the values of κ given in Example 14, we see
     that the Schwarzschild horizon and the Kerr horizon for a < m are non-degenerate Killing horizons,
     while the Kerr horizon for a = m is a degenerate one.
         The next example regards the anti-de Sitter spacetime and will play some role in the study
     of the extremal (a = m) Kerr black hole in Chap. 13.
     Example 18 (Poincaré horizon in AdS4 ): The 4-dimensional anti-de Sitter spacetime (AdS4 ) is
     (M , g) with M ≃ R4 and g is the metric whose components in the so-called global static coordinates
     (τ, r, θ, φ) are given by
                                                         dr2
                                                                                   
                                 2              2    2            2  2     2    2
                                                                                                           (3.49)
                                                                                  
                           g=ℓ           −(1 + r ) dτ +        + r dθ + sin θ dφ      ,
                                                        1 + r2
     where ℓ is a positive constant. Note that τ spans R, r spans (0, +∞), while (θ, φ) are standard spherical
     coordinates on S2 : θ ∈ (0, π) and φ ∈ (0, 2π). The metric (3.49) is a solution to the vacuum Einstein
     equation (1.43) with the negative cosmological constant Λ = −3/ℓ2 . Using the so-called conformal
     coordinates (τ, χ, θ, φ) with χ := arctan r ∈ (0, π/2), one gets
                                       ℓ2 
                                           −dτ 2 + dχ2 + sin2 χ dθ2 + sin2 θ dφ2 .                         (3.50)
                                                                                
                             g=         2
                                     cos χ
                                                3.3 Killing horizons                                                   77
3 3
2 2
                   1                                                1
                                  P
                                                             τ
                   0                                                0
             τ
1 1
2 2
                   3                                                3
                       1.5 1.0 0.5 0.0 0.5 1.0 1.5                      1.5 1.0 0.5 0.0 0.5 1.0 1.5
                                   xχ                                               xχ
Figure 3.6: Left: 2-dimensional slice (x, y) = (0, 0) of the Poincaré patch MP of anti-de Sitter spacetime plotted
in terms of the (global) coordinates τ and xχ := χ cos φ (pale yellow region). From Eq. (3.53), (x, y) = (0, 0)
implies θ = π/2 and φ = 0 or π, so that xχ = χ in the right half of the plot (φ = 0) and xχ = −χ in the left half
(φ = π). The plotted slice of MP is thus spanned by the Poincaré coordinates (t, u). The red lines are curves
of constant u, i.e. integral curves of the coordinate vector field ∂t = ξ, with u increasing from 0 to +∞ from
the right to the left of the diagram. The grey lines are curves of curves of constant t, i.e. integral curves of the
coordinate vector field ∂u , with t increasing from −∞ to +∞ from the bottom to the top of the diagram. The
Poincaré horizon H is depicted in green. The two connected components H− and H+ of H appear as straight
line segments, since for θ = π/2 and φ ∈ {0, π}, u = 0 ⇐⇒ cos τ = ± sin χ (+ for φ = 0 and − for φ = π).
Note that the curves of constant u tend to H when u → 0, in agreement with the characterization of H by
u = 0. The curves of constant t tend to H when t → ±∞, which is expected as well since the first line of
Eq. (3.53) and Eq. (3.52) imply that u → 0 for t → ±∞. Right: Killing vector field ξ defined by Eq. (3.58) on AdS4 ;
ξ is timelike everywhere, except on the Poincaré horizon H , where it is null (and normal, and thus tangent, to
H ). [Figures generated by the notebook D.3.1]
78     The concept of black hole 2: Non-expanding horizons and Killing horizons
     Note that (χ, θ, φ) spans one half5 of the hypersphere S3 . Yet another set of coordinates commonly
     used in AdS4 is Poincaré coordinates (t, x, y, u). They cover only a subpart MP of M (cf. Fig. 3.6),
     usually called the Poincaré patch and defined by
     The last line is simply (3.52) restricted to MP , the scalar field u being viewed there as one of the Poincaré
     coordinates. The metric components with respect to Poincaré coordinates7 are (cf. the notebook D.3.1
     for the computation):
                                             u2                         ℓ2
                                       g = 2 −dt2 + dx2 + dy 2 + 2 du2 .                                      (3.54)
                                             ℓ                              u
     The Poincaré horizon is the hypersurface H bounding the Poincaré patch MP in M . In view of the
     definition (3.51) of the latter, H appears to be the level set u = 0:
     Note that H is not included in MP , so that the Poincaré coordinates are not defined on H (except
     for u, they actually diverge in the vicinity of H ). Note also that H has two connected components:
     H− , where τ ∈ (−π, 0), and H+ , where τ ∈ (0, π), since u = 0 cannot be achieved for τ = 0, as a
     consequence of formula (3.52) and | sin χ| < 1 on M . The Poincaré horizon is depicted in Fig. 3.6. Its
     normal is given by the gradient of u, so that the vector field defined in all M by k := ∇u
                                                                                              ⃗ is normal to
     H on H . Given expressions (3.50) and (3.52) for respectively g and u, the components k α = g αµ ∂µ u
     of k with respect to conformal coordinates are
                                                                                                  
              1                                                     cos θ cos φ        sin φ
         k=       sin τ cos χ∂τ + (cos τ sin χ − sin θ cos φ) ∂χ −              ∂θ +             ∂φ . (3.56)
               ℓ                                                       tan χ         tan χ sin θ
     The scalar square of k is k · k = kµ k µ = k µ ∂µ u; we obtain
                                                                  u2
                                                       k·k =         .                                            (3.57)
                                                                  ℓ2
        5
           It would span the whole hypersphere if χ would run in all of (0, π), instead of being limited to (0, π/2).
        6
           See e.g. Ref. [45].
         7
           The name Poincaré coordinates stems from a variant of these coordinates obtained by using z := ℓ2 /u instead
     of u, so that g = ℓ2 −dt2 + dx2 + dy 2 + dz 2 /z 2 , which is similar to the metric of the Poincaré half-space
     model of the hyperbolic space H4 , except for the signature (−, +, +, +) instead of (+, +, +, +).
                                             3.3 Killing horizons                                                79
                H
Hence k · k = 0, which implies that H is a null hypersurface.
   Since the metric components (3.54) do not depend on t, the vector field ξ := ∂t is a Killing vector of
(MP , g). By inverting the Jacobian matrix associated with the change of coordinates (3.53), we get the
expression of ξ = ∂t in terms of conformal coordinates:
      1 − cos τ sin χ sin θ cos φ       sin τ cos χ sin θ cos φ      sin τ cos θ cos φ       sin τ sin φ
ξ=                                ∂τ −                          ∂χ −                   ∂θ +               ∂φ .
                   ℓ                               ℓ                      ℓ sin χ           ℓ sin χ sin θ
                                                                                                        (3.58)
A priori, ξ is defined on MP only, but the right-hand side of the above expression is regular on all M .
Hence, we may use (3.58) to define ξ as a vector field on all M . It is depicted in the right panel of
Fig. 3.6. By analytical continuation, it is immediate that ξ obeys the Killing equation (3.19) everywhere
and not only on MP (cf. the notebook D.3.1 for an explicit check). Hence ξ is a Killing vector of the
entire anti-de Sitter spacetime (M , g). By comparing Eqs. (3.56) and (3.58), we get
                                sin τ     u
                           ξ=         k + 2 (cos τ cos χ ∂τ − sin τ sin χ ∂χ ) .                       (3.59)
                                cos χ    ℓ
            H
Since u = 0 [Eq. (3.55)], it follows immediately that, on H , ξ is collinear to the null normal to
        H
H : ξ = (sin τ / cos χ) k, with sin τ ̸= 0 (given that τ ̸= 0 on H ). Hence, ξ is normal to the null
hypersurface H . We therefore conclude that each of the connected components H− and H+ of the
Poincaré horizon H is a Killing horizon with respect to ξ. Let us evaluate the non-affinity coefficient κ
of ξ on H± via formula (3.31). First of all, we compute the scalar square of ξ by noticing that on MP ,
ξ = ∂t , so that ξ · ξ = gtt ; with gtt read on Eq. (3.54), we get
                                                          u2
                                               ξ·ξ =−        .                                         (3.60)
                                                          ℓ2
By means of the global components (3.50) and (3.58) of respectively g and ξ, one checks that formule
(3.60) holds in all M . Given that the right-hand side is negative wherever u ̸= 0, it follows that ξ is
timelike everywhere on M , except on H . Furthermore, formula (3.60) results in d(ξ · ξ) = −2ℓ−2 u du.
        H                            H
Since u = 0, this implies d(ξ · ξ) = 0, so that formula (3.31) yields κ = 0. We conclude that the two
connected components of the Poincaré horizon of AdS4 are degenerate Killing horizons.
Property 3.11
       In the vicinity of a non-degenerate Killing horizon H , the Killing vector field ξ is timelike
       on one side of H , null on H and spacelike on the other side.
Let us focus on the side of H where ξ is timelike. There we define the “norm” of ξ by
                                                                                                             (3.61)
                                                          p
                                               V :=           −ξ · ξ .
We have V > 0 and the square of the gradient of V provides a new expression for κ:
                                            κ2 = lim ∇µ V ∇µ V ,                                             (3.62)
                                                     H
     where limH stands for the limit as one approaches H from the timelike side, which implies
     V → 0.
ω := ξ ∧ dξ (3.63)
Killing equation (3.19) enables us to simplify each term inside parentheses in (3.64), yielding
ξµ ∇ν ξρ ξ µ ∇ν ξ ρ = ξµ ξ µ ∇ν ξρ ∇ν ξ ρ = −V 2 ∇ν ξρ ∇ν ξ ρ = −V 2 ∇µ ξν ∇µ ξ ν (3.66)
                                                                         1
     ξµ ∇ν ξρ ξ ν ∇ρ ξ µ = ξµ ∇ρ ξ µ ξ ν ∇ν ξρ = −ξµ ∇ρ ξ µ ξ ν ∇ρ ξν = − ∇ρ V 2 ∇ρ V 2 = −V 2 ∇ρ V ∇ρ V.
                                                                         4
                                                                                                    (3.67)
                                                 3.3 Killing horizons                                       81
Actually, we notice that each line is made of one term of type (3.66) and two terms of type
(3.67). Hence
                      ωµνρ ω µνρ = −12V 2 (∇µ ξν ∇µ ξ ν + 2∇µ V ∇µ V ) .              (3.68)
On H , each of the terms inside parentheses in Eq. (3.64) can be expressed thanks to the
Frobenius identity (3.33):
                         H
                   ωαβγ = ξα (aβ ξγ − aγ ξβ ) + ξβ (aγ ξα − aα ξγ ) + ξγ (aα ξβ − aβ ξα ) .
We notice that all terms in the right-hand side canceal two by two, yielding
                                                           H
                                                    ωαβγ = 0.                                      (3.69)
Equation (3.69) is actually nothing but a variant of Frobenius theorem, expressing the fact that
the vector field ξ is hypersurface-orthogonal on H (see e.g. Eq. (B.3.6) in Wald’s textbook
[499], taking into account that ωαβγ = 6ξ[α ∇β ξγ] ). Let us evaluate the gradient of the square
(3.68) and take the limit on H :
                                                                                     
                    µνρ               µνρ               2         µ ν             µ
         ∇α ωµνρ ω      + ω µνρ ∇ α ω     =  −12  ∇ α V     ∇ ξ ∇  ξ   +2∇    V ∇   V
                  |{z} |{z}                       | {z  } | µ ν{z }         µ
                        →0       →0                         →2κξα        →−2κ2
                                                           V 2 ∇α (∇µ ξν ∇µ ξ ν + 2∇µ V ∇µ V ) ,
                                                      −12 |{z}
                                                               →0
                                                                H
where we have used Eq. (3.31) in the form ∇α V 2 = 2κξα , as well as expression (3.36) of κ2 .
Hence we are left with
                         κ ∇µ V ∇µ V − κ2 ξα −→ 0 on H .
                                           
         a := ∇u u
            = ∇V −1 ξ V −1 ξ = V −1 ∇ξ V −1 ξ = V −1 −V −2 (∇ξ V ) ξ + V −1 ∇ξ ξ .
                                                                             
   8
       Were ξ past-directed, we could always consider the Killing field −ξ instead.
82     The concept of black hole 2: Non-expanding horizons and Killing horizons
     We have thus
                                                 1
                                                    ∇ξ ξ.
                                                     a=                                                         (3.71)
                                                 V2
     Thanks to Killing equation (3.19), we may rewrite this relation as
                     1 µ            1               1                 1
             aα =     2
                        ξ ∇µ ξα = − 2 ξ µ ∇α ξµ = − 2 ∇α (ξµ ξ µ ) =      ∇α V 2 = ∇α ln V,
                    V              V               2V                2V 2
     hence
                                                          →
                                                          −
                                                      a = ∇ ln V.                                               (3.72)
     The norm of a, which is always a spacelike vector (since u · u = −1 implies u · a = 0), is
                                                 √            1p
                                          a :=       a·a=       ∇µ V ∇µ V .                                     (3.73)
                                                              V
     Given the result (3.62), we get an expression of κ involving a:
                                                     κ = lim V a ,                                              (3.74)
                                                           O→H
     where O → H means that the limit is achieved by choosing the worldline of observer O
     arbitrarily close to H . Since V → 0 as one approaches H , it follows that
     This means that the acceleration felt by observer O (the “gravity”) diverges as O approaches
     H . In that sense, the physical surface gravity of H is infinite. But Eq. (3.74) shows that the
     rescaled acceleration V a remains finite as one approaches H , and tends to κ. It is this quantity
     that is named the surface gravity of the Killing horizon H .
     Remark 7: As stressed above, the surface gravity κ is not the actual gravity a measured locally, i.e. by an
     observer at rest with respect to H and infinitely close to it. However, κ can be interpreted as a physical
     force (per unit mass) measured by a distant observer, at least in the special case of a Schwarzschild
     black hole, for which ξ is timelike in the entire region outside the Killing horizon9 . In this case, one can
     identify κ with the magnitude of the force exerted by an observer “at infinity” to hold in place a particle
     of unit mass close to H by means of an infinitely long massless string (see e.g. Sec. 5.2.4 of Poisson’s
     textbook [416]).
        9
         This is not true for a rotating Kerr black hole: ξ becomes null at some “light-cylinder” outside H and is then
     spacelike away from it, cf. Eq. (10.68), where ξ is denoted by χ.
                                       3.4 Bifurcate Killing horizons                                             83
Figure 3.7: Bifurcate Killing horizon H1 ∪ H2 with respect to the Killing vector field ξ; S is the bifurcation
surface. L1 and L2 are null geodesic generators of respectively H1 and H2 , which cross each other at the point
p ∈ S.
• S := H1 ∩ H2 is a spacelike (n − 2)-surface;
      • each of the sets H1 \ S and H2 \ S has two connected components, which are
        Killing horizons with respect to ξ.
   Hence we may say that a bifurcate Killing horizon is formed by four Killing horizons, H1+ ,
H1− , H2+ and H2− say, which are glued together at the bifurcation surface S (cf. Fig. 3.7), in
such a way that
The Killing vector field vanishes on the bifurcation surface of a bifurcate Killing horizon:
ξ|S = 0 . (3.77)
     Proof. Let p ∈ S and let us assume that ξ|p ̸= 0. Let L1 (resp. L2 ) be the null geodesic
     generator of H1 (resp. H2 ) that intersects S at p (cf. Fig. 3.7). By definition of a Killing horizon,
     ξ is tangent to L1 ∩ H1+ and to L1 ∩ H1− , i.e. to L1 \ {p}. If ξ|p ̸= 0, then by continuity, ξ
     is a (non-vanishing) tangent vector field all along L1 . Similarly, ξ is tangent to all L2 . At their
     intersection point p, the geodesics L1 and L2 have thus a common tangent vector, namely ξ|p .
     The geodesic uniqueness theorem (Property B.10 in Appendix B) then implies L1 = L2 , so
     that L1 ⊂ H1 ∩ H2 = S . But since S is spacelike and L1 is null, we reach a contradiction.
     Hence we must have ξ|p = 0.
     Remark 1: Having a Killing vector field that vanishes somewhere (here S ) is not the sign of any
     pathology: it simply means that the points of S are fixed points of the isometries generated by ξ, since
     setting ξ = 0 in Eq. (3.14) leads to dx = 0, i.e. to Φdt (p) = p.
     Remark 2: Contrary to what the name may suggest, a bifurcate Killing horizon is not a Killing horizon,
     for the latter, as defined in Sec. 3.3.2, is a regular (i.e. embedded) hypersurface of M (cf. Sec. A.2.7 in
     Appendix A), while the union of two hypersurfaces is not in general a hypersurface. Moreover on a
     Killing horizon, the Killing vector field is nowhere vanishing [cf. Eq. (3.21)], while on a bifurcate Killing
     horizon, it is vanishing at the bifurcation surface.
     Example 19 (Lorentz-boost bifurcate Killing horizon): Let us consider Minkowski spacetime and
     the Killing vector given by Eq. (3.23): ξ := x∂t + t∂x , namely the generator of Lorentz boosts in the
     (t, x) plane. Let us take for H1 the null hyperplane t = x considered in Example 7 and denoted there by
     H . The two half-hyperplanes defined by Eq. (3.24) are then the Killing horizons H1+ and H1− . The
     union H1 ∪ H2 , where H2 is the null hyperplane t = −x is a bifurcate Killing horizon with respect
     to ξ, with the 2-plane (t, x) = (0, 0) as bifurcation surface (cf. Fig. 3.8). Note that on H1 , the Killing
     vector ξ points away from S , while on H2 , it points towards S , as in the generic figure 3.7.
Figure 3.8: Bifurcate Killing horizon H1 ∪ H2 with respect to the Killing vector ξ = x∂t + t∂x generating
Lorentz boosts in the plane (t, x) of Minkowski spacetime. The dimension along z having been suppressed, the
bifurcation surface S appears as a line, while it is actually a 2-plane.
  Let H be a Killing horizon (with respect to a Killing vector ξ) of constant nonzero surface
  gravity κ. Let t be the parameter of the null geodesic generators L of H associated to ξ
  (ξ = dx/dt along L ). The null vector field ℓ defined on H by
                                                   eκt
                                             λ=        + λ0 ,                                      (3.79)
                                                    κ
  where λ0 is constant along a given geodesic generator.
Proof. We have
Hence ℓ is a geodesic vector. Besides, along any null generator of H , one has [cf. Eq. (A.8)]
                                     dλ
                                        = ξ(λ) = eκt ℓ(λ) = eκt ,
                                     dt              |{z}
                                                           1
         Let us assume κ > 0 and consider a null geodesic generator L of H ; L can be param-
     eterized by t, the corresponding tangent vector being ξ. When t spans the whole interval
     (−∞, +∞), Eq. (3.79) implies that λ spans the interval (λ0 , +∞) only. Since λ is an affine pa-
     rameter of L , this means that L is an incomplete geodesic (cf. Sec. B.3.2). Moreover, Eq. (3.78)
     leads to
                                ξ → 0 when t → −∞              (κ > 0).                         (3.80)
     In other words, the Killing vector field ξ vanishes and the null geodesic L stops at the “edge”
     of H corresponding to t → −∞. If there is no obstacle (spacetime singularity or spacetime
     edge10 ), L can be extended to λ ∈ (−∞, λ0 ], giving rise to a complete null geodesic L˜. This
     operation can be performed for all the null geodesic generators of H and we have the freedom
     to choose the same value of λ0 in Eq. (3.79) for all of them. In this process, one gives birth to
     a null hypersurface, H˜ say, which contains H . Let S ⊂ H˜ be the set of points of affine
     parameter λ = λ0 along all the extended null geodesics L˜. S is clearly a cross-section of
     H˜ (cf. Sec. 2.3.4); it is then a spacelike (n − 2)-dimensional surface. Assuming that ξ is
     future-directed on H , S constitutes the past boundary of H , i.e. the boundary corresponding
     to t → −∞. Since ξ is a smooth vector field on M , Eq. (3.80) implies that ξ vanishes on S . In
     other words, S is a set of fixed points for the isometry group generated by ξ (cf. Remark 1
     above). Let us denote by H − the subset of H˜ generated by the segments λ < λ0 of the null
     geodesics L˜: H − = H˜ \ (H ∪ S ). H − is clearly a null hypersurface. Since S is spacelike
     and (n − 2)-dimensional, there are, at each point p ∈ S , only two null directions normal to
     S (cf. Sec. 2.3.4). One of them is along ℓ. The set of all null geodesics departing from S along
     the other null direction forms a null hypersurface, H2+ say, in the future of S and another
     null hypersurface, H2− say, in the past of S . By studying the behavior of a Killing vector field
     around the set of its fixed points (here S ), Boyer [71] has shown that in the current setting
     (i.e. S spacelike), ξ acts locally as the generator of Lorentz boosts in Minkowski spacetime
     and S is the bifurcation surface of a bifurcate Killing horizon similar to that of Example 19 (cf.
     Fig. 3.8). More precisely, Boyer proved the following:
The reasoning developed for κ > 0 can be applied mutatis mutandis, leading to a bifurcate
Killing horizon with a bifurcation surface S that is the future boundary of H . Hence we
conclude:
Remark 3: It could be that the generators of H cannot be extended simply because the spacetime
(M , g) is not been chosen “large enough”. I. Rácz and R.M. Wald [422] have shown that under some
mild hypotheses (in particular that (M , g) is globally hyperbolic), one can extend (M , g) to a larger
spacetime with a bifurcate Killing horizon containing H .
Remark 4: For a degenerate Killing horizon, the problem of extension disappears, since t is then an
affine parameter of the null generators. Consequently if t spans the whole interval (−∞, ∞), the null
generators are complete geodesics. One can still have ξ → 0 at some boundary of H , but this is a
null boundary, not a spacelike one, and it does not correspond to a bifurcation surface. An example is
the Killing horizon with respect to a null-rotation Killing vector in Minkowski spacetime, exhibited as
Examples 8 and 13, p. 67 and 75 respectively (cf. Fig. 3.5): ξ = 0 on the null 2-plane of equation t = x,
y = 0.
Historical note : The concept of bifurcate Killing horizons has been introduced by Robert H. Boyer
(1932-1966), a young American mathematical physicist who had just been appointed to the University of
Liverpool. Sadly, Boyer was killed, among 14 victims, in a mass murder that occurred in the University
of Texas at Austin on 1 August 1966. His last notes, containing the definition of a bifurcate Killing
horizon and the proof of Property 3.14, have been turned into an article by Jürgen Ehlers and John
Stachel, which has been published in 1969 [71].
  The surface gravity is a nonzero constant over any Killing horizon that is part of a bifurcate
  Killing horizon:
                                        κ = const ̸= 0.                                    (3.82)
For any Killing vector ξ, the following identity, often called the Kostant formula, holds:
∇γ ∇β ξ α = Rαβγµ ξ µ , (3.83)
       where Rαβγµ is the Riemann curvature tensor of g. The contraction on the indices α and γ
       lets the Ricci tensor appear:
                                        ∇µ ∇α ξ µ = Rαµ ξ µ .                             (3.84)
κ (or more precisely the limit to S of H ’s surface gravity κ) is uniform over S . Each null
generator L of H is an incomplete geodesic, which once extended to a complete geodesic,
intersects S at single point. The constancy of κ on S implies then that κ cannot vary from
one null generator to the other. κ is thus uniform over H .
Example 20 (Lorentz-boost bifurcate Killing horizon): Let us consider the bifurcate Killing horizon
Hˆ of Minkowski spacetime discussed in Example 19 and illustrated in Fig. 3.8. The corresponding Killing
vector is the Lorentz-boost generator ξ := x∂t + t∂x . An easy computation yields ∇ξ ξ = t∂t + x∂x .
On the components H1+ and H1− of Hˆ (cf. Fig. 3.8), one has t = x, so that ξ = t(∂t + ∂x ) and
∇ξ ξ = t(∂t + ∂x ). Hence ∇ξ ξ = ξ and κ = 1 there. On the components H2+ and H2− of Hˆ (cf.
Fig. 3.8), one has t = −x, so that ξ = t(−∂t + ∂x ) and ∇ξ ξ = t(∂t − ∂x ). Hence ∇ξ ξ = −ξ and
κ = −1 there. We conclude that κ is constant over any of the four Killing horizons constituting Hˆ .
Remark 5: As the above example shows, the surface gravity is not uniform over all the bifurcate Killing
horizon Hˆ ; it keeps the same value only on the subsets H − ∪ H + and H − ∪ H + .
                                                             1       1         2       2
Historical note : The proof of the zeroth law for Killing horizons that are part of a bifurcate Killing
horizon (Property 3.16) has been first presented by Bernard Kay and Robert Wald in 1991 [310]; these
authors mentioned that the proof had been suggested to them by Robert Geroch. The proof appears also
in the notes from a lecture given by Wald at a school held in Erice in May 1991 [501]. It is also presented
in Poisson’s textbook [416].
3.5 Summary
Here is an inheritance diagram summarizing the main results of this chapter. The vertical
arrow means “is a”, i.e. the item at the bottom of an arrow is a special case of the item at the
top of the arrow. “NCC” stands for “Null Convergence Condition” [Eq. (2.94)], “NDC” for “Null
Dominance Condition” [Eq. (3.43)] and “BKH“ for “Bifurcate Killing Horizon” (Sec. 3.4.1).
90   The concept of black hole 2: Non-expanding horizons and Killing horizons
                                   Null hypersurface
                                 null geodesic generators
                                        ∇ℓ ℓ = κℓ
                                 Non-expanding horizon
                              closed-manifold cross-sections
                                         θ(ℓ) = 0
                           area independent of the cross-section
                          NCC =⇒ Θ = 0
                                 =⇒ induced affine connection
                                     Killing horizon
                            with closed-manifold cross-sections
                                         Θ=0
                             NDC =⇒ κ = const (Zeroth Law 1)
                       part of BKH =⇒ κ = const ̸= 0 (Zeroth Law 2)
Chapter 4
Contents
         4.1   Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   91
4.1     Introduction
Having attempted in Chaps. 2 and 3 to characterize a black hole by the local properties of
its boundary, we turn now to the general definition of a black hole. As it could have been
anticipated from the naive “definition” given in Sec. 2.2.1, the mathematically meaningful
definition of a black hole cannot be local: it has to take into account the full spacetime structure,
in particular its future asymptotics. Indeed, to conclude firmly that a particle has escaped from
a given region, one has to wait until the “end of time” to make sure that the particle will never
be back...
    In this chapter, we therefore consider the global spacetime picture to arrive at the general
definition of a black hole in Sec. 4.4. This amounts to focusing on the spacetime asymptotics,
which can be seen as the region where the “distant observers” live and may, or may not, receive
light signal from some “central region”. This far-away structure is best described in terms of the
so-called conformal completion, which brings the spacetime infinity(ies) to a finite distance in
another manifold — a technique initiated by R. Penrose [403, 404] (see Refs. [198] and [382] for
a review). We start by investigating the conformal completion of the simplest of all spacetimes:
Minkowski spacetime.
92                        The concept of black hole 3: The global view
                                       t
                                 4.0
                                 3.5
                                 3.0
                                 2.5
                                 2.0
                                 1.5
                                 1.0
                                 0.5
                                                                                     r
                                           0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0
Figure 4.1: Lines of constant null coordinates u (solid) and v (dashed) in terms of the coordinates (t, r).
     and the associated spherical coordinates (t, r, θ, φ), which are related to the Minkowskian ones
     by
                                          
                                           x = r sin θ cos φ
                                          
                                          
                                              y = r sin θ sin φ                                  (4.2)
                                          
                                          
                                              z = r cos θ.
                                          
     The coordinates (t, r, θ, φ) span R × (0, +∞) × (0, π) × (0, 2π); they do not cover the whole
     manifold M as a regular chart (cf. Sec. A.2.1 of Appendix A), but only M \ Π, where Π is the
     closed half hyperplane defined by y = 0 and x ≥ 0. Once expressed in terms of the spherical
     coordinates, the Minkowski metric (4.1) takes the form
                                                         U
                                                   1.5
                                                   1.0
                                                   0.5
                                                                                                u
                6            4            2        0.5             2           4            6
                                                   1.0
                                                   1.5
    Let us introduce the null coordinate system (u, v, θ, φ) where u and v are respectively the
retarded and advanced time defined by (cf. Fig. 4.1)
                                                
                            u=t−r                t = 1 (v + u)
                                                          2
                                           ⇐⇒                                              (4.4)
                            v =t+r               r = 1 (v − u).
                                                                   2
One has then du dv = dt2 − dr2 and the metric tensor (4.3) takes the shape
                                         1
                             g = −du dv + (v − u)2 dθ2 + sin2 θ dφ2 .                               (4.5)
                                                                   
                                         4
The coordinates (u, v) span the half part of R2 defined by u < v. In order to have coordinates
within a finite range, let us consider their arctangents (cf. Fig. 4.2):
                                                     
                             U = arctan u             u = tan U
                                                 ⇐⇒                                       (4.6)
                             V = arctan v             v = tan V.
Given that arctan is a monotonically increasing function (cf. Fig. 4.2), the coordinates (U, V )
span the half part of (−π/2, π/2) × (−π/2, π/2) defined by U < V :
                                 π     π            π     π
                            −      <U < ,      −      <V < ,           and   U < V.                 (4.7)
                                 2     2            2     2
       Since
                        dU                 dV                                      sin(V − U )
                du =          ,    dv =                  and   tan V − tan U =                 ,
                       cos2 U             cos2 V                                   cos U cos V
the Minkowski metric (4.5) is expressed in terms of the coordinates (U, V, θ, φ) as1
                             1                        2             2       2     2
                                                                                                    (4.8)
                                                                                   
                g=                     −4 dU dV + sin   (V − U ) dθ   + sin   θ dφ     .
                      4 cos2 U cos2 V
   1
       See also the SageMath notebook D.2.1.
94                         The concept of black hole 3: The global view
     Remark 1: The retarded/advanced times u and v have the dimension of a time, or of a length in the
     c = 1 units used here. Therefore, one should introduce some length scale, ℓ0 say, before taking their
     arctangent and rewrite (4.6) as
                                                            
                                 U = arctan(u/ℓ )            u = ℓ tan U
                                                   0                 0
                                                       ⇐⇒
                                 V = arctan(v/ℓ0 )           v = ℓ0 tan V.
     The coordinates (U, V ) are dimensionless and a global factor ℓ20 should be introduced in the right-hand
     side of Eq. (4.8). However, the length scale ℓ0 plays no essential role, so that, to keep simple notations, it
     is omitted in what follows. In other words, we are using units for which ℓ0 = 1.
                                                      g̃ = Ω2 g ,                                             (4.9)
     where Ω is the scalar field M → R+ obeying
                                       Ω = 2 cos U cos V                                                   (4.10a)
                                                    2
                                         =√          √                                                     (4.10b)
                                              u2 + 1 v 2 + 1
                                                         2
                                         =p               p             .                                  (4.10c)
                                              (t − r)2 + 1 (t + r)2 + 1
     We notice on (4.10b) and (4.10c) that the function Ω never vanishes on M , so that the bilinear
     form g̃ defined by (4.9) constitutes a well-behaved metric on M . Moreover, since Ω2 > 0, g̃
     has the same signature as g, i.e. (−, +, +, +). The expression of g̃ is deduced from (4.8) and
     (4.10a):
                          g̃ = −4 dU dV + sin2 (V − U ) dθ2 + sin2 θ dφ2 .                     (4.11)
                                                                             
         In view of (4.9), one says that the metric g̃ is conformal to the metric g, or equivalently,
     that the metrics g and g̃ are conformally related, or that g̃ arises from g via a conformal
     transformation. The scalar field Ω is called the conformal factor.
         A key property of a conformal transformation is to preserve orthogonality relations, since
     (4.9) clearly implies, at any point p ∈ M ,
                        ∀(u, v) ∈ Tp M × Tp M ,          g̃(u, v) = 0 ⇐⇒ g(u, v) = 0.
     In particular, null vectors for g̃ coincide with null vectors for g:
                                 ∀ℓ ∈ Tp M ,      g̃(ℓ, ℓ) = 0 ⇐⇒ g(ℓ, ℓ) = 0.
     Consequently the light cones of (M , g) and (M , g̃) are identical, which implies that (M , g)
     and (M , g̃) have the same causal structure. Moreover, since Ω2 > 0, the spacelike and timelike
     characters of vectors is preserved as well:
It follows that a curve L is timelike (resp. null, spacelike) for g̃ iff L is timelike (resp. null,
spacelike) for g. Similarly, a hypersurface Σ is timelike (resp. null, spacelike) for g̃ iff Σ is
timelike (resp. null, spacelike) for g.
     What about geodesics? Let us first recall that a null curve is not necessarily a null geodesic
(cf. Remark 2 on p. 35 and Appendix B), so that one cannot deduce from the above results that
conformal transformations preserve null geodesics. However, this turns out to be true:
  Let g and g̃ be two Lorentzian metrics on a manifold M that are conformally related:
  g̃ = Ω2 g. A smooth curve L in M is a null geodesic for g̃ iff L is a null geodesic for g;
  any affine parameter λ̃ of L as a g̃-geodesic is then related to any affine parameter λ of
  L as a g-geodesic by
                                         dλ̃
                                             = a Ω2 ,                                 (4.13)
                                         dλ
  where a is a constant.
Sketch of proof. Write explicitly the geodesic equation [Eq. (B.10)] and express the Christoffel
symbols of g̃ in terms of those of g and the derivatives of Ω (see e.g. Appendix D of Wald’s
textbook [499] for details).
    On the contrary, conformal transformations preserve neither timelike geodesics nor space-
like ones.
    The coordinates (U, V ) are of null type; let us consider instead the “time+space” coordinates
(τ, χ) defined by2                                  
                           τ =V +U                   U = 1 (τ − χ)
                                                              2
                                              ⇐⇒                                             (4.14)
                           χ=V −U                    V = 1 (τ + χ).
                                                              2
                                               τ
                                               i+
                                           3
2 +
                                                                         i0 χ
                                                   0.5 1.0 1.5 2.0 2.5 3.0
                                                               −
                                           2
                                           3
                                               i−
     Figure 4.3: Conformal diagram of Minkowski spacetime. Constant-r curves are drawn in red, while constant-t
     ones are drawn in grey. [Figure generated by the notebook D.2.1]
E = R × S3 (4.19)
     and g̃ as the Lorentzian metric on E given by (4.18). The Lorentzian manifold (E , g̃) is nothing
     but the Einstein static universe, also called the Einstein cylinder, a static solution to
     the Einstein equation (1.40) with Λ > 0 and some pressureless matter of uniform density
     ρ = Λ/(4π). We have thus an embedding3 of Minkowski spacetime into the Einstein cylinder
     (cf. Fig. 4.4):
                                             Φ : M −→ E                                          (4.20)
        3
            Cf. Sec. A.2.7 of Appendix A
                          4.2 Conformal completion of Minkowski spacetime                                               97
Figure 4.4: Two views of the Einstein cylinder E , with the conformal embedding of Minkowski spacetime M
in it. Each view represents a 2-dimensional cut of E at θ = π/2 and φ = 0 (one half of the plotted cylinder) or
φ = π (the other half). As a standard minor abuse, we authorize here φ = 0 as a valid coordinate value, while we
had excluded it from the definition of the chart (t, r, θ, φ) on M given in Sec. 4.2.1. The S3 sections of E are then
depicted as horizontal circles, which are made of two half-circles corresponding to φ = 0 and φ = π respectively,
with χ running from 0 to π on both on them. The plotted piece of M is then the 2-dimensional cut (y, z) = (0, 0)
of M , which is a timelike plane spanned by the coordinates (t, x), with x ≥ 0 on the φ = 0 half cylinder and
x ≤ 0 on the φ = π half one. The red curves are the same constant-r curves as in Fig. 4.3, while the black curves
are the same constant-t curves as those drawn in grey in Fig. 4.3. [Figure generated by the notebook D.2.1]
and this embedding is a conformal isometry from (M , g) to (Φ(M ), g̃). In the following, we
shall identify Φ(M ) and M , i.e. use the same symbol M to denote the subset of E that is the
image of M via the embedding (4.20).
    Since E and M have the same dimension, M is an open subset of E . Its (topological)
closure M in E is (cf. Figs. 4.3 and 4.4)
                        M = M ∪ I + ∪ I − ∪ i0 ∪ i+ ∪ i− ,                             (4.21)
                                                            
where
     Figure 4.5: Conformal factor Ω as a function of (τ, χ) [cf. Eq. (4.17)]. Only the part above the yellow horizontal
     plane (Ω = 0) is physical. [Figure generated by the notebook D.2.1]
     Note that since the coordinates (τ, χ, θ, φ) are singular at χ = 0 and χ = π, each of the
     above conditions (τ, χ) = (0, π), (τ, χ) = (π, 0) and (τ, χ) = (−π, 0) do define a point and
     not a 2-surface of E , in the same way as θ = 0 defines a point (the North pole) on S2 , while
     θ = π/4 defines a circle. It is customary to pronounce I as “scri”, for script i. Note that in
     the above definitions, we have extended the coordinates (U, V ) to E by the transformations
     (4.14); (U, V, θ, φ) can be then considered as a chart on E with (U, V ) spanning the infinite
     strip 0 < V − U < π of R2 and the metric g̃ on E being given by expression (4.11).
     Remark 2: As we shall detail below (Sec. 4.3.1), it is precisely because Ω vanishes at the topological
     boundary of M in E (cf. Fig. 4.5),
                               M \ M = I + ∪ I − ∪ i0 ∪ i+ ∪ i− ,                                     (4.22)
                                                                        
that the conformal transformation (4.9) brings the infinity of Minkowski spacetime to a finite distance.
                                                           ∂ M˜ = I .                                           (4.25)
        4
            Cf. Sec. A.2.2 for the precise definition.
                        4.2 Conformal completion of Minkowski spacetime                                         99
                                          τ
                                          i+
                                      3
2 +
                                                                    i0 χ
                                              0.5 1.0 1.5 2.0 2.5 3.0
                                                          −
                                      2
                                      3
                                          i−
Figure 4.6: Null radial geodesics in the conformal diagram of Minkowski spacetime. The solid green lines are
null geodesics u = const for 17 values of u uniformly spanning [−8, 8], while the dashed green lines are null
geodesics v = const for 17 values of v uniformly spanning [−8, 8]. [Figure generated by the notebook D.2.1]
Remark 4: Because the closure M is self-intersecting at the point i0 (cf. Fig. 4.4), it is not a manifold
with boundary: no open neighborhood of i0 is homeomorphic to a neighborhood of H4 = R3 × [0, +∞),
as the definition of a manifold with boundary would require, cf. Sec. A.2.2. At the points i+ and i− , M
can be considered as a topological manifold with boundary, but not as a smooth manifold with boundary.
Hence, the three points i0 , i+ and i− are excluded from the definition of the manifold with boundary
M˜.
    The hypersurface I + is the location of M˜ where all radial null geodesics terminate, while
I − is the location of M˜ where all these geodesics originate (cf. Fig. 4.6). For this reason I + is
called the future null infinity of (M , g) and I − the past null infinity of (M , g). On the
other side, any timelike geodesic of (M , g) originates at i− and ends at i+ (cf. Fig. 4.3), while
any spacelike geodesic of (M , g) originates at i0 and terminates there (after having completed
a closed path on S3 , cf. Fig. 4.4). The point i+ is then called the future timelike infinity of
(M , g), i− the past timelike infinity of (M , g) and i0 the spacelike infinity of (M , g).
Property 4.2
      spanning (−π/2, π/2). The metric induced by g̃ on I + is easily obtained by setting V = π/2
      in Eq. (4.11):
                                 g̃|I + = cos2 U dθ2 + sin2 θ dφ2 .                          (4.26)
                                                                   
      It appears clearly that the signature of this metric is (0, +, +), i.e. it is degenerate; hence I + is
      a null hypersurface of (M˜, g̃). Similarly, I − being defined by U = −π/2, a valid coordinate
      system on I − is (V, θ, φ) with V spanning (−π/2, π/2) and the metric induced by g̃ on I −
      is obtained by setting U = −π/2 in Eq. (4.11):
         The null character of I + and I − appears also clearly in the conformal diagrams of Figs. 4.3
      and 4.6, since I + and I − are straight lines of slope ±1 in these diagrams.
      Historical note : The idea of using a conformal transformation to treat infinity as a boundary “at
      a finite distance” has been put forward by Roger Penrose in 1963 [403] and expanded in 1964 in the
      seminal paper [404], where Penrose constructed the conformal completion of Minkowski spacetime as a
      part of the Einstein cylinder. In particular, Fig. 3 of Ref. [404] is equivalent to Fig. 4.4.
2. on M , g̃ = Ω2 g;
3. on I , Ω = 0;
4. on I , dΩ ̸= 0.
        I is called the conformal boundary of (M , g) within the conformal completion (M˜, g̃).
            a
              As stressed in Remark 5 in Sec. A.2.2, the set ∂ M˜ is the boundary of M˜ as a manifold with boundary; it
        is not the boundary of M˜ as a topological space, the latter being ∅.
                             4.3 Conformal completions and asymptotic flatness                                  101
Condition 1 expresses that M has been endowed with some boundary. A rigorous formulation
of it would be via an embedding Φ : M → M˜, as in Eq. (4.20), so that M˜ = Φ(M ) ∪ I .
However, as above, we identify Φ(M ) with M and therefore simply write M˜ = M ∪ I .
Conditions 2 and 3 express that the boundary of M , which “lies at an infinite distance” with
respect to g, has been brought to a finite distance with respect to g̃. Indeed, in terms of length
elements [cf. Eq. (1.3)], condition 2 implies
                                                              1
                                                     ds2 =      2
                                                                  ds̃2
                                                              Ω
with 1/Ω2 → +∞ as one approaches I (condition 3). Finally, condition 4 ensures that I is a
regular hypersurface of M˜. It is of course fulfilled by Minkowski spacetime, as we can check
graphically on Fig. 4.5: the graph of Ω has no horizontal slope at I .
Remark 1: The statement that (M˜, g̃) is a Lorentzian manifold with boundary implies that g̃ is smooth
everywhere on M˜, including at the boundary I .
Remark 2: Since g̃ is a metric, it is by definition non-degenerate and condition 2 implies that Ω cannot
vanish on M . Being non-negative, we have necessarily Ω > 0 on M .
Remark 3: The conformal boundary I is not part of the physical spacetime M , but only of the
conformal completion M˜.
Remark 4: One often speaks about conformal compactification instead of conformal completion, but in
general M˜ is not a compact manifold. For instance, the completion M˜ of Minkowski spacetime defined
by Eq. (4.24) is not compact, because the points i+ , i− and i0 have been omitted in the construction of
M˜.
Example 1 (conformal completion of AdS4 spacetime): The 4-dimensional anti-de Sitter spacetime
(M , g) has been introduced in Example 18 of Chap. 3. The metric tensor expressed in the conformal
coordinates (τ, χ, θ, φ) is given by Eq. (3.50):
                                    ℓ2 
                                        −dτ 2 + dχ2 + sin2 χ dθ2 + sin2 θ dφ2 ,                        (4.28)
                                                                             
                            g=       2
                                  cos χ
where τ ∈ R, χ ∈ (0, π/2), θ ∈ (0, π), φ ∈ (0, 2π) and the constant length scale ℓ is related to the
negative cosmological constant Λ by ℓ2 = −3/Λ. Defining Ω := ℓ−1 cos χ, we notice that a conformal
completion of (M , g) is (M˜, g̃) where (i) M˜ is the part χ ≤ π/2 of the Einstein cylinder5 introduced
in Sec. 4.2.3 and (ii) g̃ is the metric (4.18). The boundary I = ∂ M˜ is then the hypersurface χ = π/2 of
the Einstein cylinder (cf. Fig. 4.7); I is spanned by the coordinates (τ, θ, φ) and its topology is that of a
3-dimensional cylinder: I ≃ R × S2 . We notice that conditions 3 and 4 of the definition of a conformal
completion are satisfied: Ω = ℓ−1 cos χ = 0 at I and dΩ = −ℓ−1 sin χ dχ = −ℓ−1 dχ ̸= 0 at I . The
metric induced by g̃ on I is obtained by setting χ = π/2 in (4.18): −dτ 2 + dθ2 + sin2 θ dφ2 . This
3-metric is clearly Lorentzian, which shows that I is a timelike hypersurface of (M˜, g̃).
   The above example shows that I is not necessarily a null hypersurface, as it is for
Minkowski spacetime (cf. Sec. 4.2.3). Actually the causal type of I is determined by the
cosmological constant, as follows:
   5
       Recall that on the Einstein cylinder the range of χ is (0, π), cf. Eq. (4.15).
102                              The concept of black hole 3: The global view
      Figure 4.7: Conformal completion of AdS4 spacetime, depicted on the Einstein cylinder. The conformal
      boundary I is shown in yellow, red lines are lines χ = const (uniformly sampled in terms of tan χ = sinh ρ),
      green curves are radial null geodesics and the purple curve is a radial timelike geodesic, bouncing back and forth
      around χ = 0. [Figure generated by the notebook D.3.2]
        If the spacetime dimension obeysa n ≥ 3 and g is a solution to the Einstein equation with
        a cosmological constant Λ [Eq. (1.40)] and the trace T of the energy-momentum tensor
        tends to zero in the vicinity of I (i.e. when Ω → 0), then
      Proof. It follows from g̃ = Ω2 g that the Ricci scalars R̃ and R of respectively g̃ and g are
      related by6                                                            
                           2                       µν ˜ ˜           µν
                          Ω R̃ = R − (n − 1) 2Ω g̃ ∇µ ∇ν Ω − n g̃ ∂µ Ω∂ν Ω ,                  (4.29)
This equation is a priori valid in M = M˜ \ I only. Taking the limit Ω → 0 and assuming
that T → 0 in that limit, we get, by continuity, an identity on I :
                                                  I             2
                                     g̃ µν ∂µ Ω∂ν Ω = −                  Λ.                        (4.30)
                                                          (n − 1)(n − 2)
Since I corresponds to a constant value of the scalar field Ω (Ω = 0), the left-hand side of
this equation is nothing but the scalar square g̃(n, n) of the vector n normal to I defined as
the dual with respect to g̃ of the 1-form dΩ: nα = g̃ αµ ∂µ Ω (remember that by hypothesis 4
in the definition of a conformal completion, dΩ is non-vanishing on I , so that n is a valid
normal vector to I ). Equation (4.30) implies that the sign of g̃(n, n) is the opposite of that of
Λ. Given the link between the causal type of a hypersurface and the causal type of its normal
(cf. Sec. 2.2.2), this completes the proof.
    One may distinguish two subparts of the conformal boundary:
Remark 5: The above definitions of I + and I − generalize those given for the Minkowski spacetime
in Sec. 4.2.3. Note that, contrary to the Minkowski case, I + and I − are not null for spacetimes
with a non-zero cosmological constant (cf. Property 4.3). In particular, the following examples exhibit
respectively timelike and spacelike I + and I − .
Example 2 (Future and past infinities of AdS4 spacetime): Let us consider the conformal completion
of AdS4 discussed in Example 1. It is evident from the behavior of radial null geodesics (the green curves
plotted in Fig. 4.7) that any point of I can be connected to M by a future-directed null geodesic as
well as by a past-directed one. It follows that I + = I − = I and both I + and I − are timelike
hypersurfaces.
      where ℓ is a positive constant. Note that t spans R while (χ, θ, φ) are standard polar coordinates on
      S3 : χ ∈ (0, π), θ ∈ (0, π) and φ ∈ (0, 2π). The metric (4.31) is a solution of the vacuum Einstein
      equation (1.43) with the positive cosmological constant Λ = 3/ℓ2 . Using coordinates (τ, χ, θ, φ) with
      τ := 2 arctan(tanh(t/2)) ∈ (−π/2, π/2), one gets
                                     ℓ2 
                                         −dτ 2 + dχ2 + sin2 χ dθ2 + sin2 θ dφ2 .                            (4.32)
                                                                              
                              g=      2
                                   cos τ
      Defining Ω := ℓ−1 cos τ = (ℓ cosh t)−1 , we notice that a conformal completion of (M , g) is (M˜, g̃)
      where (i) M˜ is the part −π/2 ≤ τ ≤ π/2 of the Einstein cylinder introduced in Sec. 4.2.3 and (ii)
      g̃ is the metric (4.18). The boundary I = ∂ M˜ has two connected components: I + , which is the
      hypersurface τ = π/2 of M˜, and I − , which is the hypersurface τ = −π/2. Both I + and I −
      are spanned by the coordinates (χ, θ, φ) and their topology is that of S3 . We notice that conditions
      3 and 4 of the definition of a conformal completion are satisfied: Ω = ℓ−1 cos τ = 0 at I and
      dΩ = −ℓ−1 sin τ dτ = ±ℓ−1 dτ ̸= 0 at I . The metric       induced by g̃ on I is obtained by setting
      τ = ±π/2 in (4.18): dχ + sin χ dθ + sin θ dφ . This 3-metric is clearly Riemannian (this is
                                2       2       2      2     2
      actually the standard round metric of S3 ), which shows that I is a spacelike hypersurface of (M˜, g̃).
      This of course agrees with Property 4.3, given that Λ > 0. Finally, it is clear that I + (resp. I − ) matches
      the definition of a future (resp. past) infinity given above. We conclude that (M˜, g̃) is a conformal
      completion at infinity of de Sitter spacetime.
that the full Riemann curvature tensor vanishes at I [cf. Eq. (A.114)], hence the qualifier
asymptotically flat.
   The following property holds:
Proof. Consider Eq. (4.29). Near I , we have R = 0 by the very definition of asymptotic flatness.
                                               I
The limit Ω → 0 results then in g̃ µν ∂µ Ω∂ν Ω = 0, which, following the argument in the proof
on p. 102, implies that I is a null hypersurface.
   • the chronological future of S as the set I + (S) of all points of M that can be reached
     from a point of S by a future-directed timelike curve of nonzero extent;
   • the causal future of S as the set J + (S) of all points that either are in S or can be
     reached from a point of S by a future-directed causal curve;
   • the chronological past of S as the set I − (S) of all points of M that can be reached
     from a point of S by a past-directed timelike curve of nonzero extent;
   • the causal past of S as the set J − (S) of all points that either are in S or can be reached
     from a point of S by a past-directed causal curve.
From the above definitions, one has always S ⊂ J ± (S) and I ± (S) ⊂ J ± (S).
Remark 1: One has not necessarily S ⊂ I ± (S). For instance, if M does not contain any closed timelike
curve, one has S ∩ I ± (S) = ∅ for S = {p} with p being any point of M .
    Here are some topological properties of the future and past sets defined above (see e.g. § 6.2
of [266] or Chap. 14 of [390] for proofs):
• I ± (S) is always an open subset7 of M , while J ± (S) is not necessarily a closed subset.
• It follows from (4.33) and (4.34) that both sets share the same (topological) boundary:
      The subset of the causal future (resp. past) of S formed by points that cannot be connected to
      S by a timelike curve is called the future horismos (resp. past horismos) of S and is denoted
      by E + (S) (resp. E − (S)):
      The horismos E ± (S) is formed by null geodesics emanating from points in S (cf. Proposi-
      tion 4.5.10 of Ref. [266]). One has E ± (S) ⊂ ∂J ± (S). The spacetime (M , g) is said to be
      causally simple iff for every compact set K ⊂ M , E ± (K) = ∂J ± (K). This is equivalent to
      saying that J + (K) and J − (K) are closed subsets of M .
B := M \ (J − (I + ) ∩ M ) . (4.37)
      The black hole region is thus the set of points of M from which no future-directed causal curve
      in M˜ reaches I + .
         8
         Let us recall that a geodesic is complete iff its affine parameters range through the whole of R, cf. Sec. B.3.2 in
      Appendix B. In particular, such a geodesic is inextendible.
                                                    4.4 Black holes                                          107
Figure 4.8: The black hole region B defined as the complement of the causal past of the future infinity,
J − (I + ).
Example 5: The Minkowski spacetime contains no black hole, for all future-directed null geodesics
terminate at I + (cf. Fig. 4.6). More generally, any asymptotically simple spacetime contains no black
hole (cf. Sec. 4.3.2).
Example 6: The prototype of a black hole is the Schwarzschild black hole; it will be shown in Sec. 6.4
that the Schwarzschild spacetime contains a region B that fulfills the above definition of a black hole
region.
Remark 2: If we release the assumption of I + -completeness in the above definition, we may end
up with unphysical or “spurious” black holes. For instance, let us consider the conformal completion
of Minkowski spacetime (M , g) resulting from its embedding in the Einstein cylinder (E , g̃), as in
Sec. 4.2.3, keeping the same I − but defining I + as the hypersurface of E given by τ = π − χ and
0 < τ < π/2, instead of 0 < τ < π in Sec. 4.2.3. The manifold with boundary M˜ := M ∪ I + ∪ I − ,
equipped with the Einstein cylinder metric g̃, is then a conformal completion of (M , g) at null infinity.
With such a I + , the black hole region defined by (4.37) is non-empty, as shown in Fig. 4.9.
Remark 3: Some authors (in particular Hawking and Ellis [266]) define a black hole as a connected
component of Σ(τ ) ∩ B, where Σ(τ ) is a spacelike hypersurface that is a slice of the future development
of a partial Cauchy surface9 Σ(0) such that the closure in M˜ of the domain of dependence of Σ(0)
contains I + . According to such a definition, a black hole is a (n − 1)-dimensional object, while the
black hole region B defined above is a n-dimensional object.
   If B ̸= ∅, the boundary H of the black hole region is called the future event horizon (or
simply the event horizon when no ambiguity may arise):
H := ∂B . (4.38)
                 H = M \ (J − (I + ) ∩ M ) ∩ (J − (I + ) ∩ M ) = ∂(J − (I + ) ∩ M ).
   9
       The concepts of partial Cauchy surface and future development are defined in Sec. 10.8.3.
108                         The concept of black hole 3: The global view
      Figure 4.9: Spurious black hole region B in Minkowski spacetime resulting from a conformal completion with
      an incomplete I + . Compare with Fig. 4.6.
H = ∂J − (I + ) ∩ M . (4.39)
      In words: the future event horizon H is the part of the boundary of the causal past of the
      future infinity I + that lies in M (cf. Fig. 4.8). Note that thanks to property (4.35), we can
      write as well
                                         H = ∂I − (I + ) ∩ M .                                 (4.40)
      Example 7: The event horizon of the Schwarzschild black hole (Example 6) is nothing but the
      Schwarzschild horizon H considered in the examples of Chaps. 2 and 3.
      White hole
      By inverting past and future in the black hole definition (4.37), one defines the white hole
      region of a spacetime (M , g) with a conformal completion at infinity as the complement
      within M of the causal future of the past infinity I − :
W := M \ (J + (I − ) ∩ M ) . (4.41)
      The white hole region is thus the set of points of M from which no past-directed causal curve
      in M˜ reaches I − . The boundary of white hole region is called the past event horizon:
H − := ∂W = ∂J + (I − ) ∩ M . (4.42)
      Remark 4: The name white fountain is sometimes used instead of white hole. Actually, this name
      may seem better suited to describe the time symmetric of a black hole: one can fall into a hole, while
      one is expelled by a fountain.
                                                 4.4 Black holes                                                   109
Example 8: We shall encounter an example of white hole in the maximal extension of Schwarzschild
spacetime, to be discussed in Chap. 9 (cf. Sec. 9.4.4).
    The domain of outer communications is the part ⟨⟨M ⟩⟩ of M that lies neither in the
black hole region nor in the white hole one:
                  ⟨⟨M ⟩⟩ := M \ (B ∪ W ) = J − (I + ) ∩ J + (I − ) ∩ M .          (4.43)
                                                                  
The last equality, which is a direct consequence of the definitions of B and W , shows that the
domain of outer communications is the set of points from which it is possible to send a signal
to and to receive a signal from arbitrarily far regions. It also follows immediately from the
definitions of the two event horizons that the boundary of the domain of outer communications
is their union:
                                      ∂⟨⟨M ⟩⟩ = H ∪ H − .                                (4.44)
Historical note : The term event horizon has been introduced by Wolfgang Rindler in 1956 [435] in the
context of a single observer moving in some cosmological spacetime. Regarding the name black hole, the
standard story tells that it has been coined by John A. Wheeler in the end of 1967, following a suggestion
shouted from the audience during one of his conferences (cf. the account by Wheeler himself in Chap. 13
of Ref. [520]). However, a recent study [272] reveals that the expression black hole circulated as early
as 1963 at the first Texas Symposium on Relativistic Astrophysics held in Dallas, while discussing the
discovery of quasars, and could have been forged by Robert Dicke in some lecture given in 1961. In
any case, after have been put forward by Wheeler, the term black hole rapidly superseded the previous
names frozen star, collapsed star, or astre occlus (the latter still appearing along black holes in the title of
the proceedings of the famous 1972 Les Houches Summer School [164]). For instance, “black hole” was
used abundantly, albeit with quotes, in a review article by Roger Penrose published in 1969 [408]. In
the same article, Penrose defined the absolute event horizon as ∂J − (I + ) (actually ∂I − (I + ), but both
coincide, cf. Eq. (4.35)), i.e. essentially the identity (4.39), but he did not provide any formal definition of
the black hole region as M \ (J − (I + ) ∩ M ) [Eq. (4.37)]. It seems that the latter appears first in an
“instant-of-time” version (cf. Remark 3) in a seminal article by Stephen Hawking published in 1972 [260].
The expression domain of outer communications has been introduced in 1971 by Brandon Carter [93].
  A black hole event horizon H is an achronal set, i.e. no pair of points of H can be
  connected by a timelike curve of M .
     Note that in the definition of an achronal set, it is not demanded that the timelike curve
lies entirely in the set (for instance, the set can be discrete, so that no curve whatsoever lies in
it). Accordingly, an equivalent statement of Property 4.5 is: no timelike curve of M meets H
at more than one point. The proof of Property 4.5 relies on the following lemma:
110                          The concept of black hole 3: The global view
      Figure 4.10: Lemma 4.6: moving slightly the ends p and q of a timelike curve L necessarily results in another
      timelike curve L ′ .
        One can “move the ends” of any timelike curve “a little bit” and still get a timelike curve.
        More precisely, if two points p, q ∈ M are connected by a timelike curve, there exists
        a neighborhood U of p and a neighborhood V of q such that any point p′ ∈ U can be
        connected to any point q ′ ∈ V by a timelike curve.
      Proof. This is more or less evident on a spacetime diagram (cf. Fig. 4.10) and a formal proof
      can be found as Lemma 3 in Chap. 14 of O’Neill’s textbook [390].
      Proof of Property 4.5. Let us assume the negation of Property 4.5, i.e. that there exist two points
      p and q in H that are connected by a timelike curve L , with q lying in the future of p (cf.
                                               4.4 Black holes                                                 111
Figure 4.12: Proving that, for any p ∈ H , I − (p) ⊂ M \ B (left) and I + (p) ⊂ B (right).
Fig. 4.11). Invoking Lemma 4.6, let U and V be the neighborhoods of respectively p and q
within which one can deform L to a timelike curve. Let us choose p′ ∈ U ∩ B (B being the
black hole region) and q ′ ∈ V ∩ J − (I + ). Such a choice is always possible since p and q lie on
the boundary between B and J − (I + ) (cf. Fig. 4.11). Since q ′ ∈ J − (I + ), the timelike curve
linking p′ and q ′ can then be extended to the future in a causal curve L ′ reaching I + . This
implies p′ ∈ J − (I + ), which contradicts p′ ∈ B.
Proof. Let us first show that the chronological past of any point p ∈ H , I − (p) (cf. Sec. 4.4.1),
lies entirely in M \ B (the black hole exterior). For any q ∈ I − (p), we have by definition
p ∈ I + (q) and since I + (q) is an open set (cf. Sec. 4.4.1), this means that I + (q) is an open
neighborhood of p (cf. Fig. 4.12 left). Given that p lies on the boundary of B, we have necessarily
I + (q) ∩ (M \ B) ̸= ∅. Let then a ∈ I + (q) ∩ (M \ B) = I + (q) ∩ J − (I + ). By definition,
there exists a future-directed timelike curve from q to a and a future-directed causal curve from
a to I + , hence a future-directed causal curve from q to I + , which proves that q ∈ J − (I + ),
i.e. q ∈ M \ B. We conclude that I − (p) ⊂ M \ B. On the other hand, let us show that the
chronological future of p, I + (p), lies entirely in B. For any q ∈ I + (p), we have p ∈ I − (q)
(cf. Fig. 4.12 right). Since I − (q) is open, it constitutes an open neighborhood of p. Given that
p ∈ ∂B, we have then I − (q) ∩ B ̸= ∅ and we may pick a point a ∈ I − (q) ∩ B. There exists a
future-directed timelike curve from a to q. If q ̸∈ B, there would exist a future-directed causal
curve from q to I + and hence a future-directed causal curve from a to I + , which would
contradict a ∈ B. We conclude that q ∈ B and thus that I + (p) ⊂ B.
     For p ∈ H , let us consider a coordinate chart Φ : U → (−1, 1) × V , q 7→ (x0 , x1 , . . . , xn−1 )
where U is an open neighborhood of p and V is an open subset of Rn−1 , such that x0 (p) = 0
and the coordinate vector ∂0 is future-directed timelike. By choosing U sufficiently small, we
may ensure that the slice x0 = −1/2 lies in I − (p) and the slice x0 = 1/2 lies in I + (p) (cf.
Fig. 4.13). Any coordinate line defined by (x1 , . . . , xn−1 ) = (a1 , . . . , an−1 ), where a1 , ..., an−1
are n − 1 constants, is a timelike curve which connects the point (xα ) = (−1/2, a1 , . . . , an−1 )
112                          The concept of black hole 3: The global view
      Figure 4.13: Coordinate chart (xα ) in the vicinity of p ∈ H , such that ∂0 is future-directed timelike and the
      slice x0 = −1/2 (resp. x0 = 1/2) lies in I − (p) (resp. I + (p)). Any coordinate line along which x0 varies, such as
      the purple line, is timelike and intersects H at a single point h.
      of I − (p) to the point (xα ) = (1/2, a1 , . . . , an−1 ) of I + (p) (cf. the purple curve in Fig. 4.13).
      Given that I − (p) ⊂ M \ B, I + (p) ⊂ B and H = ∂B, such a curve necessarily intersects
      H . Moreover the intersection is reduced to a single point h ∈ H for the curve is timelike and
      H is achronal (Property 4.5). Let us then give the coordinates (y i ) = (a1 , . . . , an−1 ) to h. By
      varying (a1 , . . . , an−1 ), we get a homeomorphism from U ∩ H to the open subset V of Rn−1 .
      This makes H a topological manifold of dimension n − 1 (cf. Sec. A.2.1).
      Remark 5: Generically, the topological manifold H is not a smooth manifold, for it contains some
      points (the crossovers defined below) at which it is not differentiable. Actually H is slightly more than
      a mere topological submanifold of M : it is a Lipschitz submanifold of M . The latter is intermediate
      between a topological submanifold, i.e. a submanifold of class C 0 (continuous), and a differentiable
      submanifold of class C 1q    . On U ∩ H , the coordinate x0 is a Lipschitz function of the coordinates (y i ):
       x0 (y i ) − x0 (y ′ i ) < K     i (y − y ) . This follows from the achronal character of H : the points of
                                               ′i 2
                                     P i
      coordinates (y i ) and (y ′ i ) cannot have a too large separation in terms of x0 , otherwise they would be
      timelike separated. Hence, one says that H is a Lipschitz submanifold of M . The notation C 1− (i.e.
      a kind of intermediate between C 0 and C 1 ) is generally used to denote Lipschitz submanifolds.
          Beside being achronal topological submanifolds of codimension 1, the main property of
      black hole horizons is:
        Property 4.8: the event horizon ruled by never leaving null geodesics (Penrose 1968
        [407])
        A black hole event horizon H is ruled by a family of null geodesics, called the generators
        of H , that (i) either lie entirely in H or never leave H when followed into the future
        from the point where they enter H , and (ii) have no endpoint in the future. Moreover,
        there is exactly one generator through each point of H , except at special points where
        null geodesics enter H , which are called crossovers. A special case of crossover, called
        caustic, is a point where neighboring null geodesics focus and converge while arriving in
                                                 4.4 Black holes                                                  113
Figure 4.14: Lemma 4.9: A causal curve L containing a timelike segment (between a and b on the figure) can
be deformed into an entirely timelike curve L ′ with the ends kept fixed (dashed curve).
  H.
In particular, once a null geodesic has merged with H (at a point where it may intersect other
null geodesics), it will stay forever on H and will never intersect any other generator. The set
of all crossovers is called the crease set [457, 458, 74].
    The following proof of Property 4.8 is adapted from that given in Box 34.1 of MTW [371].
In addition of Lemma 4.6, it relies on the following lemma.
Lemma 4.9
Proof. We shall only provide a graphical “proof”, based on the spacetime diagram of Fig. 4.14.
The causal curve L may have parts where it is null (segments pa and bq in Fig. 4.14); these
parts are drawn with the angle of incline θ = ±45◦ with respect to the horizontal direction. If
L contains a timelike segment (such as ab in Fig. 4.14), i.e. a segment with |θ| > 45◦ , it can be
deformed, while keeping the same ends, to a curve with |θ| > 45◦ everywhere, i.e. to a timelike
curve.
Proof of Property 4.8. Let p ∈ H and let U be some convex normal neighborhood10 of p. Since
p lies in the boundary of J − (I + ), it is always possible to consider a sequence of points
(pn )n∈N converging toward p and such that ∀n ∈ N, pn ∈ U ∩ J − (I + ) (cf. Fig. 4.15). Since
pn ∈ J − (I + ), there exists a future-directed causal curve Ln from pn to I + for each n ∈ N.
The neighborhood U being convex, each Ln intersects its boundary ∂U at a unique point, qn
say, in the future of pn : {qn } = Ln ∩ ∂U (cf. Fig. 4.15). Since ∂U is compact, the sequence
  10
    Basically, a convex normal neighborhood is an open subset U such that any two points of U can be connected
by a geodesic lying entirely in U , cf. Sec. B.3.4. There always exists a convex normal neighborhood around any
point of a pseudo-Riemannian manifold (cf. Proposition 7 p. 130 in O’Neill’s textbook [390]).
114                        The concept of black hole 3: The global view
      (qn )n∈N admits a subsequence, (qf (n) )n∈N say (f being an increasing function N → N), that
      converges to some limit point q. Since from any point pf (n) arbitrarily close to p, there is the
      causal curve Lf (n) to the point qf (n) arbitrarily close to q, one can show that there exists a
      future-directed causal curve L connecting p to q (cf. Fig. 4.15; see e.g. Lemma 6.2.1 of Hawking
      & Ellis’ textbook [266] for a precise demonstration).
           As the limit of points in J − (I + ), q lies in the closure J − (I + ) = J − (I + ) ∪ H , H being
      the boundary of J − (I + ). Let us show by contradiction that actually q ∈ H . If we assume
      q ̸∈ H , then necessarily q ∈ J − (I + ). There exists then an open neighborhood V of q such
      that V ⊂ J − (I + ) (cf. Fig. 4.16). Let us choose q ′ ∈ V such that q is connected to q ′ via a
      timelike curve. We may then extend L to a causal curve L˜ from p to I + via q and q ′ (cf.
      Fig. 4.16). Since L˜ contains a timelike segment (between q and q ′ ), we may invoke Lemma 4.9
      to deform it into a timelike curve L˜′ between p and I + . Then, by Lemma 4.6, one can “move
      the past end” of L˜′ to get a new timelike curve L˜′′ linking an event p′ ∈ B close to p to I +
      (dotted curve in Fig. 4.16), which is impossible by the very definition of the black hole region
      B. Hence q ∈ H .
           The causal curve L connecting p to q cannot be timelike since p and q are both in H ,
      which is achronal (Property 4.5). Actually, L cannot even contain a timelike segment: if it
      would, then by Lemma 4.9, it could be deformed into a timelike curve between p and q, which
      again would contradict the achronal character of H . Hence L is necessarily a null curve.
      Moreover, it is a geodesic. Indeed, let us assume it is not. There is then some non-geodesic
      null segment of L , ab say. Now, as shown in Sec. B.4.3 of Appendix B, a curve from a to b is
      a geodesic iff any of its parametrizations P : [λa , λb ] → M , λ 7→ P (λ) ∈ L is a stationary
      point of the action
                                                           Z λb
                                          E(a,b) (P ) :=        g(v, v) dλ,
                                                            λa
      where v = dx/dλ is the tangent vector associated with P . For the null segment ab of L ,
      we have E(a,b) (P ) = 0. Since ab is assumed to be not geodesic, it is not a stationary point
      of E(a,b) (P ), which implies that there exists a nearby curve from a to b with E(a,b) (P ) < 0,
                                             4.4 Black holes                                            115
i.e. there exists a curve from a to b with some timelike part. It follows that p and q can be
connected by a causal curve with a timelike segment. But this feature has been excluded above.
We conclude that L is a null geodesic.
     At this stage, we have shown that given p ∈ H , there exists a future-directed null geodesic
L connecting p to another point q ∈ H . There remains to show that L lies entirely in H .
Let us start by showing that L ⊂ J − (I + ). Let a be a generic point of L between p and
q. Since L is null, there exists a point a′ arbitrarily close to a such that a′ is connected to
q by a future-directed timelike curve (cf. Fig. 4.17). Thanks to Lemma 4.6 and the property
q ∈ J − (I + ), we may find a point q ′ ∈ J − (I + ) close to q such that a′ is connected to q ′ by a
future-directed timelike curve. Since q ′ ∈ J − (I + ), such a curve can be extended to a causal
curve to I + (the dashed curve in Fig. 4.17); hence a′ ∈ J − (I + ). Since a′ is arbitrarily close to
a, we conclude that a ∈ J − (I + ). Then, by repeating the same reasoning as that employed
above for proving q ∈ H , simply replacing q by a, we get that a ∈ H . Since a is a generic
point of L , we conclude that L lies entirely in H .
     Given a point p ∈ H , we have thus constructed a future-directed null geodesic L lying
entirely in H and connecting p to another point q ∈ H . One can repeat the construction
from the point q to get another future-directed null geodesic L ′ ⊂ H connecting q to another
116                      The concept of black hole 3: The global view
      point q ′ ∈ H . Now L and L ′ must be two segments of the same null geodesic L ∪ L ′ by
      the following lemma.
Lemma 4.10
      Proof of Lemma 4.10. Assume that L and L ′ have non-collinear tangent vectors at q. Then,
      in the vicinity of q, one can find a point a ∈ L and a point b ∈ L ′ such that a and b can be
      connected by a timelike curve (cf. Fig. 4.18). Since L ⊂ H and L ′ ⊂ H , we have a ∈ H
      and b ∈ H and therefore we get a contradiction with H being achronal.
      Thanks to Lemma 4.10, we conclude that L ′ extends L to the null geodesic L ∪ L ′ entirely
      lying in H . By iterating, we conclude that the null geodesic L through p can be extended
      indefinitely into the future. Moreover, it can never leave H . Indeed, if L would leave H at
      some point q, by the same procedure used above for p, one could construct a future-directed
      null geodesic L ′ ⊂ H starting from q; then Lemma 4.10 would imply that L and L ′ would
      have the same tangent at q, which is not compatible with L leaving H at q.
          Another direct consequence of Lemma 4.10 is that no two distinct null generators may
      intersect at a point p ∈ H , except if their segments in the past of p lie outside H . This
      completes the proof of Property 4.8.
           Some features of Property 4.8 are illustrated in Fig. 4.19, which displays the null geodesic
      generators in a numerical simulation of the head-on collision of two black holes by Matzner et
      al. (1995) [359]. Note that new null geodesics enter the event horizon at the “crotch” of the
      “pair of pants”.
           The head-on black hole merger has been also computed by Cohen et al. (2009) [133], with
      an increased numerical accuracy (cf. Fig. 4.20). Cross-sections of the event horizon H (cf.
      Sec. 2.3.4) are depicted in Fig. 4.21. The same figure also shows how some null geodesics are
      reaching H to become null generators.
                                                   4.4 Black holes                                                    117
Figure 4.19: Spacetime diagram of the event horizon corresponding to the head-on merger of two black holes
as computed by Matzner et al. (1995) [359]. The white curves are some null geodesic generators; the left picture is
a zoom of the merger region, with the crease set (source: Fig. 4 of Ref. [359]; ©1995 American Association for the
Advancement of Science).
Figure 4.20: Spacetime diagram showing the event horizon in the head-on merger of two black holes, as
computed by Cohen et al. (2009) [133]. The blue curves are null geodesics that will eventually become null
generators of the event horizon; those arising from regions close to the event horizon are marked by the arrow
and the black ellipse (source: Fig. 15 of Ref. [133]; ©2009 IOP Publishing Ltd).
118                          The concept of black hole 3: The global view
      Figure 4.21: Cross-sections (at various coordinate times t) of the event horizon H corresponding to the
      head-on merger of two black holes as computed by Cohen et al. (2009) [133] and displayed in Fig. 4.20. Each
      figure is a 2D cut of a hypersurface Σt defined by a constant value of the coordinate time t, expressed in units
      of the sum M of the initial irreducible masses of each black hole (to be discussed in Chap. 16). The whole 3D
      hypersurface Σt can be reconstructed by rotation around the collision axis. tCEH (for “Common Event Horizon”)
      is the coordinate time at which the cross-section of H becomes a connected 2-surface. The cross-sections of
      H are displayed in black, while the green dashed curves denote the set of the intersections with Σt of the null
      geodesics that will become null generators of H through the cusps in the “individual” event horizons. The red
      and blue dashed curves denotes apparent horizons (to be discussed in Chap. 18). (source: Fig. 1 of Ref. [133]; ©2009
      IOP Publishing Ltd).
          Finally, Fig. 4.22 shows a cross-section of the event horizon computed by Cohen et al.
      (2012) [132] in some inspiralling binary black hole merger. The black hole spacetime itself has
      been computed as a solution to the vacuum Einstein equation (1.44) by Scheel at al. [446]; it
      corresponds to 16 inspiralling orbits of an equal-mass binary black hole with vanishing initial
      spins.
          Generically, for a binary black hole merger, the crease set forms a 2-dimensional subset of
      the event horizon H and is bounded by the set of caustic points, which forms a 1-dimensional
      subset of H [457, 458, 286, 132].
      Proof. Let us assume that H is smooth in some open subset U . By Property 4.7, H is then
      a smooth hypersurface in U . According to Property 4.8, there is a null geodesic lying in H
      through any point of H ∩U . This implies null tangent vectors at any point of H ∩U , so that, in
      U , H must be either a null hypersurface or a timelike one. But H is achronal by Property 4.5
                                                  4.4 Black holes                                                    119
Figure 4.22: Cross-section of the event horizon H of the inspiralling merger of two black holes as computed
by Cohen et al. (2012) [132]. The x and y axes define the orbital plane. This cross-section is the first connected
one in the slicing of H by surfaces of constant coordinate time t (source: Fig. 2 of Ref. [132]; ©2012 American
Physical Society).
and therefore cannot be timelike. Hence, H is a null hypersurface in U . Finally, there is only
one congruence11 of null geodesics ruling a null hypersurface: the null generators defined in
Sec. 2.3.3. The generators invoked in Property 4.8 have thus to belong to that congruence.
   It can be shown that event horizons are smooth almost everywhere: the only location where
they are not differentiable is the crease set, i.e. the set of points where null geodesics cross each
other while entering H to become a null generator (cf. Fig. 4.19).
Remark 6: Properties 4.5 to 4.11 are not specific to black hole horizons: they are actually valid for the
boundary ∂J − (S) of the causal past of any set S ⊂ M , or S ⊂ M˜ (such as I + ) [407, 266, 209, 390].
Indeed, none of the specific features of I + (i.e. I + is part of the boundary of M˜ and each of its
points can reached by a future-directed causal curve originating from M ) has been used in the proofs
of Properties 4.5 to 4.11. These properties are also valid for the boundary ∂J + (S) of the causal future of
any set S, modulo the changes future ↔ past in Property 4.8. A set of the type ∂J − (S) and ∂J + (S) is
called an achronal boundary [266].
Historical note : The properties of generic achronal boundaries, which yield Properties 4.5 to 4.11 in the
particular case of a black hole event horizon H (cf. Remark 6), have been established by Roger Penrose
in a seminal lecture at the Battelle-Seattle Center at the summer of 1967, which has been published in
1968 [407]. Penrose used the term semispacelike boundary instead of achronal boundary. It seems that the
latter has been introduced by Stephen Hawking and George Ellis in 1973 in their famous textbook [266].
  11
    Let us recall that a congruence on a submanifold S of M is a family of curves tangent to S , one, and
exactly one, through each point of S .
120   The concept of black hole 3: The global view
Chapter 5
Contents
            5.1   Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
            5.2   Stationary spacetimes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122
            5.3   Mass and angular momentum . . . . . . . . . . . . . . . . . . . . . . . 128
            5.4 The event horizon as a Killing horizon . . . . . . . . . . . . . . . . . . 152
            5.5 The generalized Smarr formula . . . . . . . . . . . . . . . . . . . . . . 162
            5.6 The no-hair theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
5.1         Introduction
Having defined black holes in all generality in Chap. 4, we focus here on the steady state case,
i.e. black holes in stationary spacetimes. We have already discussed non-expanding horizons
and Killing horizons in Chap. 3 as possible models for the event horizon of a steady state black
hole. Actually, we shall see below that (each connected component of) the event horizon of a
black hole in a stationary spacetime has to be a Killing horizon, at least in the framework of
general relativity and within rather general hypotheses. We shall start by defining properly
the concept of stationary spacetime and investigating some first properties of a black hole in
such a spacetime (Sec. 5.2). Then, in Sec. 5.3, we discuss the concepts of mass and angular
momentum in asymptotically flat spacetimes, which are useful to characterize black holes. In
Sec. 5.4, we shall see that if the Killing vector ξ generating stationarity is null on a connected
event horizon H , the latter is a Killing horizon with respect to ξ. If H is non-degenerate
and the electrovacuum Einstein equation holds, this can only occur in a static spacetime
(staticity theorem, Sec. 5.4.1). On the contrary, if ξ is spacelike1 on some parts of H , then,
modulo the electrovacuum Einstein equation and some additional hypotheses, H is still a
Killing horizon, albeit with respect to a Killing vector distinct from ξ (strong rigidity theorem,
Sec. 5.4.2). Section 5.5 is devoted to an important relation between various global quantities
   1
       The timelike case is excluded for an event horizon is a null hypersurface.
122                                     Stationary black holes
      characterizing a stationary black hole: the Smarr formula. This is the opportunity to investigate
      electromagnetic fields on the horizon of a stationary black hole, in particular to define the black
      hole’s electric charge and electric potential, both of them being involved in the Smarr formula.
      The culmination point of this chapter is Sec. 5.6, which presents the famous no-hair theorem.
      Modulo some hypotheses, this theorem stipulates that in 4-dimensional general relativity, all
      isolated stationary electrovacuum black holes are necessarily Kerr-Newman black holes; in the
      pure vacuum case (the most relevant one for astrophysics), they are Kerr black holes, to be
      explored in Part. III.
      Remark 1: Some authors (e.g. Carter [96]) call pseudo-stationary the stationary spacetimes that obey
      (ii’), keeping the qualifier stationary for the strictly stationary case. As we are going to see, when M
      contains a black hole, ξ cannot be timelike everywhere, so only pseudo-stationarity in Carter’s sense is
      relevant for such spacetimes. Our terminology, namely keeping the qualifier stationary even for (ii’),
      follows that of Choquet-Bruhat [108], Chruściel, Lopes Costa & Heusler [123], Heusler [275] and Wald
      [505].
          The Killing vector field ξ is a priori not unique: the translation group (R, +) admits the
      reparametrization t 7→ t′ = αt, where α is a nonzero constant, which yields the rescalling
      ξ 7→ ξ ′ = α−1 ξ of the Killing vector field [cf. Eq. (3.14)]. When the spacetime admits a
      conformal boundary I — which is required if a black hole is present, given the definition (4.37)
      —, then one selects ξ by demanding that it is future-directed near I and obeys
                                             ξ · ξ → −1 near I .                                         (5.1)
      This determines ξ uniquely, since this fixes the rescaling constant α to ±1, with only +1 being
      acceptable to keep the time orientation.
          If the stationary spacetime is endowed with electromagnetic and/or matter fields, they must
      respect the stationarity as well. For the electromagnetic field F (cf. Sec. 1.5.2), this is expressed
      by the vanishing of the Lie derivative along ξ:
                                                    Lξ F = 0.                                            (5.2)
      This condition is similar to Killing-vector property Lξ g = 0 [Eq. (3.18)].
         A notion stronger than stationarity is that of staticity:
                                            5.2 Stationary spacetimes                                           123
  A spacetime (M , g) is called static iff (i) it is stationary and (ii) the Killing vector field ξ
  generating the stationary action is orthogonal to a family of hypersurfaces (one says that ξ
  is hypersurface-orthogonal). The spacetime (M , g) is called strictly static iff moreover
  ξ is timelike in all M .
Remark 2: The same comment as in Remark 1 can be made: some authors would call static only
spacetimes that are strictly static according to the above definition.
     In loose terms, a spacetime is stationary if “nothing changes with time”, while it is static
if, in addition, “nothing moves”. A prototype of a stationary spacetime that is not static is a
spacetime containing a steadily rotating body, be it a star or a black hole.
where ξ is the metric dual of ξ (cf. Sec. A.3.3) and W := g(ξ, ξ) = ⟨ξ, ξ⟩. The orthogonality
of ∂t and the hypersurfaces t = const translates to g0i = 0 for i ∈ {1, . . . , n − 1}, the gαβ ’s
being the metric components with respect to the coordinates (xα ). Hence one may write the
metric of a static spacetime of dimension n as
where the indices (i, j) range in {1, . . . , n − 1} and W and gij are functions of (x1 , . . . , xn−1 )
only. It is clear that the metric (5.5) is invariant2 in the transformation t 7→ −t. One says that a
static spacetime is time-reflection symmetric.
Example 1 (staticity of anti-de Sitter spacetime): The anti-de Sitter spacetime AdS4 considered in
Example 18 of Chap. 3 is strictly static, with the coordinates (τ, r, θ, φ) being adapted to staticity (hence
their name: global static). Indeed, the metric (3.49) is of the form (5.5) with t = τ and W = −ℓ2 (1 + r2 ).
The strict staticity follows from W < 0, which shows that the Killing vector ξ = ∂t is everywhere
timelike.
   2
       Would (5.5) have contained a non-vanishing g0i dt dxi term, this would not have been the case.
124                                      Stationary black holes
        The event horizon H of a black hole in a stationary spacetime is globally invariant under
        the action of the stationary group (R, +).
          The word globally stresses that H is invariant as a whole, not that each point of H is
      a fixed point of the group action. Let us assume that H is smooth (this sounds likely in a
      stationary context; a rigorous proof can be found in Ref. [120]); it is then a null hypersurface
      (Property 4.11 in Sec. 4.4.3). Now, H is globally invariant if, and only if, the generator ξ
      of the isometry group is tangent to H . Since a timelike vector cannot be tangent to a null
      hypersurface (cf. Lemma 2.3 in Sec. 2.3.4), we conclude:
        In a stationary spacetime containing a black hole, the stationary Killing vector field ξ is
        tangent to the event horizon H , which implies that ξ is either null or spacelike on H . Let
        H0 be a connected component of H (H0 = H if H is connected). If ξ is null on all H0 ,
        one says that H0 is non-rotating, while if ξ is spacelike on some part of H0 , one says
        that H0 is rotating.
along H (e.g. from small areas to larger ones), which would break the invariance of H under
the action of the stationary group. This of course results from H being part of the global
spacetime structure, since not any null hypersurface in a stationary spacetime has a vanishing
expansion: for instance, a future light cone in Minkowski spacetime (which is stationary!) has
θ(ℓ) > 0 (cf. Example 17 on p. 45), but the light cone is not invariant by any time translation
isometry. The rigorous proof that θ(ℓ) = 0 for stationary event horizons can be found in
Hawking & Ellis’ textbook (Proposition 9.3.1 of Ref. [266]). Here, we shall simply state:
In dimension 4, one can strongly constrain the topology of the horizon cross-sections:
displaced surface. Given that L−k θ(ℓ) = −Lk θ(ℓ) , we see that hypothesis (iii) implies
Lk θ(ℓ) ≤ 0. (5.7)
      A standard identity (cf. e.g. Eq. (3b) of [269], Eq. (3.1) of [68], Eq. (5.1) of [81] or Eq. (36) of
      [297]) expresses Lk θ(ℓ) as
                                              1S
                                Lk θ(ℓ) = −     R − SDa Ωa + Ωa Ωa + G(ℓ, k),                               (5.8)
                                              2
      where SR is the Ricci scalar of the Riemannian metric q on S induced by the spacetime
      metric g, SD is the Levi-civita connection associated to q, G is the Einstein tensor of the
      spacetime metric g and Ω is the 1-form on S that is the pullback of the 1-form ω defined by
      Eq. (2.75). We may also view Ω as a spacetime 1-form, defined as the composition of ω with
      the orthogonal projector →   −q onto S : Ω = ω ◦ →   −q . In view of Eq. (2.77), we may then write
      ⟨Ω, v⟩ = −k · ∇−  →
                        q (v) ℓ, or in index notation, Ω α  =  −kµ ∇ν ℓµ q να . Besides, using Eq. (3.43), we
                      →
                      −
      have G(ℓ, k) = G(ℓ) · k = −W · k − f ℓ · k = −W · k + f . Then, integrating (5.8) over the
      compact manifold S and setting the integral of the divergence term SDa Ωa to zero since S
      has no boundary, we get
                           S √
                    1
                      Z                    Z h                                       i√
                            R qd x = 2               a
                                                Ωa Ω −W       · k + f −Lk θ(ℓ)          q d2 x.          (5.9)
                    2 S                     S
                                                | {z } | {z } |{z} | {z            }
                    |       {z         }          ≥0       ≥0        ≥0        ≥0
                             2πχ
      In the left-hand side, we have invoked the Gauss-Bonnet theorem to express the integral of
      S
       R in terms of the Euler characteristic χ of the surface S , using the fact that in dimension 2,
      the Ricci scalar SR is twice the Gaussian curvature. The signs of the terms of the integrand in
      the right-hand side are justified as follows: Ωa Ωa = q ab Ωa Ωb ≥ 0 because q is a Riemannian
      metric, −W · k ≥ 0 by Lemma 1.2 in Sec. 1.2.2, given that k is a future-directed null vector
      and W is a future-directed causal vector thanks to the null dominance condition (3.43), f ≥ 0
      by hypothesis and −Lk θ(ℓ) ≥ 0 follows from Eq. (5.7). Now, the Euler characteristic χ is a
      topological invariant, which is 2 for the sphere S2 , 0 for the torus T2 and −2(g − 1) for a
      connected orientable surface of genus g, i.e. with g “holes”. Equation (5.9) yields χ ≥ 0. The
      only possibilities for S compact, connected and orientable are S2 (χ = 2) and T2 (χ = 0).
      However the torus case is very special. Indeed, setting χ = 0 in Eq. (5.9) implies that each term
      in the integrand of the right-hand side vanishes separately: Ωa Ωa = 0, W · k = 0, f = 0 and
      Lk θ(ℓ) = 0. The last condition is the marginal case in the inequality (5.7). Moreover, since q
      is positive definite, Ωa Ωa = 0 implies Ω = 0, which in turn implies SDa Ωa = 0. In addition,
      G(ℓ, k) = −W · k + f = 0. We then deduce immediately from Eq. (5.8) that SR = 0, i.e. the
      Ricci scalar of q is identically zero. Given that the Riemann curvature tensor of a 2-dimensional
      metric is proportional to its Ricci scalar [cf. Eq. (A.112)], it follows that q is a flat metric.
      Remark 3: The topology theorem 1, as stated above, is slightly different from the original version
      given by Hawking [260, 261, 266], for it adds the “outermost” hypothesis (iii). In Hawking’s version, (iii)
      is deduced from the non-existence of trapped surfaces in the black hole exterior J − (I + ), the latter
      property being proved by assumming that the spacetime is globally hyperbolic (Proposition 9.2.8 in
                                        5.2 Stationary spacetimes                                           127
Ref. [266]). Another difference with Hawking’s version, as stated in Proposition 9.3.2 of Ref. [266], is
that the torus topology is not totally excluded by Property 5.4. However, as discussed in the historical
note below, it seems that the exclusion of the torus in Hawking’s proof requires additional hypotheses,
which are not explicitely stated in Refs. [260, 266].
Remark 4: The topology theorem 1 is not specific to stationary black hole event horizons, for it relies
only on quasilocal properties. Indeed, the proof does not rely on H being the boundary of a black hole
region, if one agrees to define the “interior” region as that pointed towards by k. The only required
property is H being a hypersurface (not even a null one) sliced by spacelike compact surfaces S
with θ(ℓ) = 0 and Lk θ(ℓ) ≤ 0. The theorem is therefore valid for outer trapping horizons [269] and
(tubes of) apparent horizons, as noticed by Hawking himself [261] (p. 34), which are not necessarily null
hypersurfaces and which exist in non-stationary spacetimes. Both concepts of outer trapping horizon
and apparent horizon will be discussed in Chap. 18.
     Another version of the topology theorem relies on the null convergence condition, which
is weaker than the null dominance condition (hypothesis (ii) above), and replaces hypothesis
(iii) by other ones, which are more global. It also fully excludes the 2-torus topology:
A part U of spacetime is said globally hyperbolic iff it admits a Cauchy surface, i.e. a
spacelike hypersurface Σ such that every inextendible timelike curve of U intersects Σ exactly
once (cf. Sec. 8.3 of Wald’s textbook [499] for more details). We shall not give the proof of
Property 5.5 here; it can of course be found in the original article by Chruściel & Wald [130].
Remark 5: The topology theorems 1 and 2 regard only a connected component of a given horizon complete
cross-section. Generally, the latter is a connected 2-manifold, but there exist 4-dimensional stationary
(actually static) spacetimes containing a black hole, the event horizon of which has disconnected
complete cross-sections: the Majumdar-Papapetrou spacetimes [353, 399, 258]. They are solutions of the
electrovacuum Einstein equation (cf. Sec. 1.5.2) representing an arbitrary number of charged black holes,
which form a static configuration thanks to an exact balance between the gravitational attraction and
the electrostatic repulsion. The Majumdar-Papapetrou black holes will be discussed further in Sec. 5.6.1
[cf. Eq. (5.108)].
      S , called the Yamabe invariant, must be positive for f ≥ 0 (3 ). This result implies that S must
      admit metrics of positive scalar curvature; it is less stringent about the topology of S than the
      theorem for n = 4. Actually, the higher n, the less constraints on the topology are provided by
      the Yamabe invariant. For instance, for n = 5, the cross-sections of the Myers-Perry black holes
      [376, 187, 425], which generalize Kerr black holes to n > 4, have the topology of the sphere S3 ,
      but the topology S1 × S2 is allowed as well, as demonstrated by the black ring solution found
      by Emparan & Reall [186, 187, 425] (see also Sec. 5.3 of Ref. [119]).
      Historical note : The topology theorem for the spacetime dimension n = 4 has been formulated first
      by Stephen Hawking in 1992 [260] (see also p. 34 of Ref. [261] and Proposition 9.3.2 in Hawking &
      Ellis’ textbook [266]). However, in 1993, Gregory Galloway [208] (p. 119) pointed out some limitation
      in Hawking’s proof, namely that it cannot exclude the torus topology (χ = 0) for the horizon’s cross-
      sections without any extra hypothesis. For instance, for the proof given in Ref. [260], one shall require
      that the spacetime is analytic (cf. Remark 4 in Sec. A.2.1), which is a rather strong hypothesis. The proof
      presented above is based on that given by Sean Hayward in 1994 [269] for outer trapping horizons (see
      also the proof of Theorem 6.3 in Ref. [381]). The theorem obtained by Piotr Chruściel and Robert Wald
      in 1994 [130] (Property 5.5) relies on the topological censorship theorem established by John Friedman,
      Kristin Schleich and Donald Witt in 1993 [199]. It leads directly to the spherical topology, excluding the
      toroidal one.
g = f + h, (5.10)
      where f is a flat Lorentzian metric on M and h is small in the following sense: the components
      of h obey |hαβ | ≪ 1 in any f -Minkowskian coordinates, i.e. coordinates (xα ) on M such
      that4 (fαβ ) = η := diag(−1, 1, . . . , 1). In addition, we assume that g is ruled by the Einstein
      equation (1.40) with Λ = 0 and an energy momentum tensor T of compact support.
         3
            In brief, Eq. (5.9) holds for n > 4 as well, except that the integral of the Ricci scalar SR is no longer proportional
      to the Euler characteristic of S (no Gauss-Bonnet theorem for dim S ̸= 2 !) but is related to the Yamabe constant
      of the conformal class of the metric q.
          4
            We keep the notation η for the matrix diag(−1, 1, . . . , 1), so that (ηαβ ) stands for the components of f in
      Minkowskian coordinates only.
                                       5.3 Mass and angular momentum                                                129
     Given the flat metric f , f -Minkowskian coordinates are not unique: any Poincaré trans-
formation x̃α = Λαµ xµ + cα , where (Λαβ ) is a Lorentz matrix and the cα ’s are constant, leads
to coordinates (x̃α ) which are f -Minkowskian as well. Furthermore, the flat metric f itself,
and hence the decomposition (5.10), is highly nonunique. Indeed any change of coordinates of
the form x′ α = xα + ζ α (x0 , . . . , xn−1 ) where (xα ) are f -Minkowskian coordinates and ζ α are
infinitesimal functions, leads to the following components of the metric tensor with respect to
(x′ α ):
                            ∂xµ ∂xν
                g ′ αβ = gµν ′ α ′ β = (ηµν + hµν ) (δ µα − ∂α ζ µ ) δ νβ − ∂β ζ ν ,
                                                                                  
                            ∂x ∂x
where we have used xα = x′ α −ζ α and ∂ζ α /∂x′ β = ∂ζ α /∂xσ ×∂xσ /∂x′ β ≃ ∂ζ α /∂xβ =: ∂β ζ α
to the first order in ζ α . Expanding to the first order in ζ α and h, we get
                                      g ′ αβ = ηαβ + hαβ − ∂α ζβ − ∂β ζα ,
where ζα := ηαµ ζ µ . We may recast the above expression as g = f ′ + h′ , where f ′ is the metric
whose components in the coordinates (x′ α ) are ηαβ (hence f ′ is flat, as5 f ) and h′ has the
following components with respect to the coordinates (x′ α ):
                                         h′ αβ = hαβ − ∂α ζβ − ∂β ζα .                                     (5.11)
The above relation can be viewed as expressing some gauge freedom on h. This freedom actually
reflects the freedom in the choice of the flat background metric f in the decomposition (5.10).
  The gauge freedom (5.11) can be used to ensure that, in terms of f -Minkowskian coordinates
  (xα ), the metric perturbation h fulfills
                                               ∂µ η µν h̄αν = 0 ,                                       (5.12)
                                                           
  where
                                                   1
                                       h̄ := h − h f ,                                      (5.13)
                                                   2
  h standing for the trace of h with respect to f : h := η µν hµν . The choice (5.12) is referred
  to as the Lorenz gauge (sometimes Hilbert gauge [464]).
the d’Alembertian operator with respect to f . Accordingly, if the Lorenz gauge (5.12) is not
fulfilled, it suffices to solve the d’Alembert equation
                                         □f ζα = ∂µ η µν h̄αν                                   (5.14)
                                                              
and plug the solution ζα into Eq. (5.11) to get a metric pertubation h′ that obeys the Lorenz
gauge.
   5                                                                    α
       Note that the components of f with respect to the coordinates (x′ ) are not ηαβ but ηαβ − ∂α ζβ − ∂β ζα .
130                                               Stationary black holes
      Remark 1: The Lorenz gauge (5.12) is equivalent to the first-order expansion in h of the relation
      defining harmonic coordinates on M , i.e.
                                                       √
                                   □g xα = 0 ⇐⇒ ∂µ        −g g µα = 0.                            (5.15)
                                                                  
      Remark 2: The Lorenz gauge does not fully specify the pair (f , h). Indeed, the solutions of the
      d’Alembert equation (5.14) are nonunique: they depend on initial and boundary data.
          A standard computation (see e.g. Chap. 18 of [371] or Chap. 5 of [464], noticing that the
      computation is independent of the spacetime dimension n) shows that, at first order in h and
      in the Lorenz gauge, the Einstein tensor of g is
                                                       1
                                                  G = − □f h̄,                                 (5.16)
                                                       2
      where □f stands for the d’Alembertian operator relative to the metric f : in f -Minkowskian
      coordinates (xα ), □f h̄αβ = η µν ∂µ ∂ν h̄αβ .
          Let us now assume that (M , g) is a stationary spacetime, with Killing vector ξ. We may
      then choose f so that the f -Minkowskian coordinates (xα ) are adapted to stationarity, i.e. x0
      is an ignorable coordinate, or equivalently ∂0 = ξ. Let then Σ be a hypersurface x0 = const
      and γ the metric induced by f on Σ. The coordinates6 (xi )1≤i≤n−1 form a Cartesian coordinate
      system of (Σ, γ): γ = δij dxi ⊗ dxj = (dx1 )2 + · · · + (dxn−1 )2 . The operator □f reduces to
      the Laplace operator of γ, ∆ say, and, thanks to property (5.16), the Einstein equation (1.40)
      becomes a system of n(n + 1)/2 independent Poisson equations:
                                                          ∆h̄αβ = −16πTαβ .                                            (5.17)
      The solutions are obtained via the Green function of the (n − 1)-dimensional Laplace operator:
                                                                         Tαβ (x′ )
                                                                  Z
                                                   16π
                                 h̄αβ (x) =                                          dn−1 x′ ,             (5.18)
                                             (n − 3)Ωn−2 Σ |x − x′ |n−3
      where x := (x1 , . . . , xn−1 ), x′ := (x′ 1 , . . . , x′ n−1 ), |x − x′ |2 := n−1 i=1 (x − x ) and Ωn−2 is
                                                                                               i    ′i 2
                                                                                      P
      the area of the unit sphere Sn−2 ; the latter is given by the formula
                                                                                Z +∞
                                    2π (p+1)/2
                        Ωp =                     ,          with Γ(u) :=               tu−1 e−t dt,        (5.19)
                                 Γ((p + 1)/2)                                      0
      so that
                                                                          8
                                    Ω2 = 4π,        Ω3 = 2π 2 ,       Ω4 = π 2 ,         Ω5 = π 3 , . . .              (5.20)
                                                                          3
         6
             Latin indices i, j, k, . . . range in {1, . . . , n − 1}, while Greek ones range in {0, . . . , n − 1}.
                                5.3 Mass and angular momentum                                       131
  the metric tensor has the following behavior when r := (x1 )2 + · · · + (xn−1 )2 → +∞:
                                                          p
                                                         
                                          M             1
                         g00 = − 1 + αn n−3 + O n−1                                 (5.21a)
                                         r            r
                                n − 2 Jij xj
                                                         
                                                        1
                          g0i =      αn n−1 + O n−1                                 (5.21b)
                                  2     r            r
                                                              
                                       αn M                    1
                          gij = 1 +                δij + O n−2 ,                    (5.21c)
                                      n − 3 rn−3             r
  and                               Z
                                         xj T0i (x) − xi T0j (x) dn−1 x,                 (5.23)
                                                                
                           Jij :=
                                    Σ
  Σ being any hypersurface x0 = const. The quantities M and Jij are independent of Σ (i.e.
  of x0 ) and are called respectively the mass and the angular momentum of the central
  source. The coordinates (xα ) correspond to the central source rest-frame and take their
  origin at the center of mass, in the sense that
                   Z                                 Z
                                n−1
                       T0i (x) d x = 0        and       xi T00 (x) dn−1 x = 0       (5.24)
                    Σ                                       Σ
  The dominance of the energy-density over the spatial stresses is expressed in terms of the
  components of T with respect to the coordinates (xα ) as T00 ≫ |Tij |.
Proof. Let (f , h) obeys the Lorenz gauge and (xα ) be some corresponding f -Minkowskian
coordinates. By a Poincaré transformation (cf. p. 129), one can inforce (5.24). Far from the
source, one may expand the term 1/|x − x′ |n−3 in Eq. (5.18) in powers of x′ i /r:
                                                                 ′ 2 !
                         1          1               xj x′ j       |x |
                           ′ n−3
                                 = n−3 1 + (n − 3)          +O             ,
                    |x − x |      r                  r r           r2
where Einstein’s summation convention is assumed on the repeated index j ∈ {1, . . . , n − 1}.
Accordingly, Eq. (5.18) yields
                                                        xj
                                Z                         Z                                
            16π         1                 ′   n−1 ′             ′j     ′  n−1 ′            1
h̄αβ (x) =                          Tαβ (x ) d x + n−1        x Tαβ (x ) d x +O n−1 .
           Ωn−2 (n − 3)rn−3 Σ                         r     Σ                            r
                                                                                           (5.25)
For the component 00, the first integral in the right-hand side is nothing but M , as defined by
(5.22), while the second integral vanishes due to the second equation in (5.24). Hence, we get
                                                                   
                                          16π       M            1
                          h̄00 (x) =                    + O n−1 .                          (5.26)
                                     (n − 3)Ωn−2 rn−3         r
132                                            Stationary black holes
      Regarding the component 0i of Eq. (5.25), the first integral in the right-hand side vanishes due
      to the first equation in (5.24). There remains then
                    16π xj                                            8π Jij xj
                              Z                                                          
                                   ′j                       1                             1
        h̄0i (x) =                        ′   n−1 ′
                                  x T0i (x ) d x + O n−1 =                       + O n−1 , (5.27)
                    Ωn−2 rn−1 Σ                           r          Ωn−2 rn−1          r
      where the second equality follows from the identity
                                              Z
                                      Jij = 2 xj T0i (x) dn−1 x.                                                   (5.28)
                                                                Σ
      In view of the definition (5.23) of Jij , the identity (5.28) follows, since T 0i = −T0i at the 0th
      order in h.
          Let us now reconstruct h from h̄. Taking the trace of Eq. (5.13) with respect to f yields
      h̄ := η µν h̄µν = h − (h/2) × n = (2 − n)h/2. Hence we may invert Eq. (5.13) to
                                                      h̄
                                       h = h̄ −          f          with h̄ = −h̄00 + h̄ii .                       (5.29)
                                                     n−2
      The part7 h̄ii of the trace h̄ is deduced from Eq. (5.25):
                                                               xj
                                        Z                        Z                          
                  16π           1                 ′   n−1 ′           ′j     ′   n−1 ′       1
       h̄ii (x) =                           Tii (x ) d x + n−1       x Tii (x ) d x + O n−1 .
                  Ωn−2 (n − 3)rn−3 Σ                         r     Σ                       r
      The second integral vanishes identically, as it can be seen from the identity
                               
                         1 2 ij                                                               1
               k j ik
          ∂i x x T − r T          = δ ki xj T ik + xk δ ji T ik + xk xj ∂i T ik −r ∂i r T ij − r2 ∂i T ij
                         2                                              | {z }    |{z}        2 | {z }
                                                                                      0        xi /r           0
                                                 j   ii     j
                                            = x T = x Tii ,
      where ∂i T ij = 0 follows from the energy-momentum conservation law (1.45). Hence the
      integral of xj Tii over Σ is that of the divergence of a vector field that vanishes outside the
      source and the Gauss-Ostrogradsky theorem allows one to set it to zero. We are thus left with
                                                     Z                           
                                        16π      1                             1
                         h̄ii (x) =                          ′   n−1 ′
                                                       Tii (x ) d x + O n−1 .                   (5.30)
                                    (n − 3)Ωn−2 rn−3 Σ                       r
         7
             Note that Einstein’s summation convention is used: h̄ii = h̄11 + · · · + h̄n−1,n−1 .
                                 5.3 Mass and angular momentum                                         133
        where
                                                           16π
                                              αn :=                ,                            (5.32)
                                                       (n − 2)Ωn−2
        (Ωn−2 being the area of the unit sphere Sn−2 , cf. Eqs. (5.19)-(5.20)), so that
                                                     8             3             16
                               α4 = 2,       α5 =      ,   α6 =      ,   α7 =        ,          (5.33)
                                                    3π            2π            5π 2
        M is a constant and (Jij ) is a constant antisymmetric (n−1)×(n−1) matrix. Furthermore,
        by a suitable SO(n − 1) transformation of the coordinates (xi )1≤i≤n−1 , (Jij ) can be brought
        to the following block diagonal form, depending only on p := [(n − 1)/2] numbers J(1) ,
        . . ., J(p) (possibly equal to zero):
                                                                               
                                          0    J(1)
                                                                               
                                        −J     0                               
                                          (1)                                  
                                                                                                (5.34)
                                                                               
                                 Jij = 
                                                    0    J(2)                  ,
                                                                                
                                                                               
                                                   −J(2) 0                     
                                                                          ...
                                                                               
with the last raw and the last column containing only zeros if n is even.
      The last assertion follows from the fact that any real antisymmetric matrix J is similar to a
      matrix J ′ of the type (5.34) via J ′ = P JP −1 , where P is a special orthogonal matrix.
          Note that the difference between (5.31) and the asymptotic expansion (5.21) for a weakly
      relativistic system is O (1/rn−2 ) in Eq. (5.31a) versus O (1/rn−1 ) in Eq. (5.21a). As discussed
      above, this results from nonlinear terms in the expansion of the Einstein equation.
                                  5.3 Mass and angular momentum                                           135
where
           (i) ξ is the 1-form associated to ξ by metric duality (cf. Sec. A.3.3), i.e. the 1-form of
               components ξα = gαµ ξ µ ;
          (ii) dξ is the exterior derivative of ξ (cf. Sec. A.4.3, especially Eqs. (A.90b) and (A.91b)),
               namely the 2-form whose components are
         (iii) ⋆(dξ) is the (n − 2)-form that is the Hodge dual of the 2-form dξ. The Hodge dual
               of any p-form A is defineda as the (n − p)-form ⋆A given by
                                                                1 µ1 ...µp
                                              ⋆Aα1 ...αn−p :=      A       ϵµ1 ...µp α1 ...αn−p ,                  (5.38)
                                                                p!
where ϵ is the Levi-Civita tensor associated with the metric g (cf. Sec. A.3.4);
         (iv) the orientation of S (which is required to define the integral of ⋆(dξ) over S ) is
              set by the (n − 2)-formb
                                           S
                                             ϵ := ϵ(n, s, ., . . . , . ),                  (5.39)
                                                          | {z }
                                                                         n−2 slots
         We shall check below (Property 5.11) that the Komar mass (5.36) gives the coefficient M
      that appears in the asymptotic expansion (5.31) of the metric tensor.
      Remark 3: As the integral of a (n − 2)-form over an oriented (n − 2)-dimensional manifold, formula
      (5.36) is well posed. More precisely, on the (n − 2)-dimensional manifold S , the theory of integration
      is defined for (n − 2)-forms on S , while ⋆(dξ) is a (n − 2)-form on M . However, any (n − 2)-form ω
      on M yields canonically to a unique (n − 2)-form ι∗ ω on the submanifold S , by restricting the action
      of ω at each point p ∈ S to vectors tangent to S . One says that ι∗ ω is the pullback of ω to S via the
      embedding ι of S in M (cf. Sec. A.2.8). The pullback operator is thus implicit in Eq. (5.36).
      Remark 4: The numerical prefactor −(n − 2)/(16π(n − 3)) in the definition (5.36) is adjusted so that
      for weakly relativistic isolated objects, the Komar mass coincides with the volume integral of the matter
                                       5.3 Mass and angular momentum                                         137
energy density, as it will be shown in Property 5.11. For n = 4 (the standard spacetime dimension), this
prefactor is −1/(8π), while for n = 5, it becomes −3/(32π).
Remark 5: The Komar mass is not defined for n ≤ 3. In particular, formula (5.36) is ill-posed for n = 3.
Already in Newtonian gravity, the very concept of gravitational mass is well defined only for n ≥ 4.
Indeed, in spherical symmetry, a solution to the Poisson equation outside the sources (∆Φ = 0) that
decays with the radius r exists only for n ≥ 4: this solution is8 Φ = −K/rn−3 where the constant K is
proportional to the mass M of the central source. For n = 3, the solutions of ∆Φ = 0 are Φ = a ln r + b
(a and b being constant), while for n = 2, they are Φ = ar + b, none of them decaying as r → +∞.
Remark 6: We have given above a heuristic motivation for defining the Komar mass by Eq. (5.36). We
shall see in Chap. 17 a more fundamental approach, which makes the Komar mass appear as the Noether
charge associated with the invariance of the Einstein-Hilbert Lagrangian under the diffeomorphisms
generated by the vector field ξ (cf. Property 17.9).
    In view of Eq. (5.38) with p = 2 and the last equality in Eq. (5.37), the Komar mass formula
(5.36) can be written as
                                       n−2
                                                 Z
                           MS = −                    ∇µ ξ ν ϵµνα1 ...αn−2 ,                 (5.40)
                                     16π(n − 3) S
where ∇µ ξ ν ϵµνα1 ...αn−2 stands for the (n − 2)-form ω defined by
                                                                                    α
                            ω(u1 , . . . , un−2 ) = ∇µ ξ ν ϵµνα1 ...αn−2 uα1 1 · · · un−2
                                                                                       n−2
for any (n − 2)-tuple of vector fields (u1 , . . . , un−2 ) tangent to S (cf. Remark 3 above).
    Instead of integrals of (n − 2)-forms along the (n − 2)-surface S , as in (5.36) and (5.40),
one may express the Komar mass as a flux integral, i.e. the integral of a 2-form contracted with
some “area element”, which is normal to S . Let us introduce the latter first.
          (i) n and s are unit timelike and spacelike vector fields normal to S obeying the same
              properties as in item (iv) of the above definition of the Komar mass; in particular at
              each point p ∈ S , (n|p , s|p ) is an orthonormal basis of the timelike plane Tp⊥ S
              normal to S (cf. Fig. ??);
     8
         This is easy to get since in the Euclidean space of dimension n − 1 the Laplace operator is ∆Φ =
                   dr for spherically symmetric fields Φ(r).
 1        d    n−2 dΦ
r n−2     dr r
138                                       Stationary black holes
         (iii) dn−2 x := dx1 · · · dxn−2 , where (xa ) = (x1 , . . . , xn−2 ) is a coordinate system on S
               such that (∂1 , . . . , ∂n−2 ) is a right-handed basis for the orientation of S defined by
               S
                ϵ [Eq. (5.39)], i.e. ϵ(n, s, ∂1 , . . . , ∂n−2 ) > 0;
          (iv) q := det(qab ) is the determinant w.r.t. (xa ) of the metric q induced on S by the
               spacetime metric g.
        The bivector dS is independent of the choice of the coordinates (xa ) on S and of the
        orthonormal basis (n, s) of Tp⊥ S .
                                   √           √
      Proof. The combination q dn−2 x = q dx1 · · · dxn−2 with q := det(qab ) is the volume (area)
      element of the Riemannian manifold (S , q); it is thus invariant in any change of coordinates
      (xa ) 7→ (x′ a ). Besides, any two orthonormal bases (n, s) and (n′ , s′ ) of Tp⊥ S having the same
      orientation and with n and n′ both future-directed are necessarily related by a 2-dimensional
      Lorentz boost (cf. Fig. ??):        
                                           n′ = cosh ψ n + sinh ψ s
                                           s′ = sinh ψ n + cosh ψ s,
The area element normal bivector dS appears in the following useful lemma.
      where A♯ is the tensor field of components Aαβ , (e(α) ) is an orthonormal tetrad such that
      e(0) = n and e(1) = s, (e(α) ) is its dual cobasis and dx1 , . . ., dxn−2 are displacement vectors
      forming elementary parallelograms on S ; for instance dxa = dxa ∂a (no summation on a)
      for a ∈ {1, . . . , n − 2}. Note that the last equality in the above expression results from the
      very definition of the integral of a (n − 2)-form over a (n − 2)-surface. Given the definition of
                                      5.3 Mass and angular momentum                                                139
the tetrad (e(α) ), (e(2) , . . . , e(n−1) ) is a basis of the tangent space Tp S ; consequently dx1 , . . .,
dxn−2 are linear combinations of e(2) , . . ., e(n−1) . Thanks to the alternate character of ϵ, we
may then restrict the sum over the indices µ and ν to (µ, ν) = (0, 1) and (µ, ν) = (1, 0). Hence
Z          Z                                                           Z
                  ♯ (0)      (1)
    ⋆A =      A (e , e ) ϵ(e(0) , e(1) , dx1 , . . . , dxn−2 ) =          A(0)(1) ϵ(n, s, dx1 , . . . , dxn−2 ).
 S            S                                                           S
A(0)(1) = g (0)(µ) g (1)(ν) A(µ)(ν) = g (0)(0) g (1)(1) A(0)(1) = (−1) × 1 × A(0)(1) = −A(0)(1) ,
with
                                                                     1
     A(0)(1) = A(e(0) , e(1) ) = A(n, s) = Aµν nµ sν = −Aµν sµ nν = − Aµν (sµ nν − nµ sν ).
                                                                     2
On the other side, ϵ(n, s, . . .), which appears above, is the (n − 2)-form Sϵ defined by Eq. (5.39).
Actually, it is nothing but the volume (area) form of (S , q), i.e. the Levi-Civita tensor of the
metric q (see e.g. Sec. 16.4.3 of Ref. [228]). In particular, Sϵ is independent of the choice of the
orthonormal basis (n, s) of Tp⊥ S . Hence we may write, for dxa = dxa ∂a ,
                                                                                 √
             ϵ(n, s, dx1 , . . . , dxn−2 ) = Sϵa1 ...an−2 dxa11 · · · dxn−2
                                                                      a  n−2
                                                                             =    q dx1 · · · dxn−2 .
Gathering the above results and using Eq. (5.42) establishes Eq. (5.43).
     Thanks to Lemma 5.10, we may re-express the Komar mass (5.36) as
                                      n−2
                                               Z
                           MS = −                 (dξ)µν dS µν .                                        (5.44)
                                   32π(n − 3) S
Using Eq. (5.37), this becomes
                                             n−2
                                                            Z
                                  MS    =−                        ∇µ ξν dS µν .                         (5.45)
                                           16π(n − 3)         S
Alternatively, we may express the exterior derivative in terms of partial derivatives (first
equality in Eq. (5.37)) and get
                                                  √                                  √
   (dξ)µν dS µν = (∂µ ξν − ∂ν ξµ )(sµ nν − nµ sν ) q dn−2 x = 2∂µ ξν (sµ nν − nµ sν ) q dn−2 x.
Equation (5.44) yields then an expression of MS that can be used for explicit computations:
                                  n−2                                        √
                                                 Z
                       MS    =−                        ∂µ ξν (sµ nν − nµ sν ) q dn−2 x .                (5.46)
                                16π(n − 3)         S
Example 3 (Komar mass in Schwarzschild spacetime): Let us consider the Schwarzschild spacetime
(M , g) introduced in Example 3 of Sec. 2.3. In terms of the ingoing Eddington-Finkelstein coordinates
(t, r, θ, φ) used there, the stationary Killing vector is simply ξ = ∂t . We have n = 4 and let us choose
140                                       Stationary black holes
      S to be a 2-sphere defined by (t, r) = const. The coordinates (xa ) = (x1 , x2 ) spanning S are then
      (θ, φ). For the pair of normal vectors to S , we choose n = N −1 ∂t − 2mN/r∂r and s = N ∂r , where
      N := (1 + 2m/r)−1/2 . Given the metric (2.5), it is easy to check that g(n, n) = −1, g(n, s) = 0 and
      g(s, s) = 1. It is also immediate that g(n, ∂θ ) = g(n, ∂φ ) = 0 and g(s, ∂θ ) = g(s, ∂φ ) = 0, which
      proves that n and s are normal to S . Moreover, s points towards the exterior of S (since N > 0) and
      (n, s, ∂θ , ∂φ ) is a right-handed basis. All the hypotheses stated below Eq. (5.42) are thus fulfilled and
      we may use Eq. (5.46) to evaluate the Komar mass over S . The components ξα of ξ are easily evaluated
      via the components (2.5) of g: ξα = gαµ ξ ν = gαµ (∂t )µ = gαt = (−1 + 2m/r, 2m/r, 0, 0). The only
      non-zero derivatives ∂µ ξν are then ∂r ξt and ∂r ξr ; they are both equal to −2m/r2 . It follows that the
      integrand in Eq. (5.46) is reduced to only two terms:
                                                             2m          −1                    2m
              ∂µ ξν (sµ nν − nµ sν ) = ∂r ξt (sr nt − nr st ) = −
                                                                                          
                                                                  N ×  N    +  2mN  ×   0   =− 2 .
                                                             r2                                r
                                                                                              √
      We read on Eq. (2.5) that the metric induced on S is q = r2 dθ2 + r2 sin2 θdφ2 , so that q = r2 sin θ.
      Accordingly, Eq. (5.46) leads to
                                           Z                             Z
                                        1         2m 2                  m
                             MS = −             − 2 r sin θ dθ dφ =            dθ dφ.
                                       8π S        r                   4π S
      Since the remaining integral is nothing but 4π, we get simply
MS = m. (5.47)
      We note that MS does not depend on r, i.e. does not depend on the choice of the sphere S .
          More generally, we have:
        The coefficient M that appears in expression (5.31) of the metric tensor of a stationary
        asymptotically flat n-dimensional spacetime
                                                p in some asymptotically Minkowskian coor-
        dinates (x ) coincides in the limit r := (x1 )2 + · · · + (xn−1 )2 → +∞ with the Komar
                  α
        mass over the sphere S of constant (x0 , r). In particular, for a weakly relativistic system,
        the Komar mass over S is equal to the volume integral of the matter energy density T00 ,
        as given by formula (5.22).
      Proof. Without any loss of generality, the coordinate system (xα ) leading to the expansion
      (5.31) can be chosen to be adapted to stationarity, i.e. such that ∂0 = ξ, where ξ is the stationary
      Killing vector. Let us evaluate the Komar mass MS via the integral (5.46), denoting the surface
                        √                     √
      element of S by q dn−2 y instead of q dn−2 x to avoid any confusion with the asymptotically
      Minkowskian coordinates (xα ). We have ξα = gαµ ξ µ = gαµ δ µ0 = gα0 , with gα0 = g0α given by
      formulas (5.31) for large r, so that
                                          M xi                                                    xk xi
                                                                                                       
                                                                   n−2        Jjk
       ∂0 ξα = 0, ∂i ξ0 ≃ −(n − 3)αn n−2           and ∂i ξj ≃            αn n−1 δki − (n − 1) 2          .
                                         r    r                       2      r                     r
                                                                1          n−1
                                                                                
      On the other hand, for r → +∞, we have sα ≃ 0, xr , . . . , x r , nα ≃ (1, 0, . . . , 0) and
      √ n−2              √
         q d y = rn−2 q̄ dn−2 y, where q̄ stands for the determinant of the round metric of the unit
                                  5.3 Mass and angular momentum                                           141
sphere Sn−2 with respect to the coordinates (y 1 , . . . , y n−2 ). Given that ∂i ξj decays as 1/rn−1 ,
it follows that only the terms involving ∂i ξ0 contribute to the integral (5.46), which becomes
                                            i
                                                                 √
                                                            
                         n−2
                                 Z
                                             x
              MS   =−                ∂i ξ0       × 1 − 0 × 0 rn−2 q̄ dn−2 y
                      16π(n − 3) S            r
                                    i i √                            √ n−2
                     n−2                               n−2
                               Z                                 Z
                                   x x          n−2
                   =      αn M             q̄ d y =         αn M      q̄ d y .
                     16π        S |r{zr}                16π        S
                                                                 |    {z    }
                                         1                                   Ωn−2
Since αn Ωn−2 = 16π/(n − 2) [cf. Eq. (5.32)], we get MS = M . The statement about a weakly
relativistic system is an immediate consequence of expansion (5.21) being a particular case of
(5.31), given that M in (5.21) has the integral form (5.22).
We are going to derive a volume integral formula for MS , from which one can show the
independence of MS from the choice of S in the vacuum case, thereby generalizing the result
of Example 3. To this aim, we shall make use of the following lemma:
  where (x1 , . . . , xn−1 ) stands for a coordinate system on Σ and γ := det(γij ) is the deter-
  minant w.r.t. these coordinates of the metric γ induced on Σ by the spacetime metric g, so
      √
  that γ dx1 · · · dxn−1 is the volume element on Σ. Then, for any 2-form A, we have
                         Z                   Z                Z
                                ν
                        2 ∇ Aµν dV =     µ
                                                  Aµν dS −µν
                                                                   Aµν dS µν ,             (5.49)
                         Σ                   Sext                Sint
  where dS µν is defined by Eq. (5.42) with s pointing to the asymptotic flat end of (M , g)
  for both Sext and Sint .
Proof. Let (t, x1 , . . . , xn−1 ) be a local coordinate system adapted to Σ, i.e. such that Σ is the
hypersurface t = 0, and let V be the vector defined by V := A(., n), i.e. Vα := Aαµ nµ . Thanks
to the antisymmetry of A, V is tangent to Σ: n · V = A(n, n) = 0. We have then, using
142                                    Stationary black holes
      In the second line, we have expressed the derivative of n in terms of the extrinsic curvature
      tensor K of Σ and of the lapse function N := −n · ∂t , via the identity ∇α nβ = −K αβ −
      Dβ ln N nα (see e.g. Eq. (4.24) of Ref. [227]), where Dβ ln N = ∇β ln N + nβ nµ ∇µ ln N . To get
      the third line, we have used the symmetry of K to set Aνµ K νµ = 0 and the property Vµ nµ = 0
      to write Vµ Dµ ln N = Vµ ∇µ ln N = V µ ∂µ ln N = V i ∂i ln N (i ∈ {1, . . . , n − 1}). Let us now
      use formula (A.74) for the divergence of V , along with V 0 = 0 (V is tangent to Σ) and the
               √          √
      relation −g = N γ between the determinants of the metrics g and γ (see e.g. Eq. (5.55) of
      Ref. [227])
                                  1   √             1        √
                       ∇µ V µ = √ ∂µ    −gV µ = √ ∂i N γV i
                                                                  
                                  −g              N γ
                                 1   √ i
                                      γV + V i ∂i ln N = Di V i + V i ∂i ln N,
                                         
                              = √ ∂i
                                  γ
      where Di V i is the divergence of V considered as a vector field on (Σ, γ), D standing for the
      Levi-Civita connection of γ. Hence we get
                                                           √
                                       ∇ν Aµν dV µ = Di V i γ dn−1 x.
      We may then apply the Gauss-Ostrogradsky theorem to V and write the integral of the
      divergence of A as
                                             i√                     √
                     Z                Z                  Z
                          ν       µ               n−1
                         ∇ Aµν dV   =    Di V γ d x =         σ i Vi q dn−2 x
                       Σ              ZΣ                 Z ∂Σ
                                           i √    n−2              √
                                    =     s Vi q d x −        si Vi q dn−2 x,
                                               Sext                   Sint
      where σ is the unit vector normal to the boundary ∂Σ and outward-pointing with respect to
      Σ. In the present case, ∂Σ has two connected components: Sint and Sext , with σ = −s on
      Sint and σ = s on Sext , hence the last equality. Now, using again the antisymmetry of A,
                                                            1
                             si Vi = sµ Vµ = sµ Aµν nν = Aµν (sµ nν − nµ sν ).
                                                            2
                                                                 √
      In view of expression (5.42) for dS µν , we thus get 2si Vi q dn−2 x = Aµν dS µν , so that Eq. (5.49)
      follows.
         Let us apply Lemma 5.12 to the 2-form A := dξ, i.e. to the 2-form of components Aαβ :=
      ∇α ξβ − ∇β ξα = 2∇α ξβ [cf. Eq. (5.37)]. The 1-form ∇ν Aµν which appears in the left-hand
                                 5.3 Mass and angular momentum                                        143
side of Eq. (5.49) is then, up to a sign, the metric dual of a vector field J (ξ) called the Komar
current of ξ: ∇ν Aµν = −J (ξ)µ , where
The Komar current is actually a conserved current, independently of the Killing nature of ξ:
  For any vector field ξ defined on the spacetime (M , g) (not necessarily a Killing vector
  field), the Komar current defined by Eq. (5.50) is conserved:
∇µ J (ξ)µ = 0. (5.51)
Proof. We have, from the definition (5.50), J (ξ)α = −∇ν Aαν with Aαβ := ∇α ξ β − ∇β ξ α .
Let (xα ) be a coordinate system on M . Since A is antisymmetric, its divergence can be
                                                           √          √
expressed in terms of partial derivatives as ∇ν Aαν = ∂ν ( −gAαν ) / −g, where g is the
determinant of g with respect to (xα ). Similarly the divergence of J (ξ) can be written as
                 √             √
∇µ J (ξ)µ = ∂µ ( −gJ (ξ)µ ) / −g. Combining the above formulas leads to ∇µ J (ξ)µ =
        √          √
−∂µ ∂ν ( −gAµν ) / −g. Since ∂µ ∂ν = ∂ν ∂µ and Aµν = −Aνµ , we get immediately ∇µ J (ξ)µ =
0.
   Thanks to the identities Aαβ = 2∇α ξβ and J (ξ)α = −∇µ Aαµ , Lemma 5.12 yields
                    Z               Z                  Z
                                 µ                 µν
                  − J (ξ)µ dV =           ∇µ ξν dS −       ∇µ ξν dS µν .
                       Σ                   Sext                  Sint
                                               n−2
                                                            Z
                           MSext = MSint   +                    J (ξ)µ dV µ .                (5.52)
                                             16π(n − 3)     Σ
    Thanks to the Killing equation (3.19), expression (5.50) for the Komar current J (ξ) simplifies
to J (ξ)α = −2∇µ ∇α ξ µ . Now, since ξ is a Killing vector, Eq. (3.84) holds: ∇µ ∇α ξ µ = Rαµ ξ µ .
Hence we get
                                       J (ξ)α = −2Rαµ ξ µ .                                 (5.53)
Pluging this relation into formula (5.52) and using expression (5.48) for dV µ , we arrive at the
following property.
                                                     n−2                     √
                                                                 Z
                              MSext = MSint      +                    R(ξ, n) γ dn−1 x,                   (5.54)
                                                   8π(n − 3)      Σ
                                                                                       √
        where R is the Ricci tensor of g, n is the future-directed unit normal to Σ and γ dn−1 x is
        the volume element on Σ induced by g. If g obeys the Einstein equation (1.42) with some
        energy-momentum tensor T and Λ = 0, then the above formula may be rewritten as
                                                   Z                           
                                           n−2                            T       √ n−1
                      MSext = MSint      +            T (ξ, n) −             ξ·n   γ d x,                 (5.55)
                                           n−3         Σ                 n−2
Property 5.15: independence of the Komar mass from the integration surface
  Two (n − 2)-surfaces Sext and Sint that are connected by a spacelike hypersurface Σ
  in a vacuum region of an asymptotically flat stationary spacetime ruled by the Einstein
  equation with Λ = 0 share the same Komar mass:
    The concept of “total mass” of a stationary and asymptotically flat spacetime is well captured
by the Komar mass at infinity:
r := (x1 )2 + · · · + (xn−1 )2 . If the Komar mass MSr converges when r → +∞, we call
  the Komar mass at infinity or total mass of the spacetime (M , g). Moreover, if g obeys
  the Einstein equation with Λ = 0 and if the energy-momentum tensor T vanishes for r
  larger than some threshold r0 , one has M∞ = MSr for any r ≥ r0 .
Proof. Since for r ≥ r0 , Sr is located in a vacuum region of spacetime, the value of MSr is
independent of r by Property 5.15, hence M∞ = MSr .
Example 5 (Total mass of Schwarzschild spacetime): We have already noticed in Example 3 that in
the Schwarzschild spacetime, the Komar mass MS is equal to the mass parameter m of Schwarzschild’s
metric, whatever the 2-surface S [Eq. (5.47)]. We thus conclude that M∞ = m.
Example 6 (Vanishing total mass of cubic Galileon black holes): In a stationary and asymptotically
flat 4-dimensional spacetime, if dξ decays faster that 1/r2 , then M∞ = 0. This happens for black holes
in a scalar-tensor theory of gravity known as the cubic Galileon, as shown in Sec. 5.1 of Ref. [490].
Actually, for these black holes, the spacetime metric tends to Minkowsky metric as fast as 1/r4 when
r → +∞, which implies that dξ decays as 1/r5 . This rapid convergence to flat space can be interpreted
as a screening of the black hole’s gravity by the surrounding scalar field.
Historical note : The Komar current (5.50) associated to any vector field of spacetime has been
introduced by Arthur Komar in 1959 [319]; he showed that this current is conserved (divergence-free)
[Eq. (5.51)]. In a subsequent study published in 1962 [320], Komar showed that the Komar current
J (ξ) associated to a Killing vector ξ is, up to some factor, the Ricci tensor applied to ξ [Eq. (5.53)] and
that the converved integral representing the flux of J (ξ) through a one-ended hypersurface Σ can
be transformed into a surface integral on the boundary S of Σ, which is nothing but Eq. (5.45) with
n = 4. He proved that such an integral is independent of the surface S as soon as S is located outside
146                                               Stationary black holes
      the sources of the Einstein equation (Property 5.15). The generalization to dimensions n ≥ 4 has been
      performed by Robert Myers and Malcolm Perry in 1986 [376] [cf. their Eq. (4.1), where N = n − 1].
        Under the same hypotheses as Property 5.14, one can rewrite Eq. (5.55) as
                                            Z                                   
                                                            1            n−2         √ n−1
                    MSext = MSint +               N E+         S       −     ⟨p, β⟩   γ d x,        (5.62)
                                              Σ            n−3           n−3
      Proof. From Eq. (5.60), the trace of the energy-momentum tensor is T := g µν Tµν = Enµ nµ +
      nµ pµ + pµ nµ + S = −E + S, since nµ nµ = −1 and pµ nµ = 0. In view of Eqs. (5.59) and (5.60),
      we can then rewrite the integrand in the r.h.s. of Eq. (5.55) as
                      T                                S−E       N
      T (n, ξ)−          n·ξ = N T (n, n) + T (n, β) −     n·ξ =   [(n − 3)E + S]−⟨p, β⟩.
                     n−2         | {z } | {z } n − 2 |{z} n − 2
                                              E         −⟨p,β⟩              −N
Example 7 (Komar mass of a weakly relativistic star): Let us check that formula (5.62) reduces to
(5.22) for a weakly relativistic star in a n-dimensional spacetime. In this case, we have N ≃ 1 and β → 0.
Furthermore, the matter constituting the star shall be weakly relativistic as well, which implies that
the trace S of the stress tensor is negligible in front of the energy density E. Under these hypotheses,
Eq. (5.62) with MSint = 0 yields MSext = M , where M is given by (5.22), given that T00 ≃ E.
5.3.5     Link between the Komar mass and the ADM mass
For generic (i.e. not necessarily stationary) asymptotically flat spacetimes, there exists the
concept of ADM energy-momentum, ADM standing for Arnowitt, Deser and Misner [20]. It
arises from the Hamiltonian formulation of general relativity developed by these authors. For
a spacelike hypersurface Σ that is asymptotically Euclidean, the ADM energy of Σ is defined
as some flux over a closed (n − 2)-sphere S of quantities involving derivatives of the metric
γ induced by g on Σ, in the limit where the coordinate radius of S tends to infinity, while the
3 components of the ADM momentum of Σ are defined by similar integrals with quantities
involving the extrinsic curvature tensor K of Σ (see e.g. [298], Chap. 8 of [227], Sec. 4.3 of
[416], Sec. 3.7 of [464] or Sec. 3.2.1 of [471] for details). The ADM mass of Σ is then defined as
the Minkowskian norm of the ADM energy-momentum. For a vanishing ADM momentum,
the ADM mass coincides with the ADM energy. One may then ask the relation with the Komar
mass if the spacetime is stationary. The answer is very simple:
For n = 4, the proof can be found in R. Beig’s article [49] (see also Ref. [28] for a different proof),
while the proof for n > 4 has been obtained by H. Barzegar, P.T. Chruściel and M. Hörzinger
[43].
  A spacetime (M , g) is called axisymmetric iff (i) it is invariant under the action of the
  rotation group SO(2) (or equivalently U(1)) and (ii) the orbits of the group actiona are
  everywhere spacelike curves or (ii’) (M , g) admits a conformal completion (cf. Sec. 4.3)
  and the orbits of the group action are spacelike in the vicinity of the conformal boundary
  I . Since SO(2) is a one-parameter Lie group, its action is generated by a single Killing
  vector η, which is unique up to some scaling constant. The orbits of the SO(2) action being
  closed curves, we shall fix the scaling constant so that the group parameter φ associated
148                                             Stationary black holes
        with η has the standard periodicity 2π. If non-empty, the set of points of M that are left
        invariant by the SO(2) action is called the rotation axis. Equivalently, the rotation axis is
        the set of points where the Killing vector η vanishes. Usually, if the dimension of M is
        n, the rotation axis is a submanifold of M of dimension n − 2. Moreover, it is a timelike
        submanifold in the region where the SO(2) orbits are spacelike.
            a
                Cf. Sec. 3.3.1 for the concepts of group action and orbit.
(dη)αβ = ∂α ηβ − ∂β ηα = ∇α ηβ − ∇β ηα . (5.64)
      Remark 8: Besides the change ξ ↔ η, we notice a difference by a factor −(n − 2)/(n − 3) between
      the r.h.s. of (5.36) and (5.63). For n = 4, this factor is −2 and is known as Komar’s anomalous factor
      (see Ref. [309] for a discussion). Fundamentally, the independence of formula (5.63) from the spacetime
      dimension n stems from the fact that the SO(2) action generated by η is effective only in a 2-dimensional
      space: the orthogonal complement of the rotation axis.
      Remark 9: The Komar angular momentum is well defined for any spacetime dimension n ≥ 3, while
      the definition of the Komar mass requires n ≥ 4 (cf. Sec. 5.3.3).
          Formulas (5.45) and (5.46) for the Komar mass rely only on the fact that ξ is a Killing vector.
      In an axisymmetric spacetime, we may therefore replace ξ by η and the numerical prefactor by
      1/(16π) to get the following expressions for the Komar angular momentum:
                                                    Z
                                                 1
                                          JS =         ∇µ ην dS µν                                 (5.65)
                                                16π S
      and
                                                                                 √
                                                     Z
                                               1
                                       JS   =              ∂µ ην (sµ nν − nµ sν ) q dn−2 x .             (5.66)
                                              16π      S
          As for the Komar mass (cf. Property 5.11), one can read the Komar angular momentum in
      the asymptotic expansion of the metric tensor in appropriate coordinates:
        Property 5.19: Komar angular momentum in the asymptotic expression of the
        metric
  axisymmetry in the sense that η = −x2 ∂x1 +x1 ∂x2 (i.e. the rotation axis of the SO(2) action
  is the (n − 2)-dimensional surface (x1 , x2 ) = (0, 0)). Then
                                                              p the coefficient J(1) = J12 in
  the expansion (5.31)-(5.34) of g coincides in the limit r := (x1 )2 + · · · + (xn−1 )2 → +∞
  with the Komar angular momentum JS over the sphere S of constant (x0 , r). In particular,
  for a weakly relativistic system, JS = J12 , where J12 is given by the volume integral (5.23).
Proof. Let us use formula (5.66) to evaluate JS . From η = −x2 ∂x1 + x1 ∂x2 and the asymptotic
expression (5.31) of g, we get the following components of η with respect to the coordinates
(xα ) in the limit r → +∞: ηα = gαµ η µ = gα1 (−x2 ) + gα2 x1 ≃ (η0 , −x2 , x1 , 0, . . . , 0), with
where we have used Eq. (5.32) to express αn . The integrand in Eq. (5.66) is then, taking into
account that ∂0 = 0 (stationarity) and s0 = 0 (s tangent to the hypersurface x0 = const):
= n0 si ∂i η0 + 2(s1 n2 − s2 n1 ).
      S , i.e. coordinates such that all the θk ’s span (0, π), φ spans (0, 2π) and the asymptotically
      Minkowskian coordinates (xi )1≤i≤n−1 are expressible as10
                           
                            x1 = r sin θ1 sin θ2 · · · sin θn−4 sin θn−3 cos φ
                           
                           
                           
                           
                                x2 = r sin θ1 sin θ2 · · · sin θn−4 sin θn−3 sin φ
                           
                           
                           
                           
                           
                            x3 = r sin θ sin θ · · · sin θ
                           
                           
                                                1    2          n−4 cos θn−3
                           
                                x4 = r sin θ1 sin θ2 · · · cos θn−4
                                     ..
                           
                                      .
                           
                           
                           
                           
                           
                           
                              xn−2 = r sin θ1 cos θ2
                           
                           
                           
                           
                           
                           
                            xn−1 = r cos θ
                           
                                                 1
      Then (x1 )2 +(x2 )2 = r2 sin2 θ1 · · · sin2 θn−3 . Given that S is the (n−2)-sphere (x0 , r) = const,
                           √            √
      one has, for r large, q ≃ rn−2 q̄, where q̄ is the determinant with respect to the coordinates
      (y a ) of the round metric on Sn−2 :
                                √
                                  q̄ = sinn−3 θ1 sinn−4 θ2 · · · sin2 θn−4 sin θn−3 .                (5.67)
      It follows that
                        (n − 1)J(1)
                                    Z
                 JS   =                sinn−1 θ1 sinn−2 θ2 · · · sin4 θn−4 sin3 θn−3 dθ1 · · · dθn−3 dφ
                          2Ωn−2      S
                        (n − 1)J(1)
                      =             × 2πIn−1 In−2 · · · I3 ,                                            (5.68)
                          2Ωn−2
      where we have introduced                                Z    π
                                                      Ik :=            sink θ dθ.
                                                               0
        10
             Note that these formulas generalize the standard expression (4.2) with (x, y, z) = (x1 , x2 , x3 ) to n ≥ 4.
                                      5.3 Mass and angular momentum                                            151
Property 5.20: volume integral formula for the Komar angular momentum
                                               √
                                Z
                           1
        JSext = JSint   +             J (η) · n γ dn−1 x,       with J (η)α = −2Rαµ η µ ,          (5.69)
                          16π     Σ
                                                     √
  where n is the future-directed unit normal to Σ and γ dn−1 x is the volume element on Σ
  induced by g. The vector J (η) is the Komar current of η [compare Eq. (5.53)]. If g obeys
  the Einstein equation (1.42) with some energy-momentum tensor T and Λ = 0, then the
  above formula may be rewritten as
                                        Z                      
                                                          T       √ n−1
                     JSext = JSint −          T (η, n) −     η·n   γ d x,                          (5.70)
                                          Σ              n−2
Remark 10: Since the axisymmetry generator η is spacelike, the spacelike hypersurface Σ can be
chosen so that η is tangent to it; one has then η · n = 0 and the second term in the integrand of Eq. (5.70)
vanishes identically. Moreover, if one expresses T via the N +1 decomposition (5.60), one can write the
first term as T (η, n) = −⟨p, η⟩, so that formula (5.70) simplifies to
                                                              √
                                                     Z
                                    JSext = JSint + ⟨p, η⟩ γ dn−1 x.                                  (5.71)
                                                     Σ
As for the Komar mass (Property 5.15), it follows immediately from Eq. (5.70) that
  Property 5.21: independence of the Komar angular momentum from the integra-
  tion surface
  Two (n − 2)-surfaces Sext and Sint that are connected by a spacelike hypersurface Σ in
  a vacuum region of an axisymmetric spacetime ruled by the Einstein equation share the
  same Komar angular momentum:
   As for the total mass (cf. Property 5.16), the concept of “total angular momentum” of
an axisymmetric and asymptotically flat spacetime is well captured by the Komar angular
momentum at infinity:
152                                         Stationary black holes
        the Komar angular momentum at infinity or total angular momentum of the space-
        time (M , g). Moreover, if g obeys the Einstein equation with Λ = 0 and if the energy-
        momentum tensor T vanishes for r larger than some threshold r0 , one has J∞ = JSr for
        any r ≥ r0 .
      Remark 11: There is no concept of “ADM angular momentum” (due to the so-called supertranslation
      ambiguity at spatial infinity, cf. e.g. Sec. 6 of Ref. [523] or Sec. 8.5 of Ref. [227]). Consequently, there is
      no point in comparing the Komar angular momentum with an “ADM” counterpart, contrary to what
      was done in Sec. 5.3.5 for the mass.
        11
           In the most astrophysically relevant case, namely that of a stationary black hole in a 4-dimensional vacuum
      spacetime, the event horizon is a connected hypersurface. However, we shall see that disconnected event horizons
      exist for some stationary black holes in 5-dimensional spacetimes; they exist as well in 4 dimensions if one allows
      for an electric charge (Majumdar-Papapetrou black holes mentioned in Remark 5 in Sec. 5.2.3).
                                  5.4 The event horizon as a Killing horizon                           153
Proof. For simplicity, we give only the proof for the case of the vacuum Einstein equation
(1.44), referring to Sudarsky & Wald’s article [468] for the electrovacuum case. Since H is a
non-degenerate Killing horizon, it is contained in a bifurcate Killing horizon (cf. Property 3.15
and Remark 3 on p. 87). Let S be the corresponding bifurcation surface. Thanks to a theorem
by Chruściel & Wald (Theorem 4.2 in Ref. [129]), the domain of outer communications can
be foliated by a family of spacelike hypersurfaces (Σt )t∈R that are asymptotically flat, are
asymptotically orthogonal to ξ, have S as inner boundary, are Lie-dragged12 to each other
by ξ and are maximal. The last property means that the extrinsic curvature tensor K of
each hypersurface Σt has a vanishing trace: K i i = 0, or equivalently, that the unit normal n
to Σt is divergence-free : ∇µ nµ = 0 (cf. Sec. 10.2.2 of Ref. [227]). Let us consider the N + 1
decomposition with respect to Σt introduced in Sec. 5.3.4. The maximality property K i i = 0
allows one to simplify one of the Einstein equations in the N + 1 decomposition, namely the
momentum constraint (see e.g. Chap. 5 of Ref. [227]), which reduces to Dj Kij = 8πpi , where
D is the Levi-Civita connection associated to the metric γ induced by g on Σt and p is the
matter momentum density in the hypersurface Σt (cf. Sec. 5.3.4). Since we consider the vacuum
case, we have p = 0. Hence the momentum constraint reduces to Dj Kij = 0. Contracting
with the shift vector β arising from the orthogonal split (5.59) of ξ, we get
where the last equality results from the symmetry of Kij and the identity Dj β i +Di β j = 2N K ij ,
which holds for the shift vector β associated with the stationary Killing vector ξ (see e.g.
Eqs. (5.66) and (5.67) in Ref. [227] with ∂γij /∂t = 0). Let us integrate the above equation over
Σt and use the Gauss-Ostrogradsky theorem for the divergence term Dj (Kij β i ), taking into
account that the boundary of Σt is made of the bifurcation sphere S (inner boundary) and the
  12
       Σt+dt is the image of Σt by the infinitesimal displacement dt ξ.
154                                        Stationary black holes
      sphere “at infinity” S∞ (outer boundary), the latter being a (n − 2)-sphere in Σt of coordinate
      radius r → +∞ (r is defined from the asymptotic flatness property); we get
                           i√                     i√                           √
              Z                        Z                         Z
                     j          n−2        j             n−2
                   s Kij β q d x −        s Kij β q d x −            N Kij K ij γ dn−1 x = 0,
                S∞                             S                            Σt
      where s is the unit normal to S∞ and S in Σt , oriented towards the asymptotic flat end of
      Σt . Now, each of the two surface integrals vanishes identically: the first one because on the
      asymptotically flat slice Σt , K decays as O(r−2 ) for r → +∞ and, still in this limit, β → 0
      since ξ is asymptotically orthogonal to Σt [cf. the orthogonal decomposition (5.59)]. As for the
      integral over S , it is zero because β|S = 0. Indeed, since S is the bifurcation surface of a
      bifurcate Killing horizon with respect to ξ (Property 5.23), one has ξ|S = 0 (Property 3.12),
      which implies via Eq. (5.59) both N |S = 0 and β|S = 0. We therefore get
                                                      √
                                       Z
                                           N Kij K ij γ dn−1 x = 0.
                                              Σt
      Given that N > 0 on Σt \ S and Kij K ij ≥ 0 on Σt (for Kij K ij = γ ik γ jl Kij Kkl with
      γ being a Riemannian metric), the vanishing of the above integral implies Kij = 0. Now,
      K = −(2N )−1 Lm γ (cf. Eq. (4.30) in Ref. [227]), where m := N n is the evolution vector
      along the normal to Σt (cf. Sec. 4.3 of Ref. [227]). The vanishing of K implies then
                                  γ̇ := Lm γ = 0           and       π̇ := Lm π = 0,                           (5.74)
      where π is the momentum canonically conjugate to γ in the ADM Hamiltonian formulation of
      general relativity [20, 196]. π is a tensor density that is related to the extrinsic curvature tensor
                      √
      K by π ij = γ(K kk γ ij − K ij ). Hence, K = 0 implies π = 0 so that π̇ = 0 in Eq. (5.74) is
      trivially fulfilled. In the Hamiltonian formulation, the spacetime dynamics is entirely described
      by the “time development” (i.e. the variation across the Σt foliation) of (γ, π), which are
      the (q, p) pairs of Hamiltonian mechanics. Hamilton’s equations (5.74) shows that (γ, π) do
      not evolve at all. This means that m is a Killing vector of the spacetime metric g. Since
      m = N n is by construction timelike and hypersurface-orthogonal, this proves that the part of
      spacetime foliated by the (Σt )t∈R , i.e. the domain of outer communications, is strictly static.
      The connection between m and ξ is given by Eq. (5.59): ξ = m + β. Hence the Killing vectors
      ξ and m coincide iff the shift vector β vanishes. If β would not vanish identically, it would be
      a Killing vector itself, as the difference ξ − m of two Killing vectors. However, as seen above, β
      vanishes at the inner boundary S of Σt and at the “outer boundary”, i.e. for r → +∞. In most
      cases, these constraints will result in β = 0 (for instance, they exclude β being a rotational
      Killing vector), so that we conclude that m = ξ.
      Remark 1: In the original formulation by Sudarsky & Wald (Theorem 2 of Ref. [468]), the non-degeneracy
      of H is replaced by H being part of a bifurcate Killing horizon. However, as mentioned in the above
      proof, these two properties are essentially equivalent.
      Remark 2: Sudarsky & Wald stated the staticity theorem for the dimension n = 4 but, as pointed out
      in Refs. [118, 123, 280], the theorem straightforwardly generalizes to n ≥ 4. Actually, the proof given
      above13 does not rely on n = 4.
        13
         The first part of the proof, leading to K = 0, differs from Sudarsky & Wald’s one, which is based on integral
      mass formulas.
                                 5.4 The event horizon as a Killing horizon                                             155
                                                    (a)                                                        (b)
Figure 5.1: Two equivalent representations of an event horizon H with cross-sections of Sn topology and with
a stationary Killing vector field ξ that is spacelike on H : (a) Representation with the null geodesic generators of
H drawn as vertical lines; two of them are actually depicted, in dark green and light green respectively, with a
null normal ℓ along them; besides, two field lines of ξ (orbits of the stationary group) are depicted, in black and
brown respectively. (b) Representation with the field lines of ξ as vertical lines. The color code is the same as
in (a) and labelled points (a, b, etc.) help to identify the two figures. A few light cones are drawn in each figure;
note that ξ, being spacelike, lies outside of them, while the null normal ℓ is tangent to them. The strong rigidity
theorem (Property 5.25) states that ℓ is collinear to a Killing vector field χ, making H a Killing horizon.
Remark 3: Given the definition of a non-rotating horizon given in Property 5.2, the staticity theorem can
be summarized as follows: an electrovacuum spacetime containing a non-rotating and non-degenerate
black hole is static.
      However, modulo some additional hypotheses, it turns out that H is still a Killing horizon,
      albeit with respect to a Killing vector distinct from ξ. This result is due to S.W. Hawking (1972)
      [260, 266] and is known as the (strong) rigidity theorem. We give below a modern version of
      this theorem, due to Hollands, Ishibashi & Wald (2007) [282] and Moncrief & Isenberg (2008)
      [372] (see also Theorem 8.1 p. 470 of Choquet-Bruhat’s textbook [108]).
           1. M and H are (real) analytic manifolds (cf. Remark 4 in Sec. A.2.1), with g being an
              analytic field,
           2. g fulfills the vacuum Einstein equation (1.44) or the electrovacuum Einstein equation
              (1.54),
        then the spacetime (M , g) admits a second Killing vector, χ say, such that ξ and χ
        commute: [ξ, χ] = 0 and H is a Killing horizon with respect to χ.
      Sketch of proof. The proof of the strong rigidity theorem is pretty long and we give here only a
      sketch for n = 4, referring to Hawking & Ellis’ textbook [266] (Proposition 9.3.6) for full details
      and to Refs. [282, 280, 372] for n > 4. Let us thus assume n = 4 and that the (electro)vacuum
      Einstein equation holds. The null dominant energy condition is then satisfied and, since H is
      connected, the topology theorem 1 (Property 5.4) implies that the complete cross-sections of
      H have the topology of S2 , or equivalently that the topology of H is R × S2 .
          Let us first show that the spacetime stationary action Φ : R × M → M , (t, p) 7→ Φt (p)
      induces a SO(2) action Φ̂ on the set G of null geodesic generators of H . Each Φt being an
      isometry of (M , g), it maps any null geodesic L of H to a null geodesic Φ̂t (L ) := Φt (L ).
      Morever, since Φt leaves H globally invariant (Property 5.1), Φ̂t (L ) ⊂ H , i.e. Φ̂t (L ) ∈ G . Let
      check that Φ̂ obeys the laws of a group action: obviously Φ̂0 (L ) = L and for any (t1 , t2 ) ∈ R2
      and p ∈ L , we have Φt2 +t1 (p) ∈ Φ̂t2 +t1 (L ) but Φt2 +t1 (p) = Φt2 (Φt1 (p)) ∈ Φ̂t2 (Φ̂t1 (L )) as
      well, from which we conclude that Φ̂t2 +t1 (L ) = Φ̂t2 (Φ̂t1 (L )), given that there is only one null
      geodesic generator through the point Φt2 +t1 (p). Since the stationary generator ξ is assumed to
      be spacelike on some parts of H , the group action is not trivial: there exist some null generators
      L for which Φ̂t (L ) ̸= L (when ξ is null on a part of H , it is necessarily tangent to a null
      generator L , which is then a fixed point of the group action: ∀t ∈ R, Φ̂t (L ) = L ). Moreover,
      the group action Φ̂ is periodic: due to the topology R × S2 of H , there exists a smallest T > 0
                                5.4 The event horizon as a Killing horizon                                            157
Figure 5.2: Action Ψ induced by the spacetime stationary action Φ (Killing vector ξ) on a cross-section S
of a rotating event horizon H : for p ∈ S and t ∈ R, Ψt (p) is denoted by p1 on the figure, while p0 = Φ−t (p).
Similarly, for a point q ∈ S close to p, q0 = Φ−t (q) and q1 = Ψt (q). Note that this figure is drawn following the
same convention as Fig. 5.1a, namely the null geodesic generators of H appear as vertical lines.
such that for any null generator L , Φ̂T (L ) = L (cf. Fig. 5.1, where a′ = ΦT (a), b′ = ΦT (b),
c′ = ΦT (c) and d′ = ΦT (d), noticing that each pair (p, p′ ), where p stands for a, b, c or d, lie
on the same null generator). Hence Φ̂ is a SO(2) action on the set G of null generators of H ,
the standard angle parameter φ ∈ [0, 2π) of SO(2) being related to t ∈ [0, T ) by φ = 2πt/T .
Moreover, the SO(2) action must have exactly two fixed points, L1 and L2 say, for G has the
topology of S2 : one can identify G with an arbitrary complete cross-section Sˆ of H (each
L ∈ G intersects Sˆ at a single point and through each p ∈ Sˆ there is a single L ∈ G ) and,
as recalled above, all cross-sections have the topology of S2 .
    Let S be a complete cross-section of H such that the stationary Killing vector ξ is nowhere
tangent to S . Let q be the metric induced by g on S . We are going to show that the spacetime
stationary action Φ induces a SO(2) action Ψ : [0, T ) × S → S , (t, p) 7→ Ψt (p) such that
each Ψt is an isometry of (S , q). For t ∈ R and p ∈ S , the point Φ−t (p) lies in H since
H is globally invariant by Φt (Property 5.1). Let then L be the unique null generator of H
through Φ−t (p). Since S is a complete cross-section of H , L intersects S at a unique point,
which we define as Ψt (p) (cf. Fig. 5.2, where L is denoted Φ̂−t (Lp )). In other words, Ψt (p) is
obtained by displacing p by a parameter −t along the field lines of ξ and then going back to S
along a null generator of H . Equivalently, we may write
                                          Ψt (p) = Φ̂−t (Lp ) ∩ S ,                                         (5.75)
where Lp is the null geodesic generator through p (cf. Fig. 5.2). Let us check that Ψ is a
1-parameter group action on S , i.e. fulfills Ψ0 (p) = p and Ψt2 +t1 (p) = Ψt2 (Ψt1 (p)) for
158                                           Stationary black holes
      any p ∈ S (cf. Sec. 3.3.1) and (t1 , t2 ) ∈ R2 . The first property is trivial. Regarding the
      second one, Ψt1 (p) lies on the same null generator L as Φ−t1 (p) (by definition of Ψt1 ). It
      follows that both Φ−t2 (Ψt1 (p)) and Φ−t2 (Φ−t1 (p)) lie on the null generator Φ̂−t2 (L ). But
      Φ−t2 (Φ−1 (p)) = Φ−t2 −t1 (p), so that Ψt2 +t1 (p) = Φ̂−t2 (L ) ∩ S = Ψt2 (Ψt1 (p)). Hence Ψ :
      R × S → S , (t, p) 7→ Ψt (p) is 1-parameter group action on S . Moreover this group
      action inherits the T -periodicity from Φ̂ and thus is a SO(2) action. Indeed, Eq. (5.75) implies
      ΨT (p) = Φ̂−T (Lp ) ∩ S = Lp ∩ S = p (see also Fig. 5.2, where Φ−T (p) lies on the same null
      generator as p).
          Let us now show that for any t ∈ [0, T ), Ψt is an isometry of (S , g). Let p and q be
      infinitely close points of S . Because Φ−t is a spacetime isometry, we have
                                      q(→
                                        −
                                        pq, →
                                            −
                                            pq) = g|p (→
                                                       −
                                                       pq, →
                                                           −
                                                           pq) = g|p0 (−
                                                                       p−→ −−→
                                                                        0 q0 , p0 q0 ),                     (5.76)
      where p0 := Φ−t (p) and q0 := Φ−t (q) (cf. Fig. 5.2). Now, p0 and q0 are two points of S0 :=
      Φ−t (S ), which is a complete cross-section of H that has no intersection with S for t ̸= 0,
      given that ξ is transverse to S . One can then choose a parametrization λ of the null generators
      L of H such that λ = 0 on S0 and λ = 1 on S . Let ℓ = dx/dλ|L be the associated tangent
      vector. One may consider that S0 and S are the two ends of a foliation of H by a family
      (Sλ )λ∈[0,1] of complete cross-sections (same construction as in the proof of Property 3.1). By
      denoting by q the metric induced by g on each of the surfaces Sλ , we have g|p0 (−         p−→ −−→
                                                                                                  0 q0 , p 0 q0 ) =
      q(−p−→ −−→
          0 q0 , p0 q0 ). Now, since H is a non-expanding horizon (Property 5.3) and the null energy
      condition is fulfilled (since we are considering the (electro)vacuum Einstein equation), the
      metric q is preserved along ℓ: →        −q ∗ Lℓ q = 0 [Property 3.2]. Since p1 := Ψt (p) and q1 := Ψt (q)
      are obtained from p0 and q0 by the same displacement δλ = 1 along ℓ (cf. Fig. 5.2), it follows that
      q(−p−→ −−→             −−→ −−→                                                   →
                                                                                       − →−         −−→ −−→
          0 q0 , p0 q0 ) = q(p1 q1 , p1 q1 ). By combining with Eq. (5.76), we get q(pq, pq) = q(p1 q1 , p1 q1 ),
      which proves that Ψt is an isometry of (S , q).
          At this stage, we have shown that each cross-section of H admits an isometry group
      isomorphic to SO(2), i.e. is axisymmetric. One thus may say that, in addition to be stationary,
      H is axisymmetric, although the concept of axisymmetry as the result of the metric invariance
      under a group action (cf. Sec. 3.3.1) is ill-defined for H , given that, as a null manifold, H is
      not endowed with a proper (non-degenerate) metric tensor.
          The major step is to show that there exists a Killing vector field χ of (M , g) that, on
      H , is tangent to the null generators; it will then follow that the whole spacetime (M , g)
      is axisymmetric, the generator of the SO(2) isometry being T /(2π)(χ − ξ). We first define
      χ on H only, by demanding that χ is the tangent vector to the null generators L of H
      corresponding to a parameter v of L that is connected to the parameter t of the stationary
      action Φ, in the sense that for any point p of a given null generator14 L ,
      The above condition is not sufficient to fully specify the parameter v, and hence χ, along L .
      We require in addition that the non-affinity coefficient κ of χ is constant along L : ∇χ κ = 0.
                                                           →
                                                           −
      In view of Eq. (2.21), this is equivalent to χ = −eρ ∇u with d2 ρ/dv 2 = 0 along L , where u is
        14
             Recall that on H , ΦT (p) and p always lie on the same null generator.
                             5.4 The event horizon as a Killing horizon                                     159
a scalar field such that H is the hypersurface u = 0. By construction, the vector field χ on H
is invariant under the stationary action, i.e. Lξ χ = 0, so that ξ and χ commute: [ξ, χ] = 0.
Let then define the following vector field on H :
                                    1                                   2π
                            η :=      (χ − ξ) ,        with    ΩH :=       .                       (5.78)
                                   ΩH                                   T
By construction, η has closed field lines of period 2π. Indeed, since ξ and χ commute, moving
by a parameter 2π along η is equivalent to moving by a parameter t = −T along the field
lines of ξ and then by a parameter v = T along the field lines of χ; the property (5.77) ensures
that one is back to the starting point. The vector field η vanishes at points where ξ = χ, i.e.
at points where ξ is tangent to some null generator of H . This occurs along null generators
that are fixed points of the group action Φ̂ on G . As discussed above, there are only two such
fixed points, L1 and L2 . These two null generators of H define thus the common “axis” of
all the rotations generated by η. Let C be a curve in H from L1 and L2 , orthogonal to η
and such that the field lines of η from C form a smooth cross-section S of H . Let us then
denote by Sv the cross-section obtained by displacing S by a parameter v along χ. Finally,
let k be the future-directed null vector orthogonal to Sv and complementary to χ, normalized
such that χ · k = −1 (k is transverse to H and is pointing towards the black hole region).
In a neighborhood of H , we introduce a spacetime coordinate system (v, r, θ, φ), named
Gaussian null coordinates, as follows. First of all, we choose spherical coordinates (θ, φ)
on the cross-section S , such that η = ∂φ and θ is a parameter along the “meridional” curve
C . To any point p ∈ H , we assign then the coordinates (v, θ, φ) where v is such that p ∈ Sv
and (θ, φ) are the coordinates of the intersection of S and the null generator L through p.
Let us then consider the null geodesics L ′ of tangent vector k, which are transverse to H
and an affine parameter r along each of them such that, on H , r = 0 and k = −dx/dr. To
any point p ∈ L ′ in the vicinity of H , we assign the coordinates (v, r, θ, φ) such that r is the
affine parameter of p along L ′ and (v, θ, φ) are the coordinates of the intersection of L ′ and
H . We may then extend the vector fields χ, η and k away from H by setting
                                 χ := ∂v ,    η := ∂φ ,    k := −∂r .
By means of the (electro)vacuum Einstein equation, one can then show by induction that,
                                                                  H
                                   ∀N ∈ N,     Lk · · · Lk Lχ g = 0.                               (5.79)
                                               | {z }
                                                N times
This is the lengthy part of the computation (see p. 343 of Ref. [266]). Once (5.79) is established,
one can use the analyticity hypothesis to conclude that Lχ g = 0 in all M , i.e. that χ is a
Killing vector of (M , g). Since, on H , χ is normal to H , it follows that H is a Killing horizon
with respect to χ.
Remark 4: For n ̸= 4, the Killing horizon H is necessarily non-degenerate, given that H must have
an incomplete null geodesic generator in that case (third hypothesis of the theorem).
Remark 5: For n = 4, the stationary Killing vector ξ cannot be spacelike on the whole of H : it is
indeed null on the rotation axis, i.e. at the points where η = 0, given that Eq. (5.78) implies ξ = χ for
η = 0.
160                                         Stationary black holes
      Remark 6: In some formulations [282, 266], the hypothesis of ξ being transverse to some cross-sections
      is replaced by H lying in the future of the past infinity I − . Actually it can be shown that the latter
      implies the former (see e.g. the proof of Proposition 9.3.1 in Ref. [266]).
      Remark 7: The strong ridigity theorem may not hold in theories of gravity distinct from general
      relativity (hypothesis 2 of Property 5.25). For instance, the so-called disformed Kerr solution [16, 54]
      describes a n = 4 stationary black hole in a higher-order scalar-tensor theory (belonging to the so-called
      DHOST family, for Degenerate Higher-Order Scalar-Tensor [338]), the event horizon of which does not
      seem to be a Killing horizon [16].
      Historical note : Stephen Hawking first established the strong rigidity theorem in 1972 [260] for a
      spacetime of dimension n = 4 and assuming that there exists a past event horizon (i.e. a white hole
      region) in addition to the future event horizon H . The theorem was restated and proved without the
      white hole hypothesis in Hawking & Ellis’ famous textbook published in 1973 [266] (Proposition 9.3.6).
      An account by Hawking himself about this reformulation can be found in Sec. 8 of Ref. [261]. The
      demonstration presented in Ref. [266] relies on the vacuum Einstein equation, but it is noted there
      that “similar arguments hold in the presence of matter fields, like the electromagnetic or scalar fields,
      which obey well-behaved hyperbolic equations”. Some details in Hawking’s proof have been fixed by
      Piotr T. Chruściel in 1997 [116]. The extension to spacetimes of dimension n ≥ 4 has been performed
      independently by Stefan Hollands, Akihiro Ishibashi and Robert M. Wald in 2007 [282] and by Vincent
      Moncrief and James Isenberg in 2008 [372]. The treatment of the electrovacuum case is done explicitly
      in Hollands, Ishibashi & Wald’s work [282].
          The strong rigidity theorem relies on the rather strong assumption that the manifolds and
      fields are analytic. On physical grounds, it would be desirable to assume only smooth manifolds
      and fields. In 1999, H. Friedrich, I. Rácz, and R.M. Wald [200] could remove the analyticity
      hypothesis but only towards the interior of the black hole, i.e. they could prove that the vector
      χ normal to H can be extended to a Killing vector of the black hole interior. Another partial
      success has been achieved in 2014 by S. Alexakis, A.D. Ionescu and S. Klainerman [11], who
      proved the strong rigidity theorem without the analyticity assumption, but only for slowly
      rotating black holes. See Ref. [288] for a review of the progresses in removing the analyticity
      hypothesis.
          The strong ridigity theorem has been extended beyond general relativity (assumption 2
      in Property 5.25) by Hollands, Ishibashi and Reall in 2023 [281]. These authors have shown
      that, under some assumptions, the theorem holds in gravity theories with actions involving
      quadratic and higher orders combinations of the curvature tensor.
          It turns out that, for each connected component of the even horizon, the Killing vector χ is
      expressible as a linear combination of the stationary Killing vector ξ and some Killing vectors
      generating axisymmetries:
        Under the same hypotheses as in Property 5.25, there exist L Killing vectors η(1) , . . . , η(L)
        with 1 ≤ L ≤ [(n − 1)/2], which have closed orbits of period 2π, such that
  where Ω(1) , . . ., Ω(L) are L real constants, all of whose ratios are irrational.
     For n = 4, one has [(n − 1)/2] = 1, so that necessarily L = 1. Denoting η(1) simply by
  η and Ω(1) by ΩH , Eq. (5.81) reduces to
χ = ξ + ΩH η. (5.82)
  The constant ΩH is then called the black hole rotation velocity if H coincides with the
  whole event horizon (i.e. if the event horizon is connected).
    Since it has closed orbits, the isometry group generated by each Killing vector η(i) is SO(2).
Each isometry is thus an axisymmetry, according to the definition given in Sec. 5.3.6. For n = 4,
Eq. (5.82) is equivalent to Eq. (5.78). The constancy of ΩH in Eq. (5.82) can be interpreted
by saying that the event horizon is rotating rigidly (no differential rotation) and justifies the
name rigidity theorem. For n > 4, we refer to the original articles [282, 372] for the proof of
Property 5.26.
Remark 8: Property 5.26 is often considered as the second part of the strong rigidity theorem. In some
statements, it is even incorporated into the strong rigidity theorem (e.g. Proposition 9.3.6 of Ref. [266]).
Remark 9: If the event horizon has various connected components, the Killing vectors η(i) and the
constants Ω(i) may vary from one connected component to the other. In particular, the horizon-
generating Killing vector χ may not be the same for each connected component. For instance, in
dimension n = 5, the black saturn found by H. Elvang and P. Figueras in 2007 [184] is a stationary
black hole, the event horizon of which has two connected components: Hp (the “planet”) and Hr (the
“ring”), with topology R × S3 and R × S1 × S2 respectively. For both Hp and Hr , L = 1 and the
two components share the same axisymmetry Killing vector η(1) , but they may have different angular
             (1)      (1)
velocities: ΩHp ̸= ΩHr , and hence different normal Killing vectors: χHp ̸= χHr .
    In the literature, the strong rigidity theorem is sometimes called simply the rigidity theorem,
without the qualifier strong (e.g. Refs. [108, 280]). The terminology strong rigidity, employed
in Refs. [103, 123, 275], aims to distinguish from the so-called weak rigidity theorem proved
by Carter in 1969 (corollary to Theorem 1 in Ref. [92]; see also [94], Sec. 4.5 of Ref. [101] and
Sec. 6.3 of Ref. [275]). The latter asserts that the event horizon of a black hole in a stationary,
axisymmetric and circular15 spacetime must be a Killing horizon. The strong rigidity theorem is
stronger in that it assumes only the stationarity of the spacetime, the axisymmetry becoming a
consequence (Property 5.26). On the other hand, the weak rigidity theorem is stronger in so far
as it does not rely on the Einstein equation, nor on the assumption of analyticity.
    By the very definition of stationarity, the Killing vector field ξ is timelike in the vicinity of
I and I − . If ξ is spacelike on some parts of H , as assumed in this section, by continuity it
   +
must be spacelike in some part of the domain of outer communications ⟨⟨M ⟩⟩ near H . The
  15
     A stationary and axisymmetric spacetime is said to be circular iff the R×SO(2) group action is orthogonally
transitive, i.e. the 2-surfaces generated by the two commuting Killing vectors ξ and η are orthogonal to a family
of (n − 2)-dimensional surfaces [92].
162                                   Stationary black holes
          The surface gravity κ of an extremal black hole is identically zero and the null generators
      of its horizon are complete geodesics: the parameter t associated to the stationary Killing ξ is
      an affine parameter ranging in the whole of R. Standard examples are the extremal Reissner-
      Norström black hole (presented briefly in Sec. 5.6.1) and the extremal Kerr black hole (to be
      discussed in Chap. 13).
  Let us assume that the null dominance condition (3.43) is fulfilled on Hk or that Hk is
  part of a bifurcate Killing horizon; by the zeroth law of black hole dynamics (Property 3.10
  or Property 3.16), this implies that κk is constant. Then the Komar mass MHk over any
  cross-section of Hk , as defined by Eq. (5.36), obeys
                                                          Lk
                                                                       !
                                    n−2        κk Ak     X     (i) (i)
                          MHk =                      +2      ΩHk JHk ,                  (5.83)
                                   2(n − 3)     4π       i=1
             (i)
  where JHk is the Komar angular momentum with respect to the axisymmetric Killing
  vector ηHk (i) over any cross-section of Hk as given by Eq. (5.63). In Eq. (5.83), it is intended
  that Lk = 0 if χk = ξ, so that there is no sum over i in that case. Furthermore, the Komar
  mass at infinity (cf. Property 5.16) obeys the generalized Smarr formula:
                             K                  L
                                                              !
                                              k
       2(n − 3)                                                                      √
                                                                         Z
                     X             κk Ak    X    (i) (i)             1
                M∞ =                     +2     ΩHk JHk           +           R(ξ, n) γ dn−1 x ,         (5.84)
        n−2          k=1
                                    4π      i=1
                                                                    4π    Σ
  where Σ is any asymptotically flat spacelike hypersurface, the inner boundary of which
  is a cross-section of the event horizon, i.e. some union of cross-sections of the connected
  components Hk , R is the Ricci tensor of g and n is the future-directed unit normal to Σ.
  If g obeys the Einstein equation (1.42) with Λ = 0, this formula can be rewritten as
                                       K                 Lk
                                                                      !
                   2(n − 3)           X      κk Ak      X     (i) (i)
                            M∞ =                    +2      ΩHk JHk
                    n−2               k=1
                                               4π       i=1
                                          Z                       
                                                            T ξ · n √ n−1
                                      +2        T (ξ, n) −              γ d x,          (5.85)
                                           Σ                n−2
Proof. Let us evaluate the Komar mass MHk via formula (5.45). If we denote the cross-section
of Hk by Sk and we substitute ξ by its expression arising from Eq. (5.81), we get16
                                                   Lk
                                                                !
                           n−2
                                  Z               X
           MHk = −                     ∇µ χν −        Ω(i) η(i)ν dS µν
                        16π(n − 3) Sk             i=1
                                  "Z                   Lk
                                                                                    #
                           n−2                        X         Z
                  = −                   ∇µ χν dS µν −      Ω(i)      ∇µ η(i)ν dS µν .
                        16π(n − 3) Sk
                                    |     {z      } i=1         | Sk    {z        }
                                                    −2κk Ak                        16πJH
                                                                                        (i)
                                                                                           k
                                                    (i)
The second surface integral being 16πJHk follows directly from Eq. (5.65). As for the first
integral, the value −2κk Ak is obtained by introducing a null vector k normal to Sk such that
  16
    Here we drop the index k on χ and the label Hk on Ω(i) and η(i) for clarity, given that the integral regards a
single connected component Hk .
164                                         Stationary black holes
      χ · k = −1. At each point p ∈ Sk , the pair (χ, k) is then a null basis of the 2-plane Tp⊥ Sk (cf.
      Fig. 2.10 where ℓ stands for χ) and we may rewrite the area element normal bivector dS αβ as17
                                                               √
                                     dS αβ = (χα k β − k α χβ ) q dn−2 x.                         (5.86)
      We have then
                                                          √                          √
                ∇µ χν dS µν = χµ ∇µ χν k ν − k µ χν ∇µ χν     q dn−2 x = 2 χµ ∇µ χν k ν q dn−2 x.
                                                    | {z }                 | {z }
                                                        −∇ν χµ                      κk χν
          For vacuum black holes of general relativity, T = 0, the null dominance condition is trivially
      fulfilled and the generalized Smarr formula (5.85) simplifies since the integral term disappears.
      Example 8 (black saturn): For the black saturn solution discussed in Remark 9, one has n = 5, K = 2,
      L1 = L2 = 1 and formula (5.83) reduces to
                                              3         3
                                    MH k =       κk Ak + ΩHk JHk ,          k ∈ {1, 2},                 (5.88)
                                             32π        2
      where k = 1 corresponds to the “planet” component of the event horizon and k = 2 corresponds to the
      “ring” component. Equation (5.88) coincides with the two formulas given in Eq. (3.43) of the discovery
      article [184], which were obtained from the explicit expressions of MHk , Ak , JHk , κk and ΩHk in terms
      of the solution parameters. For the same solution, the Smarr formula (5.85) with T = 0 is recovered by
      combining Eqs. (3.36) and (3.43) of Ref. [184].
          It is worth to specialize the generalized Smarr formula to a 4-dimensional spacetime and
      to a connected event horizon, by setting n = 4, K = 1 and L1 = 1 (cf. Property 5.26) in
      Eqs. (5.83), (5.84) and (5.85):
        17
                                                         √                  √
             This follows readily by setting n = (χ + k)/ 2 and s = (χ − k)/ 2 in formula (5.42).
                                 5.5 The generalized Smarr formula                                        165
  is part of a bifurcate Killing horizon, the zeroth law of black hole dynamics (Property 3.10
  or 3.16) leads to κ = const. Then the Komar mass MH over any cross-section of H [cf.
  Eq. (5.36)] obeys
                                              κ
                                     MH =       A + 2ΩH JH ,                            (5.89)
                                             4π
  where A is the area of H (cf. Property 3.1) and JH is the Komar angular momentum
  over any cross-section of H [cf. Eq. (5.63)]. Furthermore, the Komar mass at infinity (cf.
  Property 5.16) obeys
                                                                     √
                                                         Z
                                κ               1
                       M∞    =    A + 2ΩH JH +                R(ξ, n) γ d3 x ,                (5.90)
                               4π              4π         Σ
                                                                      √
                                             Z
                          κ
                 M∞ =       A + 2ΩH JH + (2T (ξ, n) − T ξ · n) γ d3 x,                  (5.91)
                         4π                   Σ
Example 9 (Schwarzschild black hole): The Schwarzschild black hole will be discussed in details in
Chap. 6; however, we have sufficiently studied its event horizon in various examples of the preceding
chapters to check that the Smarr formula holds for it. From Example 10 of Chap. 2, we have κ = 1/(4m)
[Eq. (2.29)], while from Example 3 of Chap. 3 we have A = 16πm2 [Eq. (3.4)]. Hence κA/(4π) = m.
Since we have seen in Example 5 of the current chapter that the parameter m is nothing but the Komar
mass at infinity M∞ , we get
                                                       κ
                                            M∞ =         A.                                      (5.92)
                                                      4π
This is nothing but the Smarr formula (5.91) for a static (ΩH = 0) black hole in vaccum (T = 0).
equation is fulfilled by (g, F ) (cf. Sec. 1.5.2). Then the pseudo-electric fielda E defined by
        is an exact 1-form:
                                                       E = −dΦ,                                              (5.94)
        where the scalar potential Φ is expressible in terms of any electromagnetic potential A
        that obeys the χ-symmetry (i.e. any 1-form A such that F = dA and Lχ A = 0) by
        We shall choose the additive constant so that Φ → 0 in the asymptotic flat end of (M , g).
        This determines Φ uniquely and we shall call it the Killing electric potential associated
                                                    →
                                                    −
        to χ . Furthermore, on H , the vector field E is collinear to the null normal χ:
                                          →
                                          − H              H
                                          E = µ0 σχ ⇐⇒ E α = µ0 σχα ,                                        (5.96)
        where the scalar field σ defined on H can be interpreted as the effective electric charge
        density of any cross-section S of H (Eq. (5.98) below). Finally, the Killing electric potential
        Φ is constant on H :
                                             H
                                           Φ = ΦH = const.                                       (5.97)
        The constant ΦH is called the electric potential of the Killing horizon H .
           a
             If χ were a unit timelike vector, E would be a genuine electric field: the one measured by the observer
        of 4-velocity χ.
                                                      1                   H
                                      Fµρ χρ F µσ χσ − Fµν F µν gρσ χρ χσ = 0,
                                      | {z } | {z } 4           | {z }
                                        Eµ      Eµ                       0
                                    5.5 The generalized Smarr formula                                               167
     →
     − → − H              →
                          −
i.e. E · E = 0. Thus E is null vector on H . Being tangent to H , it is then necessarily
collinear to the null normal to H , namely χ, hence there exists a scalar field σ on H such
                                                →
                                                −
that Eq. (5.96) holds. Finally, since on H , E is collinear to H ’s null normal χ, we have
→
−
E · v = 0 for any vector field v tangent to H . This identity can be rewritten as ⟨E, v⟩ = 0 or,
in view of Eq. (5.94), as ⟨dΦ, v⟩ = 0. It follows that the scalar field Φ is constant on H , hence
Eq. (5.97).
       The fact that σ is an effective18 surface charge density follows from:
  Under the same assumptions as in Property 5.29, the electric charge within any cross-section
  S of the Killing horizon H is
                                                                   √
                            Z              Z                  Z
                          1             1
                  QH =         ⋆F =            Fµν dS = µν
                                                                  σ q dn−2 x.            (5.98)
                         µ0 S          2µ0 S                    S
  The quantity QH is independent of the choice of the cross-section S and is called the
  electric charge of the Killing horizon H .
Proof. The first equality in Eq. (5.98) is the Gauss-law expression of the electric charge (see
e.g. Eq. (8.201) of Ref. [464] or Eq. (18.40) of Ref. [228]), while the second equality results from
Lemma 5.10. Now, using expression (5.86) for dS µν , we have
                                                √                               √
           Fµν dS µν = Fµν (χµ k ν − k µ χν ) q dn−2 x = (−Eν k ν − k µ Eµ ) q dn−2 x
                                          √                   √
                       = −2µ0 σ χµ k µ q dn−2 x = 2µ0 σ q dn−2 x,                              (5.99)
                                   | {z }
                                        −1
where we have used successively Eqs. (5.93) and (5.96). This establishes the last equality in
Eq. (5.98). To show that QH is independent of the choice of S , it suffices to consider the
part W of H that is delineated by two cross-sections S and S ′ , as in Fig. 2.9. W is then
a (n − 1)-dimensional manifold with boundary ∂W = S ∪ S ′ and we may apply Stokes’
theorem (A.94) on W to the (n − 2)-form ⋆F ; taking into account the outward orientation of
∂W involved in Stokes formula (A.94), we get
                              Z         Z         Z
                                  ⋆F −     ⋆F =       d⋆ F .
                               S′        S         W
                              | {z } | {z }
                                       µ0 Q′H      µ0 QH
Now, thanks to the second source-free Maxwell equation, d ⋆ F = 0 [Eq. (1.50) with j = 0], so
that the right-hand side of the above equation identically vanishes. Hence Q′H = QH .
       We are now in position to state:
  18
     σ is not a genuine surface charge density since there are no electrically charged particles on H , given the
electrovacuum hypothesis (vanishing of the electric n-current density j, cf. Sec. 1.5.2); see Damour’s work [150]
for interpreting σ as the component along χ of an effective surface current 4-vector on H .
168                                           Stationary black holes
                                        K                                                  L
                 2(n − 3)      X                κk Ak 2(n − 3)                               X
                          M∞ =                       +         ΦHk QHk                  +2         Ω(i) J∞
                                                                                                         (i)
                                                                                                             ,   (5.100)
                  n−2          k=1
                                                 4π    n−2                                   i=1
                                                                  (i)
        where M∞ is the Komar mass at infinity, J∞ is the Komar angular momentum at infinity
        with respect to the axisymmetric Killing vector η(i) , κk is the surface gravity of Hk , Ak is
        the area of Hk , ΦHk is the electric potential of Hk , as defined by Property 5.29, and QHk
        is the electric charge of Hk , as defined by Property 5.30.
           a
             Note that contrary to the more general setting of Property 5.27, all the connected components Hk
        are assumed to be Killing horizons with respect to the same Killing vector χ; in other words, the rotation
        velocities Ω(i) and axisymmetric Killing vectors η(i) are the same for all the Hk ’s.
      Proof. The electromagnetic field energy-momentum tensor (1.52) obeys the null dominant
      energy condition (3.46) [321], so that the surface gravities κk are constant over each Killing
      horizon Hk (cf. Property 3.10). All the hypotheses of Property 5.27 are thus fulfilled and the
      Smarr formula (5.84) holds. Thanks to the independence of Ω(i) and η(i) from Hk , we may
      permute the sums over k and i in it and write
                               K            L         K
                2(n − 3)                                                   √
                                                                  Z
                              X   κk Ak    X
                                                 (i)
                                                     X    (i)   1
                         M∞ =           +2     Ω         JHk +      R(ξ, n) γ dn−1 x,
                 n−2          k=1
                                   4π      i=1       k=1
                                                               4π Σ
                             K
                                                                                       √
                                                                        Z
                             X      (i)        (i)         (i)      1
                                   JHk    =   JSint   =   J∞     +          R(η(i) , n) γ dn−1 x.
                             k=1
                                                                   8π   Σ
      Hence
                                    K                      L
               2(n − 3)                                                                     √
                             X κk Ak                                            Z
                                        X                1
                        M∞ =         +2     Ω(i) J∞
                                                  (i)
                                                      +                              R(χ, n) γ dn−1 x,             (5.101)
                n−2          k=1
                                 4π     i=1
                                                        4π                       Σ
                               5.5 The generalized Smarr formula                                    169
where we have used the identity ξ + Li=1 Ω(i) η(i) = χ to collect all the integrals on Σ. Let us
                                      P
evaluate the integral by using the electrovacuum Einstein equation (1.54), which yields
                                  2nµ
                                                                         
                    1                                   1
                      R(χ, n) =               σ ν
                                         Fσµ F ν χ −                 ρσ
                                                              Fρσ F χµ .                 (5.102)
                   4π              µ0                2(n − 2)
We recognize the pseudo-electric field E σ = F σν χν [cf. Eq. (5.93)] in the first term in the
right-hand side; thanks to Eq. (5.94), we may re-express it in terms of the Killing electric
potential Φ:
    Fσµ F σν χν = Fσµ E σ = −Fσµ ∇σ Φ = −∇σ ΦF σµ = −∇σ ΦF σµ = ∇σ (ΦFµσ ),             (5.103)
                                                                   
where the last but one equality stems from the source-free Maxwell equation ∇σ F ασ = 0
[Eq. (1.48)]. As for the second term in the integrand (5.102), let us rewrite it in terms of some
electromagnetic potential 1-form A that obeys Lχ A = 0 and such that Φ = −⟨A, χ⟩ (i.e.
such that the constant in Eq. (5.95) is zero). Then Fρσ = ∇ρ Aσ − ∇σ Aρ , so that, using the
antisymmetry of F ρσ and again the source-free Maxwell equation ∇σ F σρ = 0,
              Fρσ F ρσ χµ = 2∇σ Aρ F σρ χµ = 2 [∇σ (Aρ F σρ χµ ) − Aρ F σρ ∇σ χµ ] .
Now, expressing the property Lχ F = 0 via formula (A.86), we have Lχ F µρ = χσ ∇σ F µρ −
F σρ ∇σ χµ − F µσ ∇σ χρ = 0, from which
Aρ F σρ ∇σ χµ = Aρ F σµ ∇σ χρ − Aρ χσ ∇σ F ρµ = Aρ F σµ ∇σ χρ − ∇σ (χσ Aρ F ρµ ) + F ρµ χσ ∇σ Aρ
              = −∇σ (χσ Aρ F ρµ ) + F ρµ (χσ ∇σ Aρ + Aσ ∇ρ χσ ) = −∇σ (Aρ F ρµ χσ ),
                                          |        {z        }
                                                      0
where we have used ∇σ χσ = 0, as implied by the Killing equation for χ, in the first line and
Lχ Aρ = 0 in the second line. Hence, we get
        Fρσ F ρσ χµ = 2∇σ Aρ F σρ χµ + Aρ F ρµ χσ = 2∇σ Aρ F ρµ χσ − Aρ F ρσ χµ .
                                                                                
Together with Eq. (5.103), this expression allows us to rewrite Eq. (5.102) as
      1           2nµ ν                                      1
                                   with Ωµν := ΦFµν +           χµ Aρ F ρν − χν Aρ F ρµ .
                                                                                       
        R(χ, n) =     ∇ Ωµν
     4π            µ0                                       n−2
Note that Ωµν defines a 2-form, since Ωµν = −Ωνµ . It follows then from Lemma 5.12 that
                            √ n−1                      µ√
               Z                            Z                             Z
            1                             2       ν          n−1        2
   I :=           R(χ, n) γ d x =              ∇ Ωµν n γ d x = −            ∇ν Ωµν dV µ
           4π Σ                           µ0 Σ                         µ0 Σ
                  Z                 Z
              1                                    
       = −             Ωµν dS µν −        Ωµν dS µν
             µ0 S∞                   Sint
                  |      {z     }
                          0
We have set the integral on S∞ to zero because F decays to zero sufficiently fast at the
asymptotically flat end of Σ (otherwise (M , g) could not be asymptotically flat). Since Sint =
  k=1 Sk , we may write
SK
                                           K Z
                                        1 X
                                  I=              Ωµν dS µν .
                                       µ0 k=1 Sk
170                                       Stationary black holes
      where we have used two expressions of the electric charge of Hk given by Eq. (5.98). In view
      of Eq. (5.101), the above value for I proves the Smarr formula (5.100).
Property 5.32: 4-dimensional Smarr formula for a connected charged black hole
        For n = 4 (which implies L ≤ 1) and for a connected black hole event horizon H (K = 1)
        of surface gravity κ, area A, angular velocity ΩH , electric charge Q and electric potential
        ΦH , the electrovacuum Smarr formula (5.100) simplifies to
                                                   κ
                                          M∞ =       A + 2ΩH J∞ + ΦH Q .                                (5.104)
                                                  4π
      Historical note : Formula (5.104) has been first derived in 1972 (published: 1973) by Larry Smarr [459] in
      the case of the Kerr-Newman black hole. Actually, Smarr did not obtain it from integral formulas for M∞
      and J∞ , as presented here; rather he used an explicit expression of M∞ in terms of A, J∞ and Q, which
      holds for the Kerr-Newman solution. After noticing that this expression is a homogeneous function
      of degree 1/2 of (A, J∞ , Q2 ), he applied Euler’s homogeneous function theorem to get Eq. (5.104).
      The derivation of Smarr formula from generic Komar-type integral formulas for mass and angular
      momentum, possibly with some non-vacuum exterior, is due to James Bardeen, Brandon Carter and
      Stephen Hawking in 1973 [40]. They obtained Eq. (5.91) [their Eq. (13)]. The general (i.e. not assuming
      the Kerr-Newmann metric) n = 4 electrovacuum Smarr formula (5.104) has been derived by Brandon
      Carter in 1973 [96] [his Eq. (9.29)]. Actually, the formula obtained by Carter is more general than
      Eq. (5.104) since it allows for electric sources (non-vanishing electric 4-current j) and matter in the black
                                          5.6 The no-hair theorem                                                 171
hole exterior (see also Carter’s review articles [99] [Eq. (6.323)] and [101] [Eq. (4.68)]). The generalization
of the Smarr formula to dimensions n ≥ 4 has been obtained by Robert Myers and Malcolm Perry in
1986 [376] for connected event horizons in vacuum: they obtained Eq. (5.85) for K = 1 and T = 0
[their Eq. (4.6), where N := n − 1]. The n ≥ 4 generalization of the electrovacuum Smarr formula
has been obtained by Rabin Banerjee, Bibhas Majhi, Sujoy Modak, and Saurav Samanta in 2010 [32],
but only for electrically charged Myers-Perry black holes, which are approximate solutions for small
rotation velocities and a single angular momentum parameter, i.e. they obtained Eq. (5.100) for K = 1
(connected horizon) and L = 1 (single angular momentum) [their Eq. (66)].
      The proof can be found in Ref. [219], which relies on the assumption that the connected
      components of H are non-degenerate Killing horizons. The fact that this assumption is not
      necessary for n = 4 has been proven in Ref. [127]. The proof in Ref. [219] relies on the positive
      mass theorem. An elementary proof specific to the case n = 4 can be found in Sec. 8.2 of
      Straumann’s textbook [464] and well as in Chap. 9 of Heusler’s textbook [275] and in Robinson’s
      articles [438] and [439] (appendix). See also Ref. [389] for a recent alternative proof.
      Remark 1: The fact that H is connected means that there does not exist any static solution in vacuum
      general relativity with “multiple black holes”19 . Intuitively, two or more black holes in vacuum would
      attract each other, making a static configuration impossible.
      Historical note : The uniqueness theorem 5.33 has been first proven in the case n = 4 by Werner Israel
      in 1967 [289] under the additional hypotheses
                                           √         that (i) the spacetime is analytic (cf. Remark 4 in Sec. A.2.1),
      (ii) the isosurfaces t = const, V := −ξ · ξ = const are regular and topologically 2-spheres in ⟨⟨M ⟩⟩
      (in particular, dV ̸= 0 in ⟨⟨M ⟩⟩), (iii) the event horizon H is connected, (iv) H is non-degenerate
      and (v) H ’s cross-sections are topologically 2-spheres. Property (v) actually follows from the topology
      theorem for n = 4 (Properties 5.4 and 5.5), which has been established 5 years later by Stephen Hawking
      [260]; it can therefore be removed from the hypotheses of the theorem. The hypothesis (ii) has been
      shown unnecessary by David Robinson in 1977 [438] (see also the work [373]). The requirement (iii) (H
      connected) has been removed as unnecessary by Gary Bunting and Abul Masood-ul-Alam in 1987 [77],
      while the requirement (iv) (H non-degenerate) has been removed by Piotr Chruściel, Harvey Reall and
      Paul Tod in 2006 [127]. Finally the analyticity hypothesis (i), which was implicit in Israel’s study, has
      been removed in 2010 by Piotr Chruściel and Gregory Galloway [121]. The generalization to dimensions
      n ≥ 4 has been achieved by Seungsu Hwang in 1998 [287] and by Gary Gibbons, Daisuke Ida and
      Tetsuya Shiromizu in 2002 [219]. The generalization (5.105) of Schwarzschild solution to dimensions
      n ≥ 4 has been found by Frank Tangherlini in 1963 [473].
        19
          The quotes indicate that even when H has various connected components, there is formally a single black
      hole region in spacetime, albeit not connected.
                                     5.6 The no-hair theorem                                       173
   For the static electrovacuum uniqueness theorem, we shall distinguish the dimension n = 4
from n ≥ 5. Starting by the former, we have:
 where M is the Komar mass at infinity (or ADM mass), Q is the black hole electric charge
 (cf. Property 5.30) and P is its magnetic monopole charge
                                                       q       (cf. Remark 2 below). The three
 constants M , Q and P must fulfill Q2 + P 2 ≤ 4π            M and the lower bound of r on
                                         p
                                                          µ0
 ⟨⟨M ⟩⟩ is rH = M + M 2 − µ0 /(4π)(Q2 + P 2 ).
                         p
      The Majumdar-Papapetrou black hole is defined by (i) the event horizon being
 disconnected, with K ≥ 2 connected components (Hk )1≤k≤K , which are all degenerate
 Killing horizons, and (ii) the existence of a coordinate system (t, x, y, z) on ⟨⟨M ⟩⟩ such
 that
                                                               r
                                                                  µ0 −2
                −2   2     2     2
       g = −U dt + U (dx + dy + dz ),   2      2
                                                      F =±           U dU ∧ dt,       (5.108a)
                                                                  4π
                   K
                 X                       µk
       U := 1 +        p                                     .                        (5.108b)
                        (x  − x   )2 + (y − y )2 + (z − z )2
                  k=1           k            k           k
      of a black hole. If it is not fulfilled, Eqs. (5.107) still define a valid solution to the electrovacuum Einstein
      equation, but it corresponds to a naked singularity, not to a black hole.
      Remark 4: The Majumdar-Papapetrou solution (5.108) with K = 1 (connected event horizon) is still
      a valid static solution to the electrovacuum Einstein equation. It is actually a member of the first
      family invoked in the uniqueness theorem, namely the Reissner-Nordström one. More            pprecisely, it is
      an extremal Reissner-Nordström spacetime, i.e. a solution (5.107) for which |Q| = 4π/µ0 M and
      P = 0, or equivalently a Reissner-Nordström solution with P = 0 and a degenerate event horizon.
      To see this, it suffices to choose (x1 , y1 , z1 ) = (0, 0, 0), to move from coordinates (x, y, z) to spherical
      coordinates (r̄, θ, φ) via the standard formulas (in particular r̄2 = x2 + y 2 + z 2 ) and to introduce a new
      radial coordinate r := r̄ + µ1 . Then U = r/(r − µ1 ) and the Majumdar-Papapetrou             solution (5.108)
      coincides with the Reissner-Nordström solution (5.107) with M = µ1 , Q = ± 4π/µ0 µ1 and P = 0.
                                                                                          p
     • all the connected components of H are non-degenerate (i.e. have non-zero surface
       gravity),
                           16πM                                   8πµ0 Q2
                   µ=                      and       q2 =                       ,           (5.112)
                        (n − 2)Ωn−2                         (n − 2)(n − 3)Ω2n−2
  where Ωn−2 is the area of Sn−2 [Eqs. (5.19)-(5.20)]. The constants µ and q must fulfill the con-
                                                                                       1/(n−3)
  straint µ > 2q and the lower bound of r on ⟨⟨M ⟩⟩ is rH = µ/2 + µ2 /4 − q 2                    .
                                                                          p
For a proof, see Ref. [218] under the additional assumption that the magnetic field vanishes (i.e.
that ξ · ⋆F = 0) and Ref. [328] for relaxing this assumption.
Example 10: For n = 5, Ωn−2 = Ω3 = 2π 2 [Eq. (5.20)] and Eqs. (5.111)-(5.112) become
                                         −1
                   q2                 q2
                            
              µ           2         µ
                                             dr2 + r2 dθ12 + sin2 θ1 (dθ22 + sin2 θ2 dφ2 )
                                                                                          
   g = − 1 − 2 + 4 dt + 1 − 2 + 4
              r    r                r r
         µ0 Q
   F = − 2 3 dt ∧ dr
        2π r
       8M                   µ0 Q2
   µ=          and     q2 =       .
       3π                    3π 3
Remark 5: A static and asymptotically flat black hole spacetime cannot carry a nonzero magnetic
monopole P for n ≥ 5 [185]. Hence Property 5.35 is not a direct generalization of Property 5.34 to
n > 4, even if H is assumed to be made of non-degenerate components.
      1918 [385]. The generalization to P ̸= 0 can be found in Brandon Carter’s lecture at Les Houches (1972)
      [95]. The generalization to dimensions n > 4 with P = 0, i.e. the solution (5.111), has been obtained by
      Frank Tangherlini in 1963 [473]. The Majumdar-Papapetrou solution (5.108) has been found in 1947
      independently by Sudhansu Datta Majumdar [353] and Achilles Papapetrou [399], as a solution to the
      electrovacuum Einstein equation describing K charged point masses in gravito-electrostatic equilibrium.
      The extremal Reissner-Nordström solution seems to have been discussed in details first by Papapetrou
      in his 1947 article [399]. It was actually the starting point that lead him to the Majumdar-Papapetrou
      solution (cf. Remark 4 above). The black hole character of the Majumdar-Papapetrou solution has been
      truly recognized and analyzed by James Hartle and Stephen Hawking in 1972 [258]. The generalization
      of the Majumdar-Papapetrou solution to spacetime dimensions n > 4 is due to Robert Myers in 1987
      [375].
      Historical note (uniqueness theorems for static electrovacuum black holes): The Reissner-
      Nordström part (with P = 0) of the 4-dimensional uniqueness theorem 5.34 has been first proven by
      Werner Israel in 1968 [290], under the same additional hypotheses as the ones used in his proof of
      the vacuum theorem (cf. historical note on p. 172), notably that the event horizon H is connected,
      has the topology R × S2 and is non-degenerate. The connectedness hypothesis has been removed by
      Abul Masood-ul-Alam in 1992 [358]. The allowance for a magnetic charge (P ̸= 0) has been achieved
      by Markus Heusler in 1994 [274] (see also Sec. 9.4 of Ref. [275]). Three years later, Heusler obtained
      the Majumdar-Papapetrou part of the theorem by assuming that all the connected components of
      H are degenerate [276]. This assumption has been relaxed by Piotr Chruściel and Paul Tod in 2007
      [128], who could exclude configurations with both degenerate and non-degenerate components of H .
      The higher-dimensional uniqueness theorem 5.35 has been established by Gary Gibbons, Daisuke Ida
      and Tetsuya Shiromizu in 2002 [218] with the extra assumption of a vanishing magnetic field. This
      assumption has been relaxed by Hari Kunduri and James Lucietti in 2018 [328].
Property 5.36: Carter-Robinson theorem (Carter 1971 [93], Robinson 1975 [437])
            • H is connected,
                                              5.6 The no-hair theorem                                          177
• there is no closed causal curve in the domain of outer communications ⟨⟨M ⟩⟩,
  then ⟨⟨M ⟩⟩ is isometric to the domain of outer communications of the Kerr spacetimea .
     a
         We shall not define the Kerr spacetime here; this will be done in Chap. 10.
Remark 6: In their original works, Carter and Robinson assumed that H is a non-degenerate Killing
horizon (i.e. has non-zero surface gravity). However, this hypothesis can be relaxed [125] (see [123] for
an extended discussion).
   The proof of Property 5.36 can be found in Chap. 10 of Heusler’s textbook [275] (see also
Sec. 5 of Carter’s review article [101]) with the additional assumption that the horizon is
non-degenerate; see Ref. [125] for the degenerate case.
Remark 7: The causality condition (absence of closed causal (i.e. null of timelike) curves in the black
hole exterior), which is one of the assumptions of Carter-Robinson’s theorem (cf. [103] for a discussion),
does not appear in Israel’s theorem (Property 5.33) because a static spacetime, which by definition has
hypersurface-orthogonal timelike curves, cannot contain any closed causal curve.
    The electrovacuum version of the Carter-Robinson theorem is
Property 5.37: Bunting-Mazur theorem (Bunting 1983 [76], Mazur 1982 [361])
• H is connected,
• there is no closed causal curve in the domain of outer communications ⟨⟨M ⟩⟩,
The proof of Property 5.37 with the additional assumption that the horizon is non-degenerate
can be found in Chap. 10 of Heusler’s textbook [275] as well as in Carter’s articles [100, 101]
and Mazur’s review article [364]; the degenerate case is treated in Ref. [125].
Remark 8: Some of the Majumdar-Papapetrou solutions (5.108) are axisymmetric (those for which
all the points (xk , yk , zk ) are aligned). However, they are excluded from the conclusion of the above
theorem because they do not fulfill the hypothesis of H being connected.
178                                      Stationary black holes
      Historical note : The first stationary-axisymmetric uniqueness result has been obtained by Brandon
      Carter in 1971 [93]. Under the assumptions of a non-degenerate horizon with spherical cross-sections,
      Carter reduced the 4-dimensional stationary-axisymmetric vacuum Einstein equation to a 2-dimensional
      non-linear elliptic system of two partial differential equations, with boundary conditions depending
      on only two parameters: the mass M and c := κA/(4π), where κ is the surface gravity and A is the
      horizon area (κ ̸= 0 since the horizon was assumed non-degenerate). Then Carter could show that
      the solutions of this system form disjoint 2-parameter families and that within a given family, the
      members are fully specified by the pair (M, c). The Kerr family is such a family and is the only one
      containing the Schwarzschild solution, but Carter could not exclude that there exist other families. This
      step has been achieved by David Robinson in 1975 [437], who, via a tour de force [103], showed that
      the solution has to belong to the Kerr family. The electrovacuum case turned out to be too complicated
      to be dealt with the same techniques; only a result similar to Carter’s one (i.e. reduction to disjoint
      4-parameter families of solutions) could be achieved by Robinson in 1974 [436]. New techniques have
      been introduced independently by Paweł Mazur in 1982 [361] and Gary Bunting in 1983 [76], which
      enabled them to get the uniqueness theorem 5.37 under the non-degeneracy assumption. The case of
      a degenerate horizon has been dealt with only in 2010, when Piotr Chruściel and Luc Nguyen [125]
      showed that electrovacuum rotating black holes with a degenerate horizon necessarily belong to the
      Kerr-Newman family (they are the so-called extremal members of that family). For further details, see
      the historical accounts [103, 364, 439] by the actors themselves. As for the Kerr-Newman solution itself,
      it has been discovered by Ezra Newman and his collaborators in 1965 [380], two years after Roy Kerr
      found his famous solution [312]. A historical note about the discovery of the Kerr metric is to be found
      in Chap. 10, p. 326.
• is 4-dimensional,
• is analytic,
• is asymptotically flat,
• is stationary,
            • fulfills the electrovacuum Einstein equation (1.54) (with the vacuum Einstein equation
              (1.44) considered as a subcase),
            • contains a black hole with a connected event horizon that is either (i) rotatinga or (ii)
              non-rotating and non-degenerate,
            • does not contain any closed causal curve in the domain of outer communications
              ⟨⟨M ⟩⟩.
                                                 5.6 The no-hair theorem                                               179
• J = 0: Reissner-Nordström spacetime;
• Q = 0, P = 0: Kerr spacetime;
• Q = 0, P = 0, J = 0: Schwarzschild spacetime.
  In the last two cases, the electromagnetic field vanishes and g fulfills the vacuum Einstein
  equation (1.44).
        a
            Let us recall that the concepts of rotating and non-rotating horizons have been defined in Property 5.2.
Proof. Let H be the black hole event horizon. In case (i) (H rotating), we may invoke
the strong rigidity theorem 5.25 to get that (M , g) is axisymmetric, in addition to being
stationary. Then the Bunting-Mazur theorem 5.37 leads to ⟨⟨M ⟩⟩ being isometric to the
domain of outer communications of a Kerr-Newman spacetime. In case (ii) (H is non-rotating
and non-degenerate), we may invoke the staticity theorem 5.24 to assert that (M , g) is static.
Then the generalized Israel theorem 5.34 leads to ⟨⟨M ⟩⟩ being isometric to the domain of
outer communications of a Reissner-Nordström spacetime (the Majumdar-Papapetrou case in
theorem 5.34 is excluded since H is assumed non-degenerate). Since the Reissner-Nordström
family is a subfamily of the Kerr-Newman one, this completes the proof.
Remark 9: A stationary black hole spacetime with a non-rotating degenerate event horizon and not
belonging to the Kerr-Newman family is not excluded by the above no-hair theorem. Such a spacetime
cannot be static, otherwise the generalized Israel theorem 5.34 would imply that it belongs to the
Reissner-Nordström subfamily. The existence of such a black hole is not excluded by the staticity
theorem 5.24 due to the degeneracy hypothesis and is an open question.
Remark 10: The no-hair theorem does not hold for dimensions n > 4, even if one restricts oneself to
the vacuum Einstein equation. Indeed, for n > 4, the Kerr solution is generalized to the Myers-Perry
solution discussed in Sec. 5.2.3, but there exist other families of black holes solutions. For n = 5, some
of these families are the black rings (cf. Sec. 5.2.3) and the black saturns (cf. Remark 9 on p. 161).
    From an astrophysical perspective, the weaknesses of the no-hair theorem are the hy-
potheses of analyticity and of absence of any closed causal curve in the domain of outer
communications. The analyticity hypothesis is inherited from the strong rigidity theorem
(Property 5.25). In physics, one usually assumes that the fields are smooth, not necessarily ana-
lytic. As discussed in Sec. 5.4.2, the only successful attempt to date to get rid of the analyticity
requirement regards slowly rotating black holes [11]. As for the hypothesis of non-existence
of closed causal curves in ⟨⟨M ⟩⟩, which holds if ⟨⟨M ⟩⟩ is assumed globally hyperbolic (cf.
p. 127), it would be much more satisfactory if this would be a consequence20 of the theorem
and not one of its premises. In the current state, we cannot exclude that there is somewhere
  20
       The Kerr-Newmann spacetime has a globally hyperbolic domain of outer communications.
180                                      Stationary black holes
      in our Universe a stationary black hole that is distinct from a Kerr-Newman one and around
      which one can travel backward in time...
      Historical note : The first hints towards the “hairlessness” of black holes arised in 1964-65 from studies
      by Vitaly Ginzburg [222] and by Andrei Doroshkevich, Yakov Zeldovich and Igor Novikov [172] (cf.
      Chap. 7 of Thorne’s book [479] for some historical details). Ginzburg showed that the gravitational
      collapse of a (electrically neutral) magnetized star gives birth to an unmagnetized black hole, while
      Doroshkevich, Zeldovich and Novikov showed that the collapse of a non-rotating body slightly departing
      from spherical symmetry leads to a perfectly spherical (i.e. Schwarzschild) black hole; the deformations
      away from spherical symmetry (the “hairs”) in the external metric are decaying to zero as the black hole
      forms. These authors have also shown in the same article [172] that any static quadrupolar deformation
      of the Schwarzschild metric leads to a curvature singularity at the event horizon and hence cannot
      provide a regular black hole solution to the vacuum Einstein equation. This result is quoted as a
      motivation for proving the uniqueness of the Schwarzschild solution by Werner Israel in the famous
      1967 article [289] presenting the theorem that bears his name (Property 5.33). In the same article,
      Israel raised the question about the uniqueness of the Kerr solution. In 1968, Brandon Carter [90]
      conjectured that the Kerr-Newman family — which contains the Schwarzschild one — may represent all
      stationary black holes. This conjecture became known as the Carter-Israel conjecture [439, 266]. It has
      been popularized by the phrase “a black hole has no hair” by John Wheeler in a review article published
      in 1971 [443], as well as in the famous 1973 MTW textbook [371] (Box 33.1). In a 1996 interview,
      Wheeler [519] stated that Jacob Bekenstein (then his PhD student) coined that phrase. The conjecture
      made most of its way in becoming a theorem in 1972 when Stephen Hawking [260] established the
      first version of the strong rigidity theorem (Property 5.25). Indeed the latter basically states that all
      stationary black holes are necessarily axisymmetric and a year before, Brandon Carter [93] had shown
      that all axisymmetric stationary and 4-dimensional black holes belong either to the Kerr family or to a
      disconnected 2-parameter family, the latter possibility being eventually excluded by David Robinson in
      1975 [437] (cf. the historical note on p. 178). The no-hair theorem has been strengthened in the following
      years by making explicit and/or relaxing certain hypotheses (cf. the historical notes on p. 172, 176 and
      178). A critical review of the theorem as of 1994, distinguishing the folklore around it from what has
      been mathematically established, has been performed by Piotr Chruściel [115]. Further details about the
      theorem history can be found in the accounts by Carter [103] and Robinson [439].
      5.6.4     To go further
      See Heusler’s textbook (1996) [275] and the review articles by Chruściel, Lopes Costa & Heusler
      (2012) [123], Hollands & Ishibashi (2012) [280] and Ionescu & Klainerman (2015) [288]. The
      no-hair theorem has been extended from general relativity to scalar-tensor theories, under
      some assumptions (among which the scalar field obeys the same symmetries as the spacetime
      ones), see the article by Capuano, Santoni & Barausse (2023) [82] for details.
        Part II
Contents
        6.1   Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
        6.2 The Schwarzschild-(anti-)de Sitter solution . . . . . . . . . . . . . . . 183
        6.3   Radial null geodesics and Eddington-Finkelstein coordinates . . . . 189
        6.4   Black hole character . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201
6.1     Introduction
After having discussed stationary black holes in Chap. 5, we examine here the simplest of them:
the Schwarzschild black hole. Let us recall that the prime importance of this object in general
relativity stems from Israel uniqueness theorem (Property 5.33, Sec. 5.6), which states that any
static black hole in an asymptotically flat 4-dimensional vacuum spacetime ruled by general
relativity must be a Schwarzschild black hole.
    In this chapter, we derive the Schwarzschild metric as a solution to the Einstein equation,
possibly with a non-vanishing cosmological constant Λ (Sec. 6.2); we then focus on the case
Λ = 0 (the true Schwarzschild solution) and explore it by means of the Eddington-Finkelstein
coordinates, which have the advantage to be regular on the horizon (Sec. 6.3). Finally, in
Sec. 6.4, we check formally that the Schwarzschild spacetime has a region that obeys the general
definition of a black hole given in Sec. 4.4.2. The maximal extension of the Schwarzschild
spacetime and its bifurcate Killing horizon is discussed Chap. 9, after two chapters (Chap. 7
and 8) devoted to the timelike and null geodesics in Schwarzschild spacetime.
ξ(t) = 1 (6.3)
      The positive coefficient r2 in front of the standard spherical element must be constant over
      S , by virtue of spherical symmetry. The area of S is then A = 4πr2 . For this reason, r is
      called the areal radius of S . Letting S vary, r can be considered as a scalar field on M . If
      dr ̸= 0, we may use it as a coordinate1 . Since S ⊂ Σt , (r, θ, φ) is a coordinate system on each
      hypersurface Σt . The set (t, r, θ, φ), where t is adapted to ξ thanks to (6.3), is then a spacetime
      coordinate system and, by construction, the expression of the metric tensor with respect to
      this system is
                           g = −A(r) dt2 + B(r) dr2 + r2 dθ2 + sin2 θ dφ2 .                           (6.5)
                                                                                  
Note that this is a special case of the general static metric element (5.5) and that Eq. (5.4) holds:
                                                         ξ = ∂t .                                                (6.6)
         1
         An example of spherically symmetric spacetime with dr = 0 somewhere is the maximally extended
      Schwarzschild spacetime, to be studied in Chap. 9: dr = 0 on the so-called bifurcation sphere S (cf. Sec. 9.3.3).
      Hence r cannot be used as a coordinate in the vicinity of S ; one shall use Kruskal-Szekeres coordinates instead.
                            6.2 The Schwarzschild-(anti-)de Sitter solution                                   185
In particular, gtt = −A(r) and grr = B(r) do not depend on t as a result of the spacetime
stationarity, while gtr = gtθ = gtφ = 0 expresses the orthogonality of ξ and Σt , i.e. the
spacetime staticity. The coordinates (t, r, θ, φ) are called areal coordinates, reflecting the fact
that r is the areal radius.
which is exactly the solution corresponding to C = 1. Hence from now on, we set C = 1, i.e.
                                                                1
                                                     B(r) =         .                                          (6.11)
                                                               A(r)
Substituting this expression in Eq. (6.9) yields an ordinary differential equation for A(r):
                                                  dA
                                              r      + A − 1 + Λr2 = 0,
                                                  dr
                                                              −1
                          2m Λ 2          2          2m Λ 2
                                                                     dr2 + r2 dθ2 + sin2 θ dφ2 .
                                                                                                
            g =− 1−            − r dt + 1 −               − r
                           r     3                    r      3
                                                                                                 (6.13)
      It is called the Kottler metric (cf. the historical note below). The Schwarzschild metric is
      the particular case Λ = 0. If Λ > 0, (6.13) is called the Schwarzschild-de Sitter metric, often
      abridged as Schwarzschild-dS metric, while if Λ < 0, it is called the Schwarzschild-anti-de
      Sitter metric, often abridged as Schwarzschild-AdS metric.
           In the rest of this chapter, we will focus on the Schwarzschild metric, i.e. on the version
      Λ = 0 of Eq. (6.13):
                                          −1
                            2m            2m
                                   2
                                                 dr2 + r2 dθ2 + sin2 θ dφ2 .                                   (6.14)
                                                                          
                    g =− 1−      dt + 1 −
                             r             r
      The areal coordinates (t, r, θ, φ) are then called the Schwarzschild-Droste coordinates2 .
         Since A(r) = 1 − 2m/r and B(r) = (1 − 2m/r)−1 for the Schwarzschild metric, the
      non-vanishing Christoffel symbols (6.7) become3
                                    m                        m(r − 2m)                         m
              Γt tr = Γt rt =                      Γr tt =                     Γr rr = −
                                r(r − 2m)                        r3                       r(r − 2m)
                                                                                         1
              Γr θθ = 2m − r         Γr φφ = (2m − r) sin2 θ             Γθ rθ = Γθ θr =                       (6.15)
                                                                                         r
                                                               1                           1
              Γθ φφ = − sin θ cos θ         Γφrφ = Γφφr =               Γφθφ = Γφφθ =          .
                                                               r                         tan θ
         2
           In the literature they are often referred to as simply Schwarzschild coordinates; we follow here Deruelle &
      Uzan [162, 163].
         3
           See also the notebook D.4.3 for a check.
                           6.2 The Schwarzschild-(anti-)de Sitter solution                                   187
where Φ(r) := −m/r is the Newtonian gravitational potential outside a spherically symmetric
body of mass m.
    Another argument for identifying m with a mass is provided by Example 3 of Sec. 5.3.3,
where it has been shown that m is nothing but the Komar mass associated with the stationarity
of Schwarzschild spacetime.
Historical note : The Schwarzschild metric (6.14) is actually the first non-trivial (i.e. different from
Minkowski metric) exact solution to the Einstein equation ever found. It has been obtained by the
astrophysicist Karl Schwarzschild in the end of 1915 [449], only a few weeks after the publication of
the articles funding general relativity by Albert Einstein. It is also quite remarkable that Schwarzschild
found the solution while serving in the German army at the Russian front. Unfortunately, he died from
a rare skin disease a few months later. The way Schwarzschild proceeded was quite different from that
exposed above: instead of the coordinates (t, r, θ, φ) named today after him, he used the coordinates
(t, x1 , x2 , φ) where x1 = r∗3 /3, with r∗3 = r3 − 8m3 , and x2 = − cos θ. Such a choice was made to
enforce det(gαβ ) = −1, a condition prescribed by Einstein in an early version of general relativity,
which had been presented on 18 November 1915 and on which Schwarzschild was working. Only in
the final version, published on 25 November 1915, did Einstein relax the condition det(gαβ ) = −1,
allowing for full covariance. Schwarzschild however exhibited the famous metric (6.14), via what he
called the “auxiliary quantity” r = (r∗3 + 8m3 )1/3 . For him, the “center”, namely the location of the
“point mass” generating the field, was at r∗ = 0, i.e. at r = 2m. Independently of Schwarzschild,
Johannes Droste, then PhD student of Hendrik Lorentz, arrived at the solution (6.14) in May 1916 [175].
Contrary to Schwarzschild, Droste performed the computation with a spherical coordinate system,
(t, r̄, θ, φ), yet distinct from the standard “Schwarzschild-Droste” coordinates (t, r, θ, φ) by the fact
that the radial coordinate r̄ was not chosen to be the areal radius, but instead a coordinate for which
gr̄r̄ = 1. At the end, by a change of variable, Droste exhibited the metric (6.14). The generalization to
a non-vanishing cosmological constant, i.e. Eq. (6.13), has been obtained by Friedrich Kottler in 1918
[322] and, independently, by Hermann Weyl in 1919 [518]. We refer to Eisenstaedt’s article [180] for a
detailed account of the early history of the Schwarzschild solution.
      with the coordinate t spanning R, the coordinate r spanning (2m, +∞) on MI and (0, 2m) on
      MII , and the coordinates (θ, φ) constituting a standard spherical chart of S2 . Note that MSD
      is a disconnected open subset of the full spacetime manifold M (to be specified later), whose
      connected components are MI and MII .
      Remark 1: To cover entirely S2 in a regular way, one needs a second chart, in addition to (θ, φ); this is
      related to the standard singularities of spherical coordinates at θ = 0 and θ = π. It is fully understood
      that the metric g, as expressed by (6.14), is fully regular on S2 . The fact that det(gαβ ) = −r2 sin2 θ is
      zero at θ = 0 and θ = π reflects merely the coordinate singularity of the (θ, φ) chart there. We shall
      not discuss this coordinate singularity any further.
        Region (MI , g) is strictly static, in the sense defined in Sec. 5.2.1: the Killing vector ξ = ∂t
        is timelike on all MI and is orthogonal to the hypersurfaces t = const. Moreover, (MI , g)
        is asymptotically flat: the metric g tends to Minkowski metric when r → +∞.
      Proof. From expression (6.14), we see that gtt < 0 for r > 2m. Since gtt = g(∂t , ∂t ), this
      implies that ∂t is timelike. Moreover ∂t is orthogonal to any hypersurface Σt defined by a
      constant value of t, since we read on (6.14) that g(∂t , ∂i ) = gti = 0 for i ∈ {1, 2, 3} and by
      definition, the vectors (∂i ) = (∂r , ∂θ , ∂φ ) form a basis of the tangent planes to Σt . Finally, for
      r → +∞, all the metric components given by (6.14) tend to those of the Minkowksi metric
      (4.3) [see also Eq. (6.16)].
        In region MII , the Killing vector ξ = ∂t is spacelike. It follows that (MII , g) is not static: the
        translation group (R, +) is still an isometry group of (MII , g), but its orbits are spacelike
        curves (cf. Sec. 5.2.1). Besides, in MII , t is a spacelike coordinate (i.e. the hypersurfaces
        t = const are timelike, cf. Sec. A.3.2) and r is a timelike one (i.e. the hypersurfaces
        r = const are spacelike).
               6.3 Radial null geodesics and Eddington-Finkelstein coordinates                            189
Proof. We see from (6.14) that gtt > 0 for r < 2m, which implies that ∂t is spacelike there.
Moreover, we have g tt > 0 and g rr < 0 in MII , which, according to the criterion (A.56), implies
that t is a spacelike coordinate and r a timelike one.
Remark 2: In the region MII , grr < 0, so that the metric g keeps a Lorentzian signature, as it should!
Remark 3: As a consequence of Property 6.2 and of the diagonal character of the metric components
(6.14), the axes of the light cones are horizontal lines for r < 2m in Fig. 6.1 below.
                                                                dr2
                                 ds2 = 0 ⇐⇒ dt2 =                    .                          (6.18)
                                                                 2m 2
                                                           1−     r
                                                                  r
                               L(u,θ,φ)
                                 out
                                        :    t = r + 2m ln          −1 +u,                       (6.21)
                                                                 2m
                                                                   r
                              L(v,θ,φ)
                                in
                                       :     t = −r − 2m ln          −1 +v ,                     (6.22)
                                                                  2m
                               t/m
                                4
                                        1      2      3       4      5       6      7      8
                                                                                               r/m
-1
-2
-3
-4
      Figure 6.1:       Radial null geodesics of Schwarzschild spacetime, plotted in terms of Schwarzschild-Droste
      coordinates (t, r): the solid (resp. dashed) lines correspond to outgoing (resp. ingoing) geodesics L(u,θ,φ)
                                                                                                                 out
                                                                                                                       (resp.
      L(v,θ,φ) ), as given by Eq. (6.21) (resp. Eq. (6.22)). The interiors of some future light cones are depicted in yellow.
        in
      Note that along a given outgoing radial null geodesic, (u, θ, φ) are constant and that two
      outgoing radial null geodesics that have distinct (u, θ, φ) are distinct. We can thus use the
      triplet (u, θ, φ) to label the outgoing radial null geodesics, leading to the notation L(u,θ,φ)
                                                                                               out
                                                                                                      .
      Moreover, the family (L(u,θ,φ) ), with u ∈ R, θ ∈ [0, π] and φ ∈ [0, 2π), forms a congruence:
                                  out
      there is one, and only one, such curve through every point of MSD . Similar considerations
      apply to the ingoing radial null geodesics L(v,θ,φ)
                                                    in
                                                          .
          By introducing the tortoise coordinate
                                                                       r
                                                r∗ := r + 2m ln          −1 ,                                        (6.23)
                                                                      2m
      one may rewrite Eqs. (6.21)-(6.22) as respectively
                                               L(u,θ,φ)
                                                 out
                                                        :         t = r∗ + u                                         (6.24)
                                               L(v,θ,φ)
                                                 in
                                                        :         t = −r∗ + v.                                       (6.25)
and (6.22) obey the geodesic equation [Eq. (B.10) in Appendix B]:
                                         d2 xα           µ
                                                   α dx dx
                                                            ν
                                               + Γ  µν        = 0,                                      (6.26)
                                         dλ2           dλ dλ
where λ is an affine parameter (cf. Sec. B.2.1). Let us check that (6.26) is satisfied by choosing
λ = r. For the curves defined by (6.21), we have
                                                 r                    
                       (xα (r)) = r + 2m ln          − 1 + u, r, θ, φ .
                                                 2m
Hence
         dxα                                                  d2 xα
                                                                                            
                              r                                                    2m
                   =              , 1, 0, 0     and                       = −             , 0, 0, 0 .
          dr               r − 2m                              dr2              (r − 2m)2
Given the Christoffel symbols (6.15), it is then a simple exercise to show that Eq. (6.26) is
satisfied. The same property holds for the family (6.22). Hence we conclude
 The radial null geodesics in the Schwarzschild-Droste domain form two congruences,
    out
 (L(u,θ,φ) ) and (L(v,θ,φ)
                    in
                           ), obeying Eqs. (6.21)-(6.22). Moreover, the areal radius r is an affine
 parameter along them.
    The two congruences of radial null geodesics are depicted in Fig. 6.1. The singularity of
Schwarzschild-Droste coordinates at the Schwarzschild radius r = 2m appears clearly on this
figure.
Remark 1: Despite their name, geodesics of the outgoing family L(u,θ,φ)out  are actually ingoing in the
region r < 2m, in the sense that r is decreasing along them when moving towards the future. Indeed,
as noticed in Sec. 6.2.5, for r < 2m, ∂r is a timelike vector and we shall see in Sec. 6.3.6 that −∂r is
oriented towards the future (cf. the “tilted” light cone in Fig. 6.1).
                                                          dr
                                           dt = dv −            ,                                       (6.28)
                                                       1 − 2m/r
192                                   Schwarzschild black hole
      Substituting this expression for dt2 in Eq. (6.14) yields the metric components with respect to
      the coordinates (xα̂ ) := (v, r, θ, φ):
                                   
                                 2m
                                      dv 2 + 2 dv dr + r2 dθ2 + sin2 θ dφ2 .                           (6.29)
                                                                          
                         g =− 1−
                                  r
      Remark 2: Since dv dr := 1/2 (dv ⊗ dr + dr ⊗ dv) [cf. Eq. (A.39)], the metric component gvr is one
      half of the coefficient of the dv dr term in Eq. (6.29), i.e. gvr = 1.
          The coordinates (xα̂ ) = (v, r, θ, φ) are called the null ingoing Eddington-Finkelstein
      (NIEF) coordinates. The qualifier null stems from the fact that v is a null coordinate, i.e. the
      level sets v = const are null hypersurfaces (cf. Sec. A.3.2). This can be seen from g vv = 0 [cf.
      Eq. (A.56)].
          To deal with a timelike coordinate instead of a null one, let us set
t̃ := v − r ⇐⇒ v = t̃ + r (6.30)
      Remark 3: From (6.30), v appears as the “time” t̃ “advanced” by r, while from (6.25), v is the “time” t
      “advanced” by r∗ .
         The relation between the ingoing Eddington-Finkelstein coordinates (t̃, r, θ, φ) and the
      Schwarzschild-Droste ones (t, r, θ, φ) is obtained by combining Eqs. (6.27) and (6.30):
                                                                   r
                                               t̃ = t + 2m ln        −1 .                              (6.32)
                                                                  2m
      The hypersurfaces t = const are plotted in Fig. 6.2, in terms of the IEF coordinates.
         From (6.30), we have dv = dt̃ + dr. Substituting into (6.29) yields
                                             
                  2m         4m              2m
                          2
                                                  dr2 + r2 dθ2 + sin2 θ dφ2 .                          (6.33)
                                                                           
          g =− 1−      dt̃ +    dt̃ dr + 1 +
                   r          r               r
         To avoid any ambiguity, we shall denote by ∂r̃ the coordinate vector of the IEF frame and
      by ∂r the coordinate vector of the Schwarzschild-Droste frame:
                                          ∂                                    ∂
                                 ∂r̃ :=                     and        ∂r :=                .          (6.34)
                                          ∂r    t̃,θ,φ                         ∂r   t,θ,φ
                   6.3 Radial null geodesics and Eddington-Finkelstein coordinates                                                 193
                            t̃/m
                             4
                                    1     2        3       4        5           6       7       8
                                                                                                    r/m
-1
-2
-3
-4
Figure 6.2: Hypersurfaces of constant Schwarzschild-Droste coordinate t, drawn in terms of the ingoing
Eddington-Finkelstein coordinates (t̃, r). Since the dimensions along θ and φ are not represented, these 3-
dimensional surfaces appear as curves.
The relation between the two vectors is given by the chain rule:
     ∂                 ∂           ∂t         ∂          ∂r          ∂                    ∂θ           ∂         ∂φ
                   =                        +                      +                                +                      ,
     ∂r   t̃,θ,φ       ∂t   r,θ,φ ∂r t̃,θ,φ   ∂r   t,θ,φ ∂r t̃,θ,φ   ∂θ             t,r,φ ∂r t̃,θ,φ   ∂φ   t,r,θ ∂r t̃,θ,φ
                                  | {z }                 | {z }                           | {z }                | {z }
                                      r  −1                    1                            0                       0
                                 (1− 2m )
                                                          r −1
                                                           
                                          ∂r̃ = ∂r + 1 −        ∂t .                                                      (6.35)
                                                         2m
    On the other hand, we deduce from (6.32) that
                                               ∂                   ∂
                                                               =                ,
                                               ∂ t̃    r,θ,φ       ∂t   r,θ,φ
which implies:
                                                         ∂t̃ = ∂t .                                                       (6.36)
In particular, the vector ∂t̃ of the IEF frame coincides with the Killing vector ξ:
                                                         ∂t̃ = ξ .                                                        (6.37)
194                                       Schwarzschild black hole
      Remark 4: The result (6.37) is not surprising since the metric components (6.33) are independent from
      t̃. This implies that ∂t̃ is a Killing vector. t̃ being a timelike coordinate in MI , it follows that ∂t̃ = αξ,
      where α is a constant. Since t̃ ∼ t when r → +∞, we get α = 1.
      Remark 5: In region MII , the four vectors (∂t̃ , ∂r̃ , ∂θ , ∂φ ) are spacelike5 . There is nothing wrong
      about that; in particular this does not contradict the signature (−, +, +, +) of the metric. The latter is
      related to the type of the basis vectors only when the metric components take a diagonal form, which
      is not the case here, since gt̃r ̸= 0 [Eq. (6.33)]. All that is demanded to (t̃, r, θ, φ) for being (locally) a
      regular coordinate system is that (∂t̃ , ∂r̃ , ∂θ , ∂φ ) form a basis of the tangent space Tp M at each point
      p. This can be achieved with any causal type of the vectors (∂t̃ , ∂r̃ , ∂θ , ∂φ ).
      Remark 6: The IEF expression (6.33) for the metric can be recast in the following remarkable form:
                                                                   2m          2
                             g = −dt̃2 + dr2 + r2 dθ2 + sin2 θ dφ2 +    dt̃ + dr ,                                 (6.38)
                                 |               {z               }  r | {z }
                                                       f                                 k⊗k
      where the f is the (flat) Minkowski metric (expressed above in terms of the spherical coordinates
      (t̃, r, θ, φ)) and k = −d(t̃ + r) = −dv. The 1-form k is dual to a vector k, which is null, as it can be
      seen from g µ̃ν̃ kµ̃ kν̃ = 0. The latter property is easily deduced from kµ̃ = (−1, −1, 0, 0) and expression
      (6.40) for g µ̃ν̃ below. A metric of the type (6.38) is said to be a Kerr-Schild metric; these peculiar metrics
      are discussed in Appendix C.
      Remark 7: In the literature, the terminology Eddington-Finkelstein coordinates is often used for the
      coordinates (v, r, θ, φ) (or (u, r, θ, φ)), i.e. for what we have called the null Eddington-Finkelstein
      coordinates, and the regularity of the metric tensor at r = 2m is demonstrated by considering the
         5
            This follows from the diagonal components gαα read on (6.33) being positive for r < 2m, but this can also be
      seen graphically on Fig. 6.3 below: for r < 2m, both the t̃ and r coordinate lines, i.e. the vertical and horizontal
      lines, are outside the light cones.
          6
            The other differences with (6.32) are a constant additive term and a misprint in Eddington’s formula: the
      term ln(r − m) should be replaced by ln(r − 2m).
               6.3 Radial null geodesics and Eddington-Finkelstein coordinates                            195
components (6.29). However, neither Eddington [177] nor Finkelstein [195] considered this null version:
they used coordinates (t̃, r, θ, φ) and they exhibited (the outgoing version of) the metric components
(6.33). Hence our terminology is more faithful to history.
with the coordinate t̃ spanning R, the coordinate r spanning (0, +∞) and the coordinates
(θ, φ) forming the standard spherical chart of S2 . The components of the inverse metric with
respect to the IEF coordinates are
                                                                     
                                  − 1 + 2m        2m
                                           
                                         r          r
                                                          0     0
                                                                     
                                     2m               2m
                                                                     
                                
                                      r
                                               1 −     r
                                                          0     0     
                       g α̃β̃ =                                      .                (6.40)
                                                                     
                                                          1
                                
                                     0            0     r2
                                                                0     
                                                                      
                                                                     
                                                                1
                                      0            0      0 r2 sin2 θ
In particular, the components g α̃β̃ are regular at r = 2m. Moreover we notice that g t̃t̃ < 0 for
all r ∈ (0, +∞). In view of the criterion (A.56), we may assert
where H is the subset of MIEF defined by r = 2m. Note that H has the topology
H ≃ R × S2 (6.42)
and that (t̃, θ, φ) is a coordinate system on H . Actually H is nothing but what has been
called the Schwarzschild horizon in the examples of Chaps. 2 and 3. Indeed, the metric (6.33)
is nothing but the metric (2.5) introduced in Example 3 of Chap. 2 (p. 29), up to the change of
notation t̃ ↔ t (compare (2.6) and (6.40) as well). We have thus the fundamental result, the
proof of which is given in Example 10 of Chap. 3 (p. 68):
196                                    Schwarzschild black hole
Property 6.5
      In particular, H is a null hypersurface, whose null geodesic generators admit ξ = ∂t̃ as tangent
      vector. It is a non-expanding horizon, whose area, as defined in Sec. 3.2.2, is (cf. Example 3 of
      Chap. 3, p. 59)
                                                A = 16πm2 .                                      (6.43)
      H is depicted in Fig. 2.7. We shall see in Sec. 6.4 that H is actually a black hole event horizon
      in Schwarzschild spacetime.
      Historical note : The first author to recognize that the hypersurface r = 2m in Schwarzschild spacetime
      is a one-way membrane, i.e. a horizon, is David Finkelstein in 1958 [195]. Amazingly, Finkelstein stressed
      rather r = 2m as a white hole boundary in a time-reversed version of (MIEF , g). In particular, he wrote
      “causal influences propagating into the "future" can cross the Schwarzschild surface only in an outward
      direction” (see also his Fig. 1). The reason is that Finkelstein considered the extension of the MI region
      to r ≤ 2m constructed with the outgoing coordinate t̃˜ = t − 2m ln(r/(2m) − 1) [his Eq. (2.3) in our
      notations] instead of t̃ as given by Eq. (6.32). However, he noticed that another extension of MI can be
      obtained by time inversion, corresponding to a “surface that is permeable inwards”. As we shall see in
      Chap. 9, both black hole and white hole regions actually exist in the maximal extension of Schwarzschild
      spacetime.
                                                          48m2
                                                    K=         .                                         (6.45)
                                                           r6
      Hence K → +∞ when r → 0. Since K is a scalar field, its value is independent of any
      coordinate system used to express it. Hence the divergence of K reflects a pathology of the
                  6.3 Radial null geodesics and Eddington-Finkelstein coordinates                                          197
                         t̃/m
                         4
                                  1      2       3      4       5      6       7      8
                                                                                          r/m
-1
-2
-3
-4
Figure 6.3: Radial null geodesics of Schwarzschild spacetime, plottved in terms of ingoing Eddington-Finkelstein
coordinates (t̃, r): the solid (resp. dashed) lines correspond to outgoing (resp. ingoing) geodesics L(u,θ,φ)
                                                                                                            out
                                                                                                                  (resp.
L(v,θ,φ) ), as given by Eq. (6.51) (resp. Eq. (6.47)). The interiors of some future light cones are depicted in yellow.
   in
Note that the hypersurfaces of constant t̃ (horizontal lines) always lie outside the light cones, i.e. are spacelike, in
agreement with t̃ being everywhere a timelike coordinate (Property 6.4). Similarly the hypersurfaces of constant
r (vertical lines) lie outside the light cones for r < 2m, in agreement with r being a timelike coordinate in MII
(Property 6.2).
Riemann tensor per se: it is called a curvature singularity. Physically, this means that
unbounded tidal forces are felt by any system approaching r = 0. This interpretation holds
because tidal forces correspond to the acceleration of the separation vector between neighboring
timelike geodesics and that acceleration is governed by the Riemann tensor, via the geodesic
deviation equation, cf. Property B.22 (see Chap. 11 of MTW [371] for more details).
                                L(v,θ,φ)
                                  in
                                         :     t̃ = −r + v ,   (θ, φ) = const,                      (6.47)
      where the constant v ∈ R labels the null curve, along with (θ, φ). The simplicity of Eq. (6.47)
      reflects the construction of the IEF coordinates on the ingoing radial null geodesics. Note that
      in MSD , Eq. (6.47) is equivalent to Eq. (6.22), given relation (6.32) between t̃ and t.
          We have seen in Sec. 6.3.1 that r is an affine parameter along L(v,θ,φ)
                                                                             in
                                                                                  . It follows that λ = −r
      is an affine parameter as well, in terms of which the equation of L(v,θ,φ) deduced from Eq. (6.47)
                                                                            in
      is
                   t̃(λ) = λ + v, r(λ) = −λ, θ(λ) = θ = const, φ(λ) = φ = const.
      The tangent vector k associated with this parametrization of L(v,θ,φ)
                                                                       in
                                                                             has components k α =
      dxα /dλ = (1, −1, 0, 0) with respect to the IEF coordinates. We have therefore
      The IEF components of the 1-form k metric-dual to k are kα = gαµ k µ with gαµ given by
      Eq. (6.33). We get kα = (−1, −1, 0, 0), hence k = −dt̃ − dr, i.e.
k = −dv. (6.49)
                                                dr    r − 2m
                                                    =        .                                      (6.50)
                                                dt̃   r + 2m
      To proceed, we have to distinguish two cases. First, if r ̸= 2m, Eq. (6.50) can be rewritten as
                                                                   
                                       r + 2m                  4m
                                 dt̃ =        dr = 1 +                 dr,
                                       r − 2m               r − 2m
                                                       r
                       L(u,θ,φ)
                         out
                                :    t̃ = r + 4m ln      −1 +u,         (θ, φ) = const,             (6.51)
                                                      2m
      where the integration constant u ∈ R labels the null curve, along with (θ, φ). Since r ̸= 2m,
      Eq. (6.51) regards MSD and we actually recover Eq. (6.21), given relation (6.32) between t̃ and t.
      On can easily show that a curve defined by (6.51) never crosses H , i.e. it either lies entirely
      in MI or lies entirely in MII (cf. Fig. 6.3). We have thus two families of outgoing radial null
      geodesics, each labelled by (u, θ, φ): L(u,θ,φ)
                                                out,I
                                                      in MI and L(u,θ,φ)
                                                                  out,II
                                                                         in MII .
               6.3 Radial null geodesics and Eddington-Finkelstein coordinates                         199
    Let us now consider the case r = 2m in Eq. (6.50); the equation reduces then to dr/dt̃ = 0,
so that the equation of the radial null curve is r = const. The constant being necessarily 2m,
we get the family of curves L(θ,φ)
                               out,H
                                     defined by
                            L(θ,φ)
                              out,H
                                    :     r = 2m ,     (θ, φ) = const.                        (6.52)
These null curves, which form a family labelled by (θ, φ), are actually the null geodesic
generators of the Killing horizon H (cf. Secs. 6.3.3 and 2.3.3). Indeed, it follows from Eq. (6.52)
that a tangent vector to them is ∂t̃ = ξ, which, for r = 2m, is the null normal of H . The
null geodesics L(θ,φ)
                    out,H
                           had not been found in Sec. 6.3.1 for they don’t belong to MSD . They
extend the outgoing family L(u,θ,φ)
                                 out
                                      since they obey the same differential equation (6.50) as the
geodesics L(u,θ,φ) . Moreover, they correspond to the limiting case r = const separating the
                out
                     H
Note that, on H , ℓ = ∂t̃ = ξ.
We have then M = R × (0, +∞) × S2 [Eq. (6.39)]. Note that we shall extend this spacetime in
Chap. 9.
     The vector k defined by Eq. (6.48) is a nonzero null vector field defined on the whole manifold
M . It may therefore be used to set the time orientation of the Schwarzschild spacetime (M , g)
(cf. Sec. 1.2.2). Since for r → +∞, k clearly points towards increasing t̃, we declare that k
defines the future direction:
200                                 Schwarzschild black hole
        The time orientation of the Schwarzschild spacetime (M , g) is such that the null vector k
        defined by Eq. (6.48) is everywhere future-directed.
          The above choice induces a time orientation of the subdomains MI and MII of M . We
      read on the metric components (6.14) that grr < 0 for r < 2m, so that the coordinate vector ∂r
      of Schwarzschild-Droste coordinates is timelike in MII . According to Lemma 1.2 (Sec. 1.2.2)
      with u = k, its time orientation is given by the scalar product k · ∂r . Given Eqs. (6.35) and
      (6.36), we have                                 −1
                                                    2m
                                       ∂r = − 1 −           ∂t̃ + ∂r̃ .
                                                     r
      Via (6.48) and (6.33), we deduce then that
                                            r       r −1
                                  k · ∂r =      1−          > 0 in MII .
                                           2m       2m
      In view of Eq. (1.6c) in Lemma 1.2, we conclude:
      This explains why −∂r lies within the future null cones in Fig. 6.1 for r < 2m.
         A corollary is:
In region MII , r must decrease towards the future along any null or timelike worldline.
      Proof. Let L be a causal curve in region MII and λ a parameter along L increasing towards
      the future. The associated tangent vector v = dx/dλ is then future-directed. According to
      Property 6.8, −∂r is a future-directed timelike vector in MII , so that we can apply Lemma 1.1
      (Sec. 1.2.2) with u = −∂r and get g(−∂r , v) < 0. Now, using the Schwarzschild-Droste
      components (6.14), we have
                                                                              −1
                                         µ          r         dr       2m           dr
                      g(−∂r , v) = −grµ v = −grr v = −grr        =         −1          .
                                                              dλ        r          dλ
      Since 2m/r − 1 > 0 in MII , g(−∂r , v) < 0 is thus equivalent to dr/dλ < 0, which proves
      that r is decreasing along L as λ increases.
      Thus not only an observer in MII cannot cross MII ’s outer boundary H to visit MI , H being
      a null hypersurface, but he is forced to move to decreasing r until he reaches the curvature
      singularity at r → 0. We shall study this motion in detail in Sec. 7.3.2.
                                            6.4 Black hole character                                                 201
Figure 6.4: Manifold with boundary M ′ = MII ∪ H ∪ MI ∪ I ′ , drawn in terms of the coordinates x and
(a compactified version of) v. The dashed lines are the ingoing radial null geodesics (as in Fig. 6.3), the arrows
marking the future orientation.
      Since (v, x, θ, φ) is a global coordinate system on M (up to the trivial coordinate singularities
      of (θ, φ)), we can identify M with the following open subset of R2 × S2 :
M = R × (−∞, 1) × S2 , (6.60)
      with v spanning R, x spanning (−∞, 1) and (θ, φ) spanning S2 . We can then extend M to the
      manifold with boundary7
                                      M ′ := R × (−∞, 1] × S2 .                            (6.61)
      Notice the change (−∞, 1) → (−∞, 1] with respect to (6.60), which means that x = 1 is an
      allowed value on M ′ ; it actually defines the boundary of M ′ , I ′ say. According to (6.55), I ′
      corresponds to r → +∞. A view of the manifold M ′ is provided in Fig. 6.4. We note that the
      conformal metric (6.59) can be extended to the boundary I ′ , yielding a regular metric. Indeed,
      the determinant of the metric components (6.59) is
      which does not vanish at x = 1 (except at the trivial coordinate singularity θ = 0 or θ = π),
      showing that g̃ is a non-degenerate symmetric bilinear form at I ′ and hence a well-defined
      metric on all M ′ . Furthermore we have Ω > 0 on M and Ω = 0 at I ′ [set x = 1 in Eq. (6.57)],
      as well as
                                             dΩ = −dx ̸= 0.                                   (6.62)
      Hence (M ′ , g̃) obeys all the conditions listed in Sec. 4.3 to be a conformal completion at infinity
      of (M , g). However, it is not adapted to the black hole definition given in Sec. 4.4.2. Indeed
      I ′ does not include any future infinity (I + ). Actually, I ′ is entirely a past infinity: a generic
      point of I ′ has coordinates (v, x, θ, φ) = (v0 , 1, θ0 , φ0 ) and is the past end point of the ingoing
      radial null geodesic defined by (v, θ, φ) = (v0 , θ0 , φ0 ). Therefore, we shall extend M ′ to include
      some I + part. To achieve this, we shall construct I + as the set of endpoints of the outgoing
      radial null geodesics in MI . In terms of the null ingoing Eddington-Finkelstein coordinates
      (v, r, θ, φ), the equation of these geodesics is obtained by combining (6.51) and (6.47):
                                                                     r
                                               v = 2r + 4m ln          − 1 + u,                        (6.63)
                                                                    2m
      where u ∈ R is a constant parameter along a given geodesic. We notice that on MI , we may
      use (xα̌ ) = (u, r, θ, φ) as a coordinate system, naturally called the null outgoing Eddington-
      Finkelstein coordinates. Since (6.63) implies
                                                                     2
                                                  dv = du +               dr,
                                                                 1 − 2m/r
         7
             Cf. Sec. A.2.2 for the definition of a manifold with boundary.
                                             6.4 Black hole character                                                  203
Figure 6.5: Manifold with boundary MI′′ = MI ∪ I ′′ , drawn in terms of the coordinates x and (a compactified
version of) u. The green solid lines are the outgoing radial null geodesics (as in Fig. 6.3), the arrows marking the
future orientation. Note that H , which is drawn on this figure, is not part of MI′′ .
Remark 1: Contrary to (v, r, θ, φ), the coordinates (u, r, θ, φ) do not cover all M = MIEF , but only MI .
                                                                                       out,I
This is graphically evident from Fig. 6.3, where the outgoing radial null geodesics L(u,θ,φ)  accumulate
on H as u → +∞ from the MI side.
                                                                                               ′′
   On MI , let us perform the change of coordinates (xα̌ ) = (u, r, θ, φ) → (xα ) = (u, x, θ, φ),
where x is related to r by the same formula as (6.55), except that on MI , x’s range is (0, 1) only.
We deduce from (6.64) and (6.55) the expression of g in terms of the coordinates (u, x, θ, φ):
                                         4m              4m2
                  g = −x du2 −                                     2       2     2
                                                                                                             (6.65)
                                                                                   
                                               du dx +          dθ   + sin   θ dφ    .
                                      (1 − x)2         (1 − x)2
Let us identify MI with the following open subset of R2 × S2 :
                                           MI = R × (0, 1) × S2 ,                                            (6.66)
with u spanning R, x spanning (0, 1) and (θ, φ) spanning S2 . Similarly to what we did above
for M , we may then extend MI to the manifold with boundary
                                           MI′′ := R × (0, 1] × S2 .                                         (6.67)
The boundary of MI′′ , I ′′ say, lies at x = 1 (cf. Fig. 6.5). It shall not be confused with the
boundary of MI as a submanifold of M ′ , which is I ′ . The difference arises from the fact that
u diverges (to −∞) when one approaches I ′ in M ′ , so that u cannot be used as a coordinate
on M ′ . This is clear on the relation (6.63) between u, v and r, which, once re-expressed in
terms of x, becomes                                            
                                                1             x
                              u = v − 4m             + ln           .                       (6.68)
                                              1−x           1−x
For a fixed value of v in M ′ , this relation yields indeed diverging values of u at two places:
204                                   Schwarzschild black hole
      We notice that it is regular and non-degenerate in all MI′′ , including on I ′′ (x = 1), and that
      on the submanifold MI , it is related to the physical metric g by g̃ = Ω2 g, with the scalar field
      Ω taking the same expression in terms of x as that introduced in Eq. (6.57): Ω = 1 − x.
          The conformal completion of (M , g) including both I ′ (as I − ) and I ′′ (as I + ) is con-
      structed as follows. Let
                                              M˜ = M ′ ∪ MI′′ .                                   (6.70)
      We endow M˜ with two coordinate charts:
             Φ2 ◦ Φ−1
                   1 : R × (0, 1) × S
                                      2
                                         −→ R × (0, 1) × S2
                                                          1                                             (6.73)
                                                                                 x
                                                                                                  
                         (v, x, θ, φ)   7−→ u = v − 4m 1−x   + ln               1−x
                                                                                           , x, θ, φ ,
The above construction makes M˜ a manifold with boundary (cf. Fig. 6.6), the boundary being
I = I + ∪ I −, (6.74)
      with
                  I + := {p ∈ MI′′ , x(p) = 1}         and      I − := {p ∈ M ′ , x(p) = 1}.            (6.75)
          We then endow M˜ with a Lorentzian metric g̃, whose expression is given by (6.59) on M ′
      and by (6.69) on MI′′ . By construction, (M˜, g̃) is then a conformal completion at null infinity
      of the Schwarzschild spacetime (M , g), the conformal factor Ω being given by (6.57) in both
      charts (M ′ , Φ1 ) and (MI′′ , Φ2 ): Ω = 1 − x. In particular, it is clear that no past-directed causal
      curve originating in M intersects I + and that no future-directed causal curve originating in
      M intersects I − . We also check immediately that I + and I − are null hypersurfaces with
      respect to the metric g̃: both hypersurfaces are defined by x = 1, so that the induced metric on
      them, as deduced from (6.59) and (6.69), is
Figure 6.6: Schematic view of the manifold with boundary M˜, which defines a conformal completion at null
infinity of Schwarzschild spacetime (M , g). NB: contrary to Figs. 6.4 and 6.5, this figure is not drawn on some
specific coordinate system. As in Figs. 6.3, 6.4 and 6.5, the green solid (resp. dashed) lines are the outgoing (resp.
ingoing) radial null geodesics, the arrows marking the future orientation.
which is clearly degenerate (along the u direction for I + and along the v direction for I − ).
   As it is clear from Fig. 6.6, MI is the interior of the causal past of I + within M :
                                  MI = int J − (I + ) ∩ M .                              (6.77)
                                                              
  The Schwarzschild spacetime (M = MIEF , g) has a black hole region B, the interior of
  which is MII ; the event horizon is nothing but the Schwarzschild horizon H discussed in
  Sec. 6.3.3.
Remark 2: As stated at the beginning of this section, the null character of the boundary H between
MI and MII and the fact that MII never intersect the asymptotically flat region r → +∞, was sufficient
to claim that MII represents what by any means should be called a black hole region. Therefore, we can
view the above demonstration more as a “sanity check” of the formal definition of a black hole given in
Sec. 4.4.2: this definition would not have been acceptable if it would not apply to the Schwarzschild
spacetime.
Remark 3: The above construction of the conformal completion at null infinity (M˜, g̃) involves two
coordinate charts, (v, x, θ, φ) and (u, x, θ, φ), with two different domains, M ′ and MI′′ . As will be
discussed in Chap. 9, one may construct a conformal completion with a single chart, as in the Minkowski
case, but its relation with the coordinates introduced so far is quite involved. In particular the standard
206                                  Schwarzschild black hole
Contents
            7.1    Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
            7.2    Geodesic motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
            7.3 Timelike geodesics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214
7.1         Introduction
We have already investigated some geodesics of Schwarzschild spacetime in Chap. 6, namely
the radial null geodesics (Secs. 6.3.1 and 6.3.5). Here, we perform an extensive study. After
having established the main properties of generic causal (timelike or null) geodesics in Sec. 7.2,
we investigate timelike geodesics in Sec. 7.3. They are of great physical importance, since they
represent orbits of planets or stars around the black hole, as well as worldlines of intrepid
observers freely falling into the black hole. The study of null geodesics, which govern images
received by observers, is deferred to Chap. 8.
    Before studying this chapter, the reader might want to have a look at Appendix B, which
recaps the properties of geodesics in manifolds equipped with a metric. It could also be worth
to read again Sec. 1.3 about worldlines of particles.
      either massive (L timelike) or massless (L null). As recalled in Sec. 1.3, the worldline of a
      particle P is a geodesic if, and only if, P is submitted only to gravitation, i.e. P is in free fall.
        Denoting by p the 4-momentum of particle P that follows the geodesic L (cf. Sec. 1.3),
        the scalar products
                                           E := −ξ · p = −g(ξ, p)                                 (7.1a)
                                           L := η · p = g(η, p) ,                                 (7.1b)
        are constant along L . The scalar E is called P’s conserved energy or energy at infin-
        ity, while L is called P’s conserved angular momentum or angular momentum at
        infinity.
      Proof. The 4-momentum p is a tangent vector associated with an affine parameter of L , i.e. it
      obeys the geodesic equation (1.11). The constancy of E and L follow then from Property B.20
      (Sec. B.5).
      In coordinates (t, r, θ, φ) adapted to the spacetime symmetries, i.e. coordinates such that ξ = ∂t
      and η = ∂φ , for instance the Schwarzschild-Droste coordinates or the Eddington-Finkelstein
      ones, one can rewrite (7.1) in terms of the components pt = gtµ pµ and pφ = gφµ pµ of the
      1-form p associated to p by metric duality:
                                                   E = −pt                                          (7.2a)
                                                   L = pφ                                           (7.2b)
  The conserved energy E is a positive quantity as soon as the geodesic L has some part in
  MI , i.e. some part with r > 2m:
L ∩ MI ̸= ∅ =⇒ E > 0. (7.5)
Proof. In MI , the Killing vector ξ is timelike and future-directed. The 4-momentum p is either
timelike or null and always future-directed. By Eq. (1.5a) in Lemma 1.1 (Sec. 1.2.2), one has
then necessarily ξ · p < 0; hence Eq. (7.1a) implies E > 0 in MI . Since E is constant along L ,
it follows that E > 0 everywhere.
Remark 1: If the geodesic L is confined to MII , i.e. to the black hole region (cf. Sec. 6.4), where ξ
is spacelike (cf. Sec. 6.2.5), it is possible to have E ≤ 0, since the scalar product of p with a spacelike
vector can take any value.
Remark 2: The Killing vector η being always spacelike, the scalar product g(η, p) can a priori take
any real value, and thus there is no constraint on the sign of L.
                                        t=0
                                      p = pt ∂t + pr ∂r + pφ ∂φ .                                     (7.6)
Now, for t > 0, if the geodesic L were departing from θ = π/2, this would constitute some
breaking of spherical symmetry, making a difference between the “Northern” hemisphere and
210           Geodesics in Schwarzschild spacetime: generic and timelike cases
pθ = 0 . (7.7)
We conclude:
         Let us denote by µ the mass of particle P, with possibly µ = 0 if P is a photon. The scalar
      square of the 4-momentum p is then [cf. Eq. (1.10)]
                                                                dx
                                                          p=       ,                                               (7.9)
                                                                dλ
                      dt                      dr                       dθ                     dφ
              ṫ :=      = pt ,       ṙ :=      = pr ,        θ̇ :=      = pθ ,      φ̇ :=      = pφ .           (7.10)
                      dλ                      dλ                       dλ                     dλ
         3
          More rigorously, Eq. (7.7) can be derived from the geodesic equation (1.11): given the expression of the
      Christoffel symbols of g in Schwarzschild-Droste coordinates (cf. Sec. D.4.2), Eq. (1.11) yields
                                              dpθ  2                         2
                                                  + pr pθ − sin θ cos θ (pφ ) = 0,
                                              dλ   r
      where λ is the affine parameter of L associated with p, so that pθ = dθ/dλ. Whatever the values of r(λ), pr (λ)
      and pφ (λ), the solution to this ordinary differential equation with the initial conditions pθ = 0 and cos θ = 0 is
      pθ = 0 for all values of λ.
         4
           Let us recall that Schwarzschild spacetime is time-oriented, cf. Sec. 6.3.6.
                                           7.2 Geodesic motion                                      211
In the present case, where θ(λ) = π/2, we have of course θ̇ = 0, in agreement with Eq. (7.7).
Given the components (6.14) of Schwarzschild metric with respect to the Schwarzschild-Droste
coordinates, Eq. (7.1a) can be written as
                                                                 
                                   µ       t                  2m
                        E = −gtµ p = −gtt p = −gtt ṫ = 1 −          ṫ,
                                                               r
hence
                                                    −1
                                         dt       2m
                                            =E 1−        .                                 (7.11)
                                         dλ        r
Similarly, Eq. (7.1b) becomes
  Along any timelike or null geodesic of Schwarzschild spacetime, the azimuthal coordinate
  φ is either constant (L = 0) or increases (resp. decreases) monotonically (L > 0) (resp.
  L < 0).
      Eq. (7.13) is decoupled from the other two equations. The task is then to first solve this equation
      for r(λ) and to inject the solution into Eqs. (7.11) and (7.12), which can then be integrated
      separately.
          A constraint to keep in mind is that the 4-momentum vector p, whose components are
      related to the solution (t(λ), r(λ), φ(λ)) by Eq. (7.10), has to be a future-directed causal vector.
      In MI , as we have seen above, this is guaranteed by choosing E > 0 [cf. Eq. (7.5)]. In MII ,
      a future-directed timelike vector is −∂r (cf. Sec. 6.3.6). According to Eq. (1.5a) in Lemma 1.1
      (Sec. 1.2.2), we have then p future-directed iff −∂r · p < 0, i.e. iff
                                                      −1
                                                  2m
                                                     −1    pr < 0.
                                                   r
Since 2m/r − 1 > 0 in MII , this is equivalent to pr < 0, i.e. to dr/dλ < 0. Hence
        In the black hole region MII , i.e. for r < 2m, the solution r(λ) of Eq. (7.13) must be a
        strictly decreasing function of λ.
      Actually, we recover Property 6.9, which has been derived for any causal worldline (not
      necessarily a geodesic) in Sec. 6.3.6.
      Remark 3: We have derived the system of Eqs. (7.11), (7.12) and (7.13) without invoking explicitly the
      famous geodesic equation, i.e. Eq. (B.10) in Appendix B. This is because we had enough first integrals of
      the second-order differential equation (B.10) to completely reduce it to a system of first order equations.
L ̸= 0. (7.14)
      We have stressed above that φ is then a strictly monotonic function of λ, increasing (resp.
      decreasing) continuously along L for L > 0 (resp. L < 0). Consequently,
        Along any timelike or null geodesic with L ̸= 0, φ can be chosen as a parameter, provided
        one does not restrict its range to (0, 2π).
                                                 7.2 Geodesic motion                                                213
This differential equation determines entirely the (r, φ)-part of the geodesic L , which we shall
call the trajectory of L in the orbital plane. The term “orbital plane” is a slight abuse of
language for the 2-dimensional surface (t, θ) = (t0 , π/2), where t0 is a constant.
    In general Eq. (7.17) is not solvable in terms of elementary functions. The exceptions are
circular orbits (u = const, the constant being one of the roots of the cubic polynomial in u in
the right-hand side) and the critical null geodesics, which we shall discuss in Sec. 8.3.2. For the
generic case, exact solutions are expressible in terms of some non-elementary special functions;
there are basically two strategies:
   • The first one is to invoke the Weierstrass elliptic function5 ℘(z; ω1 , ω2 ), which is a doubly-
     periodic meromorphic function of the complex variable z, of periods ω1 ∈ C and ω2 ∈ C.
     Among the many properties of this function, the one relevant here is that ℘ is a solution
     to the differential equation
                                                   2
                                          (℘′ (z)) = 4℘(z)3 − g2 ℘(z) − g3 ,                               (7.18)
         where g2 and g3 are two constants entirely determined by the √ periods ω1 and ω2 of ℘.
         Indeed, via the change of variables v := u − 1/6 and φ̃ := φ/ 2, Eq. (7.17) is equivalent
         to
                2                      mµ 2             2
                  dv               1                       mE        4  mµ 2     1
                             3
                         = 4v −      −4            v+2             −           − ,           (7.19)
                 dφ̃               3        L               L        3 L          27
         which is obviously of type (7.18) (no square in the cubic polynomial). The solution is thus
                                                                  
                                        m          φ                    1
                                   u=       = ℘ √ + C; ω1 , ω2 + ,                             (7.20)
                                         r          2                   6
         where C ∈ C is a constant and ω1 and ω2 are determined by m, µ, E and L (see e.g.
         Ref. [220] for more details regarding this method applied to null geodesics).
   5
       The character ℘ is a kind of calligraphic lowercase p, which is standard to denote this function.
214           Geodesics in Schwarzschild spacetime: generic and timelike cases
         • The second approach consists in noticing that the method of separation of variables can
           easily be applied to Eq. (7.17), leading to
                                Z u
                                                         dū
                        φ=±          q                                      + φ0 ,          (7.21)
                                                       mµ 2    mE 2      mµ 2
                                                                 
                                 u0        3     2
                                        2ū − ū + 2 L ū + L        − L
            where u0 and φ0 are two constants, and the ± sign can be + on some parts of the geodesic
            L and − on some other parts of L . The integral in right-hand side is expressible in
            terms of the so-called incomplete elliptic integrals of the first kind. We shall detail such
            a technique for null geodesics in Sec. 8.3. Note that this approach leads to φ = φ(r),
            whereas the method involving the Weierstrass function leads to the “polar equation”
            form: r = r(φ). It is possible though to get the polar form by invoking the inverses of
            elliptic integrals, namely Jacobi elliptic functions.
         Once the solution r = r(φ) of Eq. (7.17) has been obtained, it can be injected into the
      equation for t = t(φ) that can be deduced from the equations of motions (7.11)-(7.12):
                                                dt   E r(φ)3
                                                   =             .                                  (7.22)
                                                dφ   L r(φ) − 2m
      This is an ordinary differential equation for t = t(φ), the solution to which amounts to finding
      a primitive with respect to φ of the right-hand side. Unfortunately, this is not an easy task in
      general, the function r(φ) being quite involved, except for circular orbits (r(φ) = const).
          In what follows, we discuss separately the resolution of the system (7.11)-(7.13) or of
      Eq. (7.17) for timelike geodesics (Sec. 7.3) and for null geodesics (Chap. 8).
τ = µλ, (7.23)
      up to some additive constant. This is of course a special case of the generic relation (B.3)
      between two affine parameters of the same geodesic. Equation (7.13) becomes then
                                                     2
                                                                       ε2 − 1
                                            
                                        1       dr
                                                          + Vℓ (r) =          ,                     (7.24)
                                        2       dτ                        2
      where
                                                    ℓ2
                                                                          
                                                 m                      2m
                                      Vℓ (r) := − + 2                1−                             (7.25)
                                                 r 2r                    r
                                          7.3 Timelike geodesics                                              215
and ε and ℓ are respectively the specific conserved energy and specific conserved angular
momentum of particle P:
                                E                                    L
                         ε :=     = −ξ · u          and       ℓ :=     =η·u,                        (7.26)
                                µ                                    µ
where u is the 4-velocity of P and the second equalities result from definitions (7.1) and the
relation p = µ u [Eq. (1.18)]. Note that ε is dimensionless (in units c = 1) and that it shares
the same positiveness property (7.5) as E:
  The specific conserved energy ε is positive as soon as the timelike geodesic L has some
  part in MI , i.e. some part with r > 2m:
L ∩ MI ̸= ∅ =⇒ ε > 0. (7.27)
On the contrary, ℓ can be either positive, zero or negative, depending on the variation of φ
along L , as was already noticed above for L.
    We note that Eq. (7.24) has the shape of the first integral of the 1-dimensional motion
of a non-relativist particle in the potential Vℓ (called hereafter the effective potential), the
term 1/2 (dr/dτ )2 being interpreted as the kinetic energy per unit mass, Vℓ (r) as the potential
energy per unit mass and the constant right-hand side (ε2 − 1)/2 as the total mechanical energy
per unit mass.
Remark 1: The effective potential (7.25) differs from its non-relativistic (Newtonian) counterpart only
by the factor 1 − 2m/r instead of 1. This difference plays an important role for small values of r, leading
to some orbital instability, as we shall see in Sec. 7.3.3.
  In MI , where the Killing vector ξ is timelike, we may introduce the static observer O,
whose 4-velocity uO is collinear to ξ:
                                                  −1/2
                                                2m
                                       uO = 1 −          ξ,                                         (7.28)
                                                 r
the proportionality coefficient ensuring that uO ·uO = −1 given that ξ·ξ = gtt = −(1−2m/r).
We have then, from (7.26),
                                        1/2                    1/2
                                     2m                       2m
                       ε=− 1−                 uO · u = Γ 1 −           ,              (7.29)
                                      r                        r
where Γ = −uO · u is the Lorentz factor of P with respect to O (cf. Sec. 1.4; in particular
Eq. (1.33)). We may express Γ in terms of the norm v of the velocity of P with respect to O,
according to Eq. (1.35): Γ = (1 − v 2 )−1/2 and get
                                                              1/2
                                               2 −1/2       2m
                                                                                                    (7.30)
                                                
                                  ε= 1−v                 1−         .
                                                             r
216           Geodesics in Schwarzschild spacetime: generic and timelike cases
V` ( r )
                                   2              4              6              8               10
                                                                                                     r/m
-0.5
                    -1
                                                                                    `/m = 0. 0000
                                                                                    `/m = 2. 0000
                                                                                    `/m = 3. 0000
                                                                                    `/m = 3. 4641
                                                                                    `/m = 3. 8000
                                                                                    `/m = 4. 2000
                                                                                    `/m = 4. 6000
                                                                                    `/m = 5. 0000
                  -1.5
      Figure 7.1:    Effective potential Vℓ (r) governing the r-part of the motion along a timelike geodesic in
      Schwarzschild spacetime via Eq. (7.24). The vertical dashed line marks r = 2m, i.e. the location of the event
      horizon. The numerical value ℓ/m = 3.4641 is that of the critical specific angular momentum (7.32).
      In the region r ≫ m, we may perform a first order expansion, assuming that P moves at
      nonrelativistic velocity with respect to O (v ≪ 1), thereby obtaining:
                                       1     m
                                ε − 1 ≃ v2 −                 (r ≫ m and        v ≪ 1) .                     (7.31)
                                       2     r
      We recognize in the right-hand side the Newtonian mechanical energy per unit mass of particle
      P with respect to observer O, who can then be considered as an inertial observer, v 2 /2 being
      the kinetic energy per unit mass and −m/r the gravitational potential energy per unit mass.
          The profile of Vℓ (r) for selected values of ℓ is plotted in Figs. 7.1 and 7.2 . Its extrema are
      given by dVℓ /dr = 0, which is equivalent to
mr2 − ℓ2 r + 3ℓ2 m = 0.
      This quadratic equation admits real roots iff |ℓ| ≥ ℓcrit , with
                                                    √
                                           ℓcrit = 2 3 m ≃ 3.464102 m .                                     (7.32)
              V` ( r )
             0.15                                                            `/m = 0. 0000
                                                                             `/m = 2. 0000
                                                                             `/m = 3. 0000
                                                                             `/m = 3. 4641
              0.1                                                            `/m = 3. 8000
                                                                             `/m = 4. 2000
                                                                             `/m = 4. 6000
                                                                             `/m = 5. 0000
             0.05
                                5              10          15           20               25
                                                                                              r/m
-0.05
-0.1
-0.15
Figure 7.2: Same as Fig. 7.1, but with a zoom in along the y-axis and a zoom out along the x-axis. The dots
mark the mimima of Vℓ , locating stable circular orbits.
      This equation is identical to that governing radial free fall in the gravitational field generated
      by a mass m in Newtonian gravity. The solution is well known and depends on the sign of the
      “mechanical energy” in the right-hand side, i.e. of the position of ε with respect to 1:
          • if ε > 1, the solution is given in parameterized form (parameter η) by
                                                  m
                                        τ= 2
                                                         (sinh η − η) + τ0
                                               (ε − 1)3/2
                                                                                                    (7.37)
                                        r = m (cosh η − 1) ,
                                       
                                               ε2 − 1
      In the above formulas, τ0 is a constant; for ε ≥ µ, τ0 is the value of τ for which r → 0, while
      for ε < 1, it is the value of τ at which r takes its maximal value.
             r/m
             6
                                                                                   r0 = 2. 10 m
             5                                                                     r0 = 3. 00 m
                                                                                   r0 = 4. 00 m
             4                                                                     r0 = 5. 00 m
                                                                                   r0 = 6. 00 m
             3
             2
             1
                                      5                       10                       15
                                                                                                  τ/m
Figure 7.3: Coordinate r as a function of the proper time τ for the radial free fall from rest, for various initial
values r0 of r.
   The solution for t = t(τ ) is obtained by combining dt/dτ as expressed by (7.34) and dτ /dη
deduced from (7.41):                 r                    r
                              dτ        r03                  r0
                                  =         (1 + cos η) =       r.
                              dη        8m                  2m
We get
                                          r      −1 r
                                                                     r2
                                      
                   dt   dt dτ     r0         2m             r0
                      =       =ε     r 1−            =         −1        ,
                   dη   dτ dη    2m           r            2m     r − 2m
where we have used (7.40) and ε > 0 [Eq. (7.27)] to write ε = 1 − 2m/r0 . Substituting r
                                                               p
This equation can be integrated to (cf. the SageMath computation in Sec. D.4.5)
                        (r                                  p r0           η
                                                                             )
                              r0       hr0            i          − 1 + tan
              t = 2m             −1 η+     (η + sin η) + ln p 2m
                                                              r0
                                                                           2
                                                                           η   ,                            (7.42)
                             2m        4m                     2m
                                                                 − 1 − tan 2
                      t/m                                               t̃/m
                     25                    r0 = 2. 10 m                25                      r0 = 2. 10 m
                                           r0 = 3. 00 m                                        r0 = 3. 00 m
                                           r0 = 4. 00 m                                        r0 = 4. 00 m
                                           r0 = 5. 00 m                                        r0 = 5. 00 m
                                           r0 = 6. 00 m                                        r0 = 6. 00 m
                     20                                                20
15 15
10 10
5 5
                           1 2 3 4 5 6 7 8
                                              r/m                           1 2 3 4 5 6 7 8
                                                                                                   r/m
      Figure 7.4: Radial free fall from rest, viewed in Schwarzschild-Droste coordinates (t, r) (left) and in the ingoing
      Eddington-Finkelstein coordinates (t̃, r) (right), for various values r0 of the coordinate r at τ = 0. The grey area
      is the black hole region MII .
      is precisely the value of η yielding r = 2m in Eq. (7.41) [to see it, rewrite the second part of
      Eq. (7.41) as r = r0 cos2 (η/2) = r0 /(1 + tan2 (η/2))]. This singularity of t(η) appears also
      clearly on Fig. 7.4 (left panel). On the other hand, the equation for r, Eq. (7.36), does not exhibit
      any pathology at r = 2m, nor its solution (7.42). Actually, had we started from the ingoing
      Eddington-Finkelstein (IEF) coordinates (t̃, r, θ, φ), instead of the Schwarzschild-Droste ones,
      we would have found6 exactly the same solution for r(τ ) (which is not surprising since r,
      considered as a scalar field on M , is perfectly regular at H ). The solution for t̃(τ ) can be
      deduced from that for t(τ ) by the coordinate transformation law (6.32). Noticing that and
      r/(2m) = cos2 (η/2)/ cos2 (ηh /2), we get
                               cos2 (η/2)
                                                                                     
                       r                             2              1           1
                         −1 =              − 1 = cos (η/2)                −
                      2m      cos2 (ηh /2)                    cos2 (ηh /2) cos2 (η/2)
                            = cos2 (η/2) tan2 (ηh /2) − tan2 (η/2) .                                               (7.44)
                                                                   
      Using this identity, as well as (7.43) to express       r0 /(2m) − 1 in the logarithm term of Eq. (7.42),
                                                             p
         6
             Exercise: do it!
                                                7.3 Timelike geodesics                                             221
                τ/m
                          τf
               15         τh
10
                0
                      2                 3                  4                  5           6
                                                                                              r0 /m
Figure 7.5: Elapsed proper time to reach the event horizon (τh , dashed curve) and the central singularity (τf ,
solid curve), as a function of the initial value of r for a radial free fall from rest.
From this expression, we have t̃ = 4m ln tan(ηh /2) for η = 0. Now, we can change the origin
of the IEF coordinate t̃ to ensure t̃ = 0 for η = 0, i.e. τ = 0. We get then
             r                                                 −1/2       
                     r0   h    r0            i          η  r0             η
   t̃ = 2m              −1 η+     (η + sin η) + 2 ln cos +     −1      sin      . (7.45)
                    2m        4m                        2  2m              2
This expression is perfectly regular for all values of η in [0, π], reflecting the fact that the
ingoing Eddington-Finkelstein coordinates cover all M in a regular way. The radial free fall
solution in terms of (t̃, r) is represented in the right panel of Fig. 7.4. We note the smooth
crossing of the event horizon H .
    In view of Eq. (7.41), we may say that the radial infall starts at η = 0, for which τ = 0
and r = r0 , and terminates at η = π, for which r = 0, which means that the particle hits the
curvature singularity (cf. Sec. 6.3.4). The final value of the particle’s proper time is obtained by
222           Geodesics in Schwarzschild spacetime: generic and timelike cases
∆τin /m
2.5
1.5
                                       5                  10                  15                 20
                                                                                                      r0 /m
      Figure 7.6: Proper time spent inside the black hole region as a function of the initial value of r for a radial free
      fall from rest. Note that r0 = 2m does not correspond to any asymptote but to the finite value ∆τin = π m with
      a vertical tangent. On the other side, there is an horizontal asymptote ∆τin → 4m/3 for r0 → +∞.
      As noticed above, the event horizon H is crossed at η = ηh ; via (7.41) and (7.43), this corre-
      sponds to the following value of the proper time:
                                      r       "        r         s             #
                                          r03
                                                                       
                                                          r0       2m      2m
                               τh =             arctan       −1+        1−        ,                                (7.48)
                                          2m             2m         r0      r0
      while (7.45) leads to the following value of the IEF coordinate t̃:
                                 r                r                          
                              r0     r0               r0      r0           r0
              t̃h = 2m 2 1 +             − 1 arctan       −1+     − 1 − ln       .                                 (7.49)
                             4m      2m               2m      2m           2m
      The variation of τh and τf with r0 are depicted in Fig. 7.5 and numerical values for r0 = 6m
      and standard astrophysical black holes are provided in Table 7.1.
                                               7.3 Timelike geodesics                                                223
m rS = 2m τh τf ∆τin
Table 7.1: Proper time to reach the event horizon (τh ) and the central curvature singularity (τf ), as well as
elapsed proper time inside the black hole region (∆τin ), when freely falling from rest at r0 = 6m. The numerical
values are given for various black hole masses m, corresponding to astrophysical objects: the stellar black hole
Cygnus X-1 [393, 225], the supermassive black hole at the center of our galaxy (Sagittarius A*) [215, 301] and the
supermassive black hole M87* in the nucleus of the galaxy Messier 87 [214, 6].
It varies between πm (r0 → 2m) and 4m/3 (r0 → +∞) (cf. Fig. 7.6 and Sec. D.4.5 for the
computation of limr0 →+∞ ∆τin ). Numerical values for astrophysical black holes are provided
in Table 7.1.
                      `/m
                      6
5.5
4.5
3.5
                            2            4             6              8             10            12
                                                                                                       r/m
      Figure 7.7: Specific conserved angular momentum ℓ = L/µ on circular orbits as a function of the orbit
                                                                                                        −
      circumferential radius r. The dashed part of the curve corresponds to unstable orbits (r = rcirc      (ℓ), as given
      by
       √ Eq. (7.53)), while the solid part corresponds to stable orbits (r = r +
                                                                              circ (ℓ)). The minimal value of ℓ is ℓcrit =
      2 3 m ≃ 3.46 m.
        Circular orbits in Schwarzschild spacetime exist for all values of r > 3m. Those with
        r < 6m are unstable and those with r > 6m are stable. The marginal case r = 6m is called
        the innermost stable circular orbit, often abridged as ISCO.
      Remark 2: In the Newtonian spherical gravitational field generated by a point mass m, there is
      no unstable orbit, and thus no ISCO. The existence of unstable orbits in the relativistic case can be
      understood by the extra term in the effective potential Vℓ (r) (cf. Remark 1 on p. 215), which adds the
      attractive part −ℓ2 m/r3 to the two terms constituting the Newtonian potential: −m/r (attractive)
      and ℓ2 /(2r2 ) (repulsive). The latter is responsible for the infinite “centrifugal barrier” at small r in the
      Newtonian problem, leading always to a minimum of Vℓ (r) and thus to a stable circular orbit. In the
      relativistic case, for r small enough, the attractive term, which is O(r−3 ), dominates over the centrifugal
      one, which is only O(r−2 ). Equivalently, we may say that the “centrifugal barrier” is weakened by
      the factor 1 − 2m/r (cf. the expression (7.25) of Vℓ (r)) and ceases to exist for small values of |ℓ| (i.e.
      |ℓ| < ℓcrit ).
          From Eq. (7.52), we can easily express ℓ as a function of r on a circular orbit:
                                                              r
                                                                     m
                                                    |ℓ| = r              .                                         (7.54)
                                                                  r − 3m
                                             7.3 Timelike geodesics                                                 225
                                                  r − 2m
                                             ε= p          .                                              (7.55)
                                                 r(r − 3m)
This function is represented in Fig. 7.8. The minimal value of ε is achieved for r = 6m, i.e. at
the ISCO:                                         √
                                                 2 2
                              min ε = εISCO =         ≃ 0.9428 .                         (7.56)
                                                  3
From Fig. 7.8, we notice that
This corresponds to bound orbits, i.e. to geodesics that, if slightly perturbed, cannot reach
the asymptotically flat region r ≫ 2m, since E ≥ µ there. Indeed, when r → +∞, the Killing
vector ξ can be interpreted as the 4-velocity of some asymptotically inertial observer (at rest
with respect to the black hole) and E is the particle energy measured by that observer; the
famous Einstein relation (1.32) is then E = Γµ, where Γ is the Lorentz factor of the particle
with respect to the observer. Since Γ ≥ 1 [Eq. (1.35)], we have obviously7 E ≥ µ. For this
reason, the circular orbit at r = 4m is called the marginally bound circular orbit. Note that
the marginally bound circular orbit is unstable, since it has r < 6m.
    The track of circular orbits in the (ℓ, ε) plane is depicted in Fig. 7.9. The ISCO, which is a
minimum for both ε and ℓ, appears as a cusp point.
    The angular velocity of a circular orbit L is defined by
                                                    dφ           uφ
                                             Ω :=            =      ,                                     (7.58)
                                                    dt   L       ut
where uφ = dφ/dτ and ut = dt/dτ are the only nonzero components w.r.t. Schwarzschild-
Droste coordinates of the 4-velocity u along the worldline L . It follows from (7.58) that Ω
enters into the linear combination of the two Killing vectors ξ and η expressing the 4-velocity
on a circular orbit according to
                                       u = ut (ξ + Ωη) .                                 (7.59)
We have the following nice physical interpretation:
  The quantity Ω defined by Eq. (7.58) is nothing but the angular velocity of the orbiting
  particle P monitored by a infinitely distant static observer O.
   7
     Similarly, the radial-motion solutions (7.37)-(7.38), which allow for r → +∞, have E ≥ µ, while the solution
(7.39), which is relevant for a free fall from rest, has E < µ.
226           Geodesics in Schwarzschild spacetime: generic and timelike cases
                            ε
                      1.2
1.15
1.1
1.05
0.95
                                    2       4          6            8             10           12
                                                                                                    r/m
      Figure 7.8: Specific conserved energy ε = E/µ on circular orbits as a function of the orbit circumferential
      radius r. The dashed part of the curve corresponds to unstable orbits, while the solid part corresponds to stable
      ones. The horizontal red line ε = 1 marks the limit of bound orbits.
1.04
1.02
0.98
0.96
0.94
                      0.92
                                        3   4           5           6             7            8
                                                                                                    `/m
      Figure 7.9: Circular orbits in the (ℓ, ε) plane. The solid (resp. dashed) curve corresponds to stable (resp.
      unstable) orbits. The ISCO is located at the cusp point.
                                             7.3 Timelike geodesics                                                 227
                                                          ΩISCO
                   m                rISCO = 6m                               TISCO          TP,ISCO
                                                           2π
           15 M⊙ (Cyg X-1)             133 km             147 Hz            6.80 ms         4.81 ms
         4.3 106 M⊙ (Sgr A*)        38.1 106 km       5.11 10−4 Hz       32 min 37 s      23 min 4 s
                                     (0.255 au)
           6 109 M⊙ (M87*)             355 au         3.66 10−7 Hz         31 d 15 h        22 d 9 h
Table 7.2: Values of various quantities at the ISCO for masses m of some astrophysical black holes (see Table 7.1
for details): areal radius r, orbital frequency ΩISCO /(2π), orbital period seen from infinity TISCO and orbital
period measured by the orbiting observer/particle TP,ISCO (proper time).
Proof. Suppose that O is located at fixed coordinates (r, θ, φ) = (rO , π/2, 0) with rO ≫ m and
that P emits a photon at the event (t1 , r, π/2, 0) along a radial null geodesic. This photon is
received by O at t = t′1 . After one orbit, at the event (t2 , r, π/2, 2π), P emits a second photon
in the radial direction, which is received at t = t′2 by O. According to the definition (7.58) of Ω,
we have
                                         2π = Ω(t2 − t1 ).
On the other hand, since rO ≫ m, the proper time of O is t, so that the angular velocity
measured by O is
                                           2π
                                   ΩO = ′         .
                                         t2 − t′1
Now, since t is the coordinate associated to the spacetime invariance by time translation
(stationarity), we have necessarily t′2 − t2 = t′1 − t1 (the increment in coordinate t for the
second signal is the same as for the first one), so that t′2 − t′1 = t2 − t1 . Accordingly, the above
two equations combine to ΩO = Ω.
      Remark 4: Ω is not the orbital angular frequency experienced by the particle/observer P on the
      circular orbit L , because the proper time of P is τ and not t. The actual orbital frequency measured by
      P is
                                                         dt
                                                  ΩP =      Ω = ut Ω,
                                                         dτ
      with ut = dt/dτ obtained from (7.34) and (7.55): ut = r/(r − 3m). Hence
                                                              p
                                                r                  r
                                                      r          1      m
                                        ΩP =               Ω=±                .                          (7.61)
                                                   r − 3m        r r − 3m
                                                                         √
      Note that |ΩP | > |Ω|; in particular, at the ISCO (r = 6m), ΩP = 2Ω. The orbital period measured by
      P is TP = 2π/|ΩP |. Some ISCO values of TP for astrophysical black holes are provided in Table 7.2.
          At the ISCO, r = 6m and formula (7.60) yields (for orbits with positive ℓ)
                                                         1
                                                ΩISCO = √   .                                           (7.62)
                                                       6 6m
      Numerical values of ΩISCO (actually the frequency ΩISCO /(2π), which is more relevant from
      an observational point of view) are provided in Table 7.2.
      The first condition ensures that the effective potential Vℓ (r) takes the shape of a well in the
      region r > 2m (cf. Fig. 7.10) and the second one that the particle P is trapped in this well.
      Indeed, 0 < ε < 1 makes the right-hand of Eq. (7.24) negative, so that the region r → +∞,
      where Vℓ (r) → 0, cannot be reached. We have also argued in Sec. 7.3.3 that ε < 1 is forbidden
      in the region r → +∞ on physical grounds [cf. the discussion below Eq. (7.57)].
           In the potential well, the r-coordinate along L varies between two extrema: a minimum
      rper , for periastron (or pericenter or periapsis), and a maximum rapo , for apoastron (or
      apocenter or apoapsis) (cf. Figs. 7.10 and 7.11). Being extrema of r(τ ), the values of rper and
      rapo are obtained by setting dr/dτ = 0 in Eq. (7.24), which leads to
                                                          ℓ2
                                                          
                                               2m
                                           1−          1 + 2 = ε2 .                              (7.64)
                                                r         r
      This is a cubic equation in r−1 , which has three real positive roots, corresponding to the three
      intersections of the curve Vℓ (r) with the horizontal line at (ε2 − 1)/2 in Fig. 7.10. However,
      the smaller root has to be disregarded as a periastron since it would lead to a motion with
      Vℓ (r) > (ε2 − 1)/2, which is forbidden by Eq. (7.24).
          We get, from Eqs. (7.24)-(7.25),
                                           s                           
                                   dr                     2m           ℓ2
                                               2
                                        =± ε − 1−                 1+ 2 .                          (7.65)
                                   dτ                      r           r
                                                      7.3 Timelike geodesics                                        229
0.03
0.02
0.01
                             0
               V` ( r )
-0.01
                          -0.02
             1 (ε 2 − 1)
             2
                     -0.03
                          -0.04
                                  0         5
                                                      rper10         15
                                                                     r/m
                                                                               20    25
                                                                                      rapo
                                                                                                 30
Figure 7.10: Effective potential Vℓ (r) for ℓ = 4.2m (one of the values displayed in Figs. 7.1 and 7.2). The
horizontal red line marks Vℓ (r) = (ε2 − 1)/2 with ε = 0.973, leading to rper = 9.058 m and rapo = 25.634 m.
The corresponding orbit is shown in Fig. 7.11.
y/m
20
10
                                      -20       -10                       10    20
                                                                                     x/m
-10
-20
Figure 7.11: Timelike geodesic with ε = 0.973 and ℓ = 4.2m (same values as in Fig. 7.10), plotted in terms of
the coordinates (x, y) := (r cos φ, r sin φ). The dotted circles correspond to r = rper (periastron) and r = rapo
(apoastron). The grey disk indicates the black hole region r < 2m. [Figure produced with the notebook D.4.6]
230           Geodesics in Schwarzschild spacetime: generic and timelike cases
                              y/m                                                       y/m
                             15
                             10
                                                                                        50
        -15     -10    -5              5      10     15
                                                          x/m                  -50                    50
                                                                                                                    x/m
-5
                                                                                        -50
                             -10
-15
      Figure 7.12: Timelike geodesics with the same value of ℓ as in Fig. 7.11 (ℓ = 4.2m), but for different values of ε:
      ε = 0.967 (left) and ε = 0.990 (right). Note that the left and right figures have different scales. [Figure produced
      with the notebook D.4.6]
          The equation governing the trajectory of L in the orbital plane is Eq. (7.17), which can be
      recast as                      r
                            du                     m 2      mε 2  m 2
                               = ± 2u3 − u2 + 2          u+           −         .               (7.66)
                            dφ                      ℓ           ℓ          ℓ
      Let us recall that u := m/r and that methods for solving this differential equation have been
      briefly discussed in Sec. 7.2.3.
      Remark 5: Far from the black hole, i.e. in the region r ≫ m, one can easily recover the Newtonian
      orbits from Eq. (7.66). Indeed, according to Eq. (7.31), ε = 1 + ε0 , where ε0 = v 2 /2 − m/r is the
      Newtonian mechanical energy per unit mass. It obeys |ε0 | ≪ 1, so that ε2 ≃ 1 + 2ε0 . Moreover, for
      r ≫ m, u ≪ 1 and we can neglect the u3 term in front of the u2 one in Eq. (7.66). Hence Eq. (7.66)
      reduces to                             r  
                                     du           m 2                m 2
                                        ≃± 2            ε0 − u2 + 2         u.                       (7.67)
                                     dφ            ℓ                  ℓ
      Let us introduce the constants
                                                                r
                                          ℓ2                           ε0 ℓ2
                                     p :=        and      e := 1 + 2 2 .                             (7.68)
                                          m                             m
      Then Eq. (7.67) can be rewritten as
                                                                     p
                                                   dφ        me
                                                      = ±r          ,
                                                   du        p
                                                                u−1 2
                                                          1−  m
                                                                e
                                            p
where φ0 is a constant. We have then        mu   = 1 + e cos(φ − φ0 ), or equivalently,
                                                           p
                                             r=                                                                (7.69)
                                                   1 + e cos(φ − φ0 )
Assuming a bound orbit, we have ε < 1, which implies ε0 < 0 and, via Eq. (7.68), e < 1. We recognize
then in (7.69) the equation of an ellipse of eccentricity e and semi-latus rectum p. Hence Keplerian orbits
are recovered for r ≫ m, as they should.
   Generic bound orbits differ from the Keplerian ellipses by the fact that the variation of φ
between two successive periastron passages, i.e. two events along the worldline of P for
which r = rper , is strictly larger than 2π. This phenomenon is called periastron advance and
causes the orbits to be not closed, as illustrated in Figs. 7.11 and 7.12.
Historical note : The equations of geodesic motion (7.11)-(7.13), as well as Eq. (7.17) for the trajectories
in the orbital plane, have been first given by Karl Schwarzschild himself in January 1916 in the very
same article [449] in which he presented his famous solution8 . Schwarzschild discussed only the weak
field limit of these equations, to recover the Mercury’s perihelion advance computed by Einstein.
     It is quite remarkable that the general solution to the geodesic motion in Schwarzschild spacetime9
has been given as early as May 1916 by Johannes Droste, in the same article [175] in which he derived
the Schwarzschild solution, independently of Karl Schwarzschild (cf. historical note on p. 187). Droste
derived the equations of geodesic motion in Schwarzschild-Droste coordinates (t, r, θ, φ), using t as the
parameter along the geodesics, as well as the equation governing the trajectories in the orbital plane10 .
He gave the solutions for the trajectories in terms of the Weierstrass elliptic function ℘ (cf. Sec. 7.2.3). He
classified the solutions in terms of the roots of the cubic polynomial in v that appears in the right-hand
side of Eq. (7.19). Droste noticed that it takes an infinite amount of coordinate time t for a particle to
reach r = 2m (cf. left panel of Fig. 7.4) and he concluded incorrectly that “a moving particle outside the
sphere r = 2m can never pass that sphere”. He missed that this is only a coordinate effect, reflecting
the pathology of Schwarzschild-Droste coordinates at the horizon. One shall keep in mind that in 1916,
general relativity was just in its infancy and disentangling coordinate artefacts from physical effects
was not so obvious, especially regarding time. One can be amused by the fact that the first analysis of
the static black hole of general relativity made the black hole appear, not as an object from which no
particle may escape, but as an object into which no particle may penetrate...
     Finally, it is worth mentioning two early detailed studies of geodesic motion of massive particles in
Schwarzschild spacetime: one by by Carlo De Jans in 1923 [160] and other one by Yusuke Hagihara in
1931 [249]. For a detailed account about the history of geodesic motion in Schwarzschild spacetime see
Ref. [181].
   8
     Equation (7.17) for the trajectories is Eq. (18) in the article [449], the link between Schwarzschild’s notations
and ours being R = r, x = u/m, c = L, 1 = E, h = µ2 .
   9
     More precisely: in the MI region of Schwarzschild spacetime.
  10
     The link between Droste’s notations and ours is α = 2m, x = 2u, z = 2u − 1/3, A = µ2 /E 2 and B = L/E.
232   Geodesics in Schwarzschild spacetime: generic and timelike cases
Chapter 8
Contents
        8.1   Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233
8.1     Introduction
Having investigated the properties of generic causal geodesics in Schwarzschild spacetime in
Chap. 7, we focus here on null geodesics. The main interest of null geodesics is of course that
they are the carriers of information from the surroundings of the black hole to some observer,
in particular in the form of images. We start by studying the general property of null geodesics
in Sec. 8.2, distinguishing the radial geodesics, from the non-radial ones and focussing on
the r-motion. Then in Sec. 8.3, we study of the φ-motion and show that the trajectory of a
given null geodesic in the equatorial plane is fully integrable, via elliptic integrals. In Sec. 8.4
we compute the asymptotic direction taken by a null geodesic emitted at a given point; this
prepares the discussion of images in Sec. 8.5. The study of black hole images has received a
tremendous boost after the release of the first observed image by the Event Horizon Telescope
collaboration in 2019, that of the black hole M87* [6]. However, we differ the discussion of this
image to the chapter regarding null geodesics around a rotating black hole (Chap. 12), since
the black hole spin plays some role in the images and M87* is expected to be a fast rotator.
234                 Null geodesics and images in Schwarzschild spacetime
                                             dφ   L
                                                = 2 ,                                                (8.2)
                                             dλ  r
                                       2
                                            L2
                                                      
                                       dr           2m
                                           + 2 1−        = E2 .                                      (8.3)
                                       dλ   r        r
      Let us recall that the variable λ with respect to which these differential equations hold is the
      affine parameter associated with the 4-momentum p of P: p = dx/dλ [Eq. (7.9)] and that λ
      increases towards the future, p being always future-directed (cf. Sec. 7.2.2).
r = ±Eλ + r0 , (8.5)
      where r0 is some constant. Moreover, writing dt/dλ = dt/dr × dr/dλ and combining Eqs. (8.1)
      and (8.4), we get
                                                         −1
                                        dt             2m
                                            =± 1−             .                             (8.6)
                                        dr              r
                                8.2 Main properties of null geodesics                                      235
Remark 1: Since λ is an affine parameter and E is constant, Eq. (8.5) shows explicitly that r is another
affine parameter along radial null geodesics of Schwarzschild spacetime — a feature that we had already
obtained in Sec. 6.3.1.
                                            out,H
Remark 2: The radial null geodesics L(θ,φ)         discussed in Sec. 6.3.5 have L = 0, but they are not
recovered from Eq. (8.6), since the latter is based on Schwarzschild-Droste coordinates (t, r, θ, φ) and
      out,H
the L(θ,φ)  ’s are the null geodesic generators of the event horizon H , which does not lie in the
Schwarzschild-Droste domain.
   Some radial null geodesics of Schwarzschild spacetime are plotted in Fig. 6.1 in terms of the
Schwarzschild-Droste coordinates and in Fig. 6.3 in terms of the ingoing Eddington-Finkelstein
coordinates.
λ̃ := |L|λ , (8.8)
instead of λ, the latter being the affine parameter associated with the 4-momentum p via
Eq. (7.9). Since L is a non-vanishing constant along a geodesic, the above formula does define a
new affine parameter. The absolute value ensures that λ̃ is increasing towards the future, as λ,
whatever the sign of L. Note that λ̃ has the same dimension as L, i.e. a squared length, given
that λ is dimensionless (cf. Sec. 7.2.2). In terms of λ̃, the differential system (8.1)-(8.3) becomes
                                                      −1
                                      dt            2m
                                          =b −1
                                                 1−        ,                                       (8.9)
                                      dλ̃            r
                                            dφ    ϵL
                                                = 2 ,                                             (8.10)
                                            dλ̃   r
                                        2
                                        dr              1
                                              + U (r) = 2 ,                                       (8.11)
                                        dλ̃            b
where
                                                     |L|
                                              b :=       ,                                        (8.12)
                                                      E
                                                  L
                                         ϵL :=       = sgn L                                      (8.13)
                                                 |L|
236                 Null geodesics and images in Schwarzschild spacetime
      and                                                      
                                                     1       2m
                                            U (r) := 2    1−      .                                   (8.14)
                                                    r         r
      As for the timelike case (cf. Eq. (7.24)), we note that Eq. (8.11) has the shape of the first integral
      of the 1-dimensional motion of a non-relativist particle in the effective potential U (r). The
      main difference is that U (r) does not depend on L, contrary to the effective potential Vℓ (r) for
      timelike geodesics. Actually, U (r) is a function of r and the black hole mass m only. We thus
      arrive at
      Proof. Let us consider that the massless particle P arises from a point of coordinates θ = π/2,
      r = r0 ≫ m and φ = φ0 . Let us introduce Cartesian coordinates (x, y) in the plane θ = π/2:
      x := r cos φ and y := r sin φ. Without any loss of generality, we may consider |φ0 | ≪ 1.
      For large values of r, the motion of P is then essentially along the x-axis, with φ remaining
      small: y ≃ y0 , where y0 = r0 sin φ0 ≃ r0 φ0 . The quantity |y0 | is the impact parameter. Since
      y = r sin φ ≃ rφ, we get from y ≃ y0 the following relation
                                                  y0
                                            φ≃            (r ≫ m).
                                                  r
      Deriving with respect to t leads to
                                      dφ   y0 dr  y0
                                         ≃− 2    ≃ 2             (r ≫ m),                             (8.15)
                                      dt   r dt   r
      where we have used dr/dt = −1 for r ≫ m. This last property is easy to admit since r ≫ m
      corresponds to the flat part of spacetime; however, it can be derived rigorously by combining
      Eqs. (8.9) and (8.11) to evaluate dr/dt. Now by combining Eqs. (8.9) and (8.10), we have
                                                              
                                          dφ       b        2m
                                              = ϵL 2 1 −          .                            (8.16)
                                           dt     r          r
      For r ≫ m, the right-hand side reduces to ϵL b/r2 , so that the comparison with (8.15) yields
      ϵL b = y0 , i.e. b = |y0 |, which proves that b is the impact parameter of the massless particle P
      with respect to the central black hole.
         The effective potential U (r) is plotted in Fig. 8.1. It has no minimum and a maximum at
      r = 3m, which is
                                                          1
                                              Umax =          .                             (8.17)
                                                       27m2
                                              8.2 Main properties of null geodesics                                         237
                         0.05                                                                   3
                                                                                                2
                         0.04
                                         4
                         0.03
            m 2 U( r )
                                                                                                1
                         0.02
0.01
                         -0.01
                                 0                 5 rper           10              15                  20
                                                                    r/m
Figure 8.1: Effective potential U (r) (rescaled by m2 to make it dimensionless) governing the r-part of the
motion along a null geodesic in Schwarzschild spacetime [Eq. (8.14)]. The vertical dashed line marks r = 2m,
i.e. the location of the event horizon. The horizontal lines marked “1” to “4” correspond to the r-motion of four
null geodesics (cf. Sec. 8.2.4 for details); their trajectories in the equatorial plane are depicted in Fig. 8.2. [Figure
produced with the notebook D.4.7]
                                                            y/m
                                                            6
                                                                             1
                                     3                      4            2
                                                            2
                                                                                                    x/m
              -10                        -5                                  5                10
                                                        -2
                                                                4
                                                        -4
Figure 8.2: Selected null geodesics in the equatorial plane of Schwarzschild spacetime, plotted in terms of the
coordinates (x, y) := (r cos φ, r sin φ), with the black hole region r < 2m depicted as a grey disk. The green
geodesic is the photon circular orbit at r = 3m. The red geodesic (label “1”) starts at r0 = 10m, φ0 = 0, with
the impact parameter b = 7.071 m (b−2 = 0.02 m−2 ). The brown geodesic (label “2”) starts at r0 = 10m, φ0 = 0
with b = 5m (b−2 = 0.04 m−2 ). The orange geodesic (label “3”) starts outward at r0 = 2.1m, φ0 = 0 with
b = 4.663m (b−2 = 0.046 m−2 ). The grey geodesic (label “4”) starts outward at r0 = 2.4m, φ0 = −π/2 with
b = 5.345m (b−2 = 0.035 m−2 ). The r-motion of these four gedeosics is depicted in Fig. 8.1. [Figure produced
with the notebook D.4.8]
238                     Null geodesics and images in Schwarzschild spacetime
               Actually, dr/dλ̃ changes sign at r = rp and the photon subsequently moves away from
               the black hole for ever (cf. geodesic no. 1 in Fig. 8.2). We may call such a worldline a
               scattering trajectory.
         • if rem < 3m (e.g. trajectory no. 4 in Fig. 8.1) or b is small enough to fulfill b−2 > Umax (e.g.
           trajectory no. 2 in Fig. 8.1) the photon is not halted by the potential barrier constituted
           by U (r). It then reaches arbitrarily small values of r and is eventually absorbed by the
           black hole (r < 2m) (cf. geodesic no. 2 in Fig. 8.2).
         For an initially outward photon, i.e. a photon emitted with dr/dλ̃ > 0, one has necessarily
      rem > 2m according to the result obtained in Sec. 7.2.2 and there are then two possible
      outcomes:
         1
             unless one counts as “bound” a worldline that terminates in the black hole region
                                     8.2 Main properties of null geodesics                              239
   • if rem > 3m (e.g. trajectory no. 1 in Fig. 8.1) or b is small enough to fulfill b−2 > Umax (e.g.
     trajectory no. 3 in Figs. 8.1), the photon escapes to infinity (cf. geodesic no. 3 in Fig. 8.2);
   • if 2m < rem < 3m and b−2 < Umax (e.g. trajectory no. 4 in Fig. 8.1), the photon “bounces”
     on the left side of the potential barrier, reaching a maximal value ra of r — the apoastron,
     which is given by U (ra ) = b−2 , or equivalently by
         The photon moves subsequently towards the black hole and is absorbed by it (cf. geodesic
         no. 4 in Fig. 8.2).
   The critical value of the impact parameter b separating the cases discussed above is deter-
mined by b−2
          c = Umax . Given the value (8.17) of Umax , we get
                                               √
                                         bc = 3 3 m ≃ 5.196152 m .                            (8.21)
  Along any null geodesic of Schwarzschild spacetime, the areal coordinate r either is a
  monotonic function or has a single turning point. In the latter case, if the turning point
  corresponds to a minimum of r (a periastron, which can occur only for r > 3m), the null
  geodesic escapes to infinity, while if the turning point corresponds to a maximum of r (an
  apoastron, which can occur only for 2m < r < 3m), the null geodesic terminates at the
  central singularity (r = 0).
Note that for r < 2m, i.e. in the black hole region, r is always a monotonic function, since
it has been demonstrated in Sec. 7.2.2 that r(λ) is strictly decreasing. The impossibility of a
turning point for r < 2m can also be graphically inferred from Fig. 8.1, which shows that
the effective potential U (r) is negative for r < 2m, preventing the turning point condition
U (r) = b−2 to hold.
    For b > bc , the explicit expression of the periastron radius rp (or the apoastron radius ra )
is obtained by solving the cubic equation (8.18) (or (8.20), which is the same cubic equation,
except for the range of the solution). Fortunately Eq. (8.18) is a depressed2 cubic equation,
which makes it simpler to solve. For b > bc , its discriminant −(4p3 + 27q 2 ) is positive, which
implies that it admits three distinct real roots. Two of them are positive and are precisely rp
and ra . The third root is negative, since the product of the roots is −2mb2 < 0; it has therefore
no physical significance. The roots of the generic depressed cubic equation x3 + px + q = 0
can be expressed via Viète’s formulas:
                     r                      r                
                          p       1           3q     3      2kπ
             xk = 2 − cos           arccos         −     +        ,     k ∈ {0, 1, 2}.       (8.22)
                          3       3           2p     p       3
   2
       A depressed cubic equation is a polynomial equation of the type x3 + px + q = 0.
240                   Null geodesics and images in Schwarzschild spacetime
                                                      rper
                                              8       rapo
                                              7
                                              6
                                        r/m
                                              5
                                              4
                                              3
                                              2
                                                  5      6   7     8      9     10
                                                              b/m
      Figure 8.3:     Radial coordinate of the periastron, rp , and of the apoastron, ra , along null geodesics with
      b > bc ≃ 5.196 m. Note that a given null geodesic has either a periastron or an apoastron, but not both. [Figure
      produced with the notebook D.4.9]
      In the present case, p = −b2 , q = 2mb2 and rp (resp. ra ) corresponds to k = 0 (resp. k = 2), so
      that we obtain
                                                                  
                                          2b      π 1               bc
                                    rp = √ cos      − arccos              .                      (8.23)
                                           3      3 3                b
                                                                
                                            2b      5π 1         bc
                                       ra = √ cos      − arccos      .                                         (8.24)
                                             3       3  3        b
                                                                           √
      As a√check, we note that for b ≫ bc , Eq. (8.23) yields rp ≃ 2b/ 3 cos (π/3 − 1/3 × π/2) ≃
      2b/ 3 cos(π/6) ≃ b, as expected. Indeed, b ≫ bc implies b ≫ m, so that the photon stays far
      from the black hole, a regime in which the impact parameter coincides with the distance of
      closest approach: b ≃ rp .
          The variation of rp and ra with b is plotted in Fig. 8.3. Note that rp = ra = 3m (the photon
      orbit) at the limit b = bc .
      Remark 3: It is clear on Fig. 8.3 that, for the same value of b, ra ≤ rp , which may seem contradictory
      with the apoastron corresponding to the largest distance from the center and the periastron to the
      distance of closest approach. However, one shall keep in mind that the apoastron and the periastron
      always refer to different null geodesics: geodesics with an apoastron lie below the photon sphere
      (r = 3m), while those with a periastron are always outside it. A null geodesic can of course cross the
      photon sphere (examples are geodesics 2 and 3 on Figs. 8.1 and 8.2), but then it has neither an apoastron
      nor a periastron.
                    8.3 Trajectories of null geodesics in the equatorial plane                           241
where u := m/r [Eq. (7.16)] and Pb (u) stands for the cubic polynomial
                                                             m2
                                     Pb (u) := 2u3 − u2 +       .                               (8.26)
                                                             b2
Remark 2: Of course, Eq. (8.25) can be recovered by combining Eqs. (8.10), (8.11) and (8.14).
    The graph of the polynomial Pb is shown in Fig. 8.4 for selected values of b. Note that along
a null geodesic, one must have u > 0 and Pb (u) ≥ 0. The first condition follows from the very
definition of u as m/r, while the second one is a direct consequence of Eq. (8.25).
    By Eq. (8.25), the zeros of the polynomial Pb correspond to points at which du/dφ = 0.
They are thus stationary points for u, and hence for r. This can only occur at the circular
photon orbit r = const = 3m (then b = bc ) or at the periastron or apoastron discussed in
Sec. 8.2.4 (then b > bc ). The zeros of Pb are governed by its discriminant, which is
                                             m4 b2
                                                         
                                     ∆=4 4           − 27 .                                 (8.27)
                                             b    m2
One may distinguish three cases:
                          √
  • ∆ > 0 ⇐⇒ b > 27m = bc : Pb has three distinct real zeros (cf. the green and
     cyan curves in Fig. 8.4); one of them, un say, is negative, and hence unphysical (since
     u := m/r > 0), while the two other ones are positive, being nothing but
                                        m                     m
                                  up =         and      ua = ,                         (8.28)
                                        rp                    ra
      where rp and ra are the periastron and apoastron radii given by Eqs. (8.23)-(8.24).
242                     Null geodesics and images in Schwarzschild spacetime
                                                 Pb (u)
                                                0.2
                            b=3m
                            b = 4pm
                            b=3 3 m            0.15
                            b=7m
                            b = 20 m            0.1
                                               0.05
                                                                                                u
                             -0.4      -0.2                  0.2      0.4       0.6       0.8
                                               -0.05
                                                -0.1
                                               -0.15
                                                -0.2
      Figure 8.4: Graph of the cubic polynomial Pb (u) = 2u3 − u2 + m2 /b2 [Eq. (8.26)]. [Figure produced with the
      notebook D.4.9]
          • ∆ < 0 ⇐⇒ b < bc : Pb has one real negative zero, un , and two complex zeros (cf. blue
            and magenta curves in Fig. 8.4), so that Pb (u) never vanishes for physical values of u,
            which are real positive.
          Since they will be required in what follows, let us find explicit expressions for the real zeros
      of the polynomial Pb . The equation Pb (u) = 0 can be brought to a depressed form (no square
      term) by the change of variable u = x + 1/6. We get
                                                       1    m2    1
                                              x3 −        x+ 2 −     = 0.                                  (8.29)
                                                       12   2b   108
Zeros of Pb for b ≥ bc
      For b > bc , we can express the solutions of Eq. (8.29) via Viète’s formulas (8.22) with p = −1/12
      and q = m2 /(2b2 ) − 1/108. We then get the zeros of Pb as
                                                 b2c
                                                        
                            1      1                   2kπ    1
                        uk = cos     arccos 1 − 2 2 +        + ,                  k ∈ {0, 1, 2}.
                            3      3             b      3     6
                      8.3 Trajectories of null geodesics in the equatorial plane                                  243
                                ua
                      0.4       up
                                un
0.2
u 0
-0.2
-0.4
                            0        2       4      6       8       10       12      14
                                                        b/m
Figure 8.5: Real zeros un , up and ua of the cubic polynomial Pb (u) = 2u3 − u2 + m2 /b2 as functions of b. The
vertical dashed line marks the critical value b = bc ≃ 5.196 m. [Figure produced with the notebook D.4.9]
Using the identity arccos(1 − 2x2 ) = 2 arcsin x and noticing that u1 < 0, which implies
u1 = un , and u0 ≥ u2 , which implies u0 = ua and u2 = up , we arrive at
                                                   
                           1      2         bc     2π     1
                    un = cos        arcsin      +       +         (b ≥ bc )       (8.30a)
                           3      3         b       3     6
                                                  
                            1      2         bc   4π     1
                        up = cos     arcsin     +      +                     (b ≥ bc )                 (8.30b)
                            3      3         b     3     6
                                            
                            1      2         bc    1
                        ua = cos     arcsin      +                    (b ≥ bc ).                       (8.30c)
                            3      3         b     6
These zeros are plotted in terms of b on Fig. 8.5. Note the ordering
                               1                      1          1
                             − ≤ un < 0 < up ≤ ≤ ua < ,                                                 (8.31)
                               6                      3          2
with the inequalities ≤ being saturated for b = bc . Note also that
                                                                                    1
                    lim un = 0,             lim up = 0,         and         lim ua = .                  (8.32)
                   b→+∞                    b→+∞                            b→+∞     2
    It is easy to express two of the zeros in terms of the third one. For instance, let us pick up .
From Pb (up ) = 0, we have immediately m2 /b2 = −2u3p + u2p . If we substitute this expression
for m2 /b2 into Pb (u), we get
              Pb (u) = 0        ⇐⇒       2u3 − u2 − 2u3p + u2p = 0
                                         (u − up ) 2(u2 + up u + u2p ) − (u + up ) = 0
                                                                                 
                                ⇐⇒
                                         (u − up ) 2u2 + (2up − 1)u + up (2up − 1) = 0.
                                                                                   
                                ⇐⇒
244                    Null geodesics and images in Schwarzschild spacetime
      The two zeros different from up of this equation, namely un and ua , must then obey
                                       2u2 + (2up − 1)u + up (2up − 1) = 0.
      Solving this quadratic equation leads to the sought expressions:
                                                                      
                                    1
                                                   q
                              un =      1 − 2up − (1 − 2up )(1 + 6up )                                       (8.33a)
                                    4
                                                                      
                                    1
                                                   q
                              ua =      1 − 2up + (1 − 2up )(1 + 6up ) .                                     (8.33b)
                                    4
      The last limit results from the expansion of Eq. (8.34), which yields un ∼ −(2bc /b)2/3 /6 when
      b → 0. Observe also from Fig. 8.5 the smooth transition with the value of un for b ≥ bc , which
      is given by Eq. (8.30a).
      Historical note : As mentioned in the historical note on p. 231, the equations of geodesic motion in
      Schwarzschild metric have been first derived by Karl Schwarzschild in January 1916 [449] and Johannes
      Droste in May 1916 [175], but these two authors focussed on the timelike case (orbit of a massive particle
      or a “planet”). It seems that the first explicit writing of the equations governing null geodesics is due to
      Ludwig Flamm in September 1916 [197]. In particular, Flamm derived3 Eqs. (8.1), (8.2) and a combination
      of Eqs. (8.1) and (8.3). He also derived Eq. (8.25) governing the trajectories in the equatorial plane, as well
      as Eq. (8.18) giving the periastron. Regarding the solutions of these equations, he got only approximate
      ones, to the second order in m/b and recover Einstein’s famous result for the deflection of light by the
      Sun. For details about early studies of null geodesics in Schwarzschild spacetime, see Ref. [181].
         3
             The link between Flamm’s notations and ours is α = 2m, R = r, x = u/m and ∆ = b.
                        8.3 Trajectories of null geodesics in the equatorial plane                                   245
                                                 r/m
                                                10
                                                                                       ϕ − ϕ0
                   -8      -6      -4      -2−ϕ∗       ϕ∗ 2        4         6     8
Figure 8.6:
         √ r as a function of φ along a null geodesic with an impact parameter equal to the critical one:
b = bc = 3 3 m; the blue curve is for a critical null geodesic with r > 3m [Eq. (8.40)], while the red one regards
r < 3m [Eq. (8.42)]. [Figure produced with the notebook D.4.10]
                                                √ 
                                   
                                        d
                                     2    artanh x                  for x ∈ (0, 1)
                                   
                                   
                            1      
                                       dx
                               √ =                                                                         (8.37)
                        (1 − x) x      d       √ 
                                   2
                                         arcoth x                 for x ∈ (1, +∞).
                                       dx
We thus perform the change of variable x = 2u + 1/3 and treat separately two cases : u <
1/3 ⇐⇒ x ∈ (1/3, 1) and u > 1/3 ⇐⇒ x ∈ (1, +∞). We call geodesics in the first case
external critical null geodesics, since u < 1/3 ⇐⇒ r > 3m, and those in the second
case internal critical null geodesics, since u > 1/3 ⇐⇒ r < 3m. Note that the qualifiers
external and internal refer to the photon sphere r = 3m discussed in Sec. 8.2.3, and not to the
black hole region (r < 2m).
For u < 1/3, x ∈ (1/3, 1), so that the first line of Eq.(8.37) is relevant and Eq. (8.36) is integrated
to                                                 r
                                                            1
                                φ = ±2 artanh 2u + + φ0 ,                                         (8.38)
                                                            3
where φ0 is some integration constant. This relation is easily inverted to
                                                                  
                                           1              φ − φ0        1
                                        u = tanh2                      − .                                 (8.39)
                                           2                2           6
246                  Null geodesics and images in Schwarzschild spacetime
                                  y/m
                                   5
                                   4
                                   3
                                   2
                                   1
                                                                                            x/m
                                                       5                 10
                                  -1
                                  -2
                                  -3
      Moving back to r = m/u, we obtain the equation of the external critical null geodesic in polar
      form:
                                                  2m
                                      r=       2 φ−φ0
                                                         1 .                                  (8.40)
                                           tanh     2
                                                         −3
      The constant φ0 can be related to the asymptotic value φ∞ of φ when r → +∞ by setting
      u = 0 in Eq. (8.39); we get, using the identity artanh x = 1/2 ln[(1 + x)/(1 − x)],
                                                             √      !
                                                               3+1
                        φ0 = φ∞ ± φ∗ , with φ∗ := ln √                 ≃ 1.316958.        (8.41)
                                                               3−1
           The function r(φ), as given by Eq. (8.40), is depicted on Fig. 8.6 (blue curve). The region
      φ0 − φ∗ < φ < φ0 + φ∗ is excluded, since Eq. (8.40) would yield r < 0. For φ > φ0 + φ∗ , r(φ)
      is a decaying function, which corresponds to the plus sign in Eq. (8.38) and to the minus sign in
      Eq. (8.41): φ0 = φ∞ − φ∗ . Figure 8.7 shows such a null geodesic with φ∞ = 0, which implies
      φ0 = −φ∗ and φ > 0. When φ → +∞, the geodesic rolls up indefinitely onto the photon orbit
      discussed in Sec. 8.2.3; this behavior corresponds to the horizontal asymptote at r = 3m in
      the right part of Fig. 8.6. Note that the geodesic approaches very fast the photon orbit, only
      a single path round to it being graphically visible in Fig. 8.7. This is because the asymptotic
      expansion of relation (8.40) is r ∼ 3m(1 + 6e−φ ) when φ → +∞.
           It is worth stressing that Fig. 8.7 describes both (i) the trace in the (x, y)-plane of a geodesic
      with L > 0 (so that φ increases towards the future, cf. Eq. (8.2)) arising from r → +∞ and
      spiralling inwards to the photon orbit and (ii) the trace of a geodesic with L < 0 (so that φ
      decays towards the future) arising from r = rem > 3m, φ = φem > 0, spiralling outwards
      and escaping to r → +∞ as φ → 0. Had we restored the t dimension perpendicular to the
      (x, y)-plane in a 3d plot, these two geodesics would have clearly appeared distinct.
           On the contrary, for φ < φ0 − φ∗ , r(φ) is an increasing function, as it is clear on the left
      part of the blue curve in Fig. 8.6. This corresponds to the minus sign in Eq. (8.38) and to the
      plus sign in Eq. (8.41): φ0 = φ∞ + φ∗ . A null geodesic of this type is depicted in Fig. 8.8. It
                          8.3 Trajectories of null geodesics in the equatorial plane                                        247
                                 y/m
                                  3
                                  2
                                  1
                                                                                                        x/m
                                                     5                           10
                                 -1
                                 -2
                                 -3
                                 -4
                                 -5
Figure 8.8: Same as Fig. 8.7, but for φ0 = φ∗ and φ ∈ (−∞, 0) (left branch of the blue curve in Fig. 8.6). [Figure
produced with the notebook D.4.10]
                                 y/m                                                      y/m
                                 3                                                        3
2 2
1 1
                                                    x/m                                                      x/m
                -3   -2     -1         1   2    3                  -3       -2       -1         1   2    3
                                 -1                                                       -1
-2 -2
-3 -3
                                                                                                         √
Figure 8.9: Trace in the equatorial plane of two internal critical null geodesics (b = 3 3 m ≃ 5.196m and
r < 3m). They both obey Eq. (8.42) with φ0 = 0. The left panel corresponds to φ ∈ (−∞, 0) (left part of the red
curve in Fig. 8.6), while the right one is for φ ∈ (0, +∞) (right part of the red curve in Fig. 8.6). [Figure produced
with the notebook D.4.10]
has φ∞ = 0, so that φ0 = φ∗ and φ < 0. As for Fig. 8.7, the curve depicted in Fig. 8.8 can be
interpreted as the trace in the (x, y)-plane of two distinct geodesics: one with L < 0 arising
from r → +∞ and spiralling inwards to the photon orbit when φ → −∞ and another one
with L > 0 arising from r = rem > 3m, φ = φem < 0, spiralling outwards and escaping to
r → +∞ as φ → 0.
Let us now consider the “internal” case: r < 3m ⇐⇒ u > 1/3. This implies x ∈ (1, +∞) in
Eq. (8.37), so that Eq. (8.36) is integrated to
                                                              2m
                                               r=         2   φ−φ0
                                                                                1
                                                                                     .                             (8.42)
                                                    coth       2
                                                                            −    3
248                  Null geodesics and images in Schwarzschild spacetime
      Again, φ0 is an integration constant. But this time, it cannot be determined by the value of φ
      when r → +∞ since we are in the case r < 3m. The function r(φ) given by Eq. (8.42) is plotted
      as the red curve in Fig. 8.6. Contrary to external critical null geodesics, there is no exclusion
      interval for φ. However, φ cannot range from −∞ to +∞ along a given geodesic. Indeed, for
      φ = φ0 , one gets r = 0, which corresponds to the curvature singularity of the Schwarzschild
      black hole. This is necessarily a termination point for any geodesic. The maximum range of φ
      along an internal critical null geodesic is therefore either (−∞, φ0 ) or (φ0 , +∞).
          An internal critical null geodesic with φ0 = 0 and φ ∈ (−∞, 0) is depicted in the left
      panel of Fig. 8.9. For φ → −∞, it rolls up indefinitely onto the photon orbit (r = 3m) from
      below. Again the approach to the photon orbit is exponentially fast, the asymptotic expansion
      of relation (8.42) being r ∼ 3m(1 − 6eφ ) when φ → −∞. The curve plotted in Fig. 8.9 actually
      represents the trace in the equatorial plane of two distinct geodesics: (i) a geodesic with L > 0
      arising from r = rem < 3m, φ = φem < 0 and inspiralling to the central singularity as φ → 0
      and (ii) a geodesic with L < 0 arising from r = rem ∈ (2m, 3m), φ = φem < 0 and spiralling
      outward to the photon orbit as φ → −∞. Note that for (ii), the emission point must fulfill
      rem > 2m, i.e. must lie outside the black hole region. Indeed, as shown in Sec. 7.2.2, along any
      geodesic, r must decrease towards the future in the black hole region. Hence no null geodesic
      can be spiralling outwards there.
      Remark 3: Actually, an outward spiralling critical null geodesic can emerge from the region r < 2m
      when the latter corresponds to the white hole region in the extended Schwarzschild spacetime that
      will be discussed in Chap. 9. This explains why nothing in Eq. (8.42) and Fig. 8.9 seems to prevent this
      behavior.
          The right panel of Fig. 8.9 corresponds to an internal critical null geodesic with φ0 = 0
      and φ ∈ (0, ∞). For φ → +∞, it rolls up indefinitely onto the photon orbit (r = 3m) from
      below. Again, the plotted curve describes two cases: (i) a geodesic with L < 0 arising from
      r = rem < 3m, φ = φem > 0 and inspiralling to the central singularity as φ → 0 and (ii) a
      geodesic with L > 0 arising from r = rem ∈ (2m, 3m), φ = φem > 0 and spiralling outward
      to the photon orbit as φ → +∞.
      Historical note : The photon circular orbit at r = 3m has been exhibited by David Hilbert in December
      1916 [277, 278]. Hilbert also discussed the critical null geodesics and their spirals around the photon orbit,
      interpreting the latter as a Poincaré limit cycle of the dynamical system governed by Eq. (8.25). Hilbert
      pointed out that ingoing null geodesics with an impact parameter b > bc are deflected (possibly looping
      around the photon orbit if b is close to bc ) and escape to infinity, while those with b < bc cross the photon
      orbit and terminate on “the circle r = 2m”. Hilbert claimed that null geodesics stop there, because
      their (coordinate!) velocity vanishes4 . We recover here the interpretation of the r = 2m (coordinate)
      singularity as an impenetrable sphere advanced by Droste while discussing timelike geodesics (cf. the
      historical note on p. 231). The distinction between coordinate effects and physical ones was definitely
      not clear in the early days of general relativity, even for great minds like Hilbert! The same analysis and
      conclusions are found in the general relativity treatise by Max von Laue published in 1921 [340].
         4
          The coordinate velocity for ingoing radial null geodesics is given by Eq. (8.6): dr/dt = −1 + 2m/r, which
      clearly vanishes at r = 2m.
                     8.3 Trajectories of null geodesics in the equatorial plane                              249
Since up is the function of b given by Eq. (8.30b), the above relation defines uniquely a function
of u and b, which we consider as a function of u parameterized by b. Note that, since u ≤ up
(by definition of the periastron), one has Φb (u) ≥ 0. To evaluate Φb (u), we rewrite it in terms
of the three zeros (un , up , ua ) of Pb :
                                      Z up
                                                        dū
                          Φb (u) =         p                              .                  (8.48)
                                        u   2(ū − un )(up − ū)(ua − ū)
Let us perform the change of variable5
                                 ū − un
                          t :=                ⇐⇒       ū = (up − un )t + un .
                                 up − un
   5
    It should be clear that the variable t introduced here has nothing to do with the Schwarzschild-Droste
coordinate t.
250                     Null geodesics and images in Schwarzschild spacetime
0.9
0.8
                              0.7
                         k
0.6
0.5
                                         6         8       10        12               14          16       18     20
                                                                        b/m
      Figure 8.10: Modulus k of the elliptic integrals F and K that are involved in expression (8.53) for Φb (u). [Figure
      produced with the notebook D.4.9]
      We get
                                                                Z       1
                                                       1                                   dt
                                    Φb (u) = p                                      p                    ,               (8.49)
                                              2(ua − un )            u−un
                                                                    up −un
                                                                                     t(1 − t)(1 − k 2 t)
      where k is the following constant:
                                    r                                       √
                                        up − un                                 2
                           k :=                 = q√                                                        (b > bc ),   (8.50)
                                        ua − un      3 cot          2
                                                                            arcsin      bc
                                                                                             
                                                                                                  +1
                                                                    3                    b
      the second equality following from the expressions (8.30) of un , up and ua in terms of b.
          We can simplify further the integral via a second change of variable:
                                                                 √
                                    t = sin2 ϑ ⇐⇒ ϑ = arcsin t.
      Note
       p that 0 < u ≤ up implies 0 < t ≤ 1, so that 0 < ϑ ≤ π/2. Since dt = 2 sin ϑ cos ϑ dϑ =
      2 t(1 − t) dϑ, we arrive immediately at
                                                           √        Z       π/2
                                                       2                                dϑ
                                         Φb (u) = √                                 p               ,                    (8.51)
                                                    ua − un          ϕb (u)          1 − k 2 sin2 ϑ
      where6
                                                                s
                                                                    u − un
                                             ϕb (u) := arcsin                                 (b > bc ).                 (8.52)
                                                                    up − un
         6
             Do no confuse the letter ϕ with the symbol used for the coordinate φ.
                          8.3 Trajectories of null geodesics in the equatorial plane                                        251
                                                    R π/2           R π/2       Rϕ
By splitting the integral according to                  ϕ
                                                              =         0
                                                                            −   0
                                                                                     , we rewrite (8.51) as
                                          √
                                       2
                          Φb (u) = √        [K(k) − F (ϕb (u), k)]                              (b > bc ),         (8.53)
                                    ua − un
where F (ϕ, k) is the incomplete elliptic integral of the first kind [79, 232, 3]:
                                                              Z     ϕ
                                                                            dϑ
                                          F (ϕ, k) :=                   p                                          (8.54)
                                                                0        1 − k 2 sin2 ϑ
The notation F (ϕ, k) is the most common one in the literature [79, 232], but one may encounter
as well F (ϕ|m) for F (ϕ, k) with m = k 2 [3]. The parameter k is called the modulus of the
elliptic integral. From its expression (8.50), we see that k is a function of b. It is plotted in
Fig. 8.10. Given the ordering (8.31) and the limits (8.32), we deduce from expression (8.50) that
with
                                       lim k = 1               and              lim k = 0.                         (8.57)
                                      b→b+
                                         c
                                                                                b→+∞
                                              un               π
                                         r
                              0 < arcsin           < ϕb (u) ≤ ,                                                    (8.58)
                                           un − up             2
                                                             b = 5.196153 m
                                                             b = 5.19616 m
                                 8                           b = 5.1962 m
                                                             b = 5.197 m
                                                             b = 5.2 m
                                                             b = 5.3 m
                                                             b = 6.0 m
                                 6                           b = 10.0 m
                                                             b = 20.0 m
                       Φ b (u)                               b = 100.0 m
                                 4
                                 0
                                     0   0.05      0.1       0.15        0.2     0.25       0.3
                                                                    u
      Figure 8.11: Function Φb (u) defined by Eq. (8.47) and evaluated via the elliptic integral expression (8.52)-(8.53),
      for selected values of b > bc . For each value of b, the range of u is (0, up ], with the inverse periastron radius
      up = up (b) given by Eq. (8.30b). [Figure produced with the notebook D.4.11]
      Remark 5: The trajectory of a null geodesic in the plane θ = π/2 as given by Eqs. (8.46) and (8.53)
      is of the form φ = φ(r) on each of the two arcs with respect to the periastron. One can invert
      this relation to express the trajectory in the polar form r = r(φ). This is performed thanks to the
      inverse of the incomplete elliptic integral F (ϕ, k), which is the Jacobi elliptic sine sn(x, k), defined by
      sin ϕ = sn(F (ϕ, k), k). We deduce then from Eqs. (8.60)-(8.61) that
                                                        √                     
                                       up − u                ua − un
                                                   2
                                               = k sn         √       Φb (u), k .                            (8.62)
                                       ua − u                   2
      We refer the reader to Refs. [80, 374] for more details.
   • L is outgoing: r increases all along L ; as shown in Sec. 7.2.2, this cannot happen in
     the region r < 2m (the black hole interior). Hence, we must have r varying from 2m to
     +∞, with λ̃ ranging from some finite value, λ̃1 say, to +∞:
                                                                      1
           lim r = 2m and lim r = +∞              ⇐⇒        lim u =     and lim u = 0.         (8.64)
          λ̃→λ̃1              λ̃→+∞                         λ̃→λ̃1    2    λ̃→+∞
That the range of λ̃ for outgoing geodesics is (λ1 , +∞) and not the whole real line can be
understood by considering the limit b → 0. We are then in the case of radial null geodesics,
for which it has been proved that r is an affine parameter (cf. Secs. 6.3.1 and 8.2.2). Since this
particular affine parameter obviously takes a finite value at r = 2m, any other affine parameter
must take a finite value as well. Actually, we shall see in Chap. 9 (cf. Sec. 9.3.1) that this r = 2m
limit in the past of outgoing null geodesics does not correspond to the black hole horizon but
to the event horizon of a white hole, which is a located in a part of the spacetime that is not
covered by the Schwarzschild-Droste coordinates considered here.
    Since Pb (un ) = 0, we have m2 /b2 = −2u3n + u2n , so that we may write
with
                            1 un
                                           and                                                 (8.66)
                                                           p
                    u0 :=     −                    u∗ :=    un (3un − 1) + un .
                            4   2
Note that u0 and u∗ are functions of b, via the expression (8.34) of un . They are plotted in
Fig. 8.12. It is clear from this figure that u∗ > u0 ; consequently the term inside the square
brackets in Eq. (8.65) is always positive, in agreement with un being the only real zero of Pb for
b < bc . Furthermore, according to the limits (8.35), we have
                                                                     1
                            lim u∗ = +∞          and        lim− u∗ = .                        (8.67)
                            b→0                            b→bc      3
254                   Null geodesics and images in Schwarzschild spacetime
                            2.5
                                                                                                              u∗
                                                                                                              u0
                                 2
                            1.5
                      u0 , u ∗
                                 1
0.5
                                 0
                                         0                1              2              3             4       5
                                                                                 b/m
      Figure 8.12: Parameters u0 and u∗ , defined by Eq. (8.66), as functions of b. The horizontal dashed line marks
      u = 1/3. [Figure produced with the notebook D.4.12]
      where                                   Z    u∗                Z     u∗
                                                          dū                               dū
                       Φb (u) :=                        p        =              p                          (b < bc )   (8.69)
                                               u         Pb (ū)       u            2ū3 − ū2 + (m/b)2
      and φ∗ is the value of φ at u = u∗ . Since
                                                                                1
                                                                lim− u∗ =         = lim up ,
                                                              b→bc              3 b→b+c
      there is a kind of continuity of (8.69) with the definition (8.47) of Φb (u) for b > bc , despite the fact
      that Φb (u) is not defined for b = bc (more precisely, as we shall see later, limb→b−c Φb (u) = +∞
      for u < u∗ and limb→b+c Φb (u) = +∞ for u < up ).
          To evaluate Φb (u), we shall use expression (8.65) for Pb (u):
                                  Z u∗
                               1                                   dū
                    Φb (u) = √          p                                                         .
                                2 u        (ū − un ) [(ū − u0 ) + (u∗ − un )2 − (u0 − un )2 ]
                                                                 2
The absolute value in the right-hand side of Eq. (8.72) is not necessary in the present case
since u ≥ u∗ . We keep it because the same expression will be used below for the case u > u∗ .
To let appear the incomplete elliptic integral of the first kind (8.54), let us rewrite the above
expression as
                                 "Z                                                    #
                                     π/2                     Z ϕb (u)
                         1                     dϑ                            dϑ
         Φb (u) = p                      p                −           p                  .
                     2(u∗ − un ) 0         1 − k 2 sin2 ϑ      0        1 − k 2 sin2 ϑ
In view of respectively (8.55) and (8.54), the first integral is K(k), while the second one is
F (ϕb (u), k). We have thus
                               1
                  Φb (u) = p            [K(k) − F (ϕb (u), k)]                      (u ≤ u∗ , b < bc ).   (8.74)
                            2(u∗ − un )
As for the case b > bc , the modulus k of the elliptic integrals K and F is a function of b only,
which is given by combining Eqs. (8.73), (8.66) and (8.34). It is plotted in Fig. 8.13. Note that its
range is pretty limited:
                                              √
                                        q
                                      1
                                          2 + 3 < k < 1.                                       (8.75)
                                     |2 {z }
                                                   ≃0.96592
    Let us now turn to the case u > u∗ . It implies −1 < t < 0, so that we perform the change
of variable t = − cos ϑ in order to keep ϑ ∈ (0, π/2). We obtain then
                                                Z ϕb (u)
                                       1                      dϑ
                        Φb (u) = p                       p                ,
                                    2(u∗ − un ) π/2        1 − k 2 sin2 ϑ
   8
       To get expression (8.73), we have used Eq. (8.66) to get rid of u0 .
256                   Null geodesics and images in Schwarzschild spacetime
                                      1
                                 0.995
                                  0.99
                                 0.985
                       k
                                  0.98
                                 0.975
                                  0.97
                                          0         1          2          3        4                5
                                                                    b/m
      Figure 8.13: Modulus k of the elliptic integrals F and K that are involved in expressions (8.74) and (8.76) for
      Φb (u). [Figure produced with the notebook D.4.12]
                                 6                                                        b = 5.196 m
                                                                                          b = 5.192 m
                                                                                          b = 5.19 m
                                 4                                                        b = 5.15 m
                                                                                          b = 5.0 m
                                                                                          b = 3.0 m
                                 2                                                        b = 1.0 m
                                                                                          b = 0.5 m
                       Φ b (u)
-2
-4
                                 -6
                                      0       0.2       0.4   0.6   0.8       1     1.2       1.4
                                                                    u
      Figure 8.14: Function Φb (u) defined by Eq. (8.69) and evaluated via the elliptic integral expressions (8.74) and
      (8.76), for selected values of b < bc . [Figure produced with the notebook D.4.12]
                   8.3 Trajectories of null geodesics in the equatorial plane                         257
where ϕb (u) and k are given by Eqs. (8.72) and (8.73). We conclude that
                          1
             Φb (u) = p            [F (ϕb (u), k) − K(k)]          (u > u∗ , b < bc ).       (8.76)
                       2(u∗ − un )
    The function Φb evaluated via Eqs. (8.74) and (8.76) is plotted in Fig. 8.14 for various values
of b. Note that, by construction [cf. Eq. (8.69)], Φb (u) = 0 ⇐⇒ u = u∗ . Note also that, as in
the case b > bc (Sec. 8.3.3), the closer b is from bc ≃ 5.1961452, the larger the amplitude of
Φb (u).
∆φ = φ∞ − φ−∞ , (8.77)
with φ∞ := limλ̃→+∞ φ and φ−∞ := limλ̃→−∞ φ. Given that limλ̃→±∞ u = 0, Eq. (8.46) with
respectively ϵin = +1 and ϵin = −1 leads to
Hence
                                        ∆φ = 2ϵL Φb (0).                                     (8.79)
    Note that one can have |∆φ| > 2π; in such a case, the null geodesic is winding around the
black hole before leaving to infinity. We therefore introduce the winding number n ∈ Z by
                                               (
                                                 ∆φ ∈ [0, 2π)      if L > 0
                  ∆φ =: ∆φ + 2πn with                                                   (8.80)
                                                 ∆φ ∈ (−2π, 0] if L < 0.
Note that n ≥ 0 for L > 0 and n ≤ 0 for L < 0. We then define the deflection angle Θ by
Θ := ∆φ − ϵL π. (8.81)
Let us recall that ϵL = ±1 is the sign of the conserved angular momentum L [cf. Eq. (8.13)].
The −ϵL π term in the above equation is chosen so that Θ = 0 in flat spacetime (no deflection
of light). By construction, the range of Θ is
−π ≤ Θ ≤ π. (8.82)
    The above concepts are illustrated in Figs. 8.15-8.16, which show the trajectories of null
geodesics arising from infinity along the direction φ = 0 (i.e. having φ−∞ = 0), for various
values of the impact parameter b decaying from b = 12m to bc . They have been computed by
means of the geodesic integrator of SageMath (cf. Appendix D), instead of making use of the
elliptic-integral expression (8.53). All these geodesics have L > 0 and hence ϵL = +1. For
258                    Null geodesics and images in Schwarzschild spacetime
                             y/m                                                        y/m
         b = 12.000000 m                                            b = 8.000000 m
                            10                                                         10
5 5
                                                        x/m                                                        x/m
             -10      -5                5        10                    -10       -5                5        10
-5 -5
-10 -10
                             y/m                                                        y/m
         b = 6.000000 m                                             b = 5.355000 m
                            10                                                         10
5 5
                                                        x/m                                                        x/m
            -10       -5                5        10                    -10       -5                5        10
-5 -5
-10 -10
      Figure 8.15: Null geodesics in the plane θ = π/2 of Schwarzschild spacetime, plotted in terms of the coordinates
      (x, y) := (r cos φ, r sin φ). All geodesics arise from x ≫ m with trajectories initially parallel to the x-axis; they
      differ by the value of the impact parameter b. The grey disk marks the black hole region r < 2m, while the dashed
      green circle indicates the photon orbit at r = 3m. [Figure produced with the notebook D.4.8]
                      8.3 Trajectories of null geodesics in the equatorial plane                                   259
                      y/m                                                     y/m
   b = 5.230000 m                                          b = 5.202500 m
                     10                                                       10
5 5
                                                x/m                                                     x/m
         -10   -5                5       10                   -10       -5               5       10
-5 -5
-10 -10
                      y/m                                                     y/m
   b = 5.196430 m                                          b = 5.196155 m
                     10                                                       10
5 5
                                                x/m                                                     x/m
         -10   -5                5       10                   -10       -5               5       10
-5 -5
-10 -10
Figure 8.16: Same as Fig. 8.15 but for values of b closer to bc ≃ 5.196152 m. [Figure produced with the notebook
D.4.8]
260                 Null geodesics and images in Schwarzschild spacetime
      b = 12m (upper left panel of Fig. 8.15), the geodesic suffers only some moderate bending: the
      deflection angle is Θ ≃ π/6 and the winding number is n = 0. We recover here the standard
      deflection of light by massive bodies in general relativity. For b = 8m and 6m, the bending is
      more pronounced, exceeding Θ = π/2 for b = 6m, still with n = 0. For b = 5.355 m (lower
      right panel of Fig. 8.15), the deflection angle is Θ ≃ π, i.e. the photon goes back in the direction
      from which it was coming.
           When the impact parameter becomes even closer to the critical value bc ≃ 5.196152 m
      [Eq. (8.21)], the null geodesic starts to wind around the black hole before escaping to infinity
      (Fig. 8.16). For b = 5.230 m (upper left panel of Fig. 8.16), the winding number is n = 1 and the
      deflection angle is Θ ≃ −π/2. For b = 5.2025 m (upper right panel of Fig. 8.16), one has n = 1
      and Θ = 0. For b = 5.19643 m (lower left panel of Fig. 8.16), one has n = 1 and Θ ≃ π and
      for b = 5.196155 m (lower right panel of Fig. 8.16), one has n = 2 and Θ ≃ π/2. Note that the
      winding is taking place almost at the photon circular orbit (r = 3m). That after a few turns
      the null geodesic departs to infinity corroborates the fact that the photon orbit is unstable (cf.
      Sec. 8.2.3).
           We can understand the winding phenomenon around the photon circular orbit for b close
      to bc without investigating the properties of elliptic integrals. Indeed, by combining Eqs. (8.79)
      and (8.48), we get
                                           √ Z up                du
                                ∆φ = ϵL 2          p                             .                   (8.83)
                                               0      (up − u)(ua − u)(u − un )
      Given the ordering (8.31), each of the three factors under the square root is positive on the
      integration range (0, up ). For b ̸= bc , one has ua ̸= up (cf. Fig. 8.5) and the only diverging
                                              √
      term in the integrand of (8.83) is 1/ up − u, which diverges at the integral boundary u = up .
      However, the integral                       Z up
                                                           du
                                                        √
                                                   0      up − u
                                   √
      is finite, being equal to 2 up , so that ∆φ remains finite. When b → bc , ua → up and the
      integral (8.83) has a behavior similar to
                                       Z up                   Z up
                                                  du                  du
                                             p             =               .
                                         0      (up − u) 2
                                                               0   up − u
      Since the latter is a diverging integral, we conclude that
                                        ∆φ → ±∞ when b → bc .                                       (8.84)
      We recover the behavior observed for b = bc in Sec. 8.3.2: for an external critical null geodesic,
      ∆φ is infinite, the geodesic spiralling indefinitely around the photon orbit (cf. Fig. 8.7). We
      shall refine (8.84) in Sec. 8.4.1 [Eq. (8.102)].
   If the emission point is located prior to the periastron, i.e. if λ̃em < λ̃p , φem is related to φp
by Eq. (8.46) with ϵin = +1:
                                    φem = φp − ϵL Φb (uem )                                      (8.90)
and we get, by combining with Eq. (8.78),
                                  φ∞ = φem + ϵL [Φb (0) + Φb (uem )] .                             (8.91)
Equation (8.53) then yields
                        √
                          2                                                       (b > bc )
    φ∞ = φem + ϵL √            [2K(k) − F (ϕb (uem ), k) − F (ϕb (0), k)]                     .
                       ua − un                                                  (λ̃em < λ̃p )
                                                                                            (8.92)
φ∞ − φem is plotted as a function of b in Fig. 8.17 for various values of rem = m/uem , with the
dashed curves corresponding to the case λ̃em > λ̃p . We notice that for b ≫ m (cf. the red curve
near b = 12m), one has either φ∞ − φem ≃ 0 (dashed red curve: λ̃em > λ̃p ) or φ∞ − φem ≃ π
(solid red curve: λ̃em < λ̃p ): null geodesics with large impact parameters suffer almost no
deflection, as expected. Another striking feature of Fig. 8.17 is the divergence of φ∞ − φem
when b tends to bc . Let us examine this in details.
262                   Null geodesics and images in Schwarzschild spacetime
                             7π                                                           rem → 2m
                             2                                                            rem = 3 m
                                                                                          rem = 4 m
                             3π                                                           rem = 6 m
                                                                                          rem = 10 m
                                                                                          rem = 20 m
                             5π                                                           rem = 100 m
                       ϕ∞ − ϕem
                             2                                                            rem → + ∞
                             2π
                             3π
                             2
                               π
                             1π
                             2
                                  0
                                      0.0     2.0          4.0    6.0        8.0        10.0        12.0
                                                                 b/m
      Figure 8.17: Total change φ∞ − φem in φ from the emission point at (r, φ) = (rem , φem ) as a function of
      the impact parameter b of the null geodesic, assuming ϵL = +1. The dashed curves correspond to trajectories
      with b > bc that do not pass through the periastron. For each value of rem > 3m, the maximum of b is given by
      rp (b) = rem , i.e. by b2 = rem
                                   3
                                      /(rem − 2m) [cf. Eq. (8.18) and Fig. 8.3]. [Figure produced with the notebook D.4.11]
      Limit b → b+
                 c
Note that the constraint (8.85) is equivalent to 1 − 3uem > 0, so that the above formula is well
posed.
    For the case λ̃em < λ̃p , φ∞ is given by Eq. (8.92) and we need to determine the behavior of
K(k) to conclude. Actually, when k → 1, K(k) is diverging with the following behavior9 :
                                                          
                                                       4
                                lim K(k) − ln √                = 0.                       (8.98)
                               k→1                   1 − k2
          √
To relate 1 − k 2 to b − bc , let us introduce the small parameter ε > 0 such that
                                                                   1
                                                          up =:      − ε.                                              (8.99)
                                                                   3
The relation Pb (up ) = 0, once expanded to second order in ε2 , leads to the following relation
between b − bc and ε:                          √
                                  b − bc     81 3 2
                                          =        ε + O(ε3 ).                               (8.100)
                                     m         2
Besides, from the expressions (8.33) of un and ua in terms of up , we get, still to the second order
in ε,
                       1                                    1
               un = − + 2ε2 + O(ε3 )         and       ua = + ε − 2ε2 + O(ε3 ).
                       6                                    3
Substituting these values in the expression (8.50) of the modulus k and expanding to the first
order in ε leads to
                                      k = 1 − 2ε + O(ε2 ),
from which                            √                    p                   √
                                          1 − k2 ∼             1 − (1 − 4ε) ∼ 2 ε.
                                                  ε→0                            ε→0
The property (8.98) then leads to
                                                                                  
                                                                   2                4
                                       2K(k) ∼ 2 ln                √          ∼ ln     .
                                                 ε→0                 ε       ε→0    ε
Using this result, as well as (8.96), in Eq. (8.92) yields
                                 "                √      √          !                               √         !#
                                        4              3 + 6uem + 1                                       3+1
       φ∞ ∼+ φem + ϵL × 2 ln                − ln      p                 − ln                              √
            b→bc                        ε                2(1 − 3uem )                                       2
                                                                     
                                  16      2(1 − 3uem )          2
             ∼ φem + ϵL ln             √     √               √          .
            b→b+
               c                  ε2 ( 3 + 6uem + 1)2 ( 3 + 1)2
Note that we have used the limits (8.93) to evaluate the prefactor in Eq. (8.92) as
                                                                                    p
                                                                                      2/(ua − un ) ∼
  2/(1/3 − (−1/6)) = 2. Expressing ε in terms of b − bc via Eq. (8.100) leads to the final
p                                           2
formula:
                                    √                                !
                              648(2 3 − 3)(1 − 3uem )           m
    φ∞ ∼+ φem + ϵL ln                      p               ×                  (λ̃em < λ̃p ). (8.101)
        b→bc                 2 + 3uem + 3(1 + 6uem ) b − bc
   9
                                                      √
       See e.g. Eq. (112.1) of Ref. [79] with k ′ =       1 − k 2 or Eq. (17.3.26) of Ref. [3] with m = k 2 and m1 = 1 − k 2 .
264                 Null geodesics and images in Schwarzschild spacetime
      The prefactor of m/(b − bc ) in the logarithm is plotted as a function of rem = m/uem in Fig. 8.22
      below.
          The value of the total change ∆φ along the complete geodesic history [cf. Eq. (8.77)] is
      obtained by taking the limit uem → 0 (i.e. rem → +∞) in this formula. We get
                                                            √            !
                                                      648(7 3 − 12)m
                                    ∆φ ∼+ ϵL ln                             .                    (8.102)
                                         b→bc               b − bc
      We recover the result (8.84): ∆φ is diverging when b → b+  c . Moreover, Eq. (8.102) specifies this
      divergence as logarithmic in b − bc . More generally, Eq. (8.101) shows that φ∞ − φem diverges
      logarithmically in b − bc when b → b+  c , whatever the position of the emission point.
          Let us express ∆φ is terms of the deflection angle Θ and the winding number n through
      Eqs. (8.80) and (8.81):
                                           ∆φ = Θ + ϵL π + 2πn.                                   (8.103)
      We deduce then from (8.102) that
                                               √
                                b − bc ∼+ 648(7 3 − 12)e−π m e−ϵL Θ−2π|n| .                           (8.104)
                                      b→bc |     {z      }
                                                     ≃3.482284
      In the above writing, we have used the fact that the sign of n is the same as that of L, so that
      ϵL n = |n|.
      Remark 1: A check of Eq. (8.102) is obtained by comparing it with Eq. (268) in Chap. 3 of Ref. [107],
      where δD = b−bc and Θ is the same as ours, or with Eq. (7.4.54) of Ref. [204], where ∆ℓ = (b−bc )/(2m).
      Historical note : Equation (8.104) has been first derived by Charles Galton Darwin – the grandson of
      the famous naturalist Charles Robert Darwin – in 1959 [155] (cf. Eqs. (31) and (32) of Ref. [155], where
      µ = Θ).
   If uem > u∗ , then Φb (uem ) is given by Eq. (8.76). Combining with Eq. (8.107), we can then
write Eq. (8.106) as
                  ϵL                                                                       (b < bc )
   φ∞ = φem + p            [2K(k) − F (ϕb (uem ), k) − F (ϕb (0), k)]                                 ,
               2(u∗ − un )                                                                (uem > u∗ )
                                                                                           (8.110)
where ϕb (uem ) is still given by Eq. (8.109). φ∞ − φem is plotted as a function of b in Fig. 8.17
for various values of rem = m/uem . For rem ≫ m (cf. the red curve rem = 100 m for b < bc ),
we note that φ∞ − φem ≃ 0, as expected. As for the case b > bc , we also note that φ∞ − φem is
diverging when b tends to bc . Let us quantify this diverging behavior:
Limit b → b−
           c
When b → b−
          c , we have the following limits [cf. Eqs. (8.35) and (8.67)]:
                                          1                            1
                            lim− un = −              and      lim− u∗ = .
                            b→bc          6                  b→bc      3
Moreover, from expression (8.73) for k and the above limits for u∗ and un , we have (see also
Fig. 8.13)
                                        lim− k = 1.
                                              b→bc
Given the values (8.111) and expression (8.95) for the elliptic integral F when k = 1, we get
                                                            p               !
                                               2 + 3uem + 3(1 + 6uem )
                   lim F (ϕb (uem ), k) = ln                                           (8.112a)
                  b→b−c                                 |1 − 3uem |
                                               √
                   lim− F (ϕb (0), k) = ln(2 + 3).                                     (8.112b)
                    b→bc
When uem < u∗ , inserting these formulas into Eq. (8.108) leads to
                                             p            !                              
                                   2 + 3uem + 3(6uem + 1)                             1
        φ∞ ∼− φem + ϵL ln                  √                                 uem    <         .           (8.113)
             b→bc                     (2 + 3)(1 − 3uem )                              3
We have written uem < 1/3 because u∗ → 1/3 when b → b−        c . For this reason, we also get
rid of the absolute value around 1 − 3uem . Note that the value of φ∞ given by Eq. (8.113) is
266                    Null geodesics and images in Schwarzschild spacetime
      identical to that given by Eq. (8.97), which was obtained for b → b+     c and an emission point
      beyond the periastron. This is not surprising if one invokes the continuity around b = bc of
      null geodesics as regards their part beyond the periastron for b > bc and outside r = 3m for
      b < bc . Indeed, the geodesics with b → b+c differ significantly from those with b → bc only on
                                                                                              −
      parts including the periastron. This continuity appears clearly on Fig. 8.17: the curves with
      b < bc and rem > 3m have a continuous prolongation with the dashed curves, which are the
      curves with b > bc without any periastrion on the path from the emission point to infinity.
          For uem > u∗ , we shall use formula (8.110), with K(k) having the diverging behavior (8.98)
      since limb→b−c k = 1. To express K(k) in terms of |b − bc |, let us introduce the small parameter
      ε > 0 such that
                                                        1
                                               un = − − ε.                                       (8.114)
                                                        6
      The relation Pb (un ) = 0, once expanded to first order in ε, leads then to the following relation
      between bc − b and ε:                                √
                                             bc − b     81 3
                                                     =         ε.                                (8.115)
                                               m           4
      Besides, by expanding formula (8.66) at first order in ε, we get
                                                    1
                                             u∗ =     + ε + O(ε2 ).
                                                    3
      Substituting this value, as well as (8.114), into expression (8.73) for k yields
                                                       ε
                                             k = 1 − + O(ε2 ),
                                                       4
      so that
                                   √
                                                   r                     r
                                                            ε              ε
                                       1 − k2 ∼        1− 1−    ∼
                                             ε→0             2 ε→0           2
      The property (8.98) then leads to
                                                        √ !        
                                                       4 2         32
                                   2K(k) ∼ 2 ln         √    ∼ ln     .
                                           ε→0            ε ε→0     ε
      Substituting this expression for K(k), as well as Eq. (8.112) for F (ϕb (uem ), k) and F (ϕb (0), k),
      into Eq. (8.110) results in
                                       "                                             !                #
                                                                                                    √
                                                                      p
                              ϵL            32          2 + 3uem + 3(1 + 6uem )
       φ∞ ∼− φem + q                  ln ε − ln                                          − ln(2 + 3)
           b→bc
                           2 31 + 16                              3uem − 1
                                            √                   !
                                 32 (2 − 3)(3uem − 1)
            ∼ φem + ϵL ln                      p                  .
           b→b−c                  ε 2 + 3uem + 3(1 + 6uem )
Figure 8.18: Link between the observation angle b̂ and the impact parameter b for an asymptotic observer O
located at rO ≫ m.
Remark 3: As a check, Eq. (8.117) agrees with Eq. (4) in Ref. [234].
8.5      Images
Being the worldlines of photons, null geodesics are the key ingredient in determining images
as seen by some observer of emitting material around a black hole. Computing such images is
of great interest, especially after the first image of a black hole vicinity obtained by the Event
Horizon Telescope team in 2019 [6, 83]. As stated in Sec. 8.1, we shall differ the discussion of
that image to the chapter dealing with rotating black holes (Sec. 12.5.3).
10
                                                                                                          1
                   5
                                                                                                          3
                                    S
            y/m
                   -5
                                                                                                          2
-10
                          -10           -5           0             5            10            15           20
                                                              x/m
      Figure 8.19: Null geodesics in Schwarzschild spacetime with φ∞ = 0 and various values of the impact
      parameter b. The green ones have b ranging from 0 to 12m, by steps of 0.8m, while the olive ones have b close to
      bc , namely b/m ∈ {5.0, 5.2, 5.4}. S is a luminous point source and three of its images for an observer at y = 0
      and x → +∞ are obtained by following the drawn null geodesics through S. [Figure produced with the notebook
      D.4.13]
      A second parameter characterizing any incoming null geodesic is a polar angle α ∈ [0, 2π) so
      that (b̂, α) are spherical coordinates on O’s celestial sphere, the North pole (b̂ = 0) of which
      coinciding with the direction to the black hole. The image on O’s screen at the telescope output
      may be then parameterized by polar coordinates (b̂, α), with b̂ representing the distance to
      the screen’s center. Other projections from the celestial sphere to the planar screen may be
      considered but this is rather unimportant here, given the small field of view of the telescope. In
      what follows, we shall use directly b, instead of b̂, as the radial coordinate in the screen plane.
      According to Eq. (8.118), this amounts to drop the constant scale factor 1/rO .
         Moreover, we shall consider the asymptotic observer limit: rO → ∞. In that limit, null
      geodesics that reach O must have
To understand the formation of images on O’s screen, a bunch of null geodesics with φ∞ = 0
is drawn in Fig. 8.19. O is located in the far right of this figure and all the drawn geodesics
eventually hit O’s screen. Let us consider a luminous point source S. From the drawing of
Fig. 8.19, three null geodesics of the φ∞ = 0 family goes through S. They give rise to three
distinct images of S on O’s screen, which are represented by the open circles labelled 1 to 3 in
the right part of the figure. These three geodesics can be distinguished by their deflection angle
Θ and their winding number n (cf. Sec. 8.3.6). The geodesic giving image 1 is the less deflected
one: it has L < 0, φ−∞ ≃ 5π/4, ∆φ ≃ −5π/4, n = 0 and hence, from Eq. (8.81), a deflection
angle Θ ≃ −π/4. The geodesic giving image 2 has L > 0, φ−∞ ≃ −4π/3, ∆φ ≃ 4π/3,
n = 0 and Θ ≃ π/3. Finally, the geodesic giving image 3 has L < 0, φ−∞ ≃ 2π + 7π/6,
∆φ ≃ −7π/6, n = −1 and Θ ≃ −π/6. It it the most deflected of the three, having a winding
number of n = −1, i.e. it makes a full turn around the black hole before reaching O. We note
that the more deflected the geodesic, the closer the image with respect to the circle b = bc on
O’s screen.
    For the sake of clarity, only three images of the source S have been depicted in Fig. 8.19,
but we are going to see that there is actually an infinite number of them: two per value of |n|
(the number of turns around the black hole) — one with L > 0 and one with L < 0.
Remark 1: In Fig. 8.19, the source S is rather close to the black hole but it is worth to stress that the
multiple character of the images of a given source is not due to the proximity between the source and
the black hole. Multiple images are formed for any source, even very far ones. For instance, the source
can lie behind the observer, i.e. one can have xS > xO . This is clear if we consider the lower-right panel
of Fig. 8.15 (case b = 5.355 m): a source with xS > xO ≫ m and y = 5.355 m gives an image on O’s
screen at b = 5.355 m.
In the above discussion, we have parameterized the null geodesics arriving on the observer’s
screen by the impact parameter b. Let us relate the latter to the emission angle in the source
frame. To this aim, we consider that the point source S, as depicted in Fig. 8.19, is actually the
trace in the plane of the figure of the worldline of a static observer equipped with a source of
light, who we shall call the emitter and denote by Oem . Furthermore,    we suppose that the rest
frame of Oem is an orthonormal tetrad uem , e(r) , e(θ) , e(φ) such that the first spacelike vector,
                                                              
e(r) , is always pointing towards the black hole (cf. Fig. 8.20). Note that the first vector of the
270                   Null geodesics and images in Schwarzschild spacetime
      Figure 8.20: Part (e(r) , e(φ) ) of the vector frame of the emitter Oem and emission angle η. The coordinates
      (x, y) are defined in terms of the Schwarzschild-Droste coordinates by x = r cos φ and y = r sin φ. P is the
      photon momentum as measured by Oem .
      an orthonormal tetrad with respect to the Schwarzschild metric (6.14). Moreover, having a
      4-velocity uem collinear to the Killing vector ∂t makes the observer Oem static [cf. Eq. (7.28)].
         The generic 4-momentum p of a photon evolving in the equatorial plane θ = π/2 is given
      by Eqs. (7.10) and (8.1)-(8.3):
                                    "         −1                                 #
                                           2m                                  b
                                                                                                (8.121)
                                                         p
                           p=E         1−           ∂t ± 1 − b2 U (r) ∂r + ϵL 2 ∂φ ,
                                            r                                 r
      where we have used Eq. (8.14) to let appear U (r) and Eqs. (8.12)-(8.13) to express10 L in terms
      of E and b. Taking the value of p at the emission point S and using (8.120) to express ∂t , ∂r
      and ∂φ in terms of Oem ’s orthonormal tetrad, we get
                                                  p = εem (uem + n) ,                                           (8.122)
        10
           Note that formula (8.121) assumes E ̸= 0, otherwise b := |L|/E would be ill-defined. As discussed in
      Sec. 8.2.3, the property E ̸= 0, and even E > 0, holds as soon as the photon travels in the black hole’s exterior.
                                                           8.5 Images                                                  271
²L b/m 0.0
−5.0
                          −10.0
                                  0                  1π
                                                     2
                                                                   π          3π
                                                                              2
                                                                                             2π
                                                                   η
Figure 8.21: Impact parameter b of null geodesics (multiplied by ϵL = ±1) as a function of the emission
angle η in the emitter’s rest frame, for various values of the r-coordinate rem of the static emitter. Dashed curves
correspond to values of rem lower than 3m. The two horizontal grey lines marks b = bc ≃ 5.196 m. [Figure
produced with the notebook D.4.14]
where                                                                     −1/2
                                                                2m
                                      εem   := −uem · p = E 1 −                                             (8.123)
                                                                rem
is the energy of the photon as measured by Oem (cf. Sec. 1.4) and
                                                                                                            (8.124)
                               p                         p
                       n := ± 1 − b2 U (rem ) e(r) − ϵL b U (rem ) e(φ)
                                                             and                                            (8.126)
                          p                                                         p
               cos η = ± 1 − b2 U (rem )                               sin η = −ϵL b U (rem ),
from which we can express the impact parameter b and in terms of the emission angle η:
                                                               sin η
                                                     b = −ϵL p         .                                    (8.127)
                                                              U (rem )
Remark 2: Equation (8.126) is well posed because U (r) ≥ 0 for r > 2m [cf. Eq. (8.14)] and the equation
of motion (8.11) implies b−2 − U (rem ) ≥ 0, from which we get 0 ≤ b2 U (rem ) ≤ 1.
272                  Null geodesics and images in Schwarzschild spacetime
80
60
                               40
                     A(rem )
                               20
                               -20
                                     0   5   10    15      20     25     30      35     40
                                                        rem /m
      Figure 8.22:
               √    Function A(rem ) defined by Eq. (8.130). The dashed line marks the horizontal asymptote
      A = 648(7 3 − 12) ≃ 80.58246. [Figure produced with the notebook D.4.14]
         The variation of b in terms of η and rem , as given by formula (8.127), is depicted in Fig. 8.21.
      We notice that the smallest range of b is [0, bc ) and is reached for rem = 3m, which is not
      surprising in view of (8.127) since the maximum of U precisely occurs at r = 3m (cf. Fig. 8.1).
ϵL = −1 by b−
            n:
                           b+
                            n = bc + mA(rem ) e
                                               φem −2πn
                                                        ,      n ∈ N∗                        (8.131a)
                           b−
                            n := bc + mA(rem ) e
                                                −φem −2πn
                                                          ,      n ∈ N∗ .                    (8.131b)
The function A(rem ) is plotted in Fig. 8.22. It varies monotonically between
                                  √                                           √
        lim A(rem ) = −648(26 3 − 45) and                  lim A(rem ) = 648(7 3 − 12) .
      rem →2m            |       {z         }           rem →+∞          |    {z     }
                              ≃−21.59201                                        ≃80.58246
One shall recall that formulas (8.131) have been derived under the assumption |b − bc | ≪ m.
Given the amplitude of A(rem ) shown in Fig. 8.22 and the value e−2π ≃ 1.9 × 10−3 , this is valid
for any n ≥ 1, except maybe for b+   n with n = 1 and φem large (i.e. close to 2π).
    Via Eq. (8.127), the two sequences (8.131) of the impact parameter b correspond to two
sequences of the emission angle η:
                                                                                           (8.132a)
                                 p
                     sin ηn+ := − U (rem ) bc + mA(rem )eφem −2πn , n ∈ N∗
                                                                   
                                                                                           (8.132b)
                                p
                     sin ηn− := U (rem ) bc + mA(rem )e−φem −2πn , n ∈ N∗ .
                                                                  
Note that 0 ≤ bc U (rem ) ≤ 1 and bc U (rem ) = 1 ⇐⇒ rem = 3m. As discussed above (cf.
                   p                    p
Fig. 8.21), for rem ̸= 3m, there are always some solutions for ηn+ and ηn− for n sufficiently large
(in practice, n ≥ 1). We therefore conclude:
  Property 8.3: images of a point source
 Any static point source S give birth to two infinite sequences of images on the screen of
 the asymptotic observer O, at the effective distances from the screen’s center (b+        n )n∈N∗
 and (b−  )
        n n∈N ∗ , as  given by  Eq. (8.131).  If S, O and the black  hole are  not aligned,  i.e. if
 φem ̸∈ {0, π}, all the images of S are aligned on O’s screen: they all lie in the plane defined
 by S, O and the black hole (plane of Fig. 8.19). If S is located outside the photon sphere
 (rem > 3m), all images lie outside the circle b = bc on O’s screen, while if S is located inside
 the photon sphere (rem < 3m), all images lie inside that circle. In both cases, the images of
 the sequence (b+  n )n∈N∗ (positive L) are located on the opposite side of those of the sequence
 (bn )n∈N∗ (negative L). The higher n, the closer the images b+
   −
                                                                   n and bn to the circle b = bc .
                                                                            −
 Actually, according to the laws (8.131), each sequence converge exponentially fast to that
274                     Null geodesics and images in Schwarzschild spacetime
                  10
                                                                                                             1
                   5
                                                                                                             3
                          A
            y/m
                   -5                                                                                        4
                                                                                                             2
                  -10
                          -10           -5             0             5            10            15            20
                                                                x/m
      Figure 8.23: Same as Fig. 8.19 but underlining four null geodesics from a point source A located on the x-axis,
      i.e. aligned with the black hole and the observer on the far right. [Figure produced with the notebook D.4.13]
circle. From n to n + 1, the distance to the circle is reduced by a factor e−2π ≃ 1.9 × 10−3 .
      Remark 3: The alignment of the images is a direct consequence of the spherical symmetry of Schwarzschild
      spacetime.
      Remark 4: The reader may be puzzled by the compatibility between an infinite number of images from
      a single source and the conservation of energy. There is actually no issue here because one can show
      that the images are fainter as n increases. So in practice, only a few images would be visible.
Fig. 8.23 shows only a limited number of images, but there is actually an infinite number of
such images: the full image of A on O’s screen is composed by a infinite sequence (Cn )n∈N
of concentric circles that accumulate exponentially fast onto the circle of radius b = bc . Each
circle Cn has a radius bn that is given by Eq. (8.131) with φem = 0 or π (case of Fig. 8.23), so that
 n = bn =: bn (case φem = 0) or bn = bn−1 =: bn (case φem = π). The outermost circle, C0 , is
        −                                   −
b+                                    +
called the Einstein ring of A. It is the only significant ring in astronomical images involving
gravitational lensing by a massive foreground source (not necessarily a black hole), which are
in the context |xA | ≫ m and b ≫ m. We may call Cn the nth Einstein ring of A. For n ≥ 1,
Cn is also called a relativistic Einstein ring of A. Indeed, these rings exist only in the case of
a very relativistic central object, so that photons can wind around it. On the contrary C0 exists
even for non-relativistic objects. In particular, in astronomical observations of Einstein rings
performed up to now, the central deflecting object is a galaxy or a galaxy cluster, both being
highly non-relativistic (very low compactness).
Historical note : The infinite sequence of images of a point source located at r > 3m, with accumulation
just outside the circle b = bc , has been predicted in 1959 by Charles Galton Darwin [155] (cf. historical
note on p. 264). Darwin has also derived the sequence of relativistic Einstein rings for aligned sources.
      Figure 8.24: Shadow of a Schwarzschild black hole. The dashed grey circle delimitates the dark disk that would
      appear on the screen of an observer looking at a black√ sphere of radius r = 2m in Minkowski spacetime. The
      ratio between the radii of the two disks is bc /2m = 3 3/2 ≃ 2.60.
        For a distant observer, the image of a uniformly bright sphere S surrounding the black
        hole and the observer is a bright area fulfilling the observer’s screen, except for a central
        black disk of radius b = bc (cf. Fig. 8.24). This disk is called the black hole shadow.
      Historical note : In his 1921 treatise [340], Max von Laue noted√that an emitting sphere of radius
      r0 such that 2m < r0 < 3m will appear as having a radius of 3 3m (= bc ) to a distant observer,
      independently of the value of r0 . Maybe the first mention of a shadow lies in the article by Charles
      Galton Darwin in 1959 [155]. Actually, Darwin imagined that the source of the Schwarzschild metric
      is a massive point particle, called by him the “sun”, and wrote “suppose that there is a roughly uniform
      star-field all round the sky, and consider what will be seen and mapped from the telescope... To the area
      inside this circle (the circle b = bc ) only the sun itself can contribute light. The most obvious assumption
      is that its rays would emerge in straight lines, so that there would be a brilliant point of light surrounded
      by blackness”. In the full black hole context, the first computation of the shadow has been presented
      by James M. Bardeen at the famous 1972 Les Houches Summer School [38]; the shadow is called the
      apparent shape of the black hole by Bardeen and the computation has been performed for the Kerr black
      hole, which generalizes the Schwarzschild black hole to the rotating case (cf. Chap. 10); this Kerr black
      hole shadow will be discussed in details in Sec. 12.4.
Figure 8.25: Image of an accretion disk around a Schwarzschild black hole, as seen by a distant observer, with
various inclination angles: ι = 0 (upper left), ι = π/6 (upper right), ι = π/3 (lower left) and ι = 3/2 (lower
right). [Figure produced by Gyoto with the input files given in Sec. E.2.1]
      [388] and Page & Thorne [396], lying in some plane around the black hole. It is geometrically
      thin but optically thick. The inner radius of the disk is the innermost stable circular orbit
      (ISCO) at r = 6m (cf. Sec. 7.3.3). The images shown in Fig. 8.25 have been computed by
      the open-source ray-tracing code Gyoto [493] (cf. Appendix E). This code integrates the null
      geodesic equations backward, starting from the observer’s screen. For Fig. 8.25, the observer is
      located at r = 1000 m and at various inclination angles ι with respect to the accretion disk,
      ranging from ι = 0 (disk seen face-on) to ι = 3/2 (disk seen almost edge-on). The colors in
      Fig. 8.25 encode the flux at a fixed wavelength.
          Let us first discuss the face-on view (upper-left panel of Fig. 8.25). To interpret it, some null
      geodesics generating it have been depicted in Fig. 8.26. One may distinguish three images of
      the accretion disk:
          • the primary image, which is formed by the outermost bright part; it is generated by null
            geodesics that suffers a small deflection from the disk to the observer; more precisely,
            they are arising from the side of the disk facing the observer and have ∆φ = ±π/2 in
            their trajectory plane (cf. orange curves in Fig. 8.26); the inner boundary of the primary
            image is the image of the ISCO under these geodesics; it is located at b = 6.932 m on the
            observer’s screen;
          • the secondary image, which is the bright yellow ring separated from the inner boundary
            of the primary image by the thick black annulus; this image is generated by null geodesics
            that perform half a turn around the photon sphere before reaching the observer; more
            precisely, they are arising from the side of the disk opposite to the observer and have
            ∆φ = ±3π/2 in their trajectory plane (cf. brown curves in Fig. 8.26); the inner boundary
            of the secondary image is located at b = 5.479 m on the observer’s screen;
          • the tertiary image, which is the thin faint innermost ring; this image is generated by
            null geodesics that perform a full turn around the photon sphere before reaching the
            observer; more precisely, they are arising from the side of the disk facing the observer
            and have ∆φ = ±5π/2 in their trajectory plane (cf. red curves in Fig. 8.26); the inner
            boundary of the tertiary image is located at b = 5.208 m on the observer’s screen.
      As discussed in Sec. 8.5.2, there are actually an infinite number of images of the accretion disk
      but they are more and more faint and only three of them are visible in Fig. 8.25. Moreover
      these images are formed by exponentially thinner rings and accumulate near the circle b = bc .
      In Figs. 8.25 and 8.26, the tertiary image corresponds to n = 2 for ϵL = 1 and n = 1 for
      ϵL = −1 in Eq. (8.128) 11 , so that Eqs. (8.131)   √ with φem = π/2 (b = b−    1 ) and φem = 3π/2
      (b = b+
            2 ) yield (b − b c )/bc = A(r em )e−5π/2
                                                     /(3  3). With r em = 6m, we  have  A(rem ) ≃ 30.3, so
      that (b − bc )/bc ≃ 0.0013 for the tertiary image. This we may conclude that, up to a relative
      accuracy of 10−3 , the interior of the tertiary image corresponds to the black hole shadow as
      defined in Sec. 8.5.4,
          For a nonzero inclination angle, the images shown in Fig. 8.25 exhibit some asymmetry
      between the left and the right. This is due to the rotation of the disk at relativistic speed, which
        11
          This asymmetry of n with respect to ϵL is due to the convention φem ∈ [0, 2π); had we chosen φem ∈ (−π, π],
      it would not be present.
                                                      8.5 Images                                                  279
                                 y/m
                                10
                                  8
                                  6
                                  4
                                  2
                                                                                  x/m
                                                  5            10            15
                                 -2
                                 -4
Figure 8.26: Formation of the primary (orange), secondary (brown) and tertiary (red) images of an accretion
disk when viewed face-on. The figure plane is orthogonal to the disk plane, the intersection between the two
planes being the y-axis, so that disk is depicted by the thick segments located at φ = π/2 and φ = 3π/2. As in
Fig. 8.25, the inner boundary of the disk is rin = 6 m (the ISCO), but the outer boundary has been truncated at
rout = 9 m for pedagogical purposes. The observer is located at x ≫ m and y = 0. The dashed green circle is the
photon orbit at r = 3 m. The lower panel shows the geodesics emerging from the inner and outer edges of the
upper part of the disk. [Figure produced with the notebook D.4.15]
280                  Null geodesics and images in Schwarzschild spacetime
      generates a strong Doppler boost, making the left part of the disk, which moves towards the
      observer, much brighter than the right part, which recedes from the observer. Note that for the
      three views with ι ̸= 0, the primary, secondary and tertiary images are still present but are
      no longer circular rings. Note that the almost edge-on view is drastically different from what
      would get in flat (Minkowski) spacetime, since for the latter the accretion disk would appear as
      a very thin ellipsoidal shape elongated along the horizontal axis.
      Historical note : The first computation of the image of an accretion disk around a Schwarzschild black
      hole has been performed by Jean-Pierre Luminet in 1979 [348], (see Ref. [349] for an historical account).
      The accretion disk was the same Page-Thorne model as that considered here and Luminet obtained an
      image pretty close to that of the last quadrant of Fig 8.25, the difference lying in the inclination angle:
      ι = 80◦ ≃ 1.40 rad versus ι = 1.50 rad in Fig 8.25. In 1991, Jean-Alain Marck computed the first movie
      of an observer plunging into a Schwarzschild black endowed with an accretion disk [355, 356]. In 2007,
      Alain Riazuelo computed very precise images of a Schwarzschild black hole in front of a realistic stellar
      field [432, 433], which illustrate magnificently the multiple character of images of stars (Sec. 8.5.2) and
      the concept of black hole shadow (Sec. 8.5.4) (cf. Appendix F).
Chapter 9
Contents
            9.1    Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 281
            9.2    Kruskal-Szekeres coordinates . . . . . . . . . . . . . . . . . . . . . . . 282
            9.3    Maximal extension . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 290
            9.4    Carter-Penrose diagram . . . . . . . . . . . . . . . . . . . . . . . . . . . 297
            9.5    Einstein-Rosen bridge . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
            9.6    Physical relevance of the maximal extension . . . . . . . . . . . . . . 318
9.1         Introduction
In the preceding chapters, the Schwarzschild spacetime was considered to be (MIEF , g), where
the manifold MIEF is covered by the ingoing Eddington-Finkelstein coordinates. It turns out
that this spacetime can be smoothly (actually analytically) extended to a larger spacetime,
(M , g) say, which on its turn cannot be extended, i.e. it is maximal. To construct (M , g), we
first introduce the so-called Kruskal-Szekeres coordinates on MIEF in Sec. 9.2. These coordinates
restore some symmetry between the ingoing radial null geodesics and the outgoing ones: both
types of geodesics appear as straight lines in a spacetime diagram built on Kruskal-Szekeres
coordinates, with a slope +1 for the outgoing family and −1 for the ingoing one. Then it
appears clearly that the outgoing radial null geodesics are artificially halted1 at some past
boundary of MIEF , while no curvature singularity is located there. This calls for an extension
of the spacetime, which is performed in Sec. 9.3, thanks to Kruskal-Szekeres coordinates. This
extension is maximal; in particular, all incomplete geodesics are so because they either terminate
to or emanate from a curvature singularity. In Sec. 9.4, we construct the Carter-Penrose diagram
   1
       Technically, one says that they are incomplete geodesics, cf. Sec. B.3.2.
282                     Maximal extension of Schwarzschild spacetime
      of the maximal Schwarzschild spacetime (M , g), via two compactified versions of the Kruskal-
      Szekeres coordinates. We show that the simplest one, which is standard in the literature, does
      not lead to a regular conformal completion of (M , g), as defined in Sec. 4.3. The second version,
      built on Penrose-Frolov-Novikov compactified coordinates, achieves this goal. We use this
      completion to show explicitly that the maximal spacetime contains a white hole, in addition
      to the black one. In Sec. 9.5, we investigate the hypersurfaces of constant Kruskal-Szekeres
      time, which connect two asymptotically flat regions of (M , g) through the co-called Einstein-
      Rosen bridge. Finally, Sec. 9.6 discusses the physical relevance of the maximally extended
      Schwarzschild spacetime.
      Although r cannot be explicitly expressed in terms of (u, v), the function r 7→ r+2m ln 2m
                                                                                               r
                                                                                                 −1
      is invertible on (2m, +∞) (cf. Fig. 9.1), so that (9.1) does define a coordinate system on MI .
      The range of (u, v) is R2 .
          The above relations imply
                          dr                     dr                                      dr2
            du = dt −          ,   dv = dt +                and      du dv = dt2 −              .
                        1 − 2m
                             r
                                               1 − 2m
                                                    r                                  1−   2m 2
                                                                                             r
                                         r ∗ /m
                                         10
                                          9
                                          8
                                          7
                                          6
                                          5
                                          4
                                          3
                                          2
                                          1
                                               1 2 3 4 5 6 7 8
                                                                         r/m
                                          -1
                                          -2
                                          -3
                                          -4
                                          -5
                                          -6
                                          -7
                                          -8
Since the range of (u, v) is R2 , the range of U is (−∞, 0) and that of V is (0, +∞). We have
            1 −u/4m                        1 v/4m
  dU =        e     du,          dV =        e    dv              and        du dv = 16m2 e(u−v)/4m dU dV.
           4m                             4m
In view of Eq. (9.3), we get
                                                                          −1
                                                        32m3 −r/2m
                                       r   −1                    
                        2 −r/2m                                        2m
       du dv = 16m e                      −1    dU dV =     e       1−        dU dV.
                                       2m                 r             r
Substituting this expression in (9.2) yields the expression of the metric with respect to the
coordinates X α̂ := (U, V, θ, φ):
                                  32m3 −r/2m
                                             dU dV + r2 dθ2 + sin2 θ dφ2 .                                         (9.5)
                                                                        
                         g=−          e
                                    r
284                     Maximal extension of Schwarzschild spacetime
      We notice that the factor (1 − 2m/r) has disappeared in the metric components (9.5), which
      are thus perfectly regular as r → 2m.
          The coordinates U and V are null coordinates, since u and v are (cf. Sec. 6.3.2) and Eq. (9.4)
      shows that the hypersurfaces of constant U (resp. V ) obviously coincide with those of constant
      u (resp. v). To cope with a timelike-spacelike coordinate system instead, let us introduce on
      MI the pair (T, X) such that U is T retarded by X and V is T advanced by X:
                                                            
                            U =T −X                          T = 1 (U + V )
                                                                     2
                                                 ⇐⇒                                                (9.8)
                            V =T +X                          X = 1 (V − U )
                                                                      2
      Since the range of U on MI is (−∞, 0) and that of V is (0, +∞), the range of (T, X) is ruled
      by T < X, T > −X and X > 0. In other words, the coordinates (T, X) span the following
      quarter of R2 (cf. Fig. 9.2):
                             32m3 −r/2m
                                        −dT 2 + dX 2 + r2 dθ2 + sin2 θ dφ2 .                         (9.10)
                                                                         
                        g=       e
                               r
      This relation is a direct consequence of (9.6) and (9.8). We may rewrite it as F (r/2m) = X 2 −T 2 ,
      where F is the function defined by
      The graph of F is shown in Fig. 9.3. We see clearly that F is a bijective map. In particular, F
      induces a bijection between (1, +∞) (the range of r/2m on MI ) and (0, +∞) (the range of
                                     9.2 Kruskal-Szekeres coordinates                                          285
                                                     T
                                                 3
                                                                            I
                                                                                    X
                        3        2         1                  1         2       3
Figure 9.2: Submanifold MI in the Kruskal-Szekeres coordinates (T, X): MI is covered by the Schwarzschild-
Droste grid (in blue): the solid lines have t = const (spaced apart by δt = m), while the dashed curves have
r = const (spaced apart by δr = m/2). [Figure generated by the notebook D.4.16]
                                 F(x)
                                 7
                                 6
                                 5
                                 4
                                 3
                                 2
                                 1
                                                                                           x
                                              0.5         1.0            1.5         2.0
                                 1
1.0
                                0.5
                                                                                                   x
                        1                     1           2          3           4             5
Figure 9.4: Rescaled Lambert function W = F −1 , defined by (9.14) and obeying eW (x) (W (x) − 1) = x.
                                  p r                             
                                                    t                           X+T
                                                                                    
                    T = er/4m 2m
                                        − 1 sinh 4m                t = 2m ln X−T
                                                                   
          MI :                                            ⇐⇒                                    (9.18)
                    X = er/4m p r − 1 cosh t 
                                                                   r = 2m W (X 2 − T 2 ).
                                                                   
                                      2m            4m
                                   9.2 Kruskal-Szekeres coordinates                                        287
Note that we have used the identity artanh x = 1/2 ln [(1 + x)/(1 − x)]. The curves of constant
t and constant r in the (T, X) plane are drawn in Fig. 9.2. The fact that the curves of constant
t are straight lines from the origin follow immediately from Eq. (9.17).
Remark 1: Given the properties of the cosh and sinh functions, it is clear on these expressions that the
constraints (9.9) are satisfied.
Remark 2: In expression (9.16) the metric components gT T and gXX depend on both X and T ; this
shows that neither ∂T nor ∂X coincide with a Killing vector. In other words, the coordinates (T, X)
are not adapted to the spacetime symmetries, contrary to the Schwarzschild-Droste coordinates or to
the Eddington-Finkelstein ones.
                                   t̃ = 4m ln(T + X) − r                                        (9.20a)
                                           r     
                                    er/2m      − 1 = X 2 − T 2.                                 (9.20b)
                                    |      2m
                                           {z     }
                                         F (r/2m)
                                                              T
                                                          3
                                                                                      0
                                                                                 r=
                                                          2
                                                                                      t̃ = 2m
                                                          1
                                                                                     t˜= 0
                                                                                                   X
                               3         2         1                  1         2             3
                                                                                    t˜=
                                                                                          −
                                                          1                                   2m
      Figure 9.5: Domain of ingoing Eddington-Finkelstein coordinates, MIEF = MI ∪ H ∪ MII , depicted in terms
      of the Kruskal-Szekeres coordinates (T, X): the solid red curves have t̃ = const (spaced apart by δ t̃ = m), while
      the dashed red curves have r = const (spaced apart by δr = m/2). [Figure generated by the notebook D.4.16]
      Hence the relation between the ingoing Eddington-Finkelstein coordinates and the Kruskal-
      Szekeres ones on MIEF :
                   h                    i
        T = er/4m cosh 4m − 4m e−t̃/4m
                             t̃    r
                                                     
                                                     t̃ = 2m [2 ln(T + X) − W (X 2 − T 2 )]
                    h                    i ⇐⇒ 
        X = er/4m sinh t̃ + r e−t̃/4m
       
                            4m    4m
                                                        r = 2m W (X 2 − T 2 )
                                                                                              (9.23)
      The various subsets of MIEF correspond then to the following coordinate ranges (cf. Fig. 9.5):
                                       MI :  X > 0 and − X < T < X                                              (9.24a)
                                       H :   X > 0 and T = X                                                    (9.24b)
                                                      √
                                       MII : |X| < T < X 2 + 1.                                                 (9.24c)
          Since the relation between IEF coordinates and Kruskal-Szekeres ones is the same in MII
      as in MI (being given by (9.23) in both cases), we conclude that expressions (9.10) and (9.16)
      for the metric components with respect to Kruskal-Szekeres coordinates are valid in all MIEF .
          Let us determine the relation between the Kruskal-Szekeres coordinates and the Schwarz-
      schild-Droste ones in MII . Since r < 2m in MII , Eq. (6.32) gives
                                                            r
                                                                    r
                                     MII : e  t̃/4m
                                                    =e t/4m
                                                              1−      ,
                                                                   2m
                                        9.2 Kruskal-Szekeres coordinates                                          289
                                                          T
                                                      3
                                                                               0
                                                                            r=
                                                      2
                                                      1
                                                              II                I
                                                                                        X
                        3           2          1                    1      2        3
Figure 9.6: Schwarzschild-Droste coordinates in MSD = MI ∪ MII depicted in terms of the Kruskal-Szekeres
coordinates (T, X): the solid blue curves have t = const (spaced apart by δt = m), while the dashed blue curves
have r = const (spaced apart by δr = m/2). [Figure generated by the notebook D.4.16]
We obtain then
                                                                       
                                          r            t                                T +X
                                                                                              
                T = er/4m 1 −
                          p
               
                                         2m
                                               cosh   4m                 t = 2m ln
                                                                        
                                                                                        T −X
    MII :                                                          ⇐⇒                                   (9.25)
                                           r           t
                X = er/4m p1 −
                                                                       r = 2m W (X 2 − T 2 ).
                                                                        
                                          2m
                                               sinh   4m
This is to be compared with (9.18). The curves of constant t and constant r in the (T, X) plane
are drawn in Fig. 9.6, which extends Fig. 9.2 to MII .
    As discussed in Sec. 6.3.4, one approaches a curvature singularity as r → 0. According to
(9.23) or (9.25), this corresponds to X 2 − T 2 → −1 (see also√Fig. 9.3), with T > 0. Hence, in
the (T, X) plane, the curvature singularity is located at T = X 2 + 1, i.e. at the upper branch
of the hyperbola T 2 − X 2 = 1.
290                     Maximal extension of Schwarzschild spacetime
T = −X + V, (9.26)
            where V is a positive constant (the constraint V > 0 following from (9.21)), so that each
            geodesic of this family can be labelled by (V, θ, φ);
T = X + U, (9.27)
            where U is an arbitrary real constant, so that each geodesic of this family can be labelled
            by (U, θ, φ).
      In particular, the Schwarzschild horizon H is generated by the outgoing radial null geodesics
      having U = 0: Eqs. (9.27) and (9.11) clearly imply r = 2m for U = 0, i.e. X = T . The outgoing
      radial null geodesics not lying on H , which are denoted L(u,θ,φ)
                                                                  out
                                                                        in Sec. 6.3.5, have an equation
      in terms of the IEF coordinates given by Eq. (6.51): t̃ = r + 4m ln |r/2m − 1| + u, where the
      constant u is related to U by
                                            U = −e−u/4m on MI                                       (9.28a)
                                            U = 0 on H                                              (9.28b)
                                            U = e−u/4m      on MII .                                 (9.28c)
                                                        T
                                                    3
                                                                             0
                                                                          r=
                                                    2
                                                                                       X
                         3         2        1                  1         2         3
Figure 9.7: Radial null geodesics in MIEF = MI ∪ H ∪ MII depicted in terms of the Kruskal-Szekeres
coordinates (T, X): the solid lines correspond to the outgoing family, with u spanning [−6m, 8m] (with steps
δu = 2m), from the left to the right in MII and from the right to the left in MI ; the dashed lines correspond to
the ingoing family, with v spanning [−8m, 6m] (with steps δv = 2m) from the left to the right. [Figure generated
by the notebook D.4.16]
affine parameter r by extending the spacetime since r = 0 marks a spacetime singularity (cf.
Sec. 6.3.4).
    On the other hand, the outgoing radial null geodesics are limited by the constraint T +X > 0,
which corresponds to r > 2m in MI , with r increasing towards the future, and to r < 2m in
MII , with r decreasing towards the future. Thus all outgoing radial null geodesics terminate
towards the past at the finite value 2m of the affine parameter r (the left end point of the solid
lines in Fig. 9.7) and are therefore incomplete geodesics. However, contrary to ingoing radial
null geodesics, they can be extended since r = 2m does not mark any spacetime singularity.
More precisely, the limit at which outgoing radial null geodesics terminate is T = −X, which
by virtue of (9.11) yields r = 2m. This does not correspond to the Schwarzschild horizon H ,
since for the latter T = X, but rather to t̃ → −∞, as it is clear when comparing Fig. 9.7 with
Fig. 9.5.
    Another hint regarding the extendability of (MIEF , g) is the fact that the Killing horizon
H is non-degenerate, having a non-zero surface gravity (cf. Sec. 3.3.6); the latter has been
computed in Example 10 of Chap. 2: κ = 1/4m. Now, we have seen in Sec. 3.4 that non-
degenerate Killing horizons have incomplete null generators and, if they can be extended,
they must be part of a bifurcate Killing horizon (Property 3.15). In the present case, the null
generators of H are nothing but outgoing radial null geodesics. They are thus as incomplete
as those discussed above, i.e. those that admit r as an affine parameter.
292                       Maximal extension of Schwarzschild spacetime
           The possibility of spacetime extension beyond MIEF is clear on the metric element (9.16):
      it is invariant by the transformation
                                       Φ:     R2       −→              R2
                                                                                                         (9.29)
                                            (T, X) 7−→ (−T, −X).
      Thus we may include the part T + X < 0 by adding a copy of MIEF , symmetric to the original
      one with respect to the “origin” (T, X) = (0, 0). The whole spacetime manifold is then the
      following open subset of R2 × S2 :
      where (T, X, θ, φ) is the canonical coordinate system on R2 ×S2 , called in this context Kruskal-
      Szekeres coordinates. The metric g on the whole M is then defined by (9.16):
                                                                                                
                 2              4                  2     2
                                                                   2    2 2    2       2     2
                                                                                                
         g = 4m                                 −dT + dX + W (X − T ) dθ + sin θ dφ                ,
                     X 2 − T 2 + eW (X 2 −T 2 )
                                                                                                (9.31)
      where W is the rescaled Lambert function defined by (9.14) (cf. Fig. 9.4); it is the inverse of
      the function x 7→ ex (x − 1), which establishes a bijection from (0, +∞) to (−1, +∞). Note
      that the metric components gαβ appearing in Eq. (9.31) are regular functions of (T, X) in all
      M . In particular, the denominator X 2 − T 2 + eW (X −T ) of gT T and gXX never vanishes for
                                                             2 2
      X 2 − T 2 > −1, which is the range of variation of (T, X) on M according to Eq. (9.30). For
      |T | = |X|, and in particular at (T, X) = (0, 0), one has gXX = −gT T = 16m2 /e, given that
      W (0) = 1 (cf. Fig. 9.4),
           Let us define the following open subsets of M , which are respectively the images of MI
      and MII by the reflection through the origin (9.29):
      On MIII ∪ MIV , one may introduce coordinates (t′ , r′ , θ, φ) of Schwarzschild-Droste type; they
      are related to the Kruskal-Szekeres coordinates by formulas analogous to (9.18) and (9.25),
      simply changing T to −T and X to −X:
                                    q                                      
                              r′ /4m    r′             t′
                                                            
                                                                             t′ = 2m ln   X+T
                                                                                                  
                      T = −e
                     
                                       2m
                                           − 1 sinh   4m
                                                                            
                                                                                           X−T
        MIII :                       q                              ⇐⇒                                   (9.33)
                      X = −er′ /4m r′ − 1 cosh        t′                    r′ = 2m W (X 2 − T 2 ).
                                                                          
                                        2m            4m
                                    q                                      
                              r′ /4m       r′          t′
                                                            
                                                                             t′ = 2m ln   T +X
                                                                                                  
                      T = −e
                                     1 − 2m  cosh    4m
                                                                            
                                                                                           T −X
        MIV :                        q                             ⇐⇒                                    (9.34)
                      X = −er′ /4m 1 − r′ sinh        t′                    r′ = 2m W (X 2 − T 2 ).
                                                                          
                                           2m         4m
                                           9.3 Maximal extension                                                 293
                                                      T
                                                  3
                                                                            0
                                                                        r=
                                                  2
                              III                         II                  I
                                                                                      X
                        3           2      1                   1       2          3
                                                      IV
                                                  1
                                                                           r0 =
                                                                             0
Figure 9.8: Kruskal diagram: Schwarzschild spacetime M depicted in terms of Kruskal-Szekeres coordinates
(T, X). Each point in this diagram, including the one at (T, X) = (0, 0), is actually a sphere S2 , spanned by
the coordinates (θ, φ). Solid lines denote the hypersurfaces t = const in MI and MII and the hypersurfaces
t′ = const in MIII and MIV , whiles dashed curves denote the hypersurfaces r = const in MI and MII and the
hypersurfaces r′ = const in MIII and MIV . The bifurcate Killing horizon is marked by thick black lines, while
the singularities at r = 0 and r′ = 0 are depicted by the heavy dashed brown curve. [Figure generated by the
notebook D.4.16]
Remark 1: The extended manifold M is not just the union MI ∪ MII ∪ MIII ∪ MIV , since the latter
does not contain the hypersurfaces T = ±X (cf. the strict inequalities in Eqs. (9.24) and (9.32)), which
are parts of M according to the definition (9.30). Actually, we have
      The range of (U, V ) of M is deduced from the constraint T 2 − X 2 < 1 [cf. Eq. (9.30)]: since
      T 2 − X 2 = U V , we get:
         The expression of the metric with respect to the null coordinates X α̂ = (U, V, θ, φ) is
      deduced from (9.31):
                                                                                           
                                              4
                                2                                         2  2     2    2
                                                                                                       (9.38)
                                                                                          
                      g = 4m                             dU dV + W (−U V ) dθ + sin θ dφ      .
                                        U V − eW (−U V )
                                            32m3 −r/2m
                                                       dU dV + r2 dθ2 + sin2 θ dφ2 ,                   (9.39)
                                                                                  
                                 g=−            e
                                              r
r = 2m W (−U V ). (9.40)
      Remark 2: In Sec. 9.3.1, we have distinguished the coordinate r in MI ∪ MII from the coordinate r′ in
      MIII ∪ MIV . Here, r is the function (9.40) of (U, V ), which has the same expression in MI ∪ MII and
      MIII ∪ MIV . There is no need to make any distinction. Hence there is no mention of r′ in (9.39).
      Historical note : The Kruskal-Szekeres coordinates have been introduced in 1960 independently by
      Martin Kruskal [326] and George Szekeres [472]. Actually the coordinates introduced by Szekeres
                  √        √
      were2 (2T / e, 2X/ e). Both Kruskal and Szekeres have used these coordinates to construct the
      maximal extension of Schwarzschild spacetime. Its graphical representation in the (X, T ) plane (the
      Kruskal diagram, cf. Fig. 9.8) has been presented by Kruskal (Fig. 2 of Ref. [326]). However, the first
      author to obtain the maximal extension of Schwarzschild spacetime is John Synge in 1950 [469], i.e. 10
         2
             They are denoted by (v, u) in Szekeres’ article [472].
                                               9.3 Maximal extension                                                   295
years before Kruskal and Szekeres. Synge used coordinates (T ′ , X ′ ) whose relation to Schwarzschild-
Droste coordinates is more complicated than the Kruskal-Szekeres one: T ′ = R(r) sinh 4m          t
                                                                                                       and
                                                                                                     
                                           h                 q              i
X = R(r) cosh 4m , with R(r) := 2m arcosh 2m + 2m 2m − 1 ; compare with (9.18). Albeit
  ′                 t
                                                   p r          r   r
                                                                          
                                                                                                    −1/2
looking complicated, R(r) is nothing but the primitive vanishing at r = 2m of r 7→ 2m        r
                                                                                                −1        .
Interestingly, in his article [472], Szekeres says that the transformations (9.18) “are essentially due to
Synge”,
      pprobably because they differ only in the choice of the function R(r), the latter being RKS (r) =
er/4m    r/2m − 1 for Kruskal-Szekeres coordinates. For this reason, both coordinate systems share
some similarities: in Synge diagram (Figs. 8 and 9 in Ref. [469]), the bifurcate horizon appears as the
two bisector lines T ′ = ±X ′ and the singularity r = 0 as the hyperbola T ′2 − X ′2 = π 2 m2 (compare
with T 2 − X 2 = 1 for Kruskal-Szekeres coordinates). A major difference is that Synge diagram is not
“conformal”: the radial null geodesics are generally not lines with ±45◦ slope. Even, in some regions,
the coordinate X ′ ceases to be spacelike3 . The maximal extension of Schwarzschild spacetime has also
been found by Christian Fronsdal [205] in 1959, not via any explicit change of coordinates but rather via
an isometric embedding of the spacetime in the 6-dimensional Minkowski spacetime.
                               ∂T               ∂X              ∂θ                  ∂φ
                        ξT =      ,     ξX =       ,     ξθ =      = 0,      ξφ =      = 0.
                               ∂t               ∂t              ∂t                  ∂t
Given the coordinate transformation laws (9.18) and (9.25), we get in MI and MII :
                                         1                  1
                                ξT =       X,      ξX =       T,      ξ θ = ξ φ = 0.
                                        4m                 4m
Hence in MI ∪ MII ,
                                                   1
                                            ξ=       (X ∂T + T ∂X ) .                                         (9.42)
                                                  4m
Now, this formula defines a smooth vector field in all M . Moreover, in MIII ∪ MIV , this vector
coincides with ∂t′ since ξ T = ∂T /∂t′ and ξ X = ∂X/∂t′ , with the partial derivatives with
respect to t′ evaluated from (9.33)-(9.34). Hence the vector field ξ defined by (9.42) is a Killing
vector field of maximal extension (M , g). This vector field is depicted in Fig. 9.9.
    The bifurcate Killing horizon with respect to ξ that extends H is Hˆ = H1 ∪ H2 , where
                                                            T
                                                        3
                                                                                  0
                                                                                 r=
                                                        2
                                                                                            X
                              3        2         1                 1         2          3
                                                                                 r0 =
                                                                                   0
      Figure 9.9: Killing vector field ξ on the extended Schwarzschild manifold. [Figure generated by the notebook
      D.4.16]
      The bifurcate Killing horizon Hˆ is depicted in black in Fig. 9.9. The Schwarzschild horizon
      H is the part of H1 defined by X > 0. In terms of the null coordinates (U, V ) introduced in
      Sec. 9.3.2, we have, given (9.36),
                                           Hˆ : U = 0 or V = 0                                            (9.43a)
                                           H : U = 0 and V > 0.                                           (9.43b)
rS = 2m . (9.44)
      Moreover, setting (T, X) = (0, 0) in Eq. (9.42), we recover Property 3.12 (Sec. 3.4.1): the Killing
      vector field vanishes at the bifurcation sphere:
                                                      ξ|S = 0.                                             (9.45)
                                     9.4 Carter-Penrose diagram                                          297
Since for Û , V̂ ∈ (−π/2, π/2), we have cos Û > 0 and cos V̂ > 0, we may write
                                                                                  π            π
U V < 1 ⇐⇒ sin Û sin V̂ < cos Û cos V̂ ⇐⇒ cos(Û + V̂ ) > 0 ⇐⇒ −                  < Û + V̂ < .
                                                                                  2            2
    Since (9.46) yields dU = dÛ / cos2 Û and dV = dV̂ / cos2 V̂ , we deduce immediately from
(9.39) the expression of the metric in terms of the coordinates xα = (Û , V̂ , θ, φ):
                                                              T̂
                                                                   r=0
                                                        1.5
0 + 1.0 II +
                                                        0.5
                                  III                                                        I
                                                                                                              X̂
                 3                2           1                               1          2              3
                                                        0.5
                      0−                                                                          −
                                                        1.0           IV
                                                        1.5
                                                                   r0 = 0
      Figure 9.10: Carter-Penrose diagram of the Schwarzschild spacetime constructed with the compactified
      coordinates (T̂ , X̂). Solid curves denote hypersurfaces of constant Schwarzschild-Droste coordinate t: in region
      MI , from the X̂-axis to the top: t = 0, 2m, 5m, 10m, 20m and 50m, the last two being barely visible; in region
      MII , from the T̂ -axis to the right: t = 0, 2m, 5m, 10m, 20m and 50m, Dashed curves denote hypersurfaces of
      constant Schwarzschild-Droste coordinate r: in region MI , from the left to the right: r = 2.01m, 2.1m, 2.5m
      (almost vertical), 4m, 8m, 12m, 20m and 100m, the last three being barely visible; in region MII , from the bottom
      to the top: r = 1.98m, 1.9m, 1.7m, 1.5m, 1.25m, m, 0.5m and 0.1m. The color code is the same as in Fig. 9.8.
      [Figure generated by the notebook D.4.17]
          Via (9.46) and (9.36), the relation between (T̂ , X̂) and the Kruskal-Szekeres coordinates
      (T, X) is then the same as that between (τ, χ) and (t, r) for Minkowksi spacetime [Eq. (4.16)]:
                                                                  
                                                                             sin T̂
                                                                   T =
                                                                  
             T̂ = arctan(T + X) + arctan(T − X)                  
                                                                          cos T̂ + cos X̂
                                                            ⇐⇒                                 (9.52)
             X̂ = arctan(T + X) − arctan(T − X)                               sin X̂
                                                                   X=                    .
                                                                  
                                                                  
                                                                          cos T̂ + cos X̂
          The maximal extension of Schwarzschild spacetime is depicted with respect to the coordi-
      nates (T̂ , X̂) in Fig. 9.10. Such a plot is called a Carter-Penrose diagram (see the historical
      note p. 300). As the Kruskal diagram (Fig. 9.8), it has the feature of displaying radial null
      geodesics as straight lines with slope ±45◦ . This holds since Û (resp. V̂ ) is a function of U
      only (resp. V only), cf. Eq. (9.46), so that Û (resp. V̂ ) is constant on outgoing (resp. ingoing)
      radial null geodesics. In particular, the bifurcate Killing horizon and the Schwarzschild horizon
      are obtained for specific values of Û and V̂ :
                                             Hˆ : Û = 0 or                V̂ = 0                              (9.53a)
                                             H : Û = 0 and                 V̂ > 0.                            (9.53b)
      These relations follow immediately from (9.43) and (9.46).
                                    9.4 Carter-Penrose diagram                                        299
     We have seen in Sec. 6.4 that the future null infinity I + corresponds to v → +∞ and that
the past null infinity I − to u → −∞ (cf. Fig. 6.6). Since on MI , U = −e−u/4m and V = ev/4m
[cf. Eq. (9.4)], we may write equivalently:
                            I + : V → +∞ and U ∈ (−∞, 0)                                   (9.54a)
                            I − : U → −∞ and V ∈ (0, +∞).                                  (9.54b)
In view of (9.46), we get then:
                                         π             π 
                            I + : V̂ →       and Û ∈ − , 0                                 (9.55a)
                                         2             2 π
                                           π
                            I−    : Û → −     and V̂ ∈ 0,   .                             (9.55b)
                                           2               2
    By symmetry, the extension MIII ∪ MIV of Schwarzschild spacetime has the following null
infinity:
                             ′+        π              π 
                          I : Û →         and V̂ ∈ − , 0                           (9.56a)
                                       2                 2 
                              −          π                 π
                          I ′ : V̂ → −       and Û ∈ 0,       .                    (9.56b)
                                         2                  2
      the minus sign taking into account that sin 2Û sin 2V̂ approaches zero via negative values near
      I . A first issue is that the square root in (9.60) makes Ω not differentiable on I , where either
      sin 2Û = 0 or sin 2V̂ = 0. In other words, dΩ is diverging on I . Suppose we accept this and
      are ready to introduce a slight deviation (given that Ω2 , which is involved in (9.59), is smooth)
      from the definition given in Sec. 4.3. Then the conformal metric should be
                                         
                                       2m
                               2
                                             dÛ dV̂ − r2 sin 2Û sin 2V̂ dθ2 + sin2 θ dφ2 .       (9.61)
                                                                                           
                   g̃ = −64m 1 −
                                        r
      Near I , r → +∞ and we have g̃Û V̂ → −32m2 . On the contrary, g̃θθ is of the type “∞ × 0”; in
      order to determine its behavior, let us rewrite it as follows:
g̃θθ = −r2 sin 2Û sin 2V̂ = −4r2 sin Û sin V̂ × cos Û cos V̂ ,
                                                                             e−r/2m
                                          cos Û cos V̂ = − sin Û sin V̂            .
                                                                            r/2m − 1
      Hence
                                                                           re−r/2m
                                              g̃θθ = 8m sin2 Û sin2 V̂
                                                                          1 − 2m/r
      and (9.61) becomes
                                                            re−r/2m
                           
                         2m
                    2
                              dÛ dV̂ + 8m sin2 Û sin2 V̂          dθ2 + sin2 θ dφ2 .                    (9.62)
                                                                                    
        g̃ = −64m 1 −
                          r                                1 − 2m/r
      On this expression, we can read directly the value of the conformal metric at I , where r → +∞,
      2m/r → 0, re−r/2m → 0 and sin2 Û → 1 or sin2 V̂ → 1:
                                                            I
                                                      g̃ = −64m2 dÛ dV̂ .                                (9.63)
      This bilinear form is clearly degenerate (cf. Sec. A.3.1). Therefore, g̃ is not a regular metric
      on the whole manifold M ∪ I . We conclude that (9.60)-(9.61) does not define a conformal
      completion of (M , g).
      Historical note : The first compactified conformal diagram of the (maximal extension of) Schwarzschild
      spacetime has been constructed by Brandon Carter in 1966 [88], using the same coordinates (T̂ , X̂) as
      here4 : compare Fig. 9.10 with Fig. 1c of Ref. [88]. In his article, Carter notes that “the manner in which
         4
             (T̂ , X̂) are denoted (ψ, ξ) by Carter [88].
                                       9.4 Carter-Penrose diagram                                           301
the distant flat-space parts (...) are compressed into finite parts of the (ξ, ψ) plane by the coordinate
transformations recalls the conformal diagrams used by R. Penrose” in 1964 [404], the diagrams in
Penrose’s article [404] regarding Minkowski and de Sitter spacetimes only. This justifies the name
Carter-Penrose diagram used in the literature (e.g. [204]) for the graphical representation shown in
Fig. 9.10, while the mere Penrose diagram or conformal diagram is quite common; one encounters as
well the name Penrose-Carter diagram (e.g. [416]).
                          cosh(tan Ũ )                            cosh(tan Ṽ )
                   dU =                 dŨ        and     dV =                  dṼ ,
                             cos2 Ũ                                  cos2 Ṽ
so that the metric expression in terms of the coordinates xα = (Ũ , Ṽ , θ, φ) is easily deduced
from (9.39)-(9.40):
      As we did for (Û , V̂ ), let us rewrite (9.67) in a form that is better adapted to the null asymptotics.
      Given (9.68) and (9.15), we have
                                          r       
                                  er/2m        − 1 = − sinh(tan Ũ ) sinh(tan Ṽ ),                     (9.69)
                                          2m
      from which we get
                                               
                              2m −r/2m       2m                1
                                 e     =− 1−                                  .
                               r              r   sinh(tan Ũ ) sinh(tan Ṽ )
      Hence (9.67) becomes
                                      
                                   2m2                     dŨ dṼ
                     g = 16m 1 −
                                    r     tanh(tan Ũ ) tanh(tan Ṽ ) cos2 Ũ cos2 Ṽ
                         +r2 dθ2 + sin2 θ dφ2 .                                                         (9.70)
                                             
      Given the values (9.66) of Ũ and Ṽ near I , tanh(tan Ũ ) tanh(tan Ṽ ) does not vanish there. A
      natural choice of conformal factor is then
Note that the first line in (9.73) corresponds to M , identified as a subset of R2 ×S2 [cf. Eq. (9.65)]
and that the definitions of I + , I − , I ′ + and I ′ − are in agreement with (9.66). It is clear that
M˜ is a manifold with boundary and that
                                                        +     −
                             ∂ M˜ = I := I + ∪ I − ∪ I ′ ∪ I ′ .                                 (9.75)
Moreover, the scalar field Ω defined by (9.71) satisfies Ω ≥ 0 on M˜, along with Ω = 0 on I
and dΩ ̸= 0 on I . The last property follows from
which implies dΩ|I + = − cos Ũ dṼ ̸= 0, dΩ|I − = cos Ṽ dŨ ̸= 0, dΩ|I ′ + = − cos Ṽ dŨ ̸=
0 and dΩ|I ′ − = cos Ũ dṼ ̸= 0. Hence the conditions 1, 3 and 4 of the definition of a conformal
completion given in Sec. 4.3 are fulfilled. There remains to check condition 2, namely that
the tensor g̃ defined by (9.72) is a regular metric on the whole M˜. This was the main failing
point in the attempt of Sec. 9.4.2. Since Ω2 > 0 on M , g̃ is well-behaved on M . Let us thus
examine its behavior on I . We shall focus on I + , the behavior on the other parts of I
being obtained by some trivial symmetry. As one approaches I + , r → +∞, Ṽ → π/2 and
tanh(tan(Ṽ )) → 1; accordingly we read from (9.72) that
                                            I+       8m2
                                      g̃Ũ Ṽ =                 .
                                                  tanh(tan Ũ )
Besides, we have g̃θθ = r2 cos2 Ũ cos2 Ṽ , which is of the type “+∞ × 0” near I + . Noticing
that tan Ṽ ∼ 1/ cos Ṽ when Ṽ → π/2, we get from (9.69)
                                               rer/2m
                                 
                             1                                           π
                    sinh            ∼−                       when Ṽ → .
                           cos Ṽ        2m sinh(tan Ũ )                2
                          √
Since arsinh x = ln(x + x2 + 1) ∼ ln(2x) when x → +∞, we obtain
                                 rer/2m
                                            
            1                                       r      r                      
                 ∼ ln −                        =      + ln       − ln − sinh(tan Ũ )
          cos Ṽ            m sinh(tan Ũ )       2m         m
                       r                    π
                 ∼          when Ṽ → .
                      2m                    2
Hence cos2 Ṽ ∼ 4m2 /r2 and g̃θθ ∼ 4m2 cos2 Ũ . Gathering the above results, we have
                                                                              
                 I+              4
                       2                              2       2     2      2
                                                                                                 (9.76)
                                                                             
              g̃ = 4m                    dŨ dṼ + cos Ũ dθ + sin θ dφ          .
                           tanh(tan Ũ )
Since cos2 Ũ ̸= 0 on I + [cf. Eq. (9.74a)], this bilinear form is non-degenerate. Moreover, since
tanh(tan Ũ ) < 0 on I + [again by (9.74a)], its signature is (−, +, +, +). We conclude that g̃ is
a well-behaved metric on the whole manifold M˜. This completes the demonstration of the
following result:
304                       Maximal extension of Schwarzschild spacetime
        The pair (M˜, g̃), with M˜ defined by (9.73)-(9.74) and g̃ defined by (9.72) is a conformal
        completion of the maximally extented Schwarzschild spacetime (M , g), the conformal fac-
        tor being given by (9.71). The Penrose-Frolov-Novikov coordinates (Ũ , Ṽ , θ, φ) employed
        in this construction are related to the null Kruskal-Szekeres coordinates (U, V, θ, φ) by
        (9.64).
      Remark 1: At first sight, the metric g̃ given by (9.72) looks degenerate at the bifurcate Killing horizon
      H , since 1 − 2m/r = 0 there. But one shall not forget that on H , which is defined by (Ũ = 0 or
      Ṽ = 0), one has tanh(tan Ũ ) tanh(tan Ṽ ) = 0, which compensate the vanishing of 1 − 2m/r in the
      term g̃Ũ Ṽ . Actually, to deal with g̃ near H , it is more appropriate to use the form that is deduced from
      (9.67) and (9.71):
                32m3 −r/2m
                           cosh(tan Ũ ) cosh(tan Ṽ ) dŨ dṼ + r2 cos2 Ũ cos2 Ṽ dθ2 + sin2 θ dφ2 . (9.77)
                                                                                                    
       g̃ = −       e
                  r
      Remark 2: The conformal completion constructed above cannot be analytically extended “beyond” I ,
      because the function Ṽ 7→ 1/ tanh(tan Ṽ ), which appears in (9.72), is C ∞ but not analytic at Ṽ = π/2.
      It is possible to construct an analytic conformal completion, but it involves more complicated coordinate
      transformations. The latter start, not from the Kruskal-Szekeres coordinates (U, V ), but from the null
      coordinates (u, v) defined by (9.1). We refer to the article [254] for details.
          To depict M˜, let us introduce “time+space” coordinates (T̃ , X̃), which are related to (Û , V̂ )
      in exactly the same way as (T̂ , X̂) are related to (Û , V̂ ) [cf. Eq. (9.50)]:
                                                        
                                 T̃ = Ũ + Ṽ            Ũ = 1 (T̃ − X̃)
                                                                      2
                                                   ⇐⇒                                              (9.78)
                                 X̃ = Ṽ − Ũ            Ṽ = 1 (T̃ + X̃).
                                                                      2
      The picture of (M , g) in the (T̃ , X̃) plane is shown in Fig. 9.11. We shall call it a regular
      Carter-Penrose diagram of Schwarzschild spacetime. As the singular Carter-Penrose diagram
      of Fig. 9.10, it has the feature of displaying radial null geodesics as straight lines with slope
      ±45◦ , since Ũ (resp. Ṽ ) is a function of U only (resp. V only) [cf. Eq. (9.64)]. In particular, the
      bifurcate Killing horizon and the Schwarzschild horizon are defined by:
                          Hˆ : Ũ = 0 or         Ṽ = 0 ⇐⇒ T̃ = X̃            or T̃ = −X̃                  (9.80a)
                          H : Ũ = 0 and           Ṽ > 0 ⇐⇒ T̃ = X̃           and     T̃ > 0.             (9.80b)
                                                         T̃
                                                   1.5        r=0
0 + 1.0 II +
                                                   0.5
                          III                                                           I
                                                                                                         X̃
           3              2             1                              1            2              3
                                                   0.5
                0−                                                                           −
                                                   1.0           IV
                                                   1.5        r0 = 0
Figure 9.11: Carter-Penrose diagram of the Schwarzschild spacetime based on the Penrose-Frolov-Novikov
coordinates. Solid curves denote the same hypersurfaces of constant Schwarzschild-Droste coordinate t as in
Fig. 9.10: in region MI , from the X̂-axis to the top: t = 0, 2m, 5m, 10m, 20m and 50m; in region MII , from the
T̂ -axis to the right: t = 0, 2m, 5m, 10m, 20m and 50m, Dashed curves denote the same hypersurfaces of constant
Schwarzschild-Droste coordinate r as in Fig. 9.10: in region MI , from the left to the right: r = 2.01m, 2.1m,
2.5m, 4m, 8m, 12m, 20m and 100m; in region MII , from the bottom to the top: r = 1.98m, 1.9m, 1.7m, 1.5m,
1.25m, m, 0.5m and 0.1m. The color code is the same as in Figs. 9.8 and 9.10. Contrary to the Carter-Penrose
of Fig. 9.10, this one is associated to a regular conformal completion at null infinity of Schwarzschild spacetime.
[Figure generated by the notebook D.4.18]
    At first sight, the main difference with the “standard” Carter-Penrose diagram of Fig. 9.10
is the more complicated shape of the boundary around the T̃ -axis. This follows from the
third condition in (9.79), which is more involved than the third condition in (9.51). Actually
this boundary corresponds to the curvature singularity limit r → 0 or r′ → 0. Indeed, from
Eq. (9.41), we have
                                      r = 0 ⇐⇒ U V = 1.                                  (9.81)
In terms of the coordinates (Û , V̂ ), we have then [cf. Eq. (9.46)]
On the contrary, in terms of the coordinates (Ũ , Ṽ ), Eq. (9.81) becomes [cf. Eq. (9.64)]
                                r = 0 ⇐⇒ sinh(tan Ũ ) sinh(tan Ṽ ) = 1,
306                       Maximal extension of Schwarzschild spacetime
      This explains the more complex boundary of Fig. 9.11 diagram with respect to Fig. 9.10 diagram.
      Remark 3: The shape of the Carter-Penrose diagram in Frolov & Novikov’s book (Fig. 5.2 of Ref. [202];
      see also Fig. 10.6 of Ref. [204]) differs slightly from the diagram obtained here (Fig. 9.11). This is because
      the coordinates used by Frolov & Novikov are constructed from the Szekeres’ version (up to a factor
                                                           √               √
      2) of Kruskal-Szekeres coordinates: T ′ = T / e and X ′ = X/ e (cf. the historical note on p. 294).
      Accordingly, in Frolov & Novikov’s version, one shall replace the 1 in the right-hand side of Eq. (9.83)
      by 1/e, yielding to a different shape of the boundary r = 0.
      Remark 4: As noticed by Frolov and Novikov [202] (see their Sec. 5.1.3), one can perform some
      coordinate transformation from (Ũ , Ṽ ) to get a Carter-Penrose diagram with a straight line for the
      boundary r = 0.
            Besides the shape of the boundary r = 0, another difference between the Carter-Penrose
      diagram based on Penrose-Frolov-Novikov coordinates (Fig. 9.11) and the “standard” diagram
      of Fig. 9.10 is that the t = const hypersurfaces of the former (solid curves in Fig. 9.11) are all
      tangent to the horizontal axis when X̃ → ±π. On the contrary, the same hypersurfaces in
      Fig. 9.10 reach the point (T̂ , X̂) = (0, ±π) with a finite slope. We note that in this respect, the
      Carter-Penrose diagram of Fig. 9.11 is similar to the conformal diagram of Minkowski spacetime,
      as shown in Fig. 4.3, and therefore display correctly the asymptotic flatness structure. The
      failure of diagram of Fig. 9.10 to reproduce this behavior reflects the fact that the coordinates
      (T̂ , X̂) are singular on the boundary, as discussed in Sec. 9.4.2.
      Historical note : The transformation (9.64) from (U, V ) to (Ũ , Ṽ ) has been suggested by Roger Penrose
      in 1967 (p. 209 of Ref. [407]) to get a “version of the Kruskal diagram in which conformal infinity is
      represented”. The figure in Penrose’s article (Fig. 37 in Ref. [407]) ressembles Fig. 9.11, except that the
      curve r = 0 does not look tangent to the lines T̃ = ±X̃, as it is in Fig. 9.11. The same coordinate
      change (9.64) has been discussed in details by Valeri P. Frolov and Igor D. Novikov in their 1998 textbook
      [202] (Sec. 5.1.3), stressing that it leads to a regular conformal completion, contrary to the “standard”
      coordinate change (9.46) introduced by Brandon Carter [88] and used in Hawking & Ellis [266] and
      MTW [371] textbooks.
      with
                                                  +                                     −
                              I∗+ := I + ∪ I ′            and       I∗− := I − ∪ I ′ .                       (9.85)
      It is clear that any point of I∗+ is the furure end point of a future-directed null curve from M ,
      while any point of I∗− is the past end point of a future-directed null curve to M (cf. Fig. 9.11).
                                     9.5 Einstein-Rosen bridge                                       307
Hence the writing (9.84) matches the definition of a conformal completion at null infinity given
in Sec. 4.3.1. I∗+ is then the future null infinity of (M , g) and I∗− its past null infinity.
    We are thus in position to apply the definitions of a black hole and a white hole given in
Sec. 4.4.2. The causal past of I∗+ is J − (I∗+ ) = MI ∪ MIII ∪ MIV (cf. Fig. 9.11). In view of the
definition (4.37), we get
  The maximal extension of Schwarzschild spacetime admits a black hole region, the interior
  of which is MII . The black hole event horizon is the part of the bifurcate Killing horizon
  Hˆ (cf. Sec. 9.3.3) that has T̃ > 0.
The black hole region has thus the same interior MII as the black hole region of the Schwarzschild
spacetime (MIEF , g) considered in Chap. 6. It differs only a larger boundary: it is not reduced
to the Schwarzschild horizon H , but contains the part X̃ < 0 and T̃ > 0 of Hˆ .
    The novelty with respect to the original Schwarzschild spacetime (MIEF , g) is the existence
of a white hole region. Indeed, the causal future of I∗− is J + (I∗− ) = MI ∪ MII ∪ MIII (cf.
Fig. 9.11). In view of the definition (4.41), we get
  The maximal extension of Schwarzschild spacetime admits a white hole region, the interior
  of which is MIV . The corresponding past event horizon is the part of the bifurcate Killing
  horizon Hˆ (cf. Sec. 9.3.3) that has T̃ < 0.
Consequently
    • if |T0 | < 1, the hypersurface ΣT0 is connected and diffeomorphic to R×S2 , the coordinate
      X spanning R and (θ, φ) spanning S2 .
308                       Maximal extension of Schwarzschild spacetime
                                                            T
                                                        3
                                                                                 0
                                                                             r=
                                                        2                                T0 = 2.0
                                                                                         T0 = 1.5
                                                        1                                T0 = 1.0
                                                                                         T0 = 0.5
                                                                                         T0X= 0.0
                               3        2        1                    1     2          3
                                                                                       T0 = − 0.5
                                                        1                              T0 = − 1.0
                                                                                       T0 = − 1.5
                                                        2                              T0 = − 2.0
                                                                                r0 =
                                                                                  0
      Figure 9.12: Kruskal diagram with the hypersurfaces ΣT0 (defined by T = T0 = const) as blue horizontal
      lines. For |T0 | > 1, the dotted part of ΣT0 corresponds to a region that cannot be embedded isometrically in the
      Euclidean space. When T0 varies, the limit of these regions form the grey dotted curve. [Figure generated by the
      notebook D.4.19]
          • if |T0 p
                   | ≥ 1, ΣT0 has two connected components, defined by X < − T02 − 1 and
                                                                                    p
                   Σ+
                    T0 = {p ∈ ΣT0 ,         X(p) ≥ 0}           and   Σ−
                                                                       T0 = {p ∈ ΣT0 ,       X(p) < 0} .        (9.87)
      For |T0 | < 1, there is a slight asymmetry between the two parts: Σ+   T0 is a manifold with
      boundary (cf. Sec. A.2.2), the boundary corresponding to X = 0, while Σ−
                                                                             T0 is not. For |T0 | ≥ 1,
      ΣT0 and ΣT0 are nothing but the two connected components of ΣT0 .
       +         −
                                      32m3 −r/2m
                                                 dX 2 + r2 dθ2 + sin2 θ dφ2 ,                                   (9.88)
                                                                           
                                   γ=     e
                                        r
      where r is the function of X defined by
      The metric (9.88) is obtained by setting T = T0 and dT = 0 in (9.31). Since r > 0, the
      metric (9.88) is clearly positive definite, i.e. γ is a Riemannian metric and ΣT0 is a spacelike
      hypersurface.
                                                9.5 Einstein-Rosen bridge                                     309
r/m
                        T0 = 0.0
                        T0 = 0.5                           1
                        T0 = 1.0
                        T0 = 1.5
                        T0 = 2.0
                                                                                                X
                    4              3      2        1                    1     2         3   4
Figure 9.13: Function r = r(X) on the hypersurface ΣT0 , for the same values of T0 as in Fig. 9.12. [Figure
generated by the notebook D.4.19]
      The graph of the function r(X) is shown in Fig. 9.13. Once restricted to positive (resp.
negative) values of X, this function is a bijection (X0 , +∞) → (r0 , +∞) (resp. (−∞, −X0 ) →
(r0 , +∞)), where
                                                        
           0            if |T0 | < 1                     2m W (−T 2 ) if |T | < 1
                                                                      0         0
    X0 =                                     and r0 =                                   (9.90)
           T − 1 if |T0 | ≥ 1.                                          if |T0 | ≥ 1.
             p
                 2                                        0
                        0
   We may use the above bijection to introduce coordinates (r, θ, φ) instead of (X, θ, φ) on
each of the two regions Σ+T0 and ΣT0 . Differentiating (9.91) leads to
                                  −
                                                               rer/2m
                                       dX = ±          q                          dr.
                                                                    r
                                                                           
                                                8m2        er/2m   2m
                                                                        − 1 + T02
Substituting in (9.88) we get the expression of the metric on ΣT0 in terms of the coordinates
(xi ) = (r, θ, φ):
                                              −1 2
                                             
                          2m         2 −r/2m
                                                   dr + r2 dθ2 + sin2 θ dφ2 .           (9.92)
                                                                             
                γ = 1−         1 − T0 e
                           r
   5
       Let us recall that W −1 (x) = ex (x − 1).
310                      Maximal extension of Schwarzschild spacetime
      Remark 1: As a check of the above formula, we notice that for T0 = 0 it reduces to the metric of a
      slice t = const in Schwarzschild-Droste coordinates [set dt = 0 in Eq. (6.14)]. This is correct since the
      positive-X half of the hypersurface T = 0 in Kruskal-Szekeres coordinates, i.e. Σ+0 , coincides with the
      hypersurface t = 0 in Schwarzschild-Droste coordinates, as it can be seen by setting T = 0 in Eq. (9.18)
      (see also Fig. 9.12).
      Comparing (9.96) with (9.94), we see that a possible isometric embedding of (Σeq
                                                                                    T0 , q) into
      (R3 , f ) is
                            Φ : Σeq        −→ R3
                                 T0
                                                                                                        (9.97)
                                  (X, φ) 7−→ (r, z, φ) = (r(X), ±Z(r(X)), φ) ,
                                           9.5 Einstein-Rosen bridge                                     311
with the function r(X) is given by Eq. (9.89), the sign ± is + on Σ+,eq
                                                                   T0   and − on Σ−,eq
                                                                                  T0   and the
function Z(r) obeys
                                                           −1
                                                          
                                ′      2  2m      2 −r/2m
                          1 + Z (r) = 1 −    1 − T0 e          .
                                           r
The condition Z(r(0)) = 0 insures the continuity of the embedded surface Φ(Σeq             T0 ), while
Z (r(0)) = +∞ insures that it has a vertical tangent at the junction between Φ(Σ+,eq
  ′
                                                                                            T0 ) and
     −,eq
Φ(ΣT0 ), so that it is a smooth surface. Fortunately, the condition Z (r(0)) = ∞ is automati-
                                                                            ′
cally fulfilled from the second expression of Z ′ (r)2 in (9.98): Z ′ (r)2 clearly diverges at X = 0.
Moreover, in order for the isometric embedding Φ to be well-defined, the right-hand side of
(9.98) must be non-negative. Since the denominator of the last term is manifestly non-negative,
the sign is determined by the numerator. Hence the condition er/2m ≥ T02 , or equivalently,
r ≥ 4m ln |T0 |. (9.100)
For |T0 | ≤ 1, this condition is always fulfilled, since ln |T0 | ≤ 0 and r ≥ 0. For |T0 | > 1, it
implies the existence of a minimal value of r,
    Via (9.89) and the fact that the rescaled Lambert function W is an increasing function (cf.
Fig. 9.4), of inverse F (x) = ex (x − 1), the condition (9.100) can be turned into a condition on
X:
                                                                                           (9.102)
                                                         p
                               |X| ≥ Xemb (T0 ) := |T0 | 2 ln |T0 |.
This limit is shown as the grey dotted curve in Fig. 9.12.
312                        Maximal extension of Schwarzschild spacetime
      Figure 9.14: Flamm paraboloid: isometric embedding in the Euclidean R3 of the spacelike slice T = 0 and
      θ = π/2 of Schwarzschild spacetime. The (x, y) coordinates are the standard Cartesian coordinates of R3 related
      to (r, φ) via x = r cos φ and y = r sin φ. The labels are in units of m. [Figure generated by the notebook D.4.19]
           Summarizing, the minimal value of r on the embedded surface is r0 [cf. Eq. (9.90)] for
      |T0 | ≤ 1 or remb (T0 ) for |T0 | > 1:
                                               
                                                2m W (−T 2 ) if |T | ≤ 1
                                                             0      0
                                  rmin (T0 ) =                                            (9.103)
                                                4m ln |T0 |   if |T0 | > 1.
      Note that rmin (T0 ) is a continuous function, with the peculiar values rmin (0) = 2m and
      rmin (1) = 0. The embedding function Z(r) is found by integration of Z ′ (r), as given by (9.98),
      from rmin (T0 ) to r:                             s
                                           Z r
                                              2m            1 − T02 e−x
                                 Z(r) = 2m                                dx.                  (9.104)
                                             rmin (T0 )   T02 e−x + x − 1
                                                        2m
      The integral cannot be computed exactly in terms of elementary functions, except for T0 = 0,
      where it reduces to                 Z r
                                              2m    dx
                                Z(r) = 2m         √           (T0 = 0).
                                            1       x−1
      Hence                                 r
                                                 r
                                 Z(r) = 4m          −1       (T0 = 0).                     (9.105)
                                               2m
      According to (9.97), the whole surface Φ(Σeq
                                                T0 ) is obtained by considering z = −Z(r) as
      well. The surface equation in terms of the cylindrical coordinates (r, z, φ) of R3 is then
                                          9.5 Einstein-Rosen bridge                                               313
T0 = 0 T0 = 0.5 T0 = 0.9
T0 = 1 T0 = 1.5 T0 = 2
Figure 9.15: Sequence of isometric embeddings in the Euclidean space of spacelike slices of Schwarzschild
spacetime defined by T = T0 and θ = π/2. The slices are those shown in the Kruskal diagram of Fig. 9.12 (except
for T0 = 0.9). The first embedding (T0 = 0) is the Flamm paraboloid depicted in Fig. 9.14. In the disconnected
case (T0 = 1.5 and T0 = 2.0), the distance between the upper and lower parts is arbitrary (chosen here to be
∆z = 1). [Figure generated by the notebook D.4.19]
      Figure 9.16: Same Flamm paraboloid as in Fig. 9.14 but seen from farther away. Despite being more and more
      flat, none of the two sheets is asymptotic to a plane. [Figure generated by the notebook D.4.19]
          When T0 ̸= 0, the integral in (9.104) has to be computed numerically (see Sec. D.4.19 for the
      computation with SageMath). The resulting embedded surfaces Φ(Σeq     T0 ) are shown in Fig. 9.15
      for the values of T0 involved in the Kruskal diagram of Fig. 9.12. When T0 increases from 0, the
      “neck” becomes thinner and thinner. At T0 = 1, it ceases to be connected. As mentioned above,
      for T0 > 1, the surface Σeq
                                T0 can no longer be entirely isometrically embedded in the Euclidean
      3-space. Hence the holes in the central parts of the surfaces for T0 = 1.5 and T0 = 2. These
      holes correspond to the dotted segments in Fig. 9.12 and their radii are given by Eq. (9.101).
      Note that the tangents to the embedded surfaces at their inner boundaries are horizontal.
          The evolution of ΣT0 as T0 increases is not surprising if one remembers that the Kruskal-
      Szekeres time coordinate T is not associated with any timelike Killing vector of Schwarzschild
      spacetime. The sequence shown in Fig. 9.15 can be thought of as representing the dynamics
      of the Schwarzschild wormhole, in particular its “pinching-off” at T0 = 1, which forbids any
      traveler to go through it.
      Remark 3: We have restricted ourselves to slices T = const of Schwarzschild spacetime, with the
      isometric embedding limitation for |T | > 1. We refer the reader to Ref. [134] for more general slices
      and the corresponding embedding diagrams.
      Remark 4: There are many inexact plots of embeddings of spatial sections of Schwarzschild spacetime
      in the literature, including renown textbooks. A first common error is to draw the two ends of the
      embedded surface as asymptotic to flat planes, which a paraboloid is not (the vertical distance between
                                            √
      the two ends grows unbounded, as r, cf. Eq. (9.105) and Fig. 9.16). This is correct from a topological
      point of view, but not from the geometrical one, i.e. the embedding depicted in this way is not an
      isometry. Probably this results from some confusion with asymptotic flatness: it is true that the metric
      (9.94) tends to a flat metric when r → +∞, reflecting the asymptotic flatness of Schwarzschild spacetime,
      but the associated curvature does not decay fast enough to allow the embedded surface to be tangent
      to a plane. A second error regards the embeddings for |T0 | > 1, which are depicted as variants of that
      T0 = 1 (cf. Fig. 9.15), with two spikes at r = 0, simply pushed apart. However, as discussed above, the
      isometric embeddings with T = const cannot reach the region near r = 0 for |T0 | > 1.
                                               9.5 Einstein-Rosen bridge                                   315
Historical note : In 1916, very soon after the publication of Schwarzschild solution [449], the Austrian
physicist Ludwig Flamm (1885-1964) showed that the slice t = const and θ = π/2 in Schwarzschild-
Droste coordinates (t, r, θ, φ) can be isometrically embedded in the Euclidean space as a paraboloid of
revolution obeying Eq. (9.106) [197]. Let us recall that the positive-X part of the hypersurface T = 0
considered here coincides with the hypersurface t = 0 (cf. Remark 1 on p. 310). Although he draw
the whole paraboloid (actually a parabola in a 2-dimensional plot — Fig. 2 of Ref. [197]), Flamm did
not seem to have considered the negative-z part as physically relevant. In other words, he limited his
considerations to MI and did not contemplate any bridge to the extension MIII .
with X > 0, which implies that it is represented by a straight half-line from the origin in the
Kruskal diagram (cf. Fig. 9.8). Similarly, a hypersurface of constant t′ in MIII obeys an equation
identical to (9.107), except for t replaced by t′ and X < 0 [cf. Eq. (9.33)]. Accordingly, for t′ = t,
the union of these two hypersurfaces forms a hypersurface of M ruled by Eq. (9.107), with
X < 0 or X > 0. If we add the points (T, X) = (0, 0) to it (i.e. the bifurcation sphere S
(cf. Sec. 9.3.3), we obtain a connected hypersurface in which X takes all values in the range
(−∞, +∞). Let us call St this hypersurface. In other words, St is the hypersurface of M
defined by Eq. (9.107) with X ∈ R. Note that for t = 0, this hypersurface coincides with the
hypersurface T = 0 introduced in Sec. 9.5.1: S0 = Σ0 . But for t ̸= 0, St ̸= ΣT .
      There are two Schwarzschild-Droste coordinate systems on St : (r, θ, φ) on St ∩ MI and
(r′ , θ, φ) on St ∩ MIII , with both r and r′ ranging (2m, +∞). Let us introduce on St a third
coordinate system (xī ) = (r̄, θ, φ) as follows:
                              m                              m 2
    on St ∩ MI :       r̄ ∈           , +∞ ,        r = r̄ 1 +                                 (9.108a)
                                  2                             2r̄
                                                                1                   
                                                                                               (9.108b)
                                                                           p
                                                     ⇐⇒ r̄ =        r − m + r(r − 2m)
                                                                2
                                     m                        m 2
    on St ∩ MIII :     r̄ ∈ 0,             ,        r′ = r̄ 1 +                                (9.108c)
                                      2                         2r̄
                                                                1 ′                   
                                                                                               (9.108d)
                                                                           p
                                                     ⇐⇒ r̄ =                  ′ ′
                                                                    r − m − r (r − 2m)
                                                                2
                              m
    on St ∩ S :        r̄ =     .                                                              (9.108e)
                              2
The range of r̄ is thus (0, +∞). The graph of the function r̄ 7→ r̄(1 + m/(2r̄))2 is depicted
in Fig. 9.17. We can separate this graph in two parts: r̄ ∈ (0, m/2) (the MIII part) and
r̄ ∈ (m/2, +∞) (the MI part). In each of these part, there is a one-to-one correspondence
316                     Maximal extension of Schwarzschild spacetime
                                      r/m
                                      5
                                                  1       2         3         4
                                                                                  r̄/m
      This foliation is regular in both MI and MIII , but is singular at the bifurcation sphere S , since
      all the hypersurfaces St intersect there (cf. Fig. 9.8). We may then consider (xᾱ ) = (t, r̄, θ, φ)
      as a coordinate system on Miso , which is regular on MI and MIII , but is singular at S , i.e. at
      r̄ = m/2. This system is called isotropic coordinates.
           From (9.108a), we get
                                                                                              m     2
                                                                                         1−
                                                                                     
                            m    m                                     2m
                    dr = 1 +     1−     dr̄                   and       1−    =               2r̄
                                                                                              m          .
                             2r̄    2r̄                                     r            1+   2r̄
      It is then immediate to deduce from (6.14) the expression of the metric tensor in terms of the
      isotropic coordinates (xᾱ ) = (t, r̄, θ, φ):
                                m     2
                           1−
                                                    m 4  2
                                2r̄
                                            dt2 + 1 +       dr̄ + r̄2 dθ2 + sin2 θ dφ2 .                      (9.111)
                                                                                      
                g=−             m
                           1+   2r̄
                                                      2r̄
                                          9.5 Einstein-Rosen bridge                                                317
Since the relation between r′ and r̄ is identical to that between r and r̄ [cf. Eqs. (9.108a) and
(9.108c)], the above expression of g is valid on MI and MIII . Note that all metric coefficients
are regular on MI ∪ MIII (except for the standard coordinate singularity of the spherical
coordinates (θ, φ) for θ ∈ {0, π}). On the contrary Eq. (9.111) yields det(gᾱβ̄ ) = 0 for r̄ = m/2,
which reflects the fact that the isotropic coordinates are singular on the bifurcation sphere S .
    A remarkable feature of expresssion (9.111) is that the spatial part is proportional to the flat
metric f of the Euclidean 3-space:
In other words, the metric γ induced by g on St is conformal to the flat metric f (cf. Sec. 4.2.2):
γ = Ψ4 f , (9.113)
Contents
        10.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323
        10.2 The Kerr solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 324
        10.3 Extension of the spacetime manifold through ∆ = 0 . . . . . . . . . . 334
        10.4 Principal null geodesics . . . . . . . . . . . . . . . . . . . . . . . . . . . 340
        10.5 Event horizon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 347
        10.6 Global quantities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351
        10.7 Families of observers in Kerr spacetime . . . . . . . . . . . . . . . . . 354
        10.8 Maximal analytic extension . . . . . . . . . . . . . . . . . . . . . . . . 364
        10.9 Further reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 371
10.1      Introduction
Having studied the Schwarzschild black hole in the preceding chapters, we turn now to its
rotating generalization: the Kerr black hole. The Kerr metric is arguably the most important
solution of general relativity, largely because of the no-hair theorem, according to which all
stationary black holes in the Universe are Kerr black holes (cf. Sec. 5.6).
    In this chapter, the Kerr solution is first presented in terms of the standard Boyer-Lindquist
coordinates and its basic properties are discussed (Sec. 10.2). Then Kerr coordinates are
introduced in Sec. 10.3; contrary to Boyer-Lindquist coordinates, they are regular on the two
Killing horizons of Kerr spacetime. Kerr coordinates are also tied to one of the two congruences
of null geodesics related to the spacetime conformal structure: the principal null geodesics,
which are introduced in Sec. 10.4. The second congruence, that of the so-called outgoing
principal null geodesics, provides the generators of the black hole event horizon, which is
studied in Sec. 10.5. Then Sec. 10.6 focuses on global quantities characterizing Kerr spacetime:
mass, angular momentum and horizon area. Section 10.7 presents various standard families of
324                                                Kerr black hole
      observers in Kerr spacetime. Finally, Sec. 10.8 discusses the maximal analytical extension of
      Kerr spacetime and the concept of Cauchy horizon.
• the mass parameter m > 0, to be interpreted in Sec. 10.6.1 as the spacetime total mass;
      postponing the case a = m to Chap. 13. The Kerr solution is usually presented in the so-called
      Boyer-Lindquist coordinates (t, r, θ, φ). Except for the standard singularities of the spherical
      coordinates (θ, φ) on S2 at θ ∈ {0, π}, we may consider that Boyer-Lindquist coordinates cover
      the manifold R2 × S2 , with t spanning R, r spanning1 R, θ spanning (0, π) and φ spanning
      (0, 2π). Hence (t, r) is a Cartesian chart covering R2 and (θ, φ) is the standard spherical chart
      of S2 .
           In this section, we choose the spacetime manifold to be the open subset MBL of R2 × S2
      formed by the disjoint union of the following three connected components (cf. Fig. 10.1):
      where                                   √                                     √
                                r+ := m +      m 2 − a2       and     r− := m −         m2 − a2                     (10.3)
      and R is the subset of R2 × S2 defined in terms of Boyer-Lindquist coordinates (t, r, θ, φ) by
                                 n                                      πo
                            R := p ∈ R × S , r(p) = 0 and θ(p) =
                                         2     2
                                                                           .                   (10.4)
                                                                        2
      Remark 1: By construction, the points of R2 × S2 obeying r = r− and r = r+ are excluded from the
      spacetime manifold MBL . In a latter stage (Sec. 10.3), we shall extend the spacetime manifold to include
      these points, so that the spacetime manifold will be M = R2 × S2 \ R. Even latter on, after having
         1
             This contrasts with r spanning only (0, +∞) for the standard spherical coordinates (r, θ, φ) on R3 .
                                             10.2 The Kerr solution                                               325
                                    I
                                        II
                                              III
plane
rotation axis
Figure 10.1: View of a section t = const of R2 × S2 in terms of the coordinates (R, θ, φ), with R := er , so
that the region r → −∞ is reduced to a single point at the centre of all the pictured spheres. Such coordinates
have been introduced for pictorial purposes by O’Neill [391].
noticed that (M , g) is not geodesically complete, we shall extend the spacetime manifold further and
discuss the maximal analytic extension (Sec. 10.8).
Remark 2: One shall stress that the points having r = 0 are not special points in R2 × S2 spanned
by the coordinates (t, r, θ, φ). In particular, (t, r) = (t0 , 0), where t0 is a constant, defines a regular
sphere S0 diffeomorphic to S2 . On the spacetime manifold MBL , the points r = 0 are part of MIII and
because of the exclusion of R in the definition (10.2d), (t, r) = (t0 , 0) defines a sphere minus its equator
(cf. Fig. 10.1, where the equator is the tick orange circle).
    Note that thanks to the constraint (10.1), r+ and r− are well-defined and obey
Note also that R is spanned by the coordinates (t, φ) and is diffeomorphic to the 2-dimensional
cylinder R × S1 :
                                        R ≃ R × S1 .                                      (10.6)
This is so because r = 0 is not a peculiar value of r in R2 × S2 (cf. Remark 2 above). In view of
Eqs. (10.2b)-(10.2d) and (10.4), it is clear that the various connected components of MBL are
defined in terms of the Boyer-Lindquist coordinates (t, r, θ, φ) by
                                                  4amr sin2 θ            ρ2 2
                                         
                                      2mr
                           g = − 1− 2       dt2 −             dt dφ  +     dr
                                       ρ              ρ2                 ∆
                                                                                                          (10.8)
                                                  2a2 mr sin2 θ
                                                               
                                 2  2     2    2
                               +ρ dθ + r + a +                    sin2 θ dφ2 ,
                                                       ρ2
        with
                                                 ρ2 := r2 + a2 cos2 θ                                     (10.9)
        and
                                    ∆ := r2 − 2mr + a2 = (r − r− )(r − r+ ) .                            (10.10)
      Remark 3: Since dt dφ := 1/2 (dt ⊗ dφ + dφ ⊗ dt) [cf. Eq. (A.39)], the metric component gtφ is one
      half of the coefficient of the dt dφ term in Eq. (10.8), i.e. gtφ = −2amr sin2 θ/ρ2 .
      Remark 4: On MBL , ρ ̸= 0 and ∆ ̸= 0 (by construction of MBL !), so that the metric components (10.8)
      are regular in MBL , except for the standard singularities of the spherical coordinates (θ, φ).
         By means of a computer algebra system (cf. the SageMath notebook D.5.1), one easily
      checks:
The Kerr metric g [Eq. (10.8)] is a solution of the vacuum Einstein equation (1.44) on MBL .
      Historical note : The Kerr solution has been found by the New Zealand mathematician Roy P. Kerr
      (then at the University of Texas at Austin) in the spring of 1963 [312]. Kerr was searching for algebraically
      special metrics, i.e. metrics whose Weyl conformal curvature tensor admits a doubly degenerate principal
      null direction (to be defined in Sec. 10.4 below), in the case where the principal null congruence has a
      non-vanishing twist (or “rotation”). The special case of vanishing twist (i.e. hypersurface-orthogonal
      congruence) had been treated by Ivor Robinson and Andrzej Trautman in 1962 [440]. Kerr used Cartan’s
      structure equations in a null tetrad to manipulate the Einstein equation; he obtained the solution in
      coordinates different from the Boyer-Lindquist ones, known today as Kerr coordinates and to be discussed
      in Sec. 10.3.1 (cf. the historical note page 336). For more details about this fantastic discovery, see the
      account by Kerr himself in Refs. [313, 324]. Boyer-Lindquist coordinates have been introduced in 1966
      by Robert H. Boyer (see the historical note on p. 87) and Richard W. Lindquist [72] (compare Eq. (2.13) of
      Ref. [72] with Eq. (10.8) above, keeping in mind that Boyer and Lindquist used −a instead of a, following
      Kerr’s convention in the discovery article [312], cf. the historical note on p. 336).
  The spacetime (MBL , g) has two asymptotically flat ends: one in MI for r → +∞, which
  is equivalent to the asymptotics of a Schwarzschild spacetime of mass m and the other one
  in MIII for r → −∞, which is equivalent to the asymptotics of a Schwarzschild spacetime
  of (negative!) mass −m.
  The spacetime (MBL , g) admits two isometries: stationarity and axisymmetry, generated
  respectively by the Killing vectors
ξ := ∂t and η := ∂φ . (10.12)
Proof. All the metric components gαβ in Eq. (10.8) are independent from t and φ. This implies
that the vector fields (10.12) are Killing vectors (cf. Sec. 3.3.1). Since t spans R, the isometry
group generated by ξ is clearly the translation group (R, +). Moreover, in view of (10.11), we
have ξ · ξ = gtt < 0 as r → +∞, which means that the Killing vector ξ is asymptotically
timelike. Given the definition of stationarity stated in Sec. 5.2.1, we conclude that the Kerr
spacetime is stationary. On the other side, since φ is an azimuthal coordinate on S2 , the
isometry group generated by η is the rotation group SO(2) = U(1). Moreover, ∂φ is spacelike
for r → +∞ (even for r > 0). Hence, the Kerr spacetime is axisymmetric (cf. the definition
given in Sec. 5.3.6).
  For a ̸= 0, as we have assumed in (10.1), the Kerr spacetime is not static (cf. the definition
  in Sec. 5.2.1): the stationary Killing vector ξ is not hypersurface-orthogonal.
Proof. From Eq. (10.8), we have ξ · η = gtφ ̸= 0 for a ̸= 0. Since η = ∂φ is tangent to the
hypersurfaces t = const, this implies that ξ is not normal to these hypersurfaces. That there
does not exist any hypersurface family orthogonal to ξ can be seen from the non-vanishing of
328                                        Kerr black hole
      the twist 3-form defined by Eq. (3.63): ω := ξ ∧ dξ. We have indeed (see the notebook D.5.1
      for the computation)
                2am sin θ h        2   2     2
                                                                                     i
             ω=             sin θ(a cos θ − r ) dt ∧ dr ∧ dφ + 2∆r cos θ dt ∧ dθ ∧ dφ .
                   ρ4
      Clearly ω ̸= 0 as soon as a ̸= 0. According to the Frobenius theorem (see e.g. Eq. (B.3.6) in
      Wald’s textbook [499]), it follows that ξ is not hypersurface-orthogonal.
        is made of two connected components, which are two flat open disks of radius a, corre-
        sponding respectively to θ < π/2 and θ > π/2, since the equator θ = π/2 is excluded by
        the very definition of MIII (cf. Remark 2 above).
          The set S0,t is depicted by the dotted red line in Fig. 10.1. It is also depicted in terms of the
      so-called Kerr-Schild coordinates in Figs. C.1 - C.4 of Appendix C.
                                        10.2 The Kerr solution                                             329
10.2.4      Ergoregion
Let us investigate the causal character of the stationary Killing vector ξ. We have, according to
(10.8) and (10.9),
                                                         2mr
                               ξ · ξ = gtt = −1 + 2                .
                                                     r + a2 cos2 θ
Thus
        ξ timelike ⇐⇒ r2 − 2mr + a2 cos2 θ > 0 ⇐⇒ r < rE − (θ) or                r > rE + (θ),
with                                              √
                                rE ± (θ) := m ±       m2 − a2 cos2 θ .                           (10.17)
Comparing with (10.3), we note that
                         0 ≤ rE − (θ) ≤ r− ≤ m ≤ r+ ≤ rE + (θ) ≤ 2m,                             (10.18)
with
                      rE − (0) = rE − (π) = r−    and         rE − (π/2) = 0                (10.19a)
                      rE + (0) = rE + (π) = r+    and         rE + (π/2) = 2m.              (10.19b)
   Given the definition of MI , MII and MIII , we conclude that:
  Property 10.7: ergoregion of Kerr spacetime
       • ξ is timelike in the region of MI defined by r > rE + (θ) and in the region of MIII
         defined by r < rE − (θ);
e r/m cosθ
I 4
                                                  II       2
                                                                              in                        +
                                                         III              −
                                                                                                            e r/m sinθ
                                   -5                                                  5
                                                                   r=0
                                                           -2
-4
      Figure 10.2: Meridional view of a section t = const of Kerr spacetime with a/m = 0.90 in O’Neill exponential
      coordinates x = er/m sin θ and z = er/m cos θ (cf. Fig. 10.1). The right (resp. left) half of the figure corresponds
      to φ = 0 (resp. φ = π). The Roman numbers I, II, III denote the components MI , MII and MIII of the manifold
      MBL . The dotted orange circle marks the location of r = 0, while the small black circle at the center of the figure
      corresponds to r → −∞. The two red dots marks the curvature singularity R. The ergoregion (cf. Sec. 10.2.4) is
      shown in grey, while the yellow part is Carter time machine (cf. Sec. 10.2.5).
e r/m cosθ
                                                               6
                               I
                                                               4
                                                 II            2
                                                                                           +
                                                                         in
                                                           III                                 e r/m sinθ
                                            -5                                     5
                                                                    r=0
                                                            -2
-4
-6
e r/m cosθ
                                                  4
                I
                                                  2          in
                                        II III                                       +
                                                            −
                                                                                         e r/m sinθ
                          -5                                            5
                                                       r=0
                                                 -2
-4
      • ξ is spacelike in all MII and in the region G + of MI defined by r < rE + (θ), as well
        as in the region G − of MIII defined by r > rE − (θ).
  According to the nomenclature introduced in Sec. 5.4.2, one calls E + (resp. E − ) the outer
  ergosphere (resp. inner ergosphere) and G + (resp. G − ) the outer ergoregion (resp. inner
  ergoregion). The part of MBL where ξ is spacelike, i.e. G = G + ∪ MII ∪ G − , is called the
  ergoregion.
   Following the standard notation in topology, we shall denote by G the closure of G , i.e. the
union of the ergoregion and the two ergospheres:
                                            G := G ∪ E − ∪ E + .                                      (10.20)
The locus where the Killing vector ξ is timelike is then MBL \ G .
    The ergoregion is depicted in Figs. 10.2–10.4. It is worth noticing that the locations of
the inner and outer ergospheres in the equatorial plane (θ = π/2) do not depend on the spin
parameter a: rE − (π/2) = 0 [Eq. (10.19a)] and rE + (π/2) = 2m [Eq. (10.19b)], a feature that
appears clearly in Figs. 10.2–10.4, given that e2 ≃ 7.39.
Remark 5: By taking the limit a → 0 in Eq. (10.17), we get rE + (θ) → 2m. Since r+ → 2m in the same
limit, we conclude that, for a Schwarzschild black hole, the outer ergosphere coincides with the event
horizon. Consequently, there is no outer ergoregion for such a black hole: G + = ∅.
Remark 6: Sometimes the word ergosurface is used instead of ergosphere. One may encounter as
well the names infinite redshift surface (e.g. [495]) and static limit (cf. Sec. 10.7.2 below) for ergosphere,
especially in old texts.
Remark 7: By construction, the Killing vector ξ is null on the ergospheres E + and E − . It is moreover
tangent to these hypersurfaces (see below). However, for a ̸= 0, E + and E − are not Killing horizons,
332                                           Kerr black hole
      as defined in Sec. 3.3.2, since ξ fails to be normal to them, or equivalently, E + and E − are not null
      hypersurfaces. They are actually timelike hypersurfaces, except on the rotation axis. This can be seen by
      considering E ± as the level set f = 0 of the scalar field f := r − rE ± (θ) = r − m ∓ (m2 − a2 cos2 θ)1/2
      [cf. Eq. (10.17)]. The differential of f is df = dr ± a2 sin θ cos θ(m2 − a2 cos2 θ)−1/2 dθ, so that
                                                                                                              →
                                                                                                              −
      ⟨df, ξ⟩ = 0, which shows that ξ is tangent to E ± . The scalar square of the gradient vector field ∇f
      associated to df by metric duality is easily evaluated via the inverse metric (10.16):
                                                          a4 sin2 θ cos2 θ E ±        a2 m2 sin2 θ
                                                                          
                 →
                 −      →
                        −        µν             1
                  ∇f · ∇f = g ∂µ f ∂ν f = 2 ∆ + 2                             =                       .
                                                ρ         m − a2 cos2 θ           ρ2 (m2 − a2 cos2 θ)
                                                                          →
                                                                          −      →
                                                                                 −
      Away from the rotation axis, i.e. for sin θ ̸= 0, we have clearly ∇f · ∇f > 0 for a ̸= 0, which shows
                       →
                       −
      that the normal ∇f to E ± is spacelike, making E ± a timelike hypersurface. Further details about the
      outer ergosphere E + , especially the 2-dimensional geometry of its time slices, are discussed in Ref. [295].
         We shall see in Sec. 11.3.2 that the outer ergoregion plays a key role in an energy extraction
      mechanism known as the Penrose process.
        The region T is called Carter time machine. This name stems from the fact that thanks
        to T , there is a future-directed timelike curve connecting any two points of MIII .
      See e.g. Proposition 2.4.7 of O’Neill’s textbook [391] for a demonstration, or Carter’s original
      article [90]) for the proof of the last sentence. The Carter time machine T is depicted in yellow
      in the meridional diagrams of Figs. 10.2-10.4.
                                            10.2 The Kerr solution                                           333
                                     ρ 2 (r 2 + a 2 ) + 2a 2 mr sin 2 θ
                                                                              θ=0
                                                                              θ = π/4
                                                                              θ = π/3
                                                    8                         θ = π/2
Figure 10.5: Graph of the function giving the sign of gφφ for a = 0.9m and various values of θ.
10.2.6    Singularities
The components gαβ of the Kerr metric as given by (10.8) are diverging at various locations:
   • when ρ2 → 0, which, given (10.9) and assuming a ̸= 0, is equivalent to approaching the
     cylinder R defined by (10.4);
  The curvature of Kerr spacetime, as measured by the Kretschmann scalar K := Rµνρσ Rµνρσ
  [cf Eq. (6.44)], diverges for ρ2 → 0, i.e. for r → 0 and θ → π/2. Accordingly the subset R
  of R2 × S2 defined by ρ2 = 0 [Eq. (10.4)] is called the ring singularity of Kerr spacetime;
  the word ring reflects that t = const sections of R are circles [cf. Eq. (10.6)].
Proof. The Kretschmann scalar expressed in terms of Boyer-Lindquist coordinates is (cf. the
notebook D.5.1 for the computation)
                          m2 6
                               r − 15r4 a2 cos2 θ + 15r2 a4 cos4 θ − a6 cos6 θ .                   (10.22)
                                                                              
                 K = 48     12
                          ρ
The value for θ = π/2 is thus K = 48m2 /r6 , which clearly diverges for r → 0 (i.e. ρ2 → 0).
334                                          Kerr black hole
                                                        r 2 + a2
                                              dv = dt +          dr                                    (10.23a)
                                                            ∆
                                                          a
                                              dφ̃ = dφ + dr .                                         (10.23b)
                                                         ∆
      If a = 0, we note that the advanced Kerr coordinates are nothing but the null ingoing Eddington-
      Finkelstein coordinates on Schwarzschild spacetime (cf. Sec. 6.3.2 and compare (10.23a) with
      (6.28)).
      Remark 1: In the literature, the advanced Kerr coordinates are often simply called Kerr coordinates (e.g.
      [371, 464]). However, we prefer to keep this short name for closely related coordinates to be introduced
      in Sec. 10.3.3. Both systems originate in Kerr’s seminal article [312] (cf. the historical notes on p. 336
      and p. 340). The advanced Kerr coordinates are also called Kerr-star coordinates by O’Neill [391].
       r 2 + a2
                                                                                                        
                      2m                r+     r−                   a      a                1      1
                =1+                         −                and      =                        −               ,
           ∆        r+ − r−           r − r+ r − r−                 ∆   r+ − r−          r − r+ r − r−
                      √
      with r+ − r− = 2 m2 − a2 , so that Eqs. (10.23) can be readily integrated to
                                                                                
                                    m                      r − r+         r − r−
                         v =t+r+ √                   r+ ln        − r− ln                              (10.24a)
                                   m2 − a2                   2m             2m
                                    a        r − r+
                         φ̃ = φ + √       ln        ,                                                 (10.24b)
                                    2
                                 2 m −a 2    r − r−
One gets (cf. Appendix D or Eq. (5.31) of Ref. [266], or Lemma 2.5.2 of [391]):
                                               4amr sin2 θ
                         
                      2mr
      g = − 1− 2             dv 2 + 2dv dr −                dv dφ̃
                       ρ                            ρ2
                                                                                             (10.25)
                                                        2a2 mr sin2 θ
                                                                     
                                  2   2      2     2
                     2
              −2a sin θ dr dφ̃ + ρ dθ + r + a +                         sin2 θ dφ̃2 .
                                                             ρ2
   We note that the metric components (10.25) do not show any divergence when ∆ → 0,
contrary to the Boyer-Lindquist ones. Hence, we may extend the Kerr metric to the points of
R2 × S2 where ∆ = 0, i.e. to the hypersurfaces (cf. Fig. 10.1)
                            H := p ∈ R2 × S2 , r(p) = r+                            (10.27)
                                   
and
                             Hin := p ∈ R2 × S2 ,                                            (10.28)
                                   
                                                        r(p) = r− .
The hypersurface H is actually the boundary between the regions MI and MII , while Hin is
the boundary between MII and MIII (cf. Eq. (10.7) and Fig. 10.2). We thus consider
  The symmetric bilinear form g given by Eq. (10.25) is regular and non-degenerate on the
  manifold M defined by (10.29) and thus (M , g) is a well-behaved spacetime — our Kerr
  spacetime from now on. We note that, contrary to MBL , M is a connected manifold.
336                                               Kerr black hole
      It follows from the chain rule that the advanced Kerr coordinate frame is related to the Boyer-
      Lindquist coordinate frame by
                                          ∂v = ∂t                                                                     (10.31a)
                                                             2     2
                                                            a +r     a
                                          ∂r̂ = ∂r −             ∂t − ∂φ                                              (10.31b)
                                                              ∆      ∆
                                          ∂θ = ∂θ                                                                     (10.31c)
                                          ∂φ̃ = ∂φ .                                                                  (10.31d)
      Note that we are using the notation ∂r̂ for the ∂/∂r vector of the advanced Kerr coordinates
      (xα̂ ) = (v, r, θ, φ̃), to distinguish it from the ∂/∂r vector of Boyer-Lindquist coordinates.
      Remark 2: Contrary to the Schwarzschild case (cf. Sec. 6.3.2), v is not a null coordinate for a ̸= 0,
      but rather a spacelike one, i.e. the hypersurfaces of constant v are timelike (cf. the definitions given in
      Sec. A.3.2). Indeed, we read on (10.26) that g vv = a2 sin2 θ/ρ2 . For a ̸= 0, this implies g vv > 0 outside
      the rotation axis. The criterion (A.56c) allows us to conclude that v is a spacelike coordinate.
      Historical note : The advanced Kerr coordinates are those in which Roy P. Kerr originally presented
      his solution in 1963 [312]. As noted by Kerr himself later on [313], he used −a instead of a, because
      he was “rather hurried in performing this calculation (angular momentum) and got the sign wrong”
      (footnote in Sec. 2.5 of Ref. [313]). Actually, it was shown in 1964 by Robert H. Boyer and T.G. Price [73]
      that the angular momentum about the rotation axis of the Kerr solution is J = −am, where a is Kerr’s
      one. Taking this into account, the correspondence between our notations and those of Kerr’s article
      [312] is v ↔ u and a ↔ −a. Then one can check that the metric (10.25) coincides with that given by an
      unnumbered (!) equation in Kerr’s article [312].
The Kerr spacetime (M , g) is time-orientable, and we choose its time orientation such that
k := −∂r̂ (10.32)
Remark 3: The minus sign in the above definition, along with Eq. (10.31b), ensures that
                                           k ∼ ∂t − ∂r           when r → +∞,
                   10.3 Extension of the spacetime manifold through ∆ = 0                               337
which shows that the time orientation set by k agrees asymptotically with that of ξ = ∂t . The latter
vector field could not have been chosen to set the time orientation of (M , g) since it is not causal
everywhere, being spacelike in the ergoregion.
    The field lines of k are future-directed null curves, which may be qualified of ingoing since,
by definition, −∂r̂ points towards decreasing values of r. Note that, by the very definition of
the coordinate vector ∂r̂ , the values of the coordinates (v, θ, φ̃) are fixed along each of these
null curves. We may therefore denote them by L(v,θ,    in
                                                          φ̃) . We shall see in Sec. 10.4 that each
L(v,θ,φ̃) is actually a null geodesic.
  in
Property 10.12
Proof. Applying Lemma 1.2 (Sec. 1.2.2) with u = k and v = ∂r , we get that ∂r is past-directed
iff g(k, ∂r ) > 0. Now, in terms of the Boyer-Lindquist components (10.8), g(k, ∂r ) = gµr k µ =
grr k r = (ρ2 /∆)k r . The Boyer-Lindquist component k r is given by Eq. (10.31b) where ∂r̂ = −k;
we get k r = −1. Hence g(k, ∂r ) = −ρ2 /∆ > 0 in MII , for ∆ < 0 there.
  In region MII , the coordinate r must decrease towards the future along any causal (i.e.
  timelike or null) worldline.
Proof. Let L be a causal curve in region MII and λ a parameter along L increasing towards
the future. The associated tangent vector v = dx/dλ is then future-directed. According to
Property 10.12, −∂r is a future-directed timelike vector in MII , so that we can apply Lemma 1.1
(Sec. 1.2.2) with u = −∂r and get g(−∂r , v) < 0. Now, using Boyer-Lindquist components,
we have
                                                             dr        ρ2 dr
                    g(−∂r , v) = −grµ v µ = −grr v r = −grr        =−        .
                                                             dλ         ∆ dλ
Since −ρ2 /∆ > 0 in MII , g(−∂r , v) < 0 is thus equivalent to dr/dλ < 0, which proves that r
is decreasing along L as λ increases.
338                                         Kerr black hole
                                                        2mr
                                              dt̃ = dt +    dr                                     (10.34a)
                                                         ∆
                                                         a
                                              dφ̃ = dφ + dr .                                      (10.34b)
                                                         ∆
      The integrated version is obtained by substituting (10.33) in Eq. (10.24):
                                                                            
                                       m               r − r+         r − r−
                           t̃ = t + √            r+ ln        − r− ln                              (10.35a)
                                      m2 − a2            2m             2m
                                      a        r − r+
                           φ̃ = φ + √       ln        .                                            (10.35b)
                                      2
                                   2 m −a 2    r − r−
          Since the transformation (10.33) leads to dv = dt̃ + dr, the expression of the metric with
      respect to the Kerr coordinates (t̃, r, θ, φ̃) is easily deduced from (10.25):
                                                             4amr sin2 θ
                                      
                                   2mr         4mr
                        g = − 1− 2       dt̃2 + 2 dt̃ dr −                 dt̃ dφ̃
                                    ρ           ρ                 ρ2
                                                           
                                   2mr                  2mr
                            + 1+ 2          2
                                         dr − 2a 1 + 2          sin2 θ dr dφ̃                       (10.36)
                                    ρ                    ρ
                                                2a2 mr sin2 θ
                                                             
                              2  2     2     2
                            +ρ dθ + r + a +                     sin2 θ dφ̃2 .
                                                     ρ2
         Since we kept r, θ and φ̃ and simply changed v to t̃ via (10.33) when moving from the
      advanced Kerr coordinates to the Kerr ones, we easily get the link between the two coordinate
      frames:
                                                 ∂t̃ = ∂v                                          (10.37a)
                                                 ∂r̃ = ∂v + ∂r̂                                    (10.37b)
                                                 ∂θ = ∂θ                                           (10.37c)
                                                 ∂φ̃ = ∂φ̃ .                                       (10.37d)
      Note that we have denoted by ∂r̃ the second vector of the coordinate frame associated to the
      Kerr coordinates (xα̃ ) = (t̃, r, θ, φ̃), in order to distinguish it from the coordinate vector ∂r̂ of
                    10.3 Extension of the spacetime manifold through ∆ = 0                                    339
the advanced Kerr coordinates (xα̂ ) = (v, r, θ, φ̃), as well as from the coordinate vector ∂r of
the Boyer-Lindquist coordinates (xα ) = (t, r, θ, φ).
    By combining (10.31) and (10.37), we get the relation between the Kerr coordinate frame
and the Boyer-Lindquist coordinate frame:
                                      ∂t̃ = ∂t                                                    (10.38a)
                                                   2mr     a
                                      ∂r̃ = ∂r −       ∂t − ∂φ                                    (10.38b)
                                                    ∆      ∆
                                      ∂θ = ∂θ                                                     (10.38c)
                                      ∂φ̃ = ∂φ .                                                  (10.38d)
We notice on (10.38a) and (10.38d) that the coordinate frame vectors ∂t̃ and ∂φ̃ coincide with
the Killing vectors ξ and η:
                                  ∂t̃ = ξ and ∂φ̃ = η .                                 (10.39)
That ∂t̃ and ∂φ̃ are Killing vectors is not surprising since the metric components (10.36) do not
depend on t̃ nor on φ̃.
   The determinant of the metric components (10.36) takes a very simple form:
                                      det gα̃β̃ = −ρ4 sin2 θ.                              (10.40)
                                               
The inverse metric also takes a rather simple form in terms of Kerr coordinates (see the
notebook D.5.2-2 for the computation):
                                                             
                                  −1 − 2mr ρ2
                                              2mr
                                               ρ2
                                                  0      0
                                                             
                                     2mr       ∆         a
                                                             
                               
                                       ρ 2     ρ2 0     ρ 2
                                                              
                          α̃β̃
                         g =                                 .                  (10.41)
                                                             
                               
                                      0       0 ρ12     0    
                                                              
                                                             
                                               a         1
                                       0       ρ2
                                                  0 ρ2 sin2 θ
    Comparing (10.36) with (10.8), we note that the metric components in Kerr coordinates are
slightly more complicated than those in Boyer-Lindquist coordinates, for they contain extra
off-diagonal terms: gt̃r and grφ̃ . However the determinant (10.40) and the inverse metric (10.41)
are pretty simple. Morever Kerr coordinates are as well adapted to the spacetime symmetries
as the Boyer-Lindquist ones, as (10.39) shows, and they have the great advantage to be regular
on the boundary hypersurfaces H and Hin , contrary to Boyer-Lindquist coordinates. The last
feature is all the more important that H is the future event horizon of Kerr spacetime, as we
are going to see. Therefore, we shall continue our study of Kerr spacetime, and especially of
the black hole feature, by means of Kerr coordinates.
Remark 4: The coordinate t̃ is not everywhere timelike, i.e. the hypersurfaces of constant t̃ are
not everywhere spacelike. Indeed, according to the criterion (A.56), t̃ is timelike iff g t̃t̃ < 0 and we
read on Eq. (10.41) that g t̃t̃ = −1 − 2mr/ρ2 . Hence t̃ is timelike iff ρ2 + 2mr > 0, or equivalently
r2 + 2mr + a2 cos2 θ > 0. This quadratic polynomial in r is positive everywhere except in the region
of MIII defined by
                                 p                             p
                      −m − m2 − a2 cos2 θ ≤ r ≤ −m + m2 − a2 cos2 θ.                                (10.42)
340                                            Kerr black hole
      Note that this region is contained in the negative-r part of MIII . We conclude that the coordinate t̃ is
      timelike in MI , MII and in the part of MIII outside the region defined by (10.42). Consequently, the
      Kerr coordinates (t̃, r, θ, φ̃) are related to a 3+1 slicing of spacetime only outside the region (10.42). By
      3+1 slicing, it is meant a foliation of M by spacelike hypersurfaces (see e.g. [227]).
      Historical note : The coordinate t̃ has been introduced by Roy Kerr in the 1963 discovery article [312],
      by exactly the same transformation as (10.33) (t̃ is denoted by t and v by u in Ref. [312]). Kerr considered
      t̃ along with Cartesian-type coordinates (x, y, z) deduced from (r, θ, φ̃) by spheroidal transformations,
      to form the coordinate system (t̃, x, y, z), which is known today as Kerr-Schild coordinates (cf. Ap-
      pendix C), despite they have been introduced first in Kerr’s article [312] and not in the subsequent
      article by Kerr and Schild [314] (1965). Accordingly, the Kerr coordinates (t̃, r, θ, φ̃) defined above are a
      mix of the coordinates (v, r, θ, φ̃) in which Kerr exhibited his solution (cf. the historical note on p. 336),
      called here advanced Kerr coordinates, and the Kerr-Schild coordinates (t̃, x, y, z). The Kerr coordinates
      (t̃, r, θ, φ̃) have been first explicitely considered by Robert Boyer and Richard Lindquist in 1966 [72],
      being called by them the “(E) frame” — “(E)” standing for Eddington, since these coordinates generalize
      Eddington-Finkelstein coordinates to the rotating case. We can check that the metric (10.36) coincides
      with Eq. (2.7) of Boyer and Lindquist’s article [72], keeping in mind that these authors were using −a
      for a, as Kerr in 1963 (cf. the discussion about the sign of a in the historical note on p. 336).
∇k k = 0. (10.45)
λ = −r. (10.46)
                                                         1 α
                                         ∗
                                             C αβγδ :=    C   ϵµν ,                                  (10.48)
                                                         2 βµν γδ
ϵ being the Levi-Civita tensor (cf. Sec. A.3.4). In view of (10.47), one says that the vector field k
constitutes a doubly degenerate, principal null direction of C (see e.g. Chap. 5 of O’Neill
textbook [391] for details). We note that the ingoing principal null geodesics form a congruence:
through each point of M , there is one, and only one, curve L(v,θ,  in
                                                                       φ̃) .
Remark 1: For the Kerr spacetime, as for any solution of the vacuum Einstein equation (R = 0), the
Weyl conformal curvature tensor C is equal to the Riemann curvature tensor Riem: setting R = 0 in
Eq. (A.114) yields Riem = C.
Remark 2: At the Schwarzschild limit, a = 0, φ̃ = φ and the ingoing principal null geodesics L(v,θ,   in
                                                                                                          φ̃)
reduce to the ingoing radial null geodesics L(v,θ,φ)
                                              in     discussed in Secs. 6.3.1 and 6.3.5 [compare Eqs. (6.48)
and (10.43)].
   The components of k with respect to the Boyer-Lindquist coordinate frame are immediately
deduced from Eq. (10.31b) and k = −∂r̂ :
                                              r 2 + a2          a
                                     k=                ∂t − ∂r + ∂φ .                                (10.49)
                                                  ∆             ∆
Substituting Eqs. (10.34) for dt̃ and dφ̃ in Eq. (10.44), we get the expression of the associated
1-form k with respect to the Boyer-Lindquist coordinate coframe:
                                                    ρ2
                                    k = −dt −          dr + a sin2 θ dφ.                             (10.50)
                                                    ∆
Remark 3: Expressions (10.49) and (10.50) are singular on the two horizons H and Hin , where ∆ = 0.
This reflects the singularity of Boyer-Lindquist coordinates on H and Hin , not any pathology of the
vector field k. Indeed, the expressions of k and k in terms of Kerr coordinates [Eqs. (10.43) and (10.44)]
are perfectly regular in all M .
                                                         t̃/m
                                                             in
                                                         4
                                                                                k
                                                         2                                 `
                                                                                                           r/m
              8           6          4          2                     2          4          6          8
      Figure 10.6: Principal null geodesics of Kerr spacetime viewed in terms of the Kerr coordinates (t̃, r) for
      a/m = 0.9. The solid (resp. dashed) curves correspond to outgoing geodesics L(u,θ,
                                                                                    out
                                                                                         ˜ (resp. ingoing geodesics
                                                                                        φ̃)
      L(v,θ,
        in
            φ̃) ), as given by Eq. (10.53) with u = const (resp. Eq. (10.33) with v = const). The increment in u between
      two depicted outgoing geodesics is δu = 2m; similarly, two depicted ingoing geodesics differ in their values of v
      by δv = 2m. Note that the outgoing principal null geodesics tend to become tangent to the horizon H (resp.
      Hin ) when r → r+ (resp. r → r− ). Actually, we shall see in Sec. 10.5.3 and 10.8.3 that the generators of these
      two horizons belong to the outgoing principal null congruence. [Figure generated by the notebook D.5.3]
                                              t̃/m
                                                     in
                                              4
                                                                    k
                                              2                                 `
                                                                                            r/m
        8         6         4         2                   2          4          6       8
    The fact that the Weyl tensor C admits two, and exactly two, congruences of principal
null geodesics means that the Kerr metric is an algebraically special solution of the Einstein
equation: it belongs to the so-called Petrov type D [391].
    Let us find the expression of the outgoing principal null geodesics in terms of Kerr coor-
dinates (which have been constructed on the ingoing principal null congruence). Combining
(10.51) with (10.34), we get
                                             r2 + 2mr + a2
                                   du = dt̃ −              dr                          (10.52a)
                                                   ∆
                                              2a
                               dφ̃˜ = dφ̃ −      dr.                                   (10.52b)
                                              ∆
These equations can be integrated (cf. the computation leading to Eq. (10.24)), yielding
                                                                          
                                 2m                r − r+         r − r−
                u = t̃ − r − √              r+ ln         − r− ln                      (10.53a)
                               m2 − a2                2m            2m
                              a           r − r+
                φ̃˜ = φ̃ − √           ln           ,                                  (10.53b)
                             m2 − a2      r − r−
The quantities u − t̃ and φ̃˜ − φ̃ are plotted in terms of r in Fig. 10.8. We see there that u takes
all values in the range (−∞, +∞) in each of the regions MI , MII and MIII . Accordingly, the
3-tuple (u, θ, φ̃)
               ˜ defines uniquely an outgoing principal null geodesic only in each of these
three regions. In other words, the congruence of outgoing principal null geodesics can be split
in three families L(u,θ,
                     out,I
                          ˜ , L(u,θ,φ̃)
                         φ̃)        ˜ and L(u,θ,φ̃)
                                out,II
                                                  ˜ , each family being labelled by (u, θ, φ̃). In what
                                              out,III                                      ˜
follows, we shall denote by L(u,θ, out
                                        ˜ any member of one of these three families.
                                        φ̃)
344                                                            Kerr black hole
                           8                                                             8
                           6                                                             6
                           4                                                             4
                           2                                                             2
              (u − t̃)/m
                                                                              ϕ̃˜ − ϕ̃
                           0                                                             0
                           2                                                             2
                           4                                                             4
                           6                                                             6
                           8                                                             8
                               8   6     4       2    0    2   4    6     8                  8   6     4      2       0   2      4   6   8
                                                     r/m                                                          r/m
      Figure 10.8: u − t̃ (left panel) and φ̃˜ − φ̃ (right panel) as functions of r given by Eqs. (10.53) for a/m = 0.9.
      The dashed vertical lines correspond to r = r− (Hin ) and r = r+ (H ) and delimitate the regions MIII , MII and
      MI (from the left to the right). [Figure generated by the notebook D.5.3]
           The outgoing principal null geodesics are depicted, along with the ingoing ones, in Figs. 10.6
      and 10.7. Note that the φ̃-motion of the outgoing geodesics, as expressed by Eq. (10.53b) with
      φ̃˜ held fixed, is not shown in these figures, which represent only traces in the (t̃, r) plane.
      Actually the geodesics L(u,θ,
                                 out
                                     ˜ are winding with respect the Kerr coordinates (t̃, r, θ, φ̃) at the
                                     φ̃)
      coordinate speed
                                         dφ̃                 2a
                                                   = 2                .                            (10.54)
                                         dt̃ L out    r + 2mr + a2
                                                                        ˜
                                                                   (u,θ,φ̃)
         Note that in both asymptotically flat regions, when r → ±∞, the winding speed (10.54)
      goes to zero and the two congruences of geodesics tend to ±45◦ lines in Figs. 10.6 and 10.7, as
      expected. Note also that, despite their name, the outgoing principal null geodesics are actually
      ingoing in MII (between Hin and H ), i.e. have r decreasing towards the future, in agreement
      with Property 10.13 (Sec. 10.3.2).
      Remark 4: In Fig. 10.7, the outgoing geodesics seem to go “backward in time” in the region −2m ≲ r ≲ 0.
      This is an artefact due to the hypersurfaces t̃ = const being not spacelike there, as discussed in Remark 4
      p. 339. Consequently it is possible to move to the future with decaying values of t̃ in this region. The
      same effect exists, but is less pronounced, for a/m = 0.9 (Fig. 10.6).
         Another view of ingoing principal null geodesics is provided by Figs. C.1, C.2 and C.4 of
      Appendix C, which depict them in terms of Kerr-Schild coordinates.
                                    10.4 Principal null geodesics                                      345
        Given that ℓ is a smooth vector field, the congruence of outgoing principal null geodesics
        can be smoothly extended to include the integral curves of ℓ on H and Hin , which we shall
        denote by L(θ,ψ)
                      out,H
                              and L(θ,ψ)
                                       out,Hin
                                               , given that θ and ψ [Eq. (10.59)] are constant along them.
        Consequently, the congruence of outgoing principal null geodesics on M is made of 5
        families: L(u,θ,
                    out,I
                         ˜ , L(u,θ,φ̃)
                         φ̃)       ˜ , L(u,θ,φ̃)
                               out,II
                                                ˜ , L(θ,ψ) and L(θ,ψ)
                                           out,III    out,H       out,Hin
                                                                          , each family covering a different
        subset of M .
      Remark 5: At the Schwarzschild limit, a = 0, φ̃˜ = φ, ψ = φ and the outgoing principal null geodesics
                      out,H                                                                             out,H
      L(u,θ,
         out
             ˜ and L(θ,ψ) reduce respectively to the outgoing radial null geodesics L(u,θ,φ) and L(θ,φ)
             φ̃)
                                                                                            out
      discussed in Secs. 6.3.1 and 6.3.5. This can be seen from the fact that, for a = 0, the tangent vector ℓ
      given by Eq. (10.57) reduces to the tangent vector ℓ given by Eq. (6.53).
          The 1-form ℓ associated to ℓ by g-duality is obtained from ℓα̃ = gα̃µ̃ ℓµ̃ , with gα̃µ̃ given by
      Eq. (10.36):
                                    ∆              2ρ2 − ∆         a∆ sin2 θ
                         ℓ=− 2             d t̃ +             dr +             dφ̃ .                (10.60)
                               2(r + a2 )         2(r2 + a2 )      2(r2 + a2 )
          The components of ℓ and ℓ with respect to the Boyer-Lindquist coordinate frame/coframe
      are obtained by combining Eq. (10.57) with Eq. (10.38) and Eq. (10.60) with Eq. (10.34):
                                     1          ∆              a
                                  ℓ = ∂t +    2   2
                                                     ∂r +            ∂φ .                              (10.61)
                                     2     2(r + a )      2(r + a2 )
                                                             2
                                  ∆               ρ2           a∆ sin2 θ
                           ℓ=− 2         dt +             dr +             dφ.                         (10.62)
                              2(r + a2 )      2(r2 + a2 )      2(r2 + a2 )
      Remark 6: The reader may have noticed a certain dissymmetry between the chosen tangent vector
      k of ingoing principal null geodesics, which obeys ∇k k = 0 [Eq. (10.45)] and the tangent vector ℓ of
      the outgoing ones, which obeys ∇ℓ ℓ ̸= 0 [Eq. (10.58)]. The last property implies that the parameter λ
      associated to ℓ is not affine, while the parameter −r associated to k is (cf. Sec. B.2.2). The non-affine
      choice is the price to pay to have a parametrization of the outgoing congruence well-defined everywhere
      in M , even where ∆ = 0. We shall see in Sec. 10.8 that in the maximal extension of the Kerr spacetime,
      there are other regions where these features are reversed, thereby restoring the symmetry between
      ingoing and outgoing principal null geodesics on the extended spacetime.
          Using either the Kerr components (10.43) and (10.57) or the Boyer-Lindquist components
      (10.49) and (10.61), one can easily evaluate the scalar product of k and ℓ:
                                                                ρ2
                                              g(k, ℓ) = −            .                                 (10.63)
                                                            r 2 + a2
      This implies that g(k, ℓ) < 0 everywhere (note that ρ2 = 0 would correspond to the ring
      singularity R, which has been excluded from the spacetime manifold M ). Using Lemma 1.2
      (Sec. 1.2.2) with u = k and v = ℓ, we conclude
                                              10.5 Event horizon                                                  347
  The tangent vector ℓ to the outgoing principal null geodesics is future-directed in all the
  Kerr spacetime M .
Remark 7: Instead of ℓ, another natural choice for the tangent vector to the outgoing principal null
geodesics would have been the tangent vector ℓ̂ such that g(k, ℓ̂) = −1. In view of (10.63), the two
tangent vectors are related by ℓ̂ = (r2 + a2 )/ρ2 ℓ As for ℓ, the tangent vector ℓ̂ does not correspond to
an affine parameter of the outgoing principal null geodesics, i.e. its non-affinity coefficient κℓ̂ is nonzero.
While ℓ̂ is used in many studies, we prefer ℓ here because it coincides with the Killing vector ξ + ΩH η
on the event horizon, as we shall see in Sec. 10.5 [Eq. (10.72)]. In particular, its non-affinity coefficient
κℓ yields the so-called black hole’s surface gravity (Sec. 10.5.4 below).
n · n = ρ2 ∆. (10.65)
      where we have used (10.39) and r± stands for r+ on H and r− on Hin . On H , we may rewrite
      this expression as
                                             n = 2mr+ χ,                                  (10.67)
      with
                                                 χ := ξ + ΩH η                                         (10.68)
      and
                                        a      a             a
                             ΩH :=         = 2     2
                                                     =       √        .                               (10.69)
                                      2mr+  r+ + a     2m m + m2 − a2
      ΩH being a constant, the vector field χ defined by (10.68) is a Killing vector field of (M , g).
      Moreover, Eq. (10.67) shows that this Killing vector is normal to the null hypersurface H . In
      view of the definition given in Sec. 3.3.2, we conclude:
      and
                                        a      a             a
                             Ωin :=        = 2     2
                                                     =       √        ,                               (10.71)
                                      2mr−  r− + a     2m m − m2 − a2
      thereby arriving at:
        The hypersurface Hin defined by r = r− is a Killing horizon with respect to the Killing
        vector field χin [Eq. (10.70)].
      We shall call Hin the inner horizon. We shall see in Sec. 10.8.3 that Hin is actually (part of) a
      so-called Cauchy horizon.
      Historical note : The identification of the hypersurfaces H and Hin as the only two null hypersurfaces
      of Kerr spacetime that are stationary (i.e. that are Killing horizons in the modern language) has been
      first performed in 1964 by Robert H. Boyer and T.G. Price [73], who claimed: “These are ‘horizons’ in
      the sense that there can be a flow of matter or radiation across them in only one direction. They are the
      analogues of the Schwarzschild null sphere or ‘singularity’.”
                                          10.5 Event horizon                                            349
r ≥ r0 + α(λ − λ0 ),
where λ0 is the value of L ’s parameter at p. When λ → +∞, we get r → +∞, which proves
that the null curve L reaches I + . Hence we conclude:
  Property 10.19: black hole region in Kerr spacetime
Incidentally, since we have already shown that H is a Killing horizon (Property 10.17), this
illustrates the strong rigidity theorem (Property 5.25 in Sec. 5.4.2): the black hole event horizon
of Kerr spacetime is a Killing horizon.
    According to the discussion in Sec. 5.4.2, we may then call the quantity ΩH introduced in
Eqs. (10.68)-(10.69) the black hole rotation velocity.
350                                          Kerr black hole
        The null generators of the event horizon H are the geodesics L(θ,ψ)
                                                                       out,H
                                                                             of the outgoing
        principal null congruence.
Similarly, we have
        The null generators of the inner horizon Hin are the geodesics L(θ,ψ)
                                                                         out,Hin
                                                                                 of the outgoing
        principal null congruence.
          The reader is referred to Fig. 5.1 for a pictorial view of the event horizon H spanned by
      the rotating null generators, as well as to Figs. 10.6 and 10.7, where it appears clearly that at
      r = r+ (resp. r = r− ) the outgoing principal null geodesics are tangent to H (resp. Hin ).
      Remark 1: Since ∆ = 0 on H , we read immediately from Eq. (10.60) that
                                                    H         ρ2
                                                   ℓ=              dr,
                                                          r 2 + a2
      3 which shows that, at any point of H , the vector ℓ is normal to the hypersurface r = const through
      this point. This hypersurface being nothing but H itself, we recover the fact that ℓ is normal to H .
          In view of expression (10.69) for ΩH , Eq. (10.59) for the parameter ψ labelling the outgoing
      principal null geodesics L(θ,ψ)
                                  out,H
                                         can be rewritten as
                                                   ψ = φ̃ − ΩH t̃.                                  (10.73)
      Hence ψ appears as a corotating azimuthal coordinate on H . Moreover, we verify that the
      winding speed of the outgoing principal null geodesics given by Eq. (10.54) tends toward ΩH
      when approaching the event horizon:
                                                    dφ̃
                                             lim                       = ΩH .                       (10.74)
                                             r→r+   dt̃   L out  ˜
                                                            (u,θ,φ̃)
                                         H
      This follows immediately from ℓ = χ = ξ + ΩH η = ∂t̃ + ΩH ∂φ̃ and the identity r+
                                                                                      2
                                                                                        + a2 =
      2mr+ (compare Eq. (10.54) with r → r+ and Eq. (10.69)).
                                       10.6 Global quantities                                         351
Substituting (10.3) for r+ , we get an expression involving the two basic Kerr parameters:
                                         √
                                           m 2 − a2
                                   κ=        √         .                                   (10.77)
                                      2m(m + m2 − a2 )
Given the strict inequality a < m assumed in this chapter [Eq. (10.1)], we have κ ̸= 0. According
to the classification introduced in Sec. 3.3.6, we may state:
In Sec. 3.3.7, we have seen that the non-affinity coefficient κ can be interpreted as a “rescaled”
surface gravity. Hence κ is called the black hole surface gravity. The fact that κ is a constant
(i.e. does not depend on θ) is an illustration of the zeroth law of black hole dynamics established
in Sec. 3.3.5 (cf. in particular Example 14 in that section).
Remark 2: As a check, if we let a → 0 in Eq. (10.77), we get κ = 1/(4m), i.e. we recover the
Schwarzschild horizon value computed in Example 10 of Chap. 2 [cf. Eq. (2.29)].
      and use the asymptotic flatness of Kerr metric to get simple expressions. The unit normals to
                                                                       √
      S are then n = ∂t and s = ∂r . Moreover, when r → +∞, q = r2 sin θ. Hence Eq. (5.46)
      with n = 4 yields                       Z
                                     1
                            M =−         lim     (∂r ξt − ∂t ξr )r2 sin θ dθ dφ,
                                    8π  r→+∞   S
                                                          |{z}
                                                                      0
      with
                                                                     2m
                                           ξt = gtµ ξ µ = gtµ δ µt = gtt ≃ −1 +
                                                                         ,
                                                                      r
      the last expression resulting from the expansion (10.11). We have then ∂r ξt = −2m/r2 , so that
      the above integral yields
                                                 M =m.                                        (10.78)
Indeed, given
            √ the metric components (10.8), we immediately check that n · s = 0, s · s = 1
and s = (ρ/ ∆) dr, which does imply that s is normal to S .
   Given the above expressions for ηα , nα and sα , we get, for r → +∞,
and
                                                      2am
                                 nµ ∂µ ην sν ∼ ∂t ηr + 3 ∂φ ηr = 0.
                                                 |{z}  r   |{z}
                                                     0               0
i.e.
                                                    J = am .                                            (10.80)
We conclude that the parameter a is nothing but the total angular momentum divided by the
total mass.
                                                      2
                                              A = 4π(r+ + a2 ) = 8πmr+ .                                           (10.82)
      Via (10.3), one may recast this result to let appear only m and a:
                                                                   √
                                               A = 8πm(m +             m2 − a2 ) .                                 (10.83)
                                                            κA
                                                    M=         + 2ΩH J .                                           (10.84)
                                                            4π
      This is actually the particular case T = 0 (vacuum) of the generic Smarr formula (5.91)
      established in Chap. 5, or the particular case QH = 0 (vanishing electric charge) of the
      electrovacuum Smarr formula (5.104).
      It follows that the worldline L of O is an orbit of the isometry group R × SO(2) of Kerr
      spacetime. In physical terms, this means that the spacetime geometry as perceived by observer
      O does not evolve, hence the name stationary observer.
         3
         In the definition (10.85), the coefficients α and β are required to be constant along a given observer’s worldline.
      When considering a family of observers, they may vary from one worldline to the other.
                           10.7 Families of observers in Kerr spacetime                                  355
                           dt           dr            dθ                dφ
                              = α,         = 0,          = 0,              = β,
                           dτ           dτ            dτ                dτ
where τ is the observer’s proper time. It follows that the stationary observer evolves at fixed
values of the coordinates r and θ.
    Outside the Carter time machine T (cf. Sec. 10.2.5), we have necessarily α ̸= 0, because
η is spacelike in M \ T and the 4-velocity u is necessarily timelike. Hence, by introducing
ω := β/α, we may rewrite (10.85) as
The coefficient ω gives the rate of variation of the azimuthal coordinate along the observer’s
worldline L in any coordinate system adapted to the spacetime symmetries, like the Boyer-
Lindquist ones (t, r, θ, φ), the advanced Kerr ones (v, r, θ, φ̃) or the Kerr ones (t̃, r, θ, φ̃), ac-
cording to
                                       dφ           dφ̃           dφ̃
                                  ω=            =             =             .                 (10.87)
                                       dt   L       dv    L       dt̃   L
Proof. Let us consider Boyer-Lindquist coordinates (t, r, θ, φ). Denoting by τ the proper time
of O, we have dφ/dt|L = dφ/dτ × dτ /dt. Now by the definition (1.16) of the 4-velocity,
dφ/dτ = uφ and dτ /dt = (ut )−1 . We have thus dφ/dt|L = uφ /ut . From Eq. (10.86) along
with ξ = ∂t and η = ∂φ , we read ut = α and uφ = αω. Hence dφ/dt|L = ω. The same
demonstration applies to Kerr and advanced Kerr coordinates because ξ = ∂v = ∂t̃ and
η = ∂φ̃ .
Remark 1: That the ratios dφ/dt, dφ̃/dv and dφ̃/dt̃ along L are all equal, as expressed in (10.87), is
not surprising if one considers the links between the various coordinates expressed by Eqs. (10.23a),
(10.23b) and (10.34). Setting dr = 0 in these equations, since r is constant along L , we get
    Equation (10.87) allows for a nice physical interpretation of ω. Indeed, we have seen in
Sec. 7.3.3 [cf. Eq. (7.58)] that dφ/dt|L is nothing but the angular velocity around the symmetry
axis as measured by an asymptotically distant inertial observer. The demonstration was
performed in the Schwarzschild case and for a circular orbit in the equatorial plane, but it used
only the stationarity of Schwarzschild spacetime and the fact that dφ/dt was constant along
L , so it applies to the present case as well.
356                                        Kerr black hole
      This is thus the region where the observer O is defined. Note that it does not contain the
      horizons H and Hin , which is not surprising since Boyer-Lindquist coordinates are singular
      there, notably in terms of the spacetime slicing by the hypersurfaces Σt , as illustrated in Fig. 6.2
      for the case a = 0.
                               10.7 Families of observers in Kerr spacetime                                            357
    In all MZAMO , the timelike vector N is future-directed with respect to the time orientation
chosen in Sec. 10.3.2. Indeed, the latter is set by the global null vector field k and we have,
using the Boyer-Lindquist components (10.49) of k:
                                                  r 2 + a2
                         k · N = Nµ k µ = −k t = −         < 0 on MZAMO ,
                                                      ∆
so that Lemma 1.2 [cf. Eq. (1.6a)] let us conclude that N is future-directed.
    Choosing the observer O as having his worldline orthogonal to Σt means that O’s 4-velocity
n is the unit timelike vector introduced by Eq (10.79). Equivalently, n is N rescaled to form a
unit vector:
                                                        →
                                                        −
                                      n = N N = −N ∇t,                                   (10.92)
with4 N := (−N · N )−1/2 . In view of Eq. (10.90), we get
                                                             −1/2
                              √                2a2 mr sin2 θ
                                  
                       N = ∆ r +a +  2     2
                                                                   .                                        (10.93)
                                                     ρ2
N is called the lapse function, for it relates the increment dτ in the proper time of O to the
change dt of the coordinate t when moving from Σt to Σt+dt via
                                                   dτ = N dt .                                              (10.94)
Proof. The infinitesimal vector that connects the point of proper time τ to that of proper
time τ + dτ along O’s worldline is dx = dτ n (by the very definition of the 4-velocity n,
compare Eq. (1.16)). The corresponding increment in the coordinate t is given by formula
                                                                  →
                                                                  −
(A.21): dt = ⟨dt, dx⟩; we have then, using Eq. (10.92) to express ∇t,
                                     →
                                     −            →
                                                  −         dτ           dτ
                     dt = ⟨dt, dx⟩ = ∇t · dx = dτ ∇t · n = − n    · n} =    ,
                                                            N  | {z      N
                                                                                 −1
           Each ZAMO can be characterized by its coordinates (r0 , θ0 , φ0 ) at some fixed value t0
      of the Boyer-Lindquist time t. The set of ZAMOs is thus a 3-parameter family of observers
      filling MZAMO . The coordinates (r, θ, φ) span each hypersurface Σt . Contrary to the ZAMO
      worldlines, the curves of fixed (r, θ, φ), the tangent vector of which is ∂t = ξ, are not orthogonal
      to Σt , except for a = 0. The orthogonal decomposition of ξ into a part along the normal n and
      a part tangent to Σt defines the shift vector β:
ξ =Nn+β , n · β = 0. (10.97)
      Remark 2: The terms lapse and shift vector are those used in the so-called 3+1 formalism of general
      relativity (see e.g. Refs. [10, 44, 227, 456]). In this context, the ZAMO is called the Eulerian observer,
      which is the generic denomination of the observer whose worldline is normal to the hypersurfaces Σt
      that constitute the 3+1 foliation of spacetime.
          The ZAMO rotation velocity seen from infinity ω is obtained by comparing Eqs. (10.86) and
      Eq. (10.97) rewritten as n = N −1 (ξ − β φ η). We thus get immediately ω = −β φ . Hence
                                         dφ                        2amr
                                  ω :=            =                                  .                  (10.99)
                                         dt   L        ρ2 (r2 + a2 ) + 2a2 mr sin2 θ
                                  10.7 Families of observers in Kerr spacetime                                           359
Note that ω = 0 for a = 0 (Schwarzschild black hole) and that ω decays quite rapidly with r:
                                                           2am   2J
                                                ω    ∼        3
                                                                = 3,                                          (10.100)
                                                    r→±∞    r     r
where Eq. (10.80) has been used to let appear the black hole angular momentum J.
    A ZAMO is not an inertial observer: it is not in free-fall, since its 4-acceleration a := ∇n n
is nonzero. Indeed, for any family of observers orthogonal to a spacelike foliation (Σt )t∈R , it
can be shown that a is the orthogonal projection onto Σt of the gradient of the logarithm of the
                                                        →
                                                        −
lapse function (see e.g. Eq. (4.19) in Ref. [227]): a = ∇ ln N + (∇n ln N ) n. In the present case,
this expression simplifies since ∇n ln N = nµ ∂µ ln N = 0, as a result of n = nt ∂t + nφ ∂φ . We
              →
              −
then get a = ∇ ln N , i.e. aα = g αµ ∂µ ln N , so that
                                                ∆ ∂N          1 ∂N
                                         a=            ∂ r +         ∂θ .
                                               ρ2 N ∂r       ρ2 N ∂θ
In the asymptotic regions, the non-relativistic gravitational field felt by the observer is g = −a,
so that we recover the standard Newtonian√expression for r → +∞ (MI side). In the asymptotic
region of MIII , i.e. for r → −∞, e(r) = ( ∆/ρ) ∂r is oriented towards the black hole, so that
g points outward, i.e. is a repelling field. This is of course in agreement with the negative mass
aspect of the Kerr metric in region MIII discussed in Sec. 10.2.2.
     Despite ω as given by Eq. (10.99) is nonzero for a ̸= 0, a ZAMO has a vanishing angular
momentum about the rotation axis, hence the name zero-angular momentum observer. Indeed,
the observer’s specific “angular momentum”, loosely defined by ℓ := η · n [cf. Eq. (7.26)], is
identically zero, the observer’s 4-velocity n being orthogonal to η = ∂φ . By losely defined, we
mean that the above definition of ℓ fully makes sense for a geodesic, for which it leads to a
conserved quantity along the worldline (cf. Sec. 7.2.1), and we are going to see that a ZAMO’s
worldline is not a geodesic. However, a ZAMO shares with a ℓ = 0 geodesic crossing his
worldline the same value ω of the angular velocity as seen from infinity, as we shall see in
Sec. 11.3.4 [cf. Eq. (11.72)].
     The name locally non-rotating observer, initially given to a ZAMO (cf. historical note below),
conveys other specificities of such a observer O:
   5
       See the notebook D.5.5; see also Eqs. (70)-(71) of Ref. [450] or Eqs. (A.9) and (A.10) of Ref. [61].
360                                                Kerr black hole
         • O does not measure any component along e(φ) for the 3-momentum P of any null or
           timelike geodesic that has a zero conserved angular momentum (L = 0):
               where p stands for the particle’s 4-momentum and we have used expression (1.24) for
               the 3-momentum, along with e(φ) · n = 0, formula (10.95d) for e(φ) and the definition
               L := η · p of the conserved angular momentum [cf. Eq. (7.1b), as well as Eq. (11.2b)
               below].
         • for O, the directions e(φ) and −e(φ) are equivalent, insofar as the proper time measured
           by O for a light signal to perform a full circuit on a circle at r = const, θ = const is the
           same for a forward motion (increasing φ) of the signal as for a backward one (decreasing
           φ) (see Ref. [37] for details).
          However, a ZAMO is not totally “non rotating”, for he has a nonzero 4-rotation as soon
      as a ̸= 0. Let us recall that the 4-rotation vector of an observer O of 4-velocity n and 4-
      acceleration a is defined as the spacelike vector ωrot orthogonal to n such that the evolution
      of O’s orthonormal frame (e(α) ) along O’s worldline L takes the form (see e.g. Sec. 13.6 of
      Ref. [371] or Sec. 4.5 of Ref. [228])
      where the cross product in the hyperplane orthogonal to n, ωrot ×n e(α) , is the unique vector
      orthogonal to both n and ωrot such that (ωrot ×n e(α) ) · v = ϵ(n, ωrot , e(α) , v) for any vector v,
      ϵ being the Levi-Civita tensor (cf. Sec. A.3.4). The ΩFW operator that appears in the right-hand
      side of Eq. (10.103) is called the Fermi-Walker operator. It appears as soon as the observer
      is accelerating, even if he is non-rotating. It corrects the parallel transport of (e(α) ) along L ,
      which would be realized by ∇n e(α) = 0, to ensure that (e(α) ) remains an orthonormal frame.
      A vector field v that obeys ∇n e = ΩFW (v) is said to be Fermi-Walker transported along L .
      Physically, Fermi-Walker transport is realized by a free gyroscope: its spin vector with respect
      to O is Fermi-Walker transported along L . Hence the 4-rotation ωrot of an observer, which
      is an absolute quantity (like the 4-acceleration a, it does not depend on any other observer),
      measures the motion of his spacelike triad (e(i) ) with respect to an orthonormal triad whose
      vectors are aligned along gyroscopes axes. For the ZAMO, the 4-rotation turns out to be6
                                √                              ρ2 2
                                                                                
                      ω
             ωrot   =− 3      2           2            2   2          2
                             a ∆ cos θ sin θ e(r) + r(r + a ) + (r − a ) sin θ e(θ) ,                (10.104)
                      ρ                                        2r
      where ω is given by Eq. (10.99). Hence, as soon as a ̸= 0, ωrot ̸= 0. Far from the black hole,
      ρ ∼ |r| and we get
                                                      3ω r                  3J
                                    ωrot     ∼    −          sin θ e(θ) = − 3 sin θ e(θ) .           (10.105)
                                           r→±∞        2 |r|               |r|
         6
             See the notebook D.5.5 for the computation; see also Eqs. (73)-(74) of Ref. [450].
                              10.7 Families of observers in Kerr spacetime                                           361
In particular, in the equatorial plane (θ = π/2), ωrot ∼ 3J/|r|3 ez , where ez = −e(θ) is parallel
the symmetry axis, with the same direction as the black hole spin.
Remark 3: As a family of observers, the ZAMOs form a zero-vorticity congruence. The vorticity
2-form of any congruence of timelike worldlines of 4-velocity u is defined as the “magnetic” part Ω in
the “electric/magnetic” decomposition7 of the 2-form du with respect to u (cf. Sec. A.4.3):
where a is the 1-form associated by metric duality to the 4-acceleration a := ∇u u of the worldlines.
That Ω = 0 for the ZAMO congruence follows from its orthogonality to a family of hypersurfaces,
namely (Σt )t∈R . Indeed, we deduce from (10.92) that dn = −d(N dt) = −dN ∧ dt, since ddt = 0 [cf.
Eq. (A.92)]. Using (10.92) again, we get dn = d ln N ∧ n. Comparing with (10.106) with u = n, we
conclude that Ω = 0 for the ZAMO congruence8 . Physically, this means that if each ZAMO sets up a
orthonormal spatial frame (e′i ) with axes aligned along the spin of free gyroscopes, instead of the frame
(e(i) ) defined by (10.95), then this frame will not rotate with respect to the frame defined similarly by a
nearby ZAMO.
Historical note : The concept of ZAMO has been introduced by James M. Bardeen in 1970 [37] under
the name of locally non-rotating observers and for generic stationary and axisymmetric spacetimes.
The application to the exterior part of Kerr spacetime has been performed by Bardeen, William H. Press
and Saul A. Teukolsky in 1972 [42]. The name ZAMO has been coined by Kip S. Thorne and Douglas
MacDonald in 1982 [480].
                 r 2 + a2                                                 a
       t(τ ) =      √ τ + const,       r(τ ) = const,           φ(τ ) = √ τ + const,
                                                           θ(τ ) = const,
                  ρ ∆                                                   ρ ∆
                                                                                   (10.107)
where τ is O’s proper time, and which is equipped with the following orthonormal frame:
                                             r 2 + a2      a
                                      ε(0) =    √ ∂t + √ ∂φ                                            (10.108a)
                                              ρ ∆        ρ ∆
                                             √
                                                ∆
                                      ε(r) =      ∂r                                                   (10.108b)
                                               ρ
                                             1
                                      ε(θ) = ∂θ                                                        (10.108c)
                                             ρ
                                             a             1
                                      ε(φ) = sin θ ∂t +         ∂φ .                                  (10.108d)
                                             ρ          ρ sin θ
   7
     This name stems from the decomposition of the electromagnetic field 2-form F (cf. Sec. 1.5.2) with respect to
an observer of 4-velocity u into the electric field E and the magnetic field B, both measured by that observer,
according to F = u ∧ E + ⋆(u ∧ B), where ⋆ stands for the Hodge dual, cf. Eq. (5.38).
   8
     Incidentally, we also get a = d ln N + α n and Eq. (10.101) shows that α is actually zero.
362                                                   Kerr black hole
      We note that the frame (ε(α) ) is well-defined because ∆ > 0 in MI ∪ MIII . It is an easy exercise
      to check that (ε(α) ) is an orthonormal frame (see the notebook D.5.6). In particular, we have
      ε(0) · ε(0) = −1 and, using the Boyer-Lindquist components (10.50) of k:
                                                       r 2 + a2            a      ρ
                             k · ε(0) =   kµ εµ(0)   =− √       + a sin2 θ √ = − √ < 0,
                                                        ρ ∆               ρ ∆      ∆
      which, by virtue of Lemma 1.2 (Sec. 1.2.2), shows that ε(0) is future-directed. Moreover, we
      notice from (10.107) and (10.108a) that εα(0) = dxα /dτ , which proves that the worldline L
      is timelike and ε(0) is the 4-velocity of observer O. Since r and θ are constant along L , we
      note also that expression (10.108a) for ε(0) agrees with the general form (10.85) of a stationary
      observer’s 4-velocity.
           The algebraic dual of the Carter frame, called the Carter coframe, is the 4-tuple of 1-forms
      (ε )0≤α≤3 such that ⟨ε(α) , ε(β) ⟩ = δ αβ (cf. Sec. A.2.4 in Appendix A). It is related to the
        (α)
      Substituting Eqs. (10.109) for the 1-forms ε(α) and using the notation ω 2 := ω ⊗ ω for the tensor-
      product square of a 1-form [cf. Eq. (A.39)], we get the following expression of the metric with respect to
      Boyer-Lindquist coordinates:
                      ∆                   2 ρ2 2              sin2 θ                   2
               g=−     2
                         dt − a sin2
                                     θ dφ   +   dr + ρ2
                                                        dθ 2
                                                             +     2
                                                                      (r2 + a2 )dφ − adt .                   (10.111)
                      ρ                       ∆                  ρ
      This expression, which is equivalent to (10.8), is often found in textbooks (cf. e.g. Eq. (33.2) in Ref. [371]).
      It is called the canonical form by Carter [95].
         The specificity of the Carter observer is to be linked to the principal null geodesic con-
      gruences of Kerr spacetime, which have been introduced in Sec. 10.4. Indeed, by adding and
      subtracting the first two vectors of the Carter frame, as given by Eqs. (10.108a) and (10.108b)
      and comparing the result with Eqs. (10.49) and (10.61), we can express the tangent vectors k
      and ℓ of respectively the ingoing and outgoing principal null geodesics as
                                                               √
                       ρ                                     ρ ∆
                                             and                                             (10.112)
                                                                                    
                k=√         ε(0) − ε(r)                ℓ=              ε (0) + ε (r)   .
                        ∆                                 2(r2 + a2 )
         9
             cf. the notebook D.5.6.
                                 10.7 Families of observers in Kerr spacetime                             363
In other words, the principal null vectors k and ℓ lie in Span(ε(0) , ε(r) ). Physically, this means
that the Carter observer see the principal null geodesics having a pure radial motion (no
component along ε(θ) or ε(φ) ). Since the principal null geodesics are related to the Weyl tensor
and to the algebraically special character of the Kerr metric (cf. Sec. 10.4), this makes the Carter
frame well tailored to the computation of the Riemann’s curvature tensor via Cartan’s formula
(see Refs. [95, 391]).
    By comparing expression (10.108a) for a Carter observer’s 4-velocity ε(0) with Eq. (10.86),
we get immediately the angular velocity around the symmetry axis of the Carter observer as
measured by an asymptotically distant static observer:
                                                      dφ                 a
                                            ωC :=               =             .                (10.113)
                                                      dt    L       r2   + a2
Remark 5: Expression (10.113) is much simpler than that of the angular velocity ω of a ZAMO, as given
by Eq. (10.99). In particular, ωC does not depend on θ, contrary to ω.
       The 4-acceleration aC := ∇ε(0) ε(0) of the Carter observer is10
                  m(r2 − a2 ) + a2 (m − r) sin2 θ       a2
                 aC =           √                 ε(r) − 3 sin θ cos θ ε(θ) .                  (10.114)
                              ρ3 ∆                      ρ
                                 √                            √
Far from the black hole, we have ∆ ∼ |r|, ρ ∼ |r| so that ρ3 ∆ ∼ r4 and we get
                                                                m
                                                 aC     ∼          ε(r) ,                      (10.115)
                                                      r→±∞      r2
i.e. the same Newtonian behavior as for the ZAMO 4-acceleration [Eq. (10.102)], with the
attractive character in MI and the repelling one in MIII .
     The 4-rotation vector ωrot
                             C
                                of the Carter observer is computed in a way similar to that of the
ZAMO [Eq. (10.104)], i.e. by subtracting the Fermi-Walker part from ∇ε(0) ε(i) for i ∈ {1, 2, 3}
and writing it as ωrot
                     C
                       ×ε(0) ε(i) . We get11
                                            a h√                            i
                                    C
                                   ωrot   = 3    ∆ cos θ ε(r) − r sin θ ε(θ) .                 (10.116)
                                           ρ
           √
Due to      ∆ ∼ |r| and ρ ∼ |r| when r → ±∞, the asymptotic behavior is
                                     a 
                           C
                                                                                               (10.117)
                                                                       
                          ωrot   ∼    2
                                        cos θ ε(r) − (sgn r) sin θ ε(θ) ,
                               r→±∞ r
with sgn r = +1 in MI and sgn r = −1 in the asymptotic region of MIII . Introducing in both
the region MI and in the asymptotic region of MIII a “Cartesian” coordinate system (x, y, z)
defined x = |r| sin θ cos φ, y = |r| sin θ sin φ and z = |r| cos θ, the above formula turns out to
be equivalent to
                                                        a
                                       C
                                     ωrot    ∼ (sgn r) 2 ∂z .                             (10.118)
                                           r→±∞         r
  10
       See the notebook D.5.6 for the computation; see as well Eqs. (90)-(91) of Ref. [450].
  11
       See the notebook D.5.6 for the computation; see as well Eq. (93) of Ref. [450].
364                                                 Kerr black hole
      note that the full expression of ωrot , Eq. (10.116), is much simpler than that of ωrot , Eq. (10.104). The same
                                        C
      remark holds for the 4-acceleration aC [Eq. (10.114)] versus the ZAMO 4-acceleration a [Eq. (10.101)].
      This reflects the fact that the Carter observer is tightly bound to the spacetime structure, being closely
      related to the principal null geodesics, as shown by Eq. (10.112).
      Historical note : The Carter frame has been introduced by Brandon Carter in 1968 [91], in the form of its
      dual, namely the Carter coframe, for a quite general class of spacetimes including the Kerr one (Eqs. (81)-
      (82) in Ref. [91]). The explicit form for Kerr spacetime, i.e. the system (10.109), can be found in Carter’s
      lecture at the 1972 Les Houches Summer School [95] (Eqs. (5.19)-(5.24) and (5.31)-(5.34); cf. footnote 41
      in the reprinted version [95]). The Carter frame has been popularized by Roman Znajek in 1977 [533] for
      studying electromagnetic processes around a black hole and has been heavily used in Barrett O’Neill’s
      monograph about Kerr black holes [391], where it is called√the Boyer-Lindquist frame field. In particular
      O’Neill’s canonical Kerr vector fields V and W are V = ρ ∆ ε(0) and W = ρ sin θ ε(φ) .
         3. the ingoing principal null geodesics are mapped to lines T = −X + const and the
            outgoing ones are mapped to lines T = X + const, with T increasing towards the future
            of (M , g) in both cases.
        12
             See the historical notes on p. 300 and p. 368.
                                     10.8 Maximal analytic extension                                            365
Property 1 implies that Π acts as a projection, leaving out any information in the non-ignorable
coordinate θ. Property 2 provides the “compactified” view13 , and Property 3 makes Π(M )
look “conformal” to the 2-dimensional Minkowski spacetime generated by the metric f =
−dT ⊗ dT + dX ⊗ dX on R2 . However, let us stress that Π is not a priori connected to a
proper conformal completion of (M , g), as defined in Sec. 4.3. Note that to fulfill Property 3, it
suffices to choose the functions T (t̃, r) and X(t̃, r) defining Π such that in each of the regions
MI , MII and MIII ,
where (i) U (u) is a function of the retarded Kerr time u = u(t̃, r) given by Eq. (10.53a) that is
monotonic increasing in MI and MIII and decreasing in MII (cf. Fig. 10.8) and (ii) V (v) is a
monotonic increasing function of the advanced Kerr time v = t̃ + r [Eq. (10.33)].
    This follows from u (resp. v) being constant along the outgoing (resp. ingoing) principal
null geodesics. Note also that Property 3 ensures that the two horizons H and Hin , which
are generated by outgoing principal null geodesics, appear as lines inclined by +45◦ with
respect to the X-axis, as for the black hole event horizon in the Carter-Penrose diagrams of
Schwarzschild spacetime (Figs. 9.10 and 9.11).
Remark 1: A related but distinct concept of projection diagram has been introduced in Ref. [126] (see
also Chap. 7 of Ref. [119]); it differs from that considered here by demanding, instead of Properties 1
to 3, that Π maps any timelike curve of14 (M \ T , g) to a timelike curve of the Minkowski spacetime
(R2 , f ) and that any timelike curve in Π(M \ T ) is the image of a timelike curve of (M \ T , g).
      Figure 10.9: Carter-Penrose diagram of the Kerr spacetime (M , g), with M = R2 × S2 \ R (cf. Eq. (10.29)
      in Sec. 10.3.1), spanned by the ingoing principal null geodesics (dashed green lines). The solid green lines are
      outgoing principal null geodesics, while the dotted curves mark some hypersurfaces r = const.
          As pointed out in Sec. 10.5.2, the asymptotic structure of region MI is identical to that of
      MI in Schwarzschild spacetime. It can thus be endowed with a conformal completion at null
      infinity, as constructed in Secs. 9.4.3 and 9.4.4. This allows us to add the future (resp. past)
      null infinity I + (resp. I − ) to the diagram of Fig. 10.9. Note that I + corresponds to the limit
      v → +∞ in MI , while I − corresponds to the limit u → −∞ in MI .
Figure 10.10: Carter-Penrose diagram of the minimal extension of MI to ensure complete outgoing principal
null geodesics (one of them is drawn as a solid green line).
spacetime.
    A spacetime extending (MI , g) “to the past”, so that the outgoing principal null geodesics
are complete, is shown in Fig. 10.10. It is made by attaching to (MI , g) a time-reversed copy15
of (MII , g), (MII∗ , g) say, and then attaching to the latter a copy of (MIII , g), (MIII
                                                                                        ′′
                                                                                           , g) say16 .
By construction, the outgoing principal null geodesics are complete in this extension, but the
ingoing ones are not: the affine parameter λ = −r of that denoted by v = v0 in Fig. 10.10 is
bounded from above by −r+ . Actually the situation is completely symmetric to that of the
original Kerr spacetime (M , g) (Fig. 10.9). In particular, the region MII∗ ∪ Hin′′ ∪ MIII ′′
                                                                                              , where
Hin is the (null) hypersurface r = r− , is a white hole, since it is the complement of J (I − ) (cf.
   ′′                                                                                    +
the definition (4.41) of a white hole). The hypersurface r = r+ (black thick line in Fig. 10.10) is
the corresponding past event horizon.
    By combining the diagrams of Figs. 10.9 and 10.10, one obtains a spacetime which still
contains incomplete null geodesics: the outgoing ones in regions MII and MIII , and the ingoing
ones in regions MII∗ and MIII   ′′
                                   . To go further, one should add new regions isometric to one
of the three blocks (MI , g), (MII , g) and (MIII , g) and iterate indefinitely, leading to the
Carter-Penrose diagram of Fig. 10.11. In this process, one shall make sure to have some analytic
continuation of the metric between the various blocks. This is done by introducing Kruskal-type
coordinates in the vicinity of the boundaries between the various blocks. We shall not do it
  15
     By copy, it is meant that (MII∗ , g) is a spacetime isometric to (MII , g) and by time-reversed that r is increasing
towards the future in MII∗ , while it is decreasing towards the future in MII .
  16
     The notation MIII ′′
                          instead of MIII′
                                             is for later convenience.
368                                               Kerr black hole
      here and refer the reader to the seminal articles by Boyer & Lindquist [72] and Carter [90], the
      famous Les Houches lectures by Carter [95] or the textbook by O’Neill [391].
         In the diagram of Fig. 10.11, it is clear that the event horizon H and the inner horizon Hin
      have been extended to bifurcate Killing horizons (cf. Sec. 3.4), the bifurcation surface of which
      being a 2-sphere depicted by a circular dot.
      Historical note : In 1966, Brandon Carter [88] obtained the maximal analytic extension of the 2-
      dimensional manifold constituted by the rotation axis17 A of the Kerr spacetime and drew a diagram
      similar to that of Fig. 10.11 (Fig. 1a of Ref. [88]). More precisely, Carter introduced on A a coordinate
      system (T, X) (denoted (ψ, ξ) by him) in which the metric induced by g on A is explicitely conformal
      the 2-dimensional Minkowski metric f = −dT ⊗ dT + dX ⊗ dX. Thus Fig. 1a of Carter’s article [88],
      which is reproduced as Fig. 28 of Hawking & Ellis’ textbook [266], is a true conformal representation of
      A , and not the mere “compactified projection” that we used to define a generic Carter-Penrose diagrams
      in Sec. 10.8.1. This was of course made possible because the rotation axis A is a 2-dimensional manifold.
      For the whole Kerr spacetime, the “projection” aspect is inevitable. In the same article [88], Carter
      suggested that the maximal analytic extension of the whole 4-dimensional manifold would be similar to
      that of A . This was proven rigorously a year later by Robert H. Boyer and Richard W. Lindquist [72]
      and generalized to Kerr-Newmann spacetimes by Carter himself in 1968 [90].
Figure 10.11: Carter-Penrose diagram of the maximal analytic extension of the Kerr spacetime. As in Figs. 10.9
and 10.10, the dotted curves mark some hypersurfaces r = const. The central black or light brown dots mark the
bifurcation spheres of bifurcate Killing horizons.
370                                            Kerr black hole
      Figure 10.12: The partial Cauchy surface Σ, its future Cauchy development D+ (Σ) (hatched) and the Cauchy
      horizon HC . As in Figs. 10.9-10.11, the dotted curves marks some hypersurfaces r = const.
boundary between MII and M ′ III , H ′ in say, which is a Killing horizon as well:
HC = Hin ∪ H ′ in . (10.120)
      Note that HC corresponds to a fixed value of the r coordinate of Kerr-type coordinate systems,
      namely r = r− , in agreement with the primary definition of Hin given in Eq. (10.28).
      Remark 2: There is no such Cauchy horizon in the Schwarzschild spacetime, even in its maximally
      extended version. For instance the hypersurface defined in terms of the Kruskal-Szekeres coordinates
      by T = 0 and whose equatorial section is depicted in Fig. 9.14 (Flamm paraboloid), is a Cauchy surface
      for the whole maximally extended Schwarzschild spacetime: by looking at Fig. 9.12, you may convince
      yourself that any inextendible causal curve in Schwarzschild spacetime must go through T = 0.
          Let us evaluate the non-affinity coefficient κin of the Killing generator χin of the part Hin of
      the Cauchy horizon [cf. Eq. (10.70)]. Since mr− /(r−   2
                                                               + a2 ) = 1/2, we notice, from Eqs. (10.57),
      (10.70) and (10.71), that the Killing vector χin coincides with the vector ℓ tangent to the outgoing
      principal null geodesics on Hin :
                                                            H
                                                       χin =in ℓ.                                     (10.121)
      Hence, as the event horizon H , the inner horizon Hin is generated by outgoing principal null
      geodesics [compare Eq. (10.72) and see Figs. 10.6 and 10.7]. Equation (10.58) implies then
                                                            H
                                                ∇χin χin =in κin χin ,                                (10.122)
                                       10.9 Further reading                                         371
with the non-affinity coefficient κin obtained by specializing Eq. (10.58) to r = r− : κin =
              2
κℓ |r=r− = m(r− − a2 )/(r−
                         2
                           + a2 )2 . Using the expression of r− in terms of m and a [Eq. (10.3)],
we get
                                               √
                                                 m2 − a2
                                κin = −            √           .                       (10.123)
                                         2m(m − m2 − a2 )
Given the assumption 0 < a < m, we have κin ̸= 0, which implies (cf. Sec. 3.3.6)
  As long as a < m, the part Hin of the Cauchy horizon HC is a non-degenerate Killing
  horizon.
We note that κin < 0. According to the results of Sec. 3.4.2, this implies
  Hin is contained in a bifurcate Killing horizon, the bifurcation surface of which being the
  future boundary of Hin (light brown small disk in Fig. 10.12). Actually, the whole Cauchy
  horizon HC is the past part of the bifurcate Killing horizon, the future part being formed
  by the future boundaries r = r− of MIII and M ′ III (light brown thick lines in Fig. 10.12).
Contents
        11.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 373
        11.2 Equations of geodesic motion . . . . . . . . . . . . . . . . . . . . . . . 374
        11.3 Main properties of geodesics . . . . . . . . . . . . . . . . . . . . . . . . 388
        11.4 Timelike geodesics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 410
        11.5 Circular timelike orbits in the equatorial plane . . . . . . . . . . . . . 418
        11.6 Going further . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 441
11.1      Introduction
In various occasions during our study of Kerr spacetime in Chap. 10, we have already encoun-
tered some peculiar geodesics, namely the principal null geodesics. In this chapter, we begin
the systematic study of causal (timelike or null) geodesics in Kerr spacetime. First, taking
advantage of an algebraic particularity of this spacetime, giving birth to a non-trivial Killing
tensor and the associated Carter constant, we shall see in Sec. 11.2 that Kerr geodesic motion is
fully governed by a system of first order equations, the solution of which can be obtained by
quadrature. The basic properties of geodesics that can be inferred from this system are derived
in Sec. 11.3. Then we focus on timelike geodesics (Sec. 11.4) and in particular on those that
are bound; they represent orbits of massive particles or bodies around a Kerr black hole. In
Sec. 11.5, we shall investigate the particular case of circular orbits in the equatorial plane and
discuss their stability. The detailed study of null geodesics is deferred to the next chapter.
    Of course, when the Kerr spin parameter a tends to zero, all results in this chapter reduce to
those obtained for Schwarzschild geodesics in Chap. 7. As already suggested in the introduction
of Chap. 7, before moving on, the reader might want to have a look at Appendix B, which
recaps the properties of geodesics in pseudo-Riemannian manifolds.
374                   Geodesics in Kerr spacetime: generic and timelike cases
                                      d 2 xα           µ
                                                 α dx dx
                                                          ν
                                             + Γ  µν        = 0,              0 ≤ α ≤ 3,                             (11.1)
                                      dλ2            dλ dλ
      where the Γαµν ’s are the Christoffel symbols of the metric g with respect to the coordinates (xα ),
      as given by Eq. (A.71). Equation (11.1) is the component expression of ∇p p = 0 [Eq. (1.11)]. It
      is a system of four coupled second-order differential equations, which are non-linear. We are
      going to see that it is actually not necessary to solve this system to compute the geodesics of
      Kerr spacetime. Indeed, as in the Schwarzschild case studied in Chap. 7, there exists enough
      first integrals of Eq. (11.1) to reduce the problem to four first-order equations of the type
      dxα /dλ = F α (x0 , x1 , x2 , x3 ).
          Three integrals of motion are similar to those of Schwarzschild geodesics: one is the
      particle’s mass µ (Sec. 11.2.3 below) and the two other ones are the conserved energy E and the
      conserved angular momentum L (Sec. 11.2.2), which arise from the common symmetries of Kerr
      and Schwarzschild spacetimes: stationarity (outside the ergoregion) and axisymmetry. In the
      Schwarzschild case, the fourth integral of motion was provided by spherical symmetry, which
      constrained all geodesics to be planar, so that a suitable choice of coordinates (t, r, θ, φ) makes a
      given geodesic confined to the hyperplane θ = π/2, yielding the first integral pθ = 0 [Eq. (7.7)].
      The Kerr spacetime with a ̸= 0 being not spherically symmetric, we loose this property here.
      Fortunately there exists another integral of motion, known as the Carter constant; it arises
      from a remarkable property of Kerr spacetime: the existence of a non-trivial Killing tensor
      (Sec. 11.2.4). This makes a total of four integral of motions, which makes the problem integrable
      (Secs. 11.2.5 to 11.2.8).
  The Killing vectors ξ and η of Kerr spacetime, associated respectively with stationarity
  and axisymmetry [cf. Eq. (10.12)], give birth to two conserved quantities along any (causal)
  geodesic L :
                                      E := −ξ · p = −g(ξ, p)                                   (11.2a)
                                      L := η · p = g(η, p) ,                                   (11.2b)
  where p is the 4-momentum of the particle P having L as worldline (cf. Sec. 1.3). For
  the same reasons as in Sec. 7.2.1, E is called the conserved energy or energy at infinity
  of P, while L is called the conserved angular momentum or angular momentum at
  infinity of P.
In particular, if L reaches the asymptotic region |r| → +∞, E is nothing but P’s energy as
measured by the asymptotic inertial observer introduced in Sec. 10.7.5, given that the 4-velocity
of the latter coincides with the Killing vector ξ [compare Eq. (1.23) with Eq. (11.2a)]. Similarly,
L is the component along the rotation axis of P’s (total) angular momentum vector Ltot as
measured by the asymptotic inertial observer [cf. Eq. (7.4)].
Remark 1: Because it represents only a component of the total angular momentum, L is sometimes
denoted by Lz .
    In coordinates (t, r, θ, φ) adapted to spacetime symmetries, i.e. coordinates such that ξ = ∂t
and η = ∂φ , for instance Boyer-Lindquist coordinates (Sec. 10.2.1), advanced Kerr coordinates
(Sec. 10.3.1) or Kerr coordinates (Sec. 10.3.3), one can rewrite (11.2) in terms of the components
pt = gtµ pµ and pφ = gφµ pµ of the 1-form p associated to p by metric duality:
                                               E = −pt                                           (11.3a)
                                               L = pφ                                            (11.3b)
      Example 2 (Ingoing principal null geodesics): For the ingoing principal null geodesics L(v,θ,
                                                                                               in
                                                                                                    φ̃)
      introduced in Sec. 10.4, the 4-momentum vector is
p = αk, (11.5)
      where α is a (positive) constant, since k is a geodesic vector [Eq. (10.45)], as p [Eq. (1.11)]. Equations (11.3)
      with the components relative to Kerr coordinates (t̃, r, θ, φ̃) lead then to
      where the components kt̃ and kφ̃ have been read on Eq. (10.44). Hence for any ingoing principal null
      geodesic L(v,θ,
                 in
                      φ̃) ,
                                           E>0          and        L = aE sin2 θ.                                 (11.6)
      Recall that θ is constant along L(v,θ,
                                         in
                                             φ̃) , so that the above formula does yield a constant value for L.
      Moreover, it fulfills L ≥ 0 with L = 0 only for a = 0 or θ ∈ {0, π} (rotation axis).
      Example 3 (Outgoing principal null geodesics): For the outgoing principal null geodesics L(u,θ,
                                                                                                 out
                                                                                                      ˜ ,
                                                                                                     φ̃)
        out,H       out,Hin
      L(θ,ψ)  and L(θ,ψ)    introduced in Sec. 10.4, the 4-momentum vector is
p = β(λ)ℓ. (11.7)
      where β(λ) is a function of the affine parameter λ associated to p that obeys β(λ) > 0, since both p and ℓ
      are future-directed (cf. Sec. 10.4). Contrary to the coefficient α in Eq. (11.5), β(λ) is not constant because
      ℓ is not a geodesic vector, but only a pregeodesic one: it fulfills ∇ℓ ℓ = κℓ ℓ with κℓ ̸= 0 [Eq. (10.58)]
      (cf. Remark 6 on p. 346). The geodesic equation ∇p p = 0 implies that β obeys the differential equation
      ∇p β + κℓ β 2 = 0, or equivalently
                                                         dβ
                                                             + κℓ β 2 = 0.                                         (11.8)
                                                         dλ
      The outgoing principal null geodesics either (i) are confined to one of the horizons H and Hin , generating
                        out,H           out,Hin
      them (case of L(θ,ψ)      and L(θ,ψ)      ), or (ii) never intersect them (case of L(u,θ,
                                                                                           out
                                                                                                ˜ , cf. the solid curves
                                                                                                φ̃)
      in Figs. 10.6 – 10.7). In the first case, κℓ is a constant on H and Hin , given by Eq. (10.58) with r = r+
      for H and r = r− for Hin . The solution of Eq. (11.8) is then
                                                                   1
                                                   β(λ) =                  ,
                                                              κℓ (λ − λ0 )
      where λ0 is a constant. Outside the horizons, κℓ is the function of r given by Eq. (10.58). We then search
      for a solution of Eq. (11.8) in the form β(λ) = B(r(λ)), where r(λ) is the function giving the coordinate
      r along the geodesic. Since the latter obeys dr/dλ = pr = βℓr with ℓr read on Eq. (10.57), we get the
      following linear differential equation for B:
                                                          dB     r2 − a2
                                       (r2 − 2mr + a2 )      + 2m 2      B = 0.
                                                          dr     r + a2
      The solution is
                                                         r2 + a2
                                          B(r) = 2                 B0 = β(λ),
                                                     r2 − 2mr + a2
      where B0 is a positive constant in MI and MIII and a negative constant in MII , to ensure that β(λ) > 0.
                                  11.2 Equations of geodesic motion                                        377
    Plugging (11.7) into Eqs. (11.3) with the components relative to Kerr coordinates yields
                                       ∆                                          a∆ sin2 θ
           E = −β(λ)ℓt̃ = β(λ)                    and      L = β(λ)ℓφ̃ = β(λ)                ,
                                  2(r + a2 )
                                     2                                           2(r2 + a2 )
where the components ℓt̃ and ℓφ̃ have been read on Eq. (10.60). Given the above results for β(λ), we get
E = 0 and L = 0 on H and Hin , since ∆ = 0 there, and E = B0 and L = aB0 sin2 θ elsewhere. We
conclude that for the outgoing principal null geodesics,
                          in H ∪ Hin ,       E=0        and      L=0                             (11.9a)
                         in MI ∪ MIII ,      E>0        and      L = aE sin θ2
                                                                                                 (11.9b)
                             in MII ,        E<0        and                  2
                                                                 L = aE sin θ.                   (11.9c)
In particular, (11.9a) is nothing but the result (11.4) already obtained in Example 1. We note also that
the relation between L and E is identical to that obtained in Example 2 for the ingoing principal null
geodesics [Eq. (11.6)].
     In what follows, we will use Boyer-Lindquist coordinates (xα ) = (t, r, θ, φ) as introduced in
Sec. 10.2.1. Given the components (10.8) of the metric tensor g in these coordinates, evaluating
E and L via E = −gtµ pµ and L = gφµ pµ yields
                                                     2amr sin2 θ φ
                                            
                                        2mr
                              E = 1− 2          pt +              p .                        (11.10)
                                          ρ               ρ2
                          2amr sin2 θ t                2a2 mr sin2 θ
                                                                    
                    L=−                       2
                                     p + r +a +    2
                                                                       sin2 θ pφ ,           (11.11)
                               ρ2                            ρ2
where ρ2 := r2 + a2 cos2 θ [Eq. (10.9)].
     Let us recall that the components (pα ) of the 4-momentum are related to the parametric
equation xα = xα (λ) of the geodesic L in terms of the affine parameter λ by pα = dxα /dλ
[cf. Eq. (1.21)], i.e.
                                dt          dr          dθ           dφ
                          pt =     , pr =      , pθ =      , pφ =       .                    (11.12)
                                dλ          dλ         dλ            dλ
      where ∆ := r2 − 2mr + a2 [Eq. (10.10)]. Note that we have expressed pr and pθ in terms of
      pr and pθ thanks to the relations pr = grµ pµ and pθ = gθµ pµ , which are very simple for the
      Boyer-Lindquist components (10.8) of g:
                                           ρ2 r
                                    pr =     p        and      pθ = ρ 2 pθ .                     (11.15)
                                           ∆
      This property can be seen as a generalization of the Killing equation (3.19) to tensors of valence
      2. That the tensor K defined by (11.16) obeys the Killing identity (11.18) is established in the
      SageMath notebook D.5.7.
          Killing tensors are discussed in Sec. B.5.2 of Appendix B. It is shown there that the Killing
      identity (11.18) implies that the following quantity is constant along any geodesic L (Prop-
      erty B.21):
                                        K := K(p, p) = Kµν pµ pν .                                (11.19)
      K is named Carter constant (cf. the historical note on p. 387). From the definition (11.16) of
      K, we have
                             K = 2(r2 + a2 )⟨k, p⟩⟨ℓ, p⟩ + r2 g(p, p).                       (11.20)
      Now by Eq. (11.13), g(p, p) = −µ2 . Besides, we have ⟨k, p⟩ = kµ pµ = k µ pµ . The last form lets
      appear the constants of motion pt = −E and pφ = L [Eq. (11.3)]. Using it with the components
      of k as given by Eq. (10.49), we get
                                                   r2 + a2         a
                                     ⟨k, p⟩ = −            E − pr + L.
                                                      ∆            ∆
      Similarly, from the components (10.61) of ℓ, we obtain
                                        1         ∆              a
                              ⟨ℓ, p⟩ = − E +    2   2
                                                       pr +            L.
                                        2    2(r + a )      2(r + a2 )
                                                               2
                               1 h 2             2          i
                        K =       (r + a2 )E − aL − ρ4 (pr )2 − r2 µ2 .                         (11.21)
                               ∆
As discussed above, E is P’s energy as measured by the asymptotic inertial observer O. Then,
according to Einstein’s formula (1.26), E 2 − µ2 = P · P , where P is P’s linear momentum as
measured by O. Given that asymptotically, pr ∼ P r [cf. Eq. (1.22)], Eq. (11.22) becomes
                   K ∼ r2 P · P − (P r )2 = r2 (P (θ) )2 + (P (φ) )2 ,
                                                                       
                         r→+∞
where P (θ) = rP θ ∼ rpθ and P (φ) = r sin θ P φ ∼ r sin θ pφ are the angular components of P
in the orthonormal basis (e(r) , e(θ) , e(φ) ) := (∂r , r−1 ∂θ , (r sin θ)−1 ∂φ ). Now the total angular
momentum of P measured by O is
Hence we may conclude that asymptotically, the Carter constant coincides with the square of
P’s angular momentum as measured by the inertial observer O:
   We shall see later that one has always K ≥ 0. For now, let us establish the following
characterization of null geodesics with K = 0:
  A null geodesic L has a vanishing Carter constant K if, and only if, L is a principal null
  geodesic:
                      K = 0 ⇐⇒ L = L(v,θ,    in
                                                φ̃) or L = L out ,                   (11.24)
  where L(v,θ,
           in
               φ̃) (resp. L
                            out
                                ) is one of the ingoing (resp. outgoing) principal null geodesic
  introduced in Sec. 10.4, with L out standing for L(u,θ,out
                                                             ˜ , L(θ,ψ) or L(θ,ψ)
                                                             φ̃)
                                                                   out,H     out,Hin
                                                                                     .
Proof. Consider expression (11.20) for K . If L is a null geodesic, the term g(p, p) vanishes
identically, so that one is left with
K = 0 ⇐⇒ k · p = 0 or ℓ · p = 0.
      The vectors k, ℓ and p are all null. Now, according to Property 1.4 in Sec. 1.2.2, two null vectors
      are orthogonal iff they are collinear. Hence K = 0 is equivalent to p = αk or p = αℓ with
      α > 0. Since k (resp. ℓ) is tangent to L(v,θ,
                                                in
                                                    φ̃) (resp. L
                                                                 out
                                                                     ), this completes the proof.
      Remark 2: Property 11.2 is consistent with the interpretation (11.23) of K , since asymptotically the
      principal null geodesics are purely radial5 , and hence have Ltot = 0.
      Example 4 (Generators of the event and inner horizons): We have seen in Example 1 in Sec. 11.2.2
      that the null geodesic generators of the event horizon H and the inner horizon Hin have E = 0 and
      L = 0. Since these generators are principal null geodesics (cf. Secs. 10.5.3 and 10.8.3), the above results
      shows that in addition K = 0. As µ = 0 by definition of a null geodesic, we see that the four integrals
      of motion µ, E, L and K all vanish for these geodesics.
             The converse is true:
        The null geodesic generators of the event horizon H and of the inner horizon Hin are the
        only geodesics of Kerr spacetime having their four integrals of motion vanishing:
      Proof. If a geodesic L has (µ, K ) = (0, 0), L is necessary null and (11.24) shows that L is a
      principal null geodesic. Moreover Eq. (11.21) with (µ, E, L, K ) = (0, 0, 0, 0) implies pr = 0,
      i.e. L lies at a constant value of r. If L is ingoing, then p ∝ k, with the Boyer-Lindquist
      components of k given by Eq. (10.49). Since k r = −1, this precludes pr = 0. Hence L is an
      outgoing principal null geodesic and one has p ∝ ℓ, with the Boyer-Lindquist components of ℓ
      given by Eq. (10.61). We read ℓr = ∆/(2(r2 + a2 )) with ∆ ̸= 0, except precisely on H and
      Hin 6 . Hences pr = 0 is possible only on H and Hin .
      Historical note : That K = 0 for principal null geodesics has been pointed out by Jiří Bičák and
      Zdeněk Stuchlík in 1976 [60].
pα has to be thought of as the first order derivative dxα /dλ [Eq. (11.12)]. Two first integrals,
namely E and L, are linear in the pα ’s, while the two others, namely µ2 and K , are quadratic.
Furthermore, Eqs. (11.10) and (11.11) constitute a decoupled subsystem for (pt , pφ ), which can
easily be solved7 , yielding
                                        1 2
                             ρ 2 pt =     [(r + a2 )2 E − 2amrL] − a2 E sin2 θ                      (11.26)
                                        ∆
                                        L      a
                                    ρ 2 pφ =
                                         2  + (2mrE − aL).                                          (11.27)
                                      sin θ ∆
Besides, Eq. (11.21) involves only p and can be recast as
                                    r
with
                                                     2
                             R(r) := (r2 + a2 )E − aL − ∆(r2 µ2 + K ) .                             (11.29)
                                    
Recall that ∆ is the function of r given by Eq. (10.10): ∆ := r2 − 2mr + a2 . All other
quantities appearing in Eq. (11.29) are constant. Accordingly, R(r) is a 4th order polynomial in
r. Equation (11.28) implies that this polynomial must be non-negative along the geodesic L :
R(r) ≥ 0 . (11.30)
    Finally, if we substitute pr by the value given by Eqs. (11.28)-(11.29) in the mass first integral
(11.14), we get, after simplification,
with
                                                                   2
                                                   L
                           Θ(θ) := K −                 − aE sin θ        − µ2 a2 cos2 θ .           (11.32)
                                                 sin θ
Equation. (11.31) imposes that Θ(θ) is non-negative along the geodesic L :
Θ(θ) ≥ 0 . (11.33)
Q := K − (L − aE)2 (11.34)
                                                              L2
                                                                  
                                                   2
                              Θ(θ) = Q + cos θ a (E − µ ) −2   2     2
                                                                     .                              (11.35)
                                                            sin2 θ
      7
   An intermediate step is combining Eqs. (11.10) and (11.11) to get aE − L/ sin2 θ = apt − (r2 + a2 )pφ and
(r + a2 )E − aL = ∆(pt − a sin2 θpφ ).
  2
382                  Geodesics in Kerr spacetime: generic and timelike cases
      Following the standard usage, we call Q the Carter constant as well. To distinguish between
      the two Carter constants, we shall specify Carter constant Q or Carter constant K . As we shall
      see in Sec. 11.3.7, Q is well adapted to the description of the θ-motion of geodesics. On the
      other hand, a nice property of K , which is not shared by Q, is to be always non-negative, as
      Eqs. (11.32) and (11.33) show:
                                                  K ≥0.                                       (11.36)
          If the particle P reaches the asymptotic region r ≫ m, we deduce from Eqs. (11.34) and
      (11.23) the following behavior of Q:
      Hence, if a = 0, Q can be interpreted as the square of the part of P’s angular momentum
      (measured by the asymptotic inertial observer) that is not in L.
      Example 5 (Carter constant Q of the principal null geodesics): As (11.24) shows, for a principal
      null geodesic, be it ingoing or outgoing, the Carter constant K vanishes identically. According to
      Eq. (11.34), the Carter constant Q is then Q = −(L − aE)2 . In view of the relation L = aE sin2 θ for
      these geodesics [Eqs. (11.6) and (11.9)], we get
      where θ is the constant value of the θ-coordinate along the principal null geodesic. Note that the above
      relation holds in all Kerr spacetime, including on the horizons H and Hin , where it reduces to Q = 0,
      for E = 0 there [Eq. (11.9a)]. Equation (11.38) implies
Q ≤ 0, (11.39)
      with Q = 0 only for principal null geodesics lying in the equatorial plane or for the outgoing principal
                       out,H        out,Hin
      null geodesics L(θ,ψ)  and L(θ,ψ)     generating the horizons H and Hin .
      Remark 3: As for K , one may derive the Carter constant Q from a Killing tensor. Indeed, from the
      Walker-Penrose Killing tensor K [Eq. (11.16)], let us form the tensor field
      Being a linear combination with constant coefficients of the Killing vectors η and ξ, η̃ is itself a Killing
      vector. It follows that η̃ ⊗ η̃ is a Killing tensor (cf. Example 3 in Sec. B.5.2), so that K̃ a Killing tensor,
      the Killing equation (11.18) being linear. Applying K̃ to the 4-momentum p, we get
      The Boyer-Lindquist components of the contravariant tensor associated to K̃ by metric duality have an
      expression particularly simple in terms of those of the inverse metric (cf. the notebook D.5.7):
   In view of the relation (11.12) between the pα ’s and the derivatives of the functions xα (λ),
we may collect Eqs. (11.26), (11.27), (11.28) and (11.32) as the first-order system
                          dt    1
                       ρ2    = [(r2 + a2 )2 E − 2amrL] − a2 E sin2 θ                      (11.43a)
                          dλ   ∆
                          dr
                                                                                         (11.43b)
                                  p
                       ρ2    = ϵr R(r)
                          dλ
                          dθ
                                                                                          (11.43c)
                                  p
                       ρ2    = ϵθ Θ(θ)
                          dλ
                            dφ     L   a
                       ρ2      =    2 + (2mrE − aL) ,                                    (11.43d)
                            dλ   sin θ ∆
where ϵr := sgn pr = ±1, ϵθ := sgn pθ = ±1 and the functions R(r) and Θ(θ) are defined by
Eq. (11.29) and Eq. (11.32) or (11.35). Since pr = dr/dλ, ϵr is +1 (resp. −1) in the parts of the
geodesic L where r increases (resp. decreases) with λ. Similarly, ϵθ is +1 (resp. −1) in the
parts of L where θ increases (resp. decreases) with λ.
    We may rewrite Eq. (11.29) for R(r) in terms of Q instead of K , via Eq. (11.34):
                                              2
                 R(r) := (r2 + a2 )E − aL − ∆ r2 µ2 + Q + (L − aE)2 ) .                   (11.44)
                                                                            
Note that, beside the constants of motion E, L, µ and Q, R(r) depends on both Kerr parameters
a and m (via ∆ = r2 − 2mr + a2 ), while Θ(θ) depends on a only [cf. Eq. (11.35)]. Along L ,
these functions must obey R(r) ≥ 0 [Eq. (11.30)] and Θ(θ) ≥ 0 [Eq. (11.33)].
          We deduce from the result (11.46) that at λ = λ0 , dr/dλ moves from positive to negative
      values or vice-versa (depending on the sign of R′ (r0 )), which means that the function r(λ)
      switches from increasing to decreasing or vice-versa, hence the name r-turning point. The
      factor ϵr = ±1 in Eq. (11.43b) necessarily changes sign at λ = λ0 .
      Remark 4: For a generic smooth function r(λ) with dr/dλ = 0 at λ0 , the condition d2 r/dλ2 ̸= 0 at λ0
      is sufficient but not necessary for r to change its direction of variation there. Indeed, the same property
      holds with d2 r/dλ2 = 0 and higher order derivatives vanishing up to some even order k for which
      dk r/dλk ̸= 0. However, in the present case, d2 r/dλ2 = 0 would imply R′ (r0 ) = 0 and we shall see
      Sec. 11.3.6 that a geodesic can reach such a point only asymptotically, i.e. for λ → +∞. Hence it cannot
      be a turning point.
         Similarly, if L is a geodesic that is not stuck at some constant value of the coordinate θ,
      we define a θ-turning point of L as a point p0 ∈ L , the θ-coordinate θ0 of which obeys
                                      dθ                      d2 θ            Θ′ (θ0 )
                                                = 0 and                   =            ̸= 0.                      (11.48)
                                      dλ   λ0                 dλ2    λ0        2ρ40
      This implies that at a θ-turning point, the function θ(λ) switches from increasing to decreasing
      or vice-versa. The factor ϵθ = ±1 in Eq. (11.43c) necessarily changes sign at such a point.
      Remark 5: A comment similar to Remark 4 can be made: a geodesic with varying θ that has dθ/dλ = 0
      for some finite value of λ cannot have d2 θ/dλ2 = 0 at the same point, since we shall see in Sec. 11.3.6
      that a value of θ with both Θ(θ) = 0 and Θ′ (θ) = 0 can only be reached asymptotically along a geodesic.
      Hence (11.47) is a necessary and sufficient condition for a θ-turning point.
                                                   dλ           dλ
                                           dλ′ =     2
                                                       =                     .                                    (11.49)
                                                   ρ     r(λ) + a2 cos2 θ(λ)
                                                             2
      Since ρ2 never vanishes on the spacetime manifold M (by construction of the latter, cf.
      Eq. (10.29)), the above relation leads to a well-defined parameter8 along L . Moreover, since
      ρ2 > 0, λ′ increases towards the future, as λ. A difference between the two parametrizations is
      that λ′ is not an affine parameter of L in general9 , contrary to λ. The parameter λ′ is called
      Mino parameter [368].
         8
           Note however that λ′ may blow up if L comes arbitrarily close to the ring singularity, i.e. if ρ → 0.
         9
           The only exception is for a circurlar orbit at θ = π/2, since then ρ2 is constant and Eq. (11.49) reduces to an
      affine relation between λ and λ′ .
                                         11.2 Equations of geodesic motion                                                385
                                dt      1 2
                                   ′
                                     =    [(r + a2 )2 E − 2amrL] − a2 E sin2 θ                                 (11.50a)
                                dλ     ∆
                                dr
                                                                                                              (11.50b)
                                          p
                                     = ϵr   R(r)
                                dλ′
                                dθ
                                                                                                               (11.50c)
                                          p
                                     = ϵθ   Θ(θ)
                                dλ′
                                dφ       L   a
                                   ′
                                     =    2 + (2mrE − aL) ,                                                   (11.50d)
                                dλ     sin θ ∆
where R is the quartic polynomial defined by Eq. (11.44) and Θ is the function defined by
Eq. (11.35). It is remarkable that Eqs. (11.50b) and (11.50c) are two fully decoupled equations:
Eq. (11.50b) is an ordinary differential equation (ODE) for the function r(λ′ ), while Eq. (11.50c)
is an ordinary differential equation for the function θ(λ′ ). This was not the case for Eqs. (11.43b)
and (11.43c) since ρ2 involves both r and θ.
      where r1,2 = r1 for M odd and r1,2 = r2 for M even. Note that each term in the above sum is
      positive, the sign of ϵr compensating the order of the integral boundaries.
          The right-hand side of Eq. (11.52) is actually a path integral and is often abriged by means
      of a slash notation:                            Z r
                                                            ϵr dr̄
                                            ′     ′
                                          λ − λ0 = − p             .                             (11.53)
                                                        r0   R(r̄)
          Similarly, if Θ(θ) ̸= 0 along L , except possibly at some θ-turning points, Eq. (11.50c) can
      be integrated as
                          Z θ
             ′                 ϵθ dθ̄
            λ −   λ′0   = − p
                           θ     Θ(θ̄)
                          0Z θ
                                   ϵ dθ̄
                                  pθ                                                   if N = 0
                          
                          
                          
                           θ0 Θ(θ̄)
                          
                        =    Z θ1                     Z θ2           Z θ
                                   ϵθ dθ̄                  ϵθ dθ̄          ϵ dθ̄
                                            + (N − 1)                      pθ          if N ≥ 1,
                          
                          
                                 p                        p       +
                                      Θ(θ̄)                  Θ(θ̄)           Θ(θ̄)
                          
                              θ0                       θ1             θ1,2
                                                                                                   (11.54)
      where θ0 := θ(λ′0 ), N is the number of θ-turning points between λ′0 and λ′ , θ1 (resp. θ2 ) is the
      value of θ at the first (resp. second) turning point, if any, θ1,2 = θ1 for N odd and θ1,2 = θ2 for
      N even.
          We are now in position to state the full expression of the general solution for geodesic
      motion:
        Let L be a null or timelike geodesic in Kerr spacetime, with conserved energy E, conserved
        angular momentum L, mass µ and Carter constant Q. We assume that the associated quartic
        polynomial R(r), as defined by Eq. (11.44) (see also Eq. (11.95) below), and the associated
        function Θ(θ), as defined by Eq. (11.35), do not vanish along L except possibly at some
        r-turning points or θ-turning points. Let λ be the affine parameter of L associated with
        the 4-momentum p and λ′ the Mino parameter. If for λ = λ0 L lies at the point of
        Boyer-Lindquist coordinates (t0 , r0 , θ0 , φ0 ), then at any value of λ, the Boyer-Lindquist
                                  11.2 Equations of geodesic motion                                     387
Proof. Equation (11.55a) is nothing but the gathering of Eqs. (11.53) and (11.54). For Eq. (11.55b),
it suffices to rewrite Eq. (11.49) as
and to substitute dλ′ by ϵr dr/ R(r) in the first term [cf. Eq. (11.50b)] and by ϵθ dθ/ Θ(θ) in
                               p                                                       p
the second term [cf. Eq. (11.50c)]. Similarly, by rewriting Eq. (11.50a) as
                                (r2 + a2 )2 E − 2amrL ′
                         dt =                        dλ − a2 E sin2 θ dλ′
                                    r2 − 2mr + a2
and performing the same substitutions for dλ′ as above, we get Eq. (11.55c). Finally, Eq. (11.55d)
is deduced in the same fashion from Eq. (11.50d).
    The system (11.55) shows that the geodesic motion can be fully solved in terms of the Mino
parameter λ′ . Indeed, the integral on r in Eq. (11.55a) can be evaluated by means of elliptic
integrals since R is a polynomial of degree 4. This provides λ′ = λ′ (r). Inverting this relation
via Jacobi elliptic functions (cf. Remark 5 on p. 252) yields r = r(λ′ ). The integral on θ in
Eq. (11.55a) can be evaluated by means of elliptic integrals as well since the change of variable
ζ := cos θ along with expression (11.35) for Θ(θ) results in
               Z θ           Z ζ
                   ϵθ dθ̄                               ϵθ dζ̄
               − p        = −− p                                                       ,
                θ0  Θ(θ̄)     ζ0 Q(1 − ζ̄ 2 ) + ζ̄ 2 [a2 (E 2 − µ2 )(1 − ζ̄ 2 ) − L2 ]
with the term under the square root in the right-hand side being a polynomial of degree 4 in
ζ̄. The use of Jacobi elliptic functions leads then to θ = θ(λ′ ). From r(λ′ ) and θ(λ′ ) one can
get the functions λ(λ′ ), t(λ′ ) and φ(λ′ ) by evaluating the integrals in the right-hand sides of
Eqs. (11.55b)–(11.55d). Again, these integrals are reducible to elliptic integrals. We shall not
give the details of all the elliptic integrals computations, referring the reader to Ref. [206] for
bound timelike geodesics and to Ref. [236] for null geodesics.
Historical note : The constant of motion K has been discovered by Brandon Carter in 1968 [90], in a
study about Kerr-Newman spacetimes, which generalize the Kerr ones to nonzero global electric charge.
Carter actually did not get K from the Killing tensor K, which was discovered only two years later
388                     Geodesics in Kerr spacetime: generic and timelike cases
      by Martin Walker and Roger Penrose [508]; he started instead from the Lagrangian (B.39) governing
      both timelike and null geodesics, derived the corresponding Hamiltonian as H = g µν pµ pν (for the
      uncharged case, i.e. Kerr) and discovered that the Hamilton-Jacobi equation is separable, i.e. can be
      solved by separation of variables. K appeared then as a separation constant. Note that Carter used
      the Kerr coordinates11 (v, r, θ, φ̃) described in Sec. 10.3.1 and not the Boyer-Lindquist ones. Carter also
      introduced the constant Q via Eq. (11.34). He obtained the equivalent of the first-order system (11.43)
      for the Kerr coordinates. Actually two of his equations, those for dr/dλ and dθ/dλ, are identical to
      Eqs. (11.43b) and (11.43c) of the Boyer-Lindquist system. This is not surprising since the coordinates r
      and θ are the same in both systems. Carter’s equations for the Kerr coordinates v and φ̃ are slightly
      more complicated than Eqs. (11.43a) and (11.43d) for the Boyer-Lindquist coordinates t and φ. It seems
      that the Boyer-Lindquist first-order system (11.43) has been first derived by Daniel Wilkins in 1972 [522],
      starting from Carter’s system and performing the transformation to Boyer-Lindquist coordinates. The
      integrated geodesic equations (11.55) have been obtained by Carter in 1968 [90]: Eq. (11.55a) (without
      the λ′ − λ′0 part) is Eq. (58) of Ref. [90] and Eq. (11.55b) is Eq. (59) of Ref. [90]. Regarding Eqs. (11.55c)
      and (11.55d), Carter obtained equivalent ones for the Kerr coordinates v and φ̃, which he was using [his
      Eqs. (60) and (61)].
        If a null or timelike geodesic L has some part lying in the exterior of the ergoregion G (cf.
        Sec. 10.2.4), then the conserved energy E defined by Eq. (11.2a) is necessarily positive:
L ̸⊂ G =⇒ E > 0. (11.56)
      Proof. By the very definition of the ergoregion G (cf. Sec. 10.2.4), the Killing vector ξ is
      timelike in the exterior of G , i.e. in M \ G . Moreover, it is future-directed there, given the
      time orientation defined in Sec. 10.3.2. The 4-momentum p is either timelike or null and
      always future-directed. By Eq. (1.5a) in Lemma 1.1, one has then necessarily ξ · p < 0; hence
      E := −ξ · p > 0 in M \ G . Since E is constant along L , it follows that E > 0 everywhere.
      In particular, any timelike or null geodesic that reaches one of the asymptotic regions r → ±∞
      has E > 0.
      Remark 1: Property 11.5 generalizes Property 7.2 obtained for Schwarzschild spacetime to the case
      a ̸= 0. Indeed, for Schwarzschild spacetime, the exterior of the ergoregion is nothing but the exterior of
      the black hole region.
         Inside the ergoregion, the Killing vector ξ is spacelike and E can be either positive, zero or
      negative. A particle with E < 0 (resp. E = 0) is called a negative-energy particle (resp. a
        11
             Carter’s u is our v.
                                     11.3 Main properties of geodesics                                            389
Figure 11.1: Projection in the equatorial plane t = const and θ = π/2 of the worldline of a particle and its
4-momentum p, which decay at event A in two particles: one with 4-momentum p′ , which leaves to infinity, and
one with 4-momentum p∗ , which falls into the black hole (black region). The grey zone is the outer ergoregion.
zero-energy particle). Note that according to Property 11.5, neither a negative-energy particle
nor a zero-energy one can exist outside the ergoregion.
Example 6 (Generators of the event and inner horizons): We have already encountered zero-energy
particles in Example 1 of Sec. 11.2.2: the photons whose worldlines are the null generators of the horizons
H and Hin [cf. Eq. (11.4)].
Example 7 (Outgoing principal null geodesics in MII ): We have seen in Example 3 (p. 376) that
in the region MII , the outgoing principal null geodesics L(u,θ,
                                                             out
                                                                 ˜ have E < 0 [Eq. (11.9c)]. This is
                                                                 φ̃)
consistent with Property 11.5 because MII is entirely contained in the ergoregion G (cf. Fig. 10.2).
Remark 2: It should be stressed that, for a particle lying in the ergoregion, the quantity E is not the
particle’s energy measured by some physical observer, as given by formula (1.23). Indeed, since ξ is a
spacelike vector in the ergoregion, it cannot be identified with any observer 4-velocity. In other words,
any local observer measuring the energy of a negative-energy particle or a zero-energy one, according
to the process described in Sec. 1.4, will find a positive value. Hence there is nothing mysterious about
such particles.
∆E = ξ · p∗ |A = −E∗ , (11.59)
                                         ξ · p∗ | +ΩH η · p∗ |B < 0,
                                         | {z B}      | {z }
                                            −E∗              L∗
                                                  E∗ > ΩH L∗ .                                       (11.60)
                                   11.3 Main properties of geodesics                                        391
Given that ΩH > 0 [cf. Eqs. (10.69) and (10.1)], we deduce that if a Penrose process occurs,
i.e. if E∗ < 0, then necessarily L∗ < 0. Similarly to the energy gain (11.57), we may define the
angular momentum gain of the process P → P ′ + P∗ by
where Lin (resp. Lout ) is the angular momentum of P (resp. P ′ ) as measured by an asymptotic
inertial observer, i.e. the conserved angular momenta of particles P and P ′ as defined in
Sec. 11.2.2: Lin = η · p|∞ and Lout = η · p′ |∞ [Eq. (11.2b)]. The same computation as above
for ∆E, except for ξ replaced by η, shows that the energy-momentum conservation law
p|A = p′ |A + p∗ |A leads to a result similar to (11.59):
∆L = − η · p∗ |A = −L∗ . (11.62)
Along with Eq. (11.59), this relation turns the inequality (11.60) into ΩH ∆L > ∆E. This shows
that any Penrose process yields ∆L > 0 in addition to ∆E > 0. Let us summarize:
  The Penrose process as described above and illustrated in Fig. 11.1 extracts both some
  energy ∆E > 0 and some angular momentum ∆L > 0 from a rotating black hole, in such
  a way that
                                  ΩH ∆L > ∆E > 0,                                  (11.63)
  where ΩH is the black hole rotation velocity [cf. Eq. (10.69)]. The secondary particle P∗
  falling into the black hole during the process has both a negative conserved energy E∗ and
  a negative conserved angular momentum L∗ , which obey Eq. (11.60): ΩH L∗ < E∗ < 0.
Remark 4: Setting ΩH = 0 in Eq. (11.63) shows that the Penrose process is not possible for a non-
rotating (Schwarzschild) black hole. This is not surprising since there is no outer ergoregion for such a
black hole, the ergosphere coinciding with the event horizon (cf. Remark 5 on p. 331).
    Ultimately, the energy and angular momentum extracted via the Penrose process are taken
from the mass m and the angular momentum J = am of the black hole, so that these two
quantities must decrease slightly (i.e. in a infinitely small manner, given that P, P ′ and P∗
are assumed to be “test particles” (geodesic worldlines), which implies ∆E ≪ m and ∆L ≪ J).
This will be discussed in details in Sec. 16.3.1.
    The Penrose process can be extended from particles to fields; it is then known as the
phenomenon of superradiance, also called superradiant scattering. This occurs when the
scattering of a (scalar, vector or tensor) field by a rotating black hole results in a outgoing
field energy larger than the ingoing one. The energy of the field is defined from its energy-
momentum tensor T by constructing the “energy density” vector P as P α := −T αRµ ξ µ (compare
with Eq. (11.2a)). The energy of the field through a hypersuface Σ is then EΣ := Σ ⋆P , where
⋆P is the the Hodge dual of the 1-form P , i.e. the 3-form defined by (⋆P )αβγ := P µ εµαβγ [cf.
Eq. (5.38) with n = 4 and p = 1]. Now, by combining the energy-momentum conservation
law ∇µ T µα = 0 [Eq. (1.45)] and the Killing equation for ξ, it is easy to show that P is a
392                   Geodesics in Kerr spacetime: generic and timelike cases
      Actually the same superradiance condition holds for the Fourier modes of electromagnetic
      waves [477, 99], as well as gravitational waves [477].
      Historical note : The Penrose process has first been suggested by Roger Penrose in 1969, in the review
      article [408] (cf. the footnote 7 in this article); the detailed calculation has been presented subsequently
      in an article with Roger Floyd published in 1971 [409]. In particular, Penrose and Floyd have derived the
      inequality (11.60) [Eq. (4) in Ref. [409], taking into account that a/(2mr+ ) = ΩH , cf. Eq. (10.69)]. The
      extension to fields (superradiance) was introduced by Yakov Zeldovich in 1971 [527, 528] and explicit
      computations for a scalar field in the Kerr metric have been performed by his doctoral student Alexei
      Starobinsky in 1972 (published: 1973) [460], as well as by William Press and Saul Teukolsky at the same
      time [420]. It seems that the superradiance of a scalar field scattered by a Kerr black hole has also been
      discovered around 1971 by Charles Misner: mention of unpublished results by him can be found in his
      article [369], as well as in a 1973 article by Jacob Bekenstein [51].
    As discussed in Sec. 10.3.2, the time orientation of the Kerr spacetime (M , g) is set by the
global null vector field k generating the ingoing principal null geodesics, cf. Eq. (10.32). By
virtue of Lemma 1.2 [cf. Eq. (1.6a)], we have then, for p not collinear with k (in particular for
L timelike),
                                                              r 2 + a2           a
  p future-directed ⇐⇒ g(k, p) < 0 ⇐⇒ k µ pµ < 0 ⇐⇒                    pt − pr + pφ < 0,
                                                                  ∆              ∆
where the last inequality follows from the Boyer-Lindquist components (10.49) of k. Given the
identities pt = −E, pφ = L [Eq. (11.3)] and pr = ϵr R(r)/∆ [Eqs. (11.15) and (11.43b)], we
                                                    p
conclude:
  Property 11.7
            L timelike or       1 h 2                       i
                                                                                           (11.66)
                                                       p
                             =⇒    (r + a2 )E − aL + ϵr R(r) > 0 .
          k not tangent to L    ∆
Given expression (11.44) for R(r) and ∆ := r2 − 2mr + a2 [Eq. (10.10)], we note that this
constraint involves (µ, E, L, Q) and r, but not θ.
   Another global future-directed null vector field on the Kerr spacetime (M , g) is the null
tangent ℓ to the outgoing principal null geodesics, introduced in Sec. 10.4 [Eq. (10.57)]. If we
apply Lemma 1.2 with ℓ instead of k, we get, using the Boyer-Lindquist components (10.61) of
ℓ,
  Property 11.8
                    L timelike or
                                                                                           (11.67)
                                                            p
                                    =⇒ (r2 + a2 )E − aL − ϵr R(r) > 0 .
                 ℓ not tangent to L
   In MI ∪ M
           pIII , ∆ > 0 and we can combine (11.66) and (11.67) to (r + a )E − aL >
                                                                      2      2
Property 11.9
    A related constraint in MI ∪ MIII is obtained from the vector ε(0) of the Carter frame
introduced in Sec. 10.7.4. As shown there, ε(0) is a linear combination of k and ℓ [cf. Eq. (10.112)]
and is a future-directed timelike vector in all MI ∪ MIII . We can then apply Lemma 1.1, using
expression (10.108a) of ε(0) in terms of ξ = ∂t and η = ∂φ :
p future-directed in MI ∪ MIII ⇐⇒ g(ε(0) , p) < 0 ⇐⇒ (r2 + a2 ) g(ξ, p) +a g(η, p) < 0.
                                                                | {z }     | {z }
                                                                             −E           L
Property 11.10
      Notep
          that if L is not a principal null geodesic, (11.69) is a mere consequence of (11.68), given
      that R(r) ≥ 0.
         An immediate corollary of Property 11.10 is:
Property 11.11
         Another constraint is obtained by considering the vector field N normal to the hyper-
      surfaces of constant Boyer-Lindquist coordinate t, which has been introduced in Sec. 10.7.3:
             →
             −
      N = −dt [Eq. (10.88)]. As shown in Sec. 10.7.3, N is future-directed timelike in the subpart
      MZAMO := MI ∪ (MIII \ T ) of Kerr spacetime. Lemma 1.1 [cf. Eq. (1.5a)] then yields
Property 11.12
      Let us recall that T is the Carter time machine (cf. Sec. 10.2.5), which occupies a very lim-
      ited portion of MIII . Since pt = dt/dλ and λ increases towards the future (cf. Sec. 11.2.1),
      Property 11.12 implies
Property 11.13
         Using Eq. (11.43a) with dt/dλ = pt and the fact that ∆ > 0 in MI ∪ MIII , we can rewrite
      Property 11.12 as
        Property 11.14
                                     11.3 Main properties of geodesics                                             395
                      y/m
                      3
                      2
                      1
                                                                                         x/m
                                2       4        6              8   10   12      14
                      -1
                      -2
Figure 11.2: Trajectory in the equatorial plane of an incoming timelike geodesic with E = µ, L = 0 and Q = 0,
plunging into a Kerr black hole with a = 0.998 m. The figure is drawn in terms of the Cartesian Boyer-Lindquist
coordinates (x, y) defined by Eq. (11.73) with θ = π/2 (equatorial plane) and the grey disk marks the black hole
region. [Figure generated by the notebook D.5.8]
Remark 5: In the outer ergoregion G + = G ∩ MI (i.e. the part of the ergoregion outside of the black
hole), E ≤ 0 is allowed, but Property 11.12 shows that pt ≤ 0 is not allowed.
Historical note : That E ≤ 0 in the outer ergoregion implies L < 0 (Property 11.11) has been shown
for equatorial geodesics by George Contopoulos in 1984 [141].
      Cartesian Boyer-Lindquist coordinates (t, x, y, z), which are defined in the r > 0 region of
      Kerr spacetime and are related to the Boyer-Lindquist coordinates (t, r, θ, φ) by the standard
      transformation from spherical to Cartesian coordinates:
         We note that, at a given point (r, θ), the angular velocity (11.72) coincides with that of the
      zero-angular momentum observer (ZAMO) at that point [compare Eq. (10.99)].
      11.3.5      Winding near the event horizon and the inner horizon
      Let us consider a null or timelike geodesic L in the vicinity of the black hole event horizon H .
      On H , r = r+ and ∆ = 0. Then, the term ∆−1 in Eq. (11.43a) makes dt/dλ diverge as r → r+ ,
      except in the very special where (r+  2
                                               + a2 )E − aL = 0, which is equivalent to E = ΩH L
      according to Eq. (10.69). Similarly, dφ/dλ, as given by Eq. (11.43d), diverges as r → r+ , except
      for E = ΩH L. These two divergences are not a pathology of L per se; they reflect merely
      the singularity of Boyer-Lindquist coordinates (t, r, θ, φ) at H (cf. Sec. 10.2.6). However, we
      read on Eqs. (11.43a) and (11.43d) that the ratio dφ/dt|L := dφ/dλ × (dt/dλ)−1 converges to
      a finite value:
                                             dφ           a
                                        lim         = 2        = ΩH ,                            (11.74)
                                       r→r+ dt
                                                 L     r+ + a2
      where the second equality follows from Eq. (10.69), making the black hole rotation velocity
      ΩH appear. Hence we conclude:
        Any null or timelike geodesic that approaches the event horizon H is winding around it
        in terms of the Boyer-Lindquist coordinates at exactly the black hole rotation velocity ΩH .
      Remark 6: For a timelike geodesic on a circular orbit, we have argued in Sec. 7.3.3 that dφ/dt|L is
      the angular velocity of as seen by an asymptotic inertial observer. The reasoning was given in the
      Schwarzschild spacetime context but it actually involved only the spacetime symmetry by translation in
      t, so it is applicable here.
      Remark 7: When a ̸= 0, a geodesic that starts far from the black hole with dφ/dt|L < 0 [according to
      Eq. (11.43d) with r ≫ m, this occurs for L < 0] must necessarily have a turning point in φ if it reaches
      the event horizon, in order to fulfill (11.74), where ΩH is positive. This is illustrated in Fig. 11.3 and is
      in sharp contrast with the Schwarzschild case, where φ is always a monotonic function of λ, as shown
      in Sec. 7.2.2.
      Remark 8: The winding property does not hold for the Kerr or advanced Kerr coordinates. Indeed,
      we have seen in Sec. 10.4 that the ingoing principal null geodesics L(v,θ,
                                                                             in
                                                                                 φ̃) are geodesics along which
      φ̃ is constant. They are therefore not winding in terms of neither the Kerr coordinates (t̃, r, θ, φ̃) nor
      the advanced Kerr ones (v, r, θ, φ̃). This difference of (coordinate) behavior is understandable if one
                                        11.3 Main properties of geodesics                                              397
                       y/m
                        1
                                                                                           x/m
                                    2          4            6        8          10    12
                       -1
                       -2
                       -3
Figure 11.3: Trajectory in the equatorial plane of an incoming null geodesic with L = −6E < 0 and Q = 0,
plunging into a Kerr black hole with a = 0.998 m. Note the turning point in φ and the final winding in the
direction of the black hole rotation (counterclockwise in the figure). The figure is drawn in terms of the Cartesian
Boyer-Lindquist coordinates (x, y) defined by Eq. (11.73) with θ = π/2 (equatorial plane) and the grey disk marks
the black hole region. [Figure generated by the notebook D.5.8]
considers the diverging behavior in ∆−1 of the relation dφ̃ = dφ + a/∆ dr [Eq. (10.23b)] between the
angular coordinates φ and φ̃.
    Regarding the Boyer-Lindquist coordinate behavior in the vicinity of the inner horizon Hin
(corresponding to the second root r− of ∆), we deduce from Eqs. (11.43a) and (11.43d) that
                                               dφ                a
                                        lim             =    2
                                                                      = Ωin ,                               (11.75)
                                        r→r−   dt   L       r−   + a2
where the second equality follows from Eq. (10.71); it involves the rotation velocity Ωin of the
inner horizon Hin . Hence
  Property 11.16: behavior of φ near the inner horizon
  Any null or timelike geodesic that approaches the inner horizon Hin is winding around it
  in terms of the Boyer-Lindquist coordinates at exactly the rotation velocity Ωin of Hin .
which is a divergent improper integral. If r1 is a higher order root of R, the divergence is even
worse, being triggered by a higher power of 1/(r1 − r̄). Consequently, Eq. (11.51) with r = r1
398                Geodesics in Kerr spacetime: generic and timelike cases
      implies that the Mino parameter diverges: λ′ → +∞. Since ρ2 > 0, provided that ρ2 does not
      tend to 0 (the ring singularity) as λ′ → +∞, this implies that the affine parameter λ tends to
      +∞ as well, given the relation (11.49) between the two parameters. Hence we conclude:
        Let L be a geodesic of Kerr spacetime that does not lie at a fixed value of r and let R(r)
        be the associated polynomial (11.44). A point away from the ring singularity R and of r-
        coordinate r1 such that both R(r1 ) = 0 and R′ (r1 ) = 0 can only be reached asymptotically
        by L , i.e. in the limit λ → ±∞ of the affine parameter λ. We call r1 an asymptotic
        r-value of the geodesic L .
      This property explains why R(r0 ) = 0 and R′ (r0 ) ̸= 0 is a necessary condition to have a
      r-turning point (cf. Remark 4 on p. 384).
          By the same reasoning, we have
        Let L be a geodesic of Kerr spacetime that does not lie at a fixed value of θ and let Θ(θ)
        be the associated function (11.32). A point away from the ring singularity R and of θ-
        coordinate θ1 such that both Θ(θ1 ) = 0 and Θ′ (θ1 ) = 0 can only be reached asymptotically
        by L , i.e. in the limit λ → ±∞ of the affine parameter λ. We call θ1 an asymptotic
        θ-value of the geodesic L .
      In particular, the θ-turning points (cf. Sec. 11.2.6) are characterized by V (θ) = Q and V ′ (θ) ̸= 0,
      while the asymptotic θ-values (cf. Sec. 11.3.6) correspond to V (θ) = Q and V ′ (θ) = 0.
          We recognize in Eq. (11.76) the first integral of a (non-relativistic) 1-dimensional motion
      in the potential V (θ), which we shall call the effective θ-potential. The discussion of the
                                      11.3 Main properties of geodesics                                    399
geodesic θ-motion is then based on the properties of that potential, Q in the right-hand side
of Eq. (11.76) playing the role of the constant “total energy”. Since (dθ/dλ′ )2 ≥ 0, Eq. (11.76)
implies
                                           V (θ) ≤ Q.                                     (11.79)
Accordingly, given a plot of V (θ), as in Figs. 11.4 and 11.5, the allowed range of θ is determined
by the part of the graph of V (θ) that lies below the horizontal line of ordinate equal to Q.
    We shall distinguish the case L = 0 from the case L ̸= 0, since they lead to different shapes
of the potential V (θ).
Geodesics with L = 0
If L = 0, the effective θ-potential reduces to V (θ) = a2 (µ2 − E 2 ) cos2 θ. We have then three
subcases:
             ◦ for Q = 0, the geodesic lies at a constant value of θ, within the range [0, π];
             ◦ for Q > 0, θ varies monotonically along the geodesic, which therefore crosses the
               rotation axis an infinite number of times.
                                                                                                 V(θ) /V0
                 0.6                                                                                         0.4
                 0.4                                                                                         0.6
                 0.2                                                                                         0.8
                                                       Q=0                                                   Q = Qmin                                                         Q = Qmin
                 0.0                                                                                         1.0
                             0       θm   1π
                                          4
                                                          1π
                                                          2
                                                                              3π
                                                                              4    π − θm    π                       0            1π
                                                                                                                                  4    θv           1π
                                                                                                                                                    2       π − θv34 π            π
                                                           θ                                                                                        θ
             Figure 11.4: Effective θ-potential V (θ) in the case L = 0 and a ̸= 0. V (θ) is plotted in units of V0 := a2 |µ2 −E 2 |.
             The left figure is for |E| < µ, with colored dots or horizontal lines corresponding to geodesic trajectories for four
             values of the Carter constant Q: 0, Q1 = 0.75V0 , a2 (µ2 − E 2 ) and Q2 = 1.2V0 . The right figure is for |E| > µ,
             with trajectories corresponding to four values of Q: Qmin = −a2 (E 2 − µ2 ), Q1 = −0.3V0 , 0 and Q2 = 0.2V0 .
             Dashed lines indicate trajectories with asympotic θ-values.
                                                L 0, L 2 ≥ a 2 (E 2 − µ 2 )                                                            L 0, L 2 < a 2 (E 2 − µ 2 )
                                                                                                             4
                             8                                                                                                                 Q = Q2
                                                                                                             2
                             6                                                                                                                  Q=0
                 V(θ) /L 2
V(θ) /L 2
                                                        Q = Q1                                               0
                             4                                                                                                Q = Q1                                 Q = Q1
                                                                                                             2
                             2
                                                         Q=0                                                 4            Q = Qmin                                   Q = Qmin
                             0
                                 0   θm    1π
                                           4
                                                            1π
                                                            2
                                                                              3π
                                                                              4
                                                                                    π − θm   π                   0       θm      1π
                                                                                                                                 4     θv          1π
                                                                                                                                                   2        π − θv 34 π π − θm    π
                                                               θ                                                                                   θ
             Figure 11.5: Effective θ-potential V (θ) in the case L ̸= 0. The left figure is for L2 ≥ a2 (E 2 − µ2 ), with colored
             dots or horizontal lines corresponding to geodesic trajectories for two values of the Carter constant Q: 0 and
             Q1 = 4L2 . The right figure is for L2 < a2 (E 2 − µ2 ), with trajectories corresponding to four values of Q: Qmin ,
             Q1 = −2L2 , 0 and Q2 = 2L2 . The dashed line (Q = 0) indicates the trajectory with an asympotic θ-value, which
             is π/2.
                                     ◦ −a2 (E 2 − µ2 ) < Q < 0: the geodesic oscillates about the rotation axis, without
                                       reaching the equator; one has
                                                                 p either θ ∈ [0, θv ] or θ ∈ [π − θv , π], with the turning
                                                   15
                                       point value θv := arccos |Q|/(a2 (E 2 − µ2 )) < π/2 (cf. the trajectories Q = Q1
                                       in Fig. 11.4, right).
                                     ◦ Q = 0: θ = π/2 is either an unstable position or an asymptotic θ-value, i.e. the
                                       geodesic approaches the equatorial plane when λ → ±∞.
                                     ◦ Q > 0: θ varies in all the range [0, π]; when the geodesic reaches the rotation axis,
                                       it crosses it (cf. the trajectory Q = Q2 in Fig. 11.4, right).
                       15
                             The index v in θv stands for vortical, the definition of which is given at the end of this section.
                                11.3 Main properties of geodesics                                   401
Geodesics with L ̸= 0
When L ̸= 0, we see from expression (11.77) that V (θ) → +∞ when θ → 0 or π. We conclude
immediately that the geodesic cannot reach the rotation axis for L ̸= 0. Moreover, we have
                                                               L2
                                                                   
                       dV                      2  2     2
                           = −2 cos θ sin θ a (µ − E ) +              .
                       dθ                                    sin4 θ
                     dV            π                                L2
                        = 0 ⇐⇒ θ =             or   sin4 θ =                  .           (11.80)
                     dθ            2                           a2 (E 2 − µ2 )
We are thus led to distinguish two cases, depending whether or not the equation involving
sin4 θ has solutions distinct from π/2:
                                                          q
    • Case L ≥ a (E − µ ) ⇐⇒ a = 0 or |E| ≤ µ2 + La2 : the only extremum solution
               2     2   2     2                                    2
     right part of Fig. 11.5), which are located at θ = θ∗ , π/2 and π − θ∗ with
                                       s
                                               |L|                        π
                          θ∗ := arcsin     p             and 0 < θ∗ < .               (11.81)
                                          a E 2 − µ2                      2
                                             p           2
                                  Qmin          2   2
                                         = − a E − µ − |L| .                              (11.82)
x := cos2 θ, (11.83)
                        a2 (E 2 − µ2 )x2 + L2 + Q − a2 (E 2 − µ2 ) x − Q = 0.                       (11.84)
                                                                 
      If a2 (E 2 − µ2 ) = 0, the solution is x = Q/(L2 + Q). Moreover, in this case one has necessarily
      Q ≥ 0, so that 0 ≤ x ≤ 1 and the unique solution in (0, π/2) is
                                                   s
                                                            Q
                                     θm = arccos               .                                    (11.85)
                                                       L2   +Q
                                                                   a2 (E 2 −µ2 )=0
      In the rest of this section, we assume that a2 (E 2 − µ2 ) ̸= 0. Equation (11.84) is then a quadratic
      equation in x, the discriminant of which is
                                                       2
                            ∆ = L2 + Q − a2 (E 2 − µ2 ) + 4Qa2 (E 2 − µ2 )
                               
                                                       2
                              = L2 + Q + a2 (E 2 − µ2 ) − 4L2 a2 (E 2 − µ2 ).
                               
      It turns out that ∆ is always non-negative. Indeed, from the second equality above, we have
      clearly ∆ ≥ 0 if a2 (E 2 − µ2 ) ≤ 0. In the complementary case, i.e. when a2 (E 2 − µ2 ) > 0,
      then ∆ ≥ 0 as soon as Q ≥ Q2 , where Q2 is the larger of the two roots of ∆ considered as
      a quadratic polynomial in Q. Again, considering
                                                    p the second equalitypin the expression of ∆,
      we have Q2 = −L2 − a2 (E 2 − µ2 ) + 2|L|a E 2 − µ2 = −(|L| − a E 2 − µ2 )2 . In view of
      Eq. (11.82), we realize that Q2 = Qmin , so that Q ≥ Q2 always holds and ∆ ≥ 0. Accordingly
      the two roots of Eq. (11.84) are real and are given by
                                             s                                  
                                  2                       2
                                                                 2
                    1           L +Q                    L +Q              4Q
              x± = 1 − 2 2             2
                                           ±     1− 2 2        2
                                                                    + 2 2         .        (11.86)
                    2        a (E − µ )               a (E − µ )      a (E − µ2 )
                                      11.3 Main properties of geodesics                                 403
Since x := cos2 θ and θ = 0 and θ = π/2 cannot correspond to a θ-turning point, acceptable
solutions must obey
                                       0 < x± < 1.                                  (11.87)
       For L = 0, Eq. (11.86) simplifies to16
                                                                       Q
                                x+ = 1        and       x− = −
                                                                 a2 (E 2 − µ2 )
Now, x+ = 1 is excluded by (11.87). There remains only x− , leading to the turning point value
found in the L = 0 cases discussed above, which we can combine into a single formula:
                                             s
                                                     Q
                           θm = θv = arccos                   .                        (11.88)
                                               a (µ − E 2 )
                                                2   2
                                                                           L=0
This formula assumes that 0 < Q/(a2 (µ2 − E 2 )) < 1 (as for the trajectories Q = Q1 in both
panels of Fig. 11.4), otherwise there is no θ-turning point (as for the trajectories Q = Q2 in
both panels of Fig. 11.4). More precisely, for Q > 0 and µ2 − E 2 > 0, it leads to θm , while for
Q < 0 and µ2 − E 2 < 0, it leads to θv .
   For L ̸= 0, one shall distinguish between two cases, since the case |E| = µ has been covered
above [Eq. (11.85)]:
   • if |E| < µ, then only x− fulfills the criterion (11.87), leading to the turning point value:
                      v                                                                      
                      u                            s                     2
                      u1
                      u               2
                                    L +Q                         2
                                                               L +Q                 4Q
         θm = arccos t 1 + 2 2                 −      1 +                   −                
                         2      a (µ − E 2 )               a2 (µ2 − E 2 )      a2 (µ2 − E 2 )
                                                                                              (11.89)
   • if |E| > µ, then x+ always fulfills the criterion (11.87), leading to the turning point value:
                      v                                                                     
                      u                            s                     2
                      u1
                      u              2
                                   L +Q                          2
                                                               L +Q                  4Q
         θm = arccos t 1 − 2 2              2
                                                +      1− 2 2          2
                                                                             + 2 2           .
                        2       a (E − µ )                  a (E − µ )          a (E − µ2 )
                                                                                              (11.90)
         If, in addition Q < 0, then x− also fulfills (11.87) and we get a second turning point value
         (cf. the case Q = Q1 in the right panel of Fig. 11.5):
                          v                                                                   
                          u                             s                   2
                          u1
                          u              2
                                       L +Q                         2
                                                                  L +Q                 4Q
              θv = arccos t 1 − 2 2             2
                                                    −      1− 2 2          2
                                                                                + 2 2          .
                             2       a (E − µ )                 a (E − µ )         a (E − µ2 )
(11.91)
Remark 9: At the limit L → 0, formulas (11.89) and (11.91) reduce both to (11.88).
  16
       A switch between x+ and x− is performed if 1 + Q/(a2 (E 2 − µ2 )) < 0.
404               Geodesics in Kerr spacetime: generic and timelike cases
      Summary
      We can summarize the above results by
           • A geodesic L of Kerr spacetime cannot encounter the rotation axis unless it has
             L = 0.
Q ≥ 0. (11.92)
           • The Carter constant Q can take negative values only if L2 < a2 (E 2 − µ2 ), which
             implies a ̸= 0 and |E| > µ; the range of Q is then limited from below:
                                                  p              2
                                     Q ≥ Qmin = − a E 2 − µ2 − |L| .                          (11.93)
A geodesic with Q < 0 is called vortical; it never encounters the equatorial plane.
           • If Qmin < Q < 0, L never encounters the equatorial plane, having a θ-motion
             entirely confined either to the Northern hemisphere (0 < θ < π/2) or to the
             Southern one (π/2 < θ < π); if L ̸= 0, L oscillates between two θ-turning points, at
             θ = θm and θ = θv (Northern hemisphere) or at θ = π − θv and θ = π − θm (Southern
             hemisphere), where θm and θv are given by Eqs. (11.90) and (11.91) respectively; if
             L = 0, L oscillates about the rotation axis, with a θ-turning point at θ = θv or
             θ = π − θv , where θv is given by Eq. (11.88).
                                       11.3 Main properties of geodesics                                       405
Remark 10: We could have deduced that L = 0 is a necessary condition for a geodesic to encounter
the rotation axis without studying the potential V (θ). Indeed, by definition, L = η · p [Eq. (11.2b)],
with the Killing vector η being zero on the rotation axis, since the latter is pointwise invariant under
the action of the rotation group SO(2). Hence L = 0 on the rotation axis. Since L is constant along the
geodesic, the result follows immediately.
Remark 11: That a vortical geodesic never intersects the equatorial plane has been obtained by
examining the various cases Q < 0. This result can be derived directly from the constraint Θ(θ) ≥ 0
[Eq. (11.33)], by noticing the identity Θ(π/2) = Q, which follows immediately from Eq. (11.35).
Example 8 (Schwarzschild geodesics): It is instructive to apply the above results to a = 0 and recover
the geodesics in Schwarzschild spacetime studied in Chaps. 7 and 8. There, spherical symmetry was
used to select the coordinates (t, r, θ, φ) so that the geodesic was confined to the hyperplane θ = π/2.
Consequently there was no θ-motion. Here, we keep the coordinates (t, r, θ, φ) fixed and do not assume
that they are adapted to the geodesic L under consideration. So θ may vary along L . In Schwarzschild
spacetime, the Carter constant Q is always non-negative, since the inequality L2 ≥ a2 (E 2 − µ2 ) is
trivially satisfied for a = 0 [cf. Eq. (11.92)].
If Q = 0, then for L = 0, L lies at a constant value of θ: this actually corresponds to a purely radial
geodesic. Indeed, the total angular momentum measured in the asymptotic region is Ltot = 0 in that
case [set Q = 0, L = 0 and a = 0 in Eq. (11.37)]. If L ̸= 0, still with Q = 0, L lies stably in the
equatorial plane (this is the only possibility for a = 0 among the Q = 0 cases listed above).
If Q > 0, then L = 0 is necessarily the subcase (ii) listed above: L crosses repeatedly the z-axis
(θ ∈ {0, π}); this corresponds to the case where the orbital plane contains the z-axis. All the angular
momentum measured asymptotically is then contained in Q [cf. Eq. (11.37)]. For Q > 0 and L ̸= 0, L
oscillates symmetrically about the equatorial plane: this is the case where the orbital plane is inclined by
an angle ι ∈ (0, π/2) with respect to the equatorial plane; the θ-turning point of L is then θm = π/2−ι.
(11.95)
      cannot be satisfied for large values of |r|. A geodesic with |E| < µ is therefore located within
      a bounded region in terms of the radial coordinate r. Moreover, |E| < µ implies necessarily
      L2 > a2 (E 2 − µ2 ), so that the results of Sec. 11.3.7 lead to Q ≥ 0 [cf. Eq. (11.92)]. Therefore,
      −a2 Q ≤ 0. It follows then that r cannot be negative. Indeed, we see on expression (11.95) that
      for |E| < µ (which implies µ > 0 and Q ≥ 0) and r < 0,
      R(r) = (E 2 − µ2 )r4 + 2mµ2 r3 + a2 (E 2 − µ2 ) − Q − L2 r2 + 2m Q + (L − aE)2 r −a2 Q,
                                                                                             
              |    {z      } | {z } |                   {z            } |            {z           } | {z }
                    <0            <0                    ≤0                            ≤0               ≤0
        Any geodesic L with |E| < µ is necessarily timelike (µ > 0), has a non-negative Carter
        constant Q, is confined to the region r ≥ 0 of Kerr spacetime and cannot reach arbitrarily
        large values of r. L is called a geodesic with a bound orbit, or in short, a bound geodesic.
      Remark 12: That L cannot reach the asymptotic region r → +∞ could have been found directly from
      the definition of E as the “energy at infinity” of the particle P having L as worldline [Eq. (11.2a)].
      Indeed, since L is timelike, the 4-momentum of P is p = µu [Eq. (1.18)], where u is the 4-velocity
      of P, so that Eq. (11.2a) yields E = −µ ξ · u. Now, when r → +∞, ξ tends to the 4-velocity of an
      inertial observer (cf. Sec. 10.7.5). If L could reach the asymptotic region, the scalar product of the
      two 4-velocities ξ and u would be necessarily lower or equal to −1, or equivalently Γ := −ξ · u ≥ 1
      (the proof lies in expression (1.35) of Γ with 0 ≤ V · V < 1), so that we would have E ≥ µ, which
      contradicts |E| < µ.
Property 11.21
        Any geodesic with |E| = µ has Q ≥ 0 and is necessarily confined to the region r ≥ 0 of
        Kerr spacetime.
      Proof. Q ≥ 0 follows directly from the criteria |E| ≤ µ2 + L2 /a2 [cf. Eq. (11.92)], which is
                                                              p
      evidently fulfilled for |E| = µ. Furthermore, for |E| = µ, expression (11.95) for R(r) simplifies
      to
                      R(r) = 2mµ2 r3 − (Q + L2 )r2 + 2m Q + (L − aE)2 r − a2 Q.
                                                                            
      For r < 0, all the four terms in the above sum are ≤ 0. If L is timelike, then µ ̸= 0 and the
      first term is < 0, so that r < 0 =⇒ R(r) < 0, which violates the constraint R(r) ≥ 0. Let us
      now assume that L is null. We have then µ = 0 and E = 0, so that
               R(r) = −(Q + L2 )r2 + 2m(Q + L2 )r − a2 Q = (Q + L2 )r(2m − r) − a2 Q.
                                  11.3 Main properties of geodesics                                       407
    A timelike geodesic L with E = µ can reach the asymptotic region r → +∞. It has then
a Lorentz factor with respect to the asymptotic static observer of 4-velocity ξ equal to one (cf.
Remark 12 above), which implies that p is collinear to ξ. In that sense, L is asymptotically “at
rest”. Such a geodesic is called marginally bound.
    On the opposite, a null geodesic with E = µ(= 0) cannot reach the asymptotic region
r → +∞. Actually, it cannot even exist outside the ergoregion, by virtue of Property 11.5.
Example 9: The null geodesics generating the horizons H and Hin considered in Example 1 on p. 375
have E = 0 [Eq. (11.4)] and are indeed fully located in the ergoregion G , since H ⊂ G and Hin ⊂ G
(cf. Fig. 10.2).
  Only geodesics with |E| > µ may have some part in the region r < 0 of Kerr spacetime,
  M− :
                              L ∩ M− ̸= ∅ =⇒ |E| > µ.                            (11.96)
  In region MII , the coordinate r must decrease towards the future along any timelike or
  null geodesic:
                                         dr
                                            <0      .                             (11.97)
                                         dλ     MII
408                   Geodesics in Kerr spacetime: generic and timelike cases
      Example 10 (Principal null geodesics): For the ingoing principal null geodesics L(v,θ,    in
                                                                                                   φ̃) , a future-
      directed tangent vector is k and we have k = dr/dλ = −1, since the affine parameter associated with
                                                 r
      k is λ = −r [Eq. (10.46)]. Hence for these geodesics, r is decreasing towards the future everywhere
      in M , and in particular in MII . For the outgoing principal null geodesics L(u,θ,
                                                                                     out
                                                                                         ˜ , a future-directed
                                                                                         φ̃)
      tangent vector is ℓ and the associated (non-affine) parameter obeys Eq. (10.56):
                                                    dr        ∆
                                                       =            .
                                                    dλ   2(r + a2 )
                                                            2
        A null or timelike geodesic L that approaches the ring singularity has a vanishing Carter
        constant Q.
      The constraints R(r) ≥ 0 [Eq. (11.30)] and Θ(θ) ≥ 0 [Eq. (11.33)] imply then respectively
      Q ≤ 0 and Q ≥ 0, from which we get Q = 0.
             The above property can be refined17 :
        17
          This is a refinement because not all geodesics with Q = 0 lie in the equatorial plane. For instance the null
      geodesic generators of the event horizon H have Q = 0 [Eq. (11.25)] but those with θ ̸= π/2 lie outside the
      equatorial plane.
                                11.3 Main properties of geodesics                                    409
Property 11.25: ring singularity reachable only from the equatorial plane
  A null or timelike geodesic cannot approach the ring singularity unless it lies entirely in
  the equatorial plane.
Proof. Let L be a causal geodesic that approaches R. From the previous result, L has Q = 0.
Reviewing the subcases Q = 0 amongp    all the cases considered in Sec. 11.3.7, we see that L can
approach θ = p
             π/2 iff (i) L has |E| ≤ µ2 + L2 /a2 and lies stably in the equatorial plane or (ii)
L has |E| > µ2 + L2 /a2 and approaches asymptotically θ = π/2 as the Mino parameter λ′
tends to ±∞. Let us show that (ii) is not compatible with r → 0. For Q = 0, expression (11.95)
for R(r) reduces to
with the constant term inside the square brackets 2m(L − aE)2 ̸= 0, since L = aE is not
compatible with |E| > µ2 + L2 /a2 . It follows that r = 0 is a simple root of R(r). If L
                       p
would reach both r = 0 and θ = π/2 when λ tends to some value λ∗ , then Eq. (11.55a) would
yield
                                     Z 0           Z π
                                          ϵr dr       2 ϵθ dθ
                            ′    ′
                          λ∗ − λ0 = − p         =− p          ,                      (11.98)
                                      r0   R(r)     θ0   Θ(θ)
where λ′∗ is the Mino parameter corresponding to λ∗ . Note that λ′∗ may be infinite even if λ∗ is
finite, due to the relation (11.49) with ρ2 → 0 when λ → λ∗ . Since r = 0 is a simple root of
R(r), the integral on r has a finite value. Now, for Q = 0, expression (11.35) for Θ reduces to
                                                              L2
                                                                  
                                        2     2   2    2
                             Θ(θ) = cos θ a (E − µ ) −               .
                                                            sin2 θ
The cos2 θ term, which behaves as (θ − π/2)2 for θ near π/2, makes the integral on θ in
Eq. (11.98) divergent, which is incompatible with the finite value of the integral on r in the
left-hand side. Hence only (i) is possible, which completes the proof.
    Property 11.25 has been obtained for a generic approach to the ring singularity, i.e. for
a geodesic L that hits R, emanates from R or asymptotically approach L in the future or
the past. If we apply it to null geodesics (light rays) emanating from R, we conclude that an
intrepid observer diving into the black hole would not see the ring singularity at all, except
when he crosses the equatorial plane. At this instant, the singularity would appear to him as
a 1-dimensional segment, and the image would disappear as soon as the observer leaves the
equatorial plane. In particular, the observer will never see a ring-like image.
      However, such a motion is possible only if R(0) ≥ 0 (condition (11.30) at the boundary r = 0).
      The case R(0) = 0 is excluded for it would correspond either to a r-turning point (Sec. 11.2.6)
      or to an asymptotic r-value (Sec. 11.3.6). In both cases, L would remain on a single side of
      the hypersurface r = 0. The necessary condition for r = 0 crossing is thus R(0) > 0. Now, in
      view of expression (11.95) of R, we have R(0) = −a2 Q, so that R(0) > 0 ⇐⇒ Q < 0. We
      thus conclude
        Only a vortical geodesic (Q < 0) can cross the hypersurface r = 0 and thus move from
        the positive-r region of spacetime (M+ ) to the negative-r one (M− ), or vice-versa.  In
        particular, such a geodesic must have a high energy, i.e. it must obey |E| > µ2 + L2 /a2 .
                                                                                    p
      The energy condition is simply the necessary condition for Q < 0 stated in Sec. 11.3.7. We note
      that it implies |E| > µ, which is consistent with the property (11.96) required to travel in M− .
      Example 11: The ingoing principal null geodesics with θ ̸= π/2 cross the hypersurface r = 0 (cf. the
      dashed green lines in Figs. 10.6 and 10.9, as well as the green lines for θ = π/6 in Fig. C.2) and are
      indeed vortical, since their Carter constants (11.38) obey Q < 0 for θ ̸= π/2. Moreover,p
                                                                                              they have µ = 0
      and L = aE sin θ [Eq. (11.6)], with sin θ < 1 for θ ̸= π/2, so that they fulfill |E| > µ2 + L2 /a2 .
                       2                        2
τ = µλ (11.99)
      and let us introduce the specific conserved energy ε, specific conserved angular momentum
      ℓ , and reduced Carter constant q:
                                       E               L                        Q
                                ε :=     ,      ℓ :=          and        q :=      .                (11.100)
                                       µ               µ                        µ2
      We shall refer to ε, ℓ and q as the reduced integrals of motion of the geodesic L . Note that
      in geometrized units (c = G = 1), ε is dimensionless, ℓ has the dimension of a length and q
      that of a squared length.
          The first-order equations of motion (11.50) can be rewritten in terms of the above quantities:
                                         11.4 Timelike geodesics                                       411
                                         dt
                                              = T1 (r) + T2 (θ)                           (11.101a)
                                         dτ ′
                                               2
                                           dr
                                                   − R(r) = 0                             (11.101b)
                                           dτ ′
                                               2
                                           dθ
                                                   − Θ̃(θ) = 0                            (11.101c)
                                           dτ ′
                                         dφ
                                              = Φ1 (r) + Φ2 (θ)                           (11.101d)
                                         dτ ′
τ ′ = µλ′ (11.102)
and
                              ε(r2 + a2 )2 − 2amℓr
                  T1 (r) :=                        ,        T2 (θ) := −a2 ε sin2 θ,        (11.103)
                                 r2 − 2mr + a2
                                  a(2mεr − aℓ)                      ℓ
                        Φ1 (r) :=                ,    Φ2 (θ) :=         ,        (11.104)
                                   2
                                 r − 2mr + a   2                 sin2 θ
      R(r) := (ε2 − 1)r4 + 2mr3 + a2 (ε2 − 1) − q − ℓ2 r2 + 2m q + (ℓ − aε)2 r − a2 q ,
                                                                         
                                                                                 (11.105)
                                                               2
                                                                  
                                                              ℓ
                          Θ̃(θ) := q + cos2 θ a2 (ε2 − 1) −           .          (11.106)
                                                            sin2 θ
Note that R(r) = R(r)/µ2 , Θ̃(θ) := Θ(θ)/µ2 and that expressions (11.105) and (11.106) follow
respectively from Eqs. (11.95) and (11.35).
    Using Eqs. (11.99) and (11.49), we can relate τ ′ to the proper time τ :
                                         dτ            dτ
                                dτ ′ =      =                        .                     (11.107)
                                         ρ2   r(τ )2 + a2 cos2 θ(τ )
                                                                        −R(r)/m 4
                              −R(r)/m 4
                                                                         1.5
                               30
                                                                           1
                                                                         0.5
                                                                                                      rp         r/m
                                 20                                     -0.5   0.5      1   1.5   2        2.5
10
                                                                                                       r/m
                                                   2 rp             4               6       ra
                                 -10
-20
      Figure 11.6: Effective potential −R(r) corresponding to a = 0.998 m, ε = 0.9, ℓ = 2m and q = 1.3 m2 . The
      black vertical line marks the black hole horizon at r = r+ ≃ 1.063 m. Note that the polynomial −R(r) has
      four real roots, one of them being very close to, but distinct from, r+ . The two largest roots give the periastron
      and apoastron of the bound orbit with the above values of (ε, ℓ, q); they are respectively rp ≃ 2.175 m and
      ra ≃ 6.853 m. [Figure generated by the notebook D.5.8]
As shown in Sec. 11.3.8, such a geodesic has necessarily a non-negative Carter constant:
q≥0 (11.109)
                                                           Z   ra
                                                                      dr
                                                  Λr = 2            p     .                                            (11.111)
                                                             rp      R(r)
                                        11.4 Timelike geodesics                                     413
Since R(r) is a polynomial of degree 4 [cf. Eq. (11.105)], the integral in the right-hand side of
Eq. (11.111) can be evaluated by means of elliptic integrals. We shall not give the detail here,
referring the interested reader to the article [206].
    Regarding the θ-motion, we are in the case L2 > a2 (E 2 − µ2 ) considered in Sec. 11.3.7,
since for bound geodesics E 2 − µ2 < 0. The effective θ-potential V (θ) = Q − Θ(θ) has then
the shape shown in the left panel of Fig. 11.4 for ℓ = 0 and in the left panel of Fig. 11.5 for
ℓ ̸= 0. We can then distinguish three types of bound orbits:
   • polar orbit: L crosses the rotational axis an infinite number of times; this occurs iff
     ℓ = 0 and q > a2 (1 − ε2 );
   • equatorial orbit: L is entirely contained in the equatorial plane θ = π/2; for a bound
     orbit, this occurs iff q = 0;
   • non-polar and non-equatorial orbit: L never crosses the rotational axis and oscillates
     symmetrically about the equatorial plane between two θ-turning points at θ = θm ∈
     (0, π/2) and θ = π − θm ; for a bound orbit, this occurs iff q > 0 and (ℓ ̸= 0 or q <
     a2 (1 − ε2 )).
Strictly speaking, there is also the exceptional case ℓ = 0 and q = a2 (1 − ε2 ), for which the
rotation axis is reached asymptotically (cf. Sec. 11.3.6 and the grey dashed curve in the left
panel of Fig. 11.4).
    Circular equatorial orbits will be discussed in Sec. 11.5. In the following, we focus on
non-polar and non-equatorial orbits, which constitute the generic category of bound timelike
orbits. One has then θm ≤ θ ≤ π − θm , with θm given by Eq. (11.89):
                 v                                                 
                 u                  s              2
                 u1
                 u         2
                          ℓ +q               2
                                            ℓ +q            4q
     θm = arccos t 1 + 2      2
                                  −    1+ 2      2
                                                       − 2          ,                  (11.112)
                   2   a (1 − ε )        a (1 − ε )     a (1 − ε2 )
   From the viewpoint of Eq. (11.101c) above, non-polar orbits have a periodic θ-motion in the
potential well −Θ̃(θ):
                              ∀n ∈ Z, θ(τ ′ + nΛθ ) = θ(τ ′ ).                        (11.114)
The period Λθ is the Mino time
                            q ∆τ spent in a round-trip between θm and π − θm . By rewriting
                                ′
                                   Z   π−θm                   Z   π/2
                                               dθ                        dθ
                          Λθ = 2              q      =4                 q      ,        (11.115)
                                   θm          Θ̃(θ)           θm        Θ̃(θ)
414                   Geodesics in Kerr spacetime: generic and timelike cases
      Figure 11.7: Meridional section of the torus occupied by a bound timelike geodesic. The dot-dashed line is the
      rotation axis and the solid line marks the equatorial plane.
      where the last equality results from the symmetry of Θ̃(θ) with respect to π/2. One naturally
      associates to the periods Λr and Λθ the Mino angular frequencies:
                                                  2π                           2π
                                         Υr :=              and        Υθ :=      .                        (11.116)
                                                  Λr                           Λθ
rp ≤ r ≤ ra and θm ≤ θ ≤ π − θm . (11.117)
      This means that in terms of the Cartesian Boyer-Lindquist coordinates (11.73), the geodesic is
      confined inside a torus (cf. Fig. 11.7), which is symmetric about the equatorial plane. Moreover,
      if the two frequencies Υr and Υθ are not commensurable, i.e. if Υθ /Υr ̸∈ Q, the geodesic fills
      the torus. Examples of orbits with Υθ /Υr ∈ Q, i.e. orbits with a closed trajectory in the (r, θ)
      plane, can be found in Ref. [244].
           As an illustration, Figs. 11.8–11.10 show a bound timelike geodesic L of reduced integrals
      of motion ε = 0.9, ℓ = 2m and q = 1.3 m2 orbiting around a Kerr black hole with a = 0.998 m.
      It has rp ≃ 2.175 m, ra ≃ 6.853 m and θm ≃ 1.060 rad ≃ 60.75◦ . The effective radial potential
      −R(r) of this geodesic is the one depicted in Fig. 11.6. Figures 11.8–11.10 show actually
      the segment 0 ≤ τ ≤ 600 m of L , starting at the point of Boyer-Lindquist coordinates
      (0, (rp + ra )/2, π/2, 0). That θ(τ ) lies in the interval [θm , π − θm ] ∼ [60◦ , 120◦ ] appears clearly
      in the right panel of Fig. 11.9. On Figs. 11.9 (left panel) and 11.10, we notice a generic feature of
      eccentric orbits in the Kerr metric, known as zoom-whirl18 : the particle falls from its apoastron
      to the central region (“zoom in”) and performs some quasi-circular revolutions at close distance
      to the black hole (it “whirls”), reaching the periastron, and finally goes back to the apoastron.
      The zoom-whirl behavior exists as well for eccentric orbits in Schwarzschild spacetime (cf.
      right panel of Fig. 7.12), but is less pronounced, i.e. the number of whirls is smaller (only one
      in Fig. 7.12).
           Let us now consider the evolution of the coordinates t and φ along the geodesic, as governed
      by Eqs. (11.101a) and (11.101d). Since r and θ are periodic functions of τ ′ , so are the terms
        18
          According to Ref. [223], the name “zoom-whirl” has been forged by Curt Cutler and Eric Poisson and may
      have been suggested by Kip Thorne.
                                            11.4 Timelike geodesics                                                  415
Figure 11.8: Trajectory with respect to the Cartesian Boyer-Lindquist coordinates (x, y, z) [Eq. (11.73)] of a
bound timelike geodesic of reduced integrals of motion ε = 0.9, ℓ = 2m and q = 1.3 m2 , orbiting around a Kerr
black hole with a = 0.998 m. These parameters are the same as for the −R(r) plot in Fig. 11.6. [Figure generated
by the notebook D.5.8]
                     y/m
                     6
                     4                                                       z/m
                     2                                                       3
                                                                             2
                                           x/m
    -6    -4    -2         2     4     6                                     1
                     -2                                                                                    x/m
                                                      -6      -4      -2               2       4       6
                                                                            -1
                     -4
                                                                            -2
                     -6                                                     -3
Figure 11.9: Projection in the (x, y) plane (equatorial plane; left panel) and the (x, z) plane (meridional plane;
right panel) of the bound timelike geodesic considered in Fig. 11.8. The grey disk depicts the black hole region.
On the left panel, there appears clearly that the minimal value of r is rp ≃ 2.2 m, while its maximal value is
ra ≃ 6.9 m. On the right panel, we check that θm ≤ θ ≤ π − θm with θm ∼ 60◦ , in agreement with the generic
behavior illustrated in Fig. 11.7. [Figure generated by the notebook D.5.8]
416                  Geodesics in Kerr spacetime: generic and timelike cases
      Figure 11.10: Spacetime diagram based on the Cartesian Boyer-Lindquist coordinates (t, x, y) [Eq. (11.73)]
      depicting the bound timelike geodesic considered in Figs. 11.8 and 11.9. The black cylinder is the black hole event
      horizon. [Figure generated by the notebook D.5.8]
      T1 (r), T2 (θ), Φ1 (r) and Φ2 (θ) that appear in the right-hand side of Eqs. (11.101a) and (11.101d).
      We can thus expand them in Fourier series:
                              +∞                                       +∞                     
                                                    ′                                  ′
                              X                                        X
                  T1 (r) =          T̂1k eikΥr τ = ⟨T1 (r)⟩ +              T̂1k eikΥr τ + c.c.                (11.118a)
                             k=−∞                                      k=1
                              +∞                                       +∞                              
                                             ikΥθ τ ′                                  ikΥθ τ ′
                              X                                        X
                  T2 (θ) =          T̂2k e              = ⟨T2 (θ)⟩ +          T̂2k e              + c.c.      (11.118b)
                             k=−∞                                      k=1
                              +∞                                        +∞                       
                                             ikΥr τ ′                                      ′
                              X                                         X
                  Φ1 (r) =          Φ̂1k e              = ⟨Φ1 (r)⟩ +           Φ̂1k eikΥr τ + c.c.            (11.118c)
                             k=−∞                                       k=1
                              +∞                                        +∞                       
                                             ikΥθ τ ′                                      ′
                              X                                         X
                  Φ2 (θ) =          Φ̂2k e              = ⟨Φ2 (θ)⟩ +           Φ̂2k eikΥθ τ + c.c. ,          (11.118d)
                             k=−∞                                       k=1
      where “c.c.“ stands for complex conjugate and ⟨T1 (r)⟩ = T̂10 , ⟨T2 (θ)⟩ = T̂20 , ⟨Φ1 (r)⟩ = Φ̂10
      and ⟨Φ2 (θ)⟩ = Φ̂20 are the mean values of the functions T1 (r(τ ′ )), T2 (θ(τ ′ )), Φ1 (r(τ ′ )) and
                                             11.4 Timelike geodesics                                      417
with
                                         Γ := ⟨T1 (r)⟩ + ⟨T2 (θ)⟩,                             (11.121)
and
                                        Υφ := ⟨Φ1 (r)⟩ + ⟨Φ2 (θ)⟩.                             (11.122)
Γ and Υφ represent the constant (or average) parts of respectively dt/dτ ′ and dφ/dτ ′ , while
the oscillatory parts are the terms involved in the sums over k ≥ 1. Since Γ is the average value
of dt/dτ ′ , we can define the average angular frequencies with respect to Boyer-Lindquist
time by [173, 206]
                                  Υr               Υθ             Υφ
                            Ωr :=     ,    Ωθ :=      ,    Ωφ :=      .                  (11.123)
                                   Γ                Γ              Γ
We may summarize the above results as follows:
  The motion of a generic bound timelike geodesic in Kerr spacetime is periodic in r and θ
  only in terms of the Mino time τ ′ , the corresponding angular frequencies being Υr and Υθ .
  In terms of the Boyer-Lindquist time t, the motion in r or θ is not periodic. In this respect,
  the “average frequencies” Ωr and Ωθ defined above are not true frequencies like Υr or Υθ .
  Regarding the motion in φ, it is not periodic, neither in terms of the Mino time, nor in
  terms of the Boyer-Lindquist time, so that both Υφ and Ωφ are average frequencies.
Quasi-Keplerian parametrization
Given a bound timelike geodesic L of Kerr spacetime, one may define its eccentricity e and
(dimensionless) semilatus rectum p by means of Keplerian-like formulas [447, 174, 461]:
                                         pm                                pm
                                rp =:                  and       ra =:        ,                (11.124)
                                        1+e                               1−e
418                    Geodesics in Kerr spacetime: generic and timelike cases
      or equivalently
                                            2ra rp                      ra − rp
                                       p=                 and      e=           .                    (11.125)
                                         m(ra + rp )                    ra + rp
      One may also introduce the inclination angle θinc by
                                                       π
                                               θinc := − (sgn ℓ)θm .                                 (11.126)
                                                       2
      The sign of ℓ appears in this formula to enforce θinc ∈ [0, π/2] for prograde orbits (ℓ > 0) and
      θinc ∈ [π/2, π] for retrograde orbits (ℓ < 0).
          Instead of (ε, ℓ, q), a bound timelike geodesic can be parameterized19 by (p, e, θinc ). Actually,
      there is a one-to-one correspondence between (ε, ℓ, q) and (p, e, θinc ): it is clear from formulas
      (11.125) and (11.125) that (p, e, θinc ) are functions of (ε, ℓ, q), because (i) ra and rp are some
      roots of the quartic polynomial R(r), whose coefficients are functions of (ε, ℓ, q) only (at fixed
      Kerr parameters (m, a)) [cf. Eq. (11.105)] and (ii) θm in the right-hand side of expression (11.126)
      for θinc is the function of (ε, ℓ, q) given by Eq. (11.112). Conversely, to express (ε, ℓ, q) in terms
      of (p, e, θinc ), one shall write            
                                                    R(rp ) = 0
                                                   
                                                   
                                                      R(ra ) = 0
                                                   
                                                   
                                                      Θ̃(θm ) = 0,
                                                   
      where rp and ra are the functions of (e, p) given by Eq. (11.124) and θm is the function of θinc
      given by Eq. (11.126). The above system is then a system of 3 equations for the 3 unknowns
      (ε, ℓ, q); see Appendix B of Ref. [447] or Appendix A of Ref. [174] for details.
      Example 12: The geodesic L depicted in Figs. 11.8–11.10 has ε = 0.9, ℓ = 2m and q = 1.3 m2 , from
      which we get rp = 2.175 m, ra = 6.853 m and θm = 1.060 rad = 60.75◦ . Then Eqs. (11.125) and
      (11.126) yield p = 3.302, e = 0.518 and θinc = 29.26◦ .
          The parametrization (p, e, θinc ) allows one to introduce along the geodesic L the radial
      phase angle ψ(τ ) and the meridional phase angle χ(τ ) as monotonically increasing functions
      of τ such that
                            pm
              r(τ ) =:                      and     θ(τ ) =: arccos (cos θm cos χ(τ )) .   (11.127)
                       1 + e cos ψ(τ )
      For numerical integration, ψ(τ ) and χ(τ ) are preferred to r(τ ) and θ(τ ) since the latter are
      tricky at the turning points, where dr/dτ and dθ/dτ vanish and change sign [173].
Remark 1: The above result follows as well from the analysis of the θ-motion performed in Sec. 11.3.7
and according to which the only possible value of the Cartan constant Q for a motion confined to the
equatorial plane is Q = 0, be the geodesic timelike or null. The converse is not true: there exist (timelike
or null) geodesics with Q = 0 outside the equatorial plane; they are asymptotically approaching the
equatorial from one side, except for the exceptional case L = 0 and a2 E 2 = a2 µ2 , for which they are
moving with a constant value of θ (cf. Sec. 11.3.7).
    When θ = π/2, we have sin2 θ = 1, ρ2 = r2 and dτ ′ = r−2 dτ [cf. Eq. (11.107)], so that the
system (11.101) reduces to
                                                                      
                     dt          1                       2am
                         = 2                  2   2
                                          ε(r + a ) +         (aε − ℓ)                (11.129a)
                     dτ    r − 2mr + a2                   r
                     2
                       dr
                            + V(r) = 0                                                (11.129b)
                      dτ
                                                               
                     dφ          1                2m        2amε
                         = 2               ℓ 1−          +          ,                 (11.129c)
                     dτ    r − 2mr + a2            r          r
with
                                           2m ℓ2 + a2 (1 − ε2 ) 2m(ℓ − aε)2
                     V(r) := 1 − ε2 −         +                −            .                        (11.130)
                                            r       r2              r3
Note that V(r) = −R(r)/r4 = −R(r)/(µ2 r4 ) [cf. Eqs. (11.105) and (11.95)].
Remark 2: At the Schwarzschild limit (a = 0) we recover the differential equations of Chap. 7, namely
Eq. (11.129a) reduces to Eq. (7.34), Eq. (11.129b) to Eq. (7.24) and Eq. (11.129c) to Eq. (7.35). In particular
we have V(r) = 1 − ε2 + 2Vℓ (r), where Vℓ (r) is the effective potential (7.25). Note however that when
a ̸= 0, V(r) cannot be considered as an effective potential for the radial motion, because it depends
on ε, in addition to ℓ, even if we subtract from it the constant term 1 − ε2 . This is why we did not
bother to add a 1/2 factor in Eq. (11.129b), as we did for Eq. (7.24) to make it match the first integral of a
1-dimensional motion in a potential well.
Historical note : The potential (11.130), which is ruling the radial motion of timelike geodesics confined
to the equatorial plane, has been exhibited first by Robert Boyer and Richard Lindquist in 1966 (published
1967) [72], in the form r3 V(r), cf. their Eq. (5.6), taking into account that they were using −a instead of
a (cf. the historical note on p. 340).
      The above condition is not sufficient to single out a geodesic worldline: there exist worldlines
      with constant r = r0 that obey the whole system (11.129) (which includes (11.132)) but that are
      not solution of the geodesic equation (11.1) (see Ref. [489] for concrete examples). One must
      add a second condition, which is obtained as follows. For any timelike equatorial geodesic (not
      necessarily circular), the following relation must hold:
                                            dr d2 r
                                                              
                                                           ′
                                                 2     + V (r) = 0.
                                           dτ dτ 2
      It is obtained by differentiating Eq. (11.129b) with respect to τ . If the geodesic is not circular, at
      any point where r is not stationary (i.e. excluding r-turning points), we have dr/dτ ̸= 0 and
      the above relation implies
                                                  ′          d2 r
                                                V (r) = −2 2 .
                                                             dτ
      Since in the vicinity of any circular geodesic there exist non-circular ones (those with small
      eccentricity e, as defined by Eq. (11.125)), by continuity, we shall ask that this relation holds for
      circular geodesics as well. For the latter ones, r = r0 and d2 r/dτ 2 = 0, so that it simplifies to
                                                      V ′ (r0 ) = 0 .                               (11.133)
      One can verify20 that this relation, in conjunction with Eq. (11.132), is sufficient to eliminate
      the non-geodesic circular worldlines.
          To summarize, circular timelike geodesics in the equatorial plane are obtained by solving
      the system (11.132)-(11.133). Given expression (11.130) for V, this system takes the form
                      (1 − ε2 )r03 − 2mr02 + ℓ2 + a2 (1 − ε2 ) r0 − 2m(ℓ − aε)2 = 0            (11.134a)
                                                            
                         2
                                                                                               (11.134b)
                               2     2      2
                                                                2
                      mr0 − ℓ + a (1 − ε ) r0 + 3m(ℓ − aε) = 0.
      This is a system of 2 nonlinear equations for 3 unknowns: r0 , ε and ℓ. We thus expect a
      1-parameter family of solutions. It is convenient to choose r0 as the parameter. We have thus
      to solve (11.134) for (ε, ℓ). The change of variable
                                                      ℓ̃ := ℓ − aε                                  (11.135)
      turns (11.134) into the system
                                                                                                   (11.136a)
                                  3 2
                                   r0 ε = mℓ̃2 + r02 (r0 − m)
                                   2ar0 εℓ̃ = (3m − r0 )ℓ̃2 + r0 (mr0 − a2 ).                      (11.136b)
      Let us consider the square of Eq. (11.136b):
                                             h                              i2
                                 2 2 2 2                  2              2
                               4a r0 ε ℓ̃ = (3m − r0 )ℓ̃ + r0 (mr0 − a ) ,                          (11.137)
      and substitute (11.136a) for ε2 in it; we get
         r0 (r0 − 3m)2 − 4a2 m ℓ̃4 + 2r02 (3m2 − a2 )r0 − m(r02 + a2 ) ℓ̃2 + r03 (mr0 − a2 )2 = 0.
                                                                   
      This is a quadratic equation for X := ℓ̃2 . Its discriminant turns out to have a simple form:
                                           ∆ = 16a2 mr03 (r02 − 2mr0 + a2 )2 .
      It follows immediately that ∆ ≥ 0 ⇐⇒ r0 ≥ 0. Hence, there is no solution for r0 < 0:
        20
             See Ref. [489] for details.
                             11.5 Circular timelike orbits in the equatorial plane                                      421
Property 11.28
  There does not exist any equatorial circular timelike orbit in the region r < 0 of Kerr
  spacetime.
Remark 3: For r0 < 0 with |r0 | ≫ m, the above result is not surprising since we have seen in Sec. 10.2.2
that under these conditions, the Kerr metric appears as a Schwarzschild metric with a negative mass.
The asymptotic gravitational field is then repulsive and certainly does not admit any circular orbit.
In the region r < 0 and |r| small, the above argument does not apply. However, the results obtained
in Sec. 11.3.8 show that if a circular orbit would exist there, it would necessarily have |E| > µ [cf.
Eq. (11.96)], i.e. would be unbound.
The case r0 = 0 is excluded as well, since in the equatorial plane, it would correspond to the
curvature singularity. In what follows, we therefore assume
                                                       r0 > 0 .                                            (11.138)
                      √        √
We have then           ∆ = 4ar0 mr0 |r02 − 2mr0 + a2 | and the solutions of the quadratic equation
are21
                                                              √
                         (a2 − 3m2 )r02 + mr0 (r02 + a2 ) ∓ 2a mr0 (r02 − 2mr0 + a2 )
             ℓ̃2±   = r0                                                              .                    (11.139)
                                           r0 (r0 − 3m)2 − 4a2 m
                              2      1 h
                                     2                2     √     2
                                                                    i 1
               r0 (r0 − 3m) − 4a m =    (r0 (r0 − 3m)) − (2a mr0 ) = A+ A− ,
                                     r0                               r0
with
                                                        √
                               A± := r0 (r0 − 3m) ± 2a mr0 .
                                                         √
In the numerator of (11.139), we may use the identity ∓2a mr0 = A∓ − r0 (r0 − 3m) and get,
after simplification,
               r02                                  r2
           ℓ̃2± =    A∓ (r02 − 2mr0 + a2 ) − A+ A− = 0 r02 − 2mr0 + a2 − A± .
                                                                          
             A+ A−                                  A±
                                      √                 √
Using r02 − 2mr0 + a2 − A± = a2 ∓ 2a mr0 + mr0 = (a ∓ mr0 )2 , we obtain
                                                            √
                                                    r02 (a ∓ mr0 )2
                                         ℓ̃2±   = 2            √    .                                      (11.140)
                                                 r0 − 3mr0 ± 2a mr0
                                                                                       √
Since obviously ℓ̃2± ≥ 0, this expression gives birth to the constraint r02 − 3mr0 ± 2a mr0 > 0,
which is equivalent to
                                    3/2       1/2       √
                                   r0 − 3mr0 ± 2a m > 0.                                 (11.141)
  21
       For future convenience, we have chosen the sign ∓ in the numerator to be the opposite of the label ± in ℓ̃2± .
422                Geodesics in Kerr spacetime: generic and timelike cases
                                                                                                  1/2
      The left hand-side of this inequality is a cubic polynomial in x := r0 and determining
      its sign amounts to compute its roots. Fortunately, this corresponds
                                                                       √              to a depressed cubic
      equation x + px + q = 0, with p := −3m and q := ±2a m, the discriminant of which is
                  3
      −(4p3 + 27q 2 ) = 108m(m2 − a2 ) ≥ 0. The roots (xk )k∈{0,1,2} are then all real√    and are given by
      Viète’s formula (8.22). They obey x0 + x1 + x2 = 0 and x0 x1 x2 = −q = ∓2a m, from which
      we deduce that for ± = + in Eq. (11.141), i.e. for ℓ̃2+ , there are two positive roots, which are x0
      and x2 as given by Eq. (8.22), while for ± = −, i.e. for ℓ̃2− , there is a single positive root, which
      is x0 as given by Eq. (8.22). Going back to r0 = x2 , with the roots x0 and x2 given by Viète’s
      formula (8.22), we get the following constraints for circular orbits in the equatorial plane:
                               for ℓ̃2+ :                   ∗
                                                  0 < r0 < rph               or            +
                                                                                     r0 > rph           (11.142a)
                               for ℓ̃2− :               −
                                                  r0 > rph ,                                            (11.142b)
      with                                                                           
                                                                     1        a
                                         ±
                                        rph   := 4m cos      2
                                                                       arccos ∓                          (11.143)
                                                                     3          m
      and                                                
                                                             1        a  4π 
                                   ∗
                                  rph   := 4m cos    2
                                                               arccos −   +     .                        (11.144)
                                                             3          m   3
      The index “ph” stands for photon, because we shall see in Sec. 11.5.5 and in Sec. 12.3 that these
      radii actually correspond to photon orbits (circular null geodesics). rph
                                                                             ±
                                                                                and rph
                                                                                     ∗
                                                                                         are plotted as
      functions of a in Fig. 11.11 (green curves). Note that m ≤ rph ≤ 3m and 3m ≤ rph ≤ 4m, with
                                                                  +                     −
      Besides, 0 ≤ rph
                    ∗
                       ≤ m, with
                                                 ∗
                                            lim rph = 0,                   ∗
                                                                      lim rph = m.                       (11.146)
                                            a→0                      a→m
      the first permitted range for circular orbits in (11.142a) lies entirely in the region MIII of Kerr
      spacetime, while the second range in (11.142a) lies in entirely in MI . Regarding the orbits for
      ℓ̃2− , the range (11.142b) lies in MI as well.
             Substituting expression (11.140) for ℓ̃2 in Eq. (11.136a), we get, after simplification,
                                                    2             √      2
                                                  r   − 2mr 0 ± a   mr 0
                                         ε2± = 2 0 2                √       .                       (11.149)
                                               r0 r0 − 3mr0 ± 2a mr0
                                11.5 Circular timelike orbits in the equatorial plane                                  423
4.0
3.5
3.0
                          2.5
                                                                                             rph+
                  r0 /m
                          2.0                                                                rph−
                                                                                             rph∗
                                                                                             rergo
                          1.5                                                                r+
                                                                                             r−
                          1.0
0.5
                          0.0
                                0.0     0.2          0.4         0.6    0.8         1.0
                                                           a/m
Figure 11.11: Domain of existence of circular equatorial timelike orbits in the (a, r0 ) plane: orbits of the
(ε+ , ℓ+ ) family exist in both the light and dark grey regions, while orbits of the (ε− , ℓ− ) family exist only in
the dark grey region. The orange line marks the location of the outer ergosphere in the equatorial plane, which
is rergo = rE + (π/2) = 2m [cf. Eq. (10.19b)], the black curve corresponds to the black hole event horizon H
(r = r+ ) and the maroon one to the inner horizon Hin (r = r− ). [Figure generated by the notebook D.5.9]
The ±’s in Eqs. (11.140) and (11.149) have to be consistent. In other words, we have two pairs
of solutions for (ε2 , ℓ̃2 ), namely (ε2+ , ℓ̃2+ ) and (ε2− , ℓ̃2− ). A priori,
                                                                            q ε+ leads to two values for
                                                                                2
ε, namely ± ε2+ , and ℓ̃2+ leads to two values for ℓ̃, namely ± ℓ̃2+ , so that the pair (ε2+ , ℓ̃2+ )
             p
would generate four solutions for (ε, ℓ̃), and similarly the pair (ε2− , ℓ̃2− ) would generate four
extra solutions for (ε, ℓ̃), leading to a total of eight solutions. However, by construction these
solutions satisfy the “squared” equation (11.137) but not all of them satisfy the original equation
(11.136b). To see this, let us consider the following square roots of Eqs. (11.149) and (11.140):
                                                            √
                                              r02 − 2mr0 ± a mr0
                                        ε± = p 2              √   ,                                       (11.150)
                                            r0 r0 − 3mr0 ± 2a mr0
                                     √                   √
                                       r0 (a ∓
                                       mr0 )        r0 (± mr0 − a)
                    ℓ̃± = − p 2            √    =p 2            √                                         (11.151)
                             r0 − 3mr0 ± 2a mr0   r0 − 3mr0 ± 2a mr0
and write the eight solutions of (11.137) as
                                     
              (ε, ℓ̃) = ϵ1 ε± , ϵ2 ℓ̃± ,   with ϵ1 = ±1                       and         ϵ2 = ±1.
      Hence Eq. (11.136b) is fulfilled iff 1 − ϵ1 ϵ2 = 0, i.e. iff ϵ1 ϵ2 = 1. This reduces the number of
      possible solutions from eight to four:
              (ε, ℓ̃) = (ε+ , ℓ̃+ ) or     (−ε+ , −ℓ̃+ ) or     (ε− , ℓ̃− )     or    (−ε− , −ℓ̃− ).   (11.152)
      A further reduction of the number of solutions is provided by the future-directed condition
      (11.66). Since R(r0 ) = −µ2 r04 V(r0 ) = 0 by virtue of Eq. (11.132), for circular equatorial orbits,
      Eq. (11.66) reduces to                                  
                                            1            a
                                                 ε− 2         ℓ > 0,
                                           ∆0        r 0 + a2
      where ∆0 := r02 − 2mr0 + a2 = (r0 − r+ )(r0 − r− ). Now, we have observed above that
      circular orbits lie either in MI or MIII , where ∆0 > 0. Therefore, we can further simplify the
      future-directed condition to
                                                       a
                                              ε− 2         ℓ > 0.
                                                   r0 + a2
      Once reexpressed in terms of ℓ̃ = ℓ − aε, it is equivalent to
                                                    r02 ε − aℓ̃ > 0.                                   (11.153)
      Let us check each of the four solutions (11.152):
                                                       r0 ∆0
                                r02 ε+ − aℓ̃+ = p 2            √    >0
                                                 r0 − 3mr0 + 2a mr0
                                r02 (−ε+ ) − a(−ℓ̃+ ) = −(r02 ε+ − aℓ̃+ ) < 0
                                                          r0 ∆0
                                r02 ε− − aℓ̃− = p 2                √        >0
                                                   r0 − 3mr0 − 2a mr0
                                r02 (−ε− ) − a(−ℓ̃− ) = −(r02 ε− − aℓ̃− ) < 0.
      We conclude that only two solutions remain:
                                          (ε, ℓ̃) = (ε+ , ℓ̃+ ) or     (ε− , ℓ̃− ).                    (11.154)
       Given that ℓ = ℓ̃ + aε [Eq. (11.135)] and expressions (11.150) and (11.151) for respectively ε±
      and ℓ̃± , these two solutions can be reexpressed in terms of (ε, ℓ) as
                                         (ε, ℓ) = (ε+ , ℓ+ )   or      (ε− , ℓ− ) ,                    (11.155)
      where
                                                                  √
                                                   m r02 + a2 ∓ 2a mr0
                                               r
                                    ℓ± = ±           p               √    .                            (11.156)
                                                   r0 r02 − 3mr0 ± 2a mr0
      The quantities ε± and ℓ± are plotted in terms of r0 in Figs. 11.12 and 11.13. Orbits with ℓ > 0
      (resp. ℓ < 0) are called prograde (resp. retrograde). We have immediately ℓ− < 0, while
                                                √
      the sign of ℓ+ is that of of r02 + a2 − 2a mr0 . Outside the event horizon, i.e. in MI , we have
                                             √
      r0 > m, which implies r02 + a2 − 2a mr0 > r02 + a2 − 2ar0 = (r0 − a)2 > 0. Hence
                                                   +
                                             r0 > rph     =⇒         ℓ+ > 0.                           (11.157)
      But for r0 < rph∗
                        , i.e. in region MIII , we may have ℓ+ < 0 (cf. Fig. 11.13). In view of this and
      the result (11.142), we introduce the following nomenclature:
                                  11.5 Circular timelike orbits in the equatorial plane                                           425
                                                                           1.02                                 a = 0.0 m
          2                                                                                                     a = 0.5 m
                                                                                                                a = 0.9 m
                                                                                                                a = 0.98 m
                                                                           1.00                                 a = 1.0 m
          1
                                                                           0.98
                                                                       ε±
 ε±
0 0.96
          1
                                                      a = 0.0 m
                                                      a = 0.5 m            0.94
                                                      a = 0.9 m
                                                      a = 0.98 m           0.92
                                                      a = 1.0 m
          2
                  0       1   2   3     4     5   6     7          8                    4   5   6     7     8   9            10
                                      r0 /m                                                         r0 /m
Figure 11.12: Specific conserved energy ε = ε+ (solid curves) or ε = ε− (dashed curves) along circular timelike
orbits in the equatorial plane, as a function of the orbital radius r0 [Eq. (11.150)], for selected values of the Kerr
spin parameter a. Curves in the inner region (r0 < m) terminate by a vertical asymptote at r = rph     ∗
                                                                                                         (a) given by
                                                                                ±
Eq. (11.144), while the other curves start along a vertical asymptote at r = rph (a) given by Eq. (11.143). Dots on
ε+ curves and open circles on ε− ones mark the ISCO: all configurations on the left of these points are unstable.
The right panel is a zoom on the region 4 m ≤ r0 ≤ 10 m. Note that the red curve (a = 0) is the same as in
Fig. 7.8. [Figure generated by the notebook D.5.9]
          10                                          a = 0.0 m
                                                                                4
                                                      a = 0.5 m                 3
                                                      a = 0.9 m
                                                      a = 0.98 m                                                a = 0.0 m
              5                                       a = 1.0 m                 2                               a = 0.5 m
                                                                                                                a = 0.9 m
                                                                                1                               a = 0.98 m
                                                                                                                a = 1.0 m
                                                                       ` ± /m
 ` ± /m
              0                                                                 0
                                                                                1
                                                                                2
              5
                                                                                3
                                                                                4
          10
                      0   1   2   3     4     5   6      7         8                4       5   6     7     8   9            10
                                      r0 /m                                                         r0 /m
Figure 11.13: Specific conserved angular momentum ℓ = ℓ+ (solid curves) or ℓ = ℓ− (dashed curves) along
circular timelike orbits in the equatorial plane, as a function of the orbital radius r0 [Eq. (11.156)], for selected
values of the Kerr spin parameter a. Curves in the inner region (r0 < m) terminate by a vertical asymptote at
                                                                                                     ±
      ∗
r = rph  (a) given by Eq. (11.144), while the other curves start along a vertical asymptote at r = rph  (a) given by
Eq. (11.143). Dots on ℓ+ curves and open circles on ℓ− ones mark the ISCO: all configurations on the left of these
points are unstable. The right panel is a zoom on the region 4 m ≤ r0 ≤ 10 m. [Figure generated by the notebook
D.5.9]
426                  Geodesics in Kerr spacetime: generic and timelike cases
1.2
1.0
0.8
0.6
                             0.4
                       ε
0.2
                             0.0
                                     a = 0.0 m
                             0.2     a = 0.5 m
                                     a = 0.9 m
                                     a = 0.98 m
                                     a = 1.0 m
                             0.4
                                           4           2            0             2            4
                                                                  `/m
      Figure 11.14: Circular timelike orbits in the (ℓ, ε) plane, ℓ being the specific conserved angular momentum
      and ε the specific conserved energy. Solid (resp. dotted) curves correspond to stable (unstable) orbits. Retrograde
      outer circular orbits are on the left side, inner circular orbits in the middle and prograde outer circular orbits on
      the right side. The cusps in the left and right curves correspond to ISCOs. [Figure generated by the notebook D.5.9]
• all orbits of the (ε− , ℓ− ) family are called retrograde outer circular orbits.
          We note from Figs. 11.12 and 11.13, or from formulas (11.150) and (11.156), that both ε and
      ℓ are diverging at the boundaries of the various domains of existence of circular orbits. We
      shall comment further on this behavior at the end of Sec. 11.5.5.
          Figure 11.14 shows ε in terms of ℓ for the three families of circular orbits, with the indication
      of the stability of the various branches, as determined in the next section.
        All equatorial circular timelike orbits are stable with respect to any perturbation away
        from the equatorial plane.
                         11.5 Circular timelike orbits in the equatorial plane                                       427
                                           r 0 = 2m
                              0.02         r 0 = 3m
                              0.01
                              0.00
                       V(r)
                              0.01
                              0.02
                              0.03
                              0.04
                                     1.0      1.5     2.0   2.5   3.0    3.5   4.0   4.5   5.0
                                                                  r/m
Figure 11.15: Function V(r), as defined by Eq. (11.130), with a = 0.9 m, ε = ε+ (r0 ) [Eq. (11.150)], ℓ = ℓ+ (r0 )
[Eq. (11.156)] for r0 = 2 m (red curve) and r0 = 3 m (blue curve). By definition of ε+ and ℓ+ , one has both
V(r0 ) = 0 and V ′ (r0 ) = 0. [Figure generated by the notebook D.5.9]
Proof. All equatorial orbits have a vanishing Carter constant Q [cf. Eq. (11.128)] and we have
seen in Sec. 11.3.7 that a Q = 0 geodesic stays stably at θ = π/2 iff a2 (E 2 − µ2 ) ≤ L2 , i.e. iff
                                                      a2 (ε2 − 1) ≤ ℓ2                                  (11.158)
for a timelike geodesic. Now, for a circular orbit of radius r0 , Eq. (11.134b) implies
                            ℓ2 − a2 (ε2 − 1) = mr0 + 3m(ℓ − aε)2 /r0 > 0,
where the inequality follows from r0 > 0 [Eq. (11.138)]. Hence the stability condition (11.158)
is always fulfilled.
Radial stability
Let us now investigate the stability with respect to a radial perturbation. A circular orbit at
r = r0 obeys both V(r0 ) = 0 and V ′ (r0 ) = 0 [Eqs. (11.132) and (11.133)]. The latter property
means that it corresponds to a local extremum of the function V. This extremum can be either
a local maximum (red curve in Fig. 11.15) or a local minimum (blue curve in Fig. 11.15). Now,
in view of Eq. (11.129b) rewritten as V(r) = −(dr/dτ )2 , any motion in the equatorial plane
must obey V(r) ≤ 0. If the circular orbit corresponds to a local minimum, then for r close to
r0 , but distinct from it, V(r) > 0, since V(r0 ) = 0 (cf. the blue curve in Fig. 11.15). This means
that no geodesic motion with the same values of the conserved quantities ε and ℓ is possible
in the vicinity of r0 except for precisely r = r0 . We conclude that the circular orbit at r0 is
stable in that case. On the contrary, when r0 is a local maximum of V, one has V(r) < 0 for r
close to r0 , but distinct from it (cf. the red curve in Fig. 11.15). Motion away from r0 is then
possible for the same values of ε and ℓ; we conclude that the circular orbit is unstable in that
case. Assuming that V ′′ (r0 ) ̸= 0, a local minimum (resp. maximum) is equivalent to V ′′ (r0 ) > 0
(resp. V ′′ (r0 ) < 0). We have then
428                Geodesics in Kerr spacetime: generic and timelike cases
From the definition (11.130) of V(r) and the relation ℓ = ℓ̃ + aε [Eq. (11.135)], we get
      Substituting Eq. (11.136b) for 2aεℓ̃, we get a simple expression, involving only ℓ̃:
                                                                   !
                                                                 2
                                                    2m         ℓ̃
                                        V ′′ (r0 ) = 3 1 − 3 2 .
                                                     r0        r0
      where ± is + (resp. −) for prograde (resp. retrograde) orbits and we have introduced the
      dimensionless variables           r
                                          r0                   a
                                   x :=          and     ā := .                       (11.161)
                                          m                    m
      The problem amounts to finding the range of x where the quartic polynomial P (x) is positive.
      This requires computing the roots of P . We shall do it via Ferrari’s method. The first step is to
      introduce a parameter Z1 and rewrite P (x) as
P (x) = (x2 − Z1 )2 − T (x), with T (x) := 2(3 − Z1 )x2 ∓ 8āx + Z12 + 3ā2 . (11.162)
      The above expression is an identity, which holds for any value of Z1 ; the core of Ferrari’s
      method it to find Z1 so that the quadratic
                                             p polynomial T (x) has a double root, x0 say. We will
      have then T (x) = S(x) , with S(x) := 2(3 − Z1 )(x−x0 ), and P (x) = (x −Z1 )2 −S(x)2 =
                             2                                                 2
      In other words, the four (possibly complex) solutions of the quartic equation P (x) = 0 are
      the two solutions of the quadratic equation x2 − Z1 − S(x) = 0 plus the two solutions of
                       11.5 Circular timelike orbits in the equatorial plane                                    429
3.0
2.5
2.0
                          Z1
                               1.5
                               1.0
                                     0.0        0.2       0.4         0.6   0.8   1.0
                                                                a/m
Figure 11.16: Function Z1 , as defined by Eq. (11.165). [Figure generated by the notebook D.5.9]
x2 − Z1 + S(x) = 0. A necessary and sufficient condition for T (x) to have a double root is
that its discriminant vanishes, which is equivalent to
                                           Z13 − 3Z12 + 3ā2 Z1 − ā2 = 0.                           (11.164)
We have thus to solve a cubic equation in Z1 . Let us reduce it to a depressed cubic equation (i.e.
an equation free of any square term) via the change of variable Z1 =: Z + 1:
                                     Z 3 + 3(ā2 − 1)Z + 2(ā2 − 1) = 0.
The discriminant of this cubic equation is ∆ = −(4p3 + 27q 2 ), where p := 3(ā2 − 1) and
q := 2(ā2 − 1). We get ∆ = −108ā2 (1 − ā2 )2 . Hence ∆ < 0: there exist only one real solution.
It is given by Cardano’s formula:
          v                  ! v                        !
                      r                           r
                                                            √         √
          u                       u
          u1
           3             −∆       3 1
                                  u                 −∆      3           3
                                                                                  √3
                                                                                           
     Z=   t      −q +          +  t     −q −              = 1 − ā  2     1 + ā + 1 − ā .
             2            27        2               27
Hence                                          √              √         √      
                                                                                                     (11.165)
                                               3               3         3
                             Z1 = 1 +              1−   ā2      1 + ā + 1 − ā .
Z1 is plotted in terms of ā = a/m in Fig. 11.16. In particular, we notice that 1 ≤ Z1 ≤ 3. When
Z1 takes the value (11.165), the double root of T (x) is x0 = ±2a/(3 − Z1 ), where the ± is the
opposite of the ∓ in the definition (11.162) of T (x), and therefore indicates which family of
circular orbits, among (ε+ , ℓ+ ) or (ε− , ℓ− ), is considered. We have then
                                                                √
                                        p                     2 2ā
                               S(x) = 2(3 − Z1 ) x ∓ √               .
                                                              3 − Z1
                            √
When ā → 0, the ratio ā/ 3 − Z1 is of the undetermined type “0/0”. We can rearrange it by
noticing the identity 8ā2 /(3 − Z1 ) = Z12 + 3ā2 , which is a direct consequence of Eq. (11.164).
We then write
                                                                    q
                    S(x) = 2(3 − Z1 ) x ∓ Z2 , with Z2 := Z12 + 3ā2 .
                             p
430                 Geodesics in Kerr spacetime: generic and timelike cases
The solutions of P (x) = 0 are obtained by solving the two quadratic equations [cf. Eq. (11.163)]
                                                                                                (11.166a)
                                           p
                                    x2 −  2(3 − Z1 ) x − Z1 ± Z2 = 0
                                                                                                (11.166b)
                                        p
                                    x2 + 2(3 − Z1 ) x − Z1 ∓ Z2 = 0.
      Moreover, physically acceptable solutions must obey x > 0 (recall that x := r0 /m). The
                                                                                     p
      where ± indicates which family among (ε+ , ℓ+ ) and (ε− , ℓ− ) is considered and ISCO stands
      for innermost stable circular orbit. Indeed, rISCO
                                                     ±
                                                          being the unique zero of r02 − 6mr0 ±
        √
      8a mr0 − 3a , (11.160) is equivalent to
                   2
      and conclude that all the inner circular orbits are unstable. We also see on Fig. 11.17 that in
      the limit a = 0, rISCO
                        +        −
                             = rISCO  = 6m, i.e. we recover the Schwarzschild ISCO discussed in
      Sec. 7.3.3.
                                11.5 Circular timelike orbits in the equatorial plane                                 431
                      8
                                                                                                rph+
                                                                                                rph−
                                                                                                rph∗
                      6                                                                         rergo
                                                                                                r+
                                                                                                r −+
              r0 /m
                                                                                                rISCO
                                                                                                rISCO
                                                                                                   −
                      4                                                                         rmb+
                                                                                                rmb−
                      0
                          0.0        0.2       0.4         0.6          0.8          1.0
                                                     a/m
Figure 11.17: Various critical radii for circular orbits as functions of the Kerr spin parameter a. The red curves
                                                     ±
correspond to the innermost stable circular orbit rISCO    [Eq. (11.167)], the light brown ones to the marginally
                      ±
bound circular orbit rmb [Eq. (11.186)], while the other curves are the same as in Fig. 11.11. [Figure generated by
the notebook D.5.9]
  The function  ±
               rISCO (ā) is easily inverted by solving the quadratic equation (rISCO
                                                                                 ±
                                                                                      )2 −
6mrISCO ± 8a mrISCO − 3a2 = 0 for a [cf. Eq. (11.160)]; one gets
   ±
            p    ±
                                             r                r            !
                                                  ±                 ±
                                         m       rISCO             rISCO
                                     a=±                 4−      3       −2 ,                            (11.170)
                                         3         m                 m
  22                                                                    −
       Note that for ± = −, formula (11.170) still yields a ≥ 0, since rISCO ≥ 6m.
432                Geodesics in Kerr spacetime: generic and timelike cases
        Among all prograde outer circular orbits, the ISCO is that for which the functions ε+ (r0 )
        and ℓ+ (r0 ) are minimal. Similarly, among all retrograde outer circular orbits, the ISCO is
        that for which the function ε− (r0 ) is minimal and the function ℓ− (r0 ) is maximal.
      Proof. In what precedes, we have considered V, defined by Eq. (11.130), as a function of r only.
      Let us consider it instead as a function of (r, ε, ℓ): V = V(r, ε, ℓ). What we have denoted by
      V ′ (r) is then ∂V/∂r and Eqs. (11.132) and (11.133) can be rewritten as
      where |0 means that the quantity is evaluated at (r, ε, ℓ) = (r0 , ε± (r0 ), ℓ± (r0 )) and the
      vanishing of the first term results from Eq. (11.171b). Similarly, deriving Eq. (11.171b) with
      respect to r0 yields
                               ∂ 2V           ∂ 2V                     ∂ 2V
                                          +              ε′± (r0 ) +                 ℓ′± (r0 ) = 0.
                               ∂r2    0       ∂ϵ∂r   0                 ∂ℓ∂r      0
      Now, at the ISCO, one has precisely ∂ 2 V/∂r2 |0 = 0 (V ′′ (r0 ) = 0 in the preceding notations).
      Hence, at the ISCO, the following two equations must hold:
                                  
                                   ∂V ε′ (r0 ) + ∂V ℓ′ (r0 ) = 0
                                     ∂ϵ 0 ±          ∂ℓ 0 ±
                                                                                              (11.172)
                                      2
                                   ∂ V ε′± (r0 ) + ∂ 2 V ℓ′± (r0 ) = 0
                                          ∂ϵ∂r                   ∂ℓ∂r
                                                 0                      0
      This constitutes a linear homogeneous system for the two unknowns ε′± (r0 ), ℓ′± (r0 ) . Since
                                                                                                
      there are no obvious     reason for its determinant to vanish, we deduce that the only solution is
       ε′± (r0 ), ℓ′± (r0 ) = (0, 0). Hence the ISCO realizes an extremum of both ε± (r0 ) and ℓ± (r0 ). A
                           
      quick look at Figs. 11.12 and 11.13, especially their bottom panels, enables us to conclude that
      the extremum is a minimum for ε± (r0 ) and ℓ+ (r0 ) and a maximum for ℓ− (r0 ).
          Property 11.32 explains why the ISCO is located at the cusp in the (ℓ, ε) curves shown in
      Fig. 11.14: this corresponds to an extremum of both ε(r0 ) and ℓ(r0 ). Property 11.32 has also a
      particular astrophysical significance, to be discussed in Sec. 11.5.4.
      Summary
                        11.5 Circular timelike orbits in the equatorial plane                                    433
  Equatorial circular timelike orbits exist only in the region r > 0 of Kerr spacetime. They
  are all stable with respect to perturbations away from the equatorial plane. They are of
  three kinds:
      1. The prograde outer circular orbits: they are located outside the black hole event
         horizon (i.e. in region MI ), having a Boyer-Lindquist radius r0 ranging from rph  +
                                                                                              =
         4m cos2 [arccos(−a/m)/3] [Eq. (11.143)] to +∞; the minimal radius rph      +
                                                                                       decreases
         with a monotonically from 3m (a = 0) to m (a = m). These orbits are radially
         unstable for rph+           +
                            < r0 ≤ rISCO  and stable for r0 > rISCO
                                                                  +
                                                                      , where rISCO
                                                                                +
                                                                                     is given by
         Eq. (11.167), decreasing monotonically from 6m (a = 0) to m (a = m). Their specific
         conserved energy and angular momentum are ε = ε+ (r0 ) > 0 and ℓ = ℓ+ (r0 ) > 0,
         with ε+ (r0 ) given by Eq. (11.150) and ℓ+ (r0 ) by Eq. (11.156), both being plotted as
         the solid curves in the region r0 ≥ m of Figs. 11.12 and 11.13. The functions ε+ (r0 )
         and ℓ+ (r0 ) are minimal at the ISCO.
      2. The retrograde outer circular orbits: they are located outside the black hole event
         horizon (i.e. in region MI ), having a Boyer-Lindquist radius r0 ranging from rph   −
                                                                                                =
         4m cos [arccos(a/m)/3] [Eq. (11.143)] to +∞; the minimal radius rph increases with
                 2                                                               −
      3. The inner circular orbits: they are located inside the inner horizon (in the part
         r > 0 of region MIII ), having a Boyer-Lindquist radius r0 ranging from 0 (the ring
         singularity) to rph
                          ∗
                             = 4m cos2 [arccos(−a/m)/3 + 4π/3] [Eq. (11.144)]; the maximal
         radius rph increases with a monotonically from 0 (a = 0) to m (a = m). These orbits
                  ∗
         are all radially unstable. Their specific conserved energy and angular momentum
         are ε = ε+ (r0 ) and ℓ = ℓ+ (r0 ), with ε+ (r0 ) given by Eq. (11.150) and ℓ+ (r0 ) by
         Eq. (11.156), both being plotted as the solid curves in the region r0 ≤ m of Figs. 11.12
         and 11.13.
In particular, there are no equatorial circular orbits in region MII of Kerr spacetime.
Historical note : The solutions for timelike circular orbits in the equatorial plane (outer cases only, i.e.
excluding the case 3 above) have been first published by James M. Bardeen, William H. Press and Saul A.
Teukolsky in 1972 [42]. In particular, they have obtained Eq. (11.150) for ε± (r0 ), Eq. (11.156) for ℓ± (r0 )
                      ±
and Eq. (11.167) for rISCO . In his lecture notes at the 1972 Les Houches School [38], Bardeen says that
these solutions have been derived by Teukolsky. In the article [42], it is mentioned that they have been
obtained by means of computer algebra techniques.
434                          Geodesics in Kerr spacetime: generic and timelike cases
                1.0
                                                                                       3
                0.9                                                                    2
                                                                                       1
                0.8
                                                                                  /m
                                                                                       0
        εISCO
                                                                                                                                     +
                                                                                                                                   `ISCO
                                                                            `ISCO
                                                                                                                                   `ISCO
                                                                                                                                     −
          ±
0.7 1
                                                                              ±
                                                                                       2
                0.6                                                                    3
                               +
                             εISCO
                             εISCO
                               −
                0.5                                                                    4
                      0.0       0.2   0.4         0.6         0.8    1.0                   0.0   0.2   0.4         0.6       0.8       1.0
                                            a/m                                                              a/m
      Figure 11.18: Specific conserved energy ε (left) and specific conserved angular momentum ℓ (right) at the
      prograde ISCO (solid curve) and retrograde ISCO (dashed curve), as functions of the Kerr spin parameter a. [Figure
      generated by the notebook D.5.9]
Property 11.34: specific conserved energy and angular momentum at the ISCO
        The values ε+
                    ISCO and ℓISCO (resp. εISCO and ℓISCO ) of the specific conserved energy and
                              +            −         −
        angular momentum of a particle at the prograde ISCO (resp. retrograde ISCO) are
                             s                                                                    r                      !
                                    2m                                         2m                   r±
           ε±
            ISCO =               1− ±                   and         ℓ±
                                                                     ISCO   =± √                 2 3 ISCO − 2 + 1            . (11.173)
                                   3rISCO                                     3 3                     m
                                                                          √     √
      Proof. Let us set u := rISCO
                              ±
                                   /m, so that Eq. (11.170) becomes a = ±m u(4 − 3u − 2)/3.
                      11.5 Circular timelike orbits in the equatorial plane                           435
Property 11.35: energy release from an accretion disk around a black hole
  Accretion onto a black hole through an accretion disk releases a significant fraction of the
  rest mass energy of matter: from 6% for a non-rotating black hole (Schwarzschild) up to
  42% for a maximally rotating black hole (extremal Kerr):
    It must be stressed that the above numbers are much larger than the energy release achieved
through thermonuclear reactions: the most efficient of such reactions is hydrogen fusion —
the main energy source for stars — and it releases only 0.8% of the rest mass! This explains
436                  Geodesics in Kerr spacetime: generic and timelike cases
      why accretion onto a black hole is invoked to account for the most energetic sources in the
      Universe, such as quasars and active galactic nuclei (AGNs) (cf. e.g. [135]).
      Remark 5: The energy-release mechanism described above is distinct from the Penrose process discussed
      in Sec. 11.3.2. In the former, the energy is extracted from the thermal energy of the disk, while in the
      latter, it is extracted from the “rotational energy” of the black hole, as we shall discuss in Sec. 16.3.1, and
      this requires the existence of an ergoregion, contrary to the disk accretion process.
      Historical note : In 1969, Donald Lynden-Bell [351] suggested that the large luminosity of quasars is
      due to an accretion disk surrounding a massive black hole and terminating at the ISCO. He focused on
      Schwarzschild black holes, for which the radiated energy per unit mass is Erad /µ = 0.057 [Eq. (11.176a)].
      Soon after, in 1970, James M. Bardeen [36] considered the rotating case, i.e. Kerr black holes. He obtained
      expression (11.170) for a, as well as formulas (11.173) for ε±              ±
                                                                     ISCO and ℓISCO , and he pointed out that
      Erad /µ can reach much higher values than in the non-rotating case, up to 0.423 for a = m [Eq. (11.176b)]
      (the number 0.432 given in Ref. [36] is a typo and should be 0.423). In 1974, Kip Thorne [478] declared
      that Bardeen’s article [36] is “a landmark paper — a paper that persuaded astrophysicists to stop using the
      Schwarzschild metric to describe black holes and start using the Kerr metric”.
                                                  u = ut (ξ + Ω± η) .                                      (11.178)
                          11.5 Circular timelike orbits in the equatorial plane                                           437
     2.0                                                               0.15
                                                a = 0.0 m                                                 a = 0.0 m
                                                a = 0.5 m                                                 a = 0.5 m
                                                a = 0.9 m
                                                a = 0.98 m             0.10                               a = 0.9 m
                                                                                                          a = 0.98 m
     1.5                                        a = 1.0 m                                                 a = 1.0 m
                                                                       0.05
     1.0
                                                                 mΩ±
 mΩ±
0.00
0.5 0.05
                                                                       0.10
     0.0
                                                                       0.15
           0   1     2    3     4     5     6     7          8                 4     5   6     7     8     9       10
                              r0 /m                                                          r0 /m
Figure 11.19: Angular velocity Ω := dφ/dt = Ω+ (solid curves) or Ω = Ω− (dashed curves) along circular
timelike orbits in the equatorial plane, as a function of the orbital radius r0 [Eq. (11.177)], for selected values of
the Kerr spin parameter a. Dots on Ω+ curves and open circles on Ω− ones mark the ISCO: all configurations on
the left of these points are unstable. The right panel is a zoom on the region 4 m ≤ r0 ≤ 10 m. [Figure generated
by the notebook D.5.9]
The component ut = dt/dτ is given by Eq. (11.129a). Substituting in it Eq. (11.150) for ε and
Eq. (11.151) for ℓ̃ := ℓ − aε, we get, after simplification:
                                                     p
                                              r0 ± a   m/r0
                                 ut = p 2                  √ .                        (11.179)
                                          r0 − 3mr0 ± 2a mr0
    Equation (11.178) shows that the 4-velocity is a linear combination of the two Killing vectors
ξ and η with coefficients that are constant along the worldline L . An observer on a circular
orbit in the equatorial plane is thus a stationary observer as defined in Sec. 10.7.1 [compare
Eq. (10.86)]. He does not notice any change in the spacetime geometry. We shall call him a
circular geodesic observer. Contrary to the families of stationary observers considered in
Sec. 10.7, a circular geodesic observer is in free fall: by construction, his worldline is a geodesic,
so that his 4-acceleration is zero.
    The orbital velocity measured by the circular geodesic observer himself is ΩP = dφ/dτ =
dφ/dt × dt/dτ = ut Ω. Using the values (11.177) and (11.179), we get
                                                        √                     −3/2
                                             m ± amr0
                         Ω±
                          P    =± √      √      p               √    .                                         (11.180)
                                 ( r0 ± a m/r0 ) r02 − 3mr0 ± 2a mr0
                                  1.0
                                                                                           a = 0.0 m
                                                                                           a = 0.5 m
                                                                                           a = 0.9 m
                                                                                           a = 0.98 m
                                                                                           a = 1.0 m
                                  0.5
                         V (±ϕ)
                                  0.0
0.5
                                  1.0
                                        0      1          2   3      4     5       6       7            8
                                                                   r0 /m
      Figure 11.20: Component V+(φ) (solid curves) or V−(φ) (dashed curves) of the velocity V = V±(φ) e(φ) of a
      particle moving along a circular orbit as measured by the ZAMO [Eq. (11.183)] for selected values of the Kerr spin
                              (φ)                               (φ)
      parameter a. Dots on V+ curves and open circles on V− ones mark the ISCO: all configurations on the left of
      these points are unstable. [Figure generated by the notebook D.5.9]
      where the second equality results from β = β φ ∂φ = β φ η = −ωη, ω being the rotation angular
      velocity of the ZAMO seen from infinity [Eq. (10.99)]. Since n · η = 0, the above formula
      constitute the orthogonal decomposition of u with respect to the ZAMO’s 4-velocity n, which
      we can compare to the generic formula (1.36) and thereby conclude that the velocity of P with
      respect to the ZAMO is
                                                 1
                                           V = (Ω± − ω) η                                    (11.181)
                                                N
      and the Lorentz factor of P with respect to the ZAMO is
Γ = (1 − V · V )−1/2 = N ut . (11.182)
      Let us expand V on the ZAMO’s orthonormal fame (e(α) ) [Eq. (10.95)]. Given relation (10.95d)
      between η = ∂φ and e(φ) we get, after substituting Eq. (10.93) (with θ = π/2) for N :
      Finally, let us substitute Eq. (11.177) for Ω± and Eq. (10.99) (with θ = π/2) for ω. We obtain,
                         11.5 Circular timelike orbits in the equatorial plane                          439
after simplification:
                                                           √          √         
                  (φ)                         (φ)           m r02 ∓ 2a mr0 + a2
         V =     V±     e(φ)    with         V±     = ± 3/2   √ p                 .         (11.183)
                                                       (r0 ± a m) r02 − 2mr0 + a2
As in all formulas of this section, the sign ± is + for a prograde outer circular orbit or an
                                                                                (φ)
inner circular orbit and − for a retrograde outer circular orbit. The velocity V± is depicted in
Fig. 11.20. We note that, except for a = m,
               (φ)                     (φ)                       (φ)             (φ)
      lim V+ = 1,              lim V+ = −1,
                                ∗
                                                        lim V+ = 1,     lim V− = −1.        (11.184)
      r0 →0                r0 →rph                          +
                                                       r0 →rph              −
                                                                       r0 →rph
   • the outer orbits rotate at the speed of light with respect to the ZAMO near the minimal
     radius of their domain of existence (r0 → rph ±
                                                     ).
Given that the velocity of a massive particle with respect to any observer cannot be larger
than the speed of light, this provides a physical explanation for the boundaries of the domains
of existence of circular orbits obtained in Sec. 11.5.2. We shall see in Sec. 12.3 that these
boundaries correspond to photon circular orbits. This also provides a physical explanation why
the specific conserved energy ε and the specific conserved angular momentum ℓ diverge at
these boundaries, as observed in Figs. 11.12 and 11.13: ε := E/µ and ℓ := L/µ must tend to
±∞ when the particle’s mass µ tends to 0 (the photon limit).
    The particular case a = m will be discussed in Sec. ??.
are bound. Similarly, but this time of all values of a ≤ m, the retrograde outer circular orbits
are bound iff r0 is larger than a critical value, rmb
                                                   −
                                                       say. The orbit at r0 = rmb ±
                                                                                      is called the
marginally bound circular orbit.
    We note from Fig. 11.12 that
                                        ±
                                       rph     ±
                                           < rmb      ±
                                                  < rISCO .                                (11.185)
Accordingly, all unbound orbits are unstable.
    To evaluate rmb±
                      , let us solve the equation ε± (r0 ) = 1 for r0 . Using expression (11.150) for
ε± (r0 ), taking the square and simplifying, we get
                                                   √
                                   r02 − 4mr0 ± 4a mr0 − a2 = 0,
440                   Geodesics in Kerr spacetime: generic and timelike cases
                                                                                                       (11.186)
                                          ±
                                                                p
                                         rmb = 2m ∓ a + 2        m(m ∓ a) .
       ±
      rmb is plotted as a function of a in Fig. 11.17.
      Similarly, the limiting value of a for the stability of circular orbits in the ergoregion
                                                                                        √       is obtained
      by setting r0 = 2m in Eq. (11.160) with the + sign: (2m)2 − 12m2 + 8am 2 − 3a2 > 0. The
      right-hand side being a quadratic polynomial in a, we get easily, given the constraint a ≤ m,
      Both in (11.187) and (11.188), the orbits referred to belong to the prograde outer family.
      Remark 7: Regarding the limit (11.187), the existence is specified to be in the outer ergoregion, because
      timelike circular orbits always exist, as soon as a > 0, in the inner ergoregion: they are the (unstable)
      inner circular orbits found in Sec. 11.5.2. Indeed, these orbits exist in the range 0 < r0 < rph
                                                                                                    ∗ , which is
      entirely contained in the inner ergoregion: the boundaries of the latter in the equatorial plane are r = 0
                                             11.6 Going further                                                441
[Eq. (10.19a)] and r = r− , with r− > rph ∗ . On the contrary, in the limit (11.188), we have dropped the
qualifier outer for the ergoregion, since there is no other place where stable circular orbits can be found,
the inner circular orbits being all unstable.
Remark 8: As discussed in Sec. 11.3.1, negative-energy or zero-energy particles can exist in the outer
ergoregion. However, Fig. 11.12 shows that none of them can follow a circular orbit.
Contents
        12.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 443
        12.2 Main properties of null geodesics . . . . . . . . . . . . . . . . . . . . . 443
        12.3 Spherical photon orbits . . . . . . . . . . . . . . . . . . . . . . . . . . . 460
        12.4 Black hole shadow and critical curve . . . . . . . . . . . . . . . . . . . 479
        12.5 Images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 495
12.1      Introduction
Having investigated the properties of generic causal geodesics in Kerr spacetime in Chap. 11,
we focus here on null geodesics, with application to images of a Kerr black hole. First we
discuss the main properties of null geodesics in Sec. 12.2, in great part by taking the µ = 0
limit of results obtained for generic causal geodesics in Chap. 11. Then, in Sec. 12.3, we focus
on null geodesics evolving at a fixed value of the coordinate r — the so-called spherical photon
orbits. These geodesics play a crucial role in the formation of the images perceived by an
observer. In particular, they are related to the key concepts of critical curve and shadow in the
observer’s screen, which are investigated in Sec. 12.4. Finally, we discuss the images themselves
in Sec. 12.5, first by considering computed images from a simplified model of accretion disk
and then by analyzing the actual image of the surroundings of the black hole M87*, as obtained
recently by the Event Horizon Telescope [6].
      momentum L and the Carter constant Q by E, so that only two constants of motion become
      pertinent for the study: L/E and Q/E 2 . We thus treat first the particular case E = 0.
                                                       Q≥0          .                         (12.1)
                                                              E=0
      We shall see below that actually L ≤ 0 for all zero-energy null geodesics, as soon as a ̸= 0.
          The equations of geodesic motion expressed in terms of the Mino parameter λ′ [system
      (11.50)] simplify considerably for a geodesic L with µ = 0 and E = 0:
                                  dt        2amLr
                                        =−                                                   (12.3a)
                                  dλ′         ∆
                                  dr
                                                                                             (12.3b)
                                            p
                                        = ϵr R(r)
                                  dλ′
                                  dθ
                                                                                             (12.3c)
                                            p
                                        = ϵθ Θ(θ)
                                  dλ′
                                  dφ            L       2           2    2
                                                                                             (12.3d)
                                                                             
                                        =             r   − 2mr + a   cos  θ   ,
                                  dλ′        ∆ sin2 θ
      with [cf. Eqs. (11.95) and (11.35)]:
                                                                 L2
                                                 Θ(θ) = Q −           .                       (12.5)
                                                               tan2 θ
      By combining (12.3a) and (12.3d), we get
                                     dφ              2mr − r2 − a2 cos2 θ
                                                 =                                 .          (12.6)
                                     dt      L          2amr sin2 θ
                                                                             E=0
      It is remarkable that this expression does not depend on L or Q; it is therefore the same for
      all zero-energy null geodesics. Moreover, we note that the numerator of the right-hand side
      is always positive or zero in the closure G of the ergoregion, which is precisely defined by
                             12.2 Main properties of null geodesics                                445
2mr − r2 − a2 cos2 θ ≥ 0 (cf. Sec. 10.2.4) and where L is necessarily confined. Since moreover
r > 0 in G , we conclude that
                                            dφ
                                                 ≥ 0.                                    (12.7)
                                            dt L
   To proceed, we shall distinguish the subcases Q ̸= 0 and Q = 0.
Case Q ̸= 0
This case actually corresponds to Q > 0, since Q < 0 is forbidden by (12.1). We set
                                                   L
                                            L̄ := √                                       (12.8)
                                                    Q
Since 1 + L̄2 ̸= 0, this is a second-order polynomial in r, the two roots of which are
                      r                                          r
                                  a 2                                       a2
        rmin = m − m2 −                     and    rmax  =  m  +   m 2−           .      (12.10)
                                1 + L̄2                                   1 + L̄2
Since m2 ≥ a2 , the two roots are real. They are distinct except for a = m and L = 0. The
range of radial motion being determined by R(r) ≥ 0 [Eq. (11.30)], we get
with rmin = 0 for a = 0 or L̄2 → +∞, rmin = r− for L = 0, rmax = 2m for a = 0 or L̄2 → +∞
and rmax = r+ for L = 0. If L ̸= 0 and a ̸= 0, then rmax > r+ , so that L has a part in the
outer ergoregion and (12.2) implies that L < 0. Hence
a ̸= 0 =⇒ L ≤ 0. (12.13)
    Let us consider a zero-energy null geodesic L emitted outward (i.e. with ϵr = +1) from
a point A in the outer ergoregion G + . The coordinate r increases along L from rA to rmax ,
which corresponds to a r-turning point. Then r decreases to r+ , which means that L crosses
the black hole event horizon H and enters the region MII . In all MII , r keeps decreasing
and reaches r− . There L crosses the inner horizon Hin and enters the region MIII , where
r continues to decrease until it reaches rmin . The latter corresponding to a r-turning point,
r starts to increase and reaches r− again. There one might think that L crosses the inner
horizon Hin and enters into MII . But this is impossible since Hin is a 1-way membrane: it can
be crossed by a causal curve from MII to MIII but not in the reverse way. Moreover, r could
446                            Null geodesics and images in Kerr spacetime
      not continue to increase into MII since r must be decreasing towards the future in all this
      region (this follows from the hypersurfaces r = const being spacelike in MII , cf. Sec. 10.5.1).
      The solution to this apparent puzzle is immediate as soon as one realizes that the boundary
      r = r− of MIII is not entirely constituted by Hin : it also comprises a null hypersurface that
      separates MIII from a region distinct from MII in the maximally extended Kerr spacetime, cf.
      Fig. 10.11. This region is a “time-reversed” copy of MII and is denoted by MII∗ ′ in Fig. 10.11
      (cf. Sec. 10.8 for details). So actually, when it reaches r = r− , the null geodesic L enters MII∗ ′ .
      There r necessarily increases towards the future, at the opposite of MII . It reaches then r = r+ ,
      where L crosses a white hole horizon and emerges into the asymptotically flat region MI′′ , as
      illustrated in Fig. 12.1. The region MI′′ is similar to MI . In particular, L is confined into the
      outer ergoregion of MI′′ , having a r-turning point at r = rmax (same value (12.10) as in MI ).
      Then a new cycle begins, with L entering the future event horizon of MI′′ .
           The θ-motion of L is constrained by Θ(θ) ≥ 0 [Eq. (11.33)], which, given expression (12.5)
      for Θ, is equivalent to
      since µ = 0 and E = 0, yields θm = arccos 1/(1 + L̄2 ) = arctan |L̄|. Hence we recover the above
      formula.
          For L = 0, one has θm = 0, so that θ takes all values in the range [0, π], which means that
      L crosses repeatedly the rotation axis. For L < 0, one has 0 < θm < π/2 and L oscillates
      symmetrically about the equatorial plane, having two θ-turning points, at θm and π − θm . Of
      course, we recover the general results for Q > 0 of Sec. 11.3.7.
          We can obtain r as a function of θ along L by evaluating the integrals in the identity
      (11.55a):
                                       Z r             Z θ
                                             ϵr dr̄          ϵθ dθ̄
                                       − p          =− p
                                         r0   R(r̄)      θ0   Θ(θ̄)
      Using (12.9) and (12.5), we get on any portion of L where ϵr and ϵθ are constant,
                       Z   r                                                                  Z   θ
                                                       dr̄                                                     dθ̄
                  ϵr           p                                                       = ϵθ           p                       .
                       r0          −(1 +   L̄2 )r̄2   + 2m(1 +   L̄2 )r̄   −   a2             θ0          1 − L̄2 / tan2 θ̄
                                              r/m − 1
                                         x= q                              and            µ = cos θ
                                                     a2
                                             1 − m2 (1+ L̄2 )
      lead to                               Z   x                          Z   cos θ
                                    ϵr                   dx̄                                 dµ
                               √                      √         = −ϵθ                  p                 .
                                1 + L̄2       x0        1 − x̄2            cos θ0       1 − (1 + L̄2 )µ2
                                  12.2 Main properties of null geodesics                                 447
Figure 12.1: Trajectory in the extended Kerr spacetime of a null geodesic with E = 0, Q > 0 and L < 0,
emitted from a point A in the outer ergoregion.
                                                            √
The integration is then immediate: arcsin x = −ϵr ϵθ arcsin( 1 + L̄2 cos θ) + K, where K is a
constant, from which we get
                       s
                                     a2           h                p              i
         r =m+m            1−                 sin  K − ϵ ϵ
                                                        r θ arcsin   1 + L̄2 cos θ    .       (12.15)
                                m2 (1 + L̄2 )
        √
Since       1 + L̄2 cos θm = 1, we see that the constant K is related to the value of r at θ = θm by
                                                             
                                             r(θm )/m − 1             π
                                K = arcsin  q                 + ϵr ϵθ   .                    (12.16)
                                                      a2
                                              1 − m2 (1+               2
                                                         L̄2 )
448                          Null geodesics and images in Kerr spacetime
e r/m cosθ
4 r=r +
2 r=r −
                                                                A                      e r/m sinθ
                                           1     2    3     4       5   6    7     8
                                   r=0
                                   2
      Note that K is not a constant all along L , but only on portions where ϵr and ϵθ are constant.
      Expression 12.15 gives the trace of the zero-energy null
                                                             √ geodesic L in a meridional plane. It
      depends on Q and L only through the ratio L̄ := L/ Q. It depends as well on the value of
      r at√θm via K, as it appears in Eq. (12.16). An example is shown in Fig. 12.2 for a/m = 0.9,
      L/ Q = −1 and r(θm ) = 1.5 m. It has θm = π/4, rmin ≃ 0.229 m and rmax ≃ 1.771 m, while
      for a/m = 0.9, one has r− ≃ 0.564 m and r+ ≃ 1.436 m. For concreteness, the arrows indicate
      some direction of motion, but depending upon some initial conditions, the opposite direction
      is possible. In particular, one may consider that the geodesic is the same as that shown in
      Fig. 12.1, being emitted outward in the outer ergoregion from a point A in the equatorial plane
      (θ = π/2).
      Case Q = 0
      If the zero-energy null geodesic L has a vanishing Carter constant Q, Eq. (12.5) reduces to
      Θ(θ) = −L2 / tan2 θ, so that the constraint Θ(θ) ≥ 0 [Eq. (11.33)] implies L = 0 or θ = π/2 .
          In the first case, the four constants of motion µ, E, L and Q are zero. By virtue of the result
      (11.25), L is nothing but a null geodesic generator of the event horizon H or of the inner
                               12.2 Main properties of null geodesics                                     449
horizon Hin .
    In the second case (θ = π/2), L is confined to the equatorial plane. If L = 0, we are back
to the first case: L is null geodesic generator of H or Hin lying in the equatorial plane. If
L ̸= 0, the radial motion of L is governed by Eq. (12.3b) with the expression (12.4) of R(r)
reduced to
                                     R(r) = L2 r(2m − r).                               (12.17)
The constraint R(r) ≥ 0 [Eq. (11.30)] implies then that the motion is within the range 0 ≤ r ≤
2m, with r = 2m being a r-turning point, since it is a simple root of R(r) (cf. Sec. 11.2.6). It
corresponds to the outer edge of the ergoregion in the equatorial plane, cf. Eq. (10.19b). Hence
we have necessarily L ∩ MI ̸= ∅ and (12.2) applies: L < 0. The inner boundary of the radial
motion, r = 0, is the ring singularity. Accordingly, in the maximally extended Kerr spacetime,
L starts at the ring singularity in a MIII -type region (cf. Fig. 12.3), has r increasing, enters
a MII∗ -type region (time reversed copy of MII ), emerges in MI via the white hole horizon
at r = r+ and reaches a r-turning point at r = 2m, then r decreases continuously until L
terminates at the ring singularity of MIII , after having crossed the black hole horizon H and
the inner horizon Hin . This trajectory, depicted in Fig. 12.3, is similar to that for Q ̸= 0 shown
in Fig. 12.1, except that it is “blocked” by two ring singularities and cannot oscillate forever
between distinct MI -type regions.
Remark 2: For Q ̸= 0 and L ̸= 0, the limit Q → 0 corresponds to L̄2 → +∞ [cf. Eq. (12.8)], so that
Eq. (12.10) yields rmin → 0 and rmax → 2m. We recover then the range [0, 2m] for r obtained here for
Q = 0.
   We conclude:
  Property 12.1: null geodesic with E = 0 and Q = 0
  Any null geodesic with E = 0 and Q = 0 is either a null generator of one of the two Killing
  horizons H or Hin (in which case, it has L = 0) or it has L < 0, lies in the equatorial
  plane, emanates from a ring singularity, reaches the outer ergosphere (r = 2m), where it
  has a r-turning point, and terminates at a ring singularity.
   Regarding the sign of L, we can combine the above result with that obtained for Q ̸= 0
[Eq. (12.13)] to get:
Historical note : The zero-energy null geodesics in Kerr spacetime appear to have been first studied by
Zdeněk Stuchlík in 1981, in the appendix of the article [466]; some corrections and refinement of his
results have been performed by George Contopoulos in 1984 [141], who studied zero-energy timelike
geodesics as well.
450                         Null geodesics and images in Kerr spacetime
Figure 12.3: Trajectory in the extended Kerr spacetime of a null geodesic with E = 0, Q = 0 and L < 0.
                                                 L                          Q
                                          ℓ :=            and        q :=      .                            (12.19)
                                                 E                          E2
      Note that, in geometrized units (G = 1 and c = 1), ℓ has the dimension of a length and q that
      of a squared length.
      Remark 3: In the literature, ℓ is sometimes denoted by λ (e.g. Refs. [38, 236, 169]) or by ξ (e.g. Ref. [107]),
      while q is sometimes denoted by η (e.g. Refs. [38, 107, 236]). Moreover, in studies restricted to q ≥ 0,
      it may happen that the notation q stands for the square root of the quantity q defined by (12.19) (e.g.
      Refs. [165, 238, 169]).
                                 12.2 Main properties of null geodesics                                        451
Remark 4: Contrary to L and Q, the quantities ℓ and q are independent from the affine parametrization
of the geodesic L . Indeed, if instead of the affine parameter λ associated with the particle’s 4-momentum
p, one considers the affine parameter λ′ = αλ, where α is a positive constant, the tangent vector field
becomes p′ = α−1 p, so that the associated conserved quantities are E ′ = −ξ · p′ = α−1 E [cf.
Eq. (11.2a)], L′ = η · p′ = α−1 L [cf. Eq. (11.2b)] and Q′ = K̃(p′ , p′ ) = α−2 Q [cf. Eq. (11.41)], so that
ℓ′ := L′ /E ′ = ℓ and q ′ := Q′ /E ′ 2 = q.
    The non-negative property of Carter constant K [Eq. (11.36)], along with the identity
(11.34) leads to the following constraint on the parameters ℓ and q:
q + (ℓ − a)2 ≥ 0 . (12.20)
Example 1 (Principal null geodesic): A principal null geodesic has E ̸= 0 if, and only if, it does
not belong to the outgoing family generating the horizon H or Hin . This follows immediately from
Eqs. (11.6) and (11.9). One has then L = aE sin2 θ0 [Eqs. (11.6) and (11.9)] and Q = −a2 E 2 cos2 θ0
[Eq. (11.38)], where θ0 is the constant value of θ along the geodesic. We have thus
This yields
                                               q + (ℓ − a)2 = 0,                                    (12.22)
so that the inequality (12.20) is saturated for principal null geodesics.
   The equations of motion in terms of the Mino parameter λ′ , Eqs. (11.50) specialized to
µ = 0, can be rewritten as
                            1 dt    1  2
                                        (r + a2 )2 − 2amℓr − a2 sin2 θ                            (12.23a)
                                                          
                                ′
                                  =
                           E dλ     ∆
                             1 dr
                                                                                                  (12.23b)
                                        p
                                   = ϵr   R(r)
                           |E| dλ′
                             1 dθ
                                        q
                                   = ϵθ Θ̃(θ)                                                     (12.23c)
                           |E| dλ′
                           1 dφ      ℓ   a
                               ′
                                 =    2 + (2mr − aℓ) ,                                            (12.23d)
                           E dλ    sin θ ∆
with
                                  R(r)                          Θ(θ)
                             R(r) := 2
                                             and      Θ̃(θ) :=       .                              (12.24)
                                   E                             E2
From the general expressions (11.44), (11.95) and (11.35) specialized to µ = 0, we get
      and
                                                                     ℓ2
                                                                            
                                      Θ̃(θ) = q + cos θ a2 −
                                                       2
                                                                                 .                   (12.26)
                                                                   sin2 θ
         It suffices to use the parameter λ′′ := |E|λ′ to make E disappear from the system (12.23).
      We therefore conclude:
Property 12.3
        In Kerr spacetime, a null geodesic with E ̸= 0 is entirely determined by the two constants
        (ℓ, q), by the sign of E and by the values of the two signs ϵr = ±1 and ϵθ = ±1 at a given
        point.
      Example 2 (Principal null geodesic): Given the values (12.21) of ℓ and q for a principal null geodesic,
      Eq. (12.25b) reduces to a simple expression for the quartic polynomial R:
                                                                    2
                                           R(r) = r2 + a2 cos2 θ0        .                            (12.27)
      We note that R(r) = ρ4 , which makes sense because θ = θ0 is constant along such a geodesic. For any
      principal null geodesic lying in the equatorial plane, the polynomial simplifies even further:
                                                                    π
                                             R(r) = r4          θ0 =    .                            (12.28)
                                                                     2
Figure 12.4: Impact of a null geodesic L onto the screen of a remote observer. (x, y, z) are the Cartesian
Boyer-Lindquist coordinates defined by Eq. (11.73).
normal to the screen. The 4-momentum of the photon having L as worldline at the location
of O writes
where the second equality follows from the orthogonal decomposition (1.31) with respect to
O, ξ being the 4-velocity of O and the unit spacelike vector n being the photon’s velocity1
relative to O; it obeys ξ · n = 0 and n · n = 1 [Eq. (1.37)]. The appearance of the conserved
energy E in the above equation is a direct consequence of the definition (11.2a): E := −ξ · p,
with ξ · ξ = −1 for rO → +∞. The above identity implies pt = E and
                                   pr       pθ        pφ
                                n=    e(r) + rO e(θ) + rO sin θO e(φ) .
                                   E        E         E
With respect to O, the incoming direction of the photon is given by the vector −n and the
trace on the screen is indicated by the component of −n that is tangent to the screen, namely
                                                pθ          pφ
                                m = −n∥ = −        rO e(θ) − rO sin θO e(φ)                       (12.30)
                                                E           E
since the screen plane is spanned by e(θ) and e(φ) (cf. Fig. 12.4). Let us choose the screen
dimensionless Cartesian coordinates (α, β) so that the black hole’s rotation axis appears as the
β-axis (cf. Fig. 12.4), then
          We have defined (α, β) as dimensionless Cartesian coordinates on the screen, cf. Eq. (12.31),
      where m is dimensionless and (e(α) , e(β) ) is the screen’s orthonormal basis. In practice, their
      values are tiny, being exactly zero at the limit rO → +∞. (α, β) can thus be interpreted as
      angular coordinates, measuring the departure from the direction of the black hole “center” on
      the celestial sphere of observer O. We shall then call (α, β) the screen angular coordinates.
      Remark 5: When studying null geodesics in Schwarzschild spacetime in Chap. 8, we introduced the
      impact parameter b as b := |L|/E [Eq. (8.12)], hence b is related to ℓ by
                                                        b = |ℓ|.                                        (12.33)
      Moreover, thanks to spherical symmetry, we could reduce the study to the case where both the observer
      and the geodesic lie in the equatorial plane, which imply θO = π/2 and q = 0. Equations (12.32) yield
      then
                                                 b
                                        |α| =       = b̂    and    β = 0,                            (12.34)
                                                rO
      where b̂ is the angle introduced by formula (8.118).
      Remark 6: The angular impact parameters (α, β) depend on the geodesic L as a curve in spacetime
      and not on any affine parametrization of L . Their direct connection with (ℓ, q) expressed by (12.32) is
      thus in perfect agreement with the invariance of (ℓ, q) in any affine reparametrization of L , as noticed
      in Remark 4 p. 451.
          We deduce from Eqs. (12.32) a simple relation between the squared angular distance to the
      screen center, α2 + β 2 , and the constants of motion (ℓ, q) of the incoming geodesic:
                                                   1 2
                                     α2 + β 2 =      ℓ + q + a2 cos2 θO .                              (12.35)
                                                                       
                                                   2
                                                  rO
                              12.2 Main properties of null geodesics                                    455
Given that ℓ = L/E, we recover one of the properties listed in Sec. 11.3.7: a geodesic cannot
encounter the rotation axis unless it has L = 0.
    Moreover, on the rotation axis, the vectors e(θ) and e(φ) are not defined, due to the singularity
of spherical coordinates there. Consequently, the screen coordinates (α, β) cannot be defined
by (12.31). In particular, the rotation axis appears as single point on the screen, which forbids
to use it to define the β-axis. One has then to pick an arbitrary orthonormal frame (e(α) , e(β) )
in the screen plane to define (α, β). Formulas (12.88) do no longer hold, but formula (12.35)
is still valid, since the distance from the screen’s center is a quantity independent from the
orientation of the frame (e(α) , e(β) ). Another way to see that (12.35) is still valid is to notice
that it admits a well-defined limit for θO → 0 or π. Taking into account ℓ = 0, we obtain
                                      1
                        α2 + β 2 =      q + a2         (θO = 0 or θO = π).                   (12.37)
                                               
                                      2
                                     rO
      • A null geodesic L of Kerr spacetime cannot encounter the rotation axis unless it
        has ℓ = 0.
q ≥ 0. (12.38)
      • The reduced Carter constant q can take negative values only if |ℓ| < a (which implies
        a ̸= 0); its range is then limited from below:
               If q > 0 and ℓ = 0, L crosses repeatedly the rotation axis, with θ taking all values
               in the range [0, π].
            • If q = 0, L is stably confined to the equatorial plane for |ℓ| > a or |ℓ| = a ̸= 0; for
              |ℓ| < a, L either lies unstably in the equatorial plane or approaches it asymptotically
              from one side, while for ℓ = 0 and a = 0, L lies at a constant value θ = θ0 ∈ [0, π].
            • If qmin < q < 0, L never encounters the equatorial plane, having a θ-motion entirely
              confined either to the Northern hemisphere (0 < θ < π/2) or to the Southern one
              (π/2 < θ < π); if ℓ ̸= 0, L oscillates between two θ-turning points, at θ = θm
              and θ = θv (Northern hemisphere) or at θ = π − θv and θ = π − θm (Southern
              hemisphere), where θm is given by Eq. (12.41) above and θv is given by Eq. (11.91):
                                    v                                          
                                    u                  s              2
                                    u1
                                    u         2
                                             ℓ +q                 2
                                                                ℓ +q         4q
                        θv = arccos t 1 −        2
                                                    −       1−       2
                                                                          + 2 ;           (12.42)
                                      2         a                   a        a
      Example 3 (Principal null geodesic): A principal null geodesic moves at a constant angle θ = θ0 and
      has ℓ = a sin2 θ0 [Eq. (12.21)]. For θ0 ̸= π/2, we have |ℓ| < a and Eq. (12.39) yields qmin = −a2 cos4 θ0 .
      Comparing with the value of q given by Eq. (12.21), we note that
                                                      q = qmin ,                                         (12.45)
      which agrees with the last caseplisted above, i.e. motion at constant θ, with θ0 = θ∗ or θ0 = π − θ∗
      according to Eq. (12.44), since |ℓ|/a = sin θ0 .
                                                      12.2 Main properties of null geodesics                                                                                   457
            40                                   40                                        40                                        40
                 ` = 0.5 m                             ` = 0.863 m                              `=m                                       ` = 2mr + /a
 R(r)/m 4   30   q = 16 m 2                      30    q = 18.04 m 2                       30   q = 20 m 2                           30   q = 8 m2
            20                                   20                                        20                                        20
10 10 10 10
            0                                    0                                         0                                         0
                                                                   r0                                 r1                 r2                                   r0
            10                                   10                                        10                                        10
                 1.5   2.0     2.5   3.0   3.5         1.5   2.0         2.5   3.0   3.5        1.5        2.0     2.5   3.0   3.5        1.5   2.0     2.5        3.0   3.5
                              r/m                                       r/m                                       r/m                                  r/m
                        (a)                                        (b)                                           (c)                                  (d)
Figure 12.5: Quartic polynomial R(r) in the region MI for four values of (ℓ, q) and for a = 0.95 m. The grey
area marks the black hole region, with the vertical black line at the event horizon, at r = r+ ≃ 1.312 m. The
green part of the r axis corresponds to R(r) ≥ 0, i.e. to regions where the geodesic motion is allowed. [Figure
generated by the notebook D.5.13]
R(r) ≥ 0 . (12.46)
Example 4 (Principal null geodesic): Given the value (12.27) of R(r) for a principal null geodesic, we
note that R(r) > 0 in all Kerr spacetime. This is consistent with the fact that, for E ̸= 0 and θ0 ̸= π/2,
ingoing principal null geodesics travel from r = +∞ to r = −∞ (cf. the dashed green curve in Fig. 10.9)
and outgoing principal null geodesics travel from r = −∞ to r = +∞ (cf. the solid green curve in
Fig. 10.10).
    Since R(r) is a polynomial of degree 4 in r, its behavior can be relatively complicated.
However, it turns out that in the black hole exterior, its behavior is quite simple, according to
the following lemma:
  In region MI of Kerr spacetime, i.e. for r > r+ , the quartic polynomial R(r) associated to
  a given geodesic [Eq. (12.25)] has one of the following behaviors, depending on the value
  of (ℓ, q):
1. R(r) has no root in (r+ , +∞) and R(r) > 0 there (Fig. 12.5a);
             2. R(r) has a double root in (r+ , +∞), at r = r0 say, and R(r) > 0 iff r ∈ (r+ , r0 ) or
                r ∈ (r0 , +∞) (Fig. 12.5b);
             3. R(r) has two simple roots in (r+ , +∞), at r = r1 and r = r2 say, and R(r) > 0 iff
                r ∈ (r+ , r1 ) or r ∈ (r2 , +∞) (Fig. 12.5c);
458                        Null geodesics and images in Kerr spacetime
           4. R(r) has a unique simple root in (r+ , +∞), at r = r0 say; then necessarily ℓ =
              2mr+ /a, R(r+ ) = 0 and R(r) > 0 iff r ∈ (r0 , +∞) (Fig. 12.5d).
      Proof. First, we observe that R(r) is positive or zero at both ends of MI . This is clear for the
      asymptotic flat end since, according to expression (12.25b), R(r) ∼ r4 > 0 when r → +∞. At
      the inner end, namely for r → r+ , we have ∆ → 0 and expression (12.25a) yields the limit
      value R(r+ ) = (r+ 2
                           + a2 − aℓ)2 ≥ 0. Now, as a polynomial of degree four, R(r) has at most
      four real roots. Given the above boundary conditions, R(r) can have a unique simple root in
      (r+ , +∞) only if R(r+ ) = 0 (cf. Fig. 12.5d), which occurs for a very specific value of ℓ, namely
      ℓ = (r+2
               + a2 )/a = 2mr+ /a. This is case 4 above. Similarly, the case of three roots in (r+ , +∞),
      either three simple roots or one double and one simple root, is compatible with the boundary
      conditions only if R(r+ ) = 0, i.e. if r+ is a fourth root of R(r). But then the four roots of R(r)
      would be positive (since r+ > 0). Now, since there is no r3 term in the expression (12.25b) of
      R(r), the sum of the roots of R(r) is zero, which is impossible with all the roots being positive.
      Hence there cannot be three roots in (r+ , +∞). The same argument about the zero sum of
      the roots excludes as well the case of four roots of R(r) in (r+ , +∞). There remains then the
      cases of no root at all in (r+ , +∞) (case 1 of the lemma) and that of two roots (cases 2 and 3).
      These three cases are compatible with the boundary conditions and the vanishing of the sum
      of the roots.
      Remark 9: In case 4, R(r+ ) = 0 occurs for ΩH ℓ = 1, where ΩH is the black hole rotation velocity, as
      given by Eq. (10.69). Similarly, R(r− ) = 0 for Ωin ℓ = 1, where Ωin is the rotation velocity of the inner
      horizon, cf. Eq. (10.71).
      Remark 10: The region MIII shares with MI that R(r) is non-negative at each of its ends: R(r) ∼
      r4 > 0 when r → −∞ and R(r− ) = (r−     2 + a2 − aℓ)2 ≥ 0. However, the argument used to limit the
      number of roots in MI cannot be applied to MIII because the latter can accommodate for both positive
      and negative values of r and thus four roots of R(r) can be located in MIII .
         According to the definition given in Sec. 11.2.6, a r-turning of a null geodesic L is a point
      p0 ∈ L such that r0 := r(p) is a simple root of R(r). Lemma 12.5 leads then to:
      Proof. If a geodesic would have two turning points in MI , this would mean that there exist
      two simple roots in MI and that R(r) is positive between them (so that the motion is possible
      there). But this is excluded by Lemma 12.5. In region MII , we have ∆ < 0 and Eqs. (12.25a)
                              12.2 Main properties of null geodesics                                    459
and (12.20) show that R(r) is the sum of two non-negative terms. The only possibility to have
R(r) = 0 is then that each term vanishes separately: r2 + a2 − aℓ = 0 and q + (ℓ − a)2 = 0, i.e.
The second equation implies q ≤ 0. The case q = 0 would lead to ℓ = a and r2 = 0, which is
impossible in MII . There remains q < 0, but according to the results of Sec. 12.2.4, this implies
|ℓ| < a, so that ℓ − a < 0 and the first equation above would yield r2 < 0, which is impossible.
There is thus no r-turning point in MII .
Remark 11: That no r-turning point can exist in MII has been established above by some considerations
on R(r). This property can also be deduced as an immediate consequence of the result (11.97), namely
that r must be a strictly decreasing function of λ at any point in MII .
     Let us consider case 2 of Lemma 12.5 (double root of R(r) in MI ); it corresponds to two
distinct situations regarding the null geodesic L having R(r) as radial polynomial. First, L
can lie at a constant value of r, which is necessarily the double root r0 of R(r) by virtue of
Eq. (12.23b); L belongs then to the category of the spherical photon orbits, which will be studied
in Sec. 12.3. If r is not constant along L , then according to the definition in Sec. 11.3.6, L has
as an asymptotic r-value, which is the double root r0 . Given that r0 is the only root of R(r) in
(r+ , +∞), Eq. (12.23b) implies dr/dλ ̸= 0 all along L , so that r(λ) → r0 for λ → +∞ (future
asymptotic value) or λ → −∞ (past asymptotic value). Such a geodesic belong to the category
of the critical null geodesics, which will be studied in Sec. 12.4.1.
     In view of the above results, we can state:
  In the region MI of Kerr spacetime, any null geodesic L has one of the following behaviors.
  For generic values of the constant of motions (ℓ, q), the possibilities are:
     1. L arises from the past null infinity of MI , I − , has a r-turning point, which we may
        call the periastron, and terminates at the future null infinity of MI , I + ;
     2. L arises from the past null infinity I − , has r decreasing monotonically and crosses
        the black hole event horizon H ;
     3. L arises from the past event horizon H − separating MI from the white hole region
        MII∗ , has r increasing monotonically and terminates at the future null infinity I + ;
     4. L arises from the past event horizon H − , has a r-turning point, which we may call
        the apoastron, and crosses the black hole event horizon H ;
  For some specific values of the constant of motions (ℓ, q), forming a 1-dimensional (hence
  zero-measure) subset of the set of all possible valuesa , the possibilities are:
           6. L arises from the past null infinity I − and has r decreasing monotonically to a
              future asymptotic r-value at r0 > r+ ;
           9. L arises from the past event horizon H − , has r increasing monotonically to a future
              asymptotic r-value at r0 > r+ .
           a
               This subset is given in parametric form by Eqs. (12.56)-(12.57) below.
      Example 5 (principal null geodesics): That the principal null geodesics with E ̸= 0 belong to cases 2
      and 3 above is clear from their value (12.27) for R(r): R(r) = ρ4 > 0 in all M , which precludes the
      existence of any r-turning point nor any r-asymptotic value along these geodesics.
      Remark 13: As a sequel of Remark 10 above, a null geodesic can have two turnings points in region
      MIII . There can thus exist null geodesics that are trapped between two distinct values of r in MIII .
spherical orbits have E = 0 [cf. Eq. (11.25)]. From the results of Sec. 12.2.1, there is no other
null geodesic with E = 0 at constant r. Hence we conclude:
  In Kerr spacetime, the only spherical photon orbits with E = 0 are the null generators of
  the two Killing horizons H and Hin .
Remark 1: A spherical photon orbit of Kerr spacetime is an example of what has been called a
fundamental photon orbit in a generic stationary and axisymmetric spacetime [144, 145], namely a
null geodesic L of affine parameter λ such that (i) L is restricted a spatially compact region and (ii)
there exists Λ ∈ R+ such that
where p(λ) stands for the point of affine parameter λ along L and R × SO(2) is the symmetry group
implementing stationarity and axisymmetry. Basically (i) says that L is a bound orbit and (ii) that L
is periodic, of period Λ in terms of λ, up to some isometry. Note that the definition of a fundamental
photon orbit is coordinate-independent. Introducing coordinates (t, x1 , x2 , φ) adapted to the spacetime
symmetries, the requirement (ii) is equivalent to demanding that x1 and x2 are periodic functions of λ.
In the present case x1 = r and x2 = θ. For a spherical photon orbit, r is obviously a periodic function
of λ, being a constant function, while we shall see in Sec. 12.3.2 that θ is indeed a periodic function of λ.
    In the remainder of this section, we focus on spherical photon orbits with E ̸= 0. We shall
then describe these geodesics by the reduced constants of motion ℓ := L/E and q := Q/E 2
introduced in Sec. 12.2.2.
    For a spherical photon orbit L , the quartic polynomial R(r) associated to (ℓ, q) by
Eq. (12.25) must obey
                              R(r0 ) = 0     and      R′ (r0 ) = 0.                     (12.47)
Proof. Since r = r0 with r0 constant, we have dr/dλ′ = 0 where λ′ is the Mino parameter
along L , so that Eq. (12.23b) implies R(r0 ) = 0. If R′ (r0 ) ̸= 0, then r0 would correspond to a
r-turning point of L (cf. Eq. (11.46)), which would contradict r being constant.
    In other words, r0 is a double root the polynomial R(r). A spherical photon orbit in MI
correspond then to case 2 of Lemma 12.5 and an example of R(r) is shown for a = 0.95 m and
r0 = 2.2 m in Fig. 12.5b.
    In view of expression (12.25a) for R(r), the system (12.47) is equivalent to
                                               K
                                       q̃ :=      = q + (ℓ − a)2 ,                                   (12.49)
                                               E2
462                       Null geodesics and images in Kerr spacetime
      where the second equality follows from relation (11.34) between K and Q. The system (12.48)
      has 2 equations for 3 unknowns: (r0 , ℓ, q). We thus expect a one-parameter family of solutions.
      It is convenient to consider r0 as the parameter and to solve the system (12.48) for (ℓ, q). We
      shall distinguish the case r0 = m from r0 ̸= m. If r0 = m, the system (12.48) reduces to
                                                                 
                   (m2 + a2 − aℓ)2 − q̃(a2 − m2 ) = 0             ℓ = a + m2
                                                                             a
                                                            ⇐⇒
                       2     2
                   m + a − aℓ = 0                                 q̃ = 0 or a = m.
      Now, q̃ = 0 is equivalent to q = −(ℓ−a)2 = −m4 /a2 , which implies q < 0, which is impossible
                     2
      for ℓ = a + ma > a, due to the property |ℓ| > a =⇒ q ≥ 0 (cf. Sec. 12.2.4). There remains
      a = m, which implies ℓ = 2m. Hence we conclude
                                                    
                                                     a=m
                                        r0 = m ⇐⇒                                           (12.50)
                                                     ℓ = 2m.
      We shall discuss this case further in Sec. 12.4.4, as well as in Chap. 13, which is devoted to
      the extreme Kerr black hole (a = m). In the remainder of this section, we assume r0 ̸= m.
      Equation (12.48b) is then equivalent to
                                                2r0 (r02 + a2 − aℓ)
                                           q̃ =                     .                           (12.51)
                                                      r0 − m
      Substituting this relation into Eq. (12.48a), we get an equation involving ℓ only:
                                                                        
                               2     2          2    2           2r0
                             (r0 + a − aℓ) r0 + a − aℓ −              ∆0 = 0.
                                                               r0 − m
      The two solutions are immediate:
                                                            r02
                                                  ℓ=a+                                          (12.52)
                                                            a
      or
                                               r0
                                  ℓ=a+               [r0 (r0 − m) − 2∆0 ] .                     (12.53)
                                           a(r0 − m)
      The solution (12.52), once inserted in (12.51) leads to q̃ = 0, i.e. to q = −(ℓ − a)2 ≤ 0. Now,
      q < 0 is excluded since Eq. (12.52) implies |ℓ| ≥ a (cf. Sec. 12.2.4). There remains q = 0, which
      yields ℓ = a. Then Eq. (12.52) leads to r0 = 0. However, according to the results stated in
      Sec. 12.2.4, q = 0 and ℓ = a imply that the geodesic is confined in the equatorial plane, where
      r0 = 0 corresponds to the ring singularity. We thus conclude that the solution (12.52) is not
      permitted. The remains then the solution (12.53). Substituting it into Eq. (12.51), we get
                                                        4r02 ∆0
                                               q̃ =             ,                               (12.54)
                                                      (r0 − m)2
      so that Eqs. (12.49) and (12.53) yield, after simplifications,
                                   r03
                                                                                                (12.55)
                                           3        2     2          2
                                                                         
                            q= 2           −r0 + 6mr0  − 9m  r 0 + 4a   m  .
                              a (r0 − m)2
                                                               12.3 Spherical photon orbits                                                                                463
                                                                                              2.0
                   q                                                                                      q
              25   `                                                                                      `
                   |`| = a                                                                    1.5         |`| = a
              20
                                                                                              1.0
              15
                                                                                 `/m, q/m 2
 `/m, q/m 2
                                                                                              0.5
              10                                                                                                    rph∗ ∗                         rph∗             rph+
                                                                                              0.0
              5
                                                                                                                                           rphms
                         rph∗ ∗           rph∗     rph+                 rph−                  0.5
              0
                                                                                              1.0
               5
                                                                                              1.5
                   1              0         1             2     3      4                            1.0              0.5     0.0           0.5            1.0        1.5
                                                 r0 /m                                                                             r0 /m
Figure 12.6: Functions ℓc (r0 ) (in red) and qc (r0 ) (in blue) giving the reduced angular momentum and reduced
Carter constant of a spherical photon orbit of radius r0 , according to Eqs. (12.56) and (12.57) and for a = 0.95 m.
The right figure is a zoom on the part −m ≤ r0 ≤ 3m/2. The thick vertical lines mark the two horizons: H
(black) and Hin (light brown). The horizontal dotted lines mark the boundary of the region |ℓ| < a, where q can
take negative values. Values of q in the grey zone are unphysical, i.e. do not fulfill the conditions (12.58), so that
only values of r0 for which the blue curve lies above the grey zone correspond to spherical photon orbits. This
                                              −
occurs for rph
             ∗∗          ∗
                ≤ r0 ≤ rph  and rph
                                  +
                                     ≤ r0 ≤ rph  . The thick segments of the ℓ and q curves correspond to stable
orbits, for which 0 ≤ r0 ≤ rph . [Figure generated by the notebook D.5.11]
                             ms
Recasting expressions (12.53) and (12.55), we conclude that a spherical photon orbit at r =
r0 ̸= m has a reduced angular momentum ℓ and a reduced Carter constant q given by
                                                                     r03
                                                                                                                                                                 (12.57)
                                                                             2
                                                                             4a m − r0 (r0 − 3m)2 .
                                                                                                 
                                         q = qc (r0 ) :=         2        2
                                                                a (r0 − m)
For a solution to exist, the constraint (12.20) must be obeyed; it writes q̃ ≥ 0. Given expression
(12.54) for q̃, we see that it is equivalent ∆0 ≥ 0. Hence spherical photon orbits do not exist in
region MII of Kerr spacetime. This is in agreement with the general result (11.97).
    Equations (12.56) and (12.57) provide the general solution to the system R(r0 ) = 0 and
R (r0 ) = 0 [Eq. (12.47)], but not all of them correspond to spherical photon orbits. Indeed, we
  ′
do not expect spherical photon orbits to exist for any value of r0 , in particular for |r0 | ≫ m.
Actually, not all values of q given by Eq. (12.57) are permitted, but only those that fulfill the
constraints established in Sec. 12.2.4, namely
                                                              q≥0     if |ℓ| ≥ a                                                                                (12.58a)
                                                              q ≥ − (a − |ℓ|)    2
                                                                                              if |ℓ| < a.                                                       (12.58b)
The solutions ℓ and q given by Eqs. (12.56)-(12.57) are plotted as functions of r0 for a = 0.95 m
in Fig. 12.6, where the region excluded by (12.58) is colored in grey. Consequently spherical
photon orbits exist only for values of r0 for which the q curve (in blue) lies above the grey
region. We see that this occurs in three intervals:
                                             ∗∗
                                      r0 ∈ [rph , 0) ,                    ∗
                                                                r0 ∈ (0, rph ]                 and                           +
                                                                                                                      r0 ∈ [rph    −
                                                                                                                                , rph ],                         (12.59)
464                        Null geodesics and images in Kerr spacetime
      where rph∗
                 , rph
                    +
                       and rph
                            −
                                are the three (ordered) roots distinct from 0 of the equation qc (r0 ) =
      0 (boundary for condition (12.58a)) and rph    ∗∗
                                                        is the unique root of the equation qc (r0 ) =
      − (a − |ℓc (r0 )|) when |ℓc (r0 )| < a (boundary for condition (12.58b)). The above reason-
                        2
      ing is based on Fig. 12.6, which has been drawn for a = 0.95 m; however the conclusions (12.59)
      hold for any value of a (see the notebook D.5.11 for figures with a = 0.5 m or a = 0.998 m). We
      have excluded r0 = 0 in (12.59) because it would yield ℓ = a and q = 0 following Eqs. (12.56)-
      (12.57). But according to the results of Sec. 12.2.4, such an orbit would be confined to the
      equatorial plane, where r = 0 is not permitted (the ring singularity!).
          Given expression (12.57) for q, we see that rph∗
                                                           , rph
                                                              +
                                                                 and rph
                                                                      −
                                                                         are the three roots of the cubic
      equation
                                          r0 (r0 − 3m)2 − 4a2 m = 0.                               (12.60)
      We can solve this equation by bringing it to a depressed form in order to use Viète’s formu-
      las (8.22). However, we may rely on an already solved equation by noticing the following
      equivalences:
                                                              √                  √
               r0 (r0 − 3m)2 − 4a2 m = 0 ⇐⇒ r0 ≥ 0 and r0 |r0 − 3m| = 2a m
                                                              √                  √
                                         ⇐⇒ r0 ≥ 0 and r0 (r0 − 3m) ± 2a m = 0
                                                               3/2       1/2     √
                                         ⇐⇒ r0 ≥ 0 and r0 − 3mr0 ± 2a m = 0,
      where ± is + for r0 ≤ 3m and − for r0 ≥ 3m. We recognize the function of r0 which appears
      in the left-hand side of Eq. (11.141). As shown in Sec. 11.5.2, there are three real roots, rph
                                                                                                   ∗
                                                                                                      , rph
                                                                                                         +
      and rph
            −
              , with rph
                      ∗     +
                         , rph ≤ 3m (± = +) and rph −
                                                       ≥ 3m (± = −). They are given by Eqs. (11.144)
      and (11.143):
                                                             a  4π 
                                                 2 1
                                   ∗
                                  rph := 4m cos       arccos −       +       .                     (12.61)
                                                    3           m        3
                                                                a 
                                                    2 1
                                       ±
                                      rph := 4m cos       arccos ∓                                 (12.62)
                                                        3           m
      As for rph
              ∗∗
                 , since qc (r0 ) = −(a − |ℓc (r0 )|)2 with |ℓc (r0 )| < a occurs in a region where ℓc (r0 ) < 0
      (cf. Fig. 12.6), we get that rph∗∗
                                         is a solution of the equation qc (r0 ) = −(a + ℓc (r0 ))2 . Given
      expressions (12.56) and (12.57) for respectively ℓc (r0 ) and qc (r0 ), we get that rph ∗∗
                                                                                                 is a root of the
      cubic equation
                                              2r03 − 3mr02 + a2 m = 0.                                    (12.63)
      By the change of variable r0 =: x + m/2, we turn this equation into a depressed one:
                                                           m2
                                                             
                                    3   3 2     m     2
                                   x − m x+          a −        = 0,
                                        4       2           2
      i.e. x3 + px + q = 0, with p := −3m2 /4 and q := (m/2)(a2 − m2 /2). The discriminant is
      −(4p3 + 27q 2 ) = 27a2 m2 (m2 − a2 )/4 ≥ 0. The three roots (xk )k∈{0,1,2} are then all real and
      are given by Viète’s formula (8.22). Only the root x1 leads to a negative value of r0 , which is
      the value we are looking for (cf. Fig. 12.6). Viète’s formula (8.22) with k = 1 yields
                                               a2
                                                                             a  2π 
                             1                          2π              2
               x1 = m cos      arccos 1 − 2 2 +              = m cos      arcsin      +      ,
                             3                m          3              3         m      3
                                           12.3 Spherical photon orbits                                                    465
                       3
                                                                                                  rph+
                                                                                                  rph−
                                                                                                  rph∗
                       2                                                                          rph∗ ∗
               r0 /m
                                                                                                  rphms
                                                                                                  r+
                                                                                                  r−
                       1
Sign of E
      All E ̸= 0 spherical photon orbits lie in MI ∪ MIII . We may then apply the general result
      (11.69) to them. Since L = Eℓ, we get E(r02 + a2 − aℓ) > 0. Hence, using formula (12.56) for ℓ,
where the third line follows from ∆0 > 0 in MI ∪ MIII . In view of (12.65), we conclude:
        All outer spherical photon orbits have E > 0, as well as inner spherical photon orbits with
               ∗∗
        r0 ∈ [rph , 0), while inner spherical photon orbits with r0 ∈ (0, rph
                                                                           ∗
                                                                              ] have E < 0.
      We see on Fig. 12.6 that the reduced conserved angular momentum ℓ (red curve) vanishes at
      two places in the physically allowed range of r0 , i.e. where the blue curve lies above the grey
      region: r0 = rph pol,in
                              and r0 = rph
                                         pol
                                             with rph
                                                   ∗∗    pol,in
                                                      < rph     < 0 and rph
                                                                         +     pol
                                                                            < rph     −
                                                                                   < rph . The value rph
                                                                                                      pol,in
      (resp. rph
              pol
                  ) corresponds thus to inner (resp. outer) spherical orbits. The results of Sec. 12.2.4
      show that these orbits are the only ones that encounter the rotation axis; moreover, they cross
      it repeatedly. We therefore call them polar spherical photon orbits.
           The values of rph  pol,in
                                     and rph
                                          pol
                                              are obtained by solving ℓc (r0 ) = 0, with ℓc (r0 ) given by
      Eq. (12.56). We get the cubic equation r03 − 3mr02 + a2 r0 + a2 m = 0. Setting r0 =: x + m
      reduces it to the depressed cubic x3 + px + q = 0, with p := a2 − 3m2 and q := 2m(a2 − m2 ).
      The discriminant ∆ = −4p3 − 27q 2 being positive, the solutions are provided by Viète’s
      formulas, Eq. (8.22), with k = 0 and k = 1 (k = 2 leads to the third root of ℓc (r0 ) seen on
                                   12.3 Spherical photon orbits                                    467
Fig. 12.6, which lies in the unphysical range of r0 — blue curve in the grey region):
                          r              "                           !#
                                   2                        2    2
                                 a         1          m(m − a )
             pol
            rph   = m + 2 m2 − cos           arccos                                     (12.67a)
                                  3        3         (m2 − a2 /3)3/2
                          r              "                           !       #
                                   2                        2    2
                                 a         1          m(m     − a  )      2π
           pol,in
         rph      = m + 2 m2 − cos           arccos                    +       .        (12.67b)
                                  3        3         (m2 − a2 /3)3/2       3
As a function of a, rph
                     pol
                         decays monotonically from lima→0 rph
                                                           pol
                                                               = 3m to lima→m rphpol
                                                                                     =
     √
(1 + 2)m ≃ 2.414214 m, while rph decays monotonically from lima→0 rph
                                  pol,in                                 pol,in
                                                                                = 0 to
        pol,in
                     √
lima→m rph = (1 − 2)m ≃ −0.414214 m.
Sign of ℓ and L
For spherical orbits with ℓ ̸= 0, the sign of ℓ is deduced from expression (12.56), whose
numerator has a sign governed by the position of r0 with respect to the roots rph
                                                                               pol,in
                                                                                      and rph
                                                                                           pol
determined above and whose denominator is positive iff r0 > m, which occurs only for outer
spherical orbits. We get then
The sign of L is deduced from that of ℓ = L/E by combining with the sign of E obtained in
Sec. 12.3.1.
Latitudinal motion
By applying the results of Sec. 12.2.4, we get
     • All outer spherical photon orbits with r0 ̸∈ rph     , rph and the inner ones with
                                                         + −
       r0 ∈ (0, rph ) have q > 0 (cf. Fig. 12.6); they therefore cross the equatorial plane.
                 ∗
Moreover,
In addition, we have
        For a spherical photon orbit, θ is either a constant or a periodic function of the affine
        parameter λ, the period being
                                                         θmax
                                                                r02 + a2 cos2 θ
                                                     Z
                                               2
                                         Λθ =                       q           dθ,                  (12.68)
                                              |E|    θmin             Θ̃(θ)
       with (θmin , θmax ) = (0, π) (outer polar orbit), (0, θv ) (Northern inner polar orbit), (π −θv , π)
       (Southern inner polar orbit), (θm , π − θm ) (non-polar orbit with r0 ∈ (0, rph   ∗        +
                                                                                           ) ∪ (rph    −
                                                                                                    , rph )),
       (θm , θv ) (Northern non-polar vortical inner orbit) or (π − θv , π − θm ) (Southern non-polar
       vortical inner orbit).
      Proof. Switching from the Mino parameter λ′ to the affine parameter λ via Eq. (11.49) with
      r(λ) = r0 , we may rewrite the equation of motion (12.23c) as
                                  2
                                                                                 E 2 Θ̃(θ)
                         
                             dθ
                                       + V(θ) = 0,       with V(θ) := −                        .
                             dλ                                             (r02 + a2 cos2 θ)2
      This is the 1-dimensional equation of motion in the potential well V. It is then clear that θ(λ)
                                                                                                   q is a
      periodic function. The period Λθ is evaluated by integrating dλ = ϵθ (r02 +a2 cos2 θ)2 /(|E|        Θ̃(θ))
      over a “round-trip” to the same values of θ and dθ/dλ.
                                           12.3 Spherical photon orbits                                                469
Figure 12.8: Spherical photon orbit at r0 = 1.6 m around a Kerr black hole with a = 0.95 m, depicted in terms
of the Cartesian Boyer-Lindquist coordinates (x, y, z) defined by Eq. (11.73) and scaled in units of m. The geodesic
starts at λ = 0 and φ = 0 in the equatorial plane, in the direction of the Southern hemisphere (dθ/dλ > 0). The
left panel corresponds to 0 ≤ λ ≤ 7.7 m/E, while the right one extends the range to 0 ≤ λ ≤ 70 m/E. The grey
sphere is the black hole event horizon. [Figure generated by the notebook D.5.12]
Remark 2: The azimuthal coordinate φ is not a periodic function of the affine parameter λ in general,
nor of the Mino parameter λ′ . An exception regards polar spherical geodesics, since setting ℓ = 0 in the
equation of motion (12.23d) results in dφ/dλ′ = const, so that φ is a periodic function of λ′ (but still
not of λ).
Example 6 (Outer spherical photon orbits): Let us consider a Kerr black hole with a = 0.95 m (same
value as in Fig. 12.6). The event and inner horizon radii are r+ = 1.312 m and r− = 0.688 m and one
     ∗∗ = −0.478 m, r ∗ = 0.658 m, r + = 1.386 m and r − = 3.955 m. Some outer spherical photon
has rph                  ph              ph                   ph
orbits are plotted in Figs. 12.8–12.11 for various values of r0 :
       • r0 = 1.6 m (Fig. 12.8): this orbit has ℓ = 2.171 m, q = 5.976 m2 , θm = 0.706 rad and dφ/dλ > 0;
       • r0 = 2.8 m (Fig. 12.9): this orbit has ℓ = −1.089 m, q = 26.260 m2 and θm = 0.206 rad; φ(λ) is
         not a monotonic function (cf. the projection onto the xy-plane): dφ/dλ < 0 almost everywhere,
         except near the equator, where the Lense-Thirring effect (cf. Sec. 11.3.4) is the strongest2 and
         enforces dφ/dλ > 0;
       • r0 = 3m (Fig. 12.10): this orbit has ℓ = −1.9 m, q = 27 m2 , θm = 0.346 rad and dφ/dλ ≤ 0,
         with dφ/dλ = 0 in the equatorial plane, as it can be easily checked by plugging r = 3m, θ = π/2
         as well as the above values of ℓ and q into Eq. (12.23d); this orbit maximizes the value of q among
         all spherical photon orbits (cf. Fig. 12.6 and Eq. (12.83) below);
       • r0 = 3.9 m (Fig. 12.11): this orbit has ℓ = −6.574 m, q = 3.525 m2 , θm = 1.290 rad and
         dφ/dλ < 0.
   The next example is devoted to inner spherical photon orbits. The Cartesian Boyer-Lindquist
coordinates (t, x, y, z) are not well suited to depict these orbits, in particular those that have
   2
       This can be seen by considering the limit θ → π/2 in Eq. (12.23d).
470                          Null geodesics and images in Kerr spacetime
                                                                                        y/m
                                                                                        2
                                                                                                                    x/m
                                                                       2        1                 1       2
      Figure 12.9: Spherical photon orbit at r0 = 2.8 m around a Kerr black hole with a = 0.95 m, depicted in terms
      of the Cartesian Boyer-Lindquist coordinates (x, y, z) defined by Eq. (11.73) and scaled in units of m, with the
      grey sphere representing the black hole event horizon. The right panel depicts the projection of the orbit onto the
      xy-plane (the overlap with the grey area is a mere projection effect, since of course the orbit never crosses the
      event horizon). The geodesic starts at λ = 0 and φ = 0 in the equatorial plane, in the direction of the Southern
      hemisphere. The plotted range is 0 ≤ λ ≤ 38 m/E. [Figure generated by the notebook D.5.12]
                                                                                    y/m
                                                                                    2
                                                                                                                  x/m
                                                                            1                 1       2       3
      Figure 12.10: Same as in Fig. 12.9 but for a spherical photon orbit at r0 = 3m, with 0 ≤ λ ≤ 32 m/E. [Figure
      generated by the notebook D.5.12]
                                            12.3 Spherical photon orbits                                                471
                                                                                         y/m
                                                                                         4
                                                                                         3
                                                                                         2
                                                                                         1
                                                                                                               x/m
                                                                       4    3   2    1         1   2   3   4
                                                                                         1
                                                                                         2
                                                                                         3
                                                                                         4
Figure 12.11: Same as in Fig. 12.9 but for a spherical photon orbit at r0 = 3.9 m, with 0 ≤ λ ≤ 55 m/E.
[Figure generated by the notebook D.5.12]
Figure 12.12: Marginally stable spherical photon orbit inside a Kerr black hole with a = 0.95 m, depicted
in terms of the coordinates (x̂, ŷ, ẑ) defined by Eq. (12.69) and scaled in units of m. The geodesic is located
at r0 = 0.540 m and starts at λ = 0 and φ = 0 in the equatorial plane, in the direction of the Southern
hemisphere (dθ/dλ > 0). The left panel corresponds to 0 ≤ λ ≤ 3 m/|E|, while the right one extends the range
to 0 ≤ λ ≤ 20, m/|E|. The orange red circle is the ring singularity and the vertical black line is the rotation axis,
with a black dot marking the asymptotic end r → −∞. [Figure generated by the notebook D.5.12]
r < 0. We could use O’Neill exponential coordinates, as in Fig. 12.2. However, we are going
to use instead the radial coordinate r̂ and the associated Cartesian-type coordinates (x̂, ŷ, ẑ)
defined by
                               1        √          
                         r̂ :=      r + r2 + 4m2                                                           (12.69a)
                               2
                         x̂ := r̂ sin θ cos φ, ŷ := r̂ sin θ sin φ,        ẑ := r̂ cos θ.                (12.69b)
As O’Neill coordinates, this brings the whole range (−∞, +∞) for r to (0, +∞) for r̂, with
r̂ → 0 corresponding to r → −∞. Contrary to O’Neill coordinate er/m , the coordinate r̂ does
not enlarge too much the region exterior to the black hole, which makes it better suited for
plots covering both the black hole interior and exterior, as in Fig. 12.17 below. Note that in MI ,
r̂ is asymptotically equivalent to r: r̂ ∼ r as r → +∞.
472                         Null geodesics and images in Kerr spacetime
      Figure 12.13: Same as in Fig. 12.12 but for a vortical spherical photon orbit at r0 = −0.46 m, with 0 ≤ λ ≤
      20 m/E. Also shown is the Northern vortical circular photon orbit at r = rph
                                                                                ∗∗
                                                                                   (in light green). [Figure generated
      by the notebook D.5.12]
                                   6
                                   4
                                   2
                                   0                                                      `ph∗
                             `/m
                                                                                          `ph+
                                   2                                                      `ph−
                                   4
                                   6
                                   8
                                       0.0     0.2        0.4         0.6        0.8         1.0
                                                                a/m
      Figure 12.14: Reduced angular momentum ℓ of the three circular photon orbits in the equatorial plane, as a
      function of the Kerr spin parameter a. [Figure generated by the notebook D.5.11]
      Example 7 (Inner spherical photon orbits): We consider the same Kerr spacetime as in Example 6,
      i.e. a = 0.95 m, but this time, we focus on inner spherical photon orbits:
          • r0 = 0.540 m (Fig. 12.12): this orbit has ℓ = 1.539 m, q = 0.282 m2 , θm = 1.173 rad dφ/dλ > 0,
            E < 0 and L < 0; it is actually a marginally stable photon orbit (r0 = rph ms ), as we shall see in
Sec. 12.3.4;
          • r0 = −0.46 m (Fig. 12.13): this orbit has ℓ = −0.176 m, q = −0.461 m2 (hence it is vortical),
            θm = 0.282 rad, θv = 0.731 rad, dφ/dλ < 0, E > 0 and L < 0;
• r0 = −0.478 m = rph ∗∗ (Fig. 12.13, light green curve): this orbit has ℓ = −0.229 m, q = −0.519 m2
            (hence it is vortical), θm = θv = θph  ∗∗ = 0.514 rad, dφ/dλ < 0, E > 0 and L < 0.
                                       12.3 Spherical photon orbits                                           473
Note that the above definition is independent from any coordinate system. It is worth to
compare Eq. (12.70) with expression (11.178) for the 4-velocity u = µ−1 p of a circular timelike
orbit in the equatorial plane.
Remark 3: Circular photon orbits are sometimes called light rings [144].
    In Kerr spacetime, when using coordinates adapted to the spacetime symmetries, such as
Boyer-Lindquist coordinates (t, r, θ, φ), one has ξ = ∂t , η = ∂φ , α = pt , β = pφ and the
definition (12.70) is equivalent to pr = dr/dλ = 0 and pθ = dθ/dλ = 0. It follows immediately
that a circular photon orbit is any null geodesic along which both r and θ are constant. It is
thus a spherical photon orbit lying at a constant value of θ.
Remark 4: In Schwarzschild spacetime, the photon circular orbits forming the photon sphere at r = 3m
(cf. Sec. 8.2.3) do not have θ = const, except for the orbit in the equatorial plane. However, for a
given non-equatorial orbit on the photon sphere, one may use the spherical symmetry of Schwarzschild
spacetime to perform a change of coordinates (θ, φ) 7→ (θ′ , φ′ ) such that θ′ = const is constant for that
orbit. In Kerr spacetime with a ̸= 0, such “oblique” circular orbits cannot exist due to Lense-Thirring
precession.
Example 8: The spherical photon orbits with E = 0 discussed in Sec. 12.3.1, namely the null generators
of the horizons H and Hin are circular photon orbits, since they are the outgoing principal null
            out,H       out,Hin
geodesics L(θ,ψ)  and L(θ,ψ)    , which have θ = const.
                                                   √
                              lim ℓ+         −
                                   ph = lim ℓph = 3 3m ≃ 5.196 m,                                  (12.71)
                              a→0        a→0
0.5
0.4
0.3
                            θph∗ ∗
                                     0.2
0.1
                                     0.0
                                           0.0   0.2      0.4         0.6     0.8           1.0
                                                                a/m
      Figure 12.15: Angle θ of the Northern vortical circular photon orbit at r = rph
                                                                                   ∗∗
                                                                                      as a function of Kerr spin
      parameter a. [Figure generated by the notebook D.5.11]
      orbits at a fixed value of θ, which are symmetrical with respect to the equatorial plane. They lie
      at θ = θph∗∗
                   (Northern hemisphere) and θ = π − θph  ∗∗
                                                              (Southern hemisphere), where θph ∗∗
                                                                                                  is given
      by Eq. (12.44), using for ℓ the value (12.56) with r0 = rph . Since 3m(rph ) = 2(rph ) + a2 m by
                                                                 ∗∗             ∗∗ 2      ∗∗ 3
      photon orbits are actually circular. Such an orbit is shown in Fig. 12.13 (light green circle).
      The reduced angular momentum and Carter constant of these orbits, denoted by ℓ∗∗       ph and qph
                                                                                                     ∗∗
      respectively, are depicted in terms of a in Fig. 12.16. They tend to zero as a → 0 and obey
                                      m                                             9 2
                                     lim ℓ∗∗  and
                                          ph = −
                                                                           ∗∗
                                                                      lim qph =−       m.               (12.74)
                                      4
                                     a→m                          a→m               16
          We may summarize the above results by
        The geodesics at the boundaries of the domain of existence of spherical photon orbits
        [Eq. (12.59)] are circular orbits:
                                                0.0
                                                                                             `ph∗ ∗
                                                                                             qph∗ ∗
                                                0.1
0.3
0.4
0.5
  Moreover, there is no other circular photon orbit with E ̸= 0. The circular photon orbits
  with E = 0 are the null generators of the two Killing horizons H and Hin .
That the orbits listed above are the only E ̸= 0 circular orbits follows from the fact that all the
other E ̸= 0 spherical photon orbits have either q > 0 or qmin < q < 0 (cf. Fig. 12.6), which
imply that they have a varying θ (cf. Sec. 12.2.4).
The spherical photon orbit of radius r0 is stable ⇐⇒ R′′ (r0 ) < 0. (12.75)
Remark 5: The criterion (11.159) for timelike circular orbits involves V ′′ (r0 ) > 0 simply because
476                      Null geodesics and images in Kerr spacetime
      of the minus sign in the definition of V from R: V(r) := −R(r)/(µ2 r4 ), whereas we have here
      R(r) := R(r)/E 2 .
         An immediate consequence of Lemma 12.5 is
        Property 12.16: unstability of spherical photon orbits in the black hole exterior
All spherical photon orbits in the black hole exterior are unstable.
      Proof. The quartic polynomial R(r) of spherical photon orbits in MI belong to case 2 of
      Lemma 12.5, which states that R(r) > 0 for r ∈ (r+ , r0 ) ∪ (r0 , +∞), i.e. r0 corresponds to a
      minimum of R (cf. Fig. 12.5b).
          We are however going to see that there exist stable spherical photon orbits in region MIII .
      From expression (12.25b) for R, we get R′′ (r0 ) = 12r02 + 2(a2 − ℓ2 − q). Substituting the values
      (12.56) and (12.57) for respectively ℓ and q yields
                                                                           a2
                                                                            
                                           8r0
                              ′′
                            R (r0 ) =                       3      3
                                                    (r0 − m) + m 1 − 2            .               (12.76)
                                       (r0 − m)2                           m
      Hence the stability criterion (12.75) becomes
                                                                                
                                                                            a2
                                   0
                                  
                                    r  < 0   and  (r0 − m) 3
                                                              + m3
                                                                     1−     m2
                                                                                      >0
             R′′ (r0 ) < 0 ⇐⇒        or                                                          (12.77)
                                                                                
                                   r > 0 and (r − m)3 + m3 1 −
                                                                            a2
                                      0                0                    m2
                                                                                      < 0.
      Now
                                                      1/3
                          a2                       a2
                                             
                   3      3                                         ms
          (r0 − m) + m 1 − 2 > 0 ⇐⇒ r0 − m > −m 1 − 2      ⇐⇒ r0 > rph ,
                          m                        m
      where                                     "          1/3 #
                                                        a2
                                                    
                                       ms
                                      rph   := m 1 − 1 − 2        .                              (12.78)
                                                        m
  All spherical photon orbits are unstable with respect to radial perturbations, except for a
  subclass of inner orbits with negative energy (E < 0): those that have a radius r0 ∈ (0, rph
                                                                                             ms
                                                                                                ),
  where rph is given by Eq. (12.78). In particular, all spherical photon orbits in MI (the black
          ms
hole exterior) are unstable, as well as all circular photon orbits discussed in Sec. 12.3.3.
That all stable orbits have E < 0 follows from Property 12.10, given the inequality (12.80).
    The stable spherical photon orbits have q > 0 and ℓ > 0 (cf. right panel of Fig. 12.6).
According to the results of Sec. 12.2.4, they thus oscillate symmetrically about the equatorial
plane, between two θ-turning points, θm and π − θm , such that 0 < θm < π/2. In particular,
r0 = rms does not correspond to a unique orbit, but to a 1-parameter family of marginally stable
orbits; the parameter can be chosen to be the azimuthal coordinate φ at the first value θ = θm
(or θ = π/2) after t = 0. A marginally stable spherical photon orbit is shown in Fig. 12.12.
    We have the following property, which appears clearly on Fig. 12.6:
  Among all inner spherical photon orbits, the marginally stable orbits at r0 = rph
                                                                                 ms
                                                                                    are those
  for which the reduced angular momentum ℓ and reduced Carter constant q are maximal.
and r0 = 3m. The former regards the inner spherical orbits, while the latter regards the outer
ones. Again, from the graph shown in Fig. 12.6, it is clear that r0 = rph
                                                                        ms
                                                                           realizes a maximum of
qc among all inner spherical orbits.
Remark 6: Another proof can be given by using the same general argument as that employed in
Sec. 11.5.3 for showing that the ISCO realizes an extremum of the specific energy and specific angular
momentum of timelike circular equatorial orbits (Property 11.32). Indeed, considering R as a function
of ℓ and q, in addition to r, i.e. writing R = R(r, ℓ, q), the marginal stable orbit obeys
                                         ∂R                                  ∂2R
       R(r0 , ℓc (r0 ), qc (r0 )) = 0,      (r0 , ℓc (r0 ), qc (r0 )) = 0,       (r0 , ℓc (r0 ), qc (r0 )) = 0.
                                         ∂r                                  ∂r2
478                             Null geodesics and images in Kerr spacetime
       r̂
       m cosθ
                                                        r=0
        3                                               r = r−
                                                        r = r+
                                                                     r̂
                                                                    m cosθ
                                                                    1.0
        2
1 0.5
                                                      r̂                                                             r̂
                                                      m sinθ                                                         m sinθ
                   1        2        3         4                                0.5       1.0       1.5      2.0
1 0.5
2 1.0
      Figure 12.17: Trace of the photon region (pale and dark green areas) in a meridional plane (t, φ) = const
      of a Kerr spacetime with a = 0.95 m (as in Fig. 12.6). The right panel is a zoom on the inner       √ part of the left
      one. The meridional plane is described in terms of the coordinates (r̂, θ) where r̂ := (r + r2 + 4m2 )/2 [cf.
      Eq. (12.69)]. Some spherical photon orbits are shown as green circular arcs, with the dashed ones representing the
      polar spherical orbits, i.e. ℓ = 0 orbits. The black (resp. light brown) half-circle at r = r+ (resp. r = r− ) is the
      trace of the outer (resp. inner) Killing horizon. The dotted orange half-circle marks the locus of r = 0, with the
      red dot indicating the curvature singularity at r = 0 and θ = π/2. The ergoregion is shown in grey. The region
      of stable spherical orbits is colored in dark green, with its boundary at r = rph  ms
                                                                                             drawn in blue. Green dots mark
      photon circular orbits: from the left to the right, they are the two vortical circular orbits at r = rph
                                                                                                            ∗∗
                                                                                                               , the equatorial
      inner circular orbit at r = rph , the prograde outer circular orbit at r = rph and the retrograde outer circular orbit
                                    ∗                                               +
               −
      at r = rph . The thin dotted grey half-circle marks r = rph  ∗∗
                                                                      . [Figure generated by the notebook D.5.11]
      Deriving the first and second equations with respect to r0 and using the third equation, we get a
      homogeneous linear system for (dℓc /dr0 , dqc /dr0 ), similar to the system (11.172). The unique solution
      is then (dℓc /dr0 , dqc /dr0 ) = (0, 0), yielding the extremum in the functions ℓc (r0 ) and qc (r0 ) at
             ms .
      r0 = rph
      This is the maximum of the reduced Carter constant over all the spherical photon orbits (cf.
      Fig. 12.6).
                                12.4 Black hole shadow and critical curve                                            479
(12.56) for ℓ and formula (12.54) for q̃ := q + (ℓ − a)2 into expression (12.25a) for R(r) leads to
                                                                    a2 qc (r0 )
                                                                               
                                                   2   2
                                R(r) = (r − r0 ) r + 2r0 r −                      ,               (12.84)
                                                                        r02
      where qc (r0 ) is the function (12.57). This writing of R(r) clearly exhibits the double root at
      r = r0 . As discussed in Sec. 11.3.6 and 12.2.5, a consequence of the double-root behavior is that
      any critical null geodesic L of parameter r0 has r0 as an asymptotic r-value, i.e. r(λ) → r0
      when λ → +∞ or λ → −∞, where λ is the affine parameter along L .
          Let us consider a critical null geodesic L of parameter r0 that goes through a point A of
      Boyer-Lindquist coordinates (tA , rA , θA , φA ). By plugging (12.84) into the integrated equation
      of motion (11.55a), we get, at any point of coordinates (t, r, θ, φ) along L ,
                            Z r                                         Z θ
                                               ϵr dr̄                        ϵθ dθ̄
                            −             p
                                                                     2
                                                                       =− q          .            (12.85)
                                             2            2
                             rA |r̄ − r0 | r̄ + 2r0 r̄ − a qc (r0 )/r0   θA   Θ̃(θ̄)
      When r → r0 , it is clear that the integral in the left-hand side diverges logarithmically in terms
      of |r − r0 |. The path integral in the right-hand side must therefore diverge as well, which
      implies that L has an endless oscillatory θ-motion as r → r0 . Similarly the integrated equation
      of motion (11.55d) with expression (12.84) for R(r) becomes
                      Z r                                                            Z θ
                                             (2mr̄ − aℓ) ϵr dr̄                              ϵθ dθ̄
        φ − φem = a−                                 p
                                                                                2
                                                                                  + ℓ−           q        .
                                        2          2     2            2
                        rA |r̄ − r0 |(r̄ − 2mr̄ + a ) r̄ + 2r0 r̄ − a qc (r0 )/r0     θA sin2 θ̄   Θ̃(θ̄)
                                                                                                    (12.86)
      Again, the first integral diverges when r → r0 . The second one, which is a path integral on θ,
      diverges as well because the path integral in the right-hand side of Eq. (12.85) diverges. From
      expression (12.56) for ℓ it can be checked that 2mr0 − aℓ > 0 in the allowed range of r0 (i.e.
      in the range (12.59), where spherical orbits exist). Given that ϵr dr̄ > 0 and ϵθ dθ̄ > 0, we
      conclude that the integral on r and the path integral on θ both tend to +∞ when r → r0 . If
      ℓ ≥ 0, we get then immediately φ → +∞ when r → r0 . If ℓ < 0, the two diverging terms in
      the right-hand side of Eq. (12.86) have opposite signs. Disregarding some unexpected subtle
      compensation, one term (in practice the second one) dominates over the other one, so that we
      conclude that, whatever the sign of ℓ,
φ → ±∞ when r → r0 . (12.87)
Figure 12.18: Critical null geodesic of parameter r0 = 2.2 m in a Kerr spacetime with a = 0.95 m, depicted
in terms of the Cartesian Boyer-Lindquist coordinates (x, y, z) [Eq. (11.73)]. This geodesic has ℓ = 0.863 m,
q = 18.042 m2 and is emitted at λ = 0 from the point of Boyer-Lindquist coordinates (r, θ, φ) = (40 m, π/2, 0)
⇐⇒ (x, y, z) = (40 m, 0, 0), towards the black hole (ϵr = −1). The drawing is interrupted at λ = 80 m/E. The
grey sphere is the black hole event horizon at r = r+ = 1.312 m. [Figure generated by the notebook D.5.13]
Example 9: Figure 12.18 shows a critical null geodesic emitted from the equatorial plane at a large
distance from the black hole (the emission point is not shown on the figure that is truncated at r ∼ 10 m).
We note the winding around the sphere r = r0 = 2.2 m, in the same fashion as a spherical photon orbit
(compare with Figs. 12.8 and 12.9).
      Figure 12.19: Two null geodesics with constants of motion (ℓ, q) close to those of a critical null geodesic,
      (ℓc (r0 ), qc (r0 )) [Eqs. (12.56)–(12.57)]. Here, as in Fig. 12.18, r0 = 2.2 m and a = 0.95 m. The two geodesics depart
      from the same point at (x, y, z) = (40 m, 0, 0) (outside the scope of the figure) as the critical null geodesic shown
      in Fig. 12.18. Both geodesics have q = qc (r0 ), but the green one has ℓ = 1.0001 ℓc (r0 ) while the orange one has
      ℓ = 0.9999 ℓc (r0 ). [Figure generated by the notebook D.5.13]
      close to S0 . The geodesic L2 (orange curve) has ℓ = 0.9999 ℓc (r0 ); it is graphically indistinguishable
      from L1 until a full turn onto S0 (this is best seen in the interactive 3D view in the online notebook
      D.5.13), it then clearly departs from it and eventually terminates into the black hole. Hence L2 has no
      r-turning point.
          Let us consider an asymptotic inertial observer O (cf. Sec. 10.7.5), i.e. a static observer
      located at Boyer-Lindquist coordinates (t, rO , θO , φO ), where (rO , θO , φO ) are constant and
      rO ≫ m. As in the Schwarzschild case (Sec. 8.5.4), the concept of black hole shadow is
      defined by considering an emitting large sphere S of constant Boyer-Lindquist coordinate
      rS that encompasses both the black hole and observer O; this means that rS > rO . In a more
      astrophysical setting, one could image S as being made of many far-away light sources. We
      assume that O is equipped with a screen in the direction of the black hole, in the same set up
      as described in Sec. 12.2.3 (cf. Fig. 12.4). Given the relative position of O, the black hole and
      the emitting sphere S , any photon emitted from S that reaches O’s screen has necessarily a
      point of minimal approach to the black hole, i.e. the associated null geodesic has necessarily a
      r-turning point between S and O. Reciprocally, any null geodesic that impacts O’s screen and
      has no r-turning point in its past cannot have emerged from S and therefore results in a black
      dot on the screen. According to the above discussion, the boundary of the black area on O’s
      screen is made by the impact points of critical null geodesics and is called the critical curve
      [234, 235]; we shall denote it by C . The black area itself is called the black hole shadow. It
      is called so essentially because if the black hole were not present, it would not exist and O’s
      screen would be uniformly bright.
      Remark 1: The concept of black hole shadow is rather academic, since it requires the sources of light
                             12.4 Black hole shadow and critical curve                                     483
to be far from the black hole and to surround it, as well as the observer, from any direction. On the
contrary, astrophysical black holes are illuminated by close sources (e.g. an accretion disk) and we
shall see in Sec. 12.5 that the black area on astronomical images has little resemblance with the shadow
defined above. On the other hand, the critical curve has a true observational significance, as we shall
discuss in Sec. 12.5, and is definitely worth to study.
    In what follows, we shall assume that the observer O in not located on the black hole rotation
axis, i.e. θO ̸∈ {0, π}, leaving the special case θO = 0 or π to Sec. 12.4.3. Then sin θO ̸= 0 and
we have seen in Sec. 12.2.3 that the impact point of a null geodesic L on O’s screen, measured
by the screen angular coordinates (α, β), is related to the reduced angular momentum ℓ and
reduced Carter constant q of L by formulas (12.32). Given the above definition, the critical
curve C is obtained by using for (ℓ, q) in (12.32) the critical values (ℓc (r0 ), qc (r0 )) given by
Eqs. (12.56)–(12.57):
                                    ℓc (r0 )
                           α=−                                                                 (12.88a)
                                   rO sin θO
                                    s                                      
                              ϵθ                                  ℓc (r0 )2
                           β=          qc (r0 ) +   cos2       2
                                                           θO a −            .                 (12.88b)
                              rO                                  sin2 θO
This provides the equation of C in parametric form, the parameter being r0 — the radius of the
spherical orbits that have the same value of (ℓ, q) as the critical null geodesic that impacts O’s
screen at the point (α, β). Let us recall that (α, β) are angular coordinates and are therefore
dimensionless (cf. Sec. 12.2.3). The range of r0 for spherical orbits in the black hole exterior is
 +
rph ≤ r0 ≤ rph  −
                  [Eq. (12.59)], with rph+
                                            and rph
                                                  −
                                                     given by Eq. (12.62). The corresponding range of
ℓ = ℓc (r0 ) is then ℓc (rph ) ≤ ℓ ≤ ℓc (rph ) (cf. the red curve in Fig. 12.6) and the corresponding
                          −                +
range of q = qc (r0 ) is 0 ≤ q ≤ 27 m2 [cf. Eq. (12.83)]. However, not all these values of (ℓ, q)
are allowed, since in order for the corresponding geodesic to reach O, they have to obey the
constraint (12.29):
Except for θO = π/2 (observer in the equatorial plane), this constraint limits the range of
r0 to a subinterval [r0min , r0max ] of [rph
                                          +     −
                                             , rph ]. We note that the radius rphpol
                                                                                     of polar spherical
photon orbits, as given by Eq. (12.67a), lies necessarily in that subinterval; indeed by definition
     pol
ℓc (rph  ) = 0, which ensures that the constraint (12.89) is fulfilled for r0 = rph   pol
                                                                                          , whatever the
value of θO .
     The critical curve C in the observer’s screen is obtained by first solving Eq. (12.89) with
equality in the sign ≥ to determine r0min and r0max . Then, the upper half of C is computed
by means of formulas (12.88) with ϵθ = +1 and r0 ranging in [r0min , r0max ]. The lower half is
obtained similarly, but with ϵθ = −1 in Eq. (12.88b), so that it is the symmetric of the upper
half with respect to the α-axis.
     The result of the computation is shown in Fig. 12.20 for a Kerr black hole with a = 0.95 m
and various inclination angles of observer O. For θO = π/2 the parameter r0 of the critical
null geodesics forming C spans the full interval [rph    +    −
                                                           , rph ] ≃ [1.386 m, 3.955 m]. For θO = π/6,
the range is restricted to [r0 , r0 ] ≃ [1.768 m, 3.237 m], while for θO = 0 (to be discussed
                               min max
484                              Null geodesics and images in Kerr spacetime
                                            2
                                 (r /m) β
                                                                                                          1.39 m   r0   1.90 m
                                                                                                          1.90 m   r0   2.41 m
                                            0                                                             2.41 m   r0   2.93 m
                                                                                                          2.93 m   r0   3.44 m
                                                                                                          3.44 m   r0   3.96 m
                                            2
                                            6
                                                4       2       0        2     4    6             8
                                                                    (r /m) α
                         6                                                                    6
4 4
                         2                                                                    2
              (r /m) β
(r /m) β
0 0
2 2
4 4
                         6                                                                    6
                             4   2          0       2       4        6                                6   4        2      0      2   4   6
                                            (r /m) α                                                                (r /m) α
      Figure 12.20: Shadow of a Kerr black hole of mass m and spin parameter a = 0.95 m on the screen of an
      asymptotic inertial observer O located at r = rO and θ = θO , for three values of θO : θO = π/2 (O in the
      equatorial plane; upper panel), θO = π/6 (lower left) and θO = 0 (O on the rotation axis; lower right). The screen
      is spanned by the angular coordinates (α, β), rescaled by the inverse of the factor m/rO , whose values for some
      astrophysical black holes can be found in Table 12.1. The shadow is bounded by the critical curve C , which is
      depicted with colors indicating the range of the parameter r0 of the critical null geodesics forming it. [Figure
      generated by the notebook D.5.14]
                                  12.4 Black hole shadow and critical curve                                        485
Table 12.1: Scale factor m/rO for various astrophysical black holes: the supermassive black hole at the center
of our galaxy, Sagittarius A* (data taken from Table A.1 of Ref. [4]), the supermassive black hole M87* in the
nucleus of the galaxy Messier 87 (data from Refs. [214, 6]; see also Appendix I and Table 9 of Ref. [8]), the
supermassive black hole M31* in the nucleus of the Andromeda Galaxy (Messier 31) (data from Ref. [55]) and
the stellar black hole Cygnus X-1 (data from Refs. [393, 428, 225]). The last line gives m/rO in microarcseconds
(1 µas = 4.848 10−12 rad).
   4
       For comparison, the angular resolution of the Hubble Space Telescope is ∼ 0.1′′ = 105 µas!
486                         Null geodesics and images in Kerr spacetime
                                            5.20
                                            5.15
                                            5.10
                             (r /m) Rshad
                                            5.05
                                            5.00
                                            4.95
                                            4.90
                                            4.85
                                                   0.0        0.2          0.4         0.6    0.8     1.0
                                                                                 a/m
      Figure 12.21: Angular radius Rshad of the (circular) critical curve of a Kerr black hole as seen by an asymptotic
      inertial observer located at r = rO on the black hole’s rotation axis, as a function of the Kerr spin parameter a
      [Eq. (12.90)]. Rshad is given in units of m/rO (cf. Table 12.1). [Figure generated by the notebook D.5.14]
                                                          2      1 2   pol     2
                                                                                  
                                                         α + β = 2 qc (rph ) + a .
                                                                rO
      Given expression (12.57) of the function qc and the fact that rph
                                                                     pol
                                                                         obeys (rph
                                                                                 pol 4 pol
                                                                                    ) (rph − 3m)2 =
      a4 (rph + m)2 , as a solution of ℓc (r0 ) = 0, we get
           pol
                                                              s          q
                                                                   pol      pol 2
                         p             m                          rph     (rph ) − a2
              Rshad :=    α2 + β 2 = 2                                      pol
                                                                                             (θO = 0 or θO = π)   (12.90)
                                       rO                           m      rph  −m
      where rph
              pol
                  is the function (12.67a) of (m, a) and the scale factor m/rO is given for some
      astrophysical black holes in Table 12.1.
      Example 11: An example of shadow seen from the rotation axis is shown in the lower right panel of
                                                                                               pol
      Fig. 12.20. The Kerr spin parameter is a = 0.95 m, for which formula (12.67a) yields rph     ≃ 2.493 m,
      so that Eq. (12.90) results in Rshad ≃ 4.875 m/rO . We note that the critical curve C is depicted in the
                                12.4 Black hole shadow and critical curve                                          487
same color (green) as the part of the critical curve in the other panels that crosses the β-axis, which is
expected since the critical null geodesics that reach the screen along that axis have ℓ = 0 and thus the
                        pol
same parameter r0 = rph     as all the critical null geodesics forming C for θO = 0 or π.
    The shadow radius Rshad is a decreasing function of a, which is depicted in Fig. 12.21. Note
that its variation range is pretty limited, since the limits lima→0 rph
                                                                     pol
                                                                         = 3m and lima→m rphpol
                                                                                                =
 √
( 2 + 1)m (cf. Sec. 12.3.2) yield respectively
                √ m           m                                    √       m         m
   lim Rshad = 3 3    ≃ 5.196                 and    lim Rshad = 2( 2 + 1)    ≃ 4.828 .                 (12.91)
   a→0             rO         rO                     a→m                   rO        rO
The value for a = m is thus only 7% lower than the value for a = 0.
Remark 2: The limit a → 0 is in agreement with the result for the shadow of a Schwarzschild black
hole obtained in Sec. 8.5.4.
                                            r02
                                ℓc (r0 ) = − + 2r0 + m              (a = m)                             (12.92)
                                            m
                                      r03
                                          (4m − r0 )
                                 qc (r0 ) =             (a = m).                   (12.93)
                                     m2
Substituting these values into the system (12.88) leads to the screen angular coordinates
determining the critical curve C of an extremal Kerr black hole:
   5
     The values of rmin and rmax can be computed exactly as rmin = max(r1 , m) and rmax = r2 , where r1 and r2
are two roots of the quartic polynomial in r0 that appears under the square root in Eq. (12.94b). However, doing
so would lead to complicated expressions, while a computation by numerical root finding, as in the notebook
D.5.14, is sufficient in practice.
488                       Null geodesics and images in Kerr spacetime
      generic expression (12.57) for qc has a factor (r0 − m)2 in its denominator, so that taking the
      limits a → m and r0 → rph    +
                                       = m yields the indeterminate form “0/0”. As a consequence of
      qc (rph ) ̸= 0, one has β ̸= 0 for r0 = rph
           +                                     +
                                                   = m and |θO − π/2| sufficiently small. This implies
      that rmin = m for a finite range of values of θO around π/2. More precisely, according to (12.94),
      the screen coordinates for r0 = rph  +
                                              = m are
                                                   m
                                α|r0 =m = −2                                                       (12.96a)
                                               rO sin θO
                                              ϵθ m p
                                β|r0 =m   =             3 − 6 cos2 θO − cos4 θO .                  (12.96b)
                                            rO sin θO
      In particular, for θO = π/2 (observer O in the equatorial plane), we get
                                  m                            √ m               π
                     α|r0 =m = −2        and       β|r0 =m = ϵθ 3            θO =    .              (12.97)
                                  rO                               rO             2
      Equation (12.96b) shows that, for ϵθ = +1, β|r0 =m > 0 ⇐⇒ θcrit < θO < π − θcrit , where
      θcrit is such that cos2√θcrit is the positive root of the quadratic
                                                                     √ polynomial
                                                                          √       −x2 − 6x + 3 = 0.
      We get cos θcrit = 2 3 − 3, from which sin θcrit = 4 − 2 3 = ( 3 − 1) , hence
                  2                                    2                         2
                                                 √
                                θcrit = arcsin( 3 − 1) ≃ 0.8213 rad ≃ 47.06◦ .              (12.98)
      For θO = θcrit or π − θcrit , we have exactly β|r0 =m = 0, while for θO < θcrit or θO > π − θcrit ,
      β|r0 =m is imaginary. This means that for θcrit ≤ θO ≤ π − θcrit , the range of the parameter r0
      is [m, rmax ], while for for θO < θcrit or θO > π − θcrit , the range is [rmin , rmax ] with rmin > m,
      as for the case θO ̸= π/2 of the shadows with a < m discussed in Sec. 12.4.2. In this last case,
      the critical curve is a smooth curve with β|r0 =rmin = 0 (in addition to β|r0 =rmax = 0, which
      always holds).
           Let us focus on the first case, i.e. θcrit ≤ θO ≤ π − θcrit . The parametric curve defined by
      the system (12.94) terminates at r0 = m on the two points given by Eq. (12.96), or Eq. (12.97)
      in the particular case θO = π/2 (one point for ϵθ = +1, and the other one for ϵθ = −1). For
      θcrit < θO < π − θcrit , one has β ̸= 0 at these two points, so that the curve is not closed, as one
      can see on Fig. 12.22 (drawn for θO = π/2). One may be puzzled by this feature: the shadow
      boundary has to be closed! We are thus missing some critical null geodesics to complete the
      boundary. It is easy to find the missing ones as soon as we remember that in the special case
      a = m, we had found an extra family of spherical photon orbits in Sec. 12.3.1: those given
      by Eq. (12.50), namely the spherical photon orbits at r0 = m that have ℓ = 2m. Obviously,
      this family cannot be parameterized by r0 ; on the other hand, the reduced Carter constant
      q is a valid parameter. Since q is not constrained by Eq. (12.50), it can take all values in the
      range [0, +∞). Note that q < 0 is not permitted here since ℓ = 2m > a [cf. property (12.38)].
      However, not any value of q corresponds to a spherical photon orbit associated with a critical
      null geodesic that reaches the asymptotic inertial observer O. Indeed, q must give birth to a
                                      12.4 Black hole shadow and critical curve                                  489
                       (r /m) β
                                                                           1.00 m   r0   1.60 m
                                                                           1.60 m   r0   2.20 m
                                  0                                        2.20 m   r0   2.80 m
                                                                           2.80 m   r0   3.40 m
                                                                           3.40 m   r0   4.00 m
                                  2
                                  6
                                      4    2   0      2       4   6   8
                                                   (r /m) α
Figure 12.22: Part of the critical curve of an extremal Kerr black hole (a = m) in the screen of an asymptotic
inertial observer O in the equatorial plane (θO = π/2), as defined by the parametric equations (12.94),
                                                                                                      √ with
r0 ∈ [m, 4m]. The two endpoints are achieved at r0 = m and are given by Eq. (12.97): (ᾱ, β̄) = (−2, ± 3) ≃
(−2, ±1.732), where ᾱ := αrO /m and β̄ := βrO /m. [Figure generated by the notebook D.5.14]
radial polynomial R(r) positive for all r > m, so that the radial motion is possible between the
spherical orbit at r0 = m and the observer [condition (12.46)]. Specializing expression (12.25)
for R(r) to a = m and ℓ = 2m, we get
r = m appears as a double root of R, which confirms that we are dealing with spherical photon
orbits at r0 = m. There are two other roots, which depend on q:
                                          p
                                   rq± = ± m2 + q − m.                                (12.100)
Since q ≥ 0, we have rq+ ≥ 0 and rq− ≤ −2m. It is then clear (cf. Fig. 12.23) that
The critical value is rq+ = m, which correspond to q = 3m2 (red curve in Fig. 12.23). r = m is
then a triple root of R: q = 3m2 =⇒ R(r) = (r − m)3 (r + 3m). Since rq+ is an increasing
function of q, we conclude:
Property 12.21
  For a = m, a critical null geodesic of constants of motion (ℓ, q) can reach the asymptotic
  region r ≫ m from the vicinity of a spherical photon orbit at r0 = m iff
  Furthermore, such a critical null geodesic can reach the asymptotic inertial observer O,
who is located at θ = θO , only if the Θ̃ function associated with (ℓ, q) obeys Θ̃(θO ) ≥ 0. Setting
490                          Null geodesics and images in Kerr spacetime
                                        1.5
                                                    q = m2
                                                    q = 3m 2
                                                    q = 5m 2
                                        1.0
0.5
                             R(r)/m 4
                                        0.0
0.5
                                        1.0
                                              0.0              0.5         1.0        1.5       2.0
                                                                          r/m
      Figure 12.23: Radial polynomial R(r) for a = m, ℓ = 2m and three values of q. All the polynomials have a
      double root at r = m and the dots mark the root rq+ [Eq. (12.100)], the fourth root rq− ≤ −2m being out of the
      figure’s scope. The dotted horizontal lines indicate the range of r where a null geodesic motion to the asymptotic
      inertial observer is possible. [Figure generated by the notebook D.5.14]
                                              2                                                              r0 = m
                               (r /m) β
                                              6
                                                  4       2       0       2       4    6             8
                                                                      (r /m) α
                   6                                                                             6
4 4
                   2                                                                             2
        (r /m) β
(r /m) β
0 0
2 2
4 4
                   6                                                                             6
                       4   2              0           2       4       6       8                          4    2      0       2      4   6   8
                                              (r /m) α                                                                   (r /m) α
Figure 12.24: Critical curve C (colored curve) and shadow (grey area) of an extremal Kerr black hole on the
screen of an asymptotic inertial observer O located at r = rO and θ = θO , for three values√    of θO : θO = π/2
(O in the equatorial plane; upper panel), θO = π/3 (lower left) and θO = θcrit = arcsin( 3 − 1) ≃ 0.821
[Eq. (12.98)]. The NHEK line (plotted in maroon) is present in the first two images, while it is vanishing in the
third one (marginal case), since there is no NHEK line in C for θ < θcrit or θ > π − θcrit . As in Fig. 12.20, the
color code corresponds to some selected ranges for the parameter r0 of the critical null geodesics forming C .
[Figure generated by the notebook D.5.14]
492                      Null geodesics and images in Kerr spacetime
      Cardioid shape
      As seen from Fig. 12.24, the departure of the critical curve C from a perfect circle is maximal
      when the observer lies in the equatorial plane, i.e. when θO = π/2. The critical curve is often
      said to have a D-shape (from the letter D). It is interesting that this shape corresponds actually
      to a simple mathematical curve: the convex hull of a cardioid [193]. Indeed, for θO = π/2,
      equations (12.94), which govern the part of C parameterized by r0 , simplify to
      and substitute it into the square of Eq. (12.106b); we get (using ϵ2θ = 1)
                                            r     3   r
                               m2
                                                             
                                               rO          rO
                             2
                            β = 2         1+ 2+ α     3− 2+ α .                                 (12.107)
                                rO             m           m
      At this stage, it is worth to introduce explicitly the rescaled angular coordinates used in
      Figs. 12.22 and 12.24:
                                           rO                     rO
                                    ᾱ :=     α     and     β̄ :=    β.                   (12.108)
                                           m                      m
      Then, by expanding its right-hand side, we can rewrite Eq. (12.107) as
                                                         √
                                    β̄ 2 = −ᾱ2 + 2ᾱ + 8 ᾱ + 2 + 11.                    (12.109)
Let us introduce new screen coordinates (α̂, β̂) by shifting the origin to (ᾱ, β̄) = (−1, 0):
      The reader may have recognized the Cartesian equation of a cardioid. To put it in a more familiar
      form, let us introduce the polar coordinates (ρ, ϕ) defined by ρ2 := α̂2 + β̂ 2 , cos ϕ := α̂/ρ
      and sin ϕ := β̂/ρ. Then Eq. (12.112) becomes (ρ2 − 4ρ cos ϕ)2 = 16ρ2 , which is equivalent to
                                 12.4 Black hole shadow and critical curve                                              493
                                     β̄
                                          2
                                          6
                                                2    0      2        4   6    8
                                                                ᾱ
Figure 12.25: Cardioid defined by the polar equation ρ = 4 (1 + cos ϕ) (dotted blue curve) and critical curve of
an extremal Kerr black hole seen from the equatorial plane (red curve). The screen coordinates (p ᾱ, β̄) are defined
by Eq. (12.108), while (ρ, ϕ) are polar coordinates around the point (ᾱ, β̄) = (−1, 0), i.e. ρ := (ᾱ + 1)2 + β̄ 2
and sin ϕ := β̄/ρ. [Figure generated by the notebook D.5.14]
(ρ − 4 cos ϕ)2 = 16, i.e. to ρ = ±4 + 4 cos ϕ. Given that ρ ≥ 0, the ± sign must be +, so that
we end up with
                                      ρ = 4 (1 + cos ϕ) .                             (12.113)
We recognize the polar equation of a cardioid, generated by a circle of radius 2 rolling around a
circle of the same radius and centered at (α̂, β̂) = (2, 0) ⇐⇒ (ᾱ, β̄) = (1, 0).
    As it is clear from the starting point of the above calculation, the cardioid corresponds only
to the part of the critical curve parameterized by r0 , i.e. the part depicted in Fig. 12.22. It does
not reproduce the NHEK line. Actually, adding the NHEK line results in the convex hull of the
cardioid, as shown in Fig. 12.25.
Remark 3: It is amusing to note that the cardioid is involved in two optical phenomena of very distinct
origin: in classical optics, it appears as the caustic generated by reflection of light on a circular material,
such as a bowl or a coffee cup, and at the same time, it delineates the shadow of some black holes of
general relativity.
Remark 4: For θO ̸= π/2, the critical curve C of the extremal Kerr black hole is no longer a cardioid.
It is however still a quartic algebraic curve, i.e. (ᾱ, β̄) obey an algebraic equation of degree 4, as in
Eq. (12.112). Moreover, C is a classical elementary curve, namely a Cartesian oval, also known as
oval of Descartes, as shown in Ref. [237]. More precisely, C is exactly such an oval for θO ≤ θcrit or
θO ≥ π − θcrit . For θcrit < θO < π − θcrit , i.e. when the NHEK line is present, C is the convex hull of a
Cartesian oval.
                                                                     θ = π/2
                                                 6
                                    (r /m) β
                                                 0                                                                a=0
                                                                                                                  a = 0.5 m
                                                                                                                  a=m
                                                 2
                                                 6
                                                         6   4   2     0       2              4       6
                                                                 (r /m) α
                                               θ = π/6                                                                    θ =0
                        6                                                                         6
4 4
                        2                                                                         2
             (r /m) β
                                                                                   (r /m) β
                        0                                                                         0
2 2
4 4
                        6                                                                         6
                            6   4      2         0       2   4   6                                        6   4       2       0   2   4   6
                                       (r /m) α                                                                       (r /m) α
      Figure 12.26: Critical curves for different values of a at fixed observer’s inclination angle: θO = π/2 (top
      panel), θO = π/6 (lower left panel) and θO = 0 (lower right panel). [Figure generated by the notebook D.5.14]
      value of θO for various values of the black hole spin parameter a. For θO = 0, we recover the
      feature found in Sec. 12.4.3, namely that the critical curve depends very weakly on the spin.
      Historical note : The critical curve of a Kerr black hole has been computed first by James M. Bardeen
      in 1972 [38] (cf. the historical note on p. 276), in the form of the parametric system (12.88). He wrote:
      The rim of the “black hole”6 corresponds to photon trajectories which are marginally trapped by the black
      hole; they spiral around many times before they reach the observer. It is conceptually interesting, if not
      astrophysically very important, to calculate the precise apparent shape of the black hole. Fifty years
      later, with the first image from the Event Horizon Telescope [6, 521], this has become astrophysically
      important! For a = m and θO = π/2, Bardeen derived the system (12.106) governing the part of the
      critical curve parameterized by r0 . For the NEHK line, he simply noted7 : The non-uniform nature of
      the limit a → m allows q to range between 0 and 3m2 at r = m. Bardeen presented then a plot of the
      resulting critical curve (Fig. 6 in Ref. [38]) similar to that shown in the top panel of Fig. 12.24.
         6
             For Bardeen, “black hole” (with quotes) stands for the black spot on the observer’s screen.
         7
             q is denoted by η by Bardeen.
                                               12.5 Images                                               495
    The term black hole shadow for the interior of the critical curve has been coined by Heino Falcke,
Fulvio Melia and Eric Agol in 2000 [192], cf. the historical note on p. 501.
12.5      Images
12.5.1     Multiples images of a single source
The main features of images of a single luminous source on the screen of a remote observer
in Kerr spacetime are similar to those obtained for Schwarzschild spacetime in Sec. 8.5. In
particular, a single point-like source, wherever localized in the black hole exterior (region MI ),
gives birth to two infinite sequences of images on the screen of the asymptotic inertial observer,
both sequences converging to the critical curve C . Typically, one sequence is formed by null
geodesics having a reduced angular momentum ℓ > 0 and the other one by null geodesics
with ℓ < 0. Each image in a given sequence can be labeled by the number n of half-round trips
around the black hole and is dimmer and dimmer as n increases.
                                                                            (r /m) β
                                                                                       0
          θO = 0
                                                                                        5
                                                                                       10
                                                                                            10   5      0       5       10
                                                                                                     (r /m) α
                                                                                       10
                                                                            (r /m) β
                                                                                       0
         θO = π/6
                                                                                        5
                                                                                       10
                                                                                            10   5      0       5       10
                                                                                                     (r /m) α
                                                                                       10
                                                                                       5
                                                                            (r /m) β
                                                                                       0
         θO = π/3
                                                                                        5
                                                                                       10
                                                                                            10   5      0       5       10
                                                                                                     (r /m) α
                                                                                       10
                                                                                       5
                                                                            (r /m) β
                                                                                       0
         θO = π/2
                                                                                        5
                                                                                       10
                                                                                            10   5      0       5       10
                                                                                                     (r /m) α
      Figure 12.27: Images of a thick accretion disk around a Kerr black bole with a = 0.5 m for three observer’s
      inclination angles θO . The right column shows the critical curve (red dotted line) superposed on the image.
      [Images produced by Gyoto with the input files given in Sec. E.2.2; the addition of the critical curve in the right column
      has been performed by means of the SageMath notebook D.5.15]
                                                      12.5 Images                                                497
                                                                               10
                                                                    (r /m) β
                                                                               0
    θO = 0
                                                                                5
                                                                               10
                                                                                    10   5      0       5   10
                                                                                             (r /m) α
                                                                               10
                                                                    (r /m) β
                                                                               0
   θO = π/6
                                                                                5
                                                                               10
                                                                                    10   5      0       5   10
                                                                                             (r /m) α
                                                                               10
                                                                               5
                                                                    (r /m) β
                                                                               0
   θO = π/3
                                                                                5
                                                                               10
                                                                                    10   5      0       5   10
                                                                                             (r /m) α
                                                                               10
                                                                               5
                                                                    (r /m) β
                                                                               0
   θO = π/2
                                                                                5
                                                                               10
                                                                                    10   5      0       5   10
                                                                                             (r /m) α
Figure 12.28: Same as Fig. 12.27 but for a = 0.95 m and the images in the right column using a log scale to
reveal the faint parts. The critical curves are the same as in Fig. 12.20.
498                       Null geodesics and images in Kerr spacetime
      the tertiary image. The green circle would then mark the location of polar spherical photon
      orbits, at rph
                   pol
                       ≃ 2.883 m for a = 0.5 m, since only the ℓ = 0 critical null geodesics matter for
      θO = 0. There is actually an infinite sequence of images, the image of order n being generated
      by null geodesics that have performed n half-turns around the sphere r = rph     pol
                                                                                           before leaving
      toward the observer, with n = 0 corresponding to the primary image. A confirmation of this
      interpretation is provided by the superposition of the critical curve C onto the image, as the
      red dotted line in the right column of Fig. 12.27. The tertiary image (n = 2) is then almost
      indistinguishable from C , in agreement with the exponential convergence of high order images
      to C . This exponential convergence has been established for a = 0 by Eq. (8.131), but it holds
      for a ̸= 0 as well [235].
           Another common feature with the Schwarzschild images of Fig. 8.25 is the left part of the
      primary image being brighter than the right part as soon as θO ̸= 0. As discussed in Sec. 8.5.5,
      this is due to the Doppler boosting resulting from the rotation of the accretion disk, the left
      part moving towards the observer, while the right part is receding.
           A difference with the Schwarzschild images discussed in Sec. 8.5.5 is that in the current
      case, the disk is not confined to the equatorial plane, being geometrically thick, and moreover it
      is optically thin. This means that a given null geodesic touching the screen “carries” not only a
      photon from the disk’s surface but also photons from the disk interior, actually all the photons
      emitted along the path of the geodesic inside the disk. This cumulative property results in a
      enhanced brightness of the secondary image and makes it appear as a relatively bright ring, as
      it is clearly seen on the θO = 0 and θO = π/6 images of Fig. 12.27. Another consequence of
      optical thinness is that the bottom part of the secondary and tertiary images are not blocked
      by the part of the accretion disk standing in the foreground for θO close to π/2, as they are in
      the images of Fig. 8.25, which have been obtained for an optically thick disk model.
           Another difference between the two sets of images is that in Fig. 12.27 the inner boundary
      of the top part of the primary image is closer to the secondary image than in Fig. 8.25 (for
      θO = 0, the primary and secondary images in Fig. 12.27 seem even to touch each other). This is
      a mere consequence of the inner radius of the accretion disk being closer to the black hole in
      Fig. 12.27, since rISCO
                          +
                               ≃ 4.233 m for a = 0.5 m and rISCO
                                                               +
                                                                    = 6m for a = 0.
           In the images of Fig. 12.27, the interior of the critical curve, i.e. the black hole shadow
      according to the definition given in Sec. 12.4.2, appears black, except when crossed by the
      bottom of the primary image, which arises from the foreground part of the disk. This feature
      is similar to the Schwarzschild case presented in Fig. 8.25. However, it does not persist for
      highly spinning black holes, as we can see on Fig. 12.28, which shows images for the Kerr
      parameter a = 0.95 m. There, the boundary of the central dark spot is clearly distinct from
      the critical curve and lies within the latter. This is due to the inner part of the accretion disk
      being located well within some spherical photon orbits, including the polar ones. Indeed, for
      a = 0.95 m, we have rISCO+
                                   ≃ 1.937 m, while the radius of the retrograde circular photon orbit
      is rph ≃ 3.995 m and that of polar spherical orbits is rph
           −                                                   pol
                                                                   ≃ 2.493 m. Given that rph+
                                                                                              ≃ 1.386 m,
      we have thus rph < rISCO < rph < rph . More generally, using formulas (11.167), (12.62) and
                        +     +        pol    −
      (12.67a), we have
           +
          rISCO    −
                < rph ⇐⇒ a > 0.638 m            and       +
                                                         rISCO    pol
                                                               < rph  ⇐⇒ a > 0.853 m.           (12.114)
      These properties are illustrated in Fig. 12.29. Hence, for large spins, if the inner boundary of
                                                     12.5 Images                                                    499
                             6                                                          +
                                                                                      rISCO
                                                                                      rph+
                                                                                      rph−
                             5                                                        rphpol
                       r/m
                             3
                             1
                                 0.0     0.2         0.4         0.6            0.8        1.0
                                                           a/m
Figure 12.29: Radius of the timelike prograde ISCO, rISCO
                                                     +
                                                          [Eq. (11.167)], compared with the radii of the
                                                               −
prograde and retrograde outer circular photon orbits, rph
                                                       +
                                                          and rph [Eq. (12.62)] and the radius of polar spherical
                pol
photon orbits, rph  [Eq. (12.67a)]. [Figure generated by the notebook D.5.15]
the accretion disk is set by the prograde ISCO, a large part of the primary image lies strictly
inside the critical curve C . Let us for instance consider the image for θO = 0 in Fig. 12.28,
which is the easiest to understand since for θO = 0, C involves only the ℓ = 0 critical null
geodesics (cf. Sec. 12.4.3), i.e. those that are rolling indefinitely around the sphere r0 = rph pol
orbits at r0 = rph
                 pol
                     . The emitted photons encounter then the screen scritly inside C . We shall
not provide a full demonstration here, but simply recall that this was established rigorously
for a = 0 in Chap. 8: formula (8.131) along with A(rem ) < 0 for rem < 3m = rph      pol
                                                                                         (a = 0) (cf.
Fig. 8.22) yields impact parameters bn that are lower than bc , the latter being nothing but the
                                        ±
Property 12.22: black spot in the image and black hole shadow
  If the black hole is surrounded by some emitting material located within some of the
  spherical photon orbits, the central black spot in the images does not correspond to the
  shadow defined in Sec. 12.4.2 as the interior of the critical curve C . The black spot is
  actually smaller than the shadow, lying strictly inside C .
This holds for a = 0 as well, if the emitting matter is not limited inward by the ISCO as in
Fig. 8.25 (see e.g. Fig. 5 in Ref. [234] or Fig. 2 in Ref. [494]). Note however that in the case of a
purely radial inflow, the emitted photons suffer such a strong Doppler effect that the interior of
the critical curve is very dark, so that the effective shadow coincides with the academic shadow
of Sec. 12.4.2, as shown in Refs. [192, 377].
    A striking feature of the images shown in Fig. 12.28 is that while the central black spot
departs considerably from spherical symmetry at large inclination angle θO , the secondary
500                        Null geodesics and images in Kerr spacetime
      and higher order images are pretty circular. This is because they accumulate onto the critical
      curve, which is quite close to a circle, as shown in Fig. 12.20. The only major effect of a high
      inclination angle is to move the high order images the right.
          Besides, we note on Fig. 12.28 that the secondary image (n = 1) is brighter than the primary
      one (n = 0). Moreover, the tertiary one (n = 3), which is very thin and superposed onto the
      secondary image, is even brighter. This is because the higher the image order, the larger the
      number n of half-turns around the black hole of the involved geodesics, and thus the longer
      the path length of the geodesics inside the disk, resulting a larger number of photons “carried”
      by a given geodesic and accumulating on the same pixel of the screen. Note that this holds only
      because the disk is optically thin. Would it be optically thick, a null geodesic impacting the
      screen, even if very close to a critical geodesic, would carry only a single photon: the photon
      emitted at the first intersection of the geodesic with the disk’s surface when traced backward in
      time from the screen. In Sec. 8.5.5 (Schwarzschild case), we considered such an optically thick
      disk (Page-Thorne model) and the secondary and tertiary images were not brighter than the
      primary one (cf. Fig. 8.25). However in that case, the main reason was that the emitting region
      was too far from the black hole for a close-to-critical geodesic to visit various parts of the disk.
      For instance, in the lower part of Fig. 8.26, consider the red geodesic that intersects the disk at
      the point of coordinates (x, y) = (0, 6m); it contributes to the tertiary image and if one would
      extend it to the past beyond the point (0, 6m), it would never encounter the disk again, leaving
      the figure at the upper left corner in a more or less straight line. We shall therefore state:
        Property 12.23: photon ring
        The set constituted by the secondary image and higher order images, i.e. all images of
        order n ≥ 1, is called the photon ring. It appears as a bright feature on the observer’s
        screen under two conditions: the emitting material must (i) be optically thin and (ii) be
        located in the vicinity of some spherical photon orbits.
      As shown in Fig. 12.28, the photon ring generally appears as a single ring because the successive
      images are superposed on each other and converge exponentially to the the critical curve C .
      Only the n = 1 and n = 2 parts of the photon ring are visible in Fig. 12.28 by lack of resolution,
      since the thickness of the n-th image decays exponentially with n [235]. We note on Figs. 12.27
      and 12.28 that the n = 2 image (tertiary image) is already very close to C . We conclude:
        Property 12.24: photon ring and critical curve
        In practice the photon ring, and more precisely the part n ≥ 2 of it, materializes the critical
        curve.
      The photon ring is therefore a potentially observable feature that can provide information on
      the spacetime metric independently of the emission model [304]. For instance, by analyzing the
      photon ring, one could check that the metric is the Kerr metric and measure the spin parameter
      a.
      Remark 1: The reader is warned that the term photon ring is sometimes used in the literature for circular
                                                                                                    ±
      photon orbits, as the orbit at r = 3m in Schwarzschild spacetime and the orbits at r = rph       in Kerr
                                                  12.5 Images                                                   501
spacetime (see e.g. Ref. [84] for a recent example). We are using it here not for an orbit around the black
hole but for a feature on the screen of a remote observer, in agreement with Refs. [47, 302, 304, 235, 237].
Recently the term secondary ring. has been introduced [494] to denote the subpart of the photon ring
that contributes significantly to the flux in the image. Besides, some authors have distinguished the
secondary image (n = 1) by calling it the lensing ring [234].
Historical note : In 1973, Christopher T. Cunningham and James M. Bardeen [146] computed the
multiple images of a star orbiting an extremal Kerr black hole (a = m). In 1979, Jean-Pierre Luminet
[348] predicted that a Schwarzschild black hole illuminated
                                                         √         by a parallel light beam would appear to a
remote observer as a black disk of radius Rshad = 3 3m/rO (cf. Eq. (12.91)) surrounded by a sequence
of rings, called ghost rings by Luminet, converging exponentiallty to the rim of the black disk (which is
the critical curve). These rings are fainter and fainter, the brightest ring being the outermost one. In
the very same article [348], Luminet presented the first computed image of an accretion disk around a
Schwarzschild black hole (cf. historical note on p. 280). The first images of an accretion disk around a
Kerr black hole with a ̸= 0 have been computed by S.U. Viergutz in 1993 [491], for a = 0.998 m. In 1997,
Michał Jaroszyński and Andrzej Kurpiewski [299] studied an optically thin accretion disk extending
inwards down to the event horizon of a Kerr black hole with a = 0, 0.5 m and 0.9 m, with some nonzero
radial component of the accretion flow. They showed that the observed intensity just outside the critical
curve is enhanced by the long path length of geodesics orbiting many times around the black hole, while
the intensity inside the critical curve is low, even if there is some emitting material close to the black
hole. In 2000, Heino Falcke, Fulvio Melia and Eric Agol [192] (see also Ref. [191]) have computed images
of an optically thin accretion flow around a Kerr black hole with a = 0.998 m. The accretion flow was
assumed to extend down to the event horizon, infalling with constant angular momentum inside the
ISCO. In agreement with Jaroszyński and Kupriewski’s prediction, they obtained a large intensity just
outside the critical curve and low intensity inside it. For this reason, they introduced the term shadow of
the black hole for the interior of the critical curve, the latter being called by them the apparent boundary
of the black hole. Moreover, in the same article [192], they advanced that, in the case of Galactic Center
black hole Sgr A*, the shadow could be observed by means of very-long-baseline interferometry (VLBI),
thereby proposing what would become the Event Horizon Telescope two decades later [6].
15
10
5 jet
                                  (r /m) βs
                                              0
10
                                              15
                                                   15   10    5      0      5      10      15
                                                               (r /m) αs
      Figure 12.30: Image of the immediate vicinity of the supermassive black hole M87*, as released by the EHT
      collaboration in 2019 [6], with two critical curves superposed: that of a Schwarzschild black hole (magenta dotted
      circle) and that of an extremal Kerr black hole seen under the inclination θO = 163◦ (green dotted curve), with
      the projection of the spin axis onto the screen indicated by the green arrow (position angle Θ = 108◦ with respect
      to the βs -axis). The critical curves are scaled by assuming m/rO = 3.7 µas (cf. Table 12.1). (αs , βs ) are screen
      angular coordinates, closely related to celestial equatorial coordinates: αs is minus the (relative) right ascension
      and βs is the (relative) declination. The white circle in the bottom right corner indicates the EHT resolution
      (approx. 20 µas). [Figure generated by the notebook D.5.16; source of the EHT image: Fig. 3 of Ref. [6]]
      these three quantities can be estimated. Indeed, the distance from the Earth to M87* has been
      measured to rO ≃ 16.7 Mpc and the mass of M87* has been deduced from stellar dynamics
      in the inner part of the galactic nucleus, resulting in8 m ≃ 6.2 109 M⊙ (cf. Table 12.1 and
      references therein), hence the scale factor m/rO ≃ 3.7 µas. Regarding θO and Θ, both angles
      can be determined via reasonable assumptions, thanks to the large relativistic jet emanating
      from the close vicinity of the black hole9 , since most (all?) theoretical models predict that the
      jet is aligned with the black hole’s rotation axis. There are actually two jets, emitted from both
      sides of the black hole in opposite directions. Due to a strong Doppler beaming effect, the jet
      moving away from us, usually called the counterjet, is hardly visible, so that the large observed
      jet is the one moving towards us. The position angle of the jet in the plane of the sky, measured
      from the North axis, is Θjet ∼ −72◦ (see e.g. Fig. 1 in Ref. [509]); it is indicated by the light
      blue dashed arrow in Fig. 12.30 (the jet is not visible on the EHT image). The inclination ι of
      the jet with respect to the line of sight is estimated from observed motions of “nodes” within
      the jet, resulting in a value around ι ∼ 17◦ [365, 509]. In particular, the detection of apparent
      superluminal motions, up to vapp ∼ 6c, implies both relativistic velocities and ι being a small
         8
            Another method, based on the dynamics of the gas in the nucleus disk at r < 40 pc, yields a mass of only
      m = 3.5 109 M⊙ [514]; the discrepancy with the value arising from stellar dynamics has not been explained yet.
          9
            This jet has been discovered in 1918 [148], hence well before one suspects a black hole to lie in the core of the
      galaxy M87! It makes M87 belong to the category of galaxies with an active galactic nucleus (AGN). The jet engine
      is related to the black hole, possibly via the Penrose process discussed in Sec. 11.3.2, see e.g. Ref. [2].
                                             12.5 Images                                              503
angle (see e.g. Sec. 5.7.4 of Ref. [228]). Assuming that the jet axis is aligned with the black
holes’s rotation axis, we have either θO = ι or θO = π − ι.
     The main feature of the EHT image (Fig. 12.30) is a broad annular region, whose bottom
part is much brighter than its top part. It is natural to interpret this brightness discrepancy
as resulting from relativistic Doppler beaming: the bottom (resp. top) part corresponds to
emitting material moving towards us (resp. receding from us) [7]. Assuming that, so close to
the black hole, the emitting matter is rotating in the same sense as the black hole, due to the
Lense-Thirring effect (cf. Sec. 11.3.4), this implies that the spin of black hole is pointing away
from us, i.e. that θO = π − ι ∼ 163◦ and Θ = π + Θjet ∼ 108◦ . The projection of the resulting
black hole spin vector onto the plane of the sky is shown by the green arrow in Fig. 12.30.
To superpose the critical curves computed in Sec. 12.4 onto the EHT image, it suffices then
to rotate them by the angle Θ and to use the M87* scale factor m/rO = 3.7 µas determined
above. In Fig. 12.30, we have done it for the two extreme values that can be taken by the spin
parameter a: a = 0 (magenta dotted cirle) and a = m (green dotted curve). The two critical
curves are quite close to each other because the Earth observer is located almost on the axis of
rotation, θO being close to π. To have a stronger discrepancy among various possible values of
a, it would have been better to have θO close to π/2 instead, as illustrated in Fig. 12.26.
     The main conclusion that one can draw from Fig. 12.30 is that the overall scale of the
EHT image fits well with what could be expected from emitting matter in the close vicinity
of a black hole of mass around 6 109 M⊙ . Unfortunately, the resolution of the EHT image
(∼ 20 µas ∼ 1/6 of the image width!) is not sufficient to draw sharper conclusions. In
particular, one cannot assert whether the annular structure in the image is the photon ring
discussed in Sec. 12.5.2 blurred to the EHT resolution or the primary image of an accretion disk
or some more complicated accretion flow. It follows that it is not possible to infer the (currently
unknown) spin parameter a from that image, not speaking about performing any strong test of
general relativity (see Refs. [233, 224] for an extended discussion of this last point). However,
this is the very first image of this type and the next images, which will result from some EHT
upgrade or from space-based VLBI [305, 350], will undoubtedly be sharper and will certainly
revolutionize black hole astronomy.
Contents
        13.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 505
        13.2 Definition and basic properties . . . . . . . . . . . . . . . . . . . . . . . 506
        13.3 Maximal analytic extension . . . . . . . . . . . . . . . . . . . . . . . . 513
        13.4 Near-horizon extremal Kerr (NHEK) geometry . . . . . . . . . . . . . 523
        13.5 Going further . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 537
13.1      Introduction
The Kerr solution of the Einstein equation has been introduced in Sec. 10.2; it depends on
two parameters: the mass m > 0 and the spin parameter a ≥ 0. In Chaps. 10–12, we have
considered the Kerr solution with 0 < a < m and the case a = 0 (Schwarzschild solution) has
been treated in Chaps. 6–9. Here we focus on the case a = m, which is called the extremal Kerr
spacetime. This corresponds to the highest value of a for which the Kerr solution describes a
black hole. Indeed, for a > m, the Kerr metric is still an exact solution of the vacuum Einstein
equation (1.44), but it describes a naked singularity (cf. Sec. 9.6.1): the ring singularity is not
hidden by any horizon to asymptotic observers.
    The Kerr spacetime with a = m is not just the maximally spinning black hole of the Kerr
family. As we going to see, it has important properties that are not shared by Kerr spacetimes
with a < m. In particular, the black hole event horizon is degenerate, in the sense defined in
Sec. 3.3.6, i.e. it has a vanishing surface gravity κ. This corresponds to an extremal black hole,
according to the terminology introduced in Sec. 5.4.3. Besides, there is no internal horizon
and the maximal analytic extension, to be constructed in Sec. 13.3 below, is simpler than that
of the Kerr spacetime with a < m. Another specific property of the extremal Kerr spacetime
(actually related to the degeneracy of the horizon) regards the geometry near the horizon: it
admits an enlarged symmetry group, which is generated by four independent Killing vectors,
instead of two for the geometry of the global solution. This is called the NHEK geometry (for
506                                  Extremal Kerr black hole
      near-horizon extremal Kerr) and has received considerable attention in the recent literature. We
      shall discuss it in Sec. 13.4.
                                                             4m2 r sin2 θ
                                      
                                   2mr          4mr
                        g = − 1− 2       dt̃2 + 2 dt̃ dr −                dt̃ dφ̃
                                    ρ            ρ                ρ2
                                                           
                                   2mr                   2mr
                            + 1+ 2          2
                                         dr − 2m 1 + 2          sin2 θ dr dφ̃                        (13.3)
                                    ρ                     ρ
                                                    3     2
                                                            
                              2  2     2      2   2m r sin θ
                            +ρ dθ + r + m +                    sin2 θ dφ̃2 ,
                                                      ρ2
      with
                                            ρ2 := r2 + m2 cos2 θ.                                    (13.4)
      In this context, the coordinates (xα̃ ) = (t̃, r, θ, φ̃) are called Kerr coordinates and we recognize
      in (13.3) the limit a → m of expression (10.36) for the Kerr metric with a < m.
           The metric (13.3) is regular in all M , since the components gα̃β̃ are singular only for ρ = 0,
      i.e. for r = 0 and θ = π/2, which defines the set R that has precisely been excluded from M
      in the definition (13.1). The Kretschmann curvature invariant K := Rµνρσ Rµνρσ is given by
      Eq. (10.22) with a = m; it diverges for ρ → 0. Therefore, as for the Kerr spacetime with a < m
      (cf. Sec. 10.2.6), we shall call R the ring singularity of the extremal Kerr spacetime. Note that,
      formally, it is not part of the spacetime manifold M [cf. Eq. (13.1)].
           Moreover, the Ricci tensor of the metric (13.3) is identically zero in all M (see the note-
      book D.5.17 for the computation). Hence, we have:
Property 13.1: extremal Kerr metric as a solution of the vacuum Einstein equation
        The metric g of the extremal Kerr spacetime is a solution of the vacuum Einstein equation
        R = 0 [Eq. (1.44)].
                                  13.2 Definition and basic properties                                      507
[Eq. 10.28)], we notice that r+ = r− = m in the limit a → m. This implies that H and Hin
coincide when a → m and the region MII , which is bounded by H and Hin , disappears.
Accordingly, we shall split the extremal Kerr manifold M in two open regions only, MI and
MIII , separated by a single hypersurface H :
M = MI ∪ H ∪ MIII , (13.6)
with
Remark 1: We are using the notation MIII for the “second” region, and not MII , to be consistent with
Chaps. 10–12, i.e. with the limit a → m of the results obtained in these chapters.
   The quadratic polynomial in r introduced in Chap. 10, ∆ = r2 −2mr+a2 = (r−r+ )(r−r− ),
reduces to ∆ = (r − m)2 in the limit a → m. Its double root, r = m, defines the hypersurface
H.
   In the region MBL := M \ H = MI ∪ MIII , one may introduce the Boyer-Lindquist
coordinates (t, r, θ, φ) such that (r, θ) are the same coordinates as in Kerr coordinates, while t
and φ are related to the Kerr coordinates t̃, r and φ̃ by
                                            2m2         r−m
                                   t = t̃ +     − 2m ln                                           (13.8a)
                                           r−m           m
                                             m
                                  φ = φ̃ +      .                                                 (13.8b)
                                           r−m
Differentiating these relations leads to
                               2mr                                                    m
                dt̃ = dt +            dr            and         dφ̃ = dφ +                  dr.    (13.9)
                             (r − m)2                                              (r − m)2
508                                         Extremal Kerr black hole
                                                           t̃/m
                                                           4
                                                                                                  r/m
                            10               5                              5                10
      Figure 13.1: Trace of the hypersurfaces of constant Boyer-Lindquist time t in the plane (t̃, r). [Figure generated
      by the notebook D.5.17]
      Remark 3: As in Chap. 10, we have denoted by ∂r̃ the second vector of the coordinate frame associated
      to the Kerr coordinates (xα̃ ) = (t̃, r, θ, φ̃), in order to distinguish it from the coordinate vector ∂r of
      the Boyer-Lindquist coordinates (xα ) = (t, r, θ, φ).
           The metric components (gαβ ) with respect to the Boyer-Lindquist coordinates (xα ) =
      (t, r, θ, φ) are given by
                                                 4m2 r sin2 θ             ρ2
                                        
                                     2mr
                        g = − 1− 2         dt2 −              dt dφ +           dr2
                                      ρ              ρ2                (r − m)2
                                                                                         (13.11)
                                                  2m3 r sin2 θ
                                                              
                                2  2     2     2                    2     2
                              +ρ dθ + r + m +                    sin θ dφ .
                                                       ρ2
      This expression can be obtained either by taking the limit a → m of Eq. (10.8) or by using (13.9)
      to substitute dt̃ and dφ̃ in Eq. (13.3). We note that grr → +∞ when r → m, which reflects the
         1
             See also the limit a → m of Eq. (10.38).
                                       13.2 Definition and basic properties                                     509
singularity of Boyer-Lindquist coordinates on H and explains why the latter was excluded in
the definition of MBL . This singularity is clearly apparent in the coordinate transformations
(13.8), as well as in the spacetime slicing by the hypersurfaces t = const depicted in Fig. 13.1:
the slices accumulate onto H , without crossing it, so that the points on H do not belong to
any hypersurface t = const. That the t = const hypersurfaces do not provide a regular slicing
of the extremal Kerr spacetime (M , g) is also manifest on the Carter-Penrose diagram shown
in Fig. 13.4.
    The Boyer-Lindquist components of the inverse metric g −1 are
                                                                                        
                   1        2      2    2m3 r sin2 θ                          2m2 r
              − (r−m) 2   r   +  m    +     ρ2
                                                          0    0        −  ρ2 (r−m)2
                                                                                          
                                                      (r−m) 2                             
                                0                       ρ 2   0               0           
    αβ
  g =                                                                                      . (13.12)
                                                                                          
                                                               1
          
                                0                        0    ρ 2             0           
                                                                                           
                                                                                        
                                2m2 r                                 1              2mr
                           − ρ2 (r−m)2                    0    0 (r−m)2 sin2 θ 1 − ρ2
13.2.3         Symmetries
The extremal Kerr metric (13.3) is stationary2 and axisymmetric. The corresponding isometry
group is R × SO(2) ≃ R × U(1) and is generated by two commuting Killing vectors ξ and
η. Both the Kerr coordinates and the Boyer-Lindquist ones are adapted to the spacetime
symmetries, i.e. (t̃, φ̃) and (t, φ) are ignorable coordinates, as it is clear from the metric
components (13.3) and (13.11). Accordingly, one can normalize the Killing vectors so that
                               L(v,θ,
                                 in
                                      φ̃) :   (v, θ, φ̃) = const ∈ R × [0, π] × [0, 2π),            (13.14)
   2
       Cf. Sec. 5.2.1 for the definition of stationary and some discussion about the terminology.
510                                         Extremal Kerr black hole
t̃/m
                                                           4
                                                                           k
                                                                                    `
                                                           2
                                                                                                        r/m
                    8         6         4         2                   2         4         6         8
      Figure 13.2: Trace of the principal null geodesics in the plane (t̃, r). The dashed lines correspond to the ingoing
                                                                                                         out,I
      principal null geodesics L(v,θ,
                                 in
                                     φ̃) and the solid curves to the outgoing principal null geodesics L(u,θ,φ̃)
                                                                                                              ˜ for r > m
            out,III
      and L(u,θ, ˜ for r < m. [Figure generated by the notebook D.5.17]
                φ̃)
                         in MI : L(u,θ,
                                   out,I
                                        ˜ :
                                       φ̃)
                                                             ˜ = const ∈ R × [0, π] × [0, 2π),
                                                      (u, θ, φ̃)                                              (13.17a)
                        in MIII : L(u,θ,
                                    out,III
                                         ˜ :
                                        φ̃)
                                                             ˜ = const ∈ R × [0, π] × [0, 2π),
                                                      (u, θ, φ̃)                                              (13.17b)
                         on H : L(θ,ψ)
                                  out,H
                                        :             (θ, ψ) = const ∈ [0, π] × [0, 2π),                      (13.17c)
                                13.2 Definition and basic properties                                   511
                     (r + m)2            (r − m)2            m
                 ℓ=    2     2
                                 ∂t̃ +     2     2
                                                     ∂r̃ + 2     ∂φ̃                     (13.21a)
                    2(r + m )           2(r + m )         r + m2
                    1       (r − m)2                 m
                 ℓ = ∂t +      2       2
                                          ∂r +              ∂φ in M \ H .                (13.21b)
                    2      2(r + m )           2(r + m2 )
                                                   2
Proof. The second equality in Eq. (13.15) follows from Eq. (13.8a). Equation (13.16b) follows
from Eq. (13.16a) via Eq. (13.10). Equations (13.18) and (13.19) are the integrated version of the
system (10.52) with a = m. Equations (13.20), (13.21a) and (13.21b) are the a = m versions
of respectively Eqs. (10.59), (10.57) and (10.61). All the other statements follow from the limit
a → m of results of Sec. 10.4, except for t̃ being an affine parameter along L(θ,ψ)
                                                                                  out,H
                                                                                        , which is
peculiar to the extremal Kerr horizon and will be proven in Sec. 13.2.5.
    The principal null geodesic congruences are depicted in terms of the (t̃, r) coordinates in
Fig. 13.2. Note that the outgoing geodesics L(u,θ,
                                               out,I
                                                    ˜ and L(u,θ,φ̃)
                                                   φ̃)           ˜ tend to become tangent to H
                                                             out,III
for r → m; this agrees with H being generated by some members of the outgoing principal
null congruence, namely the geodesics L(θ,ψ)
                                          out,H
                                                , as we shall see in details in the next subsection.
Another view of the principal null geodesics is provided by the Carter-Penrose diagram of
(M , g) shown in Fig. 13.3, in which both families of geodesics appear as straight lines.
                                        − H 2m2
                                        →           m      H 2m
                                                                2
                                        ∇r = 2 ∂t̃ + 2 ∂φ̃ = 2 χ,                                        (13.22)
                                             ρ      ρ         ρ
      where χ is the Killing vector field
                                                                               1
                                   χ := ξ + ΩH η ,            with   ΩH :=       .                       (13.23)
                                                                              2m
      It follows immediately that H is a Killing horizon, i.e. a null hypersurface that admits a Killing
      vector as null normal (cf. Sec. 3.3.2).
           From expression (13.21a) for ℓ, we have immediately
                                                          H
                                                        ℓ = χ.                                           (13.24)
      This means that the null generators of H are the outgoing principal null geodesics L(θ,ψ)
                                                                                              out,H
                                                                                                    .
          The surface gravity κ of the Killing horizon H has been defined in Sec. 3.3.5 as the non-
                                                                         H
      affinity coefficient of the Killing-vector normal χ to H : ∇χ χ = κ χ [Eq. (3.29)]. Given
      the identity (13.24), κ coincides with the value on H of the non-affinity coefficient κℓ of the
      tangent ℓ to the outgoing principal null geodesics: ∇ℓ ℓ = κℓ ℓ. A direct computation (cf. the
      notebook D.5.17) reveals that
                                                     r2 − m2
                                            κℓ = m 2           .                              (13.25)
                                                    (r + m2 )2
      In particular, κℓ vanishes for r = m, i.e. on H . Hence κ = κℓ |H leads to3
κ=0. (13.26)
      According to the classification introduced in Sec. 3.3.6, it follows that H is a degenerate Killing
      horizon. The vanishing of the non-affinity coefficient κ means that ℓ is a geodesic vector on H ,
      and not only a pregeodesic one (cf. Remark 1 in Sec. 1.3.2). Equivalently, at any given point
      p ∈ H , ℓ is the tangent vector associated to an affine parameter λ of the null geodesic L(θ,ψ)
                                                                                                   out,H
      through p. Moreover, the affine parameter λ coincides with t̃, up to some additive constant.
      Indeed, Eqs. (13.24) and (13.23) imply
                                                        dt̃
                                                ℓt̃ =       = χt̃ = 1,
                                                        dλ
      from which λ = t̃ + const. Since the range of t̃ is (−∞, +∞), we conclude that L(θ,ψ) out,H
                                                                                                  is a
      complete geodesic. This constrasts with the null generators of a non-degenerate Killing horizon,
      which are incomplete, as shown in Sec. 3.4.2.
         Let us summarize the results obtained above:
         3
          The vanishing of κ can also be obtained by taking the limit a → m of expression (10.77), which has been
      derived for a < m.
                                         13.3 Maximal analytic extension                              513
  The extremal Kerr spacetime (M , g) can be endowed with a conformal completion at null
  infinity such that the future and past null infinities I + and I − are located at the boundary
  of MI . The region MIII is then the interior of a black hole, the event horizon of which is
  the Killing horizon H . Since H is degenerate, the black hole is extremal, according to the
  terminology introduced in Sec. 5.4.3.
    The future null infinity I + and the past null infinity I − relative to MI are depicted in
the Carter-Penrose diagram of Figs. 13.3 -13.4. In this diagram, it is clear that MIII is a black
hole region for MI and that H is the corresponding event horizon.
      Figure 13.3: Carter-Penrose diagram of the extremal Kerr spacetime (M , g) constructed via the projection
      map Π : M → R2 , (t̃, r, θ, φ̃) 7→ (T, X) defined by Eqs. (13.27)-(13.28). The grey curves represent hypersurfaces
      t̃ = const, with t̃ ∈ [−20m, 20m] and the increment δ t̃ = 2m between two successive hypersurfaces. The
      hypersurface t̃ = 0 is singled out by a larger thickness. The red dotted curves represent hypersurfaces r = const,
      with the increment δr between two successive hypersurfaces being δr = 2m for r < 0 and r > 3m and
      δr = 0.2 m for 0 ≤ r ≤ 3m. The hypersurface r = 0 is marked by the brown dashed curve. The green straight
      lines depict some selected principal null geodesics (dashed = ingoing, solid = outgoing, as in Fig. 13.2). iint is the
      internal infinity discussed in Sec. 13.4.1. [Figure generated by the notebook D.5.17]
                                      13.3 Maximal analytic extension                                                515
Figure 13.4: Same as Fig. 13.3 but for the time slicing associated to Boyer-Lindquist coordinates (t, r, θ, φ).
The blue curves represent hypersurfaces t = const, with t ∈ [−20m, 20m] and the increment δt = 2m between
two successive hypersurfaces. Note that the spacetime slicing by the hypersurfaces t = const is singular at H ,
contrary to the slicing by the hypersurfaces of constant Kerr time t̃ shown in Fig. 13.3. [Figure generated by the
notebook D.5.17]
with
                        u               v                                  v               u
 T0 (u, v) := arctan          + arctan           ,     X0 (u, v) := arctan            − arctan         , (13.28)
                          2                 2                                    2                 2
where u and v are the functions of (t̃, r) given by Eqs. (13.18) and (13.15) respectively. Some
outgoing principal null geodesics L(u,θ,  out,I
                                               ˜ and L(u,θ,φ̃)
                                              φ̃)          ˜ are plotted in Fig. 13.3 for selected values
                                                       out,III
of u (solid green lines): u = −4m, u = 0 and u = 4m. Since r is an affine parameter along the
null geodesics L(u,θ,
                   out,I
                        ˜ , it is clear that these geodesics are incomplete, for they all terminate
                       φ̃)
in the past at the finite value r = m (the South-West boundary of MI in Fig. 13.3), without
any possible extension into MIII from there. To extend MI so that all geodesics L(u,θ,          out,I
                                                                                                     ˜ are
                                                                                                    φ̃)
complete, let us introduce a coordinate system on MI that is adapted to the outgoing principal
null geodesics, as the Kerr coordinates (t̃, r, θ, φ̃) were adapted to the ingoing ones. We thus
516                                     Extremal Kerr black hole
                                             ˜       ˜ r, θ, φ̃)
      define the outgoing Kerr coordinates (xα̃ ) = (t̃,     ˜ by
                                                        4m2
                                                                                  
                                             ˜                             r−m
                                           t̃ = t̃ +           − 4m ln
                                          
                                          
                                                      r−m                     m
                                 on MI ,                                                            (13.30)
                                           φ̃˜ = φ̃ +
                                                         2m
                                                                .
                                                         r−m
         By construction, the tangent vector to L(u,θ,
                                                  out,I
                                                       ˜ associated with the affine parameter r is
                                                       φ̃)
      then
                                            ℓ′ = ∂t̃˜ + ∂r̃˜,                              (13.31)
      where ∂r̃˜ stands for the vector ∂/∂r of the coordinates (t̃,   ˜ r, θ, φ̃).
                                                                              ˜ Indeed ℓ′ α̃˜ = dxα̃˜ /dr =
      (1, 1, 0, 0) since along L(u,θ,     ˜
                                      ˜ , t̃ = r + u, with u constant, and both θ and φ̃ are constant. An
                                 out,I                                                ˜
                                      φ̃)
      explicit computation (cf. the notebook D.5.18) shows that ℓ′ is a geodesic vector:
                                                    ∇ℓ′ ℓ′ = 0,                                     (13.32)
      which confirms that r is an affine parameter along L(u,θ,out,I
                                                                    ˜ . ℓ is thus similar to k, which is
                                                                   φ̃)
                                                                          ′
In view of Eq. (13.13), we conclude that ∂t̃˜ and ∂φ̃˜ coincide with the Killing vectors ξ and η of
the Kerr metric:
                                 ∂t̃˜ = ξ      and        ∂φ̃˜ = η.                          (13.36)
    The link between the outgoing Kerr coordinates and the Boyer-Lindquist ones is obtained
by substituting Eq. (13.18) for u into t̃˜ = u + r [Eq. (13.29)]:
                                             2m2         r−m
                                  t̃˜ = t +      − 2m ln     .                                      (13.37)
                                            r−m           m
This relation is to be supplemented by Eq. (13.19) to fully specify the transformation from the
                                                                            ˜ r, θ, φ̃).
Boyer-Lindquist coordinates (t, r, θ, φ) to the outgoing Kerr coordinates (t̃,      ˜ Differenti-
ating Eqs. (13.37) and (13.19) leads to
                                  2mr                                         m
                  dt̃˜ = dt −            dr       and       dφ̃˜ = dφ −             dr.             (13.38)
                                (r − m)2                                   (r − m)2
Remark 1: Equation (13.38) differs from Eq. (13.9) only by the sign + changed to − in the right-hand
side. This reflects the complete symmetry between the Kerr coordinates (t̃, r, θ, φ̃) and the outgoing
                   ˜ r, θ, φ̃)
Kerr coordinates (t̃,      ˜ from the point of view of the Boyer-Lindquist coordinates (t, r, θ, φ) (cf. the
discussion at the beginning of Sec. 10.4.2).
    The expression of the metric tensor with respect to the outgoing Kerr coordinates is easily
obtained by substituting dt and dφ from Eq. (13.38) into the Boyer-Lindquist expression (13.11)
(see also the notebook D.5.18); we get
                                                 4m2 r sin2 θ ˜ ˜
                                                                                
                      2mr     ˜2    4mr ˜                                     2mr
    g = − 1− 2              dt̃ − 2 dt̃ dr −            2
                                                              dt̃ dφ̃ + 1 + 2        dr2
                       ρ             ρ                ρ                        ρ
                                                                       2m r sin2 θ
                                                                         3
                                                                                
                         2mr                ˜                                                   2
           +2m 1 + 2               2            2   2       2
                                sin θ dr dφ̃ + ρ dθ + r + m +      2
                                                                            2
                                                                                     sin2 θ dφ̃˜ .
                          ρ                                                ρ
                                                                                             (13.39)
These metric components are very similar to those in Kerr coordinates, as given by Eq. (13.3):
the only differences are gt̃r ˜ and grφ̃
                                       ˜ , which have a sign opposite to that of respectively gt̃r
and grφ̃ . Apart from the standard singularities of the spherical coordinates (θ, φ̃)˜ on the axis
θ ∈ {0, π}, the only singularity of the metric components (13.39) would occur at ρ = 0, which
does not happen in MI . In particular there is no divergence for r → m. This can be used to
extend smoothly the spacetime (MI , g) to r ∈ (−∞, m], so that the outgoing principal null
geodesics L(u,θ,
              out,I
                   ˜ with θ ̸= π/2 become complete. But the extension to r < m cannot be MIII
                  φ̃)
as it appears clearly on Fig. 13.3 that the end point of L(u,θ,
                                                           out,I
                                                                ˜ for r → m is not located at the
                                                                φ̃)
                                                                           +
boundary between MI and MIII . We thus introduce a spacetime (M ′ , g) from a new copy of
R2 × S2 with R2 spanned by the coordinates (t̃,   ˜ r) and S2 spanned by the coordinates (θ, φ̃),
                                                                                               ˜
such that (i) the manifold M is′
                                             n                                     πo
   M ′ := R2 × S2 \ R ′ where R ′ := p ∈ R2 × S2 , r(p) = 0 and θ(p) =                 , (13.40)
                                                                                   2
518                                   Extremal Kerr black hole
      (ii) MI is identified with the part r > m of M ′ and (iii) in all M ′ , g has the components given
      by expression (13.39). Furthermore, we define
      We have then MI = M ∩ M ′ . In M ′ III , one can define Kerr coordinates (t̃, r, θ, φ̃) from
       ˜ r, φ, φ̃)
      (t̃,       ˜ via formulas (13.18) (with u = t̃˜ − r) and (13.19). It appears then immediately that
      (M ′ III , g) is isometric to (MIII , g). It follows that g obeys the vacuum Einstein equation in all
      M ′ . The vector field ℓ′ := ∂t̃˜ + ∂r̃˜ [cf. Eq. (13.31)] is a smooth non-vanishing null vector field
      on M ′ . Since it is future directed in (MI , g) considered as a part of (M , g), we use it to set
      the time orientation in all M ′ .
           The tangent vector k to ingoing principal null geodesics has the following components
      with respect to the outgoing Kerr coordinates:
                                         (r + m)2                  2m
                                    k=            ∂t̃˜ − ∂r̃˜ +         ∂ ˜.                        (13.42)
                                         (r − m)2               (r − m)2 φ̃
      This follows immediately from k = ∂t̃ − ∂r̃ [Eq. (13.16a)] and using Eq. (13.35) to substitute
      ∂t̃ and ∂r̃ . Extending Equation (13.42) to all M ′ leads to a vector field that is singular on
      H ′ . To get a vector field everywhere regular on M ′ , we rescale it by the inverse of the factor
      connecting ℓ to ℓ′ in Eq. (13.33), i.e. we define
                                                       (r − m)2
                                              k′ :=               k.                                (13.43)
                                                      2(r2 + m2 )
      Hence
                                    (r + m)2         (r − m)2          m
                            k′ =             ∂ ˜  −           ∂r̃˜ + 2    ∂ ˜.                      (13.44)
                                      2    2
                                   2(r + m )   t̃      2    2
                                                    2(r + m )       r + m2 φ̃
      This vector field is clearly regular in all M ′ . Accordingly, it can be used to extend smoothly
      the family of ingoing principal null geodesics to H ′ . The price to pay is that k′ is only a
      pregeodesic vector field, while k was geodesic, being associated with the affine parameter −r.
      The pair (k, k′ ) plays actually the same role as the pair (ℓ′ , ℓ) (note the order!) regarding the
      outgoing principal null geodesics.
         As H in (M , g), H ′ is a degenerate Killing horizon of (M ′ , g). Indeed, by the same
      reasoning as in Sec. 13.2.5, we get that the null normal to H ′ is the Killing vector χ =
      ξ + 1/(2m) η [Eq. (13.23) extended to M ′ ]. This normal coincides with k′ on H ′ , as we can
      see by setting r = m in Eq. (13.44). This implies that the null geodesic generators of H ′
      belong to the ingoing principal null congruence. The non-affinity coefficient of k′ is (cf. the
      notebook D.5.18):
                                                         m2 − r 2
                                             κk′ = m 2            .                                (13.45)
                                                       (r + m2 )2
      We have thus κk′ = 0 on H ′ , so that H ′ is a degenerate Killing horizon.
         With M ′ , our extended extremal Kerr spacetime is thus (M0 , g) with
                             M0 := M ∪ M ′ = MI ∪ MIII ∪ MIII
                                                           ′
                                                              ∪H ∪H ′ .                             (13.46)
                                      13.3 Maximal analytic extension                                                 519
Figure 13.5: Carter-Penrose diagram of the (partially) extended extremal Kerr spacetime (M0 , g). The grey
curves represent hypersurfaces t̃ = const in M , with t̃ ∈ [−10m, 10m] and the increment δ t̃ = 2m between two
successive hypersurfaces. The hypersurface t̃ = 0 is singled out by a larger thickness. The purple curves represent
hypersurfaces t̃˜ = const in M ′ , with t̃˜ ∈ [−10m, 10m] and the increment δ t̃˜ = 2m between two successive
hypersurfaces. The hypersurface t̃˜ = 0 is singled out by a larger thickness. The red dotted curves represent
hypersurfaces r = const, with the increment δr between two successive hypersurfaces being δr = 2m for r < 0
and r > 3m and δr = 0.5 m for 0 ≤ r ≤ 3m. The hypersurfaces r = 0 are marked by brown dashed curves. The
dashed (resp. solid) green straight lines depict some selected ingoing (resp. outgoing) principal null geodesics.
[Figure generated by the notebook D.5.18]
520                                      Extremal Kerr black hole
      A Carter-Penrose diagram of M0 is shown in Fig. 13.5. The projection operator used to build
      this diagram (cf. Sec. 10.8.1) is Π : M0 → R2 , with R2 spanned by coordinates (T, X), such
      that Π is defined by Eqs. (13.27)-(13.28) on M , while on M ′ , Π is defined by
      Remark 3: The spacetime (M0 , g) is analytic: (i) M0 is an analytic manifold, given that the change of
      coordinates (13.30) between the two charts covering M0 is analytic (cf. Remark 4 in Sec. A.2.1) and (ii)
                                                                                                           ˜ r, θ, φ̃)
      the components (13.3) and (13.39) of g are analytic functions of the coordinates (t̃, r, θ, φ̃) and (t̃,     ˜
      respectively.
          The Killing horizon H ′ is actually a white hole horizon from the point of view of MI . More
      precisely, we can endow (M0 , g) with the same conformal completion at null infinity as that
      used for (M , g) in Sec. 13.2.6, i.e. such that the conformal boundary I is constituted by the
      same future and past null infinities I + and I − located at the boundary of MI (cf. Fig. 13.5).
      It appears then that
                                   MIII′
                                          ∪ H ′ = M0 \ (J + (I − ) ∩ M0 )                       (13.49)
      In view of the definition (4.41), we conclude:
Property 13.5: white hole region in the extended extremal Kerr spacetime
        MIII
          ′
             is the interior of a white hole region, the boundary of which is H ′ .
         Since MIII was shown in Sec. 13.2.6 to be the black hole region for the same conformal
      completion at null infinity, we may state, according to the terminology introduced in Sec. 4.4.2:
                                   13.3 Maximal analytic extension                                           521
is depicted via a Carter-Penrose diagram in Fig. 13.6. The region MI (resp. MIII ) of Mn is
                  (n)           (n)
denoted by MI (resp. MIII ). Similarly, the Killing horizon H (resp. H ′ ) of Mn is denoted
by H(n) +
           (resp. H(n)−
                        ). Note that H(n)
                                       +
                                          is a future event horizon (black hole horizon), while H(n)
                                                                                                   −
is a past event horizon (white hole horizon), for the conformal completion at null infinity with
the future and past null infinities I(n)  +
                                             and I(n)
                                                    −
                                                       as copies of I + and I − introduced for M0 .
    It is clear that (M∗ , g) is an analytic spacetime, since (M0 , g) is (cf. Remark 3 on p. 520).
    By construction, all principal null geodesics are complete in M∗ , except for those that
encounter the curvature singularity at r = 0 and θ = π/2. In particular, this holds for the
outgoing principal null geodesics generating H(n)       +
                                                          and for the ingoing ones generating H(n) −
                                                                                                     .
Contemplating the Carter-Penrose diagram of Fig. 13.6, we might thus perceive each (solid
or dashed) green straight line from an r = −∞ end to an r = +∞ end, as well as each thick
black straight line, as representing a complete principal null geodesic.
    It can be shown that actually all timelike or null geodesics of (M∗ , g), and not only the
principal null ones, are complete (see Carter’s article [90] for the proof), so that we can conclude:
Remark 4: The construction of the maximal extension of the extremal Kerr spacetime is simpler than
that of the Kerr spacetime with a < m presented in Sec. 10.8. Indeed, for the latter, if one uses only the
                                                                 ˜ r, θ, φ̃),
ingoing and outgoing Kerr coordinate patches (t̃, r, θ, φ̃) and (t̃,     ˜ one ends up with a manifold
522                                      Extremal Kerr black hole
      Figure 13.6: Carter-Penrose diagram of the maximal analytic extension (M∗ , g) of the extremal Kerr spacetime.
      The red dotted curves represent hypersurfaces r = const, with the increment δr between two successive
      hypersurfaces being δr = 2m for r < 0 and r > 2m and δr = 0.2 m for 0 ≤ r ≤ 2m. The hypersurfaces r = 0
      are marked by brown dashed curves. The dashed (resp. solid) green straight lines depict ingoing (resp. outgoing)
      principal null geodesics with v = 0 (resp. u = 0). [Figure generated by the notebook D.5.18]
                         13.4 Near-horizon extremal Kerr (NHEK) geometry                                           523
that is not maximal: the null geodesics generating the Killing horizons at r = r± are not complete,
because the bifurcation spheres (the central dots in Fig. 10.11), accross which these geodesics can be
extended, are missing. Indeed the bifurcation spheres are not covered by Kerr coordinates. To include
them and thus get the full bifurcate Killing horizons (cf. Sec. 3.4), one has to introduce Kruskal-Szekeres-
type coordinates in the vicinity of each bifurcation sphere, in a way similar to the Schwarzschild case
                                                                                        −
treated in Secs. 9.2 and 9.3. In the present case, each Killing horizon H(n)   +
                                                                                  or H(n) is degenerate and
                                                                               −
therefore made of complete null geodesics. In particular, H(n) and H(n) are not part of a bifurcate
                                                                   +
Killing horizon. In other words, there is no bifurcation sphere in the maximal extension of the extremal
Kerr spacetime and the ingoing and outgoing Kerr coordinate patches are sufficient to cover it entirely.
Historical note : In 1966, Brandon Carter [88] obtained the maximal analytic extension of the rotation
axis A of the extremal Kerr spacetime (cf. historical note on p. 368). In particular, he drew a diagram
(Fig. 1b of Ref. [88]) similar to that of Fig. 13.6, the difference being that Carter’s one is a true conformal
representation, owing to the fact that A is 2-dimensional, while the diagram of Fig. 13.6 is a mere
projection of the 4-dimensional manifold M∗ . In their 1967 classical study entitled Maximal Analytic
Extension of the Kerr metric [72], Robert H. Boyer and Richard W. Lindquist focused on the case a < m.
For a = m, they referred to Carter’s study [88] and stated simply that although his [Carter’s] work was
confined to the symmetry axis (θ = 0, π), it is clear that his conclusions apply with equal force to the full
metric. The detailed construction of the maximal analytic extension of the full extremal Kerr spacetime
was actually presented by Carter himself in 1968 [90], as the special case of vanishing electric charge of
the extremal Kerr-Newman spacetime. The construction was also performed in details in his famous
lectures at Les Houches Summer School in 1972 [95].
   5
     The induced metric would not be Riemannian in the Carter time machine (cf. Sec. 10.2.5), where gφφ < 0; but
this requires r < 0, while r > m in MI .
524                                         Extremal Kerr black hole
      On the other side, the Euclidean space R3 can be described by cylindrical coordinates (X i ) =
      (ϖ, z, φ), such that the Euclidean metric f takes the form
                                            Φ : Σt,θ       −→ R3
                                                                                                 (13.53)
                                                  (r, φ) 7−→ (ϖ(r), z(r), φ).
      Along Φ(Σt,θ ), we have then dϖ = ϖ′ (r) dr and dz = z ′ (r) dr, so that the metric h induced
      by f on Φ(Σt,θ ) is
                                h = ϖ′ (r)2 + z ′ (r)2 dr2 + ϖ(r)2 dφ2 .                      (13.54)
                                                      
      Φ performs an isometric embedding iff the two metrics (13.51) and (13.54) coincide6 , i.e. iff
                                           s
                                                          2m3 r sin2 θ
                               ϖ(r) = sin θ r2 + m2 + 2                                      (13.55a)
                                                         r + m2 cos2 θ
                                                 r2 + m2 cos2 θ
                                ′   2    ′   2
                               ϖ (r) + z (r) =                  .                            (13.55b)
                                                    (r − m)2
      We deduced from Eq. (13.55b) that
                             Z r
                                     1 p 2
                     z(r) =              r̄ + m2 cos2 θ − ϖ′ (r̄)2 (r̄ − m)2 dr̄,                (13.56)
                               2m r̄ − m
      up to some additive constant, which we set to zero by choosing arbitrarily z(2m) = 0. In the
      integrand, ϖ′ (r̄) should be substituted by the value obtained by differentiating Eq. (13.55a).
          The functions ϖ(r) and z(r), provided respectively by Eqs. (13.55a) and (13.56), define fully
      the isometric embedding (13.53) of the Riemannian 2-manifold (Σt,θ , q) into the Euclidean
      space (R3 , f ). The outcome is depicted in Fig. 13.7 for two values of θ: π/2 (left plot) and π/6
      (right plot). In both cases, it appears that the embedded surface is infinite in two limits: for
      r → +∞ and for r → m.
          The first limit, which implies ϖ → +∞ and z → +∞ according to Eqs. (13.55a) and (13.56),
      is towards the asymptotic flat end of MI . More precisely, since t is held fixed on Σt,θ , the
      asymptotic direction appears as the spacelike infinity point i0 at the right-most corner of the
      Carter-Penrose diagram of Fig. 13.4.
          The second limit, r → m, corresponds to ϖ → 2m √1+cos     sin θ
                                                                          2 θ and z → −∞, according to
      Eqs. (13.55a) and (13.56). This describes an infinite cylinder, since ϖ tends to a constant value,
      while z decays to −∞. The divergence of z for r → m is of course due to the factor (r̄ − m)−1
      in the integrand of (13.56), which leads to the following behavior:
                                      √
                                                             
                                                       r−m
                                               2
                            z(r) ∼ m 1 + cos θ ln                  when r → m.                    (13.57)
                                                          m
         6
             Formally, one says that the metric q is the pullback of h by Φ: q = Φ∗ h.
                         13.4 Near-horizon extremal Kerr (NHEK) geometry                                          525
Figure 13.7: Isometric embeddings of 2-surfaces Σt,θ of constant Boyer-Lindquist time t and constant θ of the
extremal Kerr spacetime in the Euclidean 3-space, for two values of θ: θ = π/2 (equatorial plane) (left figure)
and θ = π/6 (right figure). The drawings are truncated at r = (1 + 10−5 )m (bottom boundary) and at r = 10 m
(top boundary). The blue circle marks the ergosphere, which is located at r = 2m for θ = π/2 and r = 3m/2 for
θ = π/6. The five black circles correspond to r = 1.1 m, 1.01 m, 1.001 m, 1.0001 m and 1.00001 m, from top to
bottom. [Figure generated by the notebook D.5.19]
This logarithmic divergence is clearly apparent in Fig. 13.7: the circles at r − m = 10−1 m,
10−2 m, ..., 10−5 m look equally spaced, all the more than r is close to m. In the Carter-Penrose
diagram of Fig. 13.4, the limit r → m at fixed t appears as the point marked iint . This point is
actually the projection of a 2-sphere (spanned by (θ, φ)) called the internal infinity of the
extremal Kerr spacetime. Let us stress that, similarly to the spacelike infinity i0 and the null
infinities I + and I − , the sphere iint is not part of the physical spacetime (M , g).
    The property of iint being located infinitely far from any point at r = r0 > m on a
hypersurface Σt appears clearly on the metric (13.11). Indeed, according to the latter, the
distance along a curve (t, θ, φ) = const between the points at r = r0 and r = m is
                              Z r0              Z r0 √ 2
                                   √                   r + m2 cos2 θ
                          ℓ=         grr dr =                          dr.
                                m                m        r−m
Again, the factor (r − m)−1 makes the integral diverge, so that ℓ = +∞.
    Another perspective of iint being “infinitely far” is obtained by noticing that on the Carter-
Penrose diagram of Fig. 13.4, iint is located at the past end of the event horizon H . Since H is
a degenerate Killing horizon (cf. Sec. 13.2.5), it is generated by complete null geodesics: the
principal null geodesics L(θ,ψ)
                           out,H
                                    introduced in Sec. 13.2.4. Now, since L(θ,ψ)
                                                                              out,H
                                                                                    is complete, its
526                                         Extremal Kerr black hole
      past end at iint is obtained at the limit t̃ → −∞ of its affine parameter t̃ (see also the grey
      curves in Fig. 13.3, which reveal clearly that the limit t̃ → −∞ at r = m is iint ). Hence, in
      terms of affine length, iint is infinitely far in the past along any of the null geodesics L(θ,ψ)
                                                                                                   out,H
      generating H .
         The spacetime region in the vicinity of r = m is called the extremal Kerr throat. We are
      going to explore it in more details in the next sections.
                                                t           r−m                  t
                                       T =ε       ,    R=       ,      Φ=φ−        ,             (13.58)
                                               2m            εm                 2m
      the reverse transformation being
                                            T                                     T
                                      t = 2m ,        r = m(1 + εR),    φ=Φ+        .            (13.59)
                                            ε                                     ε
      Note that on MBL , the range of T is R, while that of R is (−∞, 0) ∪ (0, +∞), with R > 0 on
      MI and R < 0 on MIII . The coordinate choice (13.59) ensures, for a fixed value of (T, R, Φ)
      with T > 0,
                                                                         dφ      1
                          lim t = +∞, lim r = m, lim φ = +∞,                 =                   (13.60)
                          ε→0             ε→0           ε→0              dt     2m
      and is motivated as follows. For ε → 0, the coordinate R clearly implements the zoom on the
      region r ≃ m, i.e. the region close to the event horizon H . Regarding the change t → T , we
      may notice that the Boyer-Lindquist coordinate t diverges near H — more precisely, t diverges
      along the worldline of any particle that reaches H (this is clear on Figs. 13.1 and 13.4). So, we
      may keep T = εt/(2m) finite in the vicinity of H , while having t diverging. The division by
      2m makes T dimensionless, as R, with the factor 2 being chosen for later convenience. Similarly,
      the Boyer-Lindquist coordinate φ diverges along the worldline of any particle that reaches
      H . For geodesics, this divergence occurs at the rate dφ/dt = ΩH = 1/(2m) [Eqs. (11.74) and
      (13.23)], which is ensured at fixed Φ by the choice (13.59).
          Let us denote by Gαβ (T, R, θ, Φ, ε) the components of the metric tensor g with respect to
      Bardeen-Horowitz coordinates (X α ) := (T, R, θ, Φ). They depend on ε since the change of
      coordinates (13.59)-(13.58) does. The explicit expressions of the Gαβ (T, R, θ, Φ, ε)’s are rather
      complicated and can be found in the notebook D.5.20. We are concerned here only by the limit
      ε → 0. In other words, we consider the tensor field h on MBL whose components with respect
      to the Bardeen-Horowitz coordinates are the limits
         7
             Cf. the historical note on page 536.
                       13.4 Near-horizon extremal Kerr (NHEK) geometry                                      527
Evaluating the limits leads to the explicit expression of h (cf. the notebook D.5.20):
                                       dR2                4 sin2 θ
                                                                                   
           2         2         2   2
    h = m (1 + cos θ) −R dT + 2 + dθ +            2                               2
                                                                     (dΦ + R dT ) .              (13.62)
                                        R             (1 + cos2 θ)2
A priori the limits (13.61) lead only to a symmetric bilinear form. To be a proper metric tensor,
h has to be non-degenerate as well (cf. Sec. A.3.1 in Appendix A). This can be checked by
computing the determinant of the components (13.62); one gets (cf. the notebook D.5.20):
Hence det(hαβ ) vanishes only for sin θ = 0, i.e. on the rotation axis A . This merely signals
the standard coordinate singularity of spherical-type coordinates on A . Away from A , one
has det(hαβ ) < 0, which shows that h is non-degenerate and has the signature (−, +, +, +)
or (+, −, −, −). Since the (diagonal) (R, θ)-block has clearly the signature (+, +), we get that
only (−, +, +, +) is possible. Hence we may conclude:
  Equation (13.62) defines a Lorentzian metric tensor h on MBL = MI ∪ MIII . We shall call
  it the near-horizon extremal Kerr (NHEK) metric.
    The NHEK metric h has been obtained in Bardeen-Horowitz coordinates (T, R, θ, Φ). Let
us express it in terms of Boyer-Lindquist coordinates (t, r, θ, φ). From the transformation law
(13.58), we have
            r−m              dR    dr                                              r − 2m
 R dT =          dt,            =                 and      dΦ + R dT = dφ +               dt. (13.64)
             2m2              R   r−m                                                2m2
The components of h with respect to Boyer-Lindquist coordinates are then immediately deduced
from Eq. (13.62):
                                     "
                                         (r − m)2 2         dr2
                h = m2 (1 + cos2 θ) −               dt +           + dθ2
                                            4m4          (r − m)2
                                                                          2 #        (13.65)
                                            4 sin2 θ
                                                       
                                                              r − 2m
                                       +                 dφ +          dt
                                         (1 + cos2 θ)2          2m2
Remark 1: It is remarkable that, while the change of coordinates (t, r, θ, φ) ↔ (T, R, θ, Φ) depends on
the parameter ε [Eqs. (13.59)-(13.58)], the Boyer-Lindquist expression (13.65) of h is independent of ε.
This occurs because ε disappears in the combinations (13.64). In particular, the NHEK metric is unique,
i.e. it does not depend on the value of ε, contrary to the Bardeen-Horowitz coordinates on MBL .
Remark 2: As we have introduced it, the NHEK metric is defined on the part MBL of the Kerr manifold
M , where R ̸= 0. In Sec. 13.4.4, we shall consider instead that h is defined on a manifold distinct from
M , where it can be extended accross R = 0.
528                                  Extremal Kerr black hole
      where H stands for the Heaviside function: H(−R) = 1 for R < 0 and 0 for R > 0. Beside the
      AdS2 -type transformation for (τ, y) ↔ (T, R), the new azimuthal coordinate ψ is introduced
      to insure dΦ + R dT = dψ + y dτ . In terms of the coordinates (τ, y, θ, ψ), the NHEK metric
      h reads (cf. the notebook D.5.20):
                                                  dy 2              4 sin2 θ
                                                                                            
                  2        2             2   2               2                             2
           h = m (1 + cos θ) −(1 + y )dτ +              + dθ +                 (dψ + y dτ ) .
                                                 1 + y2          (1 + cos2 θ)2
                                                                                             (13.70)
                          13.4 Near-horizon extremal Kerr (NHEK) geometry                                               529
On the rotation axis (sin θ = 0), we recover the AdS2 metric expressed in global AdS coordinates
(τ, y). On the hypersurface θ = θ∗ considered above, we get
                             √                       dy 2
                                                                          
                h θ=θ∗               2       2   2
                         = 2( 3 − 1)m −(1 + y )dτ +                      2
                                                           + (dψ + y dτ ) .                                 (13.71)
                                                    1 + y2
This is the metric of AdS3 in some standard coordinates8 (compare e.g. with Eq. (17) of Ref. [57]
with y = sinh ω). On constant θ hypersurfaces, with θ ̸= θ∗ or π − θ∗ , the metric induced by h
is not that of AdS3 but that of a so-called warped AdS3 , the θ-term in front of (dψ + y dτ )2 in
Eq. (13.70) being the warp factor (see e.g. Refs. [137, 57, 247]).
                                     √
Remark 3: The value θ∗ = arcsin( 3 − 1) for which the hypersurface θ = θ∗ inherits AdS3 metric
from h is exactly the value θcrit considered in Sec. 12.4.4 [cf. Eq. (12.98)], namely the minimal value
of the colatitude θ of a distant observer for the critical curve (boundary of the black hole shadow) to
contain a vertical straight line segment (the so-called NHEK line; maroon line in Fig. 12.24). A priori,
this is a mere coincidence.
  The NHEK spacetime is the spacetime (N , h), where the manifold N is diffeomorphic
  to R2 × S2 and h is the Lorentzian metric defined by Eq. (13.70) in terms of the coordi-
  nates (τ, y, θ, ψ), with (τ, y) ∈ R2 and (θ, ψ) ∈ (0, π) × (0, 2π) being standard spherical
  coordinates on S2 .
By comparing the metrics (13.70) and (3.49), with the changes of notation y ↔ r and ψ ↔ φ,
we note that (N , h) bears some similarities with AdS4 , but is distinct from it. First of all, r
ranges in (0, +∞) only on AdS4 , while y ranges in9 (−∞, +∞) on N . Furthermore, there is
a global (1 + cos2 θ) term in (13.70), the dθ2 terms differ drastically in the two expressions, the
one in (13.70) being not multiplied by y 2 . The dφ2 and dψ 2 are also truly distinct. Moreover,
the NHEK metric contains the off-diagonal term hτ ψ , while the AdS4 one is purely diagonal.
   8
      These coordinates are linked to the property of AdS3 being a fiber bundle over AdS2 with fiber R (in the same
way as S3 is a fiber bundle over S2 with fiber S1 (Hopf fibration)); ψ is then the fiber coordinate and spans R.
Since ψ is 2π-periodic on the hypersurface θ = θ∗ , we see that the latter is actually not the whole of AdS3 , but a
quotient of it, obtained by identifying the points ψ and ψ + 2π.
    9
      Note that there is no coordinate singularity at y = 0 in (13.70), while there is one at r = 0 in (3.49) and the
latter is such that AdS4 cannot be extended analytically to r < 0.
530                                         Extremal Kerr black hole
         The inverse NHEK metric h−1 takes a rather simple form (cf. the notebook D.5.21 for the
      computation):
                                                                                       
                                     1                               y
                                − 1+y  2   0     0                1+y 2
                                                                                       
                                               2
                        1         0     1 + y   0                   0                  
              αβ
            h =                                                                         . (13.72)
                                                                                       
                           2
                    1 + cos θ  0
                              
                                          0     1                   0                  
                                                                                        
                                   y               (1+cos2 θ)2 +y 2 (cos4 θ+6 cos2 θ−3)
                                 1+y 2
                                           0     0           4(1+y 2 ) sin2 θ
          As AdS4 , the NHEK spacetime is not asymptotically flat. The metrics of both spacetimes
      are solutions of the Einstein equation with T = 0 [Eq. (1.43)], but with Λ < 0 for AdS4 metric
      and with Λ = 0 for h:
Ric(h) = 0, (13.73)
      Proof. h has been defined on MBL via the limit expression (13.61). Now, for ε > 0, the Gαβ ’s
      in the right-hand side of Eq. (13.61) are the components of a metric that has a vanishing Ricci
      tensor, since it is nothing but the extremal Kerr metric, expressed in the Bardeen-Horowitz
      coordinates (T, R, θ, Φ), which form a perfectly regular coordinate system on MBL for ε > 0
      (cf. the transformations (13.59)-(13.58)). By continuity at the limit ε → 0, it follows that h has
      a vanishing Ricci tensor as well. A priori this holds only on MBL , but since the components
      of h in the global NHEK coordinates (τ, y, θ, ψ) are identical on MBL and N , the result is
      immediately extended by analyticity to all the NHEK spacetime.
      Remark 4: For the reader skeptical about the limit process and the extension to the whole of N , a
      direct computation of the Ricci tensor of h from the global-coordinate components (13.70) is performed
      in the notebook D.5.21 and the outcome is indeed identically vanishing.
      for |y| → +∞, so that h(∂τ , ∂τ ) > 0 for sin θ > (1 + cos2 θ)/2.
                         13.4 Near-horizon extremal Kerr (NHEK) geometry                                         531
An alternative concept of mass, which does not require any Killing vector, is that of ADM
mass (cf. Sec. 5.3.5). It is however not applicable either since (N , h) is not asymptotically flat.
Incidentally, this last fact would anyway preclude any unambiguous definition of the Komar
mass, since asymptotic flatness is required to normalize the timelike Killing field at spatial
infinity (scalar square −1).
    On the contrary, the angular momentum J of the NHEK spacetime can properly be
defined from the axisymmetry Killing vector η = ∂φ = ∂Φ = ∂ψ , via the Komar formula (5.63).
The Hodge dual ⋆(dη) [cf. Eq. (5.38)], which appears in this formula, has of course to be taken
with respect to the metric h. Since h fulfills the vacuum Einstein equation [Eq. (13.73)], the
value of J does not depend upon the choice of the integration 2-surface S in formula (5.63)
(Property 5.21). One finds (cf. the notebook D.5.21 for the computation):
                                                 J = m2 .                                             (13.74)
This is exactly the value J = am = m2 of the angular momentum of the extremal Kerr
spacetime [Eq. (10.80)]. Hence we conclude:
  Property 13.10: angular momentum of NHEK spacetime
  The NHEK spacetime (N , h) has no well defined mass, but it has a well defined Komar
  angular momentum, whose value is nothing but the angular momentum J = m2 of the
  extremal Kerr spacetime (M , g) from which (N , h) arises.
Remark 5: For this reason, many authors replace the overall factor m2 of the NHEK metric in Eq. (13.62)
or (13.70) by J.
“Conformal” coordinates
The range of the coordinate y is the whole of R. For pictural purposes (e.g. drawing Carter-
Penrose-like diagrams), it would be convenient to have instead a coordinate spanning some
finite range. As in the AdS4 case presented in Example 18 of Chap. 3, it is quite natural to
introduce
                            χ := arctan y    ⇐⇒      y =: tan χ                      (13.75)
so that11 χ ∈ (−π/2, π/2). The relation (13.69) to the Bardeen-Horowitz coordinates (T, R, θ, Φ)
can be then rewritten as
                                      
                                         τ = arctan2 (2T R2 , R2 − T 2 R2 + 1) + πH(−R)
   
                sin τ
   
     T = cos τ +sin χ                
                                      
                                      
   
                                     
                                                                     
                                                       (1+T 2 )R2 −1
           cos
    R = cos χ
               τ +sin χ              
                                      
                                        χ =  arctan         2R
                                  ⇐⇒
   
   
     θ=θ                             
                                      
                                        θ=θ
   
                                     
                                                                           
                                     
                        cos(τ −χ)                             (1−T R)2 +R2
                                                          √
                                     
     Φ = ψ + ln sin τ +cos χ         
                                        ψ = Φ − ln               2    2 2 2
                                                                               ,
                                                                  [(1+T )R −1] +4R
                                                                                                      (13.76)
  11
    In Example 18, the range of χ was only (0, π/2) because r = tan χ ≥ 0 in AdS4 , while here y = tan χ spans
the whole of R.
532                                       Extremal Kerr black hole
                                      r=
                −
                                                            +∞
            r=
                                                          r=
                                                                                      4
                                                                                              R=−∞
                                                                                                     −
                                III                                                              P
                                                                                          R= −1
                                                                                      2
                                r= 0
r=
                                                                                                         0
                                 r=
                                  m
                                                                                          R=
             r=
                                                                                τ
                                                             +∞
                                                                                                              1
              −
                                                                                                             R=
                ∞
                                                                                                                      R=+∞
                                                                                      0                       +
                        i int                         I                                                       P
                 ∞
                                      r=
                −
                                                            +∞
            r=
                                                                                      2
                                        m
                                                  0
                                                          r=
                                                                                          2      1       0        1    2
                                                                                                         χ
      Figure 13.8: 2-dimensional views of the extremal Kerr spacetime (M , g) (left) and of the NHEK spacetime
                                                             (0)   (0)
      (N , h) (right). The left figure is a zoom on the (MI , MIII ) part of the Carter-Penrose diagram shown in
      Fig. 13.6. The right figure is a “compactified” view of N based on the coordinates (τ, χ). The near-horizon region
      in the Kerr spacetime is visualized by means of red curves, which are curves (actually hypersurfaces) of constant
      r, ranging from r = m/2 to r = 3m/2, i.e. within ∆r = m/2 of the horizon value r = m. The r-increment
      between two nearby curves is δr = m/10. Grey curves represents hypersurfaces of constant Boyer-Lindquist
      coordinate t in the near horizon region, with t varying by steps of δt = 2m in the range [−8m, 8m]. Each of these
      constant-r or constant-t curves is mapped to the NHEK spacetime (right figure) by means of the transformation
      (13.58) with ε = 1/2, giving birth to constant-R curves ranging from R = −1 to R = 1 (red curves) and to
      constant-T curves ranging from T = −2 to T = 2 (grey curves). [Figures generated by the notebooks D.5.18 (left)
      and D.5.21 (right)]
      These transformations can be used to map the Boyer-Lindquist domain MBL of Kerr spacetime
      to the region −π/2 < τ + χ < 3π/2 of the NHEK spacetime N , as illustrated in Fig. 13.8. This
      assumes a fixed value of ε > 0 in Eqs. (13.59)-(13.58), so that (T, R, θ, Φ) constitute a regular
      coordinate system on MBL = MI ∪ MIII . More precisely, MI , where R > 0, is mapped to the
      subregion NP+ defined by −χ − π/2 < τ < χ + π/2, which implies cos τ + sin χ > 0 and
      hence R > 0 via (13.76), since cos χ > 0 for χ ∈ (−π/2, π/2). On the other hand, MIII , where
      R < 0, is mapped to the subregion NP− defined by χ + π/2 < τ < −χ + 3π/2, which implies
      cos τ + sin χ < 0 and hence R < 0 via (13.76). In both NP+ and NP− , the coordinate T spans
      the whole of R. We shall call each of the regions NP+ and NP− a Poincaré patch of NHEK
      spacetime. They appear as the interiors of triangles having one vertical edge at the NHEK
                            13.4 Near-horizon extremal Kerr (NHEK) geometry                                             533
boundary in Fig. 13.8. Note the similarity12 with the Poincaré patch in AdS4 spacetime plotted
in Fig. 3.6.
    The NHEK metric components with respect to the coordinates (τ, χ, θ, ψ) are easily deduced
from (13.70):
                                                                 4 sin2 θ
                                                                                              
             2          2      1          2       2
                                                        2                                   2
    h = m (1 + cos θ)                 −dτ + dχ + dθ +                       (dψ + tan χ dτ ) .
                            cos2 χ                            (1 + cos2 θ)2
                                                                                             (13.77)
Note the factor 1/ cos2 χ in front of the −dτ 2 + dχ2 term, which is analogous to the conformal
factor 1/ cos2 χ of the AdS4 metric in Eq. (3.50), except that in the present case, 1/ cos2 χ does
not factor all the metric, so that it cannot be considered as a proper conformal factor. Since cos χ
never vanishes for χ ∈ (−π/2, π/2), the factor 1/ cos2 χ is regular in all N . It simply blows
up at the “boundaries” χ → ±π/2 of N , which correspond to y → ±∞ and to R → ±∞ in
the Poincaré patches NP+ and NP− .
    One can provide a pictoral view of h being the “near-horizon limit” of g as follows:
  The two red strips r ∈ [m/2, 3m/2] and R ∈ [−1, 1] in Fig. 13.8 have been drawn by
  setting ε = 1/2 in the transformation (13.58) linking the Boyer-Lindquist coordinates
  (t, r, θ, φ) to the Bardeen-Horowitz ones (T, R, θ, Φ). Now, if one maintains the red strip
  R ∈ [−1, 1] in the NHEK spacetime unchanged and let ε decays to 0, the red strip in the
  extremal Kerr spacetime shrinks to a more and more narrow strip around the horizon. As
  ε → 0, this narrow strip, endowed with the Kerr metric g, tends to be isometric to the
  fixed NHEK strip R ∈ [−1, 1] endowed with the metric h.
Remark 6: While the extremal Kerr spacetime and the NHEK one are similar in the near-horizon region,
it is clear on Fig. 13.8 that they have very different asymptotics for r → ±∞ and R → ±∞.
      are two Killing vectors of h. In the above equation, the second expression of ξ1 is that in the
      Boyer-Lindquist coordinate frame and follows directly from the transformation law (13.59).
      In particular, note that ξ1 is distinct from the Killing vector ξ = ∂t of the Kerr metric g. It is
      rather connected to the horizon-generating Killing vector χ defined by Eq. (13.23):
                                                                  2m
                                                           ξ1 =      χ.                                              (13.79)
                                                                   ε
      Besides, in Eq. (13.78), the vector field ∂Φ is identical to the Killing vector field η = ∂φ of the
      Kerr metric g as a consequence of the transformation law (13.59). Similarly, the transformation
      law (13.69) implies that η = ∂ψ .
          A third symmetry apparent on the metric components (13.62) is the invariance under
      the transformation (T, R) 7→ (αT, R/α), for any α > 0. Such transformations, which are
      called squeeze mappings (or hyperbolic rotations) of the (T, R) plane, form a 1-parameter
      group action, whose generator ξ2 is given by formula (3.14) with t = λ := α − 1 (so that the
      identity element is for λ = 0). By considering an infinitesimal transformation of parameter
      dλ, formula (3.14) leads to the components ξ2T = [(1 + dλ)T − T ]/dλ = T and ξ2R = [(1 −
      dλ)R − R]/dλ = −R. Hence the Killing vector field
                                                                                          t
                                   ξ2 = T ∂T − R ∂R = t ∂t + (m − r) ∂r +                   ∂φ .                     (13.80)
                                                                                         2m
      The second expression stems from the transformation law (13.59) and expresses ξ2 in terms
      of the Boyer-Lindquist coordinate frame. We note that this expression is independent of ε,
      contrary to the Boyer-Lindquist expression of ξ1 .
          Finally, a fourth Killing vector of h shows up clearly on the components (13.70) with respect
      to the global NHEK coordinates (τ, y, θ, ψ), since these components are independent of τ .
      Hence ∂τ is a Killing vector of h. In what follows, we will use instead the linear combination14
                                                                       1
                                                        ξ3 := ∂τ −       ξ1 ,                                        (13.81)
                                                                       2
      since it has slightly simpler components with respect to Bardeen-Horowitz coordinates. Given
      the transformation law (13.69), we get (cf. the notebook D.5.20)
                                                  
                                       1         1                   1
                                 ξ3 =      2
                                          T + 2 ∂T − RT ∂R − ∂Φ .                           (13.82)
                                       2       R                     R
         We have thus four Killing vectors of the NEHK metric h: η, ξ1 , ξ2 and ξ3 . Since none of
      them is a linear combination with constant coefficients of the three others, the isometry group
      G generated by these Killing vectors is 4-dimensional. This is 2 dimensions more than the
        14
             Let us recall that a linear combination with constant coefficients of Killing vectors is a Killing vector.
                            13.4 Near-horizon extremal Kerr (NHEK) geometry                                           535
isometry group of Kerr spacetime, which is R × U(1). In order to fully characterize G, let us
determine its Lie algebra by evaluating the commutator15 of each pair of the four generators η,
ξ1 , ξ2 and ξ3 . We get (cf. the notebook D.5.20)
and
                              [ξ2 , ξ1 ] = −ξ1 ,    [ξ2 , ξ3 ] = ξ3 ,    [ξ1 , ξ3 ] = ξ2 .                 (13.85)
Equation (13.84) shows that η commutes with all the other generators of G. Since η is the
generator of the axisymmetry group U(1), we may write G = G3 × U(1), where G3 is a
3-dimensional Lie group, generated by ξ1 , ξ2 and ξ3 . The commutation relations (13.85) show16
that the Lie algebra of G3 is the special linear algebra sl(2, R). At this stage, G3 can be one of
the following three Lie groups: SL(2, R), PSL(2, R) = SL(2, R)/Z2 or SL(2, R) (the universal
covering group of SL(2, R)). It cannot be PSL(2, R) because, as it appears clearly on (13.62),
the transformation (T, R) 7→ (−T, −R) is an element of G3 and, in PSL(2, R), this element
would be identified with the identity due to the quotient by Z2 = {Id, −Id}, where Id is the
identity element. G3 is actually the special linear group SL(2, R). We conclude:
  with U(1) generated by the axisymmetry Killing vector η and SL(2, R) generated by the
  three Killing vectors ξ1 , ξ2 and ξ3 given by Eqs. (13.78), (13.80) and (13.82).
   This property is often phrased as follows: the near-horizon region of the extremal Kerr
spacetime (the extremal Kerr throat) has some emergent symmetries, i.e. it has more symmetries
than the extremal Kerr spacetime itself. Let us stress that no such thing occurs for non-extremal
Kerr black holes.
Remark 7: SL(2, R) is isomorphic to the spin group Spin(2, 1), which is the isometry group of the
2-dimensional anti-de Sitter spacetime (AdS2 ). The spin group Spin(2, 1) is the double cover of the
pseudo-orthogonal group SO(2, 1), the latter being the isometry group of the “time-cyclic” 2-dimensional
anti-de Sitter spacetime, i.e. the quadric surface X 2 − U 2 − V 2 = −1 in R3 endowed with the flat
metric −dU 2 − dV 2 + dX 2 .
Remark 8: Having an enhanced symmetry group near the horizon is not specific to the extremal Kerr
black hole; this is actually a generic feature, which occurs near degenerate Killing horizons [327, 137].
  15
       The definition of the commutator of two vector fields is recalled in Appendix A [cf. Eq. (A.27)].
  16
       More precisely, inthe representation
                                       
                                             of sl(2,
                                                 
                                                      R) by
                                                         
                                                            the algebra
                                                                     
                                                                          of 2 × 2 real matrices with zero trace, a
                      0    1        0  0         1    0 
standard basis is E =
                              , F =
                                    
                                             , H =
                                                   
                                                              . This basis has the following commutation
                                                              
                     
                        0 0         
                                       1 0         
                                                      0 −1 
relations: [H,
            √ E] = 2E, [H, F ] = −2F and [E,√F ] = H. One recovers the last ones from (13.85) by identityfying
ξ1 with F/ 2, ξ2 with H/2 and ξ3 with −E/ 2.
536                                      Extremal Kerr black hole
      Pure geometric arguments suffice to show that a degenerate Killing horizon associated with some Killing
      vector ξ1 has necessarily a near-horizon geometry endowed with a second Killing vector, which is
      similar to ξ2 [327]. Axisymmetry provides the third Killing vector η and the Einstein equation leads to
      the existence of the fourth Killing vector ξ3 [329].
      Historical note : The NHEK metric h has been first exhibited by Brandon Carter in 1973 [95]: compare
      Eq. (5.63) of Ref. [95] (with Q = P = 0) with Eq. (13.62), taking into account the change of notations
      (τ, λ, φ) ↔ (T, −R, Φ). Actually the NHEK metric was found by Carter when searching for the
      most general stationary and axisymmetric solutions of the source-free Einstein-Maxwell equations (cf.
      Sec. 1.5.2), in order to get the Kerr-Newman solution. The latter appeared then as the generic case
      and NHEK-like metrics as special cases. The limit Q = P = 0 of Carter’s metric (5.63) corresponds
      to the case of vanishing electric charge Q and magnetic monopole P , so that it is a solution of the
      vacuum Einstein equation [Eq. (13.73) above]. Carter stated that the metric h has a 4-dimensional
      isometry group and that the hypersurfaces of constant θ are homogeneous and partially isotropic. Even
      if he did not get h by some near-horizon limit process, Carter stressed that in the neighborhood of
      the extremal Kerr horizon, the Kerr metric g can be approximated arbitrarily closely by h (cf. Fig. 6.3
      in Ref. [95], which is similar to our Fig. 13.8). In the related case of the static extremal black hole,
      constituted by thep (spherically symmetric) Reissner-Nordström solution with electric charge equal to
      the mass (Q = 4π/µ0 m, cf. Eq. (5.107)), Carter exhibited a change of coordinates similar17 to that
      between Boyer-Lindquist coordinates and Bardeen-Horowitz coordinates [Eq. (13.59)] and he showed
      that the near-horizon limit brings the Reissner-Nordström metric to the Bertotti-Robinson metric. The
      latter is the product metric of AdS2 × S2 and constitutes a static solution of the source-free Einstein-
      Maxwell equations that is highly symmetric, having the 6-dimensional group SL(2, R) × SO(3) as
      isometry group (cf. e.g. Ref. [213]). The NHEK metric has been rediscovered by James Bardeen and Gary
      Horowitz in 1999 [41]. They derived it from the Boyer-Lindquist expression of the extremal Kerr metric
      by performing the change of coordinates18 (13.58)-(13.59) and taking the limit ε → 0, as presented in
      Sec. 13.4.2. In the same article [41], Bardeen and Horowitz introduced the global NHEK coordinates19
      (τ, y, θ, ψ) via the transformation (13.69) and obtained expression (13.70) for h [their Eq. (2.9)]. They
      have analyzed the global properties of the NHEK spacetime, in particular its geodesic completeness.
      Bardeen and Horowitz have also shown that the NHEK isometry group is SL(2, R) × U(1) [Eq. (13.86)]
      and have given formulas (13.78), (13.80) and (13.82) for the four Killing vectors ξ1 , ξ2 , ξ3 and η [their
      Eq. (2.14)].
      13.4.6      Applications
      The NHEK geometry is at the core of the so-called Kerr/CFT correspondence, initiated by Monica
      Guica, Thomas Hartman, Wei Song and Andrew Strominger in 2009 [247], which conjectures
      that quantum gravity in the vicinity of the extremal Kerr horizon is dual to a two-dimensional
      conformal field theory (CFT) (see Ref. [137] for a review of this topic).
        17
            Carter defined the near-horizon coordinates (τ, λ) by t = m2 τ and r = λ + m and took the near-horizon
      limit as m → +∞ (cf. the paragraph surrounding Eq. (4.31) in Ref. [95]); this is equivalent to Eq. (13.59) with
      ε = 1/m.
         18
            The coordinates actually used by Bardeen and Horowitz were T̃ := 2mT and R̃ = mR, denoted respectively
      t and r by them.
         19
            ψ is denoted by ϕ in Ref. [41].
                                                 13.5 Going further                                                     537
    Another domain of application of the NHEK geometry regards the computation of signals
emanating from the neighborhood of an extremal Kerr black hole, either electromagnetic ones
(e.g. [238, 139, 308]) or gravitational-wave ones (e.g. [140, 239]).
  20
     The work [18] appeared on arXiv in 2012, but was published in 2015.
  21
     By transverse derivative it is meant ∂Ψ/∂r, ∂ 2 Ψ/∂r2 , etc. (recall that H is located at r = const and is
spanned by the coordinates (t̃, θ, φ̃)). If the initial data has support away from H , the divergence occurs only for
the third derivative.
538   Extremal Kerr black hole
       Part IV
Contents
        14.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 541
        14.2 Lemaître-Tolman equations and their solutions . . . . . . . . . . . . . 542
        14.3 Oppenheimer-Snyder collapse . . . . . . . . . . . . . . . . . . . . . . . 550
        14.4 Observing the black hole formation . . . . . . . . . . . . . . . . . . . . 568
        14.5 Going further . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 582
14.1      Introduction
After having investigated black holes in equilibrium, through the Schwarzschild and Kerr
solutions, we move to dynamical black holes, more specifically to the standard process of
black hole formation: gravitational collapse. To deal with analytical solutions, we simplify the
problem as much as possible. First we assume spherical symmetry, which is quite natural as a
first approximation for modeling the gravitational collapse of a stellar core or a gas cloud. A
drawback of this approach is that it forbids the study of gravitational waves, since by virtue
of the Jebsen-Birkhoff theorem (to be proven in Sec. 14.2.5), the exterior of any spherically
symmetric collapsing object is a piece of Schwarzschild spacetime, which does not contain any
gravitational radiation. The second major simplifying approximation is to consider pressureless
matter, commonly referred to as dust.
    We start by deriving from the Einstein equation the system of partial differential equations
yielding the metric for spherically symmetric dust (Lemaître-Tolman system) and write down
the most general solutions (Sec. 14.2). Then we specialize these solutions to the case of a
homogeneous ball of dust collapsing from rest (Oppenheimer-Snyder collapse, Sec. 14.3), which
we study in details, in particular regarding the birth of the event horizon and the existence
of trapped surfaces. Finally, we consider the observational appearance of the gravitational
collapse and the black hole formation to a remote observer (Sec. 14.4).
542                                  Black hole formation 1: dust collapse
                        g = −dτ 2 + a(τ, χ)2 dχ2 + r(τ, χ)2 dθ2 + sin2 θ dφ2 ,                 (14.1)
                                                                                
      where a(τ, χ) and r(τ, χ) are generic positive functions. These coordinates are called Lemaître
      synchronous coordinates, the qualifier synchronous meaning that τ is the proper time of
      a observer staying at fixed value of the spatial coordinates (χ, θ, φ). Note that the function
      r(τ, χ) gives the areal radius of the 2-spheres defined by (τ, χ) = const, which are the orbits
      of the SO(3) group action (cf. Sec. 6.2.2), i.e. the metric area of these 2-spheres is 4πr(τ, χ)2 .
          For simplicity, we consider only pressureless matter, in the form of a perfect fluid with zero
      pressure. The matter energy-momentum tensor is then
T = ρ u ⊗ u, (14.2)
      where u is the 1-form metric-dual to the fluid 4-velocity u, i.e. the 1-form of components
      uα = gαµ uµ [cf. Eq. (A.45)], and ρ is a scalar field that can be interpreted as the fluid energy
      density measured in the fluid frame, by virtue of the identity ρ = T (u, u), which follows from
      ⟨u, u⟩ = g(u, u) = −1. Let us recall that the energy-momentum tensor of a generic perfect
      fluid is T = (ρ + p)u ⊗ u + pg, where p is the fluid pressure. Expression (14.2) corresponds
      thus to the special case p = 0. Inside the matter, we link the coordinates (τ, χ, θ, φ) to the fluid
      by demanding that they are comoving with the fluid, i.e. that any fluid particle stays at fixed
      values of (χ, θ, φ). Because the 4-velocity obeys uα = dxα /dτf , where τf is the fluid proper
      time [cf. Eq. (1.16)], this amounts to uχ = uθ = uφ = 0, hence
u = ∂τ . (14.3)
      A priori, one should only have u = uτ ∂τ , but the synchronous coordinate condition gτ τ = −1
      along with the normalization g(u, u) = −1 implies uτ = 1. Since uτ = dτ /dτf , we get
      τ = τf (up to some additive constant), which provides the physical interpretation of Lemaître
      coordinate τ as the fluid proper time.
         1
             See Sec. 6.2.2 for the precise definition of spherically symmetric.
                            14.2 Lemaître-Tolman equations and their solutions                          543
  The worldlines of the fluid particles of the pressureless matter model (14.2) are timelike
  geodesics of (M , g).
Proof. If one plugs the energy-momentum tensor (14.2) in the energy-momentum conservation
law (1.45), one obtains ∇µ (ρuµ uα ) = 0, i.e.
Now the two terms in the left-hand side of this equation are orthogonal to each other, as an
immediate consequence of the normalization of the 4-velocity u [Eq. (1.17)]: u · ∇u u = 0. In
particular, u is a timelike vector, while the 4-acceleration ∇u u is a spacelike one. Thus the
only way for Eq. (14.4) to hold is that each term in the left-hand side vanishes separately:
∇µ (ρuµ ) = 0 and uµ ∇µ uα = 0.
The second equation is nothing but the geodesic equation [Eq. (B.1)] for the field lines of u, i.e.
the fluid worldlines.
Each fluid particle is thus in free-fall and moves independently of its neighbors, which is not
surprising since the pressure is zero. This justify the term dust given to the matter model
(14.2).
                                                                1 ∂r
                                                a(τ, χ) =              ,                       (14.5)
                                                              f (χ) ∂χ
where f (χ) is an arbitrary function of χ. Accordingly, we may rewrite the metric (14.1) as
                                                     2
                                  1              ∂r
                                2
                                                           dχ2 + r(τ, χ)2 dθ2 + sin2 θ dφ2 .   (14.6)
                                                                                          
                     g = −dτ +
                               f (χ)2            ∂χ
The metric (14.6) is a solution of the Einstein equation (1.40) with the energy-momentum
   2
       For Λ = 0, this case leads to Datt solution [157].
544                              Black hole formation 1: dust collapse
      Proof. Taking into account (14.5), the χχ and τ τ components of the Einstein equation yield
      respectively to (14.7a) and (14.7b), cf. the notebook D.6.1. There is no other independent
      component of the Einstein equation.
          The function M (χ) is known in the literature as the Misner-Sharp mass or Misner-
      Sharp energy, in reference of a study by Misner and Sharp in 1964 [370], although it has been
      introduced by Lemaître [346] more than 30 years earlier. This quantity is invariantly defined
      for any spherically symmetric spacetime from the areal radius r:
                                                                          
                                                r                      Λ 2
                                           M :=                    µ
                                                          1 − ∇µ r∇ r − r .                                                     (14.8)
                                                2                      3
      It is easy to check that the above relation holds in the present case: we have, thanks to (14.1),
                                                         2                     2                2                        2
                 µ   ∂r ∂r
                         µν                          ∂r                      ∂r                 ∂r             1           ∂r
         ∇µ r∇ r = g         = gτ τ                            +g   χχ
                                                                                       =−                 +
                     ∂xµ ∂xν                         ∂τ                      ∂χ                 ∂τ          a(τ, χ)2       ∂χ
      In view of the Lemaître-Tolman equation (14.7a), we conclude that (14.8) holds for M = M (χ).
      Historical note : The Lemaître-Tolman system (14.7) has been first derived in 1932 by Georges Lemaître
      [346]: Eqs. (14.6), (14.7a) and (14.7b) are respectively Eqs. (8.1), (8.2) and (8.3) of Ref. [346], up to some
      slight change of notations. The system (14.7) however became known as Tolman model or Tolman-Bondi
      model, in reference to posterior works by Richard Tolman (1934) [482] and by Hermann Bondi (1947)
      [66]. This happened despite Tolman fully acknowledging Lemaître’s work [346] in his article [482]
      (Tolman actually met Lemaître in 1932-33 during the latter’s trip to United States [182]) and Bondi [66]
      mentioned that “Lemaître studies a problem very closely related to ours and many equations given in the
      appendix can be found in the (Lemaître’s) paper”. We refer to Eisenstaedt’s article [182] for a detailed
      historical study of Lemaître’s paper [346] (see also Krasiński’s note [323]). We follow the suggestion of
      Plebański & Krasiński [414] to call the system (14.7) Lemaître-Tolman, and not merely Lemaître, in order
      to distinguish it from other Lemaître contributions to general relativity and cosmology.
                     14.2 Lemaître-Tolman equations and their solutions                             545
In the above formulas, τ0 (χ) is an arbitrary function of χ. For E > 0 and E = 0, it sets the
value of τ for which r = 0, while for E < 0, it sets the value of τ for which r takes its maximal
value (m/|E|).
Exercise: prove that each of formulas (14.11)-(14.13) provides a solution of Eq. (14.9).
    We may summarize the above results as follows:
  Property 14.3: solution for spherical dust collapse
           3. depending of on the value of E(χ), use (14.11), (14.12) or (14.13) to get the solution
              for r(τ, χ);
           4. plug this solution into the remaining Lemaître-Tolman equation, Eq. (14.7b), to get
              ρ(τ, χ) and into (14.6) to get the metric tensor.
      this relation expressing the identity ∂ t/∂r∂τ = ∂ t/∂τ ∂r. In it, χ has to be considered as a
                                                   2           2
      function of (τ, r), which is given implicitly by one of the three expressions (14.11)-(14.13) with
      M (χ) = m. Accordingly, the above condition becomes
                                f ′ (χ) ∂χ                 ε         f ′ (χ) ∂χ
                            −                    =p                              .
                                f (χ)2 ∂r    τ     f (χ)2 − 1 + 2m/r f (χ)2 ∂τ r
                      14.2 Lemaître-Tolman equations and their solutions                                547
                                     ∂χ           ∂χ       ∂r
                                              +                     = 0.
                                     ∂τ   r       ∂r   τ   ∂τ   χ
Now this condition is always fulfilled: by the chain rule, the left-hand side is nothing but the
partial derivative ∂χ
                   ∂τ χ
                        , which is trivially zero if one considers χ as a function of (τ, χ). We
conclude that there exists a function t obeying (14.17), which proves that g is Schwarzschild
metric, whatever the choice of the function f (χ). Since our starting point was the most general
metric for a (possibly non-stationary) spherically symmetric spacetime [Eq. (14.1)], we have
proven a famous theorem of general relativity:
  In vacuum, the unique spherically symmetric solution of the 4-dimensional Einstein equa-
  tion with Λ = 0 [Eq. (1.44)] is the Schwarzschild metric.
   In particular, outside any spherically symmetric body, the spacetime is a piece of Schwarz-
schild spacetime. Note that this implies that this part of spacetime is static, even if the central
body is not (for instance it may oscillate radially, keeping its spherical symmetry). In particular,
there are no gravitational waves in spherical symmetry.
Remark 1: The Jebsen-Birkhoff theorem can be viewed as a generalization of the shell theorem version
of Gauss’s law in Newtonian gravity: the gravitational field outside any spherical source is entirely
determined by the total mass m of the source, being identical to that generated by a point of mass m
located at the symmetry center.
Historical note : The Jebsen-Birkhoff theorem has been first proved by the Norwegian physicist Jørg
Tofte Jebsen in 1920 [300]. It was independently established by the American mathematician George
D. Birkhoff in 1923 [62] and became famous under the name Birkhoff’s theorem, the work of Jebsen
remaining largely unknown until recently (see Ref. [303] for details).
                                                 2m
                               g = −dτ 2 +             dχ2 + r(τ, χ)2 dθ2 + sin2 θ dφ2 ,                (14.21)
                                                                                      
                                               r(τ, χ)
      where r(τ, χ) is given by Eq. (14.18). This is the expression of the Schwarzschild metric in
      terms of the Lemaître coordinates (τ, χ, θ, φ).
          The relation between Lemaître coordinates and Schwarzschild-Droste ones is obtained by
      setting f (χ) = 1 and3 ε = −1 in Eq. (14.17):
                                                       q
                                                                        2m
                                                                         r
                                                  dτ = dt +                    dr.                      (14.22)
                                                                   1   − 2mr
      By integration, we get
                                              r                        p                 !
                                                   r                    r/2m − 1
                                τ = t + 4m           + 2m ln           p                     + const.   (14.23)
                                                  2m                    r/2m + 1
      We deduce easily from these formulas the expression of the stationarity Killing vector ξ
      of Schwarzschild spacetime in terms of the Lemaître coordinates. Since ξ = ∂t [Eq. (6.6)],
      and the above formulas imply ∂τ /∂t = 1 and ∂χ/∂t = 1, we get, applying the chain rule
      ∂/∂t = ∂/∂τ × ∂τ /∂t + ∂/∂χ × ∂χ/∂t,
ξ = ∂τ + ∂χ . (14.25)
      Remark 2: Although very simple, the above relation shows that Lemaître coordinates are not adapted
      to the spacetime symmetry generated by the Killing vector ξ: the latter does not coincide with any
      coordinate vector ∂α of Lemaître coordinates. This reflects the fact that the metric components (14.21)
      depend on τ (via the function r(τ, χ)), in addition to χ.
         3
             ε = −1 follows from Eqs. (14.14) and (14.18).
                       14.2 Lemaître-Tolman equations and their solutions                                  549
    The vacuum hypothesis implies that we can no longer interpret Lemaître coordinates as
comoving with some free-falling dust, as in Sec. 14.2.1. However, the geodesic character of
these coordinates remains. Indeed, the vector u := ∂τ is geodesic with respect to the metric
(14.1): ∇u u = 0, as a direct calculation shows (cf. the notebook D.6.1). This implies that the
curves (χ, θ, φ) = const are timelike geodesics. Moreover, the conserved energy per unit mass
along these geodesics is (cf. Sec. B.5)
                      ε = −ξ · u = −(∂τ + ∂χ ) · ∂τ = − gτ τ − gχτ = 1,
                                                       |{z} |{z}
                                                                 −1     0
where use has been made of Eq. (14.25). ε = 1 means that the geodesics are marginally bound:
they describe the free fall from rest at infinity.
    As it is clear on the metric components (14.21), a key feature of Lemaître coordinates is to
be regular at r = 2m, i.e. across the event horizon of Schwarzschild spacetime, contrary to
Schwarzschild-Droste coordinates.
Historical note : The Schwarzschild metric in the form (14.21) has been obtained in 1932 by Georges
Lemaître [346], as a vacuum solution of the Lemaître-Tolman system: cf. Eq. (11.12) of Ref. [346].
Remarkably, Lemaître pointed out that the metric components (14.21) are regular at r = 2m and was the
first author to conclude that the singularity of Schwarzschild’s solution at r = 2m is a mere coordinate
singularity. As pointed out in the historical note on p. 194, eight years before, Arthur Eddington [177]
exhibited a coordinate system that is regular at r = 2m but he did not mention this feature.
which is nothing but the 3-dimensional Euclidean metric expressed in spherical coordinates
(r, θ, φ). This proves that Schwarzschild spacetime can be sliced by a family of flat hypersurfaces.
The associated 3+1 decomposition of the metric tensor is revealed by rewriting (14.26) as
                                           r           !2
                                               2m
                   g = −dτ 2 +                              + r2 dθ2 + sin2 θ dφ2 .             (14.27)
                                                                                 
                                    dr +          dτ
                                                r
550                             Black hole formation 1: dust collapse
      One reads on this expression     that the lapse function (see e.g. Ref. [227]) is N = 1 and that the
      shift vector is β = 2m/r ∂r . Finding a lapse function equal to one simply reflects that the
                           p
      coordinate time τ is the proper time of some observer, namelly the observer following the
      marginally bound radial geodesics discussed above.
          Another interesting property of Painlevé-Gullstrand coordinates, which they share with
      Lemaître ones, is to be regular at r = 2m: despite the vanishing of gτ τ there, as read on (14.26),
      the determinant of the metric components (14.26) is not vanishing, thanks to the off-diagonal
      term gτ r . Indeed, det (gαβ ) = −r4 sin2 θ, which is clearly nonzero, except on the axis θ = 0 or
      π.
      Remark 3: Painlevé-Gullstrand coordinates (τ, r, θ, φ) can be seen as a timelike analog of the ingoing
      Eddington-Finkelstein (IEF) coordinates (t̃, r, θ, φ) introduced in Sec. 6.3.2. Both coordinate systems are
      based on ingoing radial geodesics of Schwarzschild spacetime, these geodesics being null for IEF and
      timelike for Painlevé-Gullstrand.
         The reader is referred to Ref. [357] for a more detailed discussion of Painlevé-Gullstrand
      coordinates.
      Historical note : Painlevé-Gullstrand coordinates have been introduced in 1921 by the French mathe-
      matician Paul Painlevé [397], as well as by the Swedish physicist and ophthalmologist Allvar Gullstrand
      (1911 laureate of the Nobel Prize in Medicine) in 1922 [248].
    Let us denote by rs (τ ) the areal radius of the star, i.e. the coordinate r of its surface, as a
function of the proper time τ of a matter particle located at the surface. If the origin of τ is set
to the start of the collapse, we have
                                            rs (0) = r0 .                                     (14.29)
Due to the pressureless hypothesis, the surface particles follow timelike radial geodesics of
Schwarzschild spacetime, as studied in Sec. 7.3.2. Since the collapse starts from rest, the function
rs (τ ) is given by the geodesic solution (7.41):
                                   r
                                      r03
                             τ=           (η + sin η)
                            
                            
                                       8m
                                                                  0 ≤ η ≤ π,                       (14.30)
                             rs (τ ) = r0 (1 + cos η)
                            
                            
                            
                                         2
where the parameter η is zero at the start of the collapse and π at its end. The function rs (τ ) is
plotted in Fig. 14.1 (right). Its graph is an arch of cycloid, the parameter η corresponding to the
angle of rotation of the rolling circle defining the cycloid.
Remark 1: The “cycloidal” evolution law rs (τ ), as given by Eq. (14.30), is identical to that of dust ball
in Newtonian gravity, with τ being then the (universal) Newtonian time. This coincidence follows from
the Lemaître-Tolman equation (14.9) being identical to the first integral of a purely radial motion in
Newtonian gravity.
     The IEF coordinate time t̃ corresponding to the surface proper time τ is given by Eq. (7.45):
           r                                                          −1/2        
                r0     h       r0            i            η  r0                   η
  t̃ = 2m          −1 η+          (η + sin η) + 2 ln cos +          −1         sin      , (14.31)
               2m            4m                           2     2m                 2
where t̃ = 0 is assumed at the start of the collapse. Note that by combining Eqs. (14.30) and
(14.31), one can obtain rs = rs (t̃), i.e. the evolution law of the stellar surface in terms of the
IEF coordinate time t̃. The right panel in Fig. 7.4 depicts it for various values of m/r0 .
      We note that τ depends only on η and not on χ, contrary to what happens for the general
      solution (14.13). Moreover, thanks to the choice (14.33), τ matches the proper time at the
      surface of the star as given by Eq. (14.30). The areal radii at the surface must match as well.
      Let us denote by χs the surface value pof χ. The matching of the surface areal radii given by
      Eqs. (14.30) and (14.34) implies r0 = r03 /2m sin χs , from which we get the relation between
      χs and (m, r0 ):
                                                       r
                                                         2m
                                              sin χs =         .                              (14.35)
                                                          r0
      This implies sin χs > 0 and hence χs ∈ (0, π). There are actually two solutions of Eq. (14.35)
      for χs . Given that arcsin is a one-to-one map [0, 1] → [0, π/2], they are
                                                          r
                                                             2m
                                              χs = arcsin                                        (14.36)
                                                              r0
      and χs = π − arcsin 2m/r0 . For the physical scenario we have in mind (gravitational
                              p
      collapse of an “ordinary” stellar core), we shall retain only the solution (14.36), which implies
      χs ∈ (0, π/2]. The second solution, for which χs ∈ [π/2, π), will be discussed in Remark 3 on
      p. 559 below4 . Equation (14.36) show that χs is a function of the dimensionless ratio m/r0 ,
      which is the compactness of the initial star (some numerical values can be found in Table 14.1).
         4
           For the moment, we can note that such a solution would lead to the areal radius r being a decreasing function
      of χ near χs , since Eq. (14.34) shows that r(τ, χ) behaves as sin χ, which is decaying on (π/2, π). Such a behavior
      is certainly not expected in an “ordinary” star.
                                                     14.3 Oppenheimer-Snyder collapse                                                                     553
              3.0        q
                        τ 2m/r03                                                         3.0          η
                        rs (τ)/r0 = a(τ) sinχs /r0                                                    rs (τ)/r0 = a(τ) sinχs /r0
              2.5       ρ(τ) πr03 /(6m)                                                  2.5          ρ(τ) πr03 /(6m)
              2.0                                                                        2.0
 r s , τ, ρ
                                                                            η, r s , ρ
              1.5                                                                        1.5
1.0 1.0
0.5 0.5
              0.0                                                                        0.0
                    0         1π               1π        3π         π                          0.0   0.2   0.4     0.6q 0.8   1.0   1.2   1.4    1.6
                              4                2         4
                                                η                                                                  τ     2m/r03
Figure 14.1: Evolution of the areal radius rs (τ ) the collapsing star [Eq. (14.30); red curve] and of the proper
matter density ρ(τ ) [Eq. (14.39); blue curve] in terms of the conformal time η (left) or the matter proper time τ
(right). The function τ = τ (η) is depicted by the purple dashed curve, while its inverse, η = η(τ ), is depicted by
the orange dashed one. As indicated in the legends, the plot of metric factor a(τ ) is the same as that of rs (τ ) up
to a rescaling by sin χs . [Figures generated by the notebook D.6.2]
The upper limit π/2 can be excluded from the range of χs since it would correspond to r0 = 2m,
which is the areal radius of the black hole horizon in Schwarzschild geometry. The range of
the comoving coordinate χ in the interior of the star is thus
                                                                                         π
                                                              0 ≤ χ ≤ χs <                 ,                                                    (14.37)
                                                                                         2
with χ = 0 locating the center, since it corresponds to r = 0 by virtue of Eq. (14.34).
    By combining Eqs. (14.32), (14.33) and (14.35), we note that M (χ) is an increasing function
of χ in the range (14.37), with M (0) = 0 and
M (χs ) = m. (14.38)
    The matter density ρ(τ, χ) is deduced from Eq. (14.7b), with the above values of M (χ) and
r(τ, χ). We get a function of τ only, via η = η(τ ) [cf. Eq. (14.34)], which we rewrite as ρ(τ ):
                                                                            6m
                                                        ρ(τ ) =                       .                                                         (14.39)
                                                                  πr03 (1   + cos η)3
The independence of ρ from χ confirms that the choice (14.32) leads to a uniform density
object, i.e. a homogeneous star. Let us stress that the homogeneity is maintained during all
the collapse: at each instant of matter proper time τ , the density is uniform is all the star. The
function ρ(τ ) is plotted in Fig. 14.1.
Remark 2: In view of Eq. (14.30), we may rewrite Eq. (14.39) as
                                                                             m
                                                              ρ(τ ) =   4         3
                                                                                    .                                                           (14.40)
                                                                        3 πrs (τ )
554                             Black hole formation 1: dust collapse
      This formula happens to be identical to that giving the mass density of a uniform ball of mass m and
      radius rs (τ ) in Newtonian physics (flat spacetime), but there is no profound physical significance in this
      coincidence.
         It is instructive to express the initial energy density, ρ0 := ρ(τ = 0), in terms of the
      geometrical mass density unit m−2 and the compactness angle χs by combining Eqs. (14.39)
      and (14.35):
                                                    3 sin6 χs
                                              ρ0 =            .                            (14.41)
                                                      32πm2
         The metric factor a(τ, χ), which appears in Eq. (14.1), is deduced from Eq. (14.5), with
      ∂r/∂χ computed from expression (14.34); as for ρ, we get a function of τ only, which we
      rewrite as a(τ ):
                                                   a0
                                          a(τ ) =      (1 + cos η) .                       (14.42)
                                                   2
      We may then rewrite the second equation in (14.34) as r(τ, χ) = a(τ ) sin χ, so that the metric
      tensor (14.1) inside the star takes the form
                             a20
                                 (1 + cos η)2 −dη 2 + dχ2 + sin2 χ dθ2 + sin2 θ dφ2 .                     (14.45)
                                                                                  
                       g=
                             4
      The term inside square brackets is nothing but the metric g̃ of the Einstein cylinder introduced
      in Sec. 4.2.3 [compare Eq. (4.18) with the change of notation η ↔ τ ]. Hence, inside the star, the
      metric g is conformal to g̃: g = Ω2 g̃, with the conformal factor Ω := a0 /2(1 + cos η) = a(τ ).
      For this reason, η is called the conformal time. Note that while g̃ is a static metric [it obeys
      Eq. (5.5)], g is not, since the conformal factor depends on η.
          On each hypersurface η = const, or equivalently τ = const, the 3-metric h induced by the
      conformal metric g̃ is the standard metric of the hypersphere S3 , expressed in terms of the
      hyperspherical coordinates (χ, θ, φ):
      On the whole S3 , χ ranges from 0 to π. Since in the present case, χ ∈ [0, χs ] with χs < π/2
      [Eq. (14.37)], we may view each constant η slice of the interior of the collapsing star as a piece
                                   14.3 Oppenheimer-Snyder collapse                                            555
of S3 , scaled by the factor a(τ ) given by Eq. (14.42). If one uses the conformal coordinates
(η, χ) to draw a 2-dimensional spacetime diagram of the interior, as in the left part of Fig. 14.2,
then the radial null geodesics appear as straight lines inclined by ±45◦ with the horizontal,
reflecting the Minkowskian part −dη 2 + dχ2 of the metric (14.45).
    To show that the FLRW interior (14.45) along with the Schwarzschild exterior (14.28) form
a regular spacetime, one shall check that the Darmois junction conditions are fulfilled at the
interface between the two regions, i.e. on the timelike hypersurface Σ defining the surface
of the collapsing star. These conditions, first enounced by Georges Darmois [154] (see also
Sec. 21.13 of MTW textbook [371], as well as Ref. [332]), are the continuity of the induced
3-metric of Σ and of its extrinsic curvature, when these two tensors are computed on each side
of Σ. These geometric conditions translate the absence of surface layers on Σ. We shall not
check them here; the computation can be found in Sec. 3.8 of Poisson’s textbook [416].
Historical note : As mentioned above, the solution originally obtained by J. Robert Oppenheimer
and Hartland Snyder in 1939 [392] regards pressureless matter evolving on marginally bound timelike
geodesics, i.e. matter in free fall from r → +∞, where it is at rest at τ → −∞. More precisely,
Oppenheimer and Snyder’s original solution corresponds to the choice
                                                                        r
                                               χ3                         2χ3s
                      f (χ) = 1       M (χ) = m 3      and     τ0 (χ) =                      (14.47)
                                               χs                         9m
for the freely specifiable functions determining the solutions of the Lemaître-Tolman system (cf.
Sec. 14.2.4), instead of the choice (14.32) adopted in this chapter. Choosing f (χ) = 1 leads to E(χ) = 0
[cf. Eq. (14.10)] and hence to the solution (14.12), instead of (14.13). This solution is still homogeneous,
since ρ, computed via Eq. (14.7b), is a function of τ only, but it is less adapted to a star of finite size
collapsing from an equilibrium state than the solution (14.13). The latter, which we are using here, has
been first presented by David Beckedorff in 1962 [46] and popularized by B. Kent Harrison, Kip Thorne,
Masami Wakano and John Archibald Wheeler in their 1965 monograph on gravitational collapse [256],
as well as by Misner, Thorne and Wheeler in their 1973 textbook [371].
                                                           16
                    π
                                                           14
                                      0.50
                                                           12
                  3π
                  4
                                      1.00                 10
                                       1.50
                                                    t̃/m
                  1π                                       8
            η
2 2.00
                                       2.50                6
                                       3.00
                  1π                                       4
                  4
                                       3.50
                                                           2
                    0                                      0
                        0     1π
                              8
                                        1π
                                        4                       0      2          4         6         8         10
                               χ                                                      r/m
      Figure 14.2: Spacetime diagrams of the Oppenheimer-Snyder collapse in conformal coordinates (η, χ) (left
      figure, depicting only the interior) and in Eddington-Finkelstein coordinates (t̃, r) (right figure), for the initial
      compactness m/r0 = 1/4, which corresponds to χs = π/4. In both figures, solid red lines are worldlines of matter
      particles, uniformly sampling [0, χs ] with δχ = π/32, while dashed red lines denote isosurfaces of constant matter
      proper time τ uniformly sampling [0, τend ] with δτ = τend /8. The thick red line is the star’s surface and the thick
      magenta one is the star at the initial instant τ = 0. The orange zigzag line marks the curvature singularity. Solid
      (resp. dashed) green lines are outgoing (resp. ingoing) radial null geodesics encountering the star’s surface at the
      above selected values of τ . The thick black line is the black hole event horizon H and the dotted blue line is the
      centered future inner trapping horizon T . In the left figure, thin solid grey to black lines mark isosurfaces of
      constant values of the areal radius r(τ, χ) (labeled in units of m). [Figures generated by the notebook D.6.2]
                                     14.3 Oppenheimer-Snyder collapse                                                           557
                                      3m
                                                           t̃=8m
                                     r=
    2
                             0
                        r=
                                                                              3.6
    0                                                      t˜= 6m             3.4                                         10m
T
                                                                                                                   t̃ =
                                                                              3.2
                                                                                                              3m
                                      r=                                      3.0
                                                                                                             r=
                                              4m
    2                                                                         2.8              0
                                                                          T
                                                           t˜=                            r=
                                                                 4m
                                                                              2.6
                                                                                                                      4m
                                                           r=
                                                                              2.4
                                                                                                                   r=
    4                                                         5m              2.2                      t̃ = 8m
                                                       t̃ =
                                                                              2.0
                                                           0
surface of the star (thick solid red line) is initially tangent to the hypersurface r = 4m (solide
grey line), which is the geometrical translation of the collapse starting from rest. The part of
the diagram that lies to the right of the thick solid red line is identical to a part of the diagram
shown in Fig. 9.5, which is expected since both represent a piece of Schwarzschild spacetime.
A major difference with the Kruskal-Szekeres diagram of Fig. 9.5 is that the spacetime diagram
shown in Fig. 14.3 is not conformal in the stellar interior: the radial null geodesics are not
straight lines inclined by ±45◦ . The outgoing ones even run backwards in T in the left part of
the diagram, which means that causal relations cannot be simply inferred there.
    Finally let us consider a compactified conformal diagram of the Oppenheimer-Snyder col-
lapse, i.e. a Carter-Penrose diagram (cf. Secs. 9.4 and 10.8.1). Thanks to the compactification,
such a diagram offers a view of the entire spacetime. It is then natural to consider maximal
spacetimes, i.e. spacetimes in which each geodesic is inextendible, being complete or termi-
nating at a singularity (cf. Sec. B.3.2). This is clearly not the case of the spacetime depicted in
the diagrams of Figs. 14.2 and 14.3: when run in the past direction, causal geodesics in the star
interior stop on the hypersurface τ = 0 (the magenta line). In particular, the free-falling matter
geodesics, along which τ is an affine parameter, are clearly incomplete and could a priori be
extended to τ < 0.
    There are actually two ways to maximally extend the Oppenheimer-Snyder spacetime
considered up to now. The first one corresponds to a star in hydrostatic equilibrium for τ < 0
558                               Black hole formation 1: dust collapse
      Figure 14.4: Carter-Penrose diagram of the Oppenheimer-Snyder collapse generated by switching off the
      pressure at τ = 0 in a star in hydrostatic equilibrium for τ ∈ (−∞, 0). The stellar surface is indicated by the red
      curve and some outgoing radial null geodesics are drawn as green straight lines. The black hole event horizon H
      is indicated by the black line, the shaded region being the black hole interior. The dashed blue line marks the
      trapping horizon T discussed in Sec. 14.3.6, while the orange zigzag line marks the curvature singularity. The
      maroon curve is the worldline of the remote static observer O considered in Sec. 14.4.
      and in which the pressure drops suddenly to zero at τ = 0. This extension is “astrophysical”
      insofar as it models the gravitational collapse of the core of a massive star. Of course the
      equilibrium for τ < 0 requires some nonzero pressure inside the star. Therefore it cannot
      be described by the Lemaître-Tolman equations discussed in Sec. 14.2; it should rather obey
      the Tolman-Oppenheimer-Volkoff equations, which govern hydrostatic equilibria in spherical
      symmetry (see e.g. Chap. 23 of Ref. [371]). The Carter-Penrose diagram corresponding to this
      extension is drawn schematically (i.e. not by means of explicit coordinates) in Fig. 14.4. The
      star seems to expand from the past timelike infinity i− , but this is an artifact of the drawing5 :
      the star surface (red line) follows actually a curve r = const (= r0 ) for τ < 0. For τ ≥ 0,
      the surface of the star appears as a vertical line, as in the left part of Fig. 14.2, while the
      radius is actually decreasing. Actually, one may consider that this part of the Carter-Penrose
      diagram is drawn by means of the conformal coordinates (η, χ) used in Fig. 14.2. Note that
      the Carter-Penrose diagram of Fig. 14.4 corresponds to a maximally extended spacetime: all
      geodesics are inextendible. Note as well that the exterior region is formed by parts of regions
      MI and MII of Schwarzschild spacetime (compare with Fig. 9.10).
          The second maximal extension of the Oppenheimer-Snyder spacetime is based on the
         5
           Recall that a Carter-Penrose diagram reflects the causal structure of spacetime but does not respect the metric
      distances.
                                     14.3 Oppenheimer-Snyder collapse                                                559
Figure 14.5: Carter-Penrose diagram of the Oppenheimer-Snyder collapse starting at the instant of time
symmetry (τ = 0) after an expanding phase from a past singularity. The legend is the same as in Fig. 14.4, with in
addition the past curvature singularity (bottom orange zigzag line) and the white hole region (bottom shaded
area), bounded by the past event horizon H − .
assumption that the proper time τ = 0 is an instant of time symmetry of a ball of pressureless
matter that was expanding from an initial past singularity. The spacetime for τ < 0 is then
exactly the time-reversed of that for τ > 0, as illustrated on the Carter-Penrose diagram of
Fig. 14.5. Such an extension is less “astrophysical” than the one considered above, but it has the
advantage to invoke only pressureless matter and hence to be entirely described by an exact
solution of the Einstein equation: the Friedmann-Lemaître-Robertson-Walker solution (14.45)
extended to η ∈ (−π, π) inside the star and matched to the Schwarzschild solution outside the
star. This maximal solution contains a white hole region, i.e. a region in which causal curves
arising from the past null infinity I − cannot penetrate (cf. Sec. 4.4.2). The white hole (null)
boundary is the past event horizon H − (cf. Fig. 14.5). The pressureless matter starts its life
by expanding from a past curvature singularity where r = 0, as a “mini big-bang”, the areal
radius being given by Eq. (14.34) with η ∈ (−π, 0) in the expansion phase and η ∈ (0, π) in the
collapse phase. Note that the exterior region is formed by parts of regions MI , MII and MIV
of the maximally extended Schwarzschild spacetime (compare with Fig. 9.10). Note also that
the star is not visible from the infinite past to the remote static observer O: it only appears
to O after the event denoted by a in Fig. 14.5. On the contrary, in the “astrophysical” model
discussed above, the star is always visible to O (cf. Fig. 14.4), despite it fades very rapidly after
the collapse phase has started to be observed, as we shall see in Sec. 14.4.
Remark 3: The time-symmetric extended solution considered           above opens the path to pressureless
solutions with χs ∈ (π/2, π), i.e. with χs = π − arcsin 2m/r0 as the solution of the junction equation
                                                         p
(14.35). We disregarded these solutions in Sec. 14.3.2, after having noticed that they lead to the areal
radius r(τ, χ) being a decreasing function of χ for χ ∈ (π/2, χs ]. In other words, starting from the
center of the star at χ = 0 and keeping τ fixed, r(τ, χ) increases up to the maximum value rs (τ )/ sin χs ,
560                             Black hole formation 1: dust collapse
Figure 14.6: Carter-Penrose diagram of a semiclosed world. See Fig. 14.5 for the legend.
      achieved for χ = π/2, and then decreases to rs (τ ) for χ = χs . Such a behavior is possible if the star’s
      surface meets Schwarzschild spacetime in a region of the latter where r decreases when moving away
      from the surface (i.e. “to the right” if one places the star at the left of a Carter-Penrose diagram as in
      Figs. 14.4-14.5). This happens in the region X̂ < 0 of the compactified Kruskal-Szekeres coordinates (cf.
      Fig. 9.10). We may then place the star there; the corresponding Carter-Penrose diagram is shown in
      Fig. 14.6. We note that the star exterior comprises the entire region MI of the extended Schwarzschild
      spacetime, as well as parts of regions MII , MIII and MIV . Contrary to the solution for χs < π/2
      shown in Fig. 14.5, it contains the bifurcation sphere S , where H meets H − (cf. Sec. 9.3.3). Such a
      configuration is called a semiclosed world (cf. p. 149 of Ref. [531], Sec. 2.7 of Ref. [387] or Sec. 2.7.2 of
      Ref. [202]), because at the limit χs → π, the matter region becomes an entire FRLW closed universe.
      Note that the collapse phase (τ > 0) cannot be seen by a remote static observer, like O in Fig. 14.6, for it
      is hidden below the event horizon H ; only some initial part of the expanding phase is visible to O on
      the segment a − i+ of his worldline.
      Historical note : The semiclosed world solutions have been introduced by Oscar Klein in 1961 [317]
      and Yakov B. Zeldovich in 1962 [526]. They have been further studied by Igor D. Novikov in 1963 [386].
where the last equality has been obtained by using Eq. (14.41) to express m/ sin3 χs in terms of
the initial proper energy density ρ0 . We note that τend is a function of ρ0 only.
Remark 4: The above formulas for the collapse proper time τend are identical to those giving the
collapse time of a spherical ball of dust of mass m, initial radius r0 and initial mean mass density
ρ0 = 3m/(4πr03 ) in Newtonian gravity. This coincidence arises from the same cycloidal law for rs (τ )
in both general relativity and Newtonian gravity, as already discussed in Remark 1 on p. 551.
    Similarly, setting η = π in Eq. (14.31), we get the value of the IEF coordinate t̃ at the end of
the collapse:                                 r
                                           r0     r0            r        
                                                                    0
                     t̃end = 2m π 1 +                  − 1 − ln        −1 .                   (14.49)
                                          4m      2m               2m
Example 2: Some numerical values of τend are given in Table 14.1 for various astrophysical objects, if
they happen to collapse by a sudden lack of force acting against gravitation. One sees that the Earth
would collapse in 15 minutes, while it would take half an hour for the Sun, 3 seconds for a white dwarf
and a tenth of millisecond for a neutron star. The similarity of the values for the Earth and the Sun,
which have very different masses and radii, is due to the closeness of their values of ρ0 .
   Equation (14.39) with η → π immediately yields an infinite proper energy density at the
end of the collapse:
                                   lim ρ(τ ) = +∞.                                 (14.50)
                                              τ →τend
Since ρ is a measurable quantity (by comoving observers), this signals some physical singularity.
This corresponds actually to the apparition of a curvature singularity in the part of spacetime
covering the interior of the star (cf. the discussion in Sec. 6.3.4). Indeed, from the interior metric
(14.45), one can evaluate6 curvature scalars like the Ricci scalar
                                                            24          6m
                                   R := Rµµ =                        =          ,                (14.51)
                                                  a20 (1   + cos η)3   rs (τ )3
the “square” of the Ricci tensor
                                                        576         36m2
                                    Rµν Rµν =                     =                              (14.52)
                                                 a40 (1 + cos η)6   rs (τ )6
and the Kretschmann scalar defined by Eq. (6.44):
                                                               960         60m2
                              K := Rµνρσ Rµνρσ =                         =          .            (14.53)
                                                        a40 (1 + cos η)6   rs (τ )6
Remark 5: Although the components of the Riemann and Ricci tensors with respect to the coordinates
(η, χ, θ, φ) depend on χ (cf. the notebook D.6.3), the above three scalars are independent of it, in full
agreement with the homogeneity of the stellar interior at each instant η.
   6
       See the notebook D.6.3 for the computations.
562                                Black hole formation 1: dust collapse
      Table 14.1: Numerical values for the gravitational collapse of various astrophysical bodies, assuming that all
      forces counterbalancing gravitation suddenly disappear at the proper time τ = 0. m is the gravitational mass
      (constant during the collapse), r0 is the initial areal radius, m/r0 is the initial compactness, χs is the (constant)
      value of the hyperspherical angle χ at the surface of the body [Eq. (14.36)], ρ0 is the initial proper energy density,
      assuming that the body is homogeneous [Eq. (14.40) with τ = 0, or Eq. (14.41)], τend is the matter proper time at
      the end of the collapse, when the central singularity appears [Eq. (14.48)], τhb is the matter proper time when the
      black hole event horizon forms at the center of the body [Eq. (14.58)], τ∗ is the proper time when the horizon
      encounters the infalling surface [Eq. (14.55)] and ρhb is the proper energy density at τ = τhb [Eq. (14.59)].
          It is clear that the above three curvature invariants diverge when η → π, or equivalently,
      when rs (τ ) → 0. Hence a curvature singularity appears in the stellar interior at the end of
      the collapse. It is depicted by the orange zigzag segment in left spacetime diagram of Fig. 14.2.
      This singularity is shrunk to the point (t̃, r) = (t̃end , 0) in the IEF diagram on the right part of
      the figure, because rs (τend ) = 0. It is then connected to the curvature singularity at r = 0 of
      Schwarzschild spacetime, depicted as the vertical orange zigzag segment.
      Remark 6: The reader might be puzzled by the Kretschmann scalar (14.53) being different from that
      of Schwarzschild spacetime, as given by Eq. (6.45): KSchwarz = 48m2 /r6 . Both have the same m2 /r6
      structure but they differ by the numerical prefactor: 60 versus 48. This actually reflects the discontinuity
      of the Riemann tensor between the stellar interior and the Schwarzschild exterior. This discontinuity is
      enforced on the Ricci part of the Riemann tensor via the Einstein equation and the uniform density of
      the star, the latter yielding a jump of the energy-momentum tensor T between the value (14.2) and zero
      (vacuum exterior). The difference is even more dramatic on the Ricci scalar (14.51) and the square of the
      Ricci tensor (14.52), since they are identically zero in the Schwarzschild exterior. In other words, the
      Einstein equation implies that the metric tensor of the Oppenheimer-Snyder solution is not C 2 . For an
      actual stellar core, there is a density gradient so that ρ = 0 at the surface and the metric tensor is at
      least C 2 .
                                   14.3 Oppenheimer-Snyder collapse                                              563
        3: For the compactness 1/4 considered in Figs. 14.2-14.3, one has χs = π/4, so that η∗ = π/2
Example √
and τ∗ = 2(π + 2)m ≃ 7.27 m ≃ 0.82 τend .
Example 4: Values of τ∗ for the various objects considered in Example 2 are given in Table 14.1. We
note that for the collapsing Earth, Sun and white dwarf, τ∗ is very close to τend , while for the collapsing
neutron star, τ∗ ≃ 0.9 τend . This reflects the smallness of χs for the first three objects, since Eq. (14.55)
implies τ∗ /τend ≃ 1 − 4χ3s /(3π) for χs ≪ 1.
    For τ < τ∗ , one has rs (τ ) > 2m and the event horizon must be located inside the collapsing
star. To determine it, let us consider the outgoing radial null geodesics, which are drawn as
solid green curves in Fig. 14.2. In view of the metric (14.45), the equation of such a geodesic is
very simple in terms of the coordinates (η, χ): η = χ − χs + ηs , where the constant ηs is the
value of η when the geodesic reaches the surface of the star (χ = χs ). Radial null geodesics
for which ηs < η∗ emerge out of the star in a part of Schwarzschild spacetime where r > 2m,
so that they can espace to infinity. On the contrary, geodesics having ηs > η∗ emerge in the
black hole region of Schwarzschild spacetime; they are thus trapped and eventually end on the
curvature singularity at r = 0. The event horizon is generated by the null radial geodesics of
the limiting case: ηs = η∗ . Hence, inside the star, the equation of the black hole event horizon
H is η = χ − χs + η∗ . Given the value (14.54) for η∗ , we get
                                       H :       η = χ + π − 3χs .                                    (14.56)
For a fixed value of (θ, φ), this is the equation of a straight line segment inclined by +45◦ with
respect to the horizontal (cf. Fig. 14.2, left).
    The equation of H in terms of the internal IEF coordinate is obtained in parametric form,
with parameter χ ∈ [0, χs ] by substituting (14.56) for η in Eq. (14.31) providing t̃ and in
Eq. (14.34) providing r. One obtains then a curved segment in the (t̃, r) plane, which is drawn
in black in Fig. 14.2, right.
    In view of Eq. (14.56), we may state:
564                                    Black hole formation 1: dust collapse
0.012
0.010
                                    0.008
                          ρhb m 2
                                    0.006
0.004
                                            0            1π
                                                         9
                                                                              2π
                                                                              9
                                                                                                   1π
                                                                                                   3
                                                                   χs
      Figure 14.7: Proper energy density ρhb at the instant of the black hole formation at the center of the star, in
      units of m−2 , as a function of the initial compactness angle χs . [Figure generated by the notebook D.6.2]
            • If χs > π/3, or equivalently if m/r0 > 3/8, the event horizon H is already present
               in the star at τ = 0, at the location χ = 3χs − π.
            In all cases, the horizon expands according to Eq. (14.56) until it reaches the surface of
        the star (χ = χs ) at η = η∗ = π − 2χs or τ = τ∗ given by Eq. (14.55).
      Example 5: Continuing with Example 3, i.e. with m/r0 = 1/4, or equivalently with χs = π/4, we
      get ηhb = π/4, which corresponds to√the origin of the black straight segment drawn in Fig. 14.2, left.
      Equation (14.58) gives τhb = (2 + π/ 2)m ≃ 4.22 m ≃ 0.48 τend .
      Example 6: Values of τhb for the various objects considered in Examples 2 and 4 are given in Table 14.1.
      As for τ∗ , we note that for the collapsing Earth, Sun and white dwarf, τhb is very close to τend . This
                                    14.3 Oppenheimer-Snyder collapse                                         565
follows from τhb /τend ≃ 1 − 9χ3s /(2π) for χs ≪ 1. On the contrary, for the collapsing neutron star,
τhb ≃ 0.7 τend .
   For χs ≤ π/3, the matter proper energy density at the horizon birth, i.e. at τ = τhb , is
obtained by substituting (14.57) for η into Eq. (14.39); using the identity (14.35) to express
m3 /r03 in terms of χs , we get
                                                   3      sin6 χs
                              ρhb   := ρ(τhb ) =                       .                          (14.59)
                                                 4πm2 (1 − cos(3χs ))3
It is instructive to take the limit of small initial compactness m/r0 ≪ 1, which corresponds to
χs ≪ 1 by virtue of relation (14.36):
                                                          
                                          2            5 2
                                ρhb ≃         2
                                                   1 + χs + O(χ4s ).                     (14.60)
                                       243πm           4
We deduce from Eqs. (14.59)-(14.60) that
                2                                                         81
  lim ρhb =           ≃ 2.62 10−3 m−2           and      lim ρhb =             ≃ 1.26 10−2 m−2 .
  χs →0      243πm2                                    χs →π/3         2048πm2
Between these two limits, ρhb has a monotonic behavior, as shown in Fig. 14.7. In particular,
one has ρhb < 0.02 m−2 . Hence
  Property 14.6: matter density at the horizon’s birth
  By selecting a sufficiently large mass m, the proper energy density of matter when the
  black hole forms at the center of the collapsing star can be made arbitrarily small.
Example 7: Let us determine the mass m for which ρhb is as small as the density of water: ρhb =
103 kg m−3 . By restoring the G’s and the c’s, the density m−2 becomes c6 /G3 m−2 . We then get
m ≃ 4 107 M⊙ for small initial compactness (χs ≪ 1) and m ≃ 9 107 M⊙ for large initial compactness
(χs ∼ π/3).
  The relative density increase at the horizon birth with respect to the initial value ρ0 =
m/(4/3 πr03 ) is obtained by setting η = π − 3χs in Eq. (14.39):
                                    ρhb           8
                                        =                   .                        (14.61)
                                     ρ0    (1 − cos(3χs ))3
For small compactness, we get
                                       ρhb       64
                                             ∼        .                                  (14.62)
                                        ρ0 s 729χ6s
                                           χ →0
Hence ρhb /ρ0 is diverging very rapidly when χs → 0. On the contrary, for χs large, ρhb /ρ0 can
be moderate. Even, for χs = π/3, Eq. (14.61) leads to ρhb /ρ0 = 1, in agreement with τhb = 0
for that value of χs .
Example 8: Values of ρhb for the various objects considered in Examples 2, 4 and 6 are given in
Table 14.1. These values are all larger than the nuclear density ρnuc = 2 1017 kg m−3 . For the Earth, ρhb
is even ∼ 1012 ρnuc , in agreement with the diverging behavior (14.62). On the contrary, for the neutron
star, for which χs = 0.63, we have only ρhb /ρ0 ≃ 3.7. By comparing with Example 7, we note that all
the examples considered in Table 14.1 have a mass much smaller to that required for a low value of ρhb .
566                               Black hole formation 1: dust collapse
13.0
          3π
                                                  0.50                 12.5
          4
                                                  0.65
                                                                       12.0
                                           0.70
                                                                       11.5
                                                  1.00
                                   0.75
                                                                t̃/m
          5π
                                                                       11.0
      η
10.5
                                                                       10.0
                                           1.50
          1π
          2
                                                                        9.5
                                                  2.00
                                                                        9.0
               0                 1π
                                 8
                                                         1π
                                                         4                 0.0   0.5    1.0    1.5     2.0    2.5      3.0
                                  χ                                                           r/m
      Figure 14.8: Spacetime diagrams of the Oppenheimer-Snyder collapse, as in Fig. 14.2, but zooming around
      the trapped sphere S located at (η, χ) = (11π/16, 3π/16). The left diagram is drawn in terms of the conformal
      coordinates (η, χ), while the right one2is
                                              .50based on the Eddington-Finkelstein coordinates (t̃, r). See the legend of
      Fig. 14.2 for the meaning of the various curves.
                                                    3.00
                                                         [Figures generated by the notebook D.6.2]
where q is the determinant of the metric q with respect to any coordinate system of S that is
carried along with ℓ or k, such that (θ, φ), since θ and φ are constant along the integral curves of
                                                              √
ℓ and k (the radial null geodesics). From Eq. (14.63), we get q = (a20 /4)(1+cos η)2 sin2 χ sin θ.
                                                            √             √                √
Furthermore, in view of the components (14.64), Lℓ ln q = ℓµ ∂ ln q/∂xµ = ∂ ln q/∂η +
    √                    √             √                √              √
∂ ln q/∂χ and Lk ln q = k µ ∂ ln q/∂xµ = ∂ ln q/∂η − ∂ ln q/∂χ. We then get easily
and the sign of θ(ℓ) is that of cos(χ + η/2). Given that the above ranges of η and χ imply
0 < χ + η/2 < π, we have θ(ℓ) < 0 ⇐⇒ χ + η/2 > π/2, i.e.
  At late times, i.e. for η > π − 2χ, the spheres S defined by (η, χ) = const, with η < π
  and χ > 0, are trapped surfaces (θ(ℓ) < 0 and θ(k) < 0). These spheres are marginally
  trapped (θ(ℓ) = 0 and θ(k) < 0) for η = π − 2χ and untrapped (θ(ℓ) > 0 and θ(k) < 0) for
  η < π − 2χ.
Example 9: The trapped surface S corresponding to (η, χ) = (11π/16, 3π/16) in the Oppenheimer-
Snyder collapse with r√  0 = 4m is plotted as a brown dot in Fig. 14.8. According to Eq. (14.34), the areal
radius of S is rS = 2 2 sin(3π/16)(1 + cos(11π/16)) ≃ 0.70 m. The trapped character of S can be
read on Fig. 14.8: moving to the future along either the ingoing radial null geodesic (dashed green line)
or the outgoing one (solid green line) leads to spheres of smaller areal radius, and hence smaller area.
This is clear on the left diagram when noticing that both geodesics encounter the curve r = 0.65 m in
the immediate future of S . It is even more immediate on the right diagram since r is read directly on
the x-axis.
    The set formed by all the marginally trapped spheres of fixed (η, χ) is the hypersurface T
defined by θ(ℓ) = 0. Its equation is obtained by saturating the inequalities in Eq. (14.68):
T : η = π − 2χ. (14.69)
As we shall see in Chap. 18, T is called a future inner trapping horizon. It is drawn as a
dotted blue line in Figs. 14.2 and 14.8. Since the slope of that line is −2 and (η, χ) are conformal
coordinates, it is clear from the left diagrams in these figures that T is a timelike hypersurface.
This is actually a generic feature of future inner trapping horizons, as we shall see in Chap. 18.
568                            Black hole formation 1: dust collapse
      We note from Figs. 14.2 and 14.8 that T forms at the surface of the collapsing star at the
      very same instant where the event horizon H reaches the surface. Indeed, searching for the
      intersection H ∩ T from the equations defining the hypersurfaces H and T , i.e. Eqs. (14.56)
      and (14.69), we get (η, χ) = (π − 2χs , χs ), which is the 2-sphere representing the surface of
      the collapsing star at η = π − 2χs .
          The trapped surfaces S are located “above” T , i.e. at η > π − 2χ. The region containing
      trapped 2-spheres with r = const actually extend through the collapsing star’s surface up to
      the event horizon H . Indeed, the spacetime there is a piece of the region MII of Schwarzschild
      spacetime (cf. Sec. 6.2.5), where every 2-sphere at t̃ = const and r = const is trapped (this is
      obvious since both ingoing and outgoing radial null geodesics have decreasing areal radius
      r towards the future in MII , cf. Fig. 6.3). Outside H , i.e. in the MI region of Schwarzschild
      spacetime, there is no trapped surface.
      Remark 7: The trapping horizon T is not the boundary of the region where there exist trapped surfaces
      in the Oppenheimer-Snyder collapse, but only the boundary for trapped 2-spheres S centered around
      χ = 0. Indeed, since the collapsing stellar interior is homogeneous, the trapped 2-spheres S can be
      translated in χ at fixed η, leading to other trapped surfaces. One can then show that such “translated”
      trapped 2-spheres exist in all the region η > π − χs − χ, which extends to the past of T , since for
      χ ∈ [0, χs ), π − χs − χ < π − 2χ. Furthermore, there exist more complicated trapped surfaces even
      below this limit, see Ref. [56] for details.
      where t̃rec
               i is the IEF coordinate t̃ of the reception event of signal no. i by observer O and t̃ (τi )
                                                                                                     em
      is the IEF coordinate t̃ of the emission event taking place at proper time τi ; t̃ (τi ) is given by
                                                                                         em
      Eq. (14.31) with η being the function of τi defined implicitly by Eq. (14.30). We deduce from the
                                 14.4 Observing the black hole formation                                             569
25
20
15
t̃/m 10
                                               0
                                                    0   2   4    6   8   10
                                                            r/m
Figure 14.9:       Spacetime diagram of the Oppenheimer-Snyder collapse, as in the right panel of Fig. 14.2,
illustrating the increasing delay in the reception by a remote static observer O (maroon worldline at r = 10 m) of
signals emitted radially from the surface of the collapsing star with a constant interval of matter proper time,
δτ = τend /8. See the caption of Fig. 14.2 for the legend of the various curves. [Figure generated by the notebook
D.6.2]
two reception events. The quantities t̃em (τ2 ) − t̃em (τ1 ) and rs (τ1 ) − rs (τ2 ) are always finite; on
the other side, the logarithm term diverges when rs (τ2 ) approaches 2m, i.e. when τ2 approaches
τ∗ [cf. Eq. (14.55)], so that
                                  t̃rec   rec
                                    2 − t̃1 → +∞                when τ2 → τ∗ .                           (14.71)
In particular, if a sequence of signals is emitted from the star’s surface at a constant rate in
terms of the comoving proper time τ , the signals are received by the remote observer O with an
increasing delay as the collapse proceeds. The delay between two successive signals becoming
infinite, there exists a very last signal received by O. All subsequent signals of the emitted
sequence do not cross the event horizon, i.e. are captured into the black hole.
570                              Black hole formation 1: dust collapse
Figure 14.10: Image of the collapsing star on the screen of the remote observer O, at a given instant tO .
      Example 10: Consider Fig. 14.9, which depicts the Oppenheimer-Snyder collapse with r0 = 4m, or
      equivalently χs = π/4. Starting from τ = 0, a uniform sequence of 8 signals is emitted from the     √ star’s
      surface, the proper time separation between two successive signals being δτ = τend /8 = π 2m/4
      (cf. Example 1 on p. 561). One sees graphically that the signals are received by O with an increasing
      separation δ t̃. The last received signal is the 7th one. The 8th signal is trapped in the black hole region
      and ends on the curvature singularity.
          We conclude from the above analysis:
        For a remote static observer, the collapse, as monitored by the reception of signals regularily
        emitted from the star’s surface, appears to slow down and even to freeze completely when
        the areal radius of the star tends towards 2m. In particular, the remote observer never
        sees the star’s surface crossing the event horizon and does not perceive at all the final
        singularity.
      Historical note : The above behavior explains why the name frozen star was given to black holes
      before the name black hole itself was coined (cf. the historical note on p. 109).
coordinates (t, r, θ, φ), as in Chap. 8. These coordinates are regular in the Schwarzschild exterior
where the null geodesics propagate to the remote observer O and they make computations
slightly simpler than IEF coordinates do.
    Due to spherical symmetry, the image of the star at a given instant tO of observer O’s proper
time is a disk and is invariant under rotation around the disk center. Since O is assumed to lie
far away in the weak-field region, tO coincides with the Schwarzschild-Droste time coordinate
at the location of O. As discussed in Sec. 8.5.1, the distance of a screen pixel from the disk center
is proportional to the impact parameter b of the null geodesic carrying the photon hitting this
pixel. Accordingly, all the properties of the image (redshift and intensity at a given point) are
functions of the pair (tO , b), where b can be identified with the radial coordinate in the screen
plane, centered on the image (cf. Fig. 14.10). The pixel at the disk center corresponds to a null
geodesic having b = 0, i.e. traveling radially in the direction θ = θO = π/2 and φ = φO = 0.
The image’s rim, i.e. the circle bounding the disk, is located at some maximal value of b, b⋆ (tO )
say. A great amount of information on b⋆ (tO ) can be inferred from the effective potential
analysis developed in Sec. 8.2.3. Indeed, along a given null geodesic L of impact parameter b,
the following constraint must hold in the exterior of the star:
                                          1         r
                                     b≤ p      =p         ,                                          (14.72)
                                         U (r)   1 − 2m/r
where U (r) is the effective potential (8.14): U (r) := (1 − 2m/r)/r2 . This follows directly from
(dr/dλ̃)2 ≥ 0 in Eq. (8.11). Prior to the collapse, let us assume that the surface of the star is
static and located at r = r0 . Let us then denote by t∗∗O the value of tO at which the disk image
starts to shrink. This corresponds to null geodesics emitted from the star’s surface at the matter
proper time τ = 0 arriving at the disk p boundary. For tO ≤ tO , one has then r ≥ r0 all along
                                                                 ∗∗
L . If we assume r0 > 3m, then 1/ U (r) is increasing with r all along L , given that U (r)
is a decreasing function p on the interval (3m, +∞) (cf. Fig. 8.1). The constraint  (14.72) is then
equivalent to b ≤ 1/ U (r0 ), so that the maximal value of b is b0 := 1/ U (r0 ) and we can
                                                                             p
write:
                                       r0
                 b⋆ (tO ) = b0 := p                for tO ≤ t∗∗O    and r0 > 3m.             (14.73)
                                    1 − 2m/r0
For tO ≤ t∗∗O , a null geodesic at the image’s rim, i.e having b = b⋆ (tO ), fullfils thus U (r0 ) = b ,
                                                                                                      −2
which implies dr/dλ̃ = 0 at r = r0 according to Eq. (8.11). This means that the emission point
(r = r0 ) is a periastron of the geodesic: it corresponds to the minimal value of r along L ,
even if the latter is extended infinitely far in the past. This implies that L is tangent to the
hypersurface r = r0 at the emission point, i.e. that the photon is emitted in a direction tangent
to the star’s surface.
Remark 1: In terms of the compactness angle χs defined by Eq. (14.36), formula (14.73) takes a simple
form:
                       b⋆ (tO ) = r0 cos χs for tO ≤ t∗∗
                                                      O    and r0 > 3m.                        (14.74)
In the non-relativistic limit, χs → 0 and we get b⋆ (tO ) = r0 , as expected (recall that the trully observed
quantity through a telescope is the small angle b̂ = b/rO [Eq. (8.118)], so that b̂ = r0 /rO at the disk’s
rim).
572                            Black hole formation 1: dust collapse
          One can also use the effective potential analysis to get the value of b⋆ (tO ) at late times,
      more precisely for images formed by photons emitted after the collapsing star has shrunk
      below the photon sphere  p introduced in Sec. 8.2.3, which corresponds to rs (τ ) < 3m. Since
      the function r 7→ 1/ U (r) is decreasing on (2m, 3m), reaches a minimum at r = 3m
      and increases pon (3m, +∞) (cf. the plot of  √ U (r) in Fig. 8.1), the constraint (14.72) becomes
      b ≤ min(1/ U (r)) = bc , where bc = 3 3 m [Eq. (8.21)]. An example of photon emitted
      at r < 3m and fulfilling this constraint is represented by the trajectory no. 3 in Fig. 8.1.
      Furthermore, the constraint is actually a sufficient condition for any photon emitted outward
      to reach infinity (cf. Fig. 8.1 once again). However, for b close to bc , photons perform a large
      number of orbits very close to the photon sphere, this number being actually infinite for b = bc
      (critical null geodesics), as discussed in Sec. 8.3.2 (cf. Figs. 8.7 to 8.9). As a consequence, the
      elapsed time coordinate t along the corresponding null geodesic diverges as b → bc , so that the
      image boundary b⋆ (tO ) = bc is reached only for tO → +∞:
                                                              √
                                             lim b⋆ (tO ) = 3 3 m.                                (14.75)
                                            tO →+∞
      However, one can show (see e.g. Ref. [524] for details) that b⋆ (tO ) converges to bc exponentially
      fast: b⋆ (tO ) − bc ∝ exp(−tO /bc ).
          To summarize:
            • If the initial star is not ultracompact, i.e. if its arealp radius r0 is larger than 3m,
              the radius
                       √   of the  disk image  shrinks from  b 0 =  r 0 /  1 − 2m/r0 [Eq. (14.73)] to
              b∞ := 3 3 m [Eq. (14.75)]. The latter value is reached only asymptotically,    √ i.e. for
              tO → +∞, but the convergence is exponentially fast, so that, b⋆ (tO ) ≃ 3 3 m is a
              very good approximation for tO > t∗∗  O + αm, where α ∼ 2bc /m ∼ 10.
            • Collapsing stars that are initially ultracompact (r0 < 3m) generate
                                                                          √       images of constant
              size: the radius of the disk image remains fixed at b⋆ = 3 3 m.
      Remark 2: The above results are not specific to pressureless collapse: they hold for any spherically
      symmetric gravitational collapse.
      Example 11: For our favorite√example with r0 =√4m (Figs. 14.2, 14.3, 14.8 and 14.9), the radius of
      the image disk evolves from 4 2 m ≃ 5.66 m to 3 3 m ≃ 5.20 m, which constitutes a pretty modest
      decrease: only 8%. This is of course because such a star has a large compactness. For the neutron star
      considered in Table 14.1, which has a smaller compactness: m/r0 = 0.17, the shrinking factor is higher:
      28%. For a concrete example of images of the gravitational collapse of an unstable neutron star into a
      black hole computed by means of numerical relativity, see Fig. 4 of Ref. [492], where the modest decrease
      of the disk image is very apparent.
          The photons that form the star’s image on O’s screen at a given instant tO have impact
      parameters b in the range 0 ≤ b ≤ b⋆ (tO ) and have been emitted from the star’s surface
      at various proper times τ , depending on the value of b. Indeed, let us evaluate the elapsed
                                14.4 Observing the black hole formation                                          573
By integrating Eq. (14.76a) between the emission and reception points, we get
                                Z   rO           −1
                                               2m                 −1/2
          tO − tem (b, tO ) =               1−        1 − b2 U (r)      dr + ∆,                      (14.79)
                              rsem (b,tO )      r
574                            Black hole formation 1: dust collapse
      where ∆ = 0 if there is no r-turning point (periastron) along the photon geodesic L and
                                    Z   rsem (b,tO )              −1
                                                              2m                     −1/2
                            ∆=2                            1−            1 − b2 U (r)      dr         (14.80)
                                     rp                        r
      if there is a one, located at r = rp < rsem (b, tO ). For a given observation instant tO and a
      given position on the screen, measured by b, Eq. (14.79) must be solved to get tem (b, tO ). In
      general, this is a difficult problem since the lower boundary of the integral, rsem (b, tO ), depends
      on tem (b, tO ) in a complicated way, via the parametric system (14.77). However, for b = 0
      (radial geodesic), one can compute the integral in Eq. (14.79) in an elementary way; taking into
      account that ∆ = 0 for b = 0 (no r-turning point along a radial geodesic), we obtain
                                                                     em                
                                                                     rs (0, tO ) − 2m
                                                 em
                       tem (0, tO ) = tO − rO + rs (0, tO ) + 2m ln                       .          (14.81)
                                                                         rO − 2m
      This equation must still be solved to get tem (0, tO ) since rsem (0, tO ) is a function of tem (0, tO )
      given parametrically by the system (14.77).
          Another simplification of the general formula (14.79) occurs for tem (b, tO ) < 0 (emission
      before the start of the collapse), since then rsem (b, tO ) = r0 [Eq. (14.78b)]. Equation (14.79)
      provides then directly the value of tem (b, tO ) and we see that this is a decreasing function of b.
                                                −1/2
      Indeed, at fixed r, b 7→ (1 − b2 U (r))        is an increasing function of b, given that U (r) > 0.
      Consequently, the first value of b for which tem (b, tO ) ≥ 0, i.e. for which the collapse starts to
      be perceptible, is b = 0. In other words, the collapse manifests itself first at the center of the
      disk image. Let us then denote by t∗O the corresponding value of tO . It is obtained by setting
      tem (0, tO ) = 0 and rsem (0, tO ) = r0 in Eq. (14.81):
                                                                        
                                                                 rO − 2m
                                         ∗
                                       tO = rO − r0 + 2m ln                 .                        (14.82)
                                                                 r0 − 2m
      Since r2 − b20 (1 − 2m/r) ≥ 0 for r ≥ r0 , this expression shows that the time delay t∗∗
                                                                                            O − tO is
                                                                                                 ∗
      positive.
                                14.4 Observing the black hole formation                                       575
                                                                                         (14.88)
                                                            
                                      us = Γ e(t) − V e(r) ,
with                      s                                                 s
                              2m     2m                                              2m
                                    −                         1                 1−
                   V :=        rs    r0
                                    2m       and       Γ := √     =                  r0
                                                                                     2m            (14.89)
                              1−    r0                       1−V2               1−    rs
By comparing with the generic formulas (1.35)-(1.36), we conclude that V is the norm of the
velocity V = −V e(r) of Oem with respect to the local static observer and Γ is the corresponding
Lorentz factor.
Remark 3: Formula (14.89) implies limrs →2m V = 1. We recover then a well known result: with respect
to a static observer located just outside the event horizon, a radially free-falling body is falling at the
speed of light.
                                                                                                  (t)
    Observer Oem is endowed with an orthonormal frame (us , ns , es2 , es3 ), where ns = ns e(t) +
 (r)
ns e(r) is a unit spacelike vector normal to the star surface and oriented towards the exterior,
576                            Black hole formation 1: dust collapse
      while the vectors (es2 , es3 ) are unit spacelike vectors tangent to the star surface. The components
        (t)  (r)
      (ns , ns ) of ns are obtained from the two conditions us · ns = 0 and ns · ns = 1, along with
      (14.88):
                                                                                                     (14.90)
                                                                      
                                               ns = Γ −V e(t) + e(r) .
      Remark 4: Equations (14.88) and (14.90) simply express that the orthonormal pairs (us , ns ) and
      (e(t) , e(r) ) are connected by a Lorentz boost of velocity parameter −V .
      where ϵr = ±1 and ϵL = ±1. Far from the star, one has of course ϵr = +1, but, as we shall
      see below, ϵr = −1 may occur close to the star, which corresponds to a photon emitted in the
      inward direction (decreasing r); in this case, the photon will reach a periastron, where ϵr is
      switched to +1.
          The scalar product in Eq. (14.92) is easily evaluated from expressions (14.88) and (14.93),
      owing to the orthonormality of the frame (e(α) ); one gets
                                                   −1/2 
                               Eem              2m                                 
                                                                                              (14.94)
                                                                    p
                      1+z =         =Γ 1−                   1 + ϵr V 1 − b2 U (rs ) .
                               E∞                rs
      Substituting expressions (14.89) for V and Γ, as well as (8.14) for U (r), yields
                             −1 "r                 r            s                    #
                          2m                2m         2m 2m              b2        2m
           1+z = 1−                    1−       + ϵr       −        1− 2 1−                .        (14.95)
                           rs                r0         rs    r0          rs         rs
      This formula gives the redshift of a photon that hits the screen of the remote observer O at the
      impact parameter b at the instant tO . The quantity rs has to be replaced by rsem (b, tO ) as given
      by the system (14.77) for the pressureless collapse, where tem (b, tO ) is given by Eq. (14.79).
                                14.4 Observing the black hole formation                                           577
Remark 5: Equation (14.94) is valid for any spherically symmetric gravitational collapse, provided that
V is the velocity of the surface with respect to the local static observer as defined by Eq. (14.88) and Γ
is the corresponding Lorentz factor. On the contrary, Eq. (14.95) holds only for a pressureless collapse
starting from rest.
Emission angle
The direction of emission in the frame of the comoving observer Oem is characterized by the
angle Θem between the photon’s linear momentum Pem and the normal to the surface ns .
Considering that the collapsing matter is opaque to radiation, the maximum inclination of an
emitted photon capable to reach the remote observer O is Θem = π/2 (emission tangent to the
surface). Hence any photon received by O has necessarily
                                                   π
                                       0 ≤ Θem ≤ .                                   (14.96)
                                                   2
By definition, Θem is given by
                                                    ns · Pem
                                         cos Θem = √          .                                        (14.97)
                                                    Pem · Pem
                                 √
Now, from Eq. (14.91), we have Pem · Pem = Eem (using that p is a null vector) and ns · Pem =
ns · p. Evaluating this last scalar product by means of Eqs. (14.90) and (14.93) and substituting
expression (14.94) for Eem /E∞ , we get
                                                   p
                                            V + ϵr 1 − b2 U (rs )
                                cos Θem =          p               .                      (14.98)
                                           1 + ϵr V 1 − b2 U (rs )
This relation can be easily inverted to get
                                                    cos Θem − V
                                                                                                       (14.99)
                                p
                              ϵr 1 − b2 U (rs ) =                 ,
                                                   1 − V cos Θem
which allows us to express b in terms of Θem :
                                         √
                                            1 − V 2 sin Θem
                                 b= p                           .                                    (14.100)
                                        U (rs )(1 − V cos Θem )
Remark 6: For V = 0, i.e. when the comoving observer coincides with the local static observer (prior
to the start of the collapse), Eqs. (14.98) and (14.100) reduce respectively to Eqs. (8.126) and (8.127) taking
into account the change of notations rs ↔ rem and Θem ↔ |η|.
    We may also substitute Eq. (14.99) into Eq. (14.94) to get an expression of the redshift in
terms of Θem :
                                            −1/2
                                         2m                  1
                          1+z = 1−                                     .                 (14.101)
                                          rs       Γ (1 − V cos Θem )
For a pressureless collapse, Γ and V are given by Eq. (14.89) and the above relation simplifies to
                                                              1
                              1+z = q                     q                            .             (14.102)
                                                 2m           2m        2m
                                     1−          r0
                                                      −        rs
                                                                    −   r0
                                                                             cos Θem
578                             Black hole formation 1: dust collapse
      Central redshift
      The redshift zc at the center of the image of a pressureless collapse is obtained by setting b = 0
      (which implies ϵr = +1) in Eq. (14.95):
                                           −1 "r             s                      #
                                    2m                  2m            2m          2m
              1 + zc = 1 − em                      1−       +                   −       ,        (14.103)
                                rs (0, tO )              r0       rsem (0, tO )    r0
      where we have used rsem (0, tO ) for rs . One obtains an equivalent expression if one set Θem = 0
      (radial emission) in Eq. (14.102), in agreement with the property b = 0 ⇐⇒ Θem = 0, which
      follows from relation (14.100).
          For tem (0, tO ) ≤ 0, rsem (0, tO ) = r0 and formula (14.103) reduces to 1+zc = (1−2m/r0 )−1/2 ,
      which is the standard expression for the redshift at infinity of a photon emitted by a static
      observer at r = r0 in the Schwarzschild spacetime.
          For rsem (0, tO ) → 2m, formula (14.103) yields
                                                   r                           −1
                                                          2m            2m
                                  1 + zc ∼ 2 1 −                1−                  ,            (14.104)
                                           rs →2m          r0       rs (0, tO )
      which implies z → +∞ for rsem (0, tO ) → 2m.
          The elapsed proper time of observer O since the reception of the central photon emitted at
      τ = 0 (start of the collapse) is tO − t∗O . We get, from Eqs. (14.81) and (14.82),
                                                             r                    em            
               ∗                    em                          0
                                                                                   rs (0, tO )
         tO − tO = tem (0, tO ) − rs (0, tO ) + r0 + 2m ln        − 1 − 2m ln                   −1 .
                                                              2m                         2m
      When rsem (0, tO ) → 2m, two terms in the right-hand side of the above equation diverge: the
      last logarithm and the first term, tem (0, tO ), due to the singularity of Schwarzschild-Droste
      coordinates at r = 2m. We can express the latter divergence in terms of a logarithm involving
      rsem (0, tO ) thanks to Eq. (6.32): tem (0, tO ) = t̃em (0, tO ) − 2m ln [rsem (0, tO )/2m − 1] + K, where
      t̃em (0, tO ) is the IEF coordinate of the emission event of the photon that reaches the center of
      O’s screen at the instant tO and K is a constant. t̃em (0, tO ) is finite (cf. Fig. 14.9) and we get
                                                                r                             em           
                ∗                   em                               0
                                                                                              rs (0, tO )
        tO − tO = t̃em (0, tO ) − rs (0, tO ) + r0 + 2m ln              − 1 + K −4m ln                     −1 .
                      |                           {z               2m             }               2m
                                                A
      The terms denoted by A are bounded when rsem (0, tO ) → 2m, so that
                                em             
               ∗                 rs (0, tO )                    2m                   ∗
         tO − tO ∼ −4m ln                    − 1 ⇐⇒ 1 − em                 ∼ e−(tO −tO )/(4m) .
                 rs →2m             2m                       rs (0, tO ) O
                                                                         t →+∞
      Combining with Eq. (14.104), we obtain the expression of the central redshift at late times:
                                               r
                                                      2m (tO −t∗O )/(4m)
                               1 + zc ∼ 2 1 −             e              .                    (14.105)
                                      tO →+∞           r0
      Hence we conclude:
                             14.4 Observing the black hole formation                                    579
  The photons arriving at the center of the image (b = 0) suffer an increasing redshift. At late
  times, the redshift grows exponentially with a characteristic time scale 4m. Equivalently,
  the energies of the central photons vanish exponentially fast.
  The image’s rim, or limb, at b = b⋆ (tO ) (cf. Sec. 14.4.2), is formed by photons emitted
  tangentially to the star’s surface. In other words, the image’s rim is formed by photons
  emitted in the direction Θem = π/2 with respect to the normal in the frame of the comoving
  observer Oem .
We have demonstrated this property for tO ≤ t∗∗   O , i.e. prior to the collapse, in Sec. 14.4.2. We
shall admit that it remains true for tO > tO .
                                            ∗∗
     One has then cos Θem = 0 for b = b⋆ (tO ) and Eq. (14.98) implies ϵr = −1 as soon as V > 0,
i.e. when the collapse has started. Hence these photons are emitted in the inward direction.
Their geodesics have then necessarily a r-turning point (periastron), after which r is increasing
monotonically (ϵr = +1) until reaching observer O.
     The redshift z⋆ at the image’s rim is obtained by setting Θem = π/2 in Eq. (14.101):
                                                               −1/2
                                                       2m
                              1 + z⋆ = Γ −1
                                              1 − em                  ,                      (14.106)
                                                  rs (b⋆ , tO )
where b⋆ stands for b⋆ (tO ). For pressureless collapse, Γ can be substituted by expression (14.89);
this makes the terms involving rsem (b⋆ , tO ) cancel out. Hence
  For a pressureless collapse starting from rest, the redshift at the image’s rim keeps a
  constant value:
                                                    −1/2
                                                 2m
                                 1 + z⋆ = 1 −              .                      (14.107)
                                                  r0
This rather surprising result can be understood by considering the general expression (14.106):
1+z⋆ is the product of two competing factors: the “static” gravitational redshift (1−2m/rs )−1/2 ,
which is always larger than one and diverges as rs → 2m, and the Doppler blueshift factor
Γ−1 , which is always lower than one and tends to zero as rs → 2m (since V → 1 in that limit).
The Doppler blueshift is of course due to the rim’s photons being emitted inward, which is the
direction of the matter motion.
580                              Black hole formation 1: dust collapse
      Redshift profile
      The redshift across the image, i.e. z as a function of b at fixed tO , is given by Eq. (14.94) for a
      generic collapse and by Eq. (14.95) for a pressureless collapse from rest. However, one has to
      solve for rs (b, tO ) to make these formulas fully explicit. Doing so (see e.g. Fig. 2 of Ref. [333]
      or Fig. 6 of Ref. [524]), one obtains a redshift which is maximal at the image’s center (b = 0),
      where it is given by formulas (14.103), and increases exponentially with time [Eq. (14.105)].
      The redshift decreases with b until a small annulus near the image’s rim (b = b⋆ (tO )), where it
      takes the constant value (14.107). The width of this annulus decays to zero with time.
      where z = z(b, tO ) is the redshift at the location b on the image at time tO as given by Eq. (14.94)
      or (14.95), Θem is the emission angle expressed in terms of (b, tO ) by Eq. (14.98), tem = tem (b, tO )
      is the emission coordinate time expressed via Eq. (14.79) and νem is the emission frequency. By
      the very definition of the redshift, νem is related to ν by νem = (1 + z)ν [cf. Eq. (14.85)]. The
         7
           The notation Iν follows a standard convention in the literature, the subscript ν being merely an indicator to
      distinguish this quantity from the intensity I, which is integrated over all frequencies.
                                14.4 Observing the black hole formation                                            581
function Iνem (Θem , tem , νem ) — the specific intensity at the source — depends on the model of
emission assumed for the surface of the collapsing star.
   The intensity I is the specific intensity integrated over all the frequencies:
                                               Z +∞
                                   I(b, tO ) =       Iν (b, tO , ν) dν                    (14.110)
                                                      0
                                          Z +∞
                    I(b, tO ) = (1 + z)−4
                                               Iνem (Θem , tem , νem ) dνem .                         (14.111)
                                                   0
In view of the central redshift behavior (14.105), we deduce from this formula that the intensity
at the center of the image decreases exponentially with a characteristic time equal to m:
                                                                      ∗
                                     I(0, tO )     ∼      I0 e−(tO −tO )/m .                          (14.112)
                                                 tO →+∞
    The specific flux of the image is obtained by integrating the flux element (14.108) over the
solid angle dΩ. Using dΩ = b̂ db̂ dϕ = b/rO2 db dϕ, we get
                                          2π b⋆
                                             Z
                             Fν (tO , ν) = 2      Iν (b, tO , ν) b db.                  (14.113)
                                          rO 0
    The flux of the image is the specific flux integrated over all the frequencies:
                               Z +∞
                                                      2π b⋆
                                                         Z
                     F (tO ) =      Fν (tO , ν) dν = 2       I(b, tO ) b db.                          (14.114)
                                0                     rO 0
It can be shown (see e.g. Refs. [12, 455]) that, independently of the emission model, the flux is
decaying exponentially according to
                                                                  ∗
                                                                          √
                                   F (tO )     ∼       F0 e−(tO −tO )/(3      3m)
                                                                                    .                 (14.115)
                                             tO →+∞
Historical note : The first computation of the appearance of a gravitational collapse as seen by a
remote observer has been performed by Mikhail A. Podurets in 1964 [415]. He obtained the exponential
decay formula (14.115) for the flux, up to an erroneous factor 2 in factor of tO − t∗O . This has been
corrected by William L. Ames and Kip S. Thorne in 1968 [12]. Podurets considered only the image flux,
which is relevant for a remote observer who cannot resolve the collapsing star. The intensity at the
center of the image has been derived by Yakov B. Zeldovich and Igor D. Novikov in 1964 [529], who
obtained the exponential decay formula (14.112). The computation of the intensity at each point of
the image has been achieved by Ames and Thorne in their 1968 study [12], who showed that (i) the
image is brightest at its rim, (ii) the redshift is constant there and (iii) the width of this peripheral bright
region is decaying exponentially with time. In 1969, Jack Jaffe [296] pointed out that Ames and Thorne
considered only photons with an emission angle Θ′em ≤ π/2 in the local static observer frame, while the
limiting condition should be Θem ≤ π/2 in the comoving observer frame [Eq. (14.96)]. This encompasses
extra photons: those that are emitted inward, as shown above. In 1979, Kayll Lake and R. C. Roeder
[333] recomputed the intensity with this hypothesis and concluded that the main properties found by
Podurets, Ames and Thorne are not significantly altered.
582                               Black hole formation 1: dust collapse
         8
            Note that there is no equivalent to the Jebsen-Birkhoff theorem (Property 14.4) for a non-spherically symmetric
      (e.g. flattened) body; in particular, the vacuum metric outside the body is not Kerr metric.
Chapter 15
Contents
         15.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 583
         15.2 The ingoing Vaidya metric . . . . . . . . . . . . . . . . . . . . . . . . . 583
         15.3 Imploding shell of radiation . . . . . . . . . . . . . . . . . . . . . . . . 588
         15.4 Configurations with a naked singularity . . . . . . . . . . . . . . . . . 602
         15.5 Going further . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 614
15.1      Introduction
Having investigated the gravitational collapse of a star, modeled as a ball of dust, in the
preceding chapter, we move to a much less astrophysical scenario: the formation of a black
hole by the collapse of a spherical shell of pure radiation (no matter!). Albeit quite academic,
this process illustrates various features of black hole birth and dynamics, in a way somewhat
complementary to the collapse of a dust ball. The model is based on an exact solution of the
Einstein equation sourced by a pure radial electromagnetic flux, known as Vaidya metric, which
we present first (Sec. 15.2). Then, we introduce the model describing the implosion of a radiation
shell and study the black hole formation via such a process (Sec. 15.3). Finally, we focus on a
subclass of models giving birth a naked singularity, i.e. models in which the central curvature
singularity is visible to remote observers located outside the black hole region (Sec. 15.4).
      coordinates on S2 and r being the areal radius associated with spherical symmetry (cf. Sec. 6.2.2).
      The ingoing Vaidya metric is the metric tensor
                                               
                                 2M (v)
                                                    dv 2 + 2 dv dr + r2 dθ2 + sin2 θ dφ2 ,         (15.1)
                                                                                        
                         g =− 1−
                                   r
      where M (v) is a real-valued function of v. We immediately notice that this expression strongly
      resembles that of the Schwarzschild metric expressed in the null ingoing Eddington-Finkelstein
      coordinates, as given by Eq. (6.29). Actually, the only difference is the constant m in Eq. (6.29)
      replaced by the function M (v) in Eq. (15.1). We may even say that the Schwarzschild metric is
      the special case M (v) = const of the ingoing Vaidya metric.
          A key property of the ingoing Vaidya metric is
        The hypersurfaces v = const are null (i.e. v is a null coordinate); a normal to them is the
        null vectora :
                                     −
                                     →        →
                                              −
                              k := −dv = −∇v            ⇐⇒       k := −dv.                   (15.2)
        Moreover, k is equal to minus the vector ∂r of coordinates (v, r, θ, φ):
                                                           ∂
                                                    k=−                 ,                       (15.3)
                                                           ∂r   v,θ,φ
        where we have rewritten ∂r as ∂/∂r|v,θ,φ to distinguish it from the vector ∂r of the IEF
        coordinates, to be introduced in Sec. 15.2.2.
           a
               in index notation: k α := −g αµ ∂µ v = −∇α v ⇐⇒ kα := −∂α v.
        Since dv is nowhere vanishing, k is a nonzero null vector field on M . We then set the time
        orientation of (M , g) by declaring that k is future-directed (cf. Sec. 1.2.2).
As shown in the notebook D.6.4, the Ricci tensor of g takes a simple form:
                                                      2M ′ (v)
                                               R=              dv ⊗ dv,                            (15.4)
                                                        r2
                                 15.2 The ingoing Vaidya metric                                    585
where M ′ (v) stands for the derivative of the function M (v). The Ricci scalar R = g µν Rµν =
(2M ′ (v)/r2 ) ∇µ v∇µ v = (2M ′ (v)/r2 ) kµ k µ is identically zero, since k is a null vector. A
consequence is
  The ingoing Vaidya metric (15.1) is a solution of the Einstein equation (1.40) with Λ = 0
  and with the energy-momentum tensor
                                            M ′ (v)
                                      T =           k⊗k .                               (15.5)
                                            4πr2
Remark 1: We have already noticed that Vaidya metric reduces to Schwarzschild metric for M (v) =
const. This corresponds to M ′ (v) = 0, so that Eq. (15.5) reduces to T = 0 and we recover that
Schwarzschild metric is a solution of the vacuum Einstein equation.
    The tensor (15.5) has the same structure as the energy-momentum tensor of the dust
model considered in Chap. 14: Tdust = ρ u ⊗ u [Eq. (14.2)]. The main difference is that u is
a timelike vector (the dust 4-velocity), while k is a null vector. For this reason, the energy-
momentum tensor (15.5) is sometimes referred to as a null dust model [416]. It corresponds
physically to the energy-momentum tensor of some monochromatic electromagnetic radiation
in the geometrical optics approximation (see Box 22.4 of Ref. [371]): Trad = q K ⊗ K, where
                  →
                  −
q ≥ 0 and K := ∇Φ is the wave vector, Φ being the rapidly-varying phase in the geometrical
optics decomposition A = Re(eiΦ a) of the electromagnetic potential 1-form A. The quantity
q is related to the energy density ε of the electromagnetic field as measured by an observer
O of 4-velocity u by ε = ω 2 q, where ω = −K · u is the frequency of the electromagnetic
radiation as measured by O. Maxwell equations imply that K is a null vector and that it is
geodesic: ∇K K = 0. The geodesic integral curves of K are nothing but the light rays of the
geometrical optics framework. The geodesic character also holds for k as defined by Eq. (15.2),
since k is normal to the null hypersurfaces v = const: the integral curves of k are the null
geodesic generators of these hypersurfaces (cf. Sec. 2.3.3); actually, we have exactly
                                          ∇k k = 0.                                       (15.6)
This follows from Eq. (2.22) with ℓ = k and κ = 0 by virtue of Eq. (2.21) with ρ = 0 implied by
                                         →
                                         −
Eq. (2.11) given that u = v and k = −∇v [Eq. (15.2)]. Equations (15.6) and (15.3) imply that
the curves (v, θ, φ) = const are null geodesics, λ := −r is an affine parameter along them and
k is the corresponding tangent vector.
    Another condition for the identification of T with Trad is that the coefficient in front
of k ⊗ k in (15.5) is non-negative, since q ≥ 0 in Trad . This constraint can also be seen
as the null energy condition (2.95) introduced in Sec. 2.4.2: for any null vector ℓ, we have
T (ℓ, ℓ) = M ′ (v)/(4πr2 ) (k · ℓ)2 and hence T (ℓ, ℓ) ≥ 0 ⇐⇒ M ′ (v) ≥ 0. In other words,
the function M (v) must be monotically increasing. Then, we can set k = αK, where α is a
constant, to have a perfect match of (15.5) with the electromagnetic radiation energy-momentum
tensor Trad . To summarize:
586                                  Black hole formation 2: Vaidya collapse
      Historical note : The Vaidya metric has been actually first derived by Henri Mineur in 1933 [367], as
      the solution of the Einstein equation for the exterior of a spherically symmetric body, the “mass” of
      which “is varying” due to the radiation of an “energy flux of light”. It thus corresponds to the outgoing
      version of the Vaidya metric, while the version (15.1) considered here is ingoing. More precisely, the
      solution given by Mineur reads1
                                                      du2 − dv 2
                                                                            
                                                                      2M (x)
                                         2
                                    −ds = 2 dx dr + r      2
                                                                 − 1−          dx2 .                                (15.7)
                                                         u2             r
      The coordinate x is the opposite of a retarded time (x = −u in our notations), while (u, v) are coordinates
      on the 2-sphere S2 (submanifold (x, r) = const) such that the standard (round) metric of S2 is q =
      u−2 −du2 + dv 2 (cf. the unnumbered equation      at the top of p. 35 of Mineur’s article [367]). This looks
      quite surprising, given that u−2 −du2 + dv 2 is rather the metric of the 2-dimensional anti-de Sitter
      space expressed in a variant of Poincaré coordinates (compare Eq. (13.66) with T = v and R = 1/u). In
      particular the signature is (−, +), while it should be (+, +) for S2 ! It turns out that the coordinate u
      used by Mineur is pure imaginary, i.e. Mineur wrote the metric of S2 via a kind of Wick rotation of the
      metric of the hyperbolic plane 2 2
                                     H , the latter being (dX + dY )/Y : setting v = X and u 2= iY , one
                                                                 2       2     2
      gets −q = u  −2   du − dv . If one restores the standard spherical coordinates (θ, φ) on S and uses
                           2      2
      the modern notation u = −x as well as the metric signature (−, +, +, +), one can rewrite Mineur’s
      solution as3                              
                                        2M (u)
                                                   du2 − 2 du dr + r2 dθ2 + sin2 θ dφ2 .                      (15.8)
                                                                                            
                          g =− 1−
                                            r
      This ressembles the metric (15.1). The only difference is the − sign in front of du dr, while there is a +
      sign in front dv dr in (15.1). This results from Mineur’s version being outgoing, given that he considered
      a radiating body. On the contrary, the version (15.1) is ingoing, since we are interested in gravitational
      collapse and black hole formation.
          Twenty years later, in 1953, Prahalad Chunnilal Vaidya presented the metric that bears his name in
      the form (15.8) [488]. Previously, in 1943 [486] and in 1951 [487], Vaidya presented the metric outside a
         1
            This is Mineur’s Eq. (21), p. 47 of Ref. [367]. The minus sign in the left-hand side, which is present Mineur’s
      article [367], is fortunate for comparison with the current text since Mineur is using the metric signature
      (+, −, −, −), i.e. the opposite of ours.
          2
            The spaces H2 and S2 are somehow connected as being the only 2-dimensional Riemannian manifolds of
      nonzero constant scalar curvature (negative for H2 and positive for S2 ).
          3
            As one can infer by comparing with other expressions of ds2 in Mineur’s article, the + sign in front of r2
      times the S2 line element in Eq. (15.7) is certainly a typo and should be replaced by a − sign; this correction is
      performed to get Eq. (15.8).
                                      15.2 The ingoing Vaidya metric                                             587
                                                 2M −1 2
                           2                   
          1            ∂M          2M     2
                                                      dr + r2 dθ2 + sin2 θ dφ2 ,                       (15.9)
                                                                              
   g=−                          1−      dT + 1 −
       f (M )2         ∂T           r             r
where f is an arbitrary function and M = M (T, r) fulfills the differential equation ∂M /∂r (1 −
2M /r) = f (M ). By introducing the scalar field u such that du = −f (M )−1 dM and promoting it as a
coordinate, one can bring this solution to Mineur’s form (15.8). In a foreword to Vaidya’s 1951 article
[487], Vishnu Vasudev Narlikar, who was Vaidya’s PhD advisor, wrote: “The treatment as given here is
essentially different from that of Professor H. Mineur as it appears in Ann. de l’Ecole Normal Superieure,
Ser. 3, 5, 1, 1933 (our Ref. [367]). Our attention was kindly drawn to it by Professor Mineur some years
ago.” In a common article by Narlikar and Vaidya published in 1947 [378], one can also read “The line
element (5) [our Eq. (15.9)] was first published by one of us some years ago. A line element equivalent to (5)
but not obviously so was obtained by Mineur several years earlier.” It would probably be fair to call the
metric (15.8) the Mineur-Vaidya metric; we shall however keep here the usual name of Vaidya metric.
The metric (15.8) was popularized and further studied by Richard W. Lindquist, Robert A. Schwartz and
Charles W. Misner in 1965 [347], who apparently were not aware of Mineur’s work.
t := v − r ⇐⇒ v = t + r . (15.10)
Remark 2: The coordinate t was denoted by t̃ in Chap. 6, where t was reserved for the Schwarzschild-
Droste time coordinate.
   We have dv = dt + dr, from which the IEF expression of the metric tensor is immediately
deduced from Eq. (15.1):
                                                                 
               2M (t + r)       2 4M (t + r)             2M (t + r)
     g = − 1−                dt +            dt dr + 1 +              dr2
                    r                 r                      r                                        (15.11)
         +r2 dθ2 + sin2 θ dφ2 .
                             
The expression of k in terms of the IEF coordinate frame is deduced from Eq. (15.3) and the
chain rule:
                                       k = ∂t − ∂r .                                 (15.12)
      The two solutions are V = −1 and V = [r − 2M (t + r)]/[r + 2M (t + r)]. The first solution
      gives back the ingoing null vector k introduced in Sec. 15.2.1 [cf. Eq. (15.12)]. The null vector ℓ
      corresponding to the second solution is
                                                     r − 2M (t + r)
                                          ℓ = ∂t +                  ∂r .                              (15.13)
                                                     r + 2M (t + r)
        The integral curves of the vector fields k and ℓ are null geodesics. Those of k are the
        ingoing radial null geodesics L(v,θ,φ)
                                        in
                                               already discussed in Sec. 15.2.1, while those of ℓ are
        called the outgoing radial null geodesics.
      Proof. A direct computation shows that ℓ is a pregeodesic vector field: ∇ℓ ℓ = κℓ, with
      κ = −4[rM ′ (t + r) − M (t + r)]/[r + 2M (t + r)]2 (cf. the notebook D.6.4). It follows that the
      integral curves of ℓ are geodesics (cf. Sec. B.2.2).
         The differential equation governing the outgoing radial null geodesics is obtained by
      demanding that ℓ is their tangent vector:
                                             dr   r − 2M (t + r)
                                                =                .                                    (15.14)
                                             dt   r + 2M (t + r)
      In what follows, we shall get exact solutions of this equation for M (v) piecewise linear.
      Remark 4: As in the Schwarzschild case (cf. Remark 1 on p. 191), the outgoing radial null geodesics are
      actually ingoing, i.e. have r decreasing towards the future, as soon as r < 2M (t + r). This corresponds
      to the region bounded by the red curve in Fig. 15.1, to be discussed in Sec. 15.3.5.
thickness. The function M (v) is thus constant outside the interval [0, v0 ]. In order to describe
the formation of a black hole, we choose M (v) = 0 for v < 0. This corresponds to a piece
of Minkowski spacetime, since the metric (15.11) clearly reduces to Minkowski metric (4.3)
wherever M (v) = 0. Denoting by m > 0 the constant value of M (v) for v > v0 , we have then
                 
                  0 for v < 0                       (Minkowski region, MMin )
                 
                 
       M (v) =       m S(v/v0 ) for 0 ≤ v ≤ v0 (radiation region, Mrad )                       (15.15)
                 
                     m for v > v0                    (Schwarzschild region, MSch ),
                 
                 
where S : [0, 1] → [0, 1], x 7→ S(x) is an increasing function obeying S(0) = 0 and S(1) = 1.
The region with v > v0 is qualified as Schwarzschild since for M (v) = m = const, the Vaidya
metric reduces to Schwarzschild metric, as noticed in Sec. 15.2.1. The three regions are shown
in terms of coordinates (t, r) on Fig. 15.1: Mrad (the imploding shell) is the yellow region,
MMin lies below it and MSch lies above it.
    The simplest example of a function M (v) obeying (15.15) is obtained for S(x) = x:
                                                            v
                       S(x) = x      ⇐⇒        M (v) = m         (0 ≤ v ≤ v0 ).                (15.16)
                                                            v0
It is shown as the blue curve in Fig. 15.2. This choice of S makes M (v) piecewise linear. The
resulting metric tensor (15.1) is continuous but not C 1 at v = 0 and v = v0 . A choice of S
that yields a C 2 metric tensor is S(x) = 6x5 − 15x4 + 10x3 . This choice is depicted by the red
curve in Fig. 15.2.
Historical note : The imploding shell model (15.15) has been introduced by William A. Hiscock, Leslie
G. Williams and Douglas M. Eardley in 1982 [279], as well as by Achilles Papapetrou in 1985 [400]. Both
studies regarded the specific case S(x) = x. Hiscock et al. considered the ingoing Vaidya metric in
the form (15.1) (coordinates (v, r, θ, φ)), while Papapetrou made use of the form (15.11) (coordinates
(t, r, θ, φ)).
                                                     2m
                                              α :=      .                                      (15.18)
                                                     v0
It is immediately apparent on (15.17) that for any λ > 0, the homothety Hλ : (v, r) 7→ (λv, λr)
maps g to λ2 g. Hence Hλ is a conformal isometry of (Mrad , g) with a constant conformal
factor λ2 . The homotheties (Hλ )λ∈R>0 form a 1-dimensional group, the generator of which is
590                            Black hole formation 2: Vaidya collapse
                              t/m
                              4
2 k `
1 A
                                                                                          r/m
                                         1        2          3      4        5        6
                              1
                                             B
                              2
                                  C
                              3
      Figure 15.1: Spacetime diagram of the Vaidya collapse based on the IEF coordinates (t, r) and for the linear
      mass function M (v) = mv/v0 with v0 = 3m (homothetic model with α = 2/3). The yellow area is the radiation
      region Mrad [cf. Eq. (15.15)], below it lies the Minkowski region MMin and above it, the Schwarzschild region
      MSch . The solid (resp. dashed) green curves are outgoing (resp. ingoing) radial null geodesics. The thick black
      line marks the event horizon H (Sec. 15.3.3) and the red one the future outer trapping horizon T (Sec. 15.3.5).
      Note that H and T coincide in MSch . The curvature singularity is indicated by the orange zigzag line. The part
      of the figure corresponding to MMin can be compared with Fig. 4.1, while that corresponding to MSch can be
      compared with Fig. 6.3. [Figure generated by the notebook D.6.4]
ξ = v ∂v + r ∂r |v,θ,φ = t ∂t + r ∂r . (15.19)
      The second equality follows form the change of coordinates (15.10). That ξ has the same
      expression with respect to (v, r) and (t, r) coordinates should not be surprising since the
      homothety Hλ has the same expression in both coordinate systems: Hλ : (t, r) 7→ (λt, λr),
      given that λv = λ(t + r) = λt + λr. The vector field ξ is called a homothetic Killing vector.
      The Lie derivative of g along ξ is twice g (cf. the notebook D.6.4 for the computation):
                                                          Lξ g = 2g.                                         (15.20)
                                           15.3 Imploding shell of radiation                                  591
                              1.0
                                    S(x) = x
                                    S(x) = 6x 5 − 15x 4 + 10x 3
                              0.8
0.6
                     M(v)/m
                              0.4
0.2
                              0.0
                                           2                 0                 2   v0   4   6
                                                                         v/m
Figure 15.2: Function M (v) for the imploding shell model, for v0 = 3m and two different choices of S(x) in
formula (15.15).
We shall refer to the choice M (v) piecewise linear as the homothetic radiation shell model.
Remark 1: The denomination self-similar, in place of homothetic, is also used in the literature (e.g.
[383, 384]).
Remark 2: A homothetic Killing vector is not a Killing vector, for the right-hand side of Eq. (15.20) would
be zero if ξ were a Killing vector [cf. Eq. (3.18)]. In other words, except for λ = 1, the homotheties
Hλ are not isometries, but only conformal isometries. Generally, vector fields generating conformal
isometries are called conformal Killing vectors. They fulfill Lξ g = σg, where σ is a scalar field.
Equation (15.20) constitutes the particular case σ = 2.
    Let us introduce the variable
                                                     v
                                                       ,          x :=                  (15.21)
                                                     r
which is invariant under the homotheties Hλ . The differential equation governing the outgoing
radial null geodesics, Eq. (15.14), can be rewritten as dt/dr = (1+αx)/(1−αx) [cf. Eq. (15.18)].
Given that t = v − r = r(x − 1) implies dt/dr = x − 1 + rdx/dr, we get the equivalent form
                                                       dx   αx2 − x + 2
                                                   r      =             .                          (15.22)
                                                       dr     1 − αx
Fortunately, this ordinary differential equation is separable, so that its solutions are easily
obtained by quadrature. They depend on whether the quadratic polynomial Pα (x) := αx2 −x+2
admits real roots or not. Let us first focus on the case where Pα has no real root. The discriminant
being 1 − 8α, this occurs if, and only if,
                                                         1
                                                α>         ⇐⇒ v0 < 16 m.                           (15.23)
                                                         8
By considering the energy-momentum tensor (15.5) with expression (15.16) substituted for
M (v), we get
                                       α
                                 T =       k ⊗ k.                                (15.24)
                                      8πr2
592                          Black hole formation 2: Vaidya collapse
      Consequently, we may say that the case α > 1/8 corresponds to a large radiation energy
      density. We shall discuss the low radiation energy density case (α < 1/8) in Sec. 15.4. For the
      moment, assuming (15.23), we have Pα (x) > 0 for any x ∈ R and we may rewrite Eq. (15.22) as
                                                        1 − αx
                                           d ln r =               dx,                               (15.25)
                                                      αx2 − x + 2
      the solution of which is r = r0 fα (x), where
                                                 Z x
                                                            1 − αx′
                                    ln fα (x) :=                         dx′                        (15.26)
                                                      0   αx′ 2 − x′ + 2
      and the integration constant r0 > 0 is the value of r at x = 0, or equivalently at v = 0, since
      fα (0) = 1. Explicitly,
                        √                                                        
                       2                 1               2αx − 1                1
        fα (x) = √            exp    √          arctan √            + arctan √             ,
                     2
                   αx − x + 2          8α − 1              8α − 1              8α − 1
                                                                                       (15.27)
      Introducing the dimensionless parameter u := − ln(r0 /m), we conclude:
Property 15.6: outgoing radial null geodesics for the α > 1/8 homothetic shell
        For the homothetic model with α > 1/8,    the outgoing radial null geodesics in Mrad form
        a 3-parameter family of curves L(u,θ,φ) , where the parameter u ∈ R is related to the
                                               out
        value r0 of r at the inner edge of the radiation shell (v = 0) by r0 = me−u . The parametric
        equation of L(u,θ,φ)
                        out
                              in terms of the IEF coordinates is
                              
                                      −u
                               t = me (x − 1)fα (x)
                              
                                r = me−u fα (x)                     0 ≤ x ≤ xmax ,               (15.28)
                              
                                θ = const, φ = const
                              
        where the function fα (x) is defined by Eq. (15.27) and either xmax = +∞ (L(u,θ,φ)
                                                                                      out
                                                                                             reaches
        r = 0 for some v < v0 ) or xmax is the solution of me xmax fα (xmax ) = v0 (L(u,θ,φ) reaches
                                                             −u                       out
      The function fα (x) is plotted in Fig. 15.3. It increases from 1 at x = 0 to some maximum
      reached for x = α−1 and then decreases to 0 as x → +∞. This behavior follows directly from
      the sign of the numerator 1 − αx in Eq. (15.25), given that the denominator αx2 − x + 2 is
      always positive for α > 1/8. Note that it could be that xmax < α−1 so that the maximum
      of fα (x) is actually not reached along L(u,θ,φ)
                                                   out
                                                            . In that case, r increases monotonically along
      L(u,θ,φ)
         out
               in Mrad , from r0 to r0 fα (xmax ).
          It appears clearly on Eq. (15.28) that the homothety Hλ : (t, r) 7→ (λt, λr) transforms
      the geodesic L(u,θ,φ)
                        out
                              into the geodesic L(uout ′ ,θ,φ) with u = u − ln λ, in agreement with the
                                                                       ′
                                       α = 0.25
                                       α = 0.50
                                       α = 1.00
                               4       α = 2.00
                                       α = 4.00
                                       α = 6.00
                               3
                      fα (x)
                               2
                                   0               1      2      3      4      5      6
                                                                 x
Figure 15.3: Function fα (x), defined by Eq. (15.27), for some selected values of α > 1/8. For each value of α,
the maximum of fα (x) is achieved for x = α−1 . [Figure generated by the notebook D.6.5]
    Some outgoing radial null geodesics are depicted as solid green lines in Fig. 15.1. In Mrad ,
they obey Eq. (15.28). Note that the homothetic symmetry appears clearly on the figure. If a
geodesic L(u,θ,φ)
           out
                  has a r-turning point in Mrad , it must located at x = α−1 , i.e. at t/r = α−1 −1,
or equivalently at                          v         
                                                0
                                        t=          − 1 r.                                    (15.29)
                                             2m
The above equation defines a straight line through (t, r) = (0, 0), whose intersection with
Mrad is depicted by a red segment in Fig. 15.1.
Remark 3: The turning point value (15.29) can be obtained directly by setting dr/dt = 0 in Eq. (15.14)
and using the value (15.16) for M (v).
    In the Minkowski region, the outgoing radial null geodesics are straight line segments
inclined at +45◦ in Fig. 15.1, while in the Schwarzschild region, they are curves obeying
Eq. (6.51) (with the change of notation t̃ ↔ t).
Property 15.7: black hole formation for the α > 1/8 homothetic shell
  The homothetic imploding radiation shell with α := 2m/v0 > 1/8 generates a black hole.
  The black hole event horizon H is the future light cone of the point of IEF coordinates
594                                 Black hole formation 2: Vaidya collapse
                                         0.0
                                         0.5
                                         1.0
                                         1.5
                                thb /m
                                         2.0
                                         2.5
                                         3.0
                                         3.5
                                         4.0
                                               0   2       4   6        8      10        12   14        16
                                                                    v0 /m
      Figure 15.4: Value thb of the coordinate t at the black hole birth [Eq. (15.30)] as a function of the radiation shell
      thickness v0 , for the homothetic shell collapse [Eq. (15.16)].
        Outside the radiation shell, H coincides with the Killing horizon of Schwarzschild space-
        time located at r = 2m.
            a
                As in Chap. 14, the subscript ‘hb’ stands for horizon birth.
      Proof. In the Schwarzschild region MSch , the Killing horizon is the hypersurface r = 2m.
      It is generated by the null geodesics L(θ,φ)  out,H
                                                          discussed in Sec. 6.3.5 [cf. Eq. (6.52)]. Let us
      consider one such geodesic, Lˆ say. It has a fixed value of (θ, φ) and, when followed in the
      past direction, it encounters the outer edge of the radiation region Mrad (hypersurface v = v0 )
      at the point A such that rA = 2m and tA = v0 − 2m (cf. Fig. 15.1, where Lˆ can be identified
      with the black curve). If Lˆ is prolonged into Mrad , still in the past direction, it encounters the
      inner edge of Mrad (hypersurface v = 0) at the point B (cf. Fig. 15.1). The portion AB of Lˆ
      coincides with the geodesic L(uout B ,θ,φ)
                                                 of the outgoing radial null family given by Eq. (15.28),
      with uB = − ln(rB /m) (since fα (xB ) = 1, given that xB = 0). We have then rA = rB fα (xA ).
      Now, by definition of x [Eq. (15.21)] and α [Eq. (15.18)], xA = vA /rA = v0 /(2m) = α−1 . We
      have thus 2m = rB fα (α−1 ). In view of expression (15.27) for fα (x), there comes
                                                                            
                                                      2                  1
                             rB = 2m exp − √                arctan √              .
                                                    8α − 1             8α − 1
      Since B is located on the hypersurface v = 0, we have tB = −rB . If the radial null geodesic Lˆ
      is prolonged further to the past in the Minkowski region MMin , it becomes the straight line of
      equation t = r − 2rB . Lˆ thus reaches r = 0 at some event C of coordinate t = thb = −2rB ,
      hence Eq. (15.30).
                                  15.3 Imploding shell of radiation                                     595
    There remains to prove that the null hypersurface generated by Lˆ when (θ, φ) varies, i.e.
the future light cone H of the point (t, r) = (thb , 0), is indeed the black hole event horizon.
To this aim, let us consider an outgoing radial null geodesic L in MMin such that L crosses
r = 0 at t < thb , i.e. outside H . L arrives then at the inner edge of Mrad with r = r0 > rB .
In Mrad , according to Eq. (15.28), L is homothetic to a part of Lˆ with a ratio r0 /rB > 1.
Hence it emerges at the outer edge of Mrad with r > rA = 2m. L is there in the exterior of
the Schwarzschild black hole, so that it will subsequently reach the future null infinity I + of
MSch . On the contrary, if L crosses r = 0 with t > thb , i.e. inside H , it encounters the inner
edge of Mrad with r = r0 < rB . A part of L is then homothetic to the segment BA of Lˆ with
a ratio r0 /rB < 1. Then either (i) L has a r-turning point and reaches r = 0 in Mrad or (ii) L
reaches the outer edge of Mrad (v = v0 ) at some point A′ . Given that xA = α−1 corresponds to
the maximum of fα (x), one has necessarily fα (xA′ ) ≤ fα (xA ) and thus r0 fα (xA′ ) < rB fα (xA ).
By Eq. (15.28), this implies rA′ < rA = 2m, so that L emerges in the black hole region of
Schwarzschild spacetime. So none of the two possible cases (i) or (ii) leads to L reaching I + .
We conclude that H is a black hole horizon.
      The black hole event horizon H is depicted as the thick black curve in Fig. 15.1. Since
thb   < 0 [cf. Eq. (15.30)], one immediately notices:
  The black hole forms in the Minkowski region of Vaidya spacetime, i.e. in a region where
  the spacetime curvature is zero.
This striking feature reflects the non-local character of black holes and will be discussed further
in Chap. 18.
    The dependency of thb on the width v0 of the radiation shell is shown in Fig. 15.4. One has
−4m < thb < 0, with
                          lim thb = −4m         and         lim thb = 0.                      (15.31)
                          v0 →0                           v0 →16m
Remark 4: The first limit in (15.31), which corresponds to α → +∞ in Eq. (15.30), is easily recovered
by a pure geometric construction: a zero-width shell implies the equality of the two points A and B
considered in the proof of (15.30), as well as tA = tB = −2m, hence tC = thb = −4m.
Historical note : The imploding radiation shell with M (v) piecewise linear (homothetic model) has
been extensively studied by Achilles Papapetrou in 1985 [400]. He obtained the solution (15.28) for
the outgoing radial null geodesics and determined the location of the event horizon, along the lines
presented above.
        The Vaidya shell collapse introduced in Sec. 15.3.1 generates a spacetime with a curvature
        singularity located at r = 0 and t ≥ 0.
      Remark 5: The Kretschmann scalar of Vaidya metric has the same structural form as that of the
      Schwarzschild metric, compare Eq. (6.45), while a priori K could have contained some term involving
      the derivative of M (v). Indeed M ′ (v) appears in some components of the Riemann tensor, since it is
      present in the components of the Ricci tensor, as given by Eq. (15.4).
      Remark 6: Since the Ricci tensor of the Vaidya metric is not identically zero [cf. Eq. (15.4)], other
      curvature invariants that one might have think of for tracking the curvature singularity are the Ricci
      scalar R := g µν Rµν and the Ricci “squared” Rµν Rµν . However, they are both identically zero for k is a
      null vector.
          The curvature singularity is depicted as the orange broken line in Fig. 15.1, which regards
      the homothetic radiation shell with α > 1/8. It is clear on this figure that the singularity bounds
      the future of both ingoing and outgoing radial null geodesics (this will be shown rigorously
      in Sec. 15.3.6). This implies that, for α > 1/8, the curvature singularity is spacelike, as in
      Schwarzschild spacetime. We shall see in Sec. 15.4 that for α < 1/8, the curvature singularity
      has a null segment, in addition to the spacelike one.
          For the homothetic collapse with α > 1/8 (v0 < 16 m) considered in Secs. 15.3.2 and 15.3.3
      and depicted in Fig. 15.1, one has thb < 0 [Eq. (15.30)], so that the curvature singularity is
      entirely located in the black hole region. It is therefore hidden to a remote observer. In Sec. 15.4,
      we will see that this is no longer the case for α < 1/8: the singularity is then naked.
  Obviously, the above criterion cannot be fulfilled in the Minkowski region, where M (t +
  r) = 0. On the contrary, trapped surfaces exist in the central part of the radiation re-
  gion, since M (t + r) > 0 there. They also exist in the black hole part (r < 2m) of the
  Schwarzschild region, where M (t + r) = m.
     Let us denote by T the hypersurface formed by all the marginally trapped spheres of fixed
(t, r) (vanishing expansion along ℓ). The equation of T is obtained by saturating the inequality
in Eq. (15.33):
                                   T :       r = 2M (t + r).                             (15.34)
As we shall see in Chap. 18, T is called a future outer trapping horizon. Equation (15.34)
implies dr/dt = 0 in Eq. (15.14), hence we get
Property 15.11: trapping horizon and r-turning points of radial null geodesics
  If an outgoing radial null geodesic L crosses T , the crossing point is a r-turning point of
  L.
This feature appears clearly in Figs. 15.1 and 15.5. Furthermore, T obeys the following
properties:
  In the radiation region, the future outer trapping horizon T is a spacelike hypersurface,
  while in the Schwarzschild region, T coincides with the event horizon H and hence is a
  null hypersurface there.
Proof. Let us consider (v, θ, φ) as coordinates on the 3-manifold T . We may rewrite Eq. (15.34)
for T as r = 2M (v), so that dr = 2M ′ (v) dv along T . Plugging this relation, as well as
2M (v)/r = 1, into Eq. (15.1) yields    immediately′ the metric h induced by g on T : h =
4M (v) dv + r dθ + sin θ dφ . In Mrad , M (v) > 0 [cf. Eq. (15.15)], which implies
     ′      2    2     2      2      2
Remark 7: The spacelike character of T is actually a generic feature of future outer trapping horizons
in non-stationary spacetimes (such as (Mrad , g)), as we shall see in Chap. 18.
598                                Black hole formation 2: Vaidya collapse
                              t/m
                               4
2 k `
                                                                                       r/m
                                         1       2       3        4        5       6
      Figure 15.5: Same as Fig. 15.1, but for the C 2 mass function M (v) corresponding to the choice S(x) =
      6x5 − 15x4 + 10x3 in Eq. (15.15) (cf. the red curve in Fig. 15.2). Note that the trapping horizon T (red curve) is
      tangent to the event horizon H (black curve) at the outer edge of radiation region ((t, r) = (m, 2m)). [Figure
      generated by the notebook D.6.4]
         For the homothetic shell model considered in Sec. 15.3.2, M (v) = mv/v0 = αv/2 and the
      equation of T in Mrad is vey simple:
      Example 1: For v0 = 3m, as in Fig. 15.1, α = 2/3 and we get r = 2v/3 = 2(t + r)/3 or equivalently
      t = r/2. Hence, in Mrad , T appears as the straight line segment of slope 1/2 drawn in red in Fig. 15.1.
      It appears clearly there that the outgoing radial null geodesics that cross T do it at a r-turning point. By
      considering the light cones delineated by the ingoing and outgoing radial null geodesics, it also appears
      clearly on Fig. 15.1 that T is a spacelike hypersurface in Mrad .
      Remark 8: As shown above, T coincides with H in MSch , so that the slope of T in Fig. 15.1 changes
      abruptly from 1/2 to +∞ at the outer edge of Mrad . This lack of smoothness of the hypersurface T
      reflects actually the lack of smoothness of the function M (v) of the homothetic shell model (cf. the
      blue curve in Fig. 15.2). Would M (v) be smooth, T would merge smoothly with H at v = v0 . This
      is shown in Fig. 15.5, where M (v) is of class C 2 . Besides, we can check on this figure that, while it
      has a shape more complicated that in Fig. 15.1, T always lies outside the null cones in Mrad , i.e. T is
      spacelike there.
                                  15.3 Imploding shell of radiation                                       599
where x := v/r and r is to be considered as a function of (u, v) defined implicitly by Eq. (15.36).
Since for α > 1/8, the polynomial αx2 − x + 2 never vanishes, the metric component guv read
on Eq. (15.37) is nonzero in all Mrad . This shows that the coordinates (u, v, θ, φ) are regular
on Mrad (except for the singularities inherent to the spherical coordinates (θ, φ)). Moreover,
they constitute a double-null coordinate system: both u and v are null coordinates. This has
already been shown for v (cf. Property 15.1) and for u, this follows from the component g uu of
the inverse metric deduced from (15.37) being identically zero [cf. Eq. (A.56b)]. Besides, we
read on Eq. (15.37) that guu = 0 and gvv = 0, which means that the coordinate vectors ∂u and
∂v are both null. The vector ∂u is actually tangent to the ingoing radial null geodesics L(v,θ,φ)
                                                                                             in
      Figure 15.6: Radiation region Mrad for the homothetic model (M (v) = mv/v0 ) with v0 < 16m, depicted in
      terms of the double-null coordinates (u, v). Solid (resp. dashed) green lines are outgoing (resp. ingoing) radial
      null geodesics. The orange zigzag curve corresponds to the curvature singularity at r = 0. The black segment
      marks the event horizon H , while the red curve marks the trapping horizon T [Eq. (15.42)]. The points A and B
      are the same as in Fig. 15.1.
      corresponds to the limit r → +∞. In view of Eq. (15.36) and fα (0) = 1 (cf. Eq. (15.26) or
      Fig. 15.3), this corresponds to u → −∞. The last part of the boundary of Mrad is set by the
      curvature singularity at r = 0 (cf. Sec. 15.3.4). Taking the limit r → 0 in Eq. (15.36), we found
      the equation ruling this boundary in terms of (u, v):
                   v                            r                                 
                      0                               α    1      π             1
          u = ln          + u0 ,       u0 := ln         +√          + arctan √           .                     (15.39)
                      v                               2   8α − 1 2             8α − 1
      We conclude that the range of the coordinates (u, v) on Mrad is (cf. Fig. 15.6)
                                                  
                                                   0≤v≤v
                                                           0
                                      Mrad :                                                                   (15.40)
                                                                         v0
                                                                              
                                                   −∞ < u < ln                   + u0 .
                                                                         v
Figure 15.7: Carter-Penrose diagram of Vaidya collapse for the homothetic shell model (M (v) = mv/v0 =
αv/2) with α > 1/8 (v0 < 16m). The Minkowski region MMin , radiation region Mrad and Schwarzschild region
MSch are depicted in respectively pale blue, yellow and grey. Solid (resp. dashed) green lines are outgoing (resp.
ingoing) radial null geodesics. The orange zigzag segment corresponds to the curvature singularity at r = 0. The
hatched area is the black hole region, delimited by the event horizon H . The red dot-filled area, delimited by the
future outer trapping horizon T (red curve), is the region where the spheres (t, r) = const are trapped surfaces.
The points A, B and C along H are the same as in Fig. 15.1.
    Since it is generated by outgoing null geodesics, the event horizon H crosses Mrad at a
fixed value of u, uH say. The value of uH is found by setting r = 2m and v = v0 in Eq. (15.36):
                                                            
                                    2                  1
                          uH = √          arctan √             − ln 2.                 (15.41)
                                  8α − 1             8α − 1
Given expression (15.39) for u0 , it is easy to check that uH < u0 , as drawn in Fig. 15.6.
    Regarding the future outer trapping horizon T introduced in Sec. 15.3.5, it obeys r = αv in
Mrad [Eq. (15.35)], so that the equation of T in terms of the double-null coordinates is easily
obtained by substituting α−1 for v/r in Eq. (15.36) and making use of expressions (15.27) and
(15.41); one gets                                   v 
                                                       0
                                  T :        u = ln       + uH .                             (15.42)
                                                      v
Note that this implies u → +∞ for v → 0 and u → uH for v → v0 ; the latter property agrees
with T coinciding with H in MSch , i.e. for v > v0 . T is drawn as a red curve in Fig. 15.6.
    The spacetime diagram of Mrad in Fig. 15.6 is already conformal since all radial null geodesics
are straight lines inclined at ±45◦ . To integrate it into a Carter-Penrose diagram, one needs
602                             Black hole formation 2: Vaidya collapse
                         t/m
                        20
                                      A
                        15                B
                                                   C
                        10
                                                                                     r/m
                             O    2       4    6       8    10     12    14     16
Figure 15.8: Spacetime diagram of the Vaidya collapse based on the IEF coordinates (t, r) and for the linear
mass function M (v) = mv/v0 with v0 = 18 m (homothetic shell model with α = 1/9, which corresponds to
x1 = 3 and x2 = 6). The legend is the same as in Fig. 15.1, with in addition the blue line marking the Cauchy
horizon HC induced by the naked singularity at (t, r) = (0, 0) (Sec. 15.4.4). In the radiation region, the straight
line segments OB and OC are the traces in the (t, r) plane of the homothetic Killing horizons x = x2 and x = x1 ,
the last one being a part of the Cauchy horizon. [Figure generated by the notebook D.6.4]
604                                  Black hole formation 2: Vaidya collapse
      Proof. Their equations being t = (x1 − 1)r and t = (x2 − 1)r, L(θ,φ)
                                                                         ∗1
                                                                             and L(θ,φ)
                                                                                     ∗2
                                                                                          are clearly
      orbits of the homothety group (Hλ )λ∈R>0 discussed in Sec. 15.3.2. The group generator ξ is
      thus tangent to the null curves L(θ,φ)
                                         ∗1
                                             on H1 and L(θ,φ)
                                                           ∗2
                                                               on H2 . Hence, ξ is null there. Given
      that the only null direction in a null hypersurface is the normal one, we conclude that ξ is
      normal to H1 and H2 .
          Let us now consider generic outgoing radial null geodesics, i.e. geodesics with x ̸= x1 and
      x ̸= x2 ; we may then rewrite Eq. (15.45) as
                                                               x1 + x2 − x
                                                  d ln r =                      dx.
                                                             (x − x1 )(x − x2 )
      where r0 is constant along the considered geodesic (but may vary from one geodesic to the
      other). If the geodesic intersects Mrad ’s inner edge (x = 0), then r0 represents the value of r at
      the intersection point.
      Example 2: For α = 1/9, one has x1 = 3 and x2 = 6 [cf. Eq. (15.44)] and Eq. (15.48) reduces to
      r = r0 |x/6 − 1|/(x/3 − 1)2 . Such a function of x is plotted in Fig. 15.9.
          Having determined the outgoing radial null geodesics, we are in position to establish the
      following results:
Property 15.14: black hole and naked singularity for the α < 1/8 model
        The homothetic imploding radiation shell with α := 2m/v0 < 1/8 generates a black hole,
        the event horizon of which originates at (t, r) = (0, 0), with a slope t/r = x2 −1. Moreover,
        the curvature singularity at (t, r) = (0, 0) is naked: it is connected to remote observers by
        the outgoing radial null geodesics L(θ,φ)
                                                ∗1
                                                    , as well as by the family of outgoing radial null
        geodesics that cross the outer edge of the radiation shell (v = v0 ) with x1 < x < α−1 ; all
        geodesics of this family, which contains L(θ,φ)∗2
                                                          , emanate from (t, r) = (0, 0) with a slope
        t/r = x2 − 1.
                                    15.4 Configurations with a naked singularity                                       605
3.0
2.5
2.0
                                                                        x = α −1
                                              x = x1
                                                           x = x2
                       r/r0
                              1.5
1.0
0.5
                              0.0
                                    0     2            4        6   8              10   12
                                                               x
Figure 15.9: The coordinate r as a function of x := v/r along outgoing radial null geodesics for the homothetic
Vaidya collapse with α = 1/9 (which implies x1 = 3 and x2 = 6). The graph of r(x) admits a local maximum for
x = α−1 , i.e. x = 9 in the present case, although this is barely noticeable on the figure. [Figure generated by the
notebook D.6.5]
Proof. As in Sec. 15.3.3, let us consider a null generator Lˆ of the Killing horizon of the
Schwarzschild region MSch , which is located at r = 2m. Lˆ intersects the outer edge v = v0
of Mrad at a point A such that rA = 2m and hence xA = α−1 (cf. Fig. 15.8). When prolonged
to the past in Mrad , i.e. to x < α−1 , Lˆ has r decreasing (cf. the graph of r(x) at the left of
x = α−1 in Fig. 15.9), until r = 0 is reached for x = x2 . Since the Kretschmann scalar (15.32) is
K = 12α2 x2 /r4 for the homothetic model, we get K → ∞ for r → 0 when x → x2 > 0. We
thus conclude that Lˆ hits the curvature singularity at x = x2 and cannot be extended further
in the past, contrary to the case α > 1/8 dealt with in Sec. 15.3.3.
    Let us now consider any outgoing radial null geodesic L in MSch that intersects the outer
edge v = v0 of Mrad with x such that x1 < x < α−1 , i.e. between the points C and A in
Fig. 15.8. L has r > 2m in MSch , hence it reaches the future null infinity I + . When L
is prolonged backward in Mrad , it starts with r decaying since Eq. (15.14) can be rewritten
dr/dt = (1 − αx)/(1 + αx) for the homothetic model, implying dr/dt < 0. The entire past
of L in Mrad can then be read on Fig. 15.9: if one follows the decaying r direction either for
x1 < x < x2 or x2 < x < α−1 , one ends up to r = 0 for x = x2 , as for Lˆ. Hence the same
conclusion holds: L hits the curvature singularity in the past direction at x = x2 . Given that
L extends to I + in the future, we conclude that the curvature singularity is naked.
    Finally, let us consider an outgoing radial null geodesic L in MSch that intersects the outer
edge v = v0 of Mrad with x < x1 , i.e. below the point C in Fig. 15.8. When L is prolonged
backward in Mrad , r decreases along it and we read on the left part of Fig. 15.9 that r reaches
the finite nonzero value r0 at x = 0, i.e. at the inner edge of Mrad . It can then be extended
backward to the Minkowski region MMin , where it reaches r = 0 at t = t0 := −2r0 . Since for
x → x− 1 , r/r0 → +∞ (cf. Fig. 15.9 or Eq. (15.48)), we conclude that r0 , and hence |t0 |, can be
made arbitrarily small by having x close enough to x1 at the outer edge of Mrad . This proves
that the whole of MMin can be connected to I + by this type of null geodesics. It follows that
606                              Black hole formation 2: Vaidya collapse
the black hole event horizon is the null hypersurface generated by Lˆ.
      Historical note : The formation of naked singularities in spacetimes with a Vaidya region has been
      put forward first by B. Steinmüller, Andrew R. King, and Jean-Pierre Lasota in 1975 [463], but this
      regards bodies radiating away all their masses, the exterior of which is described by the outgoing Vaidya
      metric (15.8). In the context of imploding radiating shells (ingoing Vaidya metric) considered here, the
      appearance of naked singularities has been shown first by William A. Hiscock, Leslie G. Williams and
      Douglas M. Eardley in 1982 [279] and has been further studied by Yuhji Kuroda in 1984 [330] and Achilles
      Papapetrou in 1985 [400]. In particular, the solution (15.48) for the outgoing radial null geodesics has
      been exhibited by Papapetrou: compare Eq. (18) in Ref. [400], where X1 = x2 and X2 = x1 .
      Since x2 > x1 , one has Mrad = NI ∪ NII . Furthermore, NI (resp. NII ) contains the homothetic
      Killing horizon H1 (resp. H2 ) and NI ∩ NII is the region between H1 and H2 .
      The NI region
      On NI , let us define the parameter u so that
      As r0 , u is constant along a given outgoing radial null geodesic. From Eq. (15.49), we get5
      |u| = |v/x1 − r|/(r − v/x2 )x1 /x2 . This equation determines u in terms of v and r up to some
         5
             We have made use of the identity |v/x2 − r| = r − v/x2 , which holds on NI .
                             15.4 Configurations with a naked singularity                                607
sign. We choose the latter so that u → +∞ at the inner boundary of NI (r → v/x2 ). This
yields
                                 v/x1 − r
                         u=                     , −∞ < u < +∞,                  (15.52)
                              (r − v/x2 )x1 /x2
with limr→+∞ u = −∞ (cf. Fig. 15.10). We may use (u, v, θ, φ) as a coordinate system on NI ,
instead of the IEF coordinates (t, r, θ, φ).
Example 3: As in Example 2, let us consider the case α = 1/9, for which x1 = 3 and x2 = 6 [cf.
Eq. (15.44)]. Equation (15.52) reduces then to
                                                  v/3 − r
                                               u= p        .                                   (15.53)
                                                   r − v/6
This relation can be inverted, yielding an explicit expression for r(u, v):
                                                       r          !
                                         v u                   2
                                    r= +          u − u2 + v .                                 (15.54)
                                         3 2                   3
     The metric components in coordinates (u, v, θ, φ) are deduced from those in coordinates
(v, r, θ, φ), i.e. Eq. (15.17), via the identity α = 2/(x1 x2 ). We get (see the notebook D.6.6 for
the computation):
                                   x1 /2                                             
           2x2                 v                                                     v
                                                         2     2    2    2
                                                                                              (15.55)
                                                                             
   g=−                      r−               du dv + r       dθ + sin θ dφ        r>      .
       (x2 − x1 )r             x2                                                    x2
In this expression, r shall be considered as the function of (u, v) defined implicitly by Eq. (15.52).
The metric component guv is regular and non-vanishing in all NI . Moreover we have a double-
null coordinate system, since v is a null coordinate (Property 15.1) and it can be infered from
(15.55) that g uu = 0, so that u is null as well by Eq. (A.56b). The coordinate vectors ∂u and ∂v
are both null, given that guu = 0 and gvv = 0. The vector ∂u is actually tangent to the ingoing
radial null geodesics, which are defined by (v, θ, φ) = const, and the vector ∂v is tangent to the
outgoing ones, which are defined by (u, θ, φ) = const. Furthermore, the homothetic Killing
horizon H1 , which is defined by v/r = x1 , is the hypersurface u = 0 of NI .
    Let us investigate the structure of the inner boundary v = 0 of the radiation shell in
the double-null coordinates (u, v, θ, φ). For r ̸= 0, the limit v → 0 in Eq. (15.52) leads
to u = −r1−x1 /x2 . This implies u < 0. In this part, which is the future boundary of the
Minkowski region of Vaidya spacetime, this relation defines a bijection between r ∈ (0, +∞)
and u ∈ (0, −∞). So we may say that r is a regular coordinate of NI for u < 0. On the other
side, the limit r → 0 in NI implies v → 0 since r > v/x2 in NI . Actually, the following relation
holds:
                                         v ∼ x2 r in NI .                                     (15.56)
                                               r→0
This follows directly from Eq. (15.48), which implies the equivalence
                                r → 0 ⇐⇒ (x → x2 or x → +∞)                                   (15.57)
608                            Black hole formation 2: Vaidya collapse
      Figure 15.10: Carter-Penrose diagrams of the subregions NI (right) and NII (left) of the radiation region Mrad .
      The region between the homothetic Killing horizons H1 and H2 is common to both NI and NII . Zigzag lines
      indicate a curvature singularity. The red curve in NII is the trapping horizon T [Eq. (15.72)].
      along any outgoing radial null geodesic, irrespective of the value of u (see also Fig. 15.9). Given
      that in NI , x → +∞ is excluded by the constraint x < x2 [Eq. (15.50)], there remains x → x2 ,
      which yields the equivalence (15.56). Now, Eq. (15.56) implies that the numerator of Eq. (15.52)
      is equivalent to (x2 /x1 − 1)r at the limit r → 0 and thus is positive, so that u > 0 on the part
      of the boundary of NI where r → 0. We conclude that the “point” (t, r) = (v, r) = (0, 0) in
      IEF coordinates becomes the hypersurface v = 0, u > 0 in the double-null coordinates. This
      means that the IEF coordinates are not adapted to describe the vicinity of (t, r) = (0, 0) in
      Vaidya spacetime when α < 1/8.
      Example 4: Let us perform a first order expansion in v of√ the explicit expression of r(u, v) found in
      Example 3 [Eq. (15.54)]. Taking into account the identity u2 = |u|, we get
                                       
                                        r = u2 + 1 v + O(v 2 )
                                       
                                                                        for u < 0
                                             1
                                                  3                                                           (15.58)
                                        r = v + O(v 2 )
                                                                       for u > 0.
                                             6
      The first equation agrees with the relation u = −r1−x1 /x2 found above for r ̸= 0 and v = 0, given that
      x1 /x2 = 1/2 in the present case, while the last equation is exactly (15.56), given that x2 = 6.
          As discussed above, the double null coordinates (u, v, θ, φ) are regular coordinates on
      NI , because the components (15.55) of the metric tensor are regular (except for the standard
      singularities of the spherical coordinates (θ, φ)). Another advantage of these coordinates is
      that they allow for an immediate drawing of Carter-Penrose diagrams. It suffices to bring u to
      a finite range by setting e.g. U = arctan u and use the coordinates T = U + v and X = v − U
      to get such a diagram of NI , as depicted in Fig. 15.10.
          Let us describe the (vicinity of the) curvature singularity in the double null coordinates. In
                             15.4 Configurations with a naked singularity                                        609
the present case, where M (v) = αv/2, the Kretschmann scalar (15.32) reduces to
                                                              v2
                                                   K = 12α2      ,                                     (15.59)
                                                              r6
where r is the function of (u, v) given implicitly by Eq. (15.52) (or explicitly by Eq. (15.54) in the
case α = 1/9). The above expression shows that K diverges only if r → 0. This limit occurs
only for v → 0 in the subregion u ≥ 0 of NI . There, Eq. (15.56) leads to K ∼ 12α2 x62 /v 4 ,
which actually diverges as v → 0. We conclude:
This curvature singularity corresponds to the zigzag segment in right part of Fig. 15.10.
Then Eq. (15.49) yields |u′ | = |v/x2 − r|/(v/x1 − r)x2 /x1 . Choosing the sign of u′ such that
u′ → −∞ at the outer boundary of NII (r → v/x1 ), we get
                                                       v/x2 − r
                                             u′ =                         .                            (15.61)
                                                    (v/x1 − r)x2 /x1
This relation can be inverted, yielding an explicit expression for r(u′ , v):
                                                   r                !
                                        v    1            2 ′
                                   r= + ′             1− uv−1 .                                        (15.63)
                                        3 2u              3
   As above, the metric components in coordinates (u′ , v, θ, φ) are deduced from those in
coordinates (v, r, θ, φ) [Eq. (15.17)] (cf. the notebook D.6.6 for the computation):
                                   x2 /2                                                      
           2x1              v                                                                 v
                                               ′         2      2             2   2
                                                                                                       (15.64)
                                                                                      
   g=−                         −r            du dv + r       dθ + sin θ dφ                 r<      ,
       (x2 − x1 )r          x1                                                                x1
610                           Black hole formation 2: Vaidya collapse
      where r = r(u′ , v) is defined implicitly by Eq. (15.61). Again, we get a double-null coordinate
      system adapted to the ingoing and outgoing radial null geodesics. Moreover, the coordinates
      (u′ , v, θ, φ) are regular since gu′ v is finite and non-vanishing in all NII . In addition, the homo-
      thetic Killing horizon H2 , which is defined by v/r = x2 , is the hypersurface u′ = 0 of NII . In
      NI ∩ NII , we have u > 0 and u′ < 0. The relation between the two coordinates is obtained by
      equating the right-hand sides of Eqs. (15.51) and (15.60); one gets
along with the following characterization of the two homothetic Killing horizons (cf. Fig. 15.10):
                                            H1 : u = 0, u′ → −∞                                    (15.66a)
                                            H2 : u → +∞, u′ = 0.                                   (15.66b)
  are
                                                                                  x /x
                                                                               x1 2 1
                       0 < v ≤ v0             and          − ∞ < u′ <                       .   (15.68)
                                                                             x2 v x2 /x1 −1
The second limit agrees with the upper boundary for u′ in (15.68), since the latter can be rewritten as
u′ < 3/(2v) for x1 = 3 and x2 = 6.
   As a particular case of Eq. (15.68), the maximal value of u′ along an ingoing radial null
geodesic with v = v0 (geodesic on the outer edge of the radiation shell, cf. Fig. 15.10) is
                                                            x /x1
                                                          x1 2
                                              u′0   =       x /x1 −1
                                                                       .                          (15.70)
                                                        x2 v0 2
From the expressions of x1 and x2 in terms of α [Eq. (15.44)], it is easy to show that u′H < u′0 .
Taking this property into account, one can draw H in the diagram of NII of Fig. 15.10 and
finally obtain the Carter-Penrose diagram of M shown in Fig. 15.11.
Example 8: For v0 = 18m, one has α = 1/9, x1 = 3 and x2 = 6, so that Eqs. (15.70) and (15.72) yield
respectively u′0 = 3/(2v0 ) = 1/(12m) and u′H = 3u′0 /4 = 9/(8v0 ) = 1/(16m).
    The Carter-Penrose diagram of Fig. 15.11 enables us to fully solve the puzzle of multiple
outgoing radial null geodesics emanating from the point (t, r) = (0, 0) raised at the beginning
of Sec. 15.4.2: (t, r) = (0, 0), or equivalently, (v, r) = (0, 0), does not define a single point, but
a full segment, denoted by O1 O3 in Fig. 15.11. From each point of this segment, there emanates
a single outgoing radial null geodesic (cf. the green solid lines in Fig. 15.11).
    It is instructive to add the future outer trapping horizon T introduced in Sec. 15.3.5 to
the Carter-Penrose diagram. To do so, we need the equation of T in terms of the double null
coordinates. First of all, we notice that in Mrad , T is entirely contained in NII . Indeed, T
612                             Black hole formation 2: Vaidya collapse
      Figure 15.11: Carter-Penrose diagram of the collapsing Vaidya spacetime with M (v) = αv/2 (homothetic
      radiation region) and α < 1/8 (low energy density case). The Minkowski region MMin , radiation region Mrad
      and Schwarzschild region MSch are depicted in respectively pale blue, yellow and grey. Solid (resp. dashed) green
      lines are outgoing (resp. ingoing) radial null geodesics. Zigzag lines indicate curvature singularities. The hatched
      area is the black hole region, delimited by the event horizon H . The red dot-filled area, delimited by the future
      outer trapping horizon T (red curve), is the region where the spheres (t, r) = const are trapped surfaces. The
      points A, B and C are the same as in Fig. 15.8, while the point O of Fig. 15.8 corresponds to the whole segment
      O1 O3 .
      obeying r = αv [Eq. (15.35)], the property α < x−1      2 excludes T from NI , by the definition
      (15.50) of the latter. It suffices then to express T in terms of the coordinates (u′ , v) of NII . This
      is achieved by substituting αv for r in Eq. (15.61) and making use of successively the identity
      α = 2/(x1 x2 ), Eq. (15.71) and Eq. (15.70); one gets
                                                                          v x2 /x1 −1
                                                                            0
                                       T ∩ Mrad :             ′
                                                            u =    u′H                    .                       (15.72)
                                                                           v
      Note that this implies u′ → +∞ for v → 0 and u′ → u′H for v → v0 (cf. Fig. 15.10).
      Example 9: For our favorite example α = 1/9 (x1 = 3 and x2 = 6), the equation of T ∩ Mrad reduces
      to u′ = u′H v0 /v, which corresponds to a branch of hyperbola in the (u′ , v) plane.
      Historical note : The split of Mrad in two regions, denoted here NI and NII , in order to get regular
      double-null coordinate systems, has been performed by B. Waugh and Kayll Lake in 1986 [516]. The
      double-null coordinates (u, v) and (u′ , v) introduced above are theirs, except for some constant factors.
      Four years before, William A. Hiscock, Leslie G. Williams and Douglas M. Eardley [279] exhibited a
                              15.4 Configurations with a naked singularity                                          613
Figure 15.12: Same as Fig. 15.11, with in addition the remote observer O, the partial Cauchy surface Σ and its
future Cauchy development D+ (Σ) (blue hatched area, cf. Sec. 10.8.3 for the definition), which is bounded in the
future by the Cauchy horizon HC and the future null infinity I + .
Carter-Penrose diagram similar to that of Fig. 15.11 except that the Cauchy horizon (to be discussed
below) and the event horizon coincide (Fig. 2 in Ref. [279]). Such a configuration is obtained by adding a
radiation surface layer (Dirac delta) at v = v0 . However, as discussed in Ref. [516], these authors have
not set up regular double-null coordinates in Mrad , so that their construction can be seen as heuristic.
In 1984, Yuhji Kuroda [330] exhibited a Carter-Penrose diagram again similar to that of Fig. 15.7, except
for the radiation region extending to I + (no pure Schwarzschild exterior but M ′ (v) → 0 for v → +∞)
(Fig. 3 in Ref. [330]). The case of a radiation region bounded by v = v0 , as here, can be found in Fig. 1 of
a 2007 article by Brien C. Nolan [384].
      called locally naked, while O1 O2 is called globally naked. For remote observers, one may say that the
      singularity O2 O3 is “clothed” by the event horizon H .
      Remark 3: The curvature singularity of the Vaidya spacetime with v0 < 16m (Fig. 15.7) is not even
      locally naked: observers falling into the black hole region never see it until they hit it at the end of their
      lives.
           The part (v, r) = (0, 0) of the curvature singularity, i.e. the segment O1 O3 in the diagram
      of Fig. 15.11 corresponds to a past null boundary of spacetime. Such a singularity is generically
      called a shell-focusing singularity. This terms has been coined by Eardley & Smarr [176], who
      discovered this type of singularity in spherically symmetric inhomogeneous dust collapse. Due
      to its past null character, a shell-focusing singularity is always locally naked: close observers
      can see it.
           Remote observers see the naked singularity as soon as they cross the null hypersurface HC
      generated by the outgoing radial null geodesics emanating from the birth of the singularity
      at u = 0 in NI (point O1 in Figs. 15.11 and 15.12 ). HC can be seen as the null hypersurface
      extending the homothetic Killing horizon H1 introduced in Sec. 15.4.1 beyond the radiation
      region, i.e. one has HC ∩ Mrad = H1 . The hypersurface HC is actually the future Cauchy
      horizon6 of any partial Cauchy surface Σ that encounters the Minkowkski region7 . In other
      words, the region of spacetime in the future of HC cannot be entirely predicted from initial data
      on Σ and the Einstein equation. In particular, data from the naked curvature singularity O1 O3
      is required. Since a curvature singularity marks a validity limit of general relativity, one cannot
      unambiguously prescribe data there. This seriously hampers the predictability of general
      relativity alone for describing the region beyond HC . For instance, a burst of radiation could
      come out at any time from the naked singularity and change the fate of observers travelling
      in this region. This regards all remote observers at a sufficiently advanced amount of their
      proper time, since any such observer necessarily crosses HC at some point (cf. observer O in
      Fig. 15.12, noticing that the worldlines of all remote observers terminate at the future timelike
      infinity i+ ).
                                                              2M (v)
                                                 α := lim+           .                                      (15.73)
                                                        v→0     v
         6
          Cf. Sec. 10.8.3 for the definition.
         7
          Since it has to be acausal without edge (cf. Sec. 10.8.3), Σ has to meet the spacelike infinity i0 in the
      Carter-Penrose diagram of Fig. 15.12.
                                          15.5 Going further                                            615
See Ref. [330, 194] for details. Of course, for M (v) = mv/v0 , the above definition of α reduces
to (15.18). For α < 1/8, a new possibility may occur: that of a shell-focusing singularity that
lies entirely in the black hole region, i.e. that is naked locally but not globally, contrary to what
happens for M (v) linear (Sec. 15.4). See e.g. Fig. 4 of Ref. [27] or Fig. 9.20 of Ref. [243] for the
corresponding Carter-Penrose diagram. Actually, the Vaidya collapse depicted in Fig. 15.5 lies
in this category: it has M (v) = m(v/v0 )3 [6(v/v0 )2 − 15v/v0 + 10], so that Eq. (15.73) yields
α = 0. The singularity is entirely located in the black hole region and appears to be locally
naked (note the outgoing radial null geodesic emerging from it near (t, r) = (0, 0) in Fig. 15.5).
     It is worth to stress that in the literature, many studies regard the outgoing Vaidya metric,
starting from the original studies (cf. the historical note on p. 586), because they are motivated
either by the modelling of radiating bodies or the study of black hole evaporation. Nonetheless,
most results are applicable to Vaidya collapse (which is based on the ingoing version of the
metric) by performing some time reversal. For instance, one can infer from the recent study
[143] that for an imploding Vaidya spacetime without any Minkowksi region (v spanning
the range (−∞, v0 ) in the radiation region), but such that limv→−∞ M (v) = 0, there is a
curvature singularity at the null boundary v → −∞ of spacetime, whatever the choice of the
non-decreasing mass function M (v).
616   Black hole formation 2: Vaidya collapse
Chapter 16
Contents
         16.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 617
         16.2 Towards the first law of black hole dynamics . . . . . . . . . . . . . . 617
         16.3 Evolution of the black hole area . . . . . . . . . . . . . . . . . . . . . . 635
         16.4 The laws of black hole dynamics . . . . . . . . . . . . . . . . . . . . . . 642
         16.5 Hawking radiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 646
         16.6 Black hole thermodynamics . . . . . . . . . . . . . . . . . . . . . . . . 650
         16.7 Concluding remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 653
16.1      Introduction
This chapter is in a draft stage.
      where the identifications of the black hole’s surface gravity κ and rotation velocity ΩH result
      from Eqs. (10.77) and (10.69) respectively. Hence we get
                                                  κ
                                          δM =      δA + ΩH δJ .                                  (16.1)
                                                 8π
  while if Hk is rotating (ξ spacelike on some parts of Hk ), this holds under the hypotheses
  of the strong rigidity theorem (Property 5.25), with
                                                       Lk
                                                       X       (i)
                                          χk = ξ +           ΩHk ηHk (i) ,                   (16.2)
                                                       i=1
                                               (i)
  where 1 ≤ Lk ≤ [(n − 1)/2], the ΩHk ’s are constants and the ηHk (i) ’s are axisymmetric
  Killing vectors (Property 5.26). We shall encompass both the non-rotating case and the
  rotating one by allowing Lk to take the value 0 in Eq. (16.2) so that χk = ξ is recovered
                                  (i)
  if Hk is non-rotating. Let JHk be the Komar angular momentum with respect to ηHk (i)
  over any cross-section of Hk [Eq. (5.63)]. Being a non-expanding horizon, each Hk has a
  well defined area Ak (Property 3.1). Let κk be the surface gravity of Hk , i.e. the coefficient
  such that ∇χk χk = κk χk on Hk [cf. Eq. (3.29)]. Let us assume that the null dominance
  condition (3.43) is fulfilled on Hk or that Hk is part of a bifurcate Killing horizon; by the
  zeroth law of black hole dynamics (Property 3.10 or 3.16), this implies that κk is constant.
  Finally let us assume that (M , g) is asymptotically flat with g having the same asymptotic
  behavior (5.31) as in general relativitya . Let (M , g + δg) be a nearby stationary spacetime
  sharing the same characteristics as (M , g). Then, the change δM∞ in Komar mass at
  infinity (cf. Property 5.16) between the two spacetimes is related to the changes δAk in
                                (i)
  area and to the changes δJHk in Komar angular momentum by
                                   K             Lk
                                                               !
                                   X   κk       X    (i)   (i)
                      δM∞      =          δAk +     ΩHk δJHk
                                 k=1
                                       8π       i=1
                                       Z                         Z
                                     1                       1
                                 +          µν
                                          G δgµν ξρ dV − ρ
                                                                δ Gµν ξ µ dV ν ,             (16.3)
                                   16π Σ                    8π Σ
  where (i) Σ is any asymptotically flat spacelike hypersurface, the inner boundary of which
  is an axisymmetric cross-section S of the event horizon, i.e. S = ∪K          k=1 Sk with Sk :=
  Σ ∩ Hk and ηHk (i) tangent to Sk for i ∈ {1, . . . , Lk }, (ii) G is the Einstein tensor of g and
  (iii) dV is the normal volume element vector of Σ, as defined by Eq. (5.48).
     a
         Note that g is not required to obey the Einstein equation in M .
Proof. We may consider that the black hole event horizon H is the same hypersurface in
both spacetimes (M , g) and (M , g + δg). If the horizons would differ, we could find a
diffeomorphism M → M that would map the original horizon to the perturbed spacetime’s
horizon. Similarly, there is no loss of generality in considering that the Killing vectors ξ and
ηHk (i) generating the stationarity and axisymmetries are identical, as vector fields on M :
                                      δξ = 0         and        δηHk (i) = 0.                  (16.4)
This amounts to identifying the orbits of the isometry group actions in the two spacetimes.
Given that H is globally invariant by these group actions, this requirement is compatible
with the identification of H in both spacetimes. Let us introduce the short-hand notation
h := δg, or in index notation hαβ = δgαβ . In what follows, indices are raised or lowered with
620                      Evolution and thermodynamics of black holes
      the unperturbed metric g. In particular, hαβ := g αµ g βν hµν . Note that hαβ ̸= δg αβ . Actually, by
      variation of the identity g αµ gµβ = δ αβ , one gets δg αβ = −hαβ . The starting point for proving
      (16.3) is the variation of the generalized Smarr formula (5.84):
                                   K
                                      "                              Lk 
                                                                                                      #
           2(n − 3)               X      1                          X      (i)   (i)     (i)    (i)
                                                                                                    
                      δM∞ =                (Ak δκk + κk δAk ) + 2         JHk δΩHk + ΩHk δJHk
             n−2                  k=1
                                        4π                          i=1
                                         Z
                                     1
                                  − δ Rµν ξ ν dV µ ,                                                   (16.5)
                                    4π Σ
      where Σ is any asymptotically flat spacelike hypersurface, the inner boundary of which is
      a cross-section of the event horizon, i.e. some union of cross-sections Sk of the connected
      components Hk and dV µ is the normal volume element vector of Σ defined by Eq. (5.48).
      Let us start by evaluating the term Ak δκk in the above formula. For the sake of brevity, we
      shall drop the index k in what follows, given that we temporarily focus on a single connected
      component Hk of the event horizon H . The surface gravity κ (= κk ) is given by Eq. (3.31):
      2κ = k µ ∂µ (χν χν ), where χ (= χk ) is the Killing vector normal to the Killing horizon Hk
      and k is the future-directed null vector field defined on Hk , transverse to Hk , normal to the
      cross-section Sk = Hk ∩ Σ and normalized by χ · k = −1. Note that (χ, k) is a null basis of
      the normal plane Tp⊥ Sk at each point p ∈ Sk , χ playing the role of the vector ℓ in Fig. 2.10.
      Varying the above expression of κ yields
      2δκ =      δk µ ∂µ (χν χν ) + k µ ∂µ (δχν χν + χν δχν )
          =      δk µ ∇µ (χν χν ) + k µ ∇µ (δχν χν + χν δχν )
          =      2δk µ χν ∇µ χν + k µ (χν ∇µ δχν + δχν ∇µ χν + δχν ∇µ χν + χν ∇µ δχν )
          =      2δk µ χν ∇µ χν + k µ [χν (∇µ δχν + ∇ν δχµ ) + 2δχν ∇µ χν + δχν ∇µ χν + χν ∇µ δχν ]
          =      2δk µ χν ∇µ χν + (χµ k ν + k µ χν )∇µ δχν + k µ (2δχν ∇µ χν + δχν ∇µ χν + χν ∇µ δχν ) .
      To get the last but one line, we have used the identity χν ∇ν δχµ +δχν ∇µ χν = 0, which expresses
      the vanishing of the Lie derivative of the 1-form δχ along χ [Eq. (A.86) with (k, ℓ) = (0, 1)]:
                                                   Lχ δχ = 0.                                          (16.6)
      The invariance property (16.6) holds because χ is a normal 1-form to Hk (i.e. a vector v is
      tangent to Hk iff ⟨χ, v⟩ = 0) and since Hk is the same hypersurface in the original and the
      perturbed spacetime, χ + δχ is a normal 1-form of Hk as well. Two normal 1-forms to a given
      hypersurface are necessarily collinear: there exists a scalar field λ such that χ + δχ = λχ.
      Setting δλ := λ − 1, we get
                                               δχ = δλ χ.                                    (16.7)
      Then Lχ δχ = (Lχ δλ) χ + δλ Lχ χ. But Lχ χ = 0 for χ is a Killing vector of (M , g) and,
      thanks to Eq. (16.2), Lχ δλ = Lξ δλ + Li=1 Ω(i) Lη(i) δλ = 0 + 0 = 0 because ξ and η(i) are
                                           P
      symmetry generators of both (M , g) and (M , g + δg) [cf. Eq. (16.4)]; this establishes (16.6).
      On the other side, we have
                              L
                              X                                             L
                                                                            X
                         α
                      δχ =           (i)
                                    δΩ      α
                                           η(i)   and     δχα = hαµ χ + µ
                                                                                   δΩ(i) η(i)α .       (16.8)
                              i=1                                            i=1
                             16.2 Towards the first law of black hole dynamics                                                          621
The first formula readily follows from Eqs. (16.2) and (16.4), while the second one follows
from χα = gαµ χµ , which implies δχα = δgαµ χµ + gαµ δχµ , where δgαµ =: hαµ . By means of
Eq. (16.8), we can rewrite the last two terms in the above expression of δκ as1
                                       L
                                       X                                                         L
                                                                                                 X
            ν                     ν                (i)    ν                       ν
                                                                                                          δΩ(i) χν ∇µ η(i)
                                                                                                                       ν
                                                                                        
         δχ ∇µ χν + χν ∇µ δχ =                δΩ         η(i) ∇µ χν    +   χν ∇µ η(i)       =2                             .
                                        i=1                                                      i=1
where Eq. (16.8) has been used to express δχµ and hµν ∇µ χν = 0 and ∇µ η(i)µ = 0 follow from
the Killing equation for respectively χ and η(i) , given the symmetry of h. Together with A = 0,
this result allows us rewrite Eq. (16.9) as
                                                               L
                                      1            X
                                δκ = − χµ ∇ν hµν +     δΩ(i) k µ χν ∇µ η(i)ν .                                             (16.10)
                                      2            i=1
   1
       Note that ∇µ δΩ(i) = 0 since both Ω(i) and Ω(i) + δΩ(i) are constant.
622                        Evolution and thermodynamics of black holes
                                                                                                            √
      Let us integrate this relation over the cross-section Sk , setting dS :=                                  q dn−2 x (the area
      element of Sk ). Since δκ and δΩ(i) are constant, we get
                       Z             Z                L       Z
                                   1                  X
                    δκ      dS = −           ν µ
                                        χµ ∇ h ν dS +   δΩ(i)
                                                                   ∇µ η(i)ν k µ χν dS .                                    (16.11)
                                   2 Sk
                       | S{z
                          k
                             }       |     {z     } i=1       | Sk      {z          }
                           Ak                       ISk                                        −8πJH
                                                                                                      (i)
                                                                                                        k
                                                                        (i)
      The identification of the last integral with −8πJHk readily follows from formula (5.65) for
      the Komar angular momentum, once the area element normal bivector to Sk is expressed as
      dS µν = (χµ k ν − k µ χν ) dS [Eq. (5.86)]. As for the integral denoted ISk in Eq. (16.11), it can be
      rewritten as an integral involving h and the stationary Killing vector ξ instead of the Killing
      vector χ (which depends on Hk , contrary to ξ), namely
                    Z
                                                        1                1
                         ωµν dS µν , with ωαβ :=                                                     (16.12)
                                                                  
            ISk =                                          ξ ∧ H αβ = (ξα Hβ − Hα ξβ ) ,
                      Sk                                2                2
      To prove (16.12), let us evaluate ωµν dS µν , using successively Eq. (5.86), ξµ χµ = 0 (ξ tangent to
      Hk ) and η(i)µ k µ = 0 (η(i) tangent to Sk ):
                            1
          ωµν dS µν =         (ξµ Hν − Hµ ξν ) (χµ k ν − k µ χν ) dS = (ξµ χµ Hν k ν − ξµ k µ Hν χν ) dS
                            2                                           |{z}
                                                                                         0
                                         L
                                          X                   
                      = − χµ k µ −              Ω(i) η(i)µ k µ Hν χν dS = χµ H µ dS
                           | {z }                    | {z }
                                          i=1
                                 −1                        0
                      =     (χµ ∇ν hµν   − χµ ∇µ hνν ) dS
                                                     X L
                      = (χµ ∇ν hµν       − Lξ hνν −       Ω(i) Lη(i) hνν ) dS = χµ ∇ν hµν dS.
                                           | {z }
                                                           i=1
                                                               | {z }
                                                0                             0
      In the last line, Lξ hνν = 0 follows from hνν = g µν hµν , Lξ g µν = 0 (ξ Killing vector of g)
      and Lξ hµν = 0, the latter being a consequence of ξ being a Killing vector of both g and
      g + δg = g + h (cf. Eq. (16.4)), so that Lξ h = Lξ (g + δg) − Lξ g = 0 − 0 = 0. Similarly,
      Lη(i) hνν = 0. In view of the above result and the definition of ISk in Eq. (16.11), we conclude
      that Eq. (16.12) holds. Therefore, Eq. (16.11) can be rewritten as
                                                    Z                             L
                                           1                                      X      (i)    (i)
                                Ak δκk = −                ωµν dS   µν
                                                                        − 8π            JHk δΩHk ,                         (16.14)
                                           2         Sk                           i=1
      where we have restored the index k on κ and the subscript Hk on Ω(i) . If we plug this relation
                       16.2 Towards the first law of black hole dynamics                              623
                            (i)   (i)
into Eq. (16.5), the terms JHk δΩHk cancel out and we are left with
                              K               Lk
                                                           !         K Z
         (n − 3)             X    κk        X     (i)  (i)       1 X
                   δM∞ =              δAk +      ΩHk δJHk −                ωµν dS µν
          n−2                k=1
                                  8π         i=1
                                                                16π k=1 Sk
                                   Z
                                1
                             − δ Rµν ξ ν dV µ .
                               8π Σ
We note that the inner boundary of the hypersurface Σ is Sint = S = ∪K       k=1 Sk , so that the
sum of the integrals over the Sk ’s is actually an integral over Sint and we may apply formula
(5.49) regarding the flux of a 2-form (here ω) to write
                                 K                Lk
                                                                 !
           (n − 3)
                                                                        Z
                               X      κk         X    (i)    (i)      1
                   δM∞ =                 δAk +       ΩHk δJHk −              ωµν dS µν
            n−2                 k=1
                                      8π         i=1
                                                                    16π   S∞
                                     Z                       Z
                                   1                      1
                               +         ∇ν ωµν dV µ −      δ Rµν ξ ν dV µ ,               (16.15)
                                  8π Σ                  8π Σ
where S∞ stands for the outer boundary of Σ, i.e. the limit r → +∞ of a sphere S of
constant value of (x0 , r), (xα ) being an asymptotically Minkowskian    coordinate system such
that Σ is a hypersurface x = const, ξ = ∂0 and r := (x ) + · · · + (xn−1 )2 . In view of
                             0
                                                           p
                                                                1 2
where (ya )1≤a≤n−2 is a coordinate system of S∞ and q is the determinant with respect to it of the
metric q induced by g on S∞ . Now, on S∞ , n = ξ and s = (xi /r)∂i , with ξµ nµ = ξµ ξ µ = −1
and ξµ sµ = 0. Moreover, since (xα ) is asymptotically Minkowskian, we may substitute the
covariant derivatives with partial ones. Hence we get, accounting for ∂0 hi0 = 0 (h = δg is
stationary, since both g and g + δg are),
                                             xi                   √
                       Z                Z
                                   µν
                            ωµν dS =            (∂j hij − ∂i hσσ ) q dn−2 y.
                        S∞               S∞ r
By hypothesis, the metric tensor has the asymptotic behavior (5.31); it follows that, for r → +∞,
                                                                                      
                 δM∞            1                          αn δM∞                    1
      h00 = αn n−3 + O n−2                 and     hij =                 δij + O n−2 ,
                 r            r                          n − 3 rn−3                r
where αn is given by Eq. (5.32). According to Property 5.11, δM∞ in the above formulas is the
same variation of the Komar mass at infinityPas in the left-hand side of Eq. (16.15). The trace of
h takes the asymptotic value hσσ = −h00 + n−1 i=1 hii = 2αn δM∞ /((n − 3)r
                                                                              n−3
                                                                                   ) + O (1/rn−2 ).
Hence we get
                                    xi ∂                                              √ n−2
                                                 
                                                       √
  Z                            Z                                                Z
              µν      αn δM∞                  1               n−2
      ωµν dS = −                         i    n−3
                                                         q d y = αn δM∞                 q̄ d y ,
   S∞                  n − 3 S∞ r ∂x r                |{z}
                                                         √                        S∞
                                    |      {z      } rn−2 q̄                    |      {z     }
                                        −(n−3)/rn−2                                   Ωn−2
624                        Evolution and thermodynamics of black holes
      where q̄ stands for the determinant of the round metric of the unit sphere Sn−2 with respect to
      the coordinates (y a ). Since αn Ωn−2 = 16π/(n − 2) [cf. Eq. (5.32)], there comes
                                        Z
                                                        16π
                                            ωµν dS µν =        δM∞ .
                                         S∞             n−2
      Let us make the Einstein tensor G appear in the last integral via Eq. (A.111): Rµν ξ ν dVµ =
      Gµν ξ ν dVµ + R ξ µ dVµ /2. Given that δξ µ = 0 [Eq. (16.4)], we have δ(R ξ µ dVµ ) = ξ µ (δR dVµ +
      R δ(dVµ )), so that we may write
                          K                Lk
                                                         !                 Z
                        X      κk        X      (i)  (i)       1        1
               δM∞ =              δAk +       ΩHk δJHk +          I−      δ Gµν ξ ν dVµ ,          (16.16)
                        k=1
                               8π         i=1
                                                              8π       8π Σ
      with                              Z
                                    1
                               I :=             ξ µ [∇ν H ν dVµ − (δR dVµ + R δ(dVµ ))] .
                                    2     Σ
                                           √
      Now from Eq. (5.48), dVα = −nα γ dn−1 x, where (xi )1≤i≤n−1 are coordinates on Σ, γ is the
      determinant with respect to (xi ) of the metric γ induced by g on Σ and the normal 1-form nα
      is collinear to the differential of t, the latter being a scalar field defining Σ as a hypersurface
                                                                     √           √
      t = const: nα = −N (dt)α = −N ∂α t. Given the identity −g = N γ (see e.g. Eq. (5.55) of
                                        √
      Ref. [227]), we have dVα = ∂α t −g dn−1 x. Since obviously δ(∂α t) = 0 and δ(dn−1 x) = 0, we
      get
                                                √                  1 √
                               δ(dVα ) = ∂α t δ −g dn−1 x = √ δ −g dVα .                           (16.17)
                                                                   −g
         2
           This is easy to see in a coordinate system (xα ) such that ξ = ∂0 , cf. Eq. (A.85); see also Appendix E2 of
      Ref. [424] for a covariant proof.
                           16.2 Towards the first law of black hole dynamics                                           625
Hence
                                                       √  µ
                               Z                            
                             1          ν     1
                        I=         ∇ν H − √ δ R −g ξ dVµ .
                             2 Σ              −g
                         3
Now, a standard identity , which is at the root of the derivation of the Einstein equation by
extremalizing the Einstein-Hilbert action, is
                               √                            √
                           δ R −g = (Gµν δg µν + ∇µ H µ ) −g.                          (16.18)
It’s rather straightforward to establish (16.18) from the Palatini identity 4 :
                                       δRαβ = ∇µ δΓµαβ − ∇β δΓµαµ ,                                        (16.19)
where the variations δΓγαβ of the Christoffel symbols are actually tensor fields on M (contrary
to the Christoffel symbols themselves), being expressible via the manifestly tensorial relation5
                               1
                      δΓγαβ = g γµ (∇α hµβ + ∇β hαµ − ∇µ hαβ ) .                                           (16.20)
                               2
Using R := g Rµν , we have indeed
            µν
                √                √               √              √
             δ R −g = δg µν Rµν −g + g µν δRµν −g + g µν Rµν δ −g,
where the second term can be expressed by combining Eqs. (16.19), (16.20) and (16.13):
                              g µν δRµν = ∇µ (∇ν hµν − ∇µ hνν ) = ∇µ H µ .
On the other side, formula (A.73) for the variation of the determinant g leads to
                √       1√                 1√                   1√
               δ −g =        −g δ ln |g| =    −g g µν δgµν = −      −g gµν δg µν .                         (16.21)
                        2                  2                    2
The above three equations establish (16.18). It immediately follows that
                              Z                        Z
                            1            µν ρ       1
                     I=−         Gµν δg ξ dVρ =           Gµν δgµν ξρ dV ρ ,
                            2 Σ                     2 Σ
where the second equality results from δg µν = −g µρ g νσ δgρσ . In view of Eq. (16.16), this
completes the proof of the mass variation formula (16.3).
Remark 1: Contrary to the generalized Smarr formula (5.84), the spacetime dimension n does not
appear in the mass variation formula (16.3).
   Let us apply the general mass variation formula (16.3) to a solution of the vacuum Einstein
equation (1.44). In this case, the Ricci tensor vanishes identically, and so does the Einstein
tensor: G = 0. Accordingly one gets rid of the integrals in the right-hand side of formula (16.3).
Moreover, the null dominance condition is automatically fulfilled by G = 0, which guarantees
that the surface gravities κk are constant on each connected component of the event horizon.
We may therefore state:
   3
     See e.g. Eq. (E.1.18) in Wald’s textbook [499], where v = H or Eqs. (4.18)-(4.19) and the unnumbered equation
below (4.20) on p. 437-438 of Deruelle & Uzan’s textbook [163], where V = H.
   4
     See e.g. Eq. (21.21) in MTW [371], Eq. (3.86) in Straumann’s textbook [464] or Eq. (4.62) in Carroll’s textbook
[87].
   5
     See e.g. Eq. (3.93) in Straumann’s textbook [464].
626                        Evolution and thermodynamics of black holes
Property 16.2: mass variation formula for black holes in vacuum general relativity
                                              K                    L
                                                                                  !
                                                                k
                                              X      κk       X    (i)  (i)
                                     δM =               δAk +     ΩHk δJHk            .                     (16.22)
                                              k=1
                                                     8π       i=1
                                                         κ
                                               δM =        δA + ΩH δJ .                                     (16.23)
                                                        8π
        In particular, one recovers formula (16.1), which was obtained for a Kerr black hole.
           a
             We have dropped the subscript ∞ on δM because, in the vacuum case, the Komar mass is independent
        of the integration surface (Property 5.15) and hence needs not to be taken at the asymptotically flat end of
        the stationary spacetime.
Property 16.3: mass variation formula for charged black holes in general relativity
and to the changes δQHk in electric charge of each Killing horizon (cf. Property 5.30) by
                                 K                                   L
                                 X   κk                              X
                       δM∞ =                 δAk + ΦHk δQHk +               Ω(i) δJ∞
                                                                                   (i)
                                                                                       ,            (16.24)
                                 k=1
                                        8π                            i=1
  where κk is the surface gravity of Hk and ΦHk is the electric potential of Hk (cf. Prop-
  erty 5.29).
      For a connected event horizon (K = 1) and a single axisymmetry (L = 1, which is
  required if n = 4), formula (16.24) reduces to
                                          κ
                               δM∞ =        δA + ΩH δJ∞ + ΦH δQH ,                                  (16.25)
                                         8π
  where we have dropped the indices k and i on the various quantities and have denoted
  ΩH the angular velocity of H with respect to the single axisymmetry.
     a
       Note that contrary to the more general setting of Property 16.1, all the connected components Hk
  are assumed to be Killing horizons with respect to the same Killing vector χ; in other words, the rotation
  velocities Ω(i) and axisymmetric Killing vectors η(i) are the same for all the Hk ’s.
Proof. First of all, we note that the null dominant energy condition (3.46) is fulfilled by the
electromagnetic field energy-momentum tensor T [Eq. (1.52)] [321]. By virtue of the zeroth
law (Property 3.10), this guarantees that the surface gravities κk are constant over each Killing
horizon Hk . All the hypotheses of Property 16.1 are thus fulfilled. We shall use the same setup
as in the proof of this property; in particular Eq. (16.4) holds: δξ = 0 and δη(i) = 0. By virtue of
Einstein’s equation (1.41) (with Λ = 0), we may substitute G in the last integral of the general
formula (16.3) by 8πT , where T is given by Eq. (1.52); we get
                     K                 Lk
                                                     !      Z
                    X     κk          X          (i)      1
          δM∞ =              δAk +          (i)
                                          Ω δJHk +             T µν hµν ξ ρ dVρ − δI,        (16.26)
                    k=1
                          8π          i=1
                                                          2  Σ
      respectively. Let us evaluate the variation of the integral over Σ. We have, using δξ ρ = 0
      [Eq. (16.4)],
                        δ(Fµν F µν ξ ρ dVρ ) = δ(Fµν F µν ) ξ ρ dVρ + Fµν F µν ξ ρ δ(dVρ ).
      Now, δ(dVρ ) = (1/2) g µν hµν dVρ [cf. Eqs. (16.17) and (16.21)] and
           δ(Fµν F µν ) = δFµν F µν + Fµν δ(g µρ g νσ Fρσ )
                        = δFµν F µν + Fµν δg µρ g νσ Fρσ + Fµν g µρ δg νσ Fρσ + Fµν g µρ g νσ δFρσ
                                          |{z}                      |{z}        | {z }
                                              −hµρ                      −hνσ              F ρσ
                                µν
                         =   2F δFµν −     2Fρµ F ρν hµν
                         =   2F µν δ(∂µ Aν − ∂ν Aµ ) − 2Fρ µ F ρν hµν
                         =   2F µν (∂µ δAν − ∂ν δAµ ) − 2Fσ µ F σν hµν
                         =   2F µν (∇µ δAν − ∇ν δAµ ) − 2Fσ µ F σν hµν
                         =   4F µν ∇µ δAν − 2Fσ µ F σν hµν .
      Hence
                                                                           
                     µν ρ                                           1
              δ(Fµν F ξ dVρ ) =                     µ σν
                                  4F ∇µ δAν − 2Fσ F hµν + Fστ F g hµν ξ ρ dVρ
                                     µν                               στ µν
                                                                    2
                                    µν               µν         ρ
                              = (4F ∇µ δAν − 2µ0 T hµν ) ξ dVρ
                              = 4∇ν Ωµν dVµ − 2µ0 T µν hµν ξ ρ dVρ ,          (16.28)
      where
                                       Ωµν := δAσ (F σµ ξ ν − F σν ξ µ ) .                         (16.29)
      To prove (16.28), we need to show that F νσ ∇ν δAσ ξ µ = ∇ν Ωµν . This is easily done by a
      direct computation of ∇ν Ωµν from (16.29), using ∇ν ξ ν = 0 (Killing equation) and ∇ν F σν = 0
      (source-free Maxwell equation):
                                       F σν ξ µ +δAσ (∇ν F σµ ξ ν +F σµ ∇ν ξ ν − ∇ν F σν ξ µ −F σν ∇ν ξ µ ).
      ∇ν Ωµν = ∇ν δAσ F σµ ξ ν −∇ν δAσ |{z}
                                         νσ
                                                                        | {z } | {z }
                                          −F                                   0      0
                                                   ↠
      Expressing the symmetry properties Lξ F = 0 and Lξ δA = 0 as respectively ξ ν ∇ν F σµ −
      F σν ∇ν ξ µ = F νµ ∇ν ξ σ and ξ ν ∇ν δAσ = −δAν ∇σ ξ ν [cf. Eq. (A.86)], we get
               ∇ν Ωµν = −F σµ δAν ∇σ ξ ν + F νσ ∇ν δAσ ξ µ + δAσ F νµ ∇ν ξ σ = F νσ ∇ν δAσ ξ µ .
      Hence Eq. (16.28) holds. We may then use expression (16.27) to write the variation of I as
                            Z                     Z                    Z
                       1           σ      µν    1      ν        µ    1
               δI =       δ    Aσ ξ Fµν dS −         ∇ Ωµν dV +           T µν hµν ξ ρ dVρ .
                     2µ0     S                 µ0 Σ                  2 Σ
      In view of the definition (16.29), Ωµν is antisymmetric, so that Ωµν describes a 2-form and
      we may invoke Lemma 5.12 with Sint := S to express the second integral as a flux integral
      through S (setting to zero the flux through S∞ due to the fast decay of A and F and hence
      of Ω). We thus get
                          Z                       Z                      Z
                     1            σ       µν    1          σ       µν  1
              δI =      δ     Aσ ξ Fµν dS +           δAσ F µ ξν dS +       T µν hµν ξ ρ dVρ ,
                   2µ0     S                   µ0 S                    2 Σ
                         16.2 Towards the first law of black hole dynamics                                                       629
where we have used Eq. (16.29) and the antisymmetry of dS µν to write Ωµν dS µν = δAσ (F σµ ξν −
F σν ξµ )dS µν = 2δAσ F σµ ξν dS µν . Substituting the above expression for δI into Eq. (16.26) leads
to                                                                !
                                K                   L
                               X       κk         X           (i)
                     δM∞ =                δAk +        Ω(i) δJHk + (δM∞ )EM ,                 (16.30)
                               k=1
                                       8π          i=1
Let us use ξ = χ − Li=1 Ω(i) η(i) to let appear the null normal χ of the event horizon. Taking
                    P
into account that η(i)ν dS µν = 0 for η(i) is tangent to S , we get
                           Z              L
                                                                      !                        Z
           EM       1                 σ
                                          X
                                                      (i)       σ                  µν     1
   (δM∞ )       =−     δ        Aσ χ −              Ω       Aσ η(i)       Fµν dS        −            δAσ F σµ χν dS µν .
                   2µ0      S                 i=1
                                                                                          µ0     S
Now, on each connected component Sk of S , Eq. (5.99) gives Fµν dS µν = 2µ0 σk dS, where σk
                                                                                √
is the effective electric charge density of Sk (cf. Property 5.29) and dS := q dn−2 x is the area
element of Sk . Furthermore, thanks to Eq. (5.86), χν dS µν = χν (χµ k ν − k ν χν )dS = −χµ dS.
Hence
                 K
                    " Z                       L    Z                            Z                 #
                X                           X                               1
 (δM∞ )EM =           −δ      Aσ χσ σk dS +      δ     Ω(i) Aσ η(i)
                                                                σ
                                                                    σk dS +         δAσ F σµ χµ dS .
                k=1        Sk                i=1    Sk                      µ 0  Sk
      Besides, the Komar angular momentum at infinity with respect to the axisymmetry generated
      by η(i) is given by formula (5.70):
                                                  Z
                                          (i) (i)        ν
                                       J∞ = JS + Tµν η(i)   dV µ ,
                                                            Σ
      where we have taken (5.48) into account and have set to zero the term involving the trace T in
      Eq. (5.70) since it is always possible to choose Σ so that all the η(i) ’s are tangent to it, ensuring
       µ
      η(i) dVµ = 0. Substituting the electromagnetic form (1.52) for T and using again η(i)           µ
                                                                                                        dVµ = 0,
      we get                                              Z
                                         (i)     (i)  1
                                        J∞ = JS +                      ν
                                                            Fρµ F ρν η(i) dV µ .
                                                      µ0 Σ
      Now since A obeys the spacetime symmetries, we have Lη(i) A = 0, from which we deduce
      that Fρµ F ρν η(i)
                      ν
                         = ∇ρ (Aν η(i)ν
                                        Fρµ ), in exactly the same way as that used above to establish
      Fρµ F ρν ξ ν = ∇ρ (Aν ξ ν Fρµ ) when evaluating the integral I. Hence
                                      Z                                        Z
                    (i)     (i)   1         ν      ρ        µ      (i)     1             ρ
                   J∞ = JS −             ∇ (Aρ η(i) Fµν ) dV = JS +                 Aρ η(i) Fµν dS µν ,
                                  µ0 Σ                                    2µ0 S
                                                                                           (i)           (i)
      where the second equality stems from Lemma 5.12. Since S = ∪K             k=1 Sk , JS =       k=1 JHk and
                                                                                                 PK
      Fµν dS µν = 2µ0 σk dS on Sk [Eq. (5.99)], we may rewrite the above expression as
                                              K 
                                              X        Z                            
                                                  (i)
                                      (i)
                                     J∞     =    JHk +                   ρ
                                                                     Aρ η(i) σk   dS .                    (16.32)
                                             k=1                Sk
                                                                                    (i)
      Then obviously, the term in factor of Ω(i) in Eq. (16.31) is δJ∞ , which ends the proof of
      formula (16.24).
      Remark 2: When contrasted with the electrovacuum Smarr formula (5.100), the mass variation formula
      (16.24) has the distinctive feature of being independent of the spacetime dimension n.
      Remark 3: In the limit of a vanishing electric charge (QH = 0), formula (16.25) reduces to the vacuum
      formula (16.23), given that M∞ = M and J∞ = J are independent of the integration surface in vacuum.
      Remark 4: An alternative derivation of the mass variation formulas (16.23) and (16.25) can be performed
      within the ADM Hamiltonian formulation of general relativity, as shown by Sudarsky and Wald (1992)
      [467, 468] (see also Sec. 6.2 of Ref. [503]). This derivation does not rely on the generalized Smarr formula
      (used in the proof of Property 16.1, from which formulas (16.23) and (16.25) are derived) but it requires
      that the black hole event horizon is part of a bifurcate Killing horizon. This last assumption has been
      removed by Gao and Wald in 2001 [212].
ε = ε(s, nb ), (16.34)
where s and nb are respectively the entropy density and the baryon density, both measured
in the fluid’s frame. The fluid’s thermodynamic temperature T and the baryon relativis-
tic chemical potentialb µb are then defined as the partial derivatives T := (∂ε/∂s)nb and
µb := (∂ε/∂nb )s . The resulting identity
dε = T ds + µb dnb (16.35)
expresses the first law of thermodynamics for a small fixed comoving volume. The fluid’s
pressure p is not an independent function of (s, nb ) but rather is given by the thermody-
namic identity
                                   p = T s + µb nb − ε,                              (16.36)
which, along with (16.35), implies the Gibbs-Duhem relation: dp = s dT + nb dµb .
    The law of baryon number conservation asserts that nb u is a conserved current: ∇ ·
(nb u) = 0. Furthermore, if we assume that g obeys the Einstein equation (1.41), the
                                                    →
                                                    −
energy-momentum conservation (1.45) holds: ∇ · T = 0. Its projection along u lead to
the conservation of the entropy current su, i.e. ∇ · (su) = 0 (no heat exchange between
neighboring elements of a perfect fluid). Given a spacelike hypersurface Σ, the conserved
currents nb u and su give rise to the concepts of number of baryons Nb in Σ and fluid
entropy Sf in Σ:
                            Z                              Z
                    Nb :=          µ
                              nb u dVµ       and      Sf =    suµ dVµ ,            (16.37)
                             Σ                                       Σ
where dV is Σ’s normal volume element vector [Eq. (5.48)]. If the spacetime (M , g) is
axisymmetric (Killing vector η), a third conserved current is j := −1/(16π)J [η], where
                                                                     →
                                                                     −
J [η] is the Komar current of η, as given by Eq. (5.69): J [η] = −2 R · η. Thanks to the
                              →
                              −
Einstein equation (1.42), j = T · η − (T µµ /2)η = (ε + p)(u · η) u + (ε − p)/2 η. The
fluid angular momentum in Σ is the integral of j · dV ; assuming that η is tangent to Σ (i.e.
η · dV = 0), we get                  Z
                                 Jf =       (ε + p)uν η ν uµ dVµ .                   (16.38)
                                        Σ
632                           Evolution and thermodynamics of black holes
        The three quantities Nb , Sf and Jf do not depend upon the choice of the hypersurface Σ
        as long as the latter is large enough to encompass all the fluid, i.e. Σ intersects each fluid
        line exactly once (cf. Fig. ??).
           a
               See Refs. [371] (Chap. 22), [97, 226] or [228] (Chap. 21).
           b
               By relativistic it is meant that the rest mass is included in the chemical potential.
                                                                                              →
                                                                                              −
      Proof. The entropy conservation ∇ · (su) = 0 follows straightforwardly from ∇ · T = 0
      projected along u, using the identities (16.35), (16.36) and ∇ · (nb u) = 0 (see e.g. Ref. [226]
      for details). Because they are fluxes of conserved currents, the quantities Nb , Sf and Jf take
      the same values on two hypersurfaces Σ1 and Σ2 that are connected by a (timelike or null)
      hypersurface W with all the currents vanishing on W , which happens if W is located outside
      the fluid worldtube.
          From an astrophysical point of view, it is natural to consider that the matter surrounding
      the black hole forms an accretion disk or torus, so that it can be modeled by a perfect fluid in
      circular motion around the black hole rotation axis. One then derives from the general law
      (16.3) the following mass variation formula:
Property 16.5: mass variation formula for a black hole surrounded by a fluid
u = ut (ξ + Ωη) , (16.39)
        where ut and Ω are two scalar fields. Ω is called the fluid angular velocity because
        Ω = dφ/dt along a fluid line, where t and φ are adapted coordinates: ∂t = ξ and
        ∂φ = η. The notation ut reminds that this factor is the component of u along ∂t in adapted
        coordinates. It is fully determined by Ω and the three scalar products formed from the two
        Killing vectors:
                                                                     −1/2
                                   ut = −ξ · ξ − 2Ωξ · η − Ω2 η · η        .                (16.40)
        Let (M , g + δg) be a nearby stationary spacetime sharing the same characteristics. The
        change δM∞ in Komar mass at infinity between the two spacetimes is given by
                                                     Z                   Z                    Z
                         κ                                T                   µb
               δM∞    =    δA + ΩH δJH +                     δ(dSf ) +           δ(dNb ) +          Ω δ(dJf ) ,   (16.41)
                        8π                            Σ   ut              Σ   ut                Σ
                           16.2 Towards the first law of black hole dynamics                               633
dSf := suµ dVµ , dNb := nb uµ dVµ and dJf := (ε + p)uν η ν uµ dVµ . (16.42)
Proof. Let us specialize the general formula (16.3) to the case n = 4, K = 1, L1 = 1 and g
obeying the Einstein equation (so that the Einstein tensor G can be replaced by 8πT ):
                                          Z                     Z
                   κ                    1
         δM∞ =        δA + ΩH δJH +          T δgµν ξ dVρ − δ T µν ξ ν dVµ .
                                              µν       ρ
                                                                                       (16.43)
                  8π                    2 Σ                       Σ
Let us rewrite the integrand of the last integral by means of the perfect fluid expression (16.33)
for T :
                           T µν ξ ν dVµ = [(ε + p)uµ uν ξ ν + pξ µ ] dVµ .
Substituting (ut )−1 u − Ωη for ξ [Eq. (16.39)] and using uν uν = −1 and η µ dVµ = 0 (since η is
tangent to Σ), we get
                                                ε
                               T µν ξ ν dVµ = − t uµ dVµ − Ω dJf ,
                                                u
where use has been made of Eq. (16.42) to let appear dJf . Let us perform the variation of this
expression and use δε = T δs + µb δnb in agreement with Eq. (16.35):
                        1                             ε δut µ        ε
δ (T µν ξ ν dVµ ) = −    t
                           (T δs + µ b δn b )uµ
                                                dVµ +    t 2
                                                             u dVµ − t δ (uµ dVµ ) − δΩ dJf − Ω δ(dJf ).
                        u                             (u )          u
Using Eq. (16.42) to let appear δ(dSf ) and δ(dNb ) and substituting p for T s + µb nb − ε
[Eq. (16.36)], we get
                        T            µb                        p                ε δut µ
δ (T µν ξ ν dVµ ) = −      δ(dSf ) −    δ(dN b ) − Ω δ(dJf ) +    δ(uµ
                                                                       dV µ ) +        u dVµ − δΩ dJf .
                        ut           ut                        ut               (ut )2
Now δ(uµ dVµ ) = δuµ dVµ + uµ δ(dVµ ), with, according to Eqs. (16.39) and (16.4),
and, according to Eqs. (16.17) and (16.21), δ(dVµ ) = (1/2)g ρσ δgρσ dVµ . Hence, using η µ dVµ = 0,
                                           t                
                                µ            δu     1 ρσ
                            δ(u dVµ ) =          + g δgρσ uµ dVµ .
                                              ut    2
The above expression of δ (T µν ξ ν dVµ ) becomes then
                                     T            µb
          δ (T µν ξ ν dVµ ) = −       t
                                        δ(dSf ) − t δ(dNb ) − Ω δ(dJf )
                                     u
                                      t         u                           
                                          δu           ν           1 p ρσ
                                   +       t 2
                                               − δΩuν η (ε + p) +      t
                                                                         g δgρσ uµ dVµ .
                                         (u )                      2u
634                       Evolution and thermodynamics of black holes
      Note that we have used Eq. (16.42) as δΩ dJf = (ε + p)δΩ uν η ν uµ dVµ . Now, from Eq. (16.44),
      δut /ut − ut δΩuν η ν = −uν δuν . Hence
                              T            µb                        hp                            i 1
      δ (T µν ξ ν dVµ ) = −      δ(dSf ) −    δ(dN b ) − Ω δ(dJf ) +    g ρσ
                                                                             δg ρσ − (ε + p)uν δuν
                                                                                                        uµ dVµ .
                              ut           ut                         2                              ut
Since gρσ uρ uσ = −1 implies uρ δuρ = −(1/2)δgρσ uρ uσ , we may rewrite the above relation as
                                         T            µb                      1
                 δ (T µν ξ ν dVµ ) = −     t
                                             δ(dSf ) − t δ(dNb ) − Ω δ(dJf ) + T ρσ δgρσ ξ µ dVµ ,
                                         u            u                       2
      where we have used Eq. (16.33) to let appear T ρσ and Eq. (16.39) along with η µ dVµ = 0 to write
      (ut )−1 uµ dVµ = ξ µ dVµ . Integrating the above expression on Σ and substituting into Eq. (16.43)
      yields formula (16.41).
      Historical note : The mass-variation formula (16.1) for Kerr black holes has been first derived by Jacob
      Bekenstein in 1972 [51, 52], using the same differentiation procedure as in Sec. 16.2.1. Actually, the
      formula derived by Bekenstein applies to Kerr-Newman black holes, being a special case of the generic
      formula (16.25) for electrically charged black holes. The extension to generic stationary 4-dimensional
      black holes surrounded by a perfect fluid in circular motion [Eq. (16.41)] has been obtained by James
      Bardeen, Brandon Carter and Stephen Hawking in 1973 [40]. A different derivation of the same formula
      (16.41) can also be found in Bardeen’s lecture at the famous 1972 Les Houches Summer School [39]. The
      generalization to include an electromagnetic field has been presented by Carter at the same summer
      school [96]. In the pure electrovacuum case (no fluid, no electric current), Carter’s formula (Eq. (9.64) in
      Ref. [96]) reduces to the case K = 1, L = 1 of formula (16.24). As for the general form (16.3) involving
      the Einstein tensor, it has been exhibited by Carter in 1979 [99] in the case n = 4 and K = 1 (connected
      event horizon). The mass variation formula with a fluid (16.41) has been extended to non-circular flows
      by Carter in 1979 [98].
         6
           In Newtonian mechanics, the work done by a torque τ on a body that is rotating by dφ is dW = τ dφ. Given
      that τ := dJ/dt, one gets dW = Ω dJ, where Ω = dφ/dt is the body’s angular velocity.
                               16.3 Evolution of the black hole area                                   635
  Under the accretion of particles of energy sufficiently small to neglect gravitational radia-
  tion, the area A of a Kerr black hole necessarily increases:
δA > 0 . (16.47)
Remark 1: The area increase holds even if the mass M decreases, which occurs for a Penrose process
(E ′ > E), since then E∗ < 0 and thus δM < 0 by virtue of Eq. (16.45).
    Any transformation of a black hole with δA > 0 is called an irreversible transformation .
For particle accretion, the irreversibility traces back to the inequality E∗ > ΩH L∗ [Eq. (11.60)],
636                      Evolution and thermodynamics of black holes
      which must hold if P∗ is to cross the event horizon, i.e. is to be actually accreted. Indeed, in view
      of Eq. (16.45), one might think naively that the transformation (M, J) → (M + δM, J + δJ)
      could be reverted by throwing into the black hole a particle of conserved energy E∗′ = −δM =
      −E∗ and conserved angular momentum L′∗ = −δJ = −L∗ . However this is not permitted
      because the constraint E∗′ > ΩH L′∗ is violated if E∗ > ΩH L∗ holds.
           A reversible transformation is a transformation such that δA = 0. For particle accretion,
      it is achieved in the limit case E∗ = ΩH L∗ [cf. Eq. (16.46)]. The particle P∗ reaches then the
      event horizon H with a vanishing radial affine velocity dr/dλ, λ being the affine parameter
      along P∗ ’s worldline associated with P∗ ’s 4-momentum (cf. Sec. 11.2.1). Indeed, the equation
      of geodesic motion (11.43b) yields, on H (r = r+ ),
                                                      2
                                 dr                  r+ + a2
                                             =−    2
                                                                 (E∗ − ΩH L∗ ).                    (16.48)
                                 dλ   r=r+        r+ + a2 cos2 θ
      Note that we have set ϵr = −1 in Eq. (11.43b) to account for the infall motionp         of P∗ and
      have specialized expression (11.44) for R(r) to ∆ = 0 (r = r+ ), which yields R(r+ ) =
      (r+2
           + a2 )|E∗ − ΩH L∗ | = (r+   2
                                         + a2 )(E∗ − ΩH L∗ ) thanks to Eq. (11.60) and the identity
      a/(r+ + a ) = ΩH [Eq. (10.69)]. Equation (16.48) clearly shows that dr/dλ → 0 in the
            2     2
      limit ΩH L∗ → E∗ . Moreover, √ from expression     (11.29) for R(r), one gets, for a geodesic with
      ΩH L∗ = E∗ , R′ (r+ ) = −2 M 2 − a2 (µ2∗ r+  2
                                                     + K∗ ), where µ∗ is the mass of P∗ and K∗ is the
      Carter constant of P∗ . Given that M 2 − a2 ̸= 0 (non-extremal Kerr) and K∗ ≥ 0 [Eq. (11.30)],
      we obtain R′ (r+ ) ̸= 0, except if µ∗ = 0 and K∗ = 0. According to Property 11.2, the last case
      would occur only if P∗ followed a principal null geodesic of Kerr spacetime. However, this is
      excluded since those geodesics do not fulfill ΩH L∗ = E∗ in the black hole exterior: they obey
      L∗ = aE∗ sin2 θ [Eqs. (11.6) and (11.9b)], so that ΩH L∗ = E∗ would imply ΩH a sin2 θ = 1, i.e.
      r+2
           + a2 cos2 θ = 0, which is impossible. Hence one has always R′ (r+ ) ̸= 0. We conclude that
      P∗ has necessarily a r-turning point on H (cf. Sec. 11.2.6). Hence a reversible transformation
      corresponds to the idealized case where the particle P∗ is on the verge of being accreted by
      the black hole but is actually not. This case is approached when P∗ crosses H with an affine
      velocity dr/dλ as small as possible.
           The square root of the horizon area A (rescaled by 1/(16π)) is called the black hole’s
      irreducible mass:
                                                           A
                                                   2
                                                Mirr  :=       .                                  (16.49)
                                                          16π
      Mirr has the dimension of a mass and Eq. (16.47) is equivalent to δMirr > 0, i.e. Mirr cannot
      decrease in any accretion process, hence its name. This contrasts with M , which may decrease
      (Penrose process). Formula (10.83) for A can be turned into an expression for Mirr in terms of
      M and J = aM (cf. Fig. 16.1):
                                                v           r           !
                                                u
                                                u1                  J 2
                                      Mirr = M t        1+ 1− 4 .                                 (16.50)
                                                   2                M
1.00
0.95
0.90
                      Mirr /M
                                0.85
0.80
0.75
Figure 16.1: Irreducible mass Mirr of a Kerr black hole of mass M , as a function of the black hole angular
momentum J [Eq. (16.50)] (J = 0 corresponds to a Schwarzschild black hole and J/M 2 = 1 to an extremal Kerr
one). The shadded area depicts the maximal energy that can be extracted from the black hole by a Penrose process
bringing J down to 0.
    Suppose some energy is extracted by a sequence of Penrose processes (cf. Sec. 11.3.2)
from a Kerr black hole of initial mass M and angular momentum J, leading to a final state
corresponding to a Kerr black hole of mass M0 and angular momentum J0 . The maximal energy
∆M = M − M0 that can be extracted is achieved when the final state is a Schwarzschild black
hole (J0 = 0), since then the Penrose process necessarily stops, due to the lack of an outer
ergoregion (cf. Sec. 11.3.2). Since the irreducible mass of a Schwarzschild black hole coincides
with its mass (set J = 0 in Eq. (16.50)), the irreducible mass of the final state is Mirr,0 = M0
and we have ∆M = M − Mirr,0 . The evolution law Mirr,0 ≥ Mirr provides then a upper bound
on the extracted energy:
                                         ∆M ≤ M − Mirr ,                                   (16.52)
with equality only for a sequence of reversible transformations. Figure 16.1 shows that ∆M/M
is maximal when the initial state is (close to) an extremal Kerr black hole (J = M 2 ). From
Eq. (16.50), one gets
                                      ∆M            1
                                max         = 1 − √ ≃ 0.293.                             (16.53)
                                       M             2
We conclude:
  Property 16.7: maximal energy extraction by a Penrose process
  A sequence of Penrose processes can extract up to 29% of the mass of a Kerr black hole.
  This upper bound is achieved for a maximally spinning black hole (extremal Kerr), by
  bringing it down to a non-rotating state (Schwarzschild) via reversible transformations.
Historical note : Demetrios Christodoulou introduced the concepts of irreversible transformation and
irreducible mass of a Kerr black hole in 1970 [112] and established formula (16.51). Actually, he did not
638                       Evolution and thermodynamics of black holes
      make any argument about the black hole area; rather he integrated the relation dM = ΩH dJ, which
      holds when the black hole is accreting a particle with E∗ = ΩH L∗ (cf. Eq. (16.45), i.e. at the saturation
      limit of the inequality (11.60). Indeed,
                                       √       by expressing ΩH as a function of (M,√  J) via Eq. (10.69), one
      gets dM = JdJ/(2M (M + M − J )), which is equivalent to d(M + M 4 − J 2 ) = 0. The
                                  2        4     2                                 2
irreducible mass√ Mirr then appears when writing the integration constant for this equation as 2Mirr 2 ,
      namely M + M 4 − J 2 = 2Mirr , which is equivalent to Eq. (16.50). The connection with the black
                 2                       2
      hole area [Eq. (16.49)] has been performed in a second study, with Remo Ruffini, published in 1971
      [114] and which extends Christodoulou’s results to Kerr-Newmann black holes. Meanwhile, Roger
      Penrose and Roger Floyd noticed at the end of 1970 (published 1971 [409]) that the area of a Kerr
      black hole necessarily increases under particle accretion, i.e. they established Property 16.6. In 1972,
      Brandon Carter [94] pointed out that the relation dM = ΩH dJ also holds for a rigidly rotating fluid
      star undergoing a thermodynamically reversible transformation (in that case, ΩH stands for the star’s
      angular velocity). This strengthened the analogy between irreversibility in black hole processes and
      standard thermodynamics.
θ(ℓ) ≥ 0. (16.54)
      Proof. Let ℓ be a future-directed null normal vector field of H . ℓ is necessarily tangent to the
      null geodesic geodesic generators of H (Property 2.1) and is thus a pregeodesic vector, i.e. it
      obeys Eq. (2.22): ∇ℓ ℓ = κ ℓ. If ℓ is not geodesic (κ ̸= 0), it is always possible to rescale it to
      ℓ′ = αℓ with α > 0 so that ℓ′ is a future-directed geodesic vector field: ∇ℓ′ ℓ′ = 0 [Eq. (2.24)].
      We have then θ(ℓ′ ) = αθ(ℓ) [cf. Eq. (2.68)], so that θ(ℓ) ≥ 0 ⇐⇒ θ(ℓ′ ) ≥ 0. Accordingly, for
      proving (16.54), there is no loss of generality in assuming that ℓ is a geodesic vector field. Let
      us consider a null geodesic generator L of H . Up to some additive constant, there is a unique
      affine parameter λ of L associated to ℓ, i.e. such that ℓ = dx/dλ along L . The evolution
      of θ(ℓ) along L is measured by dθ(ℓ) /dλ = ∇ℓ θ(ℓ) and is given by the null Raychaudhuri
                              16.3 Evolution of the black hole area                                  639
where σab σ ab ≥ 0 has been established in Sec. 2.4.2 [Eq. (2.92)] and R(ℓ, ℓ) ≥ 0 holds by virtue
of the null convergence condition on H . Hence
                                     dθ(ℓ)        1
                                           ≤−         θ2 .                                (16.55)
                                      dλ       n − 2 (ℓ)
Let us assume the negation of (16.54), i.e. that there exists a point p ∈ L ∩ H where
θ(ℓ) = θ0 < 0. By choosing the additive constant in the definition of the affine parameter λ, we
may ensure λ(p) = 0. Equation (16.55) implies then
                                   ∀λ ≥ 0,     θ(ℓ) (λ) ≤ θ̄(λ),                           (16.56)
where θ̄(λ) obeys
                           dθ̄        1 2
                               =−         θ̄     and        θ̄(0) = θ0 .
                           dλ       n−2
The unique solution of this differential equation is
                                                     θ0
                                   θ̄(λ) =                    .
                                             1 + θ0 λ/(n − 2)
It follows that θ̄ → −∞ as λ → −(n − 2)/θ0 > 0. The inequality (16.56) then implies
θ(ℓ) → −∞ as λ → λ∗ with 0 < λ∗ ≤ −(n − 2)/θ0 . Hence the point p∗ ∈ L of parameter λ∗
is a focusing point, i.e. a point where neighboring null geodesic generators of H intersect. But
according to Property 4.8 of black hole event horizons (cf. Sec. 4.4.3), this can happen only
if p∗ is a crossover point, i.e. a point at which the null geodesic L enters H ; however, this
situation is excluded since λ∗ > 0 implies that p∗ lies in the future of p, where L is already in
H . Hence the hypothesis θ0 < 0 leads to a contradiction. It follows that θ0 ≥ 0, i.e. at any
point p ∈ H , θ(ℓ) ≥ 0.
  Property 16.9: area theorem (Hawking 1971 [259], Chruściel, Delay, Galloway and
  Howard 2001 [120])
       (i) H is smooth between S1 and S2 , and S1 and S2 are cross-sectionsa of H that are
           intersected by the same null geodesic generators of H (cf. Fig. 16.2)
  or
640                        Evolution and thermodynamics of black holes
      Figure 16.2: Cross-sections S1 and S2 induced by the spacelike hypersurfaces Σ1 and Σ2 in a smooth part of
      the event horizon H , such that S1 and S2 are intersected by the same null geodesic generators of H (green
      curves).
      Proof. For n = 2, the theorem is trivial since S1 and S2 are each reduced to a single point, so
      that A(S1 ) = A(S2 ) = 0 and Eq. (16.57) is fulfilled. Let us then assume n ≥ 3. We consider
      first the case (i) (H smooth between S1 and S2 ). S1 and S2 are then cross-sections of H
      that are connected by null geodesic generators of H (cf. Fig. 16.2). One may introduce a
      1-parameter foliation (St )t∈[0,1] of H by cross-sections St such that S0 = S1 and S1 = S2
      (cf. the proof of Property 3.1 for details, λ playing the role of t there). The label t can then
      be considered as a parameter along the null geodesic generators of H connecting S1 to S2 .
      Let ℓ = dx/dt be the associated tangent vector. Given that S2 lies in the future of S1 , ℓ is
      future-directed. By the very definition of the expansion θ(ℓ) along ℓ [Eq. (2.49) combined with
      Eq. (2.53)], one has
                                                        √
                                                Z
                                    d
                                       A(St ) =     θ(ℓ) q dx2 · · · dxn−1 ,
                                   dt            St
      where (x2 , . . . , xn−1 ) is a coordinate system on St and q is the determinant with respect to
      these coordinates of the metric induced by g on St . Now, if the null convergence condition
      holds, Property 16.8 implies θ(ℓ) ≥ 0; hence
                                                       d
                                                          A(St ) ≥ 0.
                                                       dt
                                    16.3 Evolution of the black hole area                                             641
Figure 16.3: Surfaces S1 and S2 induced by the spacelike hypersurfaces Σ1 and Σ2 on the event horizon H
corresponding to a black hole merger. All solid dark green and light green curves are null geodesic generators of
H . The light green part of H is generated by null geodesics that entered H at some caustic points (three of
them are indicated as light green dots); the parts of these geodesics outside H are drawn as light green dot-dashed
curves. The dark green dashed curves depict other null geodesics that enter H at the same caustic points.
Example 1 (Oppenheimer-Snyder collapse): Let us consider the black hole formed by the collapse of
a ball of pressureless matter, as described by the Oppenheimer-Snyder model studied in Sec. 14.3. Let
St̃ be a cross-section of the event horizon at constant ingoing Eddington-Finkelstein coordinate t̃. The
area of St̃ is simply A = 4πr2 , where r is the areal radius (cf. Sec. 14.3.5). Its evolution can thus be
read directly on Fig. 14.2 (right): it is increasing from A = 0 (at t̃ ≃ 5.6 m) to the Schwarzschild value
A = 16πm2 (at t̃ ≃ 9.7 m). This fully agrees with the area theorem, given that the null energy condition
(2.95) is fulfilled by the energy-momentum tensor (14.2) of the collapsing matter: T (ℓ, ℓ) = ρ(u · ℓ)2 ≥ 0
since ρ ≥ 0.
642                       Evolution and thermodynamics of black holes
      Example 2 (Vaidya collapse): Similarly, we check on Figs. 15.1 and 15.5 that for the radiation shell
      collapse studied in Chap. 15, the area of cross-sections of the event horizon at constant ingoing Eddington-
      Finkelstein coordinate t is increasing towards the future, for r is again the areal radius [cf. the metric
      (15.11)]. As we have already noticed in Sec. 15.2.1, the radiation energy-momentum tensor (15.5) fulfills
      the null energy condition, given that M ′ (v) ≥ 0 (cf. Fig. 15.2).
      Historical note : As mentioned in the historical note on p. 637, Roger Penrose and Roger Floyd [409]
      have shown in 1970 (published 1971) that the area of a Kerr black hole always increases during particle
      accretion, even if its mass is decreasing, as in a Penrose process (Property 16.6). Soon after, in 1971,
      Stephen Hawking [259] established the area theorem for generic dynamical black holes and a detailed
      proof was given in Hawking & Ellis’ 1973 textbook (Proposition 9.2.7 in [266]). In 2001, Piotr Chruściel,
      Erwann Delay, Gregory Galloway and Ralph Howard [120] pointed out that the Hawking & Ellis proof
      is valid only for H piecewise smooth (cf. the discussion in Sec. 3.5.1 of Chruściel’s textbook [119]); they
      constructed a new proof that does not rely on the smoothness of H .
    Given that a black hole “in equilibrium” is modeled by a black hole in a stationary spacetime,
this property bears a strong resemblance with the zeroth law of thermodynamics, which states
that the temperature of a body in equilibrium is uniform over the entire body. This strengthens
the interpretation of the surface gravity κ as the temperature T (up to some factor) performed
in Eq. (16.58b).
However, as pointed out recently by Kehle and Unger [311], Israel’s proof is restricted to the case
where outermost trapped surfaces evolve smoothly, while, as we shall discuss in Chap. 18, their
evolution generically presents discontinuous jumps, even if the spacetime and all the matter
fields remain perfectly smooth. In particular, Kehle and Unger [311] have exhibited a regular
spacetime where the collapse of a charged scalar field (obeying the dominant, and hence the
weak, energy condition) turns a Schwarzschild black hole into an extremal Reissner-Nordström
644                         Evolution and thermodynamics of black holes
      black hole within a finite advanced time7 . Let us note that this result has been recently mitigated
      by Reall [427], who has shown that the collapse to a supersymmetric extremal black hole (such
      as the extremal Reissner-Nordström one, but not the extremal Kerr one) cannot occur in theories
      with an upper bound on the local charge to mass ratio.
          From an astrophysical point of view, Bardeen has shown in 1970 [36] that a Schwarzschild
      black hole of mass m0 , and hence of surface gravity κ = 1/(4m0 ) > 0 [Eq. (2.29)], can in
      principle be spun up to an extremal Kerr black hole (κ = 0) by accreting matter from an
      accretion disk terminating at the innermost stable circular orbit (ISCO, cf. Sec. 11.5.3). The total
      rest-mass of accreted matter is then 1.86 m0 and the mass of the final black hole is m ≃ 2.45 m0 .
      However, if one takes into account the electromagnetic emission of the accretion disk, the
      extremal Kerr state cannot be achieved. The reason is that a substantial part of the emitted
      photons carry a negative angular momentum ℓ and the black hole absorption cross-section for
      photons with ℓ < 0 is larger than for those with ℓ > 0. This clearly appears on the shadow
      picture of Fig. 12.20, where the part α > 0, which is due to photons with ℓ < 0 (cf. Eq. 12.88a),
      is much larger than the part α < 0 corresponding to ℓ > 0. Consequently, the black hole
      absorbs more photons with ℓ < 0 than with ℓ > 0; this diminishes the increase of the black hole
      spin a induced by accretion of matter from the disk (which has a positive angular momentum).
      Thorne has shown in 1974 [478] that the balance between the two processes leads to a final
      spin parameter a ≃ 0.998 m, quite insensitive to the details of the emission mechanism in the
      accretion disk. This is close to, but strictly lower than, the critical value a = m that would have
      yield a zero surface gravity. Hence, in this respect, we may say that astrophysical black holes
      absorbing matter from an accretion disk obey a kind of third law. But this does not appear to
      arise from some fundamental properties of black holes; it rather results from the properties of
      their environment.
          It must be stressed that in standard thermodynamics as well, the third law has not the same
      fundamental status as the other laws. In particular it can be violated by some rather simple
      systems (see Ref. [504] for a discussion and an example involving a boson (or fermion) gas
      confined to a circular ring).
          For all the above reasons, we shall no longer discuss the third law here.
Zeroth law: Under the hypotheses of Property 16.10, the surface gravity κ of the event
         7
          As shown in Sec. 5.5.6 of Poisson’s textbook [416], this cannot happen with a charged null dust (i.e. a charged
      generalization of Vaidya collapse discussed in Chap. 15) that obeys the weak energy condition.
                                 16.4 The laws of black hole dynamics                                          645
κ = const. (16.59)
  First law: Under the hypotheses of Property 16.3 (in particular, general relativity), the
        change δM in total mass between two nearby electrovacuum configurations of a
        black hole in equilibrium is related to the change δA in horizon area, the change δJ
        in total angular momentum and the change δQH in electric charge bya
                                              κ
                                    δM =        δA + ΩH δJ + ΦH δQH ,                               (16.60)
                                             8π
         where ΩH and ΦH are the horizon’s angular velocity and electric potential respec-
         tively.
  Second law: Under the hypotheses of Property 16.9, the area of cross-sections of a black
       hole event horizon cannot decrease in time:
Remark 1: The three laws are valid for any spacetime dimension n ≥ 4, and even n ≥ 2 for the second
law. If one assumes that the event horizon is a Killing horizon, then the zeroth law is valid for n ≥ 2 as
well.
Historical note : The above ensemble of laws, plus a tentative third law, has been first stated in 1973
by James Bardeen, Brandon Carter and Stephen Hawking in a seminal article entitled The Four Laws of
Black Hole Mechanics [40]. This article, which was mostly written during the famous 1972 Les Houches
Summer School [164], exhibits proofs of the zeroth and first laws for n = 4, relies on Hawking’s article
[260] for the second law, and presents the third law as a conjecture, saying that “there are strong reasons
for believing in it” (cf. Sec. 16.4.2).
    The three laws listed in Property 16.11 are named the laws of black hole dynamics and not
thermodynamics, because they still appear as a mathematical analogy without any definite
physical content. In particular, if we stick to a pure classical (i.e. non-quantum) point of view
and would like to attribute some thermodynamic temperature T to a black hole, it should be
T = 0. Indeed, if the black hole is placed in contact with a thermal reservoir of temperature
T0 > 0, energy will necessarily flow from the reservoir to the black hole and not in the reverse
way; this implies T < T0 , whatever T0 . For a stationary non-extremal black hole (κ ̸= 0),
T = 0 contradicts the tentative identification (16.58b) between T and κ. It is striking that
quantum field theory in curved spacetime restores the identification, thereby opening the path
to a genuine thermodynamics of black holes. The identification T ∝ κ arises from Hawking
radiation, which we discuss in the next section.
646                        Evolution and thermodynamics of black holes
                                                              ℏ κ
                                                     TH =          ,                                      (16.62)
                                                             kB 2π
        where ℏ = 1.054571817 10−34 J s is the reduced Planck constant and kB = 1.380649 10−23
         J K−1 is the Boltzmann constant. TH is called the Hawking temperature of the black
        hole.
          We shall not establish this property here; this is done in many textbooks, e.g. [499, 503, 202,
      87, 190], as well in some review articles, e.g. [75, 153, 293]. Rather, we shall limit ourselves to a
      few remarks:
      Remark 1: Hawking radiation is an outcome of quantum field theory within a fixed classical curved
      background (see e.g. [503]). It has not been derived from any quantum theory of gravity, i.e. the
      background spacetime (M , g) is not quantized. One says that Hawking radiation is a semiclassical
      effect.
      Remark 2: The phenomenon of Hawking radiation is essentially kinematic and does not depend on any
      field equation for the metric g. In particular it does not rely on the Einstein equation (see e.g. Ref. [496]).
      Moreover, the Hawking temperature (16.62) is independent from the spacetime dimension n (see e.g.
      Ref. [307]).
      Remark 3: Formula (16.62) is given for c = 1 units, which are used throughout the text. If one restores
      c and considers that κ has the dimension of an acceleration, the formula should read TH = kℏB 2πc
                                                                                                      κ
                                                                                                         .
      Remark 4: The Hawking temperature TH is the radiation temperature measured infinitely far from
      the black hole; it is not a “local” temperature measured in the vicinity of the event horizon H . More
      precisely, let us assume for simplicity that the black hole is non-rotating (cf. Property 5.2) and that its
      exterior is strictly static. The stationary Killing vector ξ is then null on H and timelike in the black
      hole exterior. Let us consider
                                √      static observers at a finite distance from H , i.e. observers of 4-velocity
      u = ξ/V , where V := −ξ · ξ [cf. Eqs. (3.70) and (3.61)]. The temperature of the Hawking radiation
                                          16.5 Hawking radiation                                                 647
Remark 5: The Hawking radiation is closely related to the Unruh effect, which is another prediction
of quantum field theory: in vacuum Minkowski spacetime, a uniformly accelerated observer measures a
                                                                                              √
black-body radiation of all particle species at the temperature TU = (ℏ/kB ) a/(2π), where a = a · a
is the norm of the observer’s 4-acceleration [484, 503, 87]. A static observer O close to the black
hole horizon H , as considered in Remark 4, feels an acceleration given by Eq. (3.74): a ∼ κ/V . By
combining formulas (16.63) and (16.62), we may then rewrite the radiation temperature T measured
by O as T ∼ (ℏ/kB ) a/(2π). Hence, it coincides with the Unruh temperature TU of the observer in
Minkowski spacetime sharing the same acceleration a as O. This is somehow expected if one assumes
that freely-falling observers near H perceive quantum states identical to the Minkowski vacuum (no
radiation).
Remark 6: In terms of elementary particles, the Hawking radiation is composed of all existing species
of massless and massive particles. For the latter ones, the contribution is significant only if kB TH > mp ,
where mp is the particle’s mass. For Schwarzschild black holes of mass M ≫ 2 1013 kg (the threshold
for kB TH being lower than the electron mass, see below), the Hawking radiation is composed quasi-
exclusively of neutrinos (87% of the radiated energy), photons (12%) and possibly gravitons (1%)
[394, 481].
Remark 7: The observed radiation spectrum at infinity is not exactly that of a black body of temperature
TH : it is corrected by greybody factors, which originate from the interaction of the radiation with the
spacetime curvature (backscattering).
    For a Kerr black hole, the surface gravity κ is given by formula (10.77), so that the Hawking
temperature can be expressed in terms of the black hole mass M = m and the dimensionless
spin parameter ā := a/m = J/M 2 (0 ≤ ā ≤ 1):
                                        ℏ c3          2
                               TH =                                             .                     (16.64)
                                       kB 8πGM 1 + (1 − ā2 )−1/2
                                                                         Kerr
Note that we have restored the gravitational constant G and the speed of light c (cf. Remark 3).
Numerically, we get
                                              
                                           M⊙            2
                        TH = 6.17 10  −8
                                                                  K,                     (16.65)
                                           M 1 + (1 − ā2 )−1/2
or, in units more adapted to small black holes,
                                                
                                            1 kg          2
                     kB TH = 1.057 10  10
                                                                      GeV.                            (16.66)
                                             M     1 + (1 − ā2 )−1/2
   8
    Cf. e.g. Eq. (7.2.10) in [503]; in view of Eq. (16.63), we may say that Hawking radiation obeys Tolman-
Ehrenfest law, which states that the temperature T of a medium in thermal equilibrium in a stationary gravita-
tional field obeys T V = const [444].
648                      Evolution and thermodynamics of black holes
      Hence, for a Schwarzschild black hole (ā = 0), TH = 6.17 10−8 (M⊙ /M ) K. For astrophysical
      black holes, this leads to tiny temperatures: TH ≃ 4 10−9 K for a 15 M⊙ stellar black hole
      (e.g. Cyg X-1, cf. Table 7.1), TH ≃ 1.6 10−14 K for the Milky Way central black hole Sgr A*
      (M = 4.1 106 M⊙ ) and TH ≃ 9 10−18 K for M87* (M = 6.5 109 M⊙ ). The value TH = 1 K is
      achieved for a black hole of mass 6.17 10−8 M⊙ , i.e. 2 % of the Earth’s mass, while the value
      kB TH = me c2 (me = 9.11 10−31 kg being the electron mass) is achieved for M = 2.07 1013 kg.
          Another conclusion that one can draw from formula (16.64) is that TH = 0 for an extremal
      Kerr black hole (ā = 1): such an object does not emit any Hawking radiation.
      Remark 8: Formula (16.64) shows that a Schwarzschild black hole (ā = 0) has a negative heat capacity:
      its temperature increases while its energy (M ) diminishes. This feature is shared with Newtonian
      self-gravitating systems, if one defines the temperature of such systems from the mean kinetic energy
      of their constituents.
      bc = 3 3M is the critical impact parameter for null geodesics introduced in Chap. 8 [cf.
      Eq. (8.21)]. Indeed, 4πb2c can be viewed as the black hole’s area “seen from infinity” (cf. Fig. 8.24).
      Moreover, it can be shown that the spectrum of Hawking radiation at high frequencies (i.e.
      ν ≫ M −1 , so that quantum fields can be treated within geometrical optics) is that of a black
      body of temperature TH and total area 4πb2c (see Fig. 1 of Ref. [394]). Hence A = 108πM 2 .
      Given that kB TH = ℏ/(8πM ) [Eq. (16.64) with ā = 0], Stefan-Boltzmann law yields
                                                          ℏ
                                                   L=C       ,                                       (16.67)
                                                          M2
      where C = 9/(20480π) ≃ 1.40 10−4 . Actually, a precise computation, taking into account
      the propagation of quantum fields in the Schwarzschild geometry and various particle species,
      leads to the same formula (16.67) with [481, 394, 395]
This value is valid for M ≫ 2 1013 kg (TH below the electron threshold, cf. Remark 6), so that
the Hawking radiation contains only photons, neutrinos and gravitons9 . The black hole mass
decreases according to dM/dt = −L, i.e.
                                                dM       ℏ
                                                    = −C 2 .                                                (16.69)
                                                 dt     M
Here t is the time measured by an inertial observer at infinity. This differential equation is
easily integrated into
                                                      1/3
                                M (t) = M03 − 3Cℏt         ,                           (16.70)
where M0 is the mass at t = 0. Defining the evaporation time tevap as the time to reach M = 0
from M = M0 , we deduce from Eq. (16.70) the following property:
where the value (16.68) of C has been used to get the numerical estimate.
    For an astrophysical black hole (M0 > 1M⊙ ), tevap is huge — more than 56 orders of
magnitude larger than the age of the Universe (tUniv ≃ 1.38 1010 yr)! We conclude that
Hawking radiation is ultra-negligible for astrophysical black holes and does not play any role
in their dynamics. Hawking radiation becomes relevant only when tevap < tUniv , say. From
Eq. (16.71), one obtains tevap < tUniv ⇐⇒ M0 < 4.15 1011 kg. Actually, this value is below
2 1013 kg, so that electrons, and even muons, should have been included into the Hawking
radiation (cf. Remark 6). However, the above estimate is pretty good, a precise computation
leading to [352]
                            tevap < tUniv ⇐⇒ M0 < 5.00 1011 kg.                        (16.72)
The upper bound is roughly the mass of an asteroid of half-kilometer size. Note that a black
hole of that mass has a Schwarzschild radius of only 0.74 fm (around the proton size!); it would
pertain to the hypothetical class of the so-called primordial black holes, i.e. black holes
of subsolar mass that could have been formed in the primordial universe (see e.g. [86] for a
review).
Remark 9: For a Schwarzschild black hole, the horizon area A is proportional to the square of the mass:
A = 16πM 2 [Eq. (3.4)], so that the evaporation law (16.69) implies dA/dt < 0. At first glance, this seems
   9
    The original computation, performed by Page in 1976 [394], resulted in C = 2.01 10−4 but it took into
account only the four species of neutrinos known at that time (νe , ν̄e , νµ and ν̄µ ), in addition to photons and
gravitons. It has been updated by adding the tau neutrinos (ντ and ν̄τ ) by Thorne, Zurek & Price in 1986 [481]. The
value (16.68) is also recovered by setting n1/2 = 3, n1 = 1 and n2 = 1 in Eq. (19) of Page’s review article [395].
650                        Evolution and thermodynamics of black holes
      to contradict the second law of black hole dynamics (Property 16.11), i.e. the area theorem 16.9. Actually
      there is no contradiction since one of the hypotheses of the theorem is not fulfilled for evaporating black
      holes, namely the null convergence condition. Indeed, the effective energy-momentum tensor10 of the
      quantum fields generating the Hawking radiation does not obey the null energy condition (2.95).
      Historical note : In 1971, while introducing the superradiant scattering of classical fields by a Kerr
      black hole (cf. Sec. 11.3.2 and the historical note on p. 392), Yakov Zeldovich [527] pointed out that the
      quantum analog of the field superradiance would be the spontaneous creation of particle/antiparticle
      pairs, with one particle falling into the black hole and the other one escaping to infinity. Zeldovich and
      his doctoral student Alexei Starobinsky presented a computation of this effect to Stephen Hawking
      while he was visiting Moscow in September 1973, accompanied by Kip Thorne (see Hawking’s account
      in Chap. 7 of [264] and Thorne’s one in Chap. 12 of [479]). Hawking tried to rederive the effect via a
      different approach of quantum field theory in curved spacetime and he realized that superradiance is not
      at all required: the spontaneous radiation takes place even for a non-rotating black hole, which has no
      outer ergoregion and thus cannot trigger any Penrose/superradiance process. This fantastic discovery
      was presented in short article published in early 1974 [262] and the computations were detailed in
      a subsequent one published in 1975 [263]. Both articles give formula (16.62) relating the radiation
      temperature to the surface gravity and discuss the black hole evaporation.
        10
           See e.g. Ref. [203] for the computation of the renormalized expectation value of the energy-momentum tensor
      of a quantum field near the black hole horizon.
                                 16.6 Black hole thermodynamics                                          651
Remark 1: The subscript “BH” in SBH is meant for Bekenstein-Hawking, not for black hole.
Remark 2: As for the Hawking temperature (16.62), formula (16.73) for the Bekenstein-Hawking
entropy does not depend on the spacetime dimension n. It is often summarized by SBH = A/4, which
holds in units such that kB = 1 and ℏ = 1.
                                                                                           3
Remark 3: For n = 4, if one restores the G’s and c’s, formula (16.73) reads SBH = kB 4Gℏ
                                                                                       c A
                                                                                            . One
recognizes the square of the Planck length: ℓP := Gℏ/c ≃ 1.62 10   −35  m, so that formula (16.73)
                                                     p
                                                            3
    For a Kerr black hole, the area A is related to the mass M and to the spin parameter a = āM
via Eq. (10.83), so that the Bekenstein-Hawking entropy (16.73) can be expressed as
                                                  √
                                          GM 2 1 + 1 − ā2
                             SBH   = 4πkB                               .                      (16.76)
                                           ℏc      2             Kerr
Note that, as in formula (16.64) for the Hawking temperature, we have restored G and c in the
above formula. Numerically, we get
                                                2     √
                                              M     1 + 1 − ā2
                          SBH = 1.05 10  77
                                                                 kB .                 (16.77)
                                              M⊙         2
Hence, for a Schwarzschild black hole (ā = 0), SBH = 1.05 1077 (M/M⊙ )2 kB . This yields huge
values for astrophysical black holes: SBH ≃ 2 1079 kB for a 15 M⊙ stellar black hole, SBH ≃
2 1090 kB for Sgr A* (M = 4.1 106 M⊙ ) and SBH ≃ 4 1096 kB for M87* (M = 6.5 109 M⊙ ).
These numbers are terribly large: besides black holes, the total entropy of the observable
Universe is about 1.1 1090 kB . It originates mostly from the cosmic microwave background and
the cosmic neutrino background (roughly one half each), largely dominating the entropy of the
interstellar and intergalactic medium (∼ 1082 kB ) and that of all the stars (∼ 1081 kB ) [178]. It
follows that the entropy of a single massive black hole, such as Sgr A* or M 87*, is larger than
the entropy of the whole observable Universe!
  Let us consider a “closed system” formed by a black hole and some matter and electromag-
652                       Evolution and thermodynamics of black holes
netic radiation in the black hole exterior. Define the generalized entropy Sgen by
        where Smat is the ordinary entropy of the matter and radiation and SBH is the Bekenstein-
        Hawking entropy (16.73) of the black hole. Then, any physical process makes Sgen increase,
        or at the very least, stay constant:
                                               ∆Sgen ≥ 0.                                  (16.79)
          The GSL should be considered more as a postulate ruling thermodynamics rather than a
      theorem that can be established. At most it can be proven in some specific cases. For instance,
      if one regards the process of Hawking radiation itself, one has ∆SBH < 0, since the black
      hole area diminishes during Hawking evaporation, as discussed at the end of Sec. 16.5.1. But
      one can show that the entropy of the Hawking radiation compensates for this loss, leading
      to ∆Smat ≥ |∆SBH |, so that ∆Sgen ≥ 0 is fulfilled (see Ref. [505] for details). Besides, one can
      also show that the GSL is actually obeyed in various thought experiments naively devised to
      violate it [85, 503, 505, 510].
      Historical note : In 1970-71, the discovery of the irreversible increase of the area of a Kerr black hole
      interacting with particles by Roger Penrose, Roger Floyd and Demetrios Christodoulou [409, 112], along
      with the general area theorem established by Stephen Hawking [259] (cf. the historical notes on p. 637
      and 642), suggested an analogy between the black hole area and the entropy in thermodynamics. In
      1972, Jacob Bekenstein [50, 52] was the first to go beyond this formal analogy by considering that a black
      hole is endowed with a genuine entropy S proportional to its area A, “as a measure of the inaccessibility
      of information (to an exterior observer) as to which particular internal configuration of the black hole is
      actually realized in a given case” [52]. Moreover, by means of some heuristic arguments, Bekenstein
      found that the relation between S and A should be S = ηkB A/(4πℏ), where η is a dimensionless
      coefficient of order unity [50, 52]. This is exactly formula (16.73) provided that η = π. This value of η
      was determined only after the black hole temperature was set to the Hawking temperature (16.62). This
      was done by Stephen Hawking in the 1975 article detailing the computation of Hawking radiation [263]
      (cf. historical note on p. 650). As for the generalized second law of thermodynamics, it was proposed
      by Bekenstein himself in the 1972-73 articles introducing the Bekenstein-Hawking entropy [50, 52].
      Further arguments for its validity have been provided in a subsequent article [53]. Previously, it was
      said that the (standard) second law of thermodynamics was transcended in presence of a black hole
      [40, 96], for it could not hold when some matter (and its entropy!) is thrown into the black hole.
                                                 SBH = kB ln N .                                         (16.80)
                                                 16.7 Concluding remarks                               653
Defining the microscopic states a priori requires a quantum theory of gravity, which we do not
have yet. However some partial results have been obtained, based on the two main current
approaches to quantum gravity: string theory (cf. e.g. [402] and Chap. 9 of [246]) and loop
quantum gravity (cf. e.g. [410, 442]). Regarding the former, Strominger and Vafa (1996) [465]
have first established the Bekenstein-Hawking formula (16.73) by counting the microstates
of an extremal supersymmetric black hole in dimension n = 5. By extremal, it is meant
that the event horizon is a degenerate Killing horizon (cf. Sec. 5.4.3), i.e. it has κ = 0, or
equivalently, a vanishing Hawking temperature. By supersymmetric, it is meant that these black
hole solutions have been obtained within a N = 4 supersymmetric Yang-Mills theory, which
corresponds to a low energy limit of some string theory. Further developments are discussed
in the review articles [85, 153, 285, 505, 513, 525]; they are still limited to extremal black holes
or near-extremal ones. Computations are indeed easier for such black holes because their
near-horizon geometries have additional symmetries, as we have seen for the extremal Kerr
case (cf. Sec. 13.4). For the latter, the Bekenstein-Hawking entropy of has been recovered via
some holographic duality11 [247, 137]. Let us stress that the Bekenstein-Hawking entropy of a
Schwarzschild black hole, which is non-extremal, has not been recovered by string theory yet.
    Regarding loop quantum gravity, first approaches, devised by Rovelli in 1996 [441] and by
Ashtekar, Baez, Corichi and Krasnov in 1997 [21], resulted in a statistical entropy proportional
to the black hole area, as for the Bekenstein-Hawking entropy (16.73). Subsequent studies lead
to the following formula for the entropy of a spherically symmetric macroscopic black hole of
area A (see e.g. Ref. [188] and Refs. [33, 85, 410] for reviews):
                                                                       
                                                         γ0 A  3        A
                                            S = kB            − ln          ,               (16.81)
                                                         γ 4ℏ 2         ℏ
where γ0 ≃ 0.274 and the dimensionless quantity γ is the so-called Barbero-Immirzi parameter.
Its value is not set by the theory (yet). This unknown parameter is not expected to appear at
the classical limit of loop quantum gravity, which should be general relativity. On the contrary,
in the quantum regime, γ plays a p   significant role by determining the
                                                                       √ quantum    of area a0 in
terms of the Planck length ℓP := Gh/c (cf. Remark 3) as a0 = 4 3πγℓP [442]. For large
                                             3                                 2
values of A, i.e. for A ≫ ℏ, the logarithmic term can be neglected in formula (16.81) and one
recovers the Bekenstein-Hawking entropy (16.73), provided that γ = γ0 .
    Other attempts to derive the Bekenstein-Hawking entropy from statistical mechanics
within quantum gravity are described in Sec. 7 of Carlip’s review [85]. See also Ref. [245] for
an historical review up to 2013.
Contents
         17.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 655
         17.2 Diffeomorphism-invariant theories of gravity . . . . . . . . . . . . . . 655
         17.3 Wald entropy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 666
         17.4 Covariant phase space formalism . . . . . . . . . . . . . . . . . . . . . 670
         17.5 First law for diffeomorphism-invariant theories . . . . . . . . . . . . 684
         17.6 What about the second law? . . . . . . . . . . . . . . . . . . . . . . . . 693
17.1      Introduction
This chapter is in a draft stage.
   This chapter relies heavily on exterior calculus, i.e. on calculus on differential forms. The
unfamiliar reader is urged to take a look at Sec. A.4.3 of Appendix A first.
                                                              √
                                    Z        Z           Z
                             S=         L=        Lϵ =       L −g dn x.                          (17.1)
                                    M         M          M
656                   Black hole thermodynamics beyond general relativity
      Here L is the Lagrangian n-form1 , L is the Lagrangian scalar and ϵ is the Levi-Civita
      tensor (cf. Sec. A.3.4). The second equality reflects the identity2 L = L ϵ and the third one
      provides an expression of the integral in terms of a coordinate system (xα ) on M , g being the
      determinant of g with respect to these coordinates [cf. Eq. (A.52)]. For simplicity we disregard
      any scalar or matter field, so that L is a function of the metric tensor g only. Actually all that
      follows can be easily generalized to account for extra fields, as in scalar-tensor theories for
      instance, where L is a function of g and some scalar field ϕ. We limit ourselves here to g,
      mostly in order to keep notations simple. More precisely, we assume that at a given point
      p ∈ M , L depends only on the values of g and its derivatives at p, up to a certain order. By
      derivative, it is meant derivative with respect to some affine connection on M , for instance the
      one formed by the partial derivatives with respect to given coordinates. We shall write L(g) to
      stress this dependency of L on g, according to the following convention:
        Notation: Boldface parentheses are used to denote the local (pointwise) dependency of a
        tensor field on M with respect to other ones: A(B, C) means that at a given point p ∈ M ,
        A depends only on the values of the tensor fields B and C and their derivatives at p.
          We shall consider only gravity theories whose formulation does not depend upon any
      specific coordinates on M , nor upon any extra-structure on M , like a privileged vector field.
      This amounts to demanding that the Lagrangian n-form L is covariant under the group of
      diffeomorphisms of M . More precisely, under the action of a diffeomorphism Φ : M → M ,
      the metric tensor is transformed to its pullback (Φ−1 )∗ g by Φ−1 (cf. Sec. A.2.8 for the definition
      of the pullback). The inverse of Φ is invoked here because, as g is a tensor of type (0, 2), Φ∗
      would carry it from3 Φ(M ) to M , while we are considering the transformation from M to
      Φ(M ) (cf. Remark 12 on p. 721). Let L((Φ−1 )∗ g) be the Lagrangian n-form constructed by
      keeping the same functional dependence in (Φ−1 )∗ g as L(g) has in g. The invariance of the
      theory under diffeomorphisms amounts to demanding that, as a n-form on M , L((Φ−1 )∗ g) is
      nothing but the n-form L(g) transformed under the action of Φ, i.e. the pullback of L(g) by
      Φ−1 : L((Φ−1 )∗ g) = (Φ−1 )∗ L(g). Given that the spacetimes (M , g) and (Φ(M ), (Φ−1 )∗ g) are
      physically equivalent, this means that L(g) and L((Φ−1 )∗ g) are basically the same object. In
      particular, the action S given by Eq. (17.1) takes the same value if L((Φ−1 )∗ g) is substituted
      for L(g). If one interprets the diffeomorphism action in a “passive” way, i.e. as a change of
      coordinates on M , the diffeomorphism-invariance condition simply means that the functional
      form of L expressed in Eq. (17.3) below does not depend upon the choice of coordinates. Since
      the condition L((Φ−1 )∗ g) = (Φ−1 )∗ L(g) must hold for any diffeomorphism, we may simplify
      it by introducing Φ′ := Φ−1 and renaming Φ′ to Φ:
         1
           As the integral of a n-form over a n-dimensional manifold, S is well-posed and coordinate independent.
         2
           Since the space of n-forms at any point p ∈ M is 1-dimensional, L and ϵ are necessarily proportional, so
      that there is no loss of generality in the writing L = L ϵ.
         3
           Since Φ is a diffeomorphism M → M , we have of course Φ(M ) = M ; however we keep the notation
      Φ(M ) to stress the image character by Φ.
                         17.2 Diffeomorphism-invariant theories of gravity                                     657
We have then
  Property 17.1: Lagrangian of diffeomorphism-invariant theories (Iyer & Wald 1994
  [292])
                                                                                       (17.3)
                                                                                  
                  L(g) = L gαβ , Rαβγδ , ∇λ Rαβγδ , . . . , ∇(λ1 · · · ∇λm ) Rαβγδ .
     There is no loss of generality in considering only symmetrized derivatives, since any antisymmetrized
     a
combination can be reexpressed in term of the Riemann tensor itself and lower order derivatives.
The reader is referred to the original article [292] for the proof.
Example 1 (Lagrangian of general relativity): For general relativity, L is the Einstein-Hilbert
Lagrangian:
         L = (16π)−1 R = (16π)−1 g µν Rµν = (16π)−1 g µν Rσµσν = (16π)−1 g µν g ρσ Rρµσν .            (17.4)
The last expression is clearly of the form (17.3), given that the components (g αβ ) are functions of the
components (gαβ ) only.
  Let L be a Lagrangian n-form that depends only of the metric tensor g and its derivatives,
  as in Eq. (17.3). The variation of L resulting from a generic variation δg of g is expressible
  as
                                      δL = E µν δgµν ϵ + dθ,                              (17.5)
  where E µν stands for the components of a symmetric type-(2, 0) tensor field E, called
  the Euler-Lagrange tensor, and θ is a (n − 1)-form, called the presymplectic potential
  (n − 1)-form (this name will be justified below). The n-form dθ is the exterior derivative
  of θ (cf. Sec. A.4.3). E depends on g but not on δg, while θ depends on both g and δg, in a
  linear way in terms of δg. All these dependencies are local, so if we want to stress them,
658                   Black hole thermodynamics beyond general relativity
we shall write E = E(g) and θ = θ(g, δg) according to the notation introduced on p. 656.
      Remark 1: The term dθ collects all the derivatives of δg, which have been gathered by means of the
      Leibniz rule, so that the term E µν δgµν ϵ in Eq. (17.5) does not contain any derivative of δg.
          It must be stressed immediately that the presymplectic potential (n − 1)-form θ is not
      unique. First of all, since d is nilpotent [Eq. (A.92)], θ in formula (17.5) can be substituted by
      θ ′ = θ + dY , where Y = Y (g, δg) is an arbitrary (n − 2)-form. Secondly, for a given gravity
      theory, the Lagrangian n-form itself is not unique. Indeed the n-form L′ = L + dµ, where
      µ = µ(g) is an arbitrary (n − 1)-form with compact support, leads to the same action (17.1)
      by virtue of Stokes’ theorem (A.94). The corresponding change in the presymplectic potential
      is θ → θ + δµ. From these two facts, we conclude:
        Property 17.3: ambiguities in the presymplectic potential form
        For a given gravity theory, the presymplectic potential (n − 1)-form θ is not unique: it can
        be substituted by
                                            θ ′ = θ + dY + δµ,                                (17.6)
        where (i) Y = Y (g, δg) is an arbitrary (n − 2)-form that is a local function of g and δg,
        with a linear dependence on δg and (ii) δµ is the variation triggered by δg of an arbitrary
        (n − 1)-form µ = µ(g) that is a local function of g and has compact support. The µ term
        in Eq. (17.6) arises from the allowed change
L′ = L + dµ (17.7)
      Example 2 (presymplectic potential form of general relativity): For general relativity, one has
      L = (16π)−1 Rϵ [Eq. (17.4)]. Given a coordinate system (xα ) on M , this can be rewritten as L =
                 √
      (16π)−1 R −g e, where g is the determinant of g with respect to (xα ) and e is the n-form whose
      components with respect to (xα ) are eα1 ···αn = 1 (resp. −1) if (α1 , . . . , αn ) is an even (resp. odd)
                                                                                                        √
      permutation of (0, . . . , n) and 0 otherwise [cf. Eq. (A.52)]. One has then δL = (16π)−1 δ(R −g) e,
                                                                                                          √
      since obviously (xα ), and thus e, is not altered by the variation δg. From the expression of δ(R −g)
      given by Eq. (16.18) and using Gµν δg µν = −Gµν δgµν , we get 16πδL = −Gµν δgµν ϵ + ∇µ H µ ϵ, where
      Gµν is the double metric dual of the Einstein tensor and H is the vector field defined in terms of δg by
      Eq. (16.13), where hαβ = g αµ δgµβ since hαβ := δgαβ . Now, the identity (A.93) yields ∇µ H µ ϵ = d(H·ϵ),
      so that 16πδL = −Gµν δgµν ϵ + d(H · ϵ). We conclude that the variation of the Einstein-Hilbert
      Lagrangian is indeed of the form (17.5) with
                      E αβ = −(16π)−1 Gαβ           and      θ = (16π)−1 H · ϵ = (16π)−1 ⋆H,                   (17.8)
      where the last equality stems from the definition of the Hodge dual of a 1-form [Eq. (5.38) with p = 1]. In
      index notation, θα1 ···αn−1 = (16π)−1 H µ ϵµα1 ···αn−1 . Substituting Eq. (16.13) for H µ , we get the explicit
      expression of θ(g, δg):
                            θα1 ···αn−1 = (16π)−1 g µν g ρσ (∇σ δgνρ − ∇ν δgρσ ) ϵµα1 ···αn−1 .                (17.9)
      We check that θ(g, δg) is linear in δg, as stated in Property 17.2.
                       17.2 Diffeomorphism-invariant theories of gravity                                  659
  The action S defined by (17.1) is extremal with respect to any metric variation δg that has
  compact support if, and only if,
                                          E(g) = 0.                                    (17.10)
  This constitutes the so-called Euler-Lagrange equations for the considered gravity theory,
  also called the field equations or equations of motion.
where Stokes’ theorem (A.94) has been used to let appear the integral of θ on ∂D, which has
been set to zero due to θ(g, δg)|∂D = 0. This last property results from the linearity of θ in δg,
given that δg and all its derivatives are vanishing on ∂D. From the above expression of δS it is
clear that having δS = 0 for any variation δg is equivalent to Eq. (17.10).
Example 3 (field equations of general relativity): For general relativity, E is given by Eq. (17.8).
Hence the field equations (17.10) become G = 0, which is equivalent to R = 0 — the vacuum Einstein
equation (1.44).
      Eq. (17.11) and, again, the definition (A.82) of the Lie derivative (which is applicable here since
      L is a tensor field of type (0, n)) leads to
                                                   δL = tLξ L.                                          (17.13)
      Remark 2: The variation δg is often defined as a variation rate rather than a genuine variation, namely
      as the derivative of a 1-parameter family of metric tensors ĝ(t) such that ĝ(0) = g. Using the symbol δ̂
      to distinguish from the variation considered here, the definition is δ̂g := dĝ/dt|t=0 . Since the metric
      family induced by the flow maps along ξ is ĝ(t) := Φ∗t g, one gets δ̂g = limt→0 t−1 (Φ∗t g − g). In view
      of the definition (A.82) of the Lie derivative, this yields
                                                     δ̂g = Lξ g.                                         (17.14)
      Similarly, one defines δ̂L := dL(ĝ(t))/dt|t=0 , which yields δ̂L = Lξ L. When comparing with
      Eqs. (17.12) and (17.13), we see that the derivative-based definition has the advantage of getting rid of
      the (not particularly significant) infinitesimal factor t.
         If one plugs Eqs. (17.12) and (17.13) into the generic Lagrangian variation formula (17.5),
      one gets tLξ L = tE µν Lξ gµν ϵ + d(θ(g, tLξ g)). Since θ is linear with respect to its second
      argument (cf. Property 17.2), one has d(θ(g, tLξ g)) = td(θ(g, Lξ g)), so that there comes
                                     Lξ L = E µν Lξ gµν ϵ + d(θ(g, Lξ g)).
      Now, the Cartan identity (A.95) gives Lξ L = ξ · dL + d(ξ · L) = d(ξ · L) since dL = 0, as
      for any (n + 1)-form on a n-dimensional manifold. Accordingly, we may rewrite the above
      equation as
                                d (θ(g, Lξ g) − ξ · L) = −E µν Lξ gµν ϵ.                 (17.15)
      If the field equations are fulfilled, i.e. if E = 0 [Eq. (17.10)], the right-hand side of Eq. (17.15)
      is zero, which implies that the argument of the exterior derivative in the left-hand side is a
      closed (n − 1)-form. This defines a conserved current, which is nothing but the expression
      of Noether’s theorem for the invariance of the Lagrangian by the 1-dimensional group of
      diffeomorphisms generated by the vector field ξ:
                                                    dJ (ξ) = 0 .                                      (17.17)
                         17.2 Diffeomorphism-invariant theories of gravity                                      661
  The Noether current (n − 1)-form is not unique, some ambiguity J → J ′ resulting from
  (17.6) and (17.7), with
  where Y (g, Lξ g) is any (n − 2)-form that is linear in Lξ g, µ(g) is any (n − 1)-form with
  compact support and ξ · µ is the (n − 2)-form µ(ξ, . . .).
Proof. That J (ξ) is a linear function of ξ follows immediately from the linearity of θ(., .) with
respect to its second argument. Similarly the locality of J (ξ) with respect of ξ is inherited from
that of θ(., .). If the theory diffeomorphism-invariant, Eq. (17.15) holds. Given the definition
of J (ξ), Eq. (17.15) is equivalent to dJ (ξ) = −E µν Lξ gµν ϵ, so that dJ (ξ) = 0 if E = 0 is
fulfilled or ξ is a Killing vector (Lξ g = 0). Finally, both L′ (g) and L(g) in Eq. (17.7) must be
covariant under diffeomorphisms. It follows that µ(g) must be covariant as well. Then the
variation of µ resulting from a diffeomorphism Φt generated by ξ is δµ = tLξ µ, by the very
same argument that led to formula (17.13) for δL. Accordingly, Eq. (17.6) yields θ ′ (g, Lξ g) =
θ(g, Lξ g) + dY (g, Lξ g) + Lξ µ(g). By combining this identity with L′ = L + dµ [Eq. (17.7)],
the definition (17.16) applied to J ′ leads to
                       J ′ (ξ) = J (ξ) + dY (g, Lξ g) + Lξ µ(g) − ξ · dµ(g).
The Cartan identity (A.95) gives Lξ µ(g) − ξ · dµ(g) = d (ξ · µ(g)) and we obtain (17.18).
Remark 3: From the definition (17.16), it is clear that J (ξ) is a local function of g (and its derivatives),
in addition to ξ. So, following the notation introduced in Sec. 17.2.1, we could have written J (g, ξ).
However for the sake of brevety, we use J (ξ), leaving the dependency on g implicit.
Example 4 (Noether current form of general relativity): From expression (17.9) for θ in general
relativity with δgµν = tLξ gµν = t(∇µ ξν + ∇ν ξµ )[Eq. (A.87)], we get
            16πθ(g, Lξ g)α1 ···αn−1   = g µν (∇σ ∇ν ξ σ + ∇σ ∇σ ξν − 2∇ν ∇ρ ξ ρ ) ϵµα1 ···αn−1
                                      = g µν (2Rνσ ξ σ − ∇σ ∇ν ξ σ + ∇σ ∇σ ξν ) ϵµα1 ···αn−1
                                      = (2Rµν ξ ν + ∇ν ∇ν ξ µ − ∇ν ∇µ ξ ν ) ϵµα1 ···αn−1 ,
where the contracted Ricci identity (A.107) has been used to get the second line. Plugging this formula,
as well as L = Rϵ (cf. Example 1) into the definition (17.16) of J (ξ) yields
                 J(ξ)α1 ···αn−1 = (16π)−1 (2Gµν ξ ν + ∇ν ∇ν ξ µ − ∇ν ∇µ ξ ν ) ϵµα1 ···αn−1 .
If the field equations are obeyed, then G = 0 (cf. Example 3) and the above expression reduces to
                      J(ξ)α1 ···αn−1 = (16π)−1 (∇ν ∇ν ξ µ − ∇ν ∇µ ξ ν ) ϵµα1 ···αn−1 .               (17.19)
We recognize in the term within parentheses the Komar current of ξ as defined by Eq. (5.50): J (ξ)µ :=
∇ν ∇ν ξ µ − ∇ν ∇µ ξ ν . Accordingly, we may write
                              J (ξ) = (16π)−1 J (ξ) · ϵ = (16π)−1 ⋆ J (ξ),                           (17.20)
where the last equality stems from Eq. (5.38) with p = 1. Hence, for general relativity, the Noether
current (n − 1)-form of ξ is nothing but the Hodge dual of the (metric dual of the) Komar current of
ξ, up to the factor (16π)−1 . That J (ξ) is closed in this case follows readily from the identity (A.93):
16π dJ (ξ) = d(J (ξ) · ϵ) = (∇ · J (ξ))ϵ, along with the conservation law (5.51): ∇ · J (ξ) = 0.
662                  Black hole thermodynamics beyond general relativity
        Q(ξ) is called the Noether potential (n − 2)-form of ξ. It is not unique, being subject to
        the ambiguity Q → Q′ , with
        where Y (g, Lξ g) is any (n − 2)-form that is linear in Lξ g, µ(g) is any (n − 1)-form with
        compact support and Z(g, ξ) is any (n − 3)-form that is linear in ξ.
      Proof. Since J (ξ) is a closed differential form [Eq. (17.17)], the Poincaré lemma states that it is
      locally exact (on a contractible open set, cf. Sec. A.4.3). However, because dJ (ξ) = 0 holds
      for any vector field ξ, it can be shown that J (ξ) is globally exact, i.e. takes the form (17.21)
      with Q(ξ) being a local function of ξ [500]. The linearity of Q(ξ) in terms of ξ arises from the
      linearity of J (ξ) (cf. Property 17.5). Finally, the Y and µ ambiguities in (17.22) are inherited
      from the ambiguities of J [Eq. (17.18)], while the Z ambiguity stems from Eq. (17.21) defining
      Q up to the addition of a closed (n − 2)-form (cf. Lemma 1 in Ref. [500]).
      Remark 4: As for J (ξ) (cf. Remark 3 on p. 661), Q(ξ) is a local function of g, in addition to ξ, although
      the dependency on g is not explicited in the notation Q(ξ).
      Remark 5: The (n − 2)-form Q(ξ) has been introduced by Wald (1993) [502] under the name Noether
      charge (n − 2)-form. However, as argued by other authors [294, 255], the name Noether potential
      (n − 2)-form is more relevant, since Q(ξ) is not conserved: a priori dQ(ξ) ̸= 0 and Eq. (17.21) shows
      clearly that Q(ξ) plays the role of a potential of a conserved current. Another name given to Q(ξ) in
      the literature is the Noether-Wald charge density [246, 250].
      Example 5 (Noether potential form of general relativity): For general relativity, J (ξ) is given by
      Eq. (17.19), or Eq. (17.20) in terms of the Komar current of ξ. Let us show that Q(ξ) is then nothing
      but the Hodge dual of the exterior derivative of the 1-form ξ (the metric dual to ξ), up to the factor
      −(16π)−1 :
                                              Q(ξ) = −(16π)−1 ⋆ (dξ).                                (17.23)
      Given the expression of the Hodge dual of a 2-form [Eq. (5.38) with p = 2], the identity (dξ)αβ =
      ∇α ξβ − ∇β ξα [Eq. (A.91b)] and the antisymmetry of ϵ, a formula equivalent to (17.23) is
      To prove that Q(ξ) as given by (17.23) obeys dQ(ξ) = J (ξ) [Eq. (17.21)], let us establish the Hodge
      dual of this relation: ⋆dQ(ξ) = ⋆J (ξ), which is an equality between 1-forms. From Eq. (17.23), we have
      ⋆dQ(ξ) = −(16π)−1 ⋆ d ⋆ (dξ). Now, the operator ⋆d⋆ is, up to a sign, the codifferential δ, which
                          17.2 Diffeomorphism-invariant theories of gravity                                            663
maps a p-form to a (p − 1)-form and is defined by4 δ := (−1)n(p+1)+1 ⋆ d⋆. Since dξ is a 2-form, we
get ⋆dQ(ξ) = (−1)n (16π)−1 δ(dξ). From the expression of the codifferential in a coordinate chart
(xα ) (cf. e.g. Eq. (14.37) in Ref. [464]), we may write, for any 2-form A:
                                        1    ∂ √
                               (δA)α = √         −gAµα = −∇µ Aαµ ,
                                                      
                                         −g ∂xµ
where the last equality follows from the identity (A.75). Applying the above formula to A := dξ,
we get δ(dξ)α = −∇µ (∇α ξ µ − ∇µ ξ α ) = J (ξ)α , i.e. δ(dξ) = J (ξ), where J (ξ) is the Komar
current of ξ defined by Eq. (5.50). Hence ⋆dQ(ξ) = (−1)n (16π)−1 J (ξ). On the other hand, we have
⋆J (ξ) = (16π)−1 ⋆ ⋆J (ξ) from Eq. (17.20). Since ⋆⋆ = (−1)n Id for a 1-form (cf. e.g. formula (14.26) in
Ref. [464]), we immediately get ⋆J (ξ) = ⋆dQ(ξ), which is equivalent to Eq. (17.21).
  where J (ξ) is the Noether current (n − 1)-form of ξ (Property 17.5) and the integral is
  taken with respect to Σ endowed with the future orientation, i.e. the orientation defined
  by n · ϵ, where n is the future-directed timelike unit normal to Σ. The Noether charge is
  a priori not unique, since any change J → J ′ allowed by (17.18) leads to QΣ,ξ → Q ′ Σ,ξ
  with                                   Z
                         Q Σ,ξ = QΣ,ξ +
                           ′
                                              [Y (g, Lξ g) + ξ · µ(g)] .             (17.26)
                                                    ∂Σ
  However, if ξ is a Killing vector and (i) ξ is tangent to ∂Σ or (ii) ∂Σ is located outside the
  support of µ(g), the Noether charge is uniquely defined: Q ′ Σ,ξ = QΣ,ξ .
     Let Σ1 and Σ2 be two spacelike hypersurfaces, such that Σ2 lies in the future of Σ1
  and both hypersurfaces are connected by a hypersurface W , i.e. Σ1 ∪ Σ2 ∪ W is a closed
  hypersurface consituting the boundary of a n-dimensional manifold D ⊂ M (cf. Fig. ??).
  Then the following conservation law holds:
                                                               Z
                                        QΣ2 ,ξ = QΣ1 ,ξ +               J (ξ) ,                          (17.27)
                                                                W↙
  where the integral over W is taken with respect to the inward orientation (indicated by the
  arrow), i.e. the orientation defined by v · ϵ, where v is any vector field pointing into D.
Proof. Equation (17.26) readily follows from (17.18) and Stokes’ theorem (A.94). If ξ is a Killing
vector, then Lξ g = 0 by definition (cf. Property 3.6); this implies Y (g, Lξ g) = 0 in the
   4
     See e.g. Eq. (14.35) of Straumann’s book [464], whose sign convention is followed here. Since the same letter
δ is used to denote the codifferential and the variation operator, we are using a boldface symbol to distinguish the
former.
664                 Black hole thermodynamics beyond general relativity
      right-hand side of Eq. (17.26), given that Y is linear with respect to Lξ g. Moreover, if ξ is
      tangent to ∂Σ (case (i)), the (n − 2)-form ξ · µ = µ(ξ, . . .) restricted to vectors tangent to ∂Σ
      is necessarily zero, for µ is a (n − 1)-form. Obviously, this holds as well in the case (ii). Hence
      Q ′ Σ,ξ = QΣ,ξ . As for Eq. (17.27), given that ∂D = Σ1 ∪ Σ2 ∪ W , Stokes’ theorem (A.94) and
      dJ (ξ) = 0 [Eq. (17.17)] lead to
                                  Z               Z              Z
                                        J (ξ) +        J (ξ) +        J (ξ) = 0,
                                   Σ↗
                                    1             Σ↗
                                                   2             W↗
      where the arrows mean that Σ1 , Σ2 Rand W are oriented according to the outwardRconvention
      (with respect to D). Noticing that Σ↗ J (ξ) = −QΣ1 ,ξ , Σ↗ J (ξ) = QΣ2 ,ξ and W ↗ J (ξ) =
                                                              R
                                            1                   2
      − W ↙ J (ξ) yields (17.27).
        R
          The integral over W in Eq. (17.27) can be interpreted as an “inward flux”, especially if W is
      a timelike hypersurface (cf. the electromagnetic analogy detailed in Remark 7 below). If this
      inward flux vanishes, Eq. (17.27) expresses the invariance of the Noether charge with respect to
      the choice of the hypersurface (conservation during “time evolution”). This occurs if J (ξ) = 0
      on W or if W is at some asymptotically flat end of M with J (ξ) decaying sufficiently fast so
      that its integral on W is zero.
        where Q(ξ) is the Noether potential (n − 2)-form of ξ (Property 17.6) and the orientation
        of S for the above integral is the future-outward orientation defined by Eq. (5.39). The
        Noether charge is a priori not unique, since any change Q → Q′ obeying (17.22) leads to
        QS ,ξ → Q ′ S ,ξ with
                                                Z
                              Q S ,ξ = QS ,ξ +
                                ′
                                                    [Y (g, Lξ g) + ξ · µ(g)] .             (17.29)
                                                      S
       However, if ξ is a Killing vector and (i) ξ is tangent to S or (ii) S is located outside the
       support of µ(g), the Noether charge is uniquely defined: Q ′ S ,ξ = QS ,ξ .
           Let Σ be a compact spacelike hypersurface bounded by two closed (n − 2)-surfaces: an
       “internal” one, Sint , and an “external” one, Sext , the exterior direction being that of the
       asymptotic flat end of (M , g). Then
Proof. Equation (17.29) readily follows from (17.22) along with S dZ = 0 as a result of Stokes’
                                                                   R
where the arrows indicate that Sint and  R Sext are outward-oriented
                                                                   R with respect to Σ (which
is itself future-oriented). Noticing that S ↗ Q(ξ) = −QSint ,ξ and S ↗ Q(ξ) = QSext ,ξ yields
                                           int                       ext
(17.30).
Remark 6: We are using the same letter Q to denote the Noether charge of a hypersurface (Property 17.7)
and the Noether charge of a (n − 2)-surface (Property 17.8). The distinction between the two quantities
occurs only through the subscript: QΣ,ξ versus QS ,ξ . This common notation is justified in so far as
both quantities are strongly related by Eq. (17.30). In particular, if the hypersurface Σ has a single
(external) boundary S , i.e. Sint = ∅, Eq. (17.30) reduces to QS ,ξ = QΣ,ξ .
Remark 7: The name charge in the expression Noether charge comes of course from the analogy with
electromagnetism: considering n = 4 for simplicity, the analog of the Noether current 3-form J is
the Hodge dual of the 1-form j, metric dual of the electric 4-current j: J = ⋆j, while the analog of
the Noether potential 2-form Q is the Hodge dual of the electromagnetic field tensor F divided by
µ0 : Q = µ−10 ⋆F . The relation dQ = J [Eq. (17.21)] is then nothing but Maxwell equation (1.50):
d ⋆F = µ0 ⋆j. Then QΣ,ξ is the total electric charge within the volume Σ (cf. e.g. Eq. (18.2) in Ref. [228])
and QS ,ξ is µ−10 times the flux of the electromagnetic field through the surface S . Accordingly,
Eq. (17.27) expresses the conservation of electric charge and Eq. (17.30) with Sint = ∅ is nothing but
Gauss’s law (cf. e.g. Eq. (18.40) in Ref. [228]).
     From Example 5, we note that the (n − 2)-form ⋆(dξ) giving Q(ξ) for general relativity
[cf. Eq. (17.23)] is exactly the differential form that constitutes the integrand in formula (5.36)
defining the Komar mass or in formula (5.63) defining the Komar angular momentum (ξ := η
in that case). In view of the definition (17.28) of the Noether charge, we conclude that
           a
            Since ξ is a Killing vector, the Noether charge is uniquely defined in the present case, provided S is
        chosen outside the support of µ if one allows for a Lagrangian change of the type (17.7), cf. Property 17.8.
      Proof. Let S ′ be another complete cross-section of H lying entirely in the future of S . Then
      S ∪ S ′ constitutes the boundary of a portion of H , which we shall denote by ∆H . Stokes’
      theorem (A.94) and the identity dQ(χ) = J (χ) [Eq. (17.21)] give
                                 Z             Z            Z
                                      Q(χ) +         Q(χ) =       J (χ),                      (17.34)
                                       S↗               S ′↗             ∆H
      where the arrows indicate that S and S ′ are outward-oriented with respect to ∆H , the
      orientation of the latter being set by a future-directed null vector field k transverse to H .
      From the definition (17.16), J (χ) := θ(g, Lχ g) − χ · L = −χ · L, since Lχ g = 0 (χ
      Killing) along with θ being linear in Lχ g imply θ(g, Lχ g) = 0. Now, when restricted to
      ∆H (i.e. applied to (n − 1)-tuples of vectors tangent to ∆H , or equivalently, considered as
      the pullback ι∗ (χ · L) of χ · L by the inclusion map ι : ∆H → M , cf. Sec. A.2.8), χ · L is
      a vanishing (n − 1)-form, since χ · L := L(χ, . . .), χ is tangent to ∆H and L is a n-form.
      Accordingly, the integral of J (χ)R = −χ · L in the right-hand side of Eq. (17.34) is zero. Given
      that S ↗ Q(χ) = −QS ,χ and S ′ ↗ Q(χ) = QS ′ ,χ , it follows that QS ′ ,χ = QS ,χ , which
           R
                                         17.3 Wald entropy                                            667
shows that SW does not depend on the choice of S . Regarding the ambiguity (17.29), the
term Y (g, Lχ g) does not contribute since Y (g, Lχ g) is linear in Lχ g (Propery 17.3), so that
Lχ g = 0 (χ Killing) implies Y (g, Lχ g) = 0. The term χ · µ(g) does not contribute either
because, taking advantage of the independence of QS ,χ from S , we are free to choose S as
the bifurcation surface, where χ = 0 (Property 3.12).
    If the diffeomorphism-invariant theory is general relativity, the Wald entropy reduces to
the Bekenstein-Hawking entropy defined in Sec. 16.6.1:
  For general relativity, the Wald entropy of a stationary black hole coincides with its
  Bekenstein-Hawking entropy, as given by formula (16.73):
                                             A
                           SW = SBH = kB          (general relativity),                  (17.35)
                                             4ℏ
  A being the black hole area.
Proof. For general relativity, the Noether potential (n − 2)-form is given by Eq. (17.23): Q(χ) =
−(16π)−1 ⋆ (dχ). We have then
                             Z                Z                       Z
                                            1                 µν
            16πQS ,χ = −         ⋆(dχ) = −        (dχ)µν dS = −          ∇µ χν dS µν ,
                               S            2 S                        S
where the second equality follows from Lemma 5.10 and the third one from expression (A.91b)
of the exterior derivative and the antisymmetric character of dS µν . Now, from Eq. (5.87),
                                     √
∇µ χν dS µν = −2κ dS, where dS := q dn−2 x is the area element of S . Since κ is constant,
there comes QS ,χ = κA/(8π). In view of the definition (17.33) of SW , we get (17.35).
Property 17.12: geometric expression of the Wald entropy (Iyer & Wald 1994 [292])
where
            • δL/δRαβγδ stands for the type-(4, 0) tensor field that is the Euler-Lagrange deriva-
              tive of L with respect to Rαβγδ , if the latter were considered as a field independent
              of gαβ in expression (17.3) of L:
                         δL       ∂L           ∂L                     ∂L
                              :=        − ∇µ           + ∇(µ ∇ν)                − ···              (17.37)
                       δRαβγδ    ∂Rαβγδ      ∂∇µ Rαβγδ           ∂∇(µ ∇ν) Rαβγδ
            • dSαβ is the metric dual of the area element normal bivector dS αβ to Sˆ, as defined
              by Eq. (5.42);
               ˆ                                                               ˆ
            • Sϵ⊥ is the binormal 2-form to Sˆ, i.e. for each point p ∈ Sˆ, Sϵ⊥ |p is the volume
              2-form of the metric induced by g in the vector space Tp⊥ Sˆ normal to Sˆ equipped
                                                      ˆ
              with the future-outward orientationa ; Sϵ⊥ can be expressed as the exterior product
              s ∧ n of any pair of vector fields (n, s) along Sˆ forming an orthonormal basis of
              Tp⊥ Sˆ at each p ∈ Sˆ, with n future-directed unit timelike and s outward-directed
              unit spacelike:
                                          Sˆ ⊥                        Sˆ ⊥
                                           ϵ =s∧n          ⇐⇒          ϵαβ   = s α nβ − nα s β .   (17.38)
        Note that Eqs. (5.42) and (17.38) yield the following relation between the area element
                                                        ˆ
        normal bivector dS and the binormal 2-form Sϵ⊥ :
                                                                  ˆ
                                                      dSαβ = Sϵ⊥
                                                               αβ dS,                              (17.39)
                           √
        where dS =             q dn−2 x is the area element of Sˆ.
           a
               See item (iv) on p. 136 for details.
      Partial proof. Q(χ) has a linear dependence on χ (cf. Property 17.6). It may contain high order
      covariant derivatives of χ but since the latter is a Killing vector, one may use iteratively Kostant
      formula (3.83) to express all derivatives of order higher than one in terms of χ and ∇χ, so
      that Q(χ) is expressible as
      where W = W (g) and X = X(g) are tensor fields of type (0, n−1) and (2, n−2) respectively,
      which are antisymmetric with respect to their last n − 2 arguments. For instance, in the case of
      general relativity, we read on (17.24) that W = 0 and X µνα1 ···αn−2 = −(16π)−1 ϵµνα1 ···αn−2 . Let
      us evaluate the Wald entropy SW by selecting the bifurcation surface Sˆ for S in formula (17.33).
      Given that χ = 0 on Sˆ (Property 3.12), the term in W does not contribute to SW . Furthermore,
      the following identity holds on Sˆ:
                                                             Sˆ       ˆ
                                                      ∇α χβ = κ Sϵ⊥
                                                                  αβ .                               (17.40)
                                           17.3 Wald entropy                                             669
Indeed, since χ = 0 on Sˆ, we have, for any vector v tangent to Sˆ, v µ ∇µ χα = 0. This
shows that, at each point p ∈ Sˆ, the 2-form ∇α χβ = ∇[α χβ] leaves in the vector space Tp⊥ Sˆ
                             ˆ
normal to Sˆ, as the 2-form Sϵ⊥            ⊥ ˆ
                               αβ . Since Tp S is 2-dimensional, these 2-forms are necessarily
                      Sˆ    ˆ
proportional: ∇α χβ = λ Sϵ⊥αβ for some scalar field λ on S . By combining Eqs. (5.86) and (17.39),
        Sˆ ⊥
we get ϵαβ = χα kβ − kα χβ , where k is the future-directed null vector of Tp⊥ Sˆ such that
k · χ = −1. Then, Eq. (3.29) yields κχα = χµ ∇µ χα = λ(χµ χµ kα − χµ kµ χα ) = λ(0 + χα ), from
                                                                               Sˆ    ˆ
which λ = κ, thereby establishing (17.40). We have then Q(χ)α1 ···αn−2 = κ Sϵ⊥
                                                                             ρσ X α1 ···αn−2 .
                                                                                 ρσ
                                ˆ       ˆ
where A is the area of Sˆ and (Sϵ⊥ )µν Sϵ⊥µν = −2 follows from Eq. (17.38) along with nµ n = −1,
                                                                                          µ
    Of course, being defined by a formula that reduces to the Bekenstein-Hawking entropy for
general relativity (Property 17.11) is not sufficient by itself to assert that the Wald entropy is the
correct concept of black hole entropy for all theories of gravity. This holds because the Wald
entropy naturally enters in the generalization of the first law to all diffeomorphism-invariant
theories that admit a Hamiltonian formulation, which we will discuss in Sec. 17.5, after having
introduced the covariant phase space formalism.
670                   Black hole thermodynamics beyond general relativity
                           dq i                                dpi
                                = {q i , H}       and              = {pi , H},    1 ≤ i ≤ N,                  (17.43)
                           dt                                  dt
      where {., .} stands for the Poisson bracket:
                                                  N                     
                                                 X    ∂f ∂g      ∂g ∂f
                                       {f, g} :=         i ∂p
                                                                − i        .                                  (17.44)
                                                 i=1
                                                      ∂q      i  ∂q ∂p i
                                                     df
                                                        = {f, H}.                                             (17.45)
                                                     dt
                                                         N
                                                         X
                                                  Ω :=         dpi ∧ dq i ,                                   (17.46)
                                                         i=1
      where d stands for the exterior derivative5 on F, so that dpi (resp. dq i ) is the differential
      1-form of pi (resp. q i ) considered as a scalar field on F. Given that q i = y i and pi = y i+N , the
         5
           On phase space manifolds, we are not using boldface notations for operators and tensor fields, keeping them
      for the spacetime manifold only. Hence the exterior derivative on F is denoted by d, and not d, as defined in
      Sec. A.4.3, the 2-form (17.46) is denoted Ω and not Ω, etc.
                               17.4 Covariant phase space formalism                                       671
where a and b are respectively the row and column indices and IN stands for the identity
matrix of size N . Any coordinate system on F in which the 2-form Ω is expressible as (17.46),
or equivalenty, in which the components of Ω are the matrix (17.47), is called a system of
canonical coordinates. The matrix (Ωab ) is clearly invertible, its inverse being its opposite:
                                                       
                                                0    IN
                                  (Ω−1 )ab =           .                               (17.48)
                                               −IN 0
This implies that the 2-form Ω is non-degenerate and allows one to define an antisymmetric
type-(2, 0) tensor field on F, called the Poisson bivector, by
As a type-(2, 0) tensor, Ω−1 acts bilinearly on 1-forms (cf. Sec. A.2.5) and one may define the
Poisson bracket of two scalar fields f and g on F by letting Ω−1 act on the pair of 1-forms
(df, dg):
                                     {f, g} := Ω−1 (df, dg) .                            (17.50)
Given the components (17.48) of Ω−1 in canonical coordinates, it is easy to check that this
definition reduces to (17.44) for these type of coordinates.
    Because dd = 0 (nilpotence of the exterior derivative, cf. Eq. (A.92)), it follows immediately
from Eq. (17.46) that the 2-form Ω is closed: dΩ = 0. This is also obvious from the coordinate
expression (A.90c) of the exterior derivative, given that the components Ωab are constant in
canonical coordinates [Eq. (17.47)]. Generically, a closed non-degenerate 2-form Ω on a smooth
manifold F is called a symplectic form and the pair (F, Ω) is called a symplectic manifold
(see e.g. Chap. 22 of Lee’s textbook [343]). Not only Ω is closed, but it is exact. Indeed, it follows
from expression (17.46) that
                                             Ω = dΘ,                                            (17.51)
where Θ is the 1-form defined by
                                                 N
                                                 X
                                          Θ :=         pi dq i .                               (17.52)
                                                 i=1
Because of (17.51), the 1-form Θ is called the symplectic potential. It is also called the
canonical 1-form on F, because its expression (17.52) is independent of the choice of the
canonical coordinates (q i , pi ).
   To any scalar field f on F, one associates a unique vector field Xf on F via the Poisson
bivector Ω−1 :
                                     Xf := Ω−1 (., df ) .                            (17.53)
672                  Black hole thermodynamics beyond general relativity
      It follows that the components of Xf with respect to any coordinate system (y a ) are
                                                            ab ∂f
                                               Xfa = Ω−1             .                           (17.55)
                                                                ∂y b
Xf = {., f } . (17.57)
      The above identity means that for any scalar field g, Xf (g) = {g, f }, with Xf (g) = Xfa ∂a g.
         Formula (17.53) can be inverted to
df = Ω(., Xf ) . (17.58)
      Proof. Ω(., Xf ) is a 1-form whose components are Ωab Xfb . Thanks to Eq. (17.55), we get
      Ωab Xfb = Ωab (Ω−1 )bc ∂c f = δ ca ∂c f = ∂a f , which gives (17.58).
      Remark 1: The vector field Xf and the 1-form df can be considered as being dual to each other
      through the symplectic form Ω in the very same way as vector fields and 1-forms are dual through
      the metric tensor g on pseudo-Riemannian manifolds: compare formula (17.58) with (A.44) (rewritten
      as u = g(., u)) and formula (17.55) with (A.47). These dualities occur because both Ω and g are
      non-degenerate bilinear forms.
          The Poisson bracket (17.50) can be expressed in terms of the symplectic form Ω acting of
      the pair of vector fields (Xf , Xg ):
      Proof. Starting from Eq. (17.50) and using Eq. (17.58), we have {f, g} = (Ω−1 )ab ∂a f ∂b g =
      (Ω−1 )ab Ωac Xfc Ωbd Xgd = δ ad Ωac Xfc Xgd = Ωdc Xfc Xgd = Ω(Xg , Xf ) = −Ω(Xf , Xg ).
      Remark 2: Notice the minus sign in Eq. (17.59), as compared to Eq. (17.50); of course, since Ω is
      antisymmetric, Eq. (17.59) can be recast as {f, g} = Ω(Xg , Xf ).
         By combining Eqs. (17.58) and (17.59) there comes ⟨df, Xg ⟩ = Ω(Xg , Xf ) = {f, g}. Hence,
      we get various equivalent expressions for the Poisson bracket:
                  2.0
                  1.5
                  1.0
                              b                 a
                  0.5
                  0.0
           pθ
                  0.5
                  1.0
                  1.5
                  2.0
                          3           2           1          0           1           2           3
                                                             θ
Figure 17.1: Phase space F of a simple gravity pendulum, depicted in terms of the canonical coordinates (θ, pθ )
(cf. Example 7). The vertical lines θ = −π and θ = π must be identified, since F = S1 × R. The grey arrows show
the Hamiltonian vector field XH and the grey lines are some of its integral curves. Two of them are highlighted:
the red one corresponds to oscillating solutions (θ having a limited range and pθ taking positive and negative
values), while the blue one corresponds to solutions rotating indefinitely in the anticlockwise direction (pθ > 0).
The points a and b each represent the initial data of a given solution in these families; these points can also be
viewed as defining the solution itself (covariant phase space approach).
    The Hamiltonian vector field is the vector field on F associated to the Hamiltonian
differential dH by symplectic duality, i.e. the vector field XH . It obeys Eq. (17.58):
dH = Ω(., XH ) . (17.61)
Remark 3: In the mathematical literature (e.g. [343]), the vector field Xf associated to any scalar
function f on F is called the Hamiltonian vector field of f . This reflects the fact that any scalar function
f can the considered as the Hamiltonian of some theory, even if the latter is unphysical.
Example 7 (phase space of a pendulum): A simple gravity pendulum is a system with N = 1, the
unique degree of freedom being the angle θ ∈ (−π, π] with respect to the vertical direction. For a unit
mass, the Lagrangian is L(θ, θ̇) = (ℓθ̇)2 /2 + gℓ cos θ, where ℓ is the length of the (massless) rod and
g the magnitude of the vertical gravity field. The momentum conjugate to θ is pθ = ∂L/∂ θ̇ = ℓ2 θ̇.
Since the latter can take any real value, the phase space F is the 2-dimensional infinite cylinder S1 × R,
spanned by the canonical coordinates (θ, pθ ). The Hamiltonian is deduced from L by the Legendre
transform H = pθ θ̇ − L, yielding H(θ, pθ ) = p2θ /(2ℓ2 ) − gℓ cos θ. The Hamiltonian vector field is
obtained via Eq. (17.56): XH = ℓ−2 pθ ∂θ − gℓ sin θ ∂pθ . It is depicted in Fig. 17.1.
674                 Black hole thermodynamics beyond general relativity
          Equation (17.57) with f = H reads {., H} = XH . Hence, for any scalar field f on F,
      {f, H} = XH (f ) and the evolution law (17.45) can be rewritten as
                                                 df
                                                    = XH (f ) .                                    (17.62)
                                                 dt
      In particular, Hamilton’s equations (17.43) become
                         dq i                             dpi
                              = XH (q i )     and             = XH (pi ),   1 ≤ i ≤ N.             (17.63)
                         dt                               dt
      It follows that any solution (q i (t), pi (t)) to the equations of motion is an integral curve of the
      Hamiltonian vector field in the phase space (cf. Fig. 7). H is constant along such a curve, since,
      from Eq. (17.61), XH (H) = ⟨dH, XH ⟩ = Ω(XH , XH ) = 0.
           In view of Eq. (17.62), one may consider the vector field XH as defining the “time evolution”
      without having to refer explicitely to the time coordinate t. Of course, we recover from
      Eq. (17.62) that dH/dt = 0 since XH (H) = 0. For tensor fields on F of valence higher than a
      scalar field, one defines the “time evolution” as the Lie derivative LXH along the Hamiltonian
      vector field XH . This is quite natural since the vanishing of LXH T means that the tensor field
      T is transported identically to itself along the flow defined by XH (cf. Sec. A.4.2). In particular,
      this encompasses Eq. (17.62), since LXH f = XH (f ) for any scalar field f . An important feature
      of the evolution thus defined is that the symplectic form is conserved:
                                                  LXH Ω = 0 .                                      (17.64)
      Proof. The proof relies on the fact that Ω is closed, i.e. dΩ = 0. Indeed, using successively the
      Cartan identity (A.95), Eq. (17.61) and the nilpotence property (A.92), we get
                     LXH Ω = XH · |{z}
                                   dΩ +d(XH · Ω) = −d(Ω(., XH )) = −ddH = 0.
                                         | {z }       | {z }
                                       0       Ω(XH ,.)              dH
dH = Ω(., XH ) . (17.65)
        By construction, XH preserves the symplectic form: LXH Ω = 0 [Eq. (17.64)]. The evolution
        of the underlying physical system is entirely determined by the Hamiltonian vector field:
        any solution to the equations of motion is an integral curve of XH . A smooth function
                                      17.4 Covariant phase space formalism                                     675
Example 8 (solutions for the gravity pendulum): For the gravity pendulum considered in Example 7,
the integral curves of XH are depicted in Fig. 17.1. Two of them are represented by the red and the blue
curves. It is clear from the figure that XH vanishes at two points: (θ, pθ ) = (0, 0) and (θ, pθ ) = (π, 0),
which are thus two equilibrium points (respectively stable and unstable).
Remark 4: The above formulation of Hamiltonian systems does not refer to any system of canonical
coordinates (q i , pi ) on the phase space F and, more generally, to any kind of coordinates. It does not
even assume that F is the cotangent bundle T ∗ C of some “configuration space” C, while the cotangent
bundle structure is the standard arena for the Hamiltonian mechanics with a finite number of degrees of
freedom [1].
  Let (F, Ω, H) be a Hamiltonian system whose equations of motion constitute a well posed
  initial value problem. Each solution to the equations of motion is a curve S : R → F,
  t 7→ S(t), which admits the Hamiltonian vector field XH as tangent vector. The set Fsol of
  all solutions can be given the structure of a smooth manifold diffeomorphic to F, and for
  each t0 ∈ R, an explicit diffeomorphism is the “initial data” map
                                             Φt0 : Fsol −→ F
                                                                                                 (17.66)
                                                       S     7−→ S(t0 ).
   6
       At an equilibrium point, the integral curve is reduced to the point itself.
676                 Black hole thermodynamics beyond general relativity
      Proof. The equivalence between Eqs. (17.67) and (17.68) follows readily from the definition
      of the pullback map given in Sec. A.2.8. The independence of Ωsol (δ1 S, δ2 S) from the choice
      of t0 follows from the constancy of Ω along the Hamiltonian flow, i.e. Eq. (17.64). Indeed
      Ωsol (δ1 S, δ2 S), as defined by Eq. (17.68), is independent of t0 iff df /dt = 0, where f (t) :=
                                                                                  −−−−−−→
      Ω(Y1 (t), Y2 (t)) and we have introduced the separation vectors Y1 (t) := S(t)S1 (t) and Y2 (t) :=
      −−−−−−→
      S(t)S2 (t). Now, from Eq. (17.62) and the Leibniz rule,
        df
           = LXH f = LXH [Ω(Y1 , Y2 )] = LXH Ω (Y1 , Y2 ) + Ω(LXH Y1 , Y2 ) + Ω(Y1 , LXH Y2 ) = 0,
        dt                               | {z }               | {z }                 | {z }
                                                0                     0                     0
      where LXH Ω = 0 is the invariance property (17.64) and LXH Y1 = 0 and LXH Y2 = 0 follows
      from the very definition of a Lie derivative (cf. Sec. A.4.2), the separation vectors Y1 and Y2
      being naturally Lie-dragged by XH . Finally, the independence of Hsol from the choice of t0
      is immediate from Eq. (17.69), which shows that Hsol (S) is the energy of the solution S (cf.
      Property 17.13).
          The triplet (Fsol , Ωsol , Hsol ) clearly constitutes a Hamiltonian system, as defined in Prop-
      erty 17.13, but without reference to any “time evolution”, given that each point S ∈ Fsol is a
      fully evolved solution by itself. The fully covariant picture is thus achieved. The Hamiltonian
      vector field XHsol is a vector field on Fsol , which generates 1-dimensional families of solutions.
      Two solutions S1 and S2 that lie on the same integral curve of XHsol are then solutions to the
      original equations of motion that only differ by a time translation, i.e. S2 (t) = S1 (t + t0 ) for
      some t0 ∈ R. In particular S1 and S2 have the same energy.
(17.10). Since no confusion may arise, for simplicity, we shall drop the index “sol” and denote
this space simply by7 F. A difference with what precedes is that we are dealing with a field
theory, which, by essence, has an infinite number of degrees of freedom. In other words, the
space of solutions F is an infinite-dimensional manifold. More precisely, it can be given the
structure of an infinite-dimensional Banach manifold [345, 255], i.e. a topological space that
locally resembles8 a Banach space of infinite dimension, in the same way as a n-dimensional
manifold locally resembles the vector space Rn . Since any point of F is a metric field g on M ,
a variation δg can be viewed as an infinitesimal displacement vector between two neighboring
points of the manifold F, similar to the displacement vector dx on M introduced in Secs. 1.2.1
and A.2.3. As in the finite dimensional case (cf. Sec. A.2.3), (non-infinitesimal) vectors on F
are defined9 as tangent vectors along curves of F, the latter being the images of smooth maps
P : I ⊂ R → F, λ 7→ P (λ) =: g(λ). Here g(λ) stands for a 1-parameter family of metric
fields on M , this family constituting a parametrization of a given curve L of F. Any vector X
at a point g0 ∈ F can then be written as in Eq. (A.14):
                                                            δg
                                                     X=        ,                                              (17.70)
                                                            δλ
where δg is the infinitesimal displacement vector connecting g0 =: g(λ) and g(λ + δλ) along
the curve L to which X is tangent to; λ is then the parameter of L associated to X and we
denote by δλ its small increment, instead of dλ as in Sec. A.2.3.
   We are going to see that, from the action (17.1), we can endow F with a (pre)symplectic
form Ω. Let Σ and Σ′ be two non-intersectic hypersurfaces of M , both diffeomorphic to Σ̂ and
such that Σ′ lies in the background future of Σ. Let Σr and Σ′r be compact subparts of Σ and
   7
      Be aware that in some part of the literature [24, 345, 507], F denotes instead the space of all fields g that are
required to only obey some asymptotic flatness conditions, but not necessarily the field equations; the subspace of
solutions to the field equations — our F — is then denoted by F̄ [345, 507] or Γ [24].
    8
      Cf. Lang’s book [336] for a precise definition of a Banach manifold.
    9
      Notice that the definition (A.8) of a tangent vector does not require the manifold to be of finite dimension.
678                      Black hole thermodynamics beyond general relativity
      Σ′ respectively, extending in the asymptotic flat end and corresponding to balls of Euclidean
      radius r in Σ̂ext . Let Wr be the hypercylinder connecting Σr to Σ′r (cf. Fig. ??) and D be the
      part of M bounded by Σr , Σ′r and Wr . Let us then consider the action (17.1) restricted to D:
                                                       Z
                                             SD [g] :=   L(g).                                 (17.71)
                                                                     D
      We are using the functional notation SD [g], but, when restricted to solutions g of the field
      equations, SD is merely a scalar field on the manifold F, i.e. a function F → R. Accordingly,
      the variation δSD of SD due to a small displacement δg on F can be viewed as the action of
      the differential 1-form dSD on the infinitesimal vector  δg: δSD = ⟨dSD , δg⟩ [cf. Eq. (A.21)].
      Now, from the definition (17.71), we get δSD = D δL(g), with δL(g) given by Eq. (17.5),
                                                          R
      in which E Ris set to zero for elements of F are solutions of the field equations. Hence
      ⟨dSD , δg⟩ = D dθ(g, δg). By linearity10 , we get for any vector X = δg/δλ at a point g ∈ F,
                                                    Z        δg 
                                       ⟨dSD , X⟩ =      dθ g,      .
                                                      D        δλ
      Thanks to Stokes’ theorem (A.94), we rewrite the right-hand side as a surface integral, taking
      into account that ∂D = Σ′r ∪ Σr ∪ Wr :
                                 Z         δg  Z         δg  Z          δg 
                     ⟨dSD , X⟩ =         θ g,    +      θ g,     +       θ g,      ,          (17.72)
                                   Σ′r ↗      δλ    Σ↗r
                                                             δλ      Wr
                                                                       ↗       δλ
      where the arrows indicate the outward orientation of the various hypersurfaces with respect to
      D. Since Eq. (17.72) is valid for any vector X on F, we may rewrite it as an identity between
      1-forms:
                                        dSD = ΘΣ′r − ΘΣr + ΥΣr ,Σ′r .                         (17.73)
      Here ΘΣr stands the 1-form on F defined by the following action on a vector X in the tangent
      space Tg F:                                  Z      δg 
                                     ⟨ΘΣr , X⟩ :=      θ g,     ,                           (17.74)
                                                    Σr      δλ
      where δg/δλ is the relative variation of g defining the vector X by Eq. (17.70) and the orientation
      of Σr is that provided by any background-future directed vector transverse to Σr . Note that
      ΘΣr is well defined, i.e. the integral in Eq. (17.74) is finite, because Σr is compact and θ is
      smooth; moreover, ΘΣr is linear for θ(·, ·) is linear in its second argument. The same definition
      holds for ΘΣ′r . Similarly, ΥΣr ,Σ′r stands for the 1-form on F defined by
                                                            Z      δg 
                                          ⟨ΥΣr ,Σ′r , X⟩ :=      θ g,    ,                         (17.75)
                                                             Wr↗      δλ
      with Wr defined as above. Note that the minus sign in front of ΘΣr in Eq. (17.73) occurs because
      of the change of orientation of Σr between Eqs. (17.72) and (17.73).
          In an attempt to endow F with a symplectic form, thereby turning it into a phase space,
      let us consider the 2-form Ω that is the limit Σr → Σ (i.e. the limit r → +∞) of the exterior
      derivative of ΘΣr . This defines a closed 2-form, which turns out to be independent of the choice
      of Σ:
        10
             Recall that θ(g, δg) is linear with respect to δg, cf. Property 17.2.
                                  17.4 Covariant phase space formalism                                             679
  Let F be the (Banach) manifold of solutions g to the field equations that are asymptotically
  flat in the asymptotically Euclidean end of the n-dimensional manifold M ∼ R × Σ̂. Let Σ
  be a hypersurface of M diffeomorphic to Σ̂ and Σr a compact subpart of Σ corresponding
  to a ball of Euclidean radius r in Σ̂ext . Given the 1-form ΘΣr defined on F by Eq. (17.74),
  the asymptotically flatness conditions of the gravity theory must insure that the limit
  is well defined, i.e. yields a finite 2-form on F. This 2-form is closed: dΩ = 0. One then
  says that Ω is a presymplectic forma on F. Moreover, Ω does not depend on the choice of
  the hypersurface Σ. The pair (F, Ω) is called the prephase space of the considered gravity
  theory (and asymptotic flatness conditions).
     a
       Recall that a symplectic form is a 2-form that is closed and non-degenerate; as we shall discuss below, Ω
  is actually degenerate.
Proof. For each value of r, dΘΣr is a 2-form on F, so if the limit (17.76) exists, it clearly defines
a 2-form Ω on F. Let us evaluate its exterior derivative, dΩ, by means of formula (A.88)
with p = 2: for any 3-tuple (X1 , X2 , X3 ) of vector fields on F, we have dΩ(X1 , X2 , X3 ) =
X1 (Ω(X2 , X3 )) − X2 (Ω(X1 , X3 )) + X3 (Ω(X1 , X2 )) − Ω([X1 , X2 ], X3 ) + Ω([X1 , X3 ], X2 ) −
Ω([X2 , X3 ], X1 ). Now
                                                      
         X1 (Ω(X2 , X3 )) = X1 lim dΘΣr (X2 , X3 ) = lim X1 (dΘΣr (X2 , X3 )) ,
                                       r→+∞                           r→+∞
with similar formulas for X2 (Ω(X1 , X3 )) and X3 (Ω(X1 , X2 )). There comes then dΩ(X1 , X2 , X3 ) =
limr→∞ ddΘΣr (X1 , X2 , X3 ). But ddΘΣr = 0 (nilpotence of the exterior derivative, cf. Eq. (A.92)).
We conclude that dΩ = 0. As for the independence of Ω from Σ, it arises from the follow-
ing identity obtained by taking the exterior derive of Eq. (17.73) and using ddSD = 0 in the
left-hand side:
                                0 = dΘΣ′r − dΘΣr + dΥΣr ,Σ′r .
Taking the limit r → +∞ yields ΩΣ′ − ΩΣ + lim dΥΣr ,Σ′r = 0. The asymptotic flatness
                                                          r→+∞
conditions ensure that lim dΥΣr ,Σ′r = 0 (Lemma 17.18 below), leaving us with ΩΣ′ = ΩΣ .
                          r→+∞
Remark 5: Instead of the limit (17.76), it could have seemed simpler to define the presymplectic form
as Ω := dΘΣ , where ΘΣ is defined as in Eq. (17.74) but with the integral extended to the whole of Σ.
However, as noticed in Ref. [24] (p. 429), standard asymptotic flatness conditions do not guarantee that
this integral is convergent; hence the 1-form ΘΣ would be ill-defined.
  The action of the presymplectic form Ω on a pair of commuting vector fields (X1 , X2 )
680                   Black hole thermodynamics beyond general relativity
        of F is expressible as follows. According to Eq. (17.70), each vector field describes the
        variation of g along some curves of F. Since they commute, the vector fields (X1 , X2 ) can
        be associated to a 2-parameter family g(λ1 , λ2 ) of metric tensors, spanning a 2-surface
        S ⊂ F, so that X1 (resp. X2 ) corresponds to variations δg along curves λ2 = const (resp.
        λ1 = const) in the 2-surface S, i.e. to partial derivatives:
                         δg                     ∂g                         δg                     ∂g
                 X1 =                      =:           and       X2 =                       =:       .      (17.77)
                         δλ1   λ2 =const        ∂λ1                        δλ2   λ1 =const        ∂λ2
        We have then                                       Z      ∂g ∂g 
                                           Ω(X1 , X2 ) =        ω g,    ,    ,                               (17.78)
                                                              Σ      ∂λ1 ∂λ2
        where ω is the (n − 1)-form on M defined from the presymplectic potential (n − 1)-form
        θ of the considered gravity theory (cf. Property 17.2) by
                                                                                
                             ∂g ∂g      ∂               ∂g        ∂     ∂g 
                         ω g,    ,    :=                θ g,       −      θ g,       .                       (17.79)
                              ∂λ1 ∂λ2    ∂λ1                 ∂λ2     ∂λ2       ∂λ1
        ω is called the presymplectic current (n−1)-form [507]a . It has a local dependence on the
        metric tensor g and on the derivatives ∂g/∂λ1 and ∂g/∂λ2 associated to the 2-parameter
        family g(λ1 , λ2 ), and is linear with respect to these derivatives.
           a
             It is called the symplectic current (n − 1)-form in articles by Wald and Iyer [502, 292] preceding [507],
        but, as discussed below, presymplectic is a better qualifier than symplectic.
      Proof. Let us compute dΘΣr (X1 , X2 ) via the exterior derivative formula (A.89), noticing that (i)
      this formula does not involve the manifold dimensionality (contrary to (A.90b), which applies
      to finite dimensions only) and (ii) the last term in the right-hand side of (A.89) vanishes since
      [X1 , X2 ] = 0. We get
      ∂λ1 ∂λ2
              = ∂λ∂2 ∂λ 1
                          . Hence the notation ω = ω(g, ∂g/∂λ1 , ∂g/∂λ2 ).
                                       17.4 Covariant phase space formalism                                              681
Example 9 (presymplectic current form of general relativity): For general relativity, one deduces
from expression (17.9) for θ and the definition (17.79) that the presymplectic current (n − 1)-form is
ω(g, δ1 g, δ2 g)α1 ···αn−1 = (32π)−1 P λµντ ρσ (δ2 gµν ∇τ δ1 gρσ − δ1 gµν ∇τ δ2 gρσ ) ϵλα1 ···αn−1 , (17.80)
with
P λµντ ρσ := 2g λρ g σµ g ντ − g λτ g µρ g σν − g λµ g ντ g ρσ − g µν g λρ g στ + g µν g λτ g ρσ . (17.81)
We shall not detail the computation, which is straightforward but rather lengthy. Let us simply mention
that one shall start by expressing the covariant derivatives in formula (17.9) for θ in terms of g’s
                                                                          √
Christoffel symbols [Eq. (A.64) with (k, ℓ) = (0, 2)] and replace ϵ by −g e as in Example 2, plug the
result in Eq. (17.79) and make use of the identities (16.20) and (16.21). As a check, one can compare with
Eqs. (21)-(23) of Ref. [283] or Eqs. (41)-(43) of Ref. [507].
The 1-form ΥΣr ,Σ′r defined by Eq. (17.75) obeys limr→+∞ dΥΣr ,Σ′r = 0.
Proof. By exactly the same computation as that leading to Eq. (17.78), we have
                                                Z      ∂g ∂g 
                         dΥΣr ,Σ′r (X1 , X2 ) =      ω g,    ,     .
                                                 Wr↗      ∂λ1 ∂λ2
As argued in Ref. [292], if the asymptotic flatness conditions ensure that the integral (17.78) on
the unbounded hypersurface Σ is convergent, then the integral of the same integrand ω over
Wr must tend to zero for r → +∞.
  The presymplectic form Ω is unique for a given gravity theory with appropriate asymptotic
  flatness conditions. Not only it is does not depend on the choice of the hypersurface Σ
  (Property 17.16), but it is unaffected by the ambiguities θ → θ + dY + δµ in the definition
  of θ [cf. Eq. (17.6)]. Moreover, the presymplectic current (n − 1)-form ω itself is unaffected
  by the ambiguity θ → θ + δµ.
Proof. Let us first show the last point, by evaluating ω from Eq. (17.79) for θ ′ (g, δg) = θ(g, δg)+
δµ(g). For δg = (∂g/∂λ2 ) δλ2 , we get
                                           ∂g       ∂g      ∂
                                       θ ′ g,     = θ g,     +     µ(g),
                                              ∂λ2        ∂λ2   ∂λ2
so that
                                                                               ∂2
                                                                      
                         ∂          ∂g 
                                   ′                  ∂             ∂g 
                                  θ g,             =            θ g,       +         µ(g).
                        ∂λ1            ∂λ2           ∂λ1             ∂λ2     ∂λ1 ∂λ2
682                 Black hole thermodynamics beyond general relativity
      Thanks to the commutativity of partial derivatives, the terms involving µ cancel each other
      when substituting θ ′ for θ in Eq. (17.79). Hence the (n − 1)-form ω computed from θ ′ is the
      same as that computed from θ. It follows then from Eq. (17.78) that the presymplectic form Ω
      is independent of µ as well. On the other side, the change θ ′ (g, δg) = θ(g, δg) + d (Y (g, δg)),
      as permitted by Eq. (17.6), leads to the following change in the presymplectic form:
                                            Z                                          
               ′                                    ∂       ∂g           ∂       ∂g 
              Ω (X1 , X2 ) = Ω(X1 , X2 ) +              Y g,           −        Y g,          ,
                                             S∞ ∂λ1           ∂λ2         ∂λ2         ∂λ1
      where use has been made of Eqs. (17.78)-(17.79), as well as the Stokes’ theorem (A.94), S∞
      standing for the “outer boundary” of Σ, i.e. the limit r → +∞ of a (n − 2)-sphere Sr of
      radius r in the asymptotically Euclidean end of Σ. Now, given that the (n − 2) form Y must
      be constructed from g, δg and their derivatives in a covariant manner, the fall-off conditions
      from standard asymptotic flatness are sufficient to make the above integral over S∞ vanish
      [292]. Hence Ω′ = Ω.
        Provided that the 2-parameter family g(λ1 , λ2 ) involved in the definition (17.79) regards
        only solutions to the field equations (i.e. g(λ1 , λ2 ) ∈ F), the presymplectic current ω is a
        closed (n − 1)-form:
                                                   dω = 0.                                     (17.82)
      Proof. The 2-parameter family of metric tensors g(λ1 , λ2 ) gives birth to a 2-parameter family
      of Lagrangian n-forms L(λ1 , λ2 ) := L(g(λ1 , λ2 )). The variation δ1 L of L corresponding to a
      variation of λ1 is given by Eq. (17.5) with E µν = 0 since g(λ1 , λ2 ) fulfills the field equations.
      Hence δ1 L = d (θ(g, δ1 g)). Given that δ1 L = ∂λ∂L
                                                         1
                                                           δλ1 , δ1 g = ∂λ
                                                                         ∂g
                                                                           1
                                                                             δλ1 and θ(., .) is linear with
      respect to its second argument, we get ∂L/∂λ1 = d (θ(g, ∂g/∂λ1 )). Taking the derivative of
      this expression with respect to λ2 yields
                            ∂ 2L
                                                                                
                                       ∂        ∂g              ∂       ∂g 
                                  =       d θ g,           =d           θ g,         .
                          ∂λ2 ∂λ1    ∂λ2           ∂λ1           ∂λ2          ∂λ1
      Swapping λ1 and λ2 and subtracting the above formula yields, in view of ω’s definition (17.79),
        ∂2L        2L
      ∂λ1 ∂λ2
              − ∂λ∂2 ∂λ 1
                          = dω. But the left-hand side of this equation is trivially zero, hence the result
      (17.82).
          It turns out that in general, the presymplectic form Ω is degenerate: there exists nonzero
      vectors X on F, such that Ω(X, .) = 0. Hence Ω fails to be a symplectic form on F. For
      instance, one may conceive a nonzero variation δg of g such that δg = 0 on the hypersurface
      Σ involved in Eq. (17.78). Then, by linearity of ω(g, δ1 g, δ2 g) with respect to δ1 g and δ2 g (cf.
      Property 17.17), one has ω(g, δg, .) = 0 on Σ and Eq. (17.78) shows that δg defines a degeneracy
      direction of Ω. More generally, the degeneracy of Ω is related to the gauge freedom of the
      gravity theory, i.e. to the invariance by diffeomorphism. For general relativity, it has been
      shown [24] that the degeneracy directions of Ω are exactly those generated by diffeomorphisms
                                    17.4 Covariant phase space formalism                                   683
of M [cf. Eqs. (17.11)-(17.12)] that reduce to the identity at infinity (i.e. such that ξ in Eq. (17.12)
tends to zero for r → +∞). The gauge freedom is intimately related to the initial value problem
of the theory being not well posed (it becomes well posed only in a fixed gauge, see e.g. Chap. VI
of Ref. [108]). Recall that in the covariant phase space construction of Property 17.14, it was
required that the initial value problem is well posed to endow the space of solutions with a
(non-degenerate) symplectic form.
    It is however possible to introduce a quotient space F̄ = F/R on which Ω gives birth to a
true symplectic form. The idea is to “factor out” the degeneracy directions of Ω. This is possible
because these directions are integrable into submanifolds of F. According to Frobenius theorem
(Theorem B.3.1 in Ref. [499]), the integrability is equivalent to the commutator [X1 , X2 ] being a
degeneracy vector field for any two degeneracy vector fields X1 and X2 . To show this property,
let us use the identity (A.79) and the Leibniz rule to write Ω([X1 , X2 ], .) = Ω(LX1 X2 , .) =
LX1 [Ω(X2 , .)] − LX1 Ω (X2 , .) = −LX1 Ω (X2 , .) since Ω(X2 , .) = 0. Now, thanks to Cartan
identity (A.95), LX1 Ω = dΩ(X1 , ., .) + d[Ω(X1 , .)] = 0, given that dΩ = 0 (Property 17.16)
and Ω(X1 , .) = 0. Hence Ω([X1 , X2 ], .) = 0 and the degeneracy directions of Ω are integrable
into submanifods of F. Assuming that these submanifolds provide a regular foliation of F (cf.
the discussion in Ref. [345]), one defines an equivalence relation R on F by setting g1 Rg2 iff
the points g1 and g2 of F belong to the same degeneracy submanifold. The set of equivalence
classes F̄ := F/R can be given the structure of Banach manifold, such that the canonical
projection π : F → F̄ is a smooth map. One may then define a 2-form Ω̄ on F̄ by setting,
for any pair (X̄1 , X̄2 ) of tangent vectors at a point ḡ ∈ F̄, Ω̄(X̄1 , X̄2 ) := Ω(X1 , X2 ), where
(X1 , X2 ) ∈ (Tg F)2 , such that π(g) = ḡ, π∗ X1 = X̄1 and π∗ X2 = X̄2 (pushforward by π, cf.
Sec. A.2.8). Because Ω is preserved along its degeneracy directions (cf. LX1 Ω = 0 in the above
proof), the definition of Ω̄(X̄1 , X̄2 ) is independent of the choice of g in the fiber π −1 (ḡ). It
is also independent of the choice of X1 and X2 in Tg F because two vectors X1 and X1′ such
that π∗ X1 = π∗ X1′ differ only by a vector along a degeneracy direction of Ω. Hence Ω̄(X̄1 , X̄2 )
is unambiguously defined. It can be shown that Ω̄ is a closed non-degenerate 2-form, i.e. a
symplectic form (cf. Refs. [255, 345] for details). The genuine covariant phase space of the
considered gravity theory is then (F̄, Ω̄). However, in what follows, we will deal only with the
prephase space (F, Ω).
  11
       Let us recall that thanks to Poincaré lemma (cf. Sec. A.4.3), ω closed =⇒ ω locally exact.
684                   Black hole thermodynamics beyond general relativity
        and let ξ be a vector field on M . Any variation δg among the solutions to the field
        equationsa generates a variation δQ(ξ) of the Noether potential (n − 2)-form of ξ (cf.
        Property 17.6 and Remark 4 on p. 662), such that
        where ω is the presymplectic current (n − 1)-form (Property 17.17) and θ is the presym-
        plectic potential (n − 1)-form (Property 17.2).
           a
             This means that both g and g + δg are assumed to be solutions to the field equations, which implies
        that δg is a solution to the linearized field equation.
      Proof. The variation of the Noether current (n − 1)-form J (ξ) induced by δg is obtained from
      Eq. (17.16): δJ (ξ) = δθ(g, Lξ g) − ξ · δL. Note that we have set δξ = 0, for the vector field
      ξ is independent of g. Here δL is given by Eq. (17.5) with E = 0 since we are dealing with
      solutions to the field equations only: δL = dθ(g, δg). Hence
      where use has been made of Cartan identity (A.95). Now, there is no loss of generality in
      considering that there exists a 2-parameter family g(λ1 , λ2 ) of metric fields such that δg =
      (∂g/∂λ1 ) δλ1 and ∂g/∂λ2 = Lξ g [cf. Eq. (17.12), rewritten as δ2 g = δλ2 Lξ g]. In view of ω’s
      definition (17.79), we have then δθ(g, Lξ g) − Lξ θ(g, δg) = ω(g, δg, Lξ g), so that the above
      equation becomes
                                 δJ (ξ) = ω(g, δg, Lξ g) + d [ξ · θ(g, δg)] .
      Finally, restoring the explicit dependence of J on g (cf. Remark 3 on p. 661), we have δJ (ξ) :=
      J (g + δg, ξ) − J (g, ξ). Since both g and g + δg are solutions to the field equations, we may
      invoke Eq. (17.21) to write J (g + δg, ξ) = dQ(g + δg, ξ) and J (g, ξ) = dQ(g, ξ). Hence there
      comes δJ (ξ) = d[Q(g + δg, ξ) − Q(g, ξ)] = dδQ(ξ), from which Eq. (17.83) follows.
      Remark 6: The denomination fundamental identity of the covariant phase space formalism for Eq. (17.83)
      is used in some part of the literature [138, 246, 250], as here, while in another part [283, 284], it is given
      to a generalization of it, which holds even if g and g + δg do not satisfy the field equations.
If we substitute the fundamental identity (17.83) for ω(g, δg, Lξ g) and use Stokes’ theorem
(A.94), we arrive at                   Z             Z
                           Ω(δg, Xξ ) = δ         Q(ξ) −          ξ · θ(g, δg),                (17.84)
                                                 S∞              S∞
Then we can rewrite Eq. (17.84) as δHξ = Ω(δg, Xξ ). Given that δHξ = ⟨dHξ , δg⟩ [cf.
Eq. (A.21)] and δg is a generic displacement on F, this is actually equivalent to
We recognize Eq. (17.65), which defines the Hamiltonian vector field XH in terms of the
Hamiltonian H. The difference is that Eq. (17.65) involves a true symplectic form, while Ω in
Eq. (17.86) is only presymplectic. In particular, we could not invert Ω to define Xξ in terms of
Hξ , but this is not a problem here since Xξ is given a priori and Eq. (17.86) defines Hξ from it
(assuming that the right-hand side of Eq. (17.84) is a pure variation). One says that Hξ is the
Hamiltonian conjugate to the vector field ξ [507]. Hence, if there exists a function Hξ on
F fulfilling (17.85), the dynamics generated on the prephase space (F, Ω) by the vector field
Xξ , i.e. the action of the diffeomorphism group generated by ξ on the solutions g to the field
equations, is a Hamiltonian dynamics.
    A natural question is whether or not a given vector field ξ admits a conjugate Hamiltonian.
A sufficient condition is that there exists a (n − 1)-form B(g) on M such that
                                  Z                  Z
                                    ξ · θ(g, δg) = δ     ξ · B(g).                         (17.87)
                                  S∞                         S∞
In particular, this holds if the asymptotic behavior of the presymplectic potential (n − 1)-form
is θ(g, δg) ∼ δB(g) for some (n − 1)-form B(g). This is the case for general relativity:
            r→+∞
      where e is the Levi-Civita tensor of f and H̄(δg) is the vector defined from δg in the same way as
      H(δg) in Eq. (16.13), but substituting f for g: H̄ α := f αρ f µν (Dν δgρµ − Dρ δgµν ), where D stands for
      the covariant derivative associated to f . Since δf = 0 and Dδ = δD, for f is independent of g, we
      may write H̄(δg) = δK(g), where K(g) is the vector field defined by
      The (n − 1)-form B(g) defined above fulfills Eq. (17.87), thereby ensuring that a conjugate Hamiltonian
      Hξ exists for any asymptotic symmetry generator ξ.
      Remark 1: As the above example shows, the (n − 1)-form B(g) does not need to be covariantly
      defined: Eq. (17.90) involves the flat metric f , which is arbitrarily chosen. In particular, B(g) is highly
      non-unique; what matters is only its asymptotic behavior.
          If condition (17.87) is satisfied, then a function Hξ fulfilling (17.85) is
                                                  Z
                                       Hξ [g] :=      (Q(ξ) − ξ · B(g)) .                                 (17.91)
                                                       S∞
        for any pair of variations (δ1 g, δ2 g) among the solutions to the field equations. Furthermore,
        if the prephase space F is simply connected, this condition is sufficient.
      Proof. Let us suppose that Hξ exists. Then, by taking the variation δ1 of Eq. (17.85) written for
      δ = δ2 , we get                                  Z
                                 δ1 δ2 Hξ = δ1 δ2 QS∞ ,ξ −         ξ · δ1 (θ(g, δ2 g)) ,
                                                                S∞
      where QS∞ ,ξ is the Noether charge of S∞ with respect to ξ [cf. Eq. (17.28)]. By subtracting
      the term 1 ↔ 2 and using (δ1 δ2 − δ2 δ1 )Hξ = 0, as well as (δ1 δ2 − δ2 δ1 )QS∞ ,ξ = 0, we obtain
      (17.92), given the definition (17.79) of ω. Hence (17.92) appears as a necessary condition for Hξ
      to exist. To prove that it is sufficient when F is simply connected, let us select a point g0 ∈ F
      and define for any point g ∈ F and any curve L from g0 to g,
                                                            Z
                                               H(g, L ) :=      α,                                (17.93)
                                                                  L
                      17.5 First law for diffeomorphism-invariant theories                                687
To evaluate the 2-form dα on S, let us compute dα(X1 , X2 ) for a pair of vectors (X1 , X2 )
associated to a 2-parameter family ḡ(λ1 , λ2 ) spanning S [cf. Eq. (17.77)]. We may choose
(λ1 , λ2 ) ∈ [0, 1]2 , with ḡ(λ1 , 0) spanning L , ḡ(λ1 , 1) spanning L ′ , ḡ(0, 0) = ḡ(0, 1) = g0
and ḡ(1, 0) = ḡ(1, 1) = g. By a computation similar to that of dΘΣr (X1 , X2 ) in the proof of
Property 17.17, we arrive at
                   Z ( 2                 2
                                                     "                                          #)
                           ∂ Q(ξ) ∂ Q(ξ)                ∂           ∂ḡ       ∂          ∂ḡ 
 dα(X1 , X2 ) =                      −          −ξ ·           θ ḡ,         −        θ ḡ,           .
                     S∞ |  ∂λ1 ∂λ2 ∂λ2 ∂λ1             ∂λ            ∂λ2       ∂λ           ∂λ1
                                     {z       }        | 1                   {z 2                 }
                                      0                                          
                                                                           ∂ ḡ   ∂ ḡ
                                                                     ω ḡ, ∂λ   , ∂λ
                                                                               1     2
Hence if Eq. (17.92) holds, dα(X1 , X2 ) = 0. We conclude that dα = 0 on S and thus that
H(g, L ) − H(g, L ′ ) = 0. It follows that H(g, L ) is a function of g only, which we shall
denote by H(g). From the definitions (17.93)-(17.94), as well as the definition of the integral of
a 1-form along a curve, we get
                                     Z                                Z           Z
δH = H(g+δg)−H(g) = ⟨α, δg⟩ =            (δQ(ξ) − ξ · θ(g, δg)) = δ       Q(ξ)−       ξ·θ(g, δg).
                                           S∞                                   S∞       S∞
      Example 11 (canonical energy for general relativity is ADM energy): From expression (17.23) for
      Q(ξ) in general relativity, we get
               Z                    Z                  Z                    Z
                                 1                  1              µν    1
                   Q(ξ) = −              ⋆(dξ) = −        (dξ)µν dS = −        ∂µ ξν dS µν ,
                S∞             16π    S∞           32π  S∞              16π  S∞
      where we have used Lemma 5.10 to express the integral in terms of the area element normal bivector
      dS to S∞ and have used formula (A.90b) and the antisymmetry of dS to get the last equality. Now,
      since ξ generates an asymptotic time translation, we have ξ = ∂t , where t is the first coordinate of an
      asymptotically Minkowskian system (xα ) = (t, x1 , . . . , xn−1p ), i.e. a f -Minkowskian coordinate system
      (cf. Sec. 5.3.1) for a flat metric f such that g ∼ f for r := (x1 )2 + · · · (xn−1 )2 → +∞. S∞ can then
      be considered as the limit r → +∞ of a (n − 2)-sphere St,r defined by (t, r) = const. From Eq. (5.42),
                                                               √
      we have dS µν := (sµ nν − nµ sν ) dS, with dS := q dn−2 y (the area element of St,r ) and, at the limit
      r → +∞, nµ = ξ µ = δ µt and sµ = (0, si ), with si := xi /r for i ∈ {1, . . . , n − 1}. Furthermore,
      ξν = gνσ ξ σ = gtν . Hence, there comes
             Z                      Z                                         Z
                                  1                       µ ν            1
                   Q(ξ) = −                       µ ν
                                         ∂µ gtν (s δ t − δ t s ) dS = −            (∂i gtt − ∂t gti ) si dS. (17.96)
               S∞               16π S∞                                  16π S∞
      On the other hand, from the (n−1)-form B obtained in Eq. (17.90), we have ξ·B = (16π)−1 ξ·(K ·e) =
      (16π)−1 e(K, ξ, . . .). For r → +∞, e ∼ ϵ and we may write ξ · B ∼ (16π)−1 ϵ(K, ξ, . . .) =
      (16π)−1 ⋆ (K ∧ ξ), where K ∧ ξ stands for the 2-form of components (K ∧ ξ)αβ = Kα ξβ − ξα Kβ
      and we have used Eq. (5.38) with p = 2 to let appear the Hodge dual of K ∧ ξ. Using again Lemma 5.10
      and the antisymmetry property (Kµ ξν − ξµ Kν ) dS µν = 2Kµ ξν dS µν , we arrive at
                    Z                    Z                       Z
                                      1                       1
                        ξ·B =                Kµ ξν dS =µν
                                                                     Kµ gtν (sµ δ νt − δ µt sν ) dS
                     S∞             16π S∞                   16π S∞
                                         Z                                       Z
                                      1                                      1
                                 =           (Ki gtt −Kt gti )si dS = −               Ki si dS.
                                    16π S∞ |{z}             |{z}           16π S∞
                                                      −1         0
      Now, from Eq. (17.89), we have, on S∞ , Ki = f µν (Dν giµ −Di gµν ) = −(∂t git −∂i gtt )+δ kl (∂l gik −∂i gkl ).
      Hence
                Z                     Z                                  Z
                                  1                                   1
                     ξ·B =−               (∂i gtt − ∂t gti ) si dS −        δ kl (∂l gik − ∂i gkl ) si dS.
                 S∞              16π   S∞                            16π  S∞
      We notice that the first integral on the right-hand side is exactly that appearing in Eq. (17.96), so that
      they cancel each other in formula (17.95) for the canonical energy, leaving
                                                 Z
                                              1
                                       E=             δ kl (∂l gik − ∂i gkl ) si dS.                     (17.97)
                                            16π S∞
      This is exactly the formula giving the ADM energy (cf. Sec. 5.3.5): compare Eq. (17.97) with e.g. Eq. (8.11)
      in Ref. [227].
                       17.5 First law for diffeomorphism-invariant theories                                     689
Remark 2: The integral of Q(ξ) over S∞ , which appears in formula (17.95) for E, is the Noether charge
QS∞ ,ξ of S∞ with respect to ξ (cf. Property 17.8). For a stationary spacetime in general relativity,
we have noticed that QS∞ ,ξ is (n − 3)/(n − 2) times the Komar mass at infinity (Property 17.9). The
latter is identical to the ADM energy (Property 5.18). Hence, for a stationary spacetime ruled by general
relativity, the relative contributions to E of the terms Q(ξ) and −ξ · B in Eq. (17.95) are respectively
(n − 3)/(n − 2) and 1/(n − 2), i.e. 1/2 and 1/2 for n = 4.
    Let us now consider a vector field η on M that is the generator of an asymptotic sym-
metry corresponding to a spatial rotation, i.e. η ∼r→+∞ x∂y − y∂x in some asymptotically
Minkowskian coordinate system (t, x1 =: x, x2 =: y, x3 , . . . , xn−1 ). Without any loss of
generality, we may choose the (n − 2)-surfaces St,r converging to S∞ for r → +∞ such
that
R η is tangent to each St,r and thus to S∞ . Then, in Eq. (17.84) with ξ := η, we have
  S∞
     η · θ(g, δg) = 0. Indeed, η · θ := θ(η, ., . . . , .) is a (n − 2)-form that vanishes on S∞ : for
any (n − 2)-tuple of vectors (v1 , . . . , vn−2 ) tangent to S∞ , (η, v1 , . . . , vn−2 ) is (n − 1)-tuple of
vectors all tangent to the (n − 2)-dimensional manifold S∞ ; these vectors are thus necessarily
linearly dependent, so that θ(η, v1 , . . . , vn−2 ) = 0 since θ is fully antisymmetric.           It follows
that a Hamiltonian conjugate to η always exists and is given by Hη [g] = S∞ Q(η). The
                                                                                            R
negative of its value for a given solution g to the field equation defines the canonical angular
momentum of that solution relative to η:
  J is actually the opposite of the Noether charge QS∞ ,η of S∞ with respect to η, as defined
  in Property 17.8.
        where ξ is the stationary Killing vector, the η(i) ’s are L axisymmetric Killing vectors
        (1 ≤ L ≤ [(n − 1)/2]) and the Ω(i) ’s are L real constants. By virtue of Property 3.16, the
        surface gravity κ of H is constant, so that the Hawking temperature of H is well defined:
        TH = ℏκ/(2πkB ) [Eq. (16.62)]. Let us consider an asymptotically flat solution δg to the
        linearized field equations around g, i.e. a variation δg in the space F of asymptotically flat
        solutions to the field equations (δg is not required to be stationary). Let us define the Wald
        entropy of the black hole in the perturbed spacetime (M , g + δg) by the same formula as
        in the stationary case, namely Eq. (17.33), but with the integral taken on the bifurcation
        surface Sˆ of the unperturbed configurationa :
                                                          Z
                                                    2πkB
                                           SW :=              Q(χ),                             (17.100)
                                                     ℏκ Sˆ
        where χ is the same vector field on M as in the unperturbed configuration (in other
        words, δχ = 0). Finally, let us assume that the asymptotic flatness conditions ensure that
        a Hamiltonian conjugate to ξ exists. Then δg generates a variation δE of the canonical
        energy relative to ξ [Eq. (17.95)] obeying
                                                                L
                                                                X
                                            δE = TH δSW +                Ω(i) δJ(i) ,                       (17.101)
                                                                   i=1
        where J(i) is the canonical angular momenta relative to η(i) [Eq. (17.98)].
           a
              As discussed in the proof of Property 16.1, there is no loss of generality in considering that the event
        horizons of the stationary spacetime (M , g) and the perturbed one (M , g + δg) are described by the same
        hypersurface H of M . A priori H is not a Killing horizon in (M , g + δg), since the latter is not assumed
        stationary. The submanifold Sˆ of H is thus not a bifurcation surface in (M , g + δg). However, this does
        not prevent ones to define the integral of the (n − 2)-form Q(χ) in (M , g + δg) (i.e. Q(χ) = Q(g + δg, χ),
        if the dependence of Q on g is made explicit) on the (n − 2)-surface Sˆ, as in Eq. (17.100). Similarly, in
        Eq. (17.100), κ is the surface gravity of H in (M , g), given that there is no well-defined concept of surface
        gravity if H is not a Killing horizon.
      Proof. Let us consider the fundamental identity (17.83) with ξ = χ, which holds since we
      assume that χ is a fixed vector field on M (δχ = 0). We may set ω(g, δg, Lχ g) = 0 in its
      left-hand side since ω(g, δg, Lχ g) is linear with respect to Lχ g (Property 17.17) and Lχ g = 0
      for χ is a Killing vector of g. There remains then
                                            d [δQ(χ) − χ · θ(g, δg)] = 0.
      The left-hand side is a (n − 1)-form. Let us integrate it on a hypersurface Σ extending from
                      17.5 First law for diffeomorphism-invariant theories                                 691
the bifurcation surface Sˆ of the bifurcate Killing horizon to the asymptotically flat end. Using
Stokes’ theorem (A.94) with ∂Σ = Sˆ ∪ S∞ , we get
                            Z            Z
                          δ     Q(χ) =        [δQ(χ) − χ · θ(g, δg)] .                    (17.102)
                               Sˆ            S∞
where we have used Eq. (17.100) to let appear the Wald entropy SW and Eq. (17.85) to let appear
the variations of the Hamiltonians Hξ and Hη(i) conjugate to ξ and η(i) respectively. Given
that δκ = 0 (for κ in Eq. (17.100) is set to the surface gravity in the unperturbed spacetime, cf.
footnote a), ℏκ/(2πkB ) = TH [Eq. (16.62)], Hξ [g] = E (Property 17.23) and Hη(i) [g] = −J(i)
(Property 17.24), we get Eq. (17.101).
    At first glance, Property 17.25 generalizes the first law of black hole thermodynamics
obtained for general relativity in Chap. 16 to any diffeomorphism-invariant theory of gravity:
compare Eq. (17.101) with Eq. (16.74), taking into account that, for general relativity, E = M (cf.
Example 11 on p. 688). However, one should notice that δSW in Eq. (17.101) is the variation of the
entropy defined by Eq. (17.100), while in Eq. (16.74), δSBH was the variation of entropy between
two nearby stationary configurations. Given that for a stationary configuration, the Wald
entropy SW =: SW   eq
                      is defined by Eq. (17.33), where κ is the surface gravity corresponding to
that configuration and not of some reference configuration as in Eq. (17.100), one should prove
that δSW = δSW  eq
                   to claim that Eq. (17.101) truly generalizes the first law (16.74). Specifically, let
us use the same framework as in Chap. 16, namely the perturbed configuration is a stationary
black hole, the horizon of which is a Killing horizon with respect to the Killing vector
                                                  L
                                                  X
                              χ + δχeq = ξ +            (Ω(i) + δΩ(i) )η(i) ,                  (17.103)
                                                  i=1
and the corresponding surface gravity is κ+δκ. Note that we are using the same gauge freedom
as in Chap. 16, namely the horizon is the same hypersurface of M and the stationary and
axisymmetric Killing vectors are held fixed: δξ = 0 and δη(i) = 0 [Eq. (16.4)]. We are using
the subscript “eq” in δχeq to avoid any confusion with δχ = 0 assumed in Property 17.25. We
have then, from Eq. (17.33) with S = Sˆ,
                                  Z                                Z
                 eq      2πkB                                2πkB
               δSW =                  Q(g + δg, χ + δχeq ) −           Q(g, χ).
                       ℏ(κ + δκ) Sˆ                           ℏκ Sˆ
Using the linearity of Q(g, χ) with respect to χ (cf. Property 17.6) and keeping the linear order
in the perturbation, we get
                     Z                                                        Z
         eq   2πkB                                                    2πkB δκ
      δSW =             [Q(g + δg, χ) + Q(g, δχeq ) − Q(g, χ)] −                  Q(g, χ).
                ℏκ Sˆ                                                   ℏκ2    Sˆ
692                  Black hole thermodynamics beyond general relativity
      Now, the difference between the first and third terms in the first integral corresponds exactly
      to δSW with SW defined by Eq. (17.100). Hence there comes
                                                Z                   Z         
                             eq           2πkB                   δκ
                          δSW = δSW +                Q(δχeq ) −          Q(χ) ,
                                           ℏκ     Sˆ              κ Sˆ
      where we have suppressed      the explicit dependence of Q on g. From Eqs. (17.99) and (17.103),
      one has δχeq = Li=1 δΩ(i) η(i) (in agreement with Eq. (16.8)). Given that Q(χ) is linear in χ,
                        P
      this yields Q(δχeq ) = Li=1 δΩ(i) Q(η(i) ). On the other hand, we may express δκ in terms of
                               P
      the δΩ(i) ’s as well, thanks to Eq. (16.11), which is purely kinematic and thus independent of
      the gravity theory:
                                                  L       Z
                                             1 X (i)
                                    δκ = −          δΩ        ∇µ η(i)ν dS µν ,                (17.104)
                                            2A i=1         Sˆ
      where A is the horizon area. Note that we have set to zero the term ISk in Eq. (16.11) since
      χ = 0 on Sˆ and ∇ν hµν := ∇ν δgµν is assumed regular on Sˆ. Hence we obtain
                                 L
                                          "Z                  Z                     Z       #
             eq           2πk B
                                X                          1
           δSW  = δSW +             δΩ(i)      Q(η(i) ) +          ∇µ η(i)ν dS µν ×     Q(χ) .
                           ℏκ i=1           Sˆ            2κA   Sˆ                   Sˆ
                                           | {z }             |       {z        } | {z }
                                                          (i)                    (i)          κA/(8π)
                                                     −J                      16πJ
                                                          Sˆ                     Sˆ
      The values indicated below the underbraces are those obtained for general relativity, resulting
      respectively from Eqs. (17.32), (5.65) and (17.35). The terms between the square brackets then
      cancel each other, leaving us with δSW  eq
                                                 = δSW . We thus conclude:
        For vacuum general relativity and for a variation between two stationary black holes, the
        first law (17.101) reduces to the “comparison” first law obtained in Property 16.2: Eq. (17.101)
        is identical to Eq. (16.22) with K = 1. Morever, still within general relativity, Property 17.25
        extends Property 16.2 to non-stationary variations δg.
      Remark 3: We have already noticed that the Hawking temperature TH is independent of the gravity
      theory (cf. Remark 2 on p. 646). Thus, contrary to the entropy, there was no need to amend it to get the
      first law in a generic diffeomorphism-invariant gravity theory.
      Remark 4: In the original work by Iyer & Wald [292], the black hole entropy in the perturbed con-
      figuration is defined by formula (17.36) instead of formula (17.100). Formula (17.36) makes sense for a
      perturbed black hole as well, provided that Sˆ is the bifurcation surface in the unperturbed configuration.
      Let us denote by SW ′ the entropy resulting from that definition, keeping S for (17.100). For a stationary
                                                                                  W
      black hole, both definitions are equivalent: SW = SW      ′ (Property 17.12), but, for a nonstationary one,
this is not obvious. However, it has been proven by Iyer & Wald that δSW = δSW ′ (cf. Eq. (94) in
      Ref. [292]), so that the first law (17.101) holds if δSW is substituted by δSW (cf. Eq. (93) in Ref. [292]).
                                                                                     ′
      Given the definition of SW  ′ , it is obvious that, when considering two nearby stationary configurations,
                                   17.6 What about the second law?                                           693
δS ′ eq
     W = δSW , so that using Iyer & Wald’s definition, Property 17.26 would have been immediate. We
            ′
nethertheless stick to the definition (17.100) since it gives the quantity that naturally appears when
deriving the first law (cf. the proof of Property 17.25), while the derivation for SW
                                                                                    ′ is more complicated,
cf. the proof in Ref. [292]. Furthermore, the definition (17.100) is connected with the recently proposed
entropy definitions for dynamical black holes [284, 506, 498], to be discussed below.
where κ is the surface gravity of the reference stationary black hole. For S = Sˆ, this formula
reduces to the Wald entropy expression (17.100), given that χ = 0 on Sˆ. Moreover BH =
BH (g, χ, Lχ g) and is linear with respect to Lχ g, so that BH = 0 on a stationary horizon and
SHWZ reduces then to SW as defined by Eq. (17.33). In the proof of Property 17.25, if one use a
hypersurface Σ with inner boundary S , instead of Sˆ, one obtains readily the first law (17.101)
with δSHWZ substituted for δSW . Given that δE and the δJ(i) ’s are the same (for they depend
only on values on S∞ ), it follows that δSHWZ = δSW for first order vacuum perturbations of a
stationary black hole. For non-vacuum perturbations, represented by an energy-momentum
tensor δT , Hollands, Wald and Zhang [284] have shown that δSHWZ (S2 ) − δSHWZ (S1 ) ≥ 0
694                  Black hole thermodynamics beyond general relativity
      if δT obeys the null energy condition (2.95) and S2 lies in the future of S1 . This constitutes a
      perturbative version of the second law.
          For general relativity, the HWZ entropy does not reduce to the Bekenstein-Hawking entropy
      SBH [Eq. (16.73)], except at the bifurcation surface Sˆ of the unperturbed spacetime, since its
      writes                                  Z
                                           kB
                          SHWZ = SBH +            vθ(ℓ) dS     (general relativity),           (17.106)
                                           4ℏ S
      where v is an affine parameter along the null geodesic generators of H such that v = 0 at
      Sˆ, ℓ := dx/dv is the associated tangent vector (ℓ is thus a null normal to H ), and θ(ℓ) is the
      expansion of H along ℓ. (cf. Property 2.9). Note that the product vθ(ℓ) is independent of the
      choice of the affine parameter v. Indeed, for an affine parameter v ′ := αv, one has ℓ′ = α−1 ℓ
      and θ(ℓ′ ) = α−1 θ(ℓ) (Property 2.10 with α ↔ α−1 ), so that v ′ θ(ℓ′ ) = vθ(ℓ) . We refer to [506, 498]
      for extended discussions about the HWZ entropy, especially in connection with the extension
      of the generalized second law (GSL, cf. Property 16.15) beyond general relativity.
          Other proposals for the entropy of dynamical black holes in diffeomorphism-invariant
      theories of gravity have been put forward in the literature, notably by Dong [171] and Wall
      [511]. As for HWZ, they regard perturbations of a stationary black hole (see Ref. [284, 498] for
      a comparison with HWZ entropy). Recently, a non-perturbative approach has been introduced
      by Davies & Reall [158], leading to a second law for effective field theories.
Chapter 18
Contents
         18.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 695
         18.2 Trapped surfaces and singularity theorems . . . . . . . . . . . . . . . 695
         18.3 Trapping horizons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 707
18.1       Introduction
This chapter is in a draft stage.
      Figure 18.1: Intersection S (red curve) of the past light cones of two points p and q in Minkowski spacetime.
      Some light rays emerging from S in the two orthogonal null directions are depicted; both sets of light rays are
      converging. Yet S is not trapped because it is not compact. [Figure generated by the notebook D.6.7]
      coordinates (t̃, r, θ, φ). Let S be any surface (t̃, r) = const. S is diffeomorphic to S2 and thus compact.
      Moreover, from Eq. (6.33), the metric induced by g on S is q = r2 (dθ2 + sin2 θ dφ2 ), which is clearly
      positive definite; hence S is spacelike. One also reads on the expression of q that the area of S is
      simply A = 4πr2 (i.e. r is the areal radius, cf. Sec. 6.2.2). If S is located in the black hole interior, i.e. if
      0 < r < 2m, Property 6.9 implies that A is decreasing along any future-directed null geodesic. It follows
      that S is trapped. On the contrary, if located in the black hole exterior (r > 2m), S is untrapped. This
      can be checked on Fig. 6.3, where it appears clearly that, in the region r > 2m, r is increasing along the
      outgoing radial null geodesics, which are normal to S .
      Example 3 (trapped surfaces in the Vaidya collapse): Spherically symmetric trapped surfaces in the
      Vaidya radiation shell collapse have been investigated in Sec. 15.3.5: in terms of the IEF coordinates
      (t, r, θ, φ) [cf. Eq. (15.11)], they are the spheres (t, r) = const such that r < 2M (t + r) (Property 15.10).
      The region containing them is depicted as red dot-filled in the Carter-Penrose diagrams of Figs. 15.7,
      15.11 and 15.12.
          The existence of a trapped surface characterizes strong gravity. More precisely it indicates
      that gravity is strong enough to focus all light rays emitted from the surface. This characteriza-
      tion is quasi-local, in the sense that it involves a surface. The surface can be “small” but it cannot
      be reduced to a point, to which the qualifier local would apply. Note that the hypothesis of
      compact manifold is crucial in the definition of a trapped surface. Without it, trapped surfaces
      would exist in Minkowski spacetime, as the following example shows.
                           18.2 Trapped surfaces and singularity theorems                                       697
                                         dr             m 2
                                                   =± 1−      .                                       (18.3)
                                         dt   L±         r
This last equation is deduced from Eq. (18.1) by setting ds2 := g(dx, dx) = 0 with dx = dt ∂t + dr ∂r
along L± . One reads on the metric (18.1) that the area of S is A = 4πr2 , as in the Schwarzschild
case (Example 1). Now, since ∂t is future-directed timelike, it appears from Eq. (18.3) that r increases
(resp. decreases) towards to the future along L+ (resp. L− ), in both regions r < m and r > m. It
follows immediately that S is not trapped. Actually, there exists only marginally trapped surfaces in
this spacetime: the cross-sections of the event horizon H .
      surface occurs, then some singularity will appear in the future of it. Before stating the theorem,
      let us recall a few concepts on which it relies.
           First of all, a Cauchy surface Σ of a spacetime (M , g) is a spacelike hypersurface such
      that every inextendible timelike curve of M intersects Σ exactly once (cf. Sec. 5.2.3). If (M , g)
      admits a Cauchy surface, it is said to be a globally hyperbolic spacetime. On such a spacetime,
      general relativity can be formulated as a well posed Cauchy problem [111], i.e. there exists a
      unique solution g of the Einstein equation in M from initial data prescribed on Σ, provided
      that the initial data fulfill four components of the Einstein equation known as the constraint
      equations (see e.g. Ref. [227]).
           The second concept involved in Penrose’s theorem is that of an inextendible incomplete
      geodesic. As defined in Appendix B (Sec. B.3.2), a geodesic L of a spacetime (M , g) is
      incomplete if some affine parameter λ does not span the whole of R along L . Given that
      any two affine parameters are related by an affine transformation, if this holds for a given
      affine parameter, this holds for all. More precisely, if L is a causal (i.e. timelike or null)
      geodesic and λ is increasing to the future, one says that L is future-incomplete if, and only
      if, λ < λmax along L , for some λmax ∈ R. Similarly, one says that L is past-incomplete
      if, and only if, λ > λmin along L , for some λmin ∈ R. Furthermore, a geodesic L is said
      inextendible if, and only if, there does not exist any geodesic L ′ of (M , g) distinct from L
      and such that L ⊂ L ′ . An inextendible incomplete causal geodesic marks the existence of a
      singularity of some kind. For instance a free-falling observer having an inextendible future-
      incomplete timelike geodesic as worldline has his proper time1 abruptly stopping at some finite
      value! Actually, the existence of an inextendible incomplete geodesic is the nowadays adopted
      definition of a spacetime singularity. We shall discuss the connection with the concept
      of curvature singularity introduced in Sec. 6.3.4 later on. For the moment, we have enough
      material to state the famous theorem:
               • the null convergence condition (2.94) holds on M : for any null vector ℓ, R(ℓ, ℓ) ≥ 0,
                 where R is g’s Ricci tensor — for general relativity, such a condition is equivalent to
                 the null energy condition (2.95): T (ℓ, ℓ) ≥ 0;
        Then at least one inextendible null geodesic emerging orthogonally from S is future-
        incomplete.
      Proof. We shall prove the theorem by contradiction. Hence let us assume that all inextendible
      null geodesics emerging orthogonally from S are future-complete. Let F be the boundary
         1
             Recall that the proper time is an affine parameter along a timelike geodesic (Property B.7).
                             18.2 Trapped surfaces and singularity theorems                                              699
Figure 18.2: Spacetime diagram representing various objects involved in the proof of Penrose’s singularity
theorem (Property 18.1): the trapped surface S (the two red dots), the two null directions ℓ and k normal to S ,
the causal future of S , J + (S ) (pale green region), the boundary F of J + (S ) (dark green curve), the compact
space V containing F (light green curve) and the image Φ(F ) of F (orange segment) into the Cauchy surface Σ
(blue line) by the topological embedding Φ constructed from the field lines of a global timelike vector ξ (maroon
lines). The two dark green dots are crossover points, where null geodesics leave F to the interior of J + (S ).
The compact (n − 2)-dimensional surface S appears disconnected and reduced to two points because this 2D
figure is the cut by a vertical plane Π of the 3D view of Fig. 3.1, where S is drawn as an horizontal circle, whose
intersection with Π is two points.
of the causal future of S (cf. Sec. 4.4.1 and Fig. 18.2): F := ∂J + (S ). Since (M , g) is
globally hyperbolic (it admits Σ as a Cauchy surface), it is necessarily causally simple (cf.
Proposition 6.6.1 of HE [266]). By the very definition of causally simple (cf. Sec. 4.4.1) and the
compacity of S , it follows that F coincides with the future horismos of S : F = E + (S ).
This implies that points of F are connected to S by null geodesics. More precisely, we note
that F is an achronal boundary, namely the boundary of the causal future or past of a given
set. As mentioned in Sec. 4.4.3 (Remark 6), the properties of achronal boundaries are the same
as those established for a black hole event horizon H = ∂J − (I + ) ∩ M (Properties 4.5 to
4.11), provided one changes future to past when appropriate. It follows that F is an achronal
(n − 1)-dimensional topological manifold (Properties 4.5 and 4.7) that is ruled by null geodesics,
called the generators of F and obeying the following properties: when followed in the past
direction, a generator never leaves F until it encounters S and there is a unique generator
through each point of F , except at special points, named crossovers, where the generators
leave F in the future direction, i.e. are future-extended to null geodesics of M that do not
belong2 to F (Property 4.8). Note that two crossovers are shown in Fig. 18.2.
    We actually need an additional property of F , which has not been established in Sec. 4.4.3:
on S (which is part of F , cf. Sec. 4.4.1), the generators of F are orthogonal to S (cf. e.g.
Theorem 9.3.11 in Wald’s textbook [499]). They therefore start from S along the two null
   2
    They actually belong to the interior of J + (S ), i.e. to the chronological future I + (S ) [Eq. (4.33)] (see e.g.
Proposition 4.5.14 in HE [266]).
700                      The quasi-local approach: trapping horizons
      directions ℓ and k normal to S (cf. Fig. 18.2). Let us denote by L a generic null generator
      along ℓ and by λ the affine parameter of L such that, on S , λ = 0 and dx/dλ|L = ℓ. We
      shall use the latter relation to extend the definition of ℓ along L away from S . Similarly, let
      us denote by L¯ a generic null generator along k and by λ̄ the affine parameter of L¯ such that
      λ̄ = 0 on S and dx/dλ̄ L¯ = k. By hypothesis, L and L¯ are future-complete. Since S is a
      trapped surface, θ0 := θ(ℓ) S < 0 and θ̄0 := θ(k) S < 0. Then, the null Raychaudhuri equation
      (2.88) applied to the null hypersurface generated by the null geodesics of the L family and the
      null convergence condition R(ℓ, ℓ) ≥ 0 implies that there exists λ∗ ∈ (0, (n − 2)/|θ0 |] such
      that θ(ℓ) → −∞ for λ → λ∗ along L . The reasoning is exactly the same at that used in the
      proof of Property 16.8 and we shall not repeat it here. The point λ = λ∗ on L is a caustic point,
      where all nearby geodesics starting from S along ℓ converge. Given the properties of F , we
      conclude that for λ > λ∗ , L ceases to a be null generator of F , i.e. Lλ>λ∗ ̸∈ F . Similarly,
      along L¯, θ(k) → −∞ for λ̄ → λ̄∗ with λ̄∗ ∈ (0, (n − 2)/|θ̄0 |], so that L¯λ̄>λ̄∗ ̸∈ F . Let
                                                                                      
                                       n−2                                     n−2
                      λmax := sup                    and      λ̄max := sup                 .
                               p∈S    |θ(ℓ) (p)|                       p∈S   |θ̄(k) (p)|
      Since S is compact, λmax and λ̄max are finite. Let us then consider the map f : S ×[0, λmax ] →
      M , (p, λ) 7→ q, where q is the point of affine parameter λ on the null geodesic of the L -family
      through p. Since all the L geodesics are assumed to be future-complete, f is well defined.
      Moreover, f is clearly continuous. Then, given that S ×[0, λmax ] is a compact set, the image set
      f (S × [0, λmax ]) is necessarily compact. Similarly, the image set f¯(S × [0, λ̄max ]) is compact,
      where f¯ : S × [0, λ̄max ] → M , (p, λ) 7→ q̄ — the point of affine parameter λ̄ on the null
      geodesic of the L¯-family through p. The set
      is then a compact subset of M , as the union of two such sets. By construction, for each p ∈ S ,
      the point q = f (p, λmax ) (resp. q̄ = f (p, λmax )) lies beyond the caustic point on the geodesic
      L (resp. L¯) through p. It follows that F ⊂ V (cf. Fig. 18.2). Given that F is closed, being a
      topological boundary (F := ∂J + (S )), and V is compact, we conclude that F is compact.
           Because (M , g) is time-orientable, there exists a nonvanishing timelike vector field ξ on
      M . The field lines of ξ form a congruence of timelike curves C , each of them intersecting Σ in
      a single point, for Σ is a Cauchy surface. We may then define a map Φ : F → Σ, p 7→ Cp ∩ Σ,
      where Cp is the unique timelike curve of the C congruence through p (cf. Fig. 18.2). The
      map Φ is clearly continuous. Moreover, it is injective given that F is achronal (if a point
      q ∈ Σ was the image of two distinct points p and p′ of F , this would mean that p and p′ are
      connected by a timelike curve, namely a segment of Cp ). Now, there cannot be any injective
      continuous map from a compact manifold (without boundary) (here F ) into a non-compact
      manifold of the same dimension (here Σ). For instance, there is no injective continuous map
      from the sphere S2 into the plane R2 . To show this impossibility in the present case, and
      hence to reach the sought contradiction, let us invoke a standard topology theorem (see e.g.
      Lemma 4.50 in Lee’s textbook [342]), according to which an injective continuous map from a
      compact space to a Hausdorff space is a topological embedding, i.e. a homeomorphism onto its
      image. This theorem applies here since Σ is Hausdorff, being a manifold (cf. Sec. A.2.1). Hence
                          18.2 Trapped surfaces and singularity theorems                                      701
      Penrose’s singularity theorem are thus satisfied. Let us check its predictions on the two families L and
      L¯ of null geodesics emanating orthogonally from a trapped sphere S that is defined in terms of the
      conformal coordinates (η, χ, θ, φ) by (η, χ) = (η0 , χ0 ) = const, with π − η0 < χ0 < χs − (π − η0 ).
      This constraint implies that both L and L¯ stay in the interior of the collapsing star until they reach the
      curvature singularity η = π (cf. the conformal diagram in Fig. 14.2 left). Let us check that this happens
      for a finite value of their affine parameters. The equation of these geodesics is easily deduced from the
      conformal form (14.45) of the metric tensor: we get η − η0 = ±(χ − χ0 ) with ± = + for L and ± = −
      for L¯. Moreover, η is an affine parameter along                  ¯
                                                         both L and L with respect to the conformal metric
      h := −dη + dχ + sin χ dθ + sin θ dφ . According to Eq. (4.13) any affine parameter λ of L or
                  2      2       2       2     2      2
      L¯ with respect to the physical metric g is related to the h-affine parameter η by dλ/dη = aΩ2 , where
      a is a positive constant and the conformal factor Ω is read on Eq. (14.45): Ω2 = (a0 /2)2 (1 + cos η)2 .
      The integration leads to
                                           λ = α (6η + 8 sin η + sin 2η) + β,
      where α and β are two constants such that α > 0. We have then limη→π λ = 6απ + β, which is
      obviously finite. We conclude that both L and L¯ are future-incomplete.
      Example 8 (Vaidya collapse): Let us consider the collapse of a radiation shell as studied in Sec. 15.3,
      assuming that the shell is thin (large radiation density): α := 2m/v0 > 1/8 [cf. Eqs. (15.18) and (15.23)].
      As it can be inferred from the Carter-Penrose diagram of Fig. 15.7, a hypersurface t = const in the black
      hole exterior is a non-compact Cauchy surface (for instance the hypersurface t = −3m in the α = 2/3
      model considered in Fig. 15.1). Besides, we have already noticed in Sec. 15.2.1 that the null energy
      condition is fulfilled, for M (v) is an increasing function (cf. Fig. 15.2). Finally, as detailed in Example 3
      (p. 696), trapped surfaces forms in the late stages of the collapse. Penrose’s singularity theorem thus
      applies and there should be an incomplete inextendible null geodesic emerging from each trapped sphere
      S described in Example 3. It is easy to show that this is indeed the case for all ingoing null geodesics3
      L¯ = L(v,θ,φ)
                in   , i.e. geodesics emerging from S along the vector k normal to the null hypersurfaces
      v = const (cf. Property 15.1). As it is clear on Fig. 15.7, all the geodesics L(v,θ,φ) in     terminate at the
      curvature singularity r = 0, v > 0. Since −r is an affine parameter along them (Property 15.4), we
      conclude that they are future-incomplete.
      Example 9 (Bardeen regular black hole as a counter-example): There exist solutions of the Einstein
      equation (with some rather exotic matter sources) corresponding to a black hole without any curvature
      singularity. They are called regular black holes (see e.g. the recent book [31]). A well known example
      (historically the first one) is the Bardeen black hole [35] (see also Sec. V of Ref. [69]). The underlying
      spacetime (M , g) is static and spherically symmetric; in some part of it, the metric is expressed in terms
      of the Schwarzschild-Droste-like coordinates (t, r, θ, φ) by
                                                        2M (r) −1 2
                                                            
                                  2M (r)      2
                                                                     dr + r2 dθ2 + sin2 θ dφ2 ,             (18.4)
                                                                                                   
                   g =− 1−                  dt + 1 −
                                      r                   r
      where
                                                               r3
                                               M (r) := m               ,                               (18.5)
                                                          (r2 + ℓ2 )3/2
                                                                √
      m and ℓ being two positive constants obeying ℓ < 4m/(3 3). Would M (r) be a constant function, one
      would get the Schwarzschild metric (6.14). Actually the Schwarzschild metric is recovered asymptotically,
         3
          All outgoing null geodesics from S are also incomplete, but this is harder to show for there is no simple
      expression of an affine parameter along them.
                           18.2 Trapped surfaces and singularity theorems                                        703
since M (r) → m for r → +∞. For small values of r, one has instead
                                                          m 3
                                              M (r) ∼        r .                                        (18.6)
                                                    r→0   ℓ3
This implies that g is regular at r = 0, contrary to the Schwarzschild metric. More precisely, setting
Λ := 6m/ℓ3 , we get
                                     Λ 2 −1 2
                                     
                       Λ 2     2
                                           dr + r2 dθ2 + sin2 θ dφ2 .                                   (18.7)
                                                                   
               g ∼ − 1− r    dt + 1 − r
                r→0    3             3
We recognize in the right-hand side the metric of de Sitter spacetime of cosmological constant Λ
expressed in the so-called static coordinates (see e.g. Sec. 4.3 of Ref. [243]). On the other side, the metric
components (18.4) are singular at values of r for which 2M (r) = r. This equation admits two solutions:
r = r± with 0 < r− < r+ . By introducing coordinates (t̃, r, θ, φ) of Eddington-Finkelstein type, one
can show that g is regular at r = r± . The hypersurfaces r = r± turn out to be two Killing horizons; the
outermost one (r = r+ ) is the black hole event horizon, while the innermost one is a Cauchy horizon in
the maximally extended spacetime (see Fig. 1 of Ref. [15]). The Bardeen metric g is a solution of the
Einstein equation (1.40) sourced by a magnetic monopole in some nonlinear electrodynamics [29]. By
nonlinear electrodynamics, it is meant a theory involving an electromagnetic 2-form F and for which
the Lagrangian density L is a nonlinear function of the electromagnetic invariant F := Fµν F µν /4, the
standard Maxwell theory corresponding to the linear case: L = −F/µ0 . Specifically, for the Bardeen
black hole, the nonlinear electrodynamics is defined by [29]
                                                          √        !5/2
                                         3m               2ℓ2 F
                                    L=−                   √                                             (18.8)
                                        µ0 |ℓ|ℓ2       1 + 2ℓ2 F
      of spacetime. It cannot be checked by any local physical experiment. As the Bardeen black hole
      shows (Example 9), one cannot get rid of this hypothesis without replacing it by other ones. It
      turns out that it is possible find more local hypotheses: a stronger convergence condition and
      the non-existence of closed timelike curve. More precisely, we have:
              where R is g’s Ricci tensor — for general relativity with a cosmological constant Λ,
              such a condition is equivalent to the strong energy condition:
                                          
                             1          Λ
               T (v, v) −          T−        v · v ≥ 0 for any timelike or null vector v, (18.11)
                          n−2           4π
           3. the following generic condition holds: every causal geodesic contains at least one
              point at which the (nonzero) tangent vectors v fulfill
                                                        β]
                                               v [α R              µ ν
                                                         µν[γ vδ] v v    ̸= 0,                 (18.12)
      The proof is quite long and we refer to HE [266], p. 267, where it is given for n = 4, the
      generalization to any dimension being straightforward (see also Sec. 5 of Ref. [451] and Sec. 5.1.4
      of Ref. [454]). Here, we shall limit ourselves to a few comments:
         • The non-existence of closed timelike curves is a much weaker condition than the existence
                            18.2 Trapped surfaces and singularity theorems                                              705
       of a Cauchy surface required in Penrose’s theorem (Property 18.1), the latter condition
       implying the former one.
• On the other side, the causal convergence condition (18.10) is stronger than the null
  convergence condition (2.94) involved in Penrose’s theorem, for it has to hold for timelike
  vectors, in addition to null ones.
• For general relativity, the causal convergence condition (18.10) is equivalent to the strong
  energy condition (18.11) thanks to the Einstein equation (1.42). For a perfect fluid of
  proper energy density ρ and pressure p, the energy-momentum is T = (ρ + p)u ⊗ u + pg
  and the strong energy condition (18.11) is equivalent to
                                                            Λ
                                          (n − 3)ρ + (n − 1)p −≥ 0.                      (18.13)
                                                            4π
      Note that, even for Λ = 0, the strong energy condition does not imply the weak energy
      condition introduced in Sec. 2.4.2 [Eq. (2.96)], contrary to what the names may suggest.
      The only logical connection between the two conditions is that both imply the null energy
      condition (2.95).
• For timelike geodesics, the generic condition (18.12) is equivalent to5
                                                    Rαµνβ v µ v ν ̸= 0.                                      (18.14)
      In view of the geodesic deviation equation (B.49), this implies that the relative acceleration
      in any timelike geodesic congruence cannot vanish identically along the geodesics. In
      physical terms, this means that tidal forces are necessarily felt sometime along the
      worldline of any inertial observer. Apart from flat (Minkowski) spacetime, this condition
      is violated only in peculiar cases, involving a high degree of symmetry, hence the qualifier
      generic. An example is the Einstein static universe considered in Sec. 4.2.3 [Eq. (4.18)]: it
      can be shown [451] that the curves (χ, θ, φ) = const are timelike geodesics that have
      Rαµνβ v µ v ν = 0 all along them.
• Hypothesis 4 of the Hawking-Penrose theorem is more general than the requirement of
  a trapped surface in Penrose’s theorem (Property 18.1), since the trapped surface can be
  replaced by a light cone that reconverges in the future (case (b)) or a spacelike compact
  hypersurface (case (c)). The latter possibility arises in spatially closed cosmological
  models, such as the de Sitter spacetime (cf. Example 3 on p. 103), which admits spacelike
  hypersurfaces of S3 topology. This does not occur in Minkowski spacetime: embedded
  hypersurfaces with S3 topology are of course possible (the unit sphere of R4 !), but they
  cannot be everywhere spacelike.
• The Hawking-Penrose theorem does not specify whether the incomplete geodesic is
  timelike or null and whether it is future-incomplete or past-incomplete; it is therefore
  less precise than Penrose’s singularity theorem, which tells that the incomplete geodesic
  is null and future-incomplete. It can be argued however that in the case (4a) (trapped
  surface), the geodesic is future-incomplete (cf. Ref. [451], p. 792).
5
    To see it, it suffices to contract (18.12) by vα v δ and use the symmetries of the Riemann tensor (cf. Sec. A.5).
706                        The quasi-local approach: trapping horizons
      Remark 2: Hypothesis (4c) (existence of a spacelike compact hypersurface) is that stated in the original
      article by Hawking & Penrose [267]. It can be generalized to the existence of an achronal compact set
      without edge (cf. Theorem 2 on p. 266 of HE [266] and Theorem 5.6 of Ref. [451]).
      Example 10 (strong energy condition): Let us check that the strong energy condition (18.11) holds for
      the classical spacetimes involved in Examples 6 (Schwarzschild), 7 (Oppenheimer-Snyder) and 8 (Vaidya).
      For the Schwarzschild black hole, this is trivial since T = 0 and Λ = 0. For the Oppenheimer-Snyder
      collapse, still Λ = 0 and we note that the matter is a perfect fluid with ρ ≥ 0 and p = 0, so that the fluid
      strong energy condition (18.13) is fulfilled. For the Vaidya collapse, the energy-momentum tensor T is
      given by Eq. (15.5), which yields T (v, v) = M ′ (v)(k · v)2 /(4πr2 ) and T = 0. Given that M ′ (v) ≥ 0
      and Λ = 0, it is clear that (18.11) is satisfied.
      Remark 3: The scope of the Hawking-Penrose theorem goes beyond gravitational collapse and black
      hole physics. It encompasses cosmology and can be used to predict the Big-Bang singularity, via
      the hypothesis (4b) with the point p being our current position in the Universe and considering the
      past-directed null goedesics (our past light-cone) (see Sec. 10.1 of HE [266] for details).
      Remark 4: Both Penrose and Hawking-Penrose theorems have been generalized to n-dimensional
      spacetimes containing a “trapped” submanifold of any codimension (the codimension is 2 in Penrose’s
      theorem 18.1 and 2, n and 1 for respectively cases (4a), (4b) and (4c) of Hawking-Penrose theorem 18.2)
      by Gregory Galloway and José Senovilla in 2010 [211]. In this work, the concept of trapped surface, a
      priori defined for codimension 2 only (cf. Sec. 18.2.1), is generalized to any codimension by demanding
      that the mean curvature vector of the submanifold is everywhere future-directed timelike; moreover,
      the null and causal convergence conditions are generalized to inequalities involving the Riemann tensor.
      Historical note : The singularity theorem established by Roger Penrose in 1965 [405] (Property 18.1)
      was a major breakthrough in general relativity and mathematical physics. It introduced a brand new
      concept, that of trapped surfaces, which revealed to be extremely fruitful in relativistic gravity. It also
      brought in the characterization of spacetime singularities by geodesic incompleteness. This novel view
      point on singularities was subsequently advocated by Robert Geroch in 1968 [216] and Stephen Hawking
      and George Ellis in their 1973 textbook [266]; it is nowadays the standard one. From an astrophysical
      perspective, Penrose’s theorem showed that once a gravitational collapse has reached a certain stage,
      characterized by the appearance of trapped surfaces, then the formation of a singularity is inevitable,
      whatever the geometry of the collapse and the stiffness of the matter’s equation of state may be (provided
      the very mild null energy condition is obeyed). Previously, it could have been argued that the central
      singularity obtained in exact solutions, like that of the Oppenheimer-Snyder collapse (Chap. 14), is an
      artifact of spherical symmetry or the assumption of zero pressure, so that e.g. rotation or high pressures
      could prevent the singularity from appearing. For this achievement, Roger Penrose was awarded the
      Nobel Prize in Physics in 2020. More details about the history of Penrose’s theorem and its impact can be
                                        18.3 Trapping horizons                                            707
found in the review articles [452, 453, 454, 335]. As for the theorem proven in 1970 by Stephen Hawking
and Roger Penrose [267] (Property 18.2), it is still considered today as the best singularity theorem.
Appendices
Appendix A
Contents
        A.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 711
        A.2 Differentiable manifolds . . . . . . . . . . . . . . . . . . . . . . . . . . 711
        A.3 Pseudo-Riemannian manifolds . . . . . . . . . . . . . . . . . . . . . . 721
        A.4 The three basic derivatives . . . . . . . . . . . . . . . . . . . . . . . . . 726
        A.5 Curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 734
A.1      Introduction
We recall in this appendix basic definitions and results of differential geometry that are used in
the main text. The reader who has some knowledge of general relativity should be familiar
with most of them. It should be clear that this appendix is not intended to be a monograph
on differential geometry. In particular, contrary to the other parts of these notes, we state
many results without proofs, referring the reader to classical textbooks on the topic [331, 343,
344, 390, 59, 110, 189], as well as to the differential geometry sections of the general relativity
textbooks [108, 464, 499].
   1. M is a separated space (also called Hausdorff space): any two distinct points of M
      admit disjoint open neighborhoods.
712                                   Basic differential geometry
Figure A.1: Locally a manifold resembles Rn (n = 2 on the figure), but not necessarily at the global level.
         2. M has a countable base 1 : there exists a countable family (Uk )k∈N of open sets of M
            such that any open set of M can be written as the union (possibly infinite) of some
            members of the above family.
      Property 1 excludes manifolds with “forks” and is very reasonable from a physical point of
      view: it allows to distinguish between two points even after a small perturbation. Property 2
      excludes “too large” manifolds; in particular it permits setting up the theory of integration on
      manifolds. It also allows for a smooth manifold of dimension n to be embedded smoothly into
      the Euclidean space R2n (Whitney theorem). Property 3 expresses the essence of a manifold:
      it means that, locally, one can label the points of M in a continuous way by n real numbers
      (xα )α∈{0,...,n−1} , which are called coordinates (cf. Fig. A.1). More precisely, given an open
      subset U ⊂ M , a coordinate system or chart on U is a homeomorphism2
                                      Φ : U ⊂ M −→ V ⊂ Rn
                                                                                                              (A.1)
                                                 p       7−→ (x0 , . . . , xn−1 ),
      Remark 2: Strictly speaking the definition given above is that of a topological manifold. We are
      saying manifold for short.
         1
          In the language of topology, one says that M is a second-countable space.
         2
          Let us recall that a homeomorphism between two topological spaces (here U and V ) is a bijective map Φ
      such that both Φ and Φ−1 are continuous.
                                     A.2 Differentiable manifolds                                          713
    Usually, one needs more than one coordinate system to cover M . An atlas on M is a set
of pairs (Ui , Φi )i∈I , where I is a set (not necessarily finite), Ui an open set of M and Φi a chart
on Ui , such that the union of all Ui covers M :
                                            [
                                                  Ui = M .                                        (A.2)
                                            i∈I
    The above definition of a manifold lies at the topological level (cf. Remark 2), meaning that
one has the notion of continuity, but not of differentiability. To get the latter, one should rely
on the smooth structure of Rn , via the atlases: a smooth manifold, is a manifold M equipped
with an atlas (Ui , Φi )i∈I such that for any non-empty intersection Ui ∩ Uj , the map
                           j : Φj (Ui ∩ Uj ) ⊂ R −→ Φi (Ui ∩ Uj ) ⊂ R
                     Φi ◦ Φ−1                                                                     (A.3)
                                                n                    n
is smooth (i.e. C ∞ ). Note that the above map is from an open set of Rn to an open set of Rn , so
that the invoked C ∞ differentiability is nothing but that of Rn . Such a map is called a change
of coordinates or, in the mathematical literature, a transition map. The atlas (Ui , Φi )i∈I is
called a smooth atlas. In the following, we consider only smooth manifolds.
Remark 3: Strictly speaking a smooth manifold is a pair (M , A) where A is a (maximal) smooth atlas
on M . Indeed, a given (topological) manifold M can have non-equivalent differentiable structures, as
shown by Milnor (1956) [366] for S7 , the unit sphere of dimension 7: there exist smooth manifolds, the
so-called exotic spheres, that are homeomorphic to S7 but not diffeomorphic to S7 . On the other side,
for n ≤ 6, there is a unique smooth structure for the sphere Sn . Moreover, any manifold of dimension
n ≤ 3 admits a unique smooth structure. Amazingly, in the case of Rn , there exists a unique smooth
structure (the standard one) for any n ̸= 4, but for n = 4 (the standard spacetime case!) there exist
uncountably many non-equivalent smooth structures, the so-called exotic R4 [474].
Remark 4: When discussing the no-hair theorem in Chap. 5, one refers to the concept of analytic
manifold, which is a special case of smooth manifold. Indeed, an analytic manifold is defined as a
manifold equipped with an atlas for which all the changes of coordinates Φi ◦ Φ−1   j are real analytic
functions. Let us recall that an analytic function is a C ∞ function f for which the Taylor series about
any point x in its domain converges to f in some neighborhood of x.
                                     Hn := (x1 , . . . , xn ) ∈ Rn , xn ≥ 0 .            (A.4)
                                            
      This definition is independent from the choice of Φ (cf. Theorem 1.37 of Ref. [343]). The set of
      all boundary points of M is naturally called the boundary of M and is denoted by ∂M .
      Remark 5: The boundary ∂M should not be confused with the topological boundary of M , i.e. the
      boundary of M as a topological space, which is the closure of M minus the interior of M ; since both
      sets coincide with M , the topological boundary of M is obviously ∅.
          A smooth manifold with boundary is a manifold with boundary endowed with a smooth
      atlas, with the understanding that a transition map
                                    j : Φj (Ui ∩ Uj ) ⊂ H −→ Φi (Ui ∩ Uj ) ⊂ H
                              Φi ◦ Φ−1                   n                    n
      is said to be smooth iff it can be extended around each point of its domain (including the points
      of ∂Hn ) into a smooth map from an open subset of Rn to Rn .
                                            P : I −→ M
                                                                                                                   (A.6)
                                                   λ 7−→ p = P (λ) ∈ L .
      Remark 6: In the literature, a curve is often defined as a map P : I → M and not as the image of P .
      According to that definition, different parametrizations give birth to different curves.
         A scalar field on M is a function f : M → R. Unless specified, we shall always consider
      smooth scalar fields, i.e. smooth maps as defined in Sec. A.2.1. At a point p = P (λ) ∈ L , the
         3
          By open subset of Hn , it is meant a set A ⊂ Hn that is open with respect to the topology of Hn ; A is then not
      necessarily open when considered as a subset of Rn (for instance A = Hn ).
                                        A.2 Differentiable manifolds                                     715
tangent vector to L associated with the parametrization P is the operator v which maps
every scalar field f defined around p to the real number
                                   df             1
                       v(f ) :=             = lim   [f (P (λ + ε)) − f (P (λ))] .                (A.8)
                                   dλ   L
                                              ε→0 ε
Given a coordinate system (xα ) around some point p ∈ M , there are n curves Lα through
p associated with (xα ) and called the coordinate lines: for each α ∈ {0, . . . , n − 1}, Lα is
defined as the curve through p parameterized by λ = xα and having constant coordinates xβ
for all β ̸= α. The tangent vector to Lα parameterized by xα is denoted ∂α . From the definition
(A.8), its action on a scalar field f is
                                              df             df
                                  ∂α (f ) =              =                     .
                                              dxα   Lα       dxα   xβ =const
                                                                     β̸=α
v = v α ∂α . (A.12)
Since every tangent vector to a curve at p is expressible as (A.12), we conclude that the set of
all tangent vectors to a curve at p is a vector space of dimension n and that (∂α ) constitutes a
basis of it. This vector space is called the tangent space to M at p and is denoted Tp M . The
elements of Tp M are simply called vectors at p. The basis (∂α ) is called the natural basis
associated with the coordinates (xα ) and the coefficients v α in (A.12) are called the components
of the vector v with respect to the coordinates (xα ). The tangent vector space is represented
at two different points in Fig. A.2.
     Contrary to what happens for an affine space, one cannot define a vector connecting two
distinct points p and q on a generic manifold, except if p and q are infinitesimally close to each
716                                    Basic differential geometry
      Figure A.2: The vectors at two points p and q on the manifold M belong to two different vector spaces: the
      tangent spaces Tp M and Tq M .
      other. Indeed, in the latter case, one defines the infinitesimal displacement vector from p to
      q as the vector dx ∈ Tp M whose action on a scalar field f is
      Since p and q are infinitesimally close, there is a unique (piece of) curve L going from p to q
      and one has
dx = v dλ , (A.14)
      where λ is a parameter along L , v the associated tangent vector at p and dλ the parameter
      increment from p to q: p = P (λ) and q = P (λ + dλ). The relation (A.14) follows immediately
      from the definitions (A.8) and (A.13) for respectively v and dx. Given a coordinate system,
      let (xα ) be the coordinates of p and (xα + dxα ) those of q. Then from Eqs. (A.13) and (A.9),
      dx(f ) = df = ∂f /∂xα dxα = dxα ∂α (f ). The scalar field f being arbitrary, we conclude that
dx = dxα ∂α . (A.15)
      In other words, the components of the infinitesimal displacement vector dx with respect to
      the coordinates (xα ) are nothing but the infinitesimal coordinate increments dxα .
                                       A.2 Differentiable manifolds                                             717
                                       ω : Tp M −→ R
                                                                                                      (A.16)
                                                v     7−→ ⟨ω, v⟩
that is linear: ⟨ω, λv + u⟩ = λ⟨ω, v⟩ + ⟨ω, u⟩ for all u, v ∈ Tp M and λ ∈ R. The set of all
linear forms at p constitutes a n-dimensional vector space, which is called the dual space
of Tp M and denoted by Tp∗ M . Given the natural basis (∂α ) of Tp M associated with some
coordinates (xα ), there is a unique basis of Tp∗ M , denoted by (dxα ), such that
⟨dxα , ∂β ⟩ = δ αβ , (A.17)
Remark 7: Do not confuse the linear form dxα with the infinitesimal increment dxα of the coordinate
xα .
   The dual basis can be used to expand any linear form ω, thereby defining its components
ωα with respect to the coordinates (xα ):
ω = ωα dxα . (A.19)
In terms of components, the action of a linear form on a vector takes then a very simple form:
⟨ω, v⟩ = ωα v α . (A.20)
      Since df = ∂f /∂xα dxα , Eq. (A.18) implies that the components of the differential with respect
      to the dual basis are nothing but the partial derivatives of f with respect to the coordinates
      (xα ) :
                                                     ∂f
                                             df =       α
                                                          dxα .                                 (A.22)
                                                    ∂x
      Remark 8: In non-relativistic physics, the concept of gradient of a scalar field is commonly used instead
      of the differential, the former being a vector field and the latter a 1-form. This is so because one associates
      implicitly a vector →−ω to any 1-form ω via the Euclidean scalar product of R3 : ∀→ −
                                                                                          v ∈ R3 , ⟨ω, →v⟩=→
                                                                                                        −      −
                                                                                                               ω ·→
                                                                                                                  −v.
                                                              →
                                                              −
      Accordingly, formula (A.21) is rewritten as df = ∇f · dx. But we should keep in mind that, at the
      fundamental level, the key quantity is the differential 1-form ∇f = df , since Eq. (A.21) does not require
                                                                                            →
                                                                                            −
      any metric on the manifold M to be meaningful. On the contrary, the gradient ∇f is a derived quantity,
      obtained from the differential ∇f by metric duality.
      Remark 9: For a fixed value of α, the coordinate xα can be considered as a scalar field on M . If we
      apply (A.22) to f = xα , we then get dxα = dxα . Hence the dual basis to the natural basis (∂α ) is
      formed by the differentials of the coordinates. This justifies the notation dxα used for its elements.
         By combining Eqs. (A.22), (A.12) and (A.17), it is easy to see that the 1-form df acting on a
      vector v is nothing but v acting on the scalar field f :
          Natural bases are of course not the only bases in the vector space Tp M . One may use a
      basis (eα ) that is not related to any coordinate system on M , for instance an orthonormal
      basis with respect to some metric. There exists then a unique basis (eα ) of the dual space Tp∗ M
      such that5
                                               ⟨eα , eβ ⟩ = δ αβ .                                (A.24)
      (eα ) is called the dual basis to (eα ). The relation (A.17) is a special case of (A.24), for which
      eα = ∂α and eα = dxα .
      A.2.5      Tensors
      Tensors are generalizations of both vectors and linear forms. Let (k, ℓ) ∈ N2 with (k, ℓ) ̸= (0, 0).
      At a point p ∈ M , a tensor of type (k, ℓ), also called tensor k times contravariant and ℓ
      times covariant, is a map
         T : Tp∗ M × · · · × Tp∗ M × Tp M × · · · × Tp M −→                                   R
                                                                                                                    (A.25)
             |       {z           } |       {z          }
                        k times                          ℓ times
      that is linear with respect to each of its arguments. The integer k +ℓ is called the tensor valence,
      or sometimes the tensor rank or order. Let us recall the canonical duality Tp∗∗ M = Tp M ,
         5
           Notice that, according to the standard usage, the symbol for the vector eα and that for the linear form eα
      differ only by the position of the index α.
                                      A.2 Differentiable manifolds                                         719
which means that every vector v can be considered as a linear form on the space Tp∗ M , via
Tp∗ M → R, ω 7→ ⟨ω, v⟩. Accordingly a vector is a tensor of type (1, 0). A linear form is a
tensor of type (0, 1). A tensor of type (0, 2) is called a bilinear form. It maps pairs of vectors
to real numbers, in a linear way for each vector.
    Given a basis (eα ) of Tp M and the corresponding dual basis (eα ) in Tp∗ M , any tensor T
of type (k, ℓ) can be expanded as
where the tensor product eα1 ⊗ . . . ⊗ eαk ⊗ eβ1 ⊗ . . . ⊗ eβℓ is the tensor of type (k, ℓ) for
which the image of (ω1 , . . . , ωk , v1 , . . . , vℓ ) as in (A.25) is the real number
                                    k
                                    Y               ℓ
                                                    Y
                                      ⟨ωi , eαi ⟩ ×   ⟨eβj , vj ⟩.
                                    i=1                j=1
Notice that all the products in the above formula are simply products in R. The nk+ℓ real
coefficients T α1 ...αk β1 ...βℓ in (A.26) are called the components of the tensor T with respect to
the basis (eα ). These components are unique and fully characterize the tensor T .
Remark 10: The notations v α and ωα already introduced for the components of a vector v [Eq. (A.12)]
or a linear form ω [Eq. (A.19)] are of course the particular cases (k, ℓ) = (1, 0) or (k, ℓ) = (0, 1) of
(A.26), with, in addition, eα = ∂α and eα = dxα .
With respect to a coordinate system (xα ), it is not difficult, via (A.12), to see that the components
of the commutator are
                                                  ∂v α       ∂uα
                                  [u, v]α = uµ µ − v µ µ .                                       (A.28)
                                                  ∂x          ∂x
720                                Basic differential geometry
      where dy is the infinitesimal displacement vector of TΦ(p) N connecting Φ(p) to Φ(q) (cf.
      Fig. ??). If (xα ) is a coordinate chart of M around p and (y β ) a chart of N around Φ(p), such
      that Φ is expressed as y β = Y β (xα ), it follows from Eqs. (A.15) and (A.30) that the matrix
      of the linear map dΦ|p with respect to the bases (∂α ) of Tp M and (∂β ) of TΦ(p) N is the
      Jacobian matrix (∂Y β /∂xα ). Using the characterization of vectors by their action on scalar
      fields [Eq. (A.8)], it is then easy to see that
of L separated from p by the infinitesimal parameter increase dλ. One has then v = dx/dλ,
where dx is the infinitesimal vector connecting p to q. Thanks to the linearity of dΦ|p and
the defining relation (A.30), one has Φ∗ v = dy/dλ, i.e. Φ∗ v is the finite vector obtained by
rescaling the infinitesimal vector connecting Φ(p) to Φ(q) by dλ. Note that Φ∗ v is a vector
tangent to the curve Φ(L ) ⊂ N : it is precisely the tangent vector to Φ(L ) at Φ(p) associated
with the parametrization of Φ(L ) by λ.
    Let now T be a fully covariant tensor field on N , i.e. a tensor field of type (0, ℓ) on N for
some ℓ ≥ 0. The pullback of T on M by Φ is the tensor field of type (0, ℓ) on M denoted by
Φ∗ T and defined by
For a scalar field f : N → R, the above definition with ℓ = 0 reduces to Φ∗ f |p := f |Φ(p) , i.e.
to Φ∗ f (p) := f (Φ(p)), so that the pullback is nothing but the map composition: Φ∗ f = f ◦ Φ.
Remark 12: Via the pushforward operation, a smooth map Φ : M → N naturally carries tangent
vectors from M to N (essentially because it carries curves in M to curves in N , as discussed above),
but it carries covariant tensors, among which linear forms, in the reverse way, i.e. from N to M
(pullback operation). Another difference is that the pushforward is a pointwise operation and does not
extend a priori to vector fields (cf. Fig. ??), while the pullback is well defined for any covariant tensor
field. A case where the pushforward extends to vector fields though is when Φ is a diffeomorphism.
                            g|p : Tp M × Tp M −→ R
                                                                                                          (A.34)
                                        (u, v)         7−→ g|p (u, v) =: g(u, v)
          In a given basis (eα ) of Tp M , the components of g is the matrix (gαβ ) defined by formula
      (A.26) with (k, ℓ) = (0, 2):
                                               g = gαβ eα ⊗ eβ .                                 (A.36)
      For any pair (u, v) of vectors we have then g(u, v) = gαβ uα v β . In particular, considering the
      natural basis associated with some coordinate system (xα ), the scalar square of an infinitesimal
      displacement vector dx = dxα ∂α [cf. Eqs. (A.13) and (A.15)] is
      This formula is called the line element expression on the pseudo-Riemannian manifold (M , g).
      It is often used to define the metric tensor in general relativity texts. Note that contrary to
      what the notation may suggest, ds2 can be negative.
           For the dual basis associated with the coordinates (xα ), one has eα = dxα (cf. Sec. A.2.4),
      so that Eq. (A.36) can be rewritten as
      One can recast this relation in a form which reminds the line element (A.37) by introducing
      the symmetric product notation (cf. e.g. Refs. [344] or [464]):
                        1
          dxα dxβ :=      dxα ⊗ dxβ + dxβ ⊗ dxα               and       (dxα )2 := dxα ⊗ dxα .     (A.39)
                                                
                        2
      Formula (A.38) then becomes
                                              g = gαβ dxα dxβ .                                    (A.40)
      Applying this relation to the pair of infinitesimal vectors (dx, dx), one gets the line element
      (A.37), by virtue of the identity ⟨dxα , dx⟩ = dxα [Eq. (A.18)].
which g(v, v) > 0 are called spacelike, and those for which g(v, v) = 0 are called null. The
subset of Tp M formed by all null vectors is termed the null cone of g at p.
    A coordinate xα of a coordinate system (x0 , . . . , xn−1 ) is said to be a timelike coordinate
(resp. spacelike coordinate or null coordinate) iff the hypersurfaces defined by xα = const
are spacelike6 (resp. timelike or null).
Remark 1: Being timelike, spacelike or null is a property of the coordinate xα per se (i.e. considering
xα as a scalar field U ⊂ M → R); on the contrary the causal type of the coordinate vector ∂α depends
on the coordinate system (x0 , . . . , xn−1 ) to which xα belongs. More precisely, whatever the causal type
of xα , the vector ∂α can be made spacelike, timelike or null by a proper choice of the complementary
coordinates (xβ )β̸=α .
   A basis (eα ) of Tp M is called a g-orthonormal basis (or simply orthonormal basis if
there is no ambiguity on the metric) iff7
The metric g induces an isomorphism between Tp M (vectors) and Tp∗ M (linear forms) which,
in index notation, corresponds to the lowering or raising of the index by contraction with gαβ
or g αβ . In the present book, an index-free symbol will always denote a tensor with a fixed
covariance type (such as a vector, a 1-form, a bilinear form, etc.). We will therefore use a
different symbol to denote its image under the metric isomorphism. In particular, we denote
by an underbar the isomorphism Tp M → Tp∗ M and by an arrow the reverse isomorphism
Tp∗ M → Tp M :
1. For any vector u in Tp M , u stands for the unique linear form such that
         However, we will omit the underbar on the components of u, since the position of the
         index allows us to distinguish between vectors and linear forms, following the standard
         usage: if the components of u in a given basis (eα ) are denoted by uα , the components
         of u in the dual basis (eα ) are then denoted by uα and are given by
                                                         uα = gαµ uµ .                              (A.45)
   6
       Cf. Sec. 2.2.2 for the definition of spacelike, timelike and null hypersurfaces.
   7
       No summation on α is implied.
724                                         Basic differential geometry
                                                  ∀v ∈ Tp M ,       g(→
                                                                      −
                                                                      ω , v) = ⟨ω, v⟩.               (A.46)
ω α = g αµ ωµ . (A.47)
         3. We extend the arrow notation to bilinear forms on Tp M (type-(0, 2) tensor): for any
                                           →
                                           −
            bilinear form T , we denote by T the tensor of type (1, 1) such that
                                                                            →
                                                                            −              →
                                                                                           −
                                ∀(u, v) ∈ Tp M × Tp M ,          T (u, v) = T (u, v) = u · T (v),    (A.48)
                         ↠
               and by T the tensor of type (2, 0) defined by
                                                                                       ↠
                                        ∀(u, v) ∈ Tp M × Tp M ,          T (u, v) =T (u, v).         (A.49)
                                                                              →
                                                                              −
               Note that in the second equality of (A.48), we have considered T as an endomorphism
               Tp M → Tp M , which is always possible for a tensor of type (1, 1). If Tαβ are the
                                                                        →
                                                                        −      ↠
               components of T in some basis (eα ), the components of T and T are respectively
                                                      →
                                                      −
                                                     ( T )α β = T αβ = g αµ Tµβ                     (A.50a)
                                                       ↠
                                                     (T )αβ = T αβ = g αµ g βν Tµν .                (A.50b)
      Remark 2: In mathematical literature, the isomorphism induced by g between Tp M and Tp∗ M is called
      the musical isomorphism, because a flat and a sharp symbols are used instead of, respectively, the
      underbar and the arrow introduced above:
                                                 u♭ = u       and       ω♯ = →
                                                                             −
                                                                             ω.
has ϵ(e0 , . . . , en−1 ) ̸= 0 and one says that the basis is right-handed (resp. left-handed) iff
ϵ(e0 , . . . , en−1 ) > 0 (resp. < 0). The components of ϵ with respect to (eα ) are
                                                                                                    (A.52)
                                                 p
                                   ϵα1 ... αn = ± |g| [α1 , . . . , αn ] ,
where ± must be + (resp. −) for a right-handed (resp. left-handed) basis, g stands for the
determinant of the matrix of g’s components with respect to the basis (eα ):
g := det(gαβ ) (A.53)
and the symbol [α1 , . . . , αn ] takes the value 0 if any two indices (α1 , . . . , αn ) are equal and
takes the value 1 (resp. −1) if (α1 , . . . , αn ) is an even (resp. odd) permutation of (0, . . . , n − 1).
p ∈ Σ ⇐⇒ f (p) = 0. (A.54)
Then, a vector field v on M is tangent to Σ iff the value of f stays to 0 for a small displacement
dλ along v; thanks to Eqs. (A.13), (A.14) and (A.12), this is equivalent to v(f ) = v µ ∂f /∂xµ = 0,
                                                                                   →
                                                                                   −
or to g(n, v) = 0, where we have let appear the gradient vector n := ∇f ; in terms of
components with respect to a coordinate system (xα ):
                                                             ∂f
                                        nα = ∇α f = g αµ         .                                  (A.55)
                                                             ∂xµ
The vector field n is called a normal to Σ. All normal vectors to Σ are collinear to each other.
    It follows from the definitions given in Sec. A.3.2 that for any coordinate xα of a given chart
(x , . . . , xn−1 ),
  0
                                    →
                                    −
          xα timelike coordinate ⇐⇒ ∇xα timelike vector ⇐⇒ g αα < 0,                               (A.56a)
                                  →
                                  −
          xα null coordinate ⇐⇒ ∇xα null vector ⇐⇒ g αα = 0,                                       (A.56b)
                                     →
                                     −
          xα spacelike coordinate ⇐⇒ ∇xα spacelike vector ⇐⇒ g αα > 0,                             (A.56c)
where g αα is the component (α, α) of the inverse metric (no summation on α). It appears here be-
                                  →
                                  −     →
                                        −
cause, according to Eq. (A.55), g(∇xα , ∇xα ) = gµν g µρ ∂xα /∂xρ g νσ ∂xα /∂xσ = δ ρν δ αρ g νσ δ ασ =
g αα .
726                                  Basic differential geometry
2. For any scalar field f and any pair (u, v) of vector fields:
∇f u v = f ∇u v. (A.58)
3. For any scalar field f and any pair (u, v) of vector fields, the following Leibniz rule holds:
∇u (f v) = ⟨df, u⟩ v + f ∇u v, (A.59)
   Given an affine connection, the variation of a vector field v between two neighboring
points, p and q say, is defined as
                                     dv := ∇dx v,                                  (A.60)
where dx is the infinitesimal displacement vector connecting p and q [cf Eq. (A.13)]. One says
that v is parallelly transported from p to q with respect to the connection ∇ iff dv = 0.
    Given a frame field (eα ) on M , the connection coefficients of an affine connection ∇ with
respect to (eα ) are the n3 scalar fields Γγ αβ defined by the expansion, at each point p ∈ M , of
the vector ∇eβ eα p onto the basis (eα |p ):
∇eβ eα =: Γµ αβ eµ . (A.61)
An affine connection is entirely defined by its connection coefficients in a given frame field. In
other words, there are as many affine connections on a manifold of dimension n as there are
possibilities of choosing n3 scalar fields Γγ αβ .
   Given v ∈ X(M ), the covariant derivative of v with respect to the affine connection
∇ is the tensor field ∇v of type (1, 1) defined by the following action at each point p ∈ M :
                                     ∇v|p : Tp∗ M × Tp M −→                                         R
                                                                                                                                        (A.62)
                                                       (ω, u)               7−→ ⟨ω, ∇ũ v|p ⟩,
where ũ is any vector field which performs some extension of u around p: ũ|p = u. As already
noted (cf. Remark 1), ∇ũ v|p is independent of the choice of ũ, so that the mapping (A.62) is
well-defined. By comparing with (A.25), we verify that ∇v|p is a tensor of type (1, 1).
     The covariant derivative is extended to all tensor fields by (i) demanding that for a scalar
field it coincides with the differential: ∇f := df and (ii) using the Leibniz rule. As a result,
the covariant derivative of a tensor field T of type (k, ℓ) is a tensor field ∇T of type (k, ℓ + 1).
Its components with respect a given frame field (eα ) are denoted
                                          ∇γ T α1 ...αk β1 ...βℓ := (∇T )α1 ...αk β1 ...βℓ γ                                            (A.63)
and are given by
                                                                                     k                              i
                                                                                                                    ↓
                                                                                                             α1 ... σ...αk
                         α1 ...αk                        α1 ...αk
                                                                                     X
                                                                                               αi
               ∇γ T                 β1 ...βℓ   = eγ (T              β1 ...βℓ )   +         Γ        σγ   T                   β1 ...βℓ
                                                                                     i=1
                                                       ℓ
                                                       X
                                                   −         Γσ   βi γ   T α1 ...αk β1 ... σ ...βℓ ,                                    (A.64)
                                                                                           ↑
                                                       i=1                                 i
where eγ (T   α1 ...αk
                        β1 ...βℓ ) stands for the action of the vector eγ on the scalar field T
                                                                                                α1 ...αk
                                                                                                         β1 ...βℓ
resulting from the very definition of a vector (cf. Sec. A.2.3). In particular, if (eα ) is a natural
frame associated with some coordinate system (xα ), then eα = ∂α and eγ (T α1 ...αk β1 ...βℓ ) =
∂T α1 ...αk β1 ...βℓ /∂xγ [cf. Eq. (A.9)].
Remark 2: Notice the position of the index γ in Eq. (A.63): it is the last one on the right-hand side.
According to (A.26), ∇T is then expressed as
                    ∇T = ∇γ T α1 ...αkβ1 ...βℓ eα1 ⊗ . . . ⊗ eαk ⊗ eβ1 ⊗ . . . ⊗ eβℓ ⊗ eγ .                                             (A.65)
728                                     Basic differential geometry
      Because eγ is the last 1-form of the tensorial product on the right-hand side, the notation T α1 ...αkβ1 ...βℓ ;γ
      instead of ∇γ T α1 ...αkβ1 ...βℓ would have been more appropriate. The index convention (A.65) agrees
      with that of MTW [371] [cf. their Eq. (10.17)].
          The covariant derivative of the tensor field T along a vector v is defined by
      The components of ∇v T are then v µ ∇µ T α1 ...αk β1 ...βℓ . Note that ∇v T is a tensor field of the
      same type as T . In the particular case of a scalar field f , one has ∇v f = ⟨∇f, v⟩ = v(f ).
          The divergence with respect to the affine connection ∇ of a tensor field T of type (k, ℓ)
      with k ≥ 1 is the tensor field denoted ∇ · T of type (k − 1, ℓ) and whose components with
      respect to any frame field are given by
                                             α1 ...αk−1                       α1 ...αk−1 µ
                                    (∇ · T )              β1 ...βℓ   = ∇µ T                  β1 ...βℓ .        (A.67)
Remark 3: For the divergence, the contraction is performed on the last upper index of T .
      Levi-Civita connection
      On a pseudo-Riemannian manifold (M , g) there is a unique affine connection ∇ such that
1. ∇ is torsion-free, i.e. for any scalar field f , ∇∇f is a field of symmetric bilinear forms:
∇α ∇β f = ∇β ∇α f. (A.68)
∇g = 0 . (A.69)
   For the Levi-Civita connection, the expression for the divergence of a vector takes a rather
simple form in a natural basis. Indeed, combining Eqs. (A.67) and (A.64), we get for v ∈ X(M ),
∇ · v = ∇µ v µ = ∂v µ /∂xµ + Γµ σµ v σ . Now, from (A.71), we have
                             1    ∂gµν 1 ∂             1    ∂ p
                      Γµ αµ = g µν α =       ln |g| = p        |g|,                         (A.72)
                             2    ∂x   2 ∂xα           |g| ∂xα
where g := det(gαβ ) [Eq. (A.53)]. The last but one equality follows from the general law of
variation of the determinant of any invertible matrix A:
where δ denotes any variation that fulfills the Leibniz rule (i.e. a derivation), tr stands for the
trace and × for the matrix product. We conclude that
                                       1 ∂ p          
                                ∇·v = p        |g| v µ
                                                         .                                  (A.74)
                                       |g| ∂xµ
where we have used the fact that Γα σµ is symmetric in (σ, µ), whereas Aσµ is antisymmetric.
Hence the simple formula for the divergence of an antisymmetric tensor field of type (2, 0):
                                         1 ∂ p          
                               ∇µ Aαµ = p        |g| Aαµ
                                                           .                                (A.75)
                                         |g| ∂xµ
                                                                                 up’
                                                    up
                                                                                           vq
                                                                  dl vq
                                    dl ℒu vp
                                                                                 q'= (p')
                                       q= (p)                                        
                                                                dvp
                                                                 *
                                             up                                up’
                                                              dl vp                             vp
                                                p                         p'
      Figure A.3: Geometrical construction of the Lie derivative of a vector field v along a vector field u: given an
      infinitesimal parameter dλ, each extremity of the arrow dλ v|p is dragged by some small parameter ε along u, to
      form the vector denoted by Φε∗ dλ v|p (cf. Sec. A.2.8). The latter is then compared with the actual value of dλ v
      at the point q, the difference (divided by εdλ) defining the Lie derivative Lu v at p.
      such that Φε (p) = q, where q is the point connected to p by the infinitesimal displacement
      vector →
             −
             pq = ε u|p (cf. Sec. A.2.3 and Fig. A.3). The natural reference vector to compare v|q
      with is then the pushforward of v|p by Φε , i.e. the vector Φε∗ v|p defined in Sec. A.2.8. Since
      v|q = v|Φε (p) and Φε∗ v|p belong to the same vector space Tq M , one may subtract the latter
      from the former and define the Lie derivative of v along u at p by
                                                     1                   
                                       Lu v|p := lim    v|Φε (p) − Φε∗ v|p .                                   (A.76)
                                                 ε→0 ε
      Remark 4: The term between parentheses in the right-hand side of Eq. (A.76) is the difference between
      two vectors of Tq M and thus a vector of Tq M . One gets a vector of Tp M in the left-hand side only
      because q = Φε (p) tends to p in the limit ε → 0. Some authors (e.g. [343, 499]) prefer to define the
      Lie derivative from the difference of vectors in Tp M , by carrying v|q to Tp M via the pushforward by
        ε = Φ−ε and comparing it with v|p ; this results in the formula
      Φ−1
                                                             1  −1                  
                                      Lu v|p := lim             (Φε )∗ v|Φε (p) − v|p .                         (A.77)
                                                         ε→0 ε
      Thanks to the limit ε → 0, formulas (A.76) and (A.77) are equivalent. Formula (A.76) is preferred here in
      so far as it corresponds to the geometrical construction displayed in Fig. A.3.
          Let (xα ) be a coordinate system adapted to the field u in the sense that the first vector of
      the natural basis (∂α ) associated with (xα ) is ∂0 = u. The coordinate expression of the flow
      map Φε is then (xα ) → (xα + εδ α0 ). The corresponding Jacobian matrix is the identity matrix
      so that the coordinate components of Φε∗ v|p are identical to those of v|p (cf. Sec. A.2.7). It
                                             A.4 The three basic derivatives                                   731
For a scalar field f , one has Φ∗ε f = f ◦ Φε (cf. Sec. A.2.8), so that the above definition with
ℓ = 0 yields Lu f = u(f ) = ⟨df, u⟩ = uµ ∂µ f [cf. Eq. (A.8)].
   Finally, the Lie derivative is extended to any tensor field by means of the Leibniz rule. As a
result, the Lie derivative Lu T of a tensor field T of type (k, ℓ) is a tensor field of the same
type, the components of which with respect to a given coordinate system (xα ) are
                                                                                      k       i
                    α1 ...αk                   ∂ α1 ...αk
                                               µ
                                                                              ↓
                                                                     X α1 ... σ...α k          ∂uαi
             Lu T              β1 ...βℓ   = u     T       β1 ...βℓ −     T            β1 ...βℓ
                                              ∂xµ                    i=1
                                                                                               ∂xσ
                                                  ℓ
                                                  X                                  ∂uσ
                                              +          T α1 ...αk β1 ... σ ...βℓ        .           (A.83)
                                                   i=1
                                                                           ↑         ∂xβi
                                                                           i
732                                             Basic differential geometry
      A special case of this formula is worth considering, namely T = g (the metric tensor). Since
      ∇µ gαβ = 0, and gσβ ∇α uσ = ∇α uβ (both thanks to Eq. (A.69)), one gets the so-called Killing
      expression of the Lie derivative of the metric tensor:
                                                       Lu gαβ = ∇α uβ + ∇β uα .                                                            (A.87)
                                                                                                                                           (A.88)
                                       A.4 The three basic derivatives                                            733
where in the first line vi (· · · ) stands for the action of the vector field vi on the scalar field
ω(v1 , . . . , vi−1 , vi+1 , . . . , vp+1 ) [cf. Eq. (A.8)]. For p = 0 (scalar field), Eq. (A.88) with ω → f
reduces to ⟨df, v1 ⟩ := v1 (f ), i.e. one recovers Eq. (A.23). Hence, for a scalar field, the exterior
derivative is nothing but the differential. For p = 1, Eq. (A.88) becomes
In terms of components with respect to a coordinate system (xα ), the definition (A.88) yields
                                          ∂ω
                      0-form: (dω)α =                                                                 (A.90a)
                                         ∂xα
                                          ∂ωβ ∂ωα
                      1-form: (dω)αβ =          − β                                                   (A.90b)
                                          ∂xα     ∂x
                                           ∂ωβγ      ∂ωαγ ∂ωαβ
                      2-form: (dω)αβγ =        α
                                                  −       +                                           (A.90c)
                                            ∂x       ∂xβ     ∂xγ
                                                p+1
                                                X           ∂
                      p-form: (dω)α1 ···αp+1 =      (−1)i+1 αi ωα1 ···αi−1 αi+1 ···αp+1 .             (A.90d)
                                                i=1
                                                           ∂x
Remark 5: The exterior derivative appeals only to the manifold structure; it does not depend upon the
metric tensor g, nor upon any other extra structure on M . Nevertheless, one may replace all partial
derivatives in formulas (A.90) by covariant derivatives taken with respect to the Levi-Civita connection
∇ of g, thanks to the symmetry of the Christoffel symbols on their last two indices:
ddω = 0 . (A.92)
A p-form ω is said to be closed iff dω = 0, and exact iff there exists a (p − 1)-form σ such
that ω = dσ. From property (A.92), any exact p-form is closed. The Poincaré lemma states
that the converse is true, at least locally, on a contractible open set.
    Given a vector field v on an oriented pseudo-Riemannian manifold (M , g) of dimension
n, the tensor field formed by setting the first argument of the Levi-Civita tensor ϵ to v, while
leaving free the other slots, i.e. ϵ(v, ., . . . , .), is a (n − 1)-form denoted9 v · ϵ. Its exterior
derivative is thus a n-form, which is expressible in terms of the divergence of v [cf. Eq. (A.74)]
as
                                       d(v · ϵ) = (∇ · v) ϵ .                                  (A.93)
     9
    More generally, the dot notation stands for the contraction on adjacent indices; here: (v · ϵ)α1 ···αn−1 :=
v ϵµα1 ···αn−1 .
 µ
734                                    Basic differential geometry
      where ∂D is D’s boundary oriented according to the outward convention: if ϵD is the d-form
      defining the orientation of D (cf. Sec. A.3.4), the (d − 1)-form defining the orientation of ∂D
      is chosen to be v · ϵD , where v is an outward-pointing10 vector field along ∂D. Note that
      each side of (A.94) is (of course!) well-defined, as the integral of a p-form over an oriented
      p-dimensional manifold.
      Remark 6: If D is a submanifold of an oriented pseudo-Riemannian manifold (M , g) of dimension
      n, the orientation of D is naturally set by choosing ϵD = ϵ if d = n, where ϵ is the Levi-Civita tensor
      of g (cf. Sec. A.3.4). If d < n, a free family of n − d vector fields (u1 , . . . , un−d ) nowhere tangent to
      D has to be chosen first and ϵD is then defined by ϵD = ϵ(u1 , . . . , un−d , ., . . . , .). Accordingly, the
      orientation of ∂D is set by ϵ∂D = v · ϵD = ϵ(u1 , . . . , un−d , v, ., . . . , .).
         Another important formula involving the exterior derivative is the Cartan identity, which
      expresses the Lie derivative of a p-form ω along a vector field u as
Lu ω = u · dω + d(u · ω) . (A.95)
      As above, a dot denotes the contraction on adjacent indices: u · dω is the p-form dω(u, ., . . . , .)
      and u · ω is the (p − 1)-form ω(u, ., . . . , .). Notice that if ω is a 1-form, Eq. (A.95) is readily
      obtained by combining Eqs. (A.84), (A.90a) and (A.90b). An immediate consequence of the
      Cartan identity and the nilpotence property (A.92) is that the Lie derivative and the exterior
      derivative commute, i.e. for any vector field u and any p-form ω:
Lu dω = d Lu ω. (A.96)
      A.5       Curvature
      A.5.1      General definition
      The Riemann curvature tensor of an affine connection ∇ (cf. Sec. A.4.1) is defined by
             Riem : X∗ (M ) × X(M )3 −→                                   C ∞ (M , R)
                                                                                                            (A.97)
                                                                                                  
                           (ω, w, u, v)        7−→        ω, ∇u ∇v w − ∇v ∇u w − ∇[u,v] w ,
      where X∗ (M ) stands for the space of 1-forms on M , X(M ) for the space of vector fields on
      M and C ∞ (M , R) for the space of smooth scalar fields on M . The above formula does define
      a tensor field on M , i.e. the value of Riem(ω, w, u, v) at a given point p ∈ M depends only
        10
          By outward-pointing, it is meant that at each point p ∈ ∂D, v|p is not tangent to ∂D and there exists a
      curve in D starting from p whose tangent vector at p is − v|p .
                                                A.5 Curvature                                                   735
upon the values of the fields ω, w, u and v at p and not upon their behaviors away from p,
as the covariant derivatives in Eq. (A.97) might suggest. We denote the components of this
tensor in a given basis (eα ) by Rγ δαβ , instead of Riemγ δαβ . The definition (A.97) leads to the
following formula, named the Ricci identity:
Remark 1: In view of this identity, one may say that the Riemann tensor measures the lack of commu-
tativity of two successive covariant derivatives of a vector field. On the opposite, for a scalar field and a
torsion-free connection, two successive covariant derivatives always commute [cf. Eq. (A.68)].
   In a coordinate basis, the components of the Riemann tensor are given in terms of the
connection coefficients by
                                   ∂Γα βν   ∂Γα βµ
                       Rα βµν =           −        + Γα σµ Γσ βν − Γα σν Γσ βµ .                     (A.99)
                                    ∂xµ      ∂xν
    From the definition (A.97), the Riemann tensor is clearly antisymmetric with respect to its
last two arguments:
                            Riem(., ., u, v) = −Riem(., ., v, u).                     (A.100)
The covariant derivatives of the Riemann tensor obey the Bianchi identity
                              Riem(ω, w, ., .) = −Riem(w, →
                                                          −
                                                          ω , ., .).                                (A.103)
Combined with (A.100) and (A.101), this implies the symmetry property
                             Riem(ω, w, u, v) = Riem(u, v, →
                                                           −
                                                           ω , w).                                  (A.104)
                            R : X(M ) × X(M ) −→                    C ∞ (M , R)
                                                                                                 (A.105)
                                        (u, v)          7−→ Riem(eµ , u, eµ , v),
      where (eα ) is a vector frame on M and (eα ) its dual counterpart. This definition is independent
      of the choice of (eα ). In terms of components:
      Remark 2: Following standard usage, we denote the components of the Riemann and Ricci tensors by
      the same letter R, the number of indices allowing us to distinguish between the two tensors. On the
      other hand, we are using different symbols, Riem and R, when employing the index-free notation.
      which is obtained by taking the trace of the Ricci identity (A.98) on the indices (α, γ) and
      relabelling β → α.
          For the Levi-Civita connection associated with the metric g, property (A.104) implies that
      the Ricci tensor is symmetric:
                                            R(u, v) = R(v, u).                                   (A.108)
      In addition, one defines the Ricci scalar (also called scalar curvature) as the trace of the
      Ricci tensor with respect to the metric g:
R := g µν Rµν . (A.109)
                                                     →
                                                     −
                                                   ∇·G =0,                                       (A.110)
            →
            −
      where G in the type-(1, 1) tensor associated by metric duality [cf. (A.48)] to the Einstein
      tensor:
                                                     1
                                          G := R − R g .                                   (A.111)
                                                     2
      Equation (A.110) is called the contracted Bianchi identity.
                                            A.5 Curvature                                              737
                                                          R γ
 Rγ δαβ = Rγ α gδβ − Rγ β gδα + δ γ α Rδβ − δ γ β Rδα +     δ β gδα − δ γ α gδβ     (n = 3). (A.113)
                                                                                
                                                          2
   For n ≥ 4, the Riemann tensor can be split into (i) a “trace-trace” part, represented by the
Ricci scalar R [Eq. (A.109)], (ii) a “trace” part, represented by the Ricci tensor R [Eq. (A.106)],
and (iii) a “traceless” part, which is constituted by the Weyl conformal curvature tensor, C:
                                    1
            Rγ δαβ = C γδαβ +           Rγ α gδβ − Rγ β gδα + δ γ α Rδβ − δ γ β Rδα
                                                                                    
                                 n−2
                                 1
                                          R δ γ β gδα − δ γ α gδβ .                          (A.114)
                                                                 
                         +
                           (n − 1)(n − 2)
The above relation may be taken as the definition of C. It implies that C is traceless: C µαµβ = 0.
The other possible traces are zero thanks to the symmetries of the Riemann tensor.
Remark 3: The decomposition (A.114) is meaningful for n = 3 as well; by comparing with (A.113), we
see that it results in a vanishing Weyl tensor: C = 0 for n = 3.
738   Basic differential geometry
Appendix B
Geodesics
Contents
         B.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 739
         B.2 Definition and first properties . . . . . . . . . . . . . . . . . . . . . . . 739
         B.3 Existence and uniqueness of geodesics . . . . . . . . . . . . . . . . . . 744
         B.4 Geodesics and extremal lengths . . . . . . . . . . . . . . . . . . . . . . 750
         B.5 Geodesics and symmetries . . . . . . . . . . . . . . . . . . . . . . . . . 757
         B.6 Geodesics and curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . 759
B.1      Introduction
Geodesics play a key role in general relativity, since they represent the worldlines of test particles
and photons (cf. Sec. 1.3). Moreover, in black hole theory, null geodesics play a prominent role,
as the generators of event horizons (cf. Sec. 4.4.3). We review here the definition and main
properties of geodesics on a generic pseudo-Riemannian manifold, i.e. a manifold equipped
with a metric of generic signature, as introduced in Sec. A.3. In particular, the results apply to
pure Riemannian manifolds (positive definite metric), as well as to Lorentzian manifolds, i.e.
spacetimes. Contrary to Appendix A, proofs of most statements will be provided, since they
are quite illustrative.
      space: this is nothing but a straight line, for which tangent vectors keep obviously a fixed
      direction. In a pseudo-Riemannian manifold, such as the spacetimes of general relativity, one
      uses this last property to define geodesics.
          Let us first recall the basic definitions given in Sec. A.2.3: a curve 1 is the image L = P (I)
      of a map (called a parametrization of the curve) P : I → M , λ 7→ P (λ), where I is an
      interval of R and the variable λ is called a parameter of the curve. Moreover, we exclude the
      case where L is reduced to a single point of M , i.e. where P is a constant function. We are
      now in position to define a geodesic as a “straight” curve:
                                                                dx
                                                          v=       ,                                            (B.2)
                                                                dλ
        where dx is the infinitesimal displacement along L corresponding to the change dλ in
        the parameter λ (cf. Eq. (A.14)).
If both parametrizations are affine, then ∇v v = 0 and ∇v′ v ′ = 0, so that the above identity
reduces to d2 λ′ /dλ2 = 0, which implies the affine law (B.3).
Remark 1: Because of (B.1), a geodesic is also called an autoparallel curve. It is also sometimes called a
zero-acceleration curve, the vector ∇v v being considered as the “acceleration” of (the parametrization
P of) the curve L ; this is of course by extension of the concept of 4-acceleration a := ∇u u of a
timelike worldline with 4-velocity u, the latter being nothing but the tangent vector associated with the
parametrization of the worldline by its proper time (cf. Sec. 1.3.3).
    An important property of geodesics is
  Property B.2: constancy of the scalar square of affine tangent vector fields
where we have used the fact that ∇ is the Levi-Civita connection of g [Eq. (A.69)] and v obeys
the geodesic equation (B.1).
The constancy of g(v, v) has an interesting corollary: the tangent vector v cannot change its
causal type along L . Hence:
  Property B.3: type of geodesics on pseudo-Riemannian manifolds
This is in sharp contrast with generic curves, which, for instance, can be timelike on some
portions and null or spacelike on other ones.
    In thep timelike case, or the spacelike one, the tangent vector field v can be rescaled by the
constant |g(v, v)| to get a unit tangent vector field, i.e. a tangent vector field u which obeys
g(u, u) = −1 (timelike geodesic) or g(u, u) = 1 (spacelike geodesic). Moreover, in doing
so, the affine character of the parametrization is preserved. Indeed,    the rescaling amounts to
choosing the constant a in the affine law (B.3) such that a = |g(v, v)|. Thus, we have
                                                                p
742                                            Geodesics
      We shall see in Sec. B.2.2 that for a timelike geodesic, the affine parameter with unit tangent
      vector is nothing but the proper time, while for a spacelike geodesic, it is the arc length.
        A curve L is a geodesic iff the tangent vector field v associated with any parametrization
        of L obeys
                                               ∇v v = κ v,                                    (B.7)
        where κ is a scalar field along L .
      Conversely, if v obeys (B.7) with κ = κ(λ), then Eq. (B.5) implies that ∇v′ v ′ = 0, i.e. that L
      is a geodesic, provided that the change of parametrization λ′ = λ′ (λ) fulfills
                                                     dλ′ d2 λ′
                                              κ(λ)      −      = 0.
                                                     dλ   dλ2
      This differential equation has the following solution:
                                        Z λ"        Z λ̃         ! #
                                 λ′ = a       exp          ˜ λ̃
                                                         κ(λ̃)d ˜ dλ̃ + b,
                                         λ1              λ0
  A vector field v obeying (B.7) is called a pregeodesic vector field. The scalar field κ is then
  called the non-affinity coefficient of v. If κ = 0, v is naturally called a geodesic vector
  field.
Note that Property B.5 is equivalent to stating that the field lines of a pregeodesic vector field
are geodesics.
    An easy consequence of Eq. (B.7) is the following evolution law for the scalar square of the
tangent vector:
  Along a geodesic L , the scalar square g(v, v) of the tangent vector v associated with any
  parametrization of L evolves according to
Proof. The tangent vector associated with the proper time τ along a timelike geodesic is nothing
but the 4-velocity u (cf. Sec. 1.3.3), which is of constant scalar square: g(u, u) = −1, so that
Eq. (B.8) reduces to 0 = −2κ, hence κ = 0, which implies that we are dealing with an affine
parametrization. Similarly, the tangent vector associated with the arc length s along a spacelike
geodesic has a scalar square everywhere equal to 1, leading to the same conclusion.
744                                           Geodesics
                              d2 X α           µ
                                         α dX dX
                                                  ν
                                     + Γ  µν        =0,           0 ≤ α ≤ n − 1,                 (B.10)
                               dλ2           dλ dλ
       where the Γαµν ’s are the Christoffel symbols of the metric g with respect to the coordinates
       (xα ), as given by Eq. (A.71).
      Proof. Notice first that the components with respect to the chart (xα ) of the tangent vector
      field v associated with the parameter λ are [cf. Eq. (A.11)]
                                                        dX α
                                                 vα =        .                                     (B.11)
                                                         dλ
      On the other side, the components of ∇v v are
                                ∂v α                                         dv α
                 v µ ∇µ v α = v µ    + Γ α
                                          µν v µ ν
                                                v  = v(v α
                                                           ) + Γ α
                                                                  µν v µ ν
                                                                        v  =      + Γαµν v µ v ν ,
                                ∂xµ                                          dλ
      where we have used successively Eqs. (A.64), (A.12) and (A.8). The above relation, along with
      (B.11), shows that the left-hand side of Eq. (B.10) is nothing but the component α of ∇v v. The
      conclusion follows from the very definition of a geodesic given in Sec. B.2.1.
          Note that if a solution of the geodesic equation (B.10) is found, the parameter λ is necessarily
      an affine parameter. For a generic parameter, the differential equation becomes (B.10) with the
      right-hand side replaced by κdX α /dλ, which is the coordinate expression of the right-hand
      side κv in Eq. (B.7). Hence, we have
        Property B.9: pregeodesic equation
        A curve L in the domain of a chart (xα ) is a geodesic iff some (actually all) coordinate
        expression xα = X α (λ) of L fulfills the following system of n second-order differential
        equations, usually called the pregeodesic equation,
                         d2 X α           µ
                                    α dX dX
                                             ν
                                                      dX α
                                + Γ  µν        = κ(λ)      ,           0 ≤ α ≤ n − 1.            (B.12)
                          dλ2           dλ dλ          dλ
                          B.3 Existence and uniqueness of geodesics                              745
                                                   dx
                                             V =        .                             (B.13)
                                                   dλ p
  Moreover, this geodesic is unique, in the sense that any geodesic L ′ sharing the same
  property coincides with L in some open neighborhood of p.
Proof. Let (xα ) be a coordinate chart of M around p. Let (V α ) be the components of V in the
basis of Tp M induced by the coordinate frame (∂α ) associated with (xα ):
V = V α ∂α |p .
                                                        dX α
                           X (0) = x (p) and
                             α          α
                                                             (0) = V α .                (B.14)
                                                         dλ
The system (B.10) + (B.14) constitutes a well-posed Cauchy problem and standard results
about ordinary differential equations, e.g. the Picard-Lindelöf (or Cauchy–Lipschitz) theorem,
guarantee the existence and uniqueness of the solution.
   A few definitions follow naturally:
  A geodesic L is said to be inextendible or maximal iff there does not exist any geodesic
  L ′ ̸= L such that L ⊂ L ′ .
  A geodesic L is complete iff the interval spanned by any of its affine parameters is the
  whole real line: I = R. A geodesic that is not complete is called incomplete.
   It is easy to show:
746                                                    Geodesics
      Proof. Let L be a complete geodesic. Let us consider any geodesic L ′ such that L ⊂ L ′ .
      Let λ and λ′ be affine parameters of respectively L and L ′ . Since L ⊂ L ′ , λ′ is also an
      affine parameter of L and we must have, along L , λ′ = aλ + b with a ̸= 0 [Eq. (B.3)]. Since
      the range of λ is (−∞, +∞), for L is complete, this implies that the range of λ′ on L is
      (−∞, +∞) as well, which make impossible to have points in L ′ \ L . Hence L ′ = L , i.e. L
      is inextendible.
           On physical grounds, one may consider that any timelike geodesic in a given spacetime
      must be complete. Otherwise, this would mean that there exists a worldline L of a freely
      falling observer that “ends” at some finite proper time. This would be the signature of either
      (i) the possibility to extend the spacetime into a larger one or (ii) the ending of worldline L
      at some (curvature) singularity. A spacetime in which this does not occur is called timelike
      geodesically complete. More generally:
      Remark 1: A well-known theorem of differential geometry, namely the Hopf-Rinow theorem, states
      that a connected Riemannian manifold is geodesically complete iff it is complete as a metric space for
      the distance function d(p, q) defined as the infimum of the length2 of all curves from p to q (see e.g.
      Ref. [344]). However, there is no such theorem for a Lorentzian manifold, for the metric does not induce
      any distance function which would make the manifold a metric space.
             The following proposition strengthens Property B.10:
                   expp : Ep ⊂ Tp M −→ M
                                        
                                         p                          if V = 0            (B.16)
                              V     7−→
                                         PV (1) ∈ LV                if V ̸= 0
In other words, expp maps a nonzero vector V in the tangent space to M at p to the point
of M of affine parameter λ = 1 along the unique geodesic through p such that (i) λ = 0
corresponds to p and (ii) the tangent vector dx/dλ to the geodesic at p is V .
    Note that if (M , g) is geodesically complete, Ep = Tp M for every point p ∈ M .
    An immediate property of the exponential map is
Property B.13
  If V ∈ Ep \ {0}, for any t ∈ [0, 1], expp (tV ) lies on the same geodesic LV as expp (V ), at
  the parameter λ = t of the parametrization PV :
Proof. For t = 0, the property follows from the definition of expp , since PV (0) = p. If
t ̸= 0, the nonzero vector tV is collinear to V and the uniqueness property of geodesics
(Property B.12) implies that LtV = LV . By virtue of the transformation law (B.4), tV is the
tangent vector to LV corresponding to the affine parameter λ′ = t−1 λ, where λ is the affine
parameter whose tangent vector field obeys v|p = V . From the definition of expp , we have
then expp (tV ) = PtV (λ′ = 1) = PV (λ = t × 1), hence (B.17).
    The exponential map realizes a local identification of the manifold with its tangent space at
a given point:
      Proof. It is clear from its definition that expp is a smooth map, at least on some neighborhood
      U ′ of 0 in Tp M . We may then consider the differential of expp at 0, d expp 0 . By virtue of
      the inverse function theorem for manifolds (see e.g. Theorem 4.5 in Ref. [343]), it suffices to
      show that d expp 0 is invertible to complete the proof. By definition of the differential of a
      map (cf. Sec. A.2.7) and since expp : Ep ⊂ Tp M → M and expp (0) = p, d expp 0 carries an
      infinitesimal displacement vector of3 T0 (Tp M ), ε say, connecting 0 to a nearby element of
      Tp M , ε′ say, to the infinitesimal vector E ∈ Tp M connecting p = expp (0) to q = expp (ε′ ):
                                       d expp      0
                                                       : T0 (Tp M ) −→ Tp M
                                                            ε           7−→ E = →
                                                                                −
                                                                                pq.
      Now, since Tp M is a vector space, we have the canonical identification T0 (Tp M ) ≃ Tp M , from
      which ε′ = ε. Without any loss of generality, we may write ε = εV , where ε is infinitesimal
      small and V ∈ Tp M . Then q = expp (εV ) = PV (ε), where the second identity results from
      (B.17). We have thus
                                                               −−−−−−−−→
                                 d expp 0 (εV ) = E = →  −
                                                         pq = PV (0)PV (ε).
                                                                   −−−−−−−−→
      According to the definition of PV , the infinitesimal vector PV (0)PV (ε) along the geodesic LV
      is εV , hence
                                            d expp 0 (εV ) = εV .
      Since the differential d expp 0 is a linear map, we get d expp 0 (V ) = V . The vector V being
      arbitrary, we conclude that d expp 0 is nothing but the identity map of the vector space Tp M :
                                                       d expp   0
                                                                    = idTp M .
      In particular, d expp   0
                                  is invertible.
         3
          Here the vector space Tp M is considered as a n-dimensional smooth manifold, and T0 (Tp M ) stands for its
      tangent space at 0 (the zero vector of Tp M ).
                              B.3 Existence and uniqueness of geodesics                                   749
                    Φ: U       −→ Rn
                                                                                              (B.18)
                                       ⟨E 0 , exp−1                     , exp−1
                                                                    n−1
                                                                                   
                          q    7−→               p (q)⟩, . . . , ⟨E          p (q)⟩
In other words, normal coordinates (xα ) on U are such that the tangent vector xµ (q)Eµ ∈ Tp M
has precisely q as image by the exponential map:
Remark 2: Some authors, e.g. [390], add the condition that the basis (Eα ) is orthonormal (with respect
to the metric g) in the definition of normal coordinates. We follow here the more general definition
of [318, 63, 266, 464]. The name Riemann normal coordinates is also commonly encountered in the
literature, either for normal coordinates as defined here (e.g. [371, 499]) or for those with the basis
orthonormality requirement (e.g. [418]).
    A characteristic feature of normal coordinates is that, in terms of them, geodesics through
p look like straight lines through 0 in Rn :
  In a normal coordinate system (xα ), the equation of the unique geodesic LV through
  p admitting V ∈ Tp M as tangent vector at p is (as long as LV remains in the normal
  neighborhood U )
                                  xα = X α (λ) = λV α ,                          (B.20)
  where the V α ’s are the components of V with respect to the basis (Eα ) defining the normal
  coordinates: V = V µ Eµ .
             X α (λ) = ⟨E α , exp−1                  α            α          α
                                 p ◦ expp (λV )⟩ = ⟨E , λV ⟩ = λ⟨E , V ⟩ = λV .
750                                            Geodesics
        Let (xα ) be a normal coordinate system around p ∈ M associated with a basis (Eα ) of
        Tp M . Then
• the coordinate frame (∂α ) associated with (xα ) coincides with (Eα ) at p:
∂α |p = Eα ; (B.21)
            • the values at p of the components (gαβ ) of the metric tensor g with respect to (xα )
              are
                                           gαβ (p) = g|p (Eα , Eβ );                        (B.22)
      Proof. Let U be the normal neighborhood covered by (xα ) and V ∈ exp−1   p (U ) ⊂ Tp M . The
      tangent vector field to the geodesic LV corresponding to the parametrization PV is v = Ẋ µ ∂µ
      with Ẋ µ obtained by deriving (B.20) with respect to λ: Ẋ µ = V µ . Hence v = V µ ∂µ . Now,
      from the very definition of PV , v|p = V = V µ Eµ . We have therefore
V µ ∂µ |p = V µ Eµ .
      This identity being fulfilled for any V µ , Eq. (B.21) follows. Equation. (B.22) is an immediate
      consequence of Eq. (B.21), since gαβ = g(∂α , ∂β ). Finally, with the functions X α (λ) given by
      (B.20), the geodesic equation (B.10) reduces to Γαµν V µ V ν = 0. In particular, at p, we get
Γαµν (p)V µ V ν = 0.
      This identity must hold for any V α . It expresses therefore that, for each value of α, the quadratic
      form V 7→ Γαµν (p)V µ V ν is identically zero on Tp M . Since the Christoffel symbols Γαµν are
      symmetric in µν, it is equivalent to say that, for each value of α, the symmetric bilinear form
      (U , V ) 7→ Γαµν (p)U µ V ν is identically zero, which amounts to Γαµν (p) = 0, i.e. Eq. (B.23).
one cannot
        p use the integral of the norm of infinitesimal displacements along the curve, i.e.
ds := g(dx,p  dx), since g(dx, dx) can be negative. Rather, it is quite natural to employ
instead ds := |g(dx, dx)|. Using dx = v dλ [Eq. (A.14)], we end up with the following
definition:
 The length of a curve L connecting two points p and q in a pseudo-Riemannian manifold
 (M , g) is the real number
                                                  Z   λq
                                                                                            (B.24)
                                                           p
                                L(p,q) (L ) :=                 |g(v, v)| dλ,
                                                  λp
  where λ is some parameter along L , λp (resp. λq ) being its value at p (resp. q), v = dx/dλ
  is the associated tangent vector field, and we assume λq ≥ λp .
Thanks to the transformation law (B.4), it is easy to check that the value of L(p,q) (L ) is
independent from the choice of the parametrization of L , i.e. for a fixed pair of points (p, q), it
is a function of L only.
    When L is included in the domain of a coordinate chart (xα ), so that its equation is
xα = X α (λ), we may rewrite (B.24) as [cf. Eq. (B.11)]
                                      Z λq r
                          L(p,q) (L ) :=              gµν (X ρ (λ))Ẋ µ Ẋ ν dλ,              (B.25)
                                           λp
where Ẋ α := dX α /dλ and gµν (X ρ (λ)) stands for the components of the metric tensor g with
respect to the coordinates (xα ) at the point of coordinates X ρ (λ).
    From the very definition of L(p,q) (L ), it is obvious that
                                           L(p,q) (L ) ≥ 0.                                   (B.26)
Moreover, if it exists, any null curve from p to q achieves the absolute minimum of the length,
without having to be a geodesic:
                                   L null =⇒ L(p,q) (L ) = 0.                                 (B.27)
      Before proceeding, a few caveats must be made. First of all, the Euler-Lagrange equation locate
      only stationary points of the action (here the length L(p,q) (L )), i.e. points where the action
      does not vary to first order in small changes of the curve. Such points are not necessarily
      extrema: they can be saddle points, as we shall see. Secondly, because of the square root in
      (B.28), the Lagrangian is not differentiable at points where gµν Ẋ µ Ẋ ν = 0. This corresponds to
      points where the curve L is null. We shall therefore exclude such curves in our analysis (we
      shall return to null curves in Sec. B.4.3). But then gµν Ẋ µ Ẋ ν has to be either always positive
      along L (i.e. L is spacelike) or always negative (i.e. L is timelike); indeed, by continuity it
      cannot change sign without going through zero. We shall then apply the variational principle
      separately to two subsets of curves connecting p and q: the timelike ones and the spacelike
      ones. The calculations will be conducted in parallel by introducing the sign parameter ϵ = −1
      for timelike curves and ϵ = +1 for spacelike ones. One can then get rid of the absolute value
      in the Lagrangian, which becomes
                                                 q
                                          α    α
                                     L(X , Ẋ ) = ϵgµν (X ρ )Ẋ µ Ẋ ν .                          (B.29)
      Asking that the length (B.25) is stationary with respect to small changes in the curve connecting
      p and q is equivalent to the Euler-Lagrange equation:
                                                    
                                            d    ∂L        ∂L
                                                        −        = 0.                            (B.30)
                                           dλ ∂ Ẋ α      ∂X α
      We have
                                   ∂                 ∂g
                                                          µν
                                     α
                                             ρ   µ ν
                                       gµν (X )Ẋ Ẋ =     α
                                                             Ẋ µ Ẋ ν ,                          (B.31)
                                  ∂X                   ∂x
      with the understanding that ∂gµν /∂xα shall be taken at the point X ρ (λ). Hence, given the
      Lagrangian (B.29),
                                             ∂L         ϵ ∂gµν µ ν
                                                   =            Ẋ Ẋ .                     (B.32)
                                           ∂X α        2L ∂xα
      Besides,
                           ∂                        
                                 gµν (X ρ )Ẋ µ Ẋ ν = gαν Ẋ ν + gµα Ẋ µ = 2gαµ Ẋ µ .    (B.33)
                         ∂ Ẋ α
      Hence
                                                 ∂L        ϵ
                                                       = gαµ Ẋ µ ,
                                               ∂ Ẋ α      L
      from which,
                                 
                       d     ∂L             ϵ dL               ϵ ∂gαµ ν µ ϵ
                                     =− 2           gαµ Ẋ µ +      ν
                                                                      Ẋ Ẋ + gαµ Ẍ µ .    (B.34)
                      dλ ∂ Ẋ   α          L dλ                L ∂x            L
      In view of (B.32) and (B.34), the Euler-Lagrange equation (B.30) becomes, after multiplication
      by L/ϵ,
                         1 dL             ∂gαµ µ ν                1 ∂gµν µ ν
                       −       gαµ Ẋ µ +    ν
                                               Ẋ Ẋ + gαµ Ẍ µ −       Ẋ Ẋ = 0.
                         L dλ             ∂x                      2 ∂xα
                                        B.4 Geodesics and extremal lengths                                     753
Now, playing with the names of repeated indices and using the symmetry of gαβ , we can rewrite
the second term as
                                                                      
     ∂gαµ µ ν       1 ∂gαν ν µ ∂gµα µ ν                 1 ∂gαν ∂gµα
         ν
           Ẋ Ẋ =        µ
                            Ẋ Ẋ +       ν
                                            Ẋ Ẋ    =          µ
                                                                  +    ν
                                                                           Ẋ µ Ẋ ν .   (B.35)
      ∂x            2 ∂x               ∂x               2 ∂x        ∂x
Accordingly, we get
                                                             
                                    1       ∂gαν ∂gµα ∂gµν
                                µ
                           gαµ Ẍ +             +     −           Ẋ µ Ẋ ν = κgαµ Ẋ µ ,            (B.36)
                                    2       ∂xµ   ∂xν   ∂xα
Property B.17: timelike and spacelike geodesics as stationary points of the length
  Among all timelike (resp. spacelike) curves connecting two points p and q, a curve has a
  stationary length iff it is a timelike (resp. spacelike) geodesic.
    For a timelike geodesic, and for points p and q not too far (in the same normal neighborhood),
the stationary length corresponds actually to a maximum:
We shall not provide a full proof here but refer instead to the proof of Proposition 5.34 in
O’Neill’s textbook [390]. We shall only illustrate the property on a specific example in flat
spacetime (Example 1 below).
    If one interprets timelike curves as worldlines and length as proper time (cf. Sec. 1.3.3),
Property B.18 can be viewed as a generalization of the standard “twin paradox” of special
relativity: when they meet again, the twin who followed the geodesic (i.e. some inertial
trajectory) is older than his brother or her sister, who made a round trip.
Example 1 (Timelike geodesic in Minkowski spacetime): Let us suppose that (M , g) is the 4-
dimensional Minkowski spacetime. All geodesics are then (segments of) straight lines. If p and q
are connected by a timelike geodesic L , we may consider a Minkowskian coordinate system (xα ) =
(t, x, y, z) such that xα (p) = (0, 0, 0, 0) and xα (q) = (T, 0, 0, 0), for some T > 0. t is then the proper
754                                                 Geodesics
                                                               t/T
                                                               1    q
0.8
0.6
0.4
0.2
                                                                    p                  x/T
                                            -0.4   -0.2                 0.2      0.4
      Figure B.1: Timelike curves Lh connecting the point p of coordinates (0, 0, 0, 0) to the point q of coordi-
      nates (T, 0, 0, 0) in Minkowski spacetime. From the left to right, the depicted curves correspond to h spanning
      [−3/4, 3/4], with the step δh = 1/4.
      time along L and L(p,q) (L ) = T . Let us consider the one-parameter family of curves (Lh )h∈(−1,1)
      defined by xα = X α (λ) with λ ∈ [0, T ] and
                                                      h
                         X 0 (λ) = λ,     X 1 (λ) =     λ(T − λ),             X 2 (λ) = 0,   X 3 (λ) = 0.
                                                      T
      Note that X 0 (λ) = λ means that the curve parameter coincides with the time coordinate: λ = t. We
      have L0 = L and for h ̸= 0, L is an arc of parabola from p to q in the (t, x) plane (cf. Fig. B.1); the
      dimensionless parameter h is related to the curve’s maximal extension along x by xmax = hT /4. We
      have                                               
                         0             1                λ
                       Ẋ (λ) = 1, Ẋ (λ) = h 1 − 2         , Ẋ 2 (λ) = 0, Ẋ 3 (λ) = 0.
                                                        T
      Given that (gαβ ) = diag(−1, 1, 1, 1), it follows that gµν Ẋ µ Ẋ ν = −1+h2 (1−2λ/T )2 . Since λ ∈ [0, T ],
      this shows that Lh is a timelike curve as long as −1 ≤ h ≤ 1. Its length is
                              s
                                                 λ 2
                          Z T                                  Z hp                   Z arcsin h
                                                              T                       T
           L(p,q) (Lh ) =            2
                                1−h 1−2               dλ =                      2
                                                                           1 − u du =              cos2 θ dθ.
                           0                     T           2h −h                    2h −arcsin h
      Note that arcsin h/h is well-defined at h = 0, since limh→0 arcsin h/h = 1. The graph of L(p,q) (Lh )
      as a function h is plotted in Fig. B.2. We see clearly that h = 0, i.e. the geodesic L , corresponds to the
      maximal length.
          For a spacelike geodesic in a Lorentzian manifold, the stationary length corresponds neither
      to a maximum nor a minimum, but rather to a saddle point, as the example below illustrates.
                                    B.4 Geodesics and extremal lengths                                             755
                                               L(p, q) ( h )/T
                                                     1
0.95
0.9
0.85
0.8
                         -1           -0.5            0              0.5           1
                                                                                       h
Figure B.2: Length of the timelike curve Lh connecting the point p of coordinates (0, 0, 0, 0) to the point q of
coordinates (T, 0, 0, 0) in Minkowski spacetime, as a function of the parameter h measuring the deviation from
the timelike geodesic L = L0 .
                           t/L
                        0.15
                         0.1
                        0.05    p                                                                 q x/L
                        -0.05           0.2         0.4             0.6          0.8          1
                         -0.1
                        -0.15
      Figure B.3: Spacelike curves Lh connecting the point p of coordinates (0, 0, 0, 0) to the point q of coordinates
      (0, L, 0, 0) in Minkowski spacetime. From the bottom to the top, the depicted curves correspond to h spanning
      [−3/4, 3/4], with the step δh = 1/4.
      would not have fit in the analysis by noticing the division by L in Eq. (B.37), which excludes
      L = 0. However, it is possible to get all geodesics, including the null ones, from a variational
      principle; one has to start from a different action, namely
                                                          Z   λq
                                                      1
                                       S(p,q) (P ) :=              gµν (X ρ )Ẋ µ Ẋ ν dλ ,                    (B.38)
                                                      2   λp
Remark 1: The variational principle applied to the action (B.38) leads directly to the geodesic equation
(B.10), which implies that the involved parametrization is affine. On the contrary, the variation of the
length functional (B.25), leads only to the pregeodesic equation (B.12) (cf. the computation in Sec. B.4.2),
which permits a generic parametrization of the geodesic, in agreement with the fact that the length is
parametrization-independent, contrary to the action (B.38).
Remark 2: The factor 1/2 in Eq. (B.38) does not play any role in the variational principle, so we could
have dropped it. However, thanks to it, the momentum conjugate to X α takes a simple form:
                                                  ∂L
                                         Πα :=          = gαµ Ẋ µ .                                 (B.40)
                                                 ∂ Ẋ α
The Lagrangian (B.39) can be then written L2 = 1/2 Πµ Ẋ µ and the Hamiltonian deduced from it by
the standard Legendre transformation is H = Πµ Ẋ µ − L2 = 1/2 Πµ Ẋ µ , i.e.
                                                  1
                                    H(X α , Πα ) = g µν (X ρ )Πµ Πν .                                (B.41)
                                                  2
Such a Hamiltonian has been used by Carter [90] to study the geodesics in Kerr spacetime, discovering
the famous Carter constant.
  Property B.20: constancy of the scalar product of an affine tangent vector with a
  Killing vector
      where the first zero holds because ξ obeys the Killing equation (3.19) and the second one holds
      thanks to Eq. (B.1), which expresses that L is a geodesic and v the tangent vector associated
      with some affine parameter.
      Remark 1: If the tangent vector v is associated with a generic (not necessarily affine) parameter of L ,
      the second zero in Eq. (B.43) must be replaced by κv ν , where κ is the non-affinity coefficient of v [cf.
      Eq. (B.7)]. Accordingly g(ξ, v) is no longer constant along L but rather evolves according to
                                                d
                                                  g(ξ, v) = κ g(ξ, v).                                     (B.44)
                                               dλ
      Note that κ a priori varies along L , so that the integration of this first-order differential equation
      depends of the precise form of the function κ(λ).
        A Killing tensor (also called a Stäckel-Killing tensor [95]) of rank p ≥ 1 in the pseudo-
        Riemannian manifold (M , g) is a tensor field K of type (0, p) that is fully symmetric and
        whose covariant derivative obeys
      Example 3: A trivial example of Killing tensor is the metric tensor g itself. If (M , g) admits a Killing
      vector ξ, another example is K = ξ (the 1-form associated to ξ by metric duality), since for p = 1,
      Eq. (B.45) reduces to the Killing equation (3.19). An example for p = 2 is then K = ξ ⊗ ξ (by the Leibniz
      rule + the Killing equation). Similarly, K = ξ ⊗ ξ ⊗ ξ is a Killing tensor with p = 3, etc. A less trivial
      example is the Walker-Penrose Killing tensor of Kerr spacetime discussed in Sec. 11.2.4.
          If a spacetime is endowed with a Killing tensor that is not trivial, i.e. neither formed from
      g nor any Killing vector as in Example 3, one often says that this spacetime has a hidden
      symmetry (see e.g. the review article [201] for an extended discussion). This is because, as
      Killing vectors, Killing tensors give birth to conserved quantities along geodesics:
                                      B.6 Geodesics and curvature                                                       759
        d
          (K(v, . . . , v)) = ∇v (K(v, . . . , v)) = v µ ∇µ (Kν1 ...νp v ν1 · · · v νp )
       dλ
                            = ∇µ Kν1 ...νp v µ v ν1 · · · v νp
                              |           {z                 }
                                               0
                                  +Kν1 ...νp v ∇µ v ν1 · · · v νp + · · · + Kν1 ...νp v ν1 · · · v µ ∇µ v νp
                                               µ
                                             | {z }                                              | {z }
                                                   0                                                 0
                              = 0,
where the first zero results from (B.45), while the zeros in the line below arise from the geodesic
equation (B.1) obeyed by v.
Example 4: Since we have already noticed that the metric tensor g is a Killing tensor (Example 3),
Property B.2 (constancy of g(v, v)) appears as a special case of Property B.21. For the Walker-Penrose
Killing tensor K of Kerr spacetime, the conserved quantity K(v, v) is called the Carter constant (cf.
Sec. 11.2.4).
v := ∂λ and z := ∂s . (B.47)
By construction, v coincides with the tangent vector field dx/dλ associated to the affine
parameter λ along each geodesic Ls [Eq. (B.2)]. The vector field z is called the separation
vector at fixed λ of the family (Ls )s∈R because at a fixed value of λ, two infinitely close
geodesics Ls and Ls+ds are connected by the vector dx = ds z (cf. Fig. B.4). Note that the
separation vector depends upon the choice of the affine parameter λ.
760                                                 Geodesics
      Figure B.4: 1-parameter family of geodesics (Ls )s∈R affinely parametrized by λ. The 2-surface S formed by
      the geodesic family is spanned by the coordinates (λ, s). The infinitesimal vector connecting a point of Ls to the
      point of Ls+ds having the same value of λ is dx = ds z, where z is the separation vector at fixed λ.
          When the geodesics Ls are timelike or null, v is the evolution vector along Ls associated
      to the affine parameter λ. One may then consider the covariant derivative ∇v z as the relative
      velocity of neighboring geodesics and the second order derivative ∇v ∇v z as the relative
      acceleration of neighboring geodesics. Even if Ls is spacelike, we keep the terms relative
      velocity and relative acceleration.
        The relative acceleration of nearby members of a 1-parameter family (Ls )s∈R of geodesics
        in a pseudo-Riemannian manifold (M , g) is given by
v µ ∇µ (v ν ∇ν z α ) = Rαρµν v ρ v µ z ν . (B.49)
      Proof. Because they are vector fields associated to the coordinate system (λ, s) of S , it is
      obvious that the vector fields v and z commute: [v, z] = 0. This can also be seen from
      the identity [v, z] = Lv z [Eq. (A.79)] and the very definition of the Lie derivative Lv z (cf.
      Sec. A.4.2), which in the present case yields Lv z = 0. Hence we have v ν ∇ν z α = z ν ∇ν v α .
                                      B.6 Geodesics and curvature                                          761
            v µ ∇µ (v ν ∇ν z α ) =   v µ ∇µ (z ν ∇ν v α ) = v µ ∇µ z ν ∇ν v α + z ν v µ ∇µ ∇ν v α
                                 =   v µ ∇µ z ν ∇ν v α + z ν v µ (∇ν ∇µ v α + Rαρµν v ρ )
                                 =   z µ ∇µ v ν ∇ν v α + z µ v ν ∇µ ∇ν v α + Rαρµν v ρ v µ z ν
                                 =   z µ ∇µ (v ν ∇ν v α ) + Rαρµν v ρ v µ z ν = Rαρµν v ρ v µ z ν ,
                                             | {z }
                                                 0
where the Ricci identity (A.98) has been used to get the second line and again the commutation
property v µ ∇µ z ν = z µ ∇µ v ν to get the third line. Finally, v ν ∇ν v α = 0 in the last line follows
from the geodesic character of v [Eq. (B.1)], given that it is the tangent vector corresponding
to an affine parametrization of Ls .
    The geodesic deviation equation (B.48) is a second order linear differential equation for the
vector field z. Thanks to it, the curvature of the pseudo-Riemannian manifold (M , g) can be
interpreted as the obstruction for neighboring geodesics that are initially parallel (zero relative
velocity ∇v z at λ = 0) to stay parallel for λ ̸= 0. Actually, Eq. (B.48) can even be used to define
the Riemann curvature tensor Riem, instead of Eq. (A.97) (see e.g. Chap. 11 of MTW [371] for
details).
762   Geodesics
Appendix C
Kerr-Schild metrics
Contents
        C.1 Generic Kerr-Schild spacetimes . . . . . . . . . . . . . . . . . . . . . . 763
        C.2 Case of Kerr spacetime . . . . . . . . . . . . . . . . . . . . . . . . . . . 765
g = f + 2Hk ⊗ k , (C.1)
f µν kµ kν = 0, (C.3)
where f µν stands for the components of the inverse of the metric f (i.e. f αµ fµβ = δ αβ ).
   A motivation for studying Kerr-Schild metrics is that the inverse metric has a simple
expression:
                                  g αβ = f αβ − 2Hk α k β ,                                  (C.4)
where
                                           k α := f αµ kµ .                                    (C.5)
764                                                Kerr-Schild metrics
= δ αβ ,
      Accordingly, we may write the components of k simply as k α without having to specify whether
      the index has been raised with g or f :
k α = f αµ kµ = g αµ kµ . (C.6)
      the form
                                       f = −dt2 + dx2 + dy 2 + dz 2 .                               (C.8)
        Let g be a Kerr-Schild metric. If g obeys the vacuum Einstein equation (1.44), i.e. if the
        Ricci tensor of g vanishes identically, then the scalar field H appearing in Eq. (C.1) can be
        chosen so that k is a geodesic vector fielda :
k µ ∇µ k α = 0 , (C.9)
Equivalently, kα = (−1, −1, 0, a sin2 θ). Let us then introduce the symmetric bilinear form
f := g − 2Hk ⊗ k, (C.12)
Riem(f ) = 0. (C.15)
  The Kerr metric g is a Kerr-Schild metric, i.e. it can be written in the form (C.1) with the
  flat metric f given by Eq. (C.14), the scalar field H given by Eq. (C.13) and the null vector k
  given by Eq. (C.10), k being the tangent vector field to the ingoing principal null geodesics.
766                                       Kerr-Schild metrics
      In Sec. 10.4, we have already noticed that k is a geodesic vector: ∇k k = 0 [Eq. (10.45)], in
      agreement with (C.9).
      Remark 1: The Kerr metric can also be brought to the Kerr-Schild form by using the tangent vector
      field to the outgoing principal null geodesics. Hence the Kerr-Schild decomposition (C.1) is not unique
      for the Kerr metric.
                                        M := M+ ∪ M− ,                                               (C.16a)
                                        M+ := R × [0, +∞) × S2 \ R                                   (C.16b)
                                        M− := R × (−∞, 0] × S2 \ R.                                  (C.16c)
      In other words, M+ is the part r ≥ 0 of M , while M− is the part r ≤ 0. Note that M+ and
      M− are submanifolds with boundaries (cf. Sec. A.2.2), which overlap at r = 0. In terms of
      the domains introduced in Sec. 10.2.1, M+ contains MI , MII and a part of MIII , while M− is
      entirely included in MIII . The Kerr-Schild coordinates (t̃, x, y, z) on M+ are defined from
      the Kerr coordinates (t̃, r, θ, φ̃) by the following formulas:
                                        t̃    =   t̃                                                 (C.17a)
                                        x     =   (r cos φ̃ − a sin φ̃) sin θ                        (C.17b)
                                        y     =   (r sin φ̃ + a cos φ̃) sin θ                        (C.17c)
                                        z     =   r cos θ.                                           (C.17d)
      Remark 2: As we shall see in Sec. C.2.3, the Kerr-Schild coordinates are singular at r = 0, so strictly
      speaking, we should have omitted r = 0 from the definition of M+ and M− .
      Remark 3: For a = 0, Eqs. (C.17b)-(C.17d) reduce to the standard relations between Cartesian and
      spherical coordinates in Euclidean space.
                                          x2 + y 2 z 2
                                                   + 2 =1.                                        (C.20)
                                          r 2 + a2  r
                                                mr3
                                           H= 4         .                                         (C.25)
                                             r + a2 z 2
     p 6: For a ̸= 0, the relations (C.26) hold at first order in the limit r ≫ a, or equivalently in the
Remark
limit x2 + y 2 + z 2 ≫ a.
   The explicit form of the components (gαβ ) of the Kerr metric in Kerr-Schild coordinates
can be read off by expanding the following expression of g:
                g = −dt̃2 + dx2 + dy 2 + dz 2
                        2mr3
                                
                                       rx + ay     ry − ax     z
                                                                   2 ,                           (C.27)
                    + 4          dt̃ + 2       dx + 2      dy + dz
                     r + a2 z 2        r + a2       r + a2     r
      Figure C.1: Surface t̃ = const, φ̃ = 0 or π and r ≥ 0 of the a = 0.9 m Kerr spacetime depicted in terms of the
      Kerr-Schild coordinates (x, y, z). The drawing is limited to r ≤ 3m. The vertical thick green line is the axis of
      rotation. On the right of it φ̃ = 0, while on the left of it φ̃ = π. The red lines are curves r = const, while the
      green ones are curves θ = const, which can be thought of as the traces of the ingoing principal null geodesics.
      The thick black curve marks the black hole event horizon and the thick blue curve the Cauchy horizon. The thick
      red segment along the y-axis marks the intersection of the surface with the disk r = 0. [Figure produced with the
      notebook D.5.4]
      (r = m − m2 − a2 , cf. Sec. 10.8.3). This property, which is shared by the Kerr coordinates, is in
      sharp contrast with the metric components in Boyer-Lindquist coordinates, which are singular at both
      horizons (cf. Sec. 10.2.6).
          Finally, the axisymmetry Killing vector η = ∂φ̃ of Kerr spacetime [Eq. (10.39)] has the
      following expression in terms of Kerr-Schild coordinates:
η = −y ∂x + x ∂y . (C.28)
      This is easily established from the chain rule and the partial derivative with respect to φ̃ of
      expressions (C.17). We notice on Eq. (C.28) that η is also a Killing vector for the flat metric f ,
      namely the Killing vector generating spatial rotations about the z-axis.
          The identity (C.20) shows that, in the Euclidean space spanned by the (x, y, z) coordinates,
      the surfaces of constant r ̸= 0 are confocal1 ellipsoids of revolution. This is depicted in Fig. C.1,
      which represents a slice t̃ = const and φ̃ = 0 or π in terms of the (x, y, z) coordinates. Note that
         1
           In any plane containing the axis of symmetry x2 + y 2 = 0, the trace of the ellipsoids are ellipses that share
      the same foci, located at the abscissas ±a along the z = 0 axis.
                                           C.2 Case of Kerr spacetime                                                   769
Figure C.2: Surfaces (t̃, θ) = const of the a = 0.9 m Kerr spacetime depicted in terms of the Kerr-Schild
coordinates (x, y, z). The disk-like surface in the plane z = 0 is for θ = π/2, while the cone-like surface is for
θ = π/6. The pale brown lines are curves (r, θ) = const, while the green ones are curves (θ, φ̃) = const. The
latter can be thought of as the traces of the ingoing principal null geodesics. The central pink disk is the (double)
disk r = 0, the boundary of which is the curvature singularity. [Figure produced with the notebook D.5.4]
this slice is not a plane but a warped surface, with a kink along the red segment −a < y < a at
(x, z) = (0, 0), which is the intersection of the slice with the double disk r = 0 (to be√discussed
below). Peculiar r = const surfaces are the black hole event    √ horizon (r = r+ = m + m − a ,
                                                                                                2    2
cf. Sec. 10.5.2) and the Cauchy horizon (r = r− = m − m2 − a2 , cf. Sec. 10.8.3). They are
depicted in respectively black and blue in Fig. C.1.
    Since the ingoing principal null geodesics L(v,θ,in
                                                        φ̃) (cf. Sec. 10.4) are curves (θ, φ̃) = const,
their traces in the 3-space of Fig. C.1 are the green lines that terminates at the disk r = 0 (the
red segment along the y-axis). Another view of the ingoing principal null geodesics is provided
by Fig. C.2, which shows two surfaces (t̃, θ) = const in terms of the Kerr-Schild coordinates
(x, y, z), namely the surfaces θ = π/6 and θ = π/2. We notice that, although they are straight
lines in terms of the Kerr-Schild coordinates, the ingoing principal null geodesics are winding
around the rotation axis in the direction of the black hole rotation, which is indicated by η [cf.
Eq. (C.28)].
                                             t̃   =   t̃                                                    (C.29a)
                                             x    =   −a sin θ sin φ̃                                       (C.29b)
                                             y    =   a sin θ cos φ̃                                        (C.29c)
                                             z    =   0                                                     (C.29d)
For a ̸= 0 and a fixed value of t̃, the subset r = 0 of the Kerr spacetime M is the set S0,t
discussed in Sec. 10.2.2 [cf. Eq. (10.14)], where t is related to t̃ via Eq. (10.35a), which for r = 0,
is basically t̃ = t + const. S0,t is topologically a 2-sphere minus its equator. It is therefore
770                                              Kerr-Schild metrics
0.5
                                    y/m
                                            0
-0.5
                                           -1
                                                 -1    -0.5     0        0.5       1
                                                              x/m
      Figure C.3: Disk (actually double disk) r = 0 of the a = 0.9 m Kerr spacetime depicted in terms of the
      Kerr-Schild coordinates (x, y). The pale brown circles are the curves θ = const, while the red segments are the
      curves φ̃ = const. The disk boundary at x + y 2 = a is the curvature singularity of Kerr spacetime. [Figure
                                                p
                                                   2
      disconnected, with two connected components: the two open hemispheres, S0,t         +
                                                                                             and S0,t
                                                                                                   −
                                                                                                      say. As
      shown in Sec. 10.2.2, S0,t+
                                  and S0,t
                                        −
                                           are actually two disks that, with respect to the metric induced
      by g, (i) have radius a and (ii) are flat. The disk S0,t
                                                             +
                                                                is spanned by the coordinates (θ, φ̃) with
      θ ∈ [0, π/2), while S0,t is spanned by the coordinates (θ, φ̃) with θ ∈ (π/2, π]. Let p ∈ S0,t
                              −                                                                        +
                                                                                                         be a
      point of Kerr coordinates (t̃, 0, θ, φ̃). The point q of coordinates (t̃, 0, π − θ, φ̃) belongs to S0,t
                                                                                                           −
                                                                                                              ;
      it is therefore distinct from p, since S0,t and S0,t are two disjoint sets. Now, the transformation
                                                +        −
      (C.29) maps both p and q to the same value (t̃, −a sin θ sin φ̃, a sin θ cos φ, 0) of the Kerr-Schild
      coordinates (t̃, x, y, z). We conclude that, for r = 0, the Kerr-Schild coordinate system fails to
      establish a one-to-one correspondence between the manifold points and some open subset of
      R4 . Actually, as it is clear from (C.29), the two disks S0,t +
                                                                      and S0,t
                                                                           −
                                                                               are mapped by Kerr-Schild
      coordinates to a single disk, namely the disk of radius a centered at (x, y, z) = (0, 0, 0) in
      the plane z = 0. This disk is shown in Fig. C.2 (pink central disk) and in Fig. C.3; its section
      p= 0 or π is depicted as the red segment in Fig. C.1. The disk boundary is the circle z = 0,
      φ̃
          x2 + y 2 = a; it is not part of M , being the curvature singularity of Kerr spacetime (cf.
      Sec. 10.2.6).
Figure C.4: Three views of the immersion of the full t̃ = const and φ̃ = 0 or π surface of the a = 0.9 m
Kerr spacetime in the Euclidean space R3 , using the Kerr-Schild coordinates (x, y, z) for the r ≥ 0 part (drawn
in grey) and the Kerr-Schild coordinates (x′ , y ′ , z ′ ) for the r ≤ 0 part (drawn in pink). The red lines are curves
(r, φ̃) = const, while the green straight lines are ingoing principal null geodesics, which obey (θ, φ̃) = const.
The thick (resp. blue) black curve marks the black hole event horizon (resp. Cauchy horizon). [Figure produced
with the notebook D.5.4; see this notebook for an interactive 3D view]
The two Kerr-Schild coordinate domains M+ and M− are connected through the r = 0
hypersurface. On a t̃ = const slice, this means being connected through the double disk S0,t .
Such a connection is depicted in Fig. C.4, which represents the (non-isometric) immersion in R3
of the surface t̃ = const and φ̃ ∈ {0, π}, with r ranging from −∞ to +∞ (actually from −3m
to 3m on the figure). The immersion is not an embedding2 because the sheet r ≥ 0 (in grey)
intersects the sheet r ≤ 0 (in pink) along the rotation axis, while the only intersection of M+
and M− is along the double disk S0,t (reduced to the segment −a < y < a at (x, z) = (0, 0)
in Fig. C.4). In other words, the intersection of the grey and pink sheets along the z-axis in
Fig. C.4 is spurious (does not correspond to an intersection in the physical spacetime), while
the intersection along the y-axis is physical. From the central and right views in Fig. C.4, one
sees clearly that the ingoing principal null geodesics go smoothly from the r > 0 region to the
r < 0 region through S0,t .
      and George Ellis in 1973 [266]. Generic Kerr-Schild metrics have been introduced and studied by Roy
      Kerr and Alfred Schild in 1965 [314].
Appendix D
SageMath computations
Contents
        D.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 773
        D.2 Minkowski spacetime . . . . . . . . . . . . . . . . . . . . . . . . . . . . 774
        D.3 Anti-de Sitter spacetime . . . . . . . . . . . . . . . . . . . . . . . . . . . 774
        D.4 Schwarzschild spacetime . . . . . . . . . . . . . . . . . . . . . . . . . . 774
        D.5 Kerr spacetime . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 778
        D.6 Evolution and thermodynamics . . . . . . . . . . . . . . . . . . . . . . 781
D.1     Introduction
SageMath (https://www.sagemath.org/) is a modern free, open-source mathematics soft-
ware system, which is based on the Python programming language. It makes use of over 90
open-source packages, among which are Maxima, Pynac and SymPy (symbolic calculations),
GAP (group theory), PARI/GP (number theory), Singular (polynomial computations), and
matplotlib (high quality 2D figures). SageMath provides a uniform Python interface to all
these packages. However, SageMath is more than a mere interface: it contains a large and
increasing part of original code (more than 750,000 lines of Python and Cython, involving 5344
classes). SageMath was created in 2005 by William Stein [462] and since then its development
has been sustained by more than a hundred researchers (mostly mathematicians). Very good
introductory textbooks about SageMath are [532, 306, 34].
    The SageManifolds project (https://sagemanifolds.obspm.fr/) provides SageMath
with tools for differential geometry and tensor calculus, which are used here to perform
computations relative to black hole spacetimes.
    There are basically two ways to use SageMath:
   • Install it on your computer, by downloading the sources or a binary version from https:
     //www.sagemath.org/ (the SageManifolds extensions towards differential geometry are
     fully integrated in version 7.5 and higher)
774                                  SageMath computations
          The SageMath notebooks (Jupyter format) accompanying these lecture notes are available
      at the nbviewer.jupyter.org links provided below. Clicking on the link leads to a read-only view
      of the notebook. Then, by clicking on the Execute on Binder button (the three interlaced circles
      in the top right menu), one gets access to a freely modifiable and executable version. All the
      notebooks are also collected on the page
https://relativite.obspm.fr/blackholes/sage.html
         1. https://nbviewer.jupyter.org/github/egourgoulhon/BHLectures/blob/master/sage/Kerr_in_advanced_
            Kerr_coord.ipynb
         2. https://nbviewer.jupyter.org/github/egourgoulhon/BHLectures/blob/master/sage/Kerr_in_Kerr_
            coord.ipynb
      https://nbviewer.jupyter.org/github/egourgoulhon/BHLectures/blob/master/sage/Kerr_spher_null_
      geod.ipynb
      D.5.16 Critical curve of a Kerr black hole onto the EHT image of M87*
      This notebook generates Fig. 12.30.
      https://nbviewer.jupyter.org/github/egourgoulhon/BHLectures/blob/master/sage/Kerr_image_M87.
      ipynb
      D.6.5 Solving the ODE for outgoing radial null geodesics in Vaidya
            spacetime
      This notebook accompanies Chap. 15.
      https://nbviewer.jupyter.org/github/egourgoulhon/BHLectures/blob/master/sage/Vaidya_solve_ode_
      out.ipynb
Gyoto computations
Contents
        E.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 783
        E.2 Image computations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 783
E.1     Introduction
Gyoto (https://gyoto.obspm.fr) is a free open-source C++ code for computing orbits and
ray-traced images in general relativity [493]. It has a Python interface and has the capability to
integrate geodesics not only in analytical spacetimes (such as Kerr) but also in numerical ones,
i.e. in spacetimes arising from numerical relativity.
By default, the computation is performed in parallel on 8 threads; you can adapt to your CPU
by changing the field NThreads in the file input.xml. The output image is in FITS format and
can be converted to PNG or JPEG by most image processing programs, such as GIMP.
On the Web
   • Movies from computations of the Center for Computational Relativity and Gravitation,
     Rochester Institute of Technology:
     https://ccrg.rit.edu/movies
   • Gyoto gallery (images and movies of accretion flows around black holes and alternative
     compact objects; see also Appendix E):
     https://gyoto.obspm.fr/gallery/
 [1] R. Abraham and J.E. Marsden: Foundations of Mechanics, second edition, Addison-Wesley,
     Redwood City (1978).
 [2] M.A. Abramowicz and P.C. Fragile: Foundations of Black Hole Accretion Disk Theory,
     Living Rev. Relativ. 16, 1 (2013).
 [3] M. Abramowitz and I. Stegun: Handbook of Mathematical Functions with Formulas, Graphs,
     and Mathematical Tables, NBS, 10th printing (1972); updated in the NIST Digital Library
     of Mathematical Functions, available at https://dlmf.nist.gov/.
 [4] R. Abuter et al. (GRAVITY Collaboration): Detection of the gravitational redshift in the
     orbit of the star S2 near the Galactic centre massive black hole, Astron. Astrophys. 615,
     L15 (2018); arXiv: 1807.09409.
 [5] R. Abuter et al. (GRAVITY Collaboration): Detection of the Schwarzschild precession in
     the orbit of the star S2 near the Galactic centre massive black hole, Astron. Astrophys. 636,
     L5 (2020); arXiv: 2004.07187.
 [6] K. Akiyama et al. (the Event Horizon Telescope Collaboration): First M87 Event Horizon
     Telescope Results. I. The Shadow of the Supermassive Black Hole, Astrophys. J. Lett. 875,
     L1 (2019); arXiv: 1906.11238.
 [7] K. Akiyama et al. (the Event Horizon Telescope Collaboration): First M87 Event Horizon
     Telescope Results. V. Physical Origin of the Asymmetric Ring, Astrophys. J. Lett. 875, L5
     (2019); arXiv: 1906.11242.
 [8] K. Akiyama et al. (the Event Horizon Telescope Collaboration): First M87 Event Horizon
     Telescope Results. VI. The Shadow and Mass of the Central Black Hole, Astrophys. J. Lett.
     875, L6 (2019); arXiv: 1906.11243.
 [9] K. Akiyama et al. (the Event Horizon Telescope Collaboration): The persistent shadow of
     the supermassive black hole of M 87. I. Observations, calibration, imaging, and analysis,
     Astron. Astrophys. 681, A79 (2024).
[10] M. Alcubierre: Introduction to 3+1 Numerical Relativity, Oxford Univ. Press, Oxford (2008)
     DOI: 10.1093/acprof:oso/9780199205677.001.0001
[11] S. Alexakis, A. D. Ionescu, and S. Klainerman: Rigidity of stationary black holes with small
     angular momentum on the horizon, Duke Math. J. 163, 2603 (2014); arXiv: 1304.0487.
[12] W.L. Ames and K. Thorne: The Optical Appearance of a Star that is Collapsing Through its
     Gravitational Radius, Astrophys. J. 151, 659 (1968).
788                                      BIBLIOGRAPHY
      [13] L. Andersson, T. Bäckdahl, and P. Blue: Geometry of Black Hole Spacetimes, in Asymptotic
           Analysis in General Relativity, edited by T. Daudé, D. Häfner and J.-P. Nicolas, Cambridge
           Univ. Press, Cambridge (2018), p. 9; DOI: 10.1017/9781108186612.002; arXiv: 1610.03540.
      [14] N. Andersson: Gravitational-Wave Astronomy: Exploring the Dark Side of the Universe,
           Oxford Univ. Press, Oxford (2020); DOI: 10.1093/oso/9780198568032.001.0001.
      [15] S. Ansoldi: Spherical black holes with regular center: a review of existing models including
           a recent realization with Gaussian sources, in Proceedings of the second edition of the
           international workshop Dynamics and Thermodynamics of Blackholes and Naked Sin-
           gularities, edited by L. Fatibene, M. Francaviglia, R. Giambò and G. Magli, published as
           arXiv:1205.2839 (2008); arXiv: 0802.0330.
      [16] T. Anson, E. Babichev, C. Charmousis and M. Hassaine: Disforming the Kerr metric,
           J. High Energ. Phys. 01(2021), 18 (2021); arXiv: 2006.06461.
      [17] S. Aretakis: Nonlinear instability of scalar fields on extremal black holes, Phys. Rev. D 87,
           084052 (2013); arXiv: 1304.4616.
      [18] S. Aretakis: Horizon instability of extremal black holes, Adv. Theor. Math. Phys. 19, 507
           (2015); arXiv: 1206.6598.
      [19] S. Aretakis: Dynamics of Extremal Black Holes, Springer, Cham (2018); DOI: 10.1007/978-
           3-319-95183-6.
      [20] R. Arnowitt, S. Deser and C.W. Misner: The dynamics of general relativity, in Gravitation:
           an introduction to current research, edited by L. Witten, Wiley, New York (1962), p. 227;
           reprinted in Gen. Relat. Grav. 40, 1997 (2008); arXiv: gr-qc/0405109.
      [21] A. Ashtekar, J. Baez, A. Corichi, and K. Krasnov: Quantum Geometry and Black Hole
           Entropy, Phys. Rev. Lett. 80, 904 (1998); arXiv: gr-qc/9710007.
      [22] A. Ashtekar, C. Beetle, and J. Lewandowski: Mechanics of rotating isolated horizons, Phys.
           Rev. D 64, 044016 (2001); arXiv: gr-qc/0103026.
      [23] A. Ashtekar, C. Beetle, and J. Lewandowski: Geometry of generic isolated horizons, Class.
           Quantum Grav. 19, 1195 (2002); arXiv: gr-qc/0111067.
      [24] A. Ashtekar, L. Bombelli, and O. Reula: The Covariant Phase Space of Asymptotically
           Flat Gravitational Fields, in Mechanics, Analysis and Geometry: 200 Years After Lagrange,
           edited by M. Francaviglia, North Holland, Amsterdam (1991), p. 417; DOI: 10.1016/B978-
           0-444-88958-4.50021-5.
      [25] A. Ashtekar, S. Fairhurst, and B. Krishnan: Isolated horizons: Hamiltonian evolution and
           the first law, Phys. Rev. D 62, 104025 (2000); arXiv: gr-qc/0005083.
      [26] A. Ashtekar, N. Khera, M. Kolanowski, and J. Lewandowski: Non-expanding horizons: mul-
           tipoles and the symmetry group, J. High Energ. Phys. 2022, 28 (2022); arXiv: 2111.07873.
      [27] A. Ashtekar and B. Krishnan: Isolated and dynamical horizons and their applications,
           Living Rev. Relativ. 7, 10 (2004).
      [28] A. Ashtekar and A. Magnon-Ashtekar: On conserved quantities in general relativity,
           J. Math. Phys. 20, 793 (1979).
      [29] E. Ayón-Beato and A. García: The Bardeen model as a nonlinear magnetic monopole, Phys.
           Lett. B 493, 149 (2000); arXiv: gr-qc/0009077.
                                         BIBLIOGRAPHY                                                789
[30] C. Bambi: Black Holes: A Laboratory for Testing Strong Gravity, Springer, Singapore
     (2017); DOI: 10.1007/978-981-10-4524-0.
[31] C. Bambi (Ed.) : Regular Black Holes: Towards a New Paradigm of Gravitational Collapse,
     Springer, Singapore (2023); DOI: 10.1007/978-981-99-1596-5.
[32] R. Banerjee, B.R. Majhi, S.K. Modak, and S. Samanta: Killing symmetries and Smarr
     formula for black holes in arbitrary dimensions, Phys. Rev. D 82, 124002 (2010); arXiv:
     1007.5204.
[33] J.F. Barbero G. and D. Pranzetti: Black Hole Entropy in Loop Quantum Gravity, in Handbook
     of Quantum Gravity, edited by C. Bambi, L. Modesto, and I. Shapiro, I., Springer, Singapore
     (2023); DOI: 10.1007/978-981-19-3079-9_104-1; arXiv: 2212.13469.
[34] G.V. Bard: Sage for Undergraduates, Americ. Math. Soc. (2015); preprint freely available
     from http://www.gregory-bard.com/Sage.html.
[35] J.M. Bardeen: Non-singular general relativistic gravitational collapse, in Abstracts of the
     5th international conference on gravitation and the theory of relativity, edited by V.A. Fock
     et al., Tbilisi University Press (1968).
[36] J.M. Bardeen: Kerr Metric Black Holes, Nature 226, 64 (1970).
[37] J.M. Bardeen: A Variational Principle for Rotating Stars in General Relativity, Astrophys. J.
     162, 71 (1970).
[38] J.M. Bardeen: Timelike and Null Geodesics in the Kerr Metric, in Black Holes – Les astres
     occlus, edited by C. DeWitt and B. DeWitt, Gordon and Breach, New York (1973), p. 215.
[39] J.M. Bardeen: Rapidly Rotating Stars, Disks, and Black Holes, in Black Holes – Les astres
     occlus, edited by C. DeWitt and B. DeWitt, Gordon and Breach, New York (1973), p. 241.
[40] J.M. Bardeen, B. Carter, and S.W. Hawking: The Four Laws of Black Hole Mechanics,
     Commun. Math. Phys., 31, 161 (1973).
[41] J. Bardeen and G.T. Horowitz: Extreme Kerr throat geometry: A vacuum analog of AdS 2 ×
     S 2 , Phys. Rev. D 60, 104030 (1999); arXiv: hep-th/9905099.
[42] J.M. Bardeen, W.H. Press, and S.A. Teukolsky: Rotating Black Holes: Locally Nonrotating
     Frames, Energy Extraction, and Scalar Synchrotron Radiation, Astrophys. J. 178, 347 (1972).
[43] H. Barzegar, P.T. Chruściel, and M. Hörzinger: Energy in higher-dimensional spacetimes,
     Phys. Rev. D 96, 124002 (2017); arXiv: 1708.03122.
[44] T.W. Baumgarte and S.L. Shapiro: Numerical Relativity: Solving Einstein’s Equations on the
     Computer, Cambridge Univ. Press, Cambridge (2010); DOI: 10.1017/CBO9781139193344
[45] C.A.B. Bayona and N.R.F. Braga: Anti-de Sitter boundary in Poincaré coordinates, Gen.
     Relativ. Gravit. 39, 1367 (2007); arXiv: hep-th/0512182.
[46] D.L. Beckedorff: Terminal Configurations of Stellar Evolution, undergraduate senior
     thesis, Princeton University (1962); URL: http://arks.princeton.edu/ark:/88435/
     dsp019593tx34r.
[47] K. Beckwith and C. Done: Extreme gravitational lensing near rotating black holes, Month.
     Not. Roy. Astron. Soc. 359, 1217 (2005); arXiv: astro-ph/0411339.
[48] J.K. Beem, P.E. Ehrlich and K.L. Easley: Global Lorentzian Geometry, 2nd edition, Marcel
     Dekker, New York (1996); DOI: 10.1201/9780203753125.
790                                      BIBLIOGRAPHY
      [49] R. Beig: Arnowitt-Deser-Misner energy and g00 , Phys. Lett. A 69, 153 (1978).
      [50] J.D. Bekenstein: Black holes and the second law, Lett. Nuovo Cimento 4, 737 (1972).
      [51] J.D. Bekenstein: Extraction of Energy and Charge from a Black Hole, Phys. Rev. D 7, 949
           (1973).
      [52] J.D. Bekenstein: Black Holes and Entropy, Phys. Rev. D 7, 2333 (1973).
      [53] J.D. Bekenstein: Generalized second law of thermodynamics in black-hole physics, Phys.
           Rev. D 9, 3292 (1974).
      [54] J. Ben Achour, H. Liu, H. Motohashi, S. Mukohyama, and K. Noui: On rotating black holes
           in DHOST theories, J. Cosmo. Astropart. Phys. 11(2020), 001 (2020); arXiv: 2006.07245.
      [55] R. Bender et al.: HST STIS Spectroscopy of the Triple Nucleus of M31: Two Nested Disks
           in Keplerian Rotation around a Supermassive Black Hole, Astrophys. J. 631, 280 (2005);
           arXiv: astro-ph/0509839.
      [56] I. Bengtsson, E. Jakobsson, and J.M.M. Senovilla: Trapped surfaces in Oppenheimer-Snyder
           black holes Phys. Rev. D 88, 064012 (2013)
           DOI: 10.1103/PhysRevD.88.064012                                          arXiv: 1306.6486
      [57] I. Bengtsson and P. Sandin: Anti-de Sitter space, squashed and stretched, Class. Quantum
           Grav. 23, 971 (2006); arXiv: gr-qc/0509076.
      [58] I. Bengtsson and J.M.M. Senovilla: Note on trapped surfaces in the Vaidya solution, Phys.
           Rev. D 79, 024027 (2009); arXiv: 0809.2213.
      [59] M. Berger: A Panoramic View of Riemannian Geometry, Springer, Berlin (2003).
      [60] J. Bičák and Z. Stuchlík: On the Latitudinal and Radial Motion in the Field of a Rotating
           Black Hole, Bul. Astron. Inst. Czechoslovakia 27, 129 (1976).
      [61] D. Bini, A. Geralico, and R.T. Jantzen: Gyroscope precession along general timelike geodesics
           in a Kerr black hole spacetime, Phys. Rev. D 95, 124022 (2017); arXiv: 1703.09525.
      [62] G.D. Birkhoff: Relativity and Modern Physics, Harvard University Press, Cambridge (1923).
      [63] R.L. Bishop and S.I. Goldberg: Tensor Analysis on Manifolds, Macmillan, New York (1968);
           reprinted by Dover, New York (1980).
      [64] G.S. Bisnovatyi-Kogan and O.Yu. Tsupko: Strong gravitational lensing by Schwarzschild
           black holes, Astrofizika 51, 125 (2008); English translation in Astrophysics 51, 99 (2008);
           arXiv: 0803.2468.
      [65] L. Blanchet: Post-Newtonian theory for gravitational waves, Living Rev. Relativ. 27, 4
           (2024).
      [66] H. Bondi: Spherically Symmetrical Models in General Relativity, Month. Not. Roy. Astron.
           Soc. 107, 410 (1947); reprinted in Gen. Relativ. Gravit. 31, 1783 (1999).
      [67] I. Booth, L. Brits, J.A Gonzalez, and C. Van Den Broeck: Marginally trapped tubes and
           dynamical horizons, Class. Quantum Grav. 23, 413 (2005); arXiv: 0506119.
      [68] I. Booth and S. Fairhurst: Isolated, slowly evolving, and dynamical trapping horizons:
           Geometry and mechanics from surface deformations, Phys. Rev. D 75, 084019 (2007); arXiv:
           gr-qc/0610032.
                                         BIBLIOGRAPHY                                                791
[69] A. Borde: Open and closed universes, initial singularities, and inflation, Phys. Rev. D 50,
     3692 (1994)
     DOI: 10.1103/PhysRevD.50.3692                                          arXiv: gr-qc/9403049
[70] A. Borde: Regular black holes and topology change, Phys. Rev. D 55, 7615 (1997); arXiv:
     gr-qc/9612057
[71] R.H. Boyer: Geodesic Killing orbits and bifurcate Killing horizons, Proc. Roy. Soc. Lond. A
     311, 245 (1969).
[72] R.H. Boyer and R.W. Lindquist: Maximal Analytic Extension of the Kerr Metric, J. Math.
     Phys. 8, 265 (1967).
[73] R.H. Boyer and T.G. Price: An interpretation of the Kerr metric in general relativity, Math.
     Proc. Cambridge Phil. Soc. 61, 531 (1965).
[74] D. Brill: History of a black hole horizon, Grav. Cosmo. 20, 165 (2014).
[75] R. Brout, S. Massar, R. Parentani, and P. Spindel: A Primer for Black Hole Quantum Physics,
     Phys. Rep. 260, 329 (1995); arXiv: 0710.4345.
[76] G.L. Bunting: Proof of the uniqueness conjecture for black holes, Ph.D. thesis, University of
     New England, Armidale, Australia (1983).
[77] G.L. Bunting and A.K.M. Masood-ul-Alam: Nonexistence of multiple black holes in asymp-
     totically Euclidean static vacuum space-time, Gen. Relat. Grav. 19, 147 (1987).
[78] L.M. Burko, G. Khanna, and A. Zenginoğlu: Cauchy-horizon singularity inside perturbed
     Kerr black holes, Phys. Rev. D 93, 041501(R) (2016); arXiv: 1601.05120.
[79] P.F. Byrd and M.D. Friedman: Handbook of Elliptic Integrals for Engineers and Scientists,
     2nd ed., Springer (Berlin) (1971); DOI: 10.1007/978-3-642-65138-0
[80] A. Čadež and U. Kostić: Optics in the Schwarzschild spacetime, Phys. Rev. D 72, 104024
     (2005); arXiv: gr-qc/0405037.
[81] L.-M. Cao: Deformation of codimension-2 surfaces and horizon thermodynamics, J. High
     Energ. Phys. 2011, 112 (2011); arXiv: 1009.4540.
[82] L. Capuano, L. Santoni, and E. Barausse: Black hole hairs in scalar-tensor gravity and the
     lack thereof, Phys. Rev. D 108, 064058 (2023); arXiv: 2304.12750.
[83] V. Cardoso: The hole picture, Nature Review Phys. 1, 701 (2019); arXiv: 1910.04173.
[84] V. Cardoso, F. Duque, and A. Foschi: The light ring and the appearance of matter accreted
     by black holes, Phys. Rev. D 103, 104044 (2021); arXiv: 2102.07784.
[85] S. Carlip: Black hole thermodynamics, Int. J. Mod. Phys. D 23, 1430023 (2014); arXiv:
     1410.1486.
[86] B. Carr, K. Kohri, Y. Sendouda, and J. Yokoyama: Constraints on primordial black holes,
     Rep. Prog. Phys. 84, 116902 (2021); arXiv: 2002.12778.
[87] S.M. Carroll: Spacetime and Geometry: An Introduction to General Relativity, Addison Wes-
     ley (Pearson Education), San Fransisco (2004); https://www.preposterousuniverse.
     com/spacetimeandgeometry/.
[88] B. Carter: Complete Analytic Extension of the Symmetry Axis of Kerr’s Solution of Einstein’s
     Equations, Phys. Rev. 141, 1242 (1966).
792                                       BIBLIOGRAPHY
       [89] B. Carter: Stationary Axisymmetric Systems in General Relativity, Ph. D. Thesis, Cambridge
            University (1967); available at https://luth.obspm.fr/~luthier/carter/Thesis/.
       [90] B. Carter: Global Structure of the Kerr Family of Gravitational Fields, Phys. Rev. 174, 1559
            (1968).
       [91] B. Carter: Hamilton-Jacobi and Schrodinger Separable Solutions of Einstein’s Equations,
            Commun. Math. Phys. 10, 280 (1968).
       [92] B. Carter: Killing horizons and orthogonally transitive groups in space-time, J. Math. Phys.
            10, 70 (1969).
       [93] B. Carter: Axisymmetric Black Hole Has Only Two Degrees of Freedom, Phys. Rev. Lett. 26,
            331 (1971).
       [94] B. Carter: Rigidity of a Black Hole, Nature Physical Science 238, 71 (1972).
       [95] B. Carter: Black hole equilibrium states Part I. Analytic and Geometric Properties of the Kerr
            Solutions, in Black Holes – Les astres occlus, edited by C. DeWitt and B. DeWitt, Gordon
            and Breach, New York (1973), p. 57; reprinted (with corrections) in Gen. Relativ. Gravit.
            41, 2873 (2009).
       [96] B. Carter: Black hole equilibrium states Part II. General theory of stationary black hole
            states, in Black Holes – Les astres occlus, edited by C. DeWitt and B. DeWitt, Gordon and
            Breach, New York (1973), p. 125; reprinted (with corrections) in Gen. Relativ. Gravit. 42,
            653 (2010).
       [97] B. Carter: Perfect fluid and magnetic field conservation laws in the theory of black hole
            accretion rings, in Active Galactic Nuclei, edited by C. Hazard and S. Mitton, Cambridge
            Univ. Press, Cambridge (1979), p. 273; available at https://luth.obspm.fr/~luthier/
            carter/trav/Carter79.pdf.
       [98] B. Carter: Generalised mass variation formula for a stationary axisymmetric star or black
            hole with surrounding accretion disc, in Journées Relativistes 1979, edited by I. Moret-Bailly
            and C. Latrémolière, Faculté des Sciences, Angers (1979), p. 166; arXiv: astro-ph/0206113.
       [99] B. Carter: The general theory of the mechanical, electromagnetic and thermodynamic
            properties of black holes, in General Relativity: An Einstein Centenary Survey, edited by
            S.W. Hawking and W. Israel, Cambridge Univ. Press, Cambridge (1979), p. 294.
      [100] B. Carter: Bunting identity and Mazur identity for non-linear elliptic systems including the
            black hole equilibrium problem, Commun. Math. Phys. 99, 563 (1985).
      [101] B. Carter: Mathematical foundations of the theory of relativistic stellar and black hole
            configurations, in Gravitation in Astrophysics, edited by B. Carter and J.B. Hartle, Plenum
            Press, New York (1987), p. 63.
      [102] B. Carter: Extended tensorial curvature analysis for embeddings and foliations, Contemp.
            Math. 203, 207 (1997).
      [103] B. Carter: Has the black hole equilibrium problem been solved?, in The Eighth Marcel
            Grossmann Meeting, edited by T. Piran, World Scientific, Singapore (1999), p. 136; arXiv:
            gr-qc/9712038.
                                          BIBLIOGRAPHY                                                793
[104] B. Carter: Energy dominance and the Hawking Ellis vacuum conservation theorem, in The
      Future of Theoretical Physics and Cosmology — Celebrating Stephen Hawking’s Contribu-
      tions to Physics, edited by G.W. Gibbons, E.P.S. Shellard, and S.J. Rankin, Cambridge Univ.
      Press, Cambridge (2003), p. 177; arXiv: gr-qc/0205010.
[105] B. Carter: Half Century of Black-Hole Theory: From Physicists’ Purgatory to Mathemati-
      cians’ Paradise, in A Century of Relativity Physics: ERE 2005; XXVIII Spanish Relativity
      Meeting, edited by L. Mornas and J. Diaz Alonso, AIP Conference Proceedings 841, 29
      (2006); arXiv: gr-qc/0604064.
[106] M. Casals, S.E. Gralla, and P. Zimmerman: Horizon instability of extremal Kerr black holes:
      Nonaxisymmetric modes and enhanced growth rate, Phys. Rev. D 94, 064003 (2016); arXiv:
      1606.08505.
[107] S. Chandrasekhar: The Mathematical Theory of Black Holes, Oxford Univ. Press, New
      York (1983); DOI: 10.1093/oso/9780198503705.001.0001.
[108] Y. Choquet-Bruhat: General Relativity and Einstein’s Equations, Oxford Univ. Press, New
      York (2009); DOI: 10.1093/acprof:oso/9780199230723.001.0001.
[109] Y. Choquet-Bruhat: Introduction to General Relativity, Black Holes, and Cosmology, Oxford
      Univ. Press, New York (2015); DOI: 10.1093/oso/9780199666454.001.0001.
[110] Y. Choquet-Bruhat, C. De Witt-Moretten, and M. Dillard-Bleick: Analysis, Manifolds and
      Physics, North-Holland, Amsterdam (1977).
[111] Y. Choquet-Bruhat and R. Geroch: Global aspects of the Cauchy problem in general
      relativity, Commun. Math. Phys. 14, 329 (1969).
[112] D. Christodoulou: Reversible and Irreversible Transformations in Black-Hole Physics, Phys.
      Rev. Lett. 25, 1596 (1970).
[113] D. Christodoulou: The Formation of Black Holes in General Relativity, European Mathe-
      matical Society, Zürich (2009); DOI: 10.4171/068; arXiv: 0805.3880.
[114] D. Christodoulou and R. Ruffini: Reversible Transformations of a Charged Black Hole, Phys.
      Rev. D 4, 3552 (1971).
[115] P.T. Chruściel: “No Hair” Theorems – Folklore, Conjectures, Results, in Differential Geometry
      and Mathematical Physics, Contemporary Mathematics 170; edited by J.K. Beem and K.L.
      Duggal, American Mathematical Society, Providence (1994), p. 23; arXiv: gr-qc/9402032.
[116] P.T. Chruściel: On Rigidity of Analytic Black Holes, Commun. Math. Phys. 189, 1 (1997);
      arXiv: gr-qc/9610011.
[117] P.T. Chruściel: Black holes, in Proceedings of the Tübingen conference on conformal structure
      of space-time, edited by J. Frauendiener, H. Friedrich, Lecture Notes in Physics 604, 61,
      Springer, Berlin (2002); arXiv: gr-qc/0201053.
[118] P.T. Chruściel: Black holes – An Introduction, in 100 Years of Relativity, edited by A.
      Ashtekar, World Scientific, Singapore (2005), p. 93; DOI: 10.1142/9789812700988_0004.
[119] P.T. Chruściel: Geometry of Black Holes, Oxford Univ. Press, Oxford (2020); https://
      global.oup.com/academic/product/geometry-of-black-holes-9780198855415.
[120] P.T. Chruściel, E. Delay, G. Galloway, and R. Howard: Regularity of horizons and the area
      theorem, Annales Henri Poincaré 2, 109 (2001); arXiv: gr-qc/0001003.
794                                      BIBLIOGRAPHY
      [121] P.T. Chruściel and G.J. Galloway: Uniqueness of static black holes without analyticity,
            Class. Quantum Grav. 27, 152001 (2010); arXiv: 1004.0513.
      [122] P.T. Chruściel and J. Lopes Costa: On uniqueness of stationary vacuum black holes,
            Astérisque 321, 195 (2008); arXiv: 0806.0016.
      [123] P.T. Chruściel, J. Lopes Costa, and M. Heusler: Stationary Black Holes: Uniqueness and
            Beyond, Living Rev. Relativ. 15, 7 (2012).
      [124] P.T. Chruściel, M. Maliborski, and N. Yunes: Structure of the singular ring in Kerr-like
            metrics, Phys. Rev. D 101, 104048 (2020); arXiv: 1912.06020.
      [125] P.T. Chruściel and L. Nguyen: A Uniqueness Theorem for Degenerate Kerr–Newman Black
            Holes, Ann. Henri Poincaré 11, 585 (2010); arXiv: 1002.1737.
      [126] P.T. Chruściel, C.R. Ölz, and S.J Szybka: Space-time diagrammatics, Phys. Rev. D 86,
            124041 (2012); arXiv: 1211.1718.
      [127] P.T. Chruściel, H.S. Reall and P. Tod: On non-existence of static vacuum black holes with
            degenerate components of the event horizon, Class. Quantum Grav. 23, 549 (2006); arXiv:
            gr-qc/0512041.
      [128] P.T. Chruściel and P. Tod: The Classification of Static Electro–Vacuum Space–Times Con-
            taining an Asymptotically Flat Spacelike Hypersurface with Compact Interior, Commun.
            Math. Phys. 271, 577 (2007); arXiv: gr-qc/0512043.
      [129] P.T. Chruściel and R.M. Wald: Maximal hypersurfaces in stationary asymptotically flat
            spacetimes, Commun. Math. Phys. 163, 561 (1994); arXiv: gr-qc/9304009.
      [130] P.T. Chruściel and R.M. Wald: On the topology of stationary black holes, Class. Quantum
            Grav. 11, L147 (1994); arXiv: gr-qc/9410004.
      [131] T. Chu, H.P. Pfeiffer, and M.I. Cohen: Horizon dynamics of distorted rotating black holes,
            Phys. Rev. D 83, 104018 (2011); arXiv: 1011.2601.
      [132] M.I. Cohen, J.D. Kaplan, and M.A. Scheel: Toroidal horizons in binary black hole inspirals,
            Phys. Rev. D 85, 024031 (2012); arXiv: 1110.1668.
      [133] M.I. Cohen, H.P. Pfeiffer, and M.A. Scheel: Revisiting event horizon finders, Class. Quantum
            Grav. 26, 035005 (2009); arXiv: 0809.2628.
      [134] P. Collas and D. Klein: Embeddings and time evolution of the Schwarzschild wormhole,
            Amer. J. Phys. 80, 203 (2012); arXiv: 1107.4871.
      [135] S. Collin-Zahn: Des quasars aux trous noirs, EDP Sciences, Les Ulis (2009).
      [136] G. Compère: An introduction to the mechanics of black holes, Lecture notes, Second
            Modave Summer School in Mathematical Physics (2006); arXiv: gr-qc/0611129.
      [137] G. Compère: The Kerr/CFT correspondence and its extensions, Living Rev. Relativ. 20, 1
            (2017).
      [138] G. Compère: Advanced Lectures on General Relativity, Springer, Cham (2019); DOI:
            10.1007/978-3-030-04260-8; arXiv: 1801.07064.
      [139] G. Compère and A. Druart: Near-horizon geodesics of high spin black holes, Phys. Rev. D
            101, 084042 (2020); erratum in Phys. Rev. D 102, 029901 (2020); arXiv: 2001.03478.
                                          BIBLIOGRAPHY                                              795
[140] G. Compère, K. Fransen, T. Hertog, and J. Long: Gravitational waves from plunges into
      Gargantua, Class. Quantum Grav. 35, 104002 (2018); erratum in Class. Quantum Grav.
      36, 179501 (2019); arXiv: 1712.07130.
[141] G. Contopoulos: Orbits through the ergosphere of a Kerr black hole, Gen. Relat. Grav. 16,
      43 (1984).
[142] R.M. Corless, G.H. Gonnet, D.E.G. Hare, D.J. Jeffrey, and D.E. Knuth: On the Lambert W
      function, Adv. Comput. Math. 5, 329 (1996).
[143] A. Coudray and J.-P. Nicolas: Geometry of Vaidya spacetimes, 1Gen. Relat. Grav. 53, 73
      (2021); arXiv: 2101.06544.
[144] P.V.P. Cunha and C.A.R. Herdeiro: Shadows and strong gravitational lensing: a brief review,
      Gen. Relat. Grav. 50, 42 (2018); arXiv: 1801.00860.
[145] P.V.P. Cunha, C.A.R. Herdeiro and E. Radu: Fundamental photon orbits: Black hole shadows
      and spacetime instabilities, Phys. Rev. D 96, 024039 (2017); arXiv: 1705.05461.
[146] C.T. Cunningham and J.M. Bardeen: The Optical Appearance of a Star Orbiting an Extreme
      Kerr Black Hole Astrophys. J. 183, 237 (1973).
[147] E. Curiel: A Potted Early History of “black hole”, appendix to K. Landsman’s article [334]
      (2021).
[148] H.D. Curtis: Descriptions of 762 Nebulae and Clusters Photographed with the Crossley
      Reflector, Publications of Lick Observatory 13, 9 (1918).
[149] M. Dafermos and I. Rodnianski: Lecture on black holes and linear waves, in Clay
      Math. Proc. 17: Evolution Equations, edited by D. Ellwood, I. Rodnianski, G.
      Staffilani and J. Wunsch, AMS / Clay Mathematical Institute, Cambridge (2013),
      p. 97; available at https://people.maths.ox.ac.uk/cmi/library/proceedings/
      cmip017c.pdf; arXiv: 0811.0354.
[150] T. Damour: Black-hole eddy currents, Phys. Rev. D 18, 3598 (1978).
[151] T. Damour: Quelques propriétés mécaniques, électromagnétiques, thermodynamiques et
      quantiques des trous noirs, Thèse de doctorat d’État, Université Paris 6 (1979); available
      at https://pagesperso.ihes.fr/~damour/Articles/.
[152] T. Damour: Surface effects in black hole physics, in Proceedings of the Second Marcel
      Grossmann Meeting on General Relativity, edited by R. Ruffini, North Holland, Amsterdam
      (1982), p. 587; available at https://pagesperso.ihes.fr/~damour/Articles/.
[153] T. Damour: The Entropy of Black Holes: A Primer, in Poincaré Seminar 2003: Bose-Einstein
      Condensation – Entropy, edited by J. Dalibard, B. Duplantier and V. Rivasseau, Birkhaäuser,
      Basel (2004), p. 227; URL: https://link.springer.com/book/9783764371067; arXiv:
      hep-th/0401160.
[154] G. Darmois: Les équations de la gravitation einsteinienne, Mémorial des sciences mathé-
      matiques, no. 25 (1927); available at https://www.numdam.org/item/?id=MSM_1927_
      _25__1_0.
[155] C. G. Darwin: The gravity field of a particle, Proc. Roy. Soc. Lond. A 249, 180 (1959).
[156] C. G. Darwin: The gravity field of a particle. II, Proc. Roy. Soc. Lond. A 263, 39 (1961).
796                                      BIBLIOGRAPHY
      [157] B. Datt: On a Class of Solutions of the Gravitation Equations of Relativity, Zeitschrift für
            Physik 108, 314 (1938); reprinted in Gen. Relativ. Gravit. 31, 1619 (1999).
      [158] I. Davies and H.S. Reall: Nonperturbative Second Law of Black Hole Mechanics in Effective
            Field Theory, Phys. Rev. Lett. 132, 171402 (2024); arXiv: 2312.07659.
      [159] F. de Felice: Equatorial Geodesic Motion in the Gravitational Field of a Rotating Source,
            Nuovo Cimento B 57, 351 (1968).
      [160] C. De Jans: Sur le mouvement d’une particule matérielle dans un champ de gravitation à
            symétrie sphérique, Mem. Acad. Roy. Belgique Cl. Sci. 7, 1 (1923).
      [161] N. Deruelle: Black Holes in General Relativity, lectures at Institut de Physique Théorique,
            CEA, Saclay (France) (2009); slides and notes available at http://ipht.cea.fr/
            Docspht/search/article.php?id=t09/346.
      [162] N. Deruelle and J.-P. Uzan: Théories de la Relativité, Belin, Paris (2014).
      [163] N. Deruelle and J.-P. Uzan: Relativity in Modern Physics, Oxford Univ. Press, Oxford
            (2018); DOI: 10.1093/oso/9780198786399.001.0001.
      [164] C. DeWitt and B. DeWitt (eds.): Black Holes – Les astres occlus, Gordon and Breach, New
            York (1973).
      [165] J. Dexter and E. Agol: A Fast New Public Code for Computing Photon Orbits in a Kerr
            Spacetime, Astrophys. J. 696, 1616 (2009); arXiv: 0903.0620.
      [166] V.I. Dokuchaev: Is there life inside black holes?, Class. Quantum Grav. 28, 235015 (2011);
            arXiv: 1103.6140.
      [167] V.I. Dokuchaev: To see the invisible: Image of the event horizon within the black hole
            shadow, Int. J. Mod. Phys. D 28, 1941005 (2019); arXiv: 1812.06787.
      [168] V.I. Dokuchaev and N.O. Nazarova: Event Horizon Image within Black Hole Shadow, J. Exp.
            Theor. Phys. 128, 578 (2019); arXiv: 1804.08030.
      [169] V.I. Dokuchaev and N.O. Nazarova: Silhouettes of invisible black holes, Physics-Uspekhi
            63, 583 (2020); arXiv: 1911.07695.
      [170] V.I. Dokuchaev and N.O. Nazarova: Visible shapes of black holes M87* and SgrA*, Universe
            6, 154 (2020); arXiv: 2007.14121.
      [171] X. Dong: Holographic entanglement entropy for general higher derivative gravity, J. High
            Energ. Phys. 01(2014), 44 (2014); arXiv: 1310.5713.
      [172] A.G. Doroshkevich, Ya.B. Zeldovich and I.D. Novikov: Gravitational Collapse of Non-
            symmetric and Rotating Masses (in Russian), Zhurnal Éksperimental’noı̆ i Teoreticheskoı̆
            Fiziki 49, 170 (1965); English translation in Soviet Physics JETP 22, 122 (1966).
      [173] S. Drasco and S.A. Hughes: Rotating black hole orbit functionals in the frequency domain,
            Phys. Rev. D 69, 044015 (2004); arXiv: astro-ph/0308479.
      [174] S. Drasco and S.A. Hughes: Gravitational wave snapshots of generic extreme mass ratio
            inspirals, Phys. Rev. D 73, 024027 (2006); arXiv: gr-qc/0509101.
      [175] J. Droste: The Field of a Single Centre in Einstein’s Theory of Gravitation, and the Motion
            of a Particle in that Field, Koninklijke Nederlandsche Akademie van Wetenschappen,
            Proceedings 19, 197 (1917); reprinted in Gen. Relativ. Gravit. 34, 1545 (2002).
                                          BIBLIOGRAPHY                                                797
[176] D.M. Eardley and L. Smarr: Time functions in numerical relativity: Marginally bound dust
      collapse, Phys. Rev. D 19, 2239 (1979).
[177] A.S. Eddington: A comparison of Whitehead’s and Einstein’s Formulæ, Nature 113, 192
      (1924).
[178] C.A. Egan and C.H. Lineweaver: A Larger Estimate of the Entropy of the Universe, Astro-
      phys. J. 710, 1825 (2010); arXiv: 0909.3983.
[179] A. Einstein and N. Rosen: The Particle Problem in the General Theory of Relativity, Phys.
      Rev. 48, 73 (1935).
[180] J. Eisenstaedt: Histoire et Singularités de la Solution de Schwarzschild (1915-1923), Archive
      for History of Exact Sciences 27, 157 (1982).
[181] J. Eisenstaedt: Trajectoires et Impasses de la Solution de Schwarzschild, Archive for History
      of Exact Sciences 37, 275 (1987).
[182] J. Eisenstaedt: Lemaître and the Schwarzschild Solution, in The Attraction of Gravitation:
      New Studies in the History of General Relativity, Proc. Third International Conference
      on the History and Philosophy of General Relativity. Einstein Studies, Vol. 5, edited by
      J. Earman, M. Janssen, and J.D. Norton, Birkhäuser, Boston (1993), p. 353.
[183] G.F.R. Ellis: Relativistic cosmology, in Proceedings of the International School of Physics
      “Enrico Fermi”, Course 47: General relativity and cosmology, edited by B. K. Sachs, Academic
      Press, New York and London (1971), p. 104; reprinted in Gen. Relat. Grav. 41, 581 (2009).
[184] H. Elvang and P. Figueras: Black saturn, J. High Energ. Phys. 05(2007), 050 (2007); arXiv:
      hep-th/0701035.
[185] R. Emparan, S. Ohashi and T. Shiromizu: No-dipole-hair theorem for higher-dimensional
      static black holes, Phys. Rev. D 82, 084032 (2010); arXiv: 1007.3847.
[186] R. Emparan and H.S. Reall: A Rotating Black Ring Solution in Five Dimensions, Phys. Rev.
      Lett. 88, 101101 (2002); arXiv: hep-th/0110260.
[187] R. Emparan and H.S. Reall: Black Holes in Higher Dimensions, Living Rev. Relativ. 11, 6
      (2008).
[188] J. Engle, K. Noui, and A. Perez: Black Hole Entropy and SU(2) Chern-Simons Theory, Phys.
      Rev. Lett. 105, 031302 (2010); arXiv: 0905.3168.
[189] H. Eschrig: Topology and Geometry for Physics, Springer, Berlin (2011); DOI: 10.1007/978-
      3-642-14700-5.
[190] A. Fabbri and J. Navarro-Salas: Modeling Black Hole Evaporation, Imperial College Press,
      London, and World Scientific, Singapore (2005); DOI: 10.1142/p378.
[191] H. Falcke: Imaging black holes: past, present and future, J. Phys. Conf. Ser. 942, 012001
      (2017); arXiv: 1801.03298.
[192] H. Falcke, F. Melia, and E. Agol: Viewing the Shadow of the Black Hole at the Galactic
      Center, Astrophys. J. 528, L13 (2000); arXiv: astro-ph/9912263.
[193] J.R. Farah, D.W. Pesce, M.D. Johnson, and L. Blackburn: On the Approximation of the Black
      Hole Shadow with a Simple Polar Curve, Astrophys. J. 900, 77 (2020); arXiv: 2007.06732.
[194] F. Fayos and R. Torres: A class of interiors for Vaidya’s radiating metric: singularity-free
      gravitational collapse, Class. Quantum Grav. 25, 175009 (2008).
798                                      BIBLIOGRAPHY
      [195] D. Finkelstein: Past-Future Asymmetry of the Gravitational Field of a Point Particle, Phys.
            Rev. 110, 965 (1958).
      [196] A. Fischer and J. Marsden: The initial value problem and the dynamical formulation of
            general relativity, in General Relativity: an Einstein Centenary Survey, edited by S.W.
            Hawking and W. Israel, Cambridge Univ. Press, Cambridge (1979), p. 138.
      [197] L. Flamm: Beiträge zur Einsteinschen Gravitationstheorie, Physikalische Zeitschrift 17,
            448 (1916); English translation in Gen. Relativ. Gravit. 47, 72 (2015).
      [198] J. Frauendiener: Conformal Infinity, Living Rev. Relativ. 7, 1 (2004).
      [199] J.L. Friedman, K. Schleich, and D.M. Witt: Topological censorship, Phys. Rev. Lett. 71, 1486
            (1993); erratum in Phys. Rev. Lett. 75, 1872 (1995); arXiv: gr-qc/9305017.
      [200] H. Friedrich, I. Rácz, and R.M. Wald: On the Rigidity Theorem for Spacetimes with a
            Stationary Event Horizon or a Compact Cauchy Horizon, Commun. Math. Phys. 204, 691
            (1999); arXiv: gr-qc/9811021.
      [201] V.P. Frolov, P. Krtouš, and D. Kubizňák: Black holes, hidden symmetries, and complete
            integrability, Living Rev. Relativ. 20, 6 (2017).
      [202] V.P. Frolov and I.D. Novikov: Black Hole Physics: Basic Concepts and New Developments,
            Kluwer Academic Publisher, Dordrecht (1998); reprinted by Springer, Dordrecht (1998);
            DOI: 10.1007/978-94-011-5139-9.
      [203] V.P. Frolov and K.S. Thorne: Renormalized stress-energy tensor near the horizon of a slowly
            evolving, rotating black hole Phys. Rev. D 39, 2125 (1989).
      [204] V.P. Frolov and A. Zelnikov: Introduction to Black Hole Physics, Oxford Univ. Press, New
            York (2011); DOI: 10.1093/acprof:oso/9780199692293.001.0001.
      [205] C. Fronsdal: Completion and Embedding of the Schwarzschild Solution, Phys. Rev. 116,
            778 (1959).
      [206] R. Fujita & W. Hikida: Analytical solutions of bound timelike geodesic orbits in Kerr
            spacetime, Class. Quantum Grav. 26, 135002 (2009); arXiv: 0906.1420.
      [207] D. Gajic: Azimuthal instabilities on extremal Kerr, submitted (2023); arXiv: 2302.06636.
      [208] G.J. Galloway: Least area tori, black holes and topological censorship, in Differential
            Geometry and Mathematical Physics, Contemporary Mathematics 170; edited by J.K. Beem
            and K.L. Duggal, American Mathematical Society, Providence (1994), p. 113.
      [209] G.J. Galloway: Null Geometry and the Einstein Equations, in The Einstein Equations and
            the Large Scale Behavior of Gravitational Fields, edited by P.T. Chruściel and H. Friedrich,
            Birkhäuser, Basel (2004), p. 379; DOI: 10.1007/978-3-0348-7953-8_11; available at https:
            //www.math.miami.edu/~galloway/papers/cargproc.pdf.
      [210] G.J. Galloway and R. Schoen: A Generalization of Hawking’s Black Hole Topology Theorem
            to Higher Dimensions, Commun. Math. Phys. 266, 571 (2006); arXiv: gr-qc/0509107.
      [211] G.J. Galloway and J.M.M. Senovilla: Singularity theorems based on trapped submanifolds
            of arbitrary co-dimension, Class. Quantum Grav. 27, 152002 (2010); arXiv: 1005.1249.
      [212] S. Gao and R.M. Wald: “Physical process version” of the first law and the generalized
            second law for charged and rotating black holes, Phys. Rev. D 64, 084020 (2001); arXiv:
            gr-qc/0106071.
                                         BIBLIOGRAPHY                                                799
[213] D. Garfinkle and E.N. Glass: Bertotti-Robinson and Melvin spacetimes, Class. Quantum
      Grav. 28, 215012 (2011); arXiv: 1109.1535.
[214] K. Gebhardt, J. Adams, D. Richstone, T.R. Lauer, S.M. Faber, K. Gültekin, J. Murphy, and
      S. Tremaine: The Black Hole Mass in M87 from Gemini/NIFS Adaptive Optics Observations,
      Astrophys. J. 729, 119 (2011); arXiv: 1101.1954.
[215] R. Genzel, F. Eisenhauer and S. Gillessen: The Galactic Center massive black hole and
      nuclear star cluster, Rev. Mod. Phys. 82, 3121 (2010); arXiv: 1006.0064.
[216] R. Geroch: What is a singularity in general relativity? Ann. Phys. 48, 526 (1968).
[217] R. Geroch and J.B. Hartle: Distorted black holes, J. Math. Phys. 23, 680 (1982).
[218] G.W. Gibbons, D. Ida and T. Shiromizu: Uniqueness of (dilatonic) charged black holes and
      black p-branes in higher dimensions, Phys. Rev. D 66, 044010 (2002); arXiv: hep-th/0206136.
[219] G.W. Gibbons, D. Ida and T. Shiromizu: Uniqueness and Non-Uniqueness of Static Vacuum
      Black Holes in Higher Dimensions, Prog. Theor. Phys. Suppl. 148, 284 (2002); arXiv:
      gr-qc/0203004.
[220] G.W. Gibbons and M. Vyska: The application of Weierstrass elliptic functions to
      Schwarzschild null geodesics, Class. Quantum Grav. 29, 065016 (2012); arXiv: 1110.6508.
[221] G. Gibbons and C.M. Will: On the Multiple Deaths of Whitehead’s Theory of Gravity,
      Studies in History and Philosophy of Science Part B: Studies in History and Philosophy
      of Modern Physics 39, 41 (2008); arXiv: gr-qc/0611006.
[222] V.L. Ginzburg: The Magnetic Fields of Collapsing Masses and the Nature of Superstars (in
      Russian), Doklady Akademii Nauk SSSR 156, 43 (1964); English translation in Soviet
      Physics Doklady 9, 329 (1964).
[223] K. Glampedakis and D. Kennefick: Zoom and whirl: Eccentric equatorial orbits around
      spinning black holes and their evolution under gravitational radiation reaction, Phys. Rev.
      D 66, 044002 (2002); arXiv: gr-qc/0203086.
[224] K. Glampedakis and G. Pappas: Can supermassive black hole shadows test the Kerr metric?,
      Phys. Rev. D 104, L081503 (2021); arXiv: 2102.13573.
[225] L. Gou, J.E. McClintock, R.A. Remillard, J.F. Steiner, M.J. Reid, J.A. Orosz, R. Narayan, M.
      Hanke, and J. García: Confirmation via the Continuum-Fitting Method that the Spin of the
      Black Hole in Cygnus X-1 is Extreme, Astrophys. J. 790, 29 (2014); arXiv: 1308.4760.
[226] E. Gourgoulhon : An introduction to relativistic hydrodynamics, in Stellar Fluid Dynamics
      and Numerical Simulations: From the Sun to Neutron Stars, edited by M. Rieutord and
      B. Dubrulle, EAS Publications Series 21, 43-79 (2006); arXiv: gr-qc/0603009.
[227] E. Gourgoulhon: 3+1 Formalism in General Relativity; Bases of Numerical Relativity,
      Springer, Berlin (2012); URL: https://relativite.obspm.fr/3p1.
[228] E. Gourgoulhon: Special Relativity in General Frames: From Particles to Astrophysics,
      Springer, Berlin (2013); URL: https://relativite.obspm.fr/sperel.
[229] E. Gourgoulhon: Relativité générale, Master course lecture notes; URL: https://luth.
      obspm.fr/~luthier/gourgoulhon/fr/master/relat.html.
[230] E. Gourgoulhon and J.L. Jaramillo: A 3+1 perspective on null hypersurfaces and isolated
      horizons, Phys. Rep. 423, 159 (2006); arXiv: gr-qc/0503113.
800                                       BIBLIOGRAPHY
      [231] E. Gourgoulhon and J.L. Jaramillo: New theoretical approaches to black holes, in Proceed-
            ings of the Conference Jean-Pierre Lasota, X-ray Binaries, Accretion Disks and Compact
            Stars held in Trzebieszowice, Poland (7-13 October 2007), edited by M. Abramowicz and
            O. Straub, New Astronomy Reviews 51, 791 (2008); arXiv: 0803.2944.
      [232] I.S. Gradshteyn, I.M. Ryzhik, Y.V. Geronimus, M.Y. Tseytlin, and A. Jeffrey: Table of
            Integrals, Series, and Products (8 ed.), Academic Press (2015).
      [233] S.E. Gralla: Can the EHT M87 results be used to test general relativity?, Phys. Rev. D 103,
            024023 (2021); arXiv: 2010.08557.
      [234] S.E. Gralla, D.E. Holz, and R.M. Wald: Black hole shadows, photon rings, and lensing rings,
            Phys. Rev. D 100, 024018 (2019); arXiv: 1906.00873.
      [235] S.E. Gralla and A. Lupsasca: Lensing by Kerr black holes, Phys. Rev. D 101, 044031 (2020);
            arXiv: 1910.12873.
      [236] S.E. Gralla and A. Lupsasca: Null geodesics of the Kerr exterior, Phys. Rev. D 101, 044032
            (2020); arXiv: 1910.12881.
      [237] S.E. Gralla and A. Lupsasca: Observable shape of black hole photon rings, Phys. Rev. D
            102, 124003 (2020); arXiv: 2007.10336.
      [238] S.E. Gralla, A. Lupsasca, and A. Strominger: Observational signature of high spin at the
            Event Horizon Telescope, Month. Not. Roy. Astron. Soc. 475, 3829 (2018); arXiv: 1710.11112.
      [239] S.E. Gralla, A.P. Porfyriadis, and N. Warburton: Particle on the innermost stable circular
            orbit of a rapidly spinning black hole, Phys. Rev. D 92, 064029 (2015); arXiv: 1506.08496.
      [240] S.E. Gralla and P. Zimmerman: Critical exponents of extremal Kerr perturbations, Class.
            Quantum Grav. 35, 095002 (2018); arXiv: 1711.00855.
      [241] P. Grandclément: Fully consistent rotating black holes in the cubic Galileon theory, Class.
            Quantum Grav. 41, 025012 (2024); arXiv: 2308.11245.
      [242] A.A. Grib and Y.V. Pavlov: Black holes and particles with zero or negative energy, Theor.
            Math. Phys. 190, 268 (2017); arXiv: 1601.02592.
      [243] J.B. Griffiths and J. Podolský: Exact Space-Times in Einstein’s General Relativity, Cambridge
            Univ. Press, Cambridge (2009); DOI: 10.1017/CBO9780511635397.
      [244] R. Grossman, J. Levin, and G. Perez-Giz: Harmonic structure of generic Kerr orbits, Phys.
            Rev. D 85, 023012 (2012); arXiv: 1105.5811.
      [245] D. Grumiller, R. McNees and J. Salzer: Black Holes and Thermodynamics: The First Half
            Century, in Quantum Aspects of Black Holes, edited by X. Calmet, Springer, Cham (2015),
            p. 27; DOI: 10.1007/978-3-319-10852-0_2; arXiv: 1402.5127.
      [246] D. Grumiller and M.M. Sheikh-Jabbari: Black Hole Physics: From Collapse to Evaporation,
            Springer, Cham (2022); DOI: 10.1007/978-3-031-10343-8.
      [247] M. Guica, T. Hartman, W. Song, and A. Strominger: The Kerr/CFT correspondence, Phys.
            Rev. D 80, 124008 (2009); arXiv: 0809.4266.
      [248] A. Gullstrand: Allgemeine Lösung des statischen Einkörperproblems in der Einsteinschen
            Gravitationstheorie, Arkiv. Mat. Astron. Fys. 16, 1 (1922).
      [249] Y. Hagihara: Theory of the Relativistic Trajectories in a Gravitational Field of Schwarzschild,
            Japanese Journal of Astronomy and Geophysics 8, 67 (1931).
                                          BIBLIOGRAPHY                                               801
[250] K. Hajian and M.M. Sheikh-Jabbari: Solution phase space and conserved charges: A general
      formulation for charges associated with exact symmetries, Phys. Rev. D 93, 044074 (2016);
      arXiv: 1512.05584.
[251] P. Hájíček: Exact models of charged black holes. I. Geometry of totally geodesic null
      hypersurface, Commun. Math. Phys. 34, 37 (1973).
[252] P. Hájíček: Exact models of charged black holes. II. Axisymmetric stationary horizons,
      Commun. Math. Phys. 34, 53 (1973).
[253] P. Hájíček: Can outside fields destroy black holes ?, J. Math. Phys. 15, 1554 (1974).
[254] J. Haláček and T. Ledvinka: The analytic conformal compactification of the Schwarzschild
      spacetime, Class. Quantum Grav. 31, 015007 (2014); arXiv: 1401.1337.
[255] D. Harlow and J. Wu: Covariant phase space with boundaries, J. High Energ. Phys.
      10(2020), 146 (2020); arXiv: 1906.08616.
[256] B.K. Harrison, K.S. Thorne, M. Wakano and J.A. Wheeler: Gravitation Theory and Gravi-
      tational Collapse, University of Chicago Press, Chicago (1965); URL: https://archive.org/
      details/GravitationalTheoryandCollapseHarrison65-17293.
[257] J.B. Hartle: Gravity: An Introduction to Einstein’s General Relativity, Addison Wes-
      ley (Pearson Education), San Fransisco (2003); URL: https://archive.org/details/
      JBHartleGravityAnIntroductionToEinsteinsGeneralRelativityAddisonWesley20031.
[258] J.B. Hartle and S.W. Hawking: Solutions of the Einstein-Maxwell equations with many
      black holes, Commun. Math. Phys. 26, 87 (1972); available at https://projecteuclid.org/
      journalArticle/Download?urlId=cmp%2F1103858037.
[259] S.W. Hawking: Gravitational radiation from colliding black holes, Phys. Rev. Lett. 26, 1344
      (1971).
[260] S.W. Hawking: Black holes in general relativity, Commun. Math. Phys. 25, 152 (1972).
[261] S.W. Hawking: The event horizon, in Black Holes – Les astres occlus, edited by C. DeWitt
      and B. DeWitt, Gordon and Breach, New York (1973), p. 1.
[262] S.W. Hawking: Black hole explosions?, Nature 248, 30 (1974).
[263] S.W. Hawking: Particle creation by black holes, Commun. Math. Phys. 43, 199 (1975);
      erratum in Commun. Math. Phys. 46, 206 (1976).
[264] S.W. Hawking: A Brief History of Time: From the Big Bang to Black Holes, Bantam Books,
      Toronto and New York (1988).
[265] S.W. Hawking: The Nature of Space and Time, part of a series of six lectures with R. Penrose
      sponsored by Princeton University Press (1994); arXiv: hep-th/9409195.
[266] S.W. Hawking and G.F.R. Ellis: The large scale structure of space-time, Cambridge Univ.
      Press, Cambridge (1973); DOI: 10.1017/CBO9780511524646.
[267] S.W. Hawking and R. Penrose: The singularities of gravitational collapse and cosmology,
      Proc. Roy. Soc. Lond. A 314, 529 (1970).
[268] S.W. Hawking and R. Penrose: The Nature of Space and Time, Princeton Univ. Press,
      Princeton (2015).
[269] S.A. Hayward: General laws of black-hole dynamics, Phys. Rev. D 49, 6467 (1994); arXiv:
      gr-qc/9303006.
802                                       BIBLIOGRAPHY
      [270] C. Heinicke and F.W. Hehl: Schwarzschild and Kerr solutions of Einstein’s field equation:
            An Introduction, Int. J. Mod. Phys. D 24, 1530006 (2015); arXiv: 1503.02172.
      [271] F. Hélein: Multisymplectic formalism and the covariant phase space, in Variational Prob-
            lems in Differential Geometry, edited by R. Bielawski, K. Houston and M. Speight, Cam-
            bridge Univ. Press, Cambridge (2011), p. 94; DOI: 10.1017/CBO9780511863219.007; arXiv:
            1106.2086.
      [272] C.A.R. Herdeiro and J.P.S. Lemos: O buraco negro cinquenta anos depois: A génese do
            nome, Gazeta de Fisica 41, 2 (2018); English translation in The black hole fifty years after:
            Genesis of the name, arXiv: 1811.06587.
      [273] W.C. Hernandez and C.W. Misner: Observer Time as a Coordinate in Relativistic Spherical
            Hydrodynamics, Astrophys. J. 143, 452 (1966).
      [274] M. Heusler: On the uniqueness of the Reissner-Nordström solution with electric and magnetic
            charge, Class. Quantum Grav. 11, L49 (1994).
      [275] M. Heusler: Black Hole Uniqueness Theorems, Cambridge Univ. Press, Cambridge (1996);
            DOI: 10.1017/CBO9780511661396.
      [276] M. Heusler: On the uniqueness of the Papapetrou-Majumdar metric, Class. Quantum Grav.
            14, L129 (1997); arXiv: gr-qc/9607001.
      [277] D. Hilbert: Die Grundlagen der Physik (Zweite Mitteilung), Nachrichten von der
            Gesellschaft der Wissenscahften zu Göttingen, Mathematisch-physikalishe Klasse, Jahr
            1917, 53 (1917).
      [278] D. Hilbert: Die Grundlagen der Physik II, lectures given at Göttingen University in winter
            1916-1917, lecture notes written by R. Bär; reprinted in [445], p. 162-307.
      [279] W.A. Hiscock, L.G. Williams, and D.M. Eardley: Creation of particles by shell-focusing
            singularities, Phys. Rev. D 26, 751 (1982).
      [280] S. Hollands and A. Ishibashi: Black hole uniqueness theorems in higher dimensional
            spacetimes, Class. Quantum Grav. 29, 163001 (2012); arXiv: 1206.1164.
      [281] S. Hollands, A. Ishibashi, and H.S. Reall: A Stationary Black Hole Must be Axisymmetric
            in Effective Field Theory, Commun. Math. Phys. 401, 2757 (2023); arXiv: 2212.06554.
      [282] S. Hollands, A. Ishibashi, and R.M. Wald: A Higher Dimensional Stationary Rotating Black
            Hole Must be Axisymmetric, Commun. Math. Phys. 271, 699 (2007); arXiv: gr-qc/0605106.
      [283] S. Hollands and R.M. Wald: Stability of Black Holes and Black Branes, Commun. Math.
            Phys. 321, 629 (2013); arXiv: 1201.0463.
      [284] S. Hollands, R.M. Wald and V.G. Zhang: Entropy of dynamical black holes, Phys. Rev. D
            110, 024070 (2024); arXiv: 2402.00818.
      [285] G.T. Horowitz: Quantum States of Black Holes, in Black Holes and Relativistic Stars, edited
            by R.M. Wald, Univ. Chicago Press, Chicago (1998), p. 241; arXiv: gr-qc/9704072.
      [286] S. Husa and J. Winicour: Asymmetric merger of black holes, Phys. Rev. D 60, 084019
            (1999); arXiv: gr-qc/9905039.
      [287] S. Hwang: A Rigidity Theorem for Ricci Flat Metrics, Geometriae Dedicata 71, 5 (1998).
      [288] A.D. Ionescu and S. Klainerman: Rigidity results in general relativity: A review, Surveys
            in Differential Geometry 20, 123 (2015); arXiv: 1501.01587.
                                         BIBLIOGRAPHY                                               803
[289] W. Israel: Event Horizons in Static Vacuum Space-Times, Phys. Rev. 164, 1776 (1967).
[290] W. Israel: Event horizons in static electrovac space-times, Commun. Math. Phys. 8, 245
      (1968).
[291] W. Israel: Third Law of Black-Hole Dynamics: A Formulation and Proof, Phys. Rev. Lett.
      57, 397 (1986).
[292] V. Iyer and R.M. Wald: Some properties of the Noether charge and a proposal for dynamical
      black hole entropy, Phys. Rev. D 50, 846 (1994); arXiv: gr-qc/9403028.
[293] T. Jacobson: Introductory Lectures on Black Hole Thermodynamics, lectures given at the
      University of Utrecht (1996); URL: https://www.physics.umd.edu/grt/taj/776b/
      lectures.pdf.
[294] T. Jacobson, G. Kang, and R.C. Myers: On black hole entropy, Phys. Rev. D 49, 6587 (1994);
      arXiv: gr-qc/9312023.
[295] T. Jacobson and Y.-A. Soong: Slices of the Kerr ergosurface, Class. Quantum Grav. 26,
      055014 (2009); arXiv: 0809.2369.
[296] J. Jaffe: Collapsing objects and the backward emission of light, Ann. Phys. 55, 374 (1969).
[297] J.L. Jaramillo: An Introduction to Local Black Hole Horizons in the 3+1 Approach of Gen-
      eral Relativity, in Black Holes: New Horizon, edited by S.A. Hayward, World Scientific,
      Singapore (2013); DOI: 10.1142/S0218271811020366; arXiv: 1108.2408.
[298] J.L. Jaramillo and E. Gourgoulhon: Mass and Angular Momentum in General Relativity, in
      Mass and Motion in General Relativity, edited by L. Blanchet, A. Spallicci and B. Whiting,
      Fundamental Theories of Physics 162, 87, Springer, Dordrecht (2011); arXiv: 1001.5429.
[299] M. Jaroszyński and A. Kurpiewski: Optics near Kerr black holes: spectra of advection
      dominated accretion flows, Astron. Astrophys. 326, 419 (1997); arXiv: astro-ph/9705044.
[300] J.T. Jebsen: Über die allgemeinen kugelsymmetrischen Lösungen der Einsteinschen Gravita-
      tionsgleichungen im Vakuum, Arkiv för Matematik, Astronomi och Fysik 15, Nr. 18, p. 1
      (1921); English translation in Gen. Relativ. Grav. 37, 2253 (2005).
[301] T. Johannsen: Sgr A* and general relativity, Class. Quantum Grav. 33, 113001 (2016);
      arXiv: 1512.03818.
[302] T. Johannsen and D. Psaltis: Testing the No-Hair Theorem with Observations in the Electro-
      magnetic Spectrum: II. Black-Hole Images, Astrophys. J. 718, 446 (2010); arXiv: 1005.1931.
[303] N.V. Johansen and F. Ravndal: On the discovery of Birkhoff’s theorem, Gen. Relativ. Grav.
      38, 537 (2006); arXiv: physics/0508163.
[304] M.D. Johnson, A. Lupsasca, A. Strominger, G.N. Wong, S. Hadar, D. Kapec, R. Narayan,
      A. Chael, C.F. Gammie, P. Galison, D.C. Palumbo, S.S. Doeleman, L. Blackburn, M. Wielgus,
      D.W. Pesce, J.R. Farah, and J.M. Moran: Universal interferometric signatures of a black
      hole’s photon ring, Science Advances 6, eaaz1310 (2020); arXiv: 1907.04329.
[305] M.D. Johnson et al: The Black Hole Explorer: motivation and vision, Proc. of SPIE 13092,
      130922D (2024); arXiv: 2406.12917.
[306] D. Joyner and W. Stein: Sage Tutorial, CreateSpace (2014).
804                                       BIBLIOGRAPHY
      [307] P. Kanti and E. Winstanley: Hawking Radiation from Higher-Dimensional Black Holes, in
            Quantum Aspects of Black Holes, edited by X. Calmet, Fundamental Theories of Physics
            178, 229, Springer, Cham (2015); arXiv: 1402.3952.
      [308] D. Kapec and A. Lupsasca: Particle motion near high-spin black holes, Class. Quantum
            Grav. 37, 015006 (2020)
            DOI: 10.1088/1361-6382/ab519e                                       arXiv: 1905.11406
      [309] J. Katz: A note on Komar’s anomalous factor, Class. Quantum Grav. 2, 423 (1985)
            DOI: 10.1088/0264-9381/2/3/018
      [310] B.S. Kay and R.M. Wald: Theorems on the uniqueness and thermal properties of stationary,
            nonsingular, quasifree states on spacetimes with a bifurcate killing horizon, Phys. Rep. 207,
            49 (1991)
            DOI: 10.1016/0370-1573(91)90015-E
      [311] C. Kehle and R. Unger: Gravitational collapse to extremal black holes and the third law of
            black hole thermodynamics, J. Eur. Math. Soc., in press (2025); arXiv: 2211.15742.
      [312] R.P. Kerr: Gravitational Field of a Spinning Mass as an Example of Algebraically Special
            Metrics, Phys. Rev. Lett. 11, 237 (1963)
            DOI: 10.1103/PhysRevLett.11.237
      [313] R.P. Kerr: Discovering the Kerr and Kerr-Schild metrics, in The Kerr spacetime, edited by
            D.L. Wiltshire, M. Visser, and S.M. Scott, Cambridge Univ. Press, Cambridge (2009), p. 38
            arXiv: 0706.1109
      [314] R.P. Kerr and A. Schild: A New Class of Vacuum Solutions of the Einstein Field Equations,
            in Atti del Convegno sulla Relativita Generale: Problemi dell’Energia e Onde Gravitazionali,
            G. Barbèra Editore, Firenze (1965), p. 1; reprinted in Gen. Relativ. Gravit. 41, 2485 (2009)
            DOI: 10.1007/s10714-009-0857-z
      [315] I. Khavkine: Covariant phase space, constraints, gauge and the Peierls formula, Int. J. Mod.
            Phys. A 29, 1430009 (2014)
            DOI: 10.1142/S0217751X14300099                                            arXiv: 1402.1282
      [316] A. King: Supermassive Black Holes, Cambridge Univ. Press, Cambridge (2023)
            DOI: 10.1017/9781108768849
      [317] O. Klein: Einige Probleme der allgemeinen Relativitätstheorie, in Werner Heisenberg und
            die Physik unserer Zeit, edited by F. Bopp, Vieweg+Teubner Verlag, Wiesbaden (1961),
            p. 58
            DOI: 10.1007/978-3-663-05439-9_5
      [318] S. Kobayashi and K. Nomizu: Foundations of Differential Geometry, vol. 1, Interscience,
            New York (1963).
      [319] A. Komar: Covariant Conservation Laws in General Relativity, Phys. Rev. 113, 934 (1959)
            DOI: 10.1103/PhysRev.113.934
      [320] A. Komar: Asymptotic Covariant Conservation Laws for Gravitational Radiation, Phys.
            Rev. 127, 1411 (1962)
            DOI: 10.1103/PhysRev.127.1411
                                          BIBLIOGRAPHY                                               805
[321] E.-A. Kontou and K. Sanders: Energy conditions in general relativity and quantum field
      theory, Class. Quantum Grav. 37, 193001 (2020)
      DOI: 10.1088/1361-6382/ab8fcf                                             arXiv: 2003.01815
[322] F. Kottler: Über die physikalischen Grundlagen der Einsteinschen Gravitationstheorie,
      Annalen der Physik 56, 401 (1918)
      DOI: 10.1002/andp.19183611402
[323] A. Krasiński: Editor’s Note: The Expanding Universe, Gen. Relativ. Gravit. 29, 637 (1997)
      DOI: 10.1023/A:1018803604510
[324] A. Krasiński, E. Verdaguer and R.P. Kerr: Editorial note to: R. P. Kerr and A. Schild, A new
      class of vacuum solutions of the Einstein field equations, Gen. Relativ. Gravit. 41, 2469
      (2009)
      DOI: 10.1007/s10714-009-0856-0
[325] B. Krishnan: Quasi-local Black Hole Horizons, in Springer Handbook of Spacetime, edited
      by A. Ashtekar and V. Petkov, Springer, Berlin, Heidelberg (2014), p. 527
      DOI: 10.1007/978-3-642-41992-8_25                                          arXiv: 1303.4635
[326] M.D. Kruskal: Maximal Extension of Schwarzschild Metric, Phys. Rev. 119, 1743 (1960)
      DOI: 10.1103/PhysRev.119.1743
[327] H.K. Kunduri and J. Lucietti: Classification of Near-Horizon Geometries of Extremal Black
      Holes, Living Rev. Relativ. 16, 8 (2013)
      DOI: 10.12942/lrr-2013-8
[328] H.K. Kunduri and J. Lucietti: No static bubbling spacetimes in higher dimensional Einstein-
      Maxwell theory, Class. Quantum Grav. 35, 054003 (2018)
      DOI: 10.1088/1361-6382/aaa744                                             arXiv: 1712.02668
[329] H.K. Kunduri, J. Lucietti and H.S. Reall: Near-horizon symmetries of extremal black holes,
      Class. Quantum Grav. 24, 4169 (2007)
      DOI: 10.1088/0264-9381/24/16/012                                           arXiv: 0705.4214
[330] Y. Kuroda: Naked Singularities in the Vaidya Spacetime, Prog. Theor. Phys. 72, 63 (1984)
      DOI: 10.1143/PTP.72.63
[331] J. Lafontaine: An Introduction to Differential Manifolds, Springer, Cham (2015)
      DOI: 10.1007/978-3-319-20735-3
[332] K. Lake: Revisiting the Darmois and Lichnerowicz junction conditions, Gen. Relativ. Gravit.
      49, 134 (2017)
      DOI: 10.1007/s10714-017-2300-1                                            arXiv: 1705.01090
[333] K. Lake and R.C. Roeder: Note on the optical appearance of a star collapsing through its
      gravitational radius, Astrophys. J. 232, 277 (1979)
      DOI: 10.1086/157286
[334] K. Landsman: Singularities, Black Holes, and Cosmic Censorship: A Tribute to Roger Penrose,
      Foundations of Physics 51, 42 (2021)
      DOI: 10.1007/s10701-021-00432-1                                           arXic: 2101.02687
[335] K. Landsman: Penrose’s 1965 singularity theorem: from geodesic incompleteness to cosmic
      censorship, Gen. Relat. Grav. 54, 115 (2022)
      DOI: 10.1007/s10714-022-02973-w                                           arXiv: 2205.01680
806                                      BIBLIOGRAPHY
      [336] S. Lang: Differential and Riemannian Manifolds, Springer-Verlag, New York (1995).
      [337] D. Langlois: Relativité générale, Vuibert, Paris (2013).
      [338] D. Langlois: Dark energy and modified gravity in degenerate higher-order scalar–tensor
            (DHOST) theories: A review, Int. J. Mod. Phys. D 28, 1942006 (2019)
            DOI: 10.1142/S0218271819420069                                      arXiv: 1811.06271
      [339] J.-P. Lasota, E. Gourgoulhon, M. Abramowicz, A. Tchekhovskoy, and R. Narayan: Ex-
            tracting black-hole rotational energy: The generalized Penrose process, Phys. Rev. D 89,
            024041 (2014)
            DOI: 10.1103/PhysRevD.89.024041                                         arXiv: 1310.7499
      [340] M. von Laue: Die Relativitätstheorie. Zweiter Band: Die Allgemeine Relativitätstheorie
            Und Einsteins Lehre Von Der Schwerkraft, Friedr. Vieweg & Sohn, Braunschweig (1921);
            available at
            https://archive.org/details/dierelativitts02laueuoft/page/n5/mode/2up
      [341] M. Le Bellac: Quantum Physics, Cambridge Univ. Press, Cambridge (2006)
            DOI: 10.1017/CBO9780511616471
      [342] J. M. Lee: Introduction to Topological Manifolds, 2nd edition, Springer, New-York (2011)
            DOI: 10.1007/978-1-4419-7940-7
      [343] J. M. Lee: Introduction to Smooth Manifolds, 2nd edition, Springer, New-York (2013)
            DOI: 10.1007/978-1-4419-9982-5
      [344] J. M. Lee: Introduction to Riemannian Manifolds, 2nd edition, Springer, Cham (2018)
            DOI: 10.1007/978-3-319-91755-9
      [345] J. Lee and R.M. Wald: Local symmetries and constraints, J. Math. Phys. 31, 725 (1990)
            DOI: 10.1063/1.528801
      [346] G. Lemaître: L’univers en expansion, Publication du Laboratoire d’Astronomie et de
            Géodésie de l’Université de Louvain 9, 171 (1932); reprinted in Annales de la Société
            Scientifique de Bruxelles A 53, 51 (1933); English translation in Gen. Relativ. Gravit. 29,
            641 (1997)
            DOI: 10.1023/A:1018855621348
      [347] R. W. Lindquist, R. A. Schwartz, and C. W. Misner: Vaidya’s Radiating Schwarzschild
            Metric, Phys. Rev. 137, B1364 (1965)
            DOI: 10.1103/PhysRev.137.B1364
      [348] J.-P. Luminet: Image of a spherical black hole with thin accretion disk, Astron. Astrophys.
            75, 228 (1979).
      [349] J.-P. Luminet: An Illustrated History of Black Hole Imaging: Personal Recollections (1972-
            2002), preprint (2019); arXiv: 1902.11196.
      [350] A. Lupsasca, A. Cárdenas-Avendaño, D.C.M. Palumbo, M.D. Johnson, S.E. Gralla, D.P. Mar-
            rone, P. Galison, P. Tiede, and L.Keeble: The Black Hole Explorer: photon ring science,
            detection, and shape measurement, Proc. of SPIE 13092, 130926Q (2024); arXiv: 2406.09498
      [351] D. Lynden-Bell: Galactic Nuclei as Collapsed Old Quasars, Nature 223, 690 (1969)
            DOI: 10.1038/223690a0
                                          BIBLIOGRAPHY                                               807
[352] J.H. MacGibbon, B.J. Carr, and D.N. Page: Do evaporating black holes form photospheres?,
      Phys. Rev. D 78, 064043 (2008)
      DOI: 10.1103/PhysRevD.78.064043                                           arXiv: 0709.2380
[353] S.D. Majumdar: A Class of Exact Solutions of Einstein’s Field Equations, Phys. Rev. 72, 390
      (1947)
      DOI: 10.1103/PhysRev.72.390
[354] J. Maldacena and L. Susskind: Cool horizons for entangled black holes, Fortschritte der
      Physik 61, 781 (2013)
      DOI: 10.1002/prop.201300020                                               arXiv: 1306.0533
[355] J.-A. Marck: Journey to a black hole, movie (1991)
      https://www.youtube.com/watch?v=5Oqop50ltrM
      see also https://images.cnrs.fr/video/46
[356] J.-A. Marck: Short-cut method of solution of geodesic equations for Schwarzchild black hole,
      Class. Quantum Grav. 13, 393 (1996)
      DOI: 10.1088/0264-9381/13/3/007                                       arXiv: gr-qc/9505010
[357] K. Martel and E. Poisson: Regular coordinate systems for Schwarzschild and other spherical
      spacetimes, Amer. J. Phys. 69, 476 (2001)
      DOI: 10.1119/1.1336836                                                arXiv: gr-qc/0001069
[358] A.K.M. Masood-ul-Alam: Uniqueness proof of static charged black holes revisited, Class.
      Quantum Grav. 9, L53 (1992)
      DOI: 10.1088/0264-9381/9/5/001
[359] R.A. Matzner, H.E. Seidel, S.L. Shapiro, L. Smarr, W.-M. Suen, S.A. Teukolsky, and J.
      Winicour: Geometry of a Black Hole Collision, Science 270, 941 (1995)
      DOI: 10.1126/science.270.5238.941
[360] N.E. Mavromatos and V.A. Mitsou: Magnetic monopoles revisited: Models and searches at
      colliders and in the Cosmos, Int. J. Mod. Phys. A 35, 2030012 (2020)
      DOI: 10.1142/S0217751X20300124                                           arXiv: 2005.05100
[361] P.O. Mazur: Proof of uniqueness of the Kerr-Newman black hole solution, J. Phys. A: Math.
      Gen. 15, 3173 (1982)
      DOI: 10.1088/0305-4470/15/10/021
[362] P.O. Mazur: Black hole uniqueness from a hidden symmetry of Einstein’s gravity, Gen.
      Relat. Grav. 16, 211 (1984)
      DOI: 10.1007/BF00762536
[363] P.O. Mazur: A global identity for nonlinear σ-models, Phys. Lett. A 100, 341 (1984)
      DOI: 10.1016/0375-9601(84)91084-3
[364] P.O. Mazur: Black Hole Uniqueness Theorems, unpublished report (2001)
      arXiv: hep-th/0101012
[365] F. Mertens, A.P. Lobanov, R.C. Walker, and P.E. Hardee: Kinematics of the jet in M87 on
      scales of 100-1000 Schwarzschild radii, Astron. Astrophys. 595, A54 (2016)
      DOI: 10.1051/0004-6361/201628829                                         arXiv: 1608.05063
[366] J. W. Milnor: On manifolds homeomorphic to the 7-sphere, Ann. Math. 64, 399 (1956).
808                                      BIBLIOGRAPHY
      [367] H. Mineur: La mécanique des masses variables. Le problème des deux corps, Annales
            scientifiques de l’École Normale Supérieure, Ser. 3, 50, 1 (1933)
            DOI: 10.24033/asens.825
      [368] Y. Mino: Perturbative approach to an orbital evolution around a supermassive black hole,
            Phys. Rev. D 67, 084027 (2003)
            DOI: 10.1103/PhysRevD.67.084027                                    arXiv: gr-qc/0302075
      [369] C.W. Misner: Interpretation of Gravitational-Wave Observations, Phys. Rev. Lett. 28, 994
            (1972)
            DOI: 10.1103/PhysRevLett.28.994
      [370] C.W. Misner and D.H. Sharp: Relativistic Equations for Adiabatic, Spherically Symmetric
            Gravitational Collapse, Phys. Rev. 136, B571 (1964)
            DOI: 10.1103/PhysRev.136.B571
      [371] C.W. Misner, K.S. Thorne, and J.A. Wheeler: Gravitation, Freeman, New York (1973);
            reprinted by Princeton University Press, Princeton (2017)
            URL: https://press.princeton.edu/books/ebook/9781400889099/gravitation
      [372] V. Moncrief and J. Isenberg: Symmetries of higher dimensional black holes, Class. Quantum
            Grav. 25, 195015 (2008)
            DOI: 10.1088/0264-9381/25/19/195015                                      arXiv: 0805.1451
      [373] H. Müller zum Hagen, D.C. Robinson, and H.J. Seifert: Black holes in static vacuum
            space-times, Gen. Relat. Grav. 4, 53 (1973)
            DOI: 10.1007/BF00769760
      [374] G. Muñoz: Orbits of massless particles in the Schwarzschild metric: Exact solutions, Amer.
            J. Phys 82, 564 (2014)
            DOI: 10.1119/1.4866274
      [375] R.C. Myers: Higher-dimensional black holes in compactified space-times, Phys. Rev. D 35,
            455 (1987)
            DOI: 10.1103/PhysRevD.35.455
      [376] R.C. Myers and M.J. Perry: Black holes in higher dimensional space-times, Ann. Phys. 172,
            304 (1986)
            DOI: 10.1016/0003-4916(86)90186-7
      [377] R. Narayan, M.D. Johnson, Michael D., and C.F. Gammie: The Shadow of a Spherically
            Accreting Black Hole, Astrophys. J. Lett. 885, L33 (2019)
            DOI: 10.3847/2041-8213/ab518c                                   arXiv: 1910.02957
      [378] V.V. Narlikar and P.C. Vaidya: A Spherically Symmetrical Non-Static Electromagnetic Field,
            Nature 159, 642 (1947)
            DOI: 10.1038/159642a0
      [379] J. Nash: The Imbedding Problem for Riemannian Manifolds, Ann. Math. 63, 20 (1956)
            DOI: 10.2307/1969989
      [380] E.T. Newman, E. Couch, K. Chinnapared, A. Exton, A. Prakash, and R. Torrence: Metric
            of a Rotating, Charged Mass, J. Math. Phys. 6, 918 (1965)
            DOI: 10.1063/1.1704351
                                         BIBLIOGRAPHY                                               809
[381] R.P.A.C. Newman: Topology and stability of marginal 2-surfaces, Class. Quantum Grav. 4,
      277 (1987)
      DOI: 10.1088/0264-9381/4/2/011
[382] J.-P. Nicolas: The Conformal Approach to Asymptotic Analysis, in From Riemann to Differ-
      ential Geometry and Relativity, edited by L. Ji, A. Papadopoulos and S. Yamada, Springer,
      Cham (2017), p. 571
      DOI: 10.1007/978-3-319-60039-0_19                                       arXiv: 1508.02592
[383] B.C. Nolan: Sectors of spherical homothetic collapse, Class. Quantum Grav. 18, 1651 (2001)
      DOI: 10.1088/0264-9381/18/9/304                                     arXiv: gr-qc/0010032
[384] B.C. Nolan: Odd-parity perturbations of self-similar Vaidya spacetime, Class. Quantum
      Grav. 24, 177 (2007)
      DOI: 10.1088/0264-9381/24/1/010                                   arXiv: gr-qc/0608040
[385] G. Nordström: On the Energy of the Gravitational Field in Einstein’s Theory, Verhandl.
      Koninkl. Ned. Akad. Wetenschap., Afdel. Natuurk. 20, 1238 (1918); available at
      https://archive.org/details/proceedingsofsec202koni/page/1238
[386] I.D. Novikov: On the evolution of a semiclosed world (in Russian), Astronomicheskii
      Zhurnal 40, 772 (1963); English translation in Soviet Astronomy 7, 587 (1964)
      URL: https://ui.adsabs.harvard.edu/abs/1964SvA.....7..587N/abstract
[387] I.D. Novikov and V.P. Frolov: Physics of Black Holes, Kluwer, Dordrecht (1989)
      DOI: 10.1007/978-94-017-2651-1
[388] I.D. Novikov and K.S. Thorne: Astrophysics of Black Holes, in Black Holes – Les astres
      occlus, edited by C. DeWitt and B. DeWitt, Gordon and Breach, New York (1973), p. 343.
[389] M. Nozawa, T. Shiromizu, K. Izumi and S. Yamada: Divergence equations and uniqueness
      theorem of static black holes, Class. Quantum Grav. 35, 175009 (2018)
      DOI: 10.1088/1361-6382/aad206                                         arXiv: 1805.11385
[390] B. O’Neill: Semi-Riemannian Geometry, with Applications to Relativity, Academic Press,
      New York (1983).
[391] B. O’Neill: The geometry of Kerr black holes, A.K. Peters, Wellesley (MA) (1995); reprinted
      by Dover, Mineola (NY) (2014).
[392] J.R. Oppenheimer and H. Snyder: On Continued Gravitational Contraction, Phys. Rev. 56,
      455 (1939)
      DOI: 10.1103/PhysRev.56.455
[393] J.A. Orosz, J.E. McClintock, J.P. Aufdenberg, R.A. Remillard, M.J. Reid, R. Narayan, and L.
      Gou: The Mass of the Black Hole in Cygnus X-1, Astrophys. J. 742, 84 (2011)
      DOI: 10.1088/0004-637X/742/2/84                                           arXiv: 1106.3689
[394] D.N. Page: Particle emission rates from a black hole: Massless particles from an uncharged,
      nonrotating hole, Phys. Rev. D 13, 198 (1976)
      DOI: 10.1103/PhysRevD.13.198
[395] D.N. Page: Hawking radiation and black hole thermodynamics, New J. Phys. 7, 203 (2005)
      DOI: 10.1088/1367-2630/7/1/203                                arXiv: hep-th/0409024
810                                      BIBLIOGRAPHY
      [396] D.N. Page and K.S. Thorne: Disk-Accretion onto a Black Hole. Time-Averaged Structure of
            Accretion Disk, Astrophys. J. 191, 499 (1974)
            DOI: 10.1086/152990
      [397] P. Painlevé: La mécanique classique et la théorie de la relativité, Comptes Rendus Acad.
            Sci. (Paris) 173, 677 (1921)
            URL:                       http://www.bibnum.education.fr/physique/relativite/
            la-mecanique-classique-et-la-theorie-de-la-relativite
      [398] E. Papantonopoulos (ed.): Physics of Black Holes, Springer, Berlin (2009).
      [399] A. Papapetrou: A Static Solution of the Gravitational Field for an Arbitrary Charge-
            Distribution, Proc. R. Irish Acad. A, 51, 191 (1947)
            URL: https://www.jstor.org/stable/20488481
      [400] A. Papapetrou : Formation of a Singularity and Causality, in A Random Walk in Relativity
            and Cosmology, edited by N. Dadhich, J. Krishna Rao, J.V. Narlikar and C.V. Vishveshwara,
            Wiley Eastern, New Dehli (1985), p. 184.
      [401] R. Parentani and P. Spindel: Hawking radiation, Scholarpedia 6, 6958 (2011)
            DOI: 10.4249/scholarpedia.6958
      [402] A.W. Peet: TASI Lectures on Black Holes in String Theory, in Strings, Branes and Gravity,
            edited by J.A. Harvey, S. Kachru, and E. Silverstein, World Scientific, Singapore (2001),
            p. 353
            DOI: 10.1142/9789812799630_0003                                      arXiv: hep-th/0008241
      [403] R. Penrose: Asymptotic Properties of Fields and Space-Times, Phys. Rev. Lett. 10, 66 (1963)
            DOI: 10.1103/PhysRevLett.10.66
      [404] R. Penrose: Conformal treatment of infinity, in Relativity, groups and topology, edited by
            B. DeWitt and C. DeWitt, Gordon and Breach, New York (1964), p. 565; reprinted in Gen.
            Relativ. Gravit. 43, 901 (2011)
            DOI: 10.1007/s10714-010-1110-5
      [405] R. Penrose: Gravitational collapse and space-time singularities, Phys. Rev. Lett. 14, 57
            (1965)
            DOI: 10.1103/PhysRevLett.14.57
      [406] R. Penrose: Zero Rest-Mass Fields Including Gravitation: Asymptotic Behaviour, Proc. Roy.
            Soc. Lond. A 284, 159 (1965)
            http://www.jstor.org/stable/2415306
      [407] R. Penrose: Structure of Space-Time, in Battelle Rencontres: 1967 Lectures in Mathematics
            and Physics, edited by C.M. DeWitt and J.A. Wheeler, Benjamin, New York (1968), p. 121.
      [408] R. Penrose: Gravitational Collapse: The Role of General Relativity, Rivista del Nuovo
            Cimento, Numero Speziale 1, 252 (1969); reprinted in Gen. Relativ. Gravit. 34, 1141 (2002)
            DOI: 10.1023/A:1016578408204
      [409] R. Penrose and G.R. Floyd: Extraction of Rotational Energy from a Black Hole, Nature
            Phys. Sci. 229, 177 (1971)
            DOI: 10.1038/physci229177a0
      [410] A. Perez: Black holes in loop quantum gravity, Rep. Prog. Phys. 80, 126901 (2017)
            DOI: 10.1088/1361-6633/aa7e14                                            arXiv: 1703.09149
                                         BIBLIOGRAPHY                                              811
[411] G. Perez-Giz: From Measure Zero to Measure Hero: Periodic Kerr Orbits and Gravitational
      Wave Physics, PhD thesis, Columbia University, New York (2011)
      DOI: 10.7916/D8P84JVB
[412] V. Perlick: Gravitational Lensing from a Spacetime Perspective, Living Rev. Relativ. 7, 9
      (2004)
      DOI: 10.12942/lrr-2004-9
[413] P. Peter and J.-P. Uzan: Primordial Cosmology, Oxford Univ. Press, Oxford (2009).
[414] J. Plebański and A. Krasiński: An Introduction to General Relativity and Cosmology,
      Cambridge Univ. Press, Cambridge (2006)
      DOI: 10.1017/CBO9780511617676
[415] M.A. Podurets: Asymptotic behavior of the optical luminosity of a star in gravitational
      collapse in terms of general relativity theory (in Russian), Astronomicheskii Zhurnal 41,
      1090 (1964); English translation in Soviet Astronomy 8, 868 (1965)
      URL: https://ui.adsabs.harvard.edu/abs/1965SvA.....8..868P/abstract
[416] E. Poisson: A Relativist’s Toolkit, The Mathematics of Black-Hole Mechanics, Cambridge
      Univ. Press, Cambridge (2004)
      DOI: 10.1017/CBO9780511606601
[417] E. Poisson and W. Israel: Internal structure of black holes, Phys. Rev. D 41, 1796 (1990)
      DOI: 10.1103/PhysRevD.41.1796
[418] E. Poisson and C.M. Will: Gravity: Newtonian, Post-Newtonian, Relativistic, Cambridge
      Univ. Press, Cambridge (2014)
      DOI: 10.1017/CBO9781139507486
[419] A. Pound and B. Wardell: Black Hole Perturbation Theory and Gravitational Self-Force, in
      Handbook of Gravitational Wave Astronomy, edited by C. Bambi, S. Katsanevas, and K.D.
      Kokkotas, Springer, Singapore (2021)
      DOI: 10.1007/978-981-15-4702-7_38-1                                 arXiv: 2101.04592
[420] W.H. Press and S.A. Teukolsky: Floating Orbits, Superradiant Scattering and the Black-hole
      Bomb, Nature 238, 211 (1972)
      DOI: 10.1038/238211a0
[421] I. Rácz: A simple proof of the recent generalizations of Hawking’s black hole topology
      theorem, Class. Quantum Grav. 25, 162001 (2008)
      DOI: 10.1088/0264-9381/25/16/162001                                   arXiv: 0806.4373
[422] I. Rácz and R.M. Wald: Global extensions of spacetimes describing asymptotic final states
      of black holes, Class. Quantum Grav. 13, 539 (1996)
      DOI: 10.1088/0264-9381/13/3/017                                    arXiv: gr-qc/9507055
[423] A. Rajantie: Introduction to magnetic monopoles, Contemporary Physics 53, 195 (2012)
      DOI: 10.1080/00107514.2012.685693                                   arXiv: 1204.3077
[424] P. Ramond and A. Le Tiec: Multipolar particles in helically symmetric spacetimes, Class.
      Quantum Grav. 38, 135022 (2021)
      DOI: 10.1088/1361-6382/abebef                                        arXiv: 2005.00602
812                                      BIBLIOGRAPHY
      [425] H.S. Reall: Higher-Dimensional Black Holes, in General Relativity, Cosmology and As-
            trophysics; Perspectives 100 years after Einstein’s stay in Prague, edited by J. Bičák and
            T. Ledvinka, Springer, Cham (2014), p. 245
            DOI: 10.1007/978-3-319-06349-2_12
      [426] H.S. Reall: Black Holes, lecture notes for the course on Black Holes in Part III of the
            Cambridge Mathematical Tripos (2020); available at
            http://www.damtp.cam.ac.uk/user/hsr1000/teaching.html
      [427] H.S. Reall: Third law of black hole mechanics for supersymmetric black holes and a quasilo-
            cal mass-charge inequality, Phys. Rev. D 110, 124059 (2024); arXiv: 2410.11956.
      [428] M.J. Reid, J.E. McClintock, R. Narayan, L. Gou, R.A. Remillard, and J.A. Orosz: The
            trigonometric parallax of Cygnus X-1, Astrophys. J. 742, 83 (2011)
            DOI: 10.1088/0004-637X/742/2/83                                            arXiv: 1106.3688
      [429] H. Reissner: Über die Eigengravitation des elektrischen Feldes nach der Einsteinschen
            Theorie, Annalen der Physik 50, 106 (1916)
            DOI: 10.1002/andp.19163550905
      [430] C.S. Reynolds: Observational Constraints on Black Hole Spin, Ann. Rev. Astron. Astrophys.
            59, 117 (2021)
            DOI: 10.1146/annurev-astro-112420-035022                                  arXiv: 2011.08948
      [431] L. Rezzolla and O. Zanotti: Relativistic Hydrodynamics, Oxford Univ. Press, Oxford (2013)
            DOI: 10.1093/acprof:oso/9780198528906.001.0001
      [432] A. Riazuelo: Trous noirs et trous de vers, Pour la Science 372, 36 (2008).
      [433] A. Riazuelo: Seeing relativity – I. Ray tracing in a Schwarzschild metric to explore the
            maximal analytic extension of the metric and making a proper rendering of the stars, Int. J.
            Mod. Phys. D 28, 1950042 (2019)
            DOI: 10.1142/S0218271819500421                                            arXiv: 1511.06025
      [434] A. Riazuelo: Seeing relativity – III. Journeying within the Kerr metric toward the negative
            gravity region, Int. J. Mod. Phys. D 29, 2050109 (2020)
            DOI: 10.1142/S0218271820501096                                            arXiv: 2008.04384
      [435] W. Rindler: Visual Horizons in World-Models, Month. Not. Roy. Astron. Soc. 116, 662
            (1956); reprinted in Gen. Relativ. Gravit. 34, 133 (2002)
            DOI: 10.1023/A:1015347106729
      [436] D.C. Robinson: Classification of black holes with electromagnetic fields, Phys. Rev. D 10,
            458 (1974)
            DOI: 10.1103/PhysRevD.10.458
      [437] D.C. Robinson: Uniqueness of the Kerr Black Hole, Phys. Rev. Lett. 34, 905 (1975)
            DOI: 10.1103/PhysRevLett.34.905
      [438] D.C. Robinson: A simple proof of the generalization of Israel’s theorem, Gen. Relat. Grav.
            8, 695 (1977)
            DOI: 10.1007/BF00756322
      [439] D.C. Robinson: Four decades of black hole uniqueness theorems, in The Kerr spacetime,
            edited by D.L. Wiltshire, M. Visser, and S.M. Scott, Cambridge Univ. Press, Cambridge
                                            BIBLIOGRAPHY                                                 813
      [454] J.M.M. Senovilla and D. Garfinkle: The 1965 Penrose singularity theorem, Class. Quantum
            Grav. 32, 124008 (2015)
            DOI: 10.1088/0264-9381/32/12/124008                                        arXiv: 1410.5226
      [455] S.L. Shapiro: Thermal radiation from stellar collapse to a black hole, Phys. Rev. D 40, 1858
            (1989)
            DOI: 10.1103/PhysRevD.40.1858
      [456] M. Shibata: Numerical Relativity, World Scientific, Singapore (2016); DOI: 10.1142/9692.
      [457] M. Siino: Topological Appearance of Event Horizon: What Is the Topology of the Event
            Horizon That We Can See?, Prog. Theor. Phys. 99, 1 (1998)
            DOI: 10.1143/PTP.99.1                                                  arXiv: gr-qc/9803099
      [458] M. Siino: Topology of event horizons Phys. Rev. D 58, 104016 (1998)
            DOI: 10.1103/PhysRevD.58.104016                                        arXiv: gr-qc/9701003
      [459] L. Smarr: Mass Formula for Kerr Black Holes, Phys. Rev. Lett. 30, 71 (1973); errata in Phys.
            Rev. Lett. 30, 521 (1973)
            DOI: 10.1103/PhysRevLett.30.71 and 10.1103/PhysRevLett.30.521 (errata)
      [460] A.A. Starobinsky: Amplification of waves during reflection from a rotating “black hole” (in
            Russian), Zhur. Eksp. Teor. Fiz. 64, 48 (1973); English translation in Soviet Physics JETP
            37, 28 (1973)
            URL: http://jetp.ras.ru/cgi-bin/e/index/e/37/1/p28?a=list
      [461] L.C. Stein and N. Warburton: Location of the last stable orbit in Kerr spacetime, Phys. Rev.
            D 101, 064007 (2020)
            DOI: 10.1103/PhysRevD.101.064007                                          arXiv: 1912.07609
      [462] W. Stein and D. Joyner: SAGE: System for Algebra and Geometry Experimentation, Com-
            mun. Comput. Algebra 39, 61 (2005).
      [463] B. Steinmüller, A.R. King, and J.-P. Lasota: Radiating bodies and naked singularities, Phys.
            Lett. A 51, 191 (1975)
            DOI: 10.1016/0375-9601(75)90525-3
      [464] N. Straumann: General Relavity, 2nd edition, Springer, Dordrecht (2013)
            DOI: 10.1007/978-94-007-5410-2
      [465] A. Strominger and C. Vafa: Microscopic origin of the Bekenstein-Hawking entropy, Phys.
            Lett. B 379, 99 (1996)
            DOI: 10.1016/0370-2693(96)00345-0                                    arXiv: hep-th/9601029
      [466] Z. Stuchlík: The Radial Motion of Photons in Kerr Metric, Bul. Astron. Inst. Czechoslovakia
            32, 40 (1981)
            https://ui.adsabs.harvard.edu/abs/1981BAICz..32...40S
      [467] D. Sudarsky and R.M. Wald: Extrema of mass, stationarity, and staticity, and solutions to
            the Einstein–Yang–Mills equations, Phys. Rev. D 46, 1453 (1992)
            DOI: 10.1103/PhysRevD.46.1453
      [468] D. Sudarsky and R.M. Wald: Mass formulas for stationary Einstein-Yang-Mills black holes
            and a simple proof of two staticity theorems, Phys. Rev. D 47, R5209(R) (1993)
            DOI: 10.1103/PhysRevD.47.R5209                                         arXiv: gr-qc/9305023
                                          BIBLIOGRAPHY                                               815
[469] J.L. Synge: The Gravitational Field of a Particle, Proc. Roy. Irish Acad. A 53, 83 (1950)
      http://www.jstor.org/stable/20488511
[470] J.L. Synge: The Escape of Photons from Gravitationally Intense Stars, Month. Not. Roy.
      Astron. Soc. 131, 463 (1966)
      DOI: 10.1093/mnras/131.3.463
[471] L.B. Szabados: Quasi-Local Energy-Momentum and Angular Momentum in General Rela-
      tivity, Living Rev. Relativ. 12, 4 (2009)
      DOI: 10.12942/lrr-2009-4
[472] G. Szekeres: On the Singularities of a Riemannian Manifold, Publicationes Mathematicae
      Debrecen 7, 285 (1960); reprinted in Gen. Relativ. Gravit. 34, 2001 (2002)
      DOI: 10.1023/A:1020744914721
[473] F.R. Tangherlini: Schwarzschild field in n dimensions and the dimensionality of space
      problem, Il Nuovo Cimento 27, 636 (1963)
      DOI: 10.1007/BF02784569
[474] C. H. Taubes: Gauge theory on asymptotically periodic 4-manifolds, J. Differential Geom.
      25, 363 (1987).
[475] E. Teo: Spherical Photon Orbits Around a Kerr Black Hole, Gen. Relativ. Gravit. 35, 1909
      (2003)
      DOI: 10.1023/A:1026286607562
[476] S.A. Teukolsky: The Kerr metric, Class. Quantum Grav. 32 124006 (2015)
      DOI: 10.1088/0264-9381/32/12/124006                                        arXiv: 1410.2130
[477] S.A. Teukolsky and W.H. Press: Perturbations of a rotating black hole. III. Interaction of
      the hole with gravitational and electromagnetic radiation, Astrophys. J. 193, 443 (1974)
      DOI: 10.1086/153180
[478] K.S. Thorne: Disk-Accretion onto a Black Hole. II. Evolution of the Hole, Astrophys. J. 191,
      507 (1974)
      DOI: 10.1086/152991
[479] K.S. Thorne: Black Holes and Time Warps, Norton, New York (1994).
[480] K.S. Thorne and D. MacDonald: Electrodynamics in curved spacetime: 3 + 1 formulation,
      Month. Not. Roy. Astron. Soc. 198, 339 (1982)
      DOI: 0.1093/mnras/198.2.339
[481] K.S. Thorne, W.H. Zurek, and R.H. Price: The Thermal Atmosphere of a Black Hole, in Black
      holes: The Membrane Paradigm, edited by K.S. Thorne, R.H. Price and D.A. MacDonald,
      Yale University Press, New Haven (1986), p. 280.
[482] R.C. Tolman: Effect of Inhomogeneity on Cosmological Models, Proc. Nat. Acad. Sci. USA
      20, 169 (1934); reprinted in Gen. Relativ. Gravit. 29, 935 (1997)
      DOI: 10.1023/A:1018891418565
[483] P.K. Townsend: Black Holes, Lecture notes for a course taught in Part III of the Cambridge
      University Mathematical Tripos (1997)
      arXiv: gr-qc/9707012
[484] W.G. Unruh: Notes on black-hole evaporation, Phys. Rev. D 14, 870 (1976)
      DOI: 10.1103/PhysRevD.14.870
816                                      BIBLIOGRAPHY
      [485] W.G. Unruh: Universal coordinates for Schwarzschild black holes (2014)
            arXiv: 1401.3393
      [486] P.C. Vaidya: The External Field of a Radiating Star in General Relativity, Current Science
            12, 183 (1943); reprinted in Gen. Relativ. Gravit. 31, 119 (1999)
            DOI: 10.1023/A:1018871522880
      [487] P.C. Vaidya: The Gravitational Field of a Radiating Star, Proc. Indian Acad. Sci. A33, 264
            (1951); reprinted in Gen. Relativ. Gravit. 31, 121 (1999)
            DOI: 10.1023/A:1018875606950
      [488] P.C. Vaidya: ’Newtonian’ Time in General Relativity, Nature 171, 260 (1953); reprinted in
            Gen. Relativ. Gravit. 31, 137 (1999)
            DOI: 10.1023/A:1018827723788
      [489] K. Van Aelst: Note on equatorial geodesics in circular spacetimes, Class. Quantum Grav.
            37, 207001 (2020)
            DOI: 10.1088/1361-6382/aba80c
      [490] K. Van Aelst, E. Gourgoulhon, P. Grandclément, and C. Charmousis: Hairy rotating black
            holes in cubic Galileon theory, Class. Quantum Grav. 37, 035007 (2020)
            DOI: 10.1088/1361-6382/ab6391                                            arXiv: 1910.08451
      [491] S.U. Viergutz: Image generation in Kerr geometry. I. Analytical investigations on the
            stationary emitter-observer problem
            Astron. Astrophys. 272, 355 (1993)
            https://ui.adsabs.harvard.edu/abs/1993A&A...272..355V
      [492] F.H. Vincent, E. Gourgoulhon, and J. Novak: 3+1 geodesic equation and images in numerical
            spacetimes, Class. Quantum Grav. 29, 245005 (2012)
            DOI: 10.1088/0264-9381/29/24/245005                                       arXiv: 1208.3927
      [493] F.H. Vincent, T. Paumard, E. Gourgoulhon, and G. Perrin: GYOTO: a new general rela-
            tivistic ray-tracing code, Class. Quantum Grav. 28, 225011 (2011)
            DOI: 10.1088/0264-9381/28/22/225011                                       arXiv: 1109.4769
      [494] F. H. Vincent, M. Wielgus, M. A. Abramowicz, E. Gourgoulhon, J.-P. Lasota, T. Paumard,
            and G. Perrin: Geometric modeling of M87* as a Kerr black hole or a non-Kerr compact
            object, Astron. Astrophys. 646, A37 (2021)
            DOI: 10.1051/0004-6361/202037787                                         arXiv: 2002.09226
      [495] C. V. Vishveshwara: Generalization of the “Schwarzschild Surface” to Arbitrary Static and
            Stationary Metrics, J. Math. Phys. 9, 1319 (1968)
            DOI: 10.1063/1.1664717
      [496] M. Visser: Hawking Radiation without Black Hole Entropy, Phys. Rev. Lett. 80, 3436 (1998)
            DOI: 10.1103/PhysRevLett.80.3436                                     arXiv: gr-qc/9712016
      [497] M. Visser: The Kerr spacetime — a brief introduction, in The Kerr spacetime, edited by
            D.L. Wiltshire, M. Visser, and S.M. Scott, Cambridge Univ. Press, Cambridge (2009), p. 3
            arXiv: 0706.0622
      [498] M. Visser and Z. Yan: Properties of dynamical black hole entropy, J. High Energ. Phys.
            10(2024), 029 (2024)
            DOI: 10.1007/JHEP10(2024)029                                             arXiv: 2403.07140
                                          BIBLIOGRAPHY                                               817
[499] R.M. Wald: General relativity, University of Chicago Press, Chicago (1984).
[500] R.M. Wald: On identically closed forms locally constructed from a field, J. Math. Phys. 31,
      2378 (1990).
[501] R.M. Wald: Black Holes and Thermodynamics, in Black Hole Physics, edited by V. De Sab-
      bata and Z. Zhang, Kluwer Academic Publisher, Dordrecht (1992), p. 55; DOI: 10.1007/978-
      94-011-2420-1_2.
[502] R.M. Wald: Black hole entropy is the Noether charge, Phys. Rev. D 48, R3427 (1993); arXiv:
      gr-qc/9307038.
[503] R.M. Wald: Quantum Field Theory in Curved Spacetime and Black Hole Thermodynamics,
      University of Chicago Press, Chicago (1994)
      URL: https://press.uchicago.edu/ucp/books/book/chicago/Q/bo3684008.html
[504] R.M. Wald: “Nernst theorem” and black hole thermodynamics, Phys. Rev. D 56, 6467 (1997);
      arXiv: gr-qc/9704008.
[505] R.M. Wald: The thermodynamics of black holes, Living Rev. Relativ. 4, 6 (2001).
[506] R.M. Wald: The Entropy of Black Holes, in Black Holes Inside and Out 2024: visions for the
      future of black hole physics, edited by L. Buoninfante, R. Carballo-Rubio, V. Cardoso, F. Di
      Filippo and A. Eichhorn (2024), p. 29; arXiv: 2410.14414.
[507] R.M. Wald and A. Zoupas: General definition of “conserved quantities” in general relativity
      and other theories of gravity, Phys. Rev. D 61, 084027 (2000); arXiv: gr-qc/9911095.
[508] M. Walker and R. Penrose: On Quadratic First Integrals of the Geodesic Equations for Type
      {22} Spacetimes, Commun. Math. Phys. 18, 265 (1970).
[509] R.C. Walker, P.E. Hardee, F.B. Davies, C. Ly, and W. Junor: The Structure and Dynamics
      of the Subparsec Jet in M87 Based on 50 VLBA Observations over 17 Years at 43 GHz,
      Astrophys. J. 855 128 (2018); arXiv: 1802.06166.
[510] A.C. Wall: Ten proofs of the generalized second law, J. High Energ. Phys. 06(2009), 021
      (2009); arXiv: 0901.3865.
[511] A.C. Wall: A Second Law for higher curvature gravity, Int. J. Mod. Phys. D 24, 1544014
      (2015); arXiv: 1504.08040.
[512] D. Wallace: The case for black hole thermodynamics part I: Phenomenological thermo-
      dynamics, Studies in History and Philosophy of Modern Physics 64, 52 (2018); arXiv:
      1710.02724.
[513] D. Wallace: The case for black hole thermodynamics part II: Statistical mechanics, Studies
      in History and Philosophy of Modern Physics 66, 103 (2019); arXiv: 1710.02725.
[514] J.L. Walsh, A.J. Barth, L.C. Ho, and M. Sarzi: The M87 Black Hole Mass from Gas-dynamical
      Models of Space Telescope Imaging Spectrograph Observations, Astrophys. J. 770, 86 (2013);
      arXiv: 1304.7273.
[515] N. Warburton, L. Barack, and N. Sago: Isofrequency pairing of geodesic orbits in Kerr
      geometry, Phys. Rev. D 87, 084012 (2013); arXiv: 1301.3918.
[516] B. Waugh and K. Lake: Double-null coordinates for the Vaidya metric, Phys. Rev. D 34,
      2978 (1986).
[517] H. Weyl: Zur Gravitationstheorie, Annalen der Physik 54, 117 (1917).
818                                       BIBLIOGRAPHY
      [518] H. Weyl: Über die statischen kugelsymmetrischen Lösungen von Einsteins “kosmologischen”
            Gravitationsgleichungen, Physikalische Zeitschrift 20, 31 (1919).
      [519] J.A. Wheeler: Feynman and Jacob Bekenstein, Web of Stories (1996); URL: https://www.
            webofstories.com/play/john.wheeler/84.
      [520] J.A. Wheeler: Geons, Black Holes, and Quantum Foam: a Life in Physics, Norton, New
            York (1998).
      [521] M. Wielgus et al.: Monitoring the Morphology of M87* in 2009-2017 with the Event Horizon
            Telescope, Astrophys. J. 901, 67 (2020); arXiv: 2009.11842.
      [522] D.C. Wilkins: Bound Geodesics in the Kerr Metric, Phys. Rev. D 5, 814 (1972).
      [523] J.W. York: Kinematics and dynamics of general relativity, in Sources of gravitational
            radiation, edited by L.L. Smarr, Cambridge University Press, Cambridge (1979), p. 83.
      [524] H. Yoshino, K. Takahashi, and K. Nakao: How does a collapsing star look?, Phys. Rev. D
            100, 084062 (2019); arXiv: 1908.04223.
      [525] A. Zaffaroni: AdS black holes, holography and localization, Living Rev. Relativ. 23, 2 (2020);
            arXiv: 1902.07176.
      [526] Ya.B. Zeldovich: Semiclosed Worlds in the General Theory of Relativity (in Russian), Zhur.
            Eksp. Teor. Fiz. 43, 1037 (1962); English translation in Soviet Physics JETP 16, 732 (1963).
      [527] Ya.B. Zeldovich: Generation of Waves by a Rotating Body (in Russian), Pisma Red. Zhur.
            Eksp. Teor. Fiz. 14, 270 (1971); English translation in Soviet Physics JETP Letters 14, 180
            (1971).
      [528] Ya.B. Zeldovich: Amplification of Cylindrical Electromagnetic Waves Reflected from a
            Rotating Body (in Russian), Zhur. Eksp. Teor. Fiz. 62, 2076 (1972); English translation in
            Soviet Physics JETP 35, 1085 (1972)
            URL: http://jetp.ras.ru/cgi-bin/e/index/e/35/6/p1085?a=list
      [529] Ya.B. Zeldovich and I.D. Novikov: Relativistic Astrophysics. I. (in Russian), Uspekhi
            Fizicheskikh Nauk 84, 377 (1964); English translation in Soviet Physics Uspekhi 7, 763
            (1965).
      [530] Ya.B. Zeldovich and I.D. Novikov: Relativistic Astrophysics, vol. 1: Stars and Relativity,
            University of Chicago Press, Chicago (1971).
      [531] Ya.B. Zeldovich and I.D. Novikov: Relativistic Astrophysics, vol. 2: The Structure and
            Evolution of the Universe, University of Chicago Press, Chicago (1983).
      [532] P. Zimmermann et al.: Computational Mathematics with SageMath, SIAM (2018); freely
            available from https://www.sagemath.org/sagebook/.
      [533] R.L. Znajek: Black hole electrodynamics and the Carter tetrad, Month. Not. Roy. Astron.
            Soc. 179, 457 (1977).
Index