Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
I N M AT H E M AT I C S 201
Geometric
Relativity
Dan A. Lee
Geometric
Relativity
GRADUATE STUDIES
I N M AT H E M AT I C S 201
Geometric
Relativity
Dan A. Lee
EDITORIAL COMMITTEE
Daniel S. Freed (Chair)
Bjorn Poonen
Gigliola Staffilani
Jeff A. Viaclovsky
2010 Mathematics Subject Classification. Primary 53-01, 53C20, 53C21, 53C24, 53C27,
53C44, 53C50, 53C80, 83C05, 83C57.
Copying and reprinting. Individual readers of this publication, and nonprofit libraries acting
for them, are permitted to make fair use of the material, such as to copy select pages for use
in teaching or research. Permission is granted to quote brief passages from this publication in
reviews, provided the customary acknowledgment of the source is given.
Republication, systematic copying, or multiple reproduction of any material in this publication
is permitted only under license from the American Mathematical Society. Requests for permission
to reuse portions of AMS publication content are handled by the Copyright Clearance Center. For
more information, please visit www.ams.org/publications/pubpermissions.
Send requests for translation rights and licensed reprints to reprint-permission@ams.org.
2019
c by the author. All rights reserved.
Printed in the United States of America.
∞ The paper used in this book is acid-free and falls within the guidelines
established to ensure permanence and durability.
Visit the AMS home page at https://www.ams.org/
10 9 8 7 6 5 4 3 2 1 24 23 22 21 20 19
For my parents, Rupert and Gloria Lee
Contents
Preface ix
vii
viii Contents
ix
x Preface
we will of course still need to use Sobolev spaces for our proofs.) The reason
for this is to prevent having to discuss what the optimal regularity is for the
hypotheses of each theorem. The reader will have to refer to the research
literature if interested in more precise statements.
When we refer to concepts or ideas that are especially common or well
known, instead of citing a textbook, we will sometimes cite Wikipedia. The
reasoning is that in today’s world, although Wikipedia is rarely the best
source, it is often the fastest source. Here, the reader can get a quick intro-
duction (or refresher) on the concept and then seek a more traditional math-
ematical text as desired. These citations will be marked with the name of
the relevant article. For example, the citation [Wik, Riemannian geometry]
means that the reader should visit
http://en.wikipedia.org/wiki/Riemannian geometry.
There are many exercises sprinkled throughout the text. Some of them
are routine computations of facts and formulas that are used heavily through-
out the text. Others serve as simple “reality checks” to make sure the reader
understands statements of definitions or theorems on a basic level. Finally,
there are some exercises (and “check this” statements) that ask the reader
to fill in the details of some proof—these are meant to mimic the sort of
routine computations that tend to come up in research.
The motivation for writing this book came from the fact that, to the
author’s knowledge, there is no graduate-level text that gives a full account
of the positive mass theorem and related theorems. This presents an un-
necessarily high barrier to entry into the field, despite the fact that the core
material in this book is now quite well understood by the research com-
munity. A fair amount of the material in Part 1 was presented as a series
of lectures during the Fall of 2015 as part of the General Relativity and
Geometric Analysis seminar at Columbia University.
I would like to thank Hubert Bray, who is the person most responsible for
shepherding me into this field of research. He taught me much of what I know
about the subject matter of this book and strongly shaped my intuition and
perspective. He also encouraged me to write this book and came up with the
title. I thank Richard Schoen, my doctoral advisor, for teaching me about
geometric analysis and supporting my research in geometric relativity. I have
also learned a great deal about this subject from him through many private
conversations, unpublished lecture notes, and talks I have attended over the
years. Similarly, I thank my other collaborators in the field, who have taught
me so much throughout my career: André Neves, Jeffrey Jauregui, Christina
Sormani, Michael Eichmair, Philippe LeFloch, and especially Lan-Hsuan
Huang, who kindly discussed certain technical issues related to this book.
xii Preface
Riemannian geometry
Chapter 1
Scalar curvature
3
4 1. Scalar curvature
gij := vi , vj ,
The notation g ij indicates the matrix inverse of the matrix gij so that
we have g ik gkj = gjk g ki = δji . Note that this g ij also represents the inner
product on T ∗ M described earlier, so that if ωi and ηi are 1-forms, then
ω, η = g ij ωi ηj . Also, the inverse of , denoted by : T ∗ M −→ T M , maps
ωi to the vector field (ω )j = g ij ωi (now indexed by j). This is an example
of “raising indices.”
Recall that, given a smooth function f , we can define the 1-form df via
the equation df (X) = Xf , where the right-hand side is the action of an
arbitrary vector field X on f . Given a metric, we define the gradient of f
to be ∇f := (df ) . Note that
∇X f := ∇f, X = df (X) = Xf = LX f,
where the last term is the Lie derivative, so that we have many notations
for the same (simple) thing. Note that although ∇f depends on the choice
of metric, ∇X f does not. If there are chosen coordinates x1 , . . . , xn , we will
sometimes use the notation ∂f to mean the (locally defined) vector field
whose components are
∂f
∂i f := = fi .
∂xi
Indeed, we will sometimes write the vector fields ∂x∂ 1 , . . . , ∂x∂n as ∂1 , . . . , ∂n
for the purpose of readability.
More generally, one can use the same idea to define the Lie derivative of
other tensor fields besides vector fields. (By tensor field, we mean a section
of a bundle that is a tensor product of some number of copies of T M and
some number of copies of T ∗ M .) This is because a local diffeomorphism also
induces an isomorphism between fibers of the tensor bundle. One can show
that with this definition, the Lie derivative obeys appropriate Leibniz rules
with respect to products of tensors. For example, given vector fields X, Y, Z
and a 1-form ω, we have facts such as LX (Y ⊗Z) = (LX Y ) ⊗Z +Y ⊗(LX Z)
and ∇X (ω(Y )) = (LX ω)(Y ) + ω(LX Y ).
We say that X is a Killing field if the local diffeomorphisms Φt that it
generates are isometries, meaning that for all tangent vectors v, w ∈ Tp M
we have dΦt |p (v), dΦt |p (w) = v, w.
Exercise 1.4. Show that X is a Killing field for the metric g if and only if
LX g = 0,
where g is thought of as a (0, 2) tensor.
Much like the Lie derivative, one can extend the Levi-Civita connection
to more general tensor fields via appropriate Leibniz rules. As is standard,
given a tensor such as Tjk in a local frame, the notation ∇i Tjk , or even
more briefly Tjk;i , is usually taken to mean the “ijk-component” of ∇T in
that frame. (Or more formally, the indices can just be used as abstract
placeholders [Wik, Abstract index notation].)
Exercise 1.6. Show that the metric g is always constant with respect to
its own Levi-Civita connection, that is, ∇g = 0. Use this to show that with
respect to a local frame, we have
(LX g)ij = ∇i Xj + ∇j Xi = Xj;i + Xi;j
for any vector field X.
1.1.5.
Curvature. The Riemann curvature tensor Riem is an element of
C ∞ ( 4 T ∗ M ) defined so that for all vector fields X, Y, Z, W ∈ C ∞ (T M ),
we have
Riem(X, Y, Z, W ) := −∇X ∇Y Z + ∇Y ∇X Z + ∇[X,Y ] Z, W .
Given a local frame v1 , . . . , vn , we can represent Riem in that frame by
Rijk := Riem(vi , vj , vk , v ).
The tensor Riem is antisymmetric in the first pair of inputs and the last
pair of inputs, and symmetric when interchanging
those pairs, or in other
∞
2 ∗ 2 ∗
words, Riem ∈ C ( T M ) ( T M ) . There is one more symmetry,
known as the first Bianchi identity:
Rijk + Rjki + Rkij = 0.
The second Bianchi identity concerns the derivatives of the curvature tensor:
Rijk;m + Rijm;k + Rijmk; = 0.
Recall that the semicolons denote covariant differentiation.
Given an orthonormal frame e1 , . . . , en , the Riemann curvature tensor
can be computed in terms of the connection 1-forms ωji . Specifically,
(1.3) Riem(ei , ej , ek , e ) = −Ω(ei , ej )ek , e ,
where Ω is the End(T M )-valued 2-form defined by
Ω = dω + ω ∧ ω,
or more precisely, Ω = Ωij ei ⊗ θj , where Ωij is the 2-form given by
Ωij = dωji + ωki ∧ ωjk ,
and θ1 , . . . , θn is the orthonormal dual coframe of e1 , . . . , en .
Remark 1.9. Other texts define the Riemann curvature tensor with three
lower indices and one raised index, instead of four lower and zero raised
as we have done here. This is an insignificant difference since the metric
provides a natural way to raise and lower indices, as described earlier.
More significantly, many texts use the opposite sign convention for the
definition of the Riemann curvature tensor. This is significant because the
12 1. Scalar curvature
Recall that the conjugate radius is the supremum of all r with the prop-
erty that any two conjugate points along a unit speed geodesic are at least
r units apart. Or equivalently, it is the supremum of all r with the property
that the exponential map at every p ∈ M has nonsingular derivative at every
point of the ball Br (0) ⊂ Tp M .
1.1. Notation and review of Riemannian geometry 15
Proof. We follow the proof in [TW14]. Although the proof is fairly ele-
mentary and accessible, it uses background material that will not be used
much in the rest of this book. Specifically, we assume familiarity with
the index form for geodesics. Recall that the energy functional of a curve
γ : [0, a] −→ M is
a
E(γ) = |γ (t)|2 dt.
0
Let γ : [0, a] −→ M be a unit speed geodesic, and let X be a vector field
defined along γ such that X(0) = X(a) = 0, and consider a smooth family
of curves γs with γ0 = γ whose deformation vector field is X. Recall that
the index form along γ may be defined to be the second variation of the
energy functional in the X direction, that is,
d2
(1.4) I(X, X) := 2 E(γs ),
ds s=0
∂
where X = ∂s γ . Recall that this can be computed to be
s=0 s
a
2
(1.5) I(X, X) = |X (t)| − Riem(γ (t), X(t), γ (t), X(t)) dt.
0
Now assume the hypotheses of the theorem, and let a be the conjugate
radius. Let p ∈ M , let u be a unit vector in Tp M , and let γ : [0, a] −→ M be
the unique geodesic with γ(0) = p and γ (0) = u. Recall the fact that γ has
no conjugate points before reaching a means that γ locally minimizes the
energy of paths from γ(0) to γ(a). This means that for any vector field X
along γ vanishing at the endpoints, we have I(X, X) ≥ 0. In particular, if
we choose V1 , . . . , Vn−1 so that γ , V
1 , . . . , Vn−1 forms a parallel orthonormal
basis along γ and set Xi = sin πt a Vi , then we have I(Xi , Xi ) ≥ 0 for each
i from 1 to n − 1. Summing this inequality over i and using (1.5) yields
a
π2 2 πt
(n − 1) − Ric(γ (t), γ (t)) sin dt ≥ 0.
2a 0 a
(So far this is the exact same argument used to prove the Bonnet-Myers
Theorem.) The next step is to integrate this inequality over all possible
starting pairs (p, u) determining γ. That is, we integrate over the unit
sphere bundle SM lying inside the tangent bundle (which has a natural
metric coming from g). This gives us
a
π2 πt
(n − 1) ωn−1 |M | − Ric(γ (t), γ (t)) dμSM sin2 dt ≥ 0,
2a 0 SM a
where γ itself should now be thought of asdepending on the point (p, u) ∈
SM . The key point is that the integral SM Ric(γ (t), γ (t)) dμSM is ac-
tually independent of t since the geodesic flow is a diffeomorphism of SM
16 1. Scalar curvature
This was later generalized by J. Bland and M. Kalka to show that non-
compact manifolds of dimension at least 3 admit complete metrics of con-
stant negative scalar curvature [BK89].
In fact, it turns out that a much more striking theorem is true.
Theorem 1.21 (Lohkamp [Loh94]). Every manifold of dimension at least
3 admits a complete metric with negative Ricci curvature.
The n = 3 case is a special case of Theorem 1.29. Schoen and Yau proved
the result in dimensions less than 8 [SY79d], and this is the case that we will
discuss in greater detail in Chapter 2. Soon afterward, Gromov and Lawson
discovered a proof that works whenever M is spin [GL80a]. A result of
Nathan Smale [Sma93] implies the n = 8 case. In recent years, proofs in
higher dimensions have appeared in a preprint by Schoen and Yau [SY17]
and a series of preprints by Lohkamp [Loh06, Loh15c, Loh15a, Loh15b].
1.2. A survey of scalar curvature results 21
4
The choice of exponent n−2 is immaterial to the definition, but it turns
out to be a convenient choice for purposes of analysis.
Exercise 1.34. Let (M n , g) be a Riemannian manifold, and let u be a
4
smooth positive function on M . If g̃ = u n−2 g, show that
− n−2
n+2 4(n − 1)
(1.6) Rg̃ = u − Δg u + Rg u .
n−2
Hint: Use Exercise 1.12.
Minimal hypersurfaces
A(X, Y ) := (∇X Ỹ )⊥ ,
the normal component of ∇X Ỹ (that is, its orthogonal projection to the
normal space Np Σ), where Ỹ is any extension of Y to a vector field on M .
Recall that A is symmetric, that is, for all X, Y ∈ Tp Σ,
A(X, Y ) = A(Y, X).
Note that in the literature, many authors use A or II instead of A.
Equivalently, we can define the shape operator (or Weingarten map) S ∈
C ∞ (N ∗ Σ ⊗ T ∗ Σ ⊗ T Σ) so that for any X ∈ Tp Σ and ν ∈ Np Σ,
23
24 2. Minimal hypersurfaces
The mean curvature vector H is the trace of A over the tangential di-
rections, that is, at p ∈ Σ,
m
H := A(ei , ei ),
i=1
where e1 , . . . , em is any orthonormal basis of the tangent space Tp Σ. In the
hypersurface case, we can define the mean curvature scalar
H := H, −ν = trh A = tr S.
Exercise 2.3. Given a hypersurface Σ in (M, g) with normal ν, prove that
for any smooth function f ,
ΔΣ f = Δg f − ∇ν ∇ν f + H, ∇f .
2.1. Basic definitions and the Gauss-Codazzi equations 25
The intrinsic and extrinsic curvatures of Σ and the ambient curvature are
all related to each other according to the Gauss-Codazzi equations [Wik,
Gauss-Codazzi equations], which we will separate into what we call the
Gauss equation and the Peterson-Codazzi-Mainardi equation.
Theorem 2.5 (Gauss equation). Let Σ be a submanifold of (M, g). For any
p ∈ Σ and any tangent vectors X, Y, Z, W ∈ Tp Σ, we have
RiemM (X, Y, Z, W ) = RiemΣ (X, Y, Z, W ) + A(X, W ), A(Y, Z)
− A(X, Z), A(Y, W ).
Theorem 2.6 (Peterson-Codazzi-Mainardi equation). Let Σ be a subman-
ifold of (M, g). For any p ∈ Σ, any tangent vectors X, Y, Z ∈ Tp Σ, and any
normal vector ν ∈ N Σ,
RiemM (X, Y, Z, ν) = (∇Y A)(X, Z) − (∇X A)(Y, Z), ν.
Writing the volume as an integral over the fixed space Σ makes it conceptu-
ally more straightforward to compute the derivative. From Proposition 1.3,
we know that
∂ 1
dμt = (trh ḣ) dμh ,
∂t t=0 2
∂
where ḣ = ht . So in order to find the first variation of the volume
∂t t=0
measure, it suffices to compute trh ḣ.
Let e1 , . . . , em be a local orthonormal frame for (Σ, h). Let (ht )ij be
the expression for ht with respect to this fixed choice of frame. Define
ei (t) := dΦt (ei ) to be the push forward of ei , which lives on Σt . By definition
1 One might suspect that this begs the question of the manifold structure on Diff (M ), but
0
one can define this smoothness without too much fuss. You may want to try it yourself.
28 2. Minimal hypersurfaces
of ht and Xt , we have
∂ ∂
(ht )ij = ei , ej ht
∂t ∂t
∂
= Φ∗t ei (t), ej (t)g
∂t
= Φ∗t (Xt ei (t), ej (t)g )
= Φ∗t (∇Xt ei (t), ej (t)g + ei (t), ∇Xt ej (t)g )
= Φ∗t ∇ei (t) Xt , ej (t)g + ei (t), ∇ej (t) Xt g ,
where we used properties of the Levi-Civita connection ∇ in the last two
equalities. (Take note of how the torsion-free property was used, and why
that step is valid.) In particular, we have
∂
(2.4) (ht )ij = ∇ei X, ej + ei , ∇ej X.
∂t t=0
Since the trace of the above expression is just 2 divΣ X, we obtain the fol-
lowing.
Lemma 2.8 (First variation of the volume measure). The linearization of
the volume measure at Σ in the direction of the vector field X is
∂
(2.5) D(dμ)|Σ (X) := dμht = (divΣ X) dμΣ .
∂t t=0
d ⊥
Dμ|Σ (X) := μ(Σt ) = − H, X dμΣ + X̂, η dμ∂Σ ,
dt t=0 Σ ∂Σ
Note that this formula does not assume that X is a normal variation,
nor that X vanishes at the boundary ∂Σ. (Do you see why the boundary
contribution is intuitive?) From this formula, one easily sees that if H
is identically zero, then the volume of Σ is stationary with respect to all
possible deformations of Σ that preserve the boundary. For this reason
any submanifold Σ with vanishing H is called a minimal submanifold of
(M, g). Despite the nomenclature, a minimal submanifold need not be a
local minimum of the volume functional (in the space of all submanifolds
with the same boundary) but only a critical point.
In order to assess whether we do have a local minimum, we must compute
d2
the second derivative, that is, the second variation of volume, dt2 μ(Σt ).
t=0
First, note that Proposition 2.10 tells us that for any t,
∂
dμt = Φ∗ (divΣt Xt ) dμt .
∂t
From this we can see that
∂ 2 ∂ ∗ 2
(2.8) dμt = Φ (divΣt Xt ) + (divΣ X) dμΣ .
∂t2 t=0 ∂t t=0
30 2. Minimal hypersurfaces
So we focus on
∂ ∗
Φ (divΣt Xt ) = Φ∗t (Xt (divΣt Xt )).
∂t t
Recalling that ei (t) := dΦt (ei ) and using Exercise 2.4 to expand the right
side, we have
m
Xt (divΣt Xt ) = Xt ei (t), ej (t)∇ei (t) Xt , ej (t)
i,j=1
m
(2.9) = Xt ei (t), ej (t) ∇ei (t) Xt , ej (t)
i,j=1
m
(2.10) + ei (t), ej (t)∇Xt ∇ei (t) Xt , ej (t)
i,j=1
m
(2.11) + ei (t), ej (t)∇ei (t) Xt , ∇Xt ej (t).
i,j=1
We will handle each of the three terms above separately. We start with
the first term (2.9). Recall the linear algebra fact that for a matrix-valued
function B(t),
d
B(t)−1 = −B(t)−1 B (t)B(t)−1 .
dt
Using this together with equation (2.4), we obtain
∂
Xt ei (t), ej (t) t=0 = Φ∗ (ei (t), ej (t))
∂t t=0 t
∂
= hij
∂t t=0 t
m
ik ∂
=− h (ht )k hj
∂t t=0
k,=1
m
=− hik (∇ek X, e + ek , ∇e X)hj
k,=1
= −∇ei X, ej − ei , ∇ej X.
Therefore the total contribution of the first term (2.9) at t = 0 is
m
m
2
(2.12) − |(∇ei X) | − ei , ∇ej X∇ei X, ej .
i=1 i,j=1
Finally, by combining the terms (2.12), (2.13), and (2.14), and inserting
them into (2.8), we obtain the following.
This formula is quite general but perhaps not so easy to use. The role of
Z might seem a bit mysterious at first. In computing the second derivative
along the path Φt , the computation will depend on more than just the
“tangent vector” X to this curve at the point Σ. It may be useful to think
about the finite-dimensional analog, where the situation is clearer.
We will now specialize to the case of a two-sided hypersurface Σn−1 ⊂
M n with a distinguished unit normal ν and rewrite the formula in terms of
the decomposition
X = X̂ + ϕν
into its tangential and normal components. We can also decompose
Z = Ẑ + ζν.
have
n−1
|(∇ei X)⊥ |2 − Riem(X, ei , X, ei )
i=1
n−1
= |(∇ei (ϕν))⊥ |2 − ϕ2 Riem(ν, ei , ν, ei )
i=1
n−1
= |∇ei ϕ|2 − ϕ2 Ric(ν, ν)
i=1
= |∇ϕ|2 − ϕ2 Ric(ν, ν),
where we used the fact that ∇ei ν is tangential. Next, we have
n−1
n−1
ei , ∇ej X∇ei X, ej = ei , ∇ej (ϕν)∇ei (ϕν), ej
i,j=1 i,j=1
n−1
= ϕ2 ei , ∇ej ν∇ei ν, ej
i,j=1
= ϕ2 |A|2 .
Finally, by equation (2.6), we have
divΣ Z = Hζ + divΣ Ẑ,
(divΣ X)2 = H 2 ϕ2 .
Observe that the stability operator is the same thing as the first variation
of mean curvature (Exercise 2.13) under normal variation. We can also
remove the troublesome Ricci term in the expression using the traced Gauss
equation (Corollary 2.7), which tells us that
1
Ric(ν, ν) = (RM − RΣ − |A|2 + H 2 ).
2
Inserting this into the definition of LΣ (see also Exercise 2.13), we obtain
the useful formula
1
(2.15) DH|Σ (ϕν) = LΣ ϕ = −ΔΣ ϕ + (RΣ − RM − |A|2 − H 2 )ϕ.
2
When Σ is minimal, H vanishes, and we see that the stability inequality for
a minimal hypersurface can be restated as
1
(2.16) |∇ϕ| + (RΣ − RM − |A| )ϕ dμΣ ≥ 0.
2 2 2
Σ 2
We can now start to see the connection between minimal surfaces and
scalar curvature. The following observation was first made by R. Schoen
and S.-T. Yau.
Proposition 2.18 (Schoen-Yau [SY79b]). If (M, g) is a 3-manifold with
positive scalar curvature, then every stable, two-sided closed minimal surface
Σ in M must be a sphere or a projective plane. If M is orientable, then Σ
must be a sphere.
We close this section with the general second variation formula, in which
we do not assume that X is normal or vanishes at the boundary, nor that
Σ is minimal.
d2
μ(Σt ) = |∇ϕ|2 − (Ric(ν, ν) + |A|2 − H 2 )ϕ2
dt2 t=0 Σ
+H(ζ − 2∇X̂ ϕ + A(X̂, X̂)) dμΣ
+ 2HϕX̂ − 2ϕS(X̂) + (divΣ X̂)X̂ − ∇ ˆ X̂ + Ẑ, η dμ∂Σ .
X̂
∂Σ
Proof. The proof is rather involved, though the individual steps are elemen-
tary. The reader may wish to skip this proof. Throughout the computation,
we assume that the ei are parallel at our point of interest. We begin with
Proposition 2.11 and we split those terms into four parts:
m
|(∇ei X)⊥ |2 − Riem(X, ei , X, ei )
i=1
m
− ei , ∇ej X∇ei X, ej + divΣ Z + (divΣ X)2
i,j=1
= CZ + Cperp + 2Ccross + Ctan .
Here the terms on the right side are defined as follows. CZ = divΣ Z is just
the contribution from Z, which easily leads to the desired ζ and Ẑ terms
via (2.6). Other than that term, the rest of the expression is quadratic in
X. Since X = X̂ + ϕν, we can decompose the various X terms into normal-
normal terms Cperp , tangent-tangent terms Ctan , and cross-terms Ccross . In
36 2. Minimal hypersurfaces
n−1
= A(ei , X̂), (∇ei ϕ)ν − ϕRiem(X̂, ei , ν, ei )
(2.17) i=1
n−1
− ei , ∇ej X̂ + ϕS(ei ), ej + Hϕ divΣ X̂
i,j=1
n−1
= −A(∇ϕ, X̂) − ˆ e X̂, ei )
ϕRiem(X̂, ei , ν, ei ) + ϕA(∇ i
i=1
+ Hϕ divΣ X̂,
Claim.
(2.18) Ccross = divΣ (HϕX̂ − ϕS(X̂)) − H∇X̂ ϕ.
n−1
= ˆˆ
A(ei , X̂)2 − Riem(X̂, ei , X̂, ei ) − ei , ∇ + (divΣ X̂)2 .
∇e i X̂ X̂
i=1
Claim.
ˆ X̂] + HA(X̂, X̂).
Ctan = divΣ [(divΣ X̂)X̂ − ∇ X̂
Using the Gauss equation (Theorem 2.5), we can see that the sum of those
two terms gives us
ˆ X̂)
divΣ ((divΣ X̂)X̂ − ∇ X̂
n−1
= ˆ
−RiemΣ (X̂, ei , X̂, ei ) + ∇ X̂, e i + (divΣ X̂)2
[X̂,ei ]
i=1
n−1
= −Riem(X̂, ei , X̂, ei ) + A(X̂, ei )2 − A(X̂, X̂)A(ei , ei )
i=1
ˆ ˆ X̂, ei + (divΣ X̂)2
−∇ ∇i X̂
One difficulty in these theorems, compared to, for example, the much
older theorem of C. B. Morrey, is that one must contend with the conformal
geometry of closed Riemann surfaces rather than disks. Indeed, the case of
minimal spheres, treated in [SU81], is particularly interesting.
Since the minimal surfaces constructed by Theorem 2.20 are certainly
stable, if we combine this theorem with Proposition 2.18, we immediately
obtain Schoen and Yau’s main theorem of [SY79b], which gave the first
topological restriction on positive scalar curvature that was not proved using
spinors. (We will discuss the spinor technique in Chapter 5.)
40 2. Minimal hypersurfaces
One can see that the basic idea behind Corollary 2.21 is actually quite
simple. It is very similar to the reasoning used to prove Synge’s Theorem
[Wik, Synge’s theorem]. The main difficulty lies in Theorem 2.20, rig-
orously establishing the existence of minimal surfaces that one intuitively
hopes should exist. The topological restrictions imposed by Corollary 2.21
are strong enough so that, when combined with current understanding of
the classification of 3-manifolds, they are sufficient to prove Theorem 1.29.
We omit the proof, which is a purely topological argument. (One can deal
with the nonorientable case by passing to the double cover.)
The “integral sum” here means that Σ may be a disjoint union of smooth
minimal hypersurfaces, each of which may have “integer multiplicity.” One
way to rephrase the above theorem is the following. For each nonzero α ∈
H n−1 (M, Z), we can find an (n − 1)-dimensional compact oriented manifold
Σ, not necessarily connected, and a map f : Σ −→ M whose induced map on
homology gives f∗ ([Σ]) = α, such that this pair (Σ, f ) has minimum volume
among all possible pairs satisfying the above, and f is a smooth embedding
on each component of Σ such that the maps from the different components
are either disjoint or exactly the same.
A proof of Theorem 2.22, together with all of the necessary background,
would require an entire book of its own, but we will attempt to tell the
highly abbreviated story of this theorem. A much better telling of this story
can be found in the accessible survey paper of C. De Lellis [DL16]. For a
2.3. Minimizing hypersurfaces and positive scalar curvature 41
full proof of Theorem 2.22, see Leon Simon’s book [Sim83]. To learn about
geometric measure theory, see the book [LY02], or [Mor16] for a lighter
introduction.
The approach to Theorem 2.22 is to use the so-called direct method.
Start with a minimizing sequence of smooth cycles in α, and try to extract
a subsequential limit. This is only possible if one chooses a sufficiently weak
topology and looks for a limit in the completion with respect to that topol-
ogy. We can view our smooth cycles as currents in the sense of G. de Rham
[Wik, Current (mathematics)], that is, as objects that are dual to smooth
differential (n − 1)-forms on M via integration. The topology one chooses is
the weak topology dual to the smooth topology of (n − 1)-forms. Or equiva-
lently in this context, we can use the flat topology [Wik, Flat convergence].
Once we are working in the completion of the space of smooth cycles in
this topology, it is trivial to extract a subsequential limit. This comple-
tion consists of (n − 1)-dimensional integral currents in M . These abstract
objects generalize oriented hypersurfaces but still have a good deal of struc-
ture. This formalism of integral currents was pioneered by Herbert Federer
and Wendell Fleming [FF60]. (A more set-theoretic approach led to early
results of E. R. Reifenberg [Rei60].)
After verifying that this limit indeed minimizes (a suitable generaliza-
tion of) volume, the remaining task is to prove that the limit object that
one obtains is actually a smooth hypersurface. A good analogy (for those
familiar with the topic) is using the direct method to solve the Dirichlet
problem for Laplace’s equation: in that case, we are looking for a function
that minimizes energy, subject to the Dirichlet boundary constraint. If we
choose an energy-minimizing sequence of functions, we cannot expect it to
converge in, say, the C 2 topology, but the sequence will be bounded in the
Sobolev space W 1,2 , and hence we can extract a subsequence converging
weakly in W 1,2 . This procedure allows us to find an energy-minimizer in
W 1,2 , simply by virtue of the formalism. However, in the end we must prove
that this energy-minimizer is actually C 2 and hence a classical solution of
Laplace’s equation.
Exercise 2.23. Given two points p, q in a complete Riemannian manifold
(M, g), prove the well-known fact that there exists a minimizing geodesic
between them using the direct method as follows. Consider an energy-
minimizing sequence of smooth paths from p to q, extract a subsequential
limit in W 1,2 , and then prove that this limit is indeed energy-minimizing
and smooth.
described so far (and hence, the existence theory) works just as well in
higher codimension. However, the regularity theory is far more complicated
in higher codimension, where the analog of Theorem 2.22 is F. Almgren’s
“big regularity theorem” [Alm00], so called because of its difficulty and its
nearly thousand page length. Recently, this work has been streamlined and
simplified by De Lellis and E. Spadaro [DLS14]. See [DL16] for a survey
of this work.
Returning to the codimension one case, W. Fleming first established
Theorem 2.22 for n = 3 [Fle62]. A major breakthrough came from Ennio
De Giorgi [DG61], who first understood how regularity of the tangent cone
at any point could be used to prove local regularity near that point. (The
paper was about sets of finite perimeter, but the insights carry over to the
setting of integral currents.) The tangent cone of an object at a point is
the result of blowing up the object at that point. A tangent cone being
regular just means that it is a hyperplane (which is, of course, the tangent
space of any smooth hypersurface). Consequently, if one can show that the
hyperplane is the only possible minimizing codimension one tangent cone,
then De Giorgi’s argument should imply regularity. Almgren was able to
carry out this argument for n = 4 [Alm66], and later J. Simons was able
to show that all minimizing codimension one tangent cones are hyperplanes
for n < 8 [Sim68], leading to Theorem 2.22.
It turns out that when n ≥ 8, it is indeed possible that a codimension one
minimizing integral current is not smooth, as demonstrated by E. Bombieri,
E. De Giorgi, and E. Giusti [BDGG69], who proved that the Simons cone
[Sim68] is a nontrivial minimizing cone. However, it was shown by H. Fed-
erer that the “singular set” of a codimension one minimizing integral current
has Hausdorff dimension less than or equal to n − 8 [Fed70]. Or in other
words, the minimizing integral current is a smooth hypersurface away from
a singular set of codimension at least 7 inside of it. L. Simon was able
to prove more about the structure of the singular set [Sim95]. Although
these are fairly strong results, the singularities still cause problems for many
geometric arguments.
When n = 8, Federer showed that the singular points are isolated
[Fed70], and then Nathan Smale was able to perturb these isolated sin-
gularities away.
Theorem 2.22 is quite powerful and elegant, but one downside (especially
in contrast to Theorem 2.20, for example) is that it does give you direct con-
trol over the topology of the minimizing hypersurface. However, sometimes
one can still get around this. For example, we can use Theorem 2.22 in place
of Theorem 2.20 to see why the 3-torus T 3 cannot carry a metric of positive
curvature: by topological reasoning, one can find a class α in H 2 (T 3 , Z) that
cannot be represented by a sum of 2-spheres. (We will explain this argu-
ment in more detail below.) Then the area-minimizer in α, whose existence
is guaranteed by Theorem 2.22, must have at least one component that is
not a sphere. But this contradicts Proposition 2.18.
We now turn our attention to proving Theorem 1.30, which implies that
a torus does not admit a metric of positive scalar curvature. We first prove
a higher-dimensional generalization of Proposition 2.18.
Proposition 2.25 (Schoen-Yau). Let (M n , g) be a Riemannian manifold
with positive scalar curvature. Then every stable, closed two-sided minimal
hypersurface of M carries a metric of positive scalar curvature.
The last part of the proof of Proposition 2.25 can be conveniently re-
stated as follows.
Corollary 2.27. Let (Σ, h) be a Riemannian manifold, and let Lh denote
its conformal Laplacian. If the principal eigenvalue of Lh is positive (i.e., Lh
is a strictly positive operator), then h is conformal to a metric with positive
scalar curvature.
2.25. We present the full proof for n ≤ 8, and we will briefly discuss some
of the ideas used in the n > 8 case in the following subsection.
f∗ ([Σ] ∩ (f ∗ ω2 ∪ · · · ∪ f ∗ ωn )) = [M ] ∩ (ω1 ∪ · · · ∪ ωn ),
which is nonzero by assumption. Therefore f ∗ ω2 ∪ · · · ∪ f ∗ ωn is nonzero
in H n−1 (Σ, Z), which allows us to use our induction hypothesis to reach a
contradiction.
(The reader may wish to review the proof of Theorem A.10 for the relevance
of these Rayleigh quotients.) Observe that since Rg = 0, the constant
functions are principal eigenfunctions for Lg . By Exercise 1.10, we also
know that Ricg is divergence-free. Using Exercise 1.18 and the fact Rg = 0,
we now compute
d d 1
At (1, 1) = Rg dμgt
dt t=0 dt t=0 |M |gt M t
1
= (DR|g )(−Ricg ) dμg
|M |g M
1
= (Δg (trg (Ricg )) − divg (divg (Ricg )) + |Ricg |2 ) dμg
|M |g M
1
= |Ricg |2 dμg
|M |g M
> 0.
So there exists > 0 such that for small enough t, we have At (u, u) ≥ t for
all nonzero constant functions u.
Now consider nonzero smooth functions u that are orthogonal to the
constants in L2 (M, g). For such u, we have A0 (u, u) ≥ λ2 (Lg ) > λ1 (Lg ) = 0,
where λ2 is the second eigenvalue of Lg . For small enough t, the quantities
gt , dμgt , and Rgt can only change by a small amount, and thus it is clear
that At (u, u) ≥ 12 λ2 (Lg ) > 0 for small enough t, for all nonzero smooth
functions u orthogonal to the constants in L2 (M, g). Thus for small t > 0,
we see that At (u, u) ≥ t for all nonzero smooth functions u on M . In other
words, λ1 (Lgt ) ≥ t > 0, completing the proof.
48 2. Minimal hypersurfaces
Remark 2.31. Note that the proof above uses a small deformation in the
direction of −Ricg . From a modern perspective, one can simply use Ricci
flow to execute this argument in a clean way: Ricci flow evolves a family of
metric gt according to
∂
g = −2Ric.
∂t
By Exercises 1.18 and 1.10, Ricci flow evolves R according to
∂
(2.19) R = Δg R + 2|Ric|2 ,
∂t
where we have suppressed the dependence on t in the notation. From this
point of view, if we start Ricci flow with initial metric g0 with zero scalar
curvature, then the parabolic strong maximum principle implies that gt must
have nonnegative scalar curvature. Moreover, it can only remain scalar-flat
if the term 2|Ric|2 is identically zero. For the reader interested in learning
more about Ricci flow, there are many great resources, but one particularly
good starting point is the book [CLN06].
2.3.3. Higher dimensions. We now discuss some of the basic ideas used
in Schoen and Yau’s approach to Theorem 2.28 for general dimension in
[SY17]. As we have seen, the problem in dimensions n > 8 is that Theo-
rem 2.22 does not hold. More specifically, minimal hypersurfaces can have
singular sets of codimension 7. The Schoen-Yau approach can be described
as taking these problematic singularities head-on. Another approach to
proving Theorem 2.28 in all dimensions is to attempt to perturb away the
singularities as in Theorem 2.24. Lohkamp follows this approach in higher di-
mensions using his new concept of skin structures [Loh06,Loh15c,Loh15a,
Loh15b].
In the n ≤ 8 proof of Theorem 2.28 described in the previous subsection,
starting with (M, g), we built a nested slicing Σ2 ⊂ · · · ⊂ Σn−1 ⊂ Σn = M
with the property that each Σi is a minimizing hypersurface in Σi+1 with
4
respect to the metric g̃i+1 := (ϕi+1 · · · ϕn−1 ) n−2 gi+1 , where gi+1 is the metric
on Σi+1 induced by the original metric g, and each ϕj > 0 is the principal
eigenfunction of the conformal Laplacian on Σj with respect to the metric
induced by g̃j+1 . As mentioned, for n > 8, the minimizer hypersurfaces may
have codimension 7 singular sets. If one naively attempts to push through
this argument even with the singularities, the main issue is that although
Σn−1 can have a singular set of codimension at least 7, which is quite small,
the next slice Σn−2 might intersect that singular set and consequently it
could potentially have a singular set as large as codimension 6 inside it. In
50 2. Minimal hypersurfaces
where (tj , . . . , tn−1 ) are the coordinates on the torus T n−j . Using this defini-
tion, it is straightforward to see that for any i-dimensional hypersurface Σ in
Σi+1 , Vρi+1 (Σ) is precisely the volume of Σ × T n−i−1 computed with respect
to the metric ĝi+1 . In particular, this means that Σi × T n−i−1 is minimizing
in (Σi+1 × T n−i−1 , ĝi+1 ) with respect to variations that are independent of
the T n−i−1 factor. We let g̃i denote the metric on Σi × T n−i−1 induced by
(Σi+1 × T n−i−1 , ĝi+1 ), so that
n−1
g̃i := gi + u2p dt2p ,
p=i+1
where (ti+1 , . . . , tn−1 ) are the coordinates on the torus T n−i−1 . Since there
is a lot of notation to keep track of, the main thing to keep in mind is that
(Σi × T n−i−1 , g̃i ) → (Σi+1 × T n−i−1 , ĝi+1 )
is a minimal embedding with respect to variations independent of the T n−i−1
factor. If we let Ãi denote the second fundamental form of this embedding,
then applying the stability inequality (2.16) to this embedding of warped
products above, we see that the stability of Σi with respect to Vρi+1 translates
to the statement that
1
(2.20) |∇ϕ| + (R̃i − R̂i+1 − |Ãi | )ϕ ρi+1 dμi ≥ 0
2 2 2
Σi 2
for all ϕ ∈ C ∞ (Σi ). Here R̃i and R̂i+1 denote the scalar curvatures of g̃i
and ĝi+1 , respectively. The corresponding stability operator on Σi is then
1
(2.21) Li = −Δ̃i + (R̃i − R̂i+1 − |Ãi |2 ),
2
where Δ̃i denotes the Laplacian with respect to g̃i (for functions indepen-
dent of the T n−i−1 factor), so that ui is the principal eigenfunction of this
operator Li . The stability tells us that Li ui ≥ 0.
52 2. Minimal hypersurfaces
Our next task is to use the stability inequality above involving R̃i and
R̂i+1 to show that Σi is Yamabe positive. We first observe that the con-
struction gives us the following.
Claim.
R̂i+1 ≥ Rg ≥ 0.
Claim.
−1/2 1/2
R̃i ≤ Ri − 4ρi+1 Δi ρi+1 .
where the gradients are all with respect to gi . Iterating this for j from i + 1
to n − 1, we obtain
n−1
R̃i = Ri − 2 u−1
j Δi uj − 2 ∇ log up , ∇ log uq .
j=i+1 i<p<q<n
2.3. Minimizing hypersurfaces and positive scalar curvature 53
Next observe that, in general, u−1 Δu = Δ log u + |∇ log u|2 . Using this and
rewriting the sum over i < p < q < n, we obtain
n−1
R̃i = Ri − 2 Δi log uj + |∇ log uj |2
j=i+1
2
n−1
n−1
(2.22)
−
∇ log uj + |∇ log uj |2
j=i+1 j=i+1
Although the alternative proof above is more complicated than the orig-
inal proof, it has the benefit that it is easier to see which positive terms have
been thrown away in the course of the argument; therefore, it suggests how
one can alter the construction to take advantage of those positive terms.
Specifically, when we wrote down (2.23), we threw away the |Ãi |2 term, and
in inequality (2.22), we threw away some |∇ log uj |2 terms. In light of these
facts, Schoen and Yau defined the operator
1
n−1
3
Li := Li + |Ãi | +
2
|∇ log up |2 + |Ãp |2 ,
8 8n
p=i+1
54 2. Minimal hypersurfaces
Proof. Assume the hypotheses of the theorem, and let ν be a global unit
normal on Σ. For each smooth function u on Σ, we consider the image hy-
persurface Σ[u] of Σ under the map Fu (x) = expx (u(x)ν). All hypersurfaces
that are close to Σ = Σ[0] in the smooth sense can be parameterized by
functions u that are close to zero.
For α ∈ (0, 1), consider the map
defined by
1
Ψ(u, s) = Fu∗ HΣ[u] − s, u dμΣ ,
|Σ| Σ
where Fu∗ HΣ[u] is the mean curvature scalar of the image surface Σ[u], pulled
back to the original surface Σ. Here, Ψ(u, s) is technically only defined for
sufficiently small u ∈ C 2,α (Σ), and one can check that the image lies in
56 2. Minimal hypersurfaces
C 0,α (Σ) × R. By Exercise 2.13 and Proposition 2.36, we can compute the
linearization of Ψ to be
1
DΨ|(0,0) (u, s) = −ΔΣ u − s, u dμΣ .
|Σ| Σ
It is well known that the kernel of the ΔΣ is the constants and that the image
is the orthogonal complement of the constants (Theorem A.8), and from this
it follows that DΨ|(0,0) is an isomorphism. (Note that this explains why we
augment the mean curvature operator with the extra parameter s.) Hence
we can invoke the inverse function theorem (Theorem A.43) to see that
there exist > 0 and a smooth map (v, H) : (−, ) −→ C 2,α (Σ) × R such
that Ψ(v(t), H(t)) = (0, t) for all t ∈ (−, ). The equation Ψ(v(t), H(t)) =
(0, t) means each surface Σt := Σ[v(t)] has constant mean curvature H(t)
(hence our choice of variable name), and differentiating this equation at
∂
t = 0 shows that ∂t t=0
v(t) = 1 (since it must be in the kernel of ΔΣ and
have average equal to 1). This means that the surfaces Σt form a smooth
foliation of constant mean curvature hypersurfaces for small values of t. In
particular, by taking smaller if necessary, a tubular neighborhood of Σ can
be identified with Σ × (−, ), via the map Ψ : Σ × (−, ) −→ R given by
Ψ(x, t) = Fv(t) (x). Under this diffeomorphism, we can rewrite the metric as
g = ht + ϕ2t dt2 ,
1
H (t) = −ΔΣt ϕt + (RΣt − RM − |AΣt |2 − H(t)2 )ϕt
2
1
≤ −ΔΣt ϕt + (2KΣt − κ)ϕt .
2
2.4. More scalar curvature rigidity theorems 57
Using our estimates on ϕt and |Σt |, it follows that H (t) 14 ≤ − 12 κH(t0 )4t,
or more simply,
Thus
t1
H(t1 ) = H(t0 ) + H (s) ds ≤ H(t0 ) − 8κH(t0 )2 = (1 − 8κ2 )H(t0 ),
t0
which is a contradiction for small enough, since H(t1 ) > 0 > H(t0 ).
Case 3: κ < 0. Again suppose that H(t1 ) > 0 for some value of t1 > 0,
but now choose t0 to be the time achieving the maximum of H(t) over
[0, t1 ]. Then because of the reversed sign on κ, we obtain the exact same
58 2. Minimal hypersurfaces
Proof. Our presentation here uses an elegant argument taken from Gal-
loway in [Gal18] rather than following the approach in [Cai02]. Assume
the hypotheses of the theorem, including the assumption that Σ is not
Yamabe positive. We begin the proof in the exact same way as in the proof
of Theorem 2.38. Specifically, we construct a map Ψ : Σ × (−, ) −→ R
such that under this diffeomorphism, we can rewrite the metric as
g = ht + ϕ2t dt2 ,
where ϕt > 0 and each slice Σt = Σ × {t} has constant mean curvature H(t).
As before, we have the equation
1
H (t) = LΣt ϕt = −ΔΣt ϕt + (Rht − RM − |AΣt |2 − H(t)2 )ϕt ,
2
where ht is the induced metric on Σt , and LΣt is the stability operator for
Σt , as defined in equation (2.15).
As in the proof of Theorem 2.38 we would like to prove that H (t) ≤ 0
for all t ∈ (0, ). Suppose there exists some small τ such that H (τ ) > 0.
Then LΣτ ϕτ > 0. Then there is a number c > 0 such that
LΣτ ϕτ ≥ cϕτ .
0 < c ≤ λ1 (LΣτ ) .
60 2. Minimal hypersurfaces
For the proof, see Theorem A.11. Recall from Exercise 2.26 that
1
λ1 (LΣτ ) ≤ λ1 (Lhτ ) ,
2
where Lhτ denotes the conformal Laplacian. So we have 0 < λ1 (Lhτ ), which
implies that Σ is Yamabe positive (Corollary 2.27). As this contradicts our
original assumption, it proves that H (t) ≤ 0 for all t ∈ (0, ), and the rest
of the proof proceeds exactly as in Theorem 2.38.
Since the minimal spheres here are not stable, the nature of the proof is
quite different from the other ones described in this section, and it requires
the use of so-called min-max methods.
It is natural to wonder whether there is a hyperbolic analog of Corol-
lary 2.32.
The spin case was proved by work of Maung Min-Oo [MO89] together
with that of Lars Andersson and Mattias Dahl [AD98]. Theorem 2.44 is
now seen as a special case of the positive mass theorem for asymptotically
hyperbolic manifolds, which was proved for spin manifolds by Xiaodong
Wang [Wan01] and improved by P. Chruściel and M. Herzlich [CH03].
The n < 8 case of Theorem 2.44 was proved by L. Andersson, M. Cai,
and G. Galloway [ACG08] as a stepping stone toward a restricted version
of the positive mass theorem for asymptotically hyperbolic manifolds. A
version of the Penrose inequality (Conjecture 4.12) has also been conjectured
for asymptotically hyperbolic spaces (see [LN15, Amb15] for some partial
results).
Theorem 2.44 led Min-Oo to conjecture that a similar statement might
be true for spherical geometry. That is, given a Riemannian manifold
(M n , g) with Rg ≥ n(n − 1) such that there is a compact set K ⊂ M
with (M K, g) isometric to a standard spherical region S n Br (p) for
some r > π/2 and p ∈ S n , does this imply that (M, g) is isometric to the
standard round unit sphere? The point of having r > π/2 is that we are
only allowing the standard S n to be changed within one hemisphere. It is
easy to see that the conjecture fails if we allow r < π/2. (Imagine a coun-
terexample as an exercise.) When n = 2, the answer was known to be yes by
classical work of Toponogov [Top59]. If we upgrade the hypothesis to the
much stronger assumption Ricg ≥ (n−1)g in place of Rg ≥ n(n−1), Fengbo
Hang and Xiaodong Wang showed that the answer is yes [HW09]. (In this
paper they also give a nice proof of the Toponogov result.) It was somewhat
surprising when counterexamples to Min-Oo’s conjecture were discovered for
n ≥ 3.
62 2. Minimal hypersurfaces
The Riemannian
positive mass theorem
3.1. Background
3.1.1. The Schwarzschild metric. In the study of partial differential
equations, it is often instructive to understand solutions with many symme-
tries. For example, looking for spherically symmetric solutions of Laplace’s
equation leads to the discovery of the fundamental solution. Here we will
look at an analog of the “fundamental solution” for the constant scalar cur-
vature equation.
Let g be a spherically symmetric metric. In other words, suppose that g
is a warped product of a line with the standard (n − 1)-sphere. Explicitly, if
we use the notation dΩ2 to denote the standard unit sphere metric on S n−1 ,
we consider metrics of the form
g = ds2 + r(s)2 dΩ2
for some positive function r(s). Note that dr
ds = 0 corresponds to the places
where the symmetric spheres are minimal (in fact, totally geodesic), and any
region where r(s) is constant corresponds to g being cylindrical (that is, the
Riemannian product of an interval with a sphere). Wherever dr ds is nonzero,
we can use r as a coordinate and rewrite the metric as
dr2
g= + r2 dΩ2
V (r)
for some positive function V (r). Let us focus our attention on such metrics
since the general case can always be broken up into pieces like this, plus
cylindrical pieces.
63
64 3. The Riemannian positive mass theorem
In light of the above, the prescribed scalar curvature equation for spher-
ically symmetric spaces is a nonhomogeneous linear ordinary differential
equation in V .
Exercise 3.2. Let κ be a constant. Use the previous exercise to show that
the only spherically symmetric metrics with constant scalar curvature κ are
(up to diffeomorphism) of the form
dr2
g= + r2 dΩ2
V (r)
with
2m κ
V (r) = 1 − − r2 ,
r n−2 n(n − 1)
where m is some constant parameter.
Exercise 3.3. Show that there exists a radial coordinate ρ such that gm
takes the form
4
gm = [u(ρ)] n−2 (dρ2 + ρ2 dΩ2 ),
where
m
u(ρ) = 1 + .
2ρn−2
1
Verify that this formula is correct, and that the region where ρ > ( m
2)
n−2
1
corresponds to where r > (2m) n−2 . (This is a bit tedious, but it is essentially
just single variable calculus.)
Note that since the new expression for gm makes sense for all ρ > 0,
we now have a metric on all of (0, ∞) × S n−1 . Show that the map ρ →
2
(m n−2 ρ−1 defines an isometry on ((0, ∞) × S n−1 , g ). Use this to help
2) m
explain why ((0, ∞) × S n−1 , gm ) is complete.
Observe that if we switch to rectangular coordinates x1 , . . . , xn on Rn
with ρ = |x|, we see that gm can be written as a metric on Rn {0} via
4
(gm )ij = [u(x)] n−2 δij ,
66 3. The Riemannian positive mass theorem
where
m
u(x) = 1 + 2|x|n−2
.
These coordinates are often called isotropic coordinates for the Schwarzschild
metric in the physics literature.
From now on, for any m > 0, we will refer to the above Riemannian
manifold ((0, ∞)×S n−1 , gm ) as the Schwarzschild space of mass m. From the
exercise above, we can see that this space is indeed a complete, noncompact
manifold with two ends that is scalar-flat everywhere. Moreover, we can see
1
that the sphere at ρ = ( m2)
n−2 is totally geodesic (and therefore minimal).
As one would expect, the Schwarzschild space of mass m has two ends,
each of which is asymptotically Schwarzschild with mass m, according to
this definition. (Check this.)
Proof. First observe that V must be a radial function. (Can you prove
this?) Therefore outside the support of ρ, V is a radial harmonic function,
and every radial harmonic function that decays at infinity is of the form
V (x) = − |x|
m
for some constant m. (This is an easy fact to check. See
Section A.1.4.)
Finally,
1
ρ(x) dx = ΔV dx
R3 R3 4π
1 ∂V
= lim dμSr
r→∞ S 4π ∂r
r
1 m
= lim dμSr (x)
r→∞ S 4π |x|2
r
= m.
Here is one way to derive it (which we note is not the original motivation).
If the theory were linear (that is, if Rg were a linear
operator of g), then
1
it would be true that the total mass should be 16π M Rg dμg . In the case
when (M, g) is very close to the Euclidean background metric (R3 , ḡ), the
scalar curvature is approximately linear. That is, the scalar curvature can
be approximated by its linearization at the Euclidean background metric
ḡij = δij . That is,
Rg ≈ DR|δ (g − ḡ).
In this case it is reasonable to define mass to be
1
(3.5) m:= DR|ḡ (g − ḡ) dμ
16π R3
1
= (−Δ(tr g) + div(div g)) dμ
16π R3
1
= lim (div g − d(tr g))(ν) dμSr
r→∞ 16π S
r
3
1 xj
= lim (gij,i − gii,j ) dμ ,
r→∞ 16π S
r
|x| Sr
i,j=1
The barred quantities are all quantities computed using the Euclidean back-
ground metric determined by the asymptotically flat coordinate chart Φk
used in Definition 3.5. The Sρ refers to the coordinate sphere of radius ρ,
dμSρ is its volume measure induced by the Euclidean metric, ν is its Eu-
clidean outward normal vector, and ωn−1 is the volume of the unit (n − 1)-
sphere.1 More explicitly, we can write the more commonly used coordinate
expression
n
1 xj
(3.7) mADM (Mk , g) = lim (gij,i − gii,j ) dμ .
ρ→∞ 2(n − 1)ωn−1 S
ρ
|x| Sρ
i,j=1
Let us also take a moment to think about what this mass is, without
regard to physical reasoning. The integral of the scalar curvature Rg can-
not be written as a flux integral at infinity, because the operator Rg is not
in divergence form. However, the linearization of Rg near the Euclidean
metric is a divergence (or in other words, Rg is a divergence plus higher-
order terms). The mass is defined to be the flux integral at infinity that
corresponds to this divergence. From this perspective, although the geo-
metric content of the mass is far from clear, one can see that it is intimately
connected to the partial differential operator Rg , when expressed using the
Euclidean background.
Exercise 3.10. Prove that the ADM mass of any end of an asymptotically
flat manifold exists and is finite. (Hint: Use Exercise 1.18, the divergence
theorem, and integrability of the scalar curvature.) Moreover, your argu-
ment should show that, given the other hypotheses of asymptotic flatness,
the integrability of scalar curvature is actually equivalent to the ADM mass
being well-defined and finite. Furthermore, observe that this proof shows
that the Sρ in formula (3.6) can be replaced by any family of surfaces Σρ
that “exhausts” the end Mk . (Here, we interpret ν as the outward normal
of Σρ .)
that appear in the definition of mass. Our convention was chosen in order to be consistent with
the fairly simple appearance of the mass parameter m in our definition of the Schwarzschild metric
in (3.1).
3.1. Background 73
This sort of formula for mass in terms of curvature goes back to A. Ash-
tekar and R. O. Hansen [AH78]. We present a fairly simple proof of Theo-
rem 3.14, essentially due to Pengzi Miao and Luen-Fai Tam [MT16]. It can
also be proved (at least, for a sequence of spheres) using a density argument
(see Lemma 3.48), as explained by Lan-Hsuan Huang [Hua12].
Remark 3.15. The asymptotic decay of gij − δij implies that ν and dμΣi
can be replaced by ν and dμΣi in the theorem above.
Remark 3.16. If (M n , g) has asymptotic decay rate q = n − 2, then The-
orem 3.14 actually implies a coordinate independent formula for the mass:
−1
mADM (Mk , g) = lim ρG(ν, ν) dμSρ (p)
ρ→∞ (n − 1)(n − 2)ωn−1 S (p)∩M
ρ k
We now state the positive mass theorem—a theorem that was conjec-
tured as soon as the concept of ADM mass was formulated back in 1961.
Since there is a more general version for initial data sets, the modifier Rie-
mannian is useful in order to avoid confusion, but in the context of Part 1
of this book, there is no risk of ambiguity.
Theorem 3.18 (Riemannian positive mass theorem). Let (M, g) be a com-
plete asymptotically flat manifold with nonnegative scalar curvature. Then
the ADM mass of each end of M is nonnegative.
Schoen and Yau first proved the three-dimensional case in 1979 [SY79c,
SY81a], and they soon saw how to generalize their result to dimensions less
than 8 in [SY79a]. (See also [Sch89].) However, the higher-dimensional
cases were stymied by the problem of singularities of minimal hypersurfaces
that we described in Chapter 2. As mentioned earlier, it is now understood
76 3. The Riemannian positive mass theorem
that the positive mass theorem is a consequence of Theorem 1.30 and this
implication is what we will explain later in the chapter. As mentioned
earlier, the higher-dimensional cases of Theorem 1.30 have been treated in
recent preprints of Schoen and Yau [SY17] and Lohkamp [Loh06, Loh15c,
Loh15a, Loh15b]. Meanwhile, in 1981 E. Witten discovered a proof that
works for all spin manifolds [Wit81]. We will discuss the proof in detail
in Chapter 5. Somewhat surprisingly, the spinor proof of the positive mass
theorem is actually simpler than the spinor proof of Theorem 1.30.
The case of zero mass is usually described as part of the positive mass
conjecture, but we prefer to think of it as a separate statement which follows
from the positive mass theorem (Theorem 3.18).
The fact that this is a direct consequence of Theorem 3.18 was essentially
proved by Schoen and Yau in [SY79c]. Witten’s spinor method also yields
the rigidity result, but only in the spin case [Wit81].
Given the above rigidity result, one might wonder about the correspond-
ing stability question. That is, if we have a sequence of complete asymptot-
ically flat manifolds with nonnegative scalar curvature whose masses con-
verge to zero, can we say that these Riemannian manifolds are approaching
Euclidean in some weak sense (given some scale-fixing assumptions)? This
question is rather subtle and essentially wide open, but various related re-
sults can be found in [Cor05, Lee09, LS14, LS12, LS15, HL15, HLS17,
SSA17, BF02, FK02, Fin09]. See Theorem 4.67.
Technically, the form of the metric g given in the statement of the propo-
sition implicitly assumes that none of the symmetric spheres around the
origin are minimal. The proof can be easily adapted when minimal surfaces
are present, as we will see in Proposition 4.20.
1 n−2
mADM (g) = lim r (1 − V (r)).
r→∞ 2
If the ADM mass is actually zero, then we see that the nondecreasing func-
tion rn−2 (1 − V (r)) must be identically zero, which means that V (r) is
identically 1, and thus g is Euclidean.
3.2.2. Conformally flat case. The next case we consider is the globally
conformally Euclidean case.
78 3. The Riemannian positive mass theorem
2
= −Δu dμ
(n − 2)ωn−1 Rn
2 n − 2 n−2n+2
= u Rg dμ.
(n − 2)ωn−1 Rn 4(n − 1)
It is now clear that Rg ≥ 0 implies mADM (g) ≥ 0. To see the rigidity,
observe that if we furthermore have mADM (g) = 0, then Δu vanishes, and
hence u is a harmonic function on all of Rn . Since it approaches 1 at infinity,
it must then be identically equal to 1 by the maximum principle.
Notice that one nice feature of the globally conformally Euclidean setting
is that the scalar curvature operator becomes a partial differential operator
on the function u, rather than on a tensor g. We can see that if we linearize
this operator at u = 1, it tells us that for u ≈ 1, we have Rg ≈ − 4(n−1)
n−2 Δu.
So we see that if we set V = 2(1 − u), then we obtain a “Newtonian limit”
as u gets closer to 1. That is, when n = 3, the equation Rg = 16πρ with
u(∞) = 1 from Section 3.1.3 indeed (informally) reduces to ΔV = 4πρ with
V (∞) = 0 as u becomes closer to the constant function 1.
3.2.3. Graphical case. The next case we consider is the case of graphical
hypersurfaces of Euclidean space, due to Mau-Kwong George Lam. The
rigidity was proved by Lan-Hsuan Huang and Damin Wu.
Theorem 3.23 (Lam [Lam11], Huang-Wu [HW13]). Let f : Rn −→ R
be a smooth function such that limx→∞ f (x) is either a constant or ∞. Let
M be the graph of f in Rn+1 , and let g be the metric on M induced by the
Euclidean metric on Rn+1 . Assume that fi fj = O2 (|x|−q ) for some q > n−2
2 ,
3.2. Special cases of the positive mass theorem 79
The key to this proof is the fact that the scalar curvature can be written
as a divergence. This was first observed by Robert Reilly [Rei73].
Lemma 3.24. Let f : Rn −→ R be a smooth function, let M be the graph
of f in Rn+1 , and let g be the metric on M induced by the Euclidean metric
on Rn+1 . Using the coordinates x → (x, f (x)) as coordinates on M , we have
n
fii fj − fij fi
Rg = ∂j ,
1 + |∂f |2
i,j=1
n
fi fj fi fj
= ∂j − ,
w i w w ij w
i,j=1
n
fi fj
|A|2 = tr(S 2 ) =
w j w i
i,j=1
n
fi fj fi fj
= ∂i −
w j w w ji w
i,j=1
n
fj fi fi fj
= ∂j − ,
w i w w ij w
i,j=1
80 3. The Riemannian positive mass theorem
n
fi fj fj fi
Rg = H − |A| =
2 2
∂j −
w i w w i w
i,j=1
n
(fii w − fi wi )fj − (fji w − fj wi )fi
= ∂j
w3
i,j=1
n
fii fj − fij fi
= ∂j .
w2
i,j=1
We claim that the expression on the right is just the ADM mass, up to a
positive constant. The integrand in the ADM mass expression is
xj
(gij,i − gii,j ) = [(fi fj )i − (fi fi )j ]ν j = (fii fj − fij fi )ν j ,
|x|
which is the same as the integrand in (3.8) except for the factor of 1 + |∂f |2 ,
which is 1 + O(|x|−q ) by hypothesis.
The rigidity part of the argument is more complicated, so we merely
outline the idea. Suppose that the mass is zero. We will show that f must
be constant. If it is not, then by Sard’s Theorem, there is a smooth level
set Σ := f −1 (c), where c < limx→∞ f (x). If one applies the divergence
theorem argument above but also uses Σ as an inner boundary, one obtains
the formula
|∂f |2
(3.9) 2(n − 1)ωn−1 mADM (M, g) = H dμΣ +
2 Σ
Rg dμ,
Σ 1 + |∂f | f (x)>c
3.2. Special cases of the positive mass theorem 81
Proof. The proof begins by choosing cylindrical coordinates (ρ, ϕ, z), where
x = ρ cos ϕ and y = ρ sin ϕ. We consider metrics of the following form:
for some functions U , α, A, and B, which are all independent of the angle
variable ϕ. Moreover, assume α vanishes on the z-axis. Clearly, this metric
∂
is invariant under rotations around the z-axis. In particular, ∂ϕ is a Killing
field. It is a nonobvious, nontrivial fact that any axisymmetric metric on
R3 can be written in the above form, after suitable choice of coordinates.
For a rigorous proof of this fact (which involves construction of isothermal
coordinates), see the work of Chruściel [Chr08, Theorem 2.7], who extended
Brill’s result to simply connected axisymmetric manifolds with more than
one end. A strong enough asymptotic flatness assumption will guarantee
that the functions U , α, A, and B and their derivatives have decay rates
strong enough for the argument below to go through.
2 Technically, this theorem requires appropriate decay of five derivatives of the metric rather
Once we have the above form for the metric, the proof follows a fairly
straightforward divergence theorem argument. However, we will only pro-
vide an outline, because the computations are rather involved. The inter-
ested reader may wish to supply the details. (There is an active literature
on axisymmetric metrics, and if one wishes to explore that field, then repro-
ducing these computations would be a worthwhile exercise.)
First, we try to compute the ADM mass in terms of the functions U , α,
A, and B. To do this, first take the above form of the metric and write it
in x, y, z coordinates so that we can use our formula (3.7) for ADM mass.
Recall that we can use any exhaustion to compute the mass, so it makes
sense to use a cylinder. For each r, let Cr denote the cylindrical region
where −r < z < r and ρ < r. Let Wr denote the lateral boundary of this
region where −r < z < r and ρ = r. After many computations, one can
show that
1 1 1
m = lim ∂(U − α) · ν dμCr + α dμWr .
r→∞ 4π ∂Cr 2 2 Wr
Since α vanishes on the z-axis, we can use the divergence theorem to obtain
1 1 1 2π r r ∂α
m = lim Δ(U − α) dμ + dρ dz dϕ
r→∞ 4π Cr 2 2 0 −r 0 ∂ρ
1 1 1 ∂α
= lim Δ(U − α) + dμ.
r→∞ 4π C 2 2ρ ∂ρ
r
Trudinger had already showed that the Yamabe problem could be solved
on conformal classes that do not contain positive scalar curvature metrics.
Let (M, g) be a compact Riemannian manifold with positive scalar curva-
ture. Then for any p ∈ M , one can show that there exists a positive Green
84 3. The Riemannian positive mass theorem
As stated above, we omit the proof and only mention that the proof
involves a careful study of the developing map.
Exercise 3.28. For readers who are knowledgeable about Kähler geometry:
let n > 1, and let (Cn , J, g) be a complete Kähler manifold that is also
asymptotically flat (of real dimension 2n), where J is the standard complex
3.2. Special cases of the positive mass theorem 85
Thus the cone angle (or perhaps 1 minus the cone angle) plays a role
analogous to that of the mass in higher dimensions.
86 3. The Riemannian positive mass theorem
Proof. Let Mρ be the compact region whose boundary ∂Mρ is the union of
the spheres {r = ρ} in each end. Invoking the Gauss-Bonnet Theorem for
Mρ , we obtain
Kg dμg = 2πχ(Mρ ) − κ ds.
Mρ ∂Mρ
By the implicit assumption of connectedness, 2πχ(Mρ ) ≤ 2π. Meanwhile,
the asymptotically conical assumption implies that for ∂Mρ , κ = ρ1 +
O(ρ−q−1 ), and the length of ∂Mρ is 2πρ k αk + O(ρ1−q ), where we sum
over the cone angles αk of all ends. Therefore
Kg dμg = 2π 1 − αk + O(ρ−q ).
Mρ k
Taking the limit as ρ → ∞, we obtain k αk ≤ 1 as desired. (Actually,
note that for this conclusion, we do not require a sign on Kg but only the
integral of Kg .)
In the case where one of the ends
has cone angle 1, we immediately see
that it must be the only end, and M Kg dμg = 0. But this is only possible if
Kg = 0 identically, which means that M is flat. Therefore (M, g) is flat and
has one planar end. Then (M, g) must be Euclidean by the same argument
used in Exercise 2.33.
the rest of this section we will gloss over this point, but in Section 3.3.3, we
will provide an alternative proof of the positive mass theorem that obviates
the need for this assumption.
−2
= lim ∇u · ν dμ(Sρ ,g)
ρ→∞ (n − 2)ωn−1 S
ρ
−2
= Δg u dμg
(n − 2)ωn−1 M
−2 n−2
= Rg u dμg
(n − 2)ωn−1 M 4(n − 1)
< 0,
where we used the divergence theorem in the third line and the defining
equation for u in the next one. This completes the proof for the case of one
end.
Note that if we directly apply this argument to the case of multiple
ends, it only shows that the sum of the masses of the ends goes down.
Instead, choose > 0 small enough so that the region {x | u(x) > 1 − } is
entirely contained in the asymptotically flat ends. By Sard’s Theorem [Wik,
Sard’s theorem], we can choose so that the level set {x | u(x) = 1 − } is
smooth. Let Σk denote the component of that level set in the end Mk . Then
we can apply the same argument as above to the region {x ∈ Mk | u(x) >
3.3. Reduction to Theorem 1.30 89
In light of the previous lemma, the general case of the positive mass
theorem (Theorem 3.18) follows from the scalar-flat case. Our next step is
to reduce to the harmonically flat case.
Definition 3.33. Let n ≥ 3. We say that (M n , g) is harmonically flat
outside a bounded set if there exists a bounded set K such that M K is a
finite union of ends M1 , . . . , M such that for each Mk , there exist an rk > 0
and a diffeomorphism
Φk : Mk −→ Rn B̄rk (0),
such that in this coordinate chart, we have
4
gij (x) = uk (x) n−2 δij ,
where uk is harmonic with respect to the Euclidean metric, and uk (x) → 1
as x → ∞.
2,p
for some C independent of λ > λ0 , for all w ∈ W−q .
Therefore we can solve Lgλ vλ = −Rgλ for large λ. Since q is smaller
than the asymptotic decay rate of g, we can check that Rgλ → 0 in Lp−q−2 .
2,p
Then the injectivity estimate (3.11) implies that vλ → 0 in W−q . Elliptic
regularity (Theorem A.4) ensures that vλ is smooth, and since p > n/2,
weighted Sobolev embedding (Theorem A.25) implies that vλ → 0 in C−q 0 .
The following lemma shows that if we choose p > n and q > n−2
2 , then
the ADM mass of the metric g̃ constructed in the previous lemma can be
chosen to be arbitrarily close to that of g.
Proof. Assume the hypotheses of the lemma. Consider fixed ρ0 > 1 and
let ρ > ρ0 . We use the notation Sρ − Sρ0 to mean Sρ ∪ Sρ0 , oriented so that
Sρ has outward normal while Sρ0 has inward normal. Then
(div g − d(tr g))(ν) dμSρ = [−Δ(tr g) + div(div g)] dμ
Sρ −Sρ0 ρ0 <|x|<ρ
= (Rg − Q(g)) dμ,
ρ0 <|x|<ρ
2,p
Since p > n, the Wloc convergence of gi to g implies (via Sobolev embed-
1 . In particular,
ding [Wik, Sobolev inequality]) that gi converges to g in Cloc
the flux integral at Sρ0 vanishes in the limit. Since p > n and q > n−2 2 , the
2,p
W−q convergence of gi to g also implies that Q(gi ) converges to Q(g) in L1 .
(Check this.) And finally, we have the explicit hypothesis that Rgi converges
to Rg in L1 . Therefore the integral over the region |x| > ρ0 also vanishes in
the limit.
92 3. The Riemannian positive mass theorem
Exercise 3.36. In the previous lemma, show that we can replace the hy-
pothesis that Rgi converges to Rg in L1 by the hypothesis that Rgi is uni-
formly bounded in L1−n−δ for some δ > 0. Hint: Deal with the problematic
term using a weighted Hölder inequality and choosing ρ0 large.
The previous exercise shows that the ADM mass is continuous in the
2,p
W−q topology, but only after being restricted to a family whose scalar cur-
vatures are uniformly bounded in L1−n−δ for some δ > 0.
Exercise 3.37. Prove that the ADM mass is NOT continuous as a function
of asymptotically flat metrics on a fixed exterior coordinate chart, in the
2,p
W−q topology, where p > n and q > n−2
2 . Hint: Construct a counterexample
sequence by considering metrics that are globally conformal to Euclidean
space.
Remark 3.38. If gi is a sequence of complete asymptotically flat metrics
converging to a limit asymptotically flat metric g uniformly on compact sub-
sets of a 3-manifold M , such that each (M, gi ) has nonnegative scalar curva-
ture and contains no minimal surfaces, then mADM (g) ≤ lim inf i→∞ mADM (gi ).
This is a very different sort of theorem than Lemma 3.35. While Lemma 3.35
is just a fact about asymptotics, the result mentioned here is a theorem
about the nature of nonnegative scalar curvature. In fact, the positive mass
theorem is a simple corollary of this result. See [Jau18, JL16, JL19].
Combining the previous three lemmas (Lemmas 3.31, 3.34, and 3.35),
we see that if there is a counterexample to the positive mass theorem (The-
orem 3.18), then there must exist a scalar-flat counterexample that is also
harmonically flat outside a compact set. Explicitly, given a complete asymp-
totically flat metric on a manifold M with nonnegative scalar curvature and
negative mass, Lemma 3.31 allows us to produce a scalar-flat example with
negative mass, then Lemma 3.34 allows us to find a small perturbation of
it that is scalar-flat and harmonically flat outside a compact set, and finally
Lemma 3.35 tells us that a small enough perturbation will still have negative
mass.
The original proof of Schoen and Yau constructed a complete stable
minimal hypersurface inside this potential counterexample and then used a
noncompact version of the argument in Proposition 2.25 to contradict the
positive mass theorem in one lower dimension (or the Gauss-Bonnet Theo-
rem in the base case of dimension 3). However, later work of J. Lohkamp
provides a much simpler argument, reducing the positive mass theorem in
the harmonically flat case to the case where the metric is actually Euclidean
outside a compact set, which is the case where Corollary 2.32 applies.
Lemma 3.39 (Lohkamp [Loh99]). Suppose there exists a counterexample
to the positive mass theorem (Theorem 3.18) on M that is harmonically flat
3.3. Reduction to Theorem 1.30 93
outside a compact set. So there is at least one end that has negative mass.
Let M − pt be the manifold obtained by taking the one-point compactifica-
tion of every other end. Then there exists a metric of nonnegative scalar
curvature on M − pt that is exactly Euclidean on its one noncompact end,
but not scalar-flat.
Proof. Let (M, g) be complete and harmonically flat outside a compact set
such that g has nonnegative scalar curvature and negative mass in at least
one of its ends. First we would like to reduce to the case of one end. We first
close up all of the ends except for one end (call it M1 ) that is assumed to
have negative mass. We do this by choosing a g-harmonic conformal factor
w such that w tends to 1 at the infinity of M1 , and 0 at the infinities of all
other ends. (Such a function can be constructed using Theorem A.40, for
example.) By the maximum principle, 0 < w < 1 everywhere, so we can
4
use it as a conformal factor to define a new function g̃ = w n−2 g. By equa-
tion (1.8), g̃ still has nonnegative scalar curvature and is still harmonically
flat in the end M1 . Using the same argument that was used in the proof of
Lemma 3.31, we can use the maximum principle and Exercise 3.13 to see
that mADM (g̃) < mADM (g) in the end M1 . Meanwhile, all other ends have
been metrically compactified, and we can see that the missing points can
be smoothly filled in because of the harmonic flatness assumption. (Specif-
ically, after a Kelvin transform, we can see that the neighborhood around
the missing point is conformally Euclidean via a bounded harmonic confor-
mal factor, and a bounded harmonic function on a punctured disk has a
removable discontinuity.)
This leaves us with a counterexample to the positive mass theorem on
M − pt, which only has one end, and that end is harmonically flat. Without
loss of generality, let us assume that g is such a counterexample. In the
4
exterior coordinate chart, for large |x|, we have gij = u n−2 δij for some
harmonic function u, which (by Corollary A.19) we can expand as
m 2−n
u(x) = 1 + |x| + O(|x|1−n ),
2
where m = mADM (g) < 0, by Exercise 3.13. Our next step is to interpolate
between g and the Euclidean metric over an annulus in such a way that
nonnegative scalar curvature is preserved. Specifically, we will replace u with
a new function ũ that agrees with u for |x| less than some large constant
ρ1 , while ũ is exactly constant for |x| larger than some bigger constant ρ2 .
4
By equation (1.8), nonnegativity of the scalar curvature of g̃ij := ũ n−2 δij is
equivalent to superharmonicity of ũ. Since m < 0, the asymptotic expansion
94 3. The Riemannian positive mass theorem
Combining Lemmas 3.31, 3.34, 3.35, and 3.39, we see that a counterex-
ample to the positive mass theorem (Theorem 3.18) can be used to con-
struct a metric as in the conclusion of Lemma 3.39. Since this contradicts
Corollary 2.32, the proof of Theorem 3.18 is complete. Actually, we do not
really need to invoke Corollary 2.32 directly, since the metric obtained from
Lemma 3.39 can be used to construct a metric of positive scalar curvature
on T n #M as in the proof of Theorem 1.23, which contradicts Theorem 1.30
directly. This completes the proof of the positive mass theorem (Theo-
rem 3.18).
Remark 3.40. Technically, we have only proved Theorem 3.18 with the ad-
ditional hypothesis of Hölder decay of the scalar curvature as in Lemma 3.31.
In Section 3.3.3, we remove this hypothesis.
3.3. Reduction to Theorem 1.30 95
Proof of Theorem 3.19. The proof combines ideas from the proofs of
Theorem 1.23 and Corollary 2.32, but with some added complications. Sup-
pose that (M, g) is a complete asymptotically flat manifold with nonneg-
ative scalar curvature, and assume that the ADM mass of one of its ends
is zero. By Lemma 3.31, g must be scalar-flat, because otherwise we could
find a conformal metric that violates the positive mass theorem. (Techni-
cally, Lemma 3.31 requires a Hölder decay assumption, but we will see in
Section 3.3.3 how to avoid this.)
Our next step is to show that g is Ricci-flat. Although there is a nice
proof of this that uses Ricci flow (see Remark 3.42 below), we will present
Schoen and Yau’s original argument. For simplicity, let us assume for now
that M has only one end. Here we use a Ricci deformation combined with a
conformal change. Let η ≥ 0 be any compactly supported cut-off function.
For small t > 0, let gt := g + tηRicg . Note that since gt equals g outside a
compact set, mADM (gt ) = 0. Next we make a conformal change back to zero
4
scalar curvature. That is, we look for ut such that g̃t = utn−2 gt is scalar-flat.
So we must solve
4(n − 1)
Lgt ut = − Δgt ut + Rgt ut = 0
n−2
Following the same reasoning as in the proof of Lemma 3.31, we can see
that g̃t is a scalar-flat complete asymptotically flat metric, and
2 ∂ut
mADM (g̃t ) = mADM (gt ) + lim − dμSρ
ρ→∞ (n − 2)ωn−1 S ∂r
ρ
2
= 0 + lim −Δgt ut dμSρ
ρ→∞ (n − 2)ωn−1 S
ρ
−1
= Rg ut dμg .
2(n − 1)ωn−1 M t
By the positive mass theorem,
we know that mADM (g̃t ) ≥ 0 is minimized at
d
t = 0. Setting ġ = dt gt t=0 = ηRicg ,
d
0 = 2(n − 1)ωn−1 mADM (g̃t )
dt
t=0
d d
=− Rgt u0 + Rg0 ut dμg
M dt dt t=0
t=0
= η|Ricg |2 dμg ,
M
where we used u0 = 1, Rg0 = 0, the divergence theorem, and the fact that
ġ = ηRicg is compactly supported. Since the choice of η was arbitrary, this
proves that g is Ricci-flat.
Finally, to see why Ricci-flatness implies that (M, g) is Euclidean, we
can use a noncompact analog of the argument used to prove Corollary 2.32.
Instead of invoking the Hodge Theorem for compact manifolds to find har-
monic 1-forms, we instead construct harmonic 1-forms by finding a harmonic
coordinate system y 1 , . . . , y n asymptotic to the original one x1 , . . . , xn . That
is, we want Δg y i = 0, so if we set v i = y i − xi , this is equivalent to solving
Δg v i = −Δg xi . By asymptotic flatness, Δg xi ∈ C−q−1 1 , where q is the as-
ymptotic decay rate (which we take to be less than n − 2). By surjectivity of
the Laplacian (Theorem A.40), there exists a solution v i ∈ C1−q 2 . Thus dy i
is a harmonic 1-form, and since these 1-forms are asymptotic to dxi , they
must form a basis near infinity.
As in the proof of Corollary 2.32 we now invoke the Weitzenböck formula
to see that these 1-forms must be parallel, which implies that g is flat. Then
Exercise 2.33 implies that (M, g) is Euclidean space. Or as an alternative,
one may argue using the Bishop-Gromov comparison theorem as in Exercise
2.34.
3.3. Reduction to Theorem 1.30 97
Now we discuss how to reduce to the case of one end. Suppose there
are multiple ends, one of which, say M1 , has zero mass. In this case, we
can conformally close the other ends using a g-harmonic conformal factor u
approaching 1 at the infinity of M1 and approaching 0 at all of the other
infinities. This has the effect of “conformally closing” the other ends to ob-
tain a new one-ended, asymptotically flat manifold with nonnegative scalar
curvature, as in the proof of Lemma 3.39. Meanwhile, applying the rea-
soning used in the proof of Lemma 3.31 for the case of multiple ends, the
ADM mass of M1 must go down after this conformal change, resulting in a
negative mass end. This is essentially a contradiction to the positive mass
theorem, and thus there can only be one end.
However, there is a small technical problem with this argument: unlike
in Lemma 3.39, using a conformal factor to implement a one-point com-
pactification of an asymptotically flat end results in a metric that is not
necessarily smooth at the point corresponding to infinity. However, the
2,p
metric is still Wloc for some p > n/2, and consequently continuous. (Check
this.) Therefore one should prove that the positive mass theorem still holds
when the metric has isolated singularities of this sort. We will prove this in
Theorem 3.43 below.
Remark 3.42. In order to prove Ricci-flatness of g above, one can use a
Ricci flow argument similar to what was described in Remark 2.31. Let gt be
a Ricci flow with initial condition g. It turns out that Ricci flow preserves
nonnegative scalar curvature, asymptotic flatness, and ADM mass. (See
Section 3.4.) Since gt has zero ADM mass, the first part of the proof above
implies that gt is scalar-flat for all t. Then equation (2.19) implies that g is
Ricci-flat. In many ways, this is a much cleaner and simpler proof, especially
in the case of multiple ends.
Theorem 3.43 (Positive mass theorem with low regularity [GT12]). Let
p > n/2, and let g be a continuous metric on a smooth one-ended manifold
M n such that outside of some compact set K, g is a smooth asymptotically
2,p
flat metric, while g is Wloc over K. If Rg ≥ 0 as a function in Lploc , then
mADM (g) ≥ 0.
This theorem is much more than what is necessary to deal with iso-
lated singular points, but we provide its statement and proof because it is
conceptually not much harder than dealing with an isolated singular point.
Proof. The proof uses the conformal method of [Mia02, Bra01], but it is
quite a bit simpler. The basic idea is to approximate g by a smooth metric
with nonnegative scalar curvature whose mass can be chosen arbitrarily close
to that of g. By applying the usual positive mass theorem to the smooth
approximation, it then follows that mADM (g) must also be nonnegative.
98 3. The Riemannian positive mass theorem
We would like to make a conformal change that removes all of the neg-
ative scalar curvature from g . To do this, we would like to solve
4(n − 1)
− Δg u − (Rg )− u = 0
n−2
with u approaching 1 at each infinity, where (Rg )− denotes the negative
part of Rg (which is defined to be a nonnegative function). Setting v =
u − 1, this is the same thing as solving the equation
4(n − 1)
− Δg v − (Rg )− v = (Rg )− .
n−2
Just as in our proofs of Lemma 3.34 and Theorem 3.19, we will do this by
showing that the operator
4(n − 1) 2,p
(3.12) − Δg − (Rg )− : W−q −→ Lp−q−2
n−2
is an isomorphism, where 0 < q < n − 2. We can do this by showing that
the operator (3.12) is a small deformation of − 4(n−1)
n−2 Δg , which we already
know is an isomorphism by Theorem A.40. In fact, the injectivity estimate
for Δg can be chosen to be independent of , as can the constants in the
Sobolev inequality. While (Rg )− does not smoothly converge to 0 as → 0,
since p > n/2, we really only need it to converge in Lp−2 in order to see that
the operator (3.12) is close to − 4(n−1)
n−2 Δg in the strong operator topology.
(Check this.)
Next, since Rg ≥ 0, we have |(Rg )− | ≤ |Rg −Rg |, and then the previous
exercise implies that the compactly supported function (Rg )− converges to 0
in Lp−2 . Hence, the desired smooth solutions v exist. Moreover, the injectiv-
2,p
ity estimate we get from this argument tells us that v → 0 in W−q , and since
p > n/2, v → 0 in C−q by weighted Sobolev embedding (Theorem A.25).
0
4
In particular, u = 1 + v > 0 for small enough . Define g̃ = un−2 g . Since
u is g-harmonic outside K, it follows from Corollary A.38 and Exercise 3.13
2,p
that g̃ is asymptotically flat, and the W−q convergence of u − 1 implies
3.3. Reduction to Theorem 1.30 99
2,p
that g̃ − g → 0 in W−q . Meanwhile, by equation (1.8), the scalar curvature
of g̃ is
n+2
4(n − 1)
Rg̃ = u n−2
− Δg u + Rg u
n−2
2n
= un−2 ((Rg )− + Rg )
2n
= un−2 (Rg )+ .
Thus Rg̃ is nonnegative everywhere, and we can apply the positive mass
theorem to see that mADM (g̃ ) ≥ 0.
Finally, we must argue that mADM (g) = lim→0 mADM (g̃ ). Since we
are not assuming p > n, we cannot call upon Lemma 3.35. Instead, using
Exercise 3.12, we compute
mADM (g̃ ) − mADM (g) = mADM (g̃ ) − mADM (g )
−2 ∂u
= lim dμSρ
ρ→∞ (n − 2)ωn−1 S ∂r
ρ
−2
= lim ∇u · ν dμ(Sρ ,g )
ρ→∞ (n − 2)ωn−1 S
ρ
−2
= Δg u dμg
(n − 2)ωn−1 M
2 n−2
= (Rg )− u dμg .
(n − 2)ωn−1 K 4(n − 1)
The result now follows from the fact that (Rg )− → 0 in Lp while u → 1
uniformly.
Remark 3.45. Although Theorem 3.43 does not come with the rigidity
statement that if the mass is zero, then (M, g) is Euclidean, the rigidity
should follow from the same Ricci flow argument that was used in [MS12].
2,p
For the case of isolated singularities that still have regularity Wloc for some
p > n/2, we can prove rigidity using the same Schoen-Yau argument used in
the smooth case: we can prove Ricci-flatness away from the singular points
and use that to construct a parallel frame.
this book, we explained (1) first and then proved in this chapter how (1)
implies (2). In this section, we showed how (2) implies (3), and obviously
(1) is a special case of (3). Originally, in dimensions less than 8, Schoen and
Yau started by proving (2) directly, essentially executing a “noncompact”
version of their proof of (1).
Specifically, Schoen and Yau’s original proof in dimension 3 involved
showing that if a complete asymptotically flat manifold of nonnegative scalar
curvature has negative mass, then they can construct a complete stable
two-sided minimal surface inside it and derive a contradiction to the Gauss-
Bonnet Theorem. (We will describe a generalization of this argument in
detail in Chapter 8.) In light of this perspective, one reasonable question is
whether there is an asymptotically flat version of Cai and Galloway’s κ = 0
case of Theorem 2.38. Indeed, we have the following.
Theorem 3.46 (Chodosh-Eichmair [CCE16]). Let (M, g) be a complete
asymptotically flat 3-manifold with nonnegative scalar curvature. If M con-
tains a smooth, noncompact area-minimizing boundary, then M is Euclidean
space.
3.3.3. More density theorems. In our proof of the positive mass theo-
rem, we used a two-step process to move from a general counterexample to
a scalar-flat counterexample to a counterexample that is harmonically flat
outside a compact set, but the following lemma shows that we can jump
3.3. Reduction to Theorem 1.30 101
Proof. The proof starts off the same way as in Lemma 3.34, with the same
2,p
definition of gλ , which converges to g in W−q . The difference is that instead
of trying to make a conformal factor that deforms gλ to a scalar-flat metric
(which would result in a large deformation since g is not already scalar-flat),
we make a conformal change that removes all of the negative curvature, as
in the proof of Theorem 3.43.
That is, we attempt to solve
4(n − 1)
− Δgλ uλ − (Rgλ )− uλ = 0
n−2
with uλ (∞) = 1. As we have done before, set vλ = uλ − 1 so that this is
equivalent to
4(n − 1)
− Δg v − (Rg )− v = (Rg )−
n−2
with vλ (∞) = 0. Next, we assume q > 0 without loss of generality and then
show that
4(n − 1) 2,p
(3.13) − Δgλ − (Rgλ )− : W−q −→ Lp−q−2
n−2
is an isomorphism. By the construction and asymptotic flatness of g, it is
not hard to see that (Rgλ )− → 0 in L∞ −2 . (Check this.) This is good enough
to see that the operator (3.13) is close enough to − 4(n−1)
n−2 Δg (which is an
isomorphism by Theorem A.40) to be an isomorphism for large enough λ
(with an injectivity estimate independent of λ).
Hence, the desired solution uλ exists, and it is smooth by elliptic regular-
2,p
ity. As in the proof of Lemma 3.34, uλ → 1 in W−q ⊂ C−q
0 , and thus u > 0
λ
4
for large λ. Setting g̃λ = uλn−2 gλ , we see that g̃λ is harmonically flat outside
2,p
a compact set by construction and g̃λ → g in W−q . Meanwhile, just as we
102 3. The Riemannian positive mass theorem
2n
computed in the proof of Theorem 3.43, we have Rg̃λ = uλn−2 (Rgλ )+ ≥ 0.
The only thing left to check is that Rg̃λ → Rg in L1 .
Exercise 3.49. Complete the last step of the proof by checking that
2n
uλn−2 (Rgλ )+ → Rg in L1 .
For the proof of the positive mass theorem (Theorem 3.18), we can use
Lemma 3.48 in place of Lemmas 3.31 and 3.34, thereby avoiding the Hölder
decay assumption mentioned in Remark 3.40. We can also remove this
assumption from our proof of positive mass theorem rigidity (Theorem 3.19)
as follows.
Let (M, g) be a complete asymptotically flat manifold with nonnegative
scalar curvature and zero ADM mass in some end Mk . Recall that in the
proof of Theorem 3.19, we invoked Lemma 3.31 to show that g must be
scalar-flat. Here we present an alternative proof of this fact.
Suppose R > 0 is positive somewhere, and let η be a compactly sup-
ported function with 0 ≤ η ≤ 1 and η > 0 somewhere that R > 0. Instead
of solving Lg u = − 4(n−1)
n−2 Δu + Ru = 0, we can instead solve
4(n − 1)
− Δu + ηRu = 0.
n−2
Following the proof of Lemma 3.31, one can see that a smooth positive
solution u exists. Moreover,
4(n − 1)
Rg̃ = u− n−2 Lg u = u− n−2 (− Δu + Ru) = u− n−2 (1 − η)Ru ≥ 0.
n+2 n+2 n+2
n−2
Since u is g-harmonic outside a compact set, Corollary A.38 and Exer-
cise 3.13 implies that g̃ is asymptotically flat. Finally, the same argument
used in Lemma 3.31 shows that mADM (Mk , g̃) < mADM (Mk , g) = 0, contra-
dicting the positive mass theorem. Hence g must be scalar-flat.
Going a step further than Lemma 3.48, not only can we find a nearby
metric with nonnegative scalar curvature that is harmonically flat outside
a compact set, but we can find one that is exactly Schwarzschild outside a
compact set.
Theorem 3.50 (Bray [Bra97]). Let (M n , g) be a complete asymptotically
flat manifold with nonnegative scalar curvature which is harmonically flat
outside some compact set. For any δ > 0 there exists another asymptotically
flat metric g̃ on M such that
• g̃ has nonnegative scalar curvature,
• g̃ is exactly Schwarzschild outside some large radius,
• 0 < mADM (g̃) − mADM (g) < δ,
3.3. Reduction to Theorem 1.30 103
of Theorem 3.51 for asymptotically flat initial data sets [CS06, Theorem 4].
That result allows one to glue initial data satisfying harmonic asymptotics
as in Definition 8.2 to Kerr initial data.
∂
g = −2Ric.
∂t
By the Ricci flow estimates of Wan-Xiong Shi [Shi89], we know that any
complete smooth manifold with bounded curvature can be used as initial
data for Ricci flow. In particular, if (M 3 , g) is a complete asymptotically
flat manifold with nonnegative scalar curvature, we can consider a Ricci flow
of complete metrics gt on M with initial condition g0 = g. Recall that the
scalar curvature R evolves according to equation (2.19):
∂
R = Δg R + 2|Ric|2 .
∂t
Combining this with the parabolic maximum principle (see [Eva10, Chapter
7], for example), it follows that nonnegativity of R is preserved by Ricci flow.
It is also true that asymptotic flatness and ADM mass are both preserved
under Ricci flow, as first observed by Xianzhe Dai and Li Ma [DM07] and
by T. Oliynyk and E. Woolgar [OW07]. (Technically, one must assume a
stronger notion of asymptotic flatness than in Definition 7.17 that involves
decay of higher derivatives of the metric [Li18].) The decay of the metric gt
essentially comes from Shi’s estimates, but integrability of R requires a little
more: we must use the scalar curvature evolution equation above, together
with decay of the |Ric|2 term coming from Shi’s estimates. See [DM07,
Theorem 11, MS12, Lemma 10] for details. To see why the ADM mass is
3.4. A few words on Ricci flow 105
all time. In other words, Li’s result described above applies. Li’s proof that
there are only finitely many surgery times again utilizes the μ-functional,
together with an understanding of the surgeries.
Chapter 4
The Riemannian
Penrose inequality
Lemma 4.1. Let B̄ be a closed ball in Rn−1 , and consider the cylinder
B̄ × R ⊂ Rn equipped with a Riemannian metric g. Let u1 , u2 : B̄ −→ R,
and let Σ[ui ] denote the graph of ui in B̄ × R. Assume that
u1 ≥ u2 ,
HΣ[u1 ] ≥ 0,
HΣ[u2 ] ≤ 0,
where the mean curvature is computed using the upward normal (so that the
mean curvature of a spherical “cap” in Euclidean space is positive). If Σ[u1 ]
and Σ[u2 ] meet anywhere in their interiors or are tangent at any point of
their boundaries, then u1 = u2 identically on all of B̄.
Rough idea of the proof. The basic idea is that if one writes out the
mean curvature operator in local coordinates, one obtains a quasi-linear
107
108 4. The Riemannian Penrose inequality
From this, one can show that the difference u1 − u2 satisfies a linear ellip-
tic inequality to which one can apply the usual strong maximum principle
(Theorem A.2). For the Euclidean case, see [Sch83, Lemma 1]. For fuller
details, see [AGH98, Section 3.1].
Figure 4.1. Theorem 4.6 states that the boundary of the minimizing
hull of Ω is smooth and minimal away from Ω.
The proof is unfortunately outside the scope of this text. See [HI01,
Regularity Theorem 1.3, Tam84] for details.
Theorem 4.7 (Existence and uniqueness of apparent horizons). Let n < 8,
and let (M n , g) be a complete asymptotically flat manifold (possibly with
boundary).
4.1. Riemannian apparent horizons 111
Sketch of the proof. We start with the first statement. We begin by con-
structing a single enclosing minimal hypersurface homologous to a large
coordinate sphere. By asymptotic flatness, the mean curvature of the coor-
dinate sphere of radius ρ is approximately n−1
ρ , and thus the end is foliated
by hypersurfaces with positive mean curvature. Consider the region Ωρ
enclosed by one of these large coordinate spheres Sρ . Next we minimize
volume in the homology class of Sρ in Ωρ . By Corollary 4.3, we obtain a
smooth minimal hypersurface Σ enclosing ∂M . (It must be enclosing and
have multiplicity 1 because it is homologous to Sρ .)
Consider the family F of all enclosed regions in M whose boundaries
are minimal hypersurfaces in M homologous to Sρ . Above, we argued that
F is nonempty. Since the end of M beyond Sρ is foliated by positive mean
curvature, Corollary 4.2 implies that every element of F lies in Ω. For any
Ω1 , Ω2 ∈ F , we claim that there exists Ω ∈ F containing both Ω1 and Ω2 .
If ∂(Ω1 ∪ Ω2 ) is smooth, then it must be minimal, and thus Ω1 ∪ Ω2 ∈
F . Therefore we consider the case where ∂(Ω1 ∪ Ω2 ) contains a singular
set, which must occur where ∂Ω1 meets ∂Ω2 . The intuition here is that
∂(Ω1 ∪Ω2 ) should have nonpositive mean curvature in a “weak” set (visualize
two intersecting planes to see why this should be true), and indeed a result
of M. Kriele and S. Hayward [KH97] implies that it can be smoothed out in
such a way that it has nonpositive mean curvature and encloses ∂(Ω1 ∪ Ω2 ).
Now we apply Corollary 4.3 to produce a new element of F that encloses
Ω1 ∪ Ω2 .
%
The claim above implies that the region Ω∈F Ω can be exhausted by a
single increasing sequence Ωi with each Ωi ⊂ Ω and |∂Ωi | ≤ |Sρ |. Finally,
this sequence of stable minimal hypersurfaces ∂Ωi with bounded volume
must converge by estimates of R. Schoen and Leon Simon [SS81]. The limit
112 4. The Riemannian Penrose inequality
Proof. Choose a distinguished end and then cut off all of the other ends at
large coordinate spheres. Now apply Theorem 4.7 to the result.
The following corollary of Theorem 2.41 gives us some control over the
topology of apparent horizons.
Corollary 4.10. If Σ is an apparent horizon in an asymptotically flat man-
ifold (M, g) with nonnegative scalar curvature, then Σ is orientable and
Yamabe positive. In particular, when M is three-dimensional, Σ is topo-
logically a union of spheres.
When n = 3, we also have some good control over the topology of the
exterior, even without any scalar curvature assumption.
Theorem 4.11. Let (M 3 , g) be an asymptotically flat manifold whose bound-
ary is either empty or minimal. Assume that M contains no immersed
minimal surfaces. Then M is diffeomorphic to R3 minus a finite number
(possibly zero) of open balls.
Observe that Theorem 4.16 is just the case when (Mout , gout ) = (Min , gin )
with minimal boundary. Pengzi Miao [Mia02] generalized Bray’s argument
to obtain the statement of Theorem 4.17, except that rigidity was only
116 4. The Riemannian Penrose inequality
obtained in dimension 3. Yuguang Shi and Luen-Fai Tam gave a proof for
spin manifolds using Witten’s spinor technique in [ST02], which we will
describe in detail in Section 5.4.1. Later, D. McFeron and G. Székelyhidi
gave another proof using Ricci flow and established that the rigidity holds
in general [MS12].
Observe that the reason why Theorem 4.17 does not follow directly
from the ordinary positive mass theorem (Theorem 3.18) is that the met-
ric g need not be smooth across Σ. In general, it is only Lipschitz. One
perspective on Theorem 4.17 is that it is a positive mass theorem for sin-
gular metrics, though the singular behavior here is quite mild. For more
work on the positive mass theorem for other kinds of singular metrics,
see [MM17, Lee13, GT12]. In particular, in the spin case, Theorem 4.17
can be generalized to show that the positive mass theorem holds as long as
the scalar curvature can be defined in a distributional sense [LL15]. These
sorts of theorems might be useful for trying to understand the concept of a
“weak” notion of nonnegative scalar curvature.
Exercise 4.18. Find an example showing that Theorem 4.17 fails without
the assumption Hout ≤ Hin .
a mollification of ht over the t-variable only, such that ht = ht for |t| > .
Let Σt refer to Σ × {t}, thought of as a hypersurface of (M, g ). Its induced
∂
metric is ht , and its second fundamental form is AΣt = ∂t ht . The key point
here is that since Hout ≤ Hin , one can show that ∂t HΣt is uniformly bounded
∂
from below in . The details of the mollification and this computation can
be found in [Mia02]. Since RΣt and |AΣt |2 are clearly uniformly bounded in
, equation (4.1) shows that Rg is uniformly bounded from below in . The
upshot is the following. After an appropriate mollification, we now have a
smooth metric g which agrees with g away from the -neighborhood of Σ,
and there is a constant C > 0, independent of , such that Rg > −C inside
that neighborhood.
We now make a conformal change that removes all of the negative scalar
curvature, just as in the proof of Theorem 3.43. We attempt to solve
− 4(n−1)
n−2 Δg v − (Rg )− v = (Rg )−
by showing that the operator
− 4(n−1) 2,p p
n−2 Δg − (Rg )− : W−q −→ L−q−2
is an isomorphism, where p > n/2 and 0 < q < n − 2. Once again, by
Theorem A.40, Δg is an isomorphism with an injectivity estimate for Δg
independent of . As before we need (Rg )− to converge to 0 in Lp−2 to show
that our operator can be chosen close enough to − 4(n−1)
n−2 Δg to make it an
isomorphism. But since |(Rg )− | < C and is supported in an -neighborhood
of Σ, (Rg )− Lp−2 = O(1/p ) → 0. Hence, we can solve for v and define
4
g̃ = (1 + v ) n−2 g .
The rest of the argument is the same as in the proof of Theorem 3.43,
2,p
except that it no longer needs to be true that g̃ → g in W−q or that
Rg̃ → Rg in L . But the convergence still holds away from Σ, and that is
1
all that is needed to invoke Lemma 3.35, since ADM mass is an asymptotic
quantity.
We now consider the case mADM (g) = 0. Suppose that Hout < Hin
somewhere in Σ. In this case, one can go further and show that (Rg )+ is
quite large near there, and consequently so is the nonnegative scalar curva-
ture Rg̃ . As seen in the proof of Lemma 3.31, this positive scalar curvature
implies that mADM (g̃ ) > 0. The hard part is to then show there is a posi-
tive lower bound independent of , and consequently mADM (g) > 0, giving
a contradiction. See [Mia02] for details.
Unfortunately, this argument does not yield the full rigidity result that
if mADM (g) = 0, then (M, g) is Euclidean space. As mentioned earlier,
McFeron and Székelyhidi [MS12] came up with an alternative method of
proof using Ricci flow in place of the conformal change. The main advantage
118 4. The Riemannian Penrose inequality
of their method is that it does yield a rigidity result. We give only a very brief
outline of the proof. Recall that Ricci flow preserves asymptotic flatness,
nonnegative scalar curvature, and ADM mass. Work of Miles Simon shows
that there exists a Ricci-DeTurck flow g(t) with initial condition g(0) =
g, which is Lipschitz but not smooth [Sim02]. Let g denote the same
smoothing of g constructed above by Miao, and consider the Ricci flows g (t)
with initial condition g . The idea is to compare g (t) to g(t), which will
simultaneously exist for some small time T . Miao’s construction shows that
(Rg )− is negligible (in an integral sense) and then, since Ricci flow has the
effect of increasing scalar curvature, (Rg (t) )− is also controlled as → 0.
McFeron and Székelyhidi showed that this control implies that Rg(t) ≥ 0
and also that mADM (g(t)) ≤ lim inf mADM (g (t)). Using the positive mass
theorem on the metric g(t) and the fact that Ricci flow (and Ricci-DeTurck
flow) preserves ADM mass, we have, for any small t < T ,
0 ≤ mADM (g(t)) ≤ lim inf mADM (g (t)) = lim inf mADM (g ) = mADM (g).
→0 →0
Proof. Let > 0. First, we double the (M, g) through its boundary to
obtain (M , ḡ). We apply the argument from Theorem 4.17 to obtain a
smoothing (M , ḡ ) such that ḡ has nonnegative scalar curvature, is -close
to g in C 0 , and has mass within of mADM (g). Next we apply our density
result (Lemma 3.48) to obtain a new harmonically flat metric ĝ on M that
has nonnegative scalar curvature, is -close to ḡ in C 0 , and has mass within
of mADM (g). By Corollary 4.9, (M , ĝ ) has an apparent horizon Σ with
respect to one of the ends. Let M̂ be a manifold without boundary obtained
by removing the part of M enclosed by Σ , so that ∂ M̂ = Σ .
4.1. Riemannian apparent horizons 119
We claim that if the Penrose inequality fails for (M, g), then it will also
fail for (M̂ , ĝ ) for small enough . This is because the mass does not change
much, while the volume of the outermost minimal hypersurface Σ in (M , ĝ )
cannot be significantly smaller than the volume of ∂M in (M, g). The reason
for this is that ∂M is not just outward-minimizing, but minimizing in (M , ḡ).
Thus |Σ |ḡ ≥ |∂M |g , and the C 0 closeness of ĝ to ĝ means that |Σ |ḡ cannot
be much smaller than |∂M |g . The result follows.
4.1.3. Special cases of the Penrose inequality. As we did for the pos-
itive mass theorem, we can consider some simple cases of the Penrose in-
equality. Of course, the easiest is the spherically symmetric case.
Proposition 4.20. Let (M, g) be a complete asymptotically flat manifold
diffeomorphic to [0, ∞)×S n−1 which is spherically symmetric in the sense of
Section 3.1.1. If g has nonnegative scalar curvature and ∂M is an apparent
horizon, then
n−2
1 |∂M | n−1
mADM (M, g) ≥ .
2 ωn−1
Moreover, if equality holds, then M is isometric to half of the Schwarzschild
space of mass mADM (M, g).
1 n−2
≤ lim r (1 − V (r)) = mADM (g).
r→∞2
If we have equality, then 12 rn−2 (1 − V (r)) is identically equal to m :=
mADM (g) for all r, which means that V (r) = 1 − r2m n−2 . Therefore g is
1
the Schwarzschild metric of mass m (3.1), and r0 = (2m) n−2 .
Note that by Lemma 4.19, one can assume without loss of generality
that the asymptotic decay rate is at least 1, for the purpose of proving the
inequality, but not for the purpose of characterizing the equality case.
Our goal in this section is to summarize the main features of Huisken
and Ilmanen’s argument. We restrict our attention to dimension 3, since
the techniques described here do not generalize well to higher dimensions.
Definition 4.23. Given a closed surface2 Σ2 in a Riemannian manifold
(M 3 , g), we define its Hawking mass to be
&
|Σ| 1
mHaw (Σ) = 1− 2
H dμΣ .
16π 16π Σ
primarily interested in the Hawking mass of spheres in this section, it is convenient to define the
Hawking mass this way.
122 4. The Riemannian Penrose inequality
≥4 K dμΣ
V
=4 det(Dν) dμΣ
V
≥ 4|S 2 | = 16π.
We omit the equality case, except to note that we can only have H 2 = 4K
where the surface is umbilic (that is, the principal curvatures are equal).
4.2. Inverse mean curvature flow 123
The famed Willmore conjecture [Wil65] states that any immersed torus
has
H 2 dμΣ ≥ 8π 2 ,
Σ
with equality only for so-called Willmore tori. This conjecture was proved
by Fernando Marques and André Neves using minimal surface techniques
[MN14].
Translated for our purposes, the Willmore inequality states that the
Hawking mass of an embedded surface in Euclidean R3 is nonpositive and
zero only for round spheres. One way to interpret this is that since Euclidean
space “contains no mass” in any sense, the Hawking mass only gives a “good”
quasi-local mass for round spheres. In general, we can easily see from the
definition that if we put lots of little wiggles in Σ, its Hawking mass can
be made to be an arbitrarily large negative number. The upshot is that
Hawking mass is most useful when Σ is somewhat nice.
Exercise 4.25. Let (M 3 , g) be an asymptotically flat manifold. Let Σρ be
the coordinate sphere of radius ρ in an asymptotically flat coordinate chart.
Prove that limρ→∞ mHaw (Σρ ) = mADM (g).
take it piece by piece. First, recall from Proposition 2.10 (and the discussion
following it) that
∂
dμt = Hϕ dμt = dμt ,
∂t
where dμt denotes the area measure on Σt . Indeed, one way of thinking
about inverse mean curvature flow is that it is precisely the flow that makes
the area measure of Σt grow exponentially. In particular,
d
|Σt | = |Σt |.
dt
Also recall from (2.15) that
∂ 1
H = −Δ(H −1 ) + (2K − R − |A|2 − H 2 )H −1 ,
∂t 2
where K is the Gauss curvature of Σt , R is the scalar curvature of the
ambient metric g, and we suppress much of the dependence on t in our
notation. In the following, λ1 and λ2 will denote the principal curvatures of
Σt . We compute
d
H 2 dμt
dt Σt
1
= (2H[−Δ(H −1 ) + (2K − R − |A|2 − H 2 )H −1 ] + H 2 ) dμt
2
Σt
= (2∇H, ∇(H −1 ) + 2K − R − |A|2 ) dμt
Σt
= (−2H −2 |∇H|2 + 2K − R − |A|2 ) dμt
Σt
1 1
= 4πχ(Σt ) + −2H −2 |∇H|2 − R − (λ1 − λ2 )2 − H 2 dμt
Σt 2 2
1
≤ 8π − H 2 dμt ,
2 Σt
where we used integration by parts in the second line, the Gauss-Bonnet
Theorem in the fourth line, and connectedness of Σt and nonnegativity of
R in the last line. Thus
3/2 d d
(16π) mHaw (Σt ) = |Σt | 16π − 2
H dμt
dt dt Σt
1 d
= |Σt | 16π − H dμt − |Σt |
2
H 2 dμt
2 Σt dt Σt
≥ 0,
using our previous calculation.
4.2. Inverse mean curvature flow 125
Note that the use of the Gauss-Bonnet formula above restricts this ar-
gument to dimension 3. The monotonicity provides an idea for how to prove
the Penrose inequality in three dimensions.
Let (M 3 , g) be a complete one-ended asymptotically flat manifold with
boundary, with nonnegative scalar curvature. Assume that ∂M is minimal
and outward-minimizing. Suppose there is an inverse mean curvature flow
Σt with initial surface Σ0 = ∂M , and suppose that this flow exists for all
time and Σt flows out toward the infinity of the end. Then by definition of
Hawking mass and Geroch monotonicity, we have
&
|Σ0 |
= mHaw (Σ0 ) ≤ lim mHaw (Σt ).
16π t→∞
Exercise 4.28. Verify that u = (n − 1) log |x| solves inverse mean curvature
flow (4.2) on Rn {0} with the Euclidean metric.
Also check that u = 12 (n − 1) log |x| is a subsolution of inverse mean
curvature flow on any asymptotically flat end (Rn B̄1 , g) where |x| is suf-
ficiently large. Being a subsolution means that
∇u
divM ≥ |∇u|,
|∇u|
or, in other words, the level sets always move outward at least as fast as
they would under inverse mean curvature flow.
K
∇u
= |∇v| + (v − u) divM + u|∇u| dμ
K |∇u|
' (
∇u
= |∇v| + ∇(u − v), + u|∇u| dμ
K |∇u|
' (
∇u
= |∇v| + |∇u| − ∇v, + u|∇u| dμ
K |∇u|
≥ JuK (u).
Remark 4.31. Lemma 4.30 should NOT be interpreted to mean that a
smooth evolution and a weak evolution of a given initial surface must agree
until the smooth solution becomes singular. (Indeed, this is false.) The weak
formulation is global in character, while the smooth formulation is not.
128 4. The Riemannian Penrose inequality
Sketch of the proof. Since the proof is quite involved and uses a fair bit
of analysis, we will only give a sketch, emphasizing the geometric content as
much as possible. As described above, we first attempt to solve a regularized
equation. Recall from Exercise 4.28 that v = 12 (n − 1) log |x| is a subsolution
for large enough |x| in the asymptotic region. For large L, let VL be the
region bounded between ∂Ω0 and the coordinate sphere v −1 (L). For each
> 0, we consider the following Dirichlet problem on VL :
)
∇u,L
(4.3) div = |∇u,L |2 + 2 ,
|∇u,L |2 + 2
(4.4) u,L = 0 at ∂Ω0 ,
(4.5) u,L = L − 2 at v = L.
The nonlinear elliptic equation (4.3) has a nice geometric interpretation.
Exercise 4.34. Given u,L as above, define the function U,L : VL ×R −→ R
by U,L (x, z) := u,L (x) − z. Assuming that u,L is C 2 , verify that U,L is
itself a solution of IMCF on the product space VL × R with metric g + dz 2 .
Clearly, the level sets of U,L are just translations of the graph of the
function −1 u,L : VL −→ R. Together with the exercise above, we see
4.2. Inverse mean curvature flow 129
that u,L solves equation (4.3) if and only if the graph of −1 u,L moves by
downward translation under smooth IMCF.
Given fixed L, one can prove that the above Dirichlet problem can be
solved for sufficiently small . This is where the bulk of the work is, and it
is accomplished using some standard PDE techniques. The existence of a
subsolution v plays an essential role. (Keep in mind that the existence of
such a v is really a statement about the asymptotic behavior (M, g).) For
details, see Huisken and Ilmanen’s paper [HI01, Section 3].
Hence, we can construct a sequence of solutions ui of the regularized
Dirichlet problems with Li → ∞ and i → 0. These solutions come with
gradient estimates that allow us to apply Arzela-Ascoli to find a locally
Lipschitz limit function u such that ui → u uniformly on compact subsets
of M Ω0 . Clearly, we have the desired boundary conditions for u, that is,
u = 0 at ∂Ω0 , u ≥ 0 on M Ω0 , and u(x) → ∞ as x → ∞. (We can fill in
u arbitrarily to obtain u < 0 in Ω0 .)
Next we recall that the functions Ui (x, z) := ui (x) − i z are smooth solu-
tions of IMCF and converge to U (x, z) := u(x) uniformly on compact sets.
(See Figure 4.4.) The critical observation is that being a weak solution of
IMCF is preserved in the limit (assuming uniformly locally bounded gradi-
ents). Considering the minimization principle used to define the weak flow,
this is not too surprising. Finally, given that U is a weak solution of IMCF
on (M Ω0 ) × R, one can see that u is a weak solution on M Ω0 .
130 4. The Riemannian Penrose inequality
Exercise 4.35. Prove the last assertion in the proof sketch above. That is,
given that U (x, z) = u(x), if U is a weak solution of IMCF on (M Ω0 ) × R,
then u is a weak solution of IMCF on M Ω0 .
Definition 4.36. Given a function u, we define the following notation,
which we will use throughout this section:
Ωt := {u < t},
t := Int{u ≤ t},
Ω+
Σt := ∂Ωt ,
Σ+ +
t := ∂Ωt .
Lemma 4.37. Let u be a weak solution of IMCF on (U, g). Then the sublevel
sets Ωt minimize Ju on U in the following sense. For any set F of locally
finite perimeter with Ωt F contained in some compact set K ⊂ U , we have
JuK (Ωt ) ≤ JuK (F ).
The minimization property in Lemma 4.37 allows one to derive the fol-
lowing regularity theorem [HI01, Regularity Theorem 1.3]. We omit both
proofs.
Theorem 4.38 (C 1,α regularity of weak IMCF). Let n < 8, and let (M n , g),
Ω0 , and u be as described in Theorem 4.32. Then Σt and Σ+ t are C
1,α for
C 1,α sense.
Proof. For item (1), let Ωt ⊂ F , where F has finite perimeter. We use the
minimization property from Lemma 4.37 to see that
∗
|∂Ωt ∩ K| − |∇u| dμ ≤ |∂ F ∩ K| − |∇u| dμ
Ωt ∩K F ∩K
We can now describe a rough intuitive picture of how the weak IMCF
works: as long as Ωt remains a minimizing hull, it flows by “ordinary”
inverse mean curvature flow. (Here we say “ordinary” instead of “classical,”
since it is a flow of C 1,α surfaces, continuous in the C 1,α topology.) At any
moment when Ωt is about to cease being a minimizing hull, it “jumps” to its
strictly minimizing hull Ω+ t and then continues to flow by ordinary inverse
mean curvature flow. However, this is not rigorous because there can be
countably many jump times. Moreover, the weak flow essentially tells us
how to start (or restart) the flow from any strictly minimizing hull (which
typically only has H ≥ 0 but not H > 0), which the classical flow does not
cover.
132 4. The Riemannian Penrose inequality
4.2.3. Geroch monotonicity of the weak flow. Observe that the in-
tuitive picture described above suggests that Geroch monotonicity ought
to hold for the weak flow. We know that Geroch monotonicity holds for
the classical flow, and the weak flow should behave much like the classical
flow, except for some jumps. But the effect of jumps on Hawking mass has
“good sign.” This is because the “jump” is a jump from Ωt to its strictly
minimizing hull Ω+ t . The
perimeter |Σt | is continuous at a jump time by
Corollary 4.39, while Σt H can only jump down to Σ+ H 2 since Σ+
2
t is
t
minimal where it disagrees with Σt . Therefore the Hawking mass should
only be able to jump up at a jump time.
Unfortunately, since that argument is not rigorous, a more involved ar-
gument is needed. First we establish the desired volume growth.
Lemma 4.40. Let (M n , g), Ω0 , and u be as described in Theorem 4.32.
Then for t > 0, e−t |∂Ωt | is constant.
Exercise 4.41 (Exponential Growth Lemma 1.6 of [HI01]). Prove Lemma
4.40. Hint: Use the minimization property from Lemma 4.37 to see that
JuK (Ωt ) is constant in t. Then use the co-area formula to find an integral
equation for |∂Ωt |.
0 ≤ t ≤ Li − 7,
d
2
φH dμt = φ(−2H −2 |∇H|2 − 2|A|2 − 2Ric(ν, ν) + H 2 )
dt Σit Σit
+∇φ, −2H −1 ∇H + Hν dμt .
The reason why we take t ≤ Li − 7 is to make sure that φ vanishes near the
boundary of Σit .
Proof. Theorem 4.38 says that if t is a jump time, then Σt can be approx-
imated by Σt for some nonjump time t . So without loss of generality, let
us assume that t is not a jump time, so that Σt = {u = t}.
We first show that {0 ≤ u ≤ t} = Ωt Ω0 is connected. Suppose it is
not. Then it must have a component K that is disjoint from the connected
set ∂Ω0 . Since ∂K ⊂ Σt = ∂Ωt , the interior of K, where 0 < u < t, must be
nonempty. Since K is compact, it follows that K attains a local minimum
in the interior of K. Using the Ju -minimizing property of u, one can show
this is only possible if u is constant on K, which is a contradiction. (Prove
this as an exercise.)
Similarly, we can show that {u ≥ t} = M Ωt is connected. Suppose
it is not. Since u → ∞ at infinity, we know that exactly one component
of M Ωt can be unbounded. So any other component K is compact, and
since ∂K ⊂ Σ+ +
t = ∂Ωt , we can see that the interior of K, where u > t,
4.2. Inverse mean curvature flow 135
Combining these two facts implies that monotonicity holds at these jump
times, that is,
mHaw (∂Ωt1 ) ≤ mHaw (∂Ω+
t1 ).
Putting this discussion together with Theorem 4.44 and Lemma 4.46,
we immediately obtain the following.
Corollary 4.47 (Geroch monotonicity for modified weak IMCF). Let
(M 3 , g) be a complete asymptotically flat manifold whose boundary consists
of an outward-minimizing connected surface Σ0 and possibly other com-
ponents which are minimal. Assume that b1 (M ) = 0. Arbitrarily fill in
136 4. The Riemannian Penrose inequality
the boundary ∂M (in a way that does not introduce any b1 ) and consider
the “modified” weak IMCF Ωt as described above, with initial condition
Σ0 = ∂Ω0 . Then for 0 ≤ r < s,
mHaw (Σs ) ≥ mHaw (Σr )
s
1 −2
+ |Σt | 3/2
2H |∇H| + R + (λ1 − λ2 ) dμt dt.
2 2
(16π)3/2 r Σt
In particular, if M has nonnegative scalar curvature, then
mHaw (Σs ) ≥ mHaw (Σr ).
4.2.4. The long-time limit of the flow. Now that we have established
monotonicity for the modified weak IMCF, the only thing left to do is verify
that
lim mHaw (Σt ) ≤ mADM (M, g).
t→∞
Recall from Exercise 4.25 that the Hawking masses of the coordinate spheres
converge to the ADM mass. We will show that Σt becomes “rounder” as
t → ∞ in a sense that is strong enough to estimate the Hawking mass.
First we establish a uniqueness result for the space Rn {0} [HI01,
Proposition 7.2].
Lemma 4.48. Let u be a solution of weak inverse mean curvature flow
on Rn {0} with the Euclidean metric, such that the level sets of u are
compact. Then we must have u = c + (n − 1) log |x| for some constant c.
This corresponds to Σt being the sphere of radius e(t−c)/(n−1) .
The importance of this lemma is that our IMCF on (M, g) can be “blown-
down” to a solution on Rn {0}, as follows. Suppose that (M n , g) and
u are as described in Corollary 4.47. Then u solves weak IMCF in the
asymptotically flat end, which we may take to be Rn B̄r . Given λ > 0,
we consider uλ (x) := u(x/λ), which can easily be seen to solve weak IMCF
in Rn B̄λr with the scaled metric g λ (x) = λ2 g(x/λ). Using the gradient
bounds and the compactness theory from the proof of Theorem 4.32, one can
show that for any sequence of λ’s approaching zero, there is a subsequence
λi and constants cλi such that uλi − cλi converges locally uniformly to some
v solving IMCF on Rn {0} with the Euclidean metric.
all later t, assuming that the sublevel sets are enclosed regions. (We omit
the proof. See [HI01, Uniqueness Theorem 2.2].) Let S1 be the coordinate
sphere inscribing Σt , and let S2 be the coordinate sphere circumscribing
Σt . By evolving all three by the weak IMCF, the comparison principle tells
us that for τ > 0, Σt+τ is sandwiched between eτ /(n−1) S1 and eτ /(n−1) S2 .
Therefore θ(Σt ) is nonincreasing in t.
Moreover, we claim that if Σt is not a coordinate sphere, then θ(Σt ) must
strictly decrease. This is due to a strong maximum principle for smooth
IMCF, which says that if one initial surface is enclosed by a distinct sur-
face, then IMCF must immediately force them to become disjoint. Thus,
if Σt is smooth, then for τ > 0, Σt+τ is sandwiched between eτ /(n−1) S1
and eτ /(n−1) S2 but not touching them, and thus θ(Σt+τ ) < θ(Σt ). If Σt is
not smooth, we can still obtain the same conclusion by comparing to what
happens to a smooth surface lying between Σt and S2 .
On the space Rn {0}, it makes sense to blow up (rather than blow
down, as described above) the solution u to obtain a new solution ũ. Then
we can see that for any time τ ,
From Lemma 4.48 and the discussion immediately following it, we see
that the desired result will follow as long as we can establish that the blow-
down limit has compact level sets. We will prove this by bounding the
eccentricity of Σt for large t. The gradient bounds that were used in the
proof of Theorem 4.32 tell us that
|∇u| = O(|x|−1 ).
The comparison principle described in the proof of Lemma 4.48 also works
when a weak solution of IMCF is contained in a subsolution of IMCF (that
is, a family of surfaces moving “at least as fast” as IMCF). Recall from
Exercise 4.28 that 12 (n − 1) log |x| is a subsolution for large enough |x|.
138 4. The Riemannian Penrose inequality
We now observe that Lemma 4.49 implies that the rescaled level set
1
of area 4π must converge to the unit sphere in R3 in C 1 . The C 0
r(t) Σt
x
convergence tells us that r(t) → |x|
x
, while the C 1 convergence also tells us
that ν → |x|X
. Hence X − r(t)ν = o(r(t)). Since the asymptotic decay rate
is at least 1, we have Gij = O(|x|−2 ). Putting all of this together, we have
1
mHaw (Σt ) ≤ − G(X, ν) dμt + o(r(t)),
8π Σt
and now the result follows from Theorem 3.14 (and the remark following
it).
where the second inequality essentially follows from the fact that Σt becomes
“more round” in some sense as t → ∞, which we proved using a blow-down
argument.
not really necessary to remove all of the orientable closed minimal surfaces;
we really only need to inductively remove enough of them (finitely many)
to kill the generators of H1 (M, R).
Next we consider the case of equality. In this case the monotonicity
inequality (Theorem 4.44) becomes an equality, and thus, for a.e. t,
−2
(4.7) 2H |∇H|2 + R + (λ1 − λ2 )2 dμt = 0.
Σt
By the semicontinuity from Theorem 4.38, this actually holds for all t. In
particular, we see that ∇H must vanish a.e. in Σt for all t. By the regularity
from Theorem 4.38, this implies that H is constant on Σt . Together with
the regularity from Theorem 4.38, one can then use the elliptic theory for
the constant mean curvature equation to show that Σt is actually smooth.
(We omit the proof.) The same reasoning can be used to prove that Σ+ t is
also smooth and has constant mean curvature. Recall that Σ+ t must have
H = 0 wherever it disagrees with Σt . Therefore in this case, we must have
Σ+ +
t = Σt , or else Σt would be minimal, violating the outermost property of
the apparent horizon ∂M . In particular, this means that the filled-in region
W is empty, and thus ∂M is connected and equal to Σ0 , which we know
must be a sphere by Corollary 4.10. The absence of “jumps” also allows us
to use the IMCF comparison principle to show that if one chooses an initial
time t0 > 0, then Σt must agree with the smooth evolution of Σt0 under
classical IMCF for t slightly larger than t0 (which exists by parabolicity
since Σt0 has positive mean curvature). In other words, Σt satisfies classical
IMCF for t > 0.
This allows us to view our inverse mean curvature flow Σt as defining a
diffeomorphism from [0, ∞) × S 2 to M . Since the speed of the flow is H −1 ,
the metric g pulled back to [0, ∞) × S 2 can be expressed as
g = H −2 dt2 + ht ,
where ht is just the induced metric on Σt , pulled back to S 2 . (Here and
below, we engage in some harmless abuse of notation, identifying quantities
∂
with their pullbacks.) By (2.7), we know that ∂t ht = H2 A. By (4.7), we
know that Σt is totally umbilic (that is, λ1 = λ2 everywhere) for all t, that
is, A = H2 ht . Thus ∂t
∂
ht = ht , and consequently ht = et h0 . Meanwhile, from
Exercise 2.13, we know that
∂
H = −ΔΣt H −1 − (|A|2 + Ric(ν, ν))H −1 .
∂t
Since we already established that H and A are constant on each Σt , the same
must be true for Ric(ν, ν). Since we also know that R = 0 from (4.7), the
traced Gauss equation implies that Σt also has constant Gauss curvature K.
142 4. The Riemannian Penrose inequality
Thus
g = H −2 dt2 + et h0 ,
where (∂M, h0 ) is a minimal round sphere in (M, g). Since R = 0 every-
where, we can now use Exercise 3.2 to obtain the desired result that (M, g)
is half of a Schwarzschild space.
Finally, we note that the inverse mean curvature flow argument can be
used to prove the positive mass theorem in three dimensions. Suppose (M, g)
contains a closed minimal surface. Then we can create a new space M + as
in the proof of Theorem 4.22 that contains no closed minimal surfaces in its
interior, but does have a nontrivial minimal boundary. Then Theorem 4.53
immediately implies that mADM (g) > 0. Now suppose that (M, g) does not
contain any closed minimal surfaces. Then for any point p ∈ M , we can
construct a solution u of weak IMCF on M {p} such that limx→p u(x) =
−∞, limx→∞ u(x) = ∞, and mHaw (Σt ) ≥ 0 for all t. The basic idea behind
the construction is the following (as explained in [HI01, Section 8]). For
each > 0, there exists a solution u to weak IMCF with initial condition
∂B (p) by Theorem 4.32. Again using gradient estimates and compactness of
solutions, one can extract a subsequence and find a sequence of real numbers
ci → ∞ such that ui − ci that converges to a weak solution u on M {p}.
Eccentricity estimates can be used to show that u has nonempty, compact
level sets for all t. Finally, since mHaw (∂Bi (p)) → 0, Geroch monotonicity
and Proposition 4.52 prove that mADM (g) ≥ limt→0+ mHaw (Σt ) ≥ 0.
We say that (M, gt , Ωt ) is a Bray flow (or conformal flow) on the interval
[0, T ) if gt is a family of metrics on M such that gt (x) is Lipschitz in t and
C 1 in x, and smooth in x away from Ωt , and the following conditions hold
for all t ∈ (0, T ):
4
• gt = utn−2 g0 , where
t
ut = 1 + vs ds,
0
Note that our formulation of the second condition has changed, but it is
essentially equivalent to the previous version.
Proof. Let us assume that the conditions hold at t = 0 and prove that they
continue to hold at any time t > 0. Since vs is g0 -harmonic away from Ωt for
all s < t, it follows that ut is g0 -harmonic away from Ωt . We also see that
ut is asymptotic to a constant at infinity. By Theorem A.38 and Exercise
4
3.13, we see that gt = utn−2 g0 is asymptotically flat. Moreover, since g0 has
nonnegative scalar curvature outside Ωt , we can use (1.6) to see that gt also
has nonnegative scalar curvature outside Ωt .
It is also the case that Σt being minimal in (M, gt ) should also be pre-
served under the flow. Indeed, we are only interested in the case when Σt is
minimal in (M, gt ), and this fact will be rolled into our existence theorem.
Although we have chosen to formulate the Bray flow in terms of the pair
(Ωt , gt ) on M , we should really think of it as a flow of asymptotically flat
manifolds with boundary (M Ωt , gt ). This flow preserves the nonnegative
scalar curvature condition and the minimal boundary condition, and the
idea is that it should “improve” the space in the sense that these manifolds
should flow toward Schwarzschild space. For this to be the case, we expect
Schwarzschild space to be a sort of fixed point for the flow. More accurately,
Schwarzschild space should be thought of as a “soliton solution” of the Bray
flow, as illustrated in the following example.
4
Exercise 4.58. Fix m > 0. Let M = Rn and (gt )ij = Utn−2 δij , where
⎧
1
⎪
⎪ e−t + m |x|
2t
2−n et for |x| ≥ m n−2 · e n−2 ,
⎪
⎨ 2 n−2 2
2 m
1 m
1 2t
Ut =
⎪ 2 m
for n−2
≤ |x| < n−2
· e n−2 ,
⎪
⎪
2|x| 2
1
2
⎩
U0 for |x| < m2
n−2
;
here U0 is any smooth positive fill-in function whose details are unimportant.
1 2t
Let Ωt = {|x| < m 2
n−2 e n−2 }. Verify that (M, g , Ω ) is a Bray flow for
t t
t ≥ 0, and that for each t ≥ 0, (M Ωt , gt ) is isometric to half of the
Schwarzschild space of mass m.
Sketch of the proof. The proof is quite technical, so we will only provide
an outline. (For the full details, see [Bra01, Section 4].) We will start with
a version of Bray flow in which vt and Ωt are discrete in time, and then we
take a limit as the discrete time intervals shrink to zero.
Let > 0. We will iteratively define vt , ut , gt , and Ωt . We know what
all of these should be at t = 0 (namely, their counterparts without the
superscripts). Assume that we have defined these objects for t ∈ [0, k],
where k is a nonnegative integer. Then we define them for t ∈ (k, (k + 1)]
as follows. The function vt is defined to be 0 on Ωk , and outside Ωk , vt is
the unique solution to the Dirichlet problem
⎧
⎨ Δg0 vt (x) = 0 on M Ωk ,
vt (x) = 0 at ∂Ωk ,
⎩
limx→∞ vt (x) = −(1 − )k .
Then we define
t
ut =1+ vs ds,
0
4
gt = (ut ) n−2 g0 ,
and we define Ωt to equal Ωk for all t ∈ (k, (k + 1)), while Ω(k+1) is the
strictly minimizing hull of Ωk with respect to the metric g(k+1)
.
The crucial nontrivial fact is that the hypersurfaces Σt := ∂Ωt are not
only smooth, but also their local C k,α bounds can be shown to be indepen-
dent of . This is proved in [Bra01, Appendix E], using regularity theory of
sets of finite perimeter developed by E. De Giorgi [DG61] (see [MM84]).
In particular, this regularity guarantees that vt (x) has Lipschitz bounds in
x independent of , and consequently ut (x) has Lipschitz bounds in both x
and t independent of . By Arzela-Ascoli, we can find a sequence i such
that ut i (x) converges uniformly on compact sets to some Lipschitz function
ut (x).
4
We next define gt = utn−2 g0 and Ωt to be the strictly minimizing hull of
Ω0 with respect to gt . The C k,α bounds on Σt allow us to extract smooth
(γ)
limit hypersurfaces Σt of Σt i , where γ is an index for these limit surfaces.
(γ) (γ)
Unfortunately, it need not be the case that Ωt = Ωt , where Ωt is the
(γ)
region enclosed by Σt . However, one can show (with some work) that if
t1 ≤ t2 ≤ t3 , then
(γ)
(4.8) Ωt1 ⊂ Ωt2 ⊂ Ωt3 .
We claim that for almost every t, vti (x) converges to vt (x) as i → ∞. Since
Ωt has smooth boundary,
2 there can only be countably many times when Ωt
does not equal s>t Ωs . As long as t is not one of those “jump times” for
(γ)
the flow, it follows from (4.8) that we do have Ωt = Ωt . This implies that
Σt i converges to Σt , which implies the claim.
t
Given the claim, it follows easily that ut = 1 + 0 vs ds.
Next we explain why, for t1 < t2 , Σt1 does not touch Σt2 . Since the
metric gt can only get smaller, one can show that since the hypersurface
Σt1 is minimal with respect to t1 , it must have nonpositive mean curvature
with respect to t2 . Therefore the strong comparison principle (Corollary 4.2)
shows that Σt2 cannot touch Σt1 .
Using this fact, we see that for each x0 ∈ M , there is at most one time t
such that x0 ∈ Σt . Since vt is smooth everywhere except at Σt and has
Lipschitz bounds, one can conclude that ut is actually C 1 in x, and it is
clear that ut must be smooth outside Σt .
(M, gt , Ωt ) with initial condition (M, g, ∅). (Recall that with our conven-
tions, we have ∂∅ = ∂M = Σ0 .) Then the itemized list above tells us that
n−2
n−2
1 A∞ n−1 1 A n−1
mADM (g) ≥ lim mADM (gt ) = m∞ = ≥ .
t→∞ 2 ωn−1 2 ωn−1
Lemma 4.60. Assume (M, gt , Σt ) is a Bray flow as constructed in Theo-
rem 4.59. Then the volume of Σt in (M, gt ) is constant in t. Moreover,
Σ+t = Σt for all t ≥ 0.
Sketch of the proof. We will first give the basic idea of why it is true by
considering the case when Σt varies smoothly in t at t = t0 . In this case we
have
d d d
|Σt |gt = |Σt |gt0 + |Σt0 |gt .
dt t=t0 dt t=t0 dt t=t0
The first term vanishes because Σt0 is minimal in gt0 , while the second term
vanishes because gt is unchanging at Σt0 at t = t0 .
In order to prove the result in the general case, we must delve into
the inner workings of the proof of Theorem 4.59. Specifically, we show
(γ)
that every limit surface Σt with respect to gt has volume equal to A :=
|Σ0 |g0 . (Actually, this fact is needed in the proof of Theorem 4.59 in order
to establish (4.8).) It suffices to show that A (t) := |Σt |gt approaches A as
→ 0. This, in turn, can be proved by showing that
A(k+1) − Ak = o().
Going back to the definitions and using the outward-minimizing property,
at Σ
this difference can be estimated in terms of the size of vk (k+1) . Finally,
the size of this quantity can be bounded using the uniform estimates on Σt
for most values of k. For details, see [Bra01, Section 5].
To see why Σ+
t = Σt , note that by lower semicontinuity of perimeter, we
have
t |gt ≤ lim |Σs |gt = lim |Σs |gs = |Σt |gt .
|Σ+ + +
s→t+ s→t+
Since Σt is strictly outward-minimizing and Σ+
t encloses it, it follows that
they must be equal.
4
considering the metric g̃t = (ωt ) n−2 ḡt on M t . The result is a new one-ended
manifold (M3t = M t ∪ {pt}, g̃t ) with nonnegative scalar curvature. (Recall
that we did something similar to this step in our proof of Lemma 3.39.) See
Figure 4.5 for an illustration of this procedure. This construction involving
doubling the manifold and then conformally closing up the newly created
asymptotically flat end was inspired by [BMuA87]. (See Theorem 6.25.)
Lemma 4.61. Assume (M, gt , Ωt ) is a Bray flow as constructed in Theo-
3t , g̃t ). Then
rem 4.59. Let m̃(t) be the mass of (M
t
m(t) = m(0) − 2 m̃(s) ds.
0
Proof. By Lemma 4.57, we know that (M t , ḡt ) has nonnegative scalar cur-
vature, and thus, by equation (1.6), (M 3t , g̃t ) also has nonnegative scalar
curvature. By the positive mass theorem (Theorem 3.18), we should have
m̃(t) ≥ 0. Then the corollary follows from Lemma 4.61.
There is a complication here, which is that (M t , ḡt ) is not smooth at the
hypersurface Σt where the gluing occurred, and therefore neither is (M 3t , g̃t ).
But although the positive mass theorem (Theorem 3.18) does not directly
apply, we can use Theorem 4.17 instead. (We can also use Theorem 3.43 to
take care of the lack of smoothness at the point added at infinity.)
Combining this with the previous equation yields the desired result.
∂
Note: This proof is a bit easier to follow if one assumes ∂t ut = vt , in
which case one can simply differentiate equation (4.10) and compare it with
equation (4.9) to see that m (t) = −2m̃(t). However, at jump times, the
function m(t) only has left and right side derivatives.
∞
Proof. By Lemma 4.61, 0 m̃(s) ds must be finite. So in order to prove the
lemma, it suffices to prove that m̃(t) has a one-sided bound on its different
quotients.
By the definition of vt , et vt is the unique g0 -harmonic that is −1 at
infinity and 0 at Σt . Since Σt moves outward, the maximum principle shows
that for each fixed x, the function et vt (x) is nondecreasing in t. Using the
same notation as in the proof of Lemma 4.61, this tells us that et b(t) is
nondecreasing. By equations (4.9) and (4.10), we have
et b(t) = et B(t) + m(0) − e2t m̃(t)
1 1
= e2t m(t) − m̃(t) + m(0).
2 2
Therefore e2t (m(t) − 2m̃(t)) is nondecreasing. Next we use the fact that
if f is a function such that e2t f is nondecreasing, then f + 2 f is also
nondecreasing. Therefore
t
m(t) − 2m̃(t) + 2 (m(s) − 2m̃(s)) ds is nondecreasing.
0
Using Lemma 4.61 to eliminate the integral of m̃(s), we see that
t
3m(t) − 2m̃(t) + 2 m(s) ds is nondecreasing.
0
We will use this to show that the difference quotients of m̃ are bounded
above. Taking a difference quotient of the above expression with h > 0
shows that
t+h
1
0≤ 3m(t + h) − 3m(t) − 2m̃(t + h) + 2m̃(t) + 2 m(s) ds
h t
2
≤ − (m̃(t + h) − m̃(t)) + 2m(t),
h
where we used the fact that m(t) is nonincreasing. Thus
1
(m̃(t + h) − m̃(t)) ≤ m(t) ≤ m(0).
h
∞
By Corollary 4.62 and nonnegativity of m(t), we have 0 m̃ ≤ 12 m(0). Since
m̃(t) is integrable and has an upper bound on its difference quotients, the
result follows. (Once again, the argument is simpler if one assumes that
m(t) and m̃(t) are differentiable.)
Recall from Exercise 4.58 that the Schwarzschild space is not a fixed
point for the Bray flow but rather a “soliton” solution in the sense that
its evolution is related to the original via diffeomorphism (in the region
outside the horizon). Therefore, we only expect our Bray flow to converge
to Schwarzschild space after pulling back by a suitable diffeomorphism. Or
152 4. The Riemannian Penrose inequality
Our goal will be to show that under the normalized Bray flow, (M Ω∗t ,
gt∗ ) converges to half of Schwarzschild space, which is a fixed point of the
normalized Bray flow. Recall from Lemma 4.19 that for the purpose of
proving the Penrose inequality, we may assume without loss of generality
that (M, g0 ) is harmonically flat outside a compact set, and we will do so
starting now. By Exercise 4.55, one can see that the Bray flow preserves
the property of being harmonically flat outside a compact set. Being har-
monically flat outside a compact set means that the metric is conformal to
Schwarzschild outside a compact set. Therefore in order to prove conver-
gence to Schwarzschild, we need only prove convergence of a single function.
One important step is to obtain at least a small amount of control
over Ω∗t .
Lemma 4.65. Assume (M, gt , Ωt ) is a Bray flow as constructed in Theo-
rem 4.59, and assume that g0 is harmonically flat outside a compact set. In
the harmonically flat coordinates, there exists some large r1 such that the
normalized region Ω∗t is always enclosed by the coordinate sphere |x| = r1 .
Idea of the proof. The basic idea is that if Σ∗t extends too far out, that
will cause it to have large volume, but we already know that it has fixed
volume A. The reason why it will have to have large volume is the following.
A well-known property of a smooth minimal hypersurface Σ in Euclidean
Rn is that if p ∈ Σ, then
|Σ ∩ Br (p)| ≥ ωn−1 rn−1 .
4.3. Bray’s conformal flow 153
We know that (M, g0 ) is harmonically flat for |x| > r1 , where r1 is given
by Lemma 4.65. Thus
4
(g0 )ij (x) = U n−2 δij
for some harmonic function U , for |x| > r1 . We can extend U to a globally
defined positive function on M , and then define the metric ḡ via
4
g0 = U n−2 ḡ.
Ut := et (ut U ) ◦ Φt ,
Vt := et (vt U ) ◦ Φt ,
−4t
ḡt := e n−2 Φ∗t ḡ,
g̃t∗ := Φ∗ g̃t outside Ω∗t .
Note that since Ut and Vt are harmonic (with respect to the Euclidean
metric), we can expand them in spherical harmonics as in Corollary A.19.
n−2
Exercise 4.66. With the definitions above, let t0 = 4 log(r1 /r0 ), and
verify the following facts:
• For |x| > r1 , (ḡt )ij (x) = δij . That is, for t > t0 , ḡt is also a flat
background metric outside a compact set. It is also equal to δij for
|x| > r0 when t > t0 .
4
• gt∗ = Utn−2 ḡt .
4
• g̃t∗ = Wtn−2 ḡt , where Wt := 12 (Ut − Vt ) outside Ω∗t .
t
• Ut = U0 + 2 0 (Us − Ws + XUs ) ds.
154 4. The Riemannian Penrose inequality
•
m(t) 2−n
Ut = 1 + |x| + O(|x|1−n),
2
m(t) − 2m̃(t) 2−n
Vt = −1 + |x| + O(|x|1−n),
2
m̃(t) 2−n
Wt = 1 + |x| + O(|x|1−n).
2
• In the region |x| > r1 , Ut , Vt , and Wt are all (Euclidean) harmonic
functions. When t > t0 , they are harmonic in the region (M Br0 )
∗
Ωt .
This result was first proved by Bray in [Bra01] in the case where M is
spin. In that case, it follows from Witten’s spinor proof of the positive mass
theorem, and we provide the argument in Section 5.4.2. For the general
case, see [Lee09].
Lemma 4.68. Assume (M, gt , Ωt ) is a Bray flow as constructed in Theo-
rem 4.59, and assume that g0 has nonnegative scalar curvature and is har-
monically flat outside a compact set. Let m∞ = limt→∞ m(t). For any
4.3. Bray’s conformal flow 155
α > 1, the following limits hold uniformly over all |x| ≥ αr1 , where r1 is the
constant given in Lemma 4.65:
m∞ 2−n
lim Ut (x) = 1 + |x| ,
t→∞ 2
m∞ 2−n
lim Vt (x) = −1 + |x| ,
t→∞ 2
lim Wt (x) = 1.
t→∞
By Theorem 4.67 and Lemma 4.63, we know that given any α > 1,
Wt converges to 1 uniformly over |x| > αr1 , as t → ∞, establishing the
last equation of the lemma to be proved. Or in other words, Ŵt converges
to 0 uniformly over |x| ≥ αr1 , as t → ∞. Since Ŵt is harmonic of order
O(|x|1−n) and converging to 0, it follows that for any > 0, we can choose
t large enough so that for all |x| > 2αr1 ,
|Ŵt (x)| < C|x|1−n
for some constant C independent of . Analyzing equation (4.11), one may
conclude that for large enough t,
|Ût (x)| < 3C|x|1−n .
The first equation of the lemma follows from this, and the second equation
follows immediately from the other two.
Lemma 4.69. Assume all of the hypotheses of Lemma 4.68 and further
suppose that m∞ > 0. Let r∞ be the Schwarzschild radius (in conformal
1
coordinates) corresponding to m∞ . That is, r∞ = (m∞ /2) n−2 . Assume
r0 < r∞ . Then there is a subsequence of Σ∗t that converges to the coordinate
sphere Sr∞ in Hausdorff distance.
Sketch of the proof. Using the general compactness theory for sets of fi-
nite perimeter, we can extract a subsequence Ω∗ti that converges to some
Ω∞ (in the sense that their characteristic functions converge in L1 ). We
can use the outward-minimizing property to help us show that Σ∗ti actually
converges to Σ∞ := ∂Ω∞ in the Hausdorff sense in the region where |x| > r0 .
The main problem is to show that Σ∞ = Sr∞ .
∗
Since Vt is harmonic on (M Br0 )Ωt for large t and uniformly bounded
in t, we can choose a subsequence such that Vti converges uniformly on
compact subsets of (M Br0 ) Ω∞ . Since the limit must be harmonic, it
follows from Lemma 4.68 that the limit is V∞ (x) = −1 + m2∞ |x|2−n .
Suppose that part of Σ∞ lies within Sr∞ . Choose a point x0 lying outside
Σ∞ but inside Sr∞ . Then V∞ (x0 ) > 0, and so we can find ti large enough so
that Vti (x0 ) > 0 while x0 lies outside Σti . But this contradicts the definition
of Vti .
Now suppose that part of Σ∞ lies outside Sr∞ . Then for some x0 ∈ Σ∞
and some r > 0, the ball B2r (x0 ) lies completely outside Sr∞ . We outline
a proof for how to obtain a contradiction from this. We know that V∞ is
bounded above by some negative number in Br (x0 ). On the other hand, we
know that Vti is zero at Σ∗ (ti ), which cuts through Br (x0 ). The only way
that this can happen is if the gradient of Vti is blowing up. More precisely,
one can show that it blows up badly enough that the energy of Vti blows up
as i → ∞. (For details of this argument, see [BL09, Section 3].) However,
we can bound the energy of Vti independently of i. To see this, note that Vti
is the harmonic function that is equal to 0 at Σti and −1 at infinity. Since
Σ∗ti is contained in Sr1 (by Lemma 4.65), the energy-minimizing property of
harmonic functions shows that the energy of Vti is less than the energy of
the harmonic function that is equal to 0 at Sr1 and −1 at infinity.
Lemma 4.70. Assume all of the hypotheses of Lemma 4.68. Then m∞ > 0.
Sketch of the proof. Just as in the proof of the previous lemma, we ex-
tract a subsequence of Σti that converges to some Σ∞ in Hausdorff distance
in the region where |x| > r0 . Using the outward-minimizing property of Σti
and the fact that the volume A = |Σ∗ti |gt∗ is constant, we can argue that the
i
1
part of Σ∞ where |x| > r0 cannot be empty, so long as r0 < (A/ωn−1 ) n−1 .
See [BL09, Section 3] for details.
Once we know that there is a point in Σ∞ where |x| > r0 , we can use
the same energy estimate argument used in the previous lemma in order to
contradict the possibility that m∞ ≤ 0.
of Theorem 4.54 plus the assumption that the metric is harmonically flat
outside a compact set. Using Theorem 4.59, we construct a long-time Bray
flow with initial condition (M, g, ∅). By Lemma 4.70, the long-time limit
of the mass m∞ , which exists by Corollary 4.62, must be positive. Let r∞
be its corresponding Schwarzschild radius (in conformal coordinates). By
Lemma 4.60, A = |Σt |gt . Choose r0 < r∞ and use this r0 to define the
normalized Bray flow (M, gt∗ , Ω∗t ).
Let > 0. By Lemma 4.69, for large enough t, Σ∗t lies within the sphere
Sr∞ + . Since Σ∗t is outward-minimizing with respect to gt∗ , we can see that
for large enough t,
A = |Σ∗ti |gt∗
i
≤ |Sr∞ + |gti
2(n−1)
= Ut n−2
Sr∞ +
2(n−1)
m∞ 2−n n−2
≤ 1+ |x| + ,
Sr∞ + 2
n−1
which converges to ωn−1 (2m∞ ) n−2 as → 0. Finally, we know that m∞ ≤
mADM (g) by monotonicity of mass (Corollary 4.62). Thus
n−2
1 A n−1
(4.12) mADM (g) ≥ .
2 ωn−1
Next we consider the case of equality. Suppose that we have initial data
(M, Ω, g), not necessarily harmonically flat outside a compact set, such that
n−2
1 A n−1
mADM (g) = .
2 ωn−1
If we evolve the (M, Ω, g) by Bray flow, Lemma 4.61 states that
T
m(T ) = m(0) − 2 m̃(s) ds
0
for any T > 0. Since the volume must remain constant (Lemma 4.60) and
the Penrose inequality (4.12) continues to hold for each (M, Ωt , gt ), it follows
that T
m̃(s) ds ≤ 0.
0
Since m̃(s) ≥ 0 by Theorem 4.17, it follows that m̃(s) = 02for almost
every s in [0, T ]. From Lemma 4.60, we know that Ω0 = Ω+ 0 = s>0 Ωs . In
particular, this implies that ωs converges to ω0 as s → 0+ (where ωs is the
conformal factor used in the construction of g̃s in the proof of Lemma 4.61).
Since m̃(s) is picked up by the asymptotics of ωs , it follows that m̃(0) = 0.
158 4. The Riemannian Penrose inequality
By rigidity of the positive mass theorem, this means that g̃0 is Euclidean
space. Reversing the construction of g̃0 , this means that the doubled metric
(M 0 , ḡ0 ) is globally conformal to Euclidean space Rn {0}, where the con-
formal factor is harmonic. This is only possible if (M 0 , ḡ0 ) is a Schwarzschild
space, or equivalently if (M Ω0 , g) is half of a Schwarzschild space. Note
that once again, because of the nonsmoothness of g̃0 , we have to invoke the
rigidity of Theorem 4.17. (See also Remark 3.45 for dealing with the singular
point in (M 30 , g̃0 ).)
Chapter 5
Spin geometry
5.1. Background
In our presentation of spinors below, we will attempt to emphasize facts
that are most directly relevant to the computations that we will need to do
later on, while deemphasizing formalism and the conceptual side. Because
of this, our introduction to spinors will be quite brief and fairly shallow.
There are many excellent resources for a more thorough introduction to the
subject. Specifically, see the books [LM89] and [Har90].
159
160 5. Spin geometry
given by φ(p, v) = [s(p), v] for any (p, v) ∈ U × V , where the brackets rep-
resent the quotient map F × V −→ V (M ). We will refer to this map φ as
a local trivialization of the bundle V (M ). (However, note that the usual
definition of a trivialization of a vector bundle uses U × Rm , which is related
to ours simply by composing with an isomorphism V ∼ = Rm .) A transition
map t between two local trivializations of F gives rise to the transition map
ρ ◦ t for the corresponding trivializations of V (M ).
One can also use this formalism to generalize the way that the Levi-
Civita connection gives rise to connections on tensor bundles. Recall that
the concept of a connection on a vector bundle is equivalent to parallel
transport, and the concept of parallel transport generalizes nicely to princi-
pal G-bundles. For example, given the Levi-Civita connection on an oriented
Riemannian manifold M , we know what it means to parallel transport an
SO(n)-frame along a path in M . Although there is a general definition of
parallel transport (or connection) on a principal G-bundle, we will not need
it. Instead we will concretely explain how to use parallel transport in FSO
to directly define parallel transport in an associated bundle V (M ).
Let γ be a path in M starting at p. Choose a local SO(n)-frame field
s = (e1 , . . . , en ) over U , giving rise to the local trivialization π −1 (U ) −→
U × SO(n), under which s corresponds to the identity section of U × SO(n).
Define g(t) ∈ SO(n) so that (γ(t), g(t)) corresponds to the parallel trans-
port of s(p) in FSO along γ. Now select any v in V . We now define
[s(γ(t)), ρ(g(t))v] to be the parallel transport of [s(p), v] in V (M ) along γ.
In the local trivialization, we would simply write this as (γ(t), ρ(g(t))v) is
the parallel transport of v at p. One can show that this gives a well-defined
connection on V (M ). In words, after a choice of local SO(n)-frame, parallel
transport of any SO(n)-frame at a point along a curve can be thought of as
a parallel transport of an element of SO(n) along that curve, and then the
action of SO(n) on V tells us how to parallel transport an element of V (M )
along that curve.
This exercise together with the preceding proposition shows that Spin(n)
is a connected Lie group that double covers SO(n). In particular, they must
have the same dimension, and since π1 (SO(n)) = Z2 , it follows that Spin(n)
is simply connected.
We are interested in the derivative of ξ. Technically, this derivative is
an isomorphism between Lie algebras, but we are primarily interested in
calculating this map explicitly.
Exercise 5.7. Consider the map
Dξ : T1 Spin(n) −→ TId SO(n),
where T1 Spin(n) is regarded as a subspace of Cl(n) and TId SO(n) is re-
garded as a subspace of End(Rn ). Then for any i, j from 1 to n with i = j,
we have
Dξ(ei ej ) = 2Aji ,
where e1 , . . . , en is an orthonormal basis for Rn , and Aji is defined as in
Exercise 5.3. Hint: Consider an appropriate path in Spin(n) and explicitly
compute its image under ξ.
which arise from complex modules over Cl(n), are more prevalent, but they are not necessary for
our purposes.
5.1. Background 165
Exercise 5.8. Show that the above equation can be seen as a direct conse-
quence of (5.2) and the fact that vectors act as skew-symmetric operators
on S, by writing an arbitrary spinor as a linear combination of constant
spinors.
One can similarly show that the connection respects the module struc-
ture of S(M ) over Cl(M ) in the sense that for any v ∈ C ∞ (T M ) and
ψ ∈ C ∞ (S(M )),
∇(v · ψ) = (∇v) · ψ + v · (∇ψ).
n
Dψ = ei · ∇i ψ,
i=1
Exercise 5.9. Check that the Dirac operator D is well-defined in the sense
that the definition above is independent of choice of local orthonormal frame.
Also, show that D is formally self-adjoint.
1
D2 ψ = ∇∗ ∇ψ + Rψ,
4
2 This is not quite the same as the famous “Dirac equation” commonly used in quantum
where ReSi ,ej denotes the curvature of the spinor bundle S(M ). Let us com-
pute the operator RS . Since it is a zero-order operator, it suffices to com-
pute how it acts on a spinor ψ that is constant with respect to the frame,
for which (5.2) applies:
RS ψ = ∇(∇ψ)
⎛ ⎞
1 n
= ∇⎝ ωji ei ej · ψ ⎠
4
i,j=1
1
n
= ∇(ωji )ei ej · ψ + ωji ∧ ei ej · ∇ψ
4
i,j=1
1 n
1
n
= dωji ei ej · ψ + ωji ∧ ωk ei ej ek e · ψ.
4 16
i,j=1 i,j,k,=1
Therefore
where the second equality used reindexing and the third equality used anti-
symmetry. Therefore
1
n
S
R = (dω + ω ∧ ω)ij ei ej
4
i,j=1
1 n
= Riem(·, ·, ei , ej )ei ej ,
4
i,j=1
by (1.3).
Substituting this into our calculation of D2 ψ, we obtain
1
n
D2 ψ = ∇∗ ∇ψ − Rijk ei ej ek e ψ.
8
i,j,k,=1
The reason for this is that the expression ei ej ek is invariant under cyclic
permutations, while the first Bianchi identity tells us that the sum of the
cyclic permutations of Rijk in i, j, k vanishes. Explicitly, by reindexing, we
have
1
Rijk ei ej ek = Rijk ei ej ek + Rjki ej ek ei + Rkij ek ei ej
3
i,j,k distinct i,j,k distinct
1
= (Rijk + Rjki + Rkij )ei ej ek
3
i,j,k distinct
= 0.
Therefore, in the sum i,j,k, Rijk ei ej ek e , we are only concerned with
when i, j, k are not distinct. Note that i = j contributes nothing because of
antisymmetry of Rijk , so we need only worry about when i = k and when
j = k. (Note that the intersection i = j = k does not contribute anything.)
Therefore, using only symmetries of the curvature tensor and properties of
5.3. Witten’s proof of the positive mass theorem 169
n
n
−Rijk ei ej ek e = (−Rijj ei ej ej e − Riji ei ej ei e )
i,j,k,=1 i,j,=1
n
= (−Rjij ei e − Riji ej e )
i,j,=1
n
= −2Ri ei e
i,=1
= 2R,
Theorem 5.12 (Positive mass theorem for spin manifolds). Let (M, g) be a
complete asymptotically flat spin manifold with nonnegative scalar curvature.
Then the ADM mass of each end is nonnegative. Moreover, if the mass of
any end is zero, then (M, g) is Euclidean space.
where Li = (δij + ei ej ) · ∇j .
Proof. We will prove the second expression involving Li first, and that is
the version of the formula we will use later on. It is easy to see that it is
equal to the manifestly frame-independent expression above it. Adopting
Einstein summation notation, we compute
−|Dψ|2 = −ei · ∇i ψ, ej · ∇j ψ
= ∇i −ei · ψ, ej · ∇j ψ + ei · ψ, ∇i (ej · ∇j ψ)
= ∇i ψ, ei ej · ∇j ψ − ψ, ei · ∇i (ej · ∇j )ψ
= ∇i ψ, ei ej · ∇j ψ − ψ, D2 ψ.
Meanwhile,
|∇ψ|2 = ∇i ψ, ∇i ψ + ψ, ∇∗ ∇ψ
= ∇i ψ, δij ∇j ψ + ψ, ∇∗ ∇ψ.
Adding these two computations and combining them with the Schrödinger-
Lichnerowicz formula and the divergence theorem yields the desired result.
The idea behind Witten’s proof of the positive mass theorem is to find a
spinor that solves the Dirac equation while being asymptotically constant at
infinity, in which case the boundary term in Corollary 5.13 is proportional
to the mass, completing the theorem. First, let us see why that boundary
term gives us the mass.
Proposition 5.14. Let (M, g) be an asymptotically flat spin manifold, and
let e1 , . . . , en be an orthonormal frame in some end Mk . Let ψ0 be a constant
spinor with respect to this frame. Then
n
1
lim ψ0 , Li ψ0 ν i dμSρ = (n − 1)ωn−1 |ψ0 |2 mADM (Mk , g),
ρ→∞ S
ρ
2
i=1
where Sρ is a coordinate sphere in Mk .
that asymptotic flatness means that hij := gij − δij decays at a faster rate
than q = n−2 2 . For convenience we choose to use the letter q for the con-
stant n−2
2 rather than the actual assumed asymptotic decay rate. Therefore
hij = o2 (|x| ). (We say that a function f is o2 (|x|−q ) if for any > 0, we
−q
have |f | + |x| · |Df | + |x|2 | · |D 2 f | < |x|−q for sufficiently large |x|.) Direct
computation can be used to show that
1
ei = ∂i − hij ∂j + o1 (|x|−q ),
2
and thus
ωji (ek ) = ∇ek ej , ei
6 7
1
n
= ∇∂k ∂j + hj ∂ , ∂i + o(|x|−2q−1 )
2
(5.3) =1
1
= Γijk − hij,k + o(|x|−2q−1 )
2
1
= (gik,j − gjk,i ) + o(|x|−2q−1 ).
2
Note that this expression is antisymmetric in i and j, as it should be. Next,
compute
n
ψ0 , Li ψ0 ν i = ψ0 , ei ej ∇j ψ0 ν i
i=1 i=j
6 7
1 k
(5.4) = ψ0 , ei ej ω (ej )ek e · ψ0 νi
4
i=j k=
1
= ωk (ej )ψ0 , ei ej ek e · ψ0 ν i .
4
i=j
k=
Note that by (5.3), the sum of the expression ωji (ek ) over cyclic permuta-
tions of i, j, k vanishes modulo o(|x|−2q−1 ), that is, ωji (ek )+ωkj (ei )+ωik (ej ) =
o(|x|−2q−1 ). For i, j, k distinct, ei ej ek is invariant under cyclic permutation,
and therefore by the same argument used in the proof of Theorem 5.10,
ωk (ej )ej ek e = o(|x|−2q−1 ).
j,k, distinct
n
1 i
ψ0 , Li ψ0 ν i = ωj (ej )ψ0 , ei ej ei ej · ψ0
4
i=1 i=j
+ωij (ej )ψ0 , ei ej ej ei · ψ0 ν i + o(|x|−2q−1 )
1
= (−(gij,i − gjj,i ) + (gjj,i − gij,j ))|ψ0 |2 ν i + o(|x|−2q−1 )
8
i=j
1
= (gjj,i − gij,j )ν i + o(|x|−2q−1 ).
4
i,j
Since the integral of the o(|x|−2q−1 ) vanishes in the limit, the result follows.
n
1
lim ψ, Li ψν i dμSρ = (n − 1)ωn−1 |ψ0 |2 mADM (Mk , g),
ρ→∞ S
ρ i=1
2
The first term will give us what we want by Proposition 5.14. We want to
show that the other terms do not contribute. It is straightforward to see that
the integrals of the last two terms will not contribute in the limit because
of decay of ξ and ∇ψ0 . We will show that the integral of the ψ0 , Li ξ term
is the same as the integral of the ξ, Li ψ0 term, and therefore it must also
vanish in the limit. This essentially follows from integration by parts. Define
α to be the (n − 2)-form defined by
α= ψ0 , ei ej ξei ej dvolM ,
i=j
5.3. Witten’s proof of the positive mass theorem 173
Using the previous corollary, the positive mass theorem will follow from
being able to solve the following Dirac equation with prescribed asymptotics.
Proposition 5.16. Let (M n , g) be a complete asymptotically flat spin man-
ifold with nonnegative scalar curvature, and let q = n−2
2 . The operator
1,2
D : W−q (S(M )) −→ L2−q−1 (S(M ))
is an isomorphism. Note that L2−q−1 = L2 with this choice of q.
We now only have to prove surjectivity. That is, given any η ∈ L2 (S(M )),
1,2
we need to find a spinor ξ ∈ W−q (S(M )) solving Dξ = η. We first consider
the case where η is compactly supported. Our estimates above show that the
1,2
pairing ω, ϕH := Dω, DϕL2 is equivalent to the W−q Hilbert product of ω
and ϕ. Observe that the map ϕ → η, ϕL2 is a well-defined bounded linear
1,2
functional on W−q (S(M )). (Check this.) Applying the Riesz representation
theorem [Wik, Riesz representation theorem] to this functional and using
1,2
the equivalence between the H product and the W−q product, it follows
1,2
that there must exist some ω ∈ W−q (S(M )) with the property that
Dω, DϕL2 = η, ϕL2
1,2
for every ϕ ∈ W−q (S(M )). We claim that ξ = Dω is the desired solution.
We know that ξ ∈ L2 (S(M )). To prove better regularity, let ξj be a sequence
1,2 1,2
of W−q spinors converging to ξ in L2 . For any test function ϕ ∈ W−q , we
obtain
lim Dξj , ϕL2 = lim ξj , DϕL2 = ξ, DϕL2 = η, ϕL2 ,
j→∞ j→∞
where mADM (M, g) is the mass of the selected end (since the contributions
from the other ends will be zero). Noting that
Dψ = Dψ0 + Dξ = η − η = 0,
it follows that
2 1
(5.6) mADM (Mk , g) = |∇ψ| + Rψ
2 2
dμM ,
(n − 1)ωn−1 M 4
which is manifestly nonnegative if R ≥ 0.
In the spin case, we get a simple proof of rigidity of the positive mass
theorem. We now suppose that mADM (M, g) = 0. Then the above equation
implies that ψ is parallel everywhere, that is, ∇ψ = 0. Note that any choice
of constant spinor in the end Mk leads to the construction of a parallel spinor
that is asymptotic to it. In particular, for i = 1, . . . , n, we can construct a
spinor ψi asymptotic to ei · ψ0 in Mk such that ∇ψi = 0. Define Vi to be
the vector field with the property that
Vi , w = w · ψ, ψi
for any w ∈ Tp M at any point p ∈ M , where ψ is the original parallel spinor
we constructed that is asymptotic to ψ0 .
Exercise 5.17. Show that ∇Vi = 0 everywhere and Vi is asymptotic to ei
at infinity.
Proof. Assume there is only one end (since the proof is really no different in
the general case). We follow the proof of Theorem 5.12 given in the previous
section. Note that g being Lipschitz is enough regularity that we can still
1,2
construct a spinor ψ ∈ W−q asymptotic to a constant spinor ψ0 that solves
the Dirac equation Dψ = 0, where q = n−2 2 . (Recall that ψ0 is chosen to have
|ψ0 | = 1 at the infinity of the end we care about, and is zero at the other
infinities.) The main difference here is that the Schrödinger-Lichnerowicz
formula (Theorem 5.10),
1
D2 ψ = ∇∗ ∇ψ + Rψ,
4
is no longer valid at the singular set Σ. (For one thing, R is not defined
there.) However, away from Σ, everything is smooth, so that this formula is
still valid, and so is its integrated version with boundary (Corollary 5.13).
In the following we use a hat to denote quantities computed using gin
and no hat to denote quantities computed using gout :
(5.7)
1
0≤ |∇ψ| − |Dψ| + Rψ
2 2 2
dμM
M 4
1
= |∇ψ| − |Dψ| + Rψ
2 2 2
dμM
Mout 4
1
+ |∇ψ|2 − |Dψ|2 + Rψ 2 dμM
Min 4
1
= (n − 1)ωn−1 mADM (g) − ψ, (∇ν + ν · D)ψ dμ∂Mout
2 ∂Mout
+ ˆ ν + ν · D̂)ψ dμ∂M
ψ, (∇ in
∂Min
1 ˆ ν − ∇ν )ψ + ν · (D̂ − D)ψ dμΣ ,
= (n − 1)ωn−1 mADM (g) + ψ, (∇
2 Σ
where we used Corollary 5.13 on Mout and Min separately, and then Corol-
lary 5.15 to identify the boundary term at infinity with the mass, and in all
of the integrals, ν is the unit normal of Σ pointing toward Mout , which is
the same for both gout and gin .
In order to compute the integrand of the Σ integral above locally, we
choose a local frame near a point of Σ which is adapted to Σ. That is,
we choose e1 , . . . , en−1 to be a local frame for Σ, choose en = ν, and then
extend the whole frame e1 , . . . , en away from Σ by demanding ∇ν ei = 0. Let
ω be the connection 1-form ωji (ek ) = ∇k ej , ei with respect to this frame,
5.4. Related results 177
computed using gout , and we define ω̂ similarly, except using gin . By our
choice of frame, along Σ we have, for i, j, k = 1, . . . , n − 1,
ωji (en ) = ω̂ji (en ) = 0,
ωji (ek ) = ω̂ji (ek ),
ωni (ek ) = −ωin (ek ) = A(ek , ei ),
ω̂ni (ek ) = −ω̂in (ek ) = Â(ek , ei ),
where the second line is just the induced connection 1-form on Σ, which
is the same for both gout and gin . Choose a basis of spinors σA which is
constant with respect to this frame, and write ψ = A ψ A σA in that basis.
Since it is easy to see that ∇ˆ ν ψ = ∇ν ψ along Σ, we need only compute the
D̂ − D term in (5.7). Using formula (5.2) for ∇σA and the equations for ω
above, we compute
ν · (D̂ − D)ψ = ψ A ν · (D̂ − D)σA
A
n
= ψA ˆ j − ∇j )σA
νej · (∇
A j=1
1 A
n−1 n
= ψ νej [ω̂k (ej ) − ωk (ej )]ek e · σA
4
A j=1 k,=1
n−1
1 A n
n−1
= ψ [ω̂ (ej ) − ωn (ej )]νej νe
4
A j=1 =1
n−1
+ [ω̂n (ej ) − ωn (ej )]νej ek ν · σA
k k
k=1
1
n−1
= ψA [2ω̂kn (ej ) − 2ωkn (ej )]ej ek · σA
4
A j,k=1
1 A
n−1
= ψ [−Â(ej , ek ) + A(ej , ek )]ej ek · σA
2
A j,k=1
1 A
n−1
= ψ [Â(ej , ej ) − A(ej , ej )] · σA
2
A j
1
= (Hin − Hout )ψ.
2
Feeding this into (5.7), we obtain
1 1
0 ≤ (n − 1)ωn−1 mADM (g) + (Hin − Hout )|ψ|2 dμΣ ,
2 Σ 2
178 5. Spin geometry
so the nonnegativity of mass now follows from our assumption that Hout ≤
Hin . If we have mADM (M, g) = 0, then just as in the rigidity proof in the
smooth spin case, we obtain parallel spinors that can be used to construct
parallel vector fields, and since these spinors are nonvanishing, the above
inequality tells us that Hout = Hin .
5.4.2. A spinor proof of Theorem 4.67. We can now prove the stability-
type result for the positive mass theorem that we used in the Bray flow proof
of the Penrose inequality in Section 4.3 [Bra01, Corollary 8].
Theorem 5.19 (Bray). Given n ≥ 3, α > 1, and > 0, there exists δ > 0
with the following property.
Let (M n , g) be a complete asymptotically flat spin manifold of nonnega-
tive scalar curvature on M , with coordinates in some end satisfying
4
gij (x) = W (x) n−2 δij
for |x| > r, for some positive harmonic function W on Rn B̄r (0) approach-
ing 1 at infinity.
n−2
If mADM (g) < δrn−2 , then for all |x| ≥ αr, |W (x) − 1| < |x|
r
.
By standard elliptic
theory, for each fixed W ∈ H, we know that the
energy functional Rn B2α |∇ψ|2 dμg is minimized by the unique Neumann
solution ψ satisfying ∇∗ ∇ψ = 0 and ∇ν ψ = 0 at ∂B2α , in addition to
limx→∞ ψ(x) = σ. If we were to write out these equations in local coor-
dinates, we would see that all of the coefficients of this elliptic system can
be written explicitly in terms of W and its derivatives. Consequently, the
minimizer depends continuously on W ∈ H. This implies the claim above.
Claim. The restriction map from R : A −→ H has relatively compact image
in H.
Harnack inequality, this can be used to show that W can be bounded purely
in terms of m and α in the region Rn B̄α . (See [Lee09] for a proof.) Since
the definition of A includes the requirement that |m| ≤ 1, the second claim
follows.
Let
σ 0 be a constant spinor of unit length and define σi := ei · σ0 . Let
n
F := i=0 Fσi .
Claim. If W ∈ H and F (W ) = 0, then W must be the constant function 1.
Quasi-local mass
181
182 6. Quasi-local mass
admissible and observe that the space of admissible extensions does not ac-
tually depend on all of (Ω, g), but only on the induced metric and mean
curvature on its boundary. We can summarize the above discussion with
the following definition.
Remark 6.2. The case n = 3 is the case of greatest interest, but we will
present results in general dimension when possible. When n = 3, one typi-
cally assumes that Σ is a sphere.
Lemma 6.4. Let n < 8, and let (M n , g) be an extension of Bartnik data with
η ≡ 0 such that (M, g) satisfies condition (5) of Definition 6.1. Then any
other extension which is a sufficiently small smooth perturbation of (M, g)
also satisfies condition (5).
Sketch of the proof. Assume the contrary. That is, suppose that (M, g)
is an extension satisfying condition (5), but there is a sequence of extension
metrics gi converging to g, each of which violates condition (5). So in each
(M, gi ), we have an apparent horizon Σi enclosing ∂M . There exists a
radius r, independent of i, such that all coordinate spheres of radius larger
than r will be mean convex in (M, gi ). By the strong comparison principle
(Corollary 4.2), the coordinate sphere Sr must enclose Σi . The outward-
minimizing property of apparent horizons (Theorem 4.7) then implies that
|Σi |gi ≤ |Sr |gi . Therefore the volume of Σi is bounded independently of i.
Since the Σi are stable minimal hypersurfaces with a uniform volume
bound, we can apply Schoen-Simon estimates [SS81] as in the proof of The-
orem 4.7 in order to extract a limit minimal hypersurface in (M, g) enclos-
ing ∂M . Then by Theorem 4.7, there is an apparent horizon in (M, g) enclos-
ing ∂M , contradicting the assumption that (M, g) satisfies condition (5).
The positive mass theorem (or more precisely, Theorem 4.17) easily gives
us the following nonnegativity property of the Bartnik mass.
Proposition 6.5. Let n ≥ 3, and let (Ωn , g) be a compact Riemannian
manifold with nonempty boundary and nonnegative scalar curvature. Sup-
pose that Σ = ∂Ω is connected and has HΣ ≥ 0 (with respect to the normal
pointing out of Ω). Then mB (Σ, g|Σ , HΣ ) ≥ 0.
More generally, the same result holds if Σ is just one component of ∂Ω,
as long as the other components are minimal.
Along with proposing this quasi-local mass concept, Bartnik also con-
jectured (for n = 3) that if η > 0, then there always exists an admissible
extension of (Σ, γ, η) realizing the Bartnik mass [Bar89]. A recent discovery
of Michael Anderson and Jeffrey Jauregui showed that the conjecture fails
at the level of generality originally envisaged by Bartnik [Jau13]. Because
of this, we will state a fairly narrow version of Bartnik’s conjecture.
Conjecture 6.6 (Bartnik minimal mass extension problem). If (Σ2 , γ, η)
is Bartnik data with η > 0 such that γ has positive Gauss curvature, then
there exists a Bartnik minimizer.
Positive Gauss curvature is probably not the ideal assumption for this
conjecture, but it is a weak enough assumption that the conjecture is highly
nontrivial, while it is narrow enough to guarantee existence of admissible
184 6. Quasi-local mass
extensions (see the next section) and also to rule out the Anderson-Jauregui
counterexamples. Those counterexamples come from taking a flat closed
topological 3-ball (B, g) that can be immersed in Euclidean 3-space in such
a way that its interior is embedded, but there is a double point at the
boundary. Since (B, g) is very close to an embedded flat closed ball, they
are able to show (with some work) that the Bartnik mass (of the boundary
data) must be zero. If a Bartnik minimizer did exist, it would be a mass
zero admissible extension of (B, g) and therefore Euclidean. But they obtain
a contradiction by showing that this is impossible for such (B, g).
The power of Bartnik’s conjecture comes from the fact that a Bartnik
minimizer should be very special. For physical reasons, Bartnik also conjec-
tured that any Bartnik minimizer should be vacuum static. Mathematically,
this conjecture was also supported by a calculus of variations computation
which we will see a bit later.
Definition 6.7. A metric g defined in a region U is called vacuum static if
g is scalar-flat and there exists a nontrivial function f on U such that
Δf = 0,
Hess f = f Ric.
The function f is called a static potential. The pair (g, f ) can be referred to
as vacuum static initial data on U .
Proof. Exercise 6.9 shows that if DR|∗g (f ) = 0 in the weak sense, then f
also satisfies the elliptic equation Δf = − n−1
1
Rf weakly. Therefore we can
invoke elliptic regularity of weak solutions (Theorem A.5) in order to see
that f must be smooth. We will first prove that f has a smooth zero set by
showing that 0 is a regular value of f . Suppose, to the contrary, that there
exists a point x where we have both f (x) = 0 and df (x) = 0.
Let β be any geodesic starting at x, and note that the Hessian equation
of Exercise 6.9 implies that the composition F = f ◦ β satisfies
1
F (t) = Ric(β (t), β (t)) − R · F (t).
n−1
Then f (x) = 0 and df (x) = 0 tells us that F (0) = F (0) = 0. Consequently,
F (t) is identically zero. Since this argument works for any β, we see that f
vanishes in a neighborhood of x. This means that the set of all points where
f and df both vanish is an open set. Since it is obviously closed as well, this
means that f and df vanish identically everywhere, which contradicts the
nontriviality of f . Thus 0 is a regular value of f .
Observe that along the zero set Z, Hess f = f Ric − n−1 1
Rg = 0. In
particular, this implies that along Z, ∇f is a parallel normal vector, and
thus Z is totally geodesic.
Next we show that Rg is constant. To see this, we will take the divergence
of the equation DR|∗g (f ) = 0. Recall the Weitzenböck formula for a 1-form ω,
ΔH ω = ∇∗ ∇ω + Ric(ω , ·),
186 6. Quasi-local mass
Proof. First check that asymptotic flatness implies that DR|g indeed maps
between these two spaces. We claim that the image of DR|g has finite
codimension. To see this, we restrict DR|g to deformations of the metric of
2,p
the form vg for v ∈ W−q (M ), i.e., conformal deformations. We can compute
DR|g (vg) using Exercise 1.18 to see that it is a second-order elliptic operator
on v of the form in Assumption A.29, and consequently the image of this
operator has finite codimension in Lp−q−2 (M ) by Corollary A.42. Since the
image of the full map DR|g contains the image of this restriction, it too has
finite codimension.
Suppose DR|g is not surjective. Since the image is closed, this implies
that there exists a nonzero element f in the dual space of Lp−q−2 (M ) such
that DR|∗g (f ) = 0 in the weak sense on all of M . By Lemma 6.10, f is
smooth and Rg is constant. Since g is asymptotically flat, this constant
is zero, and thus Δf = 0. It is not hard to see that the dual space of
∗
Lp−q−2 (M ) is just Lpq+2−n (M ), where p∗ = p−1
p
. Since q < n − 2, we can
apply weighted elliptic regularity (Corollary A.34) to see that f decays at
6.2. Bartnik minimizers 187
infinity, and then the maximum principle implies that f vanishes identically,
which is a contradiction.
Proof. Assume the hypotheses of the lemma, and suppose that g is not
scalar-flat. Just as in the proof of positive mass rigidity (Theorem 3.19),
the idea is that we can choose a conformal factor u that reduces the scalar
curvature while maintaining nonnegativity. Since this has the effect of re-
ducing the mass, this will violate the minimizing property of g, giving us our
desired contradiction. The only difference now is that we have to account
for the presence of a boundary.
Following the proof of Theorem 3.19, we let ζ be a positive compactly
supported function such that ζRg is positive somewhere and then seek a
positive conformal factor u such that
4(n − 1)
− Δg u + ζRg u = 0,
n−2
188 6. Quasi-local mass
For a more explicit statement of Theorem 6.14, see [Cor00]. The nov-
elty of this theorem is that the deformation of the metric is compactly sup-
ported. Without that requirement, a global version of this sort of theorem is
6.2. Bartnik minimizers 189
a standard result for elliptic operators and follows from the inverse function
theorem (Theorem A.43). The reason why it is possible to localize in this
way is that the scalar curvature operator R, as an operator on the metric g,
is heavily overdetermined. Corvino realized that this could be exploited to
obtain localized estimates on the operator DR|∗g . We omit the proof of The-
orem 6.14, but we note that in addition to being used to prove Theorem 6.12,
it was also used to prove Theorem 3.51 [Cor00].
We can now loosely explain Corvino’s proof that a Bartnik minimizing
extension (M, g) of Bartnik data (Σ, γ, η) must be vacuum static.
argument, using the fact that a Bartnik minimizer minimizes ADM mass
2,α
over all admissible extensions in C−q (γ).1
The admissibility conditions include two inequalities Rg ≥ 0 and Hg ≤ η,
but Lemma 6.13 tells us that a Bartnik minimizer actually satisfies the
equalities Rg = 0 and Hg = η. Therefore, if we define the constraint space
: ;
2,α
C = g ∈ C−q (γ) g is scalar-flat, and Hg = η at ∂M ,
we can see that a Bartnik minimizer minimizes ADM mass over a small
neighborhood in C. (Lemma 6.4 guarantees that all metrics in a small
enough neighborhood are admissible.) As long as this constraint space C
is smooth, we should be able to invoke the method of Lagrange multipliers
(Theorem A.47). Therefore, a crucial needed result is the following.
Theorem 6.15 (Anderson-Jauregui [Jau13]). Let (M n , g) be a complete
asymptotically flat manifold with induced metric γ on ∂M . Let n−2 2 < q <
n − 2 be less than the asymptotic decay rate of g (as in Definition 3.5), and
let α ∈ (0, 1). If we think of the scalar curvature operator as a map
2,α 0,α
R : C−q (γ) −→ C−q−2 (M ),
and the mean curvature operator as a map
2,α
H : C−q (γ) −→ C 1,α (∂M ),
then at any smooth, asymptotically flat metric g, the map
2,α
(DR|g , DH|g ) : C−q,0 (T ∗ M T ∗ M ) −→ C−q−2
0,α
(M ) × C 1,α (∂M )
2,α
is surjective. Here, the 0-subscript in C−q,0 denotes vanishing at ∂M .
Surjectivity of the map DR|g in the case without boundary (for Sobolev
spaces) was proved in Corollary 6.11. Note that this surjectivity is pre-
cisely what is needed to invoke Lagrange multipliers (Theorem A.47). The
boundary introduces a highly nontrivial technical issue which was handled
by Anderson and Jauregui by taking an approach involving static spacetimes
(as defined in Section 7.1.3). Instead, we will provide a heuristic sketch that
gives a sense for why the theorem is reasonable.
The following exercise is another illustration of the relationship between
scalar curvature and mean curvature.
Exercise 6.16. Let g be a smooth, asymptotically flat metric on a manifold
M with boundary. Then for any smooth function v on M , and any compactly
supported ġ ∈ C0∞ (T ∗ M T ∗ M ),
∗
v · DR|g (ġ) dμg = DR|g (v), ġg dμg + v · DH|g (ġ) dμ∂M .
M M ∂M
1 Technically speaking, in the argument that follows, we are assuming that we have a smooth
2,α
Bartnik minimizer that minimizes over the class of C−q (γ) competitors.
6.2. Bartnik minimizers 191
dμSρ ,g
= lim [div ġ − d(tr ġ)](ν) − [divg ġ − d(trg ġ)](νg ) dμSρ ,ḡ
ρ→∞ S dμSρ ,ḡ
ρ
∗ 1
− DR|g (1) + Rg g, ġ dμg − DH|g (ġ) dμ∂M,g
2
M ∂M
1
=− DR|∗g (1) + Rg g, ġ dμg − DH|g (ġ) dμ∂M,g .
M 2 ∂M
The first equality follows from Exercise 6.16 together with a rewriting of the
boundary term at infinity. The second equality follows from the fact that
the Sρ integral can be shown to vanish in the limit.
(6.2) 2(n − 1)ωn−1 DH|g (ġ) = v · DR|g (ġ) dμg + u · DH|g (ġ) dμ∂M .
M ∂M
Once again, we will make the simplifying assumption that u and v can be
represented by functions. Then Exercise 6.16 tells us that
∗
2(n − 1)ωn−1 DH|g (ġ) = DR|g (v), ġg dμg + (u + v) · DH|g (ġ) dμ∂M .
M ∂M
Combining this with our computation of DH|g (ġ) in (6.1), we have
∗
(6.3) 0 = DR|g (v + 1), ġg dμg + (u + v + 1) · DH|g (ġ) dμ∂M .
M ∂M
Since this must vanish for all ġ, it follows that DR|∗g (v + 1) = 0 in the
interior of M . By Lemma 6.10, v is smooth, and since it is in the dual
0,α
space of C−q−2 (M ), it lies in L1q+2−n . So by weighted elliptic regularity
(Corollary A.34), its limit at infinity is 0. Taking f := v + 1, we obtain the
desired static potential whose limit is 1 at infinity (in an integral sense at
least). Note that equation (6.3) also implies that u = −f at ∂M .
All that remains is to show that f is positive. Suppose that f < 0
somewhere in M . Since f is harmonic, the maximum principle implies that
f < 0 on ∂M . Let q be a nontrivial, nonpositive function on ∂M supported
where f < 0. By Theorem 6.15, there exists a first-order variation ġ such
that DR|g (ġ) = 0 and DH|g (ġ) = q. For this choice of ġ, equation (6.2)
tells us
2(n − 1)ωn−1 DH|g (ġ) = uq dμ∂M = −f q dμ∂M < 0.
∂M ∂M
See also [GL94] for the case of nonnegative Gauss curvature. In higher
dimensions, the analogous problem is heavily overdetermined (even local so-
lutions need not exist), and one generally does not expect to find such em-
beddings except under exceptional circumstances (see [LW99]). However,
we will keep our discussion of Theorem 6.18 in general dimension because
nothing about the proof is specific to three dimensions.
Actually, it is not the mere existence of the extension in Theorem 6.18
that is important to us, but rather the specific construction used. So al-
though we will omit the proof, we will outline the construction: by hypothe-
sis, (Σ, g) is isometric to some strictly convex hypersurface S0 in Rn . Recall
that if we flow outward at unit speed in the normal direction from a strictly
convex hypersurface in Rn , we obtain a smooth flow for all time, and strict
convexity is preserved. Let Sρ be the result of flowing S0 for time ρ. (In
particular, there is a natural diffeomorphism from S0 to Sρ .) Or equiva-
lently, Sρ is the hypersurface of distance ρ away from S0 , on the outside.
Let Ω be the region enclosed by S0 in Rn so that ∂Ω = S0 . The fact that
S0 is convex implies that the family Sρ smoothly foliates all of Rn Ω and
hence the Euclidean metric on Rn Ω can be expressed as dρ2 + hρ via the
diffeomorphism Rn Ω ≈ [0, ∞) × Σ, where hρ is the induced metric on Sρ ,
pulled back to Σ. We consider metrics of the form
g = u2 dρ2 + hρ ,
6.3. Brown-York mass 195
It is not hard to see that Sρ is close to a standard large round sphere for
large ρ. If one makes this statement precise, it then follows fairly easily from
the lemma that g = u2 dρ2 + hρ is asymptotically flat, and mADM (g) = m.
Since Hu = u−1 H > 0, we know that [0, ∞) × Σ is foliated by hypersurfaces
which are strictly mean convex with respect to g. In particular, Corollary 4.2
implies that S0 is not enclosed by any horizons in ([0, ∞)×Σ, g). Combining
all of this information, we see that ([0, ∞) × Σ, g) is an admissible extension
196 6. Quasi-local mass
1
= lim H(1 − u−1 ) dμgρ
ρ→∞ (n − 1)ωn−1 S
ρ
1
= lim (H − Hu ) dμgρ ,
ρ→∞ (n − 1)ωn−1 S
ρ
where the third equality follows from the fact that the sphere Sρ is close to
a standard large round sphere for large ρ. We choose to write the mass in
this way because the integral above is monotone in ρ.
Lemma 6.21 (Shi-Tam monotonicity [ST02]). Given the setup above, the
quantity
(H − Hu ) dμgρ
Sρ
1
(H − Hu ) dμg0 ≥ mADM (g).
(n − 1)ωn−1 S0
Proof. Observe that the traced Gauss equation (Corollary 2.7) in Euclidean
space tells us that
2
RSρ = H − |A|2 .
6.3. Brown-York mass 197
Definition 6.22. Let (Σ2 , γ, η) be Bartnik data with positive Gauss curva-
ture. By Theorem 6.19, (Σ, γ) is isometric to some strictly convex surface
Φ(Σ) in Euclidean R3 . We define the Brown-York mass of (Σ, γ, η) to be
1
mBY (Σ, γ, η) = (H − η) dμγ ,
8π Σ
Note that the uniqueness part of Theorem 6.19 guarantees that the
Brown-York mass is well-defined. Our discussion in this section leads up
to the following theorem, which is essentially due to Shi and Tam [ST02].
198 6. Quasi-local mass
Po-Ning Chen, Mu-Tao Wang, and S.-T. Yau, among others. See the survey
paper [Wan15] for an introduction to the topic.
f = 1 + A|x|2−n + O2 (|x|2−n−γ )
for some constant A and some γ > 0. We claim that A = −m, where
m is the ADM mass of g. To see this, we use our formula for mass from
200 6. Quasi-local mass
Theorem 3.14 together with the vacuum static equations (Definition 6.7).
−1
m = lim |x| · Ric(ν, ν) dμSr
r→∞ (n − 1)(n − 2)ωn−1 S
r
−1 ∇ν ∇ν f
= lim |x| · dμSr
r→∞ (n − 1)(n − 2)ωn−1 S f
r
−1
= lim A|x|1−n dμSr
r→∞ ωn−1 S
r
= −A.
Just as we did for our proof of Lemma 4.61, consider the two-ended
manifold (M , ḡ) obtained by taking (M, g) and gluing it to an isometric
copy (M , g ) of itself along ∂M . That is, (M , ḡ) is the “double” of (M, g),
obtained by reflecting it through its boundary. Since ∂M is totally geodesic
in (M, g), it follows that we can choose coordinates so that ḡ is actually C 2
across the hypersurface where the gluing occurred. Let ω be the ḡ-harmonic
function that approaches 1 at the infinity of the original M and approaches
0 at the infinity of the new end M . We use ω to conformally close the 0-end
4
by considering the metric g̃ = ω n−2 ḡ on M . The result is a new one-ended
asymptotically flat manifold (M 3 = M ∪ {pt}, g̃). Note that ḡ is scalar-flat
and ω is ḡ-harmonic.
Moreover, by symmetry, we can compute ω directly in terms of f . If
Φ : M −→ M is the name of the aforementioned isometry, then we must
have ω = 12 (1 + f ) on M and ω = 12 (1 − f ◦ Φ) on M , since these formulas
give us a ḡ-harmonic function that satisfies the boundary conditions at the
2 |x|
two infinities. In particular, ω = 1 − m 2−n + O (|x|1−n ) in the M end, so
2
by Exercise 3.13, it follows that mADM (g̃) = mADM (g) − m = 0.
By rigidity of the positive mass theorem (Theorem 3.19), it follows that
3
(M , g̃) must be Euclidean space. (See Remark 3.45 regarding the singular
3, g̃).) Reversing this construction, (M , ḡ) is scalar-flat and glob-
point in (M
ally conformal to Rn {0} via a harmonic function. Since we know what
all of those harmonic functions are, it follows that (M , ḡ) is Schwarzschild
of mass m, and (M, g) is one half of the Schwarzschild space of mass m.
1
Thus there exists a diffeomorphism M ≈ [(2m) n−2 , ∞) × S n−1 such that
g = (1 − r2m −1 dr 2 + r 2 dΩ2 . Finally, by Exercise 6.8, we must have
n−2 )
)
f = 1 − r2m n−2 since it is the unique g-harmonic function approaching 1
1
at infinity and vanishing at r = (2m) n−2 .
Proposition 6.27. Let n < 8. Given Bartnik data (Σn−1 , γ, 0), we have
n−2
1 |Σ| n−1
mB (Σ, γ, 0) ≥ .
2 ωn−1
Moreover, if (Σ, γ) is a round sphere, then we have equality, and the Schwarzs-
n−2
child space of mass 12 ω|Σ|
n−1
n−1
is the unique Bartnik minimizing extension.
where the second inequality is the stability inequality, so that the only rel-
evant case that we are excluding is the borderline case λ1 (−Δγ + K) = 0.
As discussed earlier, we know that if (Σ2 , γ) is not round, then there
can be no Bartnik minimizing extension, so instead the theorem is proved
by constructing a minimizing sequence of extensions. It is easy to see that
the previous theorem follows immediately from the following construction,
which is itself interesting.
Proposition 6.29 (Mantoulidis-Schoen [MS15]). Given any sphere (S 2 , γ)
) the surface {r} × Σ in the space
with λ1 (−Δγ + K) > 0, let Σr denote
|Σ|
M := [0, ∞) × Σ. Then for any m > 16π , there exists a metric g on M
such that
(1) Rg ≥ 0,
(2) Σ0 is minimal in M and its induced metric is isometric to γ,
(3) there exists r0 such that over (r0 , ∞)×Σ, g is identically equal to the
Schwarzschlild metric with mass m (with r as a radial coordinate
for Schwarzschild),
(4) for r > 0, Σr has positive mean curvature.
Introduction to general
relativity
207
208 7. Introduction to general relativity
For massive objects, this curve is always timelike, and for massless objects
(such as photons), the curve is always null (meaning that the object is
traveling at the speed of light). Because of this, we use the word causal to
describe vectors or curves that can be timelike or null. Essentially, something
that happens at a certain point in M can affect another point in M only if
they can be joined by a causal curve. We would like to think of one of those
points as being in the future of the other. In order to do this, we must choose
which side of the null cone at each p ∈ M is the future and which is the past.
If such a choice can be made smoothly consistently over all of M, that choice
is called a time-orientation of (M, g). Note that a choice of global timelike
vector field on M determines a time orientation. Technically, choosing a
time-orientation is equivalent to reducing the structure group from O(1, n)
down to O+ (1, n).
Given a time orientation, we can separate the nontrivial timelike and null
vectors into those that are future pointing and those that are past pointing.
Given a point p ∈ M, we define J + (p) (respectively, J − (p)) to be the causal
future (respectively, causal past) of p, meaning the set of all points that can
be reached from p by future-pointing (respectively, past-pointing) causal
curves. The collection of sets J + (p) and J − (p) can be referred to as the
causal structure of a spacetime (M, g). We also define I + (p) (respectively,
I − (p)) to be the chronological future (respectively, chronological past) of
212 7. Introduction to general relativity
p, meaning the set of all points that can be reached from p by future-
pointing (respectively,
% past-pointing) timelike
% curves. Given a set S, we
define J ± (S) = p∈S J ± (p) and I ± (S) = p∈S I ± (p). As a convention, we
consider p ∈ J ± (p) but p ∈
/ I ± (p).
It turns out that the causal structure of a spacetime is equivalent to
the conformal structure together with the time orientation. To see this,
note that the information contained in the causal structure is equivalent to
knowledge of what the time-oriented null cones are at each point. Next, a
simple argument (as in [HE73, p. 61], for example) shows that at any point
p ∈ M the null cone in Tp M determines the Lorentzian metric on Tp M up
to a constant (and vice versa, of course).
Given a causal curve γ : [a, b] −→ M, we can define its length to be
b
L(γ) = −g(γ (t), γ (t)) dt.
a
For a timelike curve, the length measures proper time, which is the amount
of time that the object experiences, or, in other words, how much time has
passed according to a clock traveling with the object. A timelike curve can
be parameterized by proper time. A test particle moving under the influence
of gravity and no other forces will travel along a geodesic (timelike if it is
massive and null if it is massless). Note that the Levi-Civita connection
and geodesics in a Lorentzian manifold are defined in the exact same way
as in Riemannian manifolds. (Raising and lowering of indices is also defined
the same way.) However, in contrast to Riemannian geometry, a timelike
geodesic locally maximizes length if and only if it is free of conjugate points.
In the rest of this subsection we study the existence of causal geodesics.
Lemma 7.3. Given a spacetime (M, g), if γ is a causal curve from p to
q in M that is not null everywhere, then γ can be deformed to a timelike
curve from p to q.
We omit the proof, but note that it ultimately rests on the fact that it
holds in small balls covering the curve.
Proposition 7.4. Given a spacetime (M, g), let p, q ∈ M such that q ∈
J + (p) I + (p). Then there exists a future null geodesic from p to q.
Idea of the proof. Observe that Lemma 7.3 implies that if q ∈ J + (p)
I + (p), then they are joined by a null curve. Suppose that null curve joining
p to q is not geodesic. In this case we can fairly explicitly deform the curve
to a causal curve from p to q that is timelike somewhere. Think about the
situation in Minkowski space in order to understand the intuition behind
this claim. (If a null curve is not geodesic, then it must “spiral” as it moves
7.1. Spacetime geometry 213
upward in time.) See [O’N83, Proposition 46, HE73, Proposition 4.5.10] for
details. But if there is a causal curve from p to q that is timelike somewhere,
then Lemma 7.3 says that q ∈ I + (p), which is a contradiction.
Definition 7.5. Given a spacetime (M, g), we say that a smooth hypersur-
face M is a Cauchy hypersurface if every inextendible causal curve must pass
through M exactly once. A spacetime that contains a Cauchy hypersurface
is called globally hyperbolic.
We omit the proof, but as one might expect, it bears some similarity to
the proof that there exists a length-minimizing geodesic connecting any two
points in a Riemannian manifold. See [Gal14] or [HE73, Section 6.7] for
details. The global hyperbolicity is used because it allows us to execute the
needed compactness argument, though this is not at all obvious from the
way we defined global hyperbolicity. (There is another common definition
of global hyperbolicity that is known to be equivalent to ours [Ger70].)
214 7. Introduction to general relativity
See [Gal14, HE73, Section 6.6] for details. Observe that one simple
consequence of the above proposition is that globally hyperbolic spacetimes
have the desirable property that there are no closed causal curves (which
would violate the physical concept of “causality”).
observer v. Given this tensor T , the spacetime metric g must satisfy the
Einstein (field) equations,
1
Ric − Rg = (n − 1)ωn−1 T,
2
where the Riemann, Ricci, and scalar curvatures of a Lorentzian metric g
are defined exactly the same as for a Riemannian metric.3 Or we could
simply write G = (n − 1)ωn−1 T , where G is the Einstein tensor of g.
An important generalization of the Einstein equations is the Einstein
field equations with cosmological constant Λ,
1
Ric − Rg + Λg = (n − 1)ωn−1 T,
2
where Λ is a constant. In this book, we will focus on the case Λ = 0.
7.2.1. Motivation for the Einstein equations. We will first give some
heuristic physical reasoning to motivate the Einstein equations. (Our discus-
sion here loosely follows that of [Ger13].) We have decided that in general
relativity, we want test particles to follow geodesics. We know that cur-
vature shows up when we look at variations of geodesics. That is, if γs (t)
∂γ
is a family of geodesics through γ0 = γ and X = ∂s is its first-order
s=0
variation, then the Jacobi equation [Wik, Jacobi field] states that
(7.2) X , Y = −Riem(X, γ , Y, γ ),
where Y is any other vector field along γ, and the primes denote (covariant)
differentiation in the t-variable.
Let us compare this to what happens in Newtonian gravity (in the n = 3
case). Recall (from Section 3.1.3 or otherwise) that in Newtonian gravity,
there is a gravitational potential function V : R3 −→ R satisfying ΔV =
4πρ, where ρ represents the mass density of all sources. The acceleration of
any test particle is −∇V . Therefore, if we imagine a family of Newtonian
trajectories γs (t) through γ0 = γ with first-order variation X = ∂γ ∂s s=0 ,
then it is straightforward to derive that
X · Y = −∇Y ∇X V,
where Y is any vector field along γ and the left side is just dot product. In
Newtonian gravity, we know that the trace of the right side of the above
equation (over the X and Y slots) is −4πρ. If we take the trace of equa-
tion (7.2) (noting that the full trace over X and Y will give the same result as
3 When n = 3, the constant (n − 1)ω
n−1 is just 8π. For n > 3, there does not seem to
be a universally accepted convention for what this constant should be, but ours is chosen to be
consistent with our earlier definition of ADM mass (Definition 3.9). Unfortunately, Exercise 7.9
suggests a different convention based on physics, but we have decided not to use this convention
because it would require an extra factor of (n − 2) in many of our formulas.
216 7. Introduction to general relativity
tracing over the spatial directions orthogonal to γ ), we see that the quantity
Ric(γ , γ ) should correspond to 4πρ.
Of course, Ric(γ , γ ) depends on the observer γ while 4πρ does not.
This dependence on observer is where the stress-energy tensor T comes
in. As described above, the energy density of the sources, as seen by the
observer γ , is T (γ , γ ), suggesting that ρ should be identified with T (γ , γ ).
However, one could also reasonably identify ρ with − tr T in the Newtonian
limit. Compromising between these two candidates for a timelike observer
suggests that we take
Ric = 4π(rT + (1 − r)(tr T )g)
for some constant r. From a physical perspective, we would like T to be a
conserved quantity, that is, div T = 0.
Exercise 7.9. Show that, in order for div T = 0 to be consistent with the
contracted second Bianchi identity (Exercise 1.10), we should choose r = 2,
and hence
1
Ric − Rg = 8πT.
2
7.2.2. Lagrangian formulation and matter models. The simplest case
of the Einstein equations is the vacuum case when T is identically zero. This
means that there are no sources at all for the gravitational field. Minkowski
space is our basic model of empty space (with Λ = 0), but one of the most
important features of general relativity is that there is an incredibly rich
theory even in the vacuum case.
The vacuum Einstein field equations can also be derived from an action
principle. The Einstein-Hilbert action is the functional
S[g] = R dμg .
M
(Note that for a Lorentzian
√ metric, the expression
√ for dμg with respect to a
local frame involves − det g instead of det g, since the det g will be neg-
ative.) It is easy to see from a formal computation that the Euler-Lagrange
equations for the Einstein-Hilbert action are precisely the vacuum Einstein
field equations. That is, the variation of S at g in the direction of every
compactly supported variation of g vanishes if and only if G = 0. (Check
that this follows immediately from Exercise 1.18 and Proposition 1.3.) The
Einstein field equations with cosmological constant Λ arise from using the
action S[g] = M (R − 2Λ) dμg .
Finally, observe that the vacuum Einstein equations G = 0 imply that
R = 0, and thus the vacuum Einstein equations can be rewritten as
Ric = 0.
7.2. The Einstein field equations 217
strictly positive. Note that this explains why we used the words vacuum
static in Definition 6.7.
By the exercise above, we just need to find N (r) such that Hessg N =
N Ricg , where g = V (r)−1 dr2 + r2 dΩ2 . In particular, Δg N = 0. Writing this
out as√a second-order ODE for N (r), it is fairly straightforward to see that
N = V is a solution. (It is also the unique one with the property that g
approaches Minkowski space as r → ∞.) Thus we have determined what N
and V must be.
Exercise 7.11. Use Proposition 1.13 to check that with this choice of N
and V , we have Hessg N = N Ricg .
and v are null coordinates in the sense that ∂u and ∂v are null (when put
together with coordinates on S n−1 to create a coordinate system).
Exercise 7.12. For n = 3, solve for r∗ explicitly. Let U = −e−u/4m and V =
ev/4m . Show that we
can rchoose
r/2mthe integration constant in the definition
of r∗ so that U V = 1 − 2m e , and show that the Schwarzschild metric
becomes
32m3 −r/2m
gm = − e dU dV + r2 dΩ2 ,
r
where we now regard r as the function of U and V implicitly defined above.
Observe that the original region r > r0 corresponds to U < 0 and V > 0 in
the new coordinates. The coordinates U and V are called Kruskal-Szekeres
coordinates.
Note that in Kruskal-Szekeres coordinates, the metric is not singular
when U = 0 or V = 0. However, one can show that there really is a singu-
larity at r = 0 (because the curvature blows up there), which corresponds to
where U V = 1. This allows us to naturally extend the Schwarzschild metric
to the product {(U, V ) ∈ R2 | U V < 1} × S 2 . See Figure 7.2. We will refer to
this vacuum spacetime as the Schwarzschild spacetime of mass m, though it
is sometimes called the Kruskal extension of Schwarzschild or the Kruskal-
Szekeres spacetime in the literature. This spacetime can be thought of as a
“maximal extension” of the metric (7.4) that was defined in the region r > r0
(region I in Figure 7.2), in the sense that it is a simply connected vacuum
extension such that every geodesic can either be extended to a complete
geodesic, or else it hits the singularity at U V = 1.
The construction generalizes to higher dimensions, with the main dif-
ference being that one no longer has ∗ However, if
for r .
a simple formula
−(n−2)u (n−2)v
one defines U = − exp 2(2m) 1/(n−2) and V = exp 2(2m) 1/(n−2) , then the
extension works out in essentially the same way. See [Chr15, Remark 1.2.6]
for details.
Observe that the region where U > 0 and V < 0 (region III in Figure 7.2)
is just an isometric copy of the “original” region where U < 0 and V > 0,
and thus the Schwarzschild spacetime should be thought of as having two
“asymptotically flat” ends since these two regions resemble Minkowski space
for large r. In particular, any negative constant U/V slice (including the
sphere at U = V = 0) of the Schwarzschild spacetime (which extends a
constant t slice of the original r > r0 region) is precisely the two-ended
Riemannian Schwarzschild space described in Chapter 3. The metric in the
regions where U > 0 and V > 0 (region II), and where U < 0 and V < 0
(region IV), which are isometric to each other, can be identified with the
metric (7.4) in the region 0 < r < r0 , where the t coordinate becomes
spacelike and the r coordinate becomes timelike.
220 7. Introduction to general relativity
U V
r= UV=1
r ∞
0 t=
t=
− r0
∞ r=
t=0
II
t=0 t=0
III I
IV
r=
t=0
r
0
∞
t=
t=
−
∞
0
r
r=
UV=1
matter field, but many physical models will naturally satisfy the dominant
energy condition. This discussion motivates the following definition.
Definition 7.16. An initial data set (M n , g, k) is a Riemannian manifold
(M, g) equipped with a symmetric (0, 2)-tensor k. We define
1
μ := Rg + (trg k)2 − |k|2g ,
2
J := (divg k) − ∇(trg k).
The quantity μ is called the energy density while J is called the current
density. Here we have defined J to be a vector quantity rather than a 1-
form as some other texts do. (This is why we use the raising operator .
Also note that for n = 3, our definition of energy density μ differs from the
Newtonian mass density ρ described earlier by a factor of 8π.) Together,
these equations are known as the (Einstein) constraint equations, and we will
refer to (μ, J) as the constraints of (g, k). They are called the constraints
because if one is given a stress-energy tensor T , this determines the pair
(μ, J), which constrains (but does not determine) the initial data (g, k)
according to the equations in Definition 7.16. In particular, in a vacuum
spacetime, T = 0, and consequently μ and J both vanish. More generally,
any initial data set with vanishing μ and J is said to satisfy the vacuum
(Einstein) constraint equations, that is, the same conditions appearing in the
hypotheses of Theorem 7.13. We say that (M, g, k) satisfies the dominant
energy condition (or DEC) whenever μ ≥ |J|g everywhere. We say that the
strict DEC holds if μ > |J|g everywhere.
We can emphasize the fact that J is a divergence by writing
J = divg π,
where π is the symmetric (2, 0) tensor defined by
π ij := k ij − (trg k)g ij ,
where the indices on k have been raised using g. Note that π contains the
same information as k since we can invert the relationship via
1
kij = πij − (trg π)gij .
n−1
We may sometimes (abusively) refer to the triple (M, g, π) as an initial data
set when the meaning is clear.
We say that (M, g, k) sits inside a spacetime (M, g) if M embeds into
M in such a way that g induces g, and k is the second fundamental form of
the embedding.
In light of Theorems 7.13 and 7.14, we see that an initial data set
(M, g, k) solving the vacuum constraints is the appropriate Cauchy data
224 7. Introduction to general relativity
for solving the vacuum Einstein equations. Building on the work of Theo-
rems 7.13 and 7.14, similar theorems can be established for Einstein equa-
tions with various matter fields. For a thorough discussion of these various
Cauchy problems, see the book [CB09].
The study of the Einstein equations using methods of hyperbolic PDE
is an extensive field of current research, but it is not the focus of this book.
However, we will mention a couple of the large, motivating problems in
the field. In four spacetime dimensions, it is conjectured that the Kerr
solutions are the only vacuum stationary spacetimes. (The analogous state-
ment for the Myers-Perry solutions in higher dimensions is known to be
false.) In fact, this conjecture is very close to being a known fact: S. Hawk-
ing [HE73] showed that any analytic vacuum stationary spacetime must
be axisymmetric, and then work of Brandon Carter [Car73] and David C.
Robinson [Rob75] shows that an axisymmetric vacuum stationary space-
time must either be static, or else it lies in the Kerr family. But if it is static,
then one can show that it must correspond, via Exercise 7.10, to a vacuum
static asymptotically flat manifold with minimal boundary. Theorem 6.25
then implies that it must be Schwarzschild, which is a special case of Kerr.
A mathematically rigorous version of this overall argument, drawing
on work of various contributors, appears in a paper by P. Chruściel and
J. Costa [CC08] (see also references cited therein). The analyticity assump-
tion is nearly removed by work of S. Alexakis, A. Ionescu, and S. Kleiner-
man [AIK10], which is strong enough to prove the uniqueness result for
small perturbations of Kerr. The general topic of black hole uniqueness the-
orems has grown in a number of directions. See [Rob09] for a survey of
developments.
The uniqueness of Kerr leads to a far more ambitious “final state con-
jecture” that all vacuum solutions of the Einstein equations should settle
down to a Kerr solution in the long-time limit. A more tractable piece of
this conjecture is simply the conjecture that the Kerr family is stable in the
sense that initial data that is a small perturbation away from Kerr initial
data will asymptotically settle down to a (possibly different) Kerr solution
in the long-time limit. This is a highly active area of research that was set
into motion by D. Christodoulou and S. Klainerman’s pioneering proof of
the stability of Minkowski space [CK93]. (H. Lindblad and I. Rodnianski
later gave an alternative proof under stronger hypotheses [LR10].) Lydia
Bieri extended the Christodoulou-Klainerman result by relaxing assump-
tions on both the regularity and decay [Bie09]. These results and questions
have natural analogs for the Einstein-Maxwell equations. Specifically, it is
also hoped that the Kerr-Newman family is stable and more generally that
any electrovacuum solution of the Einstein equations must settle down to a
7.3. The Einstein constraint equations 225
7.3.2. Asymptotically flat initial data sets. In this book we are mainly
interested in initial data sets. There is extensive literature on constructing
initial data sets solving the Einstein constraint equations (chiefly the con-
formal method ), but that is not our focus. We are primarily concerned with
general properties of asymptotically flat initial data sets that satisfy the
dominant energy condition.
Φk : Mk −→ Rn B̄1 (0),
where B̄1 (0) is the standard closed unit ball, such that if we think of each Φk
as a coordinate chart with coordinates x1 , . . . , xn , then in that coordinate
chart (which we will often call the asymptotically flat coordinate chart or
sometimes the exterior coordinate chart), we have
n−2
for some q > 2 . Moreover, we also require that μ and J are integrable
over M .
The case when k is identically zero is called the time-symmetric (or Rie-
mannian) case, and in this case the definition above reduces to the statement
that (M, g) is an asymptotically flat manifold. We reiterate that in the lit-
erature, the precise definition of asymptotic flatness can vary from paper to
paper.
be
n
1
E = lim (gij,i − gii,j )ν j dμSρ ,
ρ→∞ 2(n − 1)ωn−1 S
ρ i,j=1
n
1
Pi = lim (kij − (trg k)gij )ν j dμSρ
ρ→∞ (n − 1)ωn−1 S
ρ j=1
for i = 1, . . . , n, where the right sides are calculated in the asymptotically
flat coordinates of the chosen end, and barred quantities are calculated using
the Euclidean metric in the end. If (E, √P ) is future causal, then the ADM
mass of that end is defined to be m = E 2 − P 2 . Observe that we can also
write
n
1
Pi = lim πij ν j dμSρ for i = 1, . . . , n.
ρ→∞ (n − 1)ωn−1 S
ρ
j=1
Lan-Hsuan Huang [Hua09] showed that the center of mass can also be
written as
1
C = lim G(Z , ν) dμSρ ,
ρ→∞ 2(n − 1)(n − 2)ωn−1 E S
ρ
n
where G is the Einstein tensor of g and Z = i=1 (|x|2 δ i − 2x xi )∂i gen-
singularity
I+ black hole
I+ I+
II
d.o.c. d.o.c.
i0 i0 i0
III I
I− I− white hole I−
IV
singularity
Figure 7.3. Penrose diagrams for the Minkowski spacetime (left) and
the Schwarzschild spacetime (right).
For this reason and others, we will be interested in studying the ge-
ometry of null hypersurfaces. In general, it is important to consider null
hypersurfaces of low regularity, but for our discussion here we will focus on
the smooth case. Let H be a null hypersurface, and let be a nonvanishing
future-pointing null normal vector field. This is equivalent to asking for to
be a nonvanishing future-pointing null tangent vector field. Since there is no
natural normalization for null vectors, is only determined up to multipli-
cation by a positive function. That is, only its direction is naturally defined.
Consequently, the integral curves of are defined independently of choice
of , and in fact, after reparameterization, these integral curves become null
geodesics. To see why, let p ∈ H and X ∈ Tp H, and then extend X in the
direction of so that X remains tangent to H and [, X] = 0. Then
One can show that multiplying by a positive function has the effect
of multiplying A, S, and θ by the same positive function. In particular,
although the sizes of these quantities have no invariant geometric meaning,
their signs do.
Next we would like to see how A, S, and θ evolve along a null generator γ.
Given a vector field X tangent to H, one can see that ∇γ X must be tangent
to H. (Check this.) So we can define the covariant derivative of any X̄ ∈
C ∞ (T H/) along γ by X̄ = ∇γ X for any X ∈ C ∞ (T H) that projects to
X̄. This is well-defined in the sense that it is independent of the choice of
X. With this definition, we can easily extend covariant differentiation along
γ to A and S in the standard way.
Theorem 7.23 (Riccati equation for null generators). Let γ be a null gen-
erator for a null hypersurface H, and let = γ be our choice of null normal
along γ. Then along γ, the null shape operator satisfies
S (X̄), Ȳ = −S 2 (X̄)Ȳ − Riem(X, , Y, ),
at any p along γ, where X, Y ∈ Tp H. The null expansion satisfies
1
θ = − θ2 − |S̊|2 − Ric(, ),
n−1
where S̊ is the trace-free part of S. The equation for θ is usually referred to
as the Raychaudhuri equation in the literature, and |S̊| is called the shear
scalar.
The formulas in Theorem 7.23 may look familiar from Riemannian ge-
ometry. Although we can use the same reasoning above to prove the Rie-
mannian analog, it is interesting to note that it can also be seen as a special
case.
Corollary 7.24 (Riccati equation for parallel hypersurfaces). Let Σ be a
hypersurface of a Riemannian manifold (M, g) with unit normal ν. For
each point x ∈ Σ, consider the geodesic expx tν emanating from Σ with
normal vector ν. Let Σt be the image obtained from Σ by flowing along
those geodesics for time t. (This Σt is often called a parallel hypersurface.)
Then along each geodesic, as long as Σt is smooth near the geodesic, the
shape operator St of Σt evolves according to
S (X), Y = −S 2 (X), Y − Riem(X, ν, Y, ν)
along γ, where X, Y ∈ Tp M . The mean curvature satisfies
1
H = − H 2 − |S̊|2 − Ric(ν, ν),
n−1
where S̊ is the trace-free part of S.
To get a sense for what the sign of θ+ means, recall from Proposition 2.10
that θ± represents how the area form on Σ is changing when we vary Σ in
the ± direction (which is why it is called a “null expansion”). Meanwhile,
+ and − represent the two most “extreme” directions that a light ray can
travel away from Σ. We can think of + as shooting light outward from Σ and
− as shooting light inward. A trapped surface is one for which the following
is true. If you flow Σ in the direction of any smooth family of light rays
emanating from Σ, this always has the effect of decreasing area. Intuitively,
this is to be expected if you shoot light rays inward (corresponding to θ− ),
but it is not so expected when you shoot light rays outward (corresponding
to θ+ ). For example, it is not hard to see that any large coordinate sphere
in an asymptotically flat initial data set has θ+ > 0 and θ− < 0. The
physical intuition is that only a “strong gravitational field” can cause light
to be “trapped” in the sense that shooting light in any direction is area
decreasing.
The famous Penrose incompleteness theorem [Pen65] states that in a
spacetime satisfying the NEC, trapped surfaces force the formation of sin-
gularities in the spacetime (assuming there is a noncompact Cauchy hyper-
surface). We will instead state and prove a version appearing in [Gal14]
better suited to our interests.
7.4. Black holes and Penrose incompleteness 235
Claim. Each q ∈ ∂ out J + (Σ) lies on a null geodesic leaving Σ with tangent
vector + = n + ν, where n is the future unit timelike normal to M and ν is
the outward normal of Σ in M . Moreover, this geodesic arrives at q before
it passes any conjugate points.
The proof of Claim 1 only uses the global hyperbolicity and does not use
any assumptions about compactness, trapping, or geodesic completeness. It
should perhaps be thought of as a background lemma from causality theory.
One can show that global hyperbolicity implies that J + (Ω) is closed, that
is, ∂J + (Ω) ⊂ J + (Ω). (Again, this is not so obvious from the way we defined
global hyperbolicity.) Since being timelike is an open condition, one can see
that I + (Ω) is an open subset of J + (Ω), and thus ∂J + (Ω) ⊂ J + (Ω) I + (Ω).
By Proposition 7.4, it then follows that for any q ∈ ∂ out J + (Σ) there exists
a null geodesic γ starting at some p ∈ Ω and ending at q.
We now argue that the starting point p lies in Σ: if it started in Ω,
then we could move the starting point slightly to construct a causal curve
that is timelike near its new starting point in Ω. (Imagine the picture in
Minkowski space to see why this is clear.) By Lemma 7.3, q would lie in
I + (Ω), a contradiction.
236 7. Introduction to general relativity
This part of the proof uses the Raychaudhuri equation, which lies at
the heart of the Penrose incompleteness theorem. By the assumption of
future null geodesic completeness, each null geodesic starting in Σ with
null vector + can be extended for infinite parameter time. Let H+ be the
union of all of these null geodesics. This H+ must be a smooth (possibly
immersed) null hypersurface away from the conjugate points. Since Σ is
compact and trapped, there exists a constant c > 0 such that θ+ < −c < 0.
By the Raychaudhuri equation, each generator must blow up before reaching
parameter time T = (n − 1)/c. (See Exercise 7.26.) In other words, it must
reach a point of nonsmoothness of H+ , which translates to saying that every
generator must reach a conjugate point of the generator before parameter
time T . By Claim 1, ∂ out J + (Σ) lies in the part of H+ with parameter
times lying in the closed interval [0, T ]. Since this latter space is clearly
compact and ∂ out J + (Σ) is a closed subset of it, we conclude that ∂ out J + (Σ)
is compact. Since Ω is assumed to be compact, it follows that ∂J + (Ω) is
compact. See Figure 7.4 for an “illustration” of this (impossible) situation.
To complete the argument, we construct a map from ∂J + (Ω) to M
as follows. By time-orientability, there exists a global timelike vector field
on M generating timelike integral curves. For each point in ∂J + (Ω), we
7.4. Black holes and Penrose incompleteness 237
since the outgoing null normal + of Σ must be the null normal of H. We can
slightly push Σ outward toward the d.o.c. near p to obtain another surface
Σ such that a small part of Σ leaks into the d.o.c. with θ+ < 0, while the
rest of Σ is identically equal to Σ. As above, we can argue that there must
exist a null geodesic from Σ out to I + which is free of conjugate points.
By definition of the d.o.c., this geodesic must originate from a point in the
intersection of Σ with the d.o.c., where θ+ < 0. (And the geodesic must be
outgoing, since an ingoing geodesic will surely enter the black hole region.)
But this contradicts the Raychaudhuri equation via Exercise 7.26. This fact
that the null expansion of a black hole horizon is nonnegative is usually
called the Hawking area theorem or the second law of black hole mechanics.
It is called the area theorem because if Σ is a spacelike surface in H, then
+
θΣ ≥ 0 represents the rate of change of the area element of Σ as it flows in
the direction + . In particular, if Σ1 and Σ2 are smooth closed surfaces on
a smooth part of H with Σ2 in the causal future of Σ1 , then |Σ1 | ≤ |Σ2 |.
A corollary of the Hawking area theorem is that under the same assump-
tions as above we have the following. If the spacetime is stationary, then a
spacelike surface Σ in H has θ+ = 0, or, in other words, it is a marginally
outer trapped surface (or MOTS). Recall that stationarity means that there
is a Killing vector field and hence a family of spacetime isometries. These
isometries must preserve the event horizon and therefore they send Σ to
another cross-section Σ of H in its future, and |Σ| = |Σ |. Because of the
Hawking area theorem, this is only possible if θ+ = 0 along Σ.
Be aware that, like many terms arising from physics, the phrase “appar-
ent horizon” does not always have a consistent technical definition in the
literature. Based on our earlier discussion, if (M, g, k) lies in an asymptot-
ically flat, globally hyperbolic spacetime satisfying the NEC, an apparent
7.5. Marginally outer trapped surfaces 241
horizon will always lie on the inside of the black hole event horizon (in-
cluding the horizon itself), and if the spacetime is stationary, the apparent
horizon will lie on the black hole event horizon. It is the closest we can come
to locating where the event horizon must intersect our initial data set. In
particular, the concept has applications to numerical relativity.
Proof. As seen in Section 2.2, it is clear what the contribution from the
tangential component X̂ must be, so it suffices to consider the case of a
normal variation X = ϕν. Recall from (2.15) that we already know that the
variation of H is
1
DH|Σ (ϕν) = −ΔΣ ϕ + (RΣ − RM − |A|2 − H 2 )ϕ.
2
242 7. Introduction to general relativity
So we can compute
∂ ∂
(trΣ k) = (trg k − k(ν, ν))
∂t ∂t
∂ ∂ ∂
= (trg k) − k (ν, ν) − 2k ν, ν
∂t ∂t ∂t
= ϕ∇ν (trg k) − ϕ(∇ν k)(ν, ν) + 2k(∇ϕ, ν)
= [∇ν (trg k) − (∇ν k)(ν, ν)]ϕ + 2W, ∇ϕ,
where W = WΣ is as defined in the statement of the proposition. This is
already enough to compute the variation of θ+ , but we would like to put it
in a nicer form. Specifically, we would like to see how the quantities μ and
J show up in the formula. Let W̃ be the vector field defined along Σ that is
dual to the k(ν, ·) so that W is just the tangential part of W̃ . We choose an
orthonormal frame e1 , . . . , en−1 for Σ and compute (using Einstein notation)
(∇ν k)(ν, ν) = (divg k − divΣ k)(ν)
= (divg k)(ν) − (∇ei k)(ν, ei )
= (divg k)(ν) − [∇ei k(ν)](ei ) + k(∇ei ν, ei )
= (divg k)(ν) − divΣ W̃ + k(Aij ej , ei )
= (divg k)(ν) − divΣ W + H, W̃ ⊥ + Aij kij
= (divg k)(ν) − divΣ W − Hk(ν, ν) + A, kΣ ,
where we used (2.6) to simplify divΣ W̃ . Putting the three previous compu-
tations together, we have
1
Dθ+ |Σ (ϕν) = −ΔΣ ϕ + 2W, ∇ϕ + [RΣ − RM − |A|2 − H 2
2
+ 2∇ν (trg k) − 2(divg k)(ν) + 2 divΣ W
+ 2Hk(ν, ν) − 2A, kΣ ]ϕ.
The rest of the computation is tedious but straightforward.
Exercise 7.34. Complete the proof of Proposition 7.32. Also, what should
the formula for the linearization of θ− be?
7.5. Marginally outer trapped surfaces 243
where the right side denotes the principal (Dirichlet) eigenvalue of the self-
adjoint operator −ΔΣ + QΣ , where QΣ was defined in Proposition 7.32. In
particular, if Σ is a stable MOTS, then for any smooth function u compactly
supported in the interior of Σ, we have
(7.5) |∇u|2 + QΣ u2 ≥ 0.
Σ
Proof. We drop the Σ subscripts in the following. For any function ϕ that
is positive in the interior of Σ,
Integrating, we obtain
2 −1 0
(7.6) u ϕ LΣ ϕ ≤ |∇u|2 + Qu2 .
Σ Σ
From here, we can argue as we did in Theorem 4.7, though there are
some added technical issues to address. See [AEM11] for details.
Recall from Proposition 2.18 that every stable, two-sided closed minimal
surface in an orientable 3-manifold with positive scalar curvature must be a
sphere. The exact same reasoning leads to the following.
Exercise 7.42. Let (M 3 , g, k) be an orientable initial data set satisfying
the strict dominant energy condition, meaning that μ > |J|g everywhere.
Prove that every stable, two-sided closed MOTS is a sphere.
Since e−Ct θ+ (t) is positive for all t ∈ (0, ) but zero at t = 0, it follows that
there exists a time t = τ such that the left side of the above inequality is
positive. Hence L0Στ ϕτ > 0, and thus ϕ−1 τ LΣτ ϕτ ≥ c > 0 for some constant
0
c. Using this choice of Στ and ϕτ in (7.6) and invoking (7.7), it follows that
1
0 < c ≤ λ1 (−ΔΣτ + QΣτ ) ≤ λ1 (Lhτ ).
2
Thus Σ is Yamabe positive, which is a contradiction.
and if equality holds, then the part of (M, g, k) outside Σ sits inside the
Schwarzschild spacetime of mass m.
g = ds2 + r2 dΩ2
for some smooth positive function r(s), where dΩ2 is the standard metric on
the sphere, while k can be written as
1
k = kνν ds2 + κr2 dΩ2 ,
n−1
where kνν and κ are smooth functions of s. Note that kνν = k(ν, ν), where
ν is the outward normal of a symmetric sphere at s, while κ is the trace of
k over the symmetric sphere at s.
If (g, k) satisfies the dominant energy condition and ∂M is an outermost
MOTS/MITS, then
n−2
1 |∂M | n−1
m≥ ,
2 ωn−1
where m is the ADM mass of (M, g). Moreover, if equality holds, then
(M, g, k) lies inside the Schwarzschild spacetime of mass m.
g = V −1 dr2 + r2 dΩ2
252 7. Introduction to general relativity
√
on [r0 , ∞) × S n−1 , where V = dr ds and r0 satisfies |∂M | = ωn−1 r0
n−1
. We
consider the function
1 n−2 1
m(r) := r 1+ (κ − H )r
2 2 2
2 (n − 1)2
1 1
= rn−2 (1 − V ) + κ2 r n ,
2 2(n − 1)2
where the second expression can be easily compared to the one used in the
proof of Proposition 3.20. In the n = 3 case, this can be written more
geometrically as
&
|Sr | 1 + −
m(r) = 1+ θ θ dμSr ,
16π 16π Sr
where the right side is the appropriate generalization of the Hawking mass
of the sphere Sr to the initial data setting. In the spherically symmetric
setting, it is called the Misner-Sharp energy.
Claim.
1 κ
m (r) = rn−1 μ − J, ν .
n−1 H
In the proof of Proposition 3.20, we already saw that
d 1 n−2 n − 1 n−1
(7.8) (n − 1)2 r (1 − V ) = r R.
dr 2 2
Meanwhile, looking at the κ term in m(r), we compute
2 d
1 dκ n 2
(7.9) (n − 1) 2 n
κ r =r n−1
rκ + κ .
dr 2(n − 1)2 dr 2
Exercise 7.47. Show that in the spherically symmetric case, the constraint
equations (Definition 7.16) reduce to
1 n − 2 2
μ = R + 2κkνν + κ ,
2 n−1
1 dκ 1
J, ν = H − r + κνν − κ .
n − 1 dr n−1
Rearranging the result of the exercise above for inclusion in equations
(7.8) and (7.9),
n − 1 n−1 n−2 2
r R=r n−1
(n − 1)μ − (n − 1)κkνν − κ ,
2 2
dκ κ
rn κ = rn−1 −(n − 1) J, ν + (n − 1)κkνν − κ2 .
dr H
Feeding these into (7.8) and (7.9), we obtain the Claim. Another way to
derive the formula for m (r) is to start with the general variation formula for
7.6. The Penrose inequality 253
where E is the ADM energy. The last equality follows from Exercise 3.21,
since the κ2 term in m(r) decays too fast to contribute to the limit. Finally,
one can easily see that spherical symmetry implies that the ADM momentum
is zero (as you would expect), so that the ADM energy E is the same as the
ADM mass m.
Next we consider the case of equality, which is unfortunately a bit messy.
We will show that if equality holds, then M sits inside the Schwarzschild
spacetime of mass m as a graph over the t = 0 slice. To do this, let us first
make some calculations on graphical hypersurfaces. Consider the spacetime
Schwarzschild metric of mass m,
gm = −Vm dt2 + Vm−1 dr2 + r2 dΩ2 ,
where Vm (r) = 1 − 2mr2−n . Next consider the graph of a radial function
f (r) over the t = 0 slice, that is, we look at the hypersurface defined by
t = f (r). A quick calculation shows that if we write the induced metric gf
on the graph as
g f = Vf−1 dr2 + r2 dΩ2 ,
then
Vf−1 = Vm−1 − Vm (f )2 .
We can also consider the second fundamental form k f of the graph in gm and
f
look at its normal-normal component kνν and its trace over the sphere κf
(both computed with respect to g f ). Some routine but somewhat involved
computations (try it) show that
n − 1
(7.10) κf = Vf − Vm ,
r
d
(7.11) f
kνν = Vf − Vm .
dr
By spherical symmetry, these components determine all of k f . Therefore,
in order to prove that some given spherically symmetric (M, g, k) sits inside
254 7. Introduction to general relativity
We now consider the positive mass theorem for initial data sets.
After Schoen and Yau proved the positive mass theorem in the Rie-
mannian setting, they next used the Jang equation to prove a positive en-
ergy theorem (E ≥ 0) for general three-dimensional initial data sets satisy-
ing the dominant energy condition [SY81b]. Meanwhile, Witten’s spinor
proof, whose time-symmetric version was presented in Chapter 5, was orig-
inally written for general initial data sets. Many years later M. Eichmair
generalized Schoen and Yau’s E ≥ 0 theorem to dimensions less than 8
[Eic13]. Schoen suggested that Schoen and Yau’s proof of Theorem 3.18
could be generalized to the initial data setting using MOTS in place of mini-
mal hypersurfaces, and this was finally carried out in [EHLS16].1 Recently,
Lohkamp has treated the higher-dimensional cases [Loh16] using the theory
he developed for the Riemannian case [Loh15c, Loh15a, Loh15b].
1 That paper assumed stronger decay of (μ, J) than in Definition 7.17, but that was not
necessary.
255
256 8. The spacetime positive mass theorem
Now let us begin our proof of the spacetime positive mass theorem (The-
orem 8.1) in earnest. We argue by induction on dimension and by contra-
diction. Let 3 ≤ n < 8, assume that we have established the desired result
up to dimension n − 1 (unless n = 3), and assume that (M n , g, π) is a
counterexample. Our goal is to show that there exists a counterexample
to the Riemannian positive mass theorem one dimension lower (which is a
special case of our induction hypothesis), unless n = 3, in which case we use
Gauss-Bonnet to obtain a contradiction.
By Theorem 8.3 and Lemma 8.4, we can assume, without loss of gener-
ality, that our counterexample (M n , g, π) has harmonic asymptotics, strict
DEC, and E < |P |. We intend to construct a complete MOTS Σ in M with
a certain desirable strong stability property to be described later. Here is
where we see the value of the harmonic asymptotics assumption.
2
noting that the u n−2 factor does not contribute to the limit. With our
n−2 |P | and bi = 0 for
assumption on the direction P points, we have bn = n−1
i < n.
Next, we claim that
−n
(8.1) +
θ{xn =Λ} = (n − 1)(|P | − E)Λ|x| + O(|x|−n ).
Since g is conformal to Euclidean space outside a compact set, we can use
Exercise 2.14 to compute
2(n − 1) n−2
−2
−1
H{xn =Λ} = u ∂n u
n−2
2(n − 1) n−2
−2
−1 2 − n −n n −n
= u E|x| x + O(|x| )
n−2 2
= −(n − 1)E|x|−n Λ + O(|x|−n ),
where we use the asymptotic expansion of u. To compute tr{xn =Λ} (k), first
note that
2 1
kij = u n−2 (LY δ)ij − (divδ Y )δij .
n−1
Using the fact that Yi = O1 (|x|1−n ) for i < n, we have
n−1
tr{xn =Λ} (k) = g ij kij
i,j=1
n−1
−2 1
= u n−2 δ ij [Yi,j + Yj,i − (divδ Y )δij ]
n−1
i,j=1
n−1
−2 −1
= u n−2 δ ij
Yn,n δij + O(|x|−n )
n−1
i,j=1
= −Yn,n + O(|x|−n )
= (n − 1)|P ||x|−n Λ + O(|x|−n ),
where we used the asymptotic expansion of Y in the last step. Hence we
+
have (8.1), which shows that for large enough Λ one has θ{xn =Λ} > 0. The
+
proof that θ{xn =−Λ} < 0 is similar.
Lemma 8.6 gives us barriers for MOTS. More specifically, this would be
enough to invoke Theorem 7.39 if these barriers were actually compact. In
order to create a region with a compact boundary, for each large ρ, let Cρ
be the region of M whose boundary is the cylinder ∂Cρ := {x | (x1 )2 + · · · +
(xn−1 )2 = ρ2 } in the asymptotically flat end. Then for each Λ > 0, define
Cρ,2Λ to be the part of Cρ lying between the two planes xn = ±2Λ. For
each h ∈ [−Λ, Λ], we can consider the sphere Γn−2
ρ,h which is where ∂Cρ,2Λ
260 8. The spacetime positive mass theorem
Lemma 8.7. Assume the hypotheses of Lemma 8.6, and let Λ be large
enough so that the lemma holds. Then for any large enough ρ and any
h ∈ [−Λ, Λ], there exists a λ-minimizing stable MOTS Σρ,h whose boundary
is Γρ,h and lies inside Cρ,2Λ . The constant λ is independent of ρ and h.
There is a technical issue here because ∂Cρ,2Λ has corners, but intu-
itively, this should not be a problem since smoothing the corners creates
large positive mean curvature that only serves to help with the desired bar-
rier inequalities. (This smoothing of the corners is why we use 2Λ instead
of Λ.) Also, we assumed above that there was only one end, but if there are
other ends, we can just cut off our region Cρ,2Λ by large coordinate spheres
in those other ends, and as seen earlier those large coordinate spheres can
just be considered as part of ∂2h Cρ,2Λ since they have θ+ < 0 (with respect
to the appropriate normal).
Since what we really want is a complete MOTS rather than a MOTS
with boundary, we will take a limit as ρ → ∞.
Lemma 8.8. Assume the hypotheses of Lemma 8.6. For any choice of
ρj → ∞ and hj ∈ [−Λ, Λ], there exists a subsequence of Σρj ,hj that smoothly
converges on compact subsets of M to a complete properly embedded MOTS
Σ∞ . Moreover, there exists α ∈ (0, 1) and a constant c ∈ [−2Λ, 2Λ] such
that outside a large compact subset of M , Σ∞ can be written as the Eu-
clidean graph {xn = f (x )} of some function f (x ) = c + O3+α (|x |3−n ) in
the (x1 , . . . , xn−1 , xn ) = (x , xn ) coordinate system. In particular, expressed
in x coordinates, the metric h on Σ∞ satisfies hij − δij ∈ C2−n 2,α
(Σ∞ ).
Sketch of the proof. The proof uses some standard geometric measure
theory arguments, as well as some facts about the prescribed mean curvature
equation for graphs. Because of this, the full argument lies outside the scope
of the book, but we outline the basic ideas.
8.1. Proof for n < 8 261
The limit space Σ∞ is not just a MOTS, but it also enjoys a stability
property.
Lemma 8.9. Assume the hypotheses of Lemma 8.6. Let Σ∞ be the complete
1,2
MOTS described above. For any smooth v ∈ W 3−n (Σ∞ ), we have
2
|∇v|2 + QΣ∞ v 2 dμΣ∞ ≥ 0.
Σ∞
In other words, Σ∞ inherits the symmetrized stability property.
Together with the exercise above, the strict dominant energy condition
then implies that
(8.2) KΣ∞ dμΣ∞ > 0,
Σ∞
8.1. Proof for n < 8 263
where KΣ∞ denotes the Gauss curvature. However, one can see from the
Gauss-Bonnet Theorem that this is inconsistent with the fact that the sur-
face Σ∞ is asymptotically planar. More precisely, recalling from Lemma 8.8
that the induced metric h satisfies hij (x ) = δij + O1 (|x |−1 ), we can use
the same argument that was used to prove Theorem 3.30 to obtain the de-
sired contradiction. Therefore the spacetime positive mass theorem holds in
dimension 3.
8.1.3. Proof of the 3 < n < 8 case. When 3 < n < 8, we will construct a
complete (n − 1)-dimensional MOTS which can be conformally deformed to
provide a counterexample to the Riemannian positive mass theorem in n − 1
dimensions (Theorem 3.18), giving a contradiction. However, it is worth not-
ing that in this chapter we are also simultaneously reproving Theorem 3.18
as a special case of Theorem 8.1 via an induction argument.
As in the n = 3 case, we assume without loss of generality that Σ∞ is
connected. Let h be the induced metric on Σ∞ . We seek a smooth positive
4
function w such that w approaches 1 at infinity and w n−3 h is scalar-flat. By
equation (1.6), this means that we need to solve
4(n − 2)
Lh w := − ΔΣ∞ w + RΣ∞ w = 0,
n−3
Lh v = −RΣ∞ ,
2,p
Lh : W−q (Σ∞ ) −→ Lp−q−2 (Σ∞ )
is an isomorphism for some p > n, n−3 2 < q < n − 3. We will use the
2,p
stability from Lemma 8.9 to prove injectivity. Suppose that ϕ ∈ W−q solves
Lh ϕ = 0. By elliptic regularity, ϕ is smooth, and by weighted Sobolev
264 8. The spacetime positive mass theorem
embedding, ϕ ∈ C−q
1 . We argue as in the proof of Proposition 2.25 that
0≤ |∇ϕ|2 + QΣ∞ ϕ2 dμΣ∞
Σ
∞
1
≤ |∇ϕ| + RΣ∞ ϕ
2 2
dμΣ∞
Σ∞ 2
1 4(n − 2)
≤ |∇ϕ| + RΣ∞ ϕ
2 2
dμΣ∞
2 Σ∞ n−3
1
= ϕLh ϕ dμΣ∞
2 Σ∞
= 0,
where the second inequality follows from the dominant energy condition,
the third follows from the fact that 2 < 4(n−2)
n−3 for n > 3, and the final
equality follows from integration by parts and the decay of ϕ. Therefore the
third inequality must be an equality, which implies that ϕ is constant and
2,p
therefore identically zero. So Lh : W−q (Σ∞ ) −→ Lp−q−2 (Σ∞ ) is injective,
and it has index zero by Corollary A.42; therefore it is an isomorphism.
Hence we have the desired v and w = v + 1, which is smooth by elliptic
regularity (Theorem A.4). By Lemma 8.8, the metric h is asymptotically flat
with ADM energy zero. By Corollary A.38 and Exercise 3.13, the conformal
4
scalar-flat metric h̃ := w n−3 h on Σ∞ is also asymptotically flat.2
In order to obtain a contradiction to the Riemannian positive mass theo-
rem in dimension n−1, all we need now is to show that h̃ has negative ADM
energy. We use Exercise 3.12 to compute the change in mass as follows:
2
E h̃ = E(h) − lim w∇ν w dμ|x |=r
(n − 3)ωn−2 r→∞ |x |=r
2
=0− lim w∇η w dμ∂(Σ∞ ∩Cr )
(n − 3)ωn−2 r→∞ ∂(Σ∞ ∩Cr )
2
=− (|∇w|2 + wΔΣ∞ w) dμΣ∞
(n − 3)ωn−2 Σ∞
−2 n−3
= |∇w| +
2
RΣ w 2
dμΣ∞ ,
(n − 3)ωn−2 Σ∞ 4(n − 2) ∞
where the first line involves an integral over a coordinate sphere in Rn−1
and ν is the Euclidean outward normal, while η is the outward normal of
2 Technically, we must check that w is positive. Although the maximum principle stated in
Theorem A.2 does not apply to Lh since we do not have a sign on RΣ∞ , there is a version of
the maximum principle that applies as long as there exists a positive subsolution, and we can
construct such a subsolution by taking a limit of the principal eigenfunctions of the conformal
Laplacians on Σρj ,hj .
8.1. Proof for n < 8 265
8.1.4. The functional F . Recall that Lemma 8.9 ultimately came from
variations of the Σj fixing the boundary. The stability we desire corresponds
to moving the boundary in the vertical direction. Thus Lemma 8.11 can be
thought of as a statement of “vertical stability.”
First we will briefly describe how this was done by Schoen and Yau
in the time-symmetric case. Define Cρ , Cρ,2Λ , and Γρ,h as we did earlier
in Section 8.1.1. Using geometric measure theory (specifically, a version of
Theorem 2.22 with prescribed boundary), given large ρ and any h ∈ [−Λ, Λ],
one can construct a minimal hypersurface Σρ,h with boundary Γρ,h with the
property that it minimizes volume compared to every other hypersurface
with the same boundary. (For the general spacetime positive mass theorem,
we instead used Theorem 7.39 to construct the desired stable MOTS.) How-
ever, in order to achieve the “vertical stability” for each fixed ρ, Schoen and
Yau also minimized volume over all h ∈ [−Λ, Λ], and the barriers guaran-
tee that the minimizing h lies in the interior (−Λ, Λ). One can then show
that Σρ,hρ has the desired vertical stability property if hρ is the minimiz-
ing height. Since there is no direct analog of volume minimization in the
MOTS setting, this is an essential difficulty in generalizing the Schoen-Yau
approach.
In order to motivate the discussion, let us take a closer look at what
happens in the time-symmetric case. Suppose for a moment that the family
{Σρ,h }|h|≤|Λ| is actually a smooth foliation of minimal hypersurfaces with a
266 8. The spacetime positive mass theorem
Proof. Recall from Lemma 8.6 that θ+ > 0 on the plane xn = Λ and all
planes above it. Then by the strong maximum principle for θ+ (Proposi-
tion 7.38), we know that Σρ,Λ lies below the plane {xn = Λ} in Cρ and
that they cannot meet tangentially at their common boundary Γρ,Λ . Hence
the Euclidean outward unit normal of ∂Σρ,Λ in Σρ,Λ satisfies η̄ n > 0. (See
Figure 8.1.) The inequality F (Σρ,Λ ) > 0 then follows from equation (8.6).
The proof that F (Σρ,−Λ ) < 0 is analogous.
By examining the proof of Theorem 7.39, one can show that the Σρ,h
are ordered in h in the sense that for h1 < h2 , Σρ,h1 lies below Σρ,h2 . More
8.1. Proof for n < 8 267
xn = Λ
Σρ,Λ
Σρ,h
0
Γρ,h Γρ,h
0 0
Σρ,h
0
Σρ,-Λ
xn = −Λ
∂Cρ ∂Cρ
Figure 8.1. For any h ∈ [−Λ, Λ], we can find at least one stable MOTS
Σρ,h with prescribed boundary sphere Γρ,h . The cylinder Cρ and the
planes xn = ±Λ serve as barriers. Uniqueness fails at jump heights,
illustrated in the picture above by the height h0 . When n > 3, we must
carefully choose a (nonjump) height hρ in order to obtain the desired
“vertical stability” property for Σρ,hρ .
Lemma 8.14. Assume the hypotheses of Lemma 8.7, and let Λ, ρ, and Σρ,h
be as described in that lemma. The function h → F (Σρ,h ) is continuous at
every h0 ∈ [−Λ, Λ] that is not a jump height. If h0 ∈ [−Λ, Λ] is a jump
height, then
lim F (Σρ,h ) ≥ F (Σρ,h0 ) ≥ lim F (Σρ,h ),
h→h−
0 h→h+
0
where both limits exist, and at least one of the inequalities above is strict.
In other words, there must be a downward jump discontinuity at every jump
height.
Proof. Let h0 ∈ [−Λ, Λ]. Since the lower and upper envelopes are smooth
limits, limh→h− F (Σρ,h ) = F (Σρ,h0 ) and limh→h+ F (Σρ,h ) = F (Σρ,h0 ). By
0 0
definition, if h0 is not a jump height, then both of these limits must equal
F (Σρ,h0 ), so that continuity holds at all nonjump heights.
Let h0 be a jump height. Since the family {Σρ,h }|h|≤Λ is ordered, it
is clear that Σρ,h0 lies below Σρ,h0 , which lies below Σρ,h0 (where “below”
does not necessarily mean strictly). And since they all share the common
boundary Γρ,h0 , we have an ordering of their corresponding quantities ∂n , η
along Γρ,h0 . From the definition of F , the desired inequalities follow. More-
over, since h0 is jump height, Σρ,h0 = Σρ,h0 , so the strong maximum prin-
ciple (Proposition 7.38) implies that at least one of the two inequalities is
strict.
where
and S and H are the shape operator and mean curvature of Σ as a hyper-
surface in (M, g) (and Z = φν + Ẑ = ∂n was defined above).
Proof. The main thing to observe is that the boundary integral in this
formula is similar to the one obtained in the second variation formula with
boundary (Theorem 2.19). This is because F is similar to the second term
appearing in the first variation formula with boundary (Proposition 2.10).
In fact, one could potentially use Theorem 2.19 as a starting point, but we
choose to argue from scratch instead.
Let e1 , . . . , en−2 be a local orthonormal frame for T (∂Σ). We can differ-
entiate Z, η, and the induced measure on ∂Σ to obtain
6 7
n−2
DF |Σ (X) = ∇X Z, η + Z, ∇η X, νν − ∇ei X, ηei
∂Σ i=1
n−2
− Z, ei ∇ei X, η + Z, η div∂Σ X dμ∂Σ .
i=1
The derivative of η above was computed using the fact that e1 , . . . , en−2 , η, ν
is an orthonormal basis and differentiating the orthogonality relations. The
second term in the integrand of (8.9) is
< =
Z, ν∇η X, ν = φ ∇η (ϕν + X̂), ν
(8.10) = φ ∇η ϕ + ∇η X̂, ν
= φ∇ϕ, η − φS(X̂), η.
270 8. The spacetime positive mass theorem
n−2
= Z, η divΣ X − ∇η X, η − Z, ei ∇ei X, η
i=1
n−2
The result follows from combining the first term in the integrand of (8.9)
with the computations (8.10) and (8.11).
8.1.5. Height picking and stability. The following lemma finds a height
hρ which corresponds to the volume-minimizing height in the time-symmetric
case, as motivated by equation (8.4).
Lemma 8.16. Assume the hypotheses of Lemma 8.7, and let Λ, ρ, and
Σρ,h be as described in that lemma. Define Σρ,h to be the component of
Σρ,h containing its boundary Γρ,h . There exists hρ ∈ (−Λ, Λ) and a smooth
vector field X along Σρ,hρ that is equal to Z = ∂n at ∂Σρ,hρ = Γρ,hρ , such
that ϕ = X, ν > 0,
(8.12) Dθ+ |Σρ,h (X) = 0,
ρ
and
(8.13) DF |Σρ,h (X) ≥ 0.
ρ
Sketch of the proof. The actual proof of this lemma is somewhat com-
plicated, but in light of Lemmas 8.13 and 8.14, it is fairly intuitive and
expected. Viewed as a function of h, F (Σρ,h ) must go from negative (at
h = −Λ) to positive (at h = Λ), and its only discontinuities are jump dis-
continuities that jump down. Therefore there certainly ought to exist a value
hρ at which F (Σρ,h ) is increasing in h.
More precisely, we claim that hρ = inf{h ∈ [−Λ, Λ] | F (Σρ,h ) > 0} will
give us the desired height. If this family Σρ,h were differentiable in h at
h = hρ , we could choose X to be the first-order deformation of the family
8.1. Proof for n < 8 271
where
(8.15) G̃(X) = G(X) + φϕW.
Here W := WΣρ and Q := QΣρ are defined as in Proposition 7.32, and we
use the abbreviations dμ := dμΣρ and dA := dμ∂Σρ .
where we used equation (8.7) and inequality (8.13) in the third and fourth
lines, respectively.
|∇φ∞ |2 + Q∞ φ2∞ dμ = lim |∇φ∞ |2 + Q∞ φ2∞ dμ
Σ∞ r→∞ Σ ∩C
∞
r
= lim |∇φ∞ |2 + Q∞ φ2∞ dμ + G̃∞ (Z), η∞ dA
r→∞ Σ∞ ∩Cr ∂(Σ∞ ∩Cr )
= lim lim |∇φj |2 + Qj φ2j dμ + G̃j (Z), ηj dA
r→∞ j→∞ Σj ∩Cr ∂(Σj ∩Cr )
= lim |∇φj | +
2
Qj φ2j dμ + G̃j (Z), ηj dA
j→∞ Σj ∂Σj
≥ 0,
0 ≤ lim inf |∇wi |2 + Q∞ wi2 dμ
i→∞
Σ∞
= lim inf (|∇w|2 + Q∞ v 2 ) + 2∇v, ∇(wi − w) + 2Q∞ w(wi − w)
i→∞ Σ∞
+(|∇(wi − w)|2 + Q∞ (wi − w)2 ) dμ
= |∇w|2 + Q∞ w2 dμ,
Σ∞
where the cross terms vanish because of (8.19). Finally, we note that
1,2
φ∞ − 1 ∈ W 3−n to obtain the desired result.
2
8.3. Proof for spin manifolds 275
Let M be a spin manifold, and let (M, g, k) be an initial data set. Then
we can make the following constructions. Consider the bundle R × T M , and
let e0 be the constant section (1, 0), meaning that it is always 1 in the R
component and zero in the vector field component. We can equip this bundle
with a Lorentzian product g by declaring g(eμ , eν ) = ημν , where e1 , . . . , en
is any orthonormal basis of T M and Greek indices run from 0 to n as usual.
(If we think of (M n , g, k) as sitting inside some spacetime (Mn+1 , g), then
this is just the pullback of the bundle T M, but we choose to take a purely
initial data point of view and avoid directly dealing with M.)
The Clifford algebra construction described in Chapter 5 works perfectly
well for products that are not positive definite, and the construction can be
used as a bundle construction to obtain a Clifford bundle
∞ 5
4r
Cl(R × T M ) = (R × T M ) I,
r=0
˜ i = ∇i + 1 kij ej e0 .
∇
2
˜ comes from the am-
(From a spacetime perspective, the connection ∇
bient Levi-Civita connection on T M, while ∇ comes from the intrinsically
defined Levi-Civita connection on T M .) We now define the hypersurface
Dirac operator D̃ on S̃(M ) by
˜i
D̃ = ei · ∇
1
= D + kij ei ej e0
2
1
= D − (tr k)e0 ,
2
where D is the usual Dirac operator on S̃(M ) as defined in Chapter 5, and we
used symmetry considerations in the last line. (The trace of k is computed
with respect to g.) Next we obtain a version of the Schrödinger-Lichnerowicz
formula (Theorem 5.10) for initial data sets.
Theorem 8.21 (Witten). Let (M, g, k) be a spin initial data set. For any
ψ ∈ C ∞ (S̃(M )),
˜ i ψ + 1 (μ + Je0 ) · ψ,
˜ ∗i ∇
D̃2 ψ = ∇
2
˜ ∗ is the formal adjoint of ∇
where ∇ ˜ on S̃(M ).
Proof. We will take advantage of the work we already did to prove Theo-
rem 5.10 in Chapter 5. As usual, we choose an orthonormal basis e1 , . . . , en
that is parallel at the point where we are computing. For any ψ ∈ C ∞ (S̃(M )),
we have
1 1 1
D̃2 ψ = D2 ψ − ei · ∇i [(tr k)e0 · ψ] − (tr k)e0 ei · ∇i ψ − (tr k)2 ψ
2 2 4
∗ 1
1 1
= ∇ ∇ψ + Rψ − ∇i (tr k)ei e0 · ψ − (tr k)2 ψ
4 2 4
1 1
= ∇∗ ∇ψ + (R − (tr k)2 ) − ∇(tr k)e0 · ψ,
2 2
where we used Theorem 5.10 in the second line. On the other hand, since
the formal adjoint of ∇ on S̃(M ) is
˜ ∗ = −∇i + 1 kij ej e0 ,
∇ i
2
278 8. The spacetime positive mass theorem
The real work needed to prove this corollary was already taken care of
in Corollary 5.15. All that is left to do here is check that the k terms in the
discrepancy between L̃i and Li have enough decay that they do not matter,
and this is indeed the case. (Check this.)
Next we prove that the hypersurface Dirac operator is an isomorphism,
as we did for the ordinary Dirac operator in Proposition 5.16.
280 8. The spacetime positive mass theorem
Now we can see that the spacetime positive mass theorem follows from
the previous proposition, Corollary 8.26, and Corollary 8.22, exactly the
way we concluded Theorem 5.12 in Chapter 5. Specifically, if ψ is chosen to
8.3. Proof for spin manifolds 281
1,2
solve D̃ψ = 0 with ψ − ψ0 ∈ W−q , where ψ0 is a constant spinor chosen as
in Corollary 8.26, then
2 ˜ 1< =
E − |P | = |∇ψ| + ψ, (μ + Je0 ) · ψ
2
dμM ≥ 0,
(n − 1)ωn−1 M 2
where the inequality follows from the dominant energy condition.
Corollary 8.28 (Spacetime positive energy rigidity for spin manifolds).
Let (M, g, k) be a complete asymptotically flat spin initial data set sitting
inside some spacetime (M, g) satisfying the dominant energy condition, and
suppose that E = 0 in some end. Then the ambient spacetime metric is flat
along M .
Gi0 = Rici0 = 0,
Ric00 = Ri0i0 ,
R = −2Ri0i0 ,
1
G00 = Ric00 + R = 0.
2
We now invoke the dominant energy condition in the ambient spacetime,
which can be seen to imply that |Gij | ≤ G00 . Since we have seen that the
latter is zero, it follows that Gij is zero. Consequently G is identically zero,
and hence Ric is identically zero also. In particular,
Ricij = −Ri0j0
Sketch of the proof. The basic idea is to find a solution ψ of the hyper-
surface Dirac equation D̃ψ = 0 such that ψ is asymptotic to the constant
spinor ψ0 at infinity (just as in the proof described earlier), with the addi-
tional boundary condition
νe0 · ψ = ψ
at ∂M , where ν is the outward unit normal of ∂M (pointing into M ). The
key point is that this boundary condition guarantees that the ∂M term
8.3. Proof for spin manifolds 283
arising from Corollary 8.22 vanishes: suppose for now that we can find the
desired ψ. Then by Corollary 8.22 and Corollary 8.26,
1
− 1)ωn−1 |ψ0 | (E − |P |) =
2 ˜ 2 + 1 ψ, (μ + Je0 ) · ψ dμM
|∇ψ|
2 (n 2
M
n
+ ψ, L̃i ψν i dμ∂M
∂M i=1
n
≥ ψ, L̃i ψν i dμ∂M ,
∂M i=1
where the inequality follows from the dominant energy condition. We will
show that the boundary condition νe0 · ψ = ψ implies that this boundary
integral vanishes. Recall from equation (8.20) that
1
L̃i ψ = Li ψ + (kij − (trg k)δij )ej e0 · ψ.
2
which proves the Claim. Note that we used symmetry of the shape opera-
tor S. Next we take care of the k terms:
< 1 =
ψ, (kij − (trg k)δij )ej e0 · ψ ν i
2
1< =
= ψ, [k(ν, ej )ej − (trg k)ν]e0 · ψ
2
1< =
= ψ, [k(ν, ej )ej − (tr∂M k)ν]e0 · ψ
2
1 1
= ψ, k(ν, ej )ej e0 · ψ − (tr∂M k)ψ, νe0 · ψ
2 2
1 1
= − ψ, k(ν, ej )ej ν · ψ − (tr∂M k)ψ, ψ
2 2
1
= − (tr∂M k)|ψ| , 2
2
where the last line follows because ψ, ej ν · ψ = 0 when j = n. Altogether,
we have shown that
n
1
ψ, L̃i ψν i = − θ+ |ψ|2 ,
2
i=1
which vanishes under the assumption that ∂M is a MOTS.
The only thing left to do is establish the existence of the desired ψ.
To do this one first shows that the boundary condition νe0 · ψ = ψ is an
“elliptic boundary condition” in the sense that it satisfies the Lopatinski-
Shapiro conditions as in [Wlo87]. Once that is done, the main thing to
show is that the operator D̃ is injective on the space of spinors satisfying
the boundary condition and vanishing sufficiently fast at infinity. Recall that
this was the key fact underlying the proof of Proposition 8.27. Suppose that
ϕ solves D̃ϕ = 0 with boundary condition νe0 · ϕ = ϕ at ∂M , such that ϕ
1,2
vanishes at infinity in the sense that it lies in W−q . Then by Corollary 8.22,
Corollary 8.26, and the calculation above,
0= ˜ 2 + 1 ϕ, (μ + Je0 ) · ϕ dμM .
|∇ϕ|
M 2
Then by the DEC, it follows that ∇ϕ ˜ = 0 everywhere, and then by the
proof of Proposition 8.27, it follows that ϕ is zero. Hence we have the
desired injectivity, which can be used to complete the proof.
Our main goal for this chapter is to prove a density theorem for DEC (The-
orem 8.3), but we will also prove a density theorem for vacuum constraints
as well. First, let us settle some notation.
Assumption 9.1. Throughout this entire chapter, M will be a manifold of
dimension n ≥ 3 equipped with a background metric ḡ that is identically
Euclidean on each noncompact end of M . The variables p and q are real
numbers satisfying p > n and n−2 2 < q < n. If a statement assumes that
(g, π) is asymptotically flat initial data, then we will assume that this q is
smaller than the assumed decay rate of (g, π) in Definition 7.17. We also
4
define s := n−2 .
285
286 9. Density theorems for the constraint equations
2,p 1,p
for all (g, π) ∈ W−q (ḡ) × W−q−1 (T M T M ), where we use 2μ instead of μ
merely to avoid factors of 2 in our formulas. (Recall the relationship between
π and k from Definition 7.16.)
For notational simplicity, we will often abbreviate notation by writing
things like
2,p 1,p
Φ : W−q (ḡ) × W−q−1 −→ Lp−q−2 .
2,p 1,p
Exercise 9.3. Check that for (g, π) ∈ W−q (ḡ) × W−q−1 , the constraints
p
Φ(g, π) indeed lie in L−2−q .
2,p 1,p
DΦ|(g,π) : W−q × W−q−1 −→ Lp−q−2 ,
2,p 1,p
for all (h, w) ∈ W−q ×W−q−1 , where the more complicated contractions have
been written out in index notation with summation convention.
9.1. The constraint operator 287
2,p
Let W−q (1) denote the space of positive functions u such that u − 1 ∈
2,p 2,p 2,p
W−q (M ). Note that W−q (1) is a subset of an affine copy of W−q (M ) and,
2,p 1,p
in particular, inherits its topology. Given a fixed (g, π) ∈ W−q (ḡ) × W−q−1 ,
we define a map
2,p 2,p
T : W−q (1) × W−q (T M ) −→ Lp−q−2 (M ) × Lp−q−2 (T M )
2,p 2,p
as follows. For any (u, Y ) ∈ W−q (1) ×W−q , we define new initial data (g̃, π̃)
by
g̃ := us g,
π̃ := us/2 (π + Lg Y ).
We define
˜
T (u, Y ) := Φ(g̃, π̃) = (2μ̃, J),
where μ̃ and J˜ are the corresponding energy-momentum densities of the
initial data (g̃, π̃).
2,p 1,p
Exercise 9.7. Let (g, π) ∈ W−q (ḡ) × W−q−1 , and define T as above. Show
2,p 2,p
that for any (u, Y ) ∈ W−q (1) × W−q ,
−s 1
T (u, Y ) = u u−1 Lg u + (trg π + trg Lg Y )2
n−1
− (|π|g + 2Lg Y, π + |Lg Y |g ) ,
2 2
−s/2 n−1
u (divg Lg Y + divg π)i + (π + Lg Y )ij u−1 u,j
n−2
1 ij −1
− trg (π + Lg Y )g u u,j ,
n−2
where Lg denotes the conformal Laplacian.
Also show that the linearization of T at (1, 0),
2,p
DT |(1,0) : W−q −→ Lp−q−2 ,
4(n − 1) 2
DT |(1,0) (v, Z) = − Δg v + (trg π)(divg Z) − 4π, ∇Z − 2sμv,
n−2 n−1
n − 1 ij 1
i
(divg Lg Z) + π v,j − (trg π)∇ v − sJ v
i i
n−2 n−2
2,p
for all (v, Z) ∈ W−q , where (μ, J) are the constraints of (g, π).
9.1. The constraint operator 289
Note the close relationship between Definition 9.6 and the definition of
harmonic asymptotics (Definition 8.2). The following lemma shows that the
asymptotics for u and Y that appear in Definition 8.2 actually follow from
strong decay of the constraints.
Lemma 9.8. Let (M, g, π) be an asymptotically flat initial data set with
0,α
constraints (μ, J). Assume that (μ, J) ∈ C−n−1−δ for some 0 < α < 1 and
δ > 0.
2,p 2,p
Suppose there exists (u, Y ) ∈ W−q (1) × W−q (T M ) such that
g = us ḡ,
π = us/2 LY
outside some compact set, where L := Lḡ . Then there exist constants a, bi ∈
R and α ∈ (0, 1) such that
u(x) = 1 + a|x|2−n + O2+α (|x|1−n ),
Y i (x) = bi |x|2−n + O2+α (|x|1−n ).
In other words, (g, π) has harmonic asymptotics in the sense of Defini-
tion 8.2.
Finally, we have the following initial data version of Corollary 6.11, due
to Corvino and Schoen [CS06, Proposition 3.1]. This theorem will allow us
to find solutions of the constraint equations nearby any known solution.
2,p 1,p
DΦ|(g,π) : W−q × W−q−1 −→ Lp−q−2
is surjective.
Proof. First, we claim that DΦ|(g,π) has closed range. To see this, observe
that the range contains the range of DT |(1,0) , as defined above. By Exer-
cise 9.7, we see that the top-order part of DT |(1,0) is just the Laplacian.
Therefore a systems version of Corollary A.42 (as in [LM83]) tells us that
the range of DT |(1,0) has finite codimension, and this implies the claim.
Since DΦ|(g,π) has closed range, surjectivity of DΦ|(g,π) is equivalent to
injectivity of its formal L2 adjoint map. Suppose (ξ, V ) is an element in the
∗
kernel of DΦ|∗(g,π) . That is, it lies in the dual space Lp2+q−n and solves the
equation DΦ|∗(g,π) (ξ, V ) = (0, 0) in a weak sense. By taking the trace of
the first component of the formula for the adjoint in Proposition 9.4, we can
solve for Δξ in terms of everything else. Feeding that back into the untraced
equation, we see that
Hess ξ = “Ricg ∗ ξ + π ∗ π ∗ ξ + π ∗ LV g + ∇π ∗ V ”,
where the right side is supposed to represent the various kinds of terms
that will appear, with ∗ indicating some sort of generic contraction, and we
ignore the constant scalar factors. Similarly,
LV g = “π ∗ ξ”,
and therefore
Taking the trace of (9.1) and using asymptotic flatness and our initial
∗ ∗
decay assumption ξ, V ∈ Lp2+q−n , we have Δg ξ ∈ Lpq−n . So by weighted
2,p ∗
elliptic regularity (Theorem A.32), we have ξ ∈ W2+q−n . In particular,
∗
∇ξ ∈ Lp1+q−n , so we can now use the same argument on the trace of (9.2)
2,p ∗
to conclude that V ∈ W2+q−n also. Then by weighted Sobolev embedding
np∗
∗
(Theorem A.25), we have ξ, V ∈ L2+q−n
n−2p
. In summary, we have used elliptic
estimates to improve from a Lebesgue space with exponent p∗ to one with
np∗
exponent n−2p ∗ . We can repeat this process for further improvement. In
For the last step, we argue that infinite order vanishing implies that
(ξ, V ) vanishes identically. This follows from an elliptic argument as was
carried out in [CS06], but since we have the Hessian equations at our dis-
posal, we will use an ODE argument instead as explained in [HMM18].
Specifically, for any point p ∈ M , choose an arclength-parameterized curve
γ(t) starting at γ(1) = p and running off to infinity along a geodesic ray, and
note that t and |x| are uniformly bounded by each other at γ(t). Consider
the function
F (t) = t2 (|∇ξ|2 + |∇V |2 ) + ξ 2 + |V |2 ,
where each function is evaluated at γ(t). Then
|F (t)| ≤ 2t(|∇ξ|2 + |∇V |2 ) + 2t2 (|∇ξ| · | Hess ξ| + |∇V | · | Hess V |)
+ 2|ξ| · |∇ξ| + 2|V | · |∇V |.
We use (9.1) and (9.2) and asymptotic flatness to see that | Hess ξ| and
| Hess V | are both bounded by tC2 (|ξ| + |V |) + Ct |∇ξ|. Therefore
C
|F (t)| ≤ 2t(|∇ξ|2 + |∇V |2 ) + 2t2 (|∇ξ| + |∇V |) (|ξ| + |V |)
t2
C
+ 2t2 (|∇ξ| + |∇V |)|∇ξ| + 2|ξ| · |∇ξ| + 2|V | · |∇V |
t
1 2 1
≤ 2t(|∇ξ| + |∇V | ) + 2C t|∇ξ| + t|∇V | + |ξ| + · |V |
2 2 2 2 2
t t
1 1
+ Ct(3|∇ξ|2 + |∇V |2 ) + |ξ|2 + t|∇ξ|2 + |V |2 + t|∇V |2
t t
5(1 + C)
≤ F (t).
t
Using an integrating factor, this ODE inequality tells us that
h(t) ≥ h(1)t−5(1+C) .
But this contradicts the infinite order vanishing of (ξ, V ) and its gradient,
unless h(1) = 0. Hence (ξ, V ) vanishes at every point p ∈ M , completing
the proof of injectivity of DΦ|∗(g,π) .
vacuum constraints, and let p > n and n−2 2 < q < n − 2 such that q is less
than the decay rate of (g, π) in Definition 7.17.
Then for any > 0, there exists vacuum initial data (g̃, π̃) on M with
harmonic asymptotics (in the sense of Definition 8.2) such that (g̃, π̃) is
2,p 1,p
-close to (g, π) in W−q × W−q−1 .
Proof. We start out following the basic steps set forth in the proof of
Lemma 3.34, except that we use the deformation of initial data described
in the previous section. Let χ be a smooth nonnegative cutoff function on
Rn that is equal to 1 on B1 and vanishes outside B2 . For λ ≥ 1, define
χλ (x) = χ(x/λ). For λ large enough, we can think of χλ as being defined
on M by extending it to be 1 on the compact region of M . Recall that ḡ is
equal to the Euclidean metric on the asymptotically flat end. Define
gλ := χλ g + (1 − χλ )ḡ,
πλ := χλ π,
so that gλ = ḡ and πλ = 0 when |x| > 2λ. The asymptotic flatness of (g, π)
2,p 1,p
implies that (gλ , πλ ) → (g, π) in the W−q × W−q−1 sense as λ → ∞. For
convenience, we define (g∞ , π∞ ) := (g, π).
2,p 2,p
Next, for any pair (u, Y ) ∈ W−q (1) × W−q (T M ), we define
g̃ := us gλ ,
π̃ := us/2 (πλ + Lgλ Y ),
4
where s := n−2 , just as in Definition 9.6. If we can find a pair (u, Y )
such that (g̃, π̃) satisfies the vacuum constraints, then it will follow from the
construction and Lemma 9.8 that (g̃, π̃) has harmonic asymptotics. As in
Definition 9.6, we define
Tλ (u, Y ) := Φ(g̃, π̃) = Φ(us gλ , us/2 (πλ + Lgλ Y )),
where we use the λ subscript to denote the dependence of the operator Tλ
on λ. Then the vacuum constraint equations for (g̃, π̃) simply translate to
the system
Tλ (u, Y ) = 0.
Our basic strategy is clear: for large λ, we look for a solution (u, Y ), which
then gives us our desired vacuum initial data (g̃, π̃) with harmonic asymp-
totics. Moreover, as λ → ∞, the resulting initial data should converge to
the original data (g, π).
For the time being, let us assume that DT∞ |(1,0) is invertible so that we
can use the inverse function theorem (Theorem A.43) to solve Tλ (u, Y ) = 0.
294 9. Density theorems for the constraint equations
1 Of course, the k in Yk here is indexing the sequence and does not mean the ith component.
9.3. The density theorem for DEC (Theorem 8.3) 295
can now apply the exact same inverse function theorem argument described
above. The upshot is that we obtain a sequence of vacuum initial data
s/2
(gk , πk ) = usk gλk + hk , uk (πλk + Lgλk Yk ) + wk ,
2,p 1,p
converging to (g, π) in the W−q × W−q−1 topology. The key point is that
although we do not know much about (hk , wk ), we do know that they are
compactly supported, and thus (gk , πk ) still has harmonic asymptotics.
In the theorem above, we solved for vacuum constraints, but upon exam-
ining the proof, it is clear that we can do something similar while preserving
any fixed constraints. Indeed, we can even specify a perturbation of those
fixed constraints.
Proposition 9.11. Let (M, g, π) be a complete asymptotically flat initial
data set with constraints (μ, J). There exist δ > 0 and a constant C such
that the following holds.
For any (ξ, Z) ∈ Lp−q−2 (M ) × Lp−q−2 (T M ) with (ξ, Z)Lp−q−2 < δ, there
exists (g̃, π̃) ∈ W 2,p (ḡ) × W 1,p such that its constraints (μ̃, J)
−q −q−1
˜ satisfy
˜ = (μ + ξ, J + Z),
(μ̃, J)
and
g̃ − gW 2,p < C(ξ, Z)Lp−q−2 , π̃ − πW 1,p < C(ξ, Z)Lp−q−2 .
−q −q−1
2,p 2,p
Moreover, there exists (u, Y ) ∈ W−q (1) × W−q (T M ) such that
g̃ = us ḡ,
π̃ = us/2 LY
˜ decays fast enough to apply
outside some compact set. In particular, if (μ̃, J)
Lemma 9.8, then (g̃, π̃) has harmonic asymptotics.
Exercise 9.12. Go through the proof of Theorem 9.10 and work out what
modifications need to be made in order to prove Proposition 9.11.
2,p 1,p
each end, (g̃, π̃) is -close to (g, π) in W−q × W−q−1 , and their constraints
˜ are -close to (μ, J) in L .
(μ̃, J) 1
Furthermore, we can choose (g̃, π̃) such that the strict dominant energy
˜ g̃ . Simultaneously, (μ̃, J)
condition holds. That is, μ̃ > |J| ˜ may be chosen to
decay as fast as we like.
Alternatively, we can choose (g̃, π̃) to be vacuum outside a compact set.
˜ = 0 outside a compact set.
That is, μ̃ = |J|
Note that by Lemma 8.4, the ADM energy-momentum may also be taken
to be -close to that of (g, π). Also observe that, compared to the statement
of Theorem 8.3, we added the conclusion that (μ̃, J) ˜ may be chosen to decay
as fast as we like. The meaning of this statement will be made explicit in
the proof.
According to Proposition 9.11, we can essentially prescribe the con-
straints we want, so naively it looks like Theorem 9.13 should follow easily.
However, the complication is that the dominant energy condition μ ≥ |J|g
depends on g, and when we prescribe constraints according to Proposi-
tion 9.11, we do not have control over the perturbed metric g̃. This problem
was solved in [EHLS16], and the underlying concept can be explained using
the following definition of Corvino and Lan-Hsuan Huang [CH16].
Definition 9.14. Given an initial data set (M, g, π), the modified constraint
operator Φ(g,π) is an operator on other initial data (γ, τ ) defined by
1
Φ(g,π) (γ, τ ) = Φ(γ, τ ) + 0, (γ · J) ,
2
where Φ is the usual constraint operator, J is the current density of the
original initial data (g, π), and the sharp operator is with respect to g, so
that (γ · J) is the vector field with components g ij γjk J k .
The main usefulness of this definition comes from the following obser-
vation, which tells us that knowledge of the modified constraints gives us
control over |J|g .
Lemma 9.15. Let (g, π) and (g̃, π̃) be initial data, and assume that
Φ(g,π) (g̃, π̃) − Φ(g,π) (g, π) = (2ψ, 0)
for some function ψ, where Φ(g,π) is the modified constraint operator. Then
as long as |g̃ − g|g ≤ 3, we have
˜ 2 ≤ |J|2 ,
|J| g̃ g
where J and J˜ are the momentum densities of (g, π) and (g̃, π̃), respectively.
9.3. The density theorem for DEC (Theorem 8.3) 297
Proof. Adopting the notation in the statement of the lemma and setting
h = g̃ − g, we can see that the main assumption reduces to the statement
1
J˜i + (h · J)i = J i .
2
We compute
˜ 2 = g̃ij J˜i J˜j
|J| g̃
1 1
= (gij + hij ) J − (h · J)
i i
J − (h · J)
j j
2 2
1
= (gij + hij ) J J − g hk J J + (h · J) (h · J)
i j i k j i j
4
3 1
= |J|2g − |h · J|2g + hij (h · J)i (h · J)j
4 4
≤ |J|g ,
2
where the last inequality holds as long as |h|g ≤ 3. Note the crucial can-
cellation that occurs in the fourth equality above; this is the underlying
motivation for the definition of the modified constraint operator.
The proof is the same as the proof of Theorem 9.9 with the only difference
being that one must check that the additional term 12 (h · J) does not affect
the argument.
We can now explain how to perturb to the strict dominant energy con-
dition.
the image of DT (1,0) . Let K1 be a complementing subspace for the kernel
of DT (1,0) in W−q
2,p 1,p
× W−q−1 , and now define
T̂ : [(1, 0) + K1 ] × K2 −→ Lp−q−2
by the formula
T̂ (u, Y, h, w) = Φ(g,π) (us g + h, us/2 (π + Lg Y ) + w),
so that D T̂ is an isomorphism. By the inverse function theorem, T̂
(1,0,0,0)
has a local inverse that maps a small r-ball around T̂ (1, 0, 0, 0) = Φ(g,π) (g, π)
in Lp−q−2 into a Cr-ball around (1, 0, 0, 0) in W−q 2,p
. For small enough t,
Φ(g,π) (g, π) + (t(f + |J|g ), 0) lies in that r-ball, and therefore we obtain our
desired solution (u, Y, h, w) such that
T̂ (u, Y, h, w) = Φ(g,π) (g, π) + (t(f + |J|g ), 0).
Setting (g̃, π̃) = (us g + h, us/2 (π + Lg Y ) + w), we obtain our desired (g̃, π̃)
solving equation (9.4). As usual, for small enough t, we can show that u > 0,
and elliptic regularity arguments show that (u, Y ) is smooth.
Finally, we check that for small enough t, (μ̃, J) ˜ is close to (μ, J) in L1 .
First, μ̃ − μ = t(f + |J|g ), which clearly approaches 0 in L1 as t → ∞.
Second, J˜ − J = 12 ((g̃ − g) · J) , which can also be taken to be small in L1
2,p
since g̃ − g is small in W−q ⊂ C 0.
However, this can be fixed by slightly perturbing the prescribed constraint fk Φ(g, π) to something
that does have Hölder decay but still preserves Lp−q−2 and L1 convergence to Φ(g, π).
Appendix A
A.1. Basics
Assumption A.1. Let (M, g) be a smooth Riemannian manifold (which
we assume is connected). Let V ∈ C ∞ (T M ), and let q ∈ C ∞ (M ). In this
section we will consider second-order elliptic linear operators of the form
Lu := −Δg u + V, ∇u + qu
for any u ∈ C 2 (M ).
Although many of the results in this section hold true for more general
operators (in particular, those with lower regularity of V , q, and even g),
this level of generality will be sufficient for most of our purposes. While this
keeps most of our hypotheses as simple as possible, the cost is that our results
are almost never stated with optimal regularity assumptions. Those optimal
regularity assumptions are actually quite important, not just for studying
low regularity phenomena, but also for studying nonlinear problems. The
301
302 A. Some facts about second-order linear elliptic operators
reason why we get away with such a simplistic approach here is that we are
focusing on the case of smooth metrics, and this book generally stays away
from the intricacies of nonperturbative nonlinear problems.
Note that we have chosen to define this L with −Δg instead of Δg . This
is convenient for us because −Δg has a nonnegative spectrum, and because
the conformal Laplacian and the linearizations of mean curvature H and the
null expansion θ+ all satisfy Assumption A.1 with this sign (up to a positive
constant).
The following standard fact, sometimes called the Hopf maximum prin-
ciple, can be found in [GT01, Lemma 3.4, Theorem 3.5], for example.
etc.) carry over to the manifold setting. Similarly, one can define Hölder
spaces on Riemannian manifolds, and similar remarks apply.
In fact, one does not really need to start with W 2,p regularity of u.
Even if u is only a distribution (as described in [Wik, Distribution
(mathematics)]) and Lu is only defined in the weak sense, elliptic regu-
larity still holds. We say that Lu = f in the weak sense if u, L∗ ϕ = f, ϕ
for all test functions ϕ, where L∗ denotes the adjoint operator and the an-
gle brackets represent the pairing between distributions and test functions.
See [Fol95], for example, for an introduction to distributions and also to see
how to prove the following regularity theorem.
Theorem A.5 (Interior elliptic regularity for weak solutions). Let (M, g)
be a smooth Riemannian manifold, and consider L as in Assumption A.1.
Given u in the dual space of Cck (Int M ) for some k ≥ 2, if Lu is smooth,
then u is smooth on Int M .
1 Many authors use the notation W0k+2,p (M ) to denote the completion of Cc∞ (Int M ) in the
W k+2,p norm.
304 A. Some facts about second-order linear elliptic operators
Proof. For this proof we will write Δ for Δg for notational simplicity. We
claim that the operator Δ : W0k+2,p −→ W k,p carries an injectivity estimate.
That is, there exists C such that for all u ∈ W0k+2,p ,
(A.1) uW k+2,p ≤ CΔuW k,p .
We also claim a similar estimate for Δ : C0k+2,α −→ C k,α . Of course, this is
a stronger statement than injectivity.
We will start by proving the simpler inequality
uLp ≤ CΔuLp
for all u ∈ W02,p . Suppose there does not exist such a constant. Then
we can find a sequence ui with ui Lp = 1 and Δui → 0 in Lp . By the
A.1. Basics 305
over all v ∈ W01,2 . Together with the Poincaré inequality, the Lax-Milgram
Theorem [Wik, Weak formulation] can be used to show that a minimizer
u ∈ W01,2 exists, and then it is straightforward to see that u must solve
Δu = f weakly.2 (See [Eva10, Section 6.2], for example, for details of this
argument.) By elliptic regularity, u ∈ C0∞ .
Now consider Δ : W0k+2,p −→ W k,p and attempt to prove surjectivity.
Let f ∈ W k,p . Then there exists a sequence fi in C ∞ such that fi → f
in W k,p . As observed above, we can find ui ∈ C0∞ solving Δui = fi . By
the injectivity estimate described above, the fact that fi is Cauchy in W k,p
implies that ui is Cauchy in W k+2,p , and then its limit u ∈ W0k+2,p must
solve Δu = f . The surjectivity proof for Δ : C0k+2,α −→ C k,α proceeds in
the same way.
2 This is not quite as “weak” as our previously stated definition of a weak solution, so elliptic
regularity just follows from energy methods rather than the more sophisticated result, Theo-
rem A.5.
306 A. Some facts about second-order linear elliptic operators
Proof. The proof is similar to the previous one, but we have to make some
small adjustments. First observe that the condition M v dμ = 0 on v is a
closed condition in all of the spaces W k,p and C k,α . This fact will be used
implicitly in the following.
Obviously, the constants lie in the kernel of Δ. Instead of the injectivity
estimate (A.1), we prove that there exists a constant C such that for all
u ∈ W k+2,p ,
(A.2) if u dμ = 0, then uW k+2,p ≤ CΔuW k,p .
M
We can also prove a similar statement for C k+2,α . Note that (A.2) is a
stronger statement than the claim that the kernel is spanned by constants.
The proof of (A.2) is the same as the proof of (A.1), except that now when
we reach the point where Δu = 0, we can only conclude from the maximum
principle that u is constant, not that it is zero. But this will still imply that
u = 0 as long as the ui ’s were chosen so that M ui dμ = 0.
Now consider the range of Δ. By the divergence theorem, it is clear that
the range of Δ is orthogonal to the constants. We just need to solve Δu = f
for any f orthogonal to the constants. As in the proof of Theorem A.7, we
∞
start with f ∈ C with M f dμ = 0, and attempt to solve Δu = f . The
proof is similar to the surjectivity proof above,
except that we minimize the
functional A(v) over all v ∈ W 1,2 such that M v dμ = 0. Using the Poincaré
inequality (for functions with average equal to 0), we can show that Lax-
Milgram applies once again, and therefore we can find the desired minimizer
u ∈ W 1,2 . However, this minimizer only has the property that Δu − f is a
constant (not necessarily zero). But since the average value of f is zero by
hypothesis, that constant must be zero, and hence Δu = f . The rest of the
proof is identical.
where u± are the positive and negative parts of u, and the inequality follows
from the minimizing property of λ1 . Since the inequality is actually an
equality, u± are also eigenfunctions of L with eigenvalue λ1 . For large enough
C (specifically, C > sup |q− |), the maximum principle can be applied to
the operator L + C, and we can also choose C large enough so that λ1 +
C ≥ 0. Thus (L + C)u± = (λ1 + C)u± ≥ 0. Since u+ ≥ 0, the strong
+
maximum principle implies that either u+ > 0 in Int M and ∂u ∂ν < 0 at
∂M , or else u+ = 0. The exact same statement is true for u− , and obviously
u+ and u− cannot be simultaneously positive, so one of them must vanish
identically, showing that u indeed has a sign in the interior (and the desired
normal derivative at the boundary). Since two positive functions can never
be orthogonal, it follows that the eigenspace is one-dimensional.
= (Lv)v
M
Lv Lv
≤ sup v 2 = sup ,
M Int M v Int M v
Therefore ϕ1 −v achieves equality in the upper bound for λ1 , and we showed
that this is only possible if ϕ1 − v itself is a principal eigenfunction. Thus v
is a principal eigenfunction with Lv = λ1 v. But Lv ≥ γv by definition of γ,
giving us a contradiction unless γ = λ1 . Tracing the logic, we have shown
that γ ≤ λ1 with equality only if v is a principal eigenfunction.
It is also true that this real eigenvalue in the conclusion of the theorem
is equal to the spectral radius of T , but we will not need this fact. The proof
of Theorem A.10 given in [Eva10, Section 6.5] efficiently rolls together the
proof of Krein-Rutman with the reduction to Krein-Rutman, but instead we
will separate the two arguments.
General proof of Theorem A.10. First, observe that for any constant
c, the operators L and L + c have the exact same eigenfunctions, and the
eigenvalues are simply translated by c, so we may as well assume without
A.1. Basics 311
L : C02,α −→ C 0,α
G : C 0,α −→ C 0,α ,
converges absolutely in X for all |z| < ρ. So for any z ∈ (0, ρ),
∞
F (z) = u + z n+1 T n (T u)
n=0
∞
≥u+ z n+1 T n (γu)
n=0
= u + zγF (z),
A.1. Basics 313
where the inequality follows from the fact that T n (T u − γu) ≥ 0 for n ≥ 0.
So we have (1 − zγ)F (z) ≥ u. Since F (z) > 0 and u ≥ 0 is nontrivial, this
is only possible if 1 − zγ > 0. That is, γ < z −1 . Since this is true for all
z < ρ, it follows that γ ≤ ρ−1 = v, completing our proof of the Claim.
The result follows fairly easily from the Claim. We choose a sequence
of γi ∈ A such that γi → λ. By definition of A and the Claim, for each λi ,
there is a wi ∈ K such that wi ≥ γi and T wi ≥ γwi . Finally, since K is
compact, we can find a subsequence of wi converging to some ϕ ∈ K. Then
T ϕ ≥ λϕ, and the lower bound on wi guarantees that ϕ is nontrivial.
To prove that ϕ is the desired eigenfunction, suppose that T ϕ − λϕ is not
identically zero. Then our assumption on T implies the strict inequality
T (T ϕ − λϕ) > 0, so that for some small > 0, T (T ϕ − λϕ) ≥ T ϕ, which
implies if we set u = T ϕ ∈ C {0}, then T u ≥ (λ + )u. But that means
that λ + ∈ A, contradicting the definition of λ. Hence T ϕ = λϕ, and
ϕ ∈ Int C since it is in the image of T .
It only remains to show that this λ has a one-dimensional eigenspace.
Suppose that u ∈ X also has T u = λu. Let s > 0 be the largest number such
that both 0 ≤ ϕ + su and 0 ≤ ϕ − su, which clearly exists. If one of these
inequalities is an exact equality, then u = ±s−1 ϕ and we are done. Suppose
that neither one is, so that ϕ + su, ϕ − su ∈ C {0}. Then the positivity
property of T implies that both T (ϕ + su) and T (ϕ − su) are positive, which
means that ϕ + su and ϕ − su are strictly positive, since both functions are
λ-eigenfunctions. But this means there is enough wiggle room to contradict
the maximality of s.
Without too much more work in the self-adjoint case, we can construct
a complete orthonormal set of eigenfunctions.
Theorem A.13 (Spectral theorem for self-adjoint elliptic operators). Let
(M, g) be a Riemannian compact manifold, possibly with boundary, and con-
sider L as in Assumption A.1. We further assume that V = 0. Then
there exists a sequence of smooth (Dirichlet) eigenfunctions ϕ1 , ϕ2 , ϕ3 , . . . ∈
C ∞ (Int M )∩C 0 (M ) vanishing at ∂M with corresponding discrete (Dirichlet)
eigenvalues λ1 < λ2 ≤ λ3 ≤ · · · diverging to ∞ such that for each i,
Lϕi = λi ϕi ,
and the {ϕi } form a complete orthonormal set in L2 (M ).
314 A. Some facts about second-order linear elliptic operators
A(ϕ2 , u) = λ2 ϕ2 , u
k
uk := u, ϕi ϕi
i=1
where s is the greatest integer function, and H≤s (Rn ) := {0} if s < 0.
Corollary A.16. Let Hk (S n−1 ) denote the space of the restrictions of ele-
ments of Hk (Rn ) to the unit sphere. Then
∞
>
2
L (S n−1
)= Hk (S n−1 ).
k=0
where the function Zk (·, ξ) ∈ Hk (Rn ) is sometimes called the zonal harmonic
with pole ξ. By axisymmetry considerations, we can see that Zk (x, ξ) =
|x|k Pk (x̂ · ξ) for some polynomial Pk , where x̂ := x/|x|. This generalizes the
well-known formula for n = 2,
∞
1
K(x, ξ) = 1+ |x|k cos kθ ,
2π
k=0
For a full explanation of this whole story, see [ABR01, Chapter 5]. If
we only want the expansion to hold in a small neighborhood of the origin
(which is sometimes all that is needed), the proof is quite a bit simpler.
Note that since a partial derivative of a harmonic function is also harmonic,
one can see that the expansion can be differentiated, or, in other words, the
series converges in C m on compact subsets for any m.
We can use the expansion in Theorem A.18 to help us understand the
asymptotics of a harmonic function defined on an exterior region of Rn .
Given such a harmonic function u on Rn B̄ρ , we can use a Kelvin transform
to construct a new harmonic function u∗ defined on B1/ρ {0} via
u∗ (x) = |x|2−n u(x/|x|2 ).
If limx→∞ u(x) = 0, then limx→0 |x|n−2 u∗ (x) = 0, and it follows that u∗ has
a removable singularity at the origin. Hence,
we can apply Theorem A.18
to u∗ , giving us the expansion u = |x|2−n ∞ h
k=0 k (x/|x| 2 ) for some h ∈
k
Hk (Rn ). So we have the following corollary.
Corollary A.19. Let ρ0 > 0, and let u be a harmonic function on Rn
Bρ0 (0) such that limx→∞ u(x) = 0. Then there exist spherical harmonics
ϕk ∈ Hk (S n−1 ) such that
∞
u= |x|2−n−k ϕk (x/|x|),
k=0
where the convergence is absolute and uniform over |x| > ρ for any ρ > ρ0 .
We also obtain convergence of any number of derivatives.
One can actually go further and cook up a Laurent expansion for any
harmonic function defined on an annulus [ABR01, Chapter 10]. In the next
section, we will prove a useful generalization of Corollary A.19.
this condition is not used for any of the analytic results on asymptotically flat manifolds in this
Appendix.
A.2. Weighted spaces on asymptotically flat manifolds 319
would naturally look for a solution u that decays two orders slower than f .
Even for the case of Euclidean space, the theory is nontrivial.
With this motivation in mind, we briefly present some of the theory of
weighted spaces on asymptotically flat manifolds and the corresponding el-
liptic theory. The use of these weighted spaces to study elliptic problems
on Rn was first developed by L. Nirenberg and H. Walker [NW73] and
M. Cantor [Can74]. Much of what appears in this section can be found in
R. Bartnik’s paper [Bar86, Sections 1 and 2], but the current understanding
of this theory was built up over many years by contributions from N. Mey-
ers, Y. Choquet-Bruhat, A. Chaljub-Simon, R. McOwen, R. Lockhart, and
D. Christodoulou, among others. See [Mey63, CSCB79, McO79, McO80,
Loc81, Can81, CBC81, LM83].
Definition A.20. Let (M n , g) be a complete asymptotically flat manifold.
Slightly abusively, we will choose r to be a smooth positive function on M
such that r = |x| in each asymptotically flat coordinate chart. Given any
p ≥ 1, s ∈ R, we define the weighted Lebesgue space Lps (M ) to be the space
of all functions u ∈ Lploc (M ) with finite weighted norm
1/p
uLps (M ) = |u|p r−sp−n dμ .
M
We can also analogously define weighted Sobolev spaces for tensors and
spinors.
of g or r. (In your answer, you may keep the choice of asymptotically flat
coordinate charts fixed.) What about Ws3,p ?
Just as there are weighted Sobolev spaces, there are also weighted Hölder
spaces.
where
|v(x) − v(y)|
[v]C α (U ) := sup
x,y∈U d(x, y)α
denotes the Hölder coefficient of the tensor-valued function v over the set U .
The quantity |v(x) − v(y)| can be defined using parallel translation along
a minimizing geodesic connecting x to y. (We may take r/2 to be smaller
than the injectivity radius.)
Exercise A.24. Show that this is a simple consequence of the usual Hölder
inequality.
Proof. This was first observed in [CBC81], but the proof is fairly straight-
forward. First, the embedding itself follows from weighted Sobolev embed-
ding and the weighted Hölder inequality. Let ui be a bounded sequence in
Wsk,p (M ). Then we can extract a subsequence (which we will also write as
ui ) that converges weakly to some u ∈ Wsk,p (M ). We claim that a subse-
quence of this ui converges to u in Lqt . Each sufficiently large radius ρ > 0
divides M into an exterior region Eρ = Rn B̄ρ and an interior compact set
Kρ = M Eρ (we assume one end for simplicity).
Let > 0. Since the Lqt (Eρ ) norm is bounded by ρs−t times the Lqs (Eρ )
norm, which in turn is bounded by the Wsk,p (Eρ ) norm (by weighted Sobolev
embedding), it is clear that we can choose ρ large enough so that
ui − uLqt (Eρ ) < /2. Meanwhile, ui is certainly bounded in Wsk,p (Kρ ),
and since Kρ is a compact manifold with boundary, the standard Rellich-
Kondrachov compactness theorem implies that a subsequence of ui converges
in Lqt (Kρ ), and the only thing it could converge to is the restriction of u to
Kρ . So there exists some particular i1 such that ui1 − uLqt (Kρ ) < /2, and
thus ui1 − uLqt (M ) < . Since this can be done for any > 0, the result
follows.
Moreover, if u ∈ Cs0 (M )
and the weak object Lu can be identified with a
0,α
function in Cs−2 (M ), then u ∈ Cs2,α (M ).
Outline of the proof of Theorem A.33. The proof assumes some knowl-
edge of some of the precursors of Theorem A.6. Specifically, there exists an
interior elliptic Hölder estimate for a fixed annulus. That is, for annuli
A ⊂⊂ A lying in an asymptotically flat end, we have
(A.3) uC 2,α (A ) ≤ C(LuC 0,α (A) + uC 0 (A) ),
where C depends on A, A , α, and the coefficients of L. If we scale it up by
a factor of ρ, we obtain the scale-invariant estimate
(A.4) u∗C 2,α (ρA ) ≤ C(ρ2 Lu∗C 0,α (ρA) + uC 0 (ρA) ),
where we define
2
|D 2 u(x) − D 2 u(y)|
u∗C 2,α (ρA ) ρk sup |D k u| + ρ2+α · sup ,
x,y∈ρA |x − y|α
k=0
and Lu∗C 0,α (ρA) is defined similarly. The point is that these scale-invariant
norms, together with our hypotheses on g, V , and q, allow us to choose the
constant C in (A.4) to be independent of ρ. (The interior estimate and the
claimed dependence of C on the coefficients of L both follow from [GT01,
Theorem 6.2], for example.) Then of course we have
ρ−s u∗C 2,α (ρA ) ≤ C(ρ2−s Lu∗C 0,α (ρA) + ρ−s uC 0 (ρA) ).
Next, observe that on the annulus ρA, the constant ρ and the function
r(x) = |x| are uniformly bounded by each other. Therefore we can patch
these estimates together for an appropriate sequence of ρ’s (as well as patches
covering the compact part of M ) in a standard way to obtain our desired
weighted global estimate on M .
As for the regularity statement, standard interior elliptic regularity is
enough to guarantee that u is locally C 2,α regular, so really we only need to
show that uCs2,α (M ) < ∞, and the patching argument described above is
sufficient to do that.
(Prove this as an exercise.) One can combine this with the estimate (A.3)
and a standard (but nonobvious) PDE argument that shows that we have
the estimate
uC 2,α (A ) ≤ C(LuC 0,α (A) + uLp (A) ).
Scaling this one up by ρ leads to
−s− n
ρ−s u∗C 2,α (ρA ) ≤ C(ρ2−s Lu∗C 0,α (ρA) + ρ p uC 0 (ρA) ),
We now present the appropriate analog of Theorem A.8 for the whole
Euclidean space Rn .
Proof of the Hölder case. We will present only the Hölder case since it
seems to be underrepresented in the literature, and it is also simpler. The
computation of the kernel is fairly elementary: let u ∈ Cs2,α (Rn ) be har-
monic. If s < 0, then the maximum principle implies that u is identically
zero. If s > 0, gradient estimates for harmonic functions (as in [GT01, The-
orem 2.10]) imply that taking any s + 1 partial derivatives of u results in
a new harmonic function lying in u ∈ Cs−s−1
0 (Rn ), which again vanishes
because of the maximum principle. Therefore u must be a polynomial of
degree at most s, which is what we wanted to show. Note that the kernel
is finite dimensional.
Most of the work is in computing the range. First consider the case
2 − n < s < 0. In this case, we want to show that Δ : Cs2,α −→ Cs0,α
is surjective. Given f ∈ Cs0,α , we can construct a solution to the Poisson
equation Δu = f using the Newtonian potential in the usual way. That is,
we define
1
u(x) := |x − y|2−n f (y) dy.
(2 − n)ωn−1 Rn
Note that the decay of f guarantees that the integral is finite. For now,
assume that f ∈ Cc∞ . Then it is well known that Δu = f (e.g., [Eva10,
Section 2.2.1]).
Our main task is to check that this solution u lies in Cs2,α . The first step
is to show that u ∈ Cs0 (Rn ). We have
1
|u(x)| ≤ f Cs−2
0 |x − y|2−n r(y)s−2 dy,
(n − 2)ωn−1 R n
where we recall that r(x) is a smooth positive function equal to |x| outside
a compact set. For simplicity, let us choose the function r ≥ 1 so that
r(x) = 1 for |x| ≤ 1 and r(x) = |x| for |x| ≥ 2. (Different choices for r lead
to equivalent norms.)
A.2. Weighted spaces on asymptotically flat manifolds 327
Claim.
|x − y|2−n r(y)s−2 dy ≤ Cr(x)s
Rn
for some constant C. (Throughout this proof, we will use C as a generic
constant that can change from line to line.)
Ex = Rn B2|x| (0).
Suppose that |x| ≥ 4, so that |x| = r(x). Then we have
2−n
1
|x − y| 2−n
r(y) s−2
dy ≤ |x| |y|s−2 dy
Bx |y|< |x|
1 2
2
s
= C|x| ,
where we needed the fact that s > 2 − n in order to integrate |y|s−2 , and
s−2
2−n 1
|x − y| 2−n
r(y) s−2
dy ≤ |x − y| 2 x dy
Ax Ax
s−2
1
≤ x |x − y|2−n dy
2 |x−y|<3|x|
= C|x|s ,
2−n
1
|x − y| 2−n
r(y) s−2
dy ≤ |y| |y|s−2 dy
Ex |y|>2|x| 2
= C|x|s ,
where we needed the fact that s < 0 in order to integrate |y|−n+s . For the
case |x| < 4, we just have to estimate the integrals above by a constant in
order to complete our proof of the Claim. The first two are trivial, and we
leave the third as an exercise. The Claim implies that u ∈ Cs0 , and then we
can invoke the weighted elliptic Hölder estimate (Theorem A.33) to conclude
that
uCs2,α ≤ Cf C 0,α .
s−2
0,α
For the general case of f ∈ Cs−2 ,
we can approximate it by a sequence of
0,α
smooth compactly supported functions fi that converge to f in Cs−2 and
then solve Δui = fi as above. Then the estimate above implies that their
ui is Cauchy in Cs2,α , and then its limit u solves Δu = f . This completes
the proof in the case 2 − n < s < 0.
328 A. Some facts about second-order linear elliptic operators
Next we consider the case 1 − n < s < 2− n (as a simple but representa-
tive case). Let u ∈ Cs2,α , and observe that Rn Δu dx = 0 by the divergence
theorem, since s < 2 − n guarantees that the boundary term at infinity van-
ishes. Therefore the Δ image of Cs2,α in Cs0,α is annihilated by the constants,
which is the same thing as H≤2−n−s (Rn ) = H0 (Rn ). We must now show
2,α
that if f ∈ Cs−2 with Rn f dx = 0, then we can find u ∈ Cs2,α such that
Δu = f . We use the same strategy as before, defining u in the same way. As
we saw above, it will suffice to show that for any f ∈ Cc∞ with Rn f dx = 0,
we have
uCs0 ≤ Cf Cs−2
0 .
However, this time we see that the estimate of the integral over Bx fails
since we no longer have s > 2 − n. Instead, we make the observation that
1
u(y) = |x − y|2−n − r(x)2−n f (y) dy,
(2 − n)ωn−1 Rn
since the second term integrates to zero by hypothesis on f . Similar to
before, we will prove the estimate
|x − y| 2−n
− r(x) 2−n
r(y) s−2
dy ≤ Cr(x)s .
Rn
As before we only treat the case |x| ≥ 4 and leave the |x| < 4 case as an
exercise:
|y|
|x − y| 2−n
− r(x) 2−n
r(y) s−2
dy ≤ C|x − y|2−n |y|s−2 dy
|x|
Bx Bx
1−n
1
≤C |x| |y|s−1 dy
2 |y|< |x|
1
2
= C|x|s ,
where we used the fact that s > 1 − n. The first inequality just follows
|y|
from the binomial theorem and the fact that |x| < 12 in Bx . Estimating the
integral over Ax causes no new problems, and for Ex , we can use the old
estimate together with an estimate of the new term:
r(x) 2−n
r(y) s−2
dy = |x| 2−n
|y|s−2 dy = C|x|s ,
Ex |y|>2|x|
where we used the fact that s < 2−n. Therefore we have all of the estimates
needed to complete the case 1 − n < s < 2 − n.
Now we can summarize how the argument works in general when s <
2 − n. Using Green’s second identity (i.e., integration by parts), we can
see that the Δ image of Cs2,α in Cs−2
0,α
is annihilated by every element of
A.2. Weighted spaces on asymptotically flat manifolds 329
H≤2−n−s (Rn ), because the boundary terms vanish quickly enough. Con-
versely, for any f ∈ Cc∞ annihilated by H≤2−n−s (Rn ), we define
1
u(x) := |x − y|2−n f (y) dy,
(2 − n)ωn−1 Rn
|y| k
|x − y| 2−n
= |x| 2−n
Pk (x̂ · ŷ)
|x|
k=0
|y|
since |x| ≤ 12 . So
3−n−s
|y|
Γs (x, y)r(y) s−2
dy ≤ C|x| 2−n
|y|s−2 dy
|x|
Bx |y|< 12 |x|
= C|x|s ,
where the integral can be computed because
3 − n − s + (s − 2) > (2 − n − s) + (s − 2) = −n.
Estimating the integral over Ax is straightforward. For Ex , the |x − y|2−n
is estimated just as before while for y ∈ Ex , the other term in the integrand
can be estimated
2−n−s
|y|
k
2−n
Γs (x, y) − |x − y| = |x| 2−n
Pk (x̂ · ŷ)
|x|
k=0
2−n−s
|y|
≤ C|x|2−n ,
|x|
|y|
since |x| ≥ 2. Therefore
Γs (x, y) − |x − y|2−n r(y)s−2 dy
Ex
2−n−s
|y|
≤ C|x| 2−n
|y|s−2 dy
|y|>2|x| |x|
= C|x|s ,
where the integral can be computed because
2 − n − s + (s − 2) < (2 − n − s) + (s − 2) = −n.
This gives us all of the estimates needed for the s < 2 − n case.
All that remains is the s > 0 case. For this case, instead of expanding the
fundamental solution |x − y|2−n in spherical harmonics for x in the exterior
and y in the interior, we should swap their roles. That is, we write
∞
|x| k
|x − y|2−n = |y|2−n Pk (x̂ · ŷ),
|y|
k=0
and define
s
|x| k
Γs (x, y) := |x − y|2−n − |r(y)|2−n Pk (x̂ · ŷ).
r(y)
k=0
Let f ∈ Cc∞ and this time we define
u(x) := Γs (x, y)f (y) dy.
Rn
A.2. Weighted spaces on asymptotically flat manifolds 331
Observe the new term that has been subtracted from |x − y|2−n is actually
a harmonic polynomial in x, and consequently we still have Δu = f . As
before, we want to show that uCs2,α ≤ Cf C 0,α . Keep in mind that since
s−2
Δ now has a kernel, this solution u will not be a unique solution. It is a
specially chosen solution.
For estimating the integrals, the roles of Bx and Ex are in some sense
swapped. In order to estimate Bx , we use the fact that for y ∈ Bx , we can
estimate
|x| s
2−n
Γs (x, y) − |x − y| ≤ C|r(y)| 2−n
.
r(y)
Again, the estimate for Ax is straightforward, and the estimate for y ∈ Ex
uses
s+1
2−n |x|
|Γs (x, y)| ≤ C|y|
|y|
and the fact that (2 − n) − s + 1 + (s − 2) < −n. The reader can fill in
the details.
The proof of the Sobolev version of the theorem is fairly similar. The
solution u is defined in the same way, and the expansion of |x − y|2−n is
used in the same way, with the difference being that instead of proving C 0
estimates on u, one requires Lp estimates on u. These are a bit trickier, and
this is what was supplied by Nirenberg and Walker in [NW73, Lemma 2.1].
The next two results go back to work of Meyers [Mey63], who used some
of the same main ideas that went into the proof of Theorem A.35 above.
Corollary A.37. Let α ∈ (0, 1) and ρ > 0, and let τ < s be real numbers
not in the exceptional set Λ. Assume that u ∈ Cs2,α (Rn Bρ (0)) and Δu ∈
Cτ0,α
−2 (R Bρ (0)). Then there exists hk ∈ Hk (R {0}) for k = τ +
n n
where we simply ignore the sum as vacuous if τ + 1 > s and also ignore
terms coming from k ∈/ Λ.
The following limited version of the previous corollary for operators that
are asymptotic to Δ will be useful to us.
Corollary A.38. Let ρ > 0, and let τ < s be real numbers not in the
exceptional set Λ. Let (M = Rn Bρ (0), g) be an asymptotically flat manifold
with boundary, and consider L and α as in Assumption A.30.
Assume that u ∈ Cs2,α (M ) and Lu ∈ Cτ0,α
−2 (M ).
Proof. We will show that we can improve the initial rate s. Let f = Lu ∈
Cτ0,α
−2 so that we can rewrite the equation Lu = f as
Δu = (Δg − Δ)u + V i ∂i u + qu − f,
where Δ is the Euclidean Laplacian. Without loss of generality, let us assume
that the δ from Assumption A.30 is less than the asymptotic decay rate of
0,α
g. Then we can see that (Δg −Δ)u+V i ∂i u+qu ∈ Cs−2−δ , and consequently
we have for any s < 0,
0,α
(A.5) u ∈ Cs2,α (M ) =⇒ Δu ∈ Cmax(s−2−δ,τ −2)
(M ).
If we combine this implication with Corollary A.37, we see that we can easily
bootstrap our way to the first two assertions of Corollary A.38. However,
since Corollary A.37 will never give us better than u ∈ Ck2,α (M ), we see that
the bootstrapping argument must terminate here. But in this case, we can
A.2. Weighted spaces on asymptotically flat manifolds 333
0,α
still see that Δu ∈ Cmax(k−2−δ,τ −2) (M ). By applying Corollary A.38 one last
time, we can see that specifically, the γ in the conclusion of the corollary
can be chosen so that k − γ is the maximum of τ , k − δ (keeping in mind
that δ is less than the asymptotic decay rate), and the next largest element
of Λ (which is k − 1, except when k = 0). In particular, if τ < k − 1 and
δ > 1, we see that the third assertion of Corollary A.38 follows.
For the Sobolev case, Cantor [Can81, Corollary 6.5], Lockhart [Loc81,
Theorem 6.2], and McOwen [McO80], all saw how the Fredholm property
of the Laplacian could be transferred to other operators that are asymptotic
to the Euclidean Laplacian, which is the basic idea behind Theorem A.40,
and then the dimensions of the kernel and cokernel can be computed using
the self-adjointness. See also [Bar86, Proposition 2.2]. For the Hölder case,
as mentioned, the 2 − n < s < 0 case is treated in [CSCB79], but the full
result seems hard to find.
We will use the following lemma to prove Theorem A.40. It is an im-
provement of the estimates of Theorems A.32 and A.33 when the decay rate
s is nonexceptional, but note that it does not come with a corresponding
“regularity of decay” result as in Theorems A.32 and A.33.
Lemma A.41. Let (M, g) be a complete asymptotically flat manifold, and
let s be any real number not in the exceptional set Λ. For large ρ > 1, let
Kρ be the compact subset of M enclosed by the sphere |x| = ρ.
For p > 1, there exists a constant C and a ρ > 1 such that for any
u ∈ Ws2,p (M ),
uWs2,p (M ) ≤ C(Δg uLps−2 (M ) + uL1 (Kρ ) ).
334 A. Some facts about second-order linear elliptic operators
For α ∈ (0, 1), there exists a constant C and a ρ > 1 such that for any
u ∈ Cs2,α (M ),
uCs2,α (M ) ≤ C(Δg uC 0,α (M ) + uL1 (Kρ ) ).
s−2
Proof. Again, we will only present the proof for Hölder spaces, which is es-
sentially the same as the proof for Sobolev spaces in [Bar86, Theorem 1.10].
As usual, we use a generic constant C that can change from line to line.
First let us prove the result for the Euclidean Laplacian Δ on Rn . For
s < 0, Δ is injective, and our proof of Theorem A.35 showed directly that
for all u ∈ Cs2,α (Rn ),
uCs2,α (Rn ) ≤ CΔuC 0,α (Rn ) .
s−2
So we consider the case s > 0. In this case we do not have the above bound
(since injectivity fails), but our proof of Theorem A.35 showed that for
every u ∈ Cs2,α (Rn ), there exists v, h ∈ Cs2,α (Rn ) such that h is a harmonic
polynomial and
vCs2,α (Rn ) ≤ CΔuC 0,α (Rn ) .
s−2
Given any ρ > 0, we want to prove that there exists C such that the bound
uCs2,α (Rn ) ≤ C(ΔuC 0,α (Rn ) + uL1 (Bρ ) )
s−2
for all h ∈ H≤s (Rn ). In particular, this implies the hi → 0 in Cs2,α (Rn ).
Combining this with (A.6) contradicts the assumption that ui Cs2,α (Rn ) = 1.
Hence, we have our desired estimate
uCs2,α (Rn ) ≤ C(ΔuC 0,α (Rn ) + uL1 (Bρ ) ).
s−2
Next we show how the estimate for Δ on Rn implies the desired estimate
for Δg on M . By the elliptic estimate (Theorem A.33), we have
uCs2,α (M ) ≤ C(Δg uC 0,α (M ) + uCs0 (M ) )
s−2
A.2. Weighted spaces on asymptotically flat manifolds 335
The asymptotic flatness of g implies that for sufficiently large ρ, the second
term on the right can be absorbed into the left side to obtain
u1 Cs2,α (M ) ≤ CΔg u1 C 0,α (M ) .
s−2
Proof of the Hölder case of Theorem A.40. First we will prove that
0,α
Δg : Cs2,α (M ) −→ Cs−2 (M )
By the same reasoning used to prove Corollary A.9, we have the following
immediate consequence of Theorem A.40.
Corollary A.42. Let (M n , g) be a complete asymptotically flat manifold,
and let s be any real number not in the exceptional set Λ.
Consider L as in Assumption A.29. Then for any p > 1,
L : Ws2,p (M ) −→ Lps−2 (M )
is a Fredholm operator whose index is the same as that of the Euclidean
Laplacian Δ : Ws2,p (Rn ) −→ Lps−2 (Rn ).
Consider L and α as in Assumption A.30. Then
L : Cs2,α (M ) −→ Cs0,α (M )
is also a Fredholm operator with the same index as above.
Note that unlike the implicit function theorem in finite dimensions, this
theorem does not easily follow from the inverse function theorem, since
ker DF |x0 need not have a complementing subspace. If it does (for example,
if ker DF |x0 is finite dimensional), the result follows from applying the in-
verse function theorem to the augmented map (F, Π) : X −→ Y ×ker DF |x0 ,
where Π is the projection onto ker DF |x0 .
Proof. Without loss of generality, let us take x0 = 0 and F (0) = 0 for sim-
plicity. The hypotheses of the theorem say that F is a C 1 map from Br0 (0)
to Y for some r0 > 0, but we will present the proof with the stronger hy-
pothesis that DF |x is Lipschitz in x on Br0 (0), that is, F is C 1,1 . We do this
partly so that we can more easily describe how large the neighborhoods are,
and partly so that the proof will be analogous to our proof of Theorem A.43,
which was also written with the C 1,1 hypothesis. The modifications needed
for the more general C 1 hypothesis are left as an exercise.
The basic idea of this proof is essentially the same as that of the inverse
function theorem, except that we set it up as an iteration scheme and instead
of applying the inverse map (DF |0 )−1 , we will need to choose an appropriate
preimage under DF |0 at each step.
340 A. Some facts about second-order linear elliptic operators
[Ago13] Ian Agol, The virtual Haken conjecture, Doc. Math. 18 (2013), 1045–1087. With an
appendix by Agol, Daniel Groves, and Jason Manning. MR3104553
[AIK10] S. Alexakis, A. D. Ionescu, and S. Klainerman, Uniqueness of smooth stationary black
holes in vacuum: small perturbations of the Kerr spaces, Comm. Math. Phys. 299
(2010), no. 1, 89–127, DOI 10.1007/s00220-010-1072-1. MR2672799
[All72] William K. Allard, On the first variation of a varifold, Ann. of Math. (2) 95 (1972),
417–491, DOI 10.2307/1970868. MR0307015
[Alm66] F. J. Almgren Jr., Some interior regularity theorems for minimal surfaces and
an extension of Bernstein’s theorem, Ann. of Math. (2) 84 (1966), 277–292, DOI
10.2307/1970520. MR0200816
[Alm00] Frederick J. Almgren Jr., Almgren’s big regularity paper: Q-valued functions minimiz-
ing Dirichlet’s integral and the regularity of area-minimizing rectifiable currents up
to codimension 2, World Scientific Monograph Series in Mathematics, vol. 1, World
Scientific Publishing Co., Inc., River Edge, NJ, 2000. With a preface by Jean E.
Taylor and Vladimir Scheffer. MR1777737
[Amb15] Lucas C. Ambrozio, On perturbations of the Schwarzschild anti-de Sitter spaces of
positive mass, Comm. Math. Phys. 337 (2015), no. 2, 767–783, DOI 10.1007/s00220-
015-2360-6. MR3339162
[ACG08] Lars Andersson, Mingliang Cai, and Gregory J. Galloway, Rigidity and positivity of
mass for asymptotically hyperbolic manifolds, Ann. Henri Poincaré 9 (2008), no. 1,
1–33, DOI 10.1007/s00023-007-0348-2. MR2389888
[AD98] Lars Andersson and Mattias Dahl, Scalar curvature rigidity for asymptotically lo-
cally hyperbolic manifolds, Ann. Global Anal. Geom. 16 (1998), no. 1, 1–27, DOI
10.1023/A:1006547905892. MR1616570
[AEM11] Lars Andersson, Michael Eichmair, and Jan Metzger, Jang’s equation and its appli-
cations to marginally trapped surfaces, Complex analysis and dynamical systems IV.
Part 2, Contemp. Math., vol. 554, Amer. Math. Soc., Providence, RI, 2011, pp. 13–45,
DOI 10.1090/conm/554/10958. MR2884392
[AGH98] Lars Andersson, Gregory J. Galloway, and Ralph Howard, A strong maximum princi-
ple for weak solutions of quasi-linear elliptic equations with applications to Lorentzian
and Riemannian geometry, Comm. Pure Appl. Math. 51 (1998), no. 6, 581–624, DOI
10.1002/(SICI)1097-0312(199806)51:6581::AID-CPA23.3.CO;2-E. MR1611140
343
344 Bibliography
[AMS08] Lars Andersson, Marc Mars, and Walter Simon, Stability of marginally outer trapped
surfaces and existence of marginally outer trapped tubes, Adv. Theor. Math. Phys.
12 (2008), no. 4, 853–888. MR2420905
[AM09] Lars Andersson and Jan Metzger, The area of horizons and the trapped region,
Comm. Math. Phys. 290 (2009), no. 3, 941–972, DOI 10.1007/s00220-008-0723-y.
MR2525646
[ADM60] R. Arnowitt, S. Deser, and C. W. Misner, Energy and the criteria for radiation in
general relativity, Phys. Rev. (2) 118 (1960), 1100–1104. MR0127945
[ADM61] R. Arnowitt, S. Deser, and C. W. Misner, Coordinate invariance and energy expres-
sions in general relativity, Phys. Rev. (2) 122 (1961), 997–1006. MR0127946
[ADM62] R. Arnowitt, S. Deser, and C. W. Misner, The dynamics of general relativity, Grav-
itation: An introduction to current research, Wiley, New York, 1962, pp. 227–265.
MR0143629
[AH78] Abhay Ashtekar and R. O. Hansen, A unified treatment of null and spatial infin-
ity in general relativity. I. Universal structure, asymptotic symmetries, and con-
served quantities at spatial infinity, J. Math. Phys. 19 (1978), no. 7, 1542–1566, DOI
10.1063/1.523863. MR0503432
[AG05] Abhay Ashtekar and Gregory J. Galloway, Some uniqueness results for dynamical
horizons, Adv. Theor. Math. Phys. 9 (2005), no. 1, 1–30. MR2193368
[Aub70] Thierry Aubin, Métriques riemanniennes et courbure (French), J. Differential Geom-
etry 4 (1970), 383–424. MR0279731
[Aub76] Thierry Aubin, Équations différentielles non linéaires et problème de Yamabe con-
cernant la courbure scalaire, J. Math. Pures Appl. (9) 55 (1976), no. 3, 269–296.
MR0431287
[ABR01] Sheldon Axler, Paul Bourdon, and Wade Ramey, Harmonic function theory, 2nd
ed., Graduate Texts in Mathematics, vol. 137, Springer-Verlag, New York, 2001.
MR1805196
[Bar86] Robert Bartnik, The mass of an asymptotically flat manifold, Comm. Pure Appl.
Math. 39 (1986), no. 5, 661–693, DOI 10.1002/cpa.3160390505. MR849427
[Bar89] Robert Bartnik, New definition of quasilocal mass, Phys. Rev. Lett. 62 (1989), no. 20,
2346–2348, DOI 10.1103/PhysRevLett.62.2346. MR996396
[Bar93] Robert Bartnik, Quasi-spherical metrics and prescribed scalar curvature, J. Differ-
ential Geom. 37 (1993), no. 1, 31–71. MR1198599
[Bar05] Robert Bartnik, Phase space for the Einstein equations, Comm. Anal. Geom. 13
(2005), no. 5, 845–885. MR2216143
[BC96] Robert Beig and Piotr T. Chruściel, Killing vectors in asymptotically flat space-times.
I. Asymptotically translational Killing vectors and the rigid positive energy theorem,
J. Math. Phys. 37 (1996), no. 4, 1939–1961, DOI 10.1063/1.531497. MR1380882
[Bes08] Arthur L. Besse, Einstein manifolds, Classics in Mathematics, Springer-Verlag,
Berlin, 2008. Reprint of the 1987 edition. MR2371700
[Bie09] Lydia Bieri, Part I: Solutions of the Einstein vacuum equations, Extensions of the
stability theorem of the Minkowski space in general relativity, AMS/IP Stud. Adv.
Math., vol. 45, Amer. Math. Soc., Providence, RI, 2009, pp. 1–295. MR2537047
[Bir23] G. D. Birkhoff, Relativity and modern physics. With the cooperation of R. E. Langer,
Harvard University Press, Cambridge, MA, 1923.
[BK89] John Bland and Morris Kalka, Negative scalar curvature metrics on noncom-
pact manifolds, Trans. Amer. Math. Soc. 316 (1989), no. 2, 433–446, DOI
10.2307/2001356. MR987159
[BDGG69] E. Bombieri, E. De Giorgi, and E. Giusti, Minimal cones and the Bernstein problem,
Invent. Math. 7 (1969), 243–268, DOI 10.1007/BF01404309. MR0250205
Bibliography 345
[Chr09] Demetrios Christodoulou, The formation of black holes in general relativity, EMS
Monographs in Mathematics, European Mathematical Society (EMS), Zürich, 2009.
MR2488976
[CK93] Demetrios Christodoulou and Sergiu Klainerman, The global nonlinear stability of
the Minkowski space, Princeton Mathematical Series, vol. 41, Princeton University
Press, Princeton, NJ, 1993. MR1316662
[CM06] Piotr T. Chruściel and Daniel Maerten, Killing vectors in asymptotically flat space-
times. II. Asymptotically translational Killing vectors and the rigid positive energy
theorem in higher dimensions, J. Math. Phys. 47 (2006), no. 2, 022502, 10, DOI
10.1063/1.2167809. MR2208148
[CO81] D. Christodoulou and N. O’Murchadha, The boost problem in general relativity,
Comm. Math. Phys. 80 (1981), no. 2, 271–300. MR623161
[Chr86] Piotr Chruściel, Boundary conditions at spatial infinity from a Hamiltonian point of
view, Topological properties and global structure of space-time (Erice, 1985), NATO
Adv. Sci. Inst. Ser. B Phys., vol. 138, Plenum, New York, 1986, pp. 49–59. MR1102938
[Chr08] Piotr T. Chruściel, Mass and angular-momentum inequalities for axi-symmetric ini-
tial data sets. I. Positivity of mass, Ann. Physics 323 (2008), no. 10, 2566–2590, DOI
10.1016/j.aop.2007.12.010. MR2454698
[Chr15] Piotr T. Chruściel, The geometry of black holes, 2015. http://homepage.univie.ac.
at/piotr.chrusciel/teaching/Black~Holes/BlackHolesViennaJanuary2015.pdf.
[CC08] Piotr T. Chruściel and João Lopes Costa, On uniqueness of stationary vacuum black
holes (English, with English and French summaries), Astérisque 321 (2008), 195–265.
MR2521649
[CDGH01] P. T. Chruściel, E. Delay, G. J. Galloway, and R. Howard, Regularity of hori-
zons and the area theorem, Ann. Henri Poincaré 2 (2001), no. 1, 109–178, DOI
10.1007/PL00001029. MR1823836
[CGP10] Piotr T. Chruściel, Gregory J. Galloway, and Daniel Pollack, Mathematical general
relativity: a sampler, Bull. Amer. Math. Soc. (N.S.) 47 (2010), no. 4, 567–638, DOI
10.1090/S0273-0979-2010-01304-5. MR2721040
[CH03] Piotr T. Chruściel and Marc Herzlich, The mass of asymptotically hyperbolic
Riemannian manifolds, Pacific J. Math. 212 (2003), no. 2, 231–264, DOI
10.2140/pjm.2003.212.231. MR2038048
[CM11] Tobias Holck Colding and William P. Minicozzi II, A course in minimal surfaces,
Graduate Studies in Mathematics, vol. 121, American Mathematical Society, Provi-
dence, RI, 2011. MR2780140
[Cor00] Justin Corvino, Scalar curvature deformation and a gluing construction for the Ein-
stein constraint equations, Comm. Math. Phys. 214 (2000), no. 1, 137–189, DOI
10.1007/PL00005533. MR1794269
[Cor05] Justin Corvino, A note on asymptotically flat metrics on R3 which are scalar-flat
and admit minimal spheres, Proc. Amer. Math. Soc. 133 (2005), no. 12, 3669–3678,
DOI 10.1090/S0002-9939-05-07926-8. MR2163606
[Cor17] Justin Corvino, A note on the Bartnik mass, Nonlinear analysis in geometry and
applied mathematics, Harv. Univ. Cent. Math. Sci. Appl. Ser. Math., vol. 1, Int.
Press, Somerville, MA, 2017, pp. 49–75. MR3729084
[CH16] Justin Corvino and Lan-Hsuan Huang, Localized deformation for initial data sets
with the dominant energy condition, arXiv:1606.03078 (2016).
[CS06] Justin Corvino and Richard M. Schoen, On the asymptotics for the vacuum Einstein
constraint equations, J. Differential Geom. 73 (2006), no. 2, 185–217. MR2225517
[Cou37] Richard Courant, Plateau’s problem and Dirichlet’s principle, Ann. of Math. (2) 38
(1937), no. 3, 679–724, DOI 10.2307/1968610. MR1503362
348 Bibliography
[DL17] Mihalis Dafermos and Jonathan Luk, The interior of dynamical vacuum black holes
I: The c0 -stability of the Kerr Cauchy horizon, arXiv:1710.01722 (2017).
[DL16] Mattias Dahl and Eric Larsson, Outermost apparent horizons diffeomorphic to unit
normal bundles, arXiv:1606.0841 (2016).
[DM07] Xianzhe Dai and Li Ma, Mass under the Ricci flow, Comm. Math. Phys. 274 (2007),
no. 1, 65–80. MR2318848
[DG61] Ennio De Giorgi, Frontiere orientate di misura minima (Italian), Seminario di
Matematica della Scuola Normale Superiore di Pisa, 1960-61, Editrice Tecnico Scien-
tifica, Pisa, 1961. MR0179651
[DL16] Camillo De Lellis, The size of the singular set of area-minimizing currents, Surveys
in differential geometry 2016. Advances in geometry and mathematical physics, Surv.
Differ. Geom., vol. 21, Int. Press, Somerville, MA, 2016, pp. 1–83. MR3525093
[DLS14] Camillo De Lellis and Emanuele Spadaro, Regularity of area minimizing currents
I: gradient Lp estimates, Geom. Funct. Anal. 24 (2014), no. 6, 1831–1884, DOI
10.1007/s00039-014-0306-3. MR3283929
[DS83] V. I. Denisov and V. O. Solovev, Energy defined in general relativity on the basis of
the traditional Hamiltonian approach has no physical meaning (Russian, with English
summary), Teoret. Mat. Fiz. 56 (1983), no. 2, 301–314. MR718105
[Dou31] Jesse Douglas, Solution of the problem of Plateau, Trans. Amer. Math. Soc. 33 (1931),
no. 1, 263–321, DOI 10.2307/1989472. MR1501590
[Eic09] Michael Eichmair, The Plateau problem for marginally outer trapped surfaces, J.
Differential Geom. 83 (2009), no. 3, 551–583. MR2581357
[Eic13] Michael Eichmair, The Jang equation reduction of the spacetime positive energy the-
orem in dimensions less than eight, Comm. Math. Phys. 319 (2013), no. 3, 575–593,
DOI 10.1007/s00220-013-1700-7. MR3040369
[EGP13] Michael Eichmair, Gregory J. Galloway, and Daniel Pollack, Topological censorship
from the initial data point of view, J. Differential Geom. 95 (2013), no. 3, 389–405.
MR3128989
[EHLS16] Michael Eichmair, Lan-Hsuan Huang, Dan A. Lee, and Richard Schoen, The space-
time positive mass theorem in dimensions less than eight, J. Eur. Math. Soc. (JEMS)
18 (2016), no. 1, 83–121, DOI 10.4171/JEMS/584. MR3438380
[EM13] Michael Eichmair and Jan Metzger, Large isoperimetric surfaces in initial data sets,
J. Differential Geom. 94 (2013), no. 1, 159–186. MR3031863
[EM16] Michael Eichmair and Jan Metzger, Jenkins-Serrin-type results for the Jang equation,
J. Differential Geom. 102 (2016), no. 2, 207–242. MR3454546
[EMW12] Michael Eichmair, Pengzi Miao, and Xiaodong Wang, Extension of a theorem of Shi
and Tam, Calc. Var. Partial Differential Equations 43 (2012), no. 1-2, 45–56, DOI
10.1007/s00526-011-0402-2. MR2860402
[Eis93] Jean Eisenstaedt, Lemaı̂tre and the Schwarzschild solution, The attraction of gravita-
tion: new studies in the history of general relativity (Johnstown, PA, 1991), Einstein
Stud., vol. 5, Birkhäuser Boston, Boston, MA, 1993, pp. 353–389. MR1735388
[ER02] Roberto Emparan and Harvey S. Reall, A rotating black ring solution in five
dimensions, Phys. Rev. Lett. 88 (2002), no. 10, 101101, 4, DOI 10.1103/Phys-
RevLett.88.101101. MR1901280
[ER06] Roberto Emparan and Harvey S. Reall, Black rings, Classical Quantum Gravity 23
(2006), no. 20, R169–R197, DOI 10.1088/0264-9381/23/20/R01. MR2270099
[Eva10] Lawrence C. Evans, Partial differential equations, 2nd ed., Graduate Studies in Math-
ematics, vol. 19, American Mathematical Society, Providence, RI, 2010. MR2597943
[ES91] L. C. Evans and J. Spruck, Motion of level sets by mean curvature. I, J. Differential
Geom. 33 (1991), no. 3, 635–681. MR1100206
Bibliography 349
[Fed70] Herbert Federer, The singular sets of area minimizing rectifiable currents with codi-
mension one and of area minimizing flat chains modulo two with arbitrary codimen-
sion, Bull. Amer. Math. Soc. 76 (1970), 767–771, DOI 10.1090/S0002-9904-1970-
12542-3. MR0260981
[FF60] Herbert Federer and Wendell H. Fleming, Normal and integral currents, Ann. of
Math. (2) 72 (1960), 458–520, DOI 10.2307/1970227. MR0123260
[Fin09] Felix Finster, A level set analysis of the Witten spinor with applications
to curvature estimates, Math. Res. Lett. 16 (2009), no. 1, 41–55, DOI
10.4310/MRL.2009.v16.n1.a5. MR2480559
[FK02] Felix Finster and Ines Kath, Curvature estimates in asymptotically flat manifolds
of positive scalar curvature, Comm. Anal. Geom. 10 (2002), no. 5, 1017–1031, DOI
10.4310/CAG.2002.v10.n5.a6. MR1957661
[FM75] Arthur E. Fischer and Jerrold E. Marsden, Deformations of the scalar curvature,
Duke Math. J. 42 (1975), no. 3, 519–547. MR0380907
[FCS80] Doris Fischer-Colbrie and Richard Schoen, The structure of complete stable minimal
surfaces in 3-manifolds of nonnegative scalar curvature, Comm. Pure Appl. Math.
33 (1980), no. 2, 199–211, DOI 10.1002/cpa.3160330206. MR562550
[Fle62] Wendell H. Fleming, On the oriented Plateau problem, Rend. Circ. Mat. Palermo (2)
11 (1962), 69–90, DOI 10.1007/BF02849427. MR0157263
[Fol95] Gerald B. Folland, Introduction to partial differential equations, 2nd ed., Princeton
University Press, Princeton, NJ, 1995. MR1357411
[FB52] Y. Fourès-Bruhat, Théorème d’existence pour certains systèmes d’équations aux
dérivées partielles non linéaires (French), Acta Math. 88 (1952), 141–225, DOI
10.1007/BF02392131. MR0053338
[FS14] Alexandre Freire and Fernando Schwartz, Mass-capacity inequalities for conformally
flat manifolds with boundary, Comm. Partial Differential Equations 39 (2014), no. 1,
98–119, DOI 10.1080/03605302.2013.851211. MR3169780
[Gal00] Gregory J. Galloway, Maximum principles for null hypersurfaces and null splitting
theorems, Ann. Henri Poincaré 1 (2000), no. 3, 543–567, DOI 10.1007/s000230050006.
MR1777311
[Gal11] Gregory J. Galloway, Stability and rigidity of extremal surfaces in Riemannian ge-
ometry and general relativity, Surveys in geometric analysis and relativity, Adv. Lect.
Math. (ALM), vol. 20, Int. Press, Somerville, MA, 2011, pp. 221–239. MR2906927
[Gal14] Gregory J. Galloway, Notes on Lorentzian causality, 2014. http://www.math.miami.
edu/~galloway/vienna-course-notes.pdf.
[Gal18] Gregory J. Galloway, Rigidity of outermost MOTS: the initial data version, Gen.
Relativity Gravitation 50 (2018), no. 3, Art. 32, 7, DOI 10.1007/s10714-018-2353-9.
MR3768955
[GS06] Gregory J. Galloway and Richard Schoen, A generalization of Hawking’s black hole
topology theorem to higher dimensions, Comm. Math. Phys. 266 (2006), no. 2, 571–
576, DOI 10.1007/s00220-006-0019-z. MR2238889
[Gan75] Dennis Gannon, Singularities in nonsimply connected space-times, J. Mathematical
Phys. 16 (1975), no. 12, 2364–2367, DOI 10.1063/1.522498. MR0389141
[GY86] L. Zhiyong Gao and S.-T. Yau, The existence of negatively Ricci curved metrics on
three-manifolds, Invent. Math. 85 (1986), no. 3, 637–652, DOI 10.1007/BF01390331.
MR848687
[Ger70] Robert Geroch, Domain of dependence, J. Mathematical Phys. 11 (1970), 437–449,
DOI 10.1063/1.1665157. MR0270697
[Ger73] Robert Geroch, Energy extraction, Sixth Texas symposium on relativistic astro-
physics, 1973, pp. 108.
350 Bibliography
[Ger75] Robert Geroch, General relativity, Differential geometry (Proc. Sympos. Pure Math.,
Vol. XXVII, Part 2, Stanford Univ., Stanford, Calif., 1973), Amer. Math. Soc., Prov-
idence, R.I., 1975, pp. 401–414. MR0378703
[Ger13] Robert Geroch, General relativity: 1972 lecture notes, Minkowski Institute Press,
Montreal, 2013.
[GHHP83] G. W. Gibbons, S. W. Hawking, Gary T. Horowitz, and Malcolm J. Perry, Posi-
tive mass theorems for black holes, Comm. Math. Phys. 88 (1983), no. 3, 295–308.
MR701918
[GT01] David Gilbarg and Neil S. Trudinger, Elliptic partial differential equations of second
order, Classics in Mathematics, Springer-Verlag, Berlin, 2001. Reprint of the 1998
edition. MR1814364
[GT12] James D. E. Grant and Nathalie Tassotti, A positive mass theorem for low-regularity
metrics, arXiv:1205.1302 (2012).
[GM08] Jeremy Gray and Mario Micallef, About the cover: the work of Jesse Douglas on
minimal surfaces, Bull. Amer. Math. Soc. (N.S.) 45 (2008), no. 2, 293–302, DOI
10.1090/S0273-0979-08-01192-0. MR2383307
[Gre63] L. W. Green, Auf Wiedersehensflächen (German), Ann. of Math. (2) 78 (1963),
289–299, DOI 10.2307/1970344. MR0155271
[GL80a] Mikhael Gromov and H. Blaine Lawson Jr., Spin and scalar curvature in the pres-
ence of a fundamental group. I, Ann. of Math. (2) 111 (1980), no. 2, 209–230, DOI
10.2307/1971198. MR569070
[GL80b] Mikhael Gromov and H. Blaine Lawson Jr., The classification of simply connected
manifolds of positive scalar curvature, Ann. of Math. (2) 111 (1980), no. 3, 423–434,
DOI 10.2307/1971103. MR577131
[GL94] Pengfei Guan and Yan Yan Li, The Weyl problem with nonnegative Gauss curvature,
J. Differential Geom. 39 (1994), no. 2, 331–342. MR1267893
[HW09] Fengbo Hang and Xiaodong Wang, Rigidity theorems for compact manifolds with
boundary and positive Ricci curvature, J. Geom. Anal. 19 (2009), no. 3, 628–642.
MR2496569
[Har90] F. Reese Harvey, Spinors and calibrations, Perspectives in Mathematics, vol. 9, Aca-
demic Press, Inc., Boston, MA, 1990. MR1045637
[Hat02] Allen Hatcher, Algebraic topology, Cambridge University Press, Cambridge, 2002.
MR1867354
[Haw68] S. W. Hawking, Gravitational radiation in an expanding universe, J. Math. Phys. 9
(1968), 598–604.
[Haw72] S. W. Hawking, Black holes in general relativity, Comm. Math. Phys. 25 (1972),
152–166. MR0293962
[HE73] S. W. Hawking and G. F. R. Ellis, The large scale structure of space-time, Cambridge
Monographs on Mathematical Physics, No. 1, Cambridge University Press, London-
New York, 1973. MR0424186
[Hay96] Sean A. Hayward, Gravitational energy in spherical symmetry, Phys. Rev. D (3) 53
(1996), no. 4, 1938–1949, DOI 10.1103/PhysRevD.53.1938. MR1380012
[Heb96] Emmanuel Hebey, Sobolev spaces on Riemannian manifolds, Lecture Notes in Math-
ematics, vol. 1635, Springer-Verlag, Berlin, 1996. MR1481970
[HL16] Hans-Joachim Hein and Claude LeBrun, Mass in Kähler geometry, Comm. Math.
Phys. 347 (2016), no. 1, 183–221, DOI 10.1007/s00220-016-2661-4. MR3543182
[Hem76] John Hempel, 3-Manifolds, Princeton University Press, Princeton, N. J.; University
of Tokyo Press, Tokyo, 1976. Ann. of Math. Studies, No. 86. MR0415619
[Her70] Joseph Hersch, Quatre propriétés isopérimétriques de membranes sphériques ho-
mogènes (French), C. R. Acad. Sci. Paris Sér. A-B 270 (1970), A1645–A1648.
MR0292357
Bibliography 351
[Her97] Marc Herzlich, A Penrose-like inequality for the mass of Riemannian asymp-
totically flat manifolds, Comm. Math. Phys. 188 (1997), no. 1, 121–133, DOI
10.1007/s002200050159. MR1471334
[Her98] Marc Herzlich, The positive mass theorem for black holes revisited, J. Geom. Phys.
26 (1998), no. 1-2, 97–111, DOI 10.1016/S0393-0440(97)00040-5. MR1626060
[Hit74] Nigel Hitchin, Harmonic spinors, Advances in Math. 14 (1974), 1–55, DOI
10.1016/0001-8708(74)90021-8. MR0358873
[Hua09] Lan-Hsuan Huang, On the center of mass of isolated systems with general asymp-
totics, Classical Quantum Gravity 26 (2009), no. 1, 015012, 25, DOI 10.1088/0264-
9381/26/1/015012. MR2470255
[Hua12] Lan-Hsuan Huang, On the center of mass in general relativity, Fifth International
Congress of Chinese Mathematicians. Part 1, 2, AMS/IP Stud. Adv. Math., 51, pt.
1, vol. 2, Amer. Math. Soc., Providence, RI, 2012, pp. 575–591. MR2908093
[HL15] Lan-Hsuan Huang and Dan A. Lee, Stability of the positive mass theorem for graphical
hypersurfaces of Euclidean space, Comm. Math. Phys. 337 (2015), no. 1, 151–169,
DOI 10.1007/s00220-014-2265-9. MR3324159
[HL17] Lan-Hsuan Huang and Dan A. Lee, Rigidity of the spacetime positive mass theorem,
arXiv:1706.03732 (2017).
[HLS17] Lan-Hsuan Huang, Dan A. Lee, and Christina Sormani, Intrinsic flat stability of
the positive mass theorem for graphical hypersurfaces of Euclidean space, J. Reine
Angew. Math. 727 (2017), 269–299, DOI 10.1515/crelle-2015-0051. MR3652253
[HMM18] Lan-Hsuan Huang, Daniel Martin, and Pengzi Miao, Static potentials and area min-
imizing hypersurfaces, Proc. Amer. Math. Soc. 146 (2018), no. 6, 2647–2661, DOI
10.1090/proc/13936. MR3778165
[HW13] Lan-Hsuan Huang and Damin Wu, Hypersurfaces with nonnegative scalar curvature,
J. Differential Geom. 95 (2013), no. 2, 249–278. MR3128984
[HW15] Lan-Hsuan Huang and Damin Wu, The equality case of the Penrose inequality for
asymptotically flat graphs, Trans. Amer. Math. Soc. 367 (2015), no. 1, 31–47, DOI
10.1090/S0002-9947-2014-06090-X. MR3271252
[HI01] Gerhard Huisken and Tom Ilmanen, The inverse mean curvature flow and the
Riemannian Penrose inequality, J. Differential Geom. 59 (2001), no. 3, 353–437.
MR1916951
[HY96] Gerhard Huisken and Shing-Tung Yau, Definition of center of mass for isolated phys-
ical systems and unique foliations by stable spheres with constant mean curvature, In-
vent. Math. 124 (1996), no. 1-3, 281–311, DOI 10.1007/s002220050054. MR1369419
[Isr67] Werner Israel, Event horizons in static vacuum space-times, Phys. Rev. 164 (1967),
1776–1779.
[Jau13] Jeffrey L. Jauregui, Fill-ins of nonnegative scalar curvature, static metrics,
and quasi-local mass, Pacific J. Math. 261 (2013), no. 2, 417–444, DOI
10.2140/pjm.2013.261.417. MR3037574
[Jau18] Jeffrey L. Jauregui, On the lower semicontinuity of the ADM mass, Comm. Anal.
Geom. 26 (2018), no. 1, 85–111, DOI 10.4310/CAG.2018.v26.n1.a3. MR3761654
[Jau19] Jeffrey L. Jauregui, Smoothing the Bartnik boundary conditions and other re-
sults on Bartnik’s quasi-local mass, J. Geom. Phys. 136 (2019), 228–243, DOI
10.1016/j.geomphys.2018.11.005. MR3885243
[JL16] Jeffrey L. Jauregui and Dan A. Lee, Lower semicontinuity of mass under c0 conver-
gence and Huisken’s isoperimetric mass, arXiv:1602.00732 (2016).
[JL19] Jeffrey L. Jauregui and Dan A. Lee, Lower semicontinuity of ADM mass under
intrinsic flat convergence, arXiv:1903.00916 (2019).
[Jos11] Jürgen Jost, Riemannian geometry and geometric analysis, 6th ed., Universitext,
Springer, Heidelberg, 2011. MR2829653
352 Bibliography
[Jos13] Jürgen Jost, Partial differential equations, 3rd ed., Graduate Texts in Mathematics,
vol. 214, Springer, New York, 2013. MR3012036
[KW75a] Jerry L. Kazdan and F. W. Warner, Existence and conformal deformation of metrics
with prescribed Gaussian and scalar curvatures, Ann. of Math. (2) 101 (1975), 317–
331, DOI 10.2307/1970993. MR0375153
[KW75b] Jerry L. Kazdan and F. W. Warner, Prescribing curvatures, Differential geometry
(Proc. Sympos. Pure Math., Vol. XXVII, Stanford Univ., Stanford, Calif., 1973),
Amer. Math. Soc., Providence, R.I., 1975, pp. 309–319. MR0394505
[KMWY18] Marcus Khuri, Yukio Matsumoto, Gilbert Weinstein, and Sumio Yamada, Plumbing
constructions and the domain of outer communication for 5-dimensional stationary
black holes, arXiv:1807.03452 (2018).
[Kle68] Felix Klein, Vorlesungen über höhere Geometrie (German), Dritte Auflage. Bear-
beitet und herausgegeben von W. Blaschke. Die Grundlehren der mathematischen
Wissenschaften, Band 22, Springer-Verlag, Berlin, 1968. MR0226476
[KR50] M. G. Kreı̆n and M. A. Rutman, Linear operators leaving invariant a cone in a
Banach space, Amer. Math. Soc. Translation 1950 (1950), no. 26, 128. MR0038008
[KH97] Marcus Kriele and Sean A. Hayward, Outer trapped surfaces and their apparent hori-
zon, J. Math. Phys. 38 (1997), no. 3, 1593–1604, DOI 10.1063/1.532010. MR1435684
[KL14] Hari K. Kunduri and James Lucietti, Supersymmetric black holes with lens-space
topology, Phys. Rev. Lett. 113 (2014), 211101, 5.
[Lam11] Mau-Kwong George Lam, The graph cases of the Riemannian positive mass and
Penrose inequalities in all dimensions, ProQuest LLC, Ann Arbor, MI, 2011. Thesis
(Ph.D.)–Duke University. MR2873434
[LM89] H. Blaine Lawson Jr. and Marie-Louise Michelsohn, Spin geometry, Princeton Math-
ematical Series, vol. 38, Princeton University Press, Princeton, NJ, 1989. MR1031992
[Lee76] C. W. Lee, A restriction on the topology of Cauchy surfaces in general relativity,
Comm. Math. Phys. 51 (1976), no. 2, 157–162. MR0426805
[Lee09] Dan A. Lee, On the near-equality case of the positive mass theorem, Duke Math. J.
148 (2009), no. 1, 63–80, DOI 10.1215/00127094-2009-021. MR2515100
[Lee13] Dan A. Lee, A positive mass theorem for Lipschitz metrics with small singular sets,
Proc. Amer. Math. Soc. 141 (2013), no. 11, 3997–4004, DOI 10.1090/S0002-9939-
2013-11871-X. MR3091790
[LL15] Dan A. Lee and Philippe G. LeFloch, The positive mass theorem for manifolds
with distributional curvature, Comm. Math. Phys. 339 (2015), no. 1, 99–120, DOI
10.1007/s00220-015-2414-9. MR3366052
[LN15] Dan A. Lee and André Neves, The Penrose inequality for asymptotically locally hyper-
bolic spaces with nonpositive mass, Comm. Math. Phys. 339 (2015), no. 2, 327–352,
DOI 10.1007/s00220-015-2421-x. MR3370607
[LS12] Dan A. Lee and Christina Sormani, Near-equality of the Penrose inequality for ro-
tationally symmetric Riemannian manifolds, Ann. Henri Poincaré 13 (2012), no. 7,
1537–1556, DOI 10.1007/s00023-012-0172-1. MR2982632
[LS14] Dan A. Lee and Christina Sormani, Stability of the positive mass theorem for ro-
tationally symmetric Riemannian manifolds, J. Reine Angew. Math. 686 (2014),
187–220, DOI 10.1515/crelle-2012-0094. MR3176604
[Lee97] John M. Lee, Riemannian manifolds: An introduction to curvature, Graduate Texts
in Mathematics, vol. 176, Springer-Verlag, New York, 1997. MR1468735
[LP87] John M. Lee and Thomas H. Parker, The Yamabe problem, Bull. Amer. Math. Soc.
(N.S.) 17 (1987), no. 1, 37–91, DOI 10.1090/S0273-0979-1987-15514-5. MR888880
[LS15] Philippe G. LeFloch and Christina Sormani, The nonlinear stability of rotationally
symmetric spaces with low regularity, J. Funct. Anal. 268 (2015), no. 7, 2005–2065,
DOI 10.1016/j.jfa.2014.12.012. MR3315585
Bibliography 353
[LW99] Yanyan Li and Gilbert Weinstein, A priori bounds for co-dimension one isometric
embeddings, Amer. J. Math. 121 (1999), no. 5, 945–965. MR1713298
[Li18] Yu Li, Ricci flow on asymptotically Euclidean manifolds, Geom. Topol. 22 (2018),
no. 3, 1837–1891, DOI 10.2140/gt.2018.22.1837. MR3780446
[Lic63] André Lichnerowicz, Spineurs harmoniques (French), C. R. Acad. Sci. Paris 257
(1963), 7–9. MR0156292
[Lin14] Chen-Yun Lin, Parabolic constructions of asymptotically flat 3-metrics of prescribed
scalar curvature, Calc. Var. Partial Differential Equations 49 (2014), no. 3-4, 1309–
1335, DOI 10.1007/s00526-013-0623-7. MR3168634
[LY02] Fanghua Lin and Xiaoping Yang, Geometric measure theory—an introduction, Ad-
vanced Mathematics (Beijing/Boston), vol. 1, Science Press Beijing, Beijing; Inter-
national Press, Boston, MA, 2002. MR2030862
[LR10] Hans Lindblad and Igor Rodnianski, The global stability of Minkowski space-time in
harmonic gauge, Ann. of Math. (2) 171 (2010), no. 3, 1401–1477, DOI 10.4007/an-
nals.2010.171.1401. MR2680391
[LY06] Chiu-Chu Melissa Liu and Shing-Tung Yau, Positivity of quasi-local mass. II, J.
Amer. Math. Soc. 19 (2006), no. 1, 181–204, DOI 10.1090/S0894-0347-05-00497-2.
MR2169046
[Loc81] Robert B. Lockhart, Fredholm properties of a class of elliptic operators on noncom-
pact manifolds, Duke Math. J. 48 (1981), no. 1, 289–312. MR610188
[LM83] Robert B. Lockhart and Robert C. McOwen, On elliptic systems in Rn , Acta Math.
150 (1983), no. 1-2, 125–135, DOI 10.1007/BF02392969. MR697610
[Loh94] Joachim Lohkamp, Metrics of negative Ricci curvature, Ann. of Math. (2) 140 (1994),
no. 3, 655–683, DOI 10.2307/2118620. MR1307899
[Loh99] Joachim Lohkamp, Scalar curvature and hammocks, Math. Ann. 313 (1999), no. 3,
385–407, DOI 10.1007/s002080050266. MR1678604
[Loh06] Joachim Lohkamp, The higher dimensional positive mass theorem I, arXiv:0608795
(2006).
[Loh15a] Joachim Lohkamp, Hyperbolic geometry and potential theory on minimal hypersur-
faces, arXiv:1512.08251 (2015).
[Loh15b] Joachim Lohkamp, Skin structures in scalar curvature geometry, arXiv:1512.08251
(2015).
[Loh15c] Joachim Lohkamp, Skin structures on minimal hypersurfaces, arXiv:1512.08249
(2015).
[Loh16] Joachim Lohkamp, The higher dimensional positive mass theorem II,
arXiv:1612.07505 (2016).
[LM17] Siyuan Lu and Pengzi Miao, Minimal hypersurfaces and boundary behavior of com-
pact manifolds with nonnegative scalar curvature, arXiv:1703.08164 (2017).
[MḾ94] Edward Malec and Niall Ó Murchadha, Trapped surfaces and the Penrose inequality
in spherically symmetric geometries, Phys. Rev. D (3) 49 (1994), no. 12, 6931–6934,
DOI 10.1103/PhysRevD.49.6931. MR1278625
[MM17] Christos Mantoulidis and Pengzi Miao, Total mean curvature, scalar curvature, and
a variational analog of Brown-York mass, Comm. Math. Phys. 352 (2017), no. 2,
703–718, DOI 10.1007/s00220-016-2767-8. MR3627410
[MS15] Christos Mantoulidis and Richard Schoen, On the Bartnik mass of apparent hori-
zons, Classical Quantum Gravity 32 (2015), no. 20, 205002, 16, DOI 10.1088/0264-
9381/32/20/205002. MR3406373
[MN12] Fernando C. Marques and André Neves, Rigidity of min-max minimal spheres
in three-manifolds, Duke Math. J. 161 (2012), no. 14, 2725–2752, DOI
10.1215/00127094-1813410. MR2993139
354 Bibliography
[MN14] Fernando C. Marques and André Neves, Min-max theory and the Willmore conjec-
ture, Ann. of Math. (2) 179 (2014), no. 2, 683–782, DOI 10.4007/annals.2014.179.2.6.
MR3152944
[Mar09] Marc Mars, Present status of the Penrose inequality, Classical Quantum Gravity 26
(2009), no. 19, 193001, 59, DOI 10.1088/0264-9381/26/19/193001. MR2545137
[MM84] Umberto Massari and Mario Miranda, Minimal surfaces of codimension one, North-
Holland Mathematics Studies, vol. 91, North-Holland Publishing Co., Amsterdam,
1984. Notas de Matemática [Mathematical Notes], 95. MR795963
[MS12] Donovan McFeron and Gábor Székelyhidi, On the positive mass theorem for
manifolds with corners, Comm. Math. Phys. 313 (2012), no. 2, 425–443, DOI
10.1007/s00220-012-1498-8. MR2942956
[McO79] Robert C. McOwen, The behavior of the Laplacian on weighted Sobolev spaces,
Comm. Pure Appl. Math. 32 (1979), no. 6, 783–795, DOI 10.1002/cpa.3160320604.
MR539158
[McO80] Robert C. McOwen, On elliptic operators in Rn , Comm. Partial Differential Equa-
tions 5 (1980), no. 9, 913–933, DOI 10.1080/03605308008820158. MR584101
[MSY82] William Meeks III, Leon Simon, and Shing Tung Yau, Embedded minimal surfaces,
exotic spheres, and manifolds with positive Ricci curvature, Ann. of Math. (2) 116
(1982), no. 3, 621–659, DOI 10.2307/2007026. MR678484
[MY80] William H. Meeks III and Shing Tung Yau, Topology of three-dimensional manifolds
and the embedding problems in minimal surface theory, Ann. of Math. (2) 112 (1980),
no. 3, 441–484, DOI 10.2307/1971088. MR595203
[Mey63] Norman Meyers, An expansion about infinity for solutions of linear elliptic equations.,
J. Math. Mech. 12 (1963), 247–264. MR0149072
[Mia02] Pengzi Miao, Positive mass theorem on manifolds admitting corners along a hy-
persurface, Adv. Theor. Math. Phys. 6 (2002), no. 6, 1163–1182 (2003), DOI
10.4310/ATMP.2002.v6.n6.a4. MR1982695
[Mia04] Pengzi Miao, Variational effect of boundary mean curvature on ADM mass in gen-
eral relativity, Mathematical physics research on the leading edge, Nova Sci. Publ.,
Hauppauge, NY, 2004, pp. 145–171. MR2068577
[Mia09] Pengzi Miao, On a localized Riemannian Penrose inequality, Comm. Math. Phys.
292 (2009), no. 1, 271–284, DOI 10.1007/s00220-009-0834-0. MR2540078
[MT15] Pengzi Miao and Luen-Fai Tam, Static potentials on asymptotically flat manifolds,
Ann. Henri Poincaré 16 (2015), no. 10, 2239–2264, DOI 10.1007/s00023-014-0373-x.
MR3385979
[MT16] Pengzi Miao and Luen-Fai Tam, Evaluation of the ADM mass and center of mass
via the Ricci tensor, Proc. Amer. Math. Soc. 144 (2016), no. 2, 753–761, DOI
10.1090/proc12726. MR3430851
[MM15] Mario Micallef and Vlad Moraru, Splitting of 3-manifolds and rigidity of area-
minimising surfaces, Proc. Amer. Math. Soc. 143 (2015), no. 7, 2865–2872, DOI
10.1090/S0002-9939-2015-12137-5. MR3336611
[MO89] Maung Min-Oo, Scalar curvature rigidity of asymptotically hyperbolic spin manifolds,
Math. Ann. 285 (1989), no. 4, 527–539, DOI 10.1007/BF01452046. MR1027758
[MTW73] Charles W. Misner, Kip S. Thorne, and John Archibald Wheeler, Gravitation, W. H.
Freeman and Co., San Francisco, Calif., 1973. MR0418833
[Mor16] Frank Morgan, Geometric measure theory: A beginner’s guide; Illustrated by James
F. Bredt, 5th ed., Elsevier/Academic Press, Amsterdam, 2016. MR3497381
[MT07] John Morgan and Gang Tian, Ricci flow and the Poincaré conjecture, Clay Math-
ematics Monographs, vol. 3, American Mathematical Society, Providence, RI; Clay
Mathematics Institute, Cambridge, MA, 2007. MR2334563
Bibliography 355
[MT14] John Morgan and Gang Tian, The geometrization conjecture, Clay Mathematics
Monographs, vol. 5, American Mathematical Society, Providence, RI; Clay Math-
ematics Institute, Cambridge, MA, 2014. MR3186136
[Mor48] Charles B. Morrey Jr., The problem of Plateau on a Riemannian manifold, Ann. of
Math. (2) 49 (1948), 807–851, DOI 10.2307/1969401. MR0027137
[MzHRS73] H. Müller zum Hagen, David C. Robinson, and H. J. Seifert, Black holes in static
vacuum space-times, General Relativity and Gravitation 4 (1973), 53–78, DOI
10.1007/bf00769760. MR0398432
[MP86] R. C. Myers and M. J. Perry, Black holes in higher-dimensional space-times, Ann.
Physics 172 (1986), no. 2, 304–347, DOI 10.1016/0003-4916(86)90186-7. MR868295
[Nir53] Louis Nirenberg, The Weyl and Minkowski problems in differential geometry in the
large, Comm. Pure Appl. Math. 6 (1953), 337–394, DOI 10.1002/cpa.3160060303.
MR0058265
[NW73] Louis Nirenberg and Homer F. Walker, The null spaces of elliptic partial differen-
tial operators in Rn , J. Math. Anal. Appl. 42 (1973), 271–301, DOI 10.1016/0022-
247X(73)90138-8. Collection of articles dedicated to Salomon Bochner. MR0320821
[Nun13] Ivaldo Nunes, Rigidity of area-minimizing hyperbolic surfaces in three-manifolds,
J. Geom. Anal. 23 (2013), no. 3, 1290–1302, DOI 10.1007/s12220-011-9287-8.
MR3078354
[OW07] Todd A. Oliynyk and Eric Woolgar, Rotationally symmetric Ricci flow on asymptot-
ically flat manifolds, Comm. Anal. Geom. 15 (2007), no. 3, 535–568. MR2379804
[O’N83] Barrett O’Neill, Semi-Riemannian geometry: With applications to relativity, Pure
and Applied Mathematics, vol. 103, Academic Press, Inc. [Harcourt Brace Jovanovich,
Publishers], New York, 1983. MR719023
[PT82] Thomas Parker and Clifford Henry Taubes, On Witten’s proof of the positive energy
theorem, Comm. Math. Phys. 84 (1982), no. 2, 223–238. MR661134
[Pen65] Roger Penrose, Gravitational collapse and space-time singularities, Phys. Rev. Lett.
14 (1965), 57–59, DOI 10.1103/PhysRevLett.14.57. MR0172678
[Pen73] Roger Penrose, Naked singularities, Annals N. Y. Acad. Sci. 224 (1973), 125–134.
[Pen02] R. Penrose, Gravitational collapse: the role of general relativity, Gen. Relativity Grav-
itation 34 (2002), no. 7, 1141–1165, DOI 10.1023/A:1016578408204. Reprinted from
Rivista del Nuovo Cimento 1969, Numero Speziale I, 252–276. MR1915236
[Per02] Grisha Perelman, The entropy formula for the Ricci flow and its geometric applica-
tions, arXiv:math (2002).
[Per03a] Grisha Perelman, Finite extinction time for the solutions to the Ricci flow on certain
three-manifolds, arXiv:math (2003).
[Per03b] Grisha Perelman, Ricci flow with surgery on three-manifolds, arXiv:math (2003).
[Pet16] Peter Petersen, Riemannian geometry, 3rd ed., Graduate Texts in Mathematics,
vol. 171, Springer, Cham, 2016. MR3469435
[Pog52] A. V. Pogorelov, Regularity of a convex surface with given Gaussian curvature
(Russian), Mat. Sbornik N.S. 31(73) (1952), 88–103. MR0052807
[QT07] Jie Qing and Gang Tian, On the uniqueness of the foliation of spheres of constant
mean curvature in asymptotically flat 3-manifolds, J. Amer. Math. Soc. 20 (2007),
no. 4, 1091–1110, DOI 10.1090/S0894-0347-07-00560-7. MR2328717
[Rad30] Tibor Radó, On Plateau’s problem, Ann. of Math. (2) 31 (1930), no. 3, 457–469, DOI
10.2307/1968237. MR1502955
[RT74] Tullio Regge and Claudio Teitelboim, Role of surface integrals in the Hamiltonian for-
mulation of general relativity, Ann. Physics 88 (1974), 286–318, DOI 10.1016/0003-
4916(74)90404-7. MR0359663
356 Bibliography
[SY79d] R. Schoen and S. T. Yau, On the structure of manifolds with positive scalar cur-
vature, Manuscripta Math. 28 (1979), no. 1-3, 159–183, DOI 10.1007/BF01647970.
MR535700
[SY81a] Richard Schoen and Shing Tung Yau, The energy and the linear momentum of space-
times in general relativity, Comm. Math. Phys. 79 (1981), no. 1, 47–51. MR609227
[SY81b] Richard Schoen and Shing Tung Yau, Proof of the positive mass theorem. II, Comm.
Math. Phys. 79 (1981), no. 2, 231–260. MR612249
[SY83] Richard Schoen and S. T. Yau, The existence of a black hole due to condensation of
matter, Comm. Math. Phys. 90 (1983), no. 4, 575–579. MR719436
[SY88] R. Schoen and S.-T. Yau, Conformally flat manifolds, Kleinian groups and scalar cur-
vature, Invent. Math. 92 (1988), no. 1, 47–71, DOI 10.1007/BF01393992. MR931204
[SY94] R. Schoen and S.-T. Yau, Lectures on differential geometry, Conference Proceedings
and Lecture Notes in Geometry and Topology, I, International Press, Cambridge,
MA, 1994. Lecture notes prepared by Wei Yue Ding, Kung Ching Chang [Gong Qing
Zhang], Jia Qing Zhong and Yi Chao Xu; Translated from the Chinese by Ding and S.
Y. Cheng; With a preface translated from the Chinese by Kaising Tso. MR1333601
[SY17] Richard Schoen and Shing-Tung Yau, Positive scalar curvature and minimal hyper-
surface singularities, arXiv:1704.05490 (2017).
[Sch08] Fernando Schwartz, Existence of outermost apparent horizons with product
of spheres topology, Comm. Anal. Geom. 16 (2008), no. 4, 799–817, DOI
10.4310/CAG.2008.v16.n4.a3. MR2471370
[SZ97] Ying Shen and Shunhui Zhu, Rigidity of stable minimal hypersurfaces, Math. Ann.
309 (1997), no. 1, 107–116, DOI 10.1007/s002080050105. MR1467649
[Shi89] Wan-Xiong Shi, Ricci deformation of the metric on complete noncompact Riemann-
ian manifolds, J. Differential Geom. 30 (1989), no. 2, 303–394. MR1010165
[ST02] Yuguang Shi and Luen-Fai Tam, Positive mass theorem and the boundary behaviors
of compact manifolds with nonnegative scalar curvature, J. Differential Geom. 62
(2002), no. 1, 79–125. MR1987378
[Sim68] James Simons, Minimal varieties in riemannian manifolds, Ann. of Math. (2) 88
(1968), 62–105, DOI 10.2307/1970556. MR0233295
[Sim83] Leon Simon, Lectures on geometric measure theory, Proceedings of the Centre for
Mathematical Analysis, Australian National University, vol. 3, Australian National
University, Centre for Mathematical Analysis, Canberra, 1983. MR756417
[Sim95] Leon Simon, Rectifiability of the singular sets of multiplicity 1 minimal surfaces and
energy minimizing maps, Surveys in differential geometry, Vol. II (Cambridge, MA,
1993), Int. Press, Cambridge, MA, 1995, pp. 246–305. MR1375258
[Sim97] Leon Simon, Schauder estimates by scaling, Calc. Var. Partial Differential Equations
5 (1997), no. 5, 391–407, DOI 10.1007/s005260050072. MR1459795
[Sim02] Miles Simon, Deformation of C 0 Riemannian metrics in the direction of
their Ricci curvature, Comm. Anal. Geom. 10 (2002), no. 5, 1033–1074, DOI
10.4310/CAG.2002.v10.n5.a7. MR1957662
[Sma93] Nathan Smale, Generic regularity of homologically area minimizing hypersurfaces
in eight-dimensional manifolds, Comm. Anal. Geom. 1 (1993), no. 2, 217–228, DOI
10.4310/CAG.1993.v1.n2.a2. MR1243523
[Smi82] Francis R. Smith, On the existence of embedded minimal 2-spheres in the 3-sphere,
endowed with an arbitrary Riemannian metric, 1982. Thesis (Ph.D.)–University of
Melbourne.
[SSA17] Christina Sormani and Iva Stavrov Allen, Geometrostatic manifolds of small ADM
mass, arXiv:1707.03008 (2017).
[Spi79] Michael Spivak, A comprehensive introduction to differential geometry. Vol. I, 2nd
ed., Publish or Perish, Inc., Wilmington, Del., 1979. MR532830
358 Bibliography
[Sto92] Stephan Stolz, Simply connected manifolds of positive scalar curvature, Ann. of Math.
(2) 136 (1992), no. 3, 511–540, DOI 10.2307/2946598. MR1189863
[Tak94] Peter Takáč, A short elementary proof of the Kreı̆n-Rutman theorem, Houston J.
Math. 20 (1994), no. 1, 93–98. MR1272563
[Tam84] I. Tamanini, On the sphericity of liquid droplets (English, with French summary),
Astérisque 118 (1984), 235–241. Variational methods for equilibrium problems of
fluids (Trento, 1983). MR761754
[Top59] V. A. Toponogov, Evaluation of the length of a closed geodesic on a convex surface
(Russian), Dokl. Akad. Nauk SSSR 124 (1959), 282–284. MR0102055
[Tra58] A. Trautman, Radiation and boundary conditions in the theory of gravitation, Bull.
Acad. Polon. Sci. Sér. Sci. Math. Astr. Phys. 6 (1958), 407–412. MR0097266
[Tru68] Neil S. Trudinger, Remarks concerning the conformal deformation of Riemannian
structures on compact manifolds, Ann. Scuola Norm. Sup. Pisa (3) 22 (1968), 265–
274. MR0240748
[TW14] Wilderich Tuschmann and Michael Wiemeler, Differentiable stability and sphere
theorems for manifolds and einstein manifolds with positive scalar curvature,
arXiv:1408.3006 (2014).
[Won12] Willie Wai-Yeung Wong, A positive mass theorem for two spatial dimensions,
arXiv:1202.6279 (2012).
[Wal84] Robert M. Wald, General relativity, University of Chicago Press, Chicago, IL, 1984.
MR757180
[Wan03] Li He Wang, A geometric approach to the Calderón-Zygmund estimates, Acta
Math. Sin. (Engl. Ser.) 19 (2003), no. 2, 381–396, DOI 10.1007/s10114-003-0264-4.
MR1987802
[Wan15] Mu-Tao Wang, Four lectures on quasi-local mass, arXiv:1510.02931 (2015).
[WY09] Mu-Tao Wang and Shing-Tung Yau, Isometric embeddings into the Minkowski space
and new quasi-local mass, Comm. Math. Phys. 288 (2009), no. 3, 919–942, DOI
10.1007/s00220-009-0745-0. MR2504860
[Wan01] Xiaodong Wang, The mass of asymptotically hyperbolic manifolds, J. Differential
Geom. 57 (2001), no. 2, 273–299. MR1879228
[Whi87] Brian White, The space of m-dimensional surfaces that are stationary for a para-
metric elliptic functional, Indiana Univ. Math. J. 36 (1987), no. 3, 567–602, DOI
10.1512/iumj.1987.36.36031. MR905611
[Wik] Wikipedia contributors, Wikipedia, the free encyclopedia. [Online].
[Wil65] T. J. Willmore, Note on embedded surfaces (English, with Romanian and Russian
summaries), An. Şti. Univ. “Al. I. Cuza” Iaşi Secţ. I a Mat. (N.S.) 11B (1965),
493–496. MR0202066
[Wit81] Edward Witten, A new proof of the positive energy theorem, Comm. Math. Phys. 80
(1981), no. 3, 381–402. MR626707
[Wlo87] J. Wloka, Partial differential equations, Cambridge University Press, Cambridge,
1987. Translated from the German by C. B. Thomas and M. J. Thomas. MR895589
[Yam60] Hidehiko Yamabe, On a deformation of Riemannian structures on compact mani-
folds, Osaka Math. J. 12 (1960), 21–37. MR0125546
[Zip09] Nina Zipser, Part II: Solutions of the Einstein-Maxwell equations, Extensions of the
stability theorem of the Minkowski space in general relativity, AMS/IP Stud. Adv.
Math., vol. 45, Amer. Math. Soc., Providence, RI, 2009, pp. 297–491. MR2537048
Index
359
360 Index
time-symmetric, 225
timelike, 210
timelike hypersurface, 213
trapped surface, 234
Trautman-Bondi mass, 250
two-sided, 24
uniformization theorem, 21
Yamabe positive, 18
Yamabe problem, 21
GSM/201
www.ams.org