Sei sulla pagina 1di 64

Lecture Notes on General Relativity (GR)

lazierthanthou
(https://github.com/lazierthanthou/Lecture_Notes_GR)
March 24, 2016

Contents
1 Topology
1.1 Topological Spaces . . . .
1.2 Continuous maps . . . . .
1.3 Composition of continuous
1.4 Inheriting a topology . . .
2 Manifolds
2.1 Topological manifolds
2.2 Terminology . . . . . .
2.3 Chart transition maps
2.4 Manifold philosophy .

. . . .
. . . .
maps
. . . .

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

4
4
4
5
5

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

6
6
6
6
6

3 Multilinear Algebra
3.1 Vector Spaces . . . . . . . . . . .
3.2 Linear Maps . . . . . . . . . . . .
3.3 Vector Space of Homomorphisms
3.4 Dual Vector Spaces . . . . . . . .
3.5 Tensors . . . . . . . . . . . . . .
3.6 Vectors and Covectors as Tensors
3.7 Bases . . . . . . . . . . . . . . .
3.8 Basis for the Dual Space . . . . .
3.9 Components of Tensors . . . . .

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

8
8
9
9
9
10
10
10
11
11

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

4 Differential Manifolds
12
4.1 Compatible charts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.2 Diffeomorphisms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
5 Tangent Spaces
5.1 Velocities . . . . . . . . . . . . . . . . . . . . . . . . .
5.2 Tangent vector space . . . . . . . . . . . . . . . . . . .
5.3 Components of a vector w.r.t. a chart . . . . . . . . .
5.4 Chart-induced basis . . . . . . . . . . . . . . . . . . .
5.5 Change of vector components under a change of chart
5.6 Cotangent spaces . . . . . . . . . . . . . . . . . . . . .
5.7 Change of components of a covector under a change of

. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
chart .

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

15
15
15
17
18
18
19
19

6 Fields
21
6.1 Bundles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

6.2
6.3
6.4
6.5

Tangent bundle of smooth manifold


Vector fields . . . . . . . . . . . . . .
The C (M )-module (T M ) . . . .
Tensor fields . . . . . . . . . . . . . .

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

21
23
23
24

.
.
.
.

25
25
26
27
28

.
.
.
.
.

29
29
29
29
30
31

9 Newtonian spacetime is curved!


9.1 Laplaces questions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
9.2 The full wisdom of Newton I . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
9.3 The foundations of the geometric formulation of Newtons axiom . . . . . . . . . . . . . . . . . .

32
32
32
33

10 Metric Manifolds
10.1 Metrics . . . . .
10.2 Signature . . . .
10.3 Length of a curve
10.4 Geodesics . . . .

7 Connections
7.1 Directional derivatives of tensor fields . . . .
7.2 New structure on (M, O, A) required to fix
7.3 Change of s under change of chart . . . . .
7.4 Normal Coordinates . . . . . . . . . . . . . .
8 Parallel Transport & Curvature
8.1 Parallelity of vector fields . . . .
8.2 Autoparallely transported curves
8.3 Autoparallel equation . . . . . .
8.4 Torsion . . . . . . . . . . . . . .
8.5 Curvature . . . . . . . . . . . . .

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

11 Symmetry
11.1 Push-forward map . . . . . . . .
11.2 Pull-back map . . . . . . . . . .
11.3 Flow of a complete vector field .
11.4 Lie subalgebras of the Lie algebra
11.5 Symmetry . . . . . . . . . . . . .
11.6 Lie derivative . . . . . . . . . . .

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

36
36
37
37
38

. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
((T M ), [, ]) of vector fields
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

40
40
41
42
42
43
43

.
.
.
.
.

45
45
46
46
48
48

.
.
.
.

12 Integration
12.1 Review of integration on Rd . . . .
12.2 Integration on one chart . . . . . .
12.3 Volume forms . . . . . . . . . . . .
12.4 Integration on onechart domain U
12.5 Integration on the entire manifold

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

13 Lecture 13: Relativistic spacetime


49
13.1 Time orientation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
13.2 Observers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
13.3 Role of the Lorentz transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
14 Lecture 14: Matter
53
14.1 Point matter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
14.2 Field matter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
14.3 Energy-momentum tensor of matter fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

15 Einstein gravity
15.1 Hilbert . . . . . . . . . . . . . .
15.2 Variation of SHilbert . . . . . .
15.3 Solution of the a T ab = 0 issue
15.4 Variants of the field equations .

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

57
57
57
58
58

16 L18: Canonical Formulation of GR-I


60
16.1 Dynamical and Hamiltonian formulation of General Relativity . . . . . . . . . . . . . . . . . . . . 60
17 Lecture 22: Black Holes
61
17.1 Radial null geodesics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
17.2 Eddington-Finkelstein . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
Abstract
These are lecture notes on General Relativity.
They are based on the Central Lecture Course by Dr. Frederic P. Schuller (A thorough introduction to the theory of general relativity) introducing the mathematical and physical foundations of the
theory in 24 self-contained lectures at the International Winter School on Gravity and Light in Linz/Austria
for the WE Heraeus International Winter School of Gravity and Light, 2015 in Linz as part of the world-wide
celebrations of the 100th anniversary of Einsteins theory of general relativity and the International Year of
Light 2015.
These lectures develop the theory from first principles and aim at an audience ranging from ambitious
undergraduate students to beginning PhD students in mathematics and physics. Satellite Lectures (see
other videos on this channel) by Bernard F Schutz (Gravitational Waves), Domenico Giulini (Canonical
Formulation of Gravity), Marcus C Werner (Gravitational Lensing) and Valeria Pettorino (Cosmic Microwave
Background) expand on the topics of this central lecture course and take students to the research frontier.
Spacetime is the physical key object, we shall be concerned about.
Spacetime is a 4-dimensional topological manifold with a smooth atlas carrying a torsionfree connection compatible with a Lorentzian metric and a time orientation satisfying the Einstein equations.

Topology
Motivation: At the coarsest level, spacetime is a set. But, a set is not enough to talk about continuity
of maps, which is required for classical physics notions such as trajectory of a particle. We do not want
jumps such as a particle disappearing at some point on its trajectory and appearing somewhere. So we
require continuity of maps. There could be many structures that allow us to talk about continuity, e.g.,
distance measure. But we need to be very minimal and very economic in order not to introduce undue
assumptions. So we are interested in the weakest structure that can be established on a set which allows a
good definition of continuity of maps. Mathematicians know that the weakest such structure is topology.
This is the reason for studying topological spaces.

1.1

Topological Spaces

Definition 1. Let M be a set and P(M ) be the power set of M , i.e., the set of all subsets of M .
A set O P(M ) is called a topology, if it satisfies the following:
(i) O, M O
(ii) U O, V O = U V O
(iii) U O, A (A is an index set =


U O

Terminology:
1. the tuple (M, O) is a topological space.
2. U M is an open set if U O.
3. U M is a closed set if M \ U O.
Definition 2. (M, O), where O = {, M } is called the chaotic topology.
Definition 3. (M, O), where O = P(M ) is called the discrete topology.
Definition 4. A soft ball at the point p in Rd is the set
(
Br (p) :=

(q1 , q2 , ..., qd ) |

d
X
(qi pi )2 < r2

)
where r R+

(1.1)

i=1

Definition 5. (Rd , Ostd ) is the standard topology, provided that U Ostd iff
p U, r R+ : Br (p) U
Proof. Ostd since p , r R+ : Br (p) (i.e. satisfied vacuously)
Rd Ostd since p Rd , r = 1 R+ : Br (p) Rd
Suppose U, V Ostd . Let p U V = r1 , r2 R+ s.t. Br1 (p) U, Br2 (p) V .
Let r = min {r1 , r2 } = Br (p) U and Br (p) V = Br (p) U V = U V Ostd .
S
Suppose, U Ostd , A. Let p A U = A : p U
S
S
= r R+ : Br (p) U A U = A U Ostd .

1.2

Continuous maps

A map f , f : M N , connects each element of a set M (domain set) to an element of a set N (target
set).
Terminology:

1. If f maps m M to n N , then we may say f (m) = n, or m maps to n, or m 7 f (m) or m 7 n.


2. If V N, preimf (V ) := {m M |f (m) V }
3. If n N, m M : n = f (m), then f is surjective. Or, f : M  N .
4. If m1 , m2 M, m1 6= m2 = f (m1 ) 6= f (m2 ), then f is injective. Or, f : M , N .
Definition 6. Let (M, OM ) and (N, ON ) be topological spaces. A map f : M N is called continuous
w.r.t. OM and ON if V ON = (preimf (V )) OM .
Mnemonic: A map is continuous iff the preimages of all open sets are open sets.

1.3

Composition of continuous maps

Definition 7. If f : M N and g : N P , then


g f : M P such that m 7 (g f )(m) := g(f (m))
Theorem 1.1. If f : M N is continuous w.r.t. OM and ON and g : N P is continuous w.r.t. ON
and OP , then g f : M P is continuous w.r.t. OM and OP .
Proof. Let W OP .
preimgf (W ) = {m M |g(f (m)) W }

(g f )(m) = g(f (m))

= {m M |f (m) preimg (W )}

preimg (W ) ON g is continuous
OM f is continuous

= preimf (preimg (W ))
= g f is continuous

1.4

Inheriting a topology

Given a topological space (M, OM ), one way of inheriting a topology from it is the subspace topology.
Theorem 1.2. If (M, OM ) is a topological space and S M , then the set O|S P(S) such that O|S :=
{S U |U OM } is a topology. O|S is called the subspace topology inherited from OM .
Proof.

1. , S O|S = S , S = S M .

2. S1 , S2 O|S = U1 , U2 OM : S1 = S U1 , S2 = S U2 = U1 U2 OM
= S (U1 U2 ) O|S = (S U1 ) (S U2 ) O|S = S1 S2 O|S .
3. Let A, where A is an index set. Then S O|S = U OM : S = S U .

S
Further, let U =
U OM .
A U . Therefore,




S
S
S
S
Now,
S
=
(S

U
)
=
S

A
A
A U = S U =
A S O|S .

Theorem 1.3. If (M, OM ) and (N, ON ) are topological spaces, and f : M N is continuous w.r.t OM and
ON , then the restriction of f to S M, f |S : S N s.t. f |S (s S) = f (s), is continuous w.r.t O|S and ON .
Proof. Let V ON . Then, preimf (V ) OM .
Now preimf |S (V ) = S preimf (V ) = preimf |S (V ) O|S = f |S is continuous.

Manifolds
Motivation: There exist so many topological spaces that mathematicians cannot even classify them. For
spacetime physics, we may focus on topological spaces (M, O) that can be charted, analogously to how the
surface of the earth is charted in an atlas.

2.1

Topological manifolds

Definition 8. A topological space (M, O) is called a d-dimensional topological manifold if


p M : U O : p U, x : U x(U ) Rd satisfying the following:
(i) x is invertible: x1 : x(U ) U
(ii) x is continuous w.r.t. (M, O) and (Rd , Ostd )
(iii) x1 is continuous

2.2

Terminology

1. The tuple (U, x) is a chart of (M, O),


2. An atlas of (M, O) is a set A = {(U , x )| A, an index set} :

U = M .

3. The map x : U x(U ) Rd is called the chart map.


4. The chart map x maps a point p U to a d-tuple of real numbers x(p) = (x1 (p), x2 (p), . . . , xd (p)). This
is equivalent to d-many maps xi (p) : U R, which are called the coordinate maps.
5. If p U , then xi (p) is the ith coordinate of p w.r.t. the chart (U, x).

2.3

Chart transition maps

Imagine 2 charts (U, x) and (V, y) with overlapping regions, i.e., U V 6= .

x1

U V
y

x
d

R x(U V )

y x1

y(U V ) Rd

The map y x1 is called the chart transition map, which maps an open set of Rd to another open set of
Rd . This map is continuous because it is composition of two continuous maps, Informally, these chart transition
maps contain instructions on how to glue together the charts of an atlas,

2.4

Manifold philosophy

Often it is desirable (or indeed the only way) to define properties (e.g., continuity) of real-world object (e.g.,
the curve : R M ) by judging suitable coordinates not on the real-world object itself, but on a chartrepresentation of that real world object.

For example, in the picture below, we can use the map x to infer the continuity of the curve in U M .

U M

x(U ) Rd

However, we need to ensure that the defined property does not change if we change our chosen chart. For
example, in the picture below, continuity in x should imply y . This is true, since y = y (x1 x) =
(y x1 ) (x ) is continuous because it is a composition of two continuous functions, thanks to the continuity
of the chart transition map y x1 .

y(U ) Rd

U M

x1

y x1

x(U ) Rd

What about differentiability? Does differentiability of x guarantee differentiability of y ? No. Since


composition of a differentiable map and a continuous map might only be continuous, The solution is to restrict
the atlas by removing those charts which are not differentiable. Thus, we have got rid of our problem. However,
we must remember that with the present structure, we cannot define differentiability at manifold level since we
do not know how to subtract or divide in U M . Therefore, differentiability of : R M makes no sense
yet.

Multilinear Algebra
Motivation: The essential object of study of linear algebra is vector space. However, a word of warning
here. We will not equip space(time) with vector space structure. This is evident since, unlike in vector
space, expressions such as 5 Paris and Paris + Vienna do not make any sense. If multilinear algebra does
not further our aim of studying spacetime, then why do we study it? The tangent spaces Tp M (defined
in Lecture 5) at a point p of a smooth manifold M (defined in Lecture 4) carries a vector space structure
in a natural way even though the underlying position space(time) does not have a vector space structure.
Once we have a notion of tangent space, we have a derived notion of a tensor. Tensors are very important
in differential geometry.
It is beneficial to study vector spaces (and all that comes with it) abstractly for two reasons: (i) for
construction of Tp M , one needs an intermediate vector space C (M ), and (ii) tensor techniques are most
easily understood in an abstract setting.

3.1

Vector Spaces

Definition 9. A R-vector space is a triple (V, +, ), where


i) V is a set,
ii) + : V V V
iii) . : R V V

(addition), and
(S-multiplication)

satisfying the following:


a) u, v V : u + v = v + u

(commutativity of +)

b) u, v, w V : (u + v) + w = u + (v + w)
c) O V : v V : O + v = v

(associativity of +)

(neutral element in +)

d) v V : (v) V : v + (v) = 0

(inverse of element in +)

e) , R, v V : ( v) = ( ) v

(associativity in )

f) , R, v V : ( + ) v = v + v

(distributivity of )

g) R, u, v V : u + v = (u + v)

(distributivity of )

h) 1 R : v V : 1 v = v

(unit element in )

Terminology: If (V, +, ) is a vector space, an element of V is often referred to, informally, as a vector. But,
we should remember that it makes no sense to call an element of V a vector unless the vector space itself is
specified.
Example: Consider a set of polynomials of fixed degree,
(
P :=



p : (1, +1) R

p(x) =

N
X

)
n

pn x , where pn R

n=0

with : P P P with (p, q) 7 p q : (p q)(x) = p(x) + q(x) and


: R P P with (, p)
7 p : ( p)(x) = p(x). (P, , ) is a vector space.
Caution: We are considering real vector spaces, that is S-multiplication with the elements of R. We shall often
use same symbols + and for different vector spaces, but the context should make things clear. When R, R2 ,
etc. are used as vector spaces, the obvious (natural) operations shall be understood to be used.

3.2

Linear Maps

These are the structure-respecting maps between vector spaces.


Definition 10. If (V, +v , v ) and (W, +w , w ) are vector spaces, then : V W is called a linear map if
i) v, v V : (v +v v) = (v) +w (
v ), and
ii) R, v V : ( v v) = w (v).

Notation: : V W is a linear map : V W


Example: Consider the vector space (P, , ) from the above example,
Then, : P P with p 7 (p) := p0 is a linear map, because
p, q P : (p q) = (p q)0 = p0 q 0 = (p) (q) and
R, p P : ( p) = ( p)0 = p0 .

Theorem 3.1. If : U V and : V W , then : U W .

Proof. u, u
U, ( )(u +u u
) = ((u +u u
)) = ((u) +v (
u)) = ((u)) +w ((
u)) = ( )(u) +w (
)(
u).
R, u U, ( )( u u) = (( u u)) = ( v (u)) = w ((u)) = w ( )(u)
Example: Consider the vector space (P, , ) and the differential : P P with p 7 (p) := p0 from
previous example. Then, p00 , the second differential is also linear since it is composition of two linear maps, i.e.,

: P P .

3.3

Vector Space of Homomorphisms

n
o

Definition 11. If (V, +, ) and (W, +, ) are vector spaces, then Hom(V, W ) := : V W .
Theorem 3.2. (Hom(V, W ), +, ) is a vector space with
+ : Hom(V, W ) Hom(V, W ) Hom(V, W ) with (, ) 7 + : ( + )(v) = (v) + (v) and
: R Hom(V, W ) Hom(V, W ) with (, ) 7 : ( )(v) = (v).
Example: (Hom(P, P ), +, ) is a vector space. Hom(P, P ), Hom(P, P ), Hom(P, P ), etc.
Therefore, maps such as 5 + Hom(P, P ). Thus, mixed order derivatives are in Hom(P, P ), and hence
linear.

3.4

Dual Vector Spaces

n
o

Definition 12. If (V, +, ) is a vector space, and V := : V R = Hom(V, R) then


(V , +, ) is called the dual vector space to V.
Terminology: V is called, informally, a covector.
R1

Example: Consider I : P R, i.e., I P . We define I(p) := 0 p(x) dx, which can be easily checked to be
linear with I(p + q) = I(p) + I(q) and I( p) = I(p). Thus I is a covector, which is the integration operator
R1
( ) dx which eats a function.
0

Remarks: We shall also see later that the gradient is a covector. In fact, lots of things in physicists life, which
are covectors, have been called vectors not to bother you with details. But covectors are neither esoteric nor
unnatural.

3.5

Tensors

We can think of tensors as multilinear maps.


Definition 13. Let (V, +, ) be a vector space. An (r,s) -tensor T over V is a multilinear map

T : V V V V V V R
|
{z
} |
{z
}
r times

s times

Example: If T is a (1,1)-tensor, then


T (1 + 2 , v) = T (1 , v) + T (2 , v),
T (, v1 + v2 ) = T (, v1 ) + T (, v2 ),
T ( , v) = T (, v), and
T (, v) = T (, v).
Thus, T (1 + 2 , v1 + v2 ) = T (1 , v1 ) + T (1 , v2 ) + T (2 , v1 ) + T (2 , v2 ).

Remarks: Sometimes it is said that a (1,1)-tensor is something that eats a vector and outputs a vector. Here

is why. For T : V V R, define T : V (V ) with v 7 T ((), v). But, clearly T ((), v) : V R,


which eats a covector and spits a number. In other words, T ((), v) (V ) . Although we are yet to define
dimension, let us just trust, for the time being, that for finite-dimensional vector spaces, (V ) = V . So,

T : V V .
R1

Example: Let g : P P R with (p, q) 7 1 p(x) q(x) dx. Then, g is a (0,2)-tensor over P .

3.6

Vectors and Covectors as Tensors

Theorem 3.3. If (V, +, ) is a vector space, V is a (0,1)-tensor.


n
o

Proof. V and, by definition, V := : V R , which is a collection of (0,1)-tensors.


Theorem 3.4. If (V, +, ) is a vector space, v V is a (1,0)-tensor.

Proof. We have already stated, without proof and without defining dimensions,
n that V = (V
o ) for finite
dimensional vector spaces. Therefore, v V = v (V ) = v : V R
= v is a
(1,0)-tensor.

3.7

Bases

Definition 14. Let (V, +, ) is a vector space. A subset B V is called a basis if


n
X
v V, !e1 , e2 , . . . , en B, !v1 , v2 , . . . , vn R : v =
vi e i .
i=1

Definition 15. A vector space (V, +, ) with a basis B is said to be d-dimensional if B has d elements. In
other words, dimV := d.
Remarks: The above definition is well-defined only if every basis of a vector space has the same number of
elements.

10

Remarks: Let (V, +, ) is a vector space. Having chosen a basis e1 , e2 , . . . , en , we may uniquely associate
n
X
v 7 (v1 , v2 , dotsc, vn ), these numbers being the components of v w.r.t. chosen basis where v =
vi e i .
i=1

3.8

Basis for the Dual Space

Let (V, +, ) is a vector space. Having chosen a basis e1 , e2 , . . . , en for V , we can choose a basis 1 , 2 , . . . , n for
V entirely independent of basis of V . However, it is more economical to require that

1
if a = b
a (eb ) = ba =
0
if a 6= b
This uniquely determines 1 , 2 , . . . , n from choice of e1 , e2 , . . . , en .
Remarks: The reason for using indices as superscripts or subscripts is to be able to use the Einstein summation
P
convention, which will be helpful in dropping cumbersome
symbols in several equations.
Definition 16. For a basis e1 , e2 , . . . , en of vector space (V, +, ), 1 , 2 , . . . , n is called the dual basis of the
dual space, if a (eb ) = ba .
Example: Consider polynomials P of degree 3. Choose e0 , e1 , e2 , e3 P such that e0 (x) = 1, e1 (x) = x, e2 (x) =
1
x2 ande3 (x) = x3 . Then, it can be easily verified that the dual basis is a = a
.
a!
x=0

3.9

Components of Tensors

Definition 17. Let T be a (r, s)-tensor over a d-dimensional (finite) vector space (V, +, ). Then, with respect
to some basis {e1 , . . . , er } and the dual basis {1 , . . . , s }, define (r + s)d real numbers
T i1 ...irj1 ...js := T (i1 , . . . , ir , ej1 , . . . , ejs )
such that the indices i1 , . . . , ir , j1 , . . . , js take all possible values in the set {1, . . . , d}. These numbers T i1 ...irj1 ...js
are called the components of the tensor T w.r.t. the chosen basis.
This is useful because knowing components (and the basis w.r.t which these components have been chosen),
one can reconstruct the entire tensor.
Example: If T is a (1, 1)-tensor, then T ij := T (i , ej ). Then

d
d
d X
d
d X
d
X
X
X
X
T (, v) = T
i i ,
v j ej =
i v j T (i , ej ) =
i v j T ij =: i v j T ij
i=1

j=1

i=1 j=1

11

i=1 j=1

Differential Manifolds
Motivation: So far we have dealt with topological manifolds which allow us to talk about continuity. But
to talk about smoothness of curves on manifolds, or velocities along these curves, we need something like
differentiability. Does the structure of topological manifold allow us to talk about differentiability? The
answer is a resounding no.
So this lecture is about figuring out what structure we need to add on a topological manifold M to start
talking about differentiability of curves (R M ) on a manifold, or differentiability of functions (M R)
on a manifold, or differentiability of maps (M N ) from one manifold M to another manifold N .

:R

x(U ) Rd
idea. try to lift the undergraduate notion of differentiability of a curve on Rd to a notion of differentiability
of a curve on M
Problem Can this be well-defined under change of chart?
y(U V ) Rd

y
U V 6= y x1

:R

x
x(U V ) Rd

x undergraduate differentiable (as a map R Rd )

y
| {z }

maybe only continuous, but not undergraduate differentiable

Rd Rd

RRd

z }| {
= (y x1 )
| {z }

z }| {
(x )
| {z }

continuous

= y (x1 x)

undergrad differentiable

At first sight, strategy does not work out.

4.1

Compatible charts

In section 1, we used any imaginable charts on the top. mfd. (M, O).
To emphasize this, we may say that we took U and V from the maximal atlas A of (M, O).
Definition 18. Two charts (U, x) and (V, y) of a top. mfd. are called `-compatible if either
(a) U V = or
(b) U V 6=
chart transition maps have undergraduate ` property.

12

EY : 20151109 e.g. since Rd Rd , can use undergradate ` property such as continuity or differentiability.
y x1 : x(U V ) Rd y(U V ) Rd
x y 1 : y(U V ) Rd x(U V ) Rd
Philosophy:
Definition 19. An atlas A` is a `-compatible atlas if any two charts in A` are `-compatible.
Definition 20. A `-manifold is a triple ( M, O , A` )
A` Amaximal
| {z }
top. mfd.

`
C0
C1
Ck
Dk
..
.
C

undergraduate `
C 0 (Rd Rd ) =
C 1 (Rd Rd ) =

continuous maps w.r.t. O


differentiable (once) and is continuous
k-times continuously differentiable
k-times differentiable

C (Rd Rd )

C
C

multi-dim. Taylor exp.


satisfy Cauchy-Riemann equations, pair-wise

EY : 20151109 Schuller says: C k is easy to work with because you can judge k-times cont. differentiability from
existence of all partial derivatives and their continuity. There are examples of maps that partial derivatives
exist but are not Dk , k-times differentiable.
Theorem 4.1 (Whitney). Any C k1 -atlas, AC k1 of a topological manifold contains a C -atlas.
Thus we may w.l.o.g. always consider C -manifolds, smooth manifolds, unless we wish to define Taylor
expandibility/complex differentiability . . .
EY : 20151109 Hassler Whitney 1
Definition 21. A smooth manifold ( M, O ,
| {z }
top. mfd.

A )
|{z}

C atlas

R
x

x
Rd

EY: 20151109 Schuller was explaining that the trajectory is real in M ; the coordinate
maps to obtain coordinates is x

4.2

Diffeomorphisms

M
N
If M, N are naked sets, the structure preserving maps are the bijections (invertible maps).
e.g. {1, 2, 3} {a, b}
Definition 22. M
=set N (set-theoretically) isomorphic if bijection : M N
Examples. N
=set Z
N
=set Q (EY: 20151109 Schuller says from diagonal counting)
1 http://mathoverflow.net/questions/8789/can-every-manifold-be-given-an-analytic-structure

13


R
N
=set
Now (M, OM )
=top (N, ON ) (topl.) isomorphic = homeomorphic bijection : M N
, 1 are continuous.
(V, +, )
=vec (W, +w , w ) (EY: 20151109 vector space isomorphism) if
bijection : V W linearly
finally
Definition 23. Two C -manifolds
(M, OM , AM ) and (N, ON , AN ) are said to be diffeomorphic if bijection : M N s.t.
: M N
1 : N M
are both C -maps
Rd

ye x
e1

Re
ye

x
e
C

M U

V N
y

x
1

yx
undergraduate C

Re

Theorem 4.2. # = number of C -manifolds one can make out of a given C 0 -manifolds (if any) - up to
diffeomorphisms.
dimM
1
2
3
4
5
6
..
.

#
1
1
1
uncountably infinitely many
finite
finite
finite

Morse-Radon theorems
Morse-Radon theorems
Morse-Radon theorems
surgery theory
surgery theory
surgery theory

EY : 20151109 cf. http://math.stackexchange.com/questions/833766/closed-4-manifolds-with-uncountably-manyThe wild world of 4-manifolds

14

Tangent Spaces

Lead question: What is the velocity of a curve : R M at the point p of the curve in M ?

5.1

Velocities

Definition 24. Let (M, O, A) be a smooth manifold. Let there be a curve : R M , which is at least C 1 .
Suppose (0 ) = p. The velocity of at the point p of the curve is the linear map

v,p : C (M ) R with f 7 v,p (f ) := (f )0 (0 )

(5.1)

where C (M ) := {f : M R | f is a smooth function } equipped with


(f g)(p) := f (p) + g(p) and ( g)(p) := g(p) is a vector space.

f
M

past:

vi
|{z}

vector in past

(i f ) = (

vi
|{z}i

Figure 5.1: f . Intuition: If the first R is thought of as time,


and f as temperature, then f relates time and temperature and
(f )0 is the rate of change of temperature as you run around the
curve.

)f

vector as map

In an imprecise way, we could say that we want vectors to survive as the directional derivatives they induce. This
is a very slight shift of perspective which is extremely powerful and leads to idea of tangent space in differential
geometry.
Terminology: If X is a vector seen as a map, then X acting on a function f , i.e. Xf is called the directional
derivative of f in the X direction.

5.2

Tangent vector space

Definition 25. For each point p M , the tangent space to M at the point p is the set
Tp M := {v,p | for all smooth curves through p}

(5.2)

Figure 5.2: A pictorial representation of the tangent space Tx M of a single


point, x, on a manifold. A vector in this Tx M can represent a possible
velocity at x. After moving in that direction to a nearby point, ones velocity
would then be given by a vector in the tangent space of that nearby point
a different tangent space, not shown. By Alexwright at English Wikipedia
- Transferred from en.wikipedia to Commons by Ylebru., Public Domain
https: // commons. wikimedia. org/ w/ index. php? curid= 3941393

Figure 5.3: The tangent space Tx M and a tangent vector v Tx M ,


along a curve travelling through x M . By derivative work: McSush
(talk)Tangentialvektor.png: TNThe original uploader was TN at German
Wikipedia - Tangentialvektor.png, Public Domain, https: // commons.
wikimedia. org/ w/ index. php? curid= 4821938
Caution: Although the Fig. 5.2 and 5.3 refer to an ambient space in which M is embedded, the tangent space
has been defined intrinsically. There is a velocity corresponding to each curve along a different path in M passing
15

through p. Velocity along two different curves could be same, or curves along same paths but having different
parameter speeds would yield different velocities.
Theorem 5.1. (Tp M, , ) is a vector space with
:Tp M Tp M Hom(C (M ), R)
(v,p v,p )(

f ) := v,p (f ) +R v,p (f )
|{z}

C (M )

:R Tp M Hom(C (M ), R)
( v,p )(f ) := R v,p (f )
Proof. Various conditions that must be satisfied by a vector space, are trivially satisfied. It remains to be shown
that
i) For product, curve : v,p = v,p
ii) For sum, curve : v,p v,p = v,p
Product: Let : R M with 7 () := ( + 0 ) = ( )() where : R R with r 7 r + 0 .
Then (0) = (0 ) = p, and
v,p = (f )0 (0) = (f )0 (0) = 0 (0) (f )0 ( (0)) = (f )0 (0 ) = v,p
Sum: Choose a chart (U, x) and p U . (If the proof will depend on the choice of a chart, alarm bells should
ring. But we shall see that the result is finally independent of the chart.)
Let p = (0 ) = (1 ).
Now define : R M with 7 () := x1 ((x )(0 + ) +(x )(1 + ) (x )(0 )).
{z
}
|
RRd

Then, x (0) = x1 ((x )(0 ) + (x )(1 ) (x )(0 )) = (1 ) = p.


Now
vx ,p (f ) := (f x )0 (0)
= ((f x1 ) (x x ))0 (0)
| {z } | {z }
Rd R

RRd

(x x ) (0)
|
{z
}

(x)0 (0 )+(x)0 (1 )


i (f x1 ) (x((0)))
|{z}
p

= (x )0 (0 )(i (f x1 ))(x(p)) + (x )(1 )(i (f x1 ))(x(p))


= (f )0 (0 ) + (f )0 (1 )
f C (M )

= v,p (f ) + v,p (f )

picture: (cf. https://youtu.be/pepU_7NJSGM?t=39m5s)


If we push and to one chart, and add them there, then bring the sum back to M , we would get a curve
which would be different from the curve we would get if we used another chart. But it turns out, irrespective
of the charts selected, we get the same tangent/velocity. Conclusion: Adding trajectories is chart dependent;
hence, bad. Adding velocities is good because, whatever the charts, they yield the same derivative at the point
of intersection. Of course, you cannot add two curves ( )() := () +M () because there is no addition
+M in M . Defining + through charts results in chart-dependent results, which is, therefore, not real.

16

5.3

Components of a vector w.r.t. a chart

f
M

Rd
Let (U, x) Asmooth , : R U and (0) = p. Then
v,p (f ) := (f )0 (0)
= ((f x1 ) (x ))0 (0)
| {z } | {z }
Rd R

0 i

= (x )

RRd

(0) f x1

0

(x(p))
i

i
0
= (x ) (0) (i (f x1 ))(x(p))
{z
}
|
{z
} |
i (0)
=: x
=:( fi )
x
p



f
= xi (0)
xi p

f C (M ), f : M R

Definition 26. For velocity v,p , as a map under use of a chart (U, x),
v,p =

xi (0)

xi


(5.3)
p

where
0 i

xi = (x )

(5.4)

are the components of the velocity v,p and




xi

0  i
 
= i x1 = x1

(5.5)

which eat a function, form a basis of Tp M w.r.t. which the components of the velocity need to be understood.
Note: The components of a vector are always w.r.t. a chart. In M , there is just the vector, no components.
Picture: https://youtu.be/pepU_7NJSGM?t=1h16s
Theorem 5.2. For a chart (U, x),
xi
= ji
xj
Proof.
xi
= j (xi x1 )(x(p))
xj
= ji

since xi x1 : Rd R s.t. (1 , . . . , d ) 7 i

17

(5.6)

5.4

Chart-induced basis


Definition 27. If (U, x) Asmooth , then

x1


,...,
p

xd


Tp U Tp M constitute a chart-induced
p

basis of Tp U .



. It remains to be
Proof. We have already shown that any vector in Tp U can be expressed in terms of
xi p



shown that they are linearly independent. That is, we require i


= 0 = i = 0 for all i = 1, . . . , d.
xi p
Or,
0 = i

xi

(xj )

xj : U R is differentiable

= i i (xj x1 )(x(p))

by Eq. 5.5

= i ij

by Theorem 5.2

for all j = 1, . . . , d

Corollary 1. dim Tp M = d = dim M .


This follows from the fact d vectors are needed to express any vector in Tp M , and these d vectors arise from
the d coordinates of chart which shows that M has d dimensions.



. X i are called
Terminology: X Tp M = : R M : X = v,p and X 1 , . . . , X d : X = X i
|
{z
}
xi p
R

components of the vector X w.r.t chart-induced basis.

5.5

Change of vector components under a change of chart

8 A vector does not change under change of chart. It is the vector components that transform under a change
of chart.
Let (U, x) and (V, y) be overlapping charts and p U V . Let X Tp M . Then, X can be expanded in terms
of chart-induced basis of the two charts as follows:





i
i
X(y)
=
(5.7)

X
=
X

y i p |{z} |{z} (x)


xi p
(V,y)

(U,x)

Now,


xi

f = i (f x1 )(x(p))
p

= i ((f y 1 ) (y x1 )(x(p))
| {z } | {z }
Rd Rd

Rd R

1 j

) )(x(p)) (j (f y 1 ))(y(p))

= (i (y x

= (i (y j x1 ))(x(p)) (j (f y 1 ))(y(p))
 j 

y
f
=

xi p
y j p

xi


=

y j
xi

18

y j


(5.8)
p

Using Eq. 5.7 and Eq. 5.8,


i
X(x)

y j
xi

 
p

y j

j
=
X(y)

5.6

j
X(y)

y j
xi

y j


p

i
X(x)

(5.9)

Cotangent spaces

Since Tp M is a vector space, therefore it is trivial to define cotangent space as follows.


Definition 28. For the tangent space Tp M at p M , cotangent space is defined as

(Tp M ) := { : Tp M R}

(5.10)

Definition 29. If f C (M ), then the gradient of f at the point p M is defined as

(df )p :Tp M R
X 7 (df )p (X) := Xf

(5.11)

i.e. (df )p Tp M
(df )p is a (0, 1)-tensor over the underlying vector space Tp M . We define the components of the gradient the
same way as we define the components of a tensor (refer section 3.9).
Definition 30. Components of gradient w.r.t. chart-induced basis of (U, x) are defined as

((df )p )j := (df )p

xj

 !


=

f
xj

= j (f x1 )(x(p))

(5.12)

Theorem 5.3. A chart (U, x) = xi : U R are smooth functions. Then, (dx1 )p , (dx2 )p , . . . , (dxd )p form
a basis of Tp M .
Proof. In fact, (dxi )p form a dual basis since
a

(dx )p

5.7

xb

 !


=

xa
xb

= ba (using Theorem 5.2)

(5.13)

Change of components of a covector under a change of chart

8 A covector does not change under change of chart. It is the covector components that transform under a
change of chart.
Let (U, x) and (V, y) be overlapping charts and p U V . Let Tp M . Then, can be expanded in terms
of chart-induced basis of the two charts as follows:
(y)j (dy j )p |{z}
= |{z}
= (x)i (dxi )p
(V,y)

19

(U,x)

(5.14)

Now,

=
=
=

(y)j (dy j )p



(y)j (dy j )p
y k p



j
(y)j (dy )p
y k p
 j
y
(y)j
y k p

= (x)i (dxi )p
= (x)i (dxi )p

by Eq. 5.14


y k


p

 

xq

i
= (x)i (dx )p

y k p
xq p
 q  i
x
x
= (x)i

y k p
xq p
 q
x
j
i
(y)j k = (x)i
y k p q
 i
x
(y)k = (x)i
y k p

=
=

by Eq. 5.8
by Eq. 5.11
by Theorem 5.2

Or, with a change of indices,



(y)i =

xj
y i


(x)j

xj
y i

xj
y i

xj
y i

xj
y i

xj
y i

xj
y i


(y)i =
=

=
=
=

(y)i (dy )p =

=
(dxj )p =
(dxj )p =
(dxj )p =

(dxj )p =

xj
y i

20

(5.15)

(x)j
p

(x)j (dy i )p
p

(x)j (dy i )p (dxj )p


p

(dy i )p
p

(dy i )p
p

(x)j (dy i )p
p

(dy i )p
p

(5.16)

Fields

So far, we have focussed technically on a single tangent space and a vector/ covector in it, a basis if we chose a
chart. As physicists, we are interested in things such as vector fields such that at any point of a manifold, there
is a vector. The proper way to deal with it technically is theory of bundles.

6.1

Bundles

Definition 31. A bundle is a triple E M , where


E is a smooth manifold, called the total space,
M is a smooth manifold, called the base space, and
is a smooth map (surjective), called the projection map.

Definition 32. Let E M be a bundle and p M . Then, fibre over p := preim ({p}).

Definition 33. A section of a bundle E M is the map : M E such that = idM .

Example: E is a cylinder, M a circle and maps vertical lines on the cylinder to the point of intersection of
this line with the circle.
Example: If the fibre of p M is a tangent space, the section would pick one vector from the tangent
space.
Aside: In quantum mechanics, : M C is called a wavefunction, but it is actually a section which selects
one value from C for each p M .

6.2

Tangent bundle of smooth manifold

For this entire subsection, let (M, O, A) be a smooth manifold and let d := dim M .
Define the set,
T M :=

pM

Tp M

(6.1)

Now define a surjective map as follows:


:T M M
X 7 (X) := p M such that X Tp M

Situation: T
M
|{z}

|{z}

(6.2)

M
|{z}

set surjective map smooth manifold

For a bundle, T M should be a smooth manifold and a smooth map. Let us construct a topology on T M that
is the coarsest topology such that is just continuous. (initial topology with respect to ). Define

OT M := {preim (U )|U O}

21

(6.3)

It can be shown that (T M, OT M ) is a topological space. But we meed a smooth atlas.


Construction of a C -atlas on T M from the C -atlas A on M
Define
AT M :={(T U, x ) | (U, x) A} where
x :T U R2d

(6.4)

X 7 (x1 )(X), . . . , (xd )(X), (dx1 )(X) (X), . . . , (dxd )(X) (X)
|
{z
} |
{z
}
(U,x) coords of (X) (d-many)

components of X w.r.t (U,x) (d-many)

In the above, (x1 )(X) = x1 ((X)) = x1 (p) = x1 -coordinate, and






j
j

j
j
i
i
i
=
(dx
)
(X)
=
(dx
)
X
X T(X) M = X = X(x)
i
i
(X)
(X)
(x) x (X) = X(x) i = X(x) .
x (X)
Thus x maps X to the coordinates of its base point (X) under the chart (U, x) and the components of the
vector X w.r.t the basis induced by this chart.
We can write x1 as follows:
x1 : x (T U ) T U
| {z }
R2d

(1 , . . . , d , 1 , . . . , d ) := i

(6.5)

xi x1 (1 , . . . , d )
|
{z
}
(X)

Now we check, whether the atlas AT M smooth. That is, are the transitions between its charts smooth?
Theorem 6.1. AT M is a smooth atlas.
Proof. Let (U, x ) AT M ,

(V, y ) AT M

(y x1 )(1 , . . . , d , 1 , . . . , d ) = y
=

. . . , (y )

xm

and

i
xi
!

U V 6= . Calculate the chart transition


!

by Eq. 6.5
x1 (1 ,...,d )
i

, . . . , . . . , (dy )x1 (1 ,...,d )

x1 (1 ,...,d )

xm

!
,...

by Eq. 6.4

x1 (1 ,...,d )

!
!






m
i
i
m
, . . . , . . . , ( (dy )x1 (1 ,...,d )
,...
= . . . , y

m
m

x
x
1
1
d
1
1
d

x ( ,..., )
x ( ,..., )

|
{z
}
|
{z
}



the base point, x1 (1 ,...,d )

y i
xm

. . . , (y i x1 )(1 , . . . , d ), . . . , . . . , m

y
xm

x1 (1 ,...,d )

!
,...

x1 (1 ,...,d )



= . . . , (y i x1 )(1 , . . . , d ), . . . , . . . , m m (y i x1 )(x(x1 (1 , . . . , d ))) , . . .

= . . . , (y i x1 )(1 , . . . , d ) , . . . , . . . ,
|
{z
}
smooth (A) is smooth atlas


m m (y i x1 )(1 , . . . , d )
|
{z
}

,...

smooth chart transition map is C smooth

= (y x1 ) is smooth = AT M is smooth

22

Further, the surjective map is a smooth map because, in the chart representation, takes the 2d components
of X T M to the d-coordinates of the base point in M , which can be seen to happen smoothly by seeing how
the components are mapped. Therefore, we have the following definition.
Definition 34. Then, using the smooth manifold (M, O, A) as the base space and the smooth manifold
(T M, OT M , AT M ) as the total space, the tangent bundle is the triple

T M M

6.3

(6.6)

Vector fields

Why did we put so much effort in making a smooth atlas on T M and defining a tangent bundle? The answer
is in the following definition of smooth vector field, not just any vector field.

Definition 35. For a tangent bundle T M M , a smooth vector field is a smooth map such that
= idM .
TM

Remarks: is a section, which couldnt have been a smooth map unless we had both M and T M as smooth
manifolds.

6.4

The C (M )-module (T M )

We already know that C (M ), the collection of all smooth functions is a vector space with S-multiplication with
R. But we may also consider the structure (C (M ), +, ) with point-wise addition between elements of C (M )
and point-wise multiplication between elements of C (M ). This structure satisfies all the requirements of a
field (commutativity, associativity, neutral element, inverse element under both operations, and distributivity)
except that there is no inverse for all non-zero elements under multiplication. This is so because a function that
is not zero everywhere, may be zero at some points and then point-wise multiplication with no function would
result in the value 1 everywhere. Such a structure is called a ring.
A module over a ring is a generalization of the notion of vector space over a field, wherein the corresponding
scalars are the elements of an arbitrary given ring.
Let us consider the module made from the set of all smooth vector fields over the ring C (M ). Define
(T M ) = { : M T M | is a smooth section}

(6.7)

Definition 36. ((T M ), , ) is a C (M )-module over the ring of C (M ) functions with ,


e (T M ) and

g C (M ), such that
(
e)(f ) := (f ) + (e
f )
|{z}
C (M )

(g )(f ) := g |{z}
(f )
C (M )

Facts: Besides other differences, there are following 2 important facts:


(1) Proving that every vector space has a basis depends upon the choice of set theory; in particular, on the
Axiom of Choice in ZFC theory.
(2) No such result exists for modules.

23

This is a shame, because otherwise, we could have chosen (for any manifold) vector fields, (1) , . . . , (d) (T M )
and would be able to write every vector field in terms of component functions f i as = f i (i) .
Simple counterexample: Take a sphere. Can we find a smooth vector field over the entire sphere. Can you
comb the sphere? No. For the field to be smooth, there is a problem. Morse Theory tells us that every smooth
vector field on a sphere must vanish at 2 points = basis cannot be chosen. We cannot choose a global basis.
Therefore, if required, we only expand a vector field in terms of a basis on a domain where it is possible.
Remarks: Although we cannot have a global basis for (T M ), it is possible to do so locally. Thus, for the chart
(U, x) we can take the chart-induced basis of the vector field in the chart domain U as the map

smooth
: U T U
xi



p 7
xi p

6.5

(6.8)

Tensor fields

So far we have constructed the sections over the tangent bundle. That is, (T M ) =set of smooth vector fields
as a C (M )-module.
Exactly along the same lines we can construct the cotangent bundle (T M ) = set of covector fields as a
C (M )-module, by mapping a covector to the coordinates of its base point and components of the covector.
(T M ) and (T M ) are the basic building blocks for every tensor field.
Definition 37. An (r, s)-tensor field T is a multilinear map

T : (T M ) (T M ) (T M ) (T M ) C (M )
|
{z
} |
{z
}
r

(6.9)

Remarks: the multilinearity is in C (M ), in terms of addition in the modules and S-multiplication with functions in C (M ).
Example: Let f C (M ). Then, define a (0, 1)-tensor field df as

df :(T M ) C (M )
7 df () := f

such that (f )( p ) := (p) f


|{z}
|{z}
M

It can be checked that df is C linear.

24

Tp M

Connections
Motivation: So far, all we have dealt with (e.g., sets, topological manifolds, smooth manifolds, fields,
bundles, etc.) are structures that we have to provide by hand before we can start doing physics as we
know it. Why? Because we dont have equations which determine what we have done so far. These are
assumptions you need to submit before you can do physics.
In this lecture we introduce yet another structure called connections which are determined by Einsteins
equations. Everything from now on will be objects that are the subject of Einsteins equations depending
on the matter in the Universe. Connections are also called covariant derivatives. Even though these are
different, for our purposes we shall not distinguish the two and use the more general connections.

So far, we saw that a vector field X can be used to provide a directional derivative of a function f C (M )
in the direction X
X f := Xf
Isnt this a notational overkill? We already know
X f = Xf = (df )X
Actually, they are not quite the same because
X : C (M ) C (M )
df : (T M ) C (M )
X : C (M ) C (M )
where X can be generalized to eat an arbitrary (p, q)-tensor field and yield a (p, q)-tensor field whereas X can
only eat functions.
X : C (M )

C (M )

..
.

..
.

X : (p, q)-tensor field

(p, q)-tensor field

We need X to provide the new structure to allow us to talk about directional derivatives of tensor fields and
vector fields. Of course, only in cases where X acts on function f which is a (0, 0)-tensor, it is exactly the
same as Xf .

7.1

Directional derivatives of tensor fields

We formulate a wish list of properties which X acting on a tensor field should have. We put this in form of
a definition. There may be many structures that satify this wish list. Any remaining freedom in choosing such
a will need to be provided as additional structure beyond the structure we already have. And we assume all
this takes place on a smooth manifold.
Definition 38. A connection on a smooth manifold (M, O, A) is a map that takes a pair consisting of a
vector (field) X and a (p, q)-tensor field T and sends them to a (p, q)-tensor (field) X T satisfying
i) X f = Xf

f C M

ii) X (T + S) = X T + X S

where T, S are (p, q)-tensors

25

iii) Leibnitz rule: X T (1 , . . . , p , Y1 , . . . , Yq ) = (X T )(1 , . . . , p , Y1 , . . . , Yq )


+ T (X 1 , . . . , p , Y1 , . . . , Yq ) + + T (1 , . . . , X p , Y1 , . . . , Yq )
+ T (1 , . . . , p , X Y1 , . . . , Yq ) + + T (1 , . . . , p , Y1 , . . . , X Yq ) where T is a (p, q)-tensor
Note that for a (p, q)-tensor T and a (r, s)-tensor S, since:
(T S)((1) , . . . , (p+r) , Y(1) , . . . , Y(q+s) ) =
T ((1) , . . . , (p) , Y(1) , . . . , Y(q) ) S((p+1) , . . . , (p+r) , Y(q+1) , . . . , Y(q+s) ),
Leibnitz rule implies X (T S) = (X T ) S + T (X S).
iv) C -linearity: f C (M ), f X+Z T = f X T + Z T
C -linearity means that no matter how the function f scales the vectors at different points of the
manifold, the effect of the scaling at any point is independent of scaling in the neighbourhood and
depends only on how the scaling happens at that point.
A manifold with a connection is a quadruple (M, O, A, ), where M is a set, O is a topology and A is a
smooth atlas.
Remark: If X () can be seen as an extension of X,
then () () can be seen as an extension of d.

7.2

New structure on (M, O, A) required to fix

How much freedom do we have in choosing such a structure?


Consider vector fields X, Y and chart (U, x) A. Then


X Y = ( X i ) Y
xm
xi


m
i
= X ( ) Y
xm
xi





i
m
i
m
= X ( ) Y
+X Y
( ) m
m
i
xi x
| x{z
} x
{z
}
|
=

=X

xi

Ym

m
Y
xi

by expanding in chart-induced basis


by C -linearity
using Leibnitz rule

a vector field, by defn.

+ X i Y m qmi q
xm
x

Thus, by change of indices,


i

(X Y ) = X m

Yi
xm

+ X m Y n inm

(7.1)

So we need (dim M )3 -many functions to define directional derivative of a vector field.


Definition 39. Given (M, O, A, ) and (U, x) A, then the connection coefficient functions (s) on M
of w.r.t (U, x) are (dim M )3 -many functions given by
ijk : U R
i

p 7 jk (p) :=


dx (
i

)
k
xj
x


(p)

(7.2)

i
Note: x
j is a vector field;
( xk ) xj is a vector field, and dx is a covector which will result in a function
after acting on a vector field.

26

On a chart domain U , choice of the (dim M )3 -many functions ijk suffices to fix the action of on a vector
field. What about the directional derivative of a covector field, or a tensor field? Will we have to provide
more and more coefficients? Fortunately, the same (dim M )3 -many functions fix the action of on any tensor
field.

We know that, for a covector, m dxi = ijm dxj , since dxi form a dual basis. Are these s independent
x
of s? Consider the following.




= m ji =
( i ) = 0
x
xj
xm j


 


i
=0
= m dxi
+
dx

x
xm xj
xj
|
{z
}

xm

dxi

qjm x
q

 

= m dxi
+ dxi qjm q = 0
j
x
x
x

 

= m dxi
= dxi qjm q = qjm dxi q = qjm qi = ijm
j
x
x
x
x

 
= m dxi
dxj = ijm dxj
x
xj
|
{z
}


=jj =1

=
=

xm

dxi = ijm dxj

xm

dxi


j

= ijm

In summary,
i

(X Y ) = X(Y i ) + ijm Y j X m
(X )i = X (i )

im j X

(7.3)

(7.4)

Note that for the immediately above expression for (X Y )i , in the second term on the right hand side, ijm
has the last entry at the bottom, m going in the direction of X, so that it matches up with X m . This is a good
mnemonic to memorize the index positions of .
Similarly, as an example, by further application of Leibnitz rule, for a (1, 2)-tensor field T ,

i
(X T ) jk = X T ijk + ism T sjk X m sjm T isk X m skm T ijs X m

7.3

Change of s under change of chart

Let (U, x), (V, y) A and U V 6= .


(y)

i
jk

:= dy

y k

y j



y i q
xs
=
dx xp p j
s
y k x y x
xq
 p 



i
s
y
x
x

xs

q
=
dx
p j
+ j p s
x y
x x
xq
y k
xs
y
q
y i xp xs q y i xp xs
=
s +
(x) sp
q
k
p
j
q
k
j
x y x y
x y y
| {z }

y k

27

is C linear

(y)

i
jk

q
y i xs xp
y i 2 xq
+
(x) sp
q
k
j
q
j
k
x y y
x y y

(7.5)

Eq. (7.5) is the change of connection coefficient function under the change of chart (U V, x) (U V, y).
is not a tensor due to the first term on left hand side in Eq. (7.5). However, for linear transformation between
2 q
coordinates in two charts, the term yk xyj always vanishes and then, if s are zero in one chart, they will
be zero in the other chart too. However, there is no reason not to select a coordinate which is not a linear
transformation of another one.

7.4

Normal Coordinates

Can we find a coordinate system that makes the s vanish?


Theorem 7.1. Let p M of (M, O, A, ). Having chosen a point p, one can construct a chart (U, x) with
p U such that the symmetric part of s vanish at the point p (not necessarily in any neighbourhood). That is,
i
p M, (U, x) A : p U and (x) (jk) (p) = 0.
Such (U, x) is called a normal coordinate chart of at p M .
Proof. Let (V, y) A and p V . Then consider a new chart (U, x) to which one transits using the map (xy 1 )
whose ith component is given by

x y 1
=

i

i

1 , . . . , d := i (y) (jk) j k

i

xi
= j xi y 1 = ji (y) (jm) m
j
y
i
2 xi
=
= (y) (jk)
k
j
y y

TODO: Not understood by me.

28

where the s are taken at the point p

8
8.1

Parallel Transport & Curvature


Parallelity of vector fields

Definition 40. Let (M, O, A, ) be a smooth manifold with connection .


(1) A vector field X on M is said to be parallely transported along a smooth curve : R M if
v X = 0

(8.1)

To make explicit, how this equation applies along the curve, we may state
v,() X


()

=0

(2) A slightly weaker condition is parallel if, for : R R,


v,() X


()

= ()X()

(8.2)

Remarks: Even though parallely transported sounds like an action, it is a property.

8.2

Autoparallely transported curves

Definition 41. A curve : R M is called autoparallely transported if


v v = 0

(8.3)

Remarks: Sometimes, this curve is called an autoparallel curve. But we wish to call a curve autoparallel if
v v = v .

8.3

Autoparallel equation

Express v v = 0 in terms of chart representation.


0 = v v

=  m

xn
(x) x





m n
= m ( m ) n
+

( xm ) xn
x
xn





m n
= m

( xm ) xn
xm
xn




q

m n
q
= m

nm
xm
xq
xq



= m m q + m n qnm
x
xq

= (
q + m n qnm ) q
x
n

(x)

m
remember that (x)
:= xm

x index is understood, hence suppressed

change of index in 1st term

TODO: show that 1st term is 2nd derivative

In summary:
q
m
n
(x)
() + ((x) )qmn (()) (x)
() (x)
() = 0

29

(8.4)

Eq. (8.4) is the chart expression of the condition that be autoparallely transported.
i
Example: (a) In Euclidean plane having a chart (U = R2 , x = idR2 ), (x) jk = 0
m
m
= (x)
= 0 = (x)
() = am + bm , where a, b Rd .
(b) Consider the round sphere (S 2 , O, A, round ), i.e., the sphere (S 2 , O, A) with the connection round . Consider the chart x(p) = (, ) where (0, ) and (0, 2). In this chart round is given by
(x)
(x)

2
12


x1 (, ) =

1

22
2
(x) 21


x1 (, ) := sin cos

x1 (, ) := cot

All other s vanish. Then, using the sloppy notation (familiar to us from classical mechanics) i.e., x1 (p) = (p)
and x2 (p) = (p), the autoparallel equation is
sin cos = 0
+ 2 cot = 0

)
+ 122 = 0
=
+ 22 = 0
12

It can be seen that the above equations are satisfied at the equator where () = /2, and () = + 0
(running around the equator at constant speed ). Thus, this curve is autoparallel. However, () = 2 + 0
wouldnt be autoparallel.

8.4

Torsion

Can we use to define tensors on (M, O, A, )?


Definition 42. The torsion of a connection is the (1, 2)-tensor field
T (, X, Y ) := (X Y Y X [X, Y ])

(8.5)

where [X, Y ], called the commutator of X and Y is a vector field defined by [X, Y ]f := X(Y f ) Y (Xf ).
Proof. We shall check that T is C -linear in each entry.
T (f , f X, Y ) = f (X Y Y (X) [X, Y ])
= f T (, X, Y )
T ( + , X, Y ) = ( + )(X Y Y (X) [X, Y ])
= T (, X, Y ) + T (, X, Y )
T (, f X, Y ) = (f X Y Y (f X) [f X, Y ])
= (f X Y (Y (f ))X f (Y X) [f X, Y ])
= (f X Y (Y f )X f (Y X) [f X, Y ])
But [f X, Y ]g = f X(Y g) Y (f X)g = f X(Y g) (Y f )(Xg) f Y (Xg) = [f X, Y ] = f [X, Y ] (Y f )X
T (, f X, Y ) = (f X Y (Y f )X f (Y X) f [X, Y ] + (Y f )X)
= (f X Y f (Y X) f [X, Y ])
= f (X Y (Y X) [X, Y ]) = f T (, X, Y )
Further, T (, X, Y ) = T (, Y, X), which means scaling in the last factor need not be checked separately.

Additivity in the last two factors can also be checked.


Definition 43. A (M, O, A, ) is called torsion-free if the torsion of its connection is zero. That is, T = 0.

30

In a chart
T iab := T




dxi , a , b = dxi (. . . ) = iab iba = 2i[ab]


x x

From now on, in these lectures, we only use torsion-free connections.

8.5

Curvature

Definition 44. Riemann curvature of a connection is the (1, 3)-tensor field


Riem(, Z, X, Y ) := (X Y Z Y X Z [X,Y ] Z)

(8.6)

Proof. It can be shown that C -linear in each slot. This has been left as an exercise to the reader.
Algebraic relevance of Riem: We ask whether there is difference in applying the two directional derivatives
in different order, i.e.
X Y Z Y X Z = Riem(, Z, X, Y ) + [X,Y ] Z
In one chart (U, x), denoting a by a ,
x
m
m
 Z
(a b Z) (b a Z) = Riemmnab Z n + 

,
xa xb
|
{z
}
=0, since they commute

As the last term vanishes, we can see how the Riem tensor components contain all the information about how
the a and b fail to commute if they act on a vector field. If they act on a tensor field, there are several
terms on RHS like the one term above; if they act on a function, of course they commute. Being a tensor, Riem
vanishes in all coordinate systems if it vanishes in one coordinate system, as it does in flat spaces.
Geometric significance of Riem:
Figure 8.1: If we parallel transport a vector u at p to q along two different paths vw and wv, the resulting vectors at q are different in general. If, however, we parallel transport a vector in a Euclidean space,
where the parallel transport is defined in our usual sense, the resulting
vector does not depend on the path along which it has been parallel transported. We expect that this non-integrability of parallel transport characterizes the intrinsic notion of curvature, which does not depend on the
special coordinates chosen. From an answer of Sepideh Bakhoda [1] on
http: // math. stackexchange. com/ q/ 465672
For small v and w, if T = 0, (u)m = Riemmnab v a wb un + O(v 2 w, vw2 ).

31

Newtonian spacetime is curved!

Axiom 1 (Newton I:). A body on which no force acts moves uniformly along a straight line.
Axiom 2 (Newton II:). Deviation of a bodys motion from such uniform straight motion is effected by a force,
reduced by a factor of the bodys reciprocal mass.
Remarks:
(1) 1st axiom - in order to be relevant - must be read as a measurement prescription for the geometry of space.
If somehow, we know that no force acts on a particle, we know that the path it takes is a straight line
thus, we learn about the geometry of space. After all, unlike in maths, there is no obvious way to tell what
is a straight line. Remember, if we dont know what a straight line is, we dont know what a deviation
from a straight line is.
(2) Since gravity universally acts on every particle, in a universe with at least two particles, gravity must not
be considered a force if Newton I is supposed to remain applicable.

9.1

Laplaces questions

Question: Can gravity be encoded in a curvature of space, such that its effects show if particles under the
influence of (no other) force we postulated to more along straight lines in this curved space?
Answer: No!
Proof. Gravity, as a force point of view:
m
x (t) = mf (x(t))
| {z }
f orce:F

where f = 4G (Poisson); = mass density of matter.


The same m appearing on both sides of the equation is an experimental fact, also known as the weak equivalence principle.
x
(t) f (x(t)) = 0
Laplace asks: Is this (
x(t)) of the form x
(t) + (x(t))x (t)x (t) = 0? That is, does it take the form of
autoparallel equation?
No. Because the can only depend on the point x where you are, but the velocities x (t) and x (t) can take
any value and therefore the s cannot take care of the f in the preceding equation. Had there been such s,
we would be able to find the notion of straight line that could have absorbed the effect that we usually attribute
to a force.
Conclusion: One cannot find s such that Newtons equation takes the form of an autoparallel equation.

9.2

The full wisdom of Newton I

Laplace asked: Can we find a curvature of space such that particles move along straight lines?
Use the information from Newtons first law that particles (under influence of no force) move not just in straight
line, but also uniformly. A curve, after all, is not just a set of points, but also how their parameter is associated
with the points.

32

Introduce the appropriate setting to talk about the difference easily. How? We use spacetime instead of just
space. By using the extra coordinate viz. time, we do not need to keep track of the curve parameter since we
can just refer to time to ascertain uniformity of the motion.
Insight: Uniform & straight motion in space is simply straight motion in spacetime. We do not need to say
uniform. This can be seen by drawing the path of the particle in a t-x graph, wherein straight line results only
when the motion is uniform. So lets try in spacetime:

Let x : R R3
worldline (history) of the particle

be a particles { X :R R4

trajectory in space
t 7 (t, x1 (t), x2 (t), x3 (t)) := (X 0 (t), X 1 (t), X 2 (t), X 3 (t))
Thats all it takes. Let us assume that x : R R3 satisfies Newtons law concerning gravitational force, i.e.
we can omit m on both sides of the equation x
= f (x(t)).
Trivial rewritings:
X 0 = 1
a = 0, 1, 2, 3
0
X
=0
f (X(t)) X 0 X 0
= X
|
{z
}
(=1,2,3)

Yes, choosing 0ab = 0,

a + a X b X c = 0
X
bc

=0

autoparallel eqn. in spacetime


!

= 0 = 0 = 0 . Only 00 = f .

Question: Is this a coordinate-choice artifact?


No, since R00 = x f (only non-vanishing components) (tidal force tensor, the Hessian of the force
component)
Ricci tensor = R00 = Rm0m0 = f = 4G
Poisson: f = 4G
writing: T00 = 21 s
= R00 = 8GT00
hhh(((
(
(h
Einstein in 1912 (
Rab
8GT
hh
(=
ab
h
Conclusion: Laplaces idea works in spacetime
Remark
00 = f
R = 0

, , , = 1, 2, 3

R00 = 4G
Q: What about transformation behavior of LHS of
x
a + abc X b X c = 0
|
{z
}
(vX vX )a
| {z }

:=aa acceleration vector

9.3

The foundations of the geometric formulation of Newtons axiom

Definition 45. A Newtonian spacetime is a quintuple (M, O, A, , t) where (M, O, A) is a 4-dimensional


smooth manifold, and
33

t : M R smooth function
(dt)p 6= 0

(i) There is an absolute space

p M

(ii) Absolute time flows uniformly


dt
|{z}

=0

everywhere

(0,2)-tensor field

(iii) add to axioms of Newtonian spacetime = 0 torsion free


Definition 46. Absolute space at time
dt6=0

S := {p M |t(p) = } M =

Definition 47. A vector X Tp M is called


(a) future-directed, if dt(X) > 0
(b) spatial, if dt(X) = 0
(c) past-directed, if dt(X) < 0
Picture Newton I: The worldline of a particle under the influence of no force (gravity isnt one, anyway) is a
future-directed autoparallel i.e.
v X vX = 0
dt(vX ) > 0
Newton II:
F
v X vX = m
m a = F
where F is a spatial vector field: dt(F ) = 0.
Convention: restrict attention to atlases Astratif ied whose charts (U, x) have the property
x0 : U R
x1 : U R
..
.

..
.

x0 = t|U

absolute time flows uniformly

0=

xa

dx =

0ba

dt

a = 0, 1, 2, 3

x3
Lets evaluate in a chart (U, x) of a stratified atlas Asheet : Newton II:
F
v X vX = m
in a chart.
((
a (
((
(X 0 )00 + (
0(
)0 (X b )0stratified atlas = 0
cd (X
0

(X )00 + X X + 00 X 0 X 0 + 20 X X 0 =

= (X 0 )00 () = 0 = X 0 () = a + b
X 0 () = (x0 X)()

stratified

convention parametrize worldline by absolute time


d
d
=a
d
dt

34

F
m

= 1, 2, 3

constants a, b with
(t X)()


+ a2 X X + a2 X 0 X 0 + 2 X X 0 = F
a2 X

00
0
m

1
F

0
0

0
+ X X + 00 X X + 2 0 X X =
= X
2
{z
} a m
|
a

35

10

Metric Manifolds

We establish a structure on a smooth manifold that allows one to assign vectors in each tangent space a length
(and an angle between vectors in the same tangent space). From this structure, one can then define a notion of
length of a curve. Then we can look at shortest curves (which will be called geodesics).
Requiring then that the shortest curves coincide with the straight curves (w.r.t. ) will result in being
determined by the metric structure g. , in turn determines the curvature given by Riem. Thus

10.1

straight=shortest/
longest/ stationary curves
T =0

Riem

Metrics

Definition 48. A metric g on a smooth manifold (M, O, A) is a (0, 2)-tensor field satisfying
(i) symmetry: g(X, Y ) = g(Y, X) X, Y vector fields
(ii) non-degeneracy: the musical map
flat [ : (T M ) (T M )
X 7 [(X)
where [(X)(Y ) := g(X, Y )
[(X) (T M )
In thought bubble: [(X) = g(X, )
. . . is a C -isomorphism in other words, it is invertible.
Remark: ([(X))a
Xa
([(X))a := gam X m

or

Thought bubble: [1 = ]
[1 ()a := g am m
00
[1 ()a := (g 1 )am m = not needed. (all of this is not needed)
00
Definition 49. The (2, 0)-tensor field g 1 with respect to a metric g is the symmetric
00

g 1 : (T M ) (T M )
C (M )
(, ) 7 ([1 ())

chart: gab = gba

[1 () (T M ))

(g 1 )am gmb = ba

Example: Consider (S 2 , O, A) and the chart (U, x)


(0, 2),

(0, )

Define the metric


"

gij (x

R2
(, )) =
0

36

0
R2 sin2

#
ij

R R+
the metric of the round sphere of radius R

10.2

Signature

Aam v m = v a

gam v

=v ?

..

.
n

Linear algebra:

..

.
1
1
..

.
1
0
..

.
0

(1, 1) tensor has eigenvalues


(0, 2) has signature (p, q) (well-defined)

(+ + +)

(+ + )
d + 1 if p + q = dimV
(+ )

( )
Definition 50. A metric is called Riemannian if its signature is (++ +), and Lorentzian if it is (+ ).

10.3

Length of a curve

Let be a smooth curve. Then we know its veloctiy v,() at each () M .


Definition 51. On a Riemannian metric manifold (M, O, A, g), the speed of a curve at () is the number
q
s() = ( g(v , v ))()

(10.1)

(I feel the need for speed, then I feel the need for a metric)
Aside: [v a ] = T1
[gab ] = L2
q
p
2
[ gab v a v b ] = TL2 = TL
Definition 52. Let : (0, 1) M a smooth curve. Then the length of , L[] R is the number
Z
L[] :=

Z
ds() =

(g(v , v ))()

F. Schuller: velocity is more fundamental than speed, speed is more fundamental than length

37

(10.2)

Example: Reconsider the round sphere of radius R. Consider its equator:

,
2
= 0 () = 0,

() := (x1 )() =

On the same chart gij =

"
R2

() := (x2 )() = 23
0 () = 62

#
R2 sin2

Do everything in this chart


Z

L[] =

q
d gij (x1 ((), ()))(xi )0 ()(xj )0 ()

Z
=

q
d R2 0 + R2 sin2 (())36 2 4

Z
= 6R
0

1
d2 = 6R[ 3 ]10 = 2R
3

Theorem 10.1. : (0, 1) M and : (0, 1) (0, 1) smooth bijective and increasing reparametrization
L[] = L[ ]
Proof. in Tutorials

10.4

Geodesics

Definition 53. A curve : (0, 1) M is called a geodesic on a Riemannian manifold (M, O, A, g) if it is a


stationary curve with respect to a length functional L.
Thought bubble: In classical mechanics, deform the curve a little,  times this deformation, to first order, it
agrees with L[].
Theorem 10.2. is geodesic iff it satisfies the Euler-Lagrange equations for the Lagrangian

L :T M R
p
X 7 g(X, X)
In a chart, the Euler Lagrange equations take the form:


L
x m

L
=0
xm

F.Schuller: this is a chart dependent formulation


here:
L( i , i ) =

gij (()) i () j ()

Euler-Lagrange equations:
L
1
= gmj (()) j ()
m
...


L
m


=

...


1
gmj (()) j () +
gmj (())
j () + s (s gmj ) j ()
...

Thought bubble: reparametrize g(,


)
= 1 (its a condition on my reparametrization)

38

By a clever choice of reparametrization ( 1... ) = 0


L
1
= m gij (()) i () j ()
m
2 ...
putting this together as Euler-Lagrange equations:
1
gmj j + s gmj s j m gij i j = 0
2
1
1
qm
q
+ (g ) (i gmj m gij ) i j = 0
2
1
q + (g 1 )qm (i gmj + j gmi m gij ) i j = 0
2

(multiply on both sides (g 1 )qm )

geodesic equation for in a chart.


1
(g 1 )qm (i gmj + j gmi m gij ) =: qij (())
2
Thought bubble:

L
a+dimM
x

(x)

L
xia
x

Definition 54. Christoffel symbol


connection L.C.

L.C.

=0
(x)

are the connection coefficient functions of the so-called Levi-Civita

We usually make this choice of if g is given.


(M, O, A, g) (M, O, A, g,

L.C.

abstract way: g = 0 and T = 0 (torsion)


= = L.C.
Definition 55. (a) The Riemann-Christoffel curvature is defined by
Rabcd := gam Rmbcd

(10.3)

Rab = Rmamb

(10.4)

R = g ab Rab

(10.5)

(b) Ricci

Thought bubble: with a metric,

L.C.

(c) (Ricci) scalar curvature:

Thought bubble: L.C.


Definition 56. Einstein curvature of (M, O, A, g) is defined as
1
Gab := Rab gab R
2
00

Convention: g ab := (g 1 )ab
F. Schuller: these indices are not being pulled up, because what would you pull them up with
(student) Question: Does the Einstein curvature yield new information?
Answer:
g ab Gab = Rab g ab 12 gab g ab R = R aa R = R 12 dimM R = (1 d2 )R

39

(10.6)

11

Symmetry

This lecture is about symmetry but we will pick a number of elementary techniques in differential geometry
that we will need in Einsteins theory. We shall motivate these techniques by appealing to the feeling
that the round sphere (S 2 , O, A, g round ) has rotational symmetry, while the potato (S 2 , O, A, g potato ) does
not.
So far we have considered symmetry by having inner product first, and then demanding that w.r.t. that
inner product we classify linear maps A acting on vectors X and Y such that inner product of AX and AY
results in inner product XY .
Here we talk about an altogether different idea. Firstly, since the distinction between the two is entirely
contained in g, we are talking about the rotational symmetry of g round . Secondly, while an inner product is
on one tangent space, there are many different tangent spaces with different inner products. g talks about
the distribution of these inner products over the sphere, and that distribution in some sense is rotationally
invariant or not.
Therefore, the question is: How to describe the symmetries of a metric? This is important because nobody
has solved Einsteins Equations without assuming some sort of additional assumptions such as symmetry
of the solution.

11.1

Push-forward map

Definition 57. Let M and N be smooth manifolds with tangent bundles T M and T N respectively. Let
: M N be a smooth map. Then, the push-forward map of is the map
:T M T N
X 7 (X)
( f C (N ))

where (X)f := X(f )

TM

TN

T M

T N
f

(11.1)

N
f

Figure 11.1: Push-forward map: takes a vector X Tp M


in the tangent space at the point p M to the vector (X)
Tq N in the tangent space at the point (p) = q N , such that
the action of (X) on any smooth function f C (N ) results
in the same value as the action of X on the function (f ).

Note: If we take an entire fibre at the point p M , applying on it remains within the fibre at the point
(p) N . That is
(Tp M ) T(p) N
Mnemonic: vectors are pushed forward across tangent bundles in a manner dictated by the underlying
map.
Components of push-forward w.r.t charts (U, x) AM and (V, y) AN : Let p U and (p) V .

is a vector, we have ( x
Since x
i
i ) as a vector in N. Then we can select a component of this vector by
p
p
using dy a as follows:

40

dy

 !!
=

{z

x
x(U )
| {z }

 !

y =
p

xi

(y ) =
p

xi

(y ) :=
p

a
xi

!
(11.2)
p

V N

xi

:=a
i

M U

Rdim

xi

y x1

y(V )
| {z }

Rdim

Figure 11.2: Components of push-forward map w.r.t charts (U, x) AM


and (V, y) AN .

Theorem 11.1. If : R M is a curve in M , then : R N is a curve in N . Then, pushes the


tangent to a curve (velocity) to the tangent to the curve ( ), i.e.,
(v,p ) = v(),(p)

(11.3)

Proof. Let p = (0 ). Then f C (N ),


(v,p ) f = v,p (f )

(by Eq. 11.1)


0

(by Eq. 5.1)

= (f ( )) (0 )

(by associativity of composition)

= v(),((0 )) f

(by Eq. 5.1)

= v(),(p) f

((0 ) = p)

= ((f ) ) (0 )

= (v,p ) = v(),(p)

11.2

Pull-back map

Definition 58. Let M and N be smooth manifolds with cotangent bundles T M and T N respectively. Let
: M N be a smooth map. Then, the pull-back map of is the map
:T N T M
7 ()
where ()(X) := ( (X))

( X Tp M )

(11.4)

Components of pull-back w.r.t charts (U, x) AM and (V, y) AN : Let p U and (p) V . Since
dy a is a covector, we have (dypa ) as a covector in N. Then we can select a component of this covector by using

xi as follows:

!
  
:= (dy )(p)
xi p

 !

a
= (dy )(p)
xi p
!
a
=
= ai
xi

41

(by Eq. 11.4)

(by Eq. 11.2)

Thus, the components of the push-forward and pull-back maps are exactly the same.
a

( (X)) = ai X i
( ())i = ai a
Remember, a = (1, . . . , dimN ) and i = (1, . . . , dimM ).
Claim: (df ) = d(f ).
Mnemonic: covectors are pulled back across tangent bundles in a manner dictated by the underlying map.
Important application:
Definition 59. Let M and N be smooth manifolds. Let : M , N be an injective map. If we know a metric
g on N , then the induced metric gM in M is defined using the push-forward map as follows:
( X, Y Tp M )

gM (X, Y ) := g ( (X), (Y ))

(11.5)

In terms of components,


(gM )ij


p

= (gab )(p)

a
xi

!
(p)

b
xj

!
(11.6)
(p)

Example: N = (R , Ostd , A) and M = (S , O, A), then we can have several injective maps : S 2 , R3 . For
example, S 2 could live in R3 either as a potato or a round sphere. However, suppose R3 is equipped with the
Euclidean metric gE ...(TODO complete this example)

11.3

Flow of a complete vector field

Definition 60. Let X be a vector field on a smooth manifold (M, O, A). A curve : I R M is called
an integral curve of X if
v,() = X()
Definition 61. A vector field X is complete if all integral curves have I = R (i.e. domain is all of R).
Theorem 11.2. Compactly supported smooth vector field is complete.
Definition 62. The flow of a complete vector field X on a manifold M is a 1-parameter family
hX :R M M
(, p) 7 p ()
where p : R M is the integral curve of X with (0) = p.
Then for fixed R, hX
: M M is smooth.
Picture: If S is a set of points in M , then hX
(S) can be seen as the new position of these points under the
X
flow h after the passage of units of parameter. In general, hX
(S) 6= S( if X 6= 0).

11.4

Lie subalgebras of the Lie algebra ((T M ), [, ]) of vector fields

We know that (T M ) = { set of all vector fields }, which can be seen as a C (M )-module, or as a R-vector
space.
((T M ), [, ]) with [X, Y ] defined by its action on a function f by [X, Y ]f := X(Y f ) Y (Xf ) is a Lie algebra
since X, Y (T M ) = [X, Y ] (T M ) and the following properties are satisfied:
(i) Anticommutativity: [X, Y ] = [Y, X]
42

(ii) Linearity: [X + Z, Y ] = [X, Y ] + [Z, Y ] where R


(iii) Jacobi identity: [X, [Y, Z]] + [Z, [X, Y ]] + [Y, [Z, X]] = 0
Let X1 , . . . , Xs be s (many) vector fields on M , such that
i, j {1, . . . , s}

k
Cij
Xk
| {z }

[Xi , Xj ] =

k
where Cij
R

linear combination of Xk s
k
Cij
are called structure constants.

Let spanR {X1 , . . . , Xs } := {all linear combinations of Xk }. Then (spanR {X1 , . . . , Xs }, [, ]) is a Lie subalgebra
of ((T M ), [, ]).
Example: In S 2 , assume that the vector fields X1 , X2 , X3 satisfy [X1 , X2 ] = X3 , [X2 , X3 ] = X1 and [X3 , X1 ] =
X2 . Then (spanR {X1 , X2 , X3 }, [, ]) (= SO(3)) is a Lie subalgebra. An instance of vector fields satisfying these
conditions is (with Xi , , all taken at a point p, and x1 = , x2 = )

cot cos

X2 = cos
cot cos

X3 =

X1 = sin

Note that the above is defined on a merely smooth manifold without any additional structure like metric.

11.5

Symmetry

Definition 63. A finite-dimensional Lie subalgebra (L, [, ]) is said to be a symmetry of a metric tensor field
g if
g

hX

(A), hX


(B) = g(A, B)

(X (complete vector field) L,

R,

A, B Tp M )


In another formulation (using pullback), hX
g = g . The pullback of : M M on g, itself, is defined as

follows:
( g)(A, B) := g( (A), (B))

11.6

Lie derivative

It can be shown that the following expression is precisely the Lie derivative of g w.r.t a vector field

LX g := lim

hX

gg

(11.7)

Clearly, L is a symmetry of g, iff LX g = 0.


Definition 64. The Lie derivative L on a smooth manifold (M, O, A) a pair of a vector field X and a
(p, q)-tensor field T to a (p, q)-tensor field such that
(i) LX f = Xf

f C M

(ii) LX Y = [X, Y ]

where X, Y are vector fields

43

This condition sucks in information about the vector field X. It is not C -linear in the lower index.
If it were, the derivative would be independent of values of X at nearby points to the point where
derivative is evaluated. This is an important difference between the covariant derivative and the
Lie derivative.
(iii) LX (T + S) = LX T + LX S

where T, S are (p, q)-tensors

(iv) Leibnitz rule: LX T (1 , . . . , p , Y1 , . . . , Yq ) = (LX T )(1 , . . . , p , Y1 , . . . , Yq )


+ T (LX 1 , . . . , p , Y1 , . . . , Yq ) + + T (1 , . . . , LX p , Y1 , . . . , Yq )
+ T (1 , . . . , p , LX Y1 , . . . , Yq ) + + T (1 , . . . , p , Y1 , . . . , LX Yq ) where T is a (p, q)-tensor
Note that for a (p, q)-tensor T and a (r, s)-tensor S, since:
(T S)((1) , . . . , (p+r) , Y(1) , . . . , Y(q+s) ) =
T ((1) , . . . , (p) , Y(1) , . . . , Y(q) ) S((p+1) , . . . , (p+r) , Y(q+1) , . . . , Y(q+s) ),
Leibnitz rule implies LX (T S) = (LX T ) S + T (LX S).
(v) LX+Y T = LX T + LY T
Observe that, in chart (U, x)
i

(LX Y ) = X m

(Y i )
xm

(X i )
s
x
| {z }

Ys

requires knowing X around the point

However, for covariant derivative


i

(X Y ) = X m

(Y i ) + ism X m Y s
xm

In general, for a (1, 1)-tensor T


i

(LX T ) j = X m

(T i )
xm j

X i
s T xj
| x{z }

X s
+ j T is
| x{z }

(0 0 for lower index) (0 +0 for upper index)

Application: As above, it is easy to calculate components of Lie derivative of metric g, LX g. Thus, by checking
whether the derivative equals 0 or not, it can be determined whether a metric features a symmetry.

44

12

Integration

This lecture will be the completion of our lift of analysis on charts to the manifold level. We want to be
R
able to integrate a function f over a manifold M . This M f will be an important tool for writing down
the action on Einstein Equations.
However, to define integral we need a mild new structure on the smooth manifold (M, O, A). It requires
(i) a choice of a certain tensor field, the so-called volume form and
(ii) a restriction on the atlas A, which is called orientation.

12.1

Review of integration on Rd

We review this because this is what, after all, happens on charts; and we want to use this knowledge to have a
well-defined integration on manifolds.
a) If F : R R, we assume a notion of integration is known. We define an integral over an interval (a, b) as
follows:
b

dx F (x)

F :=
(a,b)

(which is understood in terms of, say, Riemanns integral).

b) If F : Rk R, then
(i) on a box-shaped domain, Box = (a, b) (c, d) (u, v) Rk , the integral on the box is a series of
integrals which have to be evaluated one after the other as follows
Z

Z
F :=

dx

Box

dx
c

dxk F (x1 , x2 , . . . , xk )

(ii) for other domains, G Rk , we first introduce an indicator function G : Rk R such that

G (x) =

1,

xG

0,

x 6 G

and then define


Z

F :=

dx1

dx2

dxk G (x) F (x1 , x2 , . . . , xk )

While this may not be a practical definition, it tells us what we mean by an integral over a function
from Rk to R over an arbitrary domain G.
Note: All of the above comes with the disclaimer if the integral exists since there could be many issues that
do not allow the existence of the integral as defined above.
Change of variables, which may also be called integration by substitution.
Theorem 12.1. If F : Rk 3 G R and : preim (G)( Rk ) G, then
Z

Z
F (x) =

preim (G)

|det(x)(y)| (F )(y)
{z
}
|
Jacobian of

45

Rk 3 preim (G)

G Rk
F

R
Example: Consider the domain G R2 , which includes the entire R2 except the x-axis. Let
:R+ {(0, ) (, 2)} G
(r, ) 7 (r cos , r sin )
Thus, G is in Cartesian coordinates and preim (G) is in polar coordinates. Let us calculate the Jacobian.

cos

(a x )(r, ) =
r sin
b

det(a xb )(r, ) = r
Z
1
2
1
2
F
(x
dx
dx
,
x
)
=
| {z }

Z
=
G

12.2

volume element

Z
0


sin

r cos

dr d r
| {z }

F (r cos , r sin )

volume element

Integration on one chart

Let (M, O, A) be a smooth manifold, f : M R and choose charts (U, x), (U, y) A
y(U ) Rk

f(

y)

:=

y 1

y x1

R
1

x
x(U ) Rk
Consider

R
U

f (x

:=

x
f

f
Z

dk f(x) ()

f :=
U

12.3

x(U )

Volume forms

Definition 65. On a smooth manifold (M, O, A)


a (0, dimM )-tensor field is called a volume form if
(a) vanishes nowhere (i.e. 6= 0 p M )
(b) totally antisymmetric
(. . . , |{z}
X , . . . , |{z}
Y . . . ) = (. . . , |{z}
Y , . . . , |{z}
X ...)
ith

jth

ith

46

jth

In a chart:
i1 ...id = [i1 ...id ]
Example (M, O, A, g) metric manifold
construct volume form from g
In any chart: (U, x)

i1 ...id :=

det(gij (x))i1 ...id

where Levi-Civita symbol i1 ...id is defined as 123...d = +1


1...d = [i1 ...id ]
Proof. (well-defined) Check: What happens under a change of charts
q
det(g(y)ij )i1 ...id =
s
y md
xm xn y m1
= det(gmn (x) i
) i1 . . .
[m1 ...md ] =
j
y y x
xid

 
 

 
q
q

x
y
x

det
= |detgij (x)| det
i1 ...id = detgij (x)i1 ...id sgn det
y
x
y

(y)i1 ...id =

EY : 20150323
Consider the following:
id
i1
(y)(Y(1) . . . Y(d) ) = (y)i1 ...id Y(1)
. . . Y(d)
=
q
id
i1
. . . Y(d)
=
= det(gij (y))i1 ...id Y(1)
s
xm xn
y i1
y id
i1 ...id m1 . . . m X m1 . . . X md =
= det(gmn (x)) i
j
y y
x
x d
s
 
xm xn
y
= det(gmn (x)) i
det
m1 ...md X m1 . . . X md =
j
y y
x

 
 
p

x
y
det
m1 ...md X m1 . . . X md =
= det(gmn (x)) det
y
x

 
 
p
x
x
X m1 . . . X md = sgn(det
)m1 ...md (x)X m1 . . . X md
= det(gmn (x))m1 ...md sgn det
y
y

If det

y
x

> 0,
(y)(Y(1) . . . Y(d) ) = (x)(X(1) . . . X(d) )

This works also if Levi-Civita symbol i1 ...id doesnt change at all under a change of charts. (around 42:43
https://youtu.be/2XpnbvPy-Zg)

Alright, lets require,


restrict the smooth atlas A

47

to a subatlas (A still an atlas)


A A

s.t. (U, x), (V, y) have chart transition maps y x1


x y 1
s.t. det

y
x

>0

such A called an oriented atlas


(M, O, A, g) = (M, O, A , g)
Note: associated bundles.

Note also: det

y b
xa

y b
xa

= det(a (y b x1 ))

is an endomorphism on vector space V . : V V


det

g is a (0, 2) tensor field, not endomorphism (not independent of choice of basis)


Definition 66. be a volume form on (M, O, A ) and consider chart (U, x)

independent of choice of b

|det(gij (y))|

Definition 67. (X) := i1 ...id i1 ...id same way 12...d = +1


[... ]
one can show

(y) = det

12.4

x
y


(x)

scalar density

Integration on one chart domain U

Definition 68.
Z

(U,y)

dd (y) (y 1 ())f(y) ()

f: =
U

(12.1)

y(U )

Proof. : Check that its (well-defined), how it changes under change of charts
Z

(U,y)

f: =
U

d (y) (y
y(U )


 
 

y
x
1

f(x) ()(x) (x ()det
d det
=
x
y
d

())f(y) () = =
(U,y) x(U )
Z
=
dd (x) (x1 (x))f(x) ()
x(U )

On an oriented metric manifold (M, O, A , g)


Z

Z
f :=

12.5

x(U )

dd

q
|

det(gij (x))(x1 ()) f(x) ()


{z
}

Integration on the entire manifold

48

13

Lecture 13: Relativistic spacetime

Recall, from Lecture 9, the definition of Newtonian spacetime


torsion free
t C (M )

(M, O, A, , t)

dt 6= 0
dt = 0

(uniform time)

and the definition of relativistic spacetime (before Lecture )


torsion-free

(M, O, A , , g, T )

g Lorentzian metric(+ )
T time-orientation

13.1

Time orientation

Definition 69. (M, O, A , g) a Lorentzian manifold. Then a time-orientation is given by a vector field T that
(i) does not vanish anywhere
(ii) g(T, T ) > 0
Newtonian vs. relativistic
Newtonian
X was called future-directed if
dt(X) > 0
p M , take half plane, half space of Tp M
also stratified atlas so make planes of constant t straight
relativistic
half cone p, q M , half-cone Tp M

This definition of spacetime


Question
I see how the cone structure arises from the new metric. I dont understand however, how the T , the time
orientation, comes in

Answer
(
(M, O, A, g) g
+ )
requiring g(X, X) > 0, select cones
T chooses which cone

This definition of spacetime has been made to enable the following physical postulates:
(P1) The worldline of a massive particle satisfies
(i) g() (v,(lambda) , v,() ) > 0
(ii) g() (T, v,() ) > 0

49

(P2) Worldlines of massless particles satisfy


(i) g() (v,() , v,() ) = 0
(ii) g() (T, v,() ) > 0
picture: spacetime:
Answer (to a question) T is a smooth vector field, T determines future vs. past, general relativity: we have
such a time orientation; smoothness makes it less arbitrary than it seems -FSchuller,
Claim: 9/10 of a metric are determined by the cone
spacetime determined by distribution, only one-tenth error

13.2

Observers

(M, O, A , , g, T )
Definition 70. An observer is a worldline with
g(v , v ) > 0
g(T, v ) > 0
together with a choice of basis
v,() e0 (), e1 (), e2 (), e3 ()

of each T() M where the observer worldline passes, if g(ea (), eb ()) = ab

1
1

precise: observer = smooth curve in the frame bundle LM over M

13.2.1

Two physical postulates

(P3) A clock carried by a specific observer (, e) will measure a time


Z

:=

g() (v,() , v,() )

between the two events


(0 )

start the clock

(1 )

stop the clock

and

Compare with Newtonian spacetime:


t(p) = 7
Thought bubble: proper time/eigentime

50

ab

M = R4
Application/Example.

O = Ost
A 3 (R4 , idR4 )
g : g(x)ij = ij

i
= (1, 0, 0, 0)i
T(x)

= i(x) jk = 0 everywhere
= (M, O, A , g, T, )
= spacetime is flat

Riemm = 0

This situation is called special relativity.


Consider two observers:
: (0, 1) M
i
= (, 0, 0, 0)i
(x)

: (0, 1) M

(, , 0, 0)i
i
=
(0, 1) :(x)
(, (1 ), 0, 0)i

>

1
2
1
2

lets calculate:
Z
:=

0
Z 1/2

:=

j
i
g(x)ij (x)
(x)
=

d
0

d1 = 1
0
1

12

()2

1/2

Z
=

1 2 =

p
1 2

Note: piecewise integration


Taking the clock postulate (P3) seriously, one better come up with a realistic clock design that supports
the postulate. idea.
2 little mirrors
(P4) Postulate
Let (, e) be an observer, and
be a massive particle worldline that is parametrized s.t. g(v , v ) = 1 (for parametrization/normalization
convenience)
Suppose the observer and the particle meet somewhere (in spacetime)
(2 ) = p = (1 )
This observer measures the 3-velocity (spatial velocity) of this particle as
v :  (v,(2 ) )e

= 1, 2, 3

(13.1)

where 0 , 1 , 2 , 3 is the unique dual basis of e0 , e1 , e2 , e3


EY:20150407
There might be a major correction to Eq. (13.1) from the Tutorial 14 : Relativistic spacetime, matter, and
Gravitation, see the second exercise, Exercise 2, third question:
v :=

 (v )
e
0 (v )
51

(13.2)

Consequence: An observer (, e) will extract quantities measurable in his laboratory from objective spacetime
quantities always like that.
Ex: F Faraday (0, 2)-tensor of electromagnetism:

F (ea , eb ) = Fab

0
E

1
=
E2
E3

E1
0
B3
B2

E2
B3
0
B1

E3
B2

B1
0

observer frame ea , eb
E := F (e0 , e )
B := F (e , e ) where 123 = +1 totally antisymmetric

13.3

Role of the Lorentz transformations

Lorentz transformations emerge as follows:


Let (, e) and (e
, ee) be observers with (1 ) =
e(2 )
(for simplicity (0) =
e(0)
Now
e0 , . . . , e 1

at = 0

and ee0 , . . . , ee1

at = 0

both bases for the same T(0) M


Thus: eea = ba eb

GL(4)

Now:
ab = g(e
ea , eeb ) = g(ma em , nb en ) =
= ma nb g(em , en )
| {z }
mn

i.e. O(1, 3)
Result: Lorentz transformations relate the frames of any two observers at the same point.
e
x x is utter nonsense

Tutorial
I didnt see a tutorial video for this lecture, but I saw that the Tutorial sheet number 14 had the relevant topics.
Go there.

52

14

Lecture 14: Matter

two types of matter


point matter
field matter
point matter
massive point particle
more of a phenomenological importance
field matter
electromagnetic field
more fundamental from the GR point of view
both classical matter types

14.1

Point matter

Our postulates (P1) and (P2) already constrain the possible particle worldlines.
But what is their precise law of motion, possibly in the presence of forces,
(a) without external forces
Z
Smassive [] := m

q
d g() (v,() , v,() )

with:
g() (T() , v,() ) > 0
dynamical law Euler-Lagrange equation
similarly
Z
Smassless [, ] =

dg(v,() , v,() )

g(v,() , v,() ) = 0

e.o.m.

Reason for describing equations of motion by actions is that composite systems have an action that is the
sum of the actions of the parts of that system, possibly including interaction terms.
Example.
S[] + S[] + Sint [, ]
(b) presence of external forces
or rather presence of fields to which a particle couples
Example
Z
S[; A] =

dm

g() (v,() , v,() ) + qA(v,() )

where A is a covector field on M . A fixed (e.g. the electromagnetic potential)

53

m
Consider Euler-Lagrange eqns. Lint = qA(x) (x)

!
Lint
Lint
a

qF am m
m(v v )a +
m = 0 = m(v v ) =
| {z }
m
(x)
(x)
Lorentz force on a charged particle in an electromagnetic field
|
{z
}

 

L
m
=q
(A(x)m ) (x)
m
xm

L
= qA(x)a ,
a

L
=q
(A(x)m ) m
a
xa


Aa
Am
m
m
=q
= q F(x)am (x)

(x)
xm
xa
F Faraday
Z
S[] =

14.2

q
(m g(v , v ) + qA(v ))d

Field matter

Definition 71. Classical (non-quantum) field matter is any tensor field on spacetime where equations of motion
derive from an action.
Example:
SMaxwell [A] =

1
4

d4 x gFab Fcd g ac g bd

Z
M

A (0, 1)-tensor field


= thought cloud: for simplicity one chart covers all of M

for g (+ )

Fab := 2[a Ab] = 2([a A)b]


Euler-Lagrange equations for fields
L

0=

Am
xs

L
s Am


+


2L
xs xt t s Am

Example . . .
(

xm

F )ma = j a

inhomogeneous Maxwell
thought bubble j = qv

[a Fb] ()
homogeneous Maxwell
Other example well-liked by textbooks
Z
SKlein-Gordon [] :=

d4 x g[g ab (a )(b ) m2 2 ]

(0, 0)-tensor field


54

14.3

Energy-momentum tensor of matter fields

At some point, we want to write down an action for the metric tensor field itself.
But then, this action Sgrav [g] will be added to any Smatter [A, , . . . ] in order to describe the total system.

Stotal [g, A] = Sgrav [g] + SMaxwell [A, g]

A
gab

:= Maxwells equations
:

1
Gab + (2T ab ) = 0
16G

G Newtons constant

Gab = 8GN T ab
Definition 72. Smatter [, g] is a matter action, the so-called energy-momentum tensor is
2
T ab :=
g
of

Lmatter
Lmatter
s
+ ...
gab
s gab

is Schr
odinger minus (EY : 20150408 F.Schullers joke? but wise)

choose all sign conventions s.t.


T (0 , 0 ) > 0
Example: For SMaxwell :
1
Tab = Fam Fbn g mn Fmn F mn gab
4
Tab TMaxwellab
T (e0 , e0 ) = E 2 + B 2
T (e0 , e ) = (E B)
Fact: One often does not specify the fundamental action for some matter, but one is rather satisfied to assume
certain properties / forms of
Tab
Example Cosmology: (homogeneous & isotropic)
perfect fluid

of pressure p and density modelled by


T ab = ( + p)ua ub pg ab

radiative fluid
What is a fluid of photons:

55

ab
TMaxwell
gab = 0
!

ab
observe: Tp.f. gab = 0
= ( + p)ua ub gab p g ab gab
| {z }
4

p 04p = 0
= 3p
p = 31
Reconvene at 3 pm? (EY : 20150409 I sent a Facebook (FB) message to the International Winter School on
Gravity and Light: there was no missing video; it continues on Lecture 15 immediately)

Tutorial 14: Relativistic Spacetime, Matter and Gravitation


Exercise 2: Lorentz force law.
Question electromagnetic potential.

56

15

Einstein gravity

Recall that in Newtonian spacetime, we were able to reformulate the Poisson law = 4GN in terms of the
Newtonian spacetime curvature as
R00 = 4GN
R00 with respect to Newton , and GN = Newtonian gravitational constant.
This prompted Einstein to postulate that the relativistic field equations for the Lorentzian metric g of (relativistic) spacetime
Rab = 8GN Tab
However, this equation suffers from a problem. We know from matter theory that in RHS, (a T )ab = 0 since
this has been formulated from an action. But in LHS, (a R)ab 6= 0 generically. Einstein tried to argue this
problem away. Nevertheless, the equations cannot be upheld.

15.1

Hilbert

Hilbert was a specialist for variational principles. To find the appropriate LHS of the gravitational field equations, Hilbert suggested to start from an action
Z

SHilbert [g] =
gRab g ab
M

which, in a sense, is formulated in terms of simplest action.


Aim: varying this w.r.t. metric gab will result in some tensor Gab .

15.2

Variation of SHilbert

[ g g ab Rab + g g ab Rab + g g ab Rab ]


| {z }
|{z}
M | {z }

0 = |{z}
SHilbert [g] =
gi

mn

gmn

ad 1: g = (detg)g
= 12 gg mn gmn
2 g
the above comes from det(g) = det(g)g mn gmn e.g. from det(g) = exp trln g

ad 2: g ab gbc = ca = (g ab )gbc + g ab (gbc ) = 0 = g ab = g am g bn gmn


ad 3:
Rab

=
|{z}

b mam m mab +

normal coords at point

= b mam m mab = b ()mam m ()mab

= gg ab Rab = g
if you formulate the variation properly, youll see the variation commute with b
i
i
(x) jk g
(x) jk are the components of a (1, 2)-tensor.
Let us use the notation: (b A)ij =: Aij;b

gg ab Rab |{z}
=

g(g ab mam );b

g(g ab mab );m =

g=0

57

g Ab;b

g B m,m

Question: Why is the difference of coefficients a tensor?


Answer:
(y)

i
jk

m
y i xm xq
y i 2 xm

+
(x)
nq
xm y j y k
xm y j y k

Collecting terms, one obtains


Z

1
!
0 = SHilbert =
g g mn gmn g ab Rab g g am g bn gmn Rab + ( g Aa ) ,a ( g B b ) ,b ]
[
2
{z
}
{z
}
|
|
M
surface
surface term
Z

1
1
=
[ g mn R Rmn ] = Gmn = Rmn g mn R
g
gmn
|{z}
2
2
M
arbitrary variation

Hence Hilbert, from this mathematical argument, concluded that one may take
1
Rab gab R = 8GN Tab
2
Einstein equations
Z
SEH [g] =

g R

15.3

Solution of the a T ab = 0 issue

One can show ( Tutorials) that the Einstein curvature


1
Gab = Rab gab R
2
satisfy the so-called contracted differential Bianchi identity (a G)ab = 0.

15.4

Variants of the field equations

(a) a simple rewriting:


1
Rab gab R = 8GN Tab = Tab
2
1
Rab gab R = Tab || g ab
2
R 2R = T := Tab g ab

(GN =

1
)
8

(contract on both sides with g ab )

= R = T
1
= Rab + gab T = Tab
2
1
Rab = (Tab T gab ) =: Tbab
2
Rab = Tbab
(b) SEH [g] :=

R
M

g(R + 2)

( is called cosmological constant)

History:
1915: < 0 (Einstein) in order to get a non-expanding universe
>1915: = 0 (Hubble)

58

today: > 0 to account for an accelerated expansion


6= 0 can be interpreted as a contribution 21 g to the energy-momentum of matter in spacetime. This
energy, which does not interact with anything but contributes to the curvature is called dark energy.
Question: surface terms scalar?
Answer: for a careful treatment of the surface terms which we discarded, see, e.g. E. Poisson, A relativists
toolkit C.U.P. excellent book
Question: What is a constant on a manifold?
R
R
Answer:
g =
g 1
[back to dark energy]
[Weinberg used QCD to calculate using the idea that could arise as the vacuum energy of the standard
model fields. It turns out that
calculated = 10120 obs
which is called the worst prediction of physics.
Tutorials: check that
Schwarzscheld metric (1916)
FRW metric
pp-wave metric
Reisner-Nordstrom
= are solutions to Einsteins equations
in high school
m
x + m 2 x2 = 0
x(t) = cos (t)
ET: [elementary tutorials]
study motion of particles & observers in Schwarzschild S.T.
Satellite lectures:
Marcus C. Werner: Gravitational lensing
odd number of pictures Morse theory (EY:20150408 Morse Theory !!!)
Domenico Giulini: Canonical Formulations of GR
Hamiltonian form
Key to Quantum Gravity

59

16
16.1

L18: Canonical Formulation of GR-I


Dynamical and Hamiltonian formulation of General Relativity

Purpose:
1) formulate and solve initial-value problems
2) integrate Einsteins Equations by numerical codes
3) characterise degrees of freedom
4) characterise isolated systems, associated symmetry groups and conserved quantities like Energy/Mass, Momenta (linear and angular), Poincare charges
5) starting point for canonical quantisation program
How do we achieve this goal? We will rewrite Einsteins Equations in form of a constrained Hamiltonian
system.
1
R g R +

g = |{z}
k T
|{z}
2
|
{z
} cosmological constant
8G
c4

k = 8G
c4 is an important quantity as it turns the energy density T into curvature.
Physical dimensions:

1
,
m2
Joule
for energy density [T ] =
m3
for curvature, [G ] =

[k] =

1
m2
J
m3

m
J

Convention (for this lecture):


Greek indices run from 0 to 3 and latin indices from 1 to 3
signature is (, +, +, +) as it makes space positive definite in 3 + 1-decomposition
T00 is positive energy density.

60

17

Lecture 22: Black Holes

Only depends on Lectures 1-15, so does lecture on Wednesday


Schwarzschild solution also vacuum solution (from tutorial EY : oh no, must do tutorial)
Study the Schwarzschild as a vacuum solution of the Einstein equation:
m = GN M where M is the mass


2m
1
g = 1
dt dt
dr dr r2 (d d + sin2 d d
r
1 2m
r

in the so-called Schwarzschild coordinates

(, )

(0, )

(0, )

(0, 2)

What staring at this metric for a while, two questions naturally pose themselves:
(i) What exactly happens r = 2m?

t
(, )

(0, 2m) (2m, )

(0, )

(0, 2)

(ii) Is there anything (in the real world) beyond t ?


t +
idea: Map of Linz, blown up
Insight into these two issues is afforded by stopping to stare.
Look at geodesic of g, instead.

17.1

Radial null geodesics

null - g(v , v ) = 0
Consider null geodesic in Schd
"

Z
S[] =

2m
1
r

2m
t 1
r
2

1

#
2

and one has, in particular, the t-eqn. of motion:

2m
r

 .
t = 0

=


2m
1
t = k = const.
r
Consider radial null geodesics
!
= const.
= const.

61

r r ( + sin )

with [. . . ] = 0



From 2 and 2
= r 2 = k 2 r = k
= r() = k
Hence, we may consider
e
t(r) := t(k)
Case A:
e
t
de
t
k
r
 =
= =
dr
r
r

2m
1 2m
k
r
= e
t+ (r) = r + 2m ln |r 2m|
(outgoing null geodesics)
Case b. (Circle around , consider ):

e
t (r) = r 2m ln |r 2m|
(ingoing null geodesics)
Picture

17.2

Eddington-Finkelstein

Brilliantly simple idea:


change (on the domain of the Schwarzschild coordinates) to different coordinates, s.t.
in those new coordinates,
ingoing null geodesics appear as straight lines, of slope 1
This is achieved by

t(t, r, , ) := t + 2m ln |r 2m|
Recall: ingoing null geodesic has
e
t(r) = (r + 2m ln |r 2m|)

(Schdcoords)

t 2m ln |r 2m| = r 2m ln |r 2m| + const.


t = r + const.
(Picture)
outgoing null geodesics

t = r + 4m ln |r 2m| + const.
Consider the new chart (V, g) while (U, x) was the Schd chart.

62

U
|{z}

{ horizon } = V

Schd

chart image of the horizon


Now calculate the Schd metric g w.r.t. Eddington-Finkelstein coords.
t(t, r, , ) = t + 2m ln |r 2m|
r(t, r, , ) = r
r, , ) =
(t,
(t,
r, , ) =
EY : 20150422 I would suggest that after seeing this, one would calculate the metric by your favorite CAS. I
like the Sage Manifolds package for Sage Math.
Schwarzschild BH.sage on github
Schwarzschild BH.sage on Patreon
Schwarzschild BH.sage on Google Drive
sage : load ( Schw a r z s c h i l d _ B H . sage )
4 - dimensional manifold M
expr = expr . sim p l i f y _ r a d i c a l ()
Levi - Civita connection nabla_g associated with the Lorentzian metric g on the 4 - dimensional manifold M
Launched png viewer for Graphics object consisting of 4 graphics primitives

Then calculate the Schwarzschild metric g but in Eddington-Finkelstein coordinates. Keep in mind to calculate
the set of coordinates that uses t, not e
t:
sage : gI . display ()
gI = (2* m - r )/ r dt * dt - r /(2* m - r ) dr * dr + r ^2 dth * dth + r ^2* sin ( th )^2 dph * dph
sage : gI . display ( X_EF_I_null . frame ())
gI = (2* m - r )/ r dtbar * dtbar + 2* m / r dtbar * dr + 2* m / r dr * dtbar + (2* m + r )/ r dr * dr + r ^2 dth * dth + r ^2* sin ( th )^2 dph * dph

63

References
[1] Sepideh
Bakhoda
(http://math.stackexchange.com/users/36591/sepideh-bakhoda),
Are
there
of Riemannian manifolds with zero curvature and nonzero torsion, Mathematics Stack
http://math.stackexchange.com/q/465672 (version: 2013-08-12).

64

simple
examples
Exchange. URL:

Potrebbero piacerti anche