Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
M
MACMILLAN
© J. P. Elliott and P. G . Dawber 1979
Publi shed by
Scientific a nd Medi cal Di vision
MACMILLAN PUBLISHERS LTD
London and Basingstoke
Companies and repre sent atives throughout the world
Elliott, J P
Symmetry in physics.
Vol. 2: Further applications
I. Symmetry (Physics)
I. Title II. Dawber, P G
530 QC793.3.S9
Prefa ce xvii
Introduction 1
1.1 The place of symmetry in physics 1
1.2 Examples of the consequences of symmetry 3
1.2.1 One particle in one dimension (classical) 3
1.2.2 One particle in two dimensions (classical) 3
1.2.3 Two particles connected by springs (classical) 4
1.2.4 One particle in three dimensions using quantum
mechanics-spherical symmetry and degeneracies 5
1.2.5 One particle in one dimension using quantum
mechanics -parity and selection rules 6
1.2.6 The search for symmetry---elementary particle
p~~a 7
1.3 Summary 8
2 Groups and Group Properties 9
2.1 Definition of a group 9
2.2 Examples of groups 11
2.3 Isomorphism 16
2.4 Subgroups 17
2.5 The direct product group 17
v
vi Contents
Preface xvii
In studying the structure of molecules one should strictly consider the motion
both of the nuclei of the constituent atoms and of their electrons. However,
because of the large mass of the nuclei, relative to that of an electron, one may
approximately separate the motions of these two kinds of particle . This
'Born - Oppenheimer' approximation is discussed briefly in section 20.2. For
our present purpose we shall be considering the motion of the electrons
relative to the nuclei which are assumed to be fixed. Our interest will centre on
those molecules for which the arrangement of fixed nuclei is invariant under
some point group of symmetry operations. This is precisely the same
symmetry group which we used in discussing molecular vibrations in chapter
6. The physical difference between that problem and the present one is that in
studying molecular vibrations the electrons were ignored, except in so far as
they were responsible for the potential in which the nuclei moved. From the
experimental viewpoint the molecular vibrations involve energies of the order
of (10)-2 eV, whereas the excitation of the electronic motion, which we are
about to discuss, requires energies of the order of 1 eV.
Even with the assumption of fixed nuclei the calculation of the electronic
wave function is a formidable task -see Eyring et al. (t944), Murrell et al.
281
282 Electron States in Molecules 13 .1
= L D~!m(Gal4>nlm·(r-r"l (13.1)
m'
(13.3)
a
where the sum runs over the group elements G, and any of the possible values
m and r, may be used.
If a representation T(') occurs more than once , or if we wish to consider the
mixing of different atomic orbital labels nl, then we must form a trial wave
function
1/1(2) (r) = I C ~~ ) i I/I~~) i (r) (13.4)
i.n,l
and determine the coefficients C~~ ) i by the variational method, In this sum, the
index i distinguishes molecular orbitals when T(2) occurs more than once in the
reduction ofT. To perform the variational calculation, or even to estimate the
relative energies of the single molecular orbitals (13,3) for different IX, one must
assume some form for the single electron Hamiltonian H(r). Since the
functions I/I~~) i (r) are not orthogonal the matrix formulation of this variational
problem follows the method of section 5,7. The wave functions and energies
for the molecular orbitals with symmetry IX are then given by the roots of the
matrix equation (5.23); in our present notation,
(H(') - ES('))c(') =0 (13.5)
where C (2) is the column vector of coefficients c ~~ )i in equation (13.4). The
284 Electron States in Molecules 13 ,2
ft/J~i)i·
H(a)
Since t/J'.:)i (r) is a known linear combination of atomic orbitals these matrix
elements are all given in terms of matrix elements between atomic orbitals. It is
usually assumed that the matrix elements of H(r) between atomic orbitals
vanish unless the two sites t and t' are identical or nearest neighbours in the
molecule. Further assumptions often made are that the diagonal elements are
just the single-particle energies in the free atom since the atomic orbitals are
well located about a single nucleus and H(r) must approximate the sum of the
atomic potentials. Care must be taken in these energy calculations since the
atomic orbitals on different sites are not orthogonal. In the simplest
approximation, one might assume them to be orthogonal, i.e. S(') is the unit
matrix.
13.2 Examples
H 3 molecule
The H 3 molecule is illustrated in figure 13.1 with the three atoms being
labelled by a, band c. The symmetry group is D 3 h , but since we shall only be
using atomic s-states in our calculation and these have reflection symmetry in
the plane we may ignore the horizontal reflection and use just the group D 3 •
a b
H H
Figure 13.1
13.2 Electron States in Molecules 285
The characters are given in table 13.1 together with the character for the three
Is-orbitals calculated as outlined above.
Table /3 ./
DJ E 2C J 3C z
AI 1 1 1
Az 1 1 -1
E 2 -1 0
X(ls) 3 0 = Al +E
A. = f4>~s(a)H(r)4>ls(b)dr
The main contribution to this integral is the energy due to the interaction
between the electron in the state 4>15 (a) and the nucleus at b and vice versa.
Since this is due to an attractive force we would expect A. to be negative and
hence the totally symmetric state ",(A,) to be lowest in energy. Hence, if we put
the three electrons into the states (13.6) in a way consistent with the Pauli
exclusion principle, the ground state of the molecule will have two electrons
with opposite spins in the lowest orbital ",(Ad and one electron in one of the
degenerate states ",(E). The total wave function for this state is the Slater
determinant constructed from these three functions and since each term in the
expansion of the determinant is a product with symmetry A I X A I X E = E it
is an E-orbital doublet, the degeneracy arising from the two possible states for
the third electron .
286 Electron States in Molecules 13.2
H 20 molecule
The symmetry group for the water molecule shown in figure 13.2 is C 2 v ' The
hydrogen atoms have one electron each and the oxygen atom has eight
electrons in a ground sta te configuration Is 2 2s 2 2 p 4 . The atomic orbitals that
y
.---+x
H H
a b
Figure 13.2
we will use are the Is-orbitals on the hydrogen nuclei which we denote <PI s(a)
and cPI s(b), and the 2p-orbitals on the oxygen nucleus , denoted cP2px, cP2p y,
cP2pz ' The atomic Is- and 2s-orbitals in oxygen are more tightly bound so we
neglect them in this simple calculation. The binding of the 2p-orbital in oxygen
and the 1s-orbital in hydrogen are approximately the same so that we must
take account of them both. The character table for the group C 2v is shown in
table 13.2, together with the characters for the spaces generated by the atomic
orbitals. ((Tv denotes a reflection in the plane of the molecule and (T~ a reflection
in the perpendicular plane containing the y-axis.)
Table 13.2
C Zv E Cz (Jv (J v
1 1 1
1 -1 -1
-1 1 -1
-1 -1 1
0 2 0 =A I -tBI
-1 1 1 = Al -tBI -tB2
Since all the irreducible representations are one-dimensional it is easy to
write down the symmetrised orbitals as
cP\~') oc {cPI s(a) + <PIs(b)}
cP\~" oc {cPIs(a) - cPI s(b)}
-I,(A,) _ -I, -1,(8,) _ -I, -1,(8 ,) - -I,
'+'2p - '+'2py ' '+'2p - '+'2px ' '+'2p - '+'2pz
13 .3 Electron States in Molecules 287
Since there are two A 1 -type orbitals we must allow them to mix. If we denote
the free-atom energies of 4>1$ and 4>lP by 6 1 s and 6 1 p then the effect of the
mixing will be to produce a pair of A I -type energies as shown in figure 13.3. In
the same way, the two BI-type orbitals will mix as shown. One may argue
qualitatively that, since the bond angle is greater than 90°, the overlap between
A,. (B .J and A,. (B d is greater than that between A,. (A d and ",(Ad . Hence one expects
'l'l s 'l'2p 'l' l s 2p
that the two B I -type levels will be split more widely than the two A 1 -type
levels, as shown in the figure. In the simplest approximation, the Brlevel has
the free-atom energy 6 1 p ' This ordering ofsingle-particle levels, as shown in the
figure, is consistent with experiment.
8,
Figure 13.3
To construct the wave function for the water molecule we must now place
electrons into these mo lecular orbitals, bu ilding up a Slater determinant. There
are six electrons to be placed, two of them coming from the Is-orbits on the
hydrogen atoms and four from the 2p-orbits in oxygen (recall that the oxygen
atom has a configuration of I S l 2s 1 2 p4 ). To describe the molecular ground
state we therefore place two electrons (spin up and down) in each of the lowest
three levels B A I and B 1 of figure 13.3.
"
Optical absorption can take place when the molecule undergoes a transition
from the ground state to an excited state. The simplest excited states can be
considered to be single-particle excitations from one occupied molecular
orbital to a higher unoccupied one. The energies involved are of the same order
of magnitude as in the atomic case so that absorption can be in the visible,
infrared or ultraviolet regions of the electromagnetic spectrum. The selection
rules governing the transitions can be obtained in the same way as for the levels
of an atom in a crystal field, see subsection 9.9.2. In the case of H 1 0, because
there are two electrons in each orbital, the ground state has A 1 symmetry (like
288 Electron States in Molecules
Bibliography
Problem
13.1 The methane molecule consists of four hydrogen atoms arranged at the vertices
of a regular tetrahedron and a carbon atom at the centre (symmetry group Td ) .
Show that the four Is atomic orbitals on the hydrogen atoms lead to a singlet and
a lriplet molecular orbital. Note that the 2s and 2p atomic orbitals on the carbon
atom also lead to a singlet and a triplet of the same symmetries and hence that
these will mix to give bonding states.
14
Symmetry in
Crystalline Solids
with ni integers. The a, are called primitive translation vectors and must not be
coplanar. The parallelepiped defined by ai' a2 and a 3 is called the primitive
cell. The simplest crystals have just a single atom at each lattice point but more
generally an array ofatoms with specific relative orientation is associated with
each lattice point. Figure 14.1 shows a two-dimensional crystal with two atoms
per unit cell.
Figure 14.1
As defined by equation (14.1) the space lattice, and consequently the solid, is
infinite in extent. In reality a crystal is finite but contains a very large number of
atoms ~ 102 0 per em:'. Here we will consider only the infinite crystal from
which one expects to calculate what are known as the bulk properties of the
material, such as conductivity and specific heat. These will be relevant in
samples with a small surface to volume ratio, i.e. the number of atoms
influenced by the surfaces of the crystal must be small compared with the total
number.
The infinite crystal will generally have symmetries which involve rotation
but for the moment we consider only the pure translational symmetry
=
described by the group of translation operators P(n) P(n, n 2 n3) which
displace an arbitrary point r to r + n,
(14.2)
where the primitive translation vectors a, are fixed and the ni take integer
values. We denote the group by Y (a I ' a 2 , a 3 ) and observe that it has an
infinite number of elements.
The multiplication rule for translations is easily derived from the defining
14.2 Symmetry in Crystalline Solids 291
This shows immediately that the group is Abelian and, in particular, the
translations along the directions of the three primitive vectors a, commute with
each other so that the group can be considered as a direct product of three
subgroups of translations along these directions. We now consider just one of
these subgroups :1(a 1 , 0,0). This is an Abelian group and therefore its
irreducible representations are one-dimensional. From equation (14.3) it
follows that the matrix elements T(n 1 00) corresponding to the group
elements P(n 1 00) satisfy
(14.4)
Although we use the expression 'matrix element' all these matrices are one-
dimensional so that there is only one matrix element. We use the notation
T(n) == T(n 1 n2 n 3) rather than the cumbersome T(P(n)) for a representation
of the translation group. Given that the matrix element for the identity
P(O 0 0) is unity, i.e. T(O 0 0) = I and defining the constant o: by a = T( I 00) it
follows immediately from equation (14.4) that T(n 1 00) = rx n , . For T to be a
unitary representation rx must be a complex number of modulus unity and it is
conventional to write a = exp( - 2rrik d, where k 1 is real and serves as a label
for the representation
(14.5)
The exponent here has the form of a scalar product but, since the primitive
translation vectors a, are neither orthogonal nor normalised, it is necessary to
292 Symmetry in Crystalline Solids 14.2
before we can write it as a scalar product. The reciprocal basis vectors have the
property
(14.8)
which may also be taken as a definition oftheb j • We now define a vector k with
components k, in the basis b,
(14.9)
so that
(14.10)
where n denotes the group element and k labels the irreducible representation.
In one dimension we noted that the representations k I + P with p any integer
were all the same as the representation k 1 and in three dimensions this will be
true for representations obtained by adding integers to any of the k j • This
result is more conveniently expressed if we introduce a 'reciprocal lattice' of
points given by the vectors
(14.12)
where the m j are integers. We now see that the representations k and k +Km
are equivalent,
• ozL • .t... • • •
• •oLe
°z
Space lattice • 0, • 0, 0,
• • • • • • • • •
• • •
• •
• • •
.
~ ~ • • •~•
,•
Reciprocal
lattice and
first Brillouin • ',I b, b, • •
zone (shaded)
• •
• Ib,l:•21T 110,1• • • • • •
Ib,l: 41TI/310,1 Ib,1:2 /2 1TI 10,1
Figure 14.2
One reason for choosing the Brillouin zone rather than any other unit cell is
that it demonstrates most easily the point group symmetry of the reciprocal
lattice. This is in fact the same as the symmetry of the real space lattice even
though these lattices may not be the same. The proof is as follows . The
294 Symmetry in Crystalline Solids 14.4
(14.21)
Substitution into equation (14.20) now shows that the function uk(r) defined
by equation (14.21) is a periodic function with the periodicity of the space
lattice
(14.22)
In other words the function uk(r) is an invariant with respect to the translation
group !I.
To find uk(r) we must write down the Schrodinger equation for an electron
moving in a periodic potential. The Hamiltonian is
p2
H = 2M+ V(r) (14.23)
with V(r) = V(r-n) . Thus, using the form (14.21) for the eigenfunctions
¢(k)(r), the Schrodinger equation is
[ ~~2 V2 + V(r) J
uk(r) exp (ik .r ) = e(k)uk(r) exp iik . r) (14.24)
Since uk(r) is periodic, this differential equation need be solved only within a
single unit cell using the periodic boundary condition on uk(r) at the edges of
the cell. Group theory has thus enabled us to replace a problem for the
complete crystal by one for a single cell. For each value of k , equation (14.25)
will have a set of solutions en(k ). We can see immediately from equation (14.25)
that as k is slowly varied, the solutions to this differential equation will change
continuously showing that the en(k) must be continuous functions of k . For
each n the energies en(k) are said to form a band since they vary over a limited
range of values as we now illustrate.
We will now discuss the general features of the el k ) by taking a very simple
model for the crystal. This model regards the electrons as being nearly free,
treating the periodic potential V(r) as a weak perturbation to the free-electron
296 Symmetry in Crystalline Solids 14.4.1
states. Clearly this approximation can only be appropriate for the outermost
atomic electrons (valence electrons) which, particularly in a metal, move fairly
freely through the crystal. We shall see in the next section that this model is
capable of explaining how elements from different sections of the periodic
table form metals or insulators.
If we put V(r) = 0 it is soon verified that equation (14.25) has solutions of
the form
u"k(r) = V;1 exp(iK".r)
Ti 2
e ~ ( k ) = 2M(K,,+k)2 (14.26)
e,,(k) = e~ (k) + v c-
1
f
cell
exp[ - i(k + K II ) · r] V(r)exp[i(k + K,,).r]dr
+ Vc
-2" I
L.
J exp( - iKm·r) V(r)exp(iK".r)dr 1
0 0
2
+ (14.28)
m e,,- em
14.4.1 Symmetry in Crystall ine Sol ids 297
\
I
I \
,, I
I
I
I
\
\
1
I
\
\
,
1
I
I
I
I
\
\
I
,
I \
\
I
I
I
,
I I
I
I
, I
I
I
\ I \ ' I I , I
"v
I I
\'I
\ te, (k)
I :I
(
1\
, ~\
, I
I
I
II
I
, I I I
\ \ I I
,
I
(a)
\
\ I
I , I
I
I
I ,
I
I
\ \ I ' I I
,, \ ~
I ,
'(
1 \ I
I
I I
I
, \
I \ I \
,'
I I
i 2 I
, ;'
\ I
\
\
\
I
I \
, 1 I
Ie (k)=U I
1 / 0 2M I
\ 11 \1 \ I I I
, I I
\.l.
~
I,
I ,, / *,
1 1\
I \ I
1,\
,, I:
le.,( k)
,
I \ I
/ I / I
,,
I
,
/ I
-, ,
I
, 'A "- ,
>(
/
,
1 / /
/ I
, I
I
\ I I
\ I I
\ I I
\ I
, I
,I
(b) ,
\
\\ I /
'.,"
", />,
"
\
,
' I'"
I
:/
I
I /
/
'
I
\
\ I
/ J..,
I \ /';' , /
I
" ....._ /
/ :',
I , _/
/ '\
<: »
// I
:
,
' - ""
"
,
,
. . -r _/ I ,_...
<;
I
~_ /
"' .... _ ....
, ,,.... - ........... ~ -",.- ... ....... ~ - ,,'
I
,
I
Figure 14.3
where the unperturbed energies e~ (k ) are given by equation (14.26). Since the
integrals are periodic it is sufficient to restr ict the integration to a unit cell, the
same region over which the wave functions were normalised. The first-order
term is j ust a con stant Vo, equ al to the average value of V(r). The matrix
298 Symmetry in Crystalline Solids 14.4.1
The perturbation series (14.28) can thus be written in the simple form
(
£Z(k ) + Vo - £(k ) 0 VI )((:0)=0 (14.32)
VI £I(k)+ Vo-£(k) (I
where £(k) is the new energy and Vt is the first Fourier coefficient in the one-
dimensional Fourier expansion corresponding to (14.29),
in energies s8(k) - s?(k ), the solutions approach the unperturbed values. At the
zone boundary itself s8(k ) = f;?(k ) and the roots are
(14.35)
This shows that a 'band gap' appears at the zone boundary with a width 21 V.I.
A similar calculation for the point of contact between bands labelled nand m
would yield a splitting of 21 V(n _ m) Iand the form of these results is illustrated in
figure 14.3(b).
This separation of the continuous range of allowed energies in the
unperturbed system into bands of allowed energies with gaps containing no
electron states is the central result of the theory of electron states in crystals.
We will sho w below how it can give rise to the difference between metals and
insulators. Before doing this however we look at the form of the wave
functions at the zone boundary. If we take VI to be negative, corresponding to
an attractive ionic potential at the lattice sites, the solutions to equation (14.32)
corresponding to the energies (14.35) a re (' ±= =F (' ± and hen ce from equations
o I
(14.31) and (14.27)
I
'P ~ ,,/a(x) = -2 -exp t -inx/a){l-exp(2nix/a)l
( a) l
and (14.37)
The wave function 'P ~ ,,/a ( x ) therefore has a large electron density near
x = ± a/2 and con sequently midway between lattice po ints, wherea s 'P ::: " /O(x )
has a large den sity at the lattice points and would be expected to have the lower
energy which is consistent with equation (14.35).
In the last section we found the general behaviour of the s(k ) curve for a
valence electron in a crystal. We now discuss the many-electron ground state
which is obtained by putting electrons into these single-electron states, taking
care of the Pauli exclusion principle. To do this it is convenient to consider a
crystal of finite size so that we can count the number of electrons and the
number of states. We shall therefore consider a very large but finite crystal, and
the boundary conditions we impose at the surfaces will restrict the values of k
which are acceptable for the Bloch states (14.21). The actual choice of
boundary conditions will not affect bulk properties and it is usual to use cyclic
conditions. We take a crystal with edges parallel to the primitive lattice
translations a, having N, cells along the edge parallel to a.. The total number of
300 Symmetry in Crystalline Solids 14.4.2
cells is then N = N I N 2 N 3' The cyclic boundary conditions require the wave
functions on opposite edges of the crystal to have the same value so that, for
instance
"'(000) = ",(N I 00) = "'(0 N 2 0) = "'(00 N 3 ) (14.38)
Using Bloch's theorem (14.21) this means that
exp(2n ik l N d = exp(2nik 2N 2 ) = exp(2nik 3N 3 ) = 1
and hence the possible values k jp of k, are given by kip = pi N j , with p any
integer. The allowed values of k thus form a lattice of points within the
reciprocal lattice having N, points along the unit vector hi and con sequently N
points in each unit cell. We must however restrict k to lie in the Brillou in zone
which has the same volume as a unit cell, and the above result can be expressed
by saying that there are just N allowed values of k uniformly distributed
throughout the Brillouin zone for a crystal containing N cells. Since the N, are
very large numbers there will be no real distinction between continuous and
discrete values of k when we discuss bulk properties except in so far as it
enables us to enumerate the states.
We now return to the one-dimensional crystal offigure 14.3(b) but assume it
to be finite with N cells so that there are just N Bloch states corresponding to
the N values of k in each of the bands n. We can put two electrons with opposite
spin in each of these states so that each band will hold 2N electrons. If the
atoms of our solid have just one valence electron each, like sod ium, then the
crystal with N atoms will have N valence electrons which, in the ground state,
will occupy the lowest N single-particle states and so only half fill the first
band. If, on the other hand, the atoms have two valence electrons the ground
state will have all the single-particle states in the first band occupied. On
applying an electric field a current can only flow if some electrons can be
accelerated and hence move to states with infinitesimally higher energy. For
the one-valence electron atoms, such as sodium, this is possible since there are
unoccupied electron states immediately above the highest occupied state. We
then have a conductor. For atoms with two valence electrons, like magnesium,
the band is full and the next available state is at the bottom of the second band
so that the minimum energy which can be given to an electron is equal to the
band gap 2 VI' We would therefore expect magnesium to be an insulator.
Unfortunately this disagrees with experiment but the reason is that we have
used a one-dimensional model. In a three-dimensional calculation we have
band gaps at all points on the three-dimensional Brillouin zone boundary but
they occur at different energies so that unless the gaps are large there may be no
energy region which is always in a band gap. Thus atoms with an even number
of electrons which, from the one-dimensional arguments, we would expect to
be insulators, are not always so in reality. We can illustrate this effect in two
dimensions by drawing the nearly -free electron energy bands from equation
(14.26)for a rectangular latt ice choosing k to lie first along the x-direction then
along the y-direction. The results are plotted in figure 14.4for the case a 1 = ~a2
14.4.2 Symmetry in Crystalline Solids 301
and demonstrate that for this system there will be no gap in energy between the
first and second bands since the gaps on the x- and y-boundaries of the
Brillouin zone occur at energies Ji2rr;2 j2Mai and Ji 2rr;2 j2Mai , see problem
14.3,and as shown in the figure this ditTerenceis generally large compared with
the gaps . The figure also shows how maxima and minima can now occur at
points away from the zone boundary or centre where the free-electron curves
intersect. Such features are important in considering excitations of electrons
from one band to another but we do not discuss this aspect (see Elliott and
Gibson, 1974, in the bibliography).
I
I
I
\ I
\ I i
\ I I
\
,, ,
I
/
I
\
, I
I I
I
\
, I,
, I
I "
1",,/
,
"
-,
_ _ _ _---"e::-_-::-;L--_-=::~:-~_ ____:;;:~------="------ k along x -a xis
Tr/ o ,
I ; ....
__/ \ I I "-
__ \ I
I '
\ I I
\ I I
I
,I
\ I I
\ I t{k)
I
I "-
/
/
/
,I
I
,
I "-
"-
II " ,
// I
I ,
--
/ I
,
I
I "
Figure 14.4 Nearly-free electron energy band s for a rectangular lattice with "I = 1"2'
The dotted curves are free-electron parabolas
'--t-----'--j-- k, \........f-+-+--1--. k,
(0 ) (b)
Figure 14.5 Surfaces of constant energy for a rectangular lattice. The dotted curves
are free-electron circles: (a) first band ; (b) second band
oc I exp(ik.n)cPs(r-n) (14.39)
n
14.4.3 Symmetry in Crystalline Solids 303
n =2 ------<:
Figure 14.6
where
¢(~ l (r) =
nm
L" exp(ik .n)¢nlm(r-n)
304 Symmetry in Crystalline Solids 14.4.3
is the Bloch state for orbital m. The coefficients c~ are generally found by a
variational calculation as in section 5.7. In subsection 14.9.2 we show how they
may be found from symmetry considerations alone if the crystal has additional
point-group symmetry and the vector k points along a symmetry direction.
We can see that the method is analogous to the LCAO method for molecular
orbitals discussed in chapter 13, with the irreducible representation label k for
the translation group replacing CI. for the point group. The calculation is less
general than that used in chapter 13 in that we have restricted the sum (14.40)
to atomic orbitals with a single value of n and I. The method can therefore be
extended by taking in more atomic orbitals belonging to atomic states of
nearby energy, e.g. states with different values of I but the same principal
quantum number n. In this case the variational calculation yields a matrix
equation exactly equivalent to equation (13.5).
For the simple Bloch states (14.39) the energy is obtained by evaluating the
diagonal matrix element of the one-electron Hamiltonian H(r) in this state,
Hkk
ss(k ) = - (14.41)
I kk
where
H kk =f cP~k~(r)H(r)cP~k)(r)dr
I kk = JcP~k)*(r)cP~k)(r)dr
Using the specific form (14.39) of the Bloch functions these expressions can be
simpl ified
where r' = r - n and we have used the fact that H (r) = H (r'). Introducing
p = n' - n as a variable instead of n' gives
H kk = I eXP(-ik.P)fcP;(r-p)H(r)cPs(r)dr
f1,p
where
14.4.3 Symmetry in Crystalline Solids 305
I kk =L exp(-ik .p)I p
f
lI.p
L exp(-ik.p)H p
s, (k) = -'-p -- ---- (14.44)
Lexp(-ik.p)Ip
p
where Vat (r) is the central-field potential of the free atom. The first two terms
then form the Hamiltonian for a free atom at the origin and the s-state wave
function is an eigenstate of this Hamiltonian with energy t:s ,
(14.47)
Using this equation we can now write the matrix element H pas
Lexp( - ik .p) Vp
t:s(k ) = s, + .!:..p -- - - -- (14.49)
Lexp( -ik .p)Ip
p
with
Vp f
= 4>; (r-p)[ V(r)- Vat(r)] 4>s(r)dr (14.50)
The first term on the right-hand side in equation (14.49) is just the free-atom
energy level and the second term gives the k dependence which spreads the
306 Symmetry in Crystalline Solids 14.5
level into a band. We can see qualitatively how this spreading depends on the
unperturbed state in the following way. If ¢. (r) is a deep core state such as a I s-
state, its wave function is very localised so that I p and Vpeffectively vanish for p
not equal to zero because there will be negligible overlap of the wave functions.
This means that the second term on the right of equation (14.49) reduces to
Vol 10 which is independent of k and the bandwidth is zero . For a less deep
state a crude approximation might be to neglect the overlap in I p but keep the
term in Vp for p a nearest neighbour. For a simple cubic crystal, with one atom
at each lattice point and for which the three lattice vectors a, are of equal length
a and mutually perpendicular, there are six nearest neighbours p . In this case
the energy (14.49) is given by
E.(k) = G. + Vo + 2 VI (cos ak; + cos ak, + cos ak:) (14.51)
where VI is the value of Vp for a nearest-neighbour site (all these will have the
same value in a cubic crystal). For a valence electron, several more terms should
be kept in both Vp and I P' The range 0 f values 0 f G. (k) as k varies is called the
bandwidth and the example given in equation (14.51) is seen to have a
bandwidth of 121 VII. To calculate the value of the bandwidth we must, of
course, take a model for the periodic potential V(r). A simple approximation is
to take V(r) to be the sum of the atomic potentials Va,(r), i.e.
V(r) = I Va,(r - n) (14.52)
II
With this choice , the shift Vo is small and negati ve since the perturbation in
equation (14.50) is very small in the central cell where the wave functions ¢.(r)
are large, and negative everywhere. For the same reason VI is also negative .
In a realistic calculation (see Harrison, 1970 and Zirnan, 1972 in the
bibliography) of the band structure it is necessary to choose V(r) more
carefully, since with the above choice the long range nature of the atomic
potential would mean that the series (14.49) would converge only very slowly.
It is therefore convenient to take V(r) to be a sum of localised potentials which
are similar to those of a free atom or ion in a small region of atomic size and
zero elsewhere . The energies Gnl must then be calculated for these potentials.
present problem means they must be classified by k, and that they can be
obtained by projection. Although in this simple problem with identical masses
the concept of mass weighting is unnecessary, we none the less use it so that
equations from chapter 6 may be taken over directly. The general displacement
is given by a vector q in a vector space with dimension equal to the number of
atoms. Thus we denote by e(n) a displacement of the nth atom by an amount
M -land q = LqnMle(n), where qn is the magnitude of the displacement of the
n
nth atom and M is the mass .
A displacement U(k) with the symmetry k is then obtained by projection from
e(O), as in equation (14.39):
U(k) = L exp(ikna) T(n)e(O)
n
= L exp(ikna)e(n) (14.53)
n
where qn is the magnitude of the displacement of the nth atom, then equation
(6.35) yields
L M - 1 Bnmexp( - ikna)exp(ikma)
n.m (14.55)
L exp( - ikna)exp(ikma) bnm
n.m
(Since the displacement U(k) is complex we must use the complex form (3.7) for
the scalar products in equation (6.35).) Because of translational invariance the
constant Bnm can depend only on the difference n - m so that, introducing a
variable p = n - m and writing B p for Bnm , we have
wf = M- 1 L [L Bpexp( -ikpa)]IL 1
p n
v = L 1).(qn -qn- d 2
n
By changing the dummy variable n in some of the terms this can be written as
v = 11. L (2q; -
n
qnqn - 1 - qnqn + d
so that it has the form of equation (14.54) with Bnn = 21., B np = -). for nearest
neighbours, and Bnp = 0 for all other pairs . Equation (14.56) now gives
I
rof = M [21. - Aexp( - ika) - Aex p (i k a ) ]
2A
= M(l-cos ka)
A plot of rof against k is shown in figure 14.7. The Brillouin zone boundaries
are at ± rr./a and the curve shows a great similarity to that of the lowest band of
states in the free-electron model. The degeneracy between states with opposite
values of k arises from the fact that this particular crystal has a point group
with inversion symmetry (see section 14.9).
-----:!:, - - - - - - - ------"'-...-"'----- - - - - - - , = - - _ k
o
Figure 14.7
The striking difference between these results and those ofchapter 6 is that, in
the latter, the spectrum of normal mode frequencies was discrete whereas here
it is continuous. This arises from our use of an infinite chain. On restriction to a
14.5.2 Symmetry in Crystalline Solids 309
chain offinite length with N atoms, the continuous curve of figure 14.7 would
break into N dist inct roots corresponding to N equally spaced values of k. In
real crystals the number N is so large that the distinction between discrete and
continuous spectra is of no consequence.
To find the normal frequencies and the coefficients c':/I we follow the method of
chapter 6 by diagonalising the 3s x 3s matrix of the potential energy for the
displacement (14.58) in the basis (14.57). As an obv ious generalisation of
equation (14.54) we first write V in terms of the simple coordinates,
V =t L Bli".r'jmql~(n)ql)m) (14.59)
t ,t' ,iJ .n,m
Here we have written p = n - m and 0 denotes any fixed cell. It then follows as
a generalisation of equation (6.35) that the are eigenvalues of the matrix wi
(14.60).
To show the effect of having more than one atom in the unit cell we consider
a one-dimensional chain with two atoms per unit cell and Hooke's law nearest-
310 Symmetry in Crystalline Solids 14.5.2
.. a
•
M, Mz A A
. ... 0. 00000 000000.000000 00000.00000000000. 00000 Oll'"
I 2 I 2 I 2 I 2
.. cel l
n -I
••
cell
n ••
cell
n+ 1
... cell
n+2
Figure 14.8
= FI [2q i(n) + 2 q~ ( n ) -
n
q2(n)qdn) - qdn)q 2(n)
To get the last equation we have manipulated the dummy variable n. Hence the
numbers Bt o • t • p in the one-dimension al equi valent of eq ua tio n (14.59) are
B IO . 10 = B 20. 20 = 2),
2). 2
-M - wk
1
=0
),
-------.-, [I + exp( -
(M I M 2 )1
.
ika)]
o a
-tt
Figure 14.9
and we now show that this is in fact an eigenfunction of V for all k. The
operator S• .S. commutes with S" unless p = n or m, since spin operators on
different sites commute. Thus using equation (14.63)
Sn,Sml k ) = S2/k ) +exp(ik. n)[ (S",Sm)' S,,_] 10>
+ exp(ik. m) [(S",Sm)' Sm-] 10)
and the commutators follow from the familiar commutation relations (7.28)
and (7.30) for the spin operators giving
[(Sn ,Sm),Sn-] = Sm-S"z-S,, _Smz
so long as m # n. (The case of n = m may be ignored since J "" = 0.) Putting
together these two equations with (14.61) gives
=E o lk ) + S L J q [ l - ex p (ik · q ) ] lk )
q
+ SL J q [ l - exp( - ik . q )] Ik )
q
,
25 I:J, (1 - cos k.q), where q runs over all sites relative to a fixed site. For each k
we have an excitation of thi s kind, which is called a 'magnon'. The analogy with
the lattice vibration problem of section 14.5 is apparent, with the turning of the
spin at a site corresponding to the displacement at that site. Notice that as
k -> 0 the excitation energy also tends to zero like k 2 , so that excitations may be
produced at infinitesimally low excitation energy. This contrasts with the finite
value which would be found for the expectation value of V in the wave function
Sp -10 ) in which only the spin at site p is turned. The state Ik ) corresponding
to k = 0, although having zero excitation, is not identical with the ground state
since it is given by I: Sp - 10 ) . However the operator I: Sp _ is the total spin of
p p
can then be constructed. This is not quite an exact eigenstate but the error is of
the order 1/ N , where N is the number of atoms in the crystal and is large. To
the same order of accuracy the magnon creation and annihilation operators
obey Bose - Einstein commutation relations (see section 19.1 and subsection
16.3.1) and we can talk about a state in which there are nk magnons with wave
vector k. Each magnon carries an excitation energy, given in equation (14.65).
overlap of atomic wave functions is very small indeed and to a very good
approximation the ground state has individual electrons occupying the atomic
orbitals on individual atoms. The many -electron wave function is just a
product of these orbitals. As in the spin-wave case we might try to construct
excited states by raising an electron on one atom to a higher unoccupied orbital
in that atom, but again we must use projection to construct a state with definite
I..
Let ¢J denote the many-electron ground state of an atom and ¢J' the excited
state formed from ¢J by exciting one electron into a higher orbital. Thus ¢Jpand
¢J~ refer to such atomic wave functions at site p . For the entire crystal we have
the corresponding ground state
'f'0 = n¢J
p p
(14.66)
'f'(k)=
ex
Lp exp(ik .p)'f'p (14.67)
Note that this differs from the tight-binding model where the states we
constructed were one-electron states. An exciton of wave vector k is thus an
excitation which propagates through the crystal (but of course does not carry
charge) . The energy will depend on the k vector, but only to a small extent, and
will be approximately equal to the-atomic excitation energy.
Thus the group t§ is not a direct product of :Y with the point group. As
another special case of (14.71) we deduce the inverse
{R j,n}-l = {R - 1, -Rj-1n}
j (14.74)
by asking what choice of Rj and m leads to the identity {E, O} on the right-
hand side of (14.71).
316 Symmetry in Crystalline Solids 14.9.1
Although R, refers to rotations about a lattice point at the or igin, the group
t§ includes rotations about other points because the product P(n)RjP( - n)
represents a rotation R, about the lattice point n (see problem 14.6) rather than
the origin. Furthermore, for particular lattices, t§ also includes rotations about
points other than lattice points. For example, in a two-dimensional square
lattice in the xy-plane it is verified that the operation {R z(n I2) , -ad is a
rotation R, (nI2) about the point (t,t, 0) at the centre of a cell. Reflections in
planes bisecting the cells may be generated in the same way.
In some crystals there may be additional symmetry operations which are not
of the form (14.70). These are the so-called screw-axis and glide-plane
transformations which are combinations of translations with rotations in
which neither of the separate transformations is a symmetry element. We shall
not investigate the larger groups obtained by including such operations but
refer the reader to Nussbaum (1966) and Birman (1974}-see the bibliography.
This neglect will not invalidate our results but means that in some crystals
there will, for example, be further degeneracies unexplained by our dis-
cussions. The word 'symorphic' is used to describe the group t§ defined above,
with 'non-syrnorphic' referring to the more general space groups with glide-
planes or screw-axes.
The inclusion of time-reversal will also enlarge the space groups in much the
way that it enlarged the point groups in section 9.8. In particular for magnetic
crystals there will be what are known as magnetic space groups in which time-
reversal occurs only in conjunction with a rotation or translation and is not
itself a symmetry operation. For instance, in many antiferromagnets, trans-
lation by a lattice vector must be accompanied by time-reversal to reverse the
directions of the spins . We shall not discuss these magnetic space groups any
further but a full account may be found in Bradley and Cracknell (1972) in the
bibliography to chapter 9.
the effect of group transformations on basis vectors e" which have been chosen
to transform irreducibly with respect to the subgroup of translations !!7 and
hence carry a label k, We therefore derive here a few useful results. First, under
a simple translation, we have from equation (14.11), by definition of e",
T(E, n)e" = T(n)e" = T I")(n)e" = exp( - ik . n)e" (14.78)
= exp( - iRjk . m ) e ~
This last equation shows that the vector e~ transforms irreducibly under the
subgroup of translations and belongs to the representation T lR;"I. (We use the
general method given in section 20.3.)
These general results will now be used in constructing and classifying
(labelling) the irreducible representations of the space group r§. Let L denote
the vector space of an irreducible representation. Since the translations form a
subgroup !!7 of r§ we are free to choose basis vectors which belong to
irreducible representations T I") of !!7. Let e" be such a basis vector. We can
then generate a set of vectors by operating on e"
with all the group elements
T(Rj, n). Before continuing the discussion we must make clear that there are
two different kinds of vectors involved. There are the vectors k in ordinary
three-dimensional space and there are the vectors e k in the vector space L 0 f
the representation. We shall refer to both kinds simply as 'vectors' but it will be
clear from the context and from the notation which kind of vector is meant. It
is convenient to discuss first the case when k is a vector with no special
symmetry, by which we mean that none of the rotations R, leaves k invariant.
In this case, the set of vectors k, = Rjk are all different , since if k, = k j then
R j- 1 Rjk = k and k would have special symmetry. The set of vectors k, is called
the 'sta r' oik , Hence, from equation (14.80), the set of vectors e", = T(RJet in L
all belong to different irreducible representations T(k ,) of !!7 and are therefore
linearly independent. Furthermore, we now show that they form an invariant
space and thus provide a representation of r§ . In other words, the set of vectors
e,,; generated from e" by the action of group rotations T(R j ) only, form a basis
318 Symmetry in Crystalline Solids 14.9.1
= T (R jR j,m)e k
using equation (14.71). If we call the group element RjR j = R, we can use
eq uatio n (14.79) and write
T (R j ,m )e k , = exp( - i R, k . m)T (R,)e k
= exp( - ik. . m)e k
. (14.81)
where k, = R,k = Rjkj. Th is eq ua tio n gives the general matrix element for a
general group element { Rj , m } o f <§. We note that the dimension of the
representation I" is equal to g, the number of elements in the point group. As in
subsectio n 7.4.2 we can now argue that the set e k, forms a basis for the space L
o f the irreducible representation because, if it did not, th en the space L would
have been reduced. Thus the structure of the irreducible representations of <§
is described by the basis vectors ek with the matrix elements (14.81). The
representation is labelled by a n arbitrar y vector k and denoted by flkl. There is
clearly an equ ivalence between r 1k)and r 1k.) for any k, in the sta r of k and, as
with the translation group, there is equivalence if k is replaced by k + K. , where
K. is an y reciprocal lattice vector.
If k has special symmetry (e.g. lies on an n-fold rotation axis or in a mirror
plan e) some o f the group elements R, will leave k invaria nt. These clements
must form a subgro up o f the po int group a nd it is called the 'litt le group' of k .
If g, denotes th e order o f the little group of k ; and g the order of the point
gro up, then one sees (problem 14.7) that the sta r of k now contains only gig,
vectors, which we call k p ' In general, there is more than one rotation R with the
property Rk = k p , for fixed k and k p , but for each k p we choose just one of
these and denote it by H p . It is easy to show that any rotation may be written
R, = HpG a , where G , belongs to the little group.
The irreducible repr esentati on s of <§ will now have a more complicated
structure since the vectors e k are no longer necessaril y linearly independent.
The subset of the e k for which k, = k, correspond ing to elements in the little
group, will form a subspace L ' of L which is invariant under the little group.
Consequently this subspace provides a representation of the little group. Since
the little group must itself be one of the point groups its irreducible
representations are well known and are denoted by T (2 ). It can be shown that if
L is to give an irreducible representation of the space group, then the
representation of the little group given by the subspace L' mu st be irreducible
with respect to the little group. Thus if S2 denotes the dimension of T(2) then
there will be a set of S2 linearly independent basis vectors eka~, where
JJ = 1,2, ... , Sa' which provide the representation T (2) of the little group and
all behave like T lk)under translations. By taking elements H p which lie outside
the little group we can now generate a corresponding set ofS2new bas is vectors
14 .9.1 Symmetry in Crystalline Solids 319
e liX~ for each ofthe gig, vectors k p in the star. Thus the representation of r§ has
dimension SiXg/g, and is denoted by rid).
To construct the represent ation matrix elements, corresponding to equation
(14.81) for the case of special symmetry, we must first write the general rotation
in the form R j = HpG a , where G, is an element of the little group and H, is one
of a set of rotations, one for each point in the star, with the property Hpk = k p'
Starting from the e" lJ we define the general basis vector by
e'iX ) = T(H )e ('"
kplJ p klJ
then
T(R.)' n)e(iX'
kplJ
= T(n)T(R ).)T(H P )e(ikXu)
which gives the matrix elements. In der iving this result we have defined H, and
G, as the factor s in the anal ysis of the rotation RjHp = HqG b since any
rotation can be analy sed in th is way. The vector k q is given by k q = Hqk which
is the same as Rjk p because k is unchanged by G b • The numbers T ( ~) (G b ) are
the standa rd irreducible repre sentation matrices for the little gro~p.
Some example s ofstars' for two-d imensional latt ices a re illustrated in figure
14.10and for a three-dimensional example we take a simple cubic lattice where
the Brillouin zone is also a cube of side Ztt]«. The point group here is 0h and
(0 ) (b) ( c) ( d)
Figure 14.10 Stars of k for a two-dimensional square lattice with symmetry group C 4v :
(a) general k ; (b) k alo ng x-axis: (c) k along sq uare diagon al; (d) k at zone boundary
alo ng x-ax is
for a general k the star o f k will have 48 vecto rs k, The special values of k with
symmetry are shown and labelled using the standard solid-state phys ics
not ation in figure 14.11. For a general point ~ along the kx-axis the little group
is C 4 v with 8 elements and the star has 6 points. For the particular point X on
the face of the zone , with k x = Tr ia, there is additional symmetry since it is
320 Symmetry in Crystalline Solids 14 .9.2
-t----*.
Figure 14.11
equivalent to the k-vector with k x = - ttl a. The little group of X therefore has
a reflection plane perpendicular to the x-axis and is D 4 h with 16 elements. The
star of X has thus only 3 points. At the centre r of the cube the little group
becomes the full cubic group 0h ' In a similar way if we proceed from r to M
along the line r M, then at a general point L the little group is C 2 v with four
elements and the star of L has 12 po ints. At the special point M, however, all
four corners of the cube in the kxkz-plane are equivalent and the resulting
symmetry group is D 4h with 16 elements so that the star of M has only three
points M, M 2 and M 3 in figure 14.11.
Table 14.1 Character tables for the group C 4 y at ~, O; at F and D4 h at X, for the
common elements (i.e. for the elements which appear in C4 y ) . The irreducible
representations are here labelled by ~j , F, and Xj, respectively, rather than by the
standard notation of appendix A.l in order to distinguish the representations of the
different groups. The groups O, and D4 h are product groups with an S2 factor so that
their representations carry a ± label. We have only listed the odd parity representations
since we are primarily interested in the p-states. The only even parity states to occur
belong to the trivial identity representations [i and Xi .
~1 1 1 1 1 1
~2 1 1 1 -1 -1
~3 1 1 -1 1 -1
~4 1 1 -1 -1 1
~5 2 -2 0 0 0
°h
[i 1 1 1 -1 -1
[- I 1 -1 -1 1
2
[3 2 2 0 -2 0
[,;- 3 -1 1 1 1
[; 3 -1 -1 1 -1
D4 h
Xi 1 1 1 -1 -1
Xi 1 1 1 1 1
X3 1 1 -1 -1 1
X,;- 1 1 -1 1 -1
X;; 2 -2 0 0 0
Xvector 3 -1 1 1 1
transform as the vector representation, see section 5.4, with character given in
the last row of the table. This reduces to a sum of .1 1 and .15 under the little
group. Of the three p-functions, Px belongs to .1 1 and the pair Py and pz belong
to .15 ' (Remember that x is the four-fold axis.) As in equation (14.39) we can
construct three Bloch states
r/J(k) =
q
L exp(ik.n)r/Jq(r-n)
/I
(14.82)
where q = x , y or z and, for the case being considered with k along the x-axis,
322 Symmetry in Crystalline Solids 14 .9.2
they transform in the same way as the p-functions under the little group. If
these states are now used in a variational calculation, as outlined in section
14.4.3, we may therefore conclude that the states </J ~k) and </J~k) will be
degenerate, labelled by the representation f( ti ,k I, while </J If) is labelled by f(ti,k) .
Because they belong to inequivalent representations these states will not mix.
If we add to the variational calculation th e Bloch state </J(k) s
formed from an
atomic s-state, it will o f course have .11 symmetry and therefore mixes with the
state </J(k)
5
. The matrix elements of H in the sa me nearest-neighbour approxi-
mation as equation (14.51) are therefore
El k )
rtl....---
';:;------_-:-- 1-- --+ x -axis
r x
Figure 14.12 Tight-binding energy bands for s- and p-like atomic states with k along
the x-axis
14.9.3 Symmetry in Crystalline Solids 323
tends to zero, the energies for ~ 5 and ~ 1 coalesce and the s and p wave
functions are no longer mixed . Thi s additional degeneracy is however to be
expected since, at the point F, the little group becomes the full octahedral
group O; and the three Bloch functions of equation (14.82) transform
according to the irred ucible representation r 4- of 0h' The representations ~ 1
and ~ 5 of C 4 v , which label curves which come together at I", may be obtained
from the representation r 4- by the process of reduction of r .; on restriction
from O; to the subgroup C 4 v , see sectio n 4.18. This connection of ~1 and ~5
with r;; is called a compatibility relation. In this way we can also label the
irreducible representation at the po int X by irreducible representations of the
little group at X which is D4h and from the character table we deduce the labels
sho wn in figure 14.12.
Bibliography
and for a specialist review on symmetry and electron states in metals see
Birman, J. L. (1974). Theory of space groups and infra-red and Raman lattice
processes in insulating crystals, Handb. Phys., XXV (2b); Light and matter
(1b)
Problems
14.1 Show that the vo lume of the Brillouin zone is (2n)3 /v" where Vc is the volu me of
the primitive cell.
14.2 Show that an y vector K m which satisfie s equation (14.16) is a reciprocal latt ice
vector.
14.3 Show that the free-electron parabolas in figure 14.4 are given by
eOn,n, ()k = 2M
~ { ( 2n" 1
at +k x
)2 + (2n"2 )2}
~ + k).
Much of this book has been concerned with symmetries in space. So far as we
can tell, space is homogeneous in the sense that an experiment carried out in
one laboratory will give the same result as in another and the result will not
depend on the direction in which the experiment is set up. This implies that the
laws of nature are invariant under translations and rotations and leads to the
conservation of linear momentum and angular momentum and all the
associated phenomena discussed in earlier chapters. (Within a crystal the
positioning of the atoms produces a potential field which lacks these
continuous symmetries and is invariant only with respect to finite rotations
and translations, as discussed in chapter 14.) Time is also believed to be
homogeneous in the sense that an experiment carried out today will give the
same result as one carried out tomorrow. This invariance with respect to time
translations leads to conservation of energy, a feature which is built into the
Schrodinger equation which we have used hitherto as our starting point. The
introduction of the theory of relativity by Einstein rested on the idea that space
and time possess a deeper homogeneity; that the laws of nature are invariant
under a set of transformations (Lorentz) which operate in the four-
dimensional 'space' of space and time . The transformations include rotations
and translations in both space and time as special cases. The concept of the
(rest) mass of a particle has a natural place in this theory and one finds for
325
326 Space and Time 15.1
example that the Schrodinger equation does not satisfy the new invariance. It
is necessary to derive a new formulation for both classical and quantum
mechanics to incorporate the new invariance. However, the corrections
implied by this modification are important only at velocities near that oflight,
about 3(10)8 m/sec. Thus it turns out that for most classical and quantum
mechanical applications, including the structure of atoms, molecules and
nuclei, the relativistic corrections are small. In this chapter we shall study the
groups associated with the Lorentz transformations and see how their
representations are related to physical quantities. The improper rotations
generated by the inclusion of the space inversion may also be incorporated in
the four-dimensional framework along with the time inversion operation.
The concept 0 f four-dimensional space and time is first introduced in section
15.2 where we describe the Lorentz group. Translations in four dimensions are
included in section 15.4 leading to the Poincare group. The improper
operations of space and time inversion are included in smaller sections, 15.3,
15.5 and 15.6. Although some indication of the physical significance of the
transformations is given in section 15.2 we delay a discussion of the
significance of the irreducible representations until the last two sections. The
chapter begins in section 15.1 with a discussion of the Euclidean group which
combines rotations with translations. This group is concerned entirely with the
three space dimensions and in a sense does not belong to the present chapter.
Also the physical interest in the Euclidean group is quite small but we include it
here for two reasons. Firstly it is very similar to the Poincare group but
somewhat simpler. Secondly, a group isomorphic with the two-dimensional
Euclidean group occurs in our study of the Poincare group and we shallbe able
to quote from the results of section 15.1. A reader who is anxious to move into
four dimensions could defer section 15.1 until he meets this quotation in
section 15.4.
We have studied rotations and inversion in some detail in earlier chapters and
in chapter 14 we described the symmetry of crystals with respect to finite
translations and finite rotations-the 'space groups'. We now study the group
of continuous translations and rotations following the procedure used in
section 14.9 for the space groups. We stress that the present section is
concerned entirely with the three space dimensions.
15.1 .1 Translations
following equation (3.37) so that the function l/J(x) = exp(ikx) transforms like
the irreducible representation T(k). Following section 7.3 the infinitesimal
operator then takes the form P x = - a/ox.
The exten sion to translations in three dimensions is trivial since translations
all commute. Thus the three-dimensional translation group is a direct product
of three one-dimensional translation groups in x, y and z. The general group
element is thus denoted by P(p) with the definition
P(p)r=r+p (15.2)
where r is a general position vector. Writing p = (¢, I}, () there are clearly three
parameters and three infinitesimal operators P x ' P y and P z which in a space of
functions l/J(r) are given by - a/ox, - %y and - %z. The irreducible
representations are obvious generalisations of equation (15.1), being labelled
by three numbers kx ' k; and k, which form a vector k,
Tlkl(p)=exp(- ik.p) (15.3)
If the Hamiltonian of a quantum system is translationally invariant then the
infinitesimal operators P,; P y and P z represent conserved quantities, essen-
tially the three components of the total linear momentum of the system . As
they stand, these operators are skew-Hermitian, being the infinitesimal
operators of a unitary transformation, but they can be made Hermitian by the
inclusion of a factor i. The dimension is also inappropriate for a momentum
but this can be put right by including a factor h. One actually defines the
physical momentum operators Pq by Pq = iliP q • The reason for associating
these operators with momentum is that in classical mechanics also the
translational inva riance of a system leads to the conservation of the total linear
momentum, as explained in section 16.1.
Since the irreducible representations of the translation group are one-
dimensional this invariance leads to no degeneracy, but simply to the labelling
of the eigenfunctions by the representation labels k. From the expression given
above for T(k)(p) one sees that the infinitesimal operator P, has the eigenvalue
- ik q in a state with given k . Thus the momentum is given by hk .The presence
of any potential V(r), depending on position r, will violate translational
symmetry unless V is trivially a constant. Thus for a single particle, translation
invar iance is relevant if the particle is moving freely. For a system with more
than one particle, any internal potential like V(r 1 - r2) is clearly invariant
under translations and thus the total momentum of the system is conserved.
328 Space and Time 15.1 .2
using equation (15.2). Notice that rotations and translations do not commute,
which is obvious geometrically, and in fact taking the product (15.4) in reverse
order we have
R(a) P(p)r = R(a) (r + p) = R(a)r+ R(a)p
so that we may conclude that
P(p)R(a) = R(a)P(R-1(a)p) (15.5)
Because of this result it follows that one need only consider group elements of
the form P(p) R(a), since any product in the reverse order may be written in this
way using equation (15.5).
Following our general definition (3.37) of the transformation of functions
we define the operator Ttp, a) in function space by
T(p, a) "'(r) = "'{ [P(p)R(a)] - I r } = "'{ R - I (a)p - 1 (p)r}
= ", {R -1(a)(r-p)} (15.6)
which is similar to the notation used in section 14.9.
Since the rotations and translations do not commute, the group iff3 is not a
direct product group so that the labelling of the irreducible representations of
iff3 is a non-trivial problem. However, the existence of the translations as an
Abelian subgro up enables us to solve this problem following the procedure
used in section 14.9 for the space groups.
We use the notation T(p, a) to denote the representation operator cor-
responding to the group element P(p)R(a) and, by definition, the T(p, a) must
satisfy the same multiplication laws as the group elements, i.e. from equation
(15.5).
an axis lying in the xy-plane which carries k o into k, viz. Rxyk o = k. From these
definitions we see that
R;/ Rk o = R;/k = ko
so that the rotation R;y1 R leaves k o unmoved and must therefore be a rotation
about the z-axis, i.e. R;/ R = R. which gives the desired result,
(15.8)
The irreducible representations E of the Euclidean group tff 3 may now be
generated by a sequence of steps as follows . (See also section 20.3.)
Step (I). Choose basis vectors Ik > which belong to irreducible repre-
sentations T(k) of the subgroup of translations so that
T(p, 0) Ik > = exp( - ik . p)lk > (15.9)
(It will be more convenient in this chapter to use the notation Ik > for a basi s
vector rather than ek which would be more consistent with section 14.9.)
Step (2) It is now shown that the vector T(O,a)lk > transforms like the
representation T(R(a)k)of the subgroup of translations. Using equations (15.7)
and (15.9) we have
T( p, O)(T(O, a)lk » = Ttp, a)lk >
= T(O, a)T(R - I (a) p, 0) Ik >
= T(O, a)exp[ - ik .R - I(a ) p ]Ik >
= exp[ - iR(a)k . p] (T(O, a)lk » (15.10)
which is the desired result. This shows that if the representation E contains a
basis vector Ik >then it must also contain vectors Ik' >for all k' = R(a)k, i.e. for
all k' with Ik'i = Ikl.
Step (3) We now choose a reference direction, usually the z-axis, denoting a
vector in this direction by k o , a basis vector which transforms like T(k o ) under
translations by Iko > and a rotation about k o by R(a o). As a result of step (2)
above, the vector T(O,ao) Ik o > will also transform like Ik o > under translations
because R(ao)k o = ko . Thus the set of basis vectors ofE with the same label k o
will form an invariant space under rotations about the z-axis and may
therefore be labelled also by the irreducible representations of the group 9t 2 of
rotations about a fixed axis. Thus we may introduce the more detailed notation
Ikorn >in place oflk o >where rn labels the :Yi 2 representation (see section 7.3)
and
(15.11)
Step (4) We are now in a position to construct the representation E. It will be
labelled by Ikl = Ikol and rn. Starting with a single basis vector Ikorn >we
generate the set
Ikrn> = T(O,ii(k))lkorn > (15.12)
330 Space and Time 15.1 .3
where k is any vector oflength Ikol and R(ii(k)) is that unique rotation about an
axis a(k) in the xy-plane which carries k o into k, i.e. R(ii(k) )k o = k , We use the
tilde on ato signify that this vector lies in the plane perpendicular to k o , i.e. the
xy-plane. There is therefore an infinite set of basis vectors Ikm ) in E which all
have the same m and the same length of k but with different directions of k. To
prove that this set is invariant under the group Iff3 we consider the effect of a
general operation T'(p, a) on an arbitrary basis vector Ikm ),
T(p, a) Ikm ) = T(p, O)T(O, a)T(O, o(k) )Ikom )
= T(p, O)T(O, a(k') )T(O, c)lkom )
= T(p, O)T(O, a(k'))exp( -imc)lkom )
= exp( - imc)T(p, O)lk 'm )
= exp( - imc)exp ( -ik' . p) Ik 'm > (15.13)
In deducing this result we have used equation (15.8) for the rotation
R(a)R(d(k)) writing
R(a) R(a(k)) = R(o(k')) Rz(c) (15.14)
which defines the rotations R(a(k')) and Rz(c). The vector k' is defined by
k' = R(d(k'))ko or equivalently by k' = R(a)k using equation (15.14). Thus
given aand k one can deduce k'and cso that equation (15.13)actually provides
the representation matrix elements of E.
By an argument like that used for the group :Ji 3 in subsection 7.4.2 it follow s
that, if it is to be irreducible, the representation E can have only one basis
vector with the labels Ikom ) . Thus the irreducible representations of g 3 are
given by equation (15.13). To be definite, we choose a particular axis, the z-axis,
in generating the basis vectors but it can be argued quite simply that
representations generated by taking k o in different directions are equivalent so
long as k o ha s the same length. Thus the direction of k o is irrelevant in labelling
the inequivalent, irreducible representation of rff 3 and we use the notation
Ellkl.mJ for E, where Ikl denotes the length of all vectors k appearing in the
representation.
As an example of these representations we see that the representation Ellkl.OJ
may be generated, using equation (15.12) from the function expUlklz) of
position ofa single particle. The basis vectors are given by IkO) = exp(ik.r),
I.
where k runs over all vectors with fixed length Ik To generate the
representations Elikl.m) one need s to take at least two particles and the function
exp[ilkl(Zt +zz)] {(xt-xz)+i(Yt-Yz)}m would suffice for Ikom ).
The procedure used here in the study of the irreducible representations of
rff 3 is another example of the general method of 'little groups' used in
subsection 14.9.1. Here the little group is the group 9t z which leaves the vector
k o unchanged and the 'sta r' contains an infinite number of points. This method
will be used later in studying the Poincare group and in fact it is largely for that
15 .1.4 Space and Time 331
This small section may be omitted since it will not be used directly, but we
feel it necessary to comment on the fact that the irreducible representations of
C 3 have a continuum of basis vectors. Throughout the general discussion of
chapter 4 and in alI previous applications to physical problems the irreducible
representations have had finite dimension and were therefore expressible as
finite matrices. Here, for the first time, we have found irreducible repre-
sentations E(lkl.m) which have a continuous infinity of linearly independent
basis vectors Ikm >corresponding to different directions of k , This situation
will occur again for the Poincare group in section 15.4.
Let us restrict to vectors IljJ >belonging to a particular representation space .
Then, in the case of a finite representation, we write
N
IljJ >= L ljJ;ji > (15.16)
i= 1
where Ii>denotes a set of linearly independent basis vectors and the ljJ j are
numbers. We then define the scalar product of two vectors Ic/J >and IljJ >as
N
where the integration runs over all directions of k and IjJ (k) is a function of
direction of k. We then define a scalar product as
We may now verify that the representation E(lkl,m) given by equation (15.13) is
in fact unitary. From equations (15.13) and (15.19)
f
= IjJ(R -1k')exp(- imc - ik' ·p)lk'm >dn k·
using the fact that, since k ' = R(a)k with fixed a, the surface integral dn k may
be replaced by dn k" i.e. the Jacobian has the value unity. With a change of
notation for the integration var iable this may now be written as
ITIjJ > = f
IjJ(R - I k)exp( - imc- ik . p)lkm > dn k
so that comparing with equation (15.19) and using the definition (15.20) of
scalar product we have
The analogue to equation (15.18) may be deduced only if we make use of the
Dirac b-function of the angles of k' and k. Notice that the definition of scalar
product (15.20) is not quite the same as the integral over all momentum space,
since here we keep the magnitude of k fixed.
Having discussed the way in which rotations and translations come together in
the Euclidean group we now temporarily discard translations and study the
15.2.1 Space and Time 335
extension of the rotation group :Jt 3 into four dimensions when time is
included. We define the four-dimensional space by associating with each
'event', or happening, a 'point' with four coordinates (x , y, z, cr) which denote
the place (x , y, z) and time l of the event , where c is the velocity of light. Thus all
fo ur coordinates have the dimen sion of length. We shall denote the point by a
vector e = (x, y, z, et ) where e is the four- vector from the 'origin' (0, 0, 0, 0) to
°
the point (x, y, z, et ). Remember that an event at the o rigin in four-space
implie s an event at the position x = y = Z = at time t = 0.
( -~ o
~)
0
-1 0
g=
o o -1
o o 0
Lej = L Ljje j
j
so that
the contravariant components. In this way the factors gjj may be absorbed by
switching from one set to the other. Thus for example a scalar product takes
the form
with the last two expressions involving both covariant and contravariant
components but avoiding the g jj' In this book we prefer not to introduce the
extra sophistication of having both sets of components and therefore we shall
always use the contravariant components e j denoted by a lower index and
transforming according to the equations above.]
so that
Since the Lij are real it follows that L;' ~ 1and hence the values of L" can be in
either of two separated regions L" ~ 1 or L" ~ - 1. Thus we have the four
types of transformation L distinguished by
these groups from those which include translations. We shall not use this
nomenclature. In thi s section we confine our attention to the group 2 .
and
1 0 o
o 1 o (15.24)
Q z(b) = 0 0
cosh b
( o 0 -sinh b
The first of these, written in partitioned form, is simply a rotation , since the
condition (15.23) implies that RtR = 1 and the time coordinate is unchanged.
The second leaves all points in the xy-plane unchanged and is called a pure
Lorentz tran sformation (o r 'boost")in the z-d irection. It cont ains a single real
parameter b and each value of b in the range - 00 < b < 00 corresponds to a
different transformation Q, (b). The transformed vector e is given by
(We shall see later that p has the physical significance of a velocity p = vi c
measured relative to the velocity c of light.) We may clearl y define a similar
boost transformation in an arbitrary direction defined by a unit 3-vector u.The
notation Q(b) will be used where b = bu and b defines the magnitude of the
boost, as in equation (15.24). The transformed vector e' = Q(b)e may be
deduced as a generalisation from the result for Q z(b) and is most easily
expressed by giving the space and time components e' and ct' of e',
e' = e + { (cosh b -1)(e .u) - ct sinh b} u
ct' = ct cosh b- (e .u)sinhb (15.26)
15.2.2 Space and Time 339
The 4 x 4 matrix of Q (b) may be read off from these equations. For example
Qxx(b) = 1 + (coshb -1)u;, QyAb) = (coshb -1)uxu y,
Q,x(b) = -uxsinhb, Qu(b) = coshb
There is similarity between equation (15.26) and the corresponding expression
given for rotations at the beginning of section 7.4.
Since Q(b) and R(a) are both Lorentz transformations it follows that their
product is also a Lorentz transformation for any a and b. Further than this we
now show that any Lorentz transformation L may be written as such a
product. In principle this result is very similar to the factorisation (15.8) of an
arbitrary rotation. We first define eo = (0,0,0, 1). Then, for each L we may
e
construct a vector = Leo . From equation (15.26) one sees that there is a
unique boost Q(b) with the property Q(b)eo = e. Thus we have
eo = Q-l(b)e = Q-l(b)Leo
showing that the product Q - 1 (b) L leaves eo invariant and must therefore be a
pure rotation. We may therefore write Q - I (b) L = R(a) so that
L = Q(b)R(a) (15.27)
We shall find it convenient to parameterise L in this way by the six parameters
needed to define the two three-vectors a and b. We note that whereas the
magnitudes of the rotation parameters are limited in the usual way lal ~ 11:, the
boost parameters are unlimited. This is clear from equation (15.24) which
shows that Q z satisfies the defining equations of a Lorentz transformation for
all b.
---':2~~-+---;~~--+-f---~~-;---cl
Figure 15.1
contain the null vectors. Notice that a null vector, with e.e = 0, does not have
all its components zero; this property is restricted to the single zero vector
e = 0. Although figure 15.1 is restricted to a single space dimension, the
classification is quite general since it is based on the scalar product and the sign
of t. In the general case the space-like and time-like regions are separated by
what is called the 'light cone', meaning the set of points satisfying
x 2 + y2 + Z2 = c 2 t 2 and so forming a three-dimensional 'surface' in four-space.
Since a Lorentz transformation preserves the length of a vector it carries a
point described by a vector einto another point e' with the same A, i.e. on the
same hyperbola. (One imagines a family of hyperbolae corresponding to
different A.) Thus the space-like, time-like, null and zero regions are invariant
under Lorentz transformations. Furthermore all six regions remain invariant
under the proper Lorentz transformations defined earlier. These do not permit
transformation between future and past regions (see problem 15.4). (NB The
two-dimensional diagram 15.1 suggests that proper Lorentz transformations
cannot connect regions with positive and negative z but this can be achieved in
four dimensions with the help of a rotation about one of the other space axes.)
One often refers to pictures like figure 15.1 as 'Minkowski diagrams'. An
individual at rest with respect to the space axes x, y, z would move parallel to
the time axis in the Minkowski diagram while a speed of v relative to the space
axes results in a path making an angle tan - 1 (v/c) with the time axis. Starting
from the origin , with speeds less than that of light, one can reach points in the
future time-like region only.
is not absolute, as one assumed in pre-relativity days. Were it not for this effect
the mesons produced by collisions in the upper atmosphere would have
decayed long before reaching the surface of the earth. They have speeds close
to e so that the factor (1 - fJ 2) - t is large. Notice however that the 'interval'
between the two events, birth and death of the meson, is unaffected by the
velocity .
~x -~.<
Q
y
-b<)
=bOb: = q ;~y.z(QqXq+bqYq) (15.30)
-by -b z
where bq are the three components of the boost vector b, the matrices X, are as
defined in equation (7.24) and the matrices Y q are given by
YX=( -1
~ ~ ~ -~)'YY=(~ ~ ~ -!),
0 0 0 0 -1 0 0
These three matrices describe infin itesimal boosts in the three space directions.
The commutation relations of the infinitesimal operators follow from the
matrices (15.31)
[X.~,x.j = x. [x; x.] = X.~ [x., x .j = x,
[Y,, Y y ] = - X, [Y y , Y,'] = - X.~ [Y. , Y,] = -x y
together with [A q , B",] = 0 for all q and q', In other words if we regard the A"
and B, as the infinitesimal operators the y will describe a product group
because all A q commute with all Bq • Furthermore the commutation relations
(15.34) of the A q and B, separately are precisely those of the group 9i 3 , see
equation (7.25). Thus the A q , B, describe the infinitesimal operators of the
product group fJt 3 x 9i 3 • This does not mean that !e and 9i 3 x 9i 3 are
isomorphic because in !e the X, and Y, appear with real coefficients, while in
9i 3 x 9i 3 it is the A q and B, which have real coefficients. However it does
enable us to deduce and label the irreducible representation of!e from those of
9i 3 x fJt 3 because the arguments are based on the algebra of the infinitesimal
operators as in section 7.4.
We now move away from the basic four-space of the Lorentz group to study
representations in more general vector spaces. We shall however use the
same notation for the infinitesimal operators, so that whereas in the preceding
section A q and B, have denoted 4 x 4 matrices, in the following they will
denote the corresponding operators in some vector space which provides the
basis of the representation . We know from the general arguments ofsection 7.1
that the commutators of the infinitesimal operators are the same in all
representations and may therefore be taken from equation (15.32).
Since the structure of the irr ed ucible representations O (jJ of fJt 3 was deduced
in subsection 7.4.2 entirely from the commutation relations and since the
irreducible representations of a product group are labelled by a pair of labels,
one for an irreducible representation of each group, it follows that the finite
15 .2.5 Space and Time 345
_H) (~! H)
o
x'- y' ~(-~ -2
o = -3
o o 0 000 3
and X .y =0
which agrees with the formulae (15.35) when j = j' = 1. One may even go
further and deduce the matrices of the operators X, and Y q knowing the
matrices (see equations (7.40) and (7.39)) of the infinitesimal operators of ~ 3 '
Thus for example the matrices (15.31) may be deduced from the OH) matrices
by a suitable change of basis (see problem 15.5). In particular it is worth noting
that X, = A z + B, so that from equation (15.36) the basis vector of LU' j')
labelled by m, m' will be an eigenvector of X, with eigenvalue - i(m + m') and
will transform according to the representation TIm + m'l of the group ~ 2 of
physical rotations about the z-axis.
The reduction of products of two irreducible representations of!f' is given
from the corresponding rule for the rotation group. The result is
V M )= t D<Jl (15.38)
[;- /1 ~ J ~ (j+n
which may be derived simply in two steps . First we note from equation (15.37)
that
which is to be expected since the 4 x 4 Lorentz matrices them selves give the
representation LIB) and the four-space breaks up into the three -space
components, which transform like 0(1), and the time component which is
rotation invariant.
We should also notice that whereas for the group Yt3 the complex conjugate
of a representation D U I is equivalent to DUI with a change of basis
e: = ( - 1)i- m e _m in the notation of subsect ion 7.4.2 (see also section 7.7), the
complex conj ugate of V i ,n is equivalent to Ui' ,i). a distinct representation,
Thus U M ) is equivalent to its complex conjugate only when j = j'. The
justification for this statement comes directly from equation (15.33) but we
must be careful to distinguish the operator Aq in the complex conjugate
representation which we denote by (Aq )* from the complex conjugate A: of the
operator Aq • Now, in the complex conjugate representation, X, and Y, are
replaced by X: and Y: so that
The addition of the space inversion I to the rotation group fJt 3 was a trivial
process. The inversion commutes with rotations, leading to the product group
0 3 = fJt 3 X S 2 (see section 7.4) and the irreducible representations of 0 3 are
labelled by the pair of labels j, n, where n = ± is the parity.
The addition of the space inversion I to the Lorentz group ff is not so simple
because I does not commute with the Lorentz transformations. However from
the separation (15.27) of an arbitrary L into a rotation R(a) and a boost Q(b)
we need only concern ourselves with the products of 1 with boosts. For the
particular boost Qz(b) one sees from the matrix (15.24) and the matrix
I =
(-~ -~ ~ ~)
o
o
0
0
-I
0
0
I
(15.39)
that IQz (b) = Q, ( - b)l, and it is clear that this will be true for any direction in
place of z. Thus lQ(b) = Q( - b)1 or in other words IQ(b)1 = Q( - b) showing
that 1 simply changes the direction of the boost, a result which might have been
expected. In terms of infinitesimals this means that
IXq = Xql, lY q = -Yql (15.40)
so that
Aqi = IB q and Bql = IAq (l5.4la)
which implies that
A2 I = IB 2 and B21 = IA 2 (15.4lb)
The structure and labelling of the irreducible representations of the group ff s
may now be deduced quite simply from these multiplication laws which are of
course valid not only in the 4 x 4 matrices but in all representations. As in the
preceding section, we still use the same notation A q , etc., to denote both the
infinitesimal 4 x 4 matrices and the corresponding operators in a general
representation. However, in line with the notation of section 4.1, we use the
notation T(I) for the transformations induced by the inversion operator I in a
general representation space.
Let us denote the basis vectors of an irreducible representation Lu.n of ff
r.
bye-:t.'m' with m = j,U -I), .. . , -j, m'= U' -I), .. ., -j'. Then using (15.41)
'*
Now so long asj' t . the vectors e:,{~, and!~~, must be linearly independent
and so, from the above equations, the set of 2(2j + 1)(2j'.+ 1) vectors e:,{~, and
r: with fixed j and j' is invariant under all the operators A q , B, and I of the
group !f's ' Hence they form an irreducible representation which we shall
denote by L~j.j'), although the order ofj and j' is irrelevant in this case . Clearly,
on restriction to the subgroup !f' we have the reduction
(15.44)
From equation (15.43) we see that the basis vectors e andfdo not have definite
parity. This may be achieved only by taking the combinations
(15.45)
which have even or odd parity depending on the sign ±. These combinations,
although not eigenfunctions of A2 or B 2, are nevertheless eigenfunctions of the
operator X 2 - y2 defined in equation (15.35).
A separate discussion is required for the special case of j = i ' since the
vectors e!.!m' and!!.!m are not then necessarily linearly independent. In fact we
shall show that the representation generated from e!.!m' is not irreducible if they
are linearly independent. This is done by using a reductio ad absurdum
argument. Suppose that !!.!m' and e!.!m' were independent, then the irreducible
representation which contains e!.!m' must also contain the basis vectors!!.!m' and
so have dimension 2(2j + 1)2. But since
T(I)(e:,{m ' +f:,{m ') = n: + e:,{'m
and as in equation (15.42), both e andftransform identically under A q and Bq ,
it follows that the subspace of (2j + 1)2 basis vectors (e:,{m ' +f:,{m ') is inva riant
under all the group operations. Thus the representation will have reduced,
contrary to the hypothesis that it is irreducible. The only escape from this
contradiction is thatf:,{m' = ce:,{m" for some constant c and since 12 = E then
c 2 = 1 and c = ± 1. Thus, for each i. there are two possible (2j + 1)2_
dimensional irreducible representations of !f's which we denote by L(j~ ) and
15.4 Space and Time 349
L(j- ) which are both identical with UM) for proper Lorentz transformation
and have the respective properties
for Ui +) (15.46)
and
Notice that the ± sign on U i± ) does not mean that all the basis vectors have
even or odd parit y. The equations (15.46)show tha t T(I) is not diagonal in the
mm'-basis. For example , the four-dimensional representation of the matrices L
is the Llt- ) representation. The basis y,ex' e e:,e,
in which it was set up actually
has T(I ) in diagonal form with three odd-parity vectors and one ofeven parity.
From equation (15.46) the matrix of T(l) in the mm'-basis would be
J)
- I o o
o -1
T(l) ~ ~ - 1 o
( o o
The pre vious two sections have been concerned with the (homogeneous)
Lorentz transformations, which leave the origin unmoved. We now consider
the inclusion of space - time translations with the Lorentz group which
generates the Poincare group f!J or, as it is sometimes called, the in-
homogeneous Lorentz group. We begin with a short section on the trans-
lations themselves before moving on to the Poincare group. The inclusion of
the improper elements of space and time inversion is left for later sections, as is
the discussion of the physical significance of the different representations of
the Poincare group.
the label k, or E. In fact the familiar expression (5.3) for a stationary state
simply reflects the fact that under time translations it transforms according to
the representation labelled by k, = E[hc, We note that the Schrodinger
equation (5.2) is obtained by equating the explicit form in otC/ for the energy
operator, deduced from P, to so me assumed form such as T + V for the energy
operator H. Notice that since P is a four-vector operator it follows that the
energy and momentum operators come together into a four-vector operator
(P." P,., PZ' H ie) with eigenvalues n(kx ' k)", k:, k,). In classical mechanics also ,
one sees that the lack of any explicit time dependence in the Hamiltonian leads
to the conservation of energy (see subsection 16.1.2).
In principle, k, may take either sign which would imply both positive and
negative energies. There are difficulties in assigning physical significance to
sta tes with negative energies; for example, why should the positive energy
states not decay into them'? The postulate that all negative energy states are
occupied has been made but was never sat isfactory. The modern interpretation
is to postulate that all free particles have positive energy since this is consistent
with observation. Nevertheless the representations corresponding to negative
k, have a role to play when one con structs a quantum field. We discuss this role
briefly in chapter 16.
with the Lorentz transformation acting first, followed by the translation. The
group f!I! is very similar to the Euclidean group of section 15.1 except that there
are now four dimensions and the rotations have been replaced by Lorentz
transformations. There are now ten parameters, six from L and four from P{t) .
We shall use the notation T(t, L) to denote the transformation induced in
representation space by the Poincare transformation P(t)L and, as usual , the
operations T(t, L) will satisfy the same multiplication rules as the correspond-
ing group operations.
352 Space and Time 15 .4.2
using equations (15.53), (15.49) and (15.22). Thus if Ik >is a basis vector then
>,
T(L)lk which must also belong to the representation, transforms according
to the representation of the translation group labelled by k' = Lk, and the
assertion above is proved.
The irreducible representations of f!} may therefore be first classified
according to the region of four -space in which the four-vector k lies and as we
saw in subsection 15.2.2 there are six regions . The structure of the repre-
sentation is quite different for each region so that the remaining two steps must
be discussed separately for each region.
Region (6), k= 0
°
This case may be quickly disposed of since with k = all basis vectors must
be invariant under translations and the group is effectivelyrestricted from f!} to
Y . The irreducible representations are then simply the Uj.j') described in
subsection 15.2.5.
Ik o ) . In the notation of subsection 7.4.2 there will then be a set of(2s + 1) basis
vectors with the property
T(R(a))lkosm s) = L. Db)mJm, (a) Ikosm~ ) (15.56)
m,
as well as the property (15.49).
Step (4). Starting from these (2s + 1) basis vectors we now argue that the
representation is generated by constructing all vectors of the kind
Ik sms ) = T(Q[h(k)] )Ikosms> (15.57)
where k isany four-vector in the same region as ko and having the same 'length'
and h(k) describes that unique pure Lorentz transformation which carries k o
into k, i.e.
(15.58)
One sees from equation (15.26) that the three-vector h(k) has magnitude
cosh - 1 (k,/k) and direction - k, where k denotes the three-vector part of k and
k, its fourth component.
To show that the set (15.57) with fixed t;
and s forms the basis of a
representation we consider the effect of an arbitrary group operation. In doing
this we shall need to use the result (15.27) that an arbitrary Lorentz
transformation may be written as a product of a pure Lorentz transformation
and a rotation. We have
T(e,L)lksm s > = T(e)T(L)T (Q[h(k)] )Ikosms> using equation (15.57)
= T(f)T(L')lkosms > where L' = LQ[h(k)]
= T(f)T(Q')T(R')lkosms) using equation (15.27)
= T(6)T(Q ')L:D~!m
m S J
(R ')lko sm~ ) using equation (15.56)
J
but this result was to be expected from equation (15.55). The parameters of the
rotation R' may be deduced for any given k and L from the defining equations
L' =Q'R ' (15.60)
and
L' = LQ[h(k)] (15.61)
but we shall not go into any details.
354 Space and Time 15.4.2
The equation (15.59) shows that the set (15.57) of basis vectors is invariant
under the general Poincare transformation, and so forms the basis for a
representation which we might denote by plto.S). It may be shown, as for the
Euclidean group in subsection 15.1.3, that representations generated from any
future time-l ike vecto r k with the same ' length' as ko are equivalent to plto..').
The inequivalent representations are therefore denoted simply by pCk.s) where
the index k denotes the 'length' of all the vecto rs k which occur in the basis
vectors Jksms ) ' The proof that the representations p tk . S) are irreducible
follows the argument referred to in subsection 15.1.3 for the group cff J . Because
of the infinite var iety of k with fixed length, these representations are infinite-
dimensional.
and
[X, - Y)" X}' + Y,] = °
The operator X, generates the group ~ 2 of rotations about the z-axis and we
now show that the commutation relations (15.63) are precisely those of the
Euclidean group Iff2 which describes rotations and translations in a plane.
From equation (15.15) and since translations commute with each other, we
deduce the following commutation relations for the infinitesimal operators
P " P, and X, of Iff2'
(15.64)
These are identical with equations (15.63) if we make the following associ-
ations: X, --+ Xl' X, - y y --+ P x' X, + Yx -+ p}" Thus the subgroup that we are
seeking, which leaves ko invariant, is isomorphic with Iff2 so far as infinitesimals
are concerned, and hence has the same irreducible representations. Physically,
the two groups are quite different although they do have one operator X, in
common.
15.4.2 Space and Time 355
L = a., o, L (15.66)
where L belongs to the Iff2 subgroup which leaves t;
= (0,0, 1, 1) invariant. To
show this result, which is analogous to equation (15.27), we define k = Lk o and
define R,yQz by k = R,yQzko. Then we see that
(R,yQz)-1 Lk o = (R,yQz)-1 k = ko
showing that the product (R,yQz)-1 L leaves ko invariant and therefore that L
in equation (15.66) belongs to the C 2 subgroup.
Following step (4), we now construct the representation of [Jj! corresponding
to null k by generating the set of basis vectors
A Ikm> = T(RxyQz) Ikom > (15.67)
with any future null k. To show that this set provides a representation we
consider the effect of a general Poincare operation
T(f, L)lkm>= T(f)T(L)T(RxyQz)lkom > from equation (15.67)
= T(f)T(L')jkom > defining L' = LR,yQz
= T(f)T(R~yQ ~)T(L')lkom > using equation (15.66)
356 Space and Time 15.4.3
One quickly deduces that k' is given simply by k' = L k but the value of the q( 2
rotation angle a must be deduced in detail from its defining equations. L' is
known from k and L and the factorisation (15.66) of L' then gives L', an
element of ,ff 2 ' But elements of S 2 are products of a 'translation' with an q( 2
rotation. The 'translation' is irrelevant here since Ikom > is 'translation'
invariant, see equation (15.65).
The equation (15.68) shows that the set (15.67) of basis vectors forms a
representation, which we denote by pIO .m). It may be shown, as indicated
before, that this representation is irreducible and that the representation
generated from an arbitrary future null vector k o is equivalent to the pIO ,m)
deduced above for the simple choice k o = (0,0, 1, 1). Remember that the index
m, which arose from the group q( 2 may take the usual values
m = O,±!-,± 1, . . " etc.
Thus for p(l. s) we may choose the basis vector IkoSms ) corresponding to
ko = (0,0,0, k). Then Pqlkosms ) = 0 for q = x, y and z, while
P,lkosms) = - iklkosms ) ' Hence from equation (15.72)
W,lkosms ) = 0
and Wqlkosms ) = -ikXqlko sms )
so that
W. Wlkosms ) = k 2X 2lkosms ) = - k 2s(s + 1) Ikosms ) (15.74)
2
We have used the fact that X is just the Casimir operator for the .eJt 3 subgroup
whose irreducible representations were labelled by s. The value s(s + 1) for the
91 3 Casimir operator was deduced in section 7.4 and the negative sign comes
from the factor i introduced in section 7.4 when the Hermitian operators J,
were defined as J, = iXq. The result (15.74) is valid for all vectors belonging to
p(k ,s),
For the irreducible representations p(O.m) both Casimir operators P.P and
W.W have the value zero. The latter result follows by choosing the particular
basis vector with k o = (0,0, 1, I) whereupon
W,lkom ) = -i(X,-YY)lkom ) =0,
W y Ikom ) = -i(X y + Y,) Ikom ) = 0,
Wzlkom ) = -iXzlkom ) = -mlkom)
W,lkom ) = -iXzlkom) = -mlkom ) (15.75)
using equation (15.65) and hence W.WI~o m ) = O.
Since both Casimir operators are zero they cannot serve to distinguish
representations plO ,m) with different m. However, in a sense, it is possible to use
the ratio of W to P to distinguish between them. For the particular vector
jkom ) the equations (15.75) imply that, since Plkom ) = - iklkom ) ,
Wlkom ) = - imPlkom )
In other words we may write
(W + imP) Ikom ) = 0 (15.76)
However since all basis vectors in the representation are generated by Lorentz
transformations (15.67) from Ikom) and since both Wand Pare four-vectors,
transforming in the same way under Lorentz transformations, it follows that
the relation (15.76) is true not only for the special vector Ikom), but for all
Ikm) in the representation p(O ,m) (see problem 15.9). Thus the relation (15.76)
may be used , in addition to P.P = 0, to characterise the representation p(O ,m) .
It says that the four-vector operators Wand Pare in the same 'direction' with a
proportionality constant - im which depends on the representation. Notice
that the relation (15.76) came from a consideration of the future null vector
ko = (0,0, 1, 1) and it is therefore only applicable to the representation
15.4.4 Space and Time 359
t.
referring to future null For the representation relating to past null ii we must
choose (0,0, I, - 1) in place of (0,0,1 ,1). This slightly modifies equation
(15.76) and leads to the relati on
(W -imP) lii m > = 0 (15.77)
for all past null Ii.
The proportional ity that we have established between Wand P is not
unexpected since, ifW and P were ord inary four-vectors rather th an operators,
the relation s W.W= 1'.1' = 0 together with W.P = 0 impl y that
(W - .,;1') .( W- .,;1') = 0 for all x, so that by taking z = Wr/P, we conclude
that (W - '1:P) .( W - .,; P ) = 0 a nd hence that W = .,; P a nd ind eed W= '1:1' for
thi s value of '1: .
Taken together, the sum and integral run over all basis vectors Iii sms > of p lk. s )
a nd the v; (ii) are the numbers which determine I'" >. To co ver all ii in four
dimension~ with fixed 'length' k , one needs a triple integral a nd there is
naturally some freedom in choosing the three integration va ria bles. It is
t.
convenient to choose the three components of th e space part k of From the
relation k 1 = - Ik 11 + k,1 a nd th e cond ition of future time-lik e ii one sees that
ii is determined by k and th at k mu st range over all three-space. The factor k,- 1
has been included in equation (15.78) for convenience because, as we show
below, the volume element i ; 1 dk is Lorentz invariant. One might have
omitted the factor k,- 1 , essentially absorbing it into the coefficients e m (ii).
The scala r product is now defined, following equat ion (15.20) as '
with the same volume element as above. The proof that the representation
p tk.s ) given by equation (15.59) is unitary then follow s as in subsection 15.1.6
but we shall not write it out. In that subsection we made use of the fact that if
k ' = Rk then the su rface element df2 k could be replaced by df2 k ,. The
analogous problem here is to tran sform from an integral dk to an integral dk ',
where k' is the space part of ii' = Lii . This requires calculation of the Jacobian
?W, k;. k~ ) / c(k , k)" k: ) but since we may write L = QR, from equation (15.27),
and the Jacobian for a rotation is unity, we need only consider a boost in an y
direct ion . If we choose the z-d irection then x and yare unchanged so that the
360 Space and Time 15 .5
Jacobian is simpl y ak ~ /akz ' From the explicit transformation (15.25) for such a
boost
(15.80)
Thus
but since we ha ve
giving
a k~ /a kz = (I-fJk z/k ,)/(I-fJ 2)1
= k,- Ik; using equation (15.80).
Thus the volume element dk ' may be replaced by k,- Ik;dk or , in other words,
k,- Idk ' may be replaced by k,- Idk. Essentially the same scala r product may be
used for the representations p to. m., except that the sum over m s is no longer
required and m, is replaced by m.
As we have seen in subsection 15.4.1 the vector k label s a sta te with definite
momentum and energy. Thus if th e numbers 1/1 m. (k) ar e normalised so that
<1/1 LI/I > = 1 we may interpret 11/1 m. tk W as the probability densi~y a s a function
A
of k and ms for given k. One so metimes refers to the 1/1 m (k) as the wave
function s in momentum space. We warn that in non-relati vistic treatments it is
usual to define momentum space wave function s normalised with the k,- I
factor missing from (15.79 ). Thi s is a matter of con venti on.
We note that, so far as the ba sis vecto rs Iksms >themselves a re concerned, th e
above defin ition of scala r product impl ies that
(15.81 )
where (j(k - k ') den ote s the three-dimensional delta-function. The function
1/1 m, (k) which describes one of the basis vecto rs itself, say 11/1 > = Ikrn s >is given
simply, see equation (15.78), by
I/Im.(k) = bm,rh.b(k -k)k, (15.82)
The addition of the space inver sion I to the Poincare group makes only a slight
change to the repre sentations. We now study this extension of the group,
following quite closel y the methods used in section 15.3. We have already
di scussed in sectio n 15.3 the fact that I co mmutes with rot ations but not with
Lorentz boosts and the mult iplication rule
IQ(b) = Q( -b)I (15.83 )
was deduced . The mult iplication rule with translations follows directly from
15.5 Space and Time 361
The actual representation matrix for proper elements T (t, L) is given as before
by equation (15.59), while for the inversion
T(I) lksms ± ) = T(I)T(Q[b(k)]) 1k osms ± )
= T(Q[ -b(k)])T(I)jkosms± > using equation (15.83)
= ± T(Q[ -b(k)])lkosms± > using equation (15.85)
= ± Ilksms ± > (15.87)
In the last step we have used the fact that if b denotes the boost parameter
required to carry t; into k then - b will carry ko into Ik, see equation (15.26).
The important point about equation (15.87) is that T(I), acting on a general
basis vector Iks ms ± ), leads to another basis vector of the set (15.86). Notice
that T(I) is not a diagonal operator. Not surprisingly, T(I) changes the space
direction of k and , as we shall see later, in section 15.7, the ± refers to an
internal (or intrinsic) parity of the system .
The arguments used above are not applicable to the representations p(O.m),
since the special vector in this case, t; = (0,0, 1, 1), is not invariant under
inversions. In fact we shall find that the inversion operator T(I) transforms a
basis vector of the representation p (O.m) into a basis vector of P(o. -mI. It will
then follow that the representation space of p lO.m) cannot remain invariant
under the extended group :?Js but that, because T(I)T(I) = 1, the sum of the
vector spaces ofplO,m)and P(O, -m ) is invariant under e', and therefore provides
a representation of :?Js which must be irreducible. We denote it by p (o.lmll. To
justify these statements we shall need to use the relations
(15.88)
362 Space and Time 15 .6
which follow directly from equation (15.84) and which lead to the results
w.r = IW q , W,I = -IW" [W . W, I] = [P . P, I] = 0 (15.89)
Now if Ikm > belongs to the representation pI O.m ) of the group f!/ so that
equation (15.76) holds then we see that the vector defined by T(I)lkm > has the
following properties
W . WT(I )lkm > = 0
P .PT(l) lkm > =0
(W q- imPq)T(l ) lkm > = T(l )(Wq + imPq)lkm> = 0
(W,- imP, )T(l) lkm > = -T(l)(WI +imP,)lkm > =0
so that (W - imP)T(l)lkm > = ° (15.90)
Furthermore,
P(f)T(I) lkm > = T(l) P(lf) Ikm >
= exp(ik . If)T(I )lkm >
= exp(ilk . t )T(I) lkm >
so that the vecto r T(l) lkm > transforms under translations according to the
representation TOk) of the translation group. Since by definition, k is future
null , so is Ik and thus, comparing equation (15.90) with equation (15.76), we
conclude that T(l)lkm > belongs to a representation pl O. - m) of f!/ . Thus, unless
m = 0, the vectors Ikm > and T(I)lkm > are linearly independent. We may thus
write T(l) Ikom > = IIko- m > so that the set of Ikm> with Ik - m >, generated
from equation (15.67) for all future null k, provide the basis for the
representation p (O.lmp of the extended group f!/s ' It follows generally that
T(l ) lkm > = Il k -m ) (15 .91)
and hence T(I) Ik - m ) = IIkm ). Together with equation (15.68), used within
each of the representations p I O.m ) and p lO. - m) for group elements T(f, L), the
°
representation p (o.lml ) of f!/s is thus determined.
The case m = is an exception since T(l)lkm ) and Ik - m ) both transform
according to the same representation pIO.O) of f!/ . Then, as with the
representations plk. S), one obtains two representations pI O.O) :!: of f!/s depending
on whether T(l)lkoO) = ± llk oO ). We shall not go into any details.
Having discussed the space inversion I let us now consider the time inversion
operator I, defined by l,e = e' = (x, y, z, - ct). As in the previous section the
addition of this operation to the Poincare group generates an extended group
which we denote by .'lJ,. We see directly from its definition that the products of
15.7 Space and Time 363
showing that the new vector T(l,)lk ) transforms according to T(I,k) under
translations. In other words, the time component of k is reversed. But we have
seen in section 15.4.1 that the physical significance of this component k, of k is
the energy, E = lick, . Thus the time inversion operation as given by equation
(15.93) interchanges positive and negative energy states. There is however no
evidence that this is a symmetry operation or indeed any evidence for the
existence of negative energy states. For this reason we shall not pursue the
representations based on (15.93).
We shall nevertheless return to the group ;J}I, in subsection 15.7.4 in
discussing time reversal, which is known experimentally to be an almost
universally valid symmetry operation. This operation, which corresponds
physically to a reversal of the direction of motion, is obtained by seeking a
representation of the group element I, which avoids the difficulties which
follow from equation (15.93). In deriving (15.93) we have assumed that T(l,) is
a linear operator in the representation space of the vectors Ik) . If one takes the
operators T(l.) to be 'a ntilinea r' then the appearance 0 f negative energy states
is avoided. Because new concepts are involved we defer the discussion of time
reversal until subsection 15.7.4.
If we wish to include both the space inversion I and the time inversion I,
defined by I,e = (x, y, z, - ct) then to form a group we must also include the
product II, which changes the sign of all four components, II,e = - e. Let us
denote the group formed in this way by .9's,. We note that the product II,
commutes with all Lorentz transformations but not with translations.
15.7.1 Mass
The irreducible representations of the Poincare' group are characterised
firstly by the Casimir operator P.P whose value, from section 15.4.3, is given
by - k 2 in representations p 1k • s ) and by zero in p<O .m l . In the representations
corresponding to space-like t, which we did not describe in detail, P.P would
be positive. For reasons which become clear as we proceed we associate the
operator P.P with the square of the mass . Precisely, the operator for the
'sq uare of the mass' is defined, including appropriate dimensional factors, to be
- Ji2 P.P/c • Thus if M
2 2
denotes the value of the 'square of the mass' then in
representations p(k .s) we have M 2 = Ji 2k 2/C 2, while in p(O .ml we have M 2 = 0
and for space-like t, M 2 < O. With this interpretation, the mass would be
imaginary for space-like ii, would be zero for pIO .m ) and take the value
M = hk]: in plk.s). (Notice that it is M 2 which has significance, rather than M;
a situation rather similar to angular momentum where J .J is the Casimir
operator with value J(J + 1) but we use the label J.)
Let us now give some justification for using the word 'mass'. Since we have
earlier given physical significance, momentum and energy, to the four
components of P, the very definition P. P = P;- L p~ implies a relation
q
between the eigenvalues of these operators. In plk.s) the relation becomes
i.e. (15.94)
15.7.1 Space and Time 365
making use of the definitions of p and E in subsections 15.1.1 and 15.4.1 and
with M defined above. (This is the familiar relation between energy and
momentum in classical relativity theory, see section 16.1, and the word 'rest-
mass' is sometimes used for M .) If the momentum is small, p ~ Me, then
(15.95)
which gives the energy as the sum of the rest-energy Me 2 and the kinetic energy
p2/2M = tMt: 2.
So far as the label k is concerned, the different vectors Iksm s > in the
representation plk. S) correspond to different states of motion of a particle (or
system) with mass M = ilk/c. Not only may the momentum p = hk be in any
direction, but it may have any magnitude with corresponding energy given by
equation (15.94). Thus, even if we restrict to s = 0, the single representation
plk .O) encompasses all possible states of motion of a classical particle of mass
M. We shall see presently that the label s relates to the internal angular
momentum or spin of the particle.
Before discussing zero mass we briefly consider the representation cor-
responding to k = 0 and referred to in subsection 15.4.2 under region (6). This
would not only have zero mass but also zero energy and momentum in all
states, i.e. in all frames of reference. No such object has been observed and we
discuss the possibility no further, noting however that the point k = 0 must be
excluded from the representation p lO.m l .
Let us now turn our attention to the representation p I O.m) corresponding to
zero mass . This seems a rather unl ikely description for a particle until we notice
that it may nevertheless have finite energy and momentum satisfying, from
equation (15.94), the relation
E = pc (15.96)
Thus, although having zero mass , the object acquires some physical properties
and in fact such particles are well known in physics. In the quantum treatment
of the electromagnetic field, the interaction takes place through the exchange
of photons which need to have zero mass . Also, in fJ-decay one finds neutral
particles emitted which have zero mass and are called neutrinos.
Whereas for a particle with finite mass, there is always a state with p = 0, in
which the particle is at rest, with E = Me 2 , this is not the case for a zero-mass
particle. One cannot set p = 0 without moving to the limit point k = 0 which
defines a different irreducible representation in which all states of the particle
have zero energy and momentum as described above. Classically this implies
that zero-mass particles have velocity t: = e because as we have seen in
subsection 15.2.3, the velocity is then v = e in all frames of reference and no
state of rest is possible. (Another classical argument supporting the statement
that v = e is that with zero mass a particle can acquire finite energy and
momentum only in the limit of v = c, see equation (16.10). )
366 Space and Time 15.7.2
15.7.2 Spin
Having seen how the first Casimir operator P.P is associated with mass and
the components of P are associated with momentum and energy we now turn
our attention to the second Casimir operator W.W defined in subsection
15.4.3. The interpretation of W.W takes rather different forms for finite and
zero mass and we describe them separately.
Finite mass
Consider first the case of finite mass, described by the representations p<k. s).
From equation (15.74) we recall that the number s, which may take values 0, t
1,1, .. . labels the eigenvalues -k 2 s(s+ 1)on".w for given k. In terms of the
mass, introduced in equation (15.94) the eigenvalues of W .Ware
-M 2 c2 s(s + I)/li 2 (15.97)
using the va lues for b(k) given after equat ion (15.58)
{Wz- (k,+ k)- lkz W,}!ksms >= - msklksms> (15.100 )
°
Experimentally, one find s that the electron, proton and neutron all have
x = 1 while n-me son s have s = and p-meson s s = I. Co mplex systems like
a toms and nuclei ca n ha ve mu ch greater values for s but al ways an integer or
half-integer.
Zero mass
-(X . P)Jkm >= - W, Ikm >= imP, lkm >= mk , Ikm >= mlkllkm >
(15.101)
now no (25 + I)-fold degeneracy of different spin orientations. In fact, even the
state with opposite spin - m would belong to a different irreducible
representation of the group f!J! namely P'? : - ml rather than p<o .m). A particle
with positive m is said to have positive ' helicity' or to be right-handed in the
usual sense ofa screw . We discuss the significance of heIicity in the next section
in connection with parity. However, there is a simple physical argument which
shows how this right- or left-handedness may exist in zero mas s particles but
not in finite mass particles. The helicity of a particle will be rever sed in the
frame of an observer travelling faster than the particle, compared with that of
an observer moving more slowly . Thus, for a finite mass particle which travels
at speed v < c anyone-handedness would violate Lorentz invariance. How-
ever, for zero mass , the particle must travel at speed v = c so that no observers
may travel faster than the particle and the contradiction cannot arise .
Of the known part icles with zero ma ss, the photon is found to have both
m = ± 1 while the neutrino and the antineutrino have m = - i and m = i,
respectively. (See also the discussion of antiparticles in subsection 16.3.5.)
15.7.3 Parity
Finite mass
It was shown in section 15.5 that, when the group:!J is extended to fl/ s there
are two irreducible representations p(k,s) ± of f!I's for each representation pCk , s )
of f!J'. The inversion operator was shown in equation (15.87) to have two effects ;
it changed the direction of k and it gave a factor ± 1 in p(k,s) ± , respectively. The
first of these effect s gives the expected reversal in the direction of motion while
the second applies even to a particle at rest. Thus one refers to the ± factor as
the 'intrinsic parity' of the particle, even or odd, which , like the spin, is
independent of the state of motion and is a characteristic of the particle. It may
be imagined to refer to some 'internal' structure of the particle.
In fact it is possible to determine the absolute intrinsic parity of a particle
only in rare cases like the nO. Usually, two or more changes occur in a reaction
so that only the relative intrinsic parity may be deduced . Furthermore, because
of what are believed to be absolute conservation laws. such as charge
conservation, certain classes of transition are forbidden , preventing the
determination of even the relative parities between certain sets of particles. The
absolute intrinsic parity of the n O may be found from a reaction like
p + p -+ p + P + nO, and a detailed analysis of the results of such experiments
shows that the nO has odd parity. Since they have a charge it is not possible to
devise an experiment to measure the absolute intrinsic parity of the n ± but
15.7.4 Space and Time 369
because they are part of the isospin triplet (T = 1) with the nO (see sect ion 10.2),
it is natural to assign an odd parity also to the 7l - . The relative parity of
neutron and proton is then found to be positive from the analysis of reactions
like p + p -+ n + p + 7l + and it is convention to choose them both to have
positive parity. It is crucial to the argument above that the reactions proceed
via the strong nuclear forces which are known to conserve parity.
Zero mass
For a particle with zero mass we recall from sectio n 15.5 that the irreducible
representations of the extended group # , are made up of two irreducible
representation s of !J, plO, lmll = pIO. m) -+- plO. - m) and the space inversion
carries a vector o f one into a vector of the other. This is to be expected since the
inversion commutes with the spin and thus a change in direction of k must
imply a change in helicity. One cannot assign an intrinsic parity here as we did
for finite mass because the irr educible representations PIO. lml)of the group :Y's
do not carry such a label ±. Physically, the intrinsic parity cannot be studied in
the sta te of rest since no such state exists for zero ma ss.
The fact that the photon is found experimentally in both m = ± 1 states
suggests that it is described by the representation plO.11 -+ p lO . - I ) of the group
.:J's or in other words that it has :1/, as its symmetry group. On the other hand
for the neutrino which is found always in the m = -1 state, the appropriate
group is q> excluding the inversion. Thus whereas parity is conserved in
electromagnetic interactions, involving the exchange of photons, one finds
experimentally that parity is not conserved in processes like fJ-decay which
involve neutrinos. It is true that antineutrinos, with m =1, exist also but they
must be regarded as different particles since they do not occur in the same
reactions as neutrinos, see subsection 16.3.7.
15.7.4 Time-reversal
where we have used the second of equations (15.104) and the elementary result
I,k = - Ik, where I is the space inversion. Thus, under translations, the vector
Ylk >transforms according to TOk) and the d ifficulty with negative energies is
avoided. The operator Y also has the desired effect of revers ing the direction of
k . Although time-reversal is sim ilar to space inversion we cannot assign a 'time-
reversal parity' with any phy sical significance. This stems directly from the
I
antilinear nature of Y. For suppose that r/J > was an eigenfunction of I with
;.1
eigenvalue X, so that II r/J > = r/J >. The antilinear nature of Y now implies
that for any phase '1
Texp (iIOI r/J > = exp( - ilJ)II r/J > = exp( - il/)}·I r/J >
= [exp( - 2il/p.] exp(ilJ)Ir/J )
wh ich follows from the 4 x 4 matrices. We shall assume as usual that the
representation operators satisfy the same law. In words, thi s says that the time-
reversal operator Y commutes with rotations and changes the direction of the
boosts . From the very nature of the physical operation of time-reversal, the
square 1 2 must reproduce the original ph ysical system. Thus
y 21r/J > = exp(itx)Ir/J ) , where tx is any phase. Combining this with the anti-
unitary condition (15.104) leads to the further restriction (see problem 15.10)
that 1 2 \r/J ) = ± Ir/J >. For a strict, single-valued representation we should
take only the plus sign since the group element satisfies 1,2 = E. However, if we
allow double-valued representations (like the half-integer representation of
9!' 3) then the minus sign is permitted .
Finite mass
In studying the behaviour of the basis vectors Iksms >of the representation
plk.s) of f!J when the new element I is added we follow the technique of section
15.5 and look first at the special vectors Ikosms> with t,
= (0,0,0, I). From
equation (15.105) the vector Ilkosms > has the same label k = ko . Because Y
commutes with rotations one might have expected that the vector I lkosms >
would have the same value of ms but this does not take account of the
a ntilinear nature of I . Becau se the rotation operator X= has an imaginary
372 Space and Time 15.7.4
eigenvalue, X,lk!Jsms>= - imslkosm. >, see for example section 7.3.5, we have
X,Ylkosm. > = YX,lkosm. > = 1'( -im.)lkosm. >
= im.Ylkosm. >
which shows that, under rotations about the z-axis, the vector Y lk osm. >
transforms like - m. although it is easily seen that the value of s is unchanged .
In fact, for a general rot ation
T (R(a ) )Y lkosm. > = YT(R(a) )Ik osm. >
= I . r DIs)
msm,
( a )l ko s m~ >
m,
(15.107)
1'!kosm. > transform preci~ely like th e set (-1 )m, lkos - m' >'AThere are now
two possibilities: either 1'lkosm.> is linearly independent of Ikos - m.> or we
may write Ylk osm.> = y( -l)m,!kos - m.> with so me con stant y. Because we
are looking for an irreducible representation of gJ, the first possibility can be
dismissed since it can be shown, following the technique of section 15.3, that
the representation of gJ, generated from Ikosm.> would not then be
irreducible. Returning to the second pos sibilit y it is usual to choose the phase
factor y = ( - 1)' so that, even for half-integers, the factor ( - 1)' + m, is real.
Having seen that the (2s + 1) vecto rs Ikosm. > with fixed ko and s form an
inva ria nt space under rotations and time-reversal, we generate the remaining
bas is vectors of the representation of gJ, by usin g the boost equation (15.57). It
then follow s that for the general basis vector of the representation,
Ylk sm.> = YT(Q[b(k)] )/kosm.>
= T(Q[ -b(k)])( -l)s+m,lkos - m.>
= (- 1 )s+m,IIks - m.> (15.108)
showing how l' changes the sign of both the momentum and the spin. We see
from thi s equation that
1'2lksm. > = (_l)s+m,( _l)s-m' lksm. > = ( - 1)2Slksm. > (15.109)
showing that 1'2 = 1 in a representation with integer spin and 1'2 = - 1 for
half-integer spin. This conclusion is independent of the choice of the phase
factor y. From equation (15.108) we may immediately deduce the effect of T on
an arbitrary vector 11/1 > as defined in equation (15.78); the new vector
15 .7.5 Space and Time 373
11/1 ' > = 111/1> is given by the coefficients 1/1'm, (k) = (_1),,-m'I/I ~ m..(-Ik). The
time-reversal operator may be written as a product of a unitary operator and
an operator K which takes the complex conjugate of the c-nurnbers, i.e. the
expansion coefficients in some basis , but the explicit form of I expressed in this
way must clearly depend on the basis . Ifwe note from equation (20.40) that the
spin rotation R SY ( - z ) transforms m" into - ms with a phase change ( - 1)" + m,
A A
and if we define T, (I )asan operator which changes k into Ik, i.e. which changes
t.
the sign of the space part of then in the basis Iksm,,> the transformation
(15.108) shows the equivalence I = K R ~( - n )Tdl). However, in a coordinate
representation it is given by I = KR ~,( -n)T(I,) where T(I,) replaces t by -1.
=
When s = 1. R~, ( - rr) 2is)..
Zero mass
The multiplication law (15.106) implies that IX = XI and IY = - YI,
while equation (15.92) gives IP = PI and IP, = - P, I . Thus YW = - WY
and YW, = W, I . Now consider a basis vector belonging to the representation
p'O.m) of &>, which isAappropric;,te for zero mass . Then starting from equation
(15.76) we have Y(W + imP) Ikm > = 0 so that, making use of the multipli-
cation rules given above and the antilinearity of I ,
(-W- imP)llkm > =0
a nd
(W,+imP,)llkm > =0
or in other words
(W+imP)Ylkm > =0 (15.110)
Thus we conclude that the vector Ilkm > also belongs to a representation
p,O ,ml of &> and, using equation (15.105), that it transforms like Ilkm > under the
group flJJ. Again we can argue that if Ylkm ) and IIkm > are linearly
independent then the representation of f!J, generated from them will reduce.
Thus for an irreducible representation of f!J, we must have
(15.111)
2
where we have chosen the phase factor to be + 1. Notice that y = + 1 in this
case, irrespective of the choice of phase. The interpretation of equation (15.111)
is again that both momentum. and spin are reversed but, since m is the spin in
the direction of motion, its value is unchanged . From the point of view of the
representations we conclude that the vector space of p'O .ml remains invariant
when Y is added to the group f!J and therefore provides also the basis for an
irreducible representation of f!J, .
(2) Reactions
In a collision process, the time-reversal operation will interchange the role of
the incoming and outgoing particles. An assumption of time-reversal in-
variance therefore relates the transition probabilities of inverse processes, a
property known as the 'principle of detailed balance'. In an interaction process
which may be treated by perturbation theory it is po ssible to deduce relations
between matrix elements within the same process. To illustrate this point we
must classify operators Q as even or odd under time-reversal, according as
'Q = ± Q', respectively. For example we have seen above that X and Pare
even while Y is odd. Remembering the factor i which enters in passing from
these infinitesimal operators to the Hermitian operators representing angular
momentum and momentum , we see that j, sand p are odd, as one would
expect , while for example rand s .1 are even . (Remember however that one
cannot cla ssify wave functions as even or odd under time-reversal.) For the
matrix elements of even or odd operators, respectively ,
The group e;
Since the extension from the group & to !?I, did not lead to any change in the
vector space of an irreducible representation, it follows that the extension from
&, to !?Is, which includes both space inversion and time-reversal, will be just
like the extension from !?I to !?Is described in section 15.5. In other words we
may simply include with the results of section 15.5 the transformations
deduced in this section for the operator I . The transformations for the new
product element IT(I) follow directly from those for the two factors.
Our approach throughout this chapter has been to study the irreducible
representations of the Poincare group. This tells us the transformation
properties of the functions which belong to each representation. A belief in
Poincare' invariance then led us to identify the representation labels with
376 Space and Time 15.8.1
invariant properties of the particles, like mass and spin . By comparing with the
properties of observed particles we associated each particle with an irreducible
representation in the same way that in non-relativistic theory we associated
each eigenstate of the Schrodinger equation with a representation of the
symmetry group of the Hamiltonian. However we have never constructed the
wave function explicitly . The philosophy here follows that used in discussing
the group f7l 3 , namely that one can deduce the properties of the basis vectors
Ijrn> of the representations Olj) without specifying their detailed form . One
thereby achieved the greater generality that the results were applicable to any
spherically symmetric system . For example the wave functions Urn> have the
same rotational properties whether they refer to one particle or any number of
particles. In the case ofa single particle, however, one may construct explicitly
the angular dependence of the wave functions by introducing the explicit
differential form for the rotation operators. The Casimir operator is then a
differential operator and its eigenvalue equation provides a wave equation
whose solutions are the basis vectors Ijrn> of the irreducible representations.
In this section we first elaborate the above remarks for fJi 3 and then use the
same idea in the Euclidean and Poincare' groups to deduce the famous
equations of Klein - Gordon, Dirac, Weyl and Maxwell. Although for free
particles these equations carry no more information than we have already in
the irreducible representations they are useful when interactions are in-
troduced, see subsection 15.8.2, and when a theory of quantised fields is
constructed, see section 16.3.
Recall from section 7.5 that the fJi 3 Casimir operator is J 2 and it has
eigenvalue jU + 1) in the representation Ow, where j = 0,1, 1,~, . . .. The
explicit differential form of the infinitesimal operators J, = - i(r 1\ V)q was
given in subsection 7.4.5. Thus the basis vectors Ijrn>
must satisfy the 'wave
equation'
(15.114)
Since rotations leave the length of r unchanged we may regard r as a constant in
this study of the basis vectors of Ow. In this case, from equation (7.57), we see
that the equation reduces to
(15.115)
Thus the behaviour of the basis vectors Urn> could be found by solving the
wave equation (15.115). In fact this is the Schrodinger equation for the free
motion of a particle on a sphere of radius r and its solutions,see section 7.5, are
the spherical harmonics for integer j, namely Y ~)((),¢).
To construct wave functions with half-integer j we must extend equation
(15.115) by introducing a framework with several components as described in
section 8.4. If, in particular, we take two components, transforming like Om,
15.8.2 Space and Time 377
where we have inserted the value i for 5 2. The terms which do not involve 5 are
understood to contain the 2 x 2 unit matrix and the wave function Um> has
two components
for I = j-1
but since the spin matrices satisfy s; = t, S,Sy+ sys, = 0, etc., it follows that
(V .S)2 =tv2 so that by comparison with equation (15.117) m 2 =t
and
m = ± 1. Thus, in the two-component framework , we may construct basis
vectors belonging to Elikl.!) and E( lkl. - I). From the explicit forms (8.15) for the
spin matrices one deduces that the appropriate basis vectors (un-normalised)
are
(-kx+ik y ) e''k .r
and Im = - -1 )
2
=
Ikl+k z
(15.119)
with the usual convention for the two components, i.e. s, diagonal.
which is sometimes called the Pauli equation. For a uniform magnetic field we
may take A = 1 B /\ r, where B = V /\ A denotes the constant strength and
direction of the field. Then, using the properties of the spin matrices again and
remembering that V does not commute with A this equation (15.120) reduces
to
[V 2 + Ikl2 -(ejlie) {B .(1 + 2s)} Ilv >
= 0 (15.121)
where we have omitted terms of second order in B. Comparing this with the
15.8.3 Space and Time 379
(15.122)
°
To construct a wave equation which is appropriate for s # we must again
introduce several components. In group-theoretical language this is equivalent
to the construction of product representations. For example in discussing the
representations ElIkl,±1) of g 3 in subsection 15.8.2 we were effectively
constructing the product representation ElIkl.O)x E(0.1) == E(lkl,O)x 0(1) so that
the infinitesimal operators were sums of the differential operators relating to
E(lkl,O) and the matrix (spin) operators relating to 0(1). We found that the
+
product reduced to the sum ElIkl.1) ElIkl, -t). The same technique will now be
followed to describe the representations P(k.!) of the Poincare group and to
deduce the Dirac equation.
A natural choice for the product representation in the s = 1 case will be
°
ptk . O)X U 1,0) where the second factor is the tr anslation-invariant repre-
t.
factorisation in which the matrix is independent of This contrasts with the
basis Ik!m s> used for the representation p(k.1) in subsection 15.4.2, where the
corresponding equation (15.59) may appear to factorise but the rotation R'
depends on k through equations (15.60) and (15.61). In fact we may transform
from the basis Ih > to the basis I~!ms > and this verifies that the product
representation p(k. 01 X Ut. 0) is equivalent to p<k.!l. The transformation is
using also the inverse of equation (15.125). But because U !·O) is a group
representation the matrix here is simply u tO)(Q'-1 LQ) and from equation
(15.60) this isjust U 1'0)(R') = D(!)(R'). Thus the states constructed in equation
(15.125) have the correct transformation property (15.59).
The basis I~n > thus provides a simple two-component description
for a particle with spin s = 1- The wave equation would be simply the
Klein-Gordon equation
(15.126)
where II/J > is now a two-component wave function . The role of this equation is
to provide the plane-wave factor (15.123) appropriate to the factor p(k . 0) in the
product representation. Note that the wave equation (15.126) contains no spin
operators which says that no constraint is being put on the 'direction' ofli/J > in
the two-component space.
As far as the second Casimir operator W.Wis concerned it is easily shown
(see problem 15.12) that in the product representation plk .O)X U t.O) one has the
equivalence W.W = i P.Pwhich is to be expected for the representation plk. Jl
from equation (15.74). However, in the presence of an electromagnetic field the
substitution of ~ + eA/c for' in W.W gives rise to a new term in the
Klein - Gordon equation which couples the electron spin to the field.
The two-component description given above for a particle with spin s = ! is
unsatisfactory for two reasons. Firstly the transformation (15.124) is not
unitary (recall from subsection 15.2.5 that the representation U 1.O) of the
Lorentz group was not unitary). The unitary representation could only be
recovered by returning, through the transformation (15.125) to the original
basis Ik!m s> for the representation p(k.!) and thus losing the factorisation. The
second reason is that the two-component space does not remain invariant
when the space inversion is introduced (recall from section 15.3 that the space
382 Space and Time 15 .8.4
inversion carries vectors of ut,O) into vectors of UO,l) and these two
representations of !fl may be added to form a single four-dimensional
+
irreducible representation ut,O) UO,\) of the Lorentz group with space
inversions !fl s ) and is therefore unsuitable for a description of particles with
definite parity. We now show that a four-component description based on the
+
product representation plk,O)x (Ut,O) UO,J) overcomes both these difficulties
and the restriction in the number of spin states from four to the value two ,
which is appropriate for the representation ptk,l), is achieved by solving what is
known as the Dirac wave equation.
Following a procedure very similar to that above we introduce basis vectors
+
Ika >for the product representation p(k,O)X (Ul,O) UO'\), where a = 1,2,3,4
runs over the four components of the second factor. The Poincare transfor-
mations are then given by
4
T(€, L) Ika >= exp(ik' , €) L M /ia(L)lk' P> (15.127)
/1=1
_ (U l,O) 0 )
M- o UO,,1 ) (15.128)
in partitioned form with Ul, OJ, etc ., denoting 2 x 2 matrices. Now the four-
component space is clearly too big for describing the representation p(k,l); in
fact p(k,O)X (U l,O) + UO,I)) = 2p(k,lJ. The previous two-component description,
based on the U l.O) piece, was one way of selecting a single p(k,t) representation
but there are other ways which involve the mixing of Ul,OJ and UO,I). To discuss
them we mu st first define some operators which couple these two two-
dimensional subspaces. From the rule (15.37) for reducing product repre-
sentatio ns in the group !f such operators must transform according to U l.l)
and thus form a four-vector which we denote by y. It is soon verified (see
problem 15.13) that such a set is given by the four 4 x 4 matrices
Y, = (0 I)
1 0' (0
Yq = 2 Sq Soq) with q = x, y or z (15.129)
in partitioned form where 1 denotes the 2 x 2 unit matrix and Sq are the 2 x 2
spin matrices. We may now construct the Poincare invariant operator y.Pand
diagonalise it. To find its eigenvalues we note that
while, from the definitions (15.129) and the properties of the spin matrices, the
y-matrices satisfy the anticommutation relations
(15.130)
15.8.4 Space and Time 383
which is called the Dirac equation. This acquires a more familiar form if we
write k = eM 11i in terms of the mass M. (see subsection 15.7.1) and introduce
the differential form P = (- V, ( L'c) alat). Then
For our continuing discussion of the Dirac equation we prefer to use the form
[15.131).
To appreciate the form of the solutions of the Dirac equation and to show
that for given k there are j ust two solutions rather than four we first note that,
for a particl e at rest, PIr/J = 0 and P,Ir/J ) = - ikl r/J ), so that equation (15 .131)
becom es
(15.133)
and from the explicit form (15.129) for YI it is easily seen that the solutions are
any linear combinations of the two solutions
Ik
where o ) denotes the plane wave (15.123) with k = O. For general k the
matrix equation corresponding to (15.133) is obtained simply from equation
(15.131) and (15.129) by using plr/J ) = - ik'r/J ). The solutions are then
384 Space and Time 15.8.4
kx-iky )
Ik 12 _ 12 ) = l f k(k + k ) } - t
2 I t
k+kt-kz
_ k + ik
Ik ) (15.135)
( x y
k+k t +k z
this particular pair having been chosen so that they reduce to (15.134) in the
non-relativistic limit k --+ 0. The normalisation factor will be explained later.
Recalling the structure U t O)-+- uo.
tJ of the four components it is clear that in
this limit the states Ik o ± t t)correspond to the two spin states ms = ± of a t
particle at rest. It may also be shown (see problem 15.14)that the general wave
function (15.135) may be obtained from the solutions (15.134) for a particle at
rest by the operation of a boost Q[ b(k)] , as in equation (15.57) for the vectors
of plk.t) . Finally we stress that, although we are working in a four-component
space, there are only two independent states for a particle with fixed k given by
equation (15.135) and that any state of the particle is expressible as a linear
combination of these two . This is consistent with the conclusions reached from
studying the Poincare representations where there were two states ms = ± t
for each k.
The basis which one uses in the four-component space is clearly a matter of
choice and our choice was determined by equation (15.128), i.e. the first two
rows transform like Ut.O). In this basis we saw that, even in the non-relativistic
limit, there was mixing of components, see equation (15.134). When discussing
phenomena near this limit it would clearly simplify the algebra if one chose a
new basis in which the states (15.134)are given by the column vectors (1,0,0,0)
and (0, 1, 0, 0). In the new basis one refers to the first two components as 'large'
and the last two as 'small' because of their behaviour in this non-relativistic
limit, when Ikl/k is small. We shall not use this new basis .
The solutions (15.135) could also have been obtained directly by construct-
ing a state of definite parity from the state (15.125) and the corresponding state
generated from Lt°·!). If we denote by Ikn )' the two states transforming like
the product plk .01 X Uo.!) then we find, as in (15.125), that the two solutions
(15.135) may be written as
(15.135a)
which is very like equation (15.78). However, the scalar product is now defined
as
which differs from the corresponding equation (15.79) by the presence of the
matrix 'I,. The reason for including this matrix is to make the scalar product
invariant under the transformations (15.127). As we have remarked before, the
matrix M in that transformation is not unit ary so that, had we omitted the
matrix 'I" the scalar product would not have been invariant. To show the
invariance of the definition (15.137) one must show the matrix equality
Mty,M = 'I, which from equations (15.128) and (15.129) reduces to
(VO,1l) t = (u!.o))- 1. But this relat ion follows directly from the matrices ofVO,!l
and V!,O) for pure rotations and for pure boosts and thus for all Lorentz
transformations. The set of four functions
(15.138)
is sometimes called the adjoint of the set ",.(k) and it is clear that the expression
(15.137) is simpler with this notation. An alternative, but precisely equivalent,
form for the scalar product is obtained from equation (15.137) by using the
Dirac equation (15.131),
with correspond ing expressions for the second solution 11. 1 - 1). It is soon
verified that these two solutions are orthogonal with our definition of scalar
product and their normalisation is given by
< k'1ms l k1m~) = kI 8 (k' - k ) 8 msms' (15.140)
So far as space inversion and parity is concerned the same results as in
section 15.5 may be arrived at in the four-component description by writing
the inversion operator T(I) as the product of three factors
T(I) = T intr (I)Tk(l)Ts(l)
where the last two factors refer to the two factors in the product representation
p1k. 0) x (U !·O) +- LlO,!)) and the first factor is the intrinsic parity. Now, as we have
seen before, Tk(I) will simply change the direction of the space part of k while
from section 15.3 the matrix form ofTs(I) is simply YI' Thus for a particle with
intrinsic parity 7t = ± 1 we have
0) =
S
x
1
2YyYz , etc. (15.142)
and
.
(Y s)x=-I(Ax-B x)=- 0
(s., -sx0) =- 2Yt Yx '
1
etc,
where we ha ve changed variables for the integration and made use of the
inva ria nce of the vo lume element, see subsect ion 15.4.4. Thus the function !/J(k)
>
describing the state I!/J is carried into !/J(L - 1 k). Ifwe now write L = 1 + A, as
in subs~tion 15.2.4, for infinitesimal A, and develop a Taylor expansion for
!/J(L - 1 k) as we did for rotations in subsection 7.3.5 then we deduce the explicit
differential forms.
The first of these is ju st th e familiar form fo r the rotation operator (7.21) when
expressed in the momentum representat ion , while the second gives the
corresponding expression for the infinitesimal boosts.
15.8.5 Particles with zero mass and spin 1m/ =-t - the Weyl
equation
The object here is to construct a factorised form for the representation pIO,m)
of the Poincare group, see subsections 15.4.2 and 15.4.3, when Iml = t. We
naturally con sider the product of the represent ation pto. O) with u t O ) or UO. 1),
where we stress that pto.o, do es not den ote the identity representation but
p lk.O ' with k = 0; in other words k = 0 but k t= O. To be definite, we consider
the product ptO.OIX Li o. ~ ) which we shall find to be appropriate for the case
m = - t corresponding to the neutrino. Since p' O.m l has only one basis vector
for each k while p'o. 0 , x U o. ~ ) has two , it is clear that these two representations
ca nno t be eq uiva lent but we sha ll a im to co nstruct the former fro m vectors of
>.
the latter. If we use th e nota tion I}(m as in subsection 15.4.2 for vectors of
p tO.m), and IklJ. >0 with IJ. = +
- 12 for basis vectors of p,o,o'x Lio.ll then we show
that
and X, + Y, = i(X., - iX y ) which are both lowering operators for m and give no
contribution for m = - 1.This leaves the single fJt 2 operator X, which leads to
the simple exponential transformation (15.68) for Ik -1).
We now show how it is possible to arrive at this same result (15.143)
by imposing a wave equation constraint on the two-component space. We
recall from section 15.4 that p(O .m) is characterised by equation (15.76),
CW+imP)IIjJ )=0 in addition to W.W=P .P=O. It is clear that the
product representation P(o.O)x Uo.!) satisfies the latter pair of equations but
the constraint (W + imP) 11jJ ) = 0 leads to a 2 x 2 matrix equation which must
be satisfied . To find the matrix form for W we can start from the definition
(15.72)ofW and use the second two rows and columns of equation (15.142) for
the matrix forms of X and Y. (Remember that the differential operator part of
W vanishes identically , see subsection 15.8.3). In this way the cond ition (15.76)
becomes
(15.145)
and
(15.145a)
This is really a set of four equations since (15.76) is a four-vector equation.
However, if the time-component equation (15.145) is satisfied then it follows
that for m = -1 the other three equations (15.145a) are also satisfied, as may
be seen by multiplying equation (15.145) by s. Thus the vital condition to be
sat isfied in the case m = - 1 is equation (15.145) and this is known as the Weyl
equation. Inserting the differential operator forms for P it takes the more
familiar form
This is consistent with the properties of the representation pO. -!) in which, for
given ii, there is only one basis vector. Here we have only one acceptable vector
Iii -1) but it is expressed in a two-component framework . In fact, equation
(15.145) may be written as (k . s) !IjJ ) = mlkllljJ ) which says that 11jJ) is the
eigenstate of the component of spin in the k-direction, with eigenvalue m. The
expression (15.147) is seen to be identical with (15.143) if one inserts the explicit
15 .8.6 Space and Time 389
For this case we follow the previous section very closely with m = 1 in place
of m = t, using the product representation P(o. 0) x Lu , 0). This gives us a
three-component space and we must impose the constraint (W + imP>IIjJ) = 0
with m = l. In constructing the 3 x 3 matrices for Wwe must now use the
matrices for X anti Y in the L(\ ' 0) representation. However since in Lll, 0) we
have B = Oin the notation ofsubsection 15.2.4 it follows from equation (15.33)
390 Space and Time
(15.149)
and
P .I"' > = 0 (15.150)
If we work in configuration space, introducing the differential forms for P,
these equations become
i a
-;;-1'" >
curl'''' > = c ot
Bibliography
For more details on special relativity and references to original papers see
A rigorous discussion of the Lorentz and Poincare groups runs into deep
water but for further reading we suggest Boerner (1963), see bibliography to
chapter 4, and the article by T. O. Newton in
Space and Time 391
and the references which they contain. The original papers of E. P. Wigner
are conveniently collected as reprints in
Halpern, F. R. (1968). Spe cial Relativity and Quantum Mechan ics (Prentice
Hall, Englewood Cliffs, N. J.)
Problems
15.7 Use equation (15.53) for infinitesima l a, band t , together with equation (15.48)
and the matrix (15.30) of !I. = L - 1, to deduce the co mmuta tion relations (15.69).
15.8 Use the commutation relations (15.32) and (15.69) to ju st ify equation (15.72).
15.9 The components P, of the four- vector P satisfy the relati on T(L)PjT -1(L )
= L (L - I )jj P j with a similar relation for W. Hence sho w that equation (15.76)
j "' '' A "' .......
implies that (W + imP)lkm ) = 0, where Ikm) = T(L) lkom).
15.10 Given that y211/1 ) = exp(iex)I"') , Show that exp(iex) = ± 1. (Consider the
operation of y2 on the state 11/1 ) + Ie/», where Ie/» = Y jl/l ) , and use the
antiunitary nature of Y.)
.
15.11 Show that the 2 x 2 matrix form for the operator 2(s.l) IS. (1 L) 0
1+ -1 0
and hence
verify that the eigenvectors Um ) of equation (15.116) are as given.
15.12 Use equation (15.33) to show that, in the L(!D) representation, X = iY = - isand
hence from equation (15.72) that W; = !p2 , W 2 =!p 2 +~P; so that
(W.W) = ~P .p).
15.13 Show that the matrix Y, defined in equation (15.129) commutes with the X, and
hence that it is the time component of a representation L(! i). Use equation
(15.69) to generate from Y, the other three components Yq.
15.14 Use the 4 x 4 matrices of Y in the basis of U!,Of-+- UD,!) to show that the finite
boost Q(b)=exp(b.Y) takes the form cosh!b+2b -l(b ,Y) sinh}b. Hence
deduce the solutions (15.135) from equation (15.134), taking b(k) as given ju st
after equation (15.58).
15,15 Show that y,yqy,- I = - Yq and hence that the i~version operator T(I) defined just
before equation (15.141) commutes with (y.P).
16
Particles, Fields
and Antiparticles
The concepts of momentum, energy, mass and spin were introduced in the
previous chapter entirely from the transformation properties of the Poincare
group. We worked in the algebraic framework of state vectors, which is
appropriate in quantum mechanics, and we associated different elementary
particles with different irreducible representations of the group. In introducing
these concepts we did not make use of any fundamental equation, like the
Schrodinger equation, which dominated the discussion of symmetry in the first
twelve chapters. In fact in section 15.8 we were able to deduce the forms of the
wave equations necessary to reproduce the transformation properties pre-
viously described. These comments only relate to free particles, however, and
when it comes to interactions it is necessary to have some dynamical
foundation. The foundation which seems to have universal validity is known as
Lagrangian theory or equivalently as Hamiltonian theory and rests on
Hamilton's principle. This principle in its simplest form is applicable to the
motion of particles in non-relativistic classical mechanics. It extends naturally
to relativistic classical mechanics of particles. In the description of elec-
tromagnetic phenomena it extends to incorporate the concept of interaction
with a field and the behaviour of the field itself. Finally it is generalised into a
393
394 Particles, Fields and Antiparticles 16 .1
S =
I
I,
Let (16.1)
being a minimum for any two times t 1 and t 2 ' This is a variational principle. To
ensure a minimum (or at least an extremum), the change in S due to a small
change in the functional form of any of the qj(t) must be zero and we write this
condition as ~S = O. The quantity S is called the 'action'. Using the variational
calculus it may be shown that the condition ~S = 0 leads to the Lagrange
equations
:t G~) -:~ = 0
( 16.2)
Given the form of L, these are a set of differential equations for the coordinates
qj(t) as funct ions of time . There are as many equations as coordinates and,
given the initial conditions, a unique solution for the q j(t) may be found. Hence
the motion of the system is determined directly from the Lagrangian. One
system differs from a nother by the form of L as a function of q., qj and t .
Using the Lagrange equation (16.2) the differential dL may then be written
oL
= ~ pjdqj + ~ p;dq j + Jt d t
does not contain differentials dqj of the velocities. Hence the quantity
t Pjqj - L does not contain the velocities explic itly and we call it the
I
Hamiltonian
(16.5)
These are the Hamilton equations. Whereas the Lagrange equations will
involve the second time-derivatives of qj (since 0L joqj will generally contain
q;), the Hamilton equ ations involve only first time-derivatives of Pi and qj.
However there are now twice as many equations. Naturally, the two techniques
lead to precisely the same solution.
From the point of view of symmetry one sees that if L (or H) is independent
of a certain coordinate qj so that oL jo qj = 0, then the corresponding
generalised momentum Pi is constant, since from either equations (16.2) or
(16.6) we have Pi = O. Notice also that, from equation (16.4),
(16.7)
E=H = L P;l/ j -
i
L = M v 2(l- v 2/e 2)- !+ M e 2(1- v 2/ c 2)!
(16.1 0)
as in the quantum case, see equation (15.94). For small ole, equation (16.10)
reduces to the familiar expression
(16.11)
containing the rest-energy M e and the kinetic energy.
2
describe the motion of a particle with charge e in this field we must modify the
Lagrangian (16.8) to include A but to remain invariant. The most obvious
invariant to include is the scalar product
A.M = (- A.v+e4»dt
The conventional equations describing the motion of a charged particle in an
electromagnetic field are obtained by including just this term, with a suitable
constant - etc so that
L= -M e(e 2-v2)!+(e l e)(A .v)-e4> (16.12)
The generalised momenta are now given by
aL
P.• = ---at = M v..(1 - v [c )-, + (ele) A,
2 2 1
(16.13)
E
a
= - - A le - grad 4> B = curl A (16.17)
at '
The vectors E and B are called the electric and magnetic field intensities. Since
it is equated to (mass) x (acceleration), the right-hand side of equation (16.16) is
called the Lorentz force.
The differential operator 0= (0 lox , 010 y, 0lo z,- oloct ) is a four -vector, see
subsection 15.4, so that the sixteen products O JA j transform like the product
representation U H)x L(t.!l of the Lorentz group. Being first derivatives of the
components of A, it is clear that E and B are contained in these sixteen
products and using the reduction (15.37) of the product representation one can
identify E ± iB with the representations U 1.0) and Uo. 11, respectively, see
problem 16.1.
An electromagnetic field has meaning only by the way in which it influences
the motion ofa charged particle, through equation (16.16). Thus any two fields
A which lead to the same E and B are physically the same field. From equation
398 Particles, Fields and Antiparticles 16 .2
(16.17) one sees that this leaves A undetermined to the extent that iff(r, t) is
any scalar function then the vector potential A' = (A', ¢'), where
then ¢ is said to be a scalar field, with respect to rotations. If, on the other hand,
the field has 25 + I components ¢m(r , t) with m = 5, 5 - I, ... . - 5 such that
the m-component of the rotated field is given by
then the field is said to be a tensor field of degree 5. This is the same
transformation that we deduced in equation (8.20) for the components of a
wave function with spin 5 . The most familiar example is the vector field with
5 = I so that at each point in space the three components define a direction for
the field. It is of course possible to construct fields which are mixtures of
16 .2.2 Particles, Fields and Antiparticles 399
(16.19)
(16.20)
where !f' , which is a function of ¢ and its space and time derivatives V ¢ and
<P ( = c ¢ Ii" t) is called the Lagrangian density. Application of the variational
principle 8S = 0 now leads to a generalisation of the Lagrange equation
Then, from the invariance of :£ under time translations we are able to show
that H = Itf d V is constant. In very similar fashion, if the Lagrangian density
is translation invariant in space , then the vector P with components
I
r, = Y'qd V is constant where
etc.
The proof of these results is outlined in problem 16.2. It is natural to call & q the
components of the momentum density of the field because they are the
constant quantities arising from translation invariance. The conservation of
angular momentum follows in a similar way from the rotational invariance of
!fl .
div B =0 and
i es
curl E = _ . - - (16.24)
(' at
We discussed in subsection 16.1.3 the behaviour of a particle in a field which
was regarded as fixed, in the sense that it was not influenced by the particle.
Above we have described the behaviour of the field in the absence of charges. In
general we mu st allow for the effect of the charged particles on the field and
this is done formally by taking a Lagrangian density which is a sum of the field
term (16.22) and a term like (16.12) integrated over all charged particles
present. One then deduces field equations which are generalised from equation
(16.23) by the inclusion of terms arising from the presence of A in (16.12) and
which represent the charge and current density due to the charged particles.
Thus the motion of the particles, through equation (16.16), and the motion of
the field are coupled.
description for the field. We therefore seek a quantum treatment of the field.
The technique is to replace the field by a field operator and to consider matrix
elements of this operator as is the usual practice in quantum mechanics. The
form which the operator takes is a combination of what are called creation and
annihilation operators offield particles. Thus in addition to the particles which
a re interacting via the field we introduce the new field particles. For example,
the photon is the field particle of the electromagnetic field and the n-meson the
principal field particle of the strong nuclear forces. The introduction of
operators which create and annihilate particles is not simply a mathematical
device . Such processes are commonplace both with the zero mass photons in
the emission of light and for finite mass particles such as the n-meson in the
high-energy collision of protons p + p -+ p + P + nO . The creation of mass does
not violate the Poincare invariance of the system, see problem 16.4. In a
complete theory one should also describe the interacting particles themselves
by a field so that, in the example above, one would have a mechanism which
allowed the creation of more protons as well as n-mesons and aga in such
processes are observed.
With the help of equation (16.26) it can be sh own that this co m muta tio n
relation is valid not only in the vacuum but generally. With the normalisation
(15.81), it takes the form
[ a(k), at(k'lJ = k,8(k -k ') (16.29a)
Physical operators may now be expressed in terms o f these crea tio n and
annihilation operators. For example, an operator which d oes not change the
number of particles must co ntain a n equal number of a and a" factors. In
particular, the number operator is given (see problem 16.5) by
As in subsection 15.4.4 the range of integration extends over all k with fixed
length (mass) and positive k, so that k, = (k 2 +jkI2)!.
From the symmetry point of view the Poincare transformations will not
change the number of particles so that, if we assume the vacuum to be unique,
it must be invariant. Thus one would expect that the creation operators at (I,)
transform like the single-particle states, according to equation (15.59), and this
may soon be justified using equation (16.26). For the zero-spin case being
considered here,
(16.31)
In the case of ferrnions, for which the wave functions must be antisymmetric
rather than symmetric, the results of this subsection are modified simply by the
appearance of the anticommutator [A, B] + = AB + BA in place of the
commutator [A, B] = AB - BA. The need for this modification may be seen
in the step from equation (16.26) to (16.27).
We now seek a linear combination of the operators a' (k) which represents
e
the creation ofa particle at a point in space - time rather than in a state Ik).
This new operator is denoted by ¢' (e) and is called a 'field operator'. From
this description the transformed operator must be the operator at the
transformed point Le so that we shall require
(16.32)
Using the property (16.31) one quickly verifies that the necessary linear
combination of the a '(k) is given by
= ¢'(Le+ E)
where we have changed integration variables Lk ..... k and used the invariance
(see subsection 15.4.4) of the volume element k r-I st:
Bearing in mind the
discussions in subsection 16.2.1 we shall say that the operators ¢ t (e) describe a
scalar field. (The reason why we have Le appearing in equation (16.32) and
16 .3.3 Particles, Fields and Antiparticles 405
(16.34)
e
H(p , r) ljJ(r, I) = ih ailjJ(r, I) (16.35)
< >
The expectation value ljJ(r, 1)1 Q IIjJ(r, I) of an operator Q(P , r) in the
Schrodinger picture may then be reproduced in the Heisenberg picture by
406 Particles, Fields and Antiparticles 16 .3 .3
taking the expectation value of the transformed operator Q ' = SQS - 1 in the
initial state since
< l/J(r , O)I Q ' Il/J(r , O) = < l/J(r, t)I S- l (SQ S- l )SIl/J(r , t)
= < l/J (r, t )IQ Il/J(r, t )
using the fact that S is unitary for Hermitian H. Thus, in the Heisenberg
picture if the time-dependent operators Q ' are known, we may deduce any
required expectation value, given the init ial state l/J(r, 0). Instead of having to
find the wave function l/J(r , t), the problem in the Heisenberg picture is to find
the Q'(P , r , t). But the 'equation of motion' for Q ' follows immediately from
its definition since if we assume Q to be time independent then
dQ '/dt = iH Q '/ h - iQ'H/ n, so that
from the field operator $ and its first derivatives 0$. Since $ is a scalar field,
the second term satisfies the Poincare invariance condition on !i' and by
taking the scalar product of derivatives we have ensured that the first term is
invariant also . The reason for having both terms in fi' with the factor k 2 will be
explained below . Following the procedure in subsection 16.2.2 for construct-
ing a Hamiltonian from the Lagrangian we see that
£=!(cD 2c - 2 + V$ .V$+k 2( 2)nc (16.39)
and the momentum density is given by its components
fljJq = cD Vq $nc- 1
From the definition (16.33) of the field operator ¢ t some algebra leads to the
following expressions for Hand P in terms of the creation and annihilation
16 .3.3 Particles. Fields and Antiparticles 407
J t
H = H - ( 0 1H 10) = lie k,a (k)a(klk,-'dk,
(16.40)
p = p- <OIP IO) = li Jkat(k)a(k)k,-Idk
Comparing the expression (16.40) with the number operator (16.30) one sees
that H may be interpreted as a sum over the possible states k of the number of
particles in that state multiplied by the energy lick, of the particle in that state.
A similar interpretation is valid for the momentum operators P. Thus, starting
from the field description we have arrived at a particle description.
Since $ contains both at and a operators one would have expected H to
contain terms like atat . In fact there is cancellation between such terms arising
from the two pieces of the Lagrangian. This explains our choice of the factor k 2
in !f, since only the absence of terms like a t a t enabled us to interpret H as the
Hamiltonian for a system of non-interacting particles and this was our object.
Naturally a theory of non-interacting particles such as we have given above is
rather empty but by adding other terms to the Lagrangian, as we did in the
classical problems in subsection 16.1.3, one may introduce interactions
between fields and thus between particles.
The conservation of P and H may be shown to follow from the invariance of
!f with respect to space and time translations as was the case for classical
fields, see subsection 16.2.2.
The field described above is appropriate for the neutral nO mesons. Because
some of the arguments may appear to be circular we briefly summarise the
philosophy and content of the discussion of quantum fields so far. In section
15.4 we classified all possible particles according to irreducible representations
of the Poincare group. Using the creation and annihilation operators we next
constructed a scalar field operator ¢J t(i) with the transformation property
(16.32), using the operators for zero-spin particles. From these we constructed
a Lagrangian and hence a Hamiltonian whose eigenstates corresponded to
408 Particles, Fields and Antiparticles 16 .3.4
definite numbers of particles. In doing this we did not make use of the
variational principle descr ibed in subsection 16.1.1 or the field equations
which, as shown in subsection 16.2.2, result from thi s principle. However, if we
were to apply the field equation (16.21 ) to the Lagrangian den sity operator
(16.38) we obtain the di fferenti al equation
(0 2 + k 2 )<I>(e) = 0
This is the familiar Klein - Gordon equation which we met in subsection 15.8.3
in connection with the wave function s for particles. Here it enters in a different
role as a condition on the field operators. From the expression (16.33) for ¢ \e)
one sees that <I>(e) does indeed satisfy this field equation and hence satisfies the
variational principle. (Note the similarity here with our discussion of wave
equations for particles in section 15.8). When interactions are included the field
equations playa more significant role.
= if k r
- I dk "sin] k .(e ' - e) }(2n:) - 3 (16.42)
which clearly vanishes because the sine function is odd and the integral extends
over all three-space of k ,
Thus we conclude that the field <1> built from zero-spin particles with
symmetric many-particle states, and therefore with commutation relations
(16.27) -( 16.29) between c rea tio n and annihilation operators, satisfies the
causality condition. Had we chosen to construct <1> from zero-spin particles
with antisymmetric many-particle states, i.e. with anticommutation relations
in place of (16.27)-( 16.29) then a repetition of the argument above leads to a
cosine in equation (16.43) in place of the sine function . The integral will not
then vanish and causality is not satisfied. Thus, if we accept the causality
condition, we conclude that zero-spin particles cannot obey Fermi statistics,
i.e. have antisymmetric wave functions. The arguments above can be
generalised to the case of any integer spin with precisely the same conclusion.
For half-integer spins a change of sign occurs in the algebra and one concludes
that Bose statistics, i.e. symmetric states, would violate causality. One refers to
these results as the 'spin-statistics theorem'.
16.3.5 Antiparticles
We first remark th at the Hermitian field <1> = 4> + 4> t discussed above is the
only combination (apart from phase) of 4> and 4> t which satisfies causality. A
more general, non-Hermitian field may be constructed only by introducing a
new field operator Xt defined just like 4> t in equat ion (16.33) except that in place
of at(k) we have bt(k) which creates a particle with the same Poincare
behaviour as before, i.e. same mass and spin, but which differs in some other,
internal, property as yet unspecified, e.g. charge or hypercharge. It then follows
as before that the non-Hermitian field <1> defined by
<1> = 4> + X t, <1>t = 4> t + X (16.44)
t
satisfies causality, assuming that the new particle operators b and b satisfy the
sa me commutation relations (16.27H 16.29) as the at and a and that all the new
operators commute with all the old . (For half-integer spin particles the
anticommutation relations apply instead.) The new particles are called the
'a ntipa rticles' of the old ones..
410 Particles , Fields and Antiparticles 16.3.5
A simple Lagrangian like (16.38) may again be con structed, with terms like
<1>t<1> replacing <1>2 to en sure that Y is Hermitian. The expressions for Hand P
then come out to be the sa me as before except that there are additional terms
with the b-operators in place of the a-o pera to rs. Th ere is a lso a new conserved
qu antity
and thus measures the difference between the number of particles and the
number of antiparticles. In a single-particle state Iii > the value of Q is
<IiIQ IIi >= 1, while for a single antiparticle <Q >= - I. The operator Q
clearly distinguishes particle a nd antiparticle and in the case that the field
particles and antiparticles are the IT + and IT - meson s, respectively , thi s
operator will serve to represent the electric charge of the field. On the other
hand if the field is used for the neutral K 0 and K 0 meson s, which differ in
hav ing hypercharges of ± I respecti vely, then Q will represent the hyper-
charge. For the pure K ± field, which has both charge and hypercharge, Q will
be able to represent both cha rges simultaneously. (When severa l different
particles are pre sent, the operator expre ssion s for Q becom e more elaborate
than that given above, with additional terms ari sing from the other particles.)
In this situation the expressions for the electric charge and for the hypercharge
will naturally become different from each other. There are indeed charges
other than these two like the baryon number and the lepton number (see
sectio n 11.3).
Notice that the field <1> has simple properties with respect to Q, with <1>t
increasing the value of Q by one unit and <1> decreasing it by one unit. As a
result we may retain charge conservation even when interaction terms are
included in the Lagrangian. This can be achieved by adding to Y the product
of <1> with a corresponding term of another field which has the opposite effect
on Q . The conservation of charge relates to an intrinsic property of the particle
but we may nevertheless associate charge conservation with a transformation
of the field (as momentum conservation is associated with translations) by
introducing the transformation <1> -+ U<1>U -I, where U = exp(if:Q). Then
since <1> decreases the value of Q by unity, we have [ Q , <1>] = - <1> and hence
the transformation is given by <1>-+ exp( - je)<1>, a con stant phase change. This
is called a gauge transformation (see also subsection 16.1.3). Thus Q is the
infinitesimal operator for U, as P is for translations. In fact it is a general
property that the invariance of the Lagrangian with respect to transformations
of thi s kind leads to a conserved charge. Furthermore there is an associated
16.3.6 Particles. Fields and Antiparticles 4 11
using the anti-unitary property of Y . Thus the effect of PCT (or T(I)CY in our
notation) on the field is to take the Hermitian adjoint and to replace i by - i.
The phase ± in equation (16.45) is not significant because of the arbitrariness
in choosing the phase for Y, see subsection 15.7.4. The transformation i-+ - i
which reflects both space and time is sometimes called 'strong reflection' and
because of the Hermitian property of the simple Lagrangian density !E ,
constructed for the field <1>, as in equation (16.38), it follows that the PCT
transformation induces the change !E(i)-+!E( -i). When the Hamiltonian
density .Yf (i) is constructed from !E(i), as in equation (16.39), the same change
occurs, yt (i) -+ .Yt ( -i) and finally the Hamiltonian operator H = J.Yt' (i)dV
formed by integrating over all three-space satisfies Hu) -+ H( -l). But since H
must be time independent this means that H is unchanged by the PCT
operation, which completes the argument that PCT is a symmetry operation.
The power of this result lies in the fact that the arguments outlined above are
equally applicable when the Lagrangian contains complicated interaction
terms and is dependent only on the Lagrangian being Hermitian and invariant
under the proper Poincare transformations. (There is also the implicit
assumption of point particles, without structure, and our use of differential
equations which implies a continuity in space, without any granularity.)
A more formal proof of the PCT theorem may be constructed by
arguing that invariance of the field theory with respect to proper Poincare
transformations implies invariance with respect to proper complex Poincare
transformations and that this complex group contains the strong reflection
e-: -e.
To illustrate the PCT symmetry we give two examples where there is no
symmetry with respect to P or C separately but where the product PC is
conserved . In subsection 15.7.3 it was argued that the neutrino did not possess
definite parity. In fact the parity operation produced a right-handed spinning
particle which is not found in nature. However the operation CP will take the
left-handed neutrino into the right-handed antineutrino which is found to
exist. One finds here that, although P is not conserved, the product CP is
conserved as also is T.
The KO meson with finite mass, zero spin and definite strangeness -1 has
as its antiparticle the K ° meson with strangeness + 1. These particles are
produced via the strong nuclear interaction which conserves strangeness.
However they decay via the weak interaction which does not conserve
strangeness, which does not conserve C or P separately but which to a high
degree conserves the product CPoThe KO is observed to decay in two ways
either to two n-mesons with a lifetime of about (10)- 10 sec or to three n-
mesons with a lifetime of about (10)-7 sec. These results may be understood
as follows . Since the KO particle has definite parity we may write
16 .3.7 Particles, Fields and Antiparticles 413
CP I K~ > = K~ >
1
(16.48)
+v ':Xm~
(k)b\km,)exp(ik .el) (16.49)
where the normalisation factor is conventional and the coefficients U and vare
determined to ensure the transformation property (16.47) and causality.
Regarding the u and vas 4 x 2 matrices, it is argued below that they are given by
(U !.Ol(Q)) (-U I.Ul(Q) ) ( 0 I)
= U O. !I(Q) , v(k ) =
A A
where the boost Q in the 2 x 2 matrices L carries the vector (0 0 0 1) into ~, see
equation (15.125). (In fact , the two columns of u(k) are the column vectors in
the two single-pa rt icle sta tes Ik ± ) given in equat ion (15.135a ).)
16 .3.7 Particles, Fields and Antiparticles 415
where k' = Lk and R' = Q ' - ILQ from equation (15.60) with Q' the boost for
t: Taking the Hermitian adjoint of this equation gives
T(E, L)a(km,)T -I(E, L) = exp( -ik' .E)~D~~;(R,-1 )a(k'm:)
m,
using the unitarity of the D-matrix. Since, for rotations, D(1) = U 1. 0) = UO,1),
we may write , as a matrix product,
D(\)(R , -l) = U J,O)(Q -I)U1,O)(L - I)U J, O)(Q ') (16.51)
= Lt0, 1J(Q - 1 IUD,1)(L - I)UO,1l(Q ') (16.51al
From equation (16.49) we now have
-1)D(1)(R' -I)( 0 1)
o -1 0
calculating the anticommutator of <b T(e) and <b(e') following subsection 16.3.4.
An interesting consequence of the structure of the Dirac field is that the
intrinsic parity of particle and ant iparticle must be opposite. This differs from
the zero-spin case where the two parities were the same. Let us return briefly to
that ca se before discussing the Dirac field. Let T(I) denote the unitary
transformation induced by the space inversion 1. Then if n denotes the intrinsic
parity of the particle we have , from equation (15.87),
T(l )a\k)T - 1 (I) = nat (lk)
with
then from the definition (16.44) of <1> it follows quickly, using the properties of a
and b above, that equation (16.54) will be satisfied only if n = tt ' and then
t
Now, recalling from subsection 15.8.4 that in the four-component space the
inversion operator is represented by the matrix y" we shall require a field with
definite parity to satisfy
(16.58)
where again <b(e) denotes a column vector. But , using the results (16.56) and
(16.57) in (16.49), we have
To relate this expression to cD(li) we need to change the notation for the
integration variable, writing Ik -+ t.
We must also use the property
u(lk) = }', u(k) which may be seen directly from the explicit form (15.135a) for
u(k), and the consequent property v(lk) = - }', v(k). Because of this sign
change it follows that the expression (16.59) reduces to the form (16.58) only
when n ' = - n: and then
(16.60)
Experimentally, the st udy of the electron - positron system confirms this result
that for s = !. the antiparticle has a parity opposite to that of the particle.
The Dirac equation (15.131) referred to the one-particle states and contained
operators Pwhich described infinitesimal translations in the state vectors. We
now see that the field operators cD(i) satisfy a Dirac equation
(Y .D+ik)cD(i) = 0 (16.61)
where 0 = (- V, % ct ) is a differential operator acting on the posinon
variables i . This result follows directly from the definition (16.49) by operating
with 0 on the exponentials and using equation (16.60). The Dirac equation
(16.61) is again the field equat ion which arises from the Lagrangian. In this case
the appropriate Lagrangian density which gives rise to the particle picture, as
in subsection 16.3.3, and also leads to this field equation is
2) = cD(i)( Y. 0 + ik)cD(i)
f
shall write
T(E, l)<1>,(e )T- 1 (E, L ) = (2n) - ! f k,- ldklr.( kH exp( -ik'. E- ik.e)
x L (~'ll~ l [ ( R~y. Q~) - 1 LRxyQJ*a (k' - ) + exp (ik' . E + ik .e)
x q\i O )[ (R ~)'Q ~ ) - ILRxyQJbt(k' +):
(16.64)
Writing w(ii) = U O.1)(R, y.Q :) ( y) as the seco nd column of U O.I) and using the
unitarity of the particle transformation (15.144) we see that
w(k)L(~i~ I[(R :, )Q~) -1 lR"QJ*
hold s for the second term in equ at ion (16.64) using the equ ivalence of U l. ol and
L1o. H*, see subsection 15.2.5. Putt ing the se two result s into equation (16.64)
and making the change Ii' -+ Ii in the integration variable leads to the des ired
result (16.62).
As in previous examples one sees that the field (J)( €) satisfies the appropriate
field equation which, in thi s case , is the Weyl equation (15.146)
1
( s.V - -2c - at
0) (J)(e) =
A
°
The Lagrangian density for the free neutrino field is
Note that (J)t transforms as.cordin~ to U 1,O) and the Weyl operator is a
component of a four-vector W + imP. It is rather surprising that one does not
need to include the other components to ensure the invariance of !f' but this is
due to the special relat ions between these components as pointed out after
eq uation (15.145).
Although the neutrino has no electric charge we can define an operator Q as
in subsection 16.3.5 which measures the difference between the numbers of
neutrinos and anti neutrinos. It is called the 'lepton charge' and if a similar
16.3.7 Particles, Fields and Antiparticles 419
charge is assigned to electrons and muons then one finds experimentally that
this lepton charge is conserved, like the electric charge. This is illustrated in the
fJ-decay of a neutron, n --+ p + e + v into a proton, electron and antineutrino V,
since one assigns a zero lepton charge to the heavier particles nand p.
Confirmation of these ideas is found in many experiments; for example the
process v + n --+ p + e does not take place whereas v + p --+ n + e, with e
denoting the anti-electron (or positron), is observed. We might briefly remark
that the heavier particles like nand p belong to a class of particles called
'baryons' with a baryon charge of + I and that this baryon charge provides a
further conserved charge.
We have seen in subsection 15.7.3 that space inversion changes the helicity,
i.e. changes the sign of m. Thus the neutrino field cannot have definite parity
like the Dirac field. The space inversion would carry the neutrino (m = -t)
into a neutrino with m = ! but such particles do not occur in nature and are
therefore not contained in the field. However if we introduce the charge
conjugation operator C as in subsection 16.3.6 then the existence of
antineutrinos with m = ! suggests that the field may transform simply under
the combined operation of charge conjugation and space inversion which we
denote by CP. Defining this unitary operator by the relations
t
CPat(k - )(CP) ~ I = '7(k)b (Ik +), (CP)2 = I
where '7(k) is a phase factor, we are able to show , by using the properties of
w(k), that CP transforms the components of(J)(€) into those of (J)t(I€). In fact
<l>,(f) = (27<) - j f
k,-t dkw,(k){ a(k - )exp( - ik . f )
the vector ( ~) in the m-basis is given by (e, - iey)/2i in the Cartesian basis so
that equation (16.67) tells us that w(k) is a vector oflength k., of the kind (~)
16 .3.7 Particles , Fields and Antiparticles 421
but rotated by R, y- Since R, ycarries the z-axis into the d irection k, the vector
w(k) may therefore be written
w(k ) = k,[e dk)- ie2(k)] /2 1 (16.68)
where e I and e 2are a pair of o rthogonal unit vectors in the plane perpendicular
to k . The pos ition of e 1 in th is plane is unimportant since an y rotation in this
plane changes onl y the phase of w(k ). However, by choosing e2 to be the axi s
of the rotation Rx }" the se unit vectors acquire the simple properties
e2 (1 k ) = e2(k)and e 1 (lk ) = -e dk),since Ik is arrived at by a n extra rotation
o f n: in R,}" Hence from (16.68), w(Ik) = - w*(k).
From equation (15.91) the behaviour of the particle operators under space
inversion is given by
(16.69)
where the choice of negative sign is only a convention, since the relative phase
o f the two helicity states is free. (The po sitive sign was used in equation
(15.91).) However, o nce thi s cho ice is made, the po siti ve sign between a(k -)
and at(k + ) in (16.65) acq uires significa nce. Using (16.69) we see that the field
satis fies the equat ion
T(I)<I>(e)T - 1(1) = cIlt(le )
We may therefore form linear combinations of«Dand «D t which have even and
odd parity, respe cti vely:
t(e)] t(
B(e)= [«D(e) + «D / 2l and E(e ) = - i[ cIl(e) - cIl e )] / 2!
The numerical factors a re conventional and the factor i makes E Hermitian as
well as B. By using the symbols E and B we are anticipating their association
with th e famil iar magnet ic and electric fields of classical theory, see subsectio n
16.2.3. To construct the q uantised field corresponding to the classical vector
potential A it is convenient to introduce the particle operators
t A t l' t l' 1
a1(k) = [a (K+) -a (K -)] /2 1
tA t l' t l' I
a 2 (k ) = -i[a (K+)+a (K-)] /2 I ( 16.70)
which are linear combinations of the two helicity states ± and describe states
of plane polarisation. In terms of these new operators and using the form
(16.68) for w, the expressions (16.65) lead to
f
B(e) = (2n:) -l2 -li dk {[ e2(k) adk) - edk)a 2(k)]exp( - ik .e)
t
- [e2(k )a 1 (k ) - e 1 (k)a ; (k )]exp (ik. e) } (16.71 )
422 Particles, Fields and Antiparticles 16.3.7
From the form of the expressions (16.71) we see that both E and B may be
expressed in terms of a single vector field
by writing
E(€) =
I c
- - ---;-A(€), B(i) = curIA(€) (16.73)
e ot
and using the trivial properties e. ». e 1 = e z, e. r. e z = -e 1 for a right-
handed set of orthonormal vectors.
It is quickly verified that, as in the other examples, the field satisfies the
appropriate wave equat ions deduced in subsection 15.8.6. In fact it is «I» \2),
transforming like L(t,O) , which satisfies the equations given there while «1»(2)
needs a change of sign in (15.149). Thus
i a
curl «I»(e) = - - --;-«I»(l),
t
curl «I» (e)
ic
= - --;-«1»
t
(e)
c at e M
div«l»(e) = 0,
and these equations lead to the more famili~r form of Maxwell's equations
when they are expressed in terms of E and B,
l e i 0
cur l 8 = - - E curl E = - - - 8
c ct ' c at
div B = div E = ° (16.74)
One therefore identifies this field, constructed from the zero-mass particles
with Iml = I (the photons), with the famil iar electromagnetic field of E and B
which it approaches in the classical limit. As one would expect, the way in
which E and B are obtained from the single field A is the same as in the classical
theory, see subsection 16.1.3. Recall, however, that in the classical problem we
used a four-vector potential A = (A , (/J) to describe the Lorentz invariance of
the theory, whereas in equation (16.72) we have only a three-vector. However
in subsection 16.1.3 we showed that any two potentials A which led to the same
E and B were physically equivalent. There was therefore freedom in the
choice of A and one possibility, called the ' rad ia tio n (or Coulomb) gauge' is to
set ¢ = 0. The simple choice of A(e) in equation (16.72) corresponds to this
gauge.
We have constructed the field A(~) by a somewhat circuitous path, starting
from the product representations P(O, 0) X L(1 , oi and p10, 0) X Lt O, I) . This had
the advantage of similarity with the neutrino field. However it would have been
possible to arrive more directly at E(e) by using the product representation
plO, 0) x L H,!I which also contains p(O, ± I).
Particles, Fields and Antiparticles 423
Finally we note that, since the fields E and B were constructed to have even
and odd parity, respectively, it follows from equation (16.72) th at A must have
odd parity,
T(I)A(e)T-1(l) = -A(Ie)
For this reason the photon is said to have odd intrinsic parity. By taking the
opposite relative sign of alii - ) and at(ii + ) in <1.l,\e) in equation (16.65) we
could have constructed a field with positive parity but this would not have been
consistent with experiment for the electromagnetic field.
Bibliography
Streater, R. F. and Wightman, A. S. (1964). PCT, Spin and Statistics and all that
(Benjamin, New York)
For details of the experimental data on which the new concepts like
charge conjugation are based , one should refer to the following particle
physics references: Perkins (1972) of chapter 11 and Dyson (1966) of
chapter 15.
Problems
16.1 Show that the fields E ± i B defined in equation (16.17) transform according to
the representations U 10) and U °\), respectively, of the Lorentz group. (Use the
matrices (15.30) of X and Yand for L11 0l to sho w that (X - iY)(E + iB) = O.
16.2 If the Lagrangian density is invariant under time translation (dd~ = 0 ) show
J
that H = Jf'dV is constant. (Use the field equation (16.21) and Gauss's
theorem, assuming the fields vanish at infinity.)
16.3 In the same way, show that the field momentum P defined in subsection 16.2.2 is
constant when 5f is invariant under space translations.
424 Particles. Fields and Antiparticles
16.4 Two particles with rest mass M and equal and opposite momenta ±pcollideand
come to rest with the production, also at rest, of a third particle. Show that its rest
mass is given by 2 { ( M 2 + :: Y- M } .
16.5 · Show that the number operato r defined by equation (16.30) has the properties
NIO) = 0. Nlk ) = Ik) a nd generally Nlk.k2' " k. ) = nlk 1k 2 • • • k.).
17
The Symmetric Group ~
17.1 Cycles
12 3 4 5 678910)=(1358)(27)( 6910)
( 3754810216 9 3581 72 106 9
= (1358) (6109)(2 7)(4),
17.2 The Symmetric Group Y' 427
n
12 3 k-Ik)
(123 . . . k) == ( 234 k I = PI2P23P34 '" Pk- I k (17.1)
transpositions and vice versa. In the second equ ivalent definition we make use
of the function
(17.2)
17.3 Classes
For the rotation groups it was shown in section 2.7 that if two rotations Rand
R' are in the same class then the group must contain a rotation which carries
the axis of R into the axis of R'. The analogous result for permutations is that if
two permutations P and P' are in the same class of Y' n then there must be a
permutation Q which, when carried out on both rows of P, produces P' . To
prove this statement we first define the permutations
and
(17.5)
where the last equality defines the r j given the pj and q.. Hence
(17.6)
but the permutation P' is nothing more than the permutation P of(17.4) with
both rows permuted by Q of (17.5), i.e. the numbers 1 2 . . . n are replaced by
ql q2°o ·qn and the numbers PIP2 °o .Pn are replaced by r l r 2 °o.r n •
17.3 The Symmetric Group Y' 429
n
In terms of cycles this result implies that all elements of g" assoc iated with
the same partition [111 2 • • • ] are in the same class and conversely that all
elements in the same class have the same structure in terms of cycle length and
are thus associated with the same partition. For example, in the group g 5 the
two permutations
and
are in the same class. To find a Q such that P' = Q PQ - lone needs simply to
permute the cycle labels of (17.7) into those of (17.8), giving
Q=(12345)
35412
It is quickly verified that
Q PQ _ 1 = ( 12 34 5) (1 2 3 4 5) (3 54 1 2) = (3 5 4 12 )
35412 31254 12345 43521
== ( I 2 3 4 5) = P'
21453
S=~LP
n! p
and
A = ~ L n(P)P (17.12)
n! p
although for most purposes the numerical factor l in ! is irrelevant. These two
operators are called the symmetriser and the antisymmetriser, respectively.
According to the general theory of section 4.19 the functions f s = Sf and
fA = Af will be totally symmetric and totally antisymmetric respectively, no
17 .5 The Symmetric Group Y' 431
n
matter what the functionf. It is always possible however, forcertainf, that the
functions f s and J~ may vanish identically.
Consider for example II = 2 with f = x I ' Then
Sf= t(E+PI2)X I =t(x. +X2) =fs
Af= !(E- P 12) X1 = !(x 1 -x 2 ) =fA
SJ=
' I[
(; E+PI2+PI 3+P2 3+ (123)
231 + (123)J
312 XI
= i (x 1 +X2 +x 3 +x I +x 2 +x 3)
= 1(x 1 +X2 +x 3) =J~
Af= k[ E-P I2 -P 13 -P 23 + C
(~~:) + ~~) JX I
= i(x i -X 2 -X3 - XI +x 2 +X3) =0
which leads to the character tables and also gives some insight into the
properties of the basis vectors of the irreducible representations. We shall see
that not only is the number of irreducible representations equal to the number
of partitions, but also that each irreducible representation may be labelled by a
partition in a natural way.
Consider a number n of particles which may be put into a number n of
single-particle states cPi so that cP;(k) signifies that particle k is in state i. To each
partition [n l n2"'] of n we associate the product function
$([n 1n2 "']) = cPd1)cPd2) ... cPl(n.)cP2(n l + l)cP2(n l +2) .. .
. . . cP2(n l +n2)cP3(nl +n2 + 1) . . . (17.13)
in which the first n 1 particles are in the state cP l' the next n 2 particles are in state
cP2 and so on . For each partition we now generate the set of n!/(n l ! n 2! . .. )
independent product functions P$( [n 1 n2 • • . ] ), where P is any permutation
of the particles. (Although there are n! permutations P, this number must be
divided by n l ! n 2 ! .. • to allow for those permutations which leave (17.13)
unchanged.) This set of states defines, for each partition, a vector space
L([n 1 n2 •• • J) which is invariant under permutations and therefore provides a
representation of the group Y n • In general these representations are not
irreducible but their characters are relatively easy to calculate. The characters
of the irreducible representations may then be deduced by a process of
subtraction, starting from the simple partition [n]. In this case the function
(17.13) is invariant under all permutations so that the corresponding
representation is the identity representation.
To calculate the character for a permutation Q in the representation
generated by the vector space L([n l n 2 • •• J) we must sum the diagonal matrix
elements of Q in this space. However, the result of operating with Q on a
general basis vector P$( [n 1 n2 ••• ] ) is simply another basis vector because
QP is just another permutation. Thus every column of the matrix of Q will
contain one entry of + 1 and zero in every other position. The only
contribution to the character will occur when this entry + 1 occurs on the
diagonal of the matrix. In other words there is a contribution of + 1 for every
basis vector which is unchanged by Q and hence the character for the
permutation Q is given by the number of basis vectors of the kind
P$( [n 1 n2 •• • ] ) which are unchanged by Q. It is of course only necessary to
choose one permutation Q from each class. For the identity element E the
character is simply the dimension n!/n l! n 2! • . . of the space L( [n l n 2 ... ]).
If Q is a single transposition P 12 the character is given by the number of basis
vectors for which the particles 1 and 2 are in the same state, i.e. either they are
both in state cPI or both in state cP2, etc. This number is deduced by counting
the number of ways of arranging the remaining (n - 2) particles, namely
(n-2)! (n-2)!
+, (17.14)
(n l - 2)'. n 2 ·I n 3 I• • • • n l · (n2 - 2)'.n 3 I• • • •
+ .. .
17 .5 The Symmetric Group [I' 433
n
Table 17.1
l4] I I I I 1
X[3I] 3 1 -1 0 -1
[/'3 (Ill) (21) (3) X[22] 2 0 2 -I 0
X[211] 3 -I -1 0 1
1/1[3] 1 I 1 X[III1] 1 -I 1 1 -1
1/1[21] 3 I 0
1/1[111] 6 0 0
x[3] I 1 I
X[21] 2 0 -1
X[III] 1 -I 1
characters deduced earlier and that the remainder is also irreducible. The
number of times that each of the earlier characters appears and the verification
that the remainder is irreducible may be obtained with the techniques of
section 4.11, see also problem 17.3. The remainder is denoted by X[n, n, . . . ] so
that we obtain an irreducible character corresponding to each partition. The
results of this analysis are shown in table 17.1 for n ~ 4. It is interesting to note
that ljI[ III ... ] is the character of the regular representation, see section 4.13,
which contains every irreducible representation a number of times equal to its
dimension. In terms of the basis functions (17.13), the progression down the
table corresponds to the introduction of a greater variety of functions ¢q and
hence there is greater scope for antisymmetry. Finally in the last row there are
as many functions ¢q as particles and every symmetry type may be constructed,
including the totally antisymmetric (alternating) representation X[III . . . ]
already described in section 17.4.
Although we did not give a general proof, it was shown in the preceding section
that the irreducible representations of Y n were labelled by a partition
[n 1 n2 • • • ] of the number n. For many purposes it is convenient to use a more
visual notation for a partition as follows. To each partition we construct a
diagram of n small squares of which n, are in the first row, n2 are in the second
row, and so on, with the left-hand ends of all rows being in the same vertical
line. Such a diagram is named after the mathematician A. Young. As an
illustration, the five partitions of n = 4 are given by the Young diagrams
below:
generally not irreducible with respect to [fl" _ l ' We may therefore ask which
irreducible representations of [fl" _ 1 will occur in the reduction of an
irreducible representation of [fl" and the answer comes immediately from the
character table.
Let T(2.1 denote the irreducible representation of [fl" corresponding to a
partition oc" of n. The reduction with respe ct to [fl n- l is then wr itten
(17.16)
)';,, -1
where T(" - I) is an irreducible representation of .51'" _ 1 and the sum runs over all
partitions (1." _ 1 of n - I .
The character ofT( ") for those elements which lie in [fl" _ 1 is obtained from
the character table of Y " by selecting only those classes with at least one cycle
of length unity. Then with the help of the full character table for [fl" _ 1 the
formula (4.28) enables us to deduce the coefficients m(oc" _ d.
For an example, consider n = 4. The only classes of [fl 4 which contain
elements of.Cf' J will be (1111), (211) and (31) and they contain, respectively, the
elements of the classes (Ill), (21) and (3) of Y J . We give, in table 17.2 the
characters for TI,.I and T(' ,)taken from table 17.1. The results of the reduction
are also given on the right , where it is seen that the m(ocJ) are either zero or one.
Table 17.2
I I 1
2 o - I
I - 1 1
~- - -_ ._ ~ -- --- ~ -- -- - _ . _ - - - _. _-- - -- ------
I I 1
3 1 o
2 o -1
3 -1 o
I -1 1
-~--- - - - - - -
(1) It is trivial that all basis vectors of the irreducible representation 1'2.) of
Y. may be assigned, as a first label, the same symbol (X••
(2) In the reduction of T(~'), on restricting to the subgroup Y . _ l ' the basis
vectors may be chosen to belong to one or another of the irreducible
representations T(~' -1) of Y . _ 1 which occur in (17.16). As a second label the
basis functions may therefore carry the appropriate symbol :x._ 1. Since no
representation T (~" - I) occurs more than once, the number of basis vectors
carrying the symbol (X. _ 1 will be equal to the dimension of the representation
T(a.- al.
(3) Proceeding from Y . _ 1 to Y . _ 2' each irreducible representation T(~' I )
will reduce to a sum of irreducible representations T (a. -, )of Y . _ 2 . The set of
basis vectors with the same pair oflabels (X. (X. _ 1 may now be chosen to belong
to one or another of the T(a. - , J. Thus the basis vectors acquire a further label
(X. _ 2 to include with (X. and (X. _ 1 . Again the number of basis vectors carrying
the sequence of labels (X. (X. _ 1 (X. _ 2 will be equal to the dimension of the
irreducible representation rv- -zl
(4) Clearly this process may be continued until eventually the basis vectors
carry the sequence of labels (X. (X. _ 1 (I. _ 2 •• • (X2 . Again, the number of basis
vectors with a given sequence will be equal to the dimension of the last
irreducible representation in the chain. But this is now T(2,)and the group Y 2
has only one-dimensional irreducible representations. Hence the labelling by
the sequence (X. (X. _ 1 a. _ 2 • •. a2 is complete in the sense that no two basis
vectors of T(~") will have the same sequence and that, to every seq uence, there
corresponds a basis vector ofT(~·). It must ofcourse be remembered that in any
such sequence the Young diagram a. _ 1 must be obtainable from a. by the
removal of one square, that a. _ 2 mu st be obtainable from a._ 1 by the removal
of one square, and so on .
The sequence (X. a. _ 1 (X. _ 2 ••• (X2 may seem to be a very complicated label
since each a, represents a Young diagram of r squares. It is possible however to
denote the set by a single Young diagram a. in which the numbers 1 to n have
been entered into the squares. The resulting picture is called a Young tableau.
17.8 The Symmetric Group Y
n
437
The procedure for entering the numbers is to put the number n into that square
which, when removed from (X., gives the Young diagram (X. _I ' The number
n - 1is put into that square of (X. _ I which , when removed, produces the Young
diagram (X. _ 2, and so on . By the manner of construction it is clear that the
numbers in the squares of a Young tableau must decrease to the left and
upwards.
Because of the completeness of this labelling system it follows immediately
that the dimension of the irreducible representation T(~·) is equal to the
number of different tableaux obtainable from the Young diagram (Xe - This
provides a simple way of deducing, see Littlewood (1950) in the bibliography,
the dimension of any irreducible representation ofY., although the dimension
also appears in the character table as the character of the identity element.
Table 17.3 gives the Young tableaux for each irreducible representation of
Y 2' Y 3' Y 4 . For example, the representation [31] of Y 4 gives rise to three
[II] [31]
r
[22]
[21] [211] 4
[ill]
~ ITEJ
~ 2
~ ~ 3
[Ill] [1111 ]
W
tableaux corresponding to three basis vectors. This agrees with the dimension
given in the character table 17.1. The equivalence between the clumsy notation
(X4 (X3 (X2 used earlier and the Young tableau is illustrated by the example
Yamanouchi symbol in the example given above would be (1 21 1). That this
notation is sufficient to specify the tableau is soon clear if one starts at r 1 and
moves left, remembering that the numbers in the tableau must increase to the
right and downwards. Incidentally, it must always happen that r 1 = I, r 2 = I
or 2, etc. One advantage of the Yamanouchi notation is that by removing rn
one obtains the Yamanouchi symbol specifying the behaviour of the basis
vector of T(~·) under the group Y n _ 1 and the successive removal of rn - l '
rn _ 2' . . ., r3 specifies the behaviour of the basis vector under the chain of
subgroups Y n - 2 , Y n - 3 , · · · , Y 2 •
and from (17.19) the matrix of P 2 3 is soon found by carrying out the
permutation, for example
P 2 3 O( [21] a) = PII 21> -1112> -1 2 11> }/ J 6
= -to([21]a)+AO([21]b)
leading to the matrix
( -t -.Ii )
P23 = -.Ii t
It was shown in section 17.1 that any permutation could be expressed as a
product of adjacent transpositions so that , having found matrices for P 12 and
P 2 3 , all other matrices in the representation T[21] will follow from the known
rules for multiplying group elements. Thus, for example, P 13 = P 12 P 23 P 12 so
that the matrix of P 13 is given by the product of matrices
-I
0) (-t-.Ii -.Ii )
t
(I° 0) = (- -.Ii-t
-1
A similar exercise for n = 4 is suggested in problem 17.5. In section 17.13 we
give a simple prescription for writing down the matrices of adjacent
transpositions in any irreducible representation but the derivation of the
prescription is not simple.
We note that, as a result of the isomorphism between Y' 3 and the point
group D 3 , see section 2.3, the matrices given above are the same as those given
in table 4.1 with a re-ordering of basis vectors.
Given any two representations T(<X') and T(<X') we can form the (direct) product
representation T(<X,)x T(<X') whose dimension is the product of the dimensions
ofT(<Xd and T(<X,J. This concept was described generally in section 4.17 and may
of course be used in the group Y' e- The reduction of the product representation
into a sum of irreducible representations was studied by using the character
table, see the formula (4.45). Thus, for example, table 17.1 of section 17.5 tells
+
us that T[2I]X T(21] = T(3] T(21] T[111]. +
There are two important particular examples of such products which are
worth describing here . Consider first the product T(<XJ x T[111 .. . ] of any
irreducible representation with the totally antisymmetric representation. Since
440 The Symmetric Group Y 17 .10
n
T[ III ... J is one-dimen sional the product has the same dimension as T I"l.
Furthermore if Xl" )denotes the charact er of T'" for a permutation in a class C(; P
then the character of the product representation is simply 7rpX~)' where 7rp
denotes the parity of elements in the class C(/ p ' Here we have used the result
(17.10) that "» is the character of T[ III . . . J. Now if T'" is irreducible it follows
immediately from the criterio n (4.29) that the product representation is also
irreducible using 7r; = I. Hence we may write
T l"lx T[ III ... ] = T(~l with X (~P l = 7r X (" )
p p
(17.20)
where ti labels another irreducible representation of Y n' For example if
(X =
[n] then a = [III . . . ] so that, in the language of Young diagrams, the
two representations Tt")and T(1) are related by an interchange of the role of
rows and columns. Inspection of table 17.1 shows the same result for the
example, (X = [31 ] with & = [211]. In fact it may be sho wn generally that the
Young diagram for (X and & are obtained from each other by interchanging the
roles of rows and columns or , in other words, by a reflection in the downward
diagonal at 45° from the top left-hand corner of the diagram. We say that the
representation T (1)is the 'adjoint' ofT'a)and, naturall y, some representations
like T [21] and T [22] are 'self-adjoint'. In a self-adjoint representation the
character must va nish for a class with odd parity, as is verified fro m table 17.1.
Consider next the reduction
T(alx T IP) = I myTIYl (17.21)
r
of the product of two irreducible representations and let us determine two
rather special coefficients m y, namely m[n ] and m[1 J I . .. ]- From the general
formula (4.45) we have
m[n] = (n!) -I I cp x ~") x~) (17.22)
p
having inserted the known characters 1 and 7rp for T[n] and T[III ...] and used
equation (17.20). Finally, using the orthogonality formula (4.25b) for charac-
ters and the known property that the characters of Y n are real, we have from
equations (17.22)
m[n]=8",lb m[III . . . ]=8;,p (17.23)
The meaning of these results is that the totally symmetric representation can be
produced onl y if the two irreducible factors are the same, or at least equivalent
in the sense defined in section 4.6. In the same way, total ant isymmetry may be
obtained only in the product of a representation with its adjoint. We ha ve
made use of the latter result in constructing antisymmetric wave functions in
atomic and nuclear problems, see subsection 8.6.4 and chapter 12. For
example, in the atomic structure problems of chapter 8, an orbital state with
symmetry (X must be combined with a spin state of symmetry &. Furthermore,
17 ,11 The Symmetric Group Y' 441
n
since an electron with spin 5 = t has only two spin states, the Young diagrams
(i may have , at most, two rows , Thus the Young diagrams (X for the orbital part
must have, at most, two columns. The precise relat ion between the total spin S
and the Young diagram a is discussed in section 18.10. The result is that S
= tOil - il 2 ), where ill and il 2 are the lengths of the two row s of a,
To actually produce such totally symmetric or a ntisymmetric functions
from the products it is of course necessary to sum over products of different
basis vectors of the two factors with appropriate Clebsch - Gordan coefficients
following the general arguments of section 4.17. These coefficients are easily
deduced, see problem 17.6,
In this section and the next we study a new kind of product of two
representations called the outer prod uct. Starting from a representation T(2) of
Y'n and a representation T(2') of Y'n' we generate a representation T of Y' n+ n' .
From the physical point of view this process is useful in building up wave
functions for (n + n') particles, given those of the smaller systems of nand n'
particles, but we shall not give examples. From the mathematical point of view
there is an interesting connection between this outer product for the symmetric
groups and the usual inner product for the unitary groups. This connection is
described in chapter 18.
Consider products of the type
};(I, 2, . , ., n) gj(n+ 1,n+2" . .,n+n') (17.24)
where the}; transform according to some irreducible representation T(2) of the
group Y'n on particles 1 to n and the gj transform according to some
representation T(2') of Y' n' on particles n + 1 to n + n'. The set of products
(17.24), where j runs through the s" basis vectors of the representation T(2) andj
runs through the 52' basis vectors of T(2'), will not form an invariant space
under Y'n+ n' but if one enlarges the space to allow all permutations between
the particles inland those in 9 then this larger space will be invariant. Allowing
for the number (n + n')! jn!n'! of ways of selecting n particles from (n + n'), the
dimension of the larger space is given by
(17.25)
Hence this large space will provide a representation T of Y' n + n' but not, in
general, an irreducible one . The representation T of Y' n + n ' is called the 'outer
product' of the representations T"I of Y'nand T" ') of Y' n' and we use the notation
T = T(2) ® T(2'). We are therefore faced with an interesting problem: given the
irreducible representation labels (X and (x' which are partitions of nand n' ,
respectively, can we deduce the possible irreducible representations of Y' n + n'
which appear in the reduction ofT? Let us first consider a few simple examples.
442 The Symmetric Group yo
n
17 .11
If n = n' = I, then the dimension of T is two, with the two basis vectors
j( I )g(2) andj(2)g( I). It is clear that by taking the symmetric and antisymmetric
combinations, thi s two-dimensional representation reduces with respect to the
group yo2 ' In terms of Young diagrams we would write
For the product rn ® D with n = 2, n' = I there are three basis vectors
and, if we write the symmetric two-particle functionf(l , 2) of equat ion (17.24)
as ¢(1)¢(2), then they may be taken as ¢(I)¢(2)g(3), ¢(I)¢(3)g(2) and
¢(2)¢(3)g(l). This set is identical with the set generated from the function
<1>( [21] ) of equation (I7.13)and we have seen in table 17.1 of section 17.5 and
in section 17.9 how this three-dimensional space breaks up into a single
symmetric vector and a pair of mixed symmetry [21]. Thus
rn @ D = OIJ + BJ (17.26)
ffiIl
P¢(I)¢(2)g(3) as P runs through Y03 the only new component which can be
produced will be the remaining basis function of the representation
[21]. Thus we have equation (17.26) which says that the outer product contains
all irreducible representations of yo3 which may be formed by adding one
square to the original diagram m
This argument may be extended to the more general reduction
0 =t
T(2) ®
.. T(7') (17.27)
where the sum runs over those tableaux a' which are obtainable by adding one
square to the tableau a. It is even possible to deduce a rule for obtaining the
coefficients m(f3; a, a') for the most general case
T = T 12 ) ® T 12 ' ) = tm(f3 ; a, a')TI/l) (17.28)
(i
are added before the c's and so on, (ii) that a Young diagram is obtained at
every stage, (iii) that no two identical letters appear in the same column, (iv)that
if the added letters are finally read off from the completed diagram beginning
with the first row and reading from right to left, then at no stage must the
number of b's exceed the number of a's or the number of c's exceed the number
of b's, etc.
(3) The final Young diagrams 13 which result from this process when all
letters have been added will appear in the sum (17.28) with non-zero
coefficients. If, by the given procedure, a particular diagram 13 may be built in a
number p of different ways then for that 13, the coefficient m(f3;a, rx') = p in
equation (17.28).
Consider two examples.
(i) IIIII@~ L-.L..--'--'------'-I_o,-I0--,-10-JI + ~ [ I I0 I0 I
so that
(ii)
so that
T(2I] @ T[21] = T[42] + T[411] + T[33] + 2T[321]
+ T[3I.II] + T[222]+ T[22J I] (17.29)
In example (ii) we see that the partition [321] arises twice in the process,
meaning that the representation T[321] appears with m = 2 in the reduction
(17.28). It is worth making a dimension check on such reductions using
equation (17.25). The dimensions in the two cases are I x I x 7!/4!3! = 35 in
example (i) and 2 x 2 x 6!/3!3! = 80 in example (ii). These numbers are soon
verified to be equal to the sums of the dimensions of the representations TtP)
obtained in the reductions, including the factors m(f3; rx, «'),
group [/. x [/. ' is a subgroup of .<f'. +.' (e.g. in section 17.11, [/. could refer to
particles numbered I, 2, . .., nand [/. ' refer to particles n + I, n + 2, .. .,
n + n'.) Consequently an irr ed ucible representation T ' fl )of[/ n + n' will generally
reduce as a representation of the subgroup .<f'. x Y n' ,
(17.30)
where the two Young diagram labels a. and ::x.' are necessary to specify an
irreducible representation of the product group [/. x [/ n' . The coefficients
m(p;::1., a.') in this reduction may be deduced in the usual way with the help of
character tables and using the result (4.66) for the characters of a product
group.
The reader may have noticed that the same notation m(p ; a., a.') has been used
for the coefficients in two apparently unrelated equations, (17.28) and (17.30).
This is deliberate because we now proceed to show that the coefficients are the
same, as the notation implies . From equation (17.30) we have
and using the orthogonality of the irreducible characters of [/. x [/n the
coefficients are given by
I
-,
n!n!
LG.G '
XIJlJ(GG ')X(, x,'l(GG') = m({3;a.,a.') (17.31)
For the particular case n' = 1, the reduction (17.30) is none other than the
reduction Y n + 1 --+ Y n described in section 17.7. It was seen there that the
coefficients m were all unity which provides another way of deducing equation
(17.27).
As an example of the reduction (17.30) consider the group Y 4 and its
subgroup Y 2 X Y 2 for which the character table is reproduced in table 17.4:
Table 17.4
The classes for the product group are just the possible products of the classes
(11) and (2) for each Y 2 group. Likewise the irreducible representations, all
one-dimensional, are the possible products of the representations [2] and
[11] of each Y 2 group. For the irreducible representations of Y 4 the
appropriate characters are read ofT from the character table for Y 4 ' given in
table 17.1 of section 17.5, by identifying the classes (11)(2) and (2)(11) as
belonging to the class (211) of Y 4 ' while the product (2)(2) belongs to the class
(22).The reductions given to the right of the table are deducible by inspection,
although of course one may use the orthogonality formula (4.28) if in doubt.
From this example we may deduce the corresponding reduction (17.28) for
the outer product
[2] <9 [2] = [4]+[31]+[22] (17.34)
by quoting the appropriate coefficients m(p; [2][2]) given in table 17.4.
One soon verifies the dimension check on equation (17.34), namely
1 x I x 4!/2!2! = 1 + 3 + 2.
~- N
~- M
~
'-
0
;:::;'
....
f::.
-o
iJ
- N
,,"',-
?
~
c:
0
<;;
c
.'"
II)
- M
.
II)
0..
~
II)
Il)
:0
'u
.
::l
"0 - "'1"
Il)
~
II)
-5
c: -I"'
I
-o
0..'" - N
'-
0
se
~
' 1:
<;; 001<>
E ?
II)
s: - M
f-
.,...
-
-,;
~
~
-C>
~
- "'1"
lJ
- V)
448 The Symmetric Group Y 17.13
n
[42] are shown in table 17.5. The derivation of the numerical values is
described later. Notice how the first row is labelled by the tableau CiTiT3T4l
~
which is of type (a) . The tableau y for this row is 111 213141 which does not
occur again , implying that the only non-zero element in the first row must be
on the diagonal. Notice how the second row is of type (cd and has for its
partner of type (c 2 ) the fifth row, these two and none others having the tableau
ffilill in common. There are therefore non-zero matrix elements coupling
the second row and fifth column and the fifth row and second column but,
apart from the diagonal positions, all other matrix elements in the second and
fifth rows are zero . The rules for deducing the few non-zero matrix elements
are as follows:
These rules may be used, as explained above, to deduce not only the matrix
of P 5 6 ' as given in table 17.5, but also the matrices for all other adjacent
transpositions. Thus, for example, the matrix of P 34 in the representation T[ 42]
is as follows:
-1 J&1
J~ 3
-1
17 .13 The Symmetric Group Y 449
n
Pn- l.n Pn- 2,n- I Pn- l. n = Pn- 2.n- I Pn- l.n Pn- 2.n - I (17.36)
which is soon verified. Suppose that the pair of tableaux of types (c I) and (c 2 )
occur in rows rand s of the representation T and let us construct the matrix
element in row r and column s, of the two equivalent expressions in equation
(17.36). We note that P, _ 2,n _ 1 must have a zero matrix element in this position
since the Young diagrams obtained by removing the number n from these two
tableaux have different shape and Pn- 2,n- I belongs to the subgroup Y n- I '
The diagonal matrix elements of Pn_ 2, n_ I are now assumed to follow the rules
set out above so that
1
Tss(Pn- 2.n-l)= -/ 2
where II is the axial distance from n - 2 to n - 1 in the tableau of row rand 12 is
the axial distance from n - 2 to n - 1 in the tableau of row s. Using (17.35) for
T(Pn-l.n) the identity (17.36) gives
a/~1 ~ ( I - a 2 ) - ~ ( I - a 2 ) / 2 I a = I~I ~ (1 - a 2) / 2 1
the rules to be valid for P n _ 2.n _ I we have shown that they are then valid for
P, - \,n and hence the inductive proof is complete. The rules (1) and (2) may be
regarded as special cases with axial distances of =+= 1, respectively.
in a chosen class C(j is a multiple of the unit matrix and that this multiple ). is
related to the character in that class by the formula ).. . = ~x.~) [s; where ~ is
the number of elements in the class and Sa is the dimension of the
representation. In the symmetric group f:l'n the transpositions P i j form a class
(211 .. . ), in the notation of section 17.3, so that as a particular case the sum
multiple l, of the unit operator and z, may be deduced from the character table.
This operator provides a convenient measure of the symmetry of the
representation. For example in the totally symmetric representation it has the
value 1n(n - 1) and is the negative of this in the totally antisymmetric
representation. In a self-adjoint representation it vanishes for the reason given
in section 17.10. We may regard this class operator as measuring the difference
between the number of symmetric and antisymmetric pairs remembering of
course that individual operators T(Pij) are not generally diagonal. There is a
simple formula for the numbers J, which enables them to be deduced quickly
from the Young diagram for T(a). Ifnpdenotes the length of the pth row and mq
denotes the length of the qth column then it may be shown, see problem 17.7,
that
Bibliography
The relation between the group !I' n and the continuous groups U Nand!Yi N
is described and useful characters and formulae are given by
Problems
17.1 Write the permutation (~~ii~~ las a product of cycles and hence show that it
belongs to the class [321] and has odd parity.
17.2 Construct the character table for the group // 4 using the methods of section 4.15
and check your result with table 17.1 of section 17.5.
17.3 Construct the table 17.1 of values oftjJ[n,n, ... ] and X[n,n, . . . ] for the case
n = 4.
17.4 Find the dimension of the representation [32] of Y 5 by listing the possible
Young tableaux.
17.5 Use the methods of section 17.9 to construct representation matrices for the
permutations P I Z ' P 23 and P 3 4 in the representation T[3I ] Or.'/'4 and compare
your results with those given by the rules of section 17.13.
17.6 Show that if ; ; and gi denote sets of functions transforming according to an
orthogonal matrix representation T(~)of y
Y n then the sum F s = r;;.
gi is totally
symmetric. If hi denotes a set transforming according to the adjoint repre-
sentation T[ II ... J X T''" then show that Fa = r;; hi is totally antisymmetric.
17.7 Deduce the value(17.37)fortheclass operator 0 ~ r T(Pij)inarepresentation
i <t
T[ n, n , . . J by using the following steps: (a) Let I be the basis vector whose
Young tableau y has numbers 1 to n l in the first row , n 1 + 1 to n 1 +n z in the
second row , etc, and construct the function 9 = AI, where A is the product of the
antisymmetrisers for the columns of y. (b) Write 0 = Or + 0, + Om , where Or
contains pairs ij in the same row of y, Or contains pairs in the same column and
Om the remaining pairs. (c) Show that O,y = - 1L !!!q(m q - I )y. (d) Show that
(Or + Om)AI = A(Or + Om)}. (e) Show that OJ"=- ! r mp(m p - I)}. (f) Show
that AOmI = O. P
18
The Unitary Group UN
The unit ary groups SU 2 and SU 3 have been studied in so me detail in chapters
10 and 11 and their irreducible representations were found by constructing
diagrams in one and two dimensions, respectively, as in figure s 11.1 and 11.5 of
section 11.6. This method clearl y becomes difficult for larger N , as we found in
chapter 12 in the brief encounter with SU 4 and SU 6 . In the present chapter we
introduce a simple method for describing the irreducible representations of
U N for general N which makes use of some properties of the symmetric group
Y o obtained in the preceding chapter. In fact these two very different groups
are related in an intimate way which we shall demonstrate in this chapter. The
relation is seen most clearly in the physical situation where it is of most interest,
namely in the classification of the product wave functions for n particles, where
each single particle may be in one of N different states. Thus one may consider
both the permutations of the particles and the unitary transformations in the
N-dimensional vector space of states of each single particle. Although we shall
be deducing properties of the group U N quite generally we shall use the
language of particles and sta tes.
From the physical point of view we ha ve seen in chapters lO, 11 and 12 how
the groups SU 2 , SU 3 , SU 4 and SU 6 have significance as approximate
symmetry groups of the nuclear and elementary particle Hamiltonians. The
discussion of U N in this chapter for general N helps to unify those earlier
452
18 .1 The Unitary Group U 453
I N
treatments. We shall also see how the groups V 2/ + 1 and V 2 j + 1 may be used to
classify the complicated many-electron wave functions which may be built
from n electrons in valence orbits with angular momentum lor j . In this latter
problem the unitary groups are being used more as a mathematical device than
as a symmetry group of the Hamiltonian.
We first describe the irreducible representations of V N and then make use of
a chain V N --> V N _ I --> •• . V 1 of subgroups to label the basis vectors of these
representations. In section 18.7 the general results are related to the simple
cases of S V 2' S V 3 and S V 4 discussed in earlier chapters. The infinitesimal
operators for V N are described in section 18.8. Some applications of V N in the
study of many-particle wave functions are discussed in section 18.10.
Sections 18.11 and 18.12 are concerned with the characters of the irreducible
representations of V N and the last section discusses the connection between
SV 2 and Yf3 ·
different factors may be the same. We shall refer to the second labell , 2, .. . , n
as the 'particle' label and the suffix as the 'state' label. (This language is taken
from the physical problem of constructing a product wave function for n
electrons, each of which may be in one of N degenerate single-particle states.
For example N = 2 would relate to the pos sible spin states of n electrons each
with spin s = t, while N = 21 + 1 would relate to the possible orbital states ofn
electrons in a valence orbit with angular momentum 1and projections m, = I,
1-1, . . . ,-1.)
The transformation induced in the product space (18.2) by the unitary
transformation (18.1) on all particles is denoted by T(U) and given by
T(U)<I> = ¢;(I)¢j(2)¢~(3) . .. ¢~(n)
while
and the right-hand sides of these equations are identical. Hence T(U) will
commute with the projection operators pl:z) for g o, where (IX) == [n\ n 2 •• •] is
an irreducible representation label for go ' Thus if we reduce L into subspaces
Ll:z) corresponding to each projection
L=L Ll:z)
12)
then it follows that each subspace Ll:z) is invariant under T(U) because
T(U) pl:z)<I> = pl:z) T(U)<I> = pl:z)<I>'
showing that T(U) operating on any member of LI:zI produces a vector also
lying within i». Furthermore, each space Ltd is clearly invariant under the
group go so that, since the permutations commute with the unitary
18.1 The Unitary Group UN 455
transformations, the space L'(I.) is invariant under the product group 9'. xU N.
Thus we may introduce a notation <1>~.j for the basis vectors of L'(I.) where, for a
unitary transformation,
T(U)<1>~~ = Ix'
U~!x<1>~~, (18.4)
Here the index t is the usual row label for the irreducible representation T'(I.) of
9'" described in chapter 17, while x is a new index introduced to provide a basis
for all vectors of L'(I.) with the same t. Thus U'(I.) is a representation of UN and it
is this which interests us in the present chapter. Now the representation T'(I.)(P)
of 9'. is necessarily irreducible by its method of construction through the
projection operator P'(I.) . It is a remarkable fact that the representation U'(I.) of
UN generated in this way is also irreducible but we shall not give the proof. The
set of representations U'(I.) for all partitions of all integers n is not actually a
complete set but with a small extension described in section 18,9 they become
complete, The set of basis vectors <1>~~ of L'(I.) for fixed Ct. is conveniently pictured
in the rectangular array shown below
<1> '(I.) <1>'(1.) <1>'(1.)
11 12 13 <1>\~
<1>'(1.) <1>\7)
21 22 <1>~~,
<1>(~)
3\ (18 ,6)
and by equating the dimension on both sides of this equation we get the
relation
N· = I sod, (18.8)
456 The Unitary Group UN 18.2
between the dimensions Sa. and da of'T'" and u(a), respectively. It is important to
realise that the partition IX serves a double purpose of labelling irreducible
representations of both the finite group Y n and the continuous group UN . For
fixed N we may choose any positive integer n, corresponding to a different
number of factors in the products (18.2). Thus, as expected for a continuous
group, we have an infinite number of irreducible representations. There is,
however, one limitation on the possible partitions [n 1n2 .. .]. Remember
from chapter 17 that a partition into p parts implies total antisymmetry
between sets of p particles out of n. However if p > N this is impossible because
there are only N different single-particle states. Hence the irreducible
representations U'" of U N are labelled by the partitions [n 1 n2 • • •] of any
integer n into not more than N parts. In other words they may be labelled by
the Young diagrams with not more than N rows.
The subspace L(3] is simply the set of totally symmetric combinations of these
and is easily constructed,
L(3]: 1111), (1 112 ) + 1121) + 1211) )/ J 3,
1222), (i221) + 1212) + 1122»)/)3
The vectors given here for y[2] and vl II] are precisely those given in example ( I)
of section 18.2. The two pairs of vectors given for the y[I] representation are
different but transform in the same way under the group U 2 on states <PI and
<P2' In fact they transform like the defining 2 x 2 matrices of U 2 corresponding
> >.
to n = I and the vectors II and 12 The remaining vector 133 is clearly >
invariant under U 2 '
To prove the reduction formula (18.9), or 'branching law' as it is sometimes
called, we make use of the result given in section 17.11 for the outer product of
irreducible representations of g n' First we reduce the space L of all products
(18.2) into subspaces L = i.
,= 0
L, corresponding to behaviour under the
(18.10)
diagrams an _ , will describe the behaviour both under UN _ I and under g n_, '
To find the behaviour of the products (18.10) under Y'n' we need to form the
outer product of 'J.n r of g n_ r with the totally symmetric representation [r] of
- r
.'/, which relates to the last r particles in (18.10). But this can be done as a
specia l case of the rule (17.28) with the result that for given 'J.n _ r the possible
Young diagrams an relating to ''/n are obtained by adding r squares to the
diagram an_ r in such a way that no two added squares lie in the same column.
Reversing this analysis we may now deduce the result (18.9) that , given anand r,
the possible representations of UN - I correspond to those Young diagrams
an-r which may be obtained by the removal ofr squares, no two of which lie in
the same column.
preceding section. The result is that the basis vectors ofUl".) are labelled by the
sequence
of Young diagrams where an- ' N defines the representation of the subgroup
U N- I to which that vector belongs, an- 'N - ' N _ I refers likewise to the subgroup
UN _ 2 and so on . The justification that this labelling system is unique, i.e. that
no two basis vectors of U "·) have the same set of labels , follows so closely the
arguments in section 17.8 that we do not repeat them. As an illustration, the six
basis vectors of the representation U[2] of U 3 which were discussed in the
previous section would be assigned the sequences of partitions shown in the
second column of table 18.1.
Table 18.1
III ) [2] , [2], [2] [ill]
(j12 )+121 »)/ J2 [2] , [2], [I] [JTIJ
122 ) [2], [2], [0] 0IJ
(113 )+131 »)/ J2 [2], [I], [I] OJIJ
(123 )+132 »)/ J2 [2] , [I], [0] 0IJ
133 ) [2], [0] , [0] [ill]
HPBfJffiTIHPJBFJBfJBPJHPJ (18.12)
It must be stressed that this device of writing state labels into the Young
diagrams for U N is quite distinct from the device of writing in particle labels
which was used in section 17.8 for Y n'
Given any two representations U(~·) and ut~'·) of U N we may form the direct
product as defined generally in section 4.17. This provides a larger repre-
sentation of dimension equal to the product of the dimensions of the two
original representations. We are interested in the reduction of this product
representation into its irreducible constituents and once again the result is
simple to state.
Consider the space L of products of the kind (18.2) for n + n' particles. As
explained in section 18.1 this space reduces
T = ! T(Plx UIi )
P
into irreducible representation s of the product group Y n + n ' X UN labell ed by
partitions f3 of n + n', Restriction to the subgroup Y n x Y n' X U N leads to the
further reduction
Finally, comparing this with the preceding equation gives the desired result
4.21 we know that the irreducible representations uta) of V N must take the
form of a direct product of irreducible representations of V I and SV N.
The representations of V I are, of course, one-dimensional with the form
exp( - iml/J), where m is any integer. Since the representation UIZ) was
constructed from n factors in section 18.1 it follows that the direct product
form for UIZ) is
U IZ) = exp( - inl/J) x U(:i) (18.17)
where n is the number of squares in the Young tableau a. Hence, the irreducible
representation uta) of S V N has the same dimension as UI» . In other words, U IZ)
does not reduce on restriction from V N to the subgroup SV N. We may
therefore use the same partition notation (a) = (a) = [n l n 2 • • . ] to label the
irreducible representations ofSV N. However, whereas for V Nit is obvious that
two representations with different n are inequivalent that is not the case for
SV N as we now demonstrate.
Consider the representation [11 . . . 1] of V N, where n = N . It must be one-
dimensional since there is only one totally antisymmetric combination which
can be formed from the products (1 8.2), namely
'¥ = Ln(P)PcPdl)cP2(2)
p
... cPN(N) (18.18)
where the sum runs over all permutations of the particle labels and n( P)
denotes the parity of the permutation P. Using equation (18.3) the effect of a
unitary transformation on '¥ is given by
Because of the pre sence of the antisymmetrising operator, the only com-
binations of suffices ij ... p to give a non-zero result are those in which no
suffix is repeated so that ij . . . p is a rearrangement of 12 . .. N. Using the
notation ( :~ . . .t ) for the permutation which carries the particle labels ij . .. p
into 12 . . . N the equat ion (18.19) may be written
T( U )'¥ = '\
L. Vii V j 2 · • •
'\
VpNL.n(P)P (i j . . .P ) . . · · · cPp(p)
cPi(l)cPh)
ij .. . P P 12.. . N
.. )
We stress that the permutation C~ · ~ operates only on the particle labels, as
does P. However, since the order of factors is not significant we may rearrange
them so that cP d I) comes first, cP 2 (2) second and so on . Thus we have
T(U) '¥
'\ PVii V
= ij '-:. j2 • • •
'\
V p N~ n(P)P 12
(i j P )
N cPdl)cP2(2) cP N(N)
= I.
ij ... P
Vii V j 2 .. . V PNn(~~' . .N) L n(P ')P'cPdl)cP2(2)
IJ . . . p P'
cPN(N)
464 The Unitary Group UN 18 .7
where we have defined P' as the product of permutations P' = P( ~~ : : :~) and
used the result of section 17.4 that the parity of a product is the product of the
parities. We have also used the fact that as P runs over the group 9' N so does P',
see section 2.9. Finally we note that the sum over P' is just 'P, while the sum
over ij . . . p gives the determinant of the matrix U so that
T(U) 'P = (det U) 'P (18.20)
The significance of the result (18.20) is that 'P is invariant for transfor-
mations of SU N and so we have an equivalence between the representation
labelled by the partition [11 . . . 1] of n = N and the identity representation
given trivially by n = O. In terms of Young diagrams this implies equivalence
between the single complete column of N rows and the trivial diagram with no
squares. Further equivalences now follow using the results of section 18.5 for
multiplying representations. If u(a) is any irreducible representation then from
the rules quoted for equation (18.13)
u(a) X U[II .. . I] = U(/ll
[IT]x[TI =1111I1+BTIJ+EtP
=111111+ [IT] + 0
In the more familiar notation D<i) which was used for the representation of
SU 2 in chapters 7 and 10, this is an example of equation (7.44)
Dd)x 0(1) = D(~)+ D(~)+ D(!)
using the identification of urn] with D(!n) which we discuss in the next section.
We now make contact between the preceding discussion of SUN and the
particular descriptions of SU 2, SU 3 and SU 4 given in chapters 10-12. In the
18 .7 The Unitary Group UN 465
Thus the octet (11) corresponds to the Young diagram B=J and the decuplet
sz ~~e
2
0
I
0 0
0
0
-I
) C
o
o ,Tz =
1
2
0
0
0 0
-I 0
0 I j)
0)
0 0 0 -I 0 0 0
c
0
1 0 -I o 0
v., = 2 ~ 0 -I 0
0 o 1
rj in the notation used there. The set of numbers (r. t z . .. rN ) is calIed the
' weight' of the basis vector. For example the eight basis vectors given in
equation (18.12) have the folIowing weight s, in the sa me order,
(210 )(1 20 )(201 )(\ 11)(021 )(1 11)(102) (0 12 ) (18.21 )
Notice that the weight s do not pr ovide a unique labelIing system whereas the
sequence of Young diagram s described in sect ion 18.4 does. It is con ven ient to
order the po ssible weight s and we say that (r', r ' 2 . . . r ' N ) is greate r than (r i -
r 2 • • • r N) if r', > r i - For the case r, = r \ the ordering is done by the values of
r ' 2 and r 2 and so on . Thus the greatest weight in the set ( 18.2 1) is (210) . It can be
seen from the definiti on of r I that, in an arbitrar y irreducible representation
U '~ ' where (J. == [n, n2 . . . ], the greatest weight is the sa me as the partition
r. = n\ , r 2 = n2' . . . , etc . Thu s the greatest weight serves a s a label for the
representation U'·) , a result which is a generalisation of the use of the greatest
m( = j) to label the irreducible representations D,j, of .~ J .
The matrices of the ra ising and lowering operators may be obtained by a
generalisation of the method of subsection 7.4.2 making use a lso of the chain
of subgroups. We shall not give any details.
Making use of this larger set of irreducible representations we can now show
the equivalence
(18.23)
nl
n2 I
n3
I
. ,'
N rows
f3
n N_ 1
I
nN I
Figure 18.1
N = 2 U[I]*=: (det ur ' U[l], U(2]*= (det U) - 2 U(2], U[II]* = (det U)-I U[O
U(2I]*=:(det Uj -ZU[I]
N=3 U[I]*=(det U)-IU[lI] , U(2]*=(det U) -2U(22], U[II]*=(det U)-IU[I
U(2I]*= (det U) -2 U[2I]
(I.Jl)* == ~ * == /IA.'+
IL:J" f == (JlI.)
so that in particular (10)* = (0 1), and (11) is self-conjugate.
Because of the way in which we have introduced the group UN, through
products of the kind <I> in equ ation (18.2), it should come as no surprise that the
group U N has a role to play when constructing atomic wave functions for
severa l valence electrons. Consider n valence electrons in single-pa rticle states
</J'm with fixed 1and the usual range of possible values of m = I, (1- 1), .. . , -I.
From the arguments of sectio n 18.1 it is clear that the product wave functions
with permutat ion symmetry described by a partition (J. = [n J n2 .• • ] of n will
also transform according to the irreducible representat ion UbI of the group
U 21 + I of unitary tran sformations among the (21 + 1) single-particle states </J'm '
Although the Pauli prin ciple demands that the wave function be totally
antisymmetric with respect to the permutat ion of particles, the pa rtition (J.
refers only to the orbital coord inates. Thus (J. itself need not be the partition
[11 . . . 1] but it mu st be capa ble of yield ing thi s part ition when combined
with the spin part of the wave function . Th is problem was d iscussed in
subsect ion 8.6.4 and in section 17.10, where it was shown that the spin part
must transform according to the adjoint representation Ii of the group Y n of
part icle permutation s in the s pin coord inates. (Remember that Ii is related to (J.
by the interchange of row s and columns in the Young diagram.) However, the
spin part of the wave function is itself a product of the kind (18.2) with N = 2
so that the spin part of the wave function transforms according to the
representation U (:i ) of U 2' Thus the Young diagram Ii may ha ve two rows at
most and hence the orbital part ition (J. has two columns at most and, of course,
21 + 1 row s at mo st.
We have seen in sect ion s 18.6 and 18.7 the relation between representations
of U 2 ' SU 2 and fJ1i 3 so that Ii immediately determines the spin S of the system
of n particles. Ifwe write Ii = [ill ' il 2] then S = -! (ill - il 2) . Thus the label (J. of
the representation U'" of U 21 + I is already specified when nand S are given and
in a sense this new label (J. is superfluo us. A further problem is to deduce the
possible values of total orbital angular momentum L which may be associated
with given nand S. However, since L is the representation label for the group
fJ1i 3 we are asking how the representation Ulzl of U 21 + I reduces on restriction
to its subgroup fJ1i 3 '
U( ' )= ImLD(L ) (18.24)
L
(The coefficient mL should not be confused with the state label m.)
470 The Unitary Group UN 18 .10
Table 18.2
[111J [3J J
2 0
[21J [21J t 2, I
n valence nucleons are considered it is clear that the group U 2j+ I takes the
place of U 21+ I ' The reduction from U 2j + I to 91 3 gives the values of J as in
equation (18.24). The association between J and isospin Tis analogous to the
association between Land S in the atomic case; for a given representation U II )
of U 2j+ I the isospin part of the wave function must have adjoint symmetry a
t
giving T = (ill - 11 2 ),
The introduction of the groups U 2/+ I and U 2j+ I into these problems of
atomic and nuclear structure is in a sense disappointing because they carry no
new physical information. The representation labels IX are , as we have seen,
equivalent to the spin, supermultiplet or isospin labels, depending on the
system . However, they may be used as a first step in defining a new group
which, depending on the nature of the interaction between particles, may
carry useful physical information. To illustrate this idea, consider three
electrons in the atomicf-shell, i.e. I = 3. In the reduction (18.24) corresponding
to IX = [21] the non-zero values of m L are given by m I = m 6 = m 7 = m s = I,
m 2 = m3 = m4 = m s = 2. Unlike the case shown in table 18.2 there are now L-
values for which mi. > 1,so that the quantum numbers S, L , M sand M L are no
longer sufficient to define the possible states. Ofcourse physically this problem
is resolved in perturbation theory by diagonalising the Coulomb interaction
numerically. A 2 x 2 matrix occurs for L = 2, 3, 4 and 5. From the group-
theoretical point of view there is a possibility of finding a labelling system for
these pairs of states if one can find-see Racah (1965) in the bibliography-a
new group '§ which is a subgroup of U 2/ + I and which contains 91 3 as a
subgroup. One may then carry out the restriction from U 7 to 91 3 in two steps
U 7 -> '§ -> 91 3 , In this way the basis vectors (wave functions) would carry also
a representation label for the group '§. In some , but not necessarily all, cases
the pairs of states referred to above will carry different labels and therefore be
distinguished by their properties under this intermediate group '§. The
introduction of such a group '§ may be useful on two levels. It may simply be
used to provide a convenient, system atic labelling scheme within which to set
up the matrix of the interaction. It may also be significant physically , if the
interaction has approximate invariance with respect to '§. In this case the use
of a basis labelled by irreducible representations of '§ will approximately
diagonalise the energy matrix and the labels may be associated with physical
states.
To find the intermediate group '§ we must Impose restrictions on the unitary
transformations without excluding any of the ~ 3 transformations. A system-
atic way of doing this is first to write down the commutation relations of the
infinitesimal operators of U 2/ + I ' One then searches for a subset of the
operators of U 2/+ I which contains the operators of 91 3 and which is closed
under commutation. By this we mean that the commutator of any pair in the
subset gives a linear combination of operators in the subset. From the general
472 The Unitary Group UN 18 .10 .1
definitions of section 7.2 and from equation (7.7) in particular, such a subset
defines a group which is necessarily a subgroup of U Z/+ 1 and which contains
Yl 3 as a subgroup. By comparing the structure constants obtained for <§ in this
way with those of all known groups one may identify <§. This technique shows
that, for arbitrary I, the group Yl z1+ 1 may be chosen for <§ but that, for the
small 1of interest in physical problems, very few other possibilities exist. The
more direct way to introducefii ZI+ 1 is to restrict the unitary transformations U
by the condition that the two-particle function
(18.25)
In other words, Uk; = (U- 1 b but since U is unitary th is implies that U is a real
orthogonal matrix, which defines the group Yl z1+ r- To show that the physical
Yl 3 is contained in this group Yl z/ + 1 we must find a basis ¢; for the (21 + 1)
states with angular momentum 1such that the function <1> defined in equation
(18.25) is invariant under rotations. One sees immediately that the usual m-
basis I/Im would not suffice. However if we define a new basis
(18.26)
m
(r 1 r 2 • • • r,) and we give in table 18.3 a few examples of the reductions for the
case 1 = 3. It is seen that the new ~ 7 label has succeeded in distinguishing the
two L = 3 states for n = 3 and S = ! but has failed to do so for the pairs of
states with L = 2, 4 and 5.
Table 18.3
n U7 Jt 7 Jt 3 (L)
[I] (I (0) 3
2 [2] (000) 0
(2(0) 2,4,6
[II] (110) I, 3, 5
3 [21] (I (0) 3
(210) 1, 2, 2, 3, 4, 4, 5, 5, 6, 7, 8
[111 ] (111) 0,2,3,4,6
Table 18.4
n Vb SPb 9l 3 (J )
[I] (100) ~
2 [2] (200) 1,3,5
[11] (000) 0
(110) 2,4
3 [21] (100) t
(210) !, i, }, (W, t 1[-, ¥
[Ill] (100) 5
I
(111) J. 2
2, 2
one also sees that, for odd n and each U 6 symmetry, there is a state with v = 1
which necessarily has the same spin J = t as a single particle. For even n there
is always a state with seniority v = 0 and hence J = O.
From the physical point of view, the group 9l l /+ 1 has significance in
describing the levels of the configuration In if the perturbing interaction
between pairs of valence electrons exhibits degeneracies appropriate to the
group. In other words , for the case of 1= 3 illustrated in table 18.3 we would
need to find the L = 2,4 and 6 states for n = 2 at about the same energy and the
states with L = 1,3 and 5 lying together at some other energy. In practice with
the Coulomb forces this does not happen since the states with high L are
depressed, see subsection 8.6.5. Thus, in atoms, the group 9l z/ + 1 is useful only
as a means of setting up a systematic framework in which to diagonalise the
perturbation.
In nuclei, however, the strong spin - orbit force leads to jj-coupling and the
attractive force between valence nucleons then depresses the state with low J .
This is the reverse of the atomic situation since the force between electrons is
repulsive. In fact, the short-range attractive nuclear force not only depresses
the J = 0 state but also leaves the J = 2 and 4 states, shown in table 18.4 with
labels [11] (10),close together in energy . Hence in nuclei, the group SPlj + 1 has
real physical significance. As a result, the ground states of nuclei would be
expected to be described by irreducible representations of SPlj + i - Because in
the two-particle system the invariant (000) lies lowest in energy one might
expect the ground state for general n to contain as many such pairs as possible,
consistent with the Pauli principle. This may be verified in a more detailed
model which shows that for odd n the ground state has seniority v = 1, the
representation (100), while, for even numbers of neutrons and protons
separately, the ground state has v = 0 with representation (000). These
conclusions agree with, and hence provide justification for, the simple single-
particle nuclear shell-model which assumes that all pairs of neutrons and
protons couple to J = 0 leaving the spin of the nucleus equal to that of the last
odd nucleon. (See also Section 19.2.)
18.11 The Unitary Group U 475
N
18.11 Characters
matrix of U') in the basis described in section 18.4will also be diagonal and the
matrix element corresponding to a basis vector with weight (r 1 r 2 • • • r.) will be
(-i ,f
simply
exp rjo j )
J = 1
x' 1 =
2
L
'1'2 , 'N
exp (- if. r OJ)
j = l
j
(18.28)
476 The Un itary Group UN 18 .12
where the sum runs over all the weights belonging to the representation IX, as
described in section 18.4. With the help of some rather elaborate algebra this
sum may be given in closed form, as the quotient of two determinants,
f."! + N - I
'I
£,I1 ,+N - 2
'I " e~~ I f,'~ - I e'i - 2
X(·) =
e1! + !\i - 1 e1,+ N-2 e1"'- l e'~ - 1 eN2 - -)
(18.29)
e "! +N - I
e l~ "'- fN - I
#N
' f..-
in)=exp(-in0tl+exp[-i(n-I)Ot-i02]+ . . . +exp(-infl 2 )
=exp[ -tin(O) +02)]sin{t(n+l)(0)-02)} /sint(O)-02)
a result which also follows from equation (18.29). On restricting to the group
SU 2' putting 0 1 + O2 = 0, this reduces to the familiar expression (7.42) with
n = 2j.
(18.30)
of generality we may take U(b) to be diagonal (this defines the basis) with
matrix elements Uj j(b) = ej in the notation of the previous section. Since we are
only interested in the limit a -+ 0 we need consider only small a for which the
arbitrary U(a) differs from the identit y by a skew -Hermitian matrix
I-iall a l 2 - ia 21 . . . )
U(a) = -~12 + ia 21 1- ia 22
(
Y (c) =
(1 - ia l l )el O. . . )
W(c) = 0 (1 - ia 22 )e2
(
then the Jacobian (oc/oa) is easily calculated. Since oO;/oaj k = bijbj k the first
478 The Unitary Group U
N
18 .13
GC12 I cia 12
I GC21I Ga 12
Thus, finally, the weight factor is given by
N
= In
;<j
( f,j- CjW = Itll
2
We now consider in more detail the special case of SU 2 which was first
introduced in section 10.1 to describe isospin. In that section we argued briefly
that SU 2 had three infinitesimal operators with the same commutation
relations as those of :!It 3 and hence we were able to take over all the
representation theory developed in chapter 7 for :!It 3 ' Here we describe more
carefully the relation between the parameters of the SU 2 matrices and the
rotation parameters. The fact that this relation is two-to-one rather than one-
to-one, a homomorphism rather than an isomorphism, throws light on the
double-valued representations of :!It3 ' The use of SU 2 also suggests a new
(Cayley - Klein) set of parameters to describe rotations and these are most
convenient in deducing the parameters of a product of two rotations.
describe the general group element of U 2 ' It is convenient to write the general
element as
(18.31)
°
with the one remaining condition IXIX* + PP* = 1.
The set of matrices U with ¢ = and hence with determinant = + 1 from a
subgroup of U 2 which is called the ' unimod ular' or 'specia l unitary' group and
is denoted by SU 2 ' In fact we may write U 2 = U 1 X SU 2 as the product of SU 2
with the Abelian, one-parameter group U 1 which multiplies by the phase
factor exp(i¢) and is seen to be isomorphic with :fl 2 . It will therefore be
sufficient to study the group SU 2 which has only three parameters. Henceforth
we shall consider only SU 2 with elements
(18.32)
and the condition IXIX* + PP* = 1. We now introduce three convenient real
parameters in place of the two complex numbers IX, Pwith one real condition.
The notation that we use may seem rather strange but it is chosen to simplify
the discussion in subsection 18.13.2 of the connection between SU 2 and :fl 3 •
Without loss of generality we may write
°
with k = (k" k y, k z ) any unit vector and ~ a ~ 2n, see problem 18.8. The
three real parameters are a and the two angles required to fix the unit vector k .
As a further simplification we define ax = ak; ay = ak; and az = ak, and then
the matrix U is determined by the three real parameters aq which lie in the
range L: a: ~ 4n 2
, i.e. the vector a = (ax, ay, az ) lies inside a sphere of
q
radius 2n.
480 The Unitary Group UN 18.13.2
so that the three infinite simal operators are simply - iSq in the notation of
equation (8.15) where Sq are the familiar spin matrices for s = t.
We therefore conclude th at the commutation relations of the infinitesimal
operators of SU 2 are the same as those for the group 9f 3 since the spin matrices
are infinitesimal operators of 9f 3 in the representation D(!). Because the
irreducible representations of 9f 3 were deduced in section 7.4 from these
commutation relations, it follows that the irreducible representations of SU 2
will be the same as those of 9f 3' However we shall see in subsectio n 18.13.3 that
whereas the half-integer representations of 9f 3 were double-valued they will be
single-valued representations of S U 2 ' Indeed, the representation D (!) is
identica l with the group elements themselves, the 2 x 2 matrices (18.32).
The fact that two groups like 9f 3 and SU 2 have the same commutation
relat ions for their infinitesimal operators, the same set ofstruc ture constants in
the language of section 7.2, doe s not mean that the two groups are necessarily
isomorphic (one-to-one co rrespo ndence between group elements). To study
this point it is necessar y to go beyond the infinitesimal operations to the group
elements for finite va lues of the par ameters. Consider first the group 9f 3 • For
rotations about th e z-axis we have seen in section 7.3 and equation (7.6) that the
finite rot ation is related to th e infinite simal operator by a n exponential
Rz(a) = exp(aX z )' In thi s relation there is nothing specia l about the z-axis so
that for a rotation RI< (a) about an arbitrary axis k we may also write
X.. =
1.(01 01)'
-I' X, =
1(01
I
The matrix ( 18.37) illustrates clearly that th e hal f-int eger represent ations a re
not co nti nuo us represent at ion s of fA 3 . As one rot at es thro ugh an angle a = 2n
the physical system returns to its sta rting po int but th e mat rix ( 18.37) is
cha nged by a facto r - I. In th e cove ring gro up as a increases to 4n the matrix
( 18.37) does return to its o rig inal value and co nti nuity is resto red. Thus th e
half-integer repr esent at ion s a re co ntin uo us as representa tio ns of SU 2.
(sin tc) m = (sin t a cos t b)k + (sin tb cos t a)1+ (sin t a sin t b)k /\ I
T he fo ur real num bers cos ta, k, sin t a, k; sin t a and k, sin t a a re sometimes
ca lled the Cay ley - Klein par am eters of th e rot a tion R,,(a). T he formulae
( 18.38) ar e the eq uivalent for the group :Jl3 of th e relation c = a + b fo r
co mbining ro tations in 3f 2 .
Th e for m (18.36) may be used to ded uce the a ngle and axis of th e pr oduct of
two rotat ion s. Su ppo se Rm(c ) = R 1 ( a )R, ( b ), then from equa tion (18.36) we
ha ve
D l!)(m, c) = (cos ta +2k . Xsinta)(costb+2 I.Xsintb)
= (cos t a co s tb - k .l sin ta sin tb)
+ 2 {(sin ta cos t b)k + (sin t b cos ta) 1+ ( sin ta sin t b)k /\ I }. X
Eq ua ting thi s to (cos tc + 2m .X sin tc) we deduce the angle c and ax is m from
the two equations
Let us introd uce th e coord ina tes ~, '1 of an a rbitra ry vecto r e in the two-
d imen sional space of th e group $U 2 so that fro m eq ua tio n (3.38) th e
tra nsforma tion of functio ns is given by th e eq ua tion
Ii = /3 * ~ + rx'l (18.39)
The Unitary Group UN 483
from equations (3.39) and (18.32). Consider the transformation of the (2j + 1)
functions defined by
(18.40)
Since (18.39) is a linear transformation it is clear that this set of functions wiIl
form an invariant space and thus provide a representation of SU 2 with
dimension (2j + 1). For the particular unitary transformation corresponding to
a rotation of angle a about the z-axis, the parameters a and f3 are given by
a = exp( - ! ia)and f3 = O. In this case the tran sformation (18.39) is very simple
and the character of the representation provided by the set offunct ions (18.40)
is given by
Comparison with section 7.4.3 shows thi s to be just the character of the
irreducible representation O(jl of 9i 3 ' Hence we conclude that the set (18.40)
provides a basis for the representation O (j) of S U 2 ' In particular, the three
functions e,
~ 17 and r]2 provide a basis for the vector representation 0(11 .
Bibliography
Hecht, K. T. (1973). Symm etry in Nucl ei: Annual reviews of nuclear science
(Annual Reviews Inc., Palo Alto , California)
Much of the early work on the use of the unitary groups and their
subgroups in spectroscopy was done by Racah and some of this is described
In
Problems
18.1 Construct product functions belonging to the representations [3], [21] and
[III] of U 3 , as suggested in example (4) of section 18.2.
18.2 Use the method of section 18.4 to label the basis vectors of the following
representations: (a) [3] of U J • (b) [21] of U 4 .
18.3 Reduce the direct product [21] x [2] of U J . How does this result simplify if,
484 The Unitary Group UN
instead of U 3, one is using (a) the group SU 3 (use the notation (i. ,~)), '(b) the
group SU 2?
18.4 Deduce the weights of the basis vectors in part (a) of problem 2 and hence show
that the reduction (18.24) for this case is U[3] = 0(3) -+ 0(1).
18.5 Find the L-values for the totally antisymmetric states of two and three d-
particles. (There is one antisymmetric state for each combination of different m-
values of the particles. Hence list the number of independent states for each value
of M = L m and so deduce the possible values of L.)
18.6 Reduce the products [1] x [1], [1] x [11] and [1] x [2] for the group U 5 '
Hence, knowing the reduction of product representations (7.44) of ,Jf3 and the
results of the previous problem, deduce by subtraction the L -values of the [2],
[3] and [21] representations of two and three d-particles.
18,7 Follow the method of problems 5 and 6 to list the possible combinations of J and
T for three nucleons in a j = ~ shel l.
18.8 Show that there is a one-to-one correspondence between the parameters k and a
of equation (18.33), with 0 < a < 2rr and k a unit vector, and the two complex
parameters !x, # with the condition 'l'l* + P#* = I.
19
Two Familiar 'Accidental'
Degeneracies-the Oscillator
and Coulomb Potentials
For convenience, let us measure energies in un its of hco and distances in units
of (h/ M w)i which we call the oscillator 'length parameter' and denote by b. In
these units, the Schrodinger equation becomes-a - V 2 + r 2)1jJ = EIjJ and the
Hamiltonian is
(19.1)
N 2 3
N, 100 20 0 1 10 3 002 2 1 100 1
N). 010 020 1 0 1 030 1 0 202 1 1
N, 001 0020 1 1 003 0 1 021 2 1
2,0 3,1
expected. They must therefore be the sum of several (21 + 1)-fold degeneracies
and the appropriate values of 1are shown in the last row of table 19.1. To see
how these I-values are deduced we notice that the three operators aJ transform
like the vector representation 0(1) of the group 91 3, Thus, since 1/10 is
sphericaIly symmetric, the excited states (19.6) with fixed N will transform like
the symmetrised product 0(1) x O(l) X • • • X O(l) with N factors, as defined in
appendix 3.1. Only the symmetrised product may occur because the a:
commute with each other. There is no scope for any antisymmetry with only a
single particle. The formulae of appendix 3,1 for the character of symmetrised
488 'Accidental' Degeneracies 19 .1
products together with the standard formula (7.42) for the gt3 characters give
the reduction of this product and hence the resulting values of I, as shown in
the last row of table 19.1. In fact one may deduce generally that I = N,
N - 2, . . ., 1 or 0 depending on whether N is odd or even, see problem 19.2.
Having established in this explicit way that the three-dimensional harmonic
oscillator has a greater degeneracy than expected on the basis of spherical
symmetry we now return to the problem of finding the larger symmetry group.
From the commutation relations (19.4) one may verify directly that , for any q
and q', [a; a q., H] = O. This result may al so be seen from the fact that, whereas
a q reduces the eigenvalue by one unit, a; increases it by the same amount so
that the product aqa q. cannot change the eigenvalue but merel y transforms
within the vector space of each set of degenerate eigenfunctions. Thus we have
found a set of nine operators a;a q. which all commute with the Hamiltonian H
and from these may be constructed a set of unitary symmetry operators
U = exp ( I . cqq.a;aq, )
a,«
provided that the coefficients C qq' satisfy the skew-Hermitian condition
c:q' = - cq' q' This condition leaves the freedom of nine real numbers in the
coefficients Cqq' and so we ha ve a nine-parameter set of unitary transformations
which commute with the Hamiltonian H. To confirm that this is in fact the
group U 3 we consider its effect on the set of three operators a;, finding
where C is the matrix Cqq" To obtain this result we use the operator identity
e A Be - A = B + [ A, B] + i! [A, [ A, B] ] + tr [ A, [ A, [A, B] ] ] +
together with the relation
t t
[aqaq.,a s] = oq'sa q
which follows from equation (19.3). The significance of the result (19.7) is that,
for skew-Hermitian C, it is the most general unitary transformation in the
three-dimensional space of the operators a; . Thus the operators U are in one-
to-one correspondence with the matrices exp C of the group U 3' In this sense,
the matrix of the operator a;a q. is given by (a;a q. ),s = Oq·sOq/.
The infinitesimal operators are the nine independent skew-Hermitian
combinations of the operators a;a q ' , corresponding to the nine skew-
Hermitian 3 x 3 matrices described in section 11.4. For some purposes it is
convenient to take linear combinations which are irreducible tensor operators
with respect to rotations, see subsection 7.4.6. Since both a; and a q transform
like a vector, i.e. the representation 0 (1 ), the set of nine products a;a q'
transforms like the product representation
O(1)x 0(1) = 0(0) -i- 0(1) -i- 0(2)
using the familiar rule (7.44) for its reduction. The invariant 0(0) is essentially
19.1 'Accidental' Degeneracies 489
(19.8)
Since H has this larger symmetry group U 3 one expects each eigenfunction
to be labelled by an irr ed ucible representation of U 3 ' We first note that
U 3 = U 1 X SU 3 following the argument given in section 10.4. In the present
problem the infinitesimal operator for U 1 is simpl y the invariant
~ a:a q =H- ~ and the eight remaining traceless operators L, and Q~2)
q
describe SU 3 ' Thus the U 1 representation is labelled by N and it remains to
identify the SU 3 representation corresponding to each energy level. Recalling
the study of SU 3 operators in section 11.6 it is seen from the matrices (11.2)
that we may associate the operators !(a:a x+a:.a y -2a:a z ) with Yand
-!(a;a, - a}:a y) with T, so that we may relate the numbers N " Nya nd N z used
above to the numbers Yand M T used in chapter II by the equations
(19.9)
The diagrams in figure 11.6 then show immediately that the three eigenfun-
ctions with N = I belong to the representation (10), those with N = 2 belong
to (20) and those with N = 3 belong to (30). As an illustration, in figure 19.1 we
reproduce from figure 11.6 the diagram for D( 3 0) showing the values of
(N,N ).N z) corresponding to each lattice point. Because the products (19.6) are
symmetric in the N factors a: and thus belong to the U 3 representation [N],
see section 18.7, it follow s generally that the set of eigenfunctions with energy
( N + ~) belongs to the irreducible representat ion ( NO).
The question of which values of angular momentum I occur for given N is,
group theoretically, the question of restricting from the group SU 3 to its
subgroup rJt 3' We stress that this is a different process from the restriction
from SU 3 to SU 2 described in section 11.5. The infinitesimal operators of the
490 'Accidental' Degeneracies 19.1
x\
\'020' (I~I)
120,,1
/X- M
r = '12 (N, -Ny)
\/
(012) (102)
X
(003)
Figure 19.1
physical rotations flt 3 are the operators L q , see equation (.19.7), while the
operators for the SU 2 subgroup in the xy-plane would be
(19.10)
N -4, . ..,1 orO for the representation (NO), which is the familiar result for the
I-values in an oscillator energy level.
are the same as those for the single-particle operators since operators for
different particles commute. Hence they describe the group U 3 of simul-
taneous U 3 transformations on all particles, which contains the group of
simultaneous rotations, described by the total angular momentum L, as a
subgroup. As with the single-particle operators, the nine U 3 operators may be
taken as H = 1: H(i), where H(i) is the oscillator Hamiltonian for particle i,
i
L, = 1: Lq(i) the total angular momentum and the five components of the
I
the perturbing potential 1: V(r j j ) commutes with these nine operators then
i <i
the eigenfunctions of the system are labelled by irreducible representations of
U 3, in other words by N which determines the unperturbed oscillator energy
and a label (I.Jl) of SU 3 ' We shall not go into detail regarding the possible (I.Jl)
which appear with each N, except to remark that the procedure is similar to the
problem of deducing the possible values of L in an atomic configuration which
we described in subsection 8.6.4. In such a many-particle system, one no longer
has only the simple representations with Jl = 0 which arose for a single particle.
492 'Accidental' Degenerac ies 19 .3
r. j i, i
(19.12)
Since the eigenvalues of C 2 are known, see equation (11.21), and the
eigenvalues of (L. L) are simply L(L + 1), it follows that the eigenvalues of the
quadrupole term (Q(2) .Q(2)) may be deduced. The two-body part of this is
sometimes called the 'quadrupole force' and has been used as a model for
understanding the development of deformations and rotational motion in
nuclei-see Harvey (1968) in the bibliography.
The interesting general aspect of this use of SV 3 is the way in which the
symmetry is extended from the single-particle to the many-body system .
Ifwe measure length in units of 1i 2 / Me 2 (the Bohr radius) and energy in units
of e4 / 21i 2 (= Rh, where R is the Rydberg constant), then the Hamiltonian
takes the form
H= -(V 2+2 /r) (19.14)
and the Schrodinger equation is given as usual by H ljJ = EljJ , with E the energy
and ljJ (r) the wave function . It is well known that the energy levels are labelled
by an integer n = I, 2, 3, .. ., with energy - n - 2 and th at for given n the
angular momentum takes the values 1=0, I, 2, . . ., (n - I). Thus there is a
greater degeneracy than one would expect from the obvious spherical
symmetry.
Because it is spherically symmetric, the Hamiltonian will again commute
with the three infinitesimal operators L, of ;Ji 3 ' Although it is by no means
obvious, some detailed but straightforward algebra shows that H also
commutes with the three components of the vector operator
A = t:(V f\ (r f\ V))-((r f\ V) f\ V): + r /r (19.15)
so that
(19.17a)
494 .Accidental' Degeneracies 19 .4.1
The use of - H rather than H in the defin ition of A' will be convenient in
considering bound states, for which the eigenvalues of H are negative. In thi s
case A' is Hermitian but it become s skew-Hermitian for positi ve energies. Thus
for negati ve energie s the operators L and A' define unitar y transformations
ex p { -i(a . L ) -i (c. A') } where a and c are real vector parameters. The six
infinitesimal operators are - iL a nd - iA' . The minus sign here ensures that
the rotation parameters a are the same as those defined in sub section 7.4.1. We
sha ll now identify the commutation relations (19.16) and (19.17a) as those for
the group 9i 4 •
be written
(19.18)
Having found the symmetry group it now follows from the general theory
that the energy levels may be classified by irreducible representations. We
therefore face the task of finding the po ssible irreducible representations of the
group 9t 4 but, fortunately, thi s may be done very quickly by making use of a
(local) isomorphism between the groups ~ 4 and ~ 3 X Jf 3' To understand th is
we define new infinitesimal operators
F = t(X +Z ) and G = t (X -Z)
which is equ ivalent to the use of (a + c ) and (a - c ) as parameters in equation
(19.18). Then, from equation (19.19),
(19.20)
with all components of F commuting with all components of G . Thus the
operators F and G separately sa tisfy the sa me commutat ion relat ions as X
and therefore de scribe ~ 3 groups. Since the y commute with each other it
follows that together they describe the product group iJt3 X 9f 3 and hence we
have established an isomorphism for the infinitesimals of 9f 4 and 9f 3 X 9i 3 . We
may therefore use the known irreducible representations O UI of 9i 3 withj = 0,
t .. .,
t , I, to conclude th at the irreducible representations of 9f'4 are labelled
by a pair j Ij 2 of such indice s (see sectio n 4.21 for representations of a product
group). The irreducible represent ations of 9i 4 are therefore denoted by O U,j , ),
where j I and i, refer to the two &3 groups. The o perato rs (F. F) and (G. G)
provide two Casimir operator s with va lues - jI Ut +l) and -j2U2+1 ),
respectively (see section 7.5), and the dimension of O U,j , ) is (2j I + I) (2j2 + 1).
In studying the osc illat or in section 19.1 we saw that only some of the
representations of SU 3 co uld be realised with only a single particle and a
similar situation occurs with the Coulomb problem. The restriction in this case
496 'A ccidental' Degeneracies
comes about because (A' . L) = (L . A') = 0, which follows directly from the
definitions of L and A', using some detailed algebra. Thus if we construct the
symmetry operators -1 i( L ± A' ) corresponding to F and G it is clear that
both Casimir operators reduce to the sa me expression - !{(L . L) + (A' . A') }.
Hence the only possible representations are those with i, = l z namely' O W)
t
withj = 0, I, ~ ; . ... Som e more algebra shows that the Ca sim ir operators
simplify further, giving
(F. F ) = (G . G ) = - H(L. L ) + (A ' . A' )} = !(l + H - 1)
In a representation O Uil thi s Casimir operator has the value - jU + I) so that
the corresponding value of the energy is given by
E= <H>= -(2j+ 1) - 2 (19.21)
with a degeneracy of (2j + 1)2. In the standard theory of the hydrogen atom it
is of course conventional to use an integer n = (2j + 1) to label the energies so
thatj = 0, t 1, t ..., corresponds to n = 1,2,3,4, . .., and the energy formula
(19.21) reduces to the more famil iar expre ssion E = - n - 2 in th e energy units
introd uced at the beginning of sectio n 19.4.
To deduce which angular momenta I occur at each energy involves the
reduction
OW) = L c/O(/) (19.22)
I
of the irreducible represent at ion O W ) of fJi4 follow ing the restriction to the
subgroup yt 3 of physical rot ati on s. Notice that it is the opera to r X = F + G
which describes the se rotat ion s. The problem of deducing the non-zero
coefficients in the series (19.22) is now exactl y the same as the vector coupling
of two angular momenta I = j + j which was described in subsection 7.4.4.
Thus from equation (7.44) we conclude that I = 0, 1, 2, . .., 2j( = n - I), which
is the familiar result for the hydrogen at om .
For the positive energy states we simply remark that the Lorentz group has
no finite-dimensional uriitary irreducible representations, see subsectio n
15.2.5, and this is con sistent with the well-known continuum of states which
occurs for positive energy.
The technique described in section 19.2 for extending the use of the group
U 3 to a system of particles may also be applied to a system of particles moving
in a Coulomb potential with an 91 4 symmetric interaction. However, this does
not seem to have much relevan ce to any recognised phy sical problem.
Bibliography
Problems
19.1 Show that the co nditio ns aqt/lo= 0, with a q defined by equation (19.1), lead to the
solution t/lo = A exp( - 1r2 ).
19.2 Justify the result quoted in section 19.1. that on restriction to the subgroup 9i 3 '
the totally symmetric representation [N] of SU 3 reduces into a sum of
representations Dill , where I= N , N - 2, N - 4, . . .. 1 or O. (Follow the
technique of problem 18.5 by taking symmetric product s of N factors, each of
which has rn = L 0 or - 1. Then show th at there is only one product with
Lorn = N or N - 1, that there are two products with Lorn = N - 2, a nd N - 3.
three with N - 4 and N - 5, etc .)
20
A Miscellany
Hitherto we have been concerned exclusively with symmetry groups f§, which
by definition leave the Hamiltonian invariant. We now investigate some
groups which do not have this property but which nevertheless are of some
interest. They are often called non-invariance or dynamical groups. Their
significance is that they bring together into a single representation eigenfun-
ctions corresponding to different energies, so that the spectrum and transition
probabilities may be expressed in terms of the group operators. Hence these
498
20 .1 A Miscellany 499
d ynamical quantities may be deduced from the group theory. The application
of these ideas ha s so far been limited to simple systems.
As an illustration, we consider the one-d imensional harmonic oscillator. In
the notation of section 19.1, th e Hamilton ian may be wr itten
(20.1)
a nd the energies are well known to be given by (n + i) in the usual units of hco,
where n = 0, 1,2, . . . . We now construct t he three Hermitian operators
I 2 t2
KI = 4(a +a ), (20.2)
It may now be seen th at these are just the commutation relations of a Lorentz
group z" J in three dimensions. Such a group ma y be constructed by removing
o ne of the space d imensions (r) from the usual Lorentz group discussed in
section 15.2. Its infinitesimal operators are then X, ; Y, and Y)" and their
co m mutatio n relations a re given in equations (15.32),
These agree with equat ion s (20.3 ) if we ass oc iate K l ' K 2 , K J with iY" iYy and
iX: . Again, the factors o f i a re necessary here because the infinitesima l
operators for the unitary transformat ions generated by the Hermitian
o pera to rs (20.2) mu st includ e a fact or i, see sectio n 7.2. Hence the non-
invariance group defined by th e operators (20.2) has the same commutation
relations as .!£ J and is therefore (locally) isomorphic with it. We shall not
pursue the question of the global transformations, i.e. the question of the range
of the group parameters. The infin itesimal properties are sufficient to
determine the representations if one is not concerned with questions of many-
valued ness.
We now study the unitary irreducible representations of Y J following the
procedure used in subsection 7.4.2 for 9t J . First choose a basis in which the
operator K J , and hence the energy, is diagonal. We write K J cPn = !-(n + !-)cPn,
where for convenience the eigen values are expressed in the familiar form of the
oscillator energies. There is no loss of generality here since, in this general
discussion of the representations of 2' J , the number n is, as yet, quite arbitrary
and is not restricted to be an integer. The remaining group operators K 1 and
K 2 then couple eigenfunctions with different energy. In fact, if we define
K ± = K. ±iK 2 then
(20.5)
500 A Miscellany 20 .1
showing that K + are raising and lowering operators in the energy. In detail, if
HcPn = (n +!)cP: then
H(K ±cPn) = 2K 3K±cPn = 2K±(K 3 ±1)cPn = (n+t±2)(K±cPn) (20.6)
showing that the operators K + raise and lower the energy by two units of hs».
The Casimir operator for z' 3~ corresponding to J2 for the group Yf3 , is no-w
given by
takes the form IA n l 2 < O. Thus we conclude that there are no finite dimensional
unitary irreducible representations of !f:' 3 except for the trivial identity
representation. (The same conclusion was reached in subsection 15.2.4. for the
group !e.) We therefore explore the possibility of infinite dimensional
representations. This amounts to releasing the constraints that the possible
values of n in the representation are bounded above and below.
Let us first release only the upper bound so that we assume the existence of a
least n = no which implies that
(20.9)
This value is then valid for all basis vectors cPn in the representation. Starting
with cPn o we now use the raising operator K + to generate the set of cPn, where
n = no + 2t with t an integer. (Remember from equation (20.5) that K +
increases the value of K 3 by I and hence the value of n by 2.) To ensure that the
representation is unitary, the infinitesimal operators K l' K 2 and K 3 must have
Hermitian matrices. This implies that n, and therefore no, must be real. It also
implies that K : = K _ and so from equation (20.8)
IA l- 2 = (cPn,K_K + cPn) =
n (cPn ,(C +K5 +K 3)cPn)
= 116{O +2no)(3-2no)+(1 +2n)(5+2n)}
= t(n + no + l)(n - no + 2) (20.11)
20 .1 A Miscellany 501
The Jahn- Teller effect is concerned with symmetries which occur following an
approximate separation of the Hamiltonian. Because the separation is only
approximate there are terms which break the symmetry. The word 'spon-
taneous' is sometimes used to distinguish this effect from the symmetry
breaking caused by the addition of an external perturbation which we
discussed in section 5.8. Although the idea is a general one, it is in the structure
of molecules that the Jahn- Teller effect was first described and in which it is
most clearly seen. More recently the same idea has been used in the theory of
elementary particles. To help visualisation we shall present the theory in the
language of molecules but its generality will be apparent. Brief mention of the
application to elementary particles will be made at the end of the section.
In the early parts of this book we limited our study of the theory of groups to
those properties which were vital in understanding the physical applications.
There are many fascinating properties, even offinite groups, which we have not
mentioned. Here we introduce the idea of the normal subgroup and describe
its use in finding representations. In fact this provides a general background
for the introduction oflittle groups which we have used in an ad hoc fashion in
obtaining represent ations of the space groups in chapter 14 and the Euclidean
and Poincare groups in chapter 15.
For the direct product of two groups defined in section 2.5 the characters
and representations are obtained trivially from those of the two separate
groups, as described in section 4.21. Remember that in the direct product
'§ = Yf x X the elements of Yf must commute with those of X . We now
define what is meant by saying that a group '§ is a 'semi-direct product', written
'§ = Yf /\ X . The elements of '§ must be the set of products G j a = HjK a,
where Hi and K, denote, respectively, elements of two subgroups Yf and X
and , although the H, need not commute with the K a , the relati on
(20.18)
where :?7 3 and :?7 4 denote the translation groups in three-space and space-
time, respectively.
Let us now see how a knowledge of thi s semi-direct product structure
rs = Yf /\ X of a group can help us to find its irreducible representations. For
simplicity we limit our discussion to the case when the normal subgroup .Yf is
Abelian (the generalisat ion is straightforward) so that its irreducible rep re-
sentations are one-dimensional denoted by T(;·). Since .Yf is a subgroup of rs
the basis vectors of an irreducible representation T of rs may be chosen to
belong to definite irreducible representations of .Yf so that, denoting these
II.
vectors by >we have, for any H in Yf ,
I >= T(/.)(H) IA. >
T(H) I. (20.19)
We now choose a particular II. >and use the elements K, of X to generate
from it the set of vectors denoted by
(20.20)
Each of these new vectors must itself belong to one of the representations of J't'
because from equations (20.18), (20.19) and (20.20)
I
T(H) I.a >= T(H)T(K a) I)· >= T(Ka)T(H ') II. >= T(/.)(H') II.a >
(20.21)
where H' = K,' I HK a. However, notice that the representat ion of ,Y{' defined
by this equation is not necessarily T(/.) since it associates with the group
element H the matrix element T(/.)(H') for a different element H'. Let us
therefore denote this representation of.Yf by T(;,.). Consider first the simplest
possibility, that the set of representations TV.), as G, runs through the elements
of X , are all inequivalent. In this case the vectors II.a > must be linearly
independent and provide a basis for a k-dimensional irreducible repre-
sentation of rs. The explicit form for the representation is quickly found by
operating on I).a >with the general group element G = HK,
I I
T(HK) I.a >= T(H)T(K)T(K a) I. > = T(H)T(K b) I. > I
= T(Kb)T(H ') I)· > = T(/.)(H') I).b > (20.22)
set so that
T().)(K; I HK a ) = ST().)(H)S-I and T(A)(K; I HK b) = RT().)(H)R- 1
where again H' = K; I HK b. Thus the multiplication law is satisfied. The set of
vectors II.a > which transform equivalently to a given A. must therefore form an
invariant space under the little group .% and so we may choose them to
transform according to definite irreducible representations "Pa ) of X-. Thus the
basis vectors will carry both labels ;. and rx and also a row label i if Tla) has
dimension greater than one. To generate the irreducible representations of ~
we start from one such set of vectors II.rxi > with fixed A. and,rx. It is a general
property of any group X of order k that given a subgroup X of order k any
element 0[,)( may be written in the form K = MeK, where Me is one ofa set of
klk elements of.)( (this is Lagrange's theorem, see problem 2.4). Because of
this result the representation T of ~ may be generated by operating on the
vectors IA.rxi> with the set of k/k elements M" T(Me)IArxi > = IArxi, c). The
representation is labelled T().· 0) and has dimension saklk with its matrix
elements being found as in the derivation of equation (20.22), i.e.
where the new symbols K', H', M d, K' and d are defined by the equations
K' = KM e, H' = K,-I HK', K' = MdK '. Thus the representation is given in
t~rms of the matrix elements of the representations of the subgroups .Jft' and
X.
The question arises whether each choice of ;. leads to a different irreducible
representation of ~ and the answer is generally 'no', because if some of the TV..
are not equivalent to T(i·)then the same representation of ~ would have been
obtained by starting from any of these TV.·) instead of TW. The set of
representations T(A.) are said to be in the same 'orbit'. In constructing all
irreducible representations of ~ one need only choose one I. from each orbit.
(The concept of orbit is closely related to that of the 'star of k' in subsection
14.9.1, since k corresponds to A.. Representations T(k,) of the translation group,
where k , is in the star of k, will be in the same orbit.)
As a simple example of this procedure let us find the representations of the
510 A Miscellany 20.4
A systematic listing of the finite point groups was given in chapter 9 but for the
continuous Lie groups introduced in chapter 7, we have so far considered only
those examples which have occurred in particular physical problems. The
object of this section is to find a systematic listing of Lie groups so that in any
new symmetry problem one is in a position to recognise the group responsible.
We begin by defining a few new terms. Ifa group has no invariant subgroup
it is calIed 'simple'. If it has no Abelian invariant subgroup it is calIed 'semi-
simple', so that the simple groups form a subset of the semi-simple groups. The
word 'non-semi-simple' is sometimes used for groups which have an Abelian
invariant subgroup. As we have seen in the preceding section the non-semi-
simple groups may be described as semi-direct products so we restrict our
attention to semi-simple groups. Furthermore it may be shown that if a Lie
group is semi-simple but not simple then it may be written as a direct product
of simple groups. Thus we shalI, later on, restrict our attention further to the
classification of the simple Lie groups only .
The discussion starts from the set of r infinitesimal operators X, together
with their commutations relations, see section 7.2,
(20.24)
20 .4 A Miscellany 511
(A set of operators with this law of 'multiplica tio n' is often called a Lie
'a lgebra") If the matrix
is formed from the structure constants e~p then the (Cartan) criterion for the
group to be semi-simple is that det 9 #- O. (It is not difficult to deduce this
criterion if we notice that the existence of an Abelian invariant subgroup
implies that a subset Xi' of the Xi satisfy the relations
[Hi' [E.., Ell] ] = [[ Hi' E..] , Ell] + [ E.., [Hi' Ell] ] = (ai + Pi)[ E.., Ell]
so that the commutator [E.., Ep] is itself an eigenoperator of the Hi with root
vector ex + p. Thus if a + fJ i= 0 we may write
[E.., Ell] = N..6 E" +6 (20.27)
with some normalisation constant which depends on the definition of the E...
When a + fJ = 0 the commutator [E.., Ell] commutes with the Hi and must
512 A Miscellany 20 .4
(20.28)
defining the constants !Xi which again depend on the choice of Hi and the
normalisation of E~ . In fact we can conveniently make ai = ai • (To j ustify this
step one first chooses the Hi such that gij = Dij by using a Schmidt process
because g ik = 1:
, ai~ has the form of a scalar product. Next, one defines
Cpqs = ~ gPtC~s and shows it to be antisymmetric in its three indices. Hence we
have
and also
t The symbol s y and~ , like ex and p, represent vectors and should have been printed in bold in
thi s secti on .
20.4 A Miscellany 513
where the scalar product is defined as a..P = Lrx;/3 i . This general result is
deduced from a recurrence relation as follows . 'tr m is an integer ~ n then,
from the Jacobi identity,
r.e.
and so
and if, for brevity, we define Am = N p+meJ .eJN p +m<J+eJ . -eJ then we have
a .p+ma .a = -Am +A m- I (20.30)
Now since, as explained above, P- a. is not a root we have N p_ eJ = 0 so that
A _ I = 0 and from recurrence relation (20.30) we may deduce
- Am = (m + I)a .p +-!m(m + l)« .« (20.31)
If n denotes the termination of the string, as defined above, then N p +neJ,eJ = 0
so that An = 0 and hence from equation (20.31) we deduce the relation (20.29).
From the definition of the scalar product it follows (Schwartz inequality)
that (a..P)2 ~ (a. .a.)(p.P)and hence from equation (20.29), and the correspond-
ing equation p = -2(a .p) j(p .P) for the p-string a+p,a+2p, .. .,a.+pP,
where p is again an integer,
-(a.p) j(a..a.)t(p .P)t = +tl-np) t ~ I (20.32)
This may be said to define the angle between a. and p. From the same two
equations one has the ratio of the lengths
<p'P)/(a..a.) = n/p (20.33)
The values of n (and p) are strictly limited by equation (20.32) which implies
that np = 0, I , 2 or 3. (The value 4 is not allowed since this would mean that
a = - Pcontradicting the definition of a and Pas both positive roots.) Hence
we are able to enumerate the possible sets of simple positive roots and, from
these, all roots are constructed by forming all possible strings. In some cases it
514 A Miscellany 20.4
is possible to build p-strings on roots p + rnex although multiples like rn(p + ex)
with rn > 1 are not allowed.
The enumeration is done very concisely by forming what are known as
' Dynk ln diagrams'. Each simple positive root is represented by a point. For
each pair of such roots the quantity np (which determines the angle between
them) is signified by joining the corresponding points with that number np of
straight lines, dra wn close together. The relat ive lengths of two roots joined in
this way is sometimes denoted by placing the relative values of ex.ex over each
point. In fact these are very limited because if np = 1 then n = p = 1 and from
equation (20 .33) the lengths are equal. If I1p = 2 one is v ' 2 times the other ,
while if np = 3 one is v '3 times the other. The task of drawing all possible
Dynkin diagrams is greatly simplified by noting th e following rule s which
follow from the equations we have derived above:
p- = 1=1p
. I: ajti + ~
j with ~ .cX j =0 for all i, then the coefficients are given by
~ .~ = 1- L af < I - tp
j= 1
~orX
20.4 A Miscellany 515
The arguments in rule (3) may be extended to show that if any number of single
>-<
lines are inserted into these pieces, for example
or
then the same contradiction arises . One therefore concludes that no Dynkin
-<
diagram may contain more than one double-line connection
~ or junction
»:
}II
(Xl (X~~/
~q
_.-~--
There is little more to be said about types (a) and (d). In type (a) all the simple
roots have the same length and the group may be identified with the special
unitary group SU, + i- To see this equivalence one must use the diagram to
construct the commutation relations of the infinitesimal operators and then
compare with the relations appropriate to the group S U/+ I ' see section 18.8.
The rank I may take any integer value. In type (d) one may not add any more
roots since this would imply more than three lines at one of the points. Thus in
type (d) there is only one group, with rank I = 2, which is conventionally called
G2 ' The simple roots have lengths in the ratio J
3: 1.
In type (b) we must ask where the double connection may lie in the chain, i.e.
what are the values for p and q, having chosen p ~ q without loss ofgenerality. As
before, we write
516 A Miscellany 20 .4
p q + 1
Y = I ajlX j + I bjPj + ~
i i
°
j
with a similar result for P/t. However IXj' Y= except for IXp' Y = - so that t
we can deduce Q p = -P/(P+ 1) and similarly bq = - J (2)q/ (q + 1). Using
these numbers in the expression
= 1 +tap+bq + 1/ J 2
reduces the condition ~ . ~ > 0 to the result pq < 2. Thus either q = 0 with any
P or P = q = 1. Diagrams of type (b) may therefore be of the general form
a I a2 a/ _ 2 Y
. . .. .. ,
P~~====o
In the first case there are two different Dynkin diagrams because although the
a i and )I all have the same length (since they are joined by single lines) the ratio
of (length)? of Pand y must be 2. Thus either the root Pis a factor J 2 smaller
than all others or a factor J 2 greater. These two possibilities may be identified
with the odd rotation groups 9i 2/ + 1 and the symplectic groups SP2/'
respectively (see section 18.10 for some applications of these groups). The
particular diagram with p = q = 1 corresponds to a group known as F 4 '
The limitations on the values of p, q and r in diagrams of type (c)a re deduced
by the same procedure, expressing 0 as a sum over the a;, Pi and Yj with eits
projection orthogonal to the space of the aj, P; and y;. In this case again
Q p = - p /(p + I) with corresponding expressions forthe other coefficients and
The groups Jf) and S U 2 are represented, almost trivially , by a single point. The
group 91 4 does not appear in the classification above because it has the same
structure constants as the direct product Jf) x (]f) and is not therefore a
'simple' group. This structure is in fact suggested by trying to put I = 2 in the
Jf 2/ diagram which would lead to two unconnected dots -the diagram
expected for (]f ) x (]f).
To find all roots and hence the commutation relations, given the Dynkin
diagram one must construct the strings described earlier in this section. We
illustrate the procedure with the group S U) described in detail in sections 11.6
and 18.7. The Dynkin diagram is
which implies that I = 2 with two equal roots satisfying Ii.P= - t. Hence from
equation (20.29) the string contains only (X + P and of course there are
also the corresponding three negative roots, making six in all. Since I = 2 in
this case, we can simply draw all the roots in two dimensions and the scalar
product Ii.p = -t implies an angle of 120' giving the picture shown in figure
20.1. Notice that, because of the angle of 120°, the root (X + P has the same
length as (X and p. This picture of course agrees with figure 11.3 for the six
infinitesimal operators T +, U + , V + in the notation of chapter 11. (The relative
scale of the two axes is different in the two figures by the factor! ) 3 discussed
in section 11.6.) Finally, to deduce the commutation relations (20.26)-(20.28)
we must choose a basis and hence define HI and H 2 • We shall take axis 2 along
(X with axis 1 perpendicular to it, since this is consistent with (X and P being
518 A Miscellany 20.4
a+{J
-a a
-{J
Figure 20.\
simple positive roots. With the vectors taken of unit length this gives
[H 2 , E,,] = E", [H 2 , EfI] = -1EfI , [H 2,E<l+fI] = 1E<l+fI
[HI>E,,] = 0, [H1,EfI] = 1(3)!E fI , [H1 ,E<l+fI] = 1(3)!E<l+fI
together with
[E", E_,,] = H 2 ,
Figure 20.2
x' -axis (and hence automatically the y' -axis) and this is done by giving the angle
y between the planes O zz' and O z'x', as shown in the figure. This arbitrary
rotation is con veniently expressed as the product of three simple rotations
about the original axes. We first note that if y = 0 the rotation is achieved by
the product R.( ex )R}"(pj. Here the fJ-rotation carries the x-axis to a latitude of
fJ-south while the ex-ro ta tion gives it a longitude of ex, in ter rest rial notation.
(The order of the se rotat ions is important since otherwise the fJ-rotat ion
would have to be about the axis obtained from O y by the rotation Rz(ex). For
the general case y #- O we mu st first carry out a v-rotatio n about the z-axis
giving the general rotation as
(20.36)
(We could equally well have used afinal rotation of y about the z'-axis but we
sha ll soon see the advantage of having all rotations in (20.36) relating to the
same, original set of axes.)
From the factorisation (20.36) the representation operator (20.35) becomes
T( R) = exp( - icxJ z ) exp( - if3J}.) exp( - iyJ z )
and we recall the definition of the rotation matrix, using the notation Jjm for >
the basis vectors
D~!m(ex, 13, y) = ( j m' IT(R) Urn >
Hence , using the fact that J z is diagonal in this basis , we have the immediate
simplification
D~!m (ex. 13, y) = exp[ - i(ex m' + ym)] G m' Iexp( - if3J},) Ij m > (20.37)
20.5 A Miscellany 521
Jy = 1(0
2 i '-Oi)
so that J; = 1and, as in subsection 18.13.3, the series for exp ( - i{3J y) simplifies
2
exp( - ifJJy) = (1 -lfJ + . ..) - iJ y(fJ - z'4fJ3 + .. .) = cos tfJ - 2iJ y sin tfJ
The complete matrix is therefore given as
.
D~!m(tx ,fJ, y)=exp[-i(txm '+ym)] ( . 2j ,
J-m
)-!( )-! . 2j
J-m
where m j denotes the spin projection ± t of particle i and the sum is restricted
by the constraints L mj = m, L mi = m. Because of the symmetry of the
i i
operator and the wave functions we may include only one term on the right
and multiply by the number of terms. Thus
(20.39) is a product of 2j matrix elements of exp( - if3J y ) taken from the DOl
matrix (20.38). For each particle whose spin projection is the same on left and
right there is a factor cos tfl, while for those which differ there is a factor
±sin 1/3 corresponding to the two cases <-I 1+)
and re- <+I 1-),
spectively. The only relevant parameter in the sum over m; is the number x of
particles in the first j + m which have spin + t. Thus, including the binomial
coefficients e:m)and CJ,;;,"'-x) to account for the number of ways of arranging
the spins in this way gives finally
J
. + m; )
x ( - l)j ~ m ' - .' (cos 1P) 2 ., - m - m' (sin tf3f j - 2 ., ~ m + m ' (20.40)
a result which simplifies a little when the explicit forms are inserted for the
binomial coefficients.
Bibliography
Sturge, M. D. (1967). Jahn- Teller effects in solids, in Solid State Physics, vol. 20
(Academic Press, New York) and for an experimental verification of the
effect , see
Gilmore, R. (1974). Lie Groups, Lie Algebras and some of their Applications
(Wiley-Interscience, New York)
Problems
20.1 Show that D 3 is the semi-direct product group C 3 /\ C 2 . Hence deduce the
character table for D3 by following the method of section 20.3.
20.2 Draw the root diagram for the group G2 introduced in section 20.4.
Appendix 3*: Topics
in Representation Theory
The direct product TI~) x TIP) of two representations of a group t§ was defined
in section 4.17 and a simple formula (4.43) was deduced for its character. This
would extend naturally to a product T = T(a) X TIP) X •. • of n terms and in the
present section we are concerned with the special case when all the factors are
equivalent o: = f3 = . . .. To discuss this representation it is helpful to
construct a set of basis vectors from products of functions, as in section 4.17.
Let <Pi with i = 1, 2, . . ., s" denote a set of functions transforming like the
irreducible representation T(aJ so that the set of (s,,)n products <pj(l) <pj(2) . . .
<pdn) provide a basis for T
524
A.3 .1 Appendix 525
The index p in the factors 4>i(P) serves to label the factors and is relevant in the
definition of scalar product, namely
(4)d1)4> i ' (2) . , . 4>k' (n), 4>i (1)4> j(2) , , , 4>dn) )
= (4);-(1), 4>i (1))( 4>/ (2), 4> j(2)) , . , (4)dn), 4>k (n))
= ()j"i()j'j '" (l,.'k
showing that the set of products is orthonormal given that the 4>i are
orthonormal. (In physical applications, p will label the different particles of a
system while i labels the possible states of any of the particles .) It is convenient
to introduce a more concise notation, writing
lij , , , k) == 4>i(l)4>j(2) , . . 4>k(n)
where the index p has been dropped but is inferred by the ordering of the labels
in Iij . . . k ). The scalar product is then written as <i'] ... k'iij ... k) . (This
notation is familiar in quantum mechanics .)
Let us now consider permutations P of the n indices p, defined for example
by
(A.3.1)
The set of all such permutations form the symmetric group Y n described in
chapter 17 and it is clear from equation (A.3.1) that the vector space L of the
products Iij . .. k) will be invariant under the group Y n, In fact, the
permutations commute with the operations of '§ so that L provides a basis for
a representation W of the product group '§ x Y n. Ifwe denote the irreducible
representations of the groups '§, Y nand '§ x Y n by T(Y), VIA) and T(Y) x VIA),
respectively, then the reduction of the representation W may be written as
(A.3.2)
Thus it is possible to choose a basis for L in which the basis vectors are labelled
by y and )., describing their transformation properties with respect to both '§
and Y n. If G a, P and GaP denote arbitrary elements of the groups '§, Y n and
'§ x Y n, then the character equation corresponding to the reduction (A.3.2) is
T I.· = '\'
L... mYI.· T (y)
Y
526 Appendix A.3.1
xJG a ) = L my;.x(Y)(G a)
Y
(A.3.4)
where the sum runs over the n! permutations P of 5I'n . The procedure for
finding X;. will be first to calculate X(GaP) and then to use equation (A.3.4)with
the known X()·) from the character table for the symmetric group. Once x·I . is
found, the integers my). follow from the usual orthogonality of characters of CfJ
(A.3 .5)
L T~)(Ga)Tf}(Ga)'" 71,(G a)
i.], .. ., k
= {L T:~)(GaW = V~)(GaW
(A.3.6)
i
L
i,j• . .., k
nY(Ga)T~~/(Ga) '" Tl~)(Ga)
= L T~)(G;) . . . Tl':!(G a )
i. .. ., k
(A.3.7)
which involves just the irreducible characters again, since G; is simply another
element of CfJ. For the more complicated permutation (g~) of three objects, in
A.3.2 Appendix 527
With these results we are in a position to deduce all the X.(GA a ) for the
examples n = 2 and n = 3. Firstly for n = 2 there are only two group elements
E and P 12 and there are two irreducible representations corresponding to
symmetric or ant isymmetric basis vectors. From the characters given in table
4.5 and equations (A. 3.4) to (A.3 .7) we have
For n = 3 we may take the characters from table 4.2 because of the
isomorphism between !/3 and D3 (see section 2.3). There are three classes,
given by E; P 12 ' P 23 , P 3 1 ; n~n, Og)and the three irreducible representations
are usually described as (I) totally symmetric, (2) totally antisymmetric, and (3)
mixed symmetry (two-dimensional). In this case, we find as before
Notice that there has been no need to specify the group "§ although, finally, to
obtain numbers from these expressions, one must look up the values of the
irreducible characters X(a) of «.
It is possible to obtain a general formula for the characters X(GaP) by using
the result , discussed in section 17.1, that any permutation may be written as a
product of 'cycles' of various lengths. If P contains n t cycles of length 1, n2
cycles oflength 2, and so on , then by a continuation of the method used for the
simple permutations above, one finds that
were deduced from equation (4.45) using the tables of characters. There is an
alternative way of deducing these coefficients if the corresponding coefficients
are known for a subgroup Jf of ~ . Let us denote the irreducible repre-
sentations of Jf by T(liJ, T(P), etc., and suppose that we know the coefficients
m;P in the reduction
T!J) = 'n
L... ~~· T (Ii)
Ii
Combining these three equations we have
(A.3.l3)
and also
(A.3.14)
(A.3.15)
The only unknowns in this set of equations (A.3.l5) are the required
coefficients my and, in many cases, the equations are sufficient to enable us to
deduce them.
To illustrate the method, we use the same example as in section 4.17, the
product T(3) x T(3) for the group D3 , using now the subgroup C3 as described
in section 4.l8. Thus equation (A.3.l3) becomes
T(3)X T (3) = (,12) + ,(3))X (,(2) + ,(3))
= 2,(1) +,(2) +,( 3)
Comparing with the known reductions T (3) = , (2) +,(3), T(1) = T(2) = ,(1)
from section 4.18 we are able to deduce that
A.3 .3 Appendix 529
The methods used in section 4.15 for finding the irreducible characters were
satisfactory for groups where the number of classes was small. In this section
we derive additional relations between the characters which are sufficient to
determine them for any finite group. We first introduce the sum of all group
elements in a class rej which we denote by the same symbol rei and which is
called a 'class operator':
re =j L
o, in -e,
Ga (A.3.l6)
We next show that the product of two such sums must itself contain all the
elements of a given class the same number of times, so that we can write
T.I
(.j
L T(')(G a ) (A.3.21)
G. in 't,
Equation (A.3.19) is now written in the form G, C(jj = rejG, which, in the matrix
530 Appendix A.3 .3
(A.3.22)
where Cj is the number of elements in the class ~i ' Writing (A.3.l7) in matrix
form, we have
(A.3.23)
The first two sections in this appendix contain some useful properties of the
spherical harmonics, which follow directly from our study of the group 1Ji 3 in
chapter 7. The last section discusses group integration generally and then
deduces the necessary weight function for the group integrals in 1Ji 3 •
The Wigner - Eckart theorem may be used to evaluate a useful integral. From
equation (7.53), regarding yOtl(O,
m,
¢) as an operator,
using the fact that IJi3 is simply reducible so that t is not required and also that
the Clebsch-Gordan coefficients C are real. This result may be used to expand
the product 1"'.)
m
1"1,),
m
regarded as a single
.
function, in the form of equation
l l
531
532 Appendix A.4 .2
(7.49). We have
y~: ,) (0, cP) y~:2 ) (0, cP ) = LC(l1/2 1,m lm 2m) (/IIY(/,)11/2 >y~) (O,cP) (A.4.2)
1m
making use of equation (7.50) and (A.4.1). To find the value of the reduced
matrix element we first invert equation (A.4.2) using the orthogonality of the
Clebsch-Gordan coefficients, see section 4.17,
(A.4.3)
Then, since the reduced matrix element (IIIY(/,)11/ 2 >is independent of 0 and
cP, we may insert any convenient values for 0 and cP in equation (A.4.3). By
taking 0 = 0, the spherical harmonics simplify enormously, becoming inde-
pendent of cP,
yl~(O,cP) =
21+
om.O( ~ 1)1Pk(cosO) = (21~
+ 1)1
om.O (A.4.4)
giving
= ((2/ 1 + 1)(2/2 +
4n(21 + 1)
1))tC(I I I OOO)C(I I I m m m)
I 2 , I 2 , I 2 (A.4.6)
f o
lO) = "C(I
L. 10m I -m I O)y(l,)(O
1 I , m. I
cP I )y(/
- m,,) (0 2 cP 2 ) (A.4.7)
ml
A.4.3 Appendix 533
making use of the fact that, for invariance under the subgroup ~ 2 , necessarily
m, + m2 = 0. Notice that althoughf~O) is an invariant under rotations of the
system it will depend on the relative positions of the two particles. In fact we
shall now show that f6°) is a Legendre polynomial of the angle ()'2 between the
radius vectors to the two particles. To show this we note that since fJO) is
°
invariant it is unaffected by a rotation of the system which brings particle 1 on
to the z-axis. In this position, (), = and ()2 = ()'2 so that using equation
(A.4.4) the equation (A.4.7) reduces to
f (0) (2/, +
o =C(l,/10,OOO) ~
1) PI,(cOS(}12)
where we have also used the fact that when m = 0, the spherical harmonic
reduces to a Legendre polynomial. The Clebsch-Gordan coefficients in
equation (A.4.7) have a simple form (see problem 7.14)
C(l,/, O,m, -m, 0) = (-I) /, -m' /(2/, + I)!
giving finally for equation (A.4.7)
This result is known as the spherical harmonic addition theorem and is used
for example in analysing the interaction potential between two electrons into
factors which depend on the coordinates of the separate electrons, see
appendix 5.1.
(A.4.9)
wherefis some function depending on the group element G a , is that the sum is
534 Appendix A.4 .3
f f(a)p(a)da = f f(c)p(a)da
(AA.lO)
a in <;I a in '§
for any fixed b. Here we use the notation a for the set of parameters
ai' a 2 , • • •, a" while da is an abbreviation for the volume element da
= da l da2 .. . dar . Now trivially
a in
f '§
f(a)p(a)da = f
c in '§
f(c)p(c)dc
(AA.12)
but if we use equation (AA.II) for fixed b then c is a function of the parameters
a and b, so that introducing the Jacobian
eC I GCI
- -
GC O'!I Ga2 (AA.13)
Ga
OCr OCr
oa I oar
we may change the integration variable from c to a. Notice that if b is fixed and
c runs over the group then a also runs over the group so that equation (AA.12)
becomes
f
a in '§
f(a)p(a)da = f
Gin ~
f(c)p(c) ~: da
Comparing this with the required equation (AA.10) shows that the weight
function p must satisfy the condition
GC
p(c) oa = p(a) (AA.14)
for all a. Assuming that such a p can be found, we may choose any convenient
value of a for evaluating p(c) and by taking a = 0 we have
v= f p(a)da
a in <§
It is then clear that the integral
~ f
a in '§
f(a)p(a)da (A.4.17)
will take the value un ity iff is the constant functionf(a) = 1. Both definitions
(A.4.16) and (A.4.17) may therefore be regarded as averaging procedures over
the group. It may now be seen that the proofs of chapter.4 are equally valid if
the finite average (A.4.l6) is replaced by the continuous average (A.4.17). Of
course, this assumes that the integral (A.4.17), or group volume, is finite. Such
groups, which include the rotation and unitary groups, are called 'compact'.
Although mo st of the groups of physical interest fall into this category, there
are some which do not, such as the Lorentz and translation groups discussed in
chapter 15.
For the group fit 2 the functional relation between the parameters a. band c
of equation (A.4.11) is simply c = a + b so that trivially ocloa = 1.The volume
is then gnda = 2n, and this justifies the orthogonality relation (7.14) as a
specia l case of equation (4.25).
In the group fit 3 the relation between c, a and b is not so simple and we make
use of the vector representation of these rotations. Let us choose the z-axis in
the direction of the axis of rotation of R(b) which gives to R(b) the simple
matrix
R(b) =
COS b
~nb
-sinb
cosb 0
0)
(
o 1
For R(a) we must then take an arbitrary direction but since the Jacobean is
evaluated as a -> 0 we may treat the rotation angle as small and for its matrix
536 Appendix AA.3
-n
use the unit operator plus the infinitesimal operator from equation (7.24)
(A.4.18)
Having found the matrix of R(c) it remains to deduce the parameters c
from the matrix. The angle c is found by equating the trace
of the matrix R( c) to the known trace (I + 2 cos c) for a rotation through angle
c, giving cos c = cos b - G z sin b which, remembering that G z is small, implies
c = b + Gz • To find the direction of c we use the fact that the vector c will be
unchanged by the rotation R(c),
R(c)c = c (A.4.19)
We must take care in using equation (A.4.19) since the form (A.4.18) for the
matrix of R(c) is good only to first order in a. However from equation (A.4.19)
and the unitarity of the rotations we have also R t(c)c = C so that
{R(c) - R \c)}c = 0, which gives explicitly the three equations
DC
-!-b(l +cosb) /sinb -!-b o
Da
--!-b -!-b(l +cosb) /sinb o =-!-b 2(1+cosb) /sin 2b
o o 1
A.4 ,3 Appendix 537
V = f 2(1 - cos a) d
a
2 a, d a yd a:
a in '§
Transforming to polar coordinates for the vector a, and integrating over angle,
gives
1r"
~l~~)(I-COSa)da (A.4.22)
Using this average to replace the finite sum in equation (4.25a) then gives the
orthogonality relation for characters of 9€ 3,
.~Jar"
t: = 0
XUtlXU' )(1 - cosa)da = {) .
a a }il : (A.4.23)
a result which may be verified from the explicit formula (7.42) for XU). It
should however be noticed that the orthogonality is not valid for the aha lf_
integer representations which are double-valued. Orthogonality for these
double-valued representations may be recovered by extending the range of the
rotation angle to 2n. Over this range the representation is single-valued.
Further discussion of this enlargement of the group 9€ 3 is given in section 7.6
and in section 18.13.3.
If the Euler angles iX , f3 and y are used as parameters for the rotation instead
of aI' a 2 and a3 then it may be shown, see section 20,S,that the weight function
is p( iX, f3, y) = sin f3 so that the average becomes
8n
12 1 r" 1
2
"
.= o Jp= o
2
=/(iX,f3,y)sin f3diXdf3d y
" (A.4.24)
1 1"1
2
-8 12 2" " u )* u) .
Dm!m (ex. ,fJ, y)Dm~m (ex. ,f3, y)smfJdex.dfJdy
n 0 0 0 " , ,
Some detailed calculations of the ordering of atomic energy levels for the
configura tions p2 a nd pJ are given in the first section, to reinforce the more
qu alitative discussion in chapter 8. The second section returns to the theory of
a ngular momentum and the gro up Yi J and introduces the more intricate
problems encountered when more than two angular momenta are coupled
together. Some of these ideas are used in section A.5.3 to calculate atomic
transition strengths. Some more details of the effects of symm etr y in a crystal
field, which was introduced in section 9.9 of the first volume , are given in the
last two sections.
the 'diagonal sum'. A more sophisticated method which requires the know-
ledge of tabulated coefficients is described in section A.S.2. We make use of the
fact that since the Coulomb repul sion e Z [ r i z is invariant under rotation of all
particles the energies are independent of M L and diagonal in L . Thus we may
select any convenient value of M L' The matrix elements of the Coulomb
repulsion are most easily calculated in the product basis m l mz rather than the
coupled basis L M L' Furthermore we know that the trace of a matrix is
independent of the basis . Let us denote the required term energies by ELand
the diagonal matrix elements in the product basis by Eim, mz). Then
considering separately the matrix for each value of M we are able to deduce a
set of equations relating the E L to the E(mlmz).
Taking first the maximum M = 2 for which there is only one basis state we
have directly ED = E(11). For M = 1 there are two basis states and we have
E p + ED = E(1 0) + E(O 1). For M = °
there are three basis states giving
E s + E p + ED = E(1 - 1) + E(O 0) + E( - 1 1). It remains only to calculate the
diagonal matrix elements
E(m l mz)
We do this for arbitrary [since no more work is required. The interaction (l /r IZ)
°
is separated into parts depending on each electron by first expanding it in
Legendre polynomials of cos 1 2 , where el z is the angle between r , and rz , and
then using the spherical harmonic addition theorem (A4.8). Finally, the
angular integrals are carried out using equation (AA.6). Thus expanding
ex.
e 2/r l Z = L Vd rl ,rZ)Pk(cos 0 12 )
k=O
and writing for the radial integral
F(k) = f fdddrldr2U';/(rtlu,;,(rl)Vk~rl,r1)
we have
1/
E(mlm 1) = L F(k)C2(kll,OOO)C(kll,Omlml)C(kll,Om1m1) (AS.l)
k (even) = 0
The upper limit on k has come about because of the vector coupling of k, [and l.
The coefficients C vanish otherwise. The restriction to even k results from
consideration of parity in the single-particle integral (A4.6). The vector
coupling coefficients C all take the value unity when k = so that in the case of
[ = 1 we have the simple formula
°
E(m l m1) = F(Ol + F(2 lC 2(211 ,OOO)C(211 , Om l m l )C(211, Om1m 1)
A.5 .1 Appendix 541
Using the known values of the coupling coefficients (see problem 7.8) we can
now deduce E(m\ m2)and hence ELin terms ofthe two radial integrals FlO) and
F(2),
F SL = (lJIsdI23)II e2 /rjjllJlsd123))
, < )
but since 'P is antisymmetric, the contribution from each pair of electrons is the
same and the sum is equal to three times the contribution of any particular
pair, for example electrons numbered 1 and 2. Thus using also equation (A.5.3)
(A.SA)
542 Appendix A.5 .1
where E<J> denotes the term energies for the two-particle system . In deriving
equation (A.5.4) we have integrated over the coordinates of particle number 3
which is trivial and gives a factor unity from normalisation since the operator
is now independent of particle 3. We have also introduced the coupling
coefficients implied in the expression (A.5.3) and carried out the sum over the
Ms a nd M L values of <1>. These sums again give a factor unity from the
normalisation of the coupling coefficients and the fact that E<I> is independent
of these M-values.
Knowing the two-electron term energies from the previous section one
needs only to deduce the 'fractional parentage coefficients' a<l> to obtain the
three-electron term energies F SL' Some elaborate methods have been de-
veloped for this purpose and extensive tables have been published. Here we
shall deduce the coefficients by an elementary method. From the normali-
sation of 'II the coefficients must be normalised I: Ia<l> IZ = 1 and for the
purpose of using (A.5.4) the phase of a<l> is not rele~ant. The coefficients are
conveniently presented in table A.4-with 'II labelling the rows and <I> the
Table A.4
y- ~ 3p IS
4S i
I
o
--,------------;c--- ----:::- -
2D I ex o
2p I ~ _
columns. The zero entries in the table follow directly from the triangular
conditions on vector coupling, i.e. a total S = t cannot be produced by
coupling the single particle spin of! of electron number 3 to the 'parent' spin
S = 0 and a tot al L = 2 cannot be produced by coupling I = 1 to the 1 S parent.
Use of the normalisation condition completes the first row and gives the
energy of the ground state term as
F(4S) = 3E p = 3F(O) - (3/5)F(Z) (A.5.5a)
using the results (A.5.2).To find the second row we use the result (8.44) noting
that equation (A.5.4)is valid for any invariant two-body interaction and hence
for,Ar in (8.44). Since P~j = - I in the 3 P term and + 1 in the 1 D this gives
3( -!Xz + fJz) = 3 -~ - !( ~) = 0
which combined with the normalisation !XZ + fJz = 1 gives !XZ = fJz = 1- Using
these values for (XZ and fJz we deduce the energy of the zD term,
FeD) = ~Ep +~ED = 3F (O) - z6sF(Z) (A.5.5b)
The same argument applied to the third row gives
3( _ yZ +OZ + £z) = 3 _~_!(~)
A.5 .2 Appendix 543
which together with the normalisation gives y2 = 1, (52 + £2 = 1. To find (52 and
£2 we use the two-body operator L [l(i) + IU)]2 in precise analogy with the
i<j
corresponding spin operator used in deriving equation (8.43). The correspond-
ing expression is
We have seen how the Clebsch - Gordan (or vector-coupling) coefficients play
a vital role in the construction of wave functions and the calculation of matrix
elements. These coefficients are calculated entirely from the group properties of
:ft 3 and are independent of any details of the physical system other than its
symmetry. When more than two angular momenta are coupled together one
naturally encounters products of Clebsch - Gordan coefficients and certain
sums of products occur sufficiently frequently that it is worth while
introducing a symbol to denote them and tabulating numerical values for
them-see Brink a nd Satchler (1968) and Rotenberg et a/. (1959) in the
bibliography to chapter 7, for further details of the recoupling coefficients. It is
clear that unlimited complexity may be achieved by coupling more and more
angular momenta together but, in practice, it is only the first one or two steps
along this path which are of much interest. The processes which we develop in
this section are applicable to most groups but have only been developed
extensively for the group gp 3 . For the Abelian group f:1l 2 these processes are
trivial.
544 Appendix A.5 .2
Each of these steps is unique and defines a set of basis vectors which we
denote by U12j34,jm ) . An alternative set of basis vectors may be defined by
taking the products in a different order, for example O(j,)x O(j,) and O(j2)X O(j·)
followed by OUIJ)x DU2.) would define a set IjI3j24 ,jm ) . The transformation
between these two sets is usually written as
(A.5.8)
which defines the 9j-symbol f ::}- Since functions with ditTerentj or mare
orthogonal it follows that the sum (A.5.8)is restricted to fixedj and m and it is
easily shown, using the raising operator J +, that the coefficients are
independent of m. From their method of construction it is clear that any of the
basis vectors on either side of equation (A.S.8) may be written as a sum of
vectors !m lm 2m3m4 ) in the simple product basis with coefficients which are
products of three Clebsch - Gordan coefficients. Hence using the orthogo-
nality of these coefficients, equation (A.5.8) leads to an expression for the 9j-
A. 5.2 Appendix 545
R q(k,, Iis a un it operator so far as the second component is concerned and vice
versa . Using the Wigner - Eckart theorem and the definition (A.5.8) of the 9j-
symbol one then finds
There are several special cases of these results which are in fact the ones most
frequently encountered, When one of the entries in a 9j-symbol vanishes two
other pairs of entries must be the same because of triangular conditions,
leaving only six independent j- values . In this case one writes
e}
L eA
Q b (_l)b +c+e+ f ab e
c d e = (A.5.l2)
{
f f 0 [(2,+ 1)(2! + 1)]1
jIJ
(A.5.l3)
The result (A.5.10) may be specialised in two way. Fir st, by taking S(k,) = I,
. q,
the unit operator, so that k 2 = 0, we 0 b tam
<ilhjmIR~,')Ij'lj~ j' m' >= ( - I )j, +j, +j' - k, eU' k .t.m' ql m)oj,/"
in which the dependence on the total angular momentaj and j' is given entirely
by known functions .
As a second specialisation, set K = 0 so that k I = k 2 and the operator is n°)
now an invariant, giving
<j
' j'
I 2
j'mIT(Ollj"
° j"
1 2
jm' >= 0u..! 0mm,( _pr i', +j, +j _ k,( (2j l + l)(2j2 +
(2k I+ l)
0)1
x {j~, j'~
h h
A.5.3 Appendix 547
This formula gives thej-dependence of the matrix element of the invariant T bo1
through a 6j-symbol.
The explicit form of the invariant operator is from equation (A.5.9)
putting in the known va lues for the Clebsch - Gordan coefficients. It is fairly
common practice, however, to write
and to refer to (R1k ,).S(k ,)) as the scalar product of the two irreducible sets of
operators R(k,) and S(k ,J. In particular for the angular momentum operators
which have 1<' = 1, q ,
which agrees with the usual notion of a scalar product of two vectors. Care
must be taken to avoid errors of (-1) k,(2k 1 + 1)! due to these two ways of
expressing the inva riant.
Some of the results obtained earlier in this chapter may be obtained directly
from the formulae derived above rather than by the use of equivalent
operators. For example, equations (8.28) and (8.39) are particular cases of
equation (A.5.15) while equation (A.5.14) will be used in the next section to
deduce the ratios of transition strengths.
We have already discussed selection rules for the emiss ion and absorption of
electric dipole radiation in sect ions 5.4 and 8.1. In an LS-coupling situation for
a many-electron atom they are given by
S(J ,1') = (2J + 1)(2L + 1)(21 ' + 1){ ~, l' 1}2 <L SI IV I1
' I IL' S) 2
L S
(A.5.17)
Thus without any detailed knowledge of the wave functions we deduce that
for fixed Land L ', the dependence on J and J' is given by
x{~, ~ ~r<LSIIVI1)IIL'S)2
A.5 .4 Appendix 549
Thus for example the ratio of strengths for the two transitions 151 -> 2PJ and
15r~ 2p!, shown in figure 8.1(c), both with M = -!- and M ' = is 2 -+1,
C 2(ll .1 .1 _ I _.1) x 4 x ) I ,1 I3 _i1 }2
2 2'2 2 tI 0 1
C 2(.1I.1 .1 _ I _ .1) x 2 x
I !- 1}2 - I
2 2'2 2 {
I
I 0 !-
where the values of the coefficients are taken from Rotenberg et al. (1959) in the
bibliography to chapter 7. Because of the presence of the Clebsch - Gordan
coefficients this differs from the ratio of 2 obtained from equation (A.5.l7) for
the total strengths from 1512 to 2PJ2 and 2Pb 2
respectively.
The results of subsection 9.9.2 do not depend on the detailed form of the
crystal field potential but only on its symmetry which is determined by the
cry stal structure. However, this told us only the manner in which the multiplets
would split but did not tell us anything about the ordering of the split levels or
their energy spacings. If we now want to calculate such details of the states in
the crystal field we will need to evaluate matrix elements of the crystal field
potential in the states which we have found by symmetry arguments. In this
section we study the crystal field potential and in the following section
methods a re given for evaluating the matrix elements.
The potential due to the surrounding atoms or ions will have a form which is
invariant under all the point group symmetry operators and will therefore
transform according to the identity representation . For the region in which we
are interested it will also be a solution to Laplace's equation and can therefore
be expanded in spherical harmonics as
Since rotations can only mix spherical harmonics of the same k the
contribution to Vc from each k must al so transform according to the identity
representation of the point group. This places very strong restrictions on the
coefficients At) which we illustrate by taking the symmetry group O.
From the formula (7.42) for the character of D(k) and using table 9.5 we soon
find that for k = 1, 2, 3 and 5 the representation D(k) contains no invariant
under the group 0 and for k = 4 there is just one.
By defining the z-axis to be an axis of greatest symmetry we may also restrict
the possible values of q which occur in the expression (A.5.18). For example in
the group 0, the four-fold rotational symmetry about the z-axis implies that
the values of q must have the form q = 4n, with n an integer. Thus the only
surviving coefficients with k < 6 are AbO) , AiJ4), A~4) and A(~~ and, since there is
only one fourth-order invariant, the coefficients Ab4) , A~4) and A(~~ must be
550 Appendix A.5.5
related. To make the potential invariant under a four-fold rotation about the
x- or y-axes requires that A(~~ = A~4) = (t54)fAb4), see problem A.8. Hence the
potential for a cubic field has the form
Vc = AbO) + A~4)r4 { ylo4) + (4)t( y~4) + yi~~)}+ terms with k ~6 (A.5.19)
In this particular case it is equally easy to write down an expression for V in
powers of x, y and z which has the required symmetry. The terms must be even
in x, y and z and be symmetric with respect to permutation of x, y and z. The
serie s must therefore take the form
The calculation of the crystal field splittings in terms of the parameters A~k) is
in general a complicated numerical problem involving the radial wave
functions of the electrons and details of the many-electron wave functions.
However, it is possible to deduce relations between some of the splittings by
the use of symmetry alone. This is done by using the Wigner - Eckart theorem
or, what is really the same thing, by the use of equivalent operators, see
equations (4.62), (7.53) and subsection 7.4.7.
A.5.5 Appendix 551
(A.5.20)
We deliberately use the notation V~4) rather than ¥1q4 ) to emphasise that the
actual potential is summed over all electrons. Thus, whereas ¥1q4 ) must refer to a
specific electron, the expression 0q4 ) stands for the sum of such terms . It will of
course have the same rotation properties and that is all that concerns us at the
moment. Expressions for a typical wave function
in the M-basis are easily deduced and for practical purposes the coefficients c<u
have been tabulated. In this example we shall deduce them by setting up the
7 x 7 matrix of the operator (A.5.20) in the M-basis for L = 3 and then
+ +
diagonalising. From the reduction D(3) = A 2 T 1 T 2 we know that the
resulting eigenvalues must contain two three-fold degeneracies.
From the m-dependence of the operator (A.5.20) it is clear that the term
004 ) contributes only on the diagonal of the matrix, while the other two terms
change M by four units . Thus, we see that the 7 x 7 matrix breaks up into three
2 x 2 matrices with M = ± 2, M = + 1 and - 3 and M = - 1 and + 3,
respectively, while M = 0 is uncoupled and is therefore an eigenvector. We
thus need to calculate only a few matrix elements of the kind
<L = 3, M' IV~4) IL = 3, M > which are related , according to the Wigner -
Eckart theorem by
<L = 3,M'IV~4)1 L = 3,M >= C(343,MqM ')(L = 31IV(4) II L = 3 >
552 Appendix A.5 .5
Table A.5
(A.5.2I)
necessary only to write J2 for L 2 and J, for Lz . The relative strengths of the
splitting in different J-multiplets may be deduced with the help of equation
(A.5.l4).
f
P:.!.'m = (2j + I) D:.!·'m(iX. p,1') T(iX . p. l') sin{Jd :xd{Jd}'/ 8rr 2
p UI = n.{
J r- )
J 2 - j' U' + I) l!n
J r-
U U+ I) - j' U' + I)
J
l
where n denotes the infin ite pr oduct over j' = 0. 1. I. . . ., excl ud ing j' = j .
A.4 Calculate the coupling coefficients required in the deri vation of equation (A.S.2 ).
(U se the method of questi on 7.8.)
A.S In question 7.12. the ra tio s between matrix elements o f a tensor operator of
degree 2. in sta tes with I = I. were calculated from the Wigner- Eck art theorem.
Deduce the same rat ios by using the equivalent operator meth od o f sectio n 7.4.7
(Sta rt with the equivalent operator Lll ' = L + L + and co nst r uc t the other
components fro m equati on (7.52) with equat ions (7.28 ) a nd (7.30). Then use
equation (7.40) to find th e matri x elements .)
A.6 Use equation (A.S. 14 ) to sho w th at the matrix elements o f the d ipole operator
L o + gS o in LS-coupling a re given by the formul a
f
J 'J I }
x C (J IJ '. M OM ) [L(L +I )(2L +l)(2J+I)] 1
LL S
(Write So = J o - L o and use the result of questi on 7.13 for the reduced matrix
element of an an gular momentum operator.)
A.7 Use equation (A.S.15) to deduce the result (A.S.2). Treat P, (COS8 12 ) as a scalar
product [y(k I (I) . ylk ' (2 )] because of the addition theorem (A.4.8) and take the
reduced matrix elements of y tk l from equation (A.4.S). You will need the
following coefficients:
A.9 In subsection A.5.5. we derived an expression (A.5.21 )for the operator equivalent
for V(k = 4, AI)' Here, we suggest a short-cut method to achieve the same result.
The operator we require must be a fourth-order polynomial in the operators L "
Lyand L; However, we know that L~4) = L~ and that, since L~4) has q = 0, it
must have the form L~4) = C(L 4 + «L 2L; + fJL~ + yL 2 + (iL;). The coefficients a,
p, y and (i may be found by demanding that L~41 has zero matrix elements in all
states with L < 2. This is a necessary condition for a tensor operator of degree
k = 4. The choice of states L = I, M = 0; L = I, M = I; L = 1, M = 1; L = 1,
M = t is sufficient to give the four coefficients. To find the correct relative
magnitude of L~41 and L~41 we must make use of the symmetry &roup 0 and
choose the constant C so that in the complete expression L~41 + (-fs)2(L~4) + L~~)
the coefficients of L~ and L~ are the same.
A.IO Deduce the relative splitting for given J in column three of figure 9.8 by using an
equivalent operator like (A.5.21) for the cry stal field.
A.II With the method suggested in the previous question one cannot deduce the
relative splitting in different J due to the crystal field. Use equation (A.5.14),
together with the table of Rotenberg et al. (1959) in the bibliography to chapter 7,
to find this relative splitting.
Appendix 6: Solutions
to Problems in Volume 2
Chapter /4
14.4 See figure S.3 for the Brillouin zone. The little groups are C4,- at the
points rand M, C 2v at X , and C lh at ~ , Land Z . As k --+0, ~I is
compatible with r l' r 3 and r 5' and ~ 2 is compatible with r 2, r 4 and
r s . As k--+n ja, ~I is compatible with XI and X 4, while ~2 is
compatible with X 2 and X 3 ' (The irreducible representations of the
little group C4v at r are denoted by r i , where i = I to 5 corresponds to
the order in which the representations are listed in the character tables
in appendix 1. The same convention is used at the other points.)
14.5 The eigenstates are <!>I Al=!/Jl+!/J2+!/J3+!/J4' <!>IH,) = !/Jl +!/J4
-!/J3-!/J2' <!>IE)=!/JI-!/J4' <!>I E)=!/J2-!/J 3' the last two
being degenerate. The energies all contain the constant
h 2 n 2 j2Ma 2 +Vooand in addition.F, , +2 VOl for A I'vII - 2 VOl for B 2
and -VII for the E-doublet, where
Vpq = l'c-
I f
V(x ,y) exp{(px +qy)in ja}dxdy
555
556 Appendix 6: Solutions to Problems
""10
x
r "fa
Figure S.3
Chapter 17
17.1 (124)(36)(5)
17.4 Dimension is five: (22111), (21211), (21121), (12211), (12121).
17.5
Chapter 18
18.1 For L (3]; 111 1 ) ,1222 ) ,1333 ) and the symmetrised products formed
from 1112 ),1113 ) , 1221 ) , 12 2 3 ), 1331 ) , 13 3 2) and 11 23 ) . For
L[111] ; the antisymmetrised product formed from 1123 ) . The ortho-
gonal complement then belongs to L[21]. (Or use the Y 3 projection
operators p[21].)
I
ITIillJ
~~
18.2 (a) 1111121 1112121 1212121 1111131
18.3
(a) (11 ) x (20) = (31)+(12 )+(20)+(01 ). (b) D l!lx D(1) = D I!l +D I!1
18.4 (300)(2 10)(120)(030)( 201)(1 1 1)(02 1)(102)(012)(003). Th us
M = 3, 2, I, 0, I, 0, - I, - I, - 2, - 3 an d so L = 3 o r L = I.
18.5 L = 3 a nd L = 1 in bo t h cases.
18.6 For [ 2] , L = 4, 2 o r O. Fo r [ 3], L = 6, 4, 3, 2 or O. For [2 1] , L = 5; 4, 3,
2 (twice) a nd 1.
3
18.7 T = 2WIt · h J = 2'
9 5 3 T
2' 1 · = 21 W1t ith J = 21311 9 7 (t .
' 2 ' 2' ) 5( t .
1 wlce,1 wlce,) 1,
3 1
2·
Chapte r 20
20.2 See figure SA. 3a + 2{3
3a+ {3
Figure SA
A ppendices 4 and 5
A.tO See A.11.
A.II The sh ifts in the nine levels in the third column of figure 9.8 a re given by
0, - 66, 132, - 28, - 4, 36, ( - 49 ± 5 J 72 1), 196 in a rbitra ry units, wit h
the figure roughly to sca le.
Index to Volumes 1 and 2*