Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
INTERNATIONAL SERIES IN
PURE AND APPLIED MATHEMATICS
11ttrod11ctio11 to
TOPOLOGY AND
MODERN ANALYSIS
GEORGE F. SIMMONS
Associate Professor of Mathematics
Colorado College
San Fra.ncisco
Toronto
London
EDlTIO~\T
Exclusive rights by Kogakusha Co., LtdT for lnanufaclure and exporc from Japan.
This book cannot be re-exported front the country to which ic is consigned by
Kogaku.sha Co.~ Ltd. or by McGraw~Hill no~Jk Company, Inc. or any of its :subsldiaries.
I
Copyright !963 by the ~fcGrav{-lfill Book Cornpany, Inc~ .All Rights Reserved.
This bookt or pares lhereof, n1ay nol be reproduced in any for1n l\Tithout pern1isa
sion of the publishers.
l .. HlR ARY OF -GO.~CRESS CATALOG C,\RD ~U\IBER
62-15149
I OWE MORE
THAN I CAN POSSIBLY EXPRESS
Preface
For some time now, topology has been firmly established as one of
the basic disciplines of pure mathematics. Its ideas and methods have
transformed large parts of geometry and analysis almost beyond recogni~
tionr It has also greatly ~timulated the growth of .abstract algebra.. As
things stand today, much of modern pure mathematics must remain a
elosed book the person who does not acquire a working knovlledge of at
least the element~ of topology~
There are many domains in the broad field of topologyr of which the
following are only a few: the homology and cohomology theory of complexes~ and of more general spa.ces as well; dimension theory; the theory
of differentiable and Riemannian manifolds and of Lie groups; the theory
of continuous curves; the theory of Banach and Hilbert spaces and their
operators 1 and of Banach algebras; and abstract harmonic analysis on
locally compact groups. Each of these subjects starts from roughly the
same body of fundamental knowledge and develops its own methods of
dealing with its o-..vn characteristic problems. The purpose of Part 1 of
this book is to make available to the student this .:ihard core" of fundamental topology; specificn.lly, to make it available in a form which is
general enough to meet the needs of modern mathematics, and yet is
unburdened by excess baggage best left in the research journals.
A topological spar.e can be thought of as a .set from which has been
swept away all structure irrelevant to the continuity of functions defined
on it. ~art 1 therefore begins with an informal (but quite extensive)
treatment of sets and functions+ Some writers deal with the theory of
metric spaces as if it were merely a fragment of the general theory of
t.opological spaces. This practice is no doubt logically correctt but it
8eems to me to violate the natural relation between these topics, in which
metric spaces motivate the more general theory. Metric spaces arP
therefore discussed rather fully in Chapter 2, and topological spacPs are
intl"oduccd in Chapter 3. The remaining four chapters in Part 1 are
concerned with various kinds of topological spaces of special importance
in applications and 'vith the continuous functions carried by them .
It goes without saying that one aspect of this type of mathematics
i8 its logical precision~ Too many writers 1 however, are content 'vith
this, and make lit t.le effort to help the reader maintain his orientation in
to
Yii
viii
Preface
the midst of maze.s of detail. One of the main features of this book is the
attention given to motivating the ideas under discussion. On every
possible occasion I have tried to make clear the intuitive meaning of -.,vhat
is taking place, and diagrams are provided~ whenever itt seems feasiblei
to help the reader develop skill in using his imagination to visualize
abstract ideas. Also~ each chapter begins with a brief introduction
which describes its main theme in general terms~ Courses in topology
are being taught more and more widely on the undergraduate level in
our colleges and universities, and I hope that these features, which tend
to sof tcn the austere framework of de futltions, theorer ns t and proofs~
will make this hook readable and easy to use as a text.
Historically speaking, topology has follo,vcd t'vo principa~ lines of
development. In homology theory~ dimension theory, and the study of
manifolds~ the basic motivation appPars to have co1ne from geometry.
In these fields, topological spaces are looked upon as generalized geometric
configurations, and the emphasis is placed on the structure of the spaces
themse]ve~..
ln the other direction, the n1ai n stimulus has been analysis.
Continuous functions are the chief objects of in t.erest here, and topological
space.s are regarded primarily as carriers of snr~h functions and as domain8
over which they can be integrated. These idea~ lead naturally into the
theory of Banach and Ililbert spaces and Banach algebras, the modern
theory of integration~ and abstract harmonic analysis on lo cal1y compact
groups.
In Part 1 of this book, I have attempted an even balance bet\veen
these two points of view4 This part is suitable for a basic semester course
and most of the topics treated are indispensable for further study in
almost any direction. If the instructor wishes to devote a second
semester to some of the extensions and applications of the theory, many
possibilities are open. If he prefers .applications in modern analysist he
can continue 'With Part 2 of this book 1 supplemented, ]JerhapsJ with a
brief treatment of measure and integration. aimed at the general form of
the I{iesz re presen ta ti on theorem4 Or if his tastes incline him to ward
the geometric aspects of topology, he can switch over to one of the many
excellAnt books which deal with these matters.
The instructor who intends to continue with Part 2 must face a
question which only he can answer. Do his students know enough about
algebra? This question is forced to the surface by the fact that Chapters
9 to 11 are as much about algebra as they are about topology- and analysis.. If his students know little or nothing about modern algcbrat t.hen a
careful and detailed treatment of Chapter 8 should make it possible to
proceed without difficulty. And if they knov{ a good deal~ then a quick
survey of Chapter 8 should suffice. It is my o~,.n opinion that education
in abstract. mftthematics ought to begin on the junior level \Yith a course
in modecrn alg;ebra and that topology f::l1011]d he oJTr.rcd only to students
1
Preface
ix
who have acquired some familiarity~ through such a course, with abstract
methods.
Part 3 is intended for individual study by exceptionally well ... qualified
student~ with a reasonable knowledge of complex analysis. Its principal
purpose is to unify Parts 1 and 2 into a single body of thought, along the
lines mapped out in the last section of Chapter 114
Taken as a whole, the present work stands at the threshold of the
more advanced books by Rickart {34L Loomis [271., and Naimark [~2];
and much of its subject matter can be found (in one form or another and
with innumerable applications to analysis) in the encyclopedic treatises
of Dunford and Schwartz [81 and Hille and Phillips 1201~ l This book is
intended to be elementary, in the sense of being accessible to well-trained
undergraduates, while those just mentioned are not.. lts prerequisites
are almost negligible.. Several facts about determinants are used l\rithout
proof in Chapter 11, and Chapter 12 leans heavily on Liou ville 's thcorctn
and the Laurent expansion from complex analysis4 "W~ith these exceptions, the book is essentially self-contained.
It seems to me that a "\i\rorth while distinction can be drawn bet ween
two types of pure mathematits~ The first-which unfortunately is
somewhat out of style at present-centers attention on particular function8 and theorems which arc rich in meaning and historyi like the gamma
function and the prin1e number theorem, or on juicy individual fa{~t~, like
Euler's wonderful f ormu]a
1
+ 7~ + 79 +
..
1f ~ /6~
1'he Recond is concerned primarily \vith form and strnct.urP.. The present
book belongs to this ca.mp; for its dominant theme can be expressed in
just two -.,vords, continuity and linearity, u.nd iii~ purpose is to illuminate
t.he meanings of these words and their relations to each other. Mathematics of this kind hardly ever yields great and memorable results like
the prime number theorem and Ruler's formula~ On the contrary~ its
theorems arc generaUy small parts of a. much larger whole and derive
their main significance from the place they occupy in that whole. In
my opinion, if a body of mat.hem a tics like th is is to justify it.self~ it must
possess aesthetic qualities akin to those of a good piece of architecture+
It. should have a solid foundationi it.s v.r,.alls and beams f-lhould be firm1y
.and truly placed, each part shouJd hear a meaningful relation to every
other part, and its to,vers and pinnacles should exalt the mind. Tt is my
hope that this book can contribute to a wider appreciation of these mathen1a tica.l values.
bracket~
guard.
It is a basic principle in the study of mathematics, and one too
seldom emphasized, that a proof is not really understood until the
stage is reached at which one can grasp it as a whole and see it as a single
idea. In achieving this end, much more is necessary than merely follo'\\ring the individual steps in the reasoning. 1'\his is onJy the beginning.
A proof should be che,ved, swallowed, and digested, and this proces8 of
assimilation should not be abandoned until it yields a full comprehension
of the overall pattern of thought .
CoJttents
vu
Preface
Xt
PART
Chapter One
ONE~
TOPOLOGY
Chapter Two
9.
10.
11.
12.
13.
14.
15.
49
METlllC SPACES
Chapter Three
Chapter Four
91
TOPOLOGICAL SPACES
CO:AfPACTN ESS
70
106
110
xiv
23~
24~
25~
Contents
Tychonoff's theorem and locally compacl epaees
C.Ompa.ctne.ss for metric spaces 120
A.soolfts theorem 124
Chapter Five
118
SEPARATION
129
Chapter Six
135
142
co--1NECTEDNE"SS
.
:,,..
Chapter Senen
153
APPROXJM AT/(JN
PART TWO:
Chapter Eioht
Chapter Nine
171
AWEBRAIC SYSTEJIS
OPERATORS
196
211
BANACH SPACES
Chapter Te-n
HII~BERT
243
SPACES
244
Contents
KY
Ma.trices 280
Dctermina.nta and the spectrum of an operator
The .s pectra.l theorem 290
A survc y of the sit ua.ti on 295
PART THREE:
287
ALGEBRAS OF OPERATORS
65.
66+
67.
68.
69.
301
318
323
APPENDICES
ONE
TWO
THREE
Bibliography
355
Index of Symbols
359
Subject Index
363
PART ONE
CHAPTER ONE
Topology
or a point in the complex plane ( = a complex number).. A set is a collection or aggregate of such elements, considered_ together or as a whole4
Some examples are furnished by the set of all even positive integers, the
set of all rational points on the real linei and the set of all points in the
complex plane who sc distance from the origin is 1 ( == the unit eirc le in the
plane). We reserve the word class to refer to a set of sets~ We might
speak1 for instance, of the class of all circles in a plan c (thinking of each
circle as a set of points). It will be useful in the work we do if we carry
this hierarchy one step further and use the term family for a set of classes4
One more remark: the words element, set, class, and family are not
intended to be rigidly fixed in their usage; we use them fluidly, to express
varying attitudes to'vard the mathematical objects and systems \Ve
study4. It is entirely reasonable, for instance, to thir:ik of a circle not as
a set of points, but as a single entity in itself, in which ease we might
justifiably speak of the set of all circles in a plane.
rhere are two standard .notations available for designating a particular set. Whenever it is feasible to do so, we can list its elements
bet,veen braces. Thus {1, 2, 3 \ signifies the set consisting of the first
three positive integers, {11 i, - lJ -i} is the set of the four fourth roots
of unity, and { + 1, + 3, + 5~ . ~ . } is the set of all odd integers. This
manner of specifying a set, by listing its elements, is unworkable in many
circumstances. We are then obliged to fall back on the 8econd method,
which is to use a property or attribute that characterizes the elements of
the set in question. If P denotes a certain property of elements, then
fx: P J stands for the set of all clements x for which the property P is
meaningful and true. For cxamplct the expression
1
~ x: x
is real and
irrational~,
which '\\'Te read the set of all x S:u.ch that xis real and irrational, denotes the
set of ul1 real numbers vrhirh cannot be written as the quotient of two
integers. The set under discussion contains all those elements (and no
others) which possess the stated property4 The three sets of numbers
described at the beginning of this paragraph can be written either way:
i 1, 2
and
3}
= {n : n
write
and
= {x : a < x
= {x : a < x
=
{x:a ~ x
{x : a < x
~
~
b} ,
b} ,
< bL
< b} .
Topology
tains A). A C B allows for the possibility that A and B might be equaL
If A is a subset of Band is not equal to B, we-say that A is a proper subset
of B (or is properly contained in B). This relation is denoted by A C B+
We can also express A C B by saying that B is a proper superset of A
(or properly contains A). The relation C is usually called set inclusion.
We sometimes reverse the symbols introduced in the previous paragraph. Thus A c B and A C Bare occasionally written in the equivalent forms B =:) A and B :J A .
It will of ten be convenient to have a symbol for logical implication~
and ~ is the symbol we useL .If p and q are statements, then p => q
means that p implies q, or that if pis true~ then q is also true~ Similarly,
~ is our symbol for two-way implication or logical equivalenceL
It
means that the statement on each side implies the statement on the other,
and is usually read if and only if, or is equitralent t(J.
The main properties of set inclusion are obvious. They are the
following:
(1) AC A for every A;
(2) A c B and B c A => A = B;
(3) A c B and B c C ~A c C.
It is quite important to observe that (1) and (2) can be combined into the
single statement that A :::;; B :::> A ~ B and B ~ A.. This remark
contains a useful principle of proofJ namely, that the only way to show
that two sets are equal~ apart from merely inspecting them~ is t-o show
that each is a subset of the other.
Problems
1..
2.
3.
(a)
(b)
(c)
(d)
operations of
combination~
Topology
Fig~
1~
Set inclusion.
14,ig. 2.
AVB=BUA
A U
and
(B U
C) = (A U B) V C.
B,
A f\ B = ix:x e: A and x
e B}.
A 0. (B fl O) = (A fl B) ()
and
C~
n.
'JI
= P,
n.
and An U =At
and since
A c B
;;>
A fl B = A,
10
Topology
(B U C)
Av (B
C)
A
=
=
(A (\ B) U (A (\ C)
(Av B) n (Au C)~
Fig. 4+
A U (B
C) - (A U B) f\ (A V C).
the symbols involved. For instance, the first of the two distributive
laws says that an element is in A and is in B or C precisely when it is in
A and B or is in A and C.. We can convince ourselves intuitively of the
validity of these laws by drawing pictures~ The second distributive
law is illustrated in Fig~ 4~ where A V (B 0 C) is formed on the left by
shading and (A U B) n (A V C) on the right by cross-shading.. A
momentJs consideration of these diagrams ought to convince the reader
that one obtains the same set in each
ease.
The last of our major operations
on sets is the foruiation of complements. The comple.ment of a set AJ
denoted by A' t is the set of all elements which are not in A. Since
the only elements we consider are
Fig. 5. The complement of Ar
those which make up U, it goes with..
out saying~but it ought to be said
~that A' consists of all those elements in U which are not in A.
Symbolically,
A 1 = {x:x J A}.
Figure 5 (in which A 1 is shaded) illustrates this operation. The operation
of forming complements has the following obvious properties:
(A')'
AU A'
11
$=::>
B 1 C A'
A'" B
and
(A fl B) 1
~ A~
U B'.
(1)
The first equation of (1) says that an element is not in either of two .sets
precisely when it is outside oi both:t and the second says that it is not in
both precisely when it i~ outside of one or th.e other.
The operations of forming unions and intersections are primarily
binary opPrations; that is, each is a process which applies to a pair of sets
und yields a third. V{e have emphasized this by our use of parentheses
to indicate the order in ,,... .hich the operations arc to be performed, as in
(A1 U A1) V A3i where the parentheses direct us first to unite Ai and
A 2', and then to unite the result of this v.ith A~- Associativity makes it
possible to dispense with parcn theses in an expression like this and to
write A1 V A2 U Aa, where we understand that these sets are to be
united in any order and that the order in which the operations are
performed is irrelevantL Similar remarks apply to Ai n As(\ A3r
Furthermore:1 if ~Al, A 2J
~
An l is any finite class of setsJ then we
can form
L
and
in much the same way without any ambiguity of meaning whatever.
In order to shorten the notation, we let l = ~ 1, 2, .... , n} be the set
of subscripts which index the sets under consideration. I is. called the
index set. We then compress the symbols for the union and intersection
just mentioned to U ~r Ai and (')i.I A-t. As long as it Li:; quite clear what
the index set is_, we can write this union and intersection even more
briefly~ in the form ViAt: and ni.._4;. For the sake of both brevity and
clarity, these sets are often written Vi,.. 1 A.,: and rl!. 1 Ai~
These extensions of our ideas and notations don't reach nearly far
enough. It is of ten necessary to form unions and intersections of large
(really large!) classes of setsT Let t A,\ be an entirely arbitrary class
of sets indexed by a set I of subscripts. Then
and
"141
= {x : x
V ~r Ai
l2
Topology
happen that this class is emptyJ then the above definitions give (rememU. The
bering that all sets are subsets of U) V.A., = and rlla:
second of these facts amounts to the following statement; if we require
of an element that it belong to each set in a given class_, and if there are no
sets present in the class J then every element sa ti sfi es this requirement.
If we had not made the agreement that the only elements under consideration are those in U, we \vould not have been able to assign a meaning to
the intersection of an empty cla.ss of sets. A moment's consideration
makes it clear that Eqs. {I) are valid for arbitrary unions and intersections:
;:::=
and
(2)
It is instructive to verify these equations for the case in which the class
f Ai} is empty~
We conclude our treatment of the general theory of sets with a
brief discussion of certain special classes of sets which a re of con siderable importance in topology, logic, and measure theory. We usually
For the
remainder of this section we specifically assume that the universal set
U is non-empty~ A Boolean algebra of sets is a non-empty class A of
subsets of U which has the following properties:
(1) A and Be A=::::} A U B e A;
(2) A and B s A ~ A n B ! A-;
(3) A s A = } A' s AL
Since A is assumed to be non-empty, it must contain at least one set A.
Property (3) shows that A' is in A along with A_, and since A(\ A' = 0
and A U A' == U, (1) and (2) guarantee that A contains the empty set
and the universal set. Since the class consisting only of the empty set
and the universal set is clearly a Boolean algebra of sets:r these two
distinct sets are the only ones which every Boolean algebra of sets must
13
con ta.in. It is equally clear that the class of all subsets of U is also a
Boolean algebra of sets.. There are many other less trivial kinds, and
their applications are manifold in fields of study as diverse as statistics
and electronics~
Let A be a Boolean algebra of sets.. It is obvious that if
{A 1, At, . . . , An} is a non-empty finite subclass of A, then
and
are both sets in A; and since A contains the empty set and the universal
set, it is easy to see that A is a class of sets whlch is closed under the
formation of finite unions, finite intersections~ and complements. We
now go in the other direction, and Jet A be a class of sets which is closed
under the formation of finite unions, finite intersections, and complements. By these assumptions, A automatically contains the empty set
and the universal set, so it is non-empty and is easily seen to be a Boolean
algebra of sets. We conclude from these remarks that Boolean algebras
of sets can be described alternatively as classes of sets which are closed
under the formation of finite unions, finite intersections, and complements. It should be emphasized once again that when discussing
Boolean algebras of sets we always assume that the universal set is non ..
eDlpty.
1.
21
If {A..: l and {B;} are t\ro classes of sets such that {A~} ~ {B1 I ,
show that U1A1 ~ \J;B1 and r\jBj ~ fliAi.
The difference between two sets A and B, denoted by A - B ~is the set
of all clements in A and not in B; thus A ~ B = A fl B'~ Show
the following :
A - B
3..
=A
---. (A fl
(A - B) A - {B (A U B) A - (B V
B) = (A U B) - B;
C = A - (B U C);
C) == (A ~ B) U (A n C);
C ~ (A - C) U (B - C);
C) = (A - B) fl (A - C).
14
Topology
A d (B l1 C) = (A a. B) d C;
A 6. 0 : : : : A ; . A 8 ~4. = 0;
AaB
A r\ (B Li C)
.4.
5.
6.
7.
BLlA;
(A r\ B) ~ (.ti 0. C) .
=
1\. ri11g of sets is a non-empty class A of sets such that if A and B a.re
in A'J then A ~ B and A /\ B are also in A. ShO\\r that A must also
contain the- empty set, A U B, and A - B. Show that if a nonempty class of sets contains the union and difierence of any pair of its
set.;:.;~ then it is a ring of sets. Shov{ that a Boolean algebra of sets is
a ring of sets.
Sho-w that the class of all finite subsets (including the empty set) of
an infinite set i~ a ring of sets hut. is not a Boolean algebra of sets+
Show that the class of a11 finite unions of closed-open interval8 on the
real line is a ring of sets but is not a Boolean algebra of setsT
Assuming that the universal set (I is non-empty, shoo,v that Boolean
algebras of sets can be described as rings of sets which contain U.
3. FUNCTlONS
== x2
of the real "\""ariablc x. \\'hat do "\Ve have in mind when we call this a
function a.nd say that. y is a function of x? In a nutshell_, we are drawing
attention to the fact. that each re.al number x ha.s linked to it a specific
real number y., "rhich can be calculated according to the rule (or law of
correspondence) given hy the formula. le have here a process "\vhich,
applied to any re.al number x, docH .something to it (squares it) to produce
another real number y (the square of x). Similarly,
y
x:i -..- 3x
and
are two other simple functions of the rea] variable. x, and each is given
by a rule in the form of an algebraic expression which specifics the exact
manner in 'vhich the value of y depends on the value of :r.
15
The rules for the functions we have just mentioned are expressed by
formulas. In general, this is possible only for functions of a very simple
kind or for those which are sufficiently import.ant to deserve special
symbols of their own.. Consider, for instance~ the function of the real
variable x defined as fol]O\"\fs: for each real number xJ vrritc x as an infinite
decimal (using the scheme of decimal expansion in which infinite chains
of 9~s are avoided~in whichJ for example, 74. is represented by 425000 ~ ...
rather than by .24999 .... ) ; then let y be the fifty-ninth digit after
the decimal point. There is of course no standard formula for this:P
but nevertheless it is a perfectly respectable function whose rule is given
by n verbal description. On the other hand, the function y = sin x of
the real Va.J4iab1e x is so important that its rule, though fully as complicated as the one just dcfinedJ is assigned the special symbol sin. When
discussing functions in gcneralt we \Vant to allow for all sorts of rules and
to talk about them all at once, so we usually employ noncommittal
notations like y = j(x), y = g(x), and so on.
Each of the functions mentioned above is defined for all real numbers
x. The ex.ample y = I/x shows that this restriction is much too severe,
for this function is defined only for non ... zero values of x. Similarly 1
y = log xis defined only for positive values of x~ and y = sin~t x only for
values of x which lie in the interval [ - I~ I] Wha tevcr our conception
of a function may bet it should certainly he broad enough to include
examples like these, which are defined only for some values of the real
L
variable x.
16
Topology
Fig. 6.
This
notation is supposed to be suggestive of the idea that the rule f takes the
element x and does something to it to produce the e1ernent y
/(x) .
The rule f is often called a mapping, or trans!ormation, or operator J to
amplify thls concept of it. We then think off as mapping x's to y's, or
transforming x's into y~s, or operating on x's to produce y'8~ The set X
is called the dmnain of the function, and the set of all f(x)'s for all x~s
in X iB called its range. A function whose range consists of just one
element is called a constant function.
We often denote by f: X-+ Y the function -..vith rule /, domain X,
and range contained in Y ~ 'fhis notation is uscf ul bee a use the esseu tia]
parts of the function are displayed in a manner vlhich emphasizes that it
is a composite object, the central thing being the rule or mapping f~
Figure 6 gives a convenient way of picturing this function. On the
left, X and Y are different sets, and on the right, they are equal~in which
case we usually refer to f as a mapping of X into itself. If it is clear
from the context what the sets X and Y are~ or if there is no real need to
specify them explicitly:- it is common practice to identify the function
f: X--+ Y v{ith the rule/, and to speak off alone as if it were the function
under consideration (without mentioning the sets X and Y).
It sometimes happens that two petfectly definite sets X and Y are
under discussion and that a mapping of X into Y arises which has no
natural symbol attached to it. If there is no necessity to invent a
;:::=
17
symbol for this mapping, and if it is quite clear what the map ping is, it is
often convenient to designate it by x ---t y. Accordingly, the function
y = x 2 mentioned at the beginning of this section can be written as
x ____,. x 2 or x - 4 y (where y is understood to be the square of x).
A function f is r.alled an extension of a function g (and g is called a
restriction of j) if the domain oi f contain8 the domain of g and f(x) = g(x)
for each x in the domain of g.
Most of mathematical analysis~ both classical and modern, deals
with functions whose values arc re~l numbers or complex numbers .
,-1
Fig~
7.
This is also true of those parts of topology- which are concerned with
the foundations of analysis4 If the range of a function consists of real
numbcrs 1 we call it_ a real function; similarly, a complex function is one
whose range consists of complex numbersa Obviously, every real function
is also complex. We lay very heavy emphasis on real and complex functions throughout our work.
As a matter of usage~ we generally prefer to reserve the term function
for real 9r complex functions and to speak of mappings when dealing
with fun;tions whose values are not necessarily numbers .
Consider a mapping /:X ~ Y. When we call f a mapping of X . .
into Y, we mean to suggest by this that the elements /(x)~as x varies
over all the clements of X-need not fill up Y; but if it definitely does
happen that the range off equals Y, or if we specifically want to assume
this, then we ca] 1 f a mapping of X onto Y. If two different elements
in X always have different images under f 1 then we call f a one-to-one
mapping of X into Y.. If f: X __., Y is both onto and one... to-onc 1 then
we can define its inverse mapping j- 1 : Y ~ X as follows: for each y in Y,
we find that unique element x in X such that /(x) = y (x exists and is
unique since f is onto and one-to-one) ; we then define x to be j~ 1(y).
The equation :c = 1- 1 (y) is the result of solving y = f(x) for x in just the
same way as x ==== log y is the result of solving y = e.z for .x. Figure 7
illustrates the concept of the inverse of a mapping4
18
Topology
If f is a one-to-one mapping of X onto Y~ it will sometimes be convenient to subordinate the conception of f as a mapping sending zts
over toy's and to emphasize its role as a link between x's and y}s.. Each
x has linked to it (or has corresponding to it) precisely one y = f(z);
and~ turning the situation aroundJ each y has linked to it (or has corresponding to it) exactly one x = f~ 1 (y). When we focus our attention
on th is asp~c t. of a mapping 'vhieh is onc-to--0ne onto, we usually call it
a nne-to-one correspondence~ Thus f is a one-to-one correspondence
between X and Y, ~i.nd 1~ 1 is a one-to-one correspondence between Y
and X.
Now consider an arbittary mapping f: X----? Y~ r'fhc mapping /,
which sends each clement of X over to an clement of Yi indu(~es the
following two important set mappings. If A is a subset of ..: thPn its
image f(A) is the subset of Y defined by
1
.f(A)
{f(x} :x AL
and our fir.st set mapping is that whi(' h sends each A over to its co rre
sponding f (A). Similarly~ if B i~ a subset of Y ~ then its inl.~erse inla(Jc
1~ 1 (B) is the subset of X defined by
J- 1 (B)
{ x :f(x) E.
BL
and the second set mapping pulls each B back to its corresponding
1~ 1 (B}+
It is often essential for us to know ho\V these set mappings
behave with respect to set inclusion and operations on sets. We develop
most of their significant features in the following t-..vo paragraphs.
The main properties of the first set mapping are:
/(0)
0;
f(X) C Y;
A 1 s; A 1 ::::::} f (A 1) C f (A~) ;
/(V,A1.) = UJ(Ai);
(I)
f(niAi) C rlif(At)
The reader should convince himself of the truth of these statements.
~For instancej to prove ( l) "\\,..e -y;rould have to prove first that f (U ,A 1) is a
subset of UJ(A1.), and second that UJ(Ai) is a subset of .f(U lAi).
A proof of the first of these sei inclusions might run as follows: an elcme n t
in /(UiA-.J is the image of some element in UiA1J therefore it i8 the image
of an element in some A..:t therefore it is in some /(A1), and so finally it is in
VJ(A-t). The irregularities and gaps which the reader will notice in the
above statements are essential features of this set mapping4 For example, the image of an intersection need not equal the intersect ion of the
images, because t \\i..o disjoint sets can easily have images whi cl1 are not
disjoint..
Furthermore~
1~ 1 (Y) =
O;
Its properties
X;
1-1(81) C 1~ (B2);
J- 1 (U,Bi) = Ujf-~ (B,);
J- 1 (rlJJi) = nif- (Bi);
J- 1 (B') = /~ (B)'.
B1 C
19
Bi;;;;}
(2)
(3)
(4)
Again, the reader should verify each of these statements for himself+
f
at
Fip;. 8.
g{y)
sin y,
then these two functions can be put. together to form a single function
defined by z = (gf) (x) = g(f(x)) ~ g(x 2
1) === sin (x 2 + I).. One of
the most important tools of calculus (the chain rule) explains how to differentiate functions of thls kind~ This manner of multiplying functions
together is of basic importance for us as well, and we formulate it in
general as follows.. Suppose that f: X ---7 Y and g: Y ~ Z are any two
mappings.. We define the prod1.tt-t of these mappings, denoted by
gf: X ~ Z~ by (g/){x) = g(f(x)). In words: an element x in X is taken
by f to the elementf(x) in Y~ and then g mapsf(x) to g(f(x)) in Z. :F~igure
8 is a picture of this process. We observe that the two mappings involved
here are not entirely arbitrary, for the set Y whieh contains the range of
the first equals the domain of the second. More generallyt the product
of two mappings is meaningf u) whenever the range of the first is contained in the domain of the second. We have regarded f as the first
mapping and g as the setondj and in forming their product gf, their
symbols have gotten turned around~ This is a rather unp1easant
phenomP.non, for which we blame the occasional pe-rversity of mathematical symbols~ Perhaps it will help the reader to keep this straight
20
Topotogy
in his mind if he will remember to read the product gf from right to left:
first apply f, then g.
Problems
1~
g ~ ixg
3.
let X and Y be
= (J- 1J)g
non~mpty
J- 1 (jg) == f~
ix ~
J- 1}.
Shfnv
fh
4.
S.
6.
iv~
it~elf.
Show
that f is one-to-one onto ~there exists a mapping g of X into itself
such that jg = gf = ix+ If there exists a mapping g with this
property, then there is only one sur.h mapping. Why~?
Let X be a non-empty set, and let f and g be onc~to-one mappings
of X onto itselL Show that Jg is also a one-to-one mapping of X
onto itself and that (fg)- 1 = g- 11- 1
Let X and Y be non-empty sets and fa mapping of X into Y. If
A n.nd B are, respectively, subsets of X and Y~ show the follo,ving:
(a) Jf- 1 (B) CB, and ff~ 1 (B) ~ Bis true for all B ~ f is onto;
(b) A C J- 1f(A), and A == f- 1f(A) is true for all A <=> f is one-to-one;
(c) f(A1 r\ A1) == f(A1) fl f(A2) is true for all A 1 and A2 ~ f is
one-to-one ;
(d) f(A) 1 f; f(A 1 ) is true for all A ~ f is onto;
(e) if f is onto-so t.hat f(A)' ~ f(A') is true for all A-then
f(A )' = f(A ') is true for all A <:;;;> f is also one-to-one.
21
A. PRODUCTS OF SETS
We shall often have occasion to \\:oeld together the sets of a given
class into a single new set called their product (or their Cartesian product).
The ancestor of this concept is the coordinate plane of analytic geometry,
that is, a plane equipped with the usual rectangular coordinate sys tern.
We give a brief description of this fundamental idea with a view to
paving the way for our discussion of products of sets in general..
First, a few preliminary comments about the real line. "\\,.c have
already used this term several timeH \\~ithout any explanation, and of
course what we mean by it is an ordinary geometric straight line (see
Fig. 9) whose points have been identified 'vith---or coordinatizcd by~the
-3
-2
-1
-1/3
Fig. 9.
V2
1/2
'IT
)r
set R of all real numbers. We use the letter R to denote the real line
as well as the set of all real numbers, and we of ten speak of real numbers
as if they were points on the real line, and of points on the real line as if
they were real numbers. Let no one be deceived into thinking that the
real line is a simple thing, for its structure is exceedingly intricate. Our
present view of it~ however, is as naive and uncomplicated as the picture
of it given in ~,ig . 9. Generally speaking, 'vc assume that the reader is
familiar with the simpler properties of the real line--those relating to
inequalities (see Problem 1-2) and the basic algebraic operations of
addition, subtraction, multiplication, and division. One of the most
significant facts about the real number system is perhaps less well
known. This is the so-called least upper bound property, \V hi ch asserts
that every non-empty set of real numbers which has an upper bound has
a least upper bound~ It is an easy consequence of this that every nonempty set of real numbers which has a lo\ver bound has a greatest lower
bound. Al1 these matters can be developed rigorously on the basis of a
small uumber of axioms, and detailed treatments can of ten be found in
books on elementary abstract algebra.
22
Topology
(Y1 - Y~) 1
This notion of di.stance endows the set R 1 with a certain "spatiar' character~ which we shall recognize by calling the resulting space the Eucl.idean
plane instead of the coordinate plane.
23
X1 = X2
= X'R.
= X,
24
Topology
the manner in which this set arises, through the introduction of a rectanguJru- coordinate system into ordinary three-dimensional .space. We
can draw pictures here just as in the case of the coordinate plane_, and we
can use geometric language as much as we please, but it must be understood that the mathematics of this set is the mathematics of ordered
triples of real numbers and that the pictures arc merely an aid to the
intuition~ Once we fully grasp this point of view, there is no difficulty
whatever in advancing at once to the study of the set Rft of all ordered
n-tuples (x1~ xf, . . . 1 x,q) of real numbers for any positive integer n.
It is quite true that when n is greater than 3 it is no longer possible to
draw the same kinds of intuitively rich pictures_, but at worst this is
mere]y an inconvenience. We can (and do) continue to use suggestive
geometric language1 so all is not lost.. The set Cn. is defined similarly:
it is t.he set of all ordered n-tuples (z1, z2, .... , Zn) of complex numbers+
Ea ch of the sets RR and en plays a prominent part i Il our later lYOr k.
We emphasized above that for the present the coordinate plane is to
be considered as merely a set, and not a space. Similar remarks apply
to R~ and c~.. In due course {in Sec. 15) we shall impart form and
con tent to each of these sets by su.i table definitions.. We shall convert
them into the Euclidean and unitary n-spaces whlch underlie and motivate
so many developments in modern pure mathematics, and we shall
explore some aspects of their algebraic and topological structure to the
very last pages of this book. But as of now-and this is the point we
insist on-neither one of these sets has any structure at all.
As the reader doubtless suspectsJ it is not enough that we consider
only products oi finit.e classes of sets.. 1~he needs of topology compel
us to extend these ideas to arbitrary classes of sets.
We defined the product X1 X X! X . . X Xn to be the set of
all ordered n-tup}es (X1 1 Xi, . , Xn) SUCh that X..: is in X~ for each
subscript i. To see how to extend this definition,, we reformulate it as
follows.. We have an index set /, consisting of the integers from 1 to n,
and corresponding to each index (or subscript) i we have a non-empty
set Xi. The n-tuple (x1 1 x2, ...... ~ xn) is simply a function (call it x)
defined on the index set I~ with the restriction that its value x(i) = Xi is
an element of the set X, for each i in /. Our point of view here is that
the function xis completely determined by, and is essentially equivalent
to, the array (x1, x2, ..... J Xn) of its values.
The way is now open for the definition of products in their full
generality. Let {X;.} be a non-empty class of non-empty sets, indexed by
the elements i of an index set I. The sets Xi need not be different
from one another; indeed, it may happen that they are all identical
replicas of a single sett distinguished only by different indices. The
product of the sets Xi._, written P 1J X,11 is defined to be the set of all
functions x defined on I such that z(i) is an element of the set Xl for
25
each index i. We call X 1 the ith coordinate set. When there can be no
misunderstanding about the index sct 1 the symbol P1J
is often abbrevia tcd to P i.X ,:~ The defini tiot1 we have just given requires that each
coordinate set be non-empty before the product can be formed. It vtill
be useful if we extend this definition slightly by agreeing that if any of
the X/s ar.e empty 1 then P1X" is also empty.
This approach to the idea of the product of a class of sets~ by means of
functions defined on the index setj is useful mainly in giving the definition .
In practice~ it is much more convenient to use the subscript notation
x, instead of the function notation x(i). We then interpret the product
p x~ as made up of elements x~ each of which is specified by the exhibited
array {x..:} of itB" values in the respective coordinate sets Xi. We call
Xi the ith co01dinate of the element x = {x,,}.
The mapping p1 of the product PX" onto its ith coordinate set X1
which is defined by Pi(x) = xi-that is~ the mapping whose value
at an arbitrary element of the product is the ith coordinate of that element
~is called the projection onto the ith coordinate set. The projection
p, selects the ith coordinate of each element in its domain. There is
clearly one projection for each element of the index set I, and the set of
all projections plays an important role in the general theory of topological
spaces.
x.
Problems
1.
2.
3..
26
Topology
27
28
Topology
29
if x X;: V X~:P
if x E Xo.
\Ve attempt to illustrate these ideas in Fig. 12. Jierc we present. t\Yo
replicas of the situation: on the 1efti X and Y arc represented by the
vertical lines, and f and g by the lines slanting down to the right and
30
Topology
left; and on the right, we schematically trace the ancestry oi three e}e . .
ments in X~ of which x1 has no first ancestor_, x2 has a first and second
ancestor~ and xa has a first_, second_, and third ancestor.
/(X)
g(Y)
FigT 12.
Let f; X
r.J
2..
3t
4,
1-..t
5..
6.
31
7..
,.-..,_i
,.-..,_i
6. COUNTABLE SETS
The subject oi this section and the next-infinite cardinal numberslies at the very foundation of modern mathematics. It is a vital instrument in the day-to-day work of many mathematicians, and we shall
make extensive use of it ourselves.. This theory, which was created by
the German mathematician Cantor, also has great aesthetic appeal, for
it begins with ideas of extreme simplicity and develops through natural
stages into an elaborate and beautiful structure of thought. In the
course of our discussion we shall answer questions which no one before
Cantor's time thought to askJ and we shall ask a question whlch no one
can answer to this day .
Without further ado, we can say that cardinal numbers are those used
in counting~ such as the positive integers (or natural nlUllbers) 1, 2,
a, . . iamiliar to us alt But there is mueh more to the story than this.
The art of counting is undoubtedly one of the oldest oi human
activities. Men probably learned to count in a crude way at about the
same time as they began to develop articulate speech~ The earlies~ men
who lived in communities and dome!3ticated animals must have found
it necessary to record the number of goats in the village herd by means of
a pile of stones or some similar device. If the herd was counted in each
night by removing one stone from the pile for each goat accounted forJ
then stones left over would have indicated strays, and herdsmen would
have gone out to search for them. Names for numbers and symbols for
I
32
Topology
them, like our 1, 2, 3, .... , would have been superfluous~ The simple
and yet profound idea of a one-to-one correspondence between the
stones an<l the goats would have fully met the needs of the situation.
In a manner of speaking~ we ourselves use the infinite set
1V = { I, 2, 3i . . ~ }
of all positive integers as a Hpile of stones." We carry this set around
with us as part of our intellectual equipmentr Whenever we want to count
a set, say, a stack of dollar bills~ we start through the set l'l and t..ally off
one bill against each positive integer a.s we come to iL The htst.. rtumber
we reach, corresponding to the 1ast bill, is what we rnll the 11 umber of
bills in the stack. If this last number happens to be 10, t.hen i~ 10" is
our symbol for the number of bills in the stack, as it also is for the number
of our fingers, and for the number of our toes~ and for the number of
elements in any set which can be put into one-to-one correspondence
with the finite set I 1~ 2, . . . , 10} . Our procedure is slightly more
sophisticated than that of the primitive savage. We have the symbols
1~ 2, 3~ . . .- for the numbers which arise in counting; we can record
them for future use, and communicate them to other ~oph~, and manipulate them by the operations of arithmetic. But the underlying idea,
that of the one-to-one correspondence, remains the same for us as it
probably was for him.
The positive integers are adequate for the purpose of counting any
non-empty finite set, and since outside of ma thPmat.ics all sets appear to
be of this kind, they suffice for all non-mathematical counting. But in
the world of mathematics we are obliged to consider many infinite sets,
such as the set of all positive integers itself, the set of all integers~ the
set of all rational numbers~ the set of all real nu mhers, the set of all
points in a plane, and so on. It is of ten import.ant to be able to count
such sets~ and it was Cantor,s idea to do this, and to develop a theory of
infinite cardinal numbers, by means of one-to-one correspondences~
In comparing the sizes of two sets1 the basic concept is that of
numerical eq ui valen re as defined in the previous scctionr We recall
that two non-empty sets X and Y are said to be numerically equivalent if
there exists a one-to-one mapping of one onto the other, or-and this
amounts to the same thing-if there can be found a one-to-one correspondence betlveen them. To say that two non-empty finite sets are
numerically equivalent is of course to say that they have the same number
of element8 in the ordinary sense. If we count one of them, we simply
establish a one-to-one correspondence between its clements and a set of
positive integers of the form I 1~ 2, .. ~ , n L and we then say that n is
the num.1Jer of elements possessed by bothi or the cardinal numbet of both.
The positive integers are the finite cardinal numbers. We encounter
33
'
1, 2t
3~ 4~
0, 2, -
2~
5, 6J
4, -4, 6J
7, . .
~ 6, . .
Here our device is to start 'With 0 and follow ea.ch even positive integer
as we come to it by its negative. Similarly~ N is numerically equivalent
to the set of all integers:
1, 2,
o~
3J 4,
5, 6,
-1, 2, - 2_, 3,
7'
..
-a, . .
2~
1', 2 2,
3, 4, 51
3~.i\
4\
5 21
.
.....
It struck him as very strange that this should be true, considering how
34
Topology
sparsely strewn the squares are among all the positive integersr But
the time appears not to have been ripe for the exploration of this phenomenonj or perhaps he had other things on his mind; in any case, he did not
follow up his idea.
These examples should make it clear that all that is really necessary
in 1;howing that an infinite set X is numerically equivalent to N is that we
be able to list the elements of X,. with a first 1 a second, a third~ and so on1
in such a way that it is completely exhausted by this counting off of its
elements. It is for this reason that any infinite set which is numerically
equivalent to N is said to be countably infinite~ We say that a set is
countable if it is non-empty and finite (in which case it can obviously be
counted) or ii it is countably infinite.
One of Can tor's earliest discoveries in his study of infinite sets was
that the set of all positive rational numbers (which is very large: it
contains Nanda great many other numbers besides) is actually countable.
We cannot list the positive rational n.um be rs in order of size~ as we can
the positive in tegers:P be gin.ni ng 'With the sm.al lest, then the next smallest,
and so onj for there is no smallest, and between any two there are infinitely many others. We must find some other way of counting them,
and following Cantor, we arrange them not in order of size,, but according
to the size of the sum of the numerator and denominator.. We begin
with all positive rationals whose numerator and denominator add up to 2:
there is only one, X ;:::= 1. Next we list (with increasing numerators) al1
those for which this sum is 3: Vij 31 = 2. N cxtj all those for which this
sum is 4:~, 7~ = 1, Ji = 3. Next~ all those for which this sum is
5: >i, %, %, ~ = 4. Next, all those for which this sum is 6: 7-Sj % = 72,
% ;:::= 1, % ;; 2, 91 = 5. And so on~ If we now list all these together
from the beginning, omitting those already listed when we come to them,
we get a sequence
1, J,1~ 2,
....
whlch contains each positive rational number once and only once.
Figure 13 gives a schematic representation of this manner of listing the
positive rationals~ In this figure the first row contains all positive
rationals with numerator 1, the second all with numerator 2:P etc.; and
the first column contains all Vlith denominator 1 ~ the second all -with
denominator 2J and so on~ Our listing amounts to traversing this array
of numbers as the arrows indicate, where of course all those numbers
already encountered arc left out~
It's high time that we christened the infinite cardinal number we've
been discussing, and for this purpose \Ve use the first letter of the Hebrew
alphabet (Nj pronounced "aleph~'} v.dth 0 as a subscript~ We say
that No is the number of elements in any countably infinite set. Our
35
1, 2' 3
..
NO
-11 -21
1
3
-41 ---51
.... .........
----
-52
.... ---
.-... .......
... _..-.._.
----
------
----
...
......--. ....
....._
2
l
-2
2
-23
-2
3
2
-33
-43
-34
-4
-4
5
3
-45
4
1
-51
-5
1
I
2
r
.-
.-......-
........... -
I
I
'
I
I
I
Fig. 13.
'
I
I
'
I
I
I
__
I
f
I
rationals~
( 1) there exists a one-to-one mapping of X into Y, and (2) there does not
exist a one-to-one mapping of X onto Y. UBing thjs concept, it is easy to
relate our cardinal numbers to one another by means of
1
<
1.
2.
Prove that the set of all rational numbers (positive, negative, and
zero) is countable. (Hint .~ see our method of showing that the set of
all integers is countable.)
Use the idea behind Fig. 13 to prove that if [ Xd is a countable class
of countable sets, then Vt.Xi is also countable.. We usually express
this by saying that any countable union of countable sets is countable.
36
3.
.44'
Topology
Prove that the .set of all rational points in the coordinate plane R 2
(i.er, all points whose coordinates arc both rational) is countable~
Prove that if X1 and X1 are countable, then X1 X X2 is also
countable~
5.
6.
7.
8.
7. UNCOUNTABLE SETS
All the infinite sets lVe considered in the previous section were
countable, so it might appear at this stage that every infinite set is countable. If this were true~ if the end result of the analysis of infinite sets
were that they are all numerically equivalent to one another, then
CantorJs theory- would be relatively trivial. But this is not the case, for
Cantor discovered that the infinite set R of all real numbers is not countable---or, a8 we phrase it, ll is uncountable or uncountably infinite. Since we
customarily identify the elements of R with the points of the real line
(see Sec. 4) 1 this amounts to the asgcrtion that the set of all points on the
real line represents a. '~higher type of infinityn than that of only the
integral points or on]y the rational points~
Cantor,s proof of this is very ingenious, but it is actually quite
sim plP. In outline the procedure is a.s follows: V{IJ assume that all the
real numbers (in decimal form) can be listed~ and in fact have been
listed ; then Vt...e pro<l ure a real number Vt...hich cannot be in this list-th us
contradicting our initial assumption that a complete listing is possible .
In representing real numben~ by decimals, V{e use the scheme of decimal
expansion in which infinite chains of 9's are avoided; for instanre, we
write 72 as ..5000 ..... and not as .4999 . . . . In this way we guarantee
that each re.al number has one and only one decitnal representation.
Suppose no\v t.hat we can list all the real numbersj and that they have
been listed in a column like the one below (,vherc we use particular
numbers for the purpose of ii lustration).
lst number
2nd number
3rd number
+ .712983
~4 + .913572
13
0
+
.84326.5
..
'
.,
...
. ~
37
38
Topology
real line R as shown in the figure; and we link P and P' by projecting
from its center. If formulas are preferred over geometric reasoning of
this kind, we observe that y = a
(b - a)x is a numerical equivalence
be.tween real numbers x E (O, 1) and ye: (a_,b), and that z = tan r(x - ~)
is another numerical equivalence between {0,1) and all of R. It now
follows that (a,b) and R are numerically equivalent to one another.
We are now in a position to show that any subset X of the real line
R which contains an open interval I is numerica1ly equivalent to R, no
matter how complicated the structure
oI X may be.. The proof of this
fact is very simple, and it uses only
the Schroeder-Bernstein theorem and
-1
0
1
our above result that I is numeriFig. 14. A one-to-one corr es po n den ee
between an open interval and the reel cally equivalent to R.. The argu ...
ment can be given in two sentences.
lina.
Since X is nttmerically equivalent to
itself~ it is obviously numerical1y cquiva1ent to a subset of R; and R is
numerically equivalent to a subset of X~ namely~ to 1. It is now a direct
consequence of the Schroeder-Bernstein theorem that X and R are
numerically equivalent to one anothcra We point out that all numerical
equivalences up to this point have been established by actually exhlbiting
one-to-one correspondences between the sets concerned. In the present
situationJ howevert it is not feasible to do this~ _for very little has been
assumed about the specific nature of the .set X. V\'ithout the help of the
Schroeder-Bern8tein theorem it would be very difficult to prove theorems
of this type,.
\re give another interesting application of the Schroeder-Bernstein
theorem. Consider the coordinate plane Rt and the subset X of R 2
defined by X ::::= { (x,y) :0 ::5" x < 1 and 0 < y < 1 J. We show that Xis
nu1nerically cqui valcnt to the closed~open interval
I =
y ;:;;: 0}
which forms its base (see Fig. 15).. Since I is numerically equivalent to a
subset of X~ nameJyt to I itself_, our conclusion will follow at once from
the Schroeder-Bernstein theorem if we can establish a one-to-one mapping
of X into J. This we now do. Let (xJy) be an arbitrary point of X.
Each of the coordinates x and y has a Wlique decimal exw.tnsion which
does not end in an infinite chain of 9's. We form another decimal z from
these by alternating their digits; fo! examplet if x = .327 . . ,. and
y = .614 . ~ . ~ then z = .. 362174 . . . . We now identify z {which
cannot end in an infinite chain of 9ts) with a point of I. This gives the
require<l one-to-one mapping of X into I and yields the somewhat
39
startling result that there are no more points inside a square than there
are on one of its sides.
In Sec. 6 we introduced the i:;ymhol Mo for the number of elements
in any countably infinite set. At the
beginning of this section V{C proved
Y axis
that the set fl of all real nun1hers
(or of all points on the real line) is
uncountably infinite. We now in~
troduce the symbol c (called the
ca.rdinal number of the continuum)
for the 11 umber of elements in R.
c is the cardinal number of R and of
any set \vhich ]s numerically equivalent to ll~ In the above three
x axis
paragraphs we have demonstrated
that c is the cardinal number of any
Fig. 15
open interval, of any subset of ll
'vhich contains an open interval, and
of the subset X of the roordina te plaric "h ich is illustrated in Fig~ 15.
Our list of cardinal numbers has n O\V gro-\v n to
1, 2, 3, . ~ ~ ~ Ka~ c,.
At this point \Ve encounter one of the most famous un~olved problemf; of
mathematics. Is there a cardinal number greater than Mn and leHS than
c? No one knows the answer to this question. Cantor himself thonght
that there is no such number 1 or in other \vords_, that c i8 the next infinite
cardinal number greater than N0 , and his guc~s has come to be kno\rn as
Cantor~s continuum hypothesis. 'fhe continuum hypothesis can also be
expressed by the assertion that every unrountable set of real numbers
has c as its cardinal number. 1
There is another question which arises naturally at this stage~ and
this one \Ve are fortunately able to an;swer. Are there any infinite
cardinal numbers greater than c? Yesi there are; for example, the
cardinal number of the class of all subsets of R. This ans,ver d.Ppends on
the following fact~ if Xis any non-empty set, then the cardinal number of
X is less than the cardinal number of the elass of all subsets of X~
We prove this statement as foHows. In accordance with the definition given in the last paragraph of the previous sectionJ we must show
1
For further information about the continuum hypothesis, see Wilder [42, p. 125]
and GOdel [I 2].
Topology
21(
=== C~
The simplest proof of this faet depends on the ideas dcvelope,d in the
foIJo,ving paragt'aph.
Consider t.he clo.se<l-opcn unit interval [0,1) .and a real number x in
this set. Our concp1n j~ 1vith the meaning of the decimal, binary1 and
ternary expansions of x. For the sake of clarity, let us take x to he }'4:.
llow do \Ve arrive at the derimal expansion of 7;1? First~ we split [O, 1)
in to the I 0 closed-open in terva.1s
\\re
41
NextJ we split the interval [3-{ 0 ,7} 0 ) into the 10 closed-open intervals
. ~ . , ~ 2 ;t1 oo~ r1 o) J
and '\\Te use the 10 digits to number these in order. Our nun1ber ~
belongs to (2j} 00 , 2 ~ 0 o), "'-hich is labeled VrTith the digit 5~ so 5 is the
second number after the decimal point in the decimal expansion of ~:
~~
= r25
~ ~
7.i =
~25000
..
2lh =
....
c.
42
TopoJogy
<
2~
"~hich
Problems
1.
Sho"r gcometrieaHy that the r.;et of all pointf-1 in the coordinate plane
Jlt is numcri(a1Jy equivalent to the suhset. X of R 2 illustrated in
:Fig. I;) and defined by X = { (xj y) : 0 < x < I and 0 .$ y < 1 }, and
1hat. therefore R~ has cardinal number c. [Hint: rest an open
he1niF-phrrical surf arc ( = a hemispherica.1 surface minus its bounda.ry)
1angentja1Jy on tl1c renter of
project from various points on the
Jine th rough its een ter and perpendirula.r to R-1. t and use the Sc~hroeder
Rernstcin theorem.J
Hholv that the subset X of R 3 defined by
x7
2..
X
has l~ ardinal
{ ( x 1,
TIU.pl ber
x 2, x s) : 0
:5
.t..:
<
1 for
:=
1, 2, 3 l
c.
For thr. riP-arler who -..vi.shes to learn something about the o.rit.hmetjc of infinite
('ard inHJ numbers~ we recommend Helrnos [16, sec~ 241, Kamke [24~ ch.ap. 2Jt Sierpinski 137, chaps~ 7-10], ot Fraenkel l9J ehap. 2]_
1
3.
.43
form
~lith
integral coefficients and an #- 0. Such an equation has precisely n complex roots (some of ~Thieh, of course 1 may be real). An
algebraic number is a complex number which i8 n. root of such an
equation. rrhe set of all algebraic numbers {~OntainH the set of all
rational numbers (e.g., % is the root of 3x - 2 ::: 0) and many
other n um be rs besides (the square root of 2 is a root of x 2 - 2 = O~
and 1 + i is a root of x 2 - 2x
2 = 0). Complex numbers which
are not algebraic arc called tra.nscendentaL The numbers e and rare
the best kno,vn transcendental numbers, though the fa.et that they
are transcendental is quite difficult to prove (t)ee Niven [33,, chap. 9]).
Prove that real transcendental numbers cxi~t (hint: see Problem 6-.5).
Prove also that the set of all real transcendent.al numbers is uncountably infinite.
Prove that every infinite set is numerirally equivalent to a proper
subset of itself (h.int: see Problem 6-6).
Prove that the set of all real functions defined on the closed unit
interval has cardinal number 2.:. [H inl: there a.re at least as many
such functions as there are chara-e.t-eristic functions (i.e~, functions
whose values are 0 or 1) defined on the closed unit intervaL]
4..
5.
Topology
ordered set.
and let A
<
<
y have
Example 4. Let P be the set of all real functions defined on a noncm pty set X, and let f < g mean that /(x) < g(x) for every x.
'Two elements x and y in a partially ordered set are called comparable
ii one of them is less than or equal to the other, that is, if either x < y or
y < x. The word '~partially'' in the phrase '~partially ordered sctJ) is
intended to emphasize that t.he1e may be pairs of elements in the set
which are not comparable. In Example lt for instance1 the integers
4 and 6 are not comparable, because neither divides the other; and in
Example 3, if the universal set Uhas more than one element, it is always
possible to find two subsets of []neither of which is a subset of the other.
Some partial order relations possess a fourth property in addition to
the three required by the definition:
(4) any two elements are comparable.
A partial order relation with property (4) is called a total (or linear)
arder relation~ and a partially ordered set whose relation satisfies condition
(4) is called a totally ordered .set1 or a linearly ordered set~ or1 most frequently, a chain. Example 2 is a chain, as is the subset {2, 4, 8, . . . ,
2n~ . . . } of Example 1.
Let P be a partially ordered set. An element x in P is said to be
maximal if y > x ~ y = x 1 that is, if no element other than x itself is
greater than or equal to x~ A maximal element in P is thus an elementt of
P which is not less than or equal to any other element of P. Examples
11 2, and 4 have no maximal elements. Example 3 has a single maximal
element: the set U itself~
Let A be a non--empty subset of a partially ordered set P. An
element x in P is called a lower bound of A if x < a for earh a e A; and a
lower bound of A is called a greatest lower bound of A if it is greater than or
45
Topology
t
1
0
Fig. 16.
v g,
it is usually po stula tcd in the logic with \vhich 1\,.e operate. We therefore
assume Zorn's lemma as an axiom of logic. Any reader wh-c;> is interested in these matters is urged to explore them further in the literature. 1
A lattice is a partially ordered set Lin which each pair of elements has
a greatest lower bound and a least upper bound~ If x and y are two
elements in I"', we denote their greatest lower bound and least upper
bound by x A y and xv y. These notations are analogous to (and are
_ intended to suggest) the notations for the intersection and union of t\\rO
sets~
We pursue this analogy even further, and call x A y and xv y the
meet and ioin of x and Y~ It is tempting to assume that all properties of
intersections and unions in the algebra of sets carry over to lattices, but
th is is not a valid assumption~ So me pro per ties do carry over (see
Problem 5)t but othersJ for instance the distributive laws, are fa]se in
some lat tires~
It is easy to sec that all four of our examples are lattices+ In
Example lt m An is the greatest common divisor of m and n, and 1n v n h:\
their least common multiple; and in Example 3, A AB = An B and
Av B ;:::;; A U B. In Example 2~ if x and y arc any two real num bers 1
then x A y is min {x~y} and xv y is inax {x~y J. In Example 4J f A !J is
for ex&mpleJ WiJder [42J pp. 129-1321~ Ho.lmos [l 6 1 secs~ 15---16]~ Birkhoff
[4t p. 42 J, Sierpinski [3 7t chap. 6 J, or Fraenk el and Bar-Hillel I l O, p. 44 J.
1 See"
Sels
and Functions
J,7
1.
2.
3.
4.
5..
7..
A Z)
(x
y)
I\ ZJ
(y V Z) = ( X V y) V Z, { X A y) V X = X, ( X V y) I\ X = X ~
Let A be a c]ass of subsets of some non-empty universal set U. We
say that A has the finite inter section property ii every finite subclass of
A has non-empty intersection+ Use Zorn's lemma to prove that
if A has the finite intersection property, then it is contained in some
maximal class B with this property (to say that Bis a maximal class
'vith this property is to say that any class \Yhich proper]y contains B
fai1s to have this property). (Hint.~ consider the family of all
classes which contain A and have the finite intersection property:P
order this family by c1ass inclusion~ and show that any chain in the
family has an upper bound in the family.)
Prove that if X and Y are any two non-empty sets~ then there exists
a one-to-one mapping of one into the other. (Hint: choose an
X V
6.
(y
48
8.
9.
10.
Topology
CHAPTER TWO
);fctrie Spaces
50
Topology
f Xn J = {X 1,
X 2, . . , X ~ 1
n 2:: no ==>
lxn -- xf <
lim
Xn
'~sufficiently
and expressed by Raying that Xn approaches x or Xn corwerges to x. Second,. a real function f defined on a non-empty subset X of the real line is
said to be c<mtinuous at XD in X if for each it= > 0 there exists o > 0 such
that
x in X and jx - xol
<
.5 ~ l/(x) - f(xo)t
< t,
Metric Spaces
51
= {~
if x
if x
Y~
y.
The reader can easily see for himself that this definition yields a metric
on X.
52
Topotogy
Consider the real line R and the real function lxl defined
on R~ Three elementary propert.ies of this absolute value function are
important for our purposes;
Example 2~
lxl
(i)
(ii)
(iii)
~ 0~ and
jxl
~ 0
x = O;
1-xj = lxl;
Ix+ Yi < !xi + IYI
This is called the 'l:tsual metric on ll, and the real line, as a metric space, is
always understood to have this as its metric. The fact that d actually
is a metric f ollO\.\'"B from the th rce properties stated a hover 1'his is a
piece of reasoning v..~hich occur~ frequently in our work, so 1\,.e give the
details. By (i)_, d(x,y) = fX - YI is a non-negative real number which
equals 0 ~ x - y ~ 0 {::::> x ~ Y+ l3y (ii),
d(x,y)
And by
d(x,y)
YI
==
~x -
=:::
j(x - z)
~ (y - x) l =
xi ==
d(y,x),,
< Ix - zl
+ ~i
tY
(iii)~
= Ix - Yi
+ (z
- y)~
yj
= d(x,z)
d(z,y).
denoted by R(z) and J(2)4 Two complex numbers are said to be equal
if their real and imaginary parts are equal:
a + ib :.:::: c
and
Metric Spaces
53
+ bd) + i(bc
- ad)
+ d~
ac + bd
. be - ad
= ?-+ d2 + i cz + d2;'
non-.zero. If z ;;;::: a + ib is a. complex
c2
where c~ + dt is required to be
number 1 then its negative -z and its conjugate i are defined by
-z
and i
- ! ::::::::
(~a)+
i(-b)
R(2)
-z+z
--2
and
I (z) :;
2i .
R = {z:I(z) = Oi = {z:i
z}.
and
= z.
lzl
lz'
ordinary
+ b2)H.
!z'
1~he
lzl 2 =
and
to see that
zi.
lz1 - z2 I
]zl ~ OJ and
!-zl
[z1
+ z2!
lzl
= 0
=>
z :;; 0;
!zl;
< lz1[ + ii2l~
=
Since -z
::::=
(--
5.4
which
Topo,ogy
\VP
prove by means of
{!z1f
tz~l) 2
If we use the fact that 'R{z) f S fzf for any z, property (iii) follows directly
fron1
lz1 zd 2 ~ (z 1 z2) (zl z~) =::; (21 z2) (z:1 + Z';)
+
= Z1Z1 + z22; + :l1z; + Z1Z2
= lz1 I + lz2:l + (z1'Z; + z1'Z;}
= !z1l ~ + lz2f + 2R(z1"Z:)
::; lz1) + lz2l + 2lz1z~!
= lz1 I~ + }z2\ ~ + 2lz1\ 122\
= lz1l + lz~.d + 2lz1l \z:d
== (!z1( + fZ2\)
2
of this type.
Metric Spaces
55
Example 4.. Let f be a real function defined on the closed unit interval
[O,I]. We say that f is a boun<led function if there is a real number K
such that 1/(x}I < K for every x E {0 1 1]. This concept is familiar to
the reader from elementary analysis, as is that of the continuity off as
defined in the introduction to this chapter. The underlying set of points
in this example is the set of all bounded continuous real functions defined
on the closed unit intervat Actually, the boundedness of such a function
is a consequence of its other properties, but at this stage we assume it
explicitly. If f and g are two such functions~ we add and subtract
them, and form nega ti ves:P po in twise:
(f + g)(x)
(f - g)(x)
~
~
(~f)(x)
f(z)
+ g(x);
f(x) - g(x);
-f(x)~
=0
O(x)
llfll = / 0
lf(x) I dx,
II! - vii
= /
The integral involved in this definition is the Riemann integral of elementary calculus~ Properties (i) and (ii) of the norm are easy to prove,
and {iii) follows from
J lf(x) + g(x)I
= J lf(x)I dx + /
Ill+ 1111 =
dx
1
0
<
J
0
(lf(x) I
+ lu(x)l) d:t
l11(x)I dx
= llfil + Hull.
Example 5..
llfrl
= sup {IJ(x)I: x
e [O, 1] L
and
d(f,g)
- g(x)I.
56
Topology
Properties (i) and (ii) of the norm are obvious, and in Problem .5 we ask
the reader to prove (iii) in a slightly more general form. This example
is typical of a large class of metric spaces which will play a major role
in all our work throughout the rest of this book. V,.1"e denote this space
by e[0 1 l]+
adjoined~
symbols.
+CJ)
and
00
-co<
+oo;
-oo<x<+oo.
1,he syn1bols - ~ and + oo add nothing to our understanding of the
real numbcrsA 'They are used mainly as a notational convenience, as "\Ye
see below.
I~t A be a non-empty set of real numbers which has an upper
bound. In SecA 8 we defined what is meant by the least upper bound
(or supremum} of A; sup A is the smallest upper bound of 11 t that is,
it is the smallest rPa1 Il umber y such that a ~ y for every a in Ar With
the stated assumptions about A, sup A always exists and is a real number.
If A is a non-empt.y sPt of real numbers which has no upper bound, and
therefore no least upper bound in R, we express this by writing
sup A
+ oo;
::::i.
oo ~
Metric Spaces
inf A=
inf A
+ oo.
00
'
{a,b]
= {x :a < x
b} .
This extends our previous notion in that a closed interval may now
consist of a single point (if a = b). If b is a real number and a is au
extended real number such that a < b1 then the open-closed interoal
from a to bis
(a, b] = {x : a
<x<
b}
< z < b I.
This adds to the previously defined open intervals those of the form
( - oc:i ,b) where b is real, (a,+ oo) where a is real, and (- a:i 1
oo).
Throughout the rest of this book, the term interval will always signify
one of the four types defined in this para graph. The extended real
numbers a and b are called the end-points of these intervals. We have
used the symbols - oo and + oo v. ith considerable freedom~ and it therefore seems desirable to emphasize that an interval in our present sense
is always a non-empty subset of the real number system~ it never actually
58
Topofogy
that is, it is the greatest lower bound of the distances from x to the
points of A. The diameter of the set A is defined by
d(A)
The diameter of A is thus the least upper bound of the distances bct\veen
pairs of its points. .A is said to have finite diameter or infinite diameter
according as d(A) is a real number or ro .. We observe that the
empty set has infinite diameter, since d(0) = - ". A bounded set is
one whose diameter is finite. A mapping of a non-empty set into a
metric space is called a bounded mapping if its range is a bounded set.
Several of the Himpler facts about these concepts are brought out in the
follo"\\iing problems.
Problems
1.
2.
3.
x~y
{:::;} d(x,y)
= 0,
way.
A.
Mefric Spaces
59
5.
l1JH
It
1
6.
sup lf(x)l.
non~ncgative real number such
11-/!l=l~/ll-
that
Prove in detail that
7.
8.
= {x ~d(x,xo) < r 1~
60
Topology
eo the open spheres on the real line are precisely the bounded open
intervals.. The open sphere Sr(zo} in the complex plane (see Fig. 17)
is the inside of the circle with center zo and radius r. Figure 18 illustrates an open sphere in the space e[O,IJ: Sf'(fo) consists of all functions f
in efO,l] whose graphs lie within the shaded band of vertical width 2r
centered on the graph of frrA subset G of the metric space Xis called an open set if1 given any
point x in G, there exists a positive real number r such that Sr(x) ~ G,
T
0
Fig+ 17+
plane.
FigT
18~
An open sphere in
[O~IJ.
that is, if each point of G is the center of some open sphere contained in
G~
Loosely speaking, a set is open if each of its points is '~inside~' the
set, in the sense made precise by the defini t.ion. On the real line, a set
consisting of a single point is not open, for each bounded open interval
centered on the point contains points not in the set. Similarly, the
subset (01 1) of the real line is not open,. because the point 0 in [O, 1) has
the property ihat each bounded open interval centered on it (no matter
how small it may be) contains points not in {Ott), e.g., negative points~
If we omit the offending point O~ the resulting bounded open interval
(0, I) is an open set (this is very easy to prove and is a special case of
Theorem B below). :Further, it is quite clear that any open intervalbounded or not-is an open 8et, and also tha.t the open intervals are
the only intervals which are open sets.
Theorem A. In any metric space X, the empty Bet 9 and the full space X
are open sets.
PROOF'.
To show that 0 is open, we must show that each point in 0 is
the center of an open sphere contained in 0; but since there are no points
in 0, this requirement is automatically satisfied. Xis clearly open.1 since
every open sphere centered on each of its points is contained in X.
Metric Spaces
61
We have seen that i0,1) is not open as a subset of the real line.
However~ if we consider [0,1) as a. metric space X in its own rightt as a
subspace of the real line, then [O,l) is open as a subset of X, since from
this point of view it is the full space. This apparent paradox disappears
when we realize that points out.side of a given metric space have no
relevance to any iliscussion taking place within the context of that space~
A set is open or not open only with respect to a specific metric space containing it, never on its own.
Our next theorem justifies the adjective in the expression "open
sphere.J.,
The~rem
t.Lnion of open
spheres~
A subaet G of X is open
tj
u is a
62
Topology
The intersection of these open sets is the set {O} consisting of the single
point 0 1 and this set is not open.
In an arbitrary metric space, the structure of the open sets can be
very complicated indeed. Theorem C contains the best information
available in the general case~ each open set is a union of open spheres.
In the case of the real line 1 ho\\'~evcr, a description can be given of the
open sets which is fairly explicit and rea8onably satisfying to the intuition~
Metric Spaces
63
then Is = I'JJ. We next observe that if x and y are any two distinct
points of G, then I and I JJ are either disjoint or identical; for ii they
have a common point z, then l:.t -= I. and Iv = I~, so I~ = I '1J Consider
the class I of all distinct sets of the form I~ for points x in G~ This is a
disjoint class of open intervals, and G is obviously its union+ It remains
to be proved that I is countable. Let Gr be the set of rational points in
G. G,, is clearly non-empty. We define a mapping f of Gr onto l as
follows: for each r in G~, let f(r) be that unique interval in I which con . .
tains r. G,. is countable by Problem 6-7, and the fact that I is countable
follows from Problem 6-8.
'Z;
e A and
S~(x)
CA for some
rl.
64
Topology
The proofs of these facts are quite easy~ and we ask the reader to fill in
the details as an exercise (see Problem 8) ~
Problems
1.
2.
3.
4.
5.
6.
7.
8..
9..
10..
T.~t
Metric Spaces
65
The empty set has no limit points, so it contains them all and is
therefore closed. Since the full space X contains all points, it automatically contains its ovrn limit points and thus is closed.
PROOF.
followi~g
Theorem B.
66
TopoJogy
happen, for since F} is open each of its points is the center of an open
sphere disjoint from F 1 and no such point can be a limit point of F.
1
Sr[xol contains its center, and when r = 0 it contains only its center.
The closed spheres on the real line are precisely the closed intervals. In
this connection, we observe that though open spheres on the real line are
open interval.st there are open intervals which are not open spheres, e.g.,
(~ooT+oo)~
>
d(x,xo) - ri
= d(x,xo)
- [d(x,xo) - rl
= r,
so tba t y is in S,.[ x 0] 1
The main general facts about closed sets are those given
~n
our next
theorem.
Theorem D. Let X be a m..etric space. Then (1) any interse.ction of closed
sets in X is closed,~ and (2) any finite union of closed sets in X is closed.
PROOF. By virtue of Eqs~ 2-(2) and Theorem B above, this theorem is
an immediate consequence of Theorem 10-D. We prove (1) as follows.
If f Pi) is an arbitrary class of closed subsets of X and F = ('.iF'i, then
by Theorem BJ Fis closed if F1 is open; but F' = U;.F.:' is open by Theorem
10-D, since by Theorem B each F/ is open.. The second statement is
proved similarly.
In Theorem E of the previous section, we gave an explicit characterization of the open sets on the real line.. We now consider the structure of its closed sets. Among the simplest closed sets on the real line
are the closed intervals (which are the closed spheres) and finite unions
Metric Spaces
67
of closed intervals. Finite sets are included among these, since a set
consisting of a single point is a c1osed interval with equal end-points.
'"7bat is the character of the most general closed set on the real line?
Since closed sets are the complements of open sets, Theorem lO~E gives
a complete answer to this question: the most general proper closed subset
of the real line is obtained by removing a countable disjoint class of open
..,.l
intervals. This process sounds innocent enough, but in fact it leads to
some rather curious and complicated o
______l/3
_____ . . . _~-----.1
examplesr One of these examples
is of particular importanceL It was
stuclied by Cantor aTid is usually --- ------ ........
called the Cant.or set.
To construct the Cantor set1 we
proceed as follows (see FigL 20)~ ...... ___ .,._..,. ______ .,._ .._.. _.,. .....
.....___________
F2
Ncxt 1 delete from F 2 the open intervals (%~%) and (%,%), which are
the middle thirds of its two pieces1 and denote the remaining closed set
by Fi. It is easy to see that
o,
l~ ~~
%1
7~,
%, %1
~~J
68
Topotogy
73
+ 79 + ;i7 +
= J
Metric Spec.es
the boundary of
the boundary of
(3) A is closed ~-it
We ask the reader to give
( 1)
(2)
69
A equals A (\ A
A is a closed set;
contains its boundary.
the proofs in Problem 11 ~
1
Problems
1.
2.
3.
A.
5..
6..
7.
;:::=
8.
9.
70
10.
Topology
13.
~ G is disjoint from
Prove that
A.
{x~ }
= {X 1, X2J
..
,.
Xn~ ~
..
x,
Metric Spaces
71
is impossible. The point x is cnl1ed the limit of the sequence {xn L and
we sometimes write Xn ~ x in the form
lim
Xn
X.
and
..
mean P.xactly the same thing, namely, that I Xn \ is a convergent sequence
with limit x.
Every convergent sequence {xn} has the following property: for each
E > 0, there exists a positive integer n[) such that m 1 n > no :::::::> d(xm,Xn) <
f.
For if x~ ~ x:t then there exists a positive integer no such ths.t n ;::
no;;;;;;;;;), d(x1t 1 x) < f./2, and from this l\~e see that
The statements
m,n
~ no~ d(xm~X11)
f ..
72
Topology
and
By the completeness of the real linej there exist real numbers a and b such
that a,,, ---i- a and bn---+ b. If we now put z = a + ibt then we see that
z~ ~ z by means of
Metric Spaces
73
Theorem B.
74
Topology
Metric Spaces
75
He will find~ ho\vever, that a need for it crops up from time to time, and
when this ne.ed ariscst Ba.ire's theorem is an indispensable tooL 1
Problems
1..
2.
3.
76
Topology
Theorem B..
This result shows that continuous mappings of one metric space into
another are precisely those which send convergent sequ~nces into convergent sequences, or, in other words, which preserve convergence. Our
next theorem characterizes continuous mappings in terms of open sets+
Theorem C. Let X and }T be metric spaces and fa mapping of X into. Y .
Then f is continuaus <:::;> 1-~(G) is open in X whenever G is open in Y.
PROOF.
Metric Spaces
77
78
Topology
useful.
Theorem D. Let X be a metric space, let Y be a complzete -metric space, and
let A be a dense subspace of X. If f is a uni/ormly continuous mapping of
A into Y, thenf can be exlnde.d unit[Uely to a unifOTmly continuous mapping
g of X into Y.
PnooF.. Let di and d2 be the metrics on X and Y. If A = X~ the
conclusion is obvious. We therefore assume that A X. We begin
by showing how to define the mapping o~ If xis a point in A, we define
g(x) to be f(x). Now let x be a point in X - A4 Since A is dense, x
is the limit of a convergent sequence {Un l in A. Since ~an} is a Cauchy
sequence and/ is uniformly continuous, f j(~) J is a Cauchy sequence in Y
(see Problem 8)~ Since } is complete1 there exists a point in Y-we call
this point g(x)-such that /(an)~ g(x). We must make sure that g(x)
depends only on x 1 and not on the sequence {~}. Let {bn ~ be another
sequence in A such that bn ---i- x. Then d 1 (.un,b_.) ~ O~ and by the uniiorm
continuity of f, dt(f(~) 1 f(bn)) .....-? 0. It readily follows from this that
7
f(b~)--+ g(x).
a:-+
and from this and the previous statement we see that d1 (g(x),g(x
< E.
All that remains is to show that g is unique, and this is easily proved
by means of Problem 3 below.
1
))
Metri~
Spaces
79
2.
3.
.4.
5.
6.
7t
80
8.
9.
Topology
Metric Spaces
81
;:::=
;:::=
(2) llx
Y[! < "xll
!lYll;
(3) Haxl1 = la~ llxl].
In. general terms~ a normed linear space is simply a linear space in which
th ere is av aila blc a satisfaet ory notion of the distance from an arbitrary
element to the origin. From (3) and the fact that -x ~ ( - I)x~ we
obtain fl- xii = Hx[I As we saw in SecL gJ a normed linear space is a
metric space with respect to the induced metric defined by
r
d(x 1 y)
=z
llx -
y[~a
82
Topology
Lemma. If f and g are continuo-s real functitmS defined &n a -metric space
X, then f
g and af are also continuous, where a is any real number~
d(x1xo)
<
S~
+ g(xo)11
We leave it
to the reader
+ g(x)]
- [f(xo)
- g(xo)l I ::; IJ(x} - J(xo) r
lg(x) - g(xo)I < E/2
E/2 =
to show similarly
E.
that af is continuous.
Metric Spaces
83
d(x,xo)
<
E.
84
Topofogy
fl!H
=:;=
sup lf(x)f
1.
2..
3.
4.
5.
Metric Spaces
85
Let n be a fixed positive integer~ and consider the set. Rn of all ordered
n-tuples x = (x 1~ x2, . . ~ , xn) of real n um be rs. 2 \1/ e promised in Sec . 4
to make thiH set into a space, a.nd "\Ve are no\v in a position to do so.
The construction outlined in {a) to (c) clearly depends on the initial choice of the
fD.;.ed point Xih If another fixed point l'\.1 1~ chosen, then another iSometry F of X into
e{ XtR) is determined. It would see1n 1 therefon~, that there is 1itt.lc justifieation for
calling the particu1ar X* defined in this problem th completion of X. In practice.
however, we usually pursue the rea~onable course of regarding isometric spaces ss
essentia.lly identico.l. lf'rom this point of viewJ the X* defined here is a. complete
metric apace which con ta.ins X as a dense su bspac~; and since by ( h) it is the only
complete metric space with thie property J it is natural to call it the completion or X.
: From this point on 11 we omit the adjective aordered." It is to he understood
that an n-tuple is alway8 ordered.
1
86
Toporogy
+Y =
(x1
+ Yi, x~ + Yt
Xn
+ y.,.)
and
-x = (-xl, -xi, . . .
-xn).
y
l
'
y axis
I I
I I
' ':::::jI I
_____
"
'
Fig. 21.
Metric Spaces
87
Vx2 + y2 + z2.
If x = (x1, X2J
.P Xn) is an arbitrary element of R'fL, then it is natural
to define J!xlj~the distance from the point x to the origint or the length of
the vector x~by
a
llx~I
= V1x1! 2 +
n
lxil 2 )
lx~J 2
+ ~ + lxnl
~i.
i l
ft
11/H == ( 't
-1
J/(i) I2) ~I
This is called the Euclidean norm on Rn t and the real linear space R"'
normed in this ~~ay is called n-dimensional Euc.lidean space~ The
88
Topology
Euclidean.plane is the real linear space R~ with its Euclidean norm; that is,
it is the coordinate plane equipped with the above algebraic operations
and the above norm. :For refl.8ons \vhirh will appear a littlP. later, we
o bsPrve that our formula die.fining l~x Hcan be applied cqual1y well to
n-tnp1Ps of complex numbers.
'Ve have not yet proved~ of course, that the above expression for
fixlJ po~~esse~ the three properties required by the definition of a norm .
The first and third of these conditions are clearly satisfied+ The second,
namely, that
is another matter. v.:r c prove this by the following two lemmas, of whlcb
the fir.st is essentially a tool used in the proof of the second.
Lemmo (Cauchy's lnequa,ity). Let x = (x:1, X2,
Xn) and y
Y2t ~
~ y,.) be tu,,o n-tuples of real nr complex numbers.
Then
r
(Y~
---~~Y;l 1. S
l!x!: l!Y:i
I: lxiy~!~ < 1 + 1
i=I
[!xll llYll -
I
'
.,
+ ljyJI+
Metric Spaces
PROOF4
89
relations~
ft
l]x+ Yll
I
<I
=
~xj
]x1
+ Yir([xil + 1YiD
i=l
n
i-1
i-1
1=1
:5
=
4
or summar1z1ng,
If
11x +
the
y" : : :; 0,
lxi
+ Yil !Yir
llx + Yil .
follo~~s
from
Theorem A.
;:==
ft
90
Topoiogy
The space C"', with these algebraic operations and this norm, is
called n-dimensional unitary space. Needless to say, it can equally well
be viewed as the set of all complex functions f defined on the set i 1, 2,
. . . , n}, with addition and scalar multiplication (by complex scalars)
defined pointwise and the norm defined by Hfrl = (:'.2:?_ 1 ff(i)~ 2 )~ 2
The four spaces defined and discussed in this and the previous section-e(X1R) and e(X,C), and RA and Cn_form the foundation for all
our future work. In Chaps. 3 to 7 we generalize the first two by loosening
the restrictions on the underlying space X. In Chaps . 9 to 11 we study
all four from a wider point of lriew, with special emphasis on ()n.. And
in the Jast three chapters we pull these lines of deve1opment together
in such a way that ea.ch aspect of our work sheds light on all the others.
Problems
1.
2..
3..
.
( fxn +
~
Ynr 1 )~ <
n-1
4.
ft=]
il-1
CHAPTER THREE
7:opologieal Spnees
In the previous chapter we defined the concept of a continuous
mapping of one metric space ir1to another, and this definition was formulated in term8 of the metrics on the spaces involved. It often happens~ hol\Tcver_, that it is convPnicnt~-even essential~to be able to
speak of continuous mappings in situations ''There no useful metrics are
defined, readily definablcT or capable of being defined~ In order to deal
effectively \vi th circumstances of this kind, it. is necessary for us to 1iberate
our concept of continuity from its dependence on metric spaces.
Theorem 13-C shows that the continuity of a mapping of one metric
space into another can be expressed solely in terms of open sets, 'vithout
.any direct reference t-0 metrics. frhis suggests the possibility of dis,...
carding mctries altogP.ther and of rep]acing them as the source of our
theory by open sets. With this in mind~ our attention is drawn to
Theorem 10-D, which gives the main internal properties of the class of
open r-;ets in a metric space. These t"\vO theorems provide the leading
hint on which we base out generalization of metric spaces to topological
spaces-a topological space being simply a non-empty set in which there
is given a class of subsets, called open sets, "\\Tith the properties expressed
in l'heorem I 0-D .
. .:;+ Our undcr1ying purpose in this and the next four chapters is to study
'I
92
Topology
Topo,ogical Spoces
93
we say something to the contrary) that its topology is the usual topology
described here.
Ex.amp1e 2. Let X be any non-empty set,. and let the topology be the
class of all subsets of X. This is called the di~crete topolr)(JY on X~ and
any topological space v..Those topology is the discrete topology is cal1ed a
discrete space.
Example 3. Let X be any non-empty set, and let the topology consist
only of the empty set 0 and the full space X. This topology is at the
oppo~ite extreme from that described in Example 2, but they coincide
when Xis a set with only one element .
Example 4. Let X be any infinite set, and let the topotogy consist of
the empty set 0 together with all subsets of X whose complements are
finiter
~
'
/ . Example 5.
Let. X be the three-element set {a, b,. c}, and let the topology
consist of the foUo,ving sub8ets of X: 0, {a}, {a,b i, {a,c l ~ X. Spaces of
this type serve mainly to illustrate certain aspects of the theory ,vhich
will emerge in later chap tcrs~ :
9-'
Topology
Problems
1.. Let Ti and T, be two topologies on a non-empty set X, and show
that Ti fl. T2 is also a topology on X.
2.. Let X be a non-empty set, and consider the class of subsets of X
-consisting of the empty set 0 and all sets whose complements are
3..
certain properties.)
Topological Spaces
4,
5..
6..
7~
8.
9.
10.
11.
12..
13..
17~
95
ELEMENTARY CONCEPTS
By considering the empty class of closed setsi we see at once that the
96
Topo~ogy
e.mpty set and the full space-its union and intersection-are always
closed sets in every topological spaceT
If A is a subset of a topological space, then its closure (denoted by A)
is the intersection of all closed supersets of A.. It is easy to see that the
closure of A is a closed superset of A Y\'Thich is contained in every closed
superset of A, and that A is closed ~ A = A.. A subset A of a topological space Xis said to be dense (or everywhere dense) if A = X, and Xis
called a separable space if it has a countable dense subset~ For reasons
which will become clear at the end of this section, we summarize the main
facts about the operation of fonning closures in the follomng theorem.
Its proof is a direct application of the above .statements~
Theorem B.
=-
Topological Spaces
'11
2-(2)~
the definition of a
As the following theorem shows, we could even have taken the term
"closu.ren as our undefined concept.
98
Topology
:r::
1. Let f: X
2.
---i-
Topological Spaces
3.
4.
5.
6..
7.
8.
9.
99
100
Topology
1A
iB
ft:r 81 countable space-or a spa.ce w hie h Ba tisfiel9 the first axiom of cou ntahility---a topologies.I sp&ce which ha.s & countab]e open base at each of its points (seu
See4 17).
Topologicol Spaces
101
as
< x.
i (x l ~x2) :
< bi for i =
1~2 }
and
R X
(a2~b2)
open. strips in R~. If we use closed intervals here, we get what we call
closed strips. It is plain that
Since every open strip in R 2 is clearly an open set_, the class of all open
strips is a class of open sets l\~hose finite intersections form an open base~
namely, the open base composed of the open strips~ the open rectangles,
the empty set, and the full space R 2
Now let X be a topological spacer An open 8ubbase is a class of open
subsets of X whose finite intersections form an open base. This open
base is called the open base generated by the open subbase~ We refer to
the sets in an open subbase as subbasic open setsr It is easy to see that
any class of open sets which contains an open subbase is also an open
subhase.. Since the bounded open intervals on the real line constitute
an open base for this BpaceJ it is clear that aU open intervals Of the type
102
Topology
(a,+~)
Topologi~al
Spaces
103
of each bask open set is open ~ the inverse image of eac.h BUbbasic open set is
open; and (2) f is open~ the image of each basic open set is open~
PROOF..
These statements arc immediate consequences of the definitions and~ respectively~ Eqs. 3-(2) and 3-(3) and Eq . 3-(1) .
Problems
1..
2.
3.
4..
5,
6.
7.
8.
104
9"
Topology
\\,.. e next show that this partially ordered set is a complete lattice.
In Problem 16-1 v..,.e asked the reader to prove that the intersect.ion of any
t\vo topologies T1 and T:2 on X is a topology on X. Since this topology is
evidently l\Teakcr than both T1 and r~ and stronger than any topology
\vhich is weaker than both~ it is the greatest lower bound of T1 and T~ It
is cq u ally easy to see that the in terser tion of any non-empty family of
topologies on X is a topology on X; and since it is "?t,.eaker than all ,;l;ht-~sc
and stronger than any topology which is weaker than all these, it is the
greatest lol\Ter bound of this family~ What nbout least upper bounds?
The si tua ti on here is a hit differen tt for the union of two topologies on
X need not be a topology. llowever, if \Ve have any non-empty family
of topologies T.:, then the discrete topology is a topology strongrr than
each Ti. \Ve can therefore appeal to our above remarks to conclude that
the intersection of nll topologies "?t'hich are 8tronger than each T..: is a
topology; and since it is stronger than each T:t and vireaker than any
topology \vhich is stronger than each Tf, it is t.he least upper bound of
our given family+
We summarize the results of this discussion in the following theorP.n1.
Theorem
on X
i.~
Topological Spaces
105
Jl urtherrn.ore, this lattice has a least nu~ m.ber (the weakest topology on X) and
a greatest member (the discret.e topology on X) .
The reader vlill observe that if {T..:} is a non-empty family of topologies
on our set X, then the least upper bound of this family is precisely the
topology generated by the class Vil, in the sense of Theorem 18-D; that
is, the class ViTi is an open subbase for the least upper bound of the
family {T, l . l n the present contex.tJ therefore, Theorem 18-D can be
thought of as providing a mechanism for the direct construction of least
upper bounds in our lattice of topologies.
Let X be a non-empty set~ let {X 1} be a non-empty class of topological spaces~ and for each i let fi be a mapping of X into X... It is clear
that if X is given its discrete topology,, then all the f/s are continuous~
If 've look a little further, we may find other and weaker topologies on
X which also have this property. There is, in fact, a unique weakest
1.o p ology of this kind. The weak topology gene rated by the f / s is defined to
be the intersection of all topologies on X 'vith respect to each of vlhich all
the f/s are continuous mappings. This js clearly a topology on X v;hich
makes nll the f/s continuous, and it is weaker than any topology which
has this property. It V--'ill appear in later chapters that many a topology
which is used in practice is defined to be the weak topology generated by
some set of mappings of particular interest in a given situation.
Problems
1.
2.
106
Topology
(c)
(d)
(e)
(f)
\Ve generalize the notations established in Sec. 14 by defining e(X,R) and e(X)C) to be the sets
of all hounded continuous functions defined on X ~Thieh aret rcspectivclyJ
real and complex~
It is desirable to extend our discussion of the algebraic structure of
these set.s beyond that given in Sec. 14 by introducing the following
concepts. An algebra is a linear space \vhosc vectors can be multiplied
in such a vlay that
(1) x(yz) ~ (xy)z;
(2) x(y
z) = xy
xz and (x
y)z = xz + yz;
(3) a(xy) = (~x)y = x(a.y} for every scalar a.
\\:'" .e speak of a real algebra or a cnmple:c algebra according as the scalars
arc the real n uni bers or the complex n um hers. A commutative algebra is an
algebra v..-hosc multiplication satisfies the f ollov-.ing condition:
(4) xy = yx.
In the case of a commutative algebra~ the second part of {2) is clearly
redundant.. 1\n algebra with dentity is an algebra which possesses the
follow"ing property;
( 5) there exists a. non-zero clement in the algebra, denoted by I and
r.alled the identity element (or the identity)t such that
1 x == x 1 ~ x for every x.
Vt7 c speak of the identity because the identity in an algebra (if it. has one)
is unique; for if 11 is also a.n clement such that 1' x = x ~ l' = x for
Pvery x, then 1 ~ = 1' 1 = 1.. A subalgebra of an algebra is a linear
subspace "\\-Thi ch contains the product of each pair of its elements+ A
subalgcbra of an algebra is evidently an algebra in its own right.
In the case of a function space which is also an algebra, it is to be
underst-0od that multiplication is defined pointv,ise, that ist that the
product fg of t\vo functions in the spare is defined by (jg) (x) = f(x)g(x).
This point~vise multiplication off unctions should be clearly distinguished
from the mu1tip1icntion (or composition) of mappings discussed at the
end of Sec. 3. If such an algebra has an identity element 1, then Problem
Topological Spaces
107
lemma.
continuous.
V.le prove thatfg is continuous by sho,ving that it is continuous at an
arbitrary point xo in X (sec Problem l 8-9)+ Let E > 0 be given, and
find E1 > 0 such that ie1{lf(xD)l + lg(xo)1) + E1 '2 < ic;. Since f is continuous, and thus continuous at XD~ there exists a neighborhood G1 of
x 0 such that x e G1 ~ l/(x) - f(xo)! < t:i+ SimilarlyJ there exists a
neighborhood G2 of xo such that x e: G2 ;:9 fg(x) - g(x~)I < E1. The
continuity of fg at xo now follows from the fact that G = G 1 ('.. G2 ia a
neighborhood of xo such that
x
+ [/
f.
> al ==
{x :f(x)
> a'
which is open since it is the union of two open sets, and that
<
b}
;::=
f x :f (x)
b} ,
108
Topology
f ~(zo)I < E/3. The continuity off at xo now follo"s from the fact that
x
e. G ~ !f(x) - f(xo) l
= I[f(x) - f (x)] + [/n.(x) 5 !J(x) - f (x) I + If~(x) a1
21 1
fn~(xo)]
f~ 0 (xc.) I
+ l/n~(xo)
+ IJn.a(xo)
<
- f(XG)] I
- f(xo);
.:/3 + oe/3 + t/3
e (X ~R) is a lattice in which the greatest lower bound and lea.st upper bound
of a pair of functions f and g are given by (f" g)(x) :;:::: min lf(x) ,g(x)} and
max {/(x}Jg(x) i.
PROOF. In view of the above lemmasJ everything stated here is clear,
except perhaps the fact that ~]fgH ~ l!fll llgl[; and this follows from
(f v g) (x)
Hfgll
tiff I HY fl.
This theorem is a. direct consequence of the background provided above. We do remark, however~ that the fact thatjis rontinuous
\Vhen f is follows from 1J(x) ~ ](xo) I = if(x) ~ f (xo) I+
PROOF.
Topological Spaces
109
We trust that the reader has noticed our insistence on assuming that
a topological space always has at lea.st one point~ Our reason for this is
that the empty set has no functions defined on it. If "Te were to allow a
topologir.al space X to be empty, then we would ha.ve t.o cope with the
fact that its corresponding e(XJ/l) and e(X 1 C) are also empty, and so
cannot b.c linear spaces, for a linear sp~e must contain at least one vector
(the zero vector).. Since constant functions are aJ\vays continuousJ "Te
avoid this difficulty by ta.king pains to assume that to polo gic al spaces
are non-empty .
Problems
1.
2..
3..
Let A be an algebra of real or complex functions defined on a nonempty set XJ and assume that for each point x in X there is a. function
f in A such that f(x) ~ 0. Sho"T that if A contains an identity
clement 11 than 1 (x) = 1 for all x.
I~t.f be a continuous real or complex function defined on a topological
space X 1 and assume that f is not identically zer:o, Le.~ that the set
Y = {x :f(x) 0 l is non-empty. Prove in detail that the function
1/f defined by (1//)(x) = 1/f(x) is continuous at ear.h point of the
subspace Y.
Let X be a topological space and A a subalgcbra of e(X,Jl) or e(XJC) .
Show that its closure A is also a subalgcbra. If 11 is a subalgcbra
of e(X,C) which contains the conjugate of each of its functionsJ
show that A also contains the conjugate of each of its functions.
CHAPTER FOUR
eompae!HCSS
110
Compactness
21~
111
COMPACT SPACES
theorems.
Theorem A.
B.
PROOF+
112
Topology
We go one more step in this direction and prove a similar (and much
deeper) theorem relating to .subbasic open covers~ The proof is rather
difficult, and 've introduce the following concepts in an effort to make
it as .simple as possible.. They also make some of its applications considerably easier to handle. Let X be a topological space. A class of
closed subsets of X is called a closed base if the class of all complements
of its sets is an open base~ and a closed subbase if the class of a.11 eomplements is an open subbaseL Since the class of all finite intersections of
sets in an open subbasc is an open ba.seJ it follows that the class of all
finite unions of sets in a closed subbase is a closed base. 1"'his is called
the closed base generated by the closed subbase.
Theorem F.
Compactness
113
...
The great power of this theorem can be surmised from the complexity
of its proof~ It is really a tooC and we illustrate the manner in which
114
Topology
it can be used by applying it to give a simple proof of the classical Heine ...
Borel theorem stated in the introdur.tion to this chapter.
Theorem G (the Heine-Borel Theorem).
A closed and bounded subspace of the real line is a closed subspace of some closed interval [a~b]. and by Theorem A it .suffices to show
that {aJb] is compact. If a = bJ this is clear, so we may assume that
a < b. By Sec. 18, we know that the class of all intervals of the form
[a~d) and (c,b], """here c and d are any real numbers such that a < c < b
a.nd a < d < bt is an open sub base for [a,b]; therefore the class of all
fa~c]'s a.nd all [d~brs is a closed sub base. Let S = { [a,ci], {dhb] l be a
class of these subbasic closed sets with the finite intersection property.
It suffices by Theorem F to show that the intersection of all sets in Sis
non-emptyr We may assume that Sis non-empty. If S contains only
intervals of the type [a~c-.:], or only intervals of the type [di 1 b}, then the
intersection clearly contains a or b. We may thus assume that S contains intervals of both types. We now define d by d == sup {di L and
we complete the proof by shov~dng that d < Ct: for every i. Suppose that
c~. < d for some irJ. Then by the definition of d there exists e. d;, such
that Ct:, < di,+ Si nee [a ~ci.ri] f\ (d;Q Jb] = 0, this contradicts the finite
intersection property for S and concludes the proof.
The reader should understand that there are elementary proofs of the
Heine-Borel theorem which do not use Theorem F or anything like it.
Theorem F wil1 render us its major sen.ice in connection with the proof
of the vital Tye honoff theorem of Sec. 23 ~
Problems
1.
2.
3.
4.
n,.x,
Compactness
S.
6.
7.
8~
9t
115
Prove the converse of the Heine-Borel theorem~ every compact subspace of the real line is closed and boundedr
Generalize the preceding problem by proving that a compact subspace of an arbitrary metric space is closed and bounded. (It should
be carefully noted, as Secs. 24 and 25 will show, that a closed and
bounded subspace of an arbitrary metric space is not necessarily
compact.)
Show that a continuous real or complex function defined on a compact space is bounded+ More generally, sho-\v that a continuous
mapping of a compact space into any metric space is bounded4
Sho"; that a continuous real function f defined on a compact space X
attains its infimum and its supremum in the f olloVrTing sense: ii
a = inf ~ f(x) :x EX I and b = sup {/(x): x e X} J then there exist
points x1 and x2 in X such that f (x1) == a and f (x2) = b.
If X is a compact space) and if itti l is a monotone sequence of continuous real functions defined on X which converges pointwise to a
continuous real function f defined on X, sho'v that fn converges
uniformly to/. (The assumption that ifn l is a rnonotone sequence
means that either !1 :::; j,. ::; fa ~ or f 1 > !2 > fa > ~)
116
Topo~ogy
18... D by the class of a.II open strips~ This situation provides the motivation for the more general ideas we now develop .
Let X 1 and X2 be topo1ogical spaces~ and form the product
X = X 1 X X 2 of the two sets Xi and X 2~ Consider the class S of all
subsets of X of the form G1 X X2 and X1 X G~, "'~here G1 and G! are
open subsets of X 1 and X 2~ respectively+ The topology on X generated
by this class in t.he sense of Theorem 18-D is called the product topology
on X~ The product topology therefore has S as an open subhase; in
fact, it is defined by the requirement that S be an open subbase~ The
open base generated by S, that is, the class of all finite interscr.tions of
its sets, is clearly the class of all sets o. the form G1 X G2~ and th.e open
sets in X are all unions of these
sets~ There are two projections
p1 and p~ of X onto its coordinate
spaces X1 and X2 1 and by definition they carry a typical element
(x1)x:i!) of X to xi and X2~ respectively.. We note that Sis prec.isely
the class of au inverse images
in X of all open subsets of X 1
and X2 under these projections:
G1 X X2 = p1- 1(G1) and X1 X G2 =
X1
p-i- 1 (0 2 ). The product topology is
Fig. 23. The product topology on
thus a topology on the product vr.ith
X 1 X X :.~
respect to which both projections
are continuous mappings~ and it jg
evidently the weakest such topology. In terms of the ideas discussed in
Sec4 19, the product topology- can be regarded as the weak topology
generated by the projections~ Figure 23 may assist the reader in vis-
Compactness
117
xi
xt
a topological space muat first of all be a. non-empty set, it is worth remarking here that the axiom of choice guarantees that the product of a non-empty clAea
of non-empty sets is non.empty.
1 Since
118
Topology
it. Our basic open set then consists of all functions in X whose graphs
cross each of these three vertical segments within the given open set on
that scgn1ent. In the figure, f belongs to our basic open set~ but g does
not. The product topology on any product space can be visualized in a
simila T way~ Al 1 one has to do is imagine the coordinate spaces as fibers,
each attached to a specific element of the index set. The resulting
mental image of the product space will then look something like a bundle
of fibers, or perhaps a hed of reeds growing in a pond~
Problems
1.
2.
3.
4.
5.
AU proj ections~ being open mappings, send open sets to open sets4
Use the Euclidean plane to sho\v that a projection need not send
closed sets to c1osed sets.
Show that the relative topology on a suhspace of a product space is
the v.r~eak topology generated by the restrictions of the projections
to that subspace.
1.JCt f be a mapping of a topological space X into a product space
P,X", and show that f is continuous ~ pJ is continuous for each
projection P'-
Con8ider the product space defined and discussed in the last para...
graph of the text, and show that this space is not second countable.
(Hint.+ recall Theorem 18-B, and observe that the index set is uncountably infinite~)
Let X, Y, and Z be topological spaccs 1 and consider a mapping
z = f(x y) of the product set X X Y into the set Z. We say that
f is continuous in x if for each fixed Yo the mapping of X into Z
given by z = f(x,yo) is continuous. The statement that f is continuous in y is defi11ed similarly. f is said to be }ointly con.tinuous in x
and y if it is continuous as a mapping of the product space X X Y
into the space Z.
(a) If all three spaces are metric spaces, show that j is jointly
continuous ~ x~---+ x and Yn---+ y implies f(xn~Yn) ~ f(xiy).
(b) Show that if f is jointly continuous, then it is con tin nous in each
variahle separately. Show that the converse of this statement
is false by considering the real function defined on the Euclidean
plane by f(x,y) = xy/(x 2
y 2} and f(O,O) = 0.
1
The main theorem of this section, to the effect that any product of
compact spaces is compact} is perhaps the most important single theorem
Compactness
119
of general topology.
(x1,
X2,
Xn): a~
120
Topology
1.
2..
3.
4~
Compactness
121
Theorem A.
PROOF~
property.
Let X be a compact metric space and A an infinite subset of X+
aSBume that A has no limit point, and from this we deduce a con-
PROOF+
' re
A solid case can be made for the pro position that com pa.et spaces a.re natural
g enera.li.z& tio n ~ of spa.c cs with only a finite num her of po in ts. For a discussion of
thia, a.nd of the significance of compactness in ana]yais, see Hewitt [19].
1
122
Topology
the proof.
Our next task is to prove that compactness is implied by sequential
compactness. We carry this out in several stages, the first of which can
be motivated by the fallowing considerations~ Let {Gt:} be an orw.n cover
of a metric space X. Then each point x in X belongs to at least one
G~,. and .since the G,/s are open~ each point x is the center of some open
sphere which is contained in at ]east one G;.+ lf we now move to another
point of X 1 we may be forced to decrease the radius of our open sphere
in order to squeeze it into a G1. Under special circumstances it may not
be necessary to take radii below a certain level as we move from point to
point over the en.tire space. The following concept is useful for handling
this sort of situation.. A real number a > 0 is called a Lebesgue number
for our given open cover {c.} if each subset of X whose diameter is less
than a is contained in at least one Gi~
1
In a sequentially compact
Comp ac.tness
123
Problem 10-3 that B~~ C Sr-(x) C G1t~ This contradicts the fact that
Bn.,, is a big set, and completes the proof.
The next stage requires the following concepts. Let X be a metric
space+ If e > 0 is givcnt a subset A of Xis called an f:-net if A is finite
and X = U m!A SE(a)~ that is~ if A is finite and its points are scattered
through X in such a way that each point of X is distant by less than
t from at least one point of A.
The metric space Xis said to be totally
bounded if it has an t-net for each E > 0. It is clear that if X is totally
bounded~ then it is also bounded; for ii A is an E-net, then the diameter of
A is finite (since A is a non-empty finite set) and d(X) < d(A)
2E.
'f utal boundedness is actually a much stronger property than boundedness1 as we shall see below.
Theorem D.. Every sequentially compact metric space 'is totally bounded.
PROOF.
Let X he a sequentially compact metric space~ and let E > 0
be given. Choose a point a1 in X and form the open sphere Sf(ai)~ If
this open sphere con ta.ins every point of X, then the single-element set
{ a1 ~ is an f-net~ If there are points outside of S~(ai)J let a 2 be such a
point and form the set S" (a1) V S" (a2).. If this union contains every
point of X, then the t\'vo--element set ia1Ja2} is an E-net+ If we continue in
this ~,.ay~ some union of the form SE(a1) U Se(a 2 ) U V S~(af)) will
necessarily contain every point of X; for if this process could be continued
indefinitely, then the sequence {a1, a!, . . . ~ an, ~ ~ l would be a
sequence -with no convergent subsequence~ contrary to the assumed
sequential compactness of X. We see by this that some finite set of the
form {a1, a2,
~ af)} is an E-net, so X is totally bounded.
+
12~
Topofogy
Prob terns
1.
2.
3.
4.
Compactness
125
The reason for this notation will soon be clear. Since X is totally
bounded, there exists ~finite class of open spheres 1 each with radius Yi~
\Vhosc union equals X; and from this we see that 81 has a suhf.5~quence
S2 = {X21t X2-i 1 Xi3~ . ~ . } a.11 of whose points lie in some one open sphere
of radius }1. Another appliration of the total boundedness of X sho\vs
similarly that S: ha8 a subsequence 83 = {Xa1, x~l2t XJa, ~ 1 all of
\Vhose points lie in some one open sphere of radius ?3. "\\'c continue
forming successive subsPquences in this manner~ and we let
+
126
Topology
X1
X2
Xs
= {o, 1 ~ 0,.
= {o~ 0, I'
..
..
..
..
0, . . . },
07 . })
..
0,.
..
has no convergent subsequence,. for the distance from any point of the
sequence to any other is 2Ji. This shows that X is not compact, hence
not totally bounded. The following fact, which we cannot prove here
(see Sec. 47), may add to the reader's intuition about the relation between
boundedness and total boundedness: a Banach space is finite-dimensional
~ evecy bounded subspace is totally bounded.
We now turn to the problem of .characterizing compact subspaces
of e(X,R) or e(X,C). By Theorem B 1 we know at once that a closed
subspace of e(XJR)_ or e(X,C) is compact <=> it is totally bounded.
Unfortunately, however, this information is of little value in most applications to analysis. What is needed is a criterion expressed in terms of the
individual functions in the subspacer Furthermore, for most of the
applications it suffices to consider only the case in which X is a compact
metric space. We describe the relei... .ant concept as f ollo-..vs. Let X be a
compact metric space with metric d, and let A be a non-empty set of
continuous real or complex functions defined on X. If f is .a function in
A, then by Theorem 24-F this function is uniformly continuous; that is,
for each E > 0, there exists a > 0 such that d(x,x 1 ) < 8 ==} [f(x) /(x')I < 11.. In general, a depends not only on E but also on the functionf.
A is said to be equicontinuous if for each Ea acan be found which serves at
once for all functioDB fin A, that is, if for each E > 0 there exists a > 0
such that for every f in A d(x,x') < a;;;;} If(x) - f (x') I < E.
Theorem C (Ascoli's Theorem). If Xis a compact -metric space, then a
cloBed subapace of e(XJR) or e(X,C) is compact ~ it is bounded and
equicontinuous~
=:::;
Compactness
+ Ifrc(x')
+ IJ~(x)
- f(x') I
127
~ fk(x~) f
e.
s.
= {f u, f u.,
fl'~~
+}
'
S2
83
+
{fa1,fs1,f:J31
.p
.p
..
..
'
l~
lt
..
in whlch each sequence is a subsequence of the one directly above it, and
for each i the sequence Si = {f~1r f1:1, fiJ,
l has the property that
{/";(xi) J is a convergent sequence of numbers. If 've define f ~ f'Jt f ~t
by f i = f u~ f-i. = f22, !J = f u~ . . . , then the sequence S
{f 1, f2, f3~ ~ .. J
is the ''diagonal'~ subsequence of 8 1 It is clear from this construction
that for each point xi in our dense subset of X 1 the sequence 1f~(xi) l is a
convergent sequence of numbers~ It remains only to sho'v that S, as a
+
;:::=
128
Topology
form the open sphere S-b(x~-) l\Tith radius a centered on each of the x~'s.
Since the x/8 are dense, these open spheres form an open cover of X~ and
sjnce X is r.ompact, X == U~~ 2 S~(xi) for some io. It. is easy to see that
there exists a positive integer no such that ni~ n > no~ ffm(Xi) - J'l(xi) I <
~/3 for all the points x2, x(I,
xi~- Our proof is completed by the
remark that if xis an arbit-rary point of X, then an i can be found in the
set {2, 3~ . . ~ , io} such that d(x,x. J < a, and that therefore
4
>
m,n
E.
1..
2.
3..
4..
for 0
x < 1/n, f"'(x)
locally compact.
l for 1/n
< x =5
The following terminology is often used with AscoliJs theorem. Let F be a.ny
non~mpty set of real or complex functions defined on a.n arbitrary non-empty set X.
The statement that a function fin Fis bounded mea.ns,. of course~ that there exists a
real number K such that ~/(x)I < K for every x in X. The functions in F a.rn often
said to be uniformly bounded (or F is called a unifnrmly bounded set offu.nction.r:;) if
th ere exists a single K which works in this w a.y for a 11 f 1s in ~"t ~ i.e T, if th ere is a K
such that If( x) r < K for every x in x and every I in F. If we ""Ter e tu USO this
expression 1 Aseoli's theorem wouJd take the following form: if Xis a compact metric
e:pace" th en a closed subspace of e( X,. R) or ~( X, C) is com pa ct (::::::} it is u niior m]y
bounded and ~quicontinuou@. The uniform boundedness here ]s merely boundedness
.as a sub.set of the metric spa.ee e(X.,R) or e(X;C)t
1
CHAPTER FlV'E
Separation
130
Topology
Theorem A.
~each
Let
It is customary here to drop the distinction bet.ween a point x in a space and the
.set ~ x } w hi c: h contains only that point~ Th is eonvent ion of ten makes it possible to
avoid cumbersome mo<les of expression, and we sho.11 use it freely.
1
The Ti-space nomencla.ture 1 for i ~ 0, lt
J 5, was introduced by A1exandroff and Hopf in thPir famous trealise r21. The T refers to the German word
TTen n ung saxio m~ which mcs..ns ' fjepa.ra. tion a.xio m.'' The term T rspace is the only
one of these which is at.ill in genera.I use.
1
Separation
131
'
fi
Theorem D.
Problems
1.
16~5
is not a
Topor ogy
132
2.
8..
XX Y.
I.et X be any non-empty setJ and prove that in the lattice of all
topologies on X each chain has at most one compact Hausdorff
topology as a member. (It is interesting to speculate about v.-~ he th er
a compact Hausdorff topology can be defined on an arbitrary non-+
empty set.)
Let X be an ar bi tra.ry topological space and {x" } a sequence of
points in X. This sequence is said to be (!onvergent if there exists a
point x in X such that for each neighborhood G of x a positive integer
n-o can be found ''dth the property that Xn is in G for all n > n~.
The point x is called a limit of the sequence~ and we say that Xn
converges to x (and symbolize this by Xn ~ x)
(a) Show that in Example 16-3 any sequence converges to every
point of the space. This is the reason why the above point x
is called a limit instead of the limit.
(b) Ii Xis a Hausdorff space,. show .that every convergent sequence
in X has a unique limit.
(c) Sho,v that if f :X ~ Y is a continuous mapping of one topological space into another, thr.n Xn---+ x in X ~ f(xn) -~ f(x) in Y.
Prove that the converse of this is true if each point. in X has u
countable open base. 1
+
The f ac t.s brought out in t.h.iB problem a.re the main reasons w by the concept
of a couvergcnt sequenee is not very important in the general theory of topologica.1
sps.ecs~
Separation
133
13.4
Topology
Topological spaces
Hausdorff spaces
Normal spaces
Metric
Compact
spaces
Hausdorff
spaces
F ].(:!;. ')5
.....
1.
2.
3.
4.
Separation
135
A~ U~
CIH~
c B'.
U}'.! C
[! }i}
so in the same
c B'.
{/H C llt.i
If we continue this process, for each dyadic rational number of the form
t = m/2n (where n = 1; 2, 3, . . . and m :;:::: l~ 2,
2n - 1) we have
an open set of the form llt,. such that
+
t1
< t2 ===>-A
--
B .
otherwise. It is clear that the values off lie in [O, I], and that J(A) = 0
and f(B) ~ L. All that remains to be proved i~ that f is continuous+
All intervals of the form {O,a) and (a,lL where 0 < a < 1, constitute an
open sub base for [OJ l 14 It therefore suffices to show that 1~ 1 ([O,a)) and
) 1 ((a~l]) are open. It is easy to see thatf(x) < a t j xis in some U, for
t < a; and from this it follows thatj- 1([0,a)) = {x :/(x) <a} = V,<a Ut,
which is an open set. Similarly, f(x) > a-:;::} x is outside of u; for some
t > a; and therefore 1- 1((a;11]) = tx:f(x) > al ~ v~>a Ul~ which is an
open set4
It is clear from this theorem that every normal space is completely
regular: all that is necessary is to take the closed subspace A to be a single
point and to observe that the function f is exactly what is required in the
definition of complete regularity.
The following slightly more flexible form of Urysohn's lemma will
be useful in applications.
136
Topology
Theorem B.. Let X be a normal space, and let A and B be disjmnt close.d
subspac.es of X~ If [a~b] is any closed interval on tke real line, then there
exists a continuous real function f defi.ned on XJ all of whose values lie in
[a~b1i such that f(A) = a and f(B) = b.
PROOF.
If a = b, we have only to define f by f(x) == a for every x> so we
may assume that a < b. If g is a function v.rith the properties stated in
U-rysohn's lemma, then f = (b - a)g
a has the properties required by
our theorem.
Ao== {x:fo(x) ~
-Yal
=== (M)(;3)~
+ + +
Separation
137
+.
by Sn = go + U1
~ + OiL-1 1 and we regard the sn's as
the partial sums of an infinite seri~s of functions in e(X,R)~ We know
that e(XJR) is completeJ so by lgn(x)i ::;; {%) (%)n and the fact that
x===- 0 (7~) (%) = 1, we see thats~ converges uniformly on X to a bounded
continuous real function f' such that lf'(x)I < 1. We conclude our
proof by noting that since Iftt(x)I < (%)'\ s" converges uniformly on
F to fo = !1' and that therefore/' equals/ on F and is a continuous extension of f to the full space X Vlbich has the defiired property.
now define
Bn
11
Problems
1.
2..
3.
138
Topology
As motivation for our main theorem, we note that since the metric
space R 110 is second countable, every subspace of it is also second countable4
It turns out that second countability, in addition to normality, suffices
to guarantee that a topological space is homeomorphic to a subspace
of R~. In effect~ we imbcd such a space homeomorphically in R~.
If Xis a second countable normal space, then there exists a hom.eomorphismf of X onto a subspace
of R~ 1 and X is therefore metrizable
PROOF. We may assume that X has infinitely many points~ for other-wise it would be finite and discrete 1 and clearly homcomorphlc to
any subspace of Rec with the same number of points. Since X is
second countable, it has a countably infinite open base {G1, G2, GJ, . . . }
each of whose sets is different from the empty set and the full space.
If G1 and x a Gi are given~ then by normality there exists a G;: such that
x e Gi C G; c Gj. '"fhe set of all ordered pairs (G1,G;) such that G1 C Gj
is countably infinite, and we can arrange t.hem in a sequence P1j
P 2 ~ . . . . , P n .. . By Urysohnts lcmrna~ for each ordered pair
P ~ = ( Gi~ G1) there exists a continuous real function f n: X ~ [O, 1] such
that f ,.(G;.) = 0 and fn(G/) ==== 1. :For each x in X we define /(x) to be
the sequence given by f(x) = {f 1(x) 1 f2(x)/2,
~ fn(x)/n, ~ .. }~ If
we recall that the infinite series ~== 1 l/n'2 convergesJ it is ca.sy to see that
f is a one-to-one mapping of X into Ri ~ It remains to be proved that
f and J- 1 arc continuous.
To prove that f is continuous, it suffices to show that given x~ in
X and E > 0 1 there exists a neighborhood H of Xo such that y E H ==>
l]f(y) - f(xo) ll < E. Since an infinite series of functions converges
uniformly if its terms are bounded by the terms of a convergent
infinite series of constants, it is easy to see that there exists a positive
integer no such that for every y in X we have
Theorem A (the Urysohn lmbedding Theorem).
+
nJ(y) - f(xo)H 2
=:;
I=-=1 ~[f~{y) -
/n(xo)l/nl'"
< :z::..1 I[/n(Y)
f ~(xo)J/nl !
+ E2/2.
139
Separation
-==::
<
~~
1..
2.
3..
We know that every metric space is normalJ and also that a normal
space) if second countable~ is metrizable.. Give an example of a
normal space which is not metrizable (hint.: see Problem 22-4)~ Thls
shows that metrizable spaces cannot be characterized among topologi ..
cal spaces by the property of normality~
Among normal spaces~ second countability implies metrizability.
Give an example of a metric space which is not second countable.
This shows that metrizable spaces cannot be characterized among
normal spaces by the property of second countability.
Show that a compact Hausdorff space is metrizable <=> it is second
countable~ 1
The results of this se.r:tion have chara.ct.erizcd metrizable spaces among second
oounlaLle ~pa.ces (by normality) and among com pa.ct Hausdorff spaces (by second
countahi1ity ), but not among topological spaces in genera..l. This more difficult
problem 'vas solved by Smirnov [38]. For an exposition of his solution~ see Kelley
[25, pp. 126~ 130J.
140
Topology
sections;
(1) if X is a completely regular space, then the 'veak topology
generated by e(X)R) equals the given topology;
(2) the relative topology on a subspace oi a product space equals
the weak topology generated by the restrictions of thr. projections
to that subspace .
These facts (they are Problems 27-4 and 22-2) are the basic principleH on
v.Thlch the following analysis rests.
We begin 'vith an arbitrary topological space X and the set e(X,R)
of all bounded continuous real functions defined on X. Let the fun('tions in e(X 1R) be indexed by a set of ind ices i, so that e (X ,ll) == {/i l
For each index it lr.t [; be the smallest closed interval '"'hich contain8 the
range of the function f,, ~ Each Ii: is a com pact Hausdorff spacei: so their
product P = Pili is also a compact Hausdorff space, and every subspace
of P is completely regular. We next define a mapping f of X into this
product space by means of f(x) = {fi(~)}, that ist in such a v,ray that
f(x) is that point in the product. space P whose ith coordinate is the real
number fi(x)r By ProblPm 22-3 and the fact that p,f = fi for each projection Pi~ it is clear t.ha t f is a continuous mapping of X in to P.
We now assume that e(X,R) separates the points of X~ This is a
weaker assumption than complete regularity and is exactly the requirement that f be a one-to-one mapping. At this stage, 1ve use f to replace
f (X) as a set by X; that is, we imbed X in P as a set. X is thus a sub~
set of P l\:rhich has two topologies: its own, and the relative topology
\Vhich it has as a subspace of Pr We observe two features of this situation. First~ since f is continuous~ the given topology on X is stronger
than its relative topologyr Sccondj e(X,R) is precisely the set of all restrictions to X of the projections pi of Ponto it~ coordinate spaces Ii. It is noYl
clear that if X is completely regular, so that by state1nent (1) its given
topology equals the weak topology generated by e(X,R), then by stutement (2) its given topology equals its relative topology, and X ean be
regarded as a subspace of P .
In accordance with these ideas, we now assume that X is con1pletely
Separation
l 41
regular~
1.
24>
3.
If X is completely regular, show that every bounded continuous complex function defined on X has a unique extension to a bounded
continuous complex function defined on t3(X).
Every closed subspace of a product of closed intenTals is a compact
Hausdorff space. Show, conversely, that every compact Hausdorff
space is homeomorphic to a closed subspaee of such a product.
Prove the following generalization of the rietze extension theorem.
If X is a normal space, F a closed subspace of X, and fa continuous
mapping of F into a completely regular space Y, then f can be
extended continuously to a mapping f' of X into a compact Hausdorff
space Z whlch contains Y as a subspace.
1
For Stoneta own version of these ideas. 1 aec his paper (39J.
CHAPTER SIX
eo11Jtcetcd11cs1
Connectedness
1"3
31 .. CONNECTED SPACES
A connected space is a topological space X whlch cannot be represcn tcd as the union of two disjoint non-empty open sets. If X ~ A VB,
where A and B are disjoint and openJ then A and B are also closed, so
that X is the union of two disjoint closed sets 1 and conversely. We .see
by this that Xis connected ~it cannot be represented as the union of two
disjoint non-empty closed sets. It is also clear that the connectedness
of X amounts to the condition that 0 and X are its only subsets which
are both open and closed~ A connected subspace of X is a subspace Y
which is connected as a topological space in its own right. By the definition of the relative topology- on Y 1 this is equivalent to the condition that
Y is not contained in the union of two open subsets of X whose intersections 'With Y a.re disjoint and non-empty.
Our space Xis said to be disconnected if it is not connected~ that is~
if it can be represented in the form x = A U B, where A and B are
disjointi non-empty, and open; and if Xis disconncctcd 1 a representation
of it in this for1n (there may be many) is called a disconnection of X .
We begin by proving a theorem which supports a considerable part
of the theory of connectedness.
Theorem A. A 8Ub8pace of the real linR R is connected <==>it is an interval.
In particulari. ll is connected.
PROOF..
Let X be a subspace of R. We first prove that if Xis connected 1
then it is an interval. We do this by assuming that Xis not an interval
and by using this assumption to show that X is not connected. To say
that Xis not an interval is to say that there exist real numbers x, y, z such
that x < y < zJ x and z arc in X, and y is not in X~ It is easy to see from
this that X ~ [X r\ (- oo ,y)] U [X
(y,
~ )J is a disconnection of
X 1 so X is disconnected.
We complete the proof by shol\Ting that if Xis an intenral, then it is
necessarily connected. Our strategy here is to assume that X is disconnectr.d and to deduce a contradiction from this assumption~ Let
X = AV B be a disconnection of X. Since A and Bare non-emptyt we
can choose a point x in A and a point z in B. A and B are disjoint, so
x r:E 2 1 and by altering our notation if necessary, we may assume that
x < z. Since X is an interval, [x,z] C X, and each point in [x,z] is in
either A or B.. We now define y by y == sup ([x,z]
A). It is clear
that x ~ y :::; z, so y is in X. Since A is closed in X, the definition of
11 shows that y is in A. From this we conclude that y < z.. Again by the
definition of y 1 y
E is in B for every e: > 0 such that y
~ :$; z, and
since Bis closed in X, y is in B. We have proved that y is in both A and
B, which contradicts our assumption that these sets are disjoint.
144
Topology
x.
As a direct con.sequence of the two theorems just proved~ v.,..e have the
f ollov,.ring gen~ra1iza tion of the W cierstrass in term~d ia te value theorem.
Theorem C.. 77u range of a continuous real function defined on a connected
space is an interval.
It is a trivial observation that any tv"ro discrete spaces \vith the same
number of points arc e~s~ntially id en tic al; for any one-to-one mapping of
one onto the other (there is at least one) is a homeomorphism, and \ve may
think of them as differing only in the symbols used to designate th(-~ir
points. It is in thls sense that t.here is only one discrete space \vith any
given number of points. The discrete two-pO'int space, '\\,..ruch is obviously
disconnected. is a usPful tool in the theory of connectedness~ V{e denote
its points by the symbols 0 and I, and we think of them as rca] numbers_
A topological S'pace X is disconnected ~ there exislB a continuou8 mapping of X onto the discrete two~point space {0,1 l
PROOF. If Xis disconner.ted and X = A VB is a disconnPetiont then v.~e
define a continuous mapping .f of X onto {O, 1 l by the requirement that
f(x) = 0 if x is in A and f(x) -== 1 ii xis in B. This is a valid definition
by the fact that A and Bare disjoint and their union is X. Since A and
B are non-Pmpty and open, f is clearly onto and continuous.
On the othPr hand, if there exists such a mapping, then X is disconnected; for if X were connected~ Theorem B ",.ould impJy that {O~ 1} is
connected, and this wou]d be a contradiction.
Theorem D.
Thjs result is a useful tool for the proof of our next theorem.
Theorem E.
conn.ected4
I-Rt ~ X1 l be a non-empty class of connected spaces~ and form
their product X == P~i- We assume that X is disconnected,. and we
deduce a contradiction from this assumption.. By Theorem D, thPre
exists a continuous mapping f of X onto the discrete two-point space
PROOF~
Connectedness
1'5
anJ
b").
2.
146
Topology
34
particular,
A is connectoo.
Assume that Bis disconnected, that is, that there exist two open
subsets G and H of X whose union contains B and whose intersections
with Bare di~oint and non-empty. Since A is connected and contained
PROOF.
Connectedness
147
Theorem C. If X is an arbitrary topological space, then we have the following: (1) each point in Xis contained in exactly one component of X; (2)
each r.onnected 811hspace of X is contained in a component of X; (3) a
connected subspace of X which is both open and close.d is a component of X;
and (4) each component of Xis closed.
PROOF.
To prove (1), let x be a point in X. Consider the class {Cd of
all connected subspaces of X which contain x.. This class is non-empty,
since x itself is connected~ By Theorem A, C = UiCi is a connected
subspace of X which contains x. C is clearly ma...ximal, and therefore a
component of X, because any connected subspace of X which contains
C is one of the C./s and is thus contained irr C. :Finally, C is the only
component of X which contains x. For if
is another, it is clearly
among the C/s and is therefore contained in C, and since C* is maximal a.s
a connected subspace of X~ we must have C* = C.
Part (2) is a direct consequence oi the construction in the above
pa.ragraphj for by this construction, a connected subspace of X is contained in the component which contains any one of its points.
We prove (3) as follows~ Let A be a connected subspace of X which
is both open and closed+ By (2))1 A is contained in some component C.
If A is a proper subset of C 1 then it is easy to see that
C = (Cf\ A) U (C
A')
is a disconnection of C. This contradicts the fact that C, being a component, is connected, and \Ve conclude that A = C.
, Part (4) follo,vs immcdiat~]y from Theorem B; for if a component
C is not closed, then its closure C is a connected subspace of X \Vhich
properly contains C, and this contradicts the maximality of C as a connected subspace of X.
[X
n (-
a:i
,y)]
[X n (y,+
~ )l
14
Topology
Problems
1.
u:_
2.
3.
4.
5..
An is connected4
6..
7~
BL
+ t(y -
x) = (1 - t)x
+ ty
is also in S for every real number t such that 0 < t < la Intuitively~ a
convex set is a non-empty set which contains the segment joining any
pair of its points.. Prove that every convex subspace of B is connected. Prove also that every sphere (open or closed) in B is
con vex, and is therefore connected.
Show that an open subspace of the complex plane is connected ~
every two points in it can be joined by a polygonal line:
Consider the union of two open discs in the complex plane which are
externally tangent to each other. State whether this subspace of the
plane is connected or disconnected, and justify your answer. Do
the same when one disc is open and the other closed~ and when both
are closed.
Consider the following subspace of the Euclidean plane: {(x, y) ~ x ~ 0
and y = sin (1/x) J. Is this connected or disconnected? Why?
Answer the same questions for the subspace {(x,y) :x r!E 0 and
y ~ sin (I/x) l V {(x,y) ;x = 0 and -1 < y < 1}.
Connectedness
149
We have seen that a connect.Pd space is one for which no disconnection is possible. We now consider spaces which have a great many
disconnections, and \Yhich therefore lie,. in a manner of speaking, at the
opposite end of the connectivity spectrum.
A totally disconnected space is a topological space X in which every
pair of distinct points can be separated by a disconnection of X. This
means that for every pair of points x and y in X such that x r!E y, there
exists a disconnection X ;;::: A V B vrith x e A and y e B. Such a space
is evidcn tly a Hausdorff space~ and if it has more than one point,. it is
disconnected. Oddly enough, a one-point space is both connected and
to tally disc onnccted
The discrete spaces are the simplest totally disconnected spaces.
A more interesting example is the space ...discussed at the end of the
previous section~ that is, the set of all rational numbers considered as a
subspace of the real line. The set of all irrational numbers is also a
totally disconnected subspace of the real line~ and this is proved in much
the same way, from the fact that. there exists a rational number between
any two irrationals.. The Cantor set is yet another totally disconnected
subspace of the real line, this time one which is compact.
Our first theorem should not rome as a surprise to anyone.
+
Theorem A..
Tf~
(Y l\ A) U (Y
B)
is a disconnection of Y.
Tot.al disconnectedness is closely related to another interesting
property.
Theorem B.
1.50
TopoJogy
Theorem C. Let X be a compact Hau!3dorff s-pace. Then Xis totally di!3connected {::::} it has an open base 'l.l)hose sets are also closed+
PROOF..
In view of Theorem BJ it suffices to assume th.at X is totaUy
disconnected and to prove that the class of all subsets of~ v.,..hlch are both
open and closPd forms an open base. Let x be a point and G an open set
which contains it+ We must produce a set B which is both open and
closed such that x e: B C G. We may assume that G is not the full
space~ for if G :;:::: X, then we can satisfy our requirement by taking
B = X. G' is thus a closed subspace of XJ and since Xis compa:ct)' G' is
also compact. By the assumption that X is tot.any disconnected~ for
each pointy in G1 there exists a set H,, which is both open and e1o.sed and
contains y but not x~ G' is compact, so there exists some finite class of
H 1/s, which we denote by {Hh H2~ ...
Hn}, with the property that its
union contains G' but not x. We define H by H = u: 1 Hf~ and we
obscnre that since this is a finite union and all the H/s are closPd as "~en
as open, Il is both ~pen and closed~ it contains G', and it does not contain
x. If vle now define B to be H', then B clearly has the properties required
of it.
+
Problems
1.
2.
Prove that the product of any non..empty class of totally disconnected spaces is totally disconnected.
Prove that a totally disconnected compact Hausdorff space is
homeomorphic to a closed subspace of a product of discrete two-point
spaces~
The reader should be made a.ware of the fact that several different definitions
of tota.1 disconnectedne.ss a.re commonly found in the liters.lure. The definition
given above seems to the present writer to have the logic of language behind it; and
Theorem C show.s that th.is definition is equivalent (in the case of R compact Hausdorff
spa.ce) to the moat important of the.se alternative de.finitionsr
l
Connectedness
151
A
I
I .x
\
I
I
I
I
,_
,,,,
I
I
I
1/r
.-..1-
Fig4 264
2/-;r
- - - .......... ._,.,__ __
;z::!
(x~
sin (1/x)),
152
Topology
easy to see that each point x in A has a neighborhood which does not
contain any connected neighborhood of x.
We know by Theorem 32-C that the components of an arbitrary
topological space X arc always closed sets, and from this we see at once
that the components of any closed subspace of X are also closed in X.
The reader may feel, 'vith some justification~ that the components of a
w cll-beha ved space ought to be open sets. This is true for locally
connected spaces .
Let. X be a locally connecte.d space. if Y is an open subBpace
of X, then each component of Y is open in X+ In particular, each component
of Xis open.
PROOF.
I_jet C be a component of Y. We wish to show that C is open
in X. Let x be a point in C+ Since X is locally connected and Y is
open in X, Y contains a connected neighborhood G of x~ It suffices to
sho"T that G C C. This will follow at once from the fact that C is a
component of Y if we can show that G is connected as a subspace of Y.
But this is clr.ar by Problem 16-6, according to which the topology of
Gas a subspace of Y is the same as its topology as a subspace of X; for
G is connected "\\'i th rcspe ct to the lat tcr topology
Theorem A.
1.
2.
3.
44
5.
6.
7~
Prove that a topological space X is locally connected if the components of every open subspace of X are open in X ~
A connected subspace of a locally connected space X is locally
connected ii Xis the real line. "Wliv?
. Is this true if X is an arbitrary locally connected space?
Show that a compact locally connected space has a finite number of
components .
Show that the image of a local~y connected space under a mapping
which is both continuous and open is locally connected+
Prove that the product of any non-empty finite class of locally
connected spaces is locally connected .
Show that the product oi an arbitrary non-empty c]ass of locally
connected spaces can fail to be locally connectedL (Hint consider a
product of discrete two-point spaces+)
Prove that the product of any non-empty class of connected }or.ally
connected spaces is locally connected.
CHAPTER SEVEN
Approrimation
Our work in the present chapter centers around the famous theorem
of Weierstrass on the approxhnation by polynomials of continuous real
functions defined on closed intervals. This theorem, important as it is
in classical analysis, has been overshadowed in recent years by a generalized form of it discovered by 1' tone~ The latter rela tcs to con tjn uous
functions defined on compact Hausdorff spaces, and has become an
indispensable tool in topology and modern analysis.
W.e prove the 1\7 eierstrass theorem and then the t'vo forms of the
Stone-Weierstrass theorem which deal separately with real and complex
functions+ Finallyt after an excursion into the theory of locally compact
I!ausdorff spaces, we extend the Stone-Weierstrass theorems to this
context .
35~
Let us consider a closed interval [a,b] on the real line and a polynomial
p(x} ~ ao
+ a1x +
+ anxn,
153
154
Topology
that 0
t (n)
I: mi:O
xi(l -
x)K-~
[x
+ (1
-- x)]R
(1)
Approximation
155
1~ ( ; ) xk{l
x)"~(k -
nx)
= 0,
(2)
(n) zk-
iO
(1
~ x)'f;~~ 1 (k
- nx)z = n;
(n)
i z:::Q
nx(l - x),
(n)
l:ccO
x-1(1 - x)n-'"
(4)
Identities (1) and (4) will be our main tools in showing that Bn(x) is
uniformly close to f(x) for all sufficiently large n.
Now for the proof of the fact just stated. By using (I), we see that
f(x) - B8{x) =
10
~ (~) x (1
1
- x)-k [ f(x) -
(!)}
so that
lf(x) - B .. (x)I
10
- x) 8 - t f(x) -
f (:)
(5)
Since f is uniformly continuous on [01 IJ, we can find a lJ > 0 such that
Ix - k/nj < a~ ~f(x) ~ f(k/n)I < f./2. We now split the sum on the
156
Topology
right of (5) into tvt'"o parts, denoted by ~ and X':1 where Z is the sum of
those terms for which jx - k/nf < o (we think of x as fixed but arbitrary) and where %' is the sum of the remaining terms. It is easy to see
that :2; < t:/2. We complete the proof by showing that if n is taken
sufficiently large, then l;' can be made less than f/2 independently of x.
Since f is bounded, there exists a positive real number K such that
jf(x}[ < K for al1 x in (0,1]+ l~rom this it follows that
- x)"-k,
\vhere the sum on the right-denote it by ~"-is taken over all k such
that Ix - k/n r 2: 8. It now suffices to sho''{ that if n is taken sufficiently
large, then lo" can be made less than t:/4K independently of x. Identity
(4) sho1~ls that
11
x(l - x)
-<
~2
"""''' <
so
'
x(l ~ x )__
~tn
<
.
- 4o~n
x 2~ ,
xn. t
are dense in efO, 1], u~hcrc by a real linear comhina.tion of these functjons
we mean the result of choosing any finite set of them, multiplying each by
a real number~ and adding. Instead of working with all positive pov;rers
of x, 1et us pennit gaps to occur, and consider the infinite set of functions
1'
X 'H
Approximation
157
the n.1::'B being positive integers for which ni < n2 < < n.1:: <
The result we have in mind is called Munh'.s theorem, and asserts that aH
rea1 linear combinations of these functions are dense in e[O,l] -=> the
series ~: 1 1/n.t divergesr For a proof, we refer the interested reader to
Lorentz [29, pp. 46--48J or Achieser [I~ PP~ 43--46].
Problems
<t
2.
3.
The moments
of f are the numbers
f (x)x" dx, where n = 0, I , 2, . . . . Prove
that two continuou8 real functions defined on f0, 1] are identical if
they have the same sequence of moments.
lTse the Weierstrass theorem to prove that the polynomials arc
dense in e(X,ll) for any closed and bounded subspace X of the real
line.
Jo
stripped away~
Our starting point is the fact that the polynomials are dense in
e[a,b] for any closed interval [a,b]. We ~rish to replace [a,b] by an
arbitrary compact Hausdorff space X and to make a similar statement
about e(X~R). The most obvious difficulty in this program is that it is
meaningle&13 to speak of polynomials on Xr This obstacle will disappear
when we take a closer look at what polynomials are~
Consider the two functions I and x defined on [a,b].. The set P of
all polynomials on {a,b] is identical with the set of all functions which
can be built from these two by applyjng the following three operations:
multiplication~ multiplication by real numbers, and addition. P is an
algebra of real functions on [a~bJ, for it is cJosed with respect to these
three operations. Even more~ it is a subalgebra of ~[a,bJ. We say that
P is the subalgebra of efatbl generated by {l ,x} 1 for it is a subalgebra
containing {l ,x} which ii:~ contained in every subalgebra with this property. We knov-.,.. by Problem 20-3 that the closure of a subalgebra of
e(X~R)-.---,-for any topological space X-is also a subalgcbra of e(X,R).
We may therefore speak of the closure P of P as th.e closed subalgebra of
e[a,b J generated by {1,x} . As above, this means that P is a closed subalgebra containing {I 1x} which is contained in every closed subalgebra
158
Topoiogy
with this property4 These ideas make it possible for us to state the
Weierstrass theorem in the following equivalent forms~
(1) the elosed subaJgebra of e[a,b] generated by ~ 1,x I equals
era,b];
(2) any closed subalgebra of e[a,b] which contains {1,x} equals
e[aJbJ.
..
These are potent statements, aaying 7 as they do~ that the very small set of
functions {1,x} suffices to generate the much more extensive set e[a,b].
As our theorems below will show, these statements depend only on the
fact that a closed subalgebra of e[a~b] which contains the set : I~x l
separates points in the sense of Sec. 27 (for it contains the function x)
and contains all constant functions (for it contains the non-zero constant
function 1).
U v g)(x) =
and
(f A g)(x)
max {/{x)~g(x)}
min {/(x) 1 g(x) 1-
Our first lemma states conditions which guarantee that a closed sublattice of e(X,R) equals e(X 7 R)~
lemma. Let X be a compact Hau8durff space tvith more than one pO'int,
and let L be a closul sublattice of e(X 1R) with the following P'fOperty if
x and y are distinct points of X and a and b any two real numbers, then there
exists a junction fin L Bt1.Ch that f(x) = a and f(y) = b. Then L equals
e(X,R). 1
PROOF. Let f be an arbitrary function in e(X,R).. We must show that
j is in L. Let E > 0 be given. Since Lis closed1 it suffices to construct a
function g in L such that f(z) - E < g(z) < f(z) + E for an points z in
4..
If X has only one point J then a. single eonst&n t function constitutes a closed
iu blat tice of e( X ~R) with the stated property which does not equal e{ X, R). It is
therefore nece.ssa.ry to assume th.a. t X has more than one point+ Further, the reader
wi1l notice that the proof given below makes no use of the assumption th.at X is
Hausdorff. HoweverJ if there existe a closed sublattice of e(X,R) with the stated
property then X iB necesap.rily Hq.usdorff r eo tl;lere .is nothing to be gained by omitting
t
J'
this
886\lm ptioJl~
Approximation
159
a function.
I,,et x be a point in X '\\Thich is fixed throughout this paragraph)' and
let y be a point different from x~ By our assumption about L, there
exists a function !'JI in L such that fv(x) = f(x) and fu(Y) = f{y)~ Now
consider the open set G11 = {z :f11 (z) < f(z) + ~d. It is clear that both
x and y belong to G"~ so the class of Gi/s for all poiuts y different from
x is an open cover of X. Since X is compact? this open cover has a
finite subcover, which we denote by {Gh G12)' ~ . , Gft}. If the corre..sponding functions in l.J are denoted by /1, f2i "
In, then
I
9~ =
f 1 A /i
'
"
I\
fn
is evidently a function in l..i such that g.z(x) ~ f(x) and g.z(z) < f(z) + e
for all points z in X.
We next consider the open set H ~ = {z: g.:r:(i) > /(z) - E 1- Since
x belongs to H~t the class of H 2's for all points x in Xis an open cover of
X~
The c.ompactness of X implies that this open cover has a finite subcover, which we denote by {111,. H 2 ~ . ~ ~ H m}.. We denote the corresponding functions in l..1 hy gi~ !J2J . ~ . , gm, and we define g by
g = {}1 v 02 v . v Ym~ It is clear that g i~ a function in L with the
property that f (z) - --= < g(z) < f (z) + E for all points z in X 7 so our
proof is completer
In our next lemma \ve make use of the concept of the ab~olute value
of a function. If f is a real or complex function defined on a topological
space X, then the function !fl-called the absolute value of /-is defined by
I/[ (x) == lf(x)I. If f is rontinuousJ then ]fl is also continuous. We
observe that the lattice operations in e(X~R) are expressible in terms of
addition, scalar multiplication, and the formation of absolute values:
fv g
and
f "0
=f +
=
+ If - ol
2
+ g - If - gl .
2
These identities show that any linear subspace of e(XiR) which contains
the absolute value of each of its functions is a sublattice of e(XjR).
Let X be an arbitrary topological ~~pace. 'Phen every closed subalgebra of fB(X,R) is also a closed sublattice of e(X ~Il).
PROOF.
Let A be a closed subalgebraof e{X,R). By the above remarks,
it suffices to show that if f is in ..4, then ]/' is al:oo in A. I_Jet ~ > 0 be
given. Since lti is a rontinuous function of the real variable t~ by the
Weierstrass approximation theorem there exists a polynomial p 1 with
the property that ~ ltl - p'(t)I < ~;2 for every t on the closed interval
lemma.
160
Topology
[- H!H ,Ir/JI]~
rt
Let X be a compact
Hausdorff space., and let A be a closed subalgebra of e(X:1ll) which separate8
pointa and contains a non-zero constant function. 1lhen A equals e(X 7 R) .
enoo~~. If X ha8 only one point, then e(X,R) contain~ only constant
functions; and since A contains a non-zero constant function and is an
algebra, it contains a.ll constant fun-etions and equals e(X,R). '\\',.. e may
thus assume that X has more than one point. By the above lemmas, it
suffices to sho\V' that if x and y arc distinct points of X, and if a and bare
any two real numbers, then there exists a function f in . ..4. such that
f(x) = a and f(y) = b. Since ..4 separates points, there exists a function
g in A such that g(x) ~ g(y). If we no'\,. define f by
f(z) == a g(z) - g(y)
g(x) - g(y)
+ b g(z)
~ g(x)'
g(y) ~ g(x)
161
remind the reader that its conjugate! is defined by f(x) ~ f(x). It wiU
also be convenient for us to define the real part and the imaginary part off:
R(f) =
f +J
2
and
l(f)
f-. J.
2i
(1)
1&2
2.
3.
Toporogy
Let X be the closed unit disr, in the complex plane, and show that any
function f in ~(X,C) can be uniformly approximated on X by polynomials in z and z with complex coefficients.
Let X and Y be compact Hausdorff spaces, and f a function in
~(X X Y,C)~ Show that f can be uniformly approximated by
functions of the form "X7rac1 f q.;,, where the j/s are in e(X,C,Y) and the
g/s are in e(Y,C).
Approximation
163
from the topologiea] poi11t of view, S minus its north pole can be regarded
as essentially identical with the complex plane C~ The north pole of 8
can be considered to be the point at infinity, and passing from C to
CCC) amouuts to using the point Ct; to plug up the hole in C at the north
pole. When S is identified in this manner with the extended romplex
plane)' it is usually called the llieniann sphe-reL In summary)' the locally
compact Hausdorff space C has been made into the compact Hausdorff
space S by adding the single point ~ .
We now generalize the construction outlined above to the ra~c of' an
arbitrary locally compact Hausdorff space X. Let o:J be an ohje('t not
in X 1 and form the set Xd"J = XU l i } .. We define a topology on Xri'.i by
specifying the following as open sets: (i) the open subsets of X, regarded
as subsets of X "; (ii) the complements in X ~ of the compar.t subspaces
of x; and (iii) the full space
If lf\-Te keep in mind the fact that a
rompact subspace of a Hausdorff space is closed, then it is ea8y to sho'v
that this class of sets actually iH a topolog~y on X ~, and also that the given
topology on X equals .its relative topology as a subspace of x~. The
fol1o\ving are the main propert.iP.s of the topological space X ~
(1) X~ is co1npact. To prove this~ let t Gd be an open rover of X110.
\V c must produce a. finite subcover. If X !10 occurs among the (i./s~ then
{(_T.i} clearly has a finite sub-cover, namely~ {X t;t]). We may therefore
assume that each G1 is a set of type (i) or type (ii). At least one Gi, say
Gi@J must contain the point ~ )' and this set is necessarily of type (ii).
Its complement G1.a' is thus a compact subspace of X v.,Thieh is contained in
the union of some class of open subsets of ..:Y of the form (Ji{\ X, so it is
contained in the union of some finite subclass of these sets, say {G1 f\
X, G2 (\ X, ..
G1J. n X}
It is no'v easy to see that the class
{Gi~, Gh G2~ . ~
Gfl J is a finite subcover of the original open cover of
xi"JJ)' so lit is compact.
(2) X~ is Hausdorffr X is Hausdorff, so any pair of distinct points
in X ~ both of v-.,..hich lie in X can be separated by open su hsets of X, and
, th us can be separated by open subsets of X ~ of type {i). It therPf ore
suffices to sho\v that any point x-in X and the point oo can be separated by
open subsets of Xr:o-r X is locally compact, so x has a neighborhood G
"lvhosP closure G in X is compact. It is 110-...v clear that (; and (;1 are
disjoint open subsets of X r10 such that x .e G and O'J e G', so X ~ is Hausdorff.
The compact Hausdorff space X~ associated with the locally rompact
Ilausdorff space X in the manner described above is called the one-point
compactification of X 1 and the point ~ is called the point at infinity.
'\\Te kno'v that compact spaces are locally compact, so thPsc ideas apply
Vt'"it.hout change when X is a compact Hausdorff spaceL It is easy to see
that the locally compact Hausdorff space Xis compact <i=-:> oo is an isolated
x.
16'
Topology
point of X... It may seem useless to consider the one-point compactification of a compact Hausdorff space:1 but we shall see in the next section
that it enables us to weaken the hypotheses of the Stone. Weierstrass
theorems.
The one..point compactification is useful mainly in simplifying the
proofs of theorems about locally compact Hausdorff spaces~ As an
example, any space X of th.is kind is easily seen to be completely regular;
for X is a subspace of Xll0 1 which is compact HallBdorfi and therefore
completely regular, and every subspace of a completely regular space is
completely regular. Accordingly~ if x is a point of X, and G a neighborhood of x which does not equal the full space, then there exists a continuous real function f defined on X, all of whose values lie in the closed
unit interval [0 1 IJ, such that f(x) = 1 and f(G') = 0. This fact can
easily be generalizedJ again by using the one-point compactification, to
the case in which the point x is replaced by an arbitrary compact subspace of X.
Theorem A. Let X be a locally compact Hausdorff spaceJ kt C _be a compact subspace of X, and let G be a neighborhood of C which does not equal
the full space~ Then there exists a continuous real junction f defined on X,
all of whose values lie in the closed unit interval [O,l]J such that f(C) == 1
and f (G') = 0.
PROOF.
Let
be the one-point compactmcation of X. Then c and
xd',j
Problems
1.
2..
3.
4.
Approximation
165
c.
lemma..
e{X,C).
- g(x) l
==
t:
I(f
so
+ g)(x)!
;;::: l/(x)
+ g vanishes at infinityr
t:1
166
Topology
.
e 0{X,R) is also closed with reepect to scalar multiplication and multi...
plication; and since eo(X,R) is non..-empty (it contains the function which
is identically zero), it is clearly a subalgebra of e(X,R). Similarlyt
eo(X~C) is a subalgebra of e(X,C).
Lemma.
in e(Xccitll) which vanish at the point -xi. Similarly~ eo(XJC) equals the
set of all restrictions to X of the functions in e(Xtl),C) which vanish at the
point
co ..
App roximotion
167
vanishes at co
We denote the set of all these extensions hy Ao. Our
hypotheses imply that Ao is a closed suhalgcbra of e(XG!;l,R) w~hich separates points and has the property that all its functions vanish at ;.(). Let
A 1 be the set of f unct.ions obtained by adding all constant fun ct.ions to
each function in Ao~ It is easy to see that .A.1 is a closed subalgebra of
e(Xc:o~R) which separates points and contains a non-zero constant func~
tion, so by Theorem 36-A, Ai equals e(XCJ), !~)~ It follows from this that
Ao consists of all functions in e(X~tR) which vanish at ct), and another
application of our second lemma shows that A equals eo(X~R),.
4
1.
2~
3.
PART TWO
Operators
CHAPTER EIGHT
.Algcbraie S11stcJ1ts
We have seen in the preceding chapters that one of the basic aims of
topology a11d modern analysis is the study of the bounded real and complex functions which are defined and continuous on a topological space X
These functions can of course be studied individuallyJ but this doesn't
carry us very far. It is desirable to consider the sets e(X,Jl) and e(X,C)
of all such functions as mathematical sys terns with a high level of internal
organization, and this program compels us to give serious attention to
their structural features. It is at this point that algebra enters the
picture; for modern algebra is essentially the result of crystallizing into
abstract form~ and studying for th~ir own sake, a f cw simple patterns of
structure which underlie many diverse po.rts of mathematics.
In Secs. 14 and 20 we defined what is meant by a linear space and
an algebra, but we did not develop the theory of these systems to any
appreciable degree. We used them only descriptively, as a convenient
means of calling attention to the fact that the points in the spaces R~ and
c~ can be added and multiplied by numbersJ and those in e(XJR) and
e(X,C) can be multiplied together as well. Our work in t}).e rest of this
book requires a deeper understanding of these systems and several
.others_. and the purpose of this chapter is to provide a concise but reasonably complete exposition of this r;iecessary background material.
The algebraic systems we discuss below~groupsJ ring8, linear spacesJ
and algebras-have been the subject of many books and innumerable
research articles. In the few page.a we devote to each, we clearly can do
little more than explain what each system isJ mention several outstanding
examples, and develop the theory to the limited extent required by our
+
171
172
Operators
39. GROUPS
We begin by considering two familiar algebraic systems~ each of
which is a group, with a view to pointing out those features common to
both which are set forth abstractly in the general concept of a group.
We firstobserve that the set R of all real numbers) together with the
operation of ordinary addition, has the following properties: the sum
of any two numbers in R is a number in R (R is closed under addition);
if x, y, z are any three numbers in R, then x
(y
z) = (x
y)
z
{addition iB associative); there is present in Ra special number 1 namely 0,
with the property that x
0 = 0
x = x for every :t in R (R contains
an additive identity element); and to each number x in R there corresponds another number in R, its negative -x, with the property that
:t
(-x) = (-x)
x ~ 0 (R contains additive inverses)~
It is equally clear that the set P of all positive real numbers, together
with the operation of ordinary multiplication, has the following corresponding properties: the product of any two numbers in P is a number
in P (Pis closed under multiplication); if x, y 1 z are any three numbers in
P, then x(yz) =- (xy)z (multiplication is associative); there is present in
P a special number, namely 1, with the property that xl = lx = x for
every x in P (P contains a multiplicative identity element); and to each
number x in P there corresponds another number in P, its reciprocal
1/x = x~j with the property that xx- 1 = r- 1x = 1 (P contains multip] ica ti ve in verses).
Each of these systems plainiy possesses many properties other than
those we have mentioneL We ignore all such properties and concentrate
our attention solely on the ones we have listed. Let us now consciously
disregard the concrete nature of the elements composing the ahove sets
and the familiar character of the algebraic operations involved4 What
remains in each case is a non...empty set which is clo~ed under an operation
possessing certain f onnal properties~ and apart from notation and
tenninology, these properties are identical in the two systems~ The
concept of a group is a distillation of the common structural form of
these and many other similar systems.
The definition is as follows. A group is a non--empty set G together
with an operation (called multiplication) which associates with each
ordered pair xt y of clements in G a third element in G (called their
product and written xy) in such a manner that
+ +
Algebraic Systems
173
.:t'
ex- 1
= x- 1
174
Operators
thiR group is only one among many of a similar kind, some of v..~hich arc
listed below. In aU t.hesc the elements arc numbers, the operation is
ordinary multiplication, the identity is 1, and the inverse of a number is
its reciprocaL
(a) The single-element set consisting only of the number 1.
(b) The two-element set {1, - 1 }.
(c) The set of all positive rational numbers.
{d) 1'he set of all positive real numbers. Observe that the negative
real numbers do not form a group, for thls set is not closed under
multiplies tion.
(e) The set of all non-zero real numbers~ The set R of all real
numbers contains a numberJ namely 0 1 which has no reciprocal, .so it is
not a group 'With respect to the operation considered here4
(f) The set of all non-zero complex numbers.
(o) The set {l, i, - l, -i} of the four fourth roots of unity.
(h) The set {z ~z7; ~ 1} of the n nth roots of unity for a fixed but
arbitrary positive integer n. Groups (a), (b), and (g) are the special
cases which correspond to choosing n equal to 1, 2, and 4. We see by this
that there exists a finite group of order n for each positive integer n~
(i) The unit circle {z: lzl ~ 1} in the complex plane~ Thia is called
the circle group.
A,gebraic Systems
175
The groups in the next two examples a.re closely related to our work
in the previous chapters. They differ from the groups described above
in that their elements are not numbers~
Excrnple 3. (a) The set Rn of all n-tuples of real nun1bers 1 the operation being coordinatcvlise addition. The identity here is
o=
(07
-x
o~
~
. . . , O) ~
(x1, x2 1
= ( ~x1:ii -X2 7
Xn) is
~ ~xn) .
v..~ith
respect t-0
{a) The set e (X tR) of aH bounded continuous real f unctions defined on a topological space X. The operation here is pointv..ise
addition, the identity is the f l1T1ction 'vhieh is identically zero~ and the
inverse of a function f is the funr.tion - f defined by (-f) (x) = -f(x).
(b) The set e(X~C) of all hounded continuous complex functions
defined on a topological space xj the operation again being pointwise
addition.
Example 4.
(a) The class of all sub:;ets of a set. l]i the operation being
the formation of svmmetric
di ff r.ren ces. 'The re a.dcr \vill rec all that t.h e
....
symmetric d ifferen ce of t V{O sets A and B is dP fined by
A
L).
(A - B) V (B - A);
and in Problem 2-3 it was sho\vn that this operation is associ.ativc 1 that
the idcn tity is the empty set 0~ and th.at the in verse of a set is the set
itselL It is interesting to note that if [] is non-emptyi theu this class
of sets doe!f not constitute a group \vith respect to the formation of either
unions or intersections.
(b) Ai:-y ring of subsrts of a set lI (see Problem 2-4), the operation
again being the formation of symmetric differences.
..
'
176
Operators
4
5 = 2. Figure 28 is a complete addition table for I 1: to find the
sum of any two numbers in the set, Jook for the first number down the
left-hand side, look for the second across the topj and observe t.heir sum
in the corresponding placP within the table4
0 1 2 3 4
0
1
2
3
4
5
6
5 6
1 2 3 4 5
2 3 4 5 6
3 4 5 6 0
4 5 6 0 1
5 6 0 1 2
6 0 1 2 3
6 0 1 2 3 4
0
1
2
3
4
5
Fig. 28.
for I 1.
6
0
1
2
3
4
5
1234)
= ( 3142
'
lvhere be]ow each of the integers 11 2, 3, 4 is placed its image under the
mapping P~ If
(1234)
4132
pq
1234)
( 2341 .
177
ing rigid motions: the identity motion I, which leaves fixed each point
of the square; the counterclockVtrise rotations R1 R', and R" about the
center through angles of 90, 180~ and 270 degrees; the reflections 11 and V
about the horizontal and vertical axes; and the reflections D and D' about
the indicated diagonals. Eaeh of these rigid mot.ions is related to a
certain aspect of the symmetry of the square, and they are therefore
called symmetries. We multiply t'vo symmetries by performing them
in succession~ beginning v..--ith the one on
the right. Acrordingly, RV is the result
v\_f.}
oi first reflecting the square about the
1
vertical axis, then rotating it counter- ~
I
/;
1
clockwise through 90 degrees. If \\""C D ',.....,2~""""""'"""'
~~~
/
. ::.: . .:.. .......... -
........ . . . .. ......
.
b
.....................
.... .
trace t h e e ff ect o f t h ese motions Y
}?;_~'f:\lll!!)l!l!!i!ii2!41Ji
following the manner in which the num.:i.~+0:~+:.:1.::_:,::/~::m
)~~ ~ ~~ ~ ~ ~ ~ ~ ~ ~ ~::: :~ ~:: ~~~~: ~ ~ ~~~c~:
bered vertices are shifted about, we see - - ~ ::0:2:T0,. <7:717
~~~= ~:-:::::: :~~:~ ~:::;;: ~~~~~~~~::..--:::::-
~~~'.~);,:~--~~~1
b.
Not only are the equations in {4) solvable in G, but their solutions are
unique. This is a direct consequence of the f o11owing cancellation law
r
178
Operators
;:::=
ae
===
ea
;:::=
a.
As the reader will observe, we have not only shown that e is an identity
element~ but we have also shown that each element has an inverse in the
required sense.
A subgraup of a group G is a non-..empty subset H of Q which is itself
a group with respect to the operation in G~ It is easy to sec that the
identity e' in II equals the identity e in G; for e' e' = e.' = e' e, and by the
cancellation law in G we have e' = e. Also~ if xis an element in H and x'
is its inverse in H~ so that xx' = x'x ::::;; e, then x' equals the inverse x- 1
of x in G; for xx' = e and xx- 1 ~ e yield xx' = xx- 1, and another application of the cancellation law in G gives x' = x- 1 By these remarks, we
sec that a non-empty subset H of G is a subgroup of G {::::)it is closed
under multipliration 1 it contains the identity e of Gj and it contains the
inverse x- 1 of each of its elements x. Since xx- 1 = e, it is equally clear
that a non-empty subset H of G is a subgroup of G {::::> it is closed under
mul tip lica tion and the formation of inverses
Many of the groups described above are subgroups of other groupsL
For instance, in Example 1 it is easy to see that (a) is a subgroup of (c),
(c) of (b)~ (b) of (d) 1 (d) of (e) 1 and (e) of (f). The subgroups of Example 7 arc of particular importance, and are called transformation groups.
If the underlying set is finite, as in Example 8, a transformation group
is often called a permutation group. In the case of Example 9, for
instance, each symmetry can be regarded as a permutation of the numbers which label the vertices of the square; e.g.J the reflection H about the
L
Algebraic Systems
179
G~~i).
s_.
1234)
qp ::;: ( 3421 .
Since pq - qp~ . S-t is non-_A_beHatL We sa\v in Example 9 t.hat R l' = D~
If 'ye nov.r compute VR~ \Ve gPt l7R = D'~ so Rl" :#- v'" ll and the group
of sy1nmetries of a square is a1so non-Abelia.1L
'Vhen the theory of groups is studied for its o\Yn sake, thr. Pmphasis
is usually placed on non-A helian groups. 'rhe prrscnt section ho,vever,
is intended ma.inly to provide a proper foundation for our \vork in ~he
rest of this chapter, and AbP1ian groups are the ones of grPatPst in1port.anee for us. In the A.belian ease, t.he multip]icative notation usrd above
is often replaced by additive notation, in ,vhich the product xy is v.rritten
x + !J and ralled the .~um of x and y. CorrPspondingly, thP idrnt.it.y is
dPnotcd by 0 instead of e and i~ caH~d the zero f.! enient (or simp1y zero);
and the inverse of x is denoted hy -x instr.ad of x- 1 and is r.al1rd 1he
negative of x. AJ so, the operation of subtraction ii:i definPd by
1
1.
Let G be a group, and show that (xy)- 1 = y- 1x- 1 for any t\vo e]ements x and y in G. Show also that (x- 1) - 1 = x for any e]ement
x in G.
180
2.
3..
4...
5.
6+
Operate rs
7.
9.
10.
11.
xa for every x
E G} .
Im=
8.
{0~1~
+.
G~
tm~IL
and define the uproductH of any two numbers in it to be the remainder left \Vhen their ordinary product is divided by m. Construct a multiplication table similar to ~~ig. 28 for the non-zero
e1ements of I b Does this set ,vith this operation form a group?
Compute a similar table for the non-zero elements of / 7. Does
this system form a group?
Introduce a symbol for e.ach element of the symmetric group 83
of degree 3, and eonstruct a multiplication table for this group.
Show that n ! is the order of SB~
Show that the group of all one-to--0nc mappings of a non-empty
~et X onto itself is non-Abelian if X has more than tv..,.o elements.
Construr.t a multiplication table :for the group of symmetries of a
square.
I Jet G and G' be groupsr A mapping f of G into G' is ea11ed a homomorphism if f(xy) = f(x)f(y) for all elements x and yin G~ Assume
thatf is a homomorphism of G into G', and prove the following facts:
(a) f (e) = e', where e and e' are the identity elements in G and G';
(b) /(x~l) = f(x)-1;
(c) /((;) is a subgroup of G';
(d) 1- 1 ( { e' l) is a subgroup of G.
If a homomorphism is one-to-one, it is called an isomorphism. If
there exists an isomorphism of G onto G' j then G is said to be isomorphic lo G'. To say that one group is isomorphic to another is to say
that they have the same number of elements and the same group
181
40. RlNGS
We have seen that the set I of alJ integers is an additive Abelian
group with respect to the opcratio n of 01d ii tary addition. It is just as
important to observe that Ii~ also cJosed under ordinary multip1icat.ion
and that multiplication is linked to addition in a way which enriches t.he
structure of the system as a whole.. The theory of rings is the theory
of such systems.
A ring is an additive Abelian group R \vhich is closed under a second
operation called multiplication-the product of two element.a x and y
in R is written x~in such a manner that
(1) multiplication is associative, that is1 if x, y~ z are any three
elements in R, then x(yz) ~ (xy)z; and
(2) multiplication is distributive, that isJ if x, y, z are any three elements in RJ then x(y
z) = xy
xz and (x + y)z = xz + yz.
In other words, a ring ia an additive Abelian group whose elements ean
be multiplied as well as added 1 and in which multiplication behaves
reasonably with respect to itself and addition. We note particularly
that multiplication is not assumed to be commutative,
Many of the additive Abelian groups listed in the previous section
a.re also rings with respect to natural multiplications.
Each of the following rings consists of numbers, and addimultiplication are understood to have their ordinary meanings.
The single-element set containing only the number 0.
The set I of all integers.
The set of all even integers.
The set of all rational numbers.
The set R of all real numbers.
Example 1.
tion and
(a)
(b)
{c)
(d)
{e)
182
Operators
{/)
(g)
are both integers.
Example 3. Any ring of subsets of a set ll, \Vith addition and multiplication defined by A + B = A 6. B and AB ==== A n B (see Problems
2-3 and 2-4). The fact that a ring of set~ is a ring in our present sense
is the reason :for the name ring of sets.
Example .4. Let m be a positive integer~ and Im the set of all non-nega ..
tive integers less than m: Im = {O~ l, .
~ m - 1 l ~ If a and b are
tv10 numbers in I"'~ \Ve define their l~sum" a
and Hproductn ab to be
the remainders obtained 'vhcn their ordinary sum and product are
divided by m. If 1n is 6,, for instance,, then I = { O~ 1, 2 1 3, 4, 5}, and
v.,.c have 3
4 = 1 and 2 3 = 0.. Im "~th these operations is called
the ring of integers mod m~
T
+ {)
I
\Ve now consider a general ring ll~ l\.1any fami1iar facts from ele
mentary algebra are valid in R. Nevertheless, c~ch must be proved on
its o\vn merits from the axioms or previous theorems~ for one never knows
\\,.hen something \\'hich appears to be uohviousi~ will turn out to be false.
\VP have a]ready defined subtraction in any additive Abelian group
by x - y = x + {-y)i and it is easy to shov.,. that such statements as
~ (x - y) = y - x and x ;:::=- y {::::;} x - y ;::;; 0 arc true~
Problem 39-1
assures us that - (-x) = x. And so orL Properties of this kind relate
to the additive structure of R and are comparatively trivial. It is only
'vhen we consider multiplication, and its relation to addition 1 that \Ve
begin to encounter some interesting situ_ations.
We i1lustrate this by proving that xO = 0 for any element x in R~
First~ we
have
xO
+ xO =
x(O
+ 0)
= xO.
Our next step is to add ~xO (the negative of xO) to both sides of this on
the right, ''Thich gives
(xO
+ xO} + (-xO)
\Ve
xO
+ (-xO);
Since the sum of any element and its neg&:tive is 0, this collapses to
xO
'?.thich yields
+0 =
xO
0,,
04
Algebraic Systems
183
Similarly, Ox ~ 0 :for any x~ We see in this "Tay that the product of two
elementrs in a ring is zero whenever either factor is zeror We have given
the detai1 s of the proof of this seemingly obvious fact because, surprisingly
enough~ its converse is false~ It can perfectly well happen (and it often
does happen) that the product
two non-zero elements in a ring is .zero .
The simplest examples of this phenomenon are found in the rings of
integers mod m \\"here m is greater than 1 and is not a prime number.
We have already sccn:t for instance, that in /6 the product of the two
non-zero elements 2 and 3 is 0. An element z in a ring such that either
zx = 0 for some non-zero x or yz = 0 for some non-zero y is called a
divisor of zero. In any ring with non-zero elements, the clement 0 itself
is a divisor of zero.
By using distributivity and the fact that the product of two elements in the ring R is zero when either factor is zero, it is easy to verify
the following familiar rules oi calcu] ation: x (- y) = ( - x )y = - xy,
(-x)(-y) = xv:t x(y - z) = xy ~ xzj (x - y)z = xz ~ yz~ As a simple consequence of the last t\\.'O of these rules, vte have the following
rancellation la.v.r: if a is not a divi8or of zero, then either of the relations
ax =:; ay or xa = ya implies tha.t x ;;:::: y.
R is called a co1nmutafive ring if xy = yx for all clements x and y.
Every ring in the above Jist of examples is commutative. We shall
encounter some non-commutative rings of very great importance
in Secs4 44 and 45.
If the ring R contains a non-zero element 1 with the property that
xl = Ix = x for every x, then 1 is called an identity element (or an identity) 1 and R is calJed a ring with identity~ I:f a ring has an identity, then
it has only one. In Example 11 only (a) and (c) have no identity4
In
both rings described in Example 2J the identity is the function which is
identically 1. A ring of subsets o:f a set U has an identity ~ there exists
a non-empty set in the ring which contains every set in the ring; in particular 7 if ll is non-empty and the ring is a Boolean algebra of subsets
of U, then the set lJ itself is the identity. rrhe ring Im has an identity
<=>m > 1.
Let R be a ring \~:i th identity+ If x is an element in ll 1 then it may
happen that there is present in R an element y such that xy = yx ~ I~
In this case there is only one such clcment 1 and it i~ ,vritten x- 1 and
caned the inverse of X+ If an element x in R has an inverse~ then x is
said to be regular. Elements ~Thich are not reguJar are <~aUed singular.
Regular clements are often called invertible elementsp or 1ion-8'ingular
elements. The Plement 0 is alv... ays singu]ar in a ring "~ith identity~ and
the clement 1 is always regular~ In Example lb~ 1 and ~ 1 are the only
regu1ar elemPnts; in ld to 1/1 aU non-zero elements are regular; and in
lg, the regular elements are 1~ i, -1, and -i.
or
184
Operators
Problems
l~
2.
3.
4.
5..
6.
Jn the ring of even integers, why is 2 neither regular nor singu] ar?
Consider the ring of all subBets of a non-empty set lJ., "'-it.h the
operations dPfined in Example 3. What are the regular clements in
this ring? "\Vhat are the singular clements? What are the divisors
of zero? llnder what conditions is this ring a field?
In each of the following rings of functions defined on the closed unit
interval [O, t]J descrjbe the regular clements~ the singular elements~
and the divisors of zero:
(a) an real functions;
{b) all continuous real functions;
{c) all bounded continuou!:l real function~~
What changes are Ilecessary in these descriptions if [0~11 is replaced
by (Orl)?
I...et R be a ring with iderttity, and shov{ that any divisor of zero in
R is singuln.r~
Let R be a ring ,vith identity} and show that Risa division ring<=> the
non-zero elements of R form a group \vith respect to multiplir.at.ion~
Show that the ring Im is a field {:::::> 1n is a prime number. (If int: in
sho\\ring t.hat Im is a field if m is primc:t shO\V first that in this ca.se
Im has no non-zero divisors of zero 1 so that the non-zero elements
of I"' are closed under multiplication and the cancellation law
ax ~ ay ==> x = y holds for these elements; now apply Problem 39-2
and Problem 5 above.)
J..ict R be a rjng.
i E I==> xi and ix
Algebroic Systems
185
m ===
{. ~ ~ -
I(X) = {f:f(x)
0 for every x c X}
is an ideal in this ring~ It is easy to see that I(X) equals the full ring
'vhcn X is the empty set and equals the zero ideal when X = [O, l].
As a final example, we consider the ring of all subsets of an infinite set UJ
and we o bservc that the class of all finite su bscts of U is a proper ideal in
this ring.
Some rings have a multiLude of non-.trivial ideals, while others have
none at alL In gPneral, the structure of a ring is very closely ronnected
with the ideals in it+ The following theorem illuRtrates this point.
Theorem A. If R is a commutatitte ring with identity~ then R is a field ~
it has no non-triv1"al ideals.
PROOF. We first assume that ll is a field, and we show that it has no
non-trivial ideals~ It suffices to sho-.,v that if I is a non-zero ideal in R~
then I = R. Since I is non-zerot it must contain some element a ~ 0.
R is a field, so a ha::; an inverse a- 1, and I (being an ideal) contains
1 == a~ 1 a. Since I contains 1, it also contains x = xi for every x in R 1
and therefore I = R.
We now assume that R has no non-trivial ideals, and we prove that
Risa field by shomng that if xis a non-zero element in R, then x has an
inverse. The set I == {yx :y e. R} of all multiples of x by elements of
R is easily seen to be an ideal.. Since I contains x = Ixi it is a non-zero
ideal, and it consequently equals R. We conclude from this that I
contains 11 and therefore that t.here is an element yin R such that yx = l+
This shov{S that x has an inverse, so R is a field.
186
Operators
construct other rings which are associated with the first in a natural
way. We explain how this is done~
Let I be an ideal in a ring R. We use I to define an equivalence
relation in R as follows: two elements x and y in R arc said to be con ...
gruent 1nodulo I~ written x = y (mod I), ii x - y is in I. Since only one
ideal is under consideration~ we abbreviate this symbolism to x
y.
It is easy t.o verify that we actually do have an equivalence relation here,
that iB, that the follo\\dng three conditions are satisfied:
(1) x = x for every x;
(2) x = y ===9 y = x;
(3} x ~ y and y = z =:- x
z.
Furthermore_, congruences can be added and multiplied, as if they were
ordinary equations:
(4) x1 = Zi and Y1 = Yi~ xi+ Yi
x~ +Yi and X1Y1 = X2Y2
The hypothesis of (4) is that x1 - x2 and Yi - Y2 are e]cmcnts of I, and
since I is an ideal, the conclusions follow at once from
(xi
and
+ Y1)
X1Y1 -
- (x2
+ y,.)
X~Y! =
X1Y1 -
XfY':l.
(y1 - y'J.)
X1Y2
x'A)Y2~
{y;y
xl
fy:y {y :y -
x e:
Jl
Algebraic Systems
187
Our next step is to construct a ne\v ring_, which \ve denote by RII and
caH the quotient ring oi R with respect to 14 The elements of the ring
R/ I arc the distinct cosets of the form [.r] (or x
I)+ All that remains
is to define the manner in \Vhich these cosets arc to be added and multi..
plied and to verify the fact that. \\fe do indeed obtain a ring. The
definitions arc as follo\vs:
[x]
+ {y] =
and
[x
y]
[xy]a
In other \Vords, we add and multiply t\vo cosets [x] and [y] by first adding
and multiplying the representatives x and y, and then by forming the
cosets \vhich contain x + y and xy+ It is necessary to make certain that
these arc legitimate definitions, that is, that the resulting cosets {x + y]
and [xy] do not depend on the particular representatives x and y chosen
for the cosrts [x1 and fy]~ ~ro this end, we take t\ {0 other representatives
of the same t~~o c:osct~, Lr.t t\\~o other elements X1 an<l y i of R such that
xi - x and Y1 - y. \Ve \Vant to satisfy ourselves that
11
rx + y .J
+ yJ
{xy], or equivalently~ that x1 + Yl
1
rx
and [x1y1] ==
- x
y and X1Y1 = XY~
Since this is prerisely the content of property (4) above, \Ve do have
valid dcfini t.ions for our ring opera t.ions in R /I. \Ve omit the detailed
vcrifir.at.ion of t.hP fact that Ii I I \vi th these operations actually is a ring,
rernarking only that the zero element of this ring is {01 = 0 + I = I and
that the negative of a typical element ~x] = x + I is [ -x] = (-x)
I.
It is easy to see that ll/ I is eommutative ii fl isi and that if R has an
identity 1 and I is a proper ideal~ then ll/ I has an identity 1 + I.
Wr summarize t.he results of this discussion in the follo\ving theorem.
Theorem B.
(x +I)
and
(x
(y +I) = (x
+ l)(y + J)
+ y)
+I
xy +I,
then these cosets constitute a ring dennted by R/ I and called the quotient
188
Operators
+ y)
f(x)
f(y)
f(xy) = f(x)f(y).
=
A homomorphism of one ring into another is thus a mapping of... the first
ring into the second ,vhich preserves the ring operations. It is easy to
see that f preserves zero in the sense that f(O) = 0 ;1 for
f (0)
+ f (0)
= f (0
+ O) = f (0),
+ f( ~x)
= f(x
+ (-x))
= /(0)
Similarly,
O~
{x:x e R an<lf(x)
=OJ.
It is easy to see that K is an ideal in R, and also that K is the zero ideal
in R ~ f is an isomorphism. W c leave these verifications to the reader.
In a sense, the size of the kernel K is a measure of the extent to which
f fails to be an isomorphism.
What is the real significance of homomorphism8~ homomorphic
images, and kernc1s? A full ans\ver t.o this questjon v;~ould carry us into
the utmost reaches of the general theory of rings~ """"here \\c havr. no
intention of treading. 'Ve ~will, hotrever, attempt a brief and ncce8sariJy vague partial ansv{er. Suppose that Risa ring ,~those f e.atures are
unfamiliar, v..Those structure is unknown. The question confronting us is,
'Wbat is the nature of fl? Andj as is often the r.asc in mathematics, if
we can state adequately \vhat this question means~ vire -aill have taken
a long step toward ans,vering it. Suppose now that Il' is a homomorphic
image of R and that R 1 is a wcll~kno'"~n ring which is intuitive]y familiar
We use the s ym ho I 0 here to designs. t.c two different elem en ts: the z oro in /~
and the zero in R". This convention is customary, and it aavea mor.c trouble than it
1
ca.uses~
Algebraic Systems
189
+ K~
g(x
and
+ y) = (x + y) + K
g(xy) = xy
+K
+ K) + (y + K)
== g(x) + g(y)
= (x + K)(y + K)
;;::: (x
g(x)g(y).
190
Operators
images.
If I is an ideal in a ring R, then its properties relative to all of R are
reflected in corresponding properties of the quotient ring R /I, and many
aspects of the study of R depend on the presence in it of ideals whose
corresponding quotient rings are simple and familiar.
In order to illustrate this fundamental principle, we introduce the
following concept. An ideal I in a ring R is said to be a m.aximal ideal if
it is a proper ideal which is not properly (',ontained in any other proper
ideal. Our next theorem is an immediate consequence of this concept and
Theorem A.
C.
1.
2..
3~
Algebraic Systems
191
+
+
. . -.~ In some approaches to th~ theory of linear spaees, the system of sc:U.a.rs is
allo""~d to be a.n arbitrary fieldT This degree or generality is unneeessn.ry for our
purposes; and since unuecesaa..ry generality is und~ira.Lle, we Emit ourselves accord~
ing]y. Even further, the system of scalars can be taken to be a.n arbitrary ring. In
this case!, one spea.ks of a maduh in~ten.d of & lilleat space. Modules are of great
importance in the st ru ctu re theory of rings.
192
Operators
The set R of all real num hers~ "'. i th ordinary addition and
multiplication taken as the linear operations, is a real linear space ..
Ex.amp( e 1
Example 2. 'TJ1c ~et R"' of all n-t.uples of real numbers is a real Jinen.r
space under the f ollo,\ring eoordina.tewisc linear operations: if
and
then
and
= (xl + Yi, X2 + Y2 1 ~
ax = (ax1, ax2, ~ .
ax&).
+Y
Xn
+ Yn)
and
Example
+ g(x)
6.
rfhe set e(X,C') of all bounded continuous complex functions defined on a topological space X is a complex linear space with
respect to the pointwise linear operations defined in Example 3.
Example
space~
operations defined in P.
Exompre 9. A linear space may consist solely of the vector O~ with
scalar multiplication defined by a o = o for all ar We refer to this as
the zero space 1 and we aJways denote it by {0 J.
Argebraic Systems
193
These examples are typical of the spaces which will concern us and
give ample scope for the illustration of all the important phenomena.
There are a number of other linear spaces of great interest, and we mention some of these from time to time in later chapters+ For the present,
however~ the above list will suffice.
We saw in Sec+ 14 that in any linear space "?{e have a~ 0 = o~
0 . .x = o~ and ( -1).x = - XT It is also easy to sho"r that ax = 0 ~a = 0
or x = 0; for if a ~ 0, then multiplying both sides of ax
0 by a- 1
yields a- 1 (ax) = a- 1 o~ {a- 1o:)x =:::: o, lx === o~ and finallyt x = 0.
We now turn to the general theory oi an arbitrary linear space L~
A non-empty subset M of Lis called a Bubspace (or a linear subspace)
of L if Mis a linear space in its o'~n right with respect to the linear operations defined in L.. This is clearly equivalent to the condition that ;.l{
contain all sums, neg.a tivl1st and scalar mnltiples of its elem.en ts; and
since -x = (- l)xt this in turn is equivalent to the condition that M be
closed under addition and scalar multiplication. If the subspace Mis a
proper subset of L, then it is called a proper subspace of L+ The zero
space [0} and the full space L itself are always subspaces of L+ Among
our examples, P n is a subspace of P for each positive integer n, and
P is a subspace of e[O~l]~ Also, the following are easily seen to be sub ...
spaces of R1 :
;:::=
and
= i (.x 1,
0, 0)}, M "i
M ~ ~ ~ (0, X2, xa)}, M 6
Mi
== i (0,
=
{ (x1~
x2~ 0)} 1
Ma
0, Xa) L, M 6 ==
{ (0, 0, xi) l ~
{(x1, X2~ O)}.
The subspaces Mi, Mi, Ml are usually called the coordinate axes in solid
analytic geometry, and M ,1 M 6J M 6 are called the coordinate planes~
The most general non-zero proper subspace of R' is a line or a plane
through the origin.
If M is a sube;pace of L, then-just as in the case of an ideal in a
ring~wc can use M to define an equivalence relation in L as follows~
x ~ y (mod . ~)
. means that x - y is in AI. The discussion leading up to
Theorem 41-B can be repeated \vithout essential change (but with considerahle simplification) to yield the concept of the quotient spac.e L/ M of
L v.,..ith respect to M. We give a formal statement of the basic facts in
the following theoremL
Theorem A. Lel M be a subspace of a linear .Bpace L~ and let the coset of
an element x in I,J be defined by x
M = {x
m:m e, M}. Then the
distinct cosets form a partition of L; and if addition and scalar multiplication are defined by
(x
and
+ M) + (y + M)
a(x + M)
(x + y)
= ax + M,
+M
194
Operators
then these coset8 cunstituu a linear space denoted by L/ M and called the
quotient space of L with respect to M. The origin in L/ M is the coset
0 + M
M, and the negative of x + Af is (-x)
+ M.
;:;;:r;
The proof of this theorem is routine, and we leave the details to the
reader.
It is worth remarking that the concept of a quotient space has a
simple geometric intPrprcta.tion~ To bring this out most clearly, we let
L be the linear space R'}. and A-f the suhspacc indicated in Fig. 31. If we
(~+M)+(y+M)
=ii: (
Fig_ 3 I.
x+y) +M
Addition in a quotient
spae~.
think of the vectors in L as the heads of arro,vs ~Those tails are at the
origin, then the non-zero proper subspace .i1J is a ~tra.ight. Jine through the
origint a typical coset x + kl is a line parallel to M ~ and L/ "ftf consists of
all lines parallel to j,f We add tVo coscts x +Mandy + M by adding
x and y and hy forming the line (x
y)
M through the head of x
y
and paraHe1 to !f. Scalar multiplication is carried out similarly
The subspaces of our linear space L can he characterized conveniently
as follo\\'S. If l Xt, x~, . ~ . , Xn} is a finite non-empty set of vectors in
L~ then the vector
x : : : :; a1X i + a2X'1 +
a~x"
r
A1gebraic Systems
19.S
M
N of all sums of the form x + y, where x e: M and y E N4 Since
M and N are subspaces, it is easy to see that M + N is the subspace
spanned by all vectors in !JJ and N together, i.e., that ill + N = {MU NJ.
If it happens that M
N ~ L~ then ~.ve say that L is the sum of the
subspaecs M and N. This means that each vector in L is expressible as
the sum of a vector in Al and a vector in N 'The case in v-""hich even
more is truc--namely~ that each vector ~ in L is expressible un-iquely in
the form z ::=: x + y, with x M and y f N-will be of particular importance for us. In this rase V{C say that Lis the direct sum of the subspaces
Af and N _, and we syn1b0Jizc this statement by v{riting L = ltf N ~
The concept of a direct sum can easily he broadened to allow for three
or more subspaceEL If M 1, M -i, ~ ~ . , M ~ (n > 2) are subspaces of L,
then the statement that Lis the direct :ium of the Jf /S:-written
L=M1Ea1f!EB
~means
~.
Mn
196
Operators
R3
:;:
M1
e Mi m Ma=
M1 EB M,
= M2 EB M.5 = Ma
fl} MB~
2.,
3.
4.
M1
and M1 fl
== M2
n M3
== {O}
lf
Al gebroic Systems
197
this clc.arly means that if Eq. (I) holds for certain scalar coefficients
cq, 02, . . , ani then a11 these scalars are necessarily 0. In other
words, S is linearly independent if the trivial linear combination of its
vectors (vrith all scalar coefficients equal to 0) is the only one ,vhich equals
0 ~ and it is linearly de.pendent if some non-trivial linear com bina t.ion of its
ve(~tors equals 0. In either cu.se, as 'VP know, the vectors in the subspace
lSJ spanned by S are precisely the linear combinations
(2)
= f31X1 + f12x~ +
(3)
(a~
- /3~)x2
+ (Un
t'~) Xn ~ o~
XI
+0
X2
+0~
Xn
Theorem A.
198
Operators
inclusion~
Theorem B.
h.a."J a basis.
.~pace
Algebraic Systems
199
>
1 (since x 1 '#- O) and Eq. (I) can be rewritten in such a way that Z:i is
exhibited as a linear combination of xh ~ . . , Xt-1 (with coefficients
-a1/a1, ~ . ~ , - CWi-J/a;)~
Theorem D..
B 'J
f I = I f h f 'l,
{ 1
for some positive integer m. We must now shov.T that m and n are equal,
nnd this ~Tc do as follo\VSr Since the e/s span J,J, f 1 i~a1inear combination
of the e/s, and the ~et sl ;:::= { f b e !) e2' ~ ~ en l is 1incarly dependent.
'Ve know by Theorem(~ that one of the e/s, say ei:~~ is a linear combination
of the vectors in S1 'Which precede it. If '"'Tc delete e1 from 81,, then the
remaining set Ssi. = If 1 ~ e 1~ . . . , e-i~-1, e,o+ 1~ . ~ . ~ e"} still spans 4
Just as before~ f?. is a linear combination of the vectors in '"-9'-i~ so the set
Si = {f 1, f 2~ ei, ~ . . , e.-(1-1, ei~+ 1~ . ~ , en} is linearly dependent.
Another application of Theorem C shows that some vector in 8 3 is a
linear combination of the preceding ones; and since the // s ate linearly
independent~ this vector must be one of the e/s. If 've delete this vector,
then the remaining set again spans L. If we continue in this way, it is
clear that 've cannot run out of e/s before the f/s are exhausted; for if we
do, then the r~maining f/s are linear combinations of those already used,
which contradicts the linear independence of the //s~ This shows that
n is not less than m, or equivalently, that m < n. If we reverse the
ro1es of the e/s and f/s, then precisely the same reasoning yields n 5 m~
from which we conclude that m = n~
r
number)~
200
Operators
-Bi
+ a2fit +
a.1fi1
+ aR/i..
Fu rt.her r every f; occurs in at least one such expression, for if a certain o net
say fj(j, does not, then sine e B 1 is a basis, L. is a linear combination of cer
tain e/s, and therefore of certain f/s ~ / 1.-which contradicts the fact
that the // s are 1ine.arly independent. This process associates with
each ei a finite non-empty set F i1r: of //st and in such a way that B2 =
Uo:l1B 1 F~i
Let n1 and n2 be the cardinal numbers of B1 and Bt, and let
n be the cardinal number of the indicated union. It follows from the
above set equality that nt = n, and Problem 8-10 shows that n :::; ni,
so n2 s n1. If we reverse the roles of the e/s and fjJs, then in the same
manner we obtain n1 < n2, from which we conc]ude that n1 ~ n2.
These theorems enable us to define the dimension of an arbitrary
linear space L~ If L = {O} then it is said to be 0--dimensional, or to
have dimensian 0; and if L . {0 L then its di1nension is the number ol
elements in any basis. A linear space is called finite-dim-ensonal if its
dimension is 0 or a positive integer, and i'nfi.nt"te-dimensionaI.. otherwise+
We can now justify the usual practice of calling R" and Cn n-dimensional
spaces by exhibiting the following n vectors as a bas.is for both spaces:
j
6t
= (1,.
es
~ (O~
o, o, . . , 0)
1, O, . . . , 0),
(O, O, 1, . .
0),
=
t
ei
e. = (0, 0, 0,
1).
Algebraic Systems
201
linear space is isomorphic t.o another js to say~ in effect, that they are
abstractly identical with respect to their structure as linear spaces.
NO\\r Jet L be a non-zero finite-ditnensiona] linear space of dimension
n, and let. B = {e1 e2 , . . , e1l l be a basis for L \\'ho8e elemPnts are
written in a. definite order as indi,~ated by the subscripts. Each vector
x in L is uniquely expressibh.l in the form
This theorem can easily be extended to the case of an .arl)itrary nonzero Jinear space~ We begin by deserihing the concrete JinPar spaces
which wiH replace Rn and Cn in our ge11eralized for1n of Theorem F.
Let X be an arbitrary non~cmpty sct 1 and de-note by J...(X) the set of a.11
scalar-valued functions defined on X which vnnish oulsidr finite sets~
Addition and scalar multiplication for such functions a.re understood to
be defined point.wise, and T-'(X) is obvioufdy a non~zrro linear Rpacc which
is real or complex according 38 the functions con~idered a.re real or complex~
Our purpose is to shovl that these spaces are universal model~ for
non-~ero linear spaces, in the sense that an arbitrary non-zero Jineat
space L is isomorphic to some L(X). We start by choosing a basis
B = i ed for Lr and "\ve let B be the set Xr We next establish an isomorphism of f.; onto l.i(B) by making correspond to each vector x in L a
scalar-valued function. f~ defined on B. If x = 0, then f'J; is defined by
f :t!(ei) = 0 for every ei in B If x r=: O, it is uniquely expressible in the
4
form
202
Operators
Problems
l.
2.
3.
is also a b.asig ~
Let ill he a subspace of a linear space I-', and show that there Pxists a
subspace N such that L = ~M" ffi N. Give an example for the case
in which L = R'l. to show that N need not be uniquely determined by
M4
4.
5.
If A! and N are subspaces of a lin car space l.1 ~ and if I.J :::; ill ffi NJ
sho'" that the mapping y ~ y + ~f which sends each y in N to
y +Min L/1'1 is an isomorphism of 1V onto L/lf.
Denote the dimension of a linear space L by d(l-1)~ If L is finitedimensiunal, and if M and N are subspaces of I..1, prove the fallowing:
(a) d(ftf) < d(L)t and d(M) = d(L) ~ !f = L;
{b) d(M) + d(N) = d(Al
N)
d(M f\ N);
(c) if L = M + N~ then L = J1 ffi N => d{L) = d(M) + d(N);
(d) d(L/ M) = d(L) - d(M)~
If L and L' are linear spaces, show that L is isomorphic to L' ~ they
have the same scalars and the same dimension~
6..
Algebraic Systems
203
+ y)
+ T(y)
= 7,(x)
or eq uivalcntly, if
'l~(ax
+ {3y)
and
:;::: aT(x}
T(ax)
aT(x),
+ {3T(y)~
A linear transformation of one linear space into another is thus a homomorphi~1n of lhe first spare into the secondt for it is a mapping which
preservE's the linear operations. 7' also preserves the origin and negatives, for '.f(O) = T(O O) ~ 0 7'(0) ~ 0 and
+
T(-x)
T({-J)x)
(-l)T(x)
-T{x)T
The importance of linear spaces lies mainly in the linear f..ransforma-tions thry carry, for vast tracts of algebra and analysis, ,vhen placed in
their proper contiext~ reduce to the study oi linear tran~f or ma tions of one
linear space into another. The theory of matrices, for .instance, is one
small corner of this subject~ as are the theory of certain types of differentia1 and integral equations and the theory of int~gration in its most elegant modern form.
ln the f ollo,ving examples \Ve leave l t to the reader to shOl that eaeh
mapping dP.srribed actually is a linear transformation.
Example 1. ,V. e co nsidrr the linear space R~ t and each Ji near t.ran.sf orma ti on mentioned is a. mapping of ll 2 into itself.
(a) T1((x1~x2)) = (ax1iax2), 'vhere a is a real number. The efir.ct.
of T 1 is to multiply each vector in R 2 hy the scalar a.
(b) 1\i( (x1,x2))
(xz,x1). T2 reflects ll 2 about thP. diagonal Jlnip,
X1
X2.
(c)
(d)
1,a( (x1,x2))
T t ( (x1,x2))
is
ch~ar!y
Example 3.
I (f) = /
1
1
f(x) dx
2a.
Operators
+ U) (x)
.:=
T(x)
+ U(x)~
(I)
aT(x)~
a~
(2)
Theorem A. Let L and [;, be two linear S'paces with the sam.e system of
.~cal.ars.
7 hen the set of aU linear transformatians of L into I/ is itself a
linear space u;ilh respect to the linear operations defined by b'q8L (I) and (2).
1
next
chapter~
\Ve assume, then~ that V{e have a single linear space L, and we consider the Ji near space of all linear transformn tions of I~ in to itself. W P
usually speak of these as linear transformations on I..J. The most important feature of this situation is that if T and U are any t.wo linear transformations on Lt then we r.an dPfine their prnduct TCI by means of
(TU) (x)
T( Cl(x)).
(3)
(TU)V.
(4)
and
+
+
T( U
V)
(T
U) V
= TU + TV
= TV+ UV,
(5)
(6)
(aT) U
T(aU).
(7)
Algebraic Systems
easy~
205
+ U)V](x) = (T + U)(V(x))
T(V(x)) + U(l'(x))
= (TV)(x} + (UV)(x}
~ (TV + UV)(x).
;;=
and
= xD(p) = x dp
dx
= x dp + p,
dx
7'!
(8)
IT = T
al(x)
===
ax
It is not difficu;I t to sbo w that when '1 is non-singular, then the mapping
T- 1 is also a linear transformation on L~
A particularly important type of linear transformation on L arises
as follows. Let TJ be the direct sum of the subspaces Al and N~ so that
L = M El) N. This means, of course, that each vector z in L can be
written uniquely in the form z = x
y ,vitb x in M and yin N. Since
x is uniquely determined by z, Vt'"e can define a mapping E of L into itself
1
206
Operators
by E(z)
z~ E is easily seen to be a linear t.ransforrnation on L, and it
is called the projection on ft! along N. I~""igure 33 indicates the geometric
reason for this terminology. The most significant property of Eis that
it is ide1npotent, in t.he sense that E 2 = I!); for since x = x + 0 is the
representation of x as the sum of a
R2
vector in JI and a vector in N, 1\... c
have
-;:::=
Et{z)
(EE) (z)
n~(n~(z))
~ n~(x)
= x
E(z).
(I~'
- I!}) (z)
=
O(z)
A! gab ra i e Systems
linear transformation on 4
the equation
'2<J'1
(I - E) 1 = (I - E)(I - E) =I - E - E
+E
shows that Eis a projection =>I - Eis a projection; and ~,.e know that
ii E is the projection on M along NJ then I - Eis the projection on N
.along M~ and conversely. We make one further comment on these
matters: if M is a given subspace of L, then by Problem 43-3 there cer...
tainly exists a projection on Jl.f and along some N; but since there may
be many different subspaces N such that L = l\f E9 Ni there may also
be many different projections on JI (and along various N's).
Problems
1.
2.
3.
A.
5.
6.
7~
8.
9.
10.
Show that the mappings defined by Eqs. (1) to (3) are linear
transformations+
Sho\v that the linear transformations T2 and T.a defined in Exam.ple 1
do not commute; that is, shovl that T2T3 ~ TaT2.
If D and J..f are the linear transformations on the space P defined
in the wxt, show that Dllf = MD +I and (MD) 2 = 1lf 2D 2 + 1\lD.
Let T be a linear transformation on a linear space ~and show that T
is non-singular{::::} there exists a. linear transformation T' on L such
that TT' = T'T == I.
Let T be a linear transformation on a linear space L 1 and prove
that T is non-singular # T (B) is a basis for l.1 whenever B is~
Prove that a linear transformation on a finite-dimensional linear
space is non~singular (::::}it is one~to--0ne ::::>it is onto4
Show that the set of all non-singular linear transformations on a
linear space L i" a group with respect to multiplication. If L is
finite-dimensional with dimension n > o~ this group is called the
full linear group of degree n.
If L and L 1 arc non-zero linear spaces (both real or both complex),
prove that there exists a non-zero linear transformation of L into L'.
Let I"' be a linear space, and let x and y be vectors in L such that
x r== 0.. Prove that there exists a linear transformation T on L such
that T(x) = y. If y is not a scalar multiple of x, prove that there
exists a linear transformation 'Jl' on L such that T 1 (x) === 0 and
T'(y) ~ 0.
Let Land L' be linear spaces with the same scalars, and let T be a
linear transformation of L into L 1 The null space of T, namely1
{x: T(x) = 0 L and its range, { T(x) :x e L}, are clearly subspaces
of L and L'. The nullity of T, denoted by n(T), is the dimension
208
Operators
of its null space, and its rank r(T) is the dimension of its range. If
Lis finite-dimensional~ prove that n(1 + r('.f) == d(L)~
If E is a projection on a linear spare L, sho\v that its r:t.nge equals
the set of all vectors \vhich arc fixed under E; ie~, shO\\' that
(E (z) :z e L 1 = {z : .K (z) = z J .
1
11.
45. ALGEBRAS
A linear space A is tailed an algebra (see Sec. 20) if its vectors can
be multiplied in such a way that A is also a ring in which scalar multipli ...
cation is related to multiplication by the follo,ving property:
a(xy)
(ax)y
x(ay)~
Linear
Algebras
spaces
Fig.
!34~
which the major algebraic systems defined in this chapter are related to
one anotherr
Since an algebra is a linear space 1 all the ideas devcl oped in Secs. 4 2
and 43 arc immediately app1icable. Some algebras arc real and some
are complex 1 .and every algebra has a well-defined dimension. Furthermore1 since an algebra is also a ring, it may be commutative or non-commutative, and may or may not have an identity; and if it does have
an identity, then we can speak of its rcgu1ar and singular e1ements. A
division algebra is an algebra 'vith identity which~ as a ring, is a division
ring~ A subalgebra of an algebra A is a non-empty subset Ao of .A. 'vhich
is an algebra in its Ol-Vn right with respect to the operations in A. This
condition evidently means that A 0 is closed under addition, scalar multiplication, and multipli<.:ation .
Algebraic Systems
209
Example 1
(a) The real linear space R of all re.al numbers (see Example 42-1) is a commutative real algebra \vith identity if multiplication is
defined in the ordinary way. 1;he reader vml observe that 8C alar
multiplication is indistinguishable from ring multiplication in this system.
(b) rfhe complex linear spuce (} of all complex numbers defined in
Example 42-4 is a commutative complex algebra with identity if multiplication is defined as usual. Again we see that scalar roul ti plication
and ring multiplication are the same.
to
Example 2. (a) The real linear space e(XJR) (see Example 42-3) is a
com mutative real algebra with identity if multiplication is defined
poin tu. ise.
(b) The complex linear space e(X~C) defined in Example 42-6 is a
commutative complex algebra with identity with respect to pointwise
multiplication.
chapters+
Problems
1.
210
2.
3.
4~
Operators
CHAPTER NINE
Ha11aclt Spaces
We have already seen~ in Sec. 14, that a Banach space is a linear
space which is also, in a special way, a complete metric space. ..fhis
combination of algebraic and metric structures opens up the possibility
of studying Ji near transformations of one Banach space into another
v-.,..hich have the additional property of being continuous.
1\1ost of our work in this chapter centers around three fundamental
theorems relating to continuous linear transformations. The HahnBanach them-em guarantees that a Banach space is richly supplied with
continuous linear functionalst and ma.kes posHiblc an adequate theory of
conjugate spaces. The open mapping theorem enables us to give a satis
factory description of the projections on a Banach space, and has the
important closed graph theorem as one of its consequences. We use the
uniforni boundedness thertrem in our discussion of the conjugate of an
operator on a Banach space, and this in turn provides the setting for our
treatment in the next chapter of the adjoint of an operator on a Hilbert
space.
Virtually all this theory had its origins in analysis~ Our present
interest, however~ lies in the study of form and structure, not in exploring the many applications of these ideas to specific problemsL This
chapter is therefore strongly oriented t.oward the algebraic and topological aspects of the matters a.t hand~
21T
212
46~
Operators
~lxll
(1)
(2)
- 11YID
y)[j ==
nx -
Yll,
which together 'with (2) yields (1). We now prove (2) by observing that
Hx!I = I (x ~ y) u!J < ~Ix - Yll + \lYl1- The main conclusion we draw
from (1) is that the norm is a continuous function~
z ~ Hx.. I~ ~ 11xU.
that I l x~ll - HxH t < Hxn
x" ~
and
x~ ~ x and Yn
a,. ~ a and
-4
y ~ x"
Xn --t
+ Ya ~ x + y
Banach Spaces
213
l[(xn
+ Y~)
- (z
+ y)I[
~ fl(x~ - x)
+ (Yn
- y)[]
5 flx11 - x[j
+ ~ly. - y!f
and
l!a~x~
axl[ ~
IJan(Xn
~ x)
(a~ ~ a)xH
Our first theorem exhibits one of the most useful ways of forming
new normed linear spaces out of old ones.
Theorem A. Let M be a ~losed linear subspace of a nor-med linear space N.
If the norm of a coset x
M in the quotient space N / }.{ is defined b-y
+
Hx + Ml1
=:::;
inf {llx
+ml! :m 5 ML
(3)
+ }.{:;::::;
IJYm
Ynli == [I (ym
+ '"
_.. Ym+1)
(Ym+l - Jlm+2)
(Y~-1 - Yn) I~ ::; HYm ~ Ym+l ~!
llYm+l - Ym+2!f
llYn-1 - Ynl] < l/2m + 1/2m+l
~ ~ + 1/2n-l
< 1/2tn- 1
21.4
Operators
(x1, x2,
~ . , X-n) of real and complex numbers can be made into normed linear
spaces in an infinite variety of ways, as we shall sec below. If the
norm is defined by
Example
:;;!!;
(4)
spaces~
(5)
Banach Spaces
215
{ X 1,. X :;i.,
!xn IP <
..
Xn' . }
(TJ ,
'IO
llxllP ~ (
n.=l
lxftl P) 11 P
(6)
The reader Vr"'ill observe that the real and complex spaces l2 arc precisely
tl1e infi.nite4limcnsional Euclidean and unitary spaces R~ and C' defined
in Problem 15-4r The proof of the fact that l,, actually is a Banach
space requires arguments similar w those used in Problems 15-3 and 15-4+
The Banach spaces discussed in these examples are all special cases of
the important LP spaces studied in the thco:ry of measure and integration.
A detailed treatment of these spaces is out.side the scope of this book, but
we can describe them loosely as fo11ows. An LP space essentially consists
of all measurable functions f defined on a measure space X with measure
m which arc such that. lf(x)I P is integrable, v..~ith
H!HP ==
(j lf(x)]
dm (x)) lip
(7)
taken as the norm. In order to include the spaces l; and lp VtTithin the
theory of LP spaces1 we have only to consider the sets [ 1, 2j ~ .. , n l
and I I~ 2., .... , n 1 i as measure spaces in 'vhich each point has
measure I, and to regard n ~tuples and sequences of scalars. as functions
dcfin ed on these sets+ Since integration is a generalized type of summation, formulas (5) and (6) are special cases of formula (7) . 1
Example 5. Just as in Example 3,. wc start "'Tith the linear space of all
n-tuples x = (x 1,, x 2 ~ J Xn) of scalar8, but this time we define the
Several remarks and example.s relating to LP spaces n.re scatt-Prr.d about in this
and the next chapter. T"his fragmentary material is not essential for an understanding of the.se chapters, and may Le disregarded by any reader without. the necessary
backgrouud. Brief ~ketches of the relevant idc&S can be found in Taylor [41 ~ chap. 7J
and T..001nis [27 chap. 3J. For more ext.ended treatments 1 !'Jee l!a.lmos [ 18], Za.anen
[45j, or Kolmogorov and Fomin [26i- vol. 21.
1
216
Operators
norm by
(8)
This Banach space i~ rommonly denoted by l~, and the symbol llxllco is
oreasiona1ly used for the norm given by (8). The reason for this practice
lies in the interesting fact that
!lxll~
:;;:
Hm
Hxll,,
max
11 )
IP
asp~
(9)
oo
i=I
We briefly inspect the case n = 2 to sec why this is true. Let x = (xh X2)
be an ordered pair of real numbers with x1 and x2 > 0. It is clear
that l!x ii~ = max ~ x1, X2} :::; (x1P + x2P) 11 P = llxlLp+ If x1 = X!b t.hen
lim llxl',, = lim (2x2P) 11P = lim 2 1 ' 11 x2 = X-:? = ]!xl[~. And if x1 < x2t then
Ii m 11 x lIp
I im (x 1,,
+x
2P) 1f P =
x2
llx:!~
6..
(10)
and we denote the resulting Banach space by l~., The set c of all con~
vergen t seq uen cc s is easily seen to be a closed linear subspace of l ~ and
is thereforP itself a Banach space. Another Banach space in this family
is the subset co of c which consists of all convergent sequences ,dt.h
Hmit O+
Examp1e 7~ The Banach space of primary interest to us js the space
e(X) of all bounded r.ontinuous scalar~valucd functions defined on a
topologiral space X, "1.th the norm gi vPn by
11111 = sup
(11)
lf(x)!L 1
r'fhis norm is sometimes called the uniform norm-, because the statement
that fn converges to f \Vith respect to this norm means that J~ converges
tof uniiorm.ly on X. The fact that this space is complete amounts to the
fact that if f is the uniform limit of a sequence of bounded cont.inuous
functions~ then f itself is bounded and continuous. If, as above 1 \ve
consider n-tuples and sequences as functions defined on f I ~ 2~
n.}
and ~ l, 2,
n, . . . }, then the spaces l! and l~ are the special CUSPS
of e(X) which correspond to choosing X to be the sets just mentioned.,
each \vi th the discrete topology .
r
...
The real apac~ e{ X) and t.he complex space e( X) are, of course, the spaces
previously denoted by e(XJR) a.nd e(X~C).
1
Banach Spaces
217
Fig. 35.
=:::;
(12)
where 1 :=; p < oo, and if we a11ow p to increase from 1 to ~ ~ then the
corresponding S's swell continuously from the first square mentioned to
the second. We note that S is truly "'spheric.al'' ~ p = 2. These
considerations also show quite clearly why we always assume that
p > 1; for if we \Vere to define llxll,. by formula (12) with p < I~ then
S = t x: Bx11 p 5 1 J would not be con vex (see the star-shaped inner portion
of Fig. 35). For p < 11 tberef ore, formula ( 12) does not yield a norm4
218
Operators
Problems
2t
3..
4.
(b)
'
conclusion.)
Prove II alder's inequality: 2:~ 1 [x1Yil < [lx[IPllY~I q (If x = 0 or
u = 0, the inequality is obvious~ so assume that both arc 0.
Put a; = (lxi[/llxll ,)P and bi = (IY1!/llvllq)q, and use part (a)
to obtain lxt:Yil/llxH,,llYI[ q < ai/p + bi/q. Add these inequali-+
ties for i = 11 2, . . . , n 1 and conclude that
n
( ~ [x&il)/llxHP!IYllq <
1/p
1/q
1.)
i=l
(c)
Banach Spaces
219
+ yl[J'P =
ixi + Yt:~p
i-1
<
i-l
Jx1I ~x, +
~ ~
i 1
K
Y1IP- 1
i=l
< ([lxlJP
+ g is also in LP and
Let N and N' be normed linear spaces with the same scalars, and
let T be a linear transformation of N into N 1 l- \Vhen we say that T is
continuous, \Ve mean that it is continuous as a mapping of the metric
space N in to the metric space N'. By Theorem 13-B~ this amounts to the
condition that x~ ---t x in N ::::::} T(x~) ---i- 7"'(x) in N'. Our main purpose
in this section is to convert the rcquiremr.nt of continuity into several
more useful equivalent forms and t-0 show that the set of all continuous
linear transformations of N into N' can itself be made into a normed linear
space in a natura] way .
Theorem A. Let N and N' be normed linear spaces and T a linear trans-formation of N into N'. Then the following condition3 on Tare all equivalent to one another:
(1) Tis continuous;
(2) T 1~s continuous at the origin, in the sense that Xn ~ 0 ~ T(xn.) ___. 0;
In the futurcJ whenever we mention two normed linear spaces with a view to
~-onsirlc~Ting linear transformations of one into the other . we shall always e.asume-v.-T 1thou t n ec cssarily say jng so ex plici tly-tha t they have the aam e scalars.
1
220
Operators
(3)
that
If
T (x) 11
< Kllxl[
Yw. == Xn/n[lxn![,
then it is e-asy to see that y'*' ~ 0 but T(yn) -H- 0, so Tis not continuous
at the origin.
(3) -=> ( 4). Since a non-empty subset of a normed linear space is
bounded ~ it is contained in a closed sphere centered on the origin, it is
evident that (3) ~ (4); for if Hxll < 1, then [IT(x)~I < K~ To show that
(4) ~ (3), we assume that T{S) is contained in a closed sphere of radius
K centered on the origin. If x == O~ then '1. (x) = O, and clearly 11 T (x) li
::; K~lxll; and if x ~ 0, then x/llxll e S, and .therefore l T(x/~lxll) II < K,
so again ~re have 11 T{x) [I < K!lxll
1
SU p
{uT (x) 11
: Hx 11
:5
1 J.
(1)
When N #- iO], this formula can clearly be written in the eqwvalent form
== 1}.
(2)
It is apparent from the proof of Theorem A that the set of all bounds for
T equals the set of all radii of closed spheres centered on the origin which
contain T(S). This yields yet another expression for the norm of T,
Banach Spaces
221
namely,
[1T[l
=inf {K:K
(3)
<
~I T[j
HxU
(4)
We now denote the set of all continuous {or bounded) linear transformations of N into N' by ffi(N,N'), where the letter Hffi" is intended to
suggest the adjective ''boundcd~n It is a routine matter to verify that
this set is a linear space with respect to the poin twise linear operations
defined by Eqs. 44-(1) and 44-(2) and to show that formula (1} actually
does define a norm on this linear space. we summarize and extend these
remarks in
Theorem B. If N and N' are nor~d linear spaces, then fM set ffi(}l,N 1 )
of all .continuous linear transformations of N into N' is itself a normed
linear space wi,th TC 8pect to the poinlwisc linear Operati&n3 and the narm
defined by ( 1). Further, if N' is a Banach space~ then CB ( N, N') U; also a
Banach space .
PnooF. We leave to the reader the simple task of showing that ffi(N,N')
is a normed linear space~ and we prove that this space is complete when
Nt is.
Let {Tn} be a Cauchy sequence in m(N,N')~ If xis an arbitrary
vectorinN,thcn j[Tm(x) - Tn{x)ll = ll(Tm - Tn)(x)1! < llTm - Tnll 11xH
shows that ~ Tn(x) J is a Cauchy sequence in N 1 ; and since N' is complete,
there exists a vector in_ N'~we denote it by T(x)---such that T11(x) ~
7'(x). This defines a mapping T of N into N' ~ and by the joint continuity of addition and scalar multiplicationJ T is easily seen to be a linear
transformationL To conclude the proof, we have only to show that T is
continuous and that T" --4 T "tith respect to the norm on ru(N,N').. By
the inequality 46-(1)~ the norms of the terms of a Cauchy sequence in a
normed linear space form a bounded set of numbers, so
11 T(x)
II
fllim
T n(x) l
;::=
1im
!ITn(x) I~
~ sup
([ITnll llxl[)
===
(sup
!IT~II) ~lxtl
ltTm(x) ~
Tn(X)[I
=:::;
< If Tm -
T~~I
< E.
We now hold m fixed and allow n to approach oo, and we see that
[ITtft(x) ~ Tn(x)" ~ [IT.(.z) - T(x) 11, from which we conclude that
Operators
222
HT.(x)
- T(x) II ::5 '- for all m 2::. no and all x such that llx!I :5 1. This
shows that HT. - Tll S E for all m > no, and the proof is complete.
Let N be a normed line.ar spaee. We call a continuous linear
transf orma tio n of N in to itself an operalor on N, and 'vc dcno tc th c
normed linear space of all operators on N by ffi (N) instead of ffi (1V :tN) .
Theorem B sho~rs that <B(N) is a Banach space when N is. Ii,urthcrmore,
if operators are multiplied in accordance with formula 44-(3), then
fB(N) is an algebra in which multiplication is related to the norm by
(5)
<
< 1}
< 1}
=
=
: llxli < 1 i
ll T(I sup [[IT' (x) ll: llx~I < 1}
== 11 TH 11 T' IJ.
sup { l T(7''(x)) [I
We know from the previous section that addition and scalar multiplication in <B (N) are jointly continuous, as t.hey are in any normed linear
space~ Property (5) permits us to conclude that multiplication is also
jointly continuous:
T~
---i-
II T l)T~ -
TT'll
IJ Tn(T: - T')
(Tn - T) T'll :$
li T n 1l
r: - T' 11
I]
HTfi -
Tll [l T'l1
l[J[I = I;
(6)
Problems
1..
Banach Spaces
2..
223
3.
4.
5.
6~
z;
74'
r-
224
Ope rotors
8.
Bana(:h Spaces
pointwise~
ll!H
225
f is defined by
S 1}
Fa(!J =
Jf(x)g(x) dm(x).
IFQ(f)I
I/ f(x)g(x) dm(x)I
<
J lf(x)g(x)~ dm(x)
HF.:11 = 11Yll ~
It can also be shown that every function al on L 11 f.lJ'i.ses in this way, so the
mapping g-----+ Fa (which is clearly linear) is an isometric isomorphism of
L.q onto L'P *. This statement is usually expressed by writing
L,,*
L"-J
(1)
226
Operators
and that
(l~) =
l!
(3)
(l!)* =
l~ ~
(4)
We sketch proofs of (2), (3), and (4) in the problems.. When we consider
sequences of seal ars, then for 1 < p < oo we have the f oUowing special
case of (1):
ljj.
If p
l(J_~
(5)
l1* =
l~.
(6)
49-51 ]) ~
Most of the theory of conjugate spaces rests on the Hahn-Banach
theorem, which asserts that auy functional defined on a linear subspace of
a normed linear space can be extended linearly and continuously to the
whole space without increasing its norm. The proof is rather complicated, so we begii1 with a lemma \Vhich serves to isolate its most difficult
parts.
Lemma. !Jet }rf be a linear subspace of a normed linear space N, and let
f be a functional defined on M. If XQ is a vector not in M, and if
Mo= M
+ [x-0]
Banach Spaces
271
~ Hx
axo![
-f(x) ~ !Ix+ ax{dl
or
\Ve now observe that for any two vectors x 1 and x!I in .llf we have
f (x2)
so
f (xi)
llx"l
f(x:.)
+ ilx! + XD~I ~
+ xol!,
(9)
+ xoB ;x J_~J
+ Xo I~ : x C. Jf J ,
then (0) shows that a ~ b. If "Te nov{ choose ro to he any real number
such that a < r 0 ~ bJ then the required inequali(y (8) is satisfied and this
part of the proof is complete.
lVc next use the result of the above paragraph to prove the lemma
for the case in which N is complex. Here f is a complex-valued func~
tional defined on ~! for \Vhich flfll = lr \\re begin by remarking that a
complex linear space can be regarded as a real linear space by simply
restricting the scalars to he real numbers~ If g and h are the real and
imaginary parts of fi so t.hat f(x) = g(x)
ih(x) for every x in L~f 1 then
both g and h are en.sily seen to be real-valued functionals on the real
spae.e 11!; and since 11f11 ;:; 1, we have ~I g 11 < 1 ~ 'fhe eq ua ti on
f (ix)
together \Vith /(ix) = g(ix)
===
if (x)
+ ih(ix) and
ig (x) - h (x) i
228
Operators
x in Mo by fo(x)
= go(x) - ig(l(ix).
;::=
+ ig o(x)
;;::=
= r =
ri 8fo(x)
fo(ri'x);
= 1
11/oll
llfl].
Banach Spaces
229
Among other things, this result shows that N* separates the vectors
in N, for if x and y are any two distinct vectors, so that x - y # 0 1 then
there exists a functional fin N such ~that f (x - y) ~ 0, or equivalently,
f(x) ~ f(y).
Theorem C.
+M
~ 0.
f (XG
Ii we now defin~ f\J by fo(x)
desired properties~
+ jf)
~ Q.
= f(T(x))~
1.
2.
s:
3.
18 4).
In this problem we ask the reader to convince himself of the vu.lidity
of Eqs. {2) to (4)~ Let L be the linear space of aU n-tuples
X
(x l~
X2,
..
Xn)
230
Operators
(Y11
Y21
Yn) --+- f,
llfl:
inf
{K: K
~ 0 and lf(x) I
< Kl]x]f
All that remains is to see 'vhat norm for the y~s makes the mapping
y ~fan isometric isomorphism.
(a) If 1 < p < o: 1 th.en (l;) * =
The norm in this caHe is
defined by Hxll = (Zi-1 lriJP) llPi and it fo1lov{S from
1:.
:$ ;~
<
that I! f 11
oth erVtrise.
(b)
(c)
If xi] = max
Uxil},
[f(x) I ~ I~ xiyi~
<
i~l
4.
~ jxil
il
Yi!.
(I IY1il)
i~I
Banach Spaces
231
defined on a real linear space L such that p(ax) = crp(x) for a: > 0
and p(x
y) < p(x) + p(y) 1 and if f is a real linear function defined
on a linear subspace JI such that f{x) :::; p(x) for all x in ~f 7' then
f can be extended to a real linear function f rJ d cfincd on l.J such that
fo(x) ,:::; p(x) for all x in L .
Fzj(f) = f(x).
In other wordB, we invert the usua1 pract.ir..e by regarding the symbol
f(x) as specifying a function off for eaeh fixed xJ and v...-e emphasize this
point of view by writing f(x) in the form F~(f)+ A simple manipulation
of the definition shows that F ~ is linear:
F~(af
+ {jg)
ltxl!
llF~II
[]xJI.
232
Operators
+
+
= f(x
y)
= f(x)
f(y)
= Fr(f) + Fu(f'J
= ( F :t + F !J) (f),
l p **
--
l q*
--
l p
z:
It fo1lows from Problem 48-3 that the spaces for 1 < p < co arc also
reflexive. Since N** is complete, N is necessarily complete if it is
reflexive+ If N is complete~ however, it is not necessarily reflexive, as we
see from c{t* ~ 11 and co** = l1* = l~. If X is a compact Hausdorff
space, it can be sho"'"'n that e(X) is reflexive RX is a finite set+
'rherc is an interesting criterion for reflexivity, which depends on the
co nee pt of the wealc topology on a 11 or med linear space N ~ This is
de.fined to be the weak topology on N generated by the functions in N*
in the sense of Sec. 19; that is 1 it is the weakest topology on N with
respect to whirh all the functions in N* remain continuous~ The criterion
referred to is t.he f ollowing: if B is a Banach space, and ifs = { x: nxH < 11
is its closed unit sphere, then B is reflexive-RS is compact in the weak
topology. J.'his fact is something one should kno-.,v about Banach spaces,
but "\\Tc shall have no need for it ourselves, so we state it "\\ithout proof . 1
Far more important for our purposes is the weak topology on N*,
lNhich is defined to be the weak topology on N* generated by all the
induced functionals F~ in N**. This .situation is rather complicated, so
\\Te shall try to make clear just what is going OIL
1:irst of all, N* (like N) is a normed line~r space 1 and it therefore
has a topology derived from its character as a metric space~ Thia is
called the strong topology. N** is the set of all scalar-valued linear
functions defined on N* \i\Thich arc continuous ,vith respect to its strong
topology.. The weak topology on N* (like the weak topology on N) is the
weakest topology on N* ~rith respect to which all the functions in N* a.re
continuous 1 and clearly this is weaker than its strong topology. So farj as
See Hille ~nd Phillips [20, p. 38] or Dunford and Sehwartz
l8:.
p. 425].
Banach Spaces
233
fo,
E)
;:::=
f :f N* and IF~<n
- {f :f N* and l/(x) -
F;:(fo)~
<
f n(x)I <
~:l
f}
Theorem A,. If N is a normed limar spac-et then the closed unit sphere S*
in N* is a compact Hausdorff space in the weak* topology.
1 When we use the adjective "closedH in referring to S*, we int.end only lo emphasize the ineq u.a.lity 1 /II ~ 1, as con tm.sted with 1! /H <1.
234
Operators
also a compact space. For each x, the values f(x) of all fs in s lie in
C~~ This enables us to imbed S* in C by regarding each f in S* as
identical with the array of all its values at the vectors x in N~ It is clear
from the definitions of the topologies concerned that the weak* topology
on S* equals it::i topology as a subspace of C; and since C is compact.i it
suffices to sho,v that s is closed as a subspace of C. We shov{ that if g
is in S*, then g is in S* ~ lf we consider g to be a function defined on the
index set Nt then since g is in C we have lu(x) I < Hxfl for every x in N.
It therefore suffices to show that g is linear as a functjon defined on N.
Let ~ > 0 be given~ and Jet x and y be any t'\\~o vectors in N. Every
basic neighborhood of g intersects J.~, so there exists an f in 8* 8uch that
fg(x) ~ f(x)I < f/3, lg(y) - /(y)i < f/3, and Jo(x + y) - f(x + y)I <
t=/3. Since f is linearJ j(x + y) - f(x) - j(y) = 0, and we therefore have
jg(x
+
+ +
+
+
+ y)
~g(y) - f(y)J
+ t:/3
f;.
The fact that this inequality is true for every E > 0 nov-r implies that
g(x + y) = g(x) + y(y)r We can show in the same way that
g(ax)
ag(x)
\Ve arc no\\" in a position to keep the promise made in the last
para.graph of Sec. 46~ for the f ono,ving result is an obvious conscq uence
of our preceding \\.Tork.
B.
This theorem sho,vs 1 in effect 1 that the most general Banach space is
rssent.ially a closed linear subspa~e of e(X), where X is a eompaet
llausdorff r.;pacc. The purpose of representation theorems in abstract
mathema.tif's is to revca] the structures of complex systems in terms of
simpler onP:.-:i} and from this point of view, Theorem B is satisfying to a
dc-grec. It must be pointed outJ however~ that ,,.e know next to nothing
about the r.losed linear subspaces of e(X), though we know a good deal
about e( . Y)
. it.self
Theorem R is thcrefo1e some'"That less revealing than
appears at first glance~ We shall see in Chaps. 13 and 14 that the
L
Banach Spaces
23.S
1.
2.
3.
4.
5.
v:r=
s:
236
Operators
is in T(S 1), there exists a vector x1 in B such that Hx1ll < 1 and
llY - Y1 II < E/2)' where Yi = T(x1). We next observe that
s;
T(Sh), so there exists a vector x2 in B such that llx2!I < ~ and
if (y ~ Y1) - Y2fl < f/4, where Y'l = T(x2)~ Continuing in this way, we
obtain a sequence {xn} in B such that IJxmlf < 1;2~~ 1 and l!u -- (Yt + Y2
+ Yn) 'I < E/2~ 1 where y,,.. = T(xn). If we put
Since
s:12
Jlxnil <
s3.
T(x)
+ Yi + + Y")
y,
+ s:.
sake of emphasis.
A one-to-one continuous linear transformation of Qne Banach
space onto another is a homeomorphism. In particular, if a one-to-one
linear transformation T of a Banach space onto it.JJelf is continUOUB, then its
inverse ~ 1 is automatically continuous .
Theorem B.
As our first application of Theorem B, we give a geometric characterization of the projections on a Banach space+ The reader will rece.ll
from Sec. 44 that a projection E on a linear space L is simply an idem-
Banach Spaces
237
E(z) = x)~
These facts show that the study of projections on Lis equivalent to the
study of pairs of linear subspaces which are disjoint and span L.
In the theory of Banach spaces1 however, more is required of a
projection than mere linearity and idempotence. A pra}edion on a
Banach space Bis an idempotent operator on B; that is, it is a projection
on B in the algebraic sense which is also continuous. Our present task
is to assess the effect of the additional requirement of continuity on the
geometric descriptions given in (1) and (2) aboveL The analogue of (1)
.
lS easy.
Theorem C1 If P i8 a projection on a Banach space B 1 and if Mand N
are its range and null space, then Mand N are closed linear subspaces of B
such that B
M EB N .
PROOF. - P is an algebraic projection 1 so (1) gives everything except
the fact that M and N a.re closed. The null space of any continuous
linear transformation is closed J so N is obviously closed; and the fact
that M is also closed is a consequence of
:;:::=
~ P)(x)
= O},
I zII' :::;
Jlx[[
+ HYU,
238
Operators
then B' is a Banach space; and since l! P(z)jl ~ llxll S Hxfl + l!-u[I = flzll'1
P is clearly rontinuous as a mapping of B' into B. It therefore suffices
to prove that B' and B have the same topology. If T denotes the
identity mapping of B' onto B, then
~I T(z)[I =
[liH
l]x~I
+ llyff
llzl!'
then the resulting topology is easily seen to be the same as the product
topology 1 and convergence "Tith respect to this metric is eq ui valen t to
coordinate,visc convergence+ Now let 1' he a linear transformation of
B into B'~ '\Ve recall that the graph of T is that subset of B X B 1
which consists of all ordered pairs of the form (x/T(x))
Problem 26-6
shows that ii T is continuous1 then its graph is closed a.s a sub:.-:;et of
B X B ~ In the present context, the converse is also true.
+
PROOF.
HT(x) 11
<
~lxl]
+ ll T(x) [I
= llxH h
Banach Spaces
239
- x) H=
=
xH + l T(xn) - T(x) [I
llx1) - xii + llTCxn) - Yll ~ 0.
llx"
1.
3..
We shall see in this section that each opera.tor Ton a normed linear
space N induces a corresponding operator~ denoted by T* and called the
conjugate of TJ on the conjugate space N*. Our first task is to define T* ~
and our second is to investigate the properties of the mapping T ~ T*.
We base our discussion on the follov-.. ing theorem.
Theorem A (the Uniform Boundedness Theorem). Let B be a Banach
apace and N a normed linear space. If {'I\ l is a non-empty set of con1
See Taylor
[41~
pp.
181-1851~
240
Operators
s Band If Ti(x)Jl
n for all i}
Since B is complete 1 BaireJs theorem sho-v-.Ts that one of the Fn 1s, say
F nfJ~ has non-empty interior~ and thus contains a closed sphere So with
Center Xo and radius ro > 0.. rfhis Says, in e ffectJ that each veetor in
every set Tt(So) has norm less than or equal to no; and for the sake of
brevityt "\\Te express this fact by writing II Ti(~S'o) I < no. It is clear that
S 0 - x o is the closed sphere with radius r o centered on the origin 1 so
(So - xo)/ro is the closed unit sphere B+ Since Xo is in So, it is evident
that ll"Ti(So - xfr) H ~ 2no. This yields IJ Ta:(B) H < 2no/ro, so fl T,JI <
2n 0/ r 0 for every i, and the proof is complete.
N4
B-ana ch Spaces
itself, as follows.
241
!T'(f)J(x) = f(T(x)).
We leave it. to the reader to verify that T 1 (f) actually is linear as a func ..
tion defined on N, and also that 1 is linear as a mapping of L into itself.
The following natural question now presents itself. 1-7 nder what
circumstances does T, map N* into lv*? Thi:s question has a simple and
elegant answer: T' (N*) C N * <=> '~f i8 continuous. If "~e keep Theorem
B in mindJ t.he proof of this statement is very easy; for if ~g is the closed
unit sphere in N 1 then '1' is continuous<=> T(S) is bounded 8 f(T(S)) is
bounded for each fin 1V* ~ CT' (f) ](S) is bounded for each f iu .i:V* ~ T'(f)
is in N* for each f in N* .
We now assume that the linear transformation T i~ continuous and is
therefore an operator on N. The preceding dcve1opmcnts allow us to
consider the restriction of T' to a mapping of N* into itself.. We denote
t.his restriction by T*~ and we call it the conjugate of T. The action of
T* is given by
1
'
[T*(f)J(x) = f(T(x)),
(2)
T*ll
:5 If Tit
Since HTU = sup {fIT(x) H~ fIx [I < 1}, we see at once from Theorem 48-B
that equality holds here 1 that is, that
If
T*ll
l! Tll
(3)
+ ~T,.)* =
+ PT2*~
= T2*T1,
I* = I.
(T1T2)
and
aT1*
(4)
(5)
(6)
242
Operators
(TJT~,) *(f) =
(TzT1 *) (f) for each fin N*, and this means that
[(Tl T2)*(/)J (x) = [(1\~*T 1*) (f)J (x)
shO\YS
that
f(1,1(1't2(x)}) = [Tl*(j)]('.Fi(x))
= [ 1' 2 * ( 7\ ( f) ) ] (x) = [('.t 2 * T 1 *) (f) ] (x) .
follo"Vl-~ing
summary of
J.v
1...,he general significance of the ideas developed here ran be understood 011ly in the light of the theory of operators on IliJbert spaccsr
Some preliminary comments on these matters are given in the introduction to the next. r hapter
+
Problems
1.
2~
3.
CHAPTER TEN
Jlil/Jert Spaces
One of the principal app1ications of the theory of Banach algebras
developed in Chaps+ 12 to 14 is to the study of operators on Hilbert
spar.es. Our purpose in this chapter is to present enough of the elementary theory of Hilbert spaces and their operator~ to provide an
adequate foundation for the deeper theory discussed in these Jatcr
chapters.
We shall see from the formal definition given in the next section that
a IIilbert space is a special type of Banach space, one \\Thich possesses
additional structure enabling us to tell when 1Nf0 vectors arc orthogona1
(or perpt~ndicular) T The first part of the chapter is concerned solely "'Tith
the geometric jmplica.t.ions of this additional structure.
As "\\"'e _have said before, the objects of greatest interest in -r..onnection
with any linear spaee are the linear transformations on that space+ In
our treatment of llanach spacest we took advantage of the metric structure of such a space by focusing our attention on its operators~ A
Banach space, however, is still a hit t<>o genera] to yield a really rich
theory of operators+ One fact that did emPrgP., 'vhich is of great significance for our present work~ is that \vith each operator T on a Banach
space B there is associated an operator T* (its r,onjugatc) on the r.onjugate space B. We shall sec bclo\v that onP of the central properties of
a Hilbert space H is that there is a natural correspondence between II
and its tonjugate space H*. If T is an operator on H, this correspondence makes it possible to regard the conjugate T* as acting on JI i tsc1f
(instead of H*), vrhere it can be compared "ith T. These ideas lead to
the concept of the adjoint of an operator on a Hilbert space, and they
2A3
244
Operators
make it easy to understand the importance of operators (such as selfadj oint and normal operators) which are related in simple ways to their
adjoin ts .
=::
X1Y 1
X2Y2
;t
+ XsY3,. 1
Uxll ~-
where fJ is the angle between x and y, and also with the fact that x and y
are orthogonal precisely when (x,y) = o~
Most of these ideas can readily be adapted to the three-dimensional
unitary space C 1 ~ 14~or any two vectors x = (x1, Xi, xa) s.nd y = (y1, Y2J ya)
in thls space 1 we define their inner product by
(x,y) ~
X1Y1
+ X2Y1 + X&:V3
(1)
(xix) ;;
!lxU2
Hi rbert Sp aces
sen~ that (x,y)
245
(y,x)~
(xJ ay
+ ~z)
= a(x,y)
+ S(x,z)
x = (z1,
X2i ,
Xn) and Y
=:::
defined by
246
Operators
E~omple
The space
2.
= {x1
:t
x 2' ~
l~, ~~ith
' Xn '
yn J ~
defined by
om)
I.
(x,y) =
XnY~
n 1
1,hc fact that this series convergcs-----and thu~ defines a -complex number-for ~ach r and y in l'l. is an easy eonHequencc of <:anehy'H inequality.
Example 3. --rhc space ?. a~sociatcd . . vith a rncn.t{ure :::.pace . Y
. "=-ith
measure rn, "~ith the inner product of t\vo functions f and g defined by
(J ,g)
J1(x) 9 ( x) d n <x) .
l
This llilbert space is of course not part of the offieial content of this book,
but \VC mention it any,vay in c.ase the reader has some knov... ledge of these
matte-rs~
A-c:;chwarz inequality.
Theorem A.. If x and y are any tu,o t.'ectors in a
Hlberl space, then I(xJy)I < llxH l~Y~!
PHOOF.
\V'hen y = 0, the result is clear, for both
ti id PH vanish.
\\,..hen y ~ 0, the iueq uality is equiv~
a hint to I(x,y/ liYIDI < llxl~.
may therefore confinP our attention to proving that if [lyl] = l, then
'"Te have ~ (x,y)I < :i!rli for all x. This is a direct
conHequencc of the fact that
' re
Fig
afi. Sch ~"!:Lr z' H
inc4 uality.
+{
Xn ----;
Yn
---+
===?
(xn,Yn) - t (x,y).
I(xn,Yn)
- (x,y)J
1(Xn~Yn)
- (Xn 1Y)
+ Ctn~Y)
(:t1i,Y)I
11
llYll
Hirbert Spaces
247
This is readily proved by writing out the expression on the left in terms
of inner products:
(x,x)
+ (x,y) +
+ Y1 x + y) + (x
(x
(y,x)
(y,y)
~ y, x -
(x,x) - (x,y)
= 2(x,x)
2(y,y)
y)
(y7x)
+ (y,y)
= 2Hxll 2
+ 2llYll
llxm -
x~ll 2
= 2l]xmlP' + 2l[xnll 2
l~x.
+ Xmil 1
4d!;
+ x'
2
<
~rhich
llxlP
2
HxH 2
2
ilx'r[ 2
2
--..
x - x'
2
+ Hx2 H -d,
1
2..t 8
Operators
+ iy!f 2 -
il[x - iylf 1.
(2)
This is easily verified by co nvcrting the expression on the right into inner
products.
If B is a complex Banach space whose norm obeys the parallelogram lawt and if an inner product is defintrl on B by (2) 1 then B is a
Hilbert space.
PROOF~ All that is necessary is to make sure that the inner product
defined by (2) has the three properties required by the definition of a
Hilbert space~ 'fhis is easy in the case of properties (2) and (3). Property (l) is best trr.atcd by splitting it into two parts:
Theorem
C.
(x
+y
z) = (x,z)
+ (yjz),
and (ax,y) = a(x,y)~ The first requires the parallelogram law, and the
second follovrrs from the first. We ask the reader (in Problem 6) to ~rork
out the detai1s.
1.,his result has no implications at all for our future \vork. However,
it does provide a satisfying geometric insight into the place Hilbert
spaces occupy among all complex Banach spaces: they are precisely those
in which the parallelogram la'v is true4
Problems
1..
2.
3.
4.
5.
6..
Show that the series \vhich defines the inner product in Example 2
is convergent.
The Hilbert cube is the su bsc t of l 2 consisting of all sequences
such that jxli.I ::; 1/n for all n. Show that this set is compact as a
subspace of 12.
l1~or the special Hilbert space li, use Cauchy's inequality to prove
Sch '\va rz 's inequality.
Sho\V that the parallelogram law is not true in z~ (n > 1)4
In a Hilbert space, show that if Uxll = 11YH ::::; 1, and if= > 0 is given,
then there exists & > 0 such that Jl(x + y)/2[1 > I - a~ IJx - Yll
< E~ A Banach space with this property is said to be uniformly
conz:ex4 See Taylor [41, p. 231].
Give n detailed proof of Theorem C.
Hilbert Spaces
249
11x
== l~xlf 2
+ flYll
{OJ J.
Sr1Sl.~{OJ;
S1CS2~S11- 282..L;
250
Operators
!lzo - ayll
so
and
If we put
o:
llx -
(yo+
(1)
-2Pl (zo,y)I i
+ f3 i(zo,Y)! 2llYll
1
> 0.
- 2/3a
+ p~ab > 0
f3a(/jb - 2)
so
P~
However, if a
>
(2)
> O,
theorem~
Hilbert Spaces
2.Sl
Problems
1.
2.
3.
A.
252
Operators
zero vectors in H~ and if the x/s are normalized by replacing each of them
by ej = Xi/llx,:ll, then the resulting SE~t. fed is an orthonormal set.
Example 1. The HUbset {ei, _et, . . . , en} of l;, ":rhere Ci is the n-tuple
'"Tith 1 in the ith place and O's clse\v here, is evidently an orthonormal
set. in this space.
Example 2. Similarly, if en is the sequence \vith 1 in the nth place and
OJ s clse\vhere ~ then {e1, e2,. . . . , en, . . J is an orthonormal set in l2~
L
At the end of this section, \\re give some additional ex.rimples taken
from the field of ana}vsiH+
....
Every aspect of the theory of orthonormal sets dependr.; in one way
or another on our first theoremr
Theorem A. T"'et {e1, ez, ~ .. T e~} be a
space II. If x is any vector in H~ then
I (x,e,)l 2
< If xiii;
(1)
1-1
'l
/urtherJ
x -
(x,ei)e.: 1-
(2)
Ci
i= l
for ear.h j.
PRoott~.
~ (x~ei)ei
< r1 x -
11
l=l
- (r -
(x~e,)ei~ x ~ ~ (x,ei)ei)
l=l
~ (x,x) -
j=l
I
I
1"=1
I.:-1
1l
- llxlf 2
I (x~e.:)l 2T
i=J
( x - ~ {x 1 ei)e1}ej) ~ (x,eJ) il
(x,eD (ei,ei) -
(x_,eJ) - (x,eJ) = OJ
i=l
reader should note that the inequality (I) can be given the
following loose but illuminating geometric interpretation: the sum of the
squares of the components of a. vector in various perpendicular directions
Hilbert Spaces
253
docs not exceed the square of the length of the vector itself. Thls is
usually ca lied Bessel's i.rn.equality., though., as we shall see be1o w, it is only
a special case of a more general inequality with the same namer In a
similar vein, rclat.ion (2) says that if we subtract from a vector its components in several perpendicu1ar directions, then the result has no component left in any of these directions.
Our next task is to prove that both parts of Theorem .A. gencraliie
to the case of an arbitrary orthonormal set. The main problem here
is to shov.r that the sums in (1) and (2) can be defined in a reasonable way
'vhen no restriction is placed on the number of e/s under consideration~
The key l-0 this problem Hes in t.he following theorem.
Theorem B. If {ei} is an orthonormal set in a llilbert space II~ and if x
is any vector in HJ then the set S = {Ci; (x~.e-i:) - 0} is either empty or
countable.
PROOF~ },or each positive integer n, consider the set
S = f e1 e2,
11
~ . ~
:11 eA:11
.. }
By the theory of absolutely con vcrgent series, if x:=1 j (x~en) i2 con verges,
then every series obtained from this by rearranging its terms also con-
254
Operators
vergest and all such series have the same sum. We therefore define
l:[ (x,ei)I t to be ~:- 1 I(x,eliL)l1, and it follows from the above remark that
:E !(x,e;:) I2 is a non-negative extended real number which depends only
on 8 1 and not on the arrangement of its vectors. We conclude the proof
by observing that in this case~ (3) reduces to the assertion that
OD
I (xten)l 2 ~
tlxl! 2 ;
(4)
n.-1
and since it follows from (1) that no partial sum of the seriPs on the left
of (4) can exceed l[xll 2, it is clear that (4) itself is true.
The second part of Theorem A is generalized in essentially the
same way.
If fed is an orthonormal set in a /filbert space H, and
is an arbitrary vectOT in H ~ then
Theorem Dt
if x
(5)
for eaih j.
PROOF.
along~
We again write
we define 2; (i:,e,)e.: to be 2;:m:o 1(x~et:)ei; and in this case~ (5) reduooe to (2),
which has already been proved.
By Theorem B, we may assume for the remainder of the proof that
Sis countably infinite4 Let the vectors in S be listed in a definite order:
s : : ; t ei, e2, ~ ~ eHT } we put s~ = ~~-=.J (xfei)ei, and "'~e note
that form > n we have
1
!Is. -
s~H
;;;
"
i=~+I
(x~et:)e,:tl ~
2
(x~ei) I2.
i:aq1+l
Hilbert Spaces
255
observe that (5) follows from (2) and the continuity of the inner product:
(x - ~ (x,Bi)e1, e1)
A.ll that remains is to show that this definition of ~(x~e")et: is valid, in the
sense that it does not depend on the arrangement of the vectors in S~
Let the vectors in S be rearranged in any manner:
i/1,/2,
,f~,
"
.}.
!Is'
Sinee
~
i
is arbitrary 1 this
sho~,..s
3~~
thats' == s.
Let H be a non-zero Hilbert space, so that the class of all its orthonormal sets is non-empty. 1-~his class is clearly a partially ordered set
with respect to set inclusion. An orthonormal set {ei i in H is said to be
complete if it is maximal in this partially ordered sett that isJ if it is
impossible to adjoin a vector e to {ed in such a way that {eiJe J is an
orthonormal set which properly contains ~ ei} .
Theorem E.
aet.
The statement follov_.,-s at once from Zorn 1 s lemma, since the
union of any chain of orthonormal sets is clearly an upper bound for the
chain in the partially ordered set of all orthonormal sets.
PnooF.
Orthonormal sets are truly interesting only when they are complete.
The reasons for this arc presented in our next theorem.
Theorem F. Let H be a Ililhert space, and let {ed be an ortho-normal set in
H. Then the following conditions a re all equi()alent to one another:
(1) le l is complete;
(2) x ..l {e..: l :=9 x = 0 ;
(3) if xis an arbitrary vector in H~ th-en x ~ ~(x,ei)ei;
(4) if x is an arbitrary vector in H, then ~lxll 2 = ~I (x,edl 2 .
PROOF.
We prove that each of the conditions (1), (2), and (3) implies
the one following it and that. (4) implies (1).
256
Opera tors
::=::}
tlx112 =
l;(x,ei) (x~ei)
=:::;
)1
~j (xte .
Jo
27
lf(x)j 2 dx <
co ,
JIJll =
and
(f,g) =
J.
2.-
dx~
f(x)g(x)
== 0, + I, + 2,
'
In order to under.s t..an d thia and the next exam plet the reader should ha.ve some
knowledge of the modern theory of measure and integra tionr We wjsh to emphasize
once a.gain that these examp1es a.re in no way e.ssentia.l to the structure of the book,
and may be skipped by a.ny reader without the necessary background.. We
advise such a reader t.o ignore these examples s.nd to proceed at once to the discussion
of the Gram-Schmidt process.
HUbert Spaces
257
{6)
c,. = (f,e,.} = y-f (x)e-""'
dx
211" 0
a.re its classical Fourier coefficients, and Bessel 1 s inequality takes the form
I""'
t.1:=-o11;1
n ""' -
lcnl 2
= /
2':
lf(xW dx.
~ I~
v 21f
.., Cl;:C
k -~
ik~ J
(8)
llf
1i
Ill~
o.
-This situation is often expressed by saying that f is the limit in the mean
of the ft/s.. We add one final remark to our de.script.ion of this portion of
the theory of liou ricr series~ Ii f is an arbitrary f unet.io n in l..12 ,vi th
Fourier coefficients cfl defined by (6)~ then BPssP1's inequality tells us that
the series E; -~ ~cnl 2 converges. 1.,he celebrated lliesz-Fischer theorem
assert.H thP converse : if ctl (n ;::;:; 0 1 + 1J + 2, ~ . . ) are given complex
numbers for which
Icn. I2 converges, then t.he re ex ists a function
i;;
-CCI
258
Operators
fin L 2 whose Fourier r.oefficients are the c~'s. If we grant the completeness of L2 as a metric space, this is very easy to prove. All that is
necessary is to use the en's to define a sequence of /,/sin accordance with
(8). The functions ein'&/ V21r form an orthonormal set~ so for m > n we
have
11/m - J~l! 2 =
t
[cA:! ~
ikl=n+l
2
(9)
By the convergence of 1:=--~ 1cn[ 21 the sum on the right of (9) can be made
as small as we please for a 11 suffi cicntly large n and all m. > nr Thls
tells us that the /1/s form a Cauchy sequence in L~; and since L'}. is complete, there exists a function fin L2 such that f n--+ f.. 1,_his function f is
given by (7)t and the cl1's are clearly its Fourier coefficients. It is
apparent from these remarks that the essence of the Riesz-Fischer
theorem lies in the completeness of 2 as a metric space.
We shall have use for one further item in the general theory of
orthonormal sets 1 namely, the Gram-Schmidt orthogonalization proces8.
Suppose that {x1 1 x~, ~ ~ . t Xn, . . . . J is a linearly independent set in a
Hilbert space H. The problem is to exhibit a constructive procedure for
converting this set into a CO rrespo ndi ng orthonormal set {Ct~ e'2 ~
en 1 J -..vith the propert~~t that for each n the linear Bubspaee of II
spanned by fe1T e2", . . , en} is the same as that spanned by {x1~ x2 1
. . . , xfl l ~ Our first step is to normalize .x 1-which is necessarily non ..
zero-by putting
r
et
r1 X1 -
(x1~e1 )el
(X2,e1)e111
ea
~ Hx 3
(.xa,e1)e1 - (xhe2)e2
(xs,e1)e1 - (X.:1,e2)e1 If
Hilbert Spaces
259
.) ,
'
ai ~
(xte,)
for each i. (Hint.: expand llx ~ ~~ 1 aieil: 11 add and subtract ~7=-= 1
I Cx,e-i)l 2J and obtain an expn:1ssion of the form
1 r (x,eJ -- ail 2 in
the rcsultr)
Show that the orthonormal sets described in Examples 1 and 2 are
complete.
Show that every orthonormal set in a Hilbert space is contained in
some complete orthonormal set,. and use this fact to give an alternative proof of Theorem 5:i-B.
Prove that a llilbert space// is separable$===) every orthonormal set in
H is countable+
Show that an orthonormal set in a Hilbert space L" linearly independent~ and use this to prove that a Hilb~rt space is finite-dimensional <=> every complete orthonormal set it':l a basis+
Prove that any two complete orthonormal sets in a Hilbert space H
have the same cardinal number. This cardinal number is callr.d the
1::
2~
3..
4.
5.
6.
260
Operotors
+ xz)
(x1
x2, y)
= (xitY) + (x21Y)
;:::= f JJ(x1)
f 11(X2)
f 11 (ax) = (ax,y)
= cr.(x,y)
::== af11(X)
=
and
Hilbert Spaces
261
!f11(x)j
~ (x,y)!
that is,
IJ/
11 IJ
llYll.
>
1}
1~ C1~!1)
- C1~11' y)
llYI/.
Theorem A.
in H .
f(x)
= (x,y)
(1)
for every x in H4
PROOF
It is easy to see that if such a y exists, th en it is necessarily
unique. For ii we also have f (x) = (x~y') for alt x, then (x,y') = (x,y)
and (x, y 1 - y) :;: : ;" 0 for all Xj and since 0 is the only vector orthogonal
to every vector, this implies that y' - y
0 or y' ::::; y.
We now turn to the problem of showing that y does exist. If f = 0 1
then it clearly suffices to choose y = 0. We may therefore assume that
f F- 0.. The null space M off is thus a proper closed linear subspace of H,
and by Theorem 53-B, there exi8ts a non-zero vector Yo \vhich is orthogonal to M.. We sho'v that if a is a suitably chosen scalar~ then the vector
y ;: a:yo meets our requirements. We first observe that no matter what
a may be:1 (1) is true for every x in M; for f(x) = 0 for such an x, and
since x is orthogonal to yr,, 've also have {x,y} .;: : : 0. "fhis allows us to
focus our attention on choosing a in such a way that (1) is true for
x = Yo- The condition this imposes on a is that
4
=::::;
J(y~) = (yo,<XYo)
atlYolt 24
We therefore choose a to be f(y 0)/HYo" 2, and it follows that (1) is true for
every x in M and for x = Yo- It is easily seen that each x in H can be
written in the form x = m + /jy 0 with m in M ~ all that is necessary is
t-0 choose /3 in such a way that j(x - {Jyo) = f(x) ~ f1f(y 0) = 0, and this
is accomp1ished by putting (3
f(x)/f(y 0 ). Our conclusion that ( l) is
;::;!;
262
Operoto rs
/Jy{t)
;:::=
f(ni)
+ fJf(yo)
(m-,y)
+ fJ(yo,y)
= (m
+ /3yoJ y)
;:::=
(x,y).
(x~y ),
(2)
(3)
2.
3.
Hirbe rt Spaces
263
= f(Tx). 1
(1)
We write z = T*y~ and we call this ne'v mapping 1?* of H into itself the
adjoint of T,. The same symbol is used for the adjoint of T as for its
conjugate because these two mappings are actually the same if II and H*
are identified by means of the natural ~..orrespondence. It is easy to
keep track of whether T* signifies the conjugate or the adjoint of T by
noticing whether it operates on functionals or on vect-0rs. 1.~hc action
of the adjoint can be linked more closely to the structure of H by observing that for every vector x we have (T*f21}x = f"sJ(Tx) - (Tx~y) and
(T~f,,)x ;:::= f.(x) = (x,z) = (x, T*y), so that
( Tx,y)
(x, T*y)
(2)
26'
Operators
(x, T*(y
+ z))
~ (Tx, y
+ z)
~ (Tx~y)
+ (Tx,z)
= (x,T*y)
80
T(y
The relation
+ z)
+ (x,T*z)
- (x 1 Ty
+ T*z)i
T*y
Tz4
T*(ay) == aT*y
The re&eoning here depends on the fact that if y, and y 1 are veetora such tha. t
(ztfll) ..... (:t,1/2) for all :rji then {2; 'II - Y:1) :m 0 for all x:, so YL - :Yt = 0 or 1/i ~ f/J.
Hi Ibert Sp aces
for all Y1
265
so
in to itself.
This
Theorem A.
pro periies:
(1)
(2)
{T1
T2)* = T1*
(a.T) * ~ aT*;
(3)
(4)
(5)
(6)
(TlT~)* ~
+ T2*;
1'i*1 1*;
1
T*t. == T;
ll T* II = [l T [I ;
"T*7'11
l Tll !_
PROOF+
(x~(T1TiY'*y) =
(T1T2x,y)
(Tzx, T1 *y)
(x 1 T2*'1.\*y).
and the
!I Txll 2
(Tx1Tx)
:=;;
=
llT*Tll is an immediate
(T*Tx,x)
consequence of
satisfying+
Problems
1.
2t
266
3.
4..
Operators
( T*)-1 ;; ( ']_,-1) .
Show that 11 TT*ll
HTH 2
+ PA2)*
aA1*
+ PA1*
=a Ai+ PA2
l!A -
A *!I
shows that A - A
theorem.
= Oi
so A
= A*.
The reader will notice that we have said nothing here about the
product of two self-adjoint operators. Vecy little is known about such
267
Hilbert Spaces
. products, and the following simple result represents almost the extent of
our information.
Theorem B.. If Ai and A 2 are self-adjoint operators on H, then their
product A 1A2 is self-adjoint=> A lA ~ = A2A i
PROOF4
=}
(Tx,y) = 0
0 for all x.
We
+ JJY)~ ax + /3y)
lal
(Tx~x) -
l~l 2(7'YiY)
~ aP(Tx,y)
+ a{:J(Ty~x).
(1)
We first observe that by our hypothesis~ the left side of (1)-and therefore
t.he tight side as we11-eq uals 0 for all a and {J. If we put a ;;; 1 and
(j ~ 1, then (1) becomes
(Tx,y)
+ (Ty,x)
(2)
O;
:;==
(3)
0.
Dividing (3) by i and adding the result t-0 (2) yields 2{ Tx, y)
(Tx,y) == 0 and the prooi is complete+
O~
so
268
Operators
Theorem D.
PBOOF~
(Txtx)
= (x, Tx) =
(x~ Tx)
(Tx,x)
sho\~ls
that (Tx~x) is real for all x~ On the other hand, if (Tx,x) is real
for all x 1 then ( Tx,x) = (Tx"x) = (x, T*x) = ( T* x,x) or
([T -
By Theorem
C~
T*]x" x)
=0
OJ so T
T*,.
self~adjmnt
operators rm H is a
partially ordered set whose linear structure and order structure are related by
the following properties:
(l) if A 1 < A2t then Ai + A. :=;; A~ + A far eyery A;
(2) if Ai < A2 and a 2:: 0, then a A1 :5'. qA2.
PROOF.
The relation in question is obvious.Ir reflexive and transitive
(see Sec. 8) To shov-r that it is .also antisymmetric, \ve assume that
A 1 < Ai and A2 < A 1 ~ This implies at once that ([Al - A ~Jx, x) :.== 0
for all x, so by Theorem C, A1 - Az ~ 0 and Ai= A1~ The proofs of
properties (l} and (2) are easy. For instance, if A 1 < A2, so that
(A1x,x) S ( A!X,X) for all x 1 then (A 1xJx) + (Axtx) < (A iX,x) + (Ax,x)
or ([A1 + A]x, x) ~ ((Aft+ A]x 1 x) for all x, so A1 +A < A2 +A+
The proof of (2) is similar.
+
Theorem F..
(I+ A)x
==
(~x~x) == -l~xll 2
>
O~x
= O~
HiJbert Spaces
269
proof by observing that M = H; for otherwise there \vould exist a nonzero vector Xo orthogonal to M, and this would contradict the fact that
(xoJ [l + A]xo) = 0 ~ llxoll 2 == - (Axo,x-o) S 0 ~ Xo = 0.
If the reader wonders why we fail to show that the partially ordered
set of all self-adjoint operators is a lattice, the reason is simple: it isn't
true. As a matter of fact, this system is about. as far from being a lattice
as a partially ordered set can be 1 for it can he shown that t \\~o opera tors
in the set have a greatest lower bound {=} they are comparable. This
whole situation is intimately related to questions of commutativity for
algebras of operators and is too complicated for us to explore here+ For
further de ta il.s, see Kadi.Bon I22].
Probtems
1..
snd Ao I
= (A 1 o A-:i:) o A3
2.. If T is any opera tor on Ht it is clear that l( Tx,x) r < HTx [I II x II <
II TU l]x]1 2 j so if H ~ f0} ~ we have sup ~I (Tx~x)I /llxl] 2 :x r! 0} < [I Tl].
Prove that if T is self-adjoint~ then equality holds here. (Hint
write a = sup f I (Tx~x)[ /!lx[l 2 :x r! 0} = sup rl (Tx,x)!: HxH = 1 l, and
show that l Txll ~a whenever llxll = 1 by putting b = llTxl]H~if
Tx ~ o--and considering
4ll Tx[l 2
~ (T(bx
+ b~ Tx), bx + b-irx)
1
+ ltbx
- b- 1 TxW~J
= 4a[I Txll4)
270
Operators
N*Nll
~ UNN* - N1N1!ll
HNk*N1: ~ N*N[] = !INN* -
+ HNkNk* - Nk*NlU
NkNk*ll + llNk*NJt; -
NNH ~
o,
so the assumption implies that each commutes with the adjoint. of tha
other. To shov{ that Ni+ N2 is normal under the stated conditions, we
have only to compare the results of the follo\ving computations:
(N1
+ N2){N1 + N2)*
and
(N1
+ N-i.)*(N1 + N2) =
= N1*N1 + N1*N2
+ NJV2*
+ N'l.*N1 + N"J*N2~
= N1Ni*N2N1*
=
N,,*N1N1*N.,,
Ni*N1*N1N2 = (N1N.,,)"'NJN2.
=
Hj Ibert Spaces
271
Theorem
c.
PROOF.
llT*xll
!ITxrl ~ ltTxrl 2
:::::=
IJT~xu
==
!ITxH 2 Q
(T*x,T*x)
(T*Tx,x) <=>([TT* -
T*T]x,x) = OL
for every x 1 and this impiies that ll..L~,l 2 ll = llN*NI~have [IN*NH = [! 1'l !I2, so the proof is com plcte.
= llNfl'l.
By Theorem 56-A, we
The self~adjoint operators Ai and A~ are ca11ed the real part and the
imagi-nary part of 1'.
W c emphasized earlier that the complicated structure of ill (JI) is
due in large part to the fact that operator multiplication is in gcneraJ
non-commutative. Since our future '"Tork ~~in be focused mainly on
normal operatorsi it is of interest to scc--as the following theorem show.s---that the existence of non-normal operators can be traced directly to the
non-commutativity of self-adjoint opera tors.
Theorem E. If T is an operator on II, then T is norm.al
imaginary parts cmnmute.
PROOF.
'l.'
.;;>
TT*
= (Ai
+ iA2)(A1 -
iA2)
= A1 2 + A2~ + i(A2A1
- A1A2)
and
A2A1)~
272
Ope rotors
Theorem f.
(Tx~Ty)
(x~y)
HTxlj =
0.
T {X1,
Xi,
J =
o, xi, X2J
inverse~
i~
Theorem G.
:;:=
Hilbert Spaces
273
Problems
1.
2.
3~
4..
59.. PROJECTIONS
Aecording to the definition given in Sec. 50t a projection on a Banach
space B is an idempotent operator on B~ that iBt an operator P with the
property that P 2 = P. It "~as proved in that sect.ion that each projection P determines a pair of closed linear subspaces Ai and N~the range
and null space of P~sucb that B == Al $ N, and 3.lso~ conversely, that
each such pair of closed linear subspaces .ll1 and N determines a projection
P with range J.lJ. and null space N. In this way~ there is established a
one-to-one correspondence between projections on B and paLn; of closed
1inear subspaces of B ,,..-hich span the whole space and have only t.he zero
vector in common.
The context of our present \\"Ork~ however, is the llilbert space H,
and not a general Banach space~ a.nd the structure \Yhich H enjoys in
addition to being a Banach spacr rnableB us to single out for special
attention those projections whose range and nun space are orthogonal.
Our first theorem gives a convenient characterization of these projections~
Theorem A,, If P is a proiectinn on H with range M and null space N,
tlten M .l N ~ P i8 self-adjoint, and in this case, N = M 1- ~
PROOF.
Each vector z in H can be written uniquely in the iorm
z ~ x
y with x arid y in M and N. If M .l N, so that x .l. y, then
p = P will follow by Theorem 57-C from (P*z,z) = (Pz,z); and this
is a consequence of
4
(P*z,z)
(z,Pz)
(z~x) -
(x
+y
x)
(x,x)
+ (y,x)
(x,x)
27.4
Operators
(Px,y)
x E. M ~ Px
= x {=} ![Pxl1
llx11:
The first equivalence here was proved in Problem 44-11; and since for
every x in H we have
1lxl! 1 =
UPx
+ (I -
P)x!i 2
= \!Pxli:a + H(I -
P)xlli,
(1)
If z
(2)
Hirbert Spaces
275
PROO!t,L
PROOF.
~/
Our next theorem shows how projections can be used to express the
statement that tl\;rO closed linear subspaces of Hare orthogonal.
Theorem F.
276
Operators
Theorem G.
If Pi,
P2t
... ,
M == M 1
+ Jf +
!
+ MB
Since P is clearly self-adjoint, it is a projection ~ it is idempotent. If the P/s are pair,vise orthogonal~ then a simple compu t.ation
shows at once that Pis idempotent. To prove the converse, we assume
that P is idempotent. Let x be a vector in the range of p,.j so that
x ~ P"x. 1'"hen
PROOF.
TC
[]xB~
]~PtTi[~
<I
;'=-l
HP;xllr
(PJx,x)
(Px1x) ~ []Pxli 2
::;
Hxll 2
j=l
We conclude that equality must hold all along the line here, so
n
2
llP1xllt
=
llPall
j=l
and
Thus the range of Pi is contained in the null spa.cc of P1J that is~M, ~ Mi1-,
for every j ~ i. This means that Mi ...L M, whenever i # j, and our
conclusion that the P/s are pairwise orthogonal now follows from the
preceding theorem. We prove the final statement in two steps. First,
we observe that since HPxU = Uxl] for every x in M,, each Mi is contained
in the range of P ~ and therefore M is also contained in the range of P ~
Second, if xis a vector in the range of P, then
z = Px -== Pix
is evidently in M
+ P2z + + P,,.x
There are many other ways in which the algebraic structure of the
set of all projections on H can be related to the geometry of its closed
linear subspaces, and several of these are given in the problems below.
Hilbert Spaces
277
1.
2.
another:
(a)
(b)
< Q;
HPx!I < HQxl1
for
every x;
(c) .tlf c N;
(d) PQ = P;
(e) QP = P.
(Hint the equivalence of (a) end (b) is easy to prove, as is that of
(c), (d), and (e); prove that (d) implies (a) by using
(Px,x) ~
3.
4.
IJPxll 2
= ~1r~Qxlr 2
< 11QxH 2 =
(Qx}x);
CHAPTER ELEVEN
Tx == AX.
(1)
A non-zero vector x such that Eq. (1) is true for some scalar A is called
an eigeniector of T, and a scalar X such that ( 1) holds for some non~zero x
is called a11 eigenvalue of T. 1 Each eigenvalue has one or more eigenvectors a.swcia ted 'vi th it~ and to ea eh eigenvector there corrcspo nds
precisely one eigenvalue. If JI has no non-zero vectors at alli then T
certainly has no eigenvectors~ In this case the ~\thole theory collap~es
into triviality, so we assume throughout the present chapter that
H ~ ~O}.
Let X be an eigenvalue of T, and consider the set M of all its corre..
sponding cig-cnvect.ors together with the- vector 0 (note that 0 is not an
eigenvector). Mis thus the set of all vectors x which satisfy the equation
(T - J,,l)x
= 0,
and it is clearly a non-zero closed linear subspace of II. ~le call M the
eigenspace of T corresponding to A+ It is evident that ~Y is invariant
under T and that the restriction of 'T to ~Y is a very simple operator~
namely~ scalar multiplication by k.
In order to place the ideas of this chapter in their proper framework,
1
The equivalent terms ckaraden'slie vector a.nd charac.t.eristie iuluei and proper
vect-Or and pr-oper value, a.re UBed by many write~.
278
279
l\~e
\\rhere
+ X2 +
X1
~ ~
. + X~,
(2)
Tx
+ Tx2 + + Txm
A1X1 + A2X~ + + AmXm
It now
Tx 1
(3)
i we have M 1
(4)
~
yields
and it follows at once from this that
Ix = x =
=
for every
xin H, so
I
+ x~ + +
P1X + P'JX +
+Pm::c
(P1 + P2 + + Pm)x
X1
Xm
+ P2 + +Pm.
P1
(5)
+ X2X2 + ~ + Amxm
== A1P 1X + A2P 2X + ~ ' + ).,_mpm,X
Tx = A.1x1
(X1P1
+ A2P2 + '
~ ~
+ XmP .,,.)x
for every x, so
(6)
280
Operators
1
[
X1_, X2~ . .
= {
o, X1, X2,
i.
T*
X1P1
+ 'X;P:i + + h:Pm,
TT*
= C\1P1 + XJ'i
and, similarly,
This entire circle of ideaB will be completed in the neatest possible way
if we can show that evecy normal operator on H satisfies conditions (a),
{b), and (c), and therefore haB a spectral resolution. Our aim in the
present chapter is to prove this as..~rt.ion~ which is known as the spectral
theorem, and the machinery treated in the follo,ving sections is directed
exclusively toward this end+ We emphasize once again that His understood to be finite-dimensional with dimension n > 0.
60 . MATRICES
Our first goal is to prove that every operator on H has an eigenvalue_,
and in pursuing this we make use of certain elementary portions of the
281
Te;
~Jet..
-1
(1)
Ciij
[T] =
(2)
4'
(3)
The reader should make sure that he has a perfectly clear understanding
of the rule according to VtThich the matrix of Tis eonstn1ctcd: lA:rrite 1'fei
as a linear combination of el~ e~, . . . , ei-i, and use the resulting coefficients to form the jth column of [1']~
We offer several comments on the above pa.ragTaph. First:t t.hP
term mairix has not been defined at all, but only ~"'t.he matrix of an
operator relative to an ordered basis/' .l'1. matrix~defined simply as a
square array of scalars-is sometimes regarded as an object 'vorthy of
interest in its own right. For the most part, however, we shall consider
a matrix to be associated with a definite operator relative to a particular
ordered basis, and \.Ve shall regard matrices as little more than romputa~
tional devices which are occasionally useful in handling operator~+ Next,
the matrices we work with are an square matrices. R.cctangular matrices
occur in connection with linear transformations of one linear space int.a
another and are of no interest to us here. Finally_, 've took B to be an
282
Ope rotors
ordered basis rather than merely a basis, because the appearance of the
array (2) clearly depends on the arrangement of the e/s as well as on the
e./ s themscl ves. In most theoretir,al considerations, however t the order
Qf the rows and columns of a matrix is as irrelevant as the order of the
vectors in a basis. For this reason, we usually omit the adjective and
speak of ''the matrix of an opera tor relative to a basis.''
By using the fixed basis B ~ {e;}J we have assigned a matrix:
[T] = [rui] to each operator T on H, and the mapping T-4 {T] from
opera to rs to matrices is described by Tei = ,;711:11 1 a.ijCi+ The importance
of matrices is based primarily on two facts: T --4 [T] is a one-to-one
mapping of the set of all operators on H onto the set of all mat.rices; and
algebraic operations can be defined on the set of all ma trices in such a
manner that the mapping T ---+ [ T] preserves the algebraic s lruc t ure
of IB(H).
1;he first of these st.a temen ts is easy to prove. If vle know that
[aiil is the matrix of. T, then this information fully determines 1 x for
every x; for if x = ~j i t31-eh then
1
Tx
I: {3;Te5
I: f3i cI Uijei)
I <I: O:ijfjj)et: .
j=l
Pl
j=]
~=I
.; ... 1
.::i:i:-t
This shov{s that T---:,, [TJ is onc-to~one. We see that this mapping is
onto by means of the follo1,dng reasoning~ if [ai1] i8 any matrix~ then
Tei = !~_ 1 auei defines T for the vectors in B 1 and when T is extended
by linearity to all of H, it i8 clear that the resulting operator has [aii] as
its matrix.
To establish the second statement, it suffices to discover ho"Pr~ to add
and multiply two matrices and how to multiply a matrix by a scalar, in
such a way that the follov.Ting matrix equations arc true for all operators
T1 and T2 on H: [T1 + T~d = [T 1] + [T2], faT1] ~ a[T1]J and
fT1T2J
Let
[~;]
[T1UT2J+
+ T2)e;
--rhe computation
+ T2ei
= I: ai;e i + I: f31J-ei
= T1e1
1t
-1
i=-=l
..: :o::i:l
(ai;
+ /3i;)ei.
283
[lli11 + [tji11
[T1 + T2J
then we obtain
= [ai:j + PiiL
= [T 1J + [T2].
(4)
a[.:liJ]
(aT1]
then
= [aa.ij],
= a[T1]~
(5)
(T1T2)e, = T1(Tf].e1)
T1
(I: ~k,~k)
k==l
I {3k-;T
= I
(I: airre1)
~ I (I aa~k;) ei
tC.k
t-= I
n
t'.ti
k=I
i=-=1
i=l
k=l
{ai;] [tft:;] = [
o:ud31t:i ~
(6)
k= l
then we get
{T1T2] =
(T1HT~J.
The operations defined by (4), (.5), and (6) are the standard algebraic
operations for matricesa In v_rords, i,.vc add t\vo matrices by adding
corresponding entries~ and we multiply a matrix by a scalar by multiplying each of its entries by that scalar. The verbal description of (6) is
more complicated, and is often called the ro1D-by-column rule: t.o find the
entry in the ith row and jth column of the product [aij][.8..:i], take the ith
row (ail, 0-~2, . . ~ , "in) of the first factor and the ith co1umn (ft1J, /j2;,
. . , fJni) of the second, multiply corresponding entries, and add:
n
~ 12iJ:{jkj
~J3Ii
O!.f"J./32;
+ r:l1nf3ni
k e. l
It is worth noting that the image of the zero operator under the mapping
T--+ [TJ is the zero matrixJ all of '"~hose entries are 0. Further, it is
equally clear that the image of the identity operator is the identity matrix i
which has 1is down the main diagonal (v..Thc re i = j) and O's el sew here.
If we introduce the standard Kronecker delia 1 which is defined by
aij
0
{ 1
if i ~ :i
if i = j J
[a1JJ~
284
Operators
We no\V reverse our point of view for a moment (but only a. moment)
and consider the set An of all n X n.matrices as an n.lgehraic system in its
ovrn right, with addition, sca.lar multiplication, and mu1tiplication defined
by (4) 1 (5), and (6). It can be verified directly from these definitions
that An is a complex algebra with identity (the ide..ntit~{ matrix), called
the total matrix algebra of degree n. If we ignore the ideas leading to
(4) (fl), and (6)~ then the structure of An is defined, and can be studied,
'Yithout any reference to its origin as a representing system for the operator~ on H~
This a.pproar.h \Vou1d make very little sense, ho1vever, because
the primary reason for considering matrices in the first place is that they
provide a {~ompulational t-001 'vhich is usP-ful in treating certain aspects
of the theory of t..hesc opera tors~
Let us return to our original position and observe two facts: that
IB(H) is an algebra; and that the structure of An is defined in just such a
\Vay as to guara ntce that the one-to-one mapping 7r ---+ [ 7'1 of ~ (H) on to
An prescrveH .addition, scalar multiplica.tioni and multiplication. It IlOl\r
fol1o\vs at once that An is an algebra, and that T ~ [T] is an isomorphism
(see Problem 4f>-4) of C(H) onto A 11
\'le give the following formal summary of our work so iar.
t
for every matrix {~.'.il; and by the theory of rings, this identity is unique.
A matrix [001] is said to be non-singular if there exists a matrix [t3i1] such
that
1O.ij JLB~; J =
LB.;] [Orij]
= [8iJ] ;
and, again by the theory of rings~ if such a matrix exists~ then it is unique,
it is denoted by [aiJ]- 1 , and it is called the inverse of [a#].
These ideas are connected with operators by the following considerations. Suppose that [ai 1] is the matrix of an operator T relative to B.
285
We therefore have
Lei B be a basis fur H, and 1' an operator tDhose m.-atrix
relative ta R is [aij]. Then Tis non-singular (::::} [~i] is non-singu.la.r, and in
this case [at;}- 1 = [T- 1]~
Theorem B.
Te; : : :;.
I. a.i,-ei
i-1
and Tf1 = Z7==1 /31ff1T J...ct ['Yi;] be the matrix of A relative t.o B, so that
Ae; = 2;~_ 1 'Yii~- By Theorem ll, [~ii] is non-singular. -~le now compute
Th in two different ways:
n
Th
f31c1fi;
I. ~qAl\t
I <I. 'Yilt\)
~ ( I: 1'ikf1~;) ei;
==
l::cw:l
.1::-1
a
n.
fl1t:i
i:-1
n
i=l
it>;ICLl
i=l
and
Th
===
TAei
(I. 'Y;et)
~ :!: Y1c~Te1i;
~ I,
{I: ai:te1)
I (I aik'Yk;) ei.
=T
k=l
:cot}
n.
;r:i
Yki
1:=1
n
i::::!:El
,:-1
k:::ml
286
Operators
or
(7)
['T]n1
=:;
Theorem Ct
1~
2.
3.
4~
5.
8 - cos
sin 8]
()
c?s
[ sin fJ
6.
287
and
[eift
0
O ].
e-t8
e[
288
Operators
..
ar~
a1n
:i!:'2
a2n
a~2
. .
. ann
~
'
dct({1'ii]-l[ni.'1[1'i.il)
= det (['Yii 1- 1) det ([ lliiD <let ([ 'Yi1])
= det([ "Yii}- 1) det 1'ii]) det([ O:i;])
= dct(["'f~11- 1 !'YiJD dct{{ 1J)
=== dct({ 61;}) det([ aii1)
= <let{[aiJ]) ~
289
. .
~
..
..
afl i
..
..
..
'r
,.
.,
.,
Onn -
..
Our search for eigenvalues of Tis reduced in this \Vay to a ~earth for roots
oi Eq. ( l). Propr:rty (4 1) tells us that this is a po]ynomial equation,
\vit.h cornplex coefficients, of degree n in the c.omp]ex variable X. \'1e nov{
appeal to the f!1ndame11tal theorem of a]gebra, \vhich guarantee~ that an
equation of thi8 kind al\vays has exactly n complex routs+ Some of these
roots may of course be repeated, in \vhich -ease there are fe,ver than n
distinct roots. In summary, lre have
If 'I' is an arbitrary operator on H, then the eigenvalues of T
constitute a non-em.pty fin1~te subsel of the com.plex planea F'uriherntore, the
num.ber of P'n~nts in thi.~ set does not excee.d th.P dim.ension n of the space !Ir
Theorem A.
1~he
1.
:;;-
a(T);
290
Operators
if X e: u( T), and if p is any polynomial, then p(~) an (p( T));
if T~ = 0 for some positive integer k, then cr(T) = {O}.
Let the dimension n of H be 2, let B = fe 1~e2} be a basis :for H 1 and
assume that the determinant of a 2 X 2 matrix [ai.1] is given by
(d)
(e)
2.
a11a22 -
(a)
(b)
auau.
i:.
llTx - Xxll
!IT*x - Xx[I
for every vector x, and the statements of the theorem follow e..t once from
this.
The way is now clear for
Theorem B.
291
PROOF.
Ai(Xi, x1)
and since Ai
~ Xh
(J\ai, Xj)
= (Txi Xi)
1
=:::;
= (Xi,~X;) =
~i, T *xt)
Xi(Xi,Xj);
(xi~x1 ) =
0.
Our next step is to prove that the M/s span H when Tis norma1, and
for this we need the following preliminary fact.
Theorem C.
If Tis normal,
p = P1
+ P2 +
+Pm,.
Since each M" reduces T, we see by Theorem 159-E that TP,; = P..:T for
each Pi. It follows from this that TP ~PT, so M also reduces T, and
consequently 21! 1- is invariant under 7ra If i.lf 1- ~ 0}, then~ since all the
eigenvectors of T arc contained in M, the restriction of T to Ml. is an
opera tor on a non-trivial finite-dimension.al H i1 bcrt space which has no
eigenvectors, and hence no eigenvaluesa Theorem 61-A shows that this
is impossible~ We therefore conclude that M 1- = {0}) so M = H and
T ;: h1P1
+ AIJP2 +
4
'
+ AmP~~
(1)
T2
)..,/P1~
oja.1
Tn
==;
.I:
1-1
')...i'* Pl.
(2)
292
Operators
If ,vc make the customary a-greemcnt that T 0 = I, then the fact that
I = :z 1: 1 P"' shows that (2) is also valid for the case n = O~ Next, let
p(z) be any polynomial, \vit.h eomplex cocffieicnts 1 in the complex va1ia.ble
z. Ry taking linear combinations_, (2) ca.n c'idently be extended to
m.
p(T)
(3)
p(Xi)P1.
t .1::11" 1
\\re 'would like to find a polynomial p such that the right side of (3)
col.lapses to a specified one of the P/s, say Pi ''rhat is nccd~d is a
polynomial p1 '\vith the property that Pi(X~) = 0 if i ~ J and PJ(X1) = 1.
\1.,le delinc Pi as f ollo\VS:
pz
( )
)
(z -:- X1)
(z - Ai-t) (z - AJ+t) (z - Am)
.
(Aj ~ h.1) ' ('J...j - AJ-1) (AJ - A1+1) ~ ' (A - ~m)
+
=:.:
(4)
Pi= P1(T).
in T.
We no'v as8nmc that
"\VP
and that this is also a Hpectral resolution of T, in the sense that the a/s
are distinct complex numbersi- the Q/s arr. non-zero pairwise orthogonal
projections, and I =
";-e wish to show that (5) is actually
1 Qj.
identical ":-ith (1), except for notation and order of terms. We begin by
proving, in t\vO steps,. that the a/s are precisely the eigenva1ues of T .
.First, since Qi. ~ O, there exists a non-2'ero vector x in the range oi Q*; and
since Q~-x = x and Qix = 0 for j ~ i, we see from (5) that 71:c = atX, so
each ai. is an eigenvalue of '1'. Next, if A is an eigenvalue of T, so that
Tz = Xx for some non-zero x, then
2::
Tx = Xx = A.Ix = A
!r Qa I
===
~=l
il
and
so
Tx =
i (X i=-1
!r a1Qat
il
lti)Qix
0.
XQa
293
Since the Q.-x' s a.re pairwise orthogonal, the non-zero vectors among
them~there
(8)
294
Operators
\\here the -X/s are distinct complex numberst the P/s are non-zPro pairwise
orthogona1 projPr.tions, and I = :!:f' 1 Pi. rrhc uniqucnes8 of the spectral
resolution no\v guarantees that the A./s are the distinct eigenvalues of T
and that thP P/s are the projections on the corresponding eigenspaces.
The spectral thPorPm tPlls us that statements It II, and Ill are equivalent
to one ano t.her. 1~he a hove remarks earry us a bit f urthcr i for they
constitute a proof of the fart that these statements a.re also equivalenl to
IV. There exists an orthonormal basis for H relative to \vhich the
matrix of 1T is diagonal.
It i8 interesting to realize that the implication III ===:>IV, which '\Ve
proved by sho\\"i ng that Ill ==? I a n<l 1 ~ I\!~ can be made to depend
more direet]y on matrix computations. 'This proof is outlined in the
]ast three problems below.
Problems
l.
24
3.
4.
5.
64
{ 0}
C M 1 C ltf ~ C
C !Jf" == II 1
(Hint.. if n
7.
295
actually diagonal.)
'
"'
IA?t
(1)
:i= 1
In our version, we attemptred to give equal emphasis to both the geometric and the algebraic sides of the matter. Most v-.""riters~ however,
con fine their statement of the theorem to that given above~ and for a
very good reason: it is ( 1) tba t generalizes to the infinite-dime nsiona I case~
There are two ways of carrying out this generalization, and we give a
brief description of each.
First, there is the analytic approach. For the sake of simplicity t "\Ve
consider a self-adjoint operator A, and we write (1) in the form
(2)
Our reason for making this assumption is that the eigenvalucB of A are
real numbers and a.re therefore ordered in a natural way~ We further
assume that the notation in (2) is chosen so that A1 < x, < . < ~m,
and we use the PlB to define new projections:
EA. = O;
E>.1 = P1;
E).I :=: P1 + P2;
ii
..
..
E~=P1+P,+
..
..
..
..
. +Pm.
296
Operators
+ A2P2 + ~ + "AmP~
X1(b').l - E).(}) + X2(E>..* - EAJ + ... ~ + A-m(Ex.,.
A = X1P1
- E'}.,1J'o_J
?Pl-
Ai(E}.j -
EAi-1).
t: =I
A ::::::
=I
N =
).J',
1=1
For a. general discuBBion of t.he spectral thoor~m from this .an.a.lytie point of
view, see LoTch [28). A full trep.tment can be found in Riesz and Sz.-Na.gy [3..3,
chap. 7].
1
2'17
p(1"l) -= ~ p("A~)Pi
(5)
i=l
p(1V)
~ p,
(G)
PART THREE
Algebras o. Operators
CHAPTER TWELVE
(jeJteral
Preliminaries ou !!auack Algebras
Our work in Part 1 of this book was primarily concerned with
topological spaces and the continuous functions (~arrie<l by them.
The ideas of Part 2, on the other hand, \Vere essentially algebraic in
naturcr The function spaces \Ve encountered earlier led us to begin a
study of Banach spaces for their ov.rn sake~ and as \ve procccde<l 1 we
found our attention focusing more and more closely on the properties
of their operators. Except for a few elementary notions about metric
spacesJ v,re used very little genuine topology in Part 2. As a mat.ter of
fact, our treatment of spectral theory in Chap. 11 'vas completely independent of topology~ for in the fi ni te-dimen sional case, all Ii near trans-
302
Algebras of Operators
[lx~(YR
- y)
+ (Xn -
+ Uxn -
x[I HY ti)~
(a)
303
<I
i=-=1
where
~tui)
CI
,, I: ====
"
/jj(Jj)
Jflffl
'Ykulq
(1)
k-1
g"j
(J-;-fl~ aJ3,.
(2)
s~e
304
Algebras of Operators
over all subscripts i and j such that g~-g1 = (}1t In effect, thereforeJ we
formally multiply out the sums on the left of (1), and we then gather
together all the resulting terms which contain the same element of G ~
With these ideas as an intuitive guide, we revert to our first point of
view, in which the clements of L1{G) are functionsJ and we see that our
definition of multiplication can be expressed in the following way. If
two functions f and g in JJ 1(G) are given~ then their product, which is
denoted by j * g and ca]led thPir convolution7' is that function whose
value at g1: is
(f
* g) (g,J
~Oi!Jj=tHr f(gi)g(gi)
n.
(3)
ffrh:YJ-l)g(gi).
j=l
IJ/ * o~I ~
==
t (f * o)(gk)I
t I t f(gkg.i~)g(g;)I
t
k =-= l
71
1-1
j:c::l
t t
tL
ft
<
k-li-1
n
n
I.
t
If (gkgj- 1)I Iu(gj)
1-1
= I lg(g1)I t lf(OlUii-1
= I. Jg(g;)[ l!/H .
:i
=
k~l
)[
i:l
-n.
= 11/U
i =-l
IY<ui)I
= 11111 lluH-
f...
Let G
~ -2, -17'
305
are defined pointwise, the norm by ~I/II = I::~ ~f(n)', and the convolu.tion off and g---see Eq. (3)~by
~
(f * g) (n) ~
I.
m= -
f(n - m)g(m).
llJ
IjMB Il
SU p
{ lIM4 ( x) lI : Hx 1!
<
I ~
:=:.
~I a 11)
Loomis [27j is t.he standard reference in thls subject. For a general exposition
of the main id,eas~ see Mackey [30]4 A brief treatment or the classical analyeie which
underlies the modern theory can be found in Goldberg fl4JT
306
Algebras of Operators
Theorem A. Every element x for which llx - I 11 < 1 is regular, and the
inverse of such. an element iB given by the formula X- 1 = 1 +
l ~ x) n ..
PHOOF.
If we put r = llx ~ l[I, so that r < 1. then
i;=- (
shows that the partial sums of the series 2;:=- 1 (1 ~ x)~ form a Cauchy
sequence in A. Since A is complete~ these partial sums converge to an
element of A~ which we denote by ~~::::ic 1 (l - x)n4 If we define y by
y = 1 + ~::.. 1 (1 - x)tt, then the joint continuity of multiplication in A
implies that
OD
JI -
xy =
(1 - x)y
(1 - x)
.m;i.
+k
(1 - x)n
n=2
so xy = 1.
Similarly, yx
:k (1 -
x)ft
y -
1,
n l
I.
Theorem B.
that
Let
Xo
Hx - xoH <
be
an
<
X- 1Xo =
(xo- 1x)- 1 = 1
n=l
(1 - xo- 1x)~.
< ~2
307
l!x- 1
xo- 1 H
l (x- 1xo
~ l)xo- 1 11
II ~
n~l
==-
J!xo- 11
1
(1 -
XQ-
n~l
x[I"
[j 1 ~ xo- 1xl1
llxo- 1 11
Hxn-1 l
-
1~
<
I; H1 ~ xo- xlt"
1
ncmO
1ll -
Ul
xo- 1xU
~ xo~ 1 xl1
308
Algebros of Operators
llrn- 1Z
1!I
<1
for some n, so that r n -lz~ and the refore z = r n (r n -lz) J would be regular.
Since {r "- 1 J is un bounded 1 we may assume that Hr ft.- 1 II ~ co If Zn i~
now defined by z1J = rn - 1/l]rn - 1 then our conclusion follows from the
observations that Hzfll! = 1 and
a,
zz~ =
ZTtt-l
llru H=
l+(z - rn}rn-l
-1
l!r.-111
- Ur"-1 [! + {z ~
rn)Zm ~ o~
J'}-..:
In the previous
T - XI is singular} J
Xl is singular}+
309
observe that a(x) i...~ a subset of thP eloscd disc fz: lzl < llxl~} t for if X is a
complex number such that IXi > ~lxll ~ then ~Ix/All < 1t I! 1 - (1 - x/X) II
< 1, 1 --- x/X is regular, and therefore x - Xl is regular.
Our first task is to establish the fact that a(x) is al,vays nou-empty,
a.nd for this we need a few preliminary notions~ The resolvent set oi x,
denoted by p(x)t is the complement of a(x); it is clearly an open subset
of the complex plane which contains {z: lz! > lix~l l. The resolvent of x
is the function with values in A defined on p(x) by
Theorem 65-C tells us that x (X) i8 a continuous function of >.. ; and the
fact that x(X) = x~ 1 (x/X - I)- 1 for X ~ 0 jmplies that x(k) ~ 0 as
A ---i- ~. If "J.. and are both in p(x)~ then
x(X) ;; x(X}[x - ,ul ]x()
= x(X)[x - 1\1 (>-. - ) l]x()
= [1
(J\ - )x(X)]x()
x("J...)
+ (A -
)x(X)x(.u) ~
x (.u) = (A - )x (;\)x ()
= x()
so
f (A)
f (If~ = f (x(A}x())
A-
/(x() 2 }J
A - .
p(x)~
Further 1
--+
310
Algebras of Operators
sup
is called the spectral radius of x. It is clear that 0 < r(x) < l~x\I
The
eon ce pt of th c spec t. ra1 radius \Vill be uscf ul in r,e rtain parts of our later
a
"Tork.
algebras.
Since 0 is a divisor of zero, it is a topological divisor of ze10 in every
Banach algebra. In the Banach algebra G, 0 is plainly the only topologieal di visor of zero. Conversely, \Ve have
311
IJxy II >
KJlx It I~ y H
PROOF~
Theorem E.
Ii/II
= sup
=
sup
1lf(z)1: lzl
f l/(z)I: lzl
< lI
~ 1}
This allows us to identify A with the Banach algebra of all the restrictions
of its functions to the boundary of D, which is a Banach subalgcbra of
A I = e( r z : j zl = l J).. If we now consider the element f in A defined by
f (z) = zJ then it is easy to see th& t u A(/) equals D and that uA, (f) equals
the boundary of D.
312
Algebras of Operators
UXI : A e: uA ( x) }.
Now let A' be the Banach sub.algebra of A generated by x, that is, the
closure of the set of all polynomials in x~ Theorem 67-E shows that r(x)
has the same value if it is computed with respect to A':
r(x)
===
This suggests qrnte strongly that r(x) depends only on the sequence of
powers of x. The formula for r(x) is given in Theorem A below, and our
purpose in this section is to prove it~ It is convenient to begin with the
following preliminary result.
Lemma..
a(xn)
= q(,x)ta.
x" - XI
X~l)
(x - A1 l)(x -
~lJ
>..2 1
..
Xn its
(x - X.l).
The statement of the lemma follows easily from the fact that x" - Xl
is singular <=> x - '>..~I is singular for at least one i~
Theorem A. r(x) = lim l[xnll lln.
PROOF.
Our Jemma shows that r(xft) ;:::;: r(x)'\, and since r(.x") <
l!x~ll, we have r(x)~ < jlx~I[ or r(x) < l~xnU 1 !n for every nr To conclude
th~ proof, it suffices to show that if a is any real number such that
r(x) < a, then .llxH 1 1~ ::;; a for all but a finite number of n,s, and this we
now do.
It follows from Theorem 65-A and our work in Sec. 67 that if IJ..I >
llxl~, then
x(X)
(z - >.1)~ 1
-~~1
-).-1
:;:
x - I
>..- 1 ( A
)-1
x)-1
(
1- A
[1 + ~] .
n-1
(1)
313
f (x(>.))
= -
>.- 1 [ f (1)
+ .. f (~:))
1
+I
l!IG
-A- 1 [/(I)
f(xn)x-n]
(2)
n=l
for all IX! > llxl!~ We saw in the proof of Theorem 67-A that /(x('A)) is
&n analytic function in the region IXI > r(x); and since (2) is its Laurent
expansion for !Xl > Hxll, \ve know from complex analysis that this
expansion is valid for IXI > r(x). If we now let a be any real number such
that r(x) < a < a~ then it follo,vs from the prcc~ding remark that the
series
1 f(x~/an) converges, so its terms form a bounded sequence.
Since this is true for every fin A*, an application of Theorem 51-B shows
that the element8 xft/an form a bounded sequence in A. Thus
i;::_
fl znll 11,. < K 1 fna for some positive constant K and every n. Since
Kr.Ina < a for every sufficiently large n we have llxB ll
< a for an but a
or
1Jn
314
Algebras of Operators
;:::=
li = z
= A ( l - r) ::::: ~ x
xr : x
Al
315
M +Ar
;:::=
{m
+ xr~m e. Mand x e: A}
m +xr
;::=
A and
flMLI
every x}.
(1)
(2)
We now prove that all four of these sets arc the same by showing that the
two sets on the right of (l) and (2) are equal to one another. By symmetry~ it evidently suffices t-0 prove the
Lemma. If 1 PROOF.
s = (1 -
xr)- 1 ~
+ rsx)
(1
+ rsx)(l
- rx)
We leave
1,
(1 - xr)- 1
+ + (xr) + ~
XT
and
(1 - rx)- 1 ~ 1
rxrxrx
A~gebras
316
of Operators
' re
l~z:ery
Theorem B..
Taken
Theorem
together~
C.
Theorem D.
+ i 11 :i e I J.
is clearly an algebra Vr"'ith identity I + I,
I! 1 + I I! ~ jnf { I~ I + if I :i e I} < I! 1 ;1 =
l~x
Further, A/ I
The
11 (.r
muHipli<~ativc
I} (y
+ 1} I:
<
<
+ JI~
inf 111 x
and
1.
follo~~s:
= rIxy + I [I = inf { 11 xy
i 1.: : i e I J
inf {IJ ( x
i r) (y
i ~) 11 :i 1 1 i 2 e: I J
inf l I": z
i 1 !I I: y
i 2 I! : i 1, i 1 e: I J
[inf { 11 x + i i 1 e I } l ii nf ~ 11 y i 2 I~ :i 2 e.. I H
+
+
ti: :
+
+
:;:;; llx
+ /j[
l]Y +Ill.
317
All that. remains is to show that lll +II~ = I; and since we already have
~I l + Ill < 1, this is an immediate consequence of the fact that
Ill +Ill ~ 1~(1 + I) 2 H < Ul + lllz implies 1 < Ill+ 1[1.
As a final result, we state
Theorem E..
CHAPTER THIRTEEN
?:lte Structure of
MAPPING
Theorem A..
x(M) ~
311
x + M,
319
(2)
(3)
(4)
(5)
(6)
(ax)(M) = ax(M);
(xy)(M) = x(M)y(M);
x(M) = 0 ~ x e M;
l(M) =I;
1x(M)! =
:::;
ilxU.
S(t,
Mo,~) =
< E}.
mapping~
320
Algebras of Operators
properties:
A.
the image
S*
= { f~f
EA* and
llf1!
:$ 1}
321
Our strategy is to id en tify fill 7' both as a set and as a topologi ca.I space,
with a closed subspace of s.
A multiplicative functional on A is a functional! in the ordinary sense
~that is, a.n element of the conjugate space A *-which is non-zero and
satisfies the additional condition f(xy) = f(x)f(y).. Theorem A shows
that to each M in ~ there corresponds a roultiplicalive functional f M
defined by fM(X) =-= x(M). It is important for us to know that M ~ f Mis
a one-to-one mapping of mi onto the set of all multiplicative funet.ionals.
It will facilitate our \vork if we begin by proving the
Lemma. If fl and f 2are multiplicative functionals on A with the sanie null
apace M 7' then f1 = /2~
PnOoF. We first show that f1 = af2 for some scalar a. JJet Xo he
fi(X) = f1(rn)
+ f3f1(Xo)
f3f1(Xo) = rf1(Xo)ff~(XQ))f2(x),
so
a=
0 or a= l.
1~
322
Algebras of Operators
s.
s.
X = ('\ .ttKA
l f: f e. S
== r\~,V'A {f :f e. S*
~ flz~urA { f :f e
~ n~.11tA {f :f
Es
and f (xy)
:;=
f (x )/(y) }
it is easy to see that X is closed in s (note that each of the sets last
written has this property).. We next remark that F1 is continuous on X
and equals 1 on $land 0 at the .zero functional.. It follows from this that
~ is closed in X and is therefore closed in S* ~
323
= lim llxnjl um
IJxnll 11n
llxll
The
llxlL
so (1) implies (2)~ The fact that (2) implies (1) is immediate from
l~:x 2 ll = r(x 2 ) = r(x) 2 = if xii~~ In view of the equation t(x) = Jixll (see
Theorem 70-B), the equivalence of (2) and (3) is obvious.
Our next problem is to devise a VtTay of making Slll'e that the representing algebra A comes as close as it can to exhausting ~(mi) J and we
accomplish this by introducing the follomng property~ A is said to be
self-adjoint if for each x in A there exists an element y in A such that
ii (2Jl) ~ x(.tlf) for every M.
Theorem
B.
By part (1) of Theorem 70-B, we know that A is a suhalgcbra of e(mr) which separates the points of mi and contains th~
identity function~ Problem 20-3 and our hypothesis now tell us that the
closure of A is a closed subalgebra of e(mt) which separates points~ contains the identity function, and contains the conjugate of each of its
functions. Theorem 3&-B {the complex Stone-Weierstrass theorem)
shows that this closure equals e{ mt), so A itself is dense in e( m) .
PnOOF.
theorems~
Theorem C.
324
Atgebrcs of Operators
shows that Xn *---+ x*. A much stronger relation between the involution
and the norm is given by the condition
llx*xll
llxlP\
Hxll 2 ==
l~x*x]l
<
It is
llx*~I ~lxll
shows that
Ux*I~
also true.
325
Theorem A.
::;=
~l-
This result suggests more strongly than ever that there are close
connections between B-algebras and algebras of operators on Hilbert
spaces (see Theorem 5S-D).. We describe the true state of affairs in this
matter at the end of the next section.
73~
form~
in mt.
Our first step is to prove that if x is seJf . .adjoint, then i(M) is real
for every M. We assume the contrary, namely, that there exists an Jr!
such that (M) = a
i/3 with fj ~ 0. Since x is self-adjoint,
y ~ (x - al)/P
Algebras of Operators
326
M~
{m:mEMI
+ ii
we see that
...,.......,.. .........
+ iKl) (M)
and,....,,..........__
(JJ
lly -
iKll! ~
= i(I
+ K)~
ll:v -
iKllJ ~IY
j!(y + iKI)*(y
llY -
+K
s;:
On
...-....
x*(M)
...,.,...,..,, .........
= (y
a..-.
x(M).
c-
CHAPTER FOURTEEN
Some Speeia/
Commutative Ranae/J Algebras
Our discussion in Sec. 63 foreshadowed a generalized form of the
spectral theorem~ and the principal purpose of the present chapter is to
formulate and prove this rcsultr We begin with some additional material
relating to Banach algebras of continuous functions. In particular, we
keep the promise made in Sec. 30 by showing that the Stone.-Cech
compactification of a completely regulu space is essentially unique.
it is the
328
f (y)
Algebras of Operators
~
= fM~(f) =
f(x),
so J = f and e(mT) === e(X)~ We now recall that any compact Hausdorff
topology on a non-empty set is uniquely determined as the weak topology
generated by the set of all its continuous complex functions (see Problem
27-3). It follows from thie that X and ml are equal a~ topological spaces.
We fflmmarize the results of thls discussion in
Theorem A.. Let X be a compact Hausdorff space and mi the space of
niaximal ideals in the commutative B*-algebra e(X). Then to each point
x in X tkere corresponds a maximal ideal M:.: defined by
= 0},
The main idea of this theorem is that the maximal ideals in e(X)
correspond in a natural way to the points of X. Our next step is to
extend this idea and to obtain a similar characterization of the proper
closed ideals in e(X).
We again consider a compact Hausdorff space X, and we begin our
discussion with the observation that to each non-empty closed subset F
329
I(F)
= l f :f e e(X)
and f(F) = O} ~
We define F by
F=
{x~f(x)
It is easy to see that Fis a proper closed subset of X; and since I is contained in some maximal ideal, it follows that F is non-empty. Our
task is to prove that I(F) = I, and since it is obvious that I c I(F),
the real problem is to prove that l(F) C I~ If f is any function which
vanishes on F, we must show that flies in I~ We may evidently assume
that/~ 0, so that {x:f(x) = O} is a proper subset of X~
In the first part of our proof~ we assume that f vanishes on some open
set G which contains F. Since f ~ 0, G' is non-empty and is thus a
compact subspace of X. For each point x in G', there exists a function
gin I such that g(x) 'F 0. The technique used in the proof of Theorem A
can now be applied again~ to yield a finite number of functions g1, g,,
~ ~ ~ J Un in I with the property that at least one is non-zero at every
point of G'. We next define a function go by go = ~i-1 g1g ~ i;:_.1 1o,I z,
and we observe that g0 is in I and that go(x) > 0 for every x in G'. By
the Tietze extension theorem, the function whose valuee on G' are given
by 1/ go(x) can be extended to a function h in e(X). It is easily seen that
goh is in I, that it equals 1 on G', and that f = fgoh, so f is in I.
We now turn to the general case~ For each E > 0 1 the sets Kand L
defined by K = {x: 1/{x) I < ~/2 J and L = {x ~ lf(x) i ~ t} are disjoint
closed subsets of X. K is clearly non-empty, and since j ~ Ot Lis also
non-empty for every sufficiently small ie. We assume that E bas been
chosen at least this small, so that K and L constitute a disjoint pair of
closed subspaces of X~ By Urysohn's lemma, there exists a function
gin e(X) such that g(K) = 0, g(L) = 1, and 0 < g(x) < 1 for every x.
We now define a function k in e (X} by h == f g, and we note that
I I - hll = llf(l -
g) !I ::;
E.
It is evident that h vanishes on the set G = {x: lf(x)I < t/3}; and since
G is an open set which contains F, it follows from the preceding para-
330
Algebras of Operators
graph that his in I. This shows that for every sufficiently small positive
number e there exists a function h in I such that Ill - hll < E, and since
I is closed~ we conclude that f is in /.
We give the following "formal statement of our result.
Theorem B. Let X be a .compact Hausdorff space. Then to eac,h no'Tl. . .
empty closed set F in X there corresponds a proper closed ideal I (F) in
e(X), defined by l(F) = {f :/ e e(X) andf(F) == 0 J; and further, F--+ I(F)
i8 a one...to-one mapping of the class of all non-empty closed subsets of X onto
We have seen in Theorem A that the points and the topology o:f a
compact Hausdorff space X r.an be recovered from the maximal ideals in
e(X)~ Since the maximal ideals in ~(X) are objects of a purely algebraic nature~ it follo\YS that the compact Hausdorff space X is fully
determined, both as a set and as a topological space, by the algebraic
structure of e(X)
These observations lead us directly to
I
...
xt
331
332
Algebras of Operators
f Tt:}
specialize Theorem A to
Theorem B.. I.Jet N be a normal operator on H, and A the commutalive
C*-algebra generated by N. If mt is the space of maximal ideals in A, then
the Gelfand mapping T---+ Tis an isometric *--isomorphism of A onto e(~)~
element of A .
333
1
-
PROOF~
= R(M1} =
functions fl and N*
i?(M1)
J:i(M2)
N*(M2),
so each of the
takes equal values at Mi and M2.
Since A is the closure of the set of all polynomials in N and N*, every
....
............
function in e (~) is a uniform Jilnit of polynomials in !l and N*, and
therefore every function in <3($,) takes equal values at M 1 and M2. We
conclude the proof by observing that since e(mt) .separates the points
of ml, it follows that M 1 = M 2..
334
Algebras of Operators
P.
where
is the function in e(~) defined by ';(>..,.)
from this that
&..;.
It is evident
where the P/s are non-.zero pairwise orthogonal projections in A such that
?:t,1 Pi = I. This is precisely the spectral resolution of N treated in
Chap. 11, so Theorem R actually contains the finite-dimensional spectral
theorem. We therefore have solid grounds for regarding Theorem E
as the generalized form of the spectral theorem discussed in the last paragraph of Sec.. 63, and all our promises are fulfilled.
Appendices
APPENDIX ONE
Let
r------
11
I
I
I
--...--....-. ...... J
-1
Fig. 39
[ ~ l J 1J vnth the property that f (xc) = Xo~ The proof is easy+ We con~
aider the continuous f llllction F defined on {- 1, I] by P' (x) :;::: f(x) - x,
and we observe that F(-1) > 0 and that F(l) :5 0. It now follows
from the Weierstrass intermediate value theorem (see Theorem 31-C
and the introduction to Chap. 6) that there exists a point Xo in [ ~ 1 tlJ
such that F(xo) = 0 or f(xn) = XoIt is convenient to describe t.his phenomenon by means of the follow~
ing terminology" A topological space X is called a fixed point space if
337
338
Appendices
every continuous mapping f of X into itself has a fixed point, in the sense
that /(xo) = Xo for some xo in X. The remarks in th~ above para.graph
show that {-1,1] is a fixed ~~nt space.. Furthermore, the closed disc
{ (x,y) :zt
y 1 < 1 l in the Euclidean plane R" is also a fixed point space
(for a lucid elementary proof of this~ see Courant and Robbins [6, PP~ 251255]) ~ Both of these facts are special cases of
tx ~ llx I <
1}
T'here are several proofs of this classic result~ but since they &11
depend on the methods of algebraic topology-, we refer the reader to Bers
(3, p. 86}. Brouwer' s theorem itself U3 a Bpecial case of
1
and, in general~
d(xm 1Xn)
Xn
==
rnxo = Tx~-1
If m
< n,
then
= d ( Tw'x o, Tnx o)
== d( Ttnx D~ T T~xo)
S r"" d(xo, Tw~o) ~ r- d(xo,x--.)
::; T"' (d(Xo,X1)
d(x1,x~)
~
d(z,. 1,X,.-w1)]
< r d{xoJX1)ll + r
+ r~ 1 ]
< r d(xo,X1).
1- r
+
+
Since r < 1, it is evident from this that {xn} is a Cauchy sequence, and
by the completeness of X, there exists a point x in X such that x.--+ x.
339
X.+t =
z.
Picard's Theorem.. If f(x,y) and afjay are continuous in a cl-OBed Tectangle R == f(x,y):a1 < x <at and b1 Sy~ b,J, and if (xo,Yo) i8 an
interior point of R, th.en tJu differential equation
dy
dx
= f(x,y)
(1)
l/(x,y)[
and
:Y f(x,y)
(2)
<M
(3)
l/(:i:,y1) - /{:i:,y2) I =
for some 8 such that 0
<8<
If(x,y1)
I.
(x,y~)
9(y, - 111))
are
(4)
(5)
g(x) = Yo
or
J.,: f(t,g(t)) dt
+ J~ f(t,g(t)) dt.
(6)
ConverselyJ iI y = g(x) satisfies (6), then it is clear that g(xo) = Yo, and
on ditTerentiating (6) we obtain ( 1). It therefore suffices to show that
the integral equation (6) has a unique solution .
The only uBe we nta.ke of the hypothesis that af/ay exists and is continuous in
R is to derive the so-ealled Lipuh.itz condition ( 5)~
i
:WO
.Appendices
::f""
The fact that T maps X into itself is evident from (2), for
lh(x) -
y~J =
J,: f(t,g(t)) dt
< Ka.
/"': [f{t,g1(t))
f(t,g~(t))J dt
APPENDIX TWO
Jlalt1t-Mozurkiewicz ?:lteore111
A continuous curve is usually thought of as "the path of a continuously moving point~H and this rather vague notion is often felt to carry
v.ith it the even vaguer at tribu tc of 'lthinness/' or "one-dimensionality."
For the case of plane curves, Jordan (in 1887) gave precise expression
to this intuitive geometric concept by means of the following definition~
if f is a continuous mapping of the closed unit interval I = [O,lJ into the
I
I
l
J
-- --+-_..
......
l
Il
+--+~ ~.!.- -
--tI
!
l~
.J__
- +--T--+
I
I
J
11 {l) I
12 (I}
Fig. 40
342
Appendices
then the respective curves are the images of I under continuous mappings
f 1~ f 2l and f ;j of I into S.. The process of constructing these curves can
be continued in the same way, and it yields a sequence of continuous
mappings f. of I into S. By the manner in which ea.ch curve is constructed from its predecessort it is clear that the sequence {/'A} converges
pointwise to a mapping f of I into S; a.nd since this convergence is evi
dently uniform, f is continuous (see Problem 14-4) andf(l) is a continuous
curve in the sense of Jordan~ Furthermore, each point of S 1ies in f (l),
so f{/) is a space-filling curve.
Peano's discovery of space-filling curves was a shock to many mathematicians of the time, for it violated all their preconceived ideas of what
a continuous curve ought to be. To a few of the others~ however~ it
presented an opportunity. It suggested the very interesting problem
of determining what a continuous curve actually is, or in other words, of
finding intrinsic topological properties of a subset X of R 2 which are
equivalent to the existence of a continuous mapping of I onto X~
Before describing the solution of this problemJ we plar.e it in a wider
context by extending Jordan's definition. A topological space X is
called a continuous curve if X is a Hausdorff space and there exists a
continuous mapping of I onto X~ 1 We know that I is compact and
connected, so by Theorems 21-B and 31-B, any continuous curve is also
compact and connec.ted. Jn the lemmas below, we give two additional
properties which every continuous curve must have.
It is convenient to begin by introducing the following concept. A
mapping f of one topological space into a.nother is said to be closed if it
carries closed sets into closed sets, that is 1 if f(F) is closed whenever Fis
closed.. We shall use the fact that a continuous mapping of a compact
space into a Hausdorff Bpace is automatically closed (see the proof of
Theorem 26-E).
Lemma.
3.43
is contained inf-1 (G). By the compactness of J- 1 ({ x}) and the fact that
{Gil is closed under the formation of finite unions, there exists a G, such
that
1-1 ({x} )J
G/
1-1 (G)
I,
(1)
and since the complement of an inverse image equals the inverse image of
the complementJ we can -write (1) in the form
1- 1( lx} ')
G/
~ 1~ 1 (G')~
(2)
or
so
complete~
The above remarks and lemmas estab1ish the easy half of the following famous characterization of continuous curves.
The Hahn-Mazurkiewicz Theorem.. A topological Bpaee Xis a continu()U8
curve ~ X iB a compact Hausdorff space which is second countable, connecteil,.
APPENDIX THREE
Nooleax Algebras,
Hoo!eax /(ixgs, 11Hd Stone$ 7:1teore111
We saw in Sec. 2 that a Boolean alge1Jra of sets can be defined as a
cle..ss of subsets of a non-empty set which is closed under the formation of
finite unions 1 finite intersect ions~ and com pl em en ts+ Our purpose in
this appendix is threefold: to define abstract Boolean algebras by means
of lattices; to show that the theory of these systems can be regarded as
part of the general theory of rings; and to prove the fa mo us theorem of
Stone, whlch asserts that every Boolean algebra is isomorphic to a Boolean
algebra of sets~
The render will recall that a la.ttice is a partially ordered set in which
each pair of elements x and Y. has a greatest lower bound x A y and a least
upper bound xv yJ and that these elements are uniquely detcnnined by
x and y. It is easy to show (see Problem 8-5) that the operations A and
XAy=yAX
x /\ (y /\ z)
(x r-.. y) /\ z
(x A y) V X = X
and
and
and
and
xvx
::::=
.x;
xv y :::;- yv x;
x v (y v z) :;: (x v y) v
(x Vy) AX = X.
z;
(1)
(2)
(3)
(4)
We shall see in the next paragraph that these properties are actually
characteristic of lattices. Bcfore proceeding f urthcr, however, we remark
that
x < y <=> x I\ y = x.
Boolean
Stone~ s Theorem
345
shall prove that. a partial order relation ::::; can be de fined in L in such a
way that L becomes a lattice in \vhich x" y and xv y are the greatest
lower bound and least upper bound of x and -y. Our first step is to notice
that x A y = x a.nd xv y = y are equivalent; for if x A y ;::::; x, then
xv y = (x " y) v y = (y A x) v y = yt' and similarly xv y = y implies
x /\ y = x. \\re now define x < y to mean that either x" y ;: : :. x or
xv y :;;;::; y. Since x Ax = x, '"Te have x < x for every x. If x < y and
y < x, so that x A y = x and y,... x ~ y, then x = x A y = y f\ x = y.
If x < y and y < z, so that x A y = x and y A z = !J 7 then
x
z ~ (x
y)
(y
z)
x:P
so x < z. This completes the proof that < is a partial order relation+
\"\le no'\v sho\Y that x A y is the greatest lower bound of x and y. Since
(x A !J) v x = x and (x A y) v y = (y A x) v y = y :t '\\"'e see that x A y < x
and x A y ~ Y~ If z < x and z < y 1 so that z Ax ==- z and z A y = z, then
z /\ (x A y} = (z A x) A y = z A y = z, so z < x I\ y It is easy to pro vc,
by similar arguments, thu t x v y is the 1east upper bound of x and y.
This chara<~terization of lattices brings the theory of these systems
somewhat closer to ordinary abstrart a1gebra, in which operations
(instead of relations) are usually pl aced in the foreground.
A lattice is said to be distributii\e if it has the follov.ring properties:
+
x " (y v z} =
and
X V
(y /\
Z)
(x
y) v (.t
= (X
y)
z)
A ( X V Z)
It is useful to know that (5) and (6) are equivalent to one another..
if (5) holds, then
(xvy)t-.(xvz)
(5)
(6)
For
[(xvy)Ax}v[(xvy) AZ]
=xv [(xvy) AZ]
~ xv [(x A z) v (y A z)]
= [X V (X A Z)) V (y AZ)
= xv (y f\ z) J
and a similar computation shows that (6) implies (5)~ We shall say that a
lattice is complemented if it contains distinct elements 0 and 1 such that
(7)
for every x (these elements are clearly unique when they exist), and U
each element x has a complement x' v.ith the property that
X A X
=0
and
xv x' : : : :;: 1.
(8)
3"'6
Append ices
so x < x'.
so x ;;:: x'.
(9)
O'
0 and 0 v 1
and
l'
and
(X V
11 so we have
0.
(10)
The identities
(x
I\
y)'
X'
y'
(11}
are also true in every Boolean algebra. We shall prove the first part of
(11). Our principal tool will be the fact that
(12)
y1
x) v (y 1
x I)
::;::;;
0 v (y' /\
1
)
y'
x;'
and therefore y' < x'. We now turn to the proof of (x A y) 1 = x' v y' .
Our first step is to observe that if x ~ < z and y' :::; z, so that z' 5 x and
2/ < y, then z' ~ x A y or (x A y)' ::;; z. This sho~,..s that (x" y) 1 is less
than or equal to any upper bound of x' and y'~ so (x A y)' < x' v y'. We
conclude the proof by showing that x' v y 1 < (x A y)'. Thls follows at
once from the rcla tions x' ::;; (x /\ y )1 and y 1 < (x A y) ', ",..hlch, since they
are equivalent to x A y < x and x 1-. y < Y~ are evidently true. The
second part of (11) can be proved in essentially the same way.
One of the basic facts about Boolean algebras is that these systems
can be identified with a certain class of rings. Thls enables us to study
Boolean algebras by means of powerful techniques which are already
available in the general theory of ringsr
A Boolean ring is a ring with identity in which every element is
idempotent (i.e. 1 x 1 = x for every x)~ It is a surprising fact that multiplication in a Boolean ring is automatically commutative and that
x + x = 0 (or equivalently, x = -x) for every x. The proof of these
statements rests on the relation
x
+y =
(x
+ y) =
1
(x
+ y)(x + y)
zt
= x
+ xy + yx + y
+ xy + yx + yj
347
+y
(X A
y1 )
V (X
y)
and
= x" y.
xy
(13)
=
that
+0
(x
+ x~
(x
1) v 0
=xAl =x.,
and that
x
+x =
(x A x')
v (x' " x) == 0 v 0 ~ O~
+ y)
= [ (X
~ [(X
= [(X
= (X
y ')
V Y)
/\
V (X
/\
A X] V [ ( X V
X) V ( y A
JJ)
r=
y)
(X,
.r) j
y')+
(X1
y)
1f) /\ y 1]
{ { X t A y ~)
V (
(.T
y
It is con-
y')
y1) j
+ (y + z}
x, and that 1 is an identity~ In view of the fact that multiplication is obviously commutative~ a11 that remains is to verify that
x(y + z) = xy
xzJ and this is a consequence of th_e following computations:
that x 2
;;::
X (y
+ Z)
X A
(y
+ Z)
348
Appendices
and
xy
+ xz =
(x A y)
r(x J\ y)
+ (x A z)
z) 1 l v [Cr A y) t A ( x A z) 1
[Cr A y) A ( x, v z') ] v [(x' v y') A ( x A z) J
~ ( X I\ y A X ') V ( X A y A Z ') V ( X t A X /\ Z) V ( y;
= (x A y A z') v (x A y' ;., z)
A ( x J\
X !\
Z)
and
y = xy
xv y = x
+ y + xy
(14)
convert it into a Boolean algebra . cl. If \Ve keep in mind the fact that
multiplication in R is commutative and that for every x \\ e have x 2 = x
and :r. == ~ :r, then (1) and (2) are evident. Property (3) follO"\VS from
the a.ssociativity of multiplication and the computations
7
:r v (y v
z)
x v (y + z + y z)
= x + y + z + y.z + xy
(xv y) v z = (x
~ x
and
(x" y) v x
true~
+ xz + xyz
+ y + xy) v z
+ y + xy + z + xz + yz + xyz+
for
xy v x = xy
+ x + xyx
+ xy + xy =
and
(xv y) Ax
+ y + xy)
(x
Ax = x 2
(y v z)
=p
x(y
+ z + yz)
yx
+ xyx
+ xy + xy
x.
+ xz + xyz
xy + xz + xyxz
= xy
=
==
xz
(xAy)v(XAZ)r
xy v
It is easy to see that the elements 0 and 1 have the property that 0 <
x < I for every x and that x' = 1 + x acts as a complement for x, so A is
1
a Boolean algebra4
It is worth noting that the two processes we have described are
inverses of one another. Suppose we start \vith a Boolean algebra A and
use (13) to make it into a Boolean ring R:
+y
= (x
y') v (x'
y)
and
xy
==
x " y.
It is apparent that x
xv y
and
xl..y=xy
"A y = xy =
===
349
A:
+ y + xy.
y; and since
~ (1 A x
1
)
v (0
x)
= x'vO == x',
we also have
xv y = x + y + xy = 1 +
(1
x) (1
= 1 x'y'
= (x' " y')'
~xv y.
+ y)
This shows that the operations in A coincide with those in A. Conversely, if we start '\i\dth a Doo1ean ring ll, make it into a Boolean algebra
.11, and then convert A back into a Boolean ring ll, then the operations in
R coincide with those in R. "\Ve leave the verification of this to the
reader .
The ideas developed above show that Boolean algebras are essen
tialJy identical with Boolean rings. ~rhe practical effect of this is a
considerable sa.ving of labor1 for it allows us to transpose our study of
Boolean algebras into the more familiar cont.ext of the theory of rings 1
where many standard tools-ideaJst homomorphisms~ etc.--Jie ready at
hand. We illustrate this principle by proving the basic representation
theorem for Boolean algebras in t\vo steps: first, \Ve prove the corresponding theorem for Boolean rings; and second t we t.ra.ns1ate this result
back into the language of Boolean algebras.
Before entering into the details, we give a brief descri pt.ion of the type
of represent.at.ion we are aiming a.t+ If X is a compact Hausdorff space~
then each of the sets 0 and X is both open and closed (or more briefly,
open-closed)~ and the class A of an such sets is a Doo1ean algebra of subsets
of X. If X is disconnected 1 then A contains a.t least three sets; and if X is
totally disconnected, then A may contain a great many sets) for, by
Theorem 33-C, it is an open base for the topology of X.. :i-~urthermorP~
\\~e kno'v that A becomes a Boolean ring of sets if addition and multiplication are defined by
A+ B
(An B 1)
(A;
B)
and
AB= An B.
350
Appendices
any Boolean ring which is a field necessarily equals {0 1 1} ~ 1,o see this, it
suffices to observe that if x is a n on-z r.ro e1em en t in such a ring~ th en
xx- 1
x 2x- 1 = .r(xx- 1 ) = x l = x.
+ 1) = x
2
+I
=x+I
+
+
I= {m+r(l+x):ms.MandreRl
+ r(l + x)
+ r(l + x)]x
= mx + r(x + x~)
~ mx + r(x + x)
= x2 =
[m
== mx
conirary to the fact that xis not in Af. This contradicts the maximality
property of M, and the proof is complete.
351
it:f(x)
0}
and if :f(x) = 1} is open-closed. This follows at once from the fact that
each is the inverse image of an open-closed set in H:., under the projection
of H* onto H~.
We now pass to the subspace II of H* whirh consists of .all homorn orphisms of R on to I 0, 1 J . It is cl ear that H is a totally disconnected
Hausdorff space. To prove that it is also compact, it suffices to show that
it is closed in H*} and thls v-.,.e do as f ollows4 A homomorphism of ll onto
{Otl J is of course a mapping fin H such that f(x
y) = f(x)
f(y)
and f(xy) = f(x)f(y) for all x and y a.nd such that /(1) = lr It is evident
from this that H is the intersection of the following three subsets of H*;
r\~i~R{f:f(X
f\:z;,JJtR
+ y) = f(x) + f(y)J,
l f :f(xy)
{f:j(l) =
and
f (x )j(y)} ~
(15)
(16)
114
{17)
We know from our remark in the preceding paragraph that (17) is closed;
and if we can show that the other two sets are a.I so closed, th en it will
follow at once that H is closed. We inspect (15). If x and y are any
given elements of R~ then it is easy to see that
{/ :f(x
+ y)
f(x)
+ f(y) l
and
{f:f(x) =
{/ =f(x) :=::
{f:f(x) ===
{j:f(x) =
0, f(y)
0, J(y)
1, f(y)
1, f(y)
== 0, and f(x
= 1, and j(x
= O, and f(x
= IJ and f(x
+ y)
+ y)
+ y)
0L
1L
= 1L
+ y) = O}.
Each of these sets, being itself the intersection of three closed setst is
closed, so {/ :f(x
y) == f(x) + /(y)} is closed, and consequently (15)
is also closed~ A similar argument shows that (16) is closed, so 11 is
closed and therefore compact.
352
Appendices
T(x)
tf~f
+ y) =
1 }.
lf:f(x + y) = 1}
iJ:f(x)
f(y) = 1}
{f :f(x) = l} + {/ :f(y) == I I
T.(x) + T(y)
l f :f(xy) = 1}
{/ :f(x)f(y) = 1}
tf :f(x) = 1} n i I :f(y) = 1}
T(x) T(y)4
=
=
T(xy) ;:;;
and
e Hand /{x)
;;
= {/ :/(1) =
T(l)
1}
H,
+ x)
T(l
== T(I)
+ T(x) = H + T(z)
= T(x)'
for every x in R.
Finally~ we show that T maps R onto R.
We begin by observing
that the topology of H is defined by means of basic open sets of the form
B = ~ f :f (xt) ==
Ei 7
i = I, ~ . ~ ~ n ~ ~
if we define Yi to be x1 or 1
B =
=
n:=-1 ff :f(x1)
r\L1 tf :f(yi}
=
-;::=
i1J
11
= n:._1 T(yi)
s = Vi-1 Bi
en;. . \ B/)
Stone~s
Theorem
353
= (rl~l 7'(zt)')'
+
+ +
R~
Y.le now conclude our theory by translating Stone's theorem into the
language of Boolean algebras.
Let A and A be Boolean algebras+ A mapping h of .ti into A* is
called an isornMph isrn (or a Boolean algebra isomorphism) if it is one-to-one
and has the following three properties: h(x A y) = h(x) A h(y) 1
h(x v y)
h(x) v h(y),
l!ibliograplt11
1. Achieser, X. Ir: "Vorlesungen tiber Apptoximationstheorie,n Akadcmie, Berlin~ l 953.
2. Alexandroff ~ P. ~ and 11. Ilopf ~ "Topologie ,' ~ Springer, Berlin, 193b.
3. Bers, I~~: l'Topology,'' 1ecture notes,. Xe,v "'Y~ork University Institute
of lvf athcmatical Sr.iP.nces~ Kc'\v York, 19fi7.
4. Bir kho.ff ~ G. : '-~Lattice Theory,'' American Ma the ma ti cal Society
Colloquium Pub1ica tions, vol. 25, N e\v r~ ork 1 1948.
5. Rirkhofi, G. 11., and 0. D. Ke Hogg: Invariant Points io Function
Spare, Trans. A Jner. kl ath. J...';or.~ 23 (1922), pp. 96-1 lfi~
6+ Conran t~ R.. ~ and H. Rob bins: 'i \\That Is l\.la thematics?" Oxford,
I~ondon and Ne'v 1. . ork, l941.
7. Dixmicr~ J.: "LP.s a.1gebres d~opcrateurs dans Fespace hilbertien
(A1gebrcs de von X eumann),'' GauthiP.r-\'.iHars, Paris, 19t57.
Sr Dunford, X., and J T ~ Sch"\vartz: ~~I "'inear Operators, Part I;
General Theory,'' I ntcrscience,. Ke\v \' ork, l 958+
9~ Fracnkel 1 A. A.: ''Abstra.ct Set Theory,~~ Korth-Holland 1 Amsterdamt
1953.
10. Fraenkel 1 A. A.~ and Y. Bar-IIiUel: i~Foundations of Set Theory,''
North-Holland, Amsterdam, 1958r
11~ Ga.1 1 I. S.: On Sequences of Operations in (~01nplete Vector Spaces,
A mer. Math. Monthly~ 60 ( 1953) 1 pp. 527-538+
12. GOdel, K.: What Is Cantor~s Continulllll Problcm~r Amer. J1 athr
Monthly 1 54 (194 7) ~ pp. 515-525.
13. Goffm~1n, C.: Preliminaries to Functional Analysis, in ustudies in
Mathematics," vol. 1 i l\.Iathematica] Association of America, 1962.
14. Goldberg~ R+ R.: 'lFourier Transforms/' Cambridge~ New York, 1961.
15. Hahn, H. : The Crisis in Intuition, in l'The World of :rvrathematics,' ~
Simon and Schuster1 New r~ork 1 1956.
16+ Halmos~ P.R.: uNaivc Set Theory/ 1 \Tan Nostrand, Princeton 1 N.J.,
T
1960.
: ".Finite-dimensional Vector Spaccs,'J ,ran N ostrandt Prince . .
ton~ K .J., 1958.
18.
: ul\leasure Theory/' Van Nostrand, Princeton, N .J., 1950 .
17.
. 355
356
Si bliogf a phy
727-733~
357
1959~
JndeK of S!fmbo!s
A= B, A ;:e B
ACB
ACB
AUB
AnB
A'
A-B
AAB
a= b (mod m)
la,bl, (a.~bl~ etct
No
A
Ai< A2
An
A
CB(N~N')
m(N)
/3(X}
of
e[a,b1
e(X,R)
360
Index of Symbols
Space of bounded continuous complex functions
on X~ 84, 106
Spaces of continuous functions on X which vanish
at infinity~ 165
Space of bounded continuous scalar-valued functions on X, 216
e(X,C)
e(X)
d(x,y)
d(x,A)
d(A)
D(A)
,.ii
det ({ai;])
Empty set, 5
J:x~
J-1: y __,., x
j(A)
J- 1 (B)
J~g
F~
fM
G
af:X--+ Z
Int (A)
.
i.r
==>, ~
r\..:Ai etc~
1
~cat
+co
inf A
Im.
I(F)
lim
lfJ.P
l,,
Xn
Index of Symbols
Lp
zn
i;l(I
lot:i, c, Co
L/~l
grr
m]n A, max A
M~
J.11
+N
~f
.J.\l
m < n, m :::; n
N*
N**
Jtl/ }.{
361
R.,;J
r(x)
p(x)
Ill!
'
s
S*
SJ_
[SJ
Sr(Xet)
S,{xo]
sup A
c-( T)
O'(X) 1 CTA(X)
362
Index of Symbols
T*
T
ll Tl!
{T}, [T]B
u
u,.A1.t etc.
x--+ y
X11 _..,, .X
x""
X1 X X'J,
[x]
!lxl1
Ux + MH
x(M)
~
x().)
A y,
(X1Y)
x e: A 1 x
x _l y
x .-....- y
x<y
x ~
Vy
tA
y (mod I)
x - y (mod M)
Universal set, 5
Union of a class of sets1 11
Mapping notation, 17
Convergent sequenceJ 50, 70-71, 132
One-point compactification of X, 163
Product of two sets1 23
Equivalence set associated with x, 27
Norm of x 1 54, 81, 212
N arm of coset x -t- ]{, 213
Function on marximal ideals, 318-319
Function on maximal idcalst 319
Rcsolvent of x, 309
Meet and join of x and y, 46
Inner product of x and y, 245
x is (is not) an element of A 1 5
x is orthogonal to y, 24 9
xis equivalent to yJ 27
Order relation for real numbers and partially
ordered sets~ 7J 43
Congruence modulo an ideal in a ring 1 186
Congruence modulo a subspace in s linear spscet
193
Adjoint of x, 324
811/Jjeet J11tler
Absolute value, on complex plane, 53
of a !unetionj 159
on real lin et 52
.A.chieser, N. I., 157
Adjoint~ of element in Banach ~a.JgebraJ
324
of operator, 263
Adjoint operation (involution), on ffi(H),
265
on Ba.na.ch 111 -aJgebnt,. 324
AtexandrotI, P ., 130n.
Alge bra, I 06) 208
B*~,
324
Banach, 302
Banach *-, 324
210
commuta.tivei 106
corn pl ex, l 06
C*-, 303
disc, 303
division,. 208
~roup, 303-305
hon1orr1orphism, 210
ideal jn, 209~ 313
\\ith i<lentity, 106
ison1orphism, 210
quotient aJJ?;ebra of, 209
radical, 314
r.egu1ar representation, 210
sr:1nj-.":Simplc~ 316
sub algebra of J l06i 208
tot a] matrix r 28'.l
von Neun1ann, 303
TV*-, 30:1.
Antisymmetry, 43
Arzela.)s theoremr 128
B *-algl'.!bra J 324
representation, 32.&-.326
Baire~s theoremt 7 4, 75n .
Banach n.lgcbra., 302
364
Sub;ect Index
C 111 -al~cbra~
303
oom1nutative 1 33Z...334
Canonical form problem for matrices~ 286
Cant.or~ G. i .31-4:3.i 49
C.an tor continuum hypothesis, 39
Cantor intersection theorem~ 73
C.a.n tor .set, 67
Cardina.1 nun1bf'r(s )~ 31
comparability t.neorem, 48
of con tinuurn, .39
finite, 32
Carte~i.un product (product u sets),
23--25
Cau-chy seqnoence, 71
Cauchy"s inequa.J ity, 88, 219
Cs.yleyts theorem, 181
Chain, 44
Charaetcristic cq u a tion, 288-289
Char acteristi e v a] u e1 2 7 8n~
Char act ieristie vector, 27 8n.
Choice, axiom of, 46
Class, 4
disjoint, 9
Closed baseT 112
generated by closed subbase,, 112
Cl oscd graph th core m ~ 238
Closed mapping~ 342
Closed rectangle . 101, 119
Closed set~ 65, 9.5
Closed eph eru Ou
Closed strips, 101
Closed ~u b base~ 112
Closed unit ~phere, 217~ 232
Closure,, 68, 96
Compact subspace, 111
Com pa.ct to po] ogic a1 space~ 1 l 0 ~ 111
j
Subject Index
Dim enaion, of a linear spaceJ 200
orthogonal t of a Hilbert spaee, 259-260
Disc algebra, 303
Disconnected epa.ce, 143
DiBconn ec tion of a. space~ 143
Discrete space~ 93
D isc.rete topology~ 93
Discrete two-point spac c) l 44
Disjoint linear suhsp.aces) 195
Disjoint sets 1 9
I) is tanc e, from point to set J 58
bet ween tvito point.BJ 51
Diatrib u tive 1a ws ~ for In tticcs;11 345
for sets;11 10
D ivi.slon alge hr a 1 208
Diisor of :zero, 183
topologies.I, 307
DixrnierJ J.) 303n.
Dunford~ N., ix, 226~ 232n.
Eigcnspac e, 278
Eigenvalue, 27 8
Eigenvector~ 278
Element(s)~ 3
com pa.re.bl c ~ 7 ~ 44
maJ[imal, 44
in ring, regu]ar~ 183J 314
singular1 183~ 314
Empty set, 5
rnetJ 123
Equicontinuous functions, 126
Equivalence rela.tionJ 27
Equivalence set~ 27
Euclidea.n plane~ 22~ 87-88
Eu cl jd ean space, in fini tc-dim en sion alt 90
n-dimensional, 24, 87 ~ 214
Everywhere dense seti 70, 96
Extended complex plane, 162
Extended real number system~ 56
Family_, 4
Field, 184
Fini tc intersection property) 47 ~ 112
Fir.st countable space, 1OOn.
Fixed point1 338
Fixed point space, 337~338
Fixed point theorem~ Brouwer is, 338
Schauder'ij, 338
FominJ S. V., 128, 215nT
F-0urier coefficients~ 256 257
1
365
CAI) 1.
310
s.)
Gali!eo~
240
33
Gelf&nd ma.pping, 319
Gclfa.n d rcpre.sen ta tion theorem, 322
Gclj~nd-N eumark th eoremst 318~ 325326
G Od-nl ~ K~ ~ 39n.
Goffman, C.;11 128
Goldberg~ R. R., 305-n.
Gram-Schmidt proceSB~ 258, 295
Graph of ma. pping~ 23
Group~ 172-173
Abelie.n (commutative)) 173, 179
additive) 179
366
Subiect Index
173n~
circ1eii 174
finiteii 173
full lineart 207
homomorphism) 180
identity in 1 173J 179
infinitet 173
inverses,, 173, 179
isomorph ie J 180
isomorphism,. 180
order:t 173
perm u ta ti on~ 178
regular represe nta. ti on, 181
subgroup or, 178
:symmetric~ 176
of symmetries of square;11 176-177
transformation~ 178, 181
(iro up alge bra 1 303--305
Hahn-Barra.ch
theorem~
2llJ 228
gen cral iz ed form~ 230-231
Hahn~ Mazurkie w ic z theorem J 343
llalmos, P. R~, 42n~, 46n~,. 172, 215n.
Ho. usd o :r II space~ 130
H cine-Borel theorem, 11 O~ 114
con verse t 115
gen er ali.zed 7 119
Her.in ite run ctiona~ 259
HewittJ E. ~ l21n~
Hilbert~ D~J 49
Hilbert cube,. 248
Hilbert space( s) ~ 245
s.mong com ptex Banach spacesjl 248
inner prod ue t ~ 245
orthogonal eomplementeJ 249
Join~ 46
O~
D., 338
Kolmogorov~ AT N.~ 12BJ 215n.
Kronecker delta 1 283
Kuratowski closure axiomsjl 98
45
Lebesgue, H.;p 49
Lebesgue covering 1emma., 122
Lebesgue numbert 122
Legendre polynomials, 259
Limit, in the mean, 257
of sequence~ 50, 70--7lf 132
Limit point~ 65~ 96
contrasted with 1im it 1 72
Lindel~r~ s theorem, 100
Linear space1 81 1 191
baeis for 1 197
dim cnsion f 200
isomorph ism, 200
linear comb in at ion in, 194
linear dependence in 1 196-1 Q7
linear in depend en ee in, 196--197
linear operations in, 191
linear subspace(s)J 81 1 193
diejoint, 195
sum ofJ 195
direct~ 195
normed, 54~ 81, 212
quotient space, 193--194
repre sen ta.tion, 201-202
Lineat tre.nsforma.tion(e), 203
bound for~ 220
bounded, 220
continuous~ 219, 220
idempote ntt 206
identity, 205
inverse, 205
negative 1 204
non-singular, 205
norm, 22Q--.221
null spa.ee, 207
nullity t 207-208
product, 204
range~ 207
rank~ 208
erot 204
36i
~1cCoy ~
N. H.~ 172
~IackP.y ~ (;. W ., 265nq 305n~
h1ac Lane~ S., 29
Mappjng(s)~ 16
bounded~ 58
closedJ 342
~omposition (multiplication) or, 19
continuoua.t 76, 93
jointly, 118
at point, 7 5-761 104
in single variable, 118
uniformly~ 77
contrasted with function~ 17
equality for, 20
Gelfand~ 319
graph~
23
identity, 20
into 1 17
inverse~
17
isometric t 79
one-to-one, 17
onto 7 17
open, 93
productJ 19
of sets, 1g.....19
(See also Function)
M.atriecs, ennonical form problem, 286
. __operations for 7 282-283
sirn il a.r, 286
Matrix~ conj uge. te transpose, 294
determinant, 287
diagonal, 287
identity, 283
as in dependent entity~ 28l 1 284
inverse~ 284
non-singular~ 284
of operator, 281
sca.1.a.r J 2 86
368
Subiecl Index
Ma tr i~ i triangular 1 295
zero:t 283
:tit1 ax i mal e1em en t 1 44
!\faxim..a.l ideal space~ 319
Maximum. 45
Maximum modulus theoremt 311
MeetJ 46
M etrie, 51
Metric spaee, 50, 51
comp]ete, 71
comp]etion~ 84--85
con tr action in 1 338
seq uen t.ial] y com pa.ct~ 121
SU hspace of, 56
tota.lly bounded, 123
M etri zabl c spa~c J H3
~[injmumt
45
Numbers~
algebraic J 43
cardinal~ 31
com parab ili ty th eorcm, 48
finite, 32
eomplex~ 23, 52-54
real 1 21
transcendental, 43
One-t0-0nc eorrespondenee, 18
One-point eompactific.ation, 163
Open base,. 99
generated by op~n subbase, 101
for point (at point)~ 96
Open-dosed set, 349
Open cover 1 II I
basic, 112
suhLasict ] J 2
su bcovcr of~ 111
Open ma ppingJ 03
Open mapping theorem~ 211 1 23&
Open rectangles, 10 l~ 119
Open set, 60t 91~ 92
basieJ 09
subbasie, 101
Open sphere, 59
Open strips 1 101
Open suhbMe~ 101
Operator( s) t 222
adjoint, 263
characteristic eq nation, 288--289
conjugate 1 241
detennina.n t, 288
eigr:nspac-n J 27 8
eigenvalueJ 278
eigenvector~ 278
imaginary pa.rt, 271
matrix, 281
norm al, 269
projection(s)J on Banach space, 237
on Hilbert space i 27 4
orthogonal, 276
ree.1 part, 271
reduced by subspace, 275
ring, 303
self-adjoint, 266
ord ering 1 268
posi tiire~ 268
ap-ectra1 resolution 1 280, 291
uniqueness~ 291-293
~pectr al theorem, 280t 290, 295--297
Subject Index
Operato.r(s) ~ epeetrum, 289~ 296
sq ua.r e roott 294
subspace invariant under, 275
unifa1ry, 272
Order r.e latio n ~ partial, 7, 43
on real 1ine1 7
tota.1 (or linear) J 7, 44
()rigin 1 54, 80, l 9 l
Orthogonal eomp1e.ment1 249
Orthogonal dimension, 259-2 60
Orthogon a] vectors J 24 9
Ort hon onn a1 basis, 293
Ort hon onn al set, 25 l
comp1ete1 255
369
Radical, 314
Re.al line~ 21
absolute va.Jue on 1 52
cxten ded, 56
Jeast up per bound property, 21 _, 45
usual ru et.r ic on, .52
llec tangle 1 closed, 10 l ~ 11 ~
open, 101J 119
Reflexivity, of normed linear space,, 232
of re]ation, 27 ~ 43
R e1 a tion (binary), 26
antisymmetric~ 43
circu] s..r, 31
equivalen ee,, 27
partial order, 7J 43
reflexive 1 2 7 1 43
symmetric, 27
transitive, 21, 43
triangular~ 31
Re1 a tive topology, 93
Representation,, of algebrat 210
of B *-algebra1 325-326
or B a.ns..ch el ge bra, 305
of Banach space, 234
of Boolean aJgebra, 353
or Bool can ring t 3 51
or commutative C*-olgebra 1 332--334
of commutative semi-siro p1 e Banach
alge br & 1 322
or grou P~ 181
of Hll b ert space~ 260
of linear space, 201~202
of ringi 188--100
Reso1v en t of an el em ent. ~ 309
Resolvent eq ua ti on, 309
Re.solvent set, 309
Rickart~ C+ E., b.: 1 326n.
Riemann, B., 49
Riemann sphere, 163
Riesz~ F ., 49~ 226, 296n.
370
Subiect Index
Russe1l 1 R., 7
Russcl1 '.s paradox~ 6
246
a bnorma.l, 6
Boolean a. lge bra j 12,, 344
boundary J 68, 97
boundary point, 68~ 97
bounded~ 58
Cantor, 67
(~art -r.:sia n pro<l u ct'" 21
cJosPd ~ 65, 95
fini tu intersection, 12
finite union 1 12
of first category1 7 5n.
inclusion~ 6
index~ 11
infinite, 5
interior of, 63, D7
interior pointt 63i 97
intersection,, 9, 11
1in early ordered, 44
neighborhood or' 96
normal~ 6
nowberP densei 7 4, 99
n UII1 eric al equiv al en cc, 28, 32
open 1 60'" 91, 92
ba.sic 1 99
SU bha:sic' 10 I
open-('Josed 1 349
orthonormal, 2 51
eo1n pl ete 7 2 55
partially ordered, 43-4 4
partition 26
perfect, 99
product, 23--25
proper au bset, 6
pro per su perset 7 6
ring~ 14
of aeoon d category~ 75-n.
subsctJ 5
i-
;au per-set,, 5
Subiect Index
Set (s) ~ universal 1 5, 7-8
Set mappings, 18-19
Sierpinski, W . 1 42n+, 46n.
Similarity for ma trice.s~ 286
Smirnov~ Y. M., 139n.
Space, contrasted with set, 5n.
E:uclidean~
161
extended~
371
homcomorphici 94
locally connected, 151
mctrizable;1: 93
normal 133
open base 1 99
open :!iubb!:u3c1 101
j
Pea.not 342n..
prodn~t, 117
166--167
rerul 160
Strips~ 101
Strong topology, 232
re]ative, 93
strong 1 on normed linear space, 232
strongest~
l 04
usual, 92
weak~ generated
Supremum,. 45
weakest~ l 04
Symmetryt 27, 51
372
Subiect Index
W *-algebra~ 303
allrnan, H t l.50
7
"'