Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Michael Potter
Later in this entry the words 'set' and 'class' will take on precise meanings, so to begin with we shall use the word 'collection' quite broadly to mean anything whose identity is solely a matter of what its members are. Which collections exist? Two extreme positions are initially appealing. The first is to say that all do. Unfortunately this is inconsistent because of Russell's paradox (see PARADOXES OF SET AND PROPERTY). The second is to say that none do hut to talk as if they do when such talk can be shown to he eliminable and therefore harmless. This is consistent but far too weak to be of much use. We need a n intermediate theory. Various theories of collections have been proposed since the 1900s. What they all share is the axiom of extensionality, which asserts that if x and y are collections then
The fact that they share this is just a matter of definition: objects which do not satisfy extensionality are not collections. Beyond extensionality, however, theories differ. The most popular among mathematicians is Zermelo-Fraenkel set theory or ZF (see $ 3). A common alternative is von Neumann-Bernays-Gdel class theory or NBG (see $ 41, which allows for the same sets as ZF but also has proper classes, i.e. collections (such as the class of all sets or the class of all ordinals) whose members are sets but which are not themselves sets. Two general principles have been used to motivate the axioms of ZF and its relatives. The f i s t is the iterative conception, according to which sets occur cumulatively in layers, each layer containing all the members and subsets of all previous layers. The second is the doctrine of limitation of size, according to which the 'paradoxical sets' (i.e. the proper classes of NBG) fail to be sets because they are in some sense too big. Neither principle is altogether satisfactory as a justification for the whole of ZF: for example, the replacement schema (see $3)is motivated only by limitation of size; and foundation (see $3)is motivated only by the iterative conception. Among the other systems of set theory to have been proposed the one that has received widespread attention is Quine's N F (see $ 7), which seeks to avoid paradox by means of a syntactic restriction but which has not been provided with an intuitive justification on the basis of any conception of set. It is known that if N F is consistent then ZF is consistent, but the converse result has still not been proved.
1 Virtual collections
Sometimes when we speak of collections it is just a manner of speaking and no more: if we say that Sarah belongs to the collection of all linguists, we might just as well say straight away that she is a linguist; nothing seems to be lost in the translation. This way of regarding collections is quite old: it is how Peano conceived of them in the 1880s. If we already have before us some formal theory which does not have collectiontalk in it, then we can add it straightforwardly. If @(z)is a formula of the theory in question, we introduce a new term {z : a($)}(called a collection term) into the language. Collection terms are suhject to the following two rules: (1) If @(z)is a formula then x E
(2) If @ ( x ) and Q ( z )are both formulae of the theory then { x an abbreviation for V x ( @ ( z ) t ,Q ( z ) ) .
:@ ( x ) }
={ x :Q ( z ) } is
We permit the use of collection terms only in contexts from which these two rules allow them to he eliminated. Collections used harmlessly in this way have been called virtual by Quine and we shall follow this terminology here. Let us introduce the convention that lower case Greek letters stand schematically for collection terms. So the second re-writing rule allows us to assert, for instance, the following extensionality schema:
By this schematic device we generate a considerable amount of the naive theory of collections. We can, for instance, introduce the usual Boolean operations as follows:
And so on. The theory thus generated is elegant enough in itself, but it is difficult to be satisfied with it for long. (We have to remember, for instance, that in this theory } are merely notational devices and do not, despite the appearexpressions such as {x,y ance to the contrary that their 'explicit' definitions might suggest, have any meaning on their own.) Collection terms stand in sentences in some of the places where the grammar leads us to expect proper names; schematic letters stand where we might expect variables. There is a drive to objectify a t work here, to suppose that collection terms can stand in all the places where names can occur. It is of course this step which, if carelessly done, leads straight to contradiction. How to do it without contradiction will be the subject of the rest of this entry.
2 Real collections
In the virtual theory just discussed the relation of membership is defined contextually. In the real theory with collections regarded as objects in their own right membership is taken by almost all authors as primitive. (Lewis (1991) is a rare exception.) Formally, this means that the language in which the theory is couched contains, in addition to the usual logical constants, a binary predicate symbol 'E'. I t will be convenient to introduce ) . . . and (3% E a) .. . as a few notational conventions straight away. We write (Vs E a abbreviations for Vx(x E a + . . . ) and 3x(x E a A . . . ) respectively. Upper case Greek letters such as @ and Q will stand for formulae. The notation @ ( x ) stands for a formula in which the variable zoccurs free: it does not indicate that there are no other variables occurring free in the formula. If @ is a formula, we write @(a) for the relatiuization of @ to a,i.e. the formula obtained from @ by replacing each quantifier Vx or 3x by (Vx E a) or (3x E a) as the case may be. We write z C y as a n abbreviation for Vz(z E z + z E y ) . We introduce ordered pairs hy means of the device
In the virtual theory whether there are any objects which are not collections is scarcely an issue: since there are not really any collections, if there is nothing else then there is nothing a t all; the theory therefore collapses into vacuity. In any non-trivial real theory things are different: there are plenty of collections to talk ahout whether or not there are any non-collections. Since this issue arises uniformly for all the real theories
we shall be discussing, it is best to address it now. The objects of a theory which are not collections are called atoms, individuals or (even in non-German texts) Urelemente. If we are to allow for their presence, we need a primitive unary predicate atom(x) to represent that z is an atom. We then introduce as an axiom the following: (2.1) Axiom of extensionality (applied form). VxVy(atom(z) Vatom(y) VVZ(ZE z tt Z E v) + x = y ) . This asserts no more than that objects which are not atoms are collections, since (as we have already observed) i t is part of what we mean by a collection that it is completely determined by its members. We shall also, if the atoms are to play any role in the theory a t all, need the following: Collection axiom. 3:/Vz(x E y
tt
atom(x)).
In words: the atoms form a collection. If, on the other hand it is pure collections that interest us, then the beginnings of the theory are a little more concise. We do not need the predicate atom(%)or the collection axiom, and the axiom of extensionality takes the following simpler form: Axiom of extensionality (pure form). VxVy(Vr(z E x
tt
z E y)
x = y)
Throughout what follows we shall state theories in their pure form and adopt the convention that if T is a pure theory (i.e. one which includes the pure form of the axiom of extensionality) then TU is the corresponding applied theory (i.e. the one obtained from T by deleting the pure form of extensionality and substituting the applied form together with the collection axiom). In the virtual theory we had collection terms defined contextually, provided that they occurred in certain positions (on the right of the 'E' symbol or on either side of the '=' symbol). Now that we are regarding collections as real we cannot be so indiscriminate. Let us say that a formula @(z) is collectiuizing in x for a theory T if
Suppose now that T includes extensionality among its axioms. If @(z) is collectivizing in z for T then by extensionality
We thereby retrieve the virtual theory of l but limited to collectivizing formulae. What remains is to specie which formulae are collectivizing; it is this that is dealt with differently in the various competing theories.
3 Zermelo-Fraenkelset theory
We start by listing the axioms of a fragment of Zermelo-Fraenkel set theory called Z-: Axiom of extensionality. As before. Axiom schema of separation. If @(x) is a formula in which y does not occur free, then this is a n axiom:
E z A y E z)
z E 2).
+y
U {z} E x).
Axiom schema of replacement. If @(x,y) is a formula in which z does not occur free then this is a n axiom:
Va((Vz E a)3!y@(z,y)
To get Z F C we add:
E x).
Axiom schema of selection and union. If @(x,y) is a formula in which a and b do not occur free, then this is an axiom:
V@aVz(@(z,y)
+
x E a)
The history of these axioms for set theory is as follows. In 1908 Zermelo gave the first published axiomatization of set theory. His axioms were those of Z C U - (i.e. the applied form of Z- plus the axiom of choice) except that the axiom of infinity was replace by Zermelo's form as given above. Zermelo's choice of axioms seems to have been driven largely by his desire to ground a proof of the well-ordering theorem (see AXIOM OF CHOICE) which he published in the same year. Although Zermelo's system is strong enough for many mathematical purposes, it is weaker than Z C U - in one important respect: it is impossible to develop a satisfactory theory of ordinal numbers (see SET THEORY) in Zermelo's system. During the 1920s this problem was solved in two ways: ingenious methods were found, principally by Kuratowski, of achieving without ordinals what is most naturally done with them; and Zermelo's system was strengthened, principally by Fraenkel and Skolem, so as to permit the theory of ordinals to be embedded in it.
Axiom of extensionality. As before. Axiom of classes. 3xVy((3z y E z ) + y E x ) Empty set axiom. 3z(Vy y $ z A 3 z x E z). Pairing axiom. (Vx E V ) ( V yE V ) ( 3 z E V ) V u ( uE z
Axiom schema of class existence. If is any formula with free variables u l , v z , . . . .21, then this is a n axiom:
u = x V u = 9).
ti
3u(u E u A u E x ) ) . u 2 x).
Axiom of infinity. ( 3 x E V ) ( 0 E x A (Vu E x)(Vu E x ) ( u U {?I}E z ) ) Axiom of replacement. (Vx E V ) V r ( f u n c ( r )+ ( 3 y E V ) V u ( u E y Axiom of foundation. As before.
Most of these axioms are unexciting transcriptions of the axioms of Z F . However, there is one striking difference: the axiom scheme of replacement has turned into a single axiom. Even more surprisingly, the axiom schema of class existence can also be replaced by a finite list of axioms, such as the following:
tt
( 3 E x)(u,u) E
VaVb3cVx(z E c tt ( x E a A s E b))
y) , E a)). Va3b(Vx E V ) ( x E b tt ( 3 y E V ) ( ( z
So NBG is finitely axiomatizable. This is quite striking since by contrast Z F cannot be finitely axiomatized (if it is consistent). And yet NBG and Z F are of equal strength, in the sense that if @ is any formula in the language of set theory then
Z F i-
Moreover, NBG is consistent if and only if Z F is consistent. Once we have taken the step of regarding classes as objects, it is puzzling why we should constrain the class existence schema by insisting that the formula defining a class can mention only sets, not proper classes. So if we are serious about the reification of proper classes, we should replace the class existence schema with the following:
Morse-Kelley schema. If @ ( z )is any formula in which y does not occur free then this is a n axiom:
The system thus obtained is known as Morse-Kelley class theory and denoted MK. It is (if consistent) a genuine extension of NBG, even with respect to sets. In other words, ~ NBG 1 Y dV). However, the diMiculty there are formulae such that MK k ~ ( but now is to know why we should stop: if classes are sufficiently real to occur legitimately in the definitions of sets, there seems no good reason why they should not he members of still more collections. As long as we refrain from calling the new collections thus countenanced 'sets' or 'classes', there is no fresh danger of inconsistency. Theories arranged in this way are not popular, though, despite their practical convenience.
5 Other axiomatizations
So far we have said little about the motivation for the axioms. Two principles have guided traditional accounts of this question: one is the iterative conception; the other is the doctrine of limitation of size. However, it is not easy to justify all the axioms of Z F on the basis of either principle on its own. Other axiomatizations have been given which are closer in spirit to these principles. Here we shall describe two. The first is Scott's way of axiomatizing the iterative conception of set, according to which sets are divided up into levels. The set of all levels belonging to a given level is called its history. The formal details are as follows. First define
acc(a) = { z : ( 3 E a)(zE bV z
C b)]
if this set exists. (In words, the accumulation of a set is the set of all the elements and subsets of its members.) Next define hist,ory(a)to be an abbreviation for
(Vb E a ) b = acc(b n a ) .
Then define level(b) to be an abbreviation for
Now adopt the convention that the variables V,V ' ,V" are restricted to range over levels, so that for instance W . . . is an abbreviation for W ( l e v e l ( V ) t .. . ). Then the axioms of Z' are as follows:
Axiom of extensionality. As before. Axiom schema of separation. As before Axiom schema of reflection. If @(z) is any formula then this is a n axiom:
Then Z' is equivalent to Z and ZF' is equivalent to ZF. The second theory we shall describe is due to Ackermann. It takes V as a primitive constant. The axioms of the system A are as follows:
Axiom of extensionality. As before. Axiom schema of separation. As before Axiom of transitivity. VxVy(x E y A y E V Axiom of subsets. VxVy(x C y A y E V
+s
i
x E V).
V).
7~1,712,. ..
Aekermam's schema. If is a formula in which the free variables are and in which V does not occur, then this is a n axiom:
, un
The motivation for A is not that of the iterative conception; rather is it based on the idea that anything formed solely from sets is itself a set. The system A* is something of a hybrid: it grafts onto A the following axiom, which is derived from the iterative conception and cannot be justified by the motivating idea of A:
t then A k dV);
a.
6 Anti-foundation axioms
We have already mentioned that not all the axioms of Z F can be motivated by any one account of the notion of sethood. In particular, the axiom of foundation cannot be sustained solely on the basis of the doctrine of limitation of size. But without it where do sets get their individuation? To tell whether two sets are equal we compare their members; if necessary, we compare their members; and so on. In the presence of the axiom of foundation, this process cannot go on indefinitely; in its absence, it can. Now if the pattern of membership exhibited by the two sets is the same, then it seems reasonable to say that they are equal: they have done all that can be expected of them, as sets, to be regarded as identical. We are therefore led to the following principle:
Axiom of extensionality (strong form). If two sets have isomorphic -graphs, then they are equal.
(The -graph of a set is the membership relation restricted to the set, its members, their members, etc.) As we have already noted, this assertion is provable in ZF, but in ZF- (ZF without foundation) we need to add it as an axiom. It ensures, for instance, l such that Cl = {Cl} (since any two such sets would have that there is a t most one set i to have isomorphic -graphs). However, it does not guarantee that there is such a set Cl. Aczel and others have studied the consequences of adding to ZF- not only the strong extensionality axiom just mentioned (which is essentially limitative) but also a permissive axiom generating non-well-founded sets (which can be thought of as limit points in much the same way that irrational numbers are limits of rational numbers). The most natural of the axioms that have been studied is the following:
7 Quine's systems
Another, more radical strand in attempts to resolve the set-theoretic paradoxes is represented by the various forms of the theory of types (see THEORY OF TYPES). We cannot, in any of the systems we have been describing so far, collect together all the objects to form another object, but we can certainly quantify over them; in the theory of types we cannot. Objects are stratified into types, which are not cumulative but otherwise resemble the levels of Scott's theory; variables are labelled with superscripts to indicate the type they range over. The formula z m E yn is taken to be well-formed only if n = m + 1. Now type theory is not, according to the strict definition adopted here, a form of set theory, because it does not have the axiom of extensionality. What it has instead is a schema of axioms, one for each type:
This may seem a pedantic cavil, particularly if the device of typical ambiguity is introduced, whereby formulae are written with superscripts omitted and taken to stand schematically for any way of adding superscripts which produces a grammatical result. Thus the extensionality schema above is usually represented
Nevertheless, we maintain, this is not the axiom of extensionality. Typical ambiguity is not true generality. To see the difference consider the formula 3yVz z E y : taken as typically ambiguous this stands schematically for such formulae as 3yn+lVzn z n E gn+l and is therefore true; taken as a formula in its own right it is false. The restrictions on what can be said in the theory of types have struck many writers as too severe. However, the means by which the theory avoids paradox are as secure as those of iterative set theories. Quine has proposed a system which uses type theory's method of paradox avoidance but abandons its grammatical restrictions. Specifically, he regards formulae as genuine, not typically ambiguous, but calls a formula stratified if it is possible to decorate it with superscripts so as to make it well-formed according to the theory of types. The system, which is known as N F because it was proposed in an article called Wew foundations for mathematical logic', has the following axioms:
Axiom of extensionality. As before. Axiom schema of stratified comprehension. If @ ( z ) is any stratified formula in which y does not occur free then this is a n axiom:
N F is very different from ZF. For one thing N F is finitely axiomatizable. Moreover, since the formula z = z is stratified, it follows a t once that there is a set of all sets: if
we denote it V then V E V. The axiom of choice is provably false in NF. When we come to arithmetic, it is possible in a natural way to define zero and the successor operation in NF, and then to define the set of natural numbers to be the intersection of all sets containing zero and closed under successors. This works well up to a point, but gives us mathematical induction only for stratified formulae. Unfortunately some useful formulae are not stratified. For example, we cannot prove in NF, and are therefore forced to add as an axiom, the following:
Axiom of extensionality. As before. Axiom schema of class existence. If @(x) is a formula in which y does not occurfree then this is a n axiom:
Axiom schema of set existence. If a($,vl, u z , ... , v,) is a stratified formula in which no variables other than those listed occurfree, then this is a n axiom:
In contrast to NF, ML is not finitely axiomatizable if it is consistent. If is any formula in which V does not occur then N F t if and only if ML t Thus ML is consistent if and only N F is consistent. However it is still not known whether ML and N F are consistent if ZF is consistent. If we define the natural numbers in super& cially the same way as in NF, we get mathematical induction even for non-stratifiable formulae, obviating the need for an axiom of counting. However, this time we cannot prove that the class of natural numbers is a set (if ML is consistent). So now we have to add this as an axiom. Thus neither N F nor ML suffices without supplementation as a basis for mathematics.
Fraenkel, A.A., Bar-Hillel, Y., and Levy, A. (1973) Foundations of set theory, 2nd edition, Amsterdam: North-Holland. (Detailed discussions of most of the systems of set theory mentioned here and a wealth of historical information and references.) G d e l , K. (1940) The consistency of the continuum hypothesis, Princeton, N.J.: Princeton University Press. (The definitive formulation of NBG.) Hallet, M. (1984) Cantorian set theory and limitation of size, Oxford: Clarendon Press. (A detailed study of the early history of set theory.) van Heijenoort, J. (ed.) (1967) From Frege to Godel: A source book in mathematical logic, 1879-1931, Cambridge, Mass.: Harvard University Press. (It is possible to trace the historical development of ZF and NBG through the papers reprinted here.) Kelley, J.L. (1955) General topology, Princeton, N.J.: Van Nostrand. (The appendix contains the first published version of MK.) * Kuratowski, K. (1921) 'Sur la notion d'ordre dans la theorie des ensembles', Fundam. Math. 2: 161-71. (Mentioned in $3.) * Lewis, D. (1991) Parts of classes, Oxford: Basil Blackwell. Morse, A. (1965)A theory of sets, New York: Academic Press. (A treatment of the theory of classes based on a version of MK.) von Neumann, J. (1925) 'Eine axiomatisierung der Mengenlehre', J. reine angew. Math. 154: 21940. English translation in van Heijenoort (1967). (The first formulation of an axiomatization similar to what is now called NBG.) Potter, M.D. (1990) Sets: An introduction, Oxford: Oxford University Press. (An account of set theory based on a Scott-type axiomatization and intended for the mathematically minded.) * Quine, W.V.O. (1937) 'New foundation for mathematical logic', Amer. Math. Monthly 44: 70-80. (Principally of historical interest. For up-to-date information about N F consult Forster.) -(1969) Set theory and its logic, revised edition, Cambridge, Mass.: Harvard University Press. (Rather idiosyncratic but extremely clear and well written.) * -(1951) Mathematical logic, revised edition, Cambridge, Mass.: Harvard University Press. (Very technical, but includes a wealth of information.) Scott, D. (1974) 'Axiomatizing set theory' in Axiomatic set theory, Proc. Symp. Pure Math. 13, Part 11, pp.207-14, Providence, R.I.: Am. Math. Soc. (The primary source for Scott's axiomatization of ZF' described in $5.) Skolem, T. (1922) 'Einige Bemerkungen zur axiomatischen Begrundung der Mengenlehre', Wiss. Vortrage gehalten zuf dem 5 Kongress der Skandinau. Mathematiken in Helsingfors, pp.217-32. English translation in van Heijenoort (1967). (The first precise formulation of the axiom schema of replacement and of Zermelo's notion of 'definite property'.) * Zermelo, E. (1908) Untersuchungen iiber die Grundlagen der Mengenlehre 1', Math. Ann., 65: 261-81. English translation in van Heijenoort (1967). (The first published axiomatization of set theory, referred to in $3.)