Narrow Road

The Narrow Road to the Interior
A Mathematical Journey
Leland McInnes
Current as of July 2, 2007 http://jedidiah.stu.gen.nz/wp/
ii
Contents
1 First Steps 1 1.1 On Abstraction . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2 The Slow Road . . . . . . . . . . . . . . . . . . . . . . . . . . 7 1.3 A Fraction of Algebra . . . . . . . . . . . . . . . . . . . . . . . 10 2 A Fork in the Road 2.1 The Paradoxes of the Continuum, Part I 2.2 Shifting Patterns . . . . . . . . . . . . . 2.3 Paradoxes of the Continuum, Part II . . 2.4 Permutations and Applications . . . . . 2.5 A Transnite Landscape . . . . . . . . . 2.6 Grouping Symmetries . . . . . . . . . . . 17 18 24 34 41 49 58
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
iii
Contents
iv
First Steps
The Narrow Road draws its title from Oku no Hosomichi (The Narrow Road to the Interior), the famous travel diary of Matsuo Basho as he journeyed into northern Japan. My aim is to follow a similar wandering journey, but instead travelling into the abstract highlands of pure mathematics, pausing to admire the beauty and sights along the way, much as Basho did. That means we have a long way to travel: from the basics of abstract or pure mathematics, through topology, manifolds, group theory and abstract algebra, category theory, and more. There may well be some detours along the way as well. It is going to take a long time to get to where we are going, but along the way well see plenty of things that make the trip worthwhile. Indeed, as is so often the case, the journey means more than the destination.
First Steps
1.1
On Abstraction
Lets begin with a short practical experiment. Pick up a pen, or whatever similar sized object is handy, hold it a short distance above the ground, and drop it. The result that the pen falls to the ground is not a surprising one. The point of the experiment was not to note the result, however, but rather to note our lack of surprise at it. We expect the pen to fall to the ground; our expectation is based not on knowledge of the future however, but on abstraction from past experience. Chambers Dictionary denes abstract, the verb, to mean to generalize about something from particular instances, and it is precisely via this action that we come to expect the pen to fall to the ground. By synthesis of many previous instances of objects falling when we drop them, we have generalized the rule that things will always fall when we drop them1 . We make this abstraction so instinctively, and take it so completely for granted, that it is worth dwelling on it for a moment so we can see how remarkable it actually is. The circumstances surrounding each and every instance of you observing an object falling to the ground are quite unique. Were it not for our brains natural tendency to try to link together our experiences into some kind of narrative we would be left contemplating each dropped object as an entirely distinct instance and be in no position to have any expectation as to what will happen each time it would be an entirely new case. In our minds we have, rather than a vast array of disjoint and distinct instances, a single principle that knits together the common elements. This allows us to generalise to any new circumstances that share the same common elements as those of past experiences. This is something we do unconsciously; automatically: our brains hunt for patterns in the world and try to generalise those patterns in the expectectation that they will continue. Indeed, almost all our expectations are a result of such inductive knowledge and abstraction abstraction is fundamental to our experience. Of course, not all abstractions are correct. Our minds are constantly on the hunt for possible patterns, and dont always pick out valid ones. The classic example is the Christmas goose, who, every day of the year has found the arrival of the farmer results in the goose getting fed until Christmas day when the abstracted rule that Farmer implies food meets a painful end. Even if we nd abstractions that are ostensibly correct, that doesnt mean theyre ideal. A perfectly valid abstraction of dropping objects, for instance, would be the rule that a dropped object always accelerates away from your
Note that in practice some expectations have been, to some extent, pre-wired into our brains, and this particular discussion is simply an illustrative example of the general concept of abstraction.
1
On Abstraction
hand. Certainly that is true, but it leaves out what might be considered an important common property of things being dropped: the direction in which the dropped object accelerates. Alternatively we could note that dierent objects, say a feather compared to a stone, behave very dierently when dropped and arrive at a vast array of rules, one for each dierent kind of object. This leads to the dilemma of nding the most eective or ecient abstraction the abstraction that most consistently produces the results you want with the least eort. That is where science comes in: it is a systematic eort to rene our abstracted rules and principles, and continually check them for consistency. We may, if we like, think of the dierent sciences as arising from the results we want clause of our denition of eective abstraction. A biologist tends to work with dierent abstractions (generally on very dierent scales) than a physicist because the sorts of results they are interested in determining are rather dierent. This is all very interesting, but you are probably starting to wonder what any of it has to do with mathematics. The answer is that mathematics relies on precisely this sort of abstraction that is so integral to our experience of the world. Mathematics simply attempts to take the abstraction as far as it possibly can. Part of what makes mathematics dicult is that it tends to pile abstraction upon abstraction. That is to say, after developing a particular abstraction it is common for mathematics to then study that abstraction and, upon nding common properties when dealing with that abstraction, generalise that commonality into a new abstraction; mathematics develops abstractions not only from a synthesis of experiences of the external world, but also via synthesis of properties of existing abstractions. This layering means that unless youve gotten a good grasp of the preceding level of abstraction, the current one can be extremely hard to follow. Once youve wandered o the path, so to speak, it can be dicult to nd your way back. The other problem that people tend to face, when learning mathematics, is that as you pile up abstractions and climb higher, and hence more distant from everyday experience, intuition becomes less and less helpful, and an increasing degree of pedantry is required. To give an example of what I mean by this, lets take a detailed look at a mathematical abstraction that almost everyone takes for granted: numbers. Natural numbers (also known as counting numbers) are one of those remarkable abstractions, like objects falling to the ground, that we take for granted. Natural numbers are, however, a very abstract concept they simply dont exist in the external world outside your own head. If you think otherwise, I challenge you to show me where the number 3 exists. You can point to some collection of 3 things, but that is only ever a particular
First Steps
instance that possesses the property common to all the particular instances from which the number 3 is generalised. Because we take natural numbers for granted we tend to make assumptions about how they work without thinking through the details. For instance we all know that 1 + 1 = 2 but does it? Consider a raindrop running down a windowpane. Another raindrop can run down to meet it and the separate raindrops will merge into a single raindrop. One raindrop, plus another raindrop, results in one raindrop: 1 + 1 = 1. The correct response to this challenge to common sense is to say but thats not what I mean by 1 + 1 = 2 and going on to explain why this particular example doesnt qualify. This, however, raises the question of what exactly we do mean when we say that 1 + 1 = 2. To properly specify what we mean, and rule out examples like putting 1 rabbit plus another rabbit in a box and (eventually) ending up with more than 2 rabbits, is rather harder than you might think, and requires a lot of pedantry. Ill quote the always lucid Bertrand Russell2 to explain exactly what we mean when we say that 1 + 1 = 2:
Omitting some niceties, the proposition 1 + 1 = 2 can be interpreted as follows. We shall say that is a unit property if it has the two following properties: 1. there is an object a having the property ; 2. whatever property f may be, and whatever object x may be, if a has property f and x does not, then x does not have the property . We shall say that is a dual property if there is an object c such that there is an object d such that: 1. there is a property F belonging to c but not to d; 2. c has the property and d has the property ; 3. whatever properties f and g may be, and whatever object x may be, if c has the property f and d has the property g and x has neither, then x does not have property . We can now enunciate 1 + 1 = 2 as follows: If and are unit properties, and there is an object which has property but not the property , then or is a dual property. It is a tribute to the giant intellects of school children that they grasp this great truth so readily.
From Essays in Analysis, Section V, Chapter 15: Is Mathematics Purely Linguistic, pages 301 and 302.[5]
2
On Abstraction
We dene 1 as being the property of being a unit property and 2 as being the property of being a dual property. The point of this rigmarole is to show that 1 + 1 = 2 can be enunciated without mention of either 1 or 2. The point may become clearer if we take an illustration. Suppose Mr A has one son and one daughter. It is required to prove that he has two children. We intend to state the premise and the conclusion in a way not involving the words one or two. We translate the above general statement by putting: x. = .x is a son of Mr A, x. = .x is a daughter of Mr A. Then there is an object having the property , namely Mr A junior; whatever x may be, if it has some property that Mr A junior does not have, it is not Mr A junior, and therefore not a son of Mr A senior. This is what we mean by saying that being a son of Mr A is a unit property. Similarly being a daughter of Mr A is a unit property. Now consider the property being a son or daughter of Mr A, which we will call . There are objects, the son and daughter, of which (1) the son has the property of being male, which the daughter has not; (2) the son has the property and the daughter has the property ; (3) if x is an object which lacks some property possessed by the son and also some property possessed by the daughter, then x is not a son or a daughter of Mr A. It follows that is a dual property. In short a man who has one son and one daughter has two children.
As you can see, once we try to be specic about exactly what we mean, even simple facts that we assume to be self-evident become mired in technical detail. For the most part people have a suciently solid intuitive grasp of concepts like number and addition that they can see that 1 + 1 = 2 without having to worry about the technical pedantry. The more abstractions we pile atop one another, however, the less intuition people have about the concepts involved and it becomes increasingly important to spell things out explicitly. In fact much of modern mathematics has reached the point where it is suciently far divorced from everyday experience that common intuition is counterproductive, and leads to false conclusions. If you think that sounds silly then consider that modern physics has also passed this threshold few people can claim quantum mechanics to be intuitive. Our experience of the world is actually remarkably narrow, and thus so is our intuition. At about this point I imagine some readers are wondering, if it is so easy to get lost and mired in technicality, why bother with all of these abstractions? I like to think of this process of increasing abstraction as akin to a road deep into the mountains. At times the road can look awfully steep. At other
First Steps
times, to avoid a sheer climb, the road is forced to take a tortuous, winding path. As you progress deeper into the mountains, however, you will nd places where the road opens out to present you with a glorious vista looking out over where you have come from. Each new view allows an ever broader view of the landscape, allowing you to see further and more clearly, while also seeing all the other dierent roads that all lead to this same peak. It is these unexpected moments, upon rounding a corner, of beauty, and clarity and insight, that, to me, make the study of mathematics worthwhile. I hope that, in this journey into the interior of mathematics, I can impart to you some of those moments of beauty and wonder.
The Slow Road
1.2
The Slow Road
natsukusa ya tsuwamonodomo ga yume no ato The summer grasses: The high bravery of men-at-arms, The vestiges of dream.3
Matsuo Basho, on visiting Hiraizumi, once home to the great Fujiwara clan whose splendid castles had been reduced to overgrown grass mounds.
A good haiku not only arrests our attention, it also demands reection and contemplation of deeper themes. In Bashos Oku no Hosomichi, The Narrow Road to the Interior, the haiku often serve as a point of pause amidst the travelogue, asking the reader to slow down and take in all that is being said. The slow road to understanding is often the easiest way to get there. At the same time the travelogue itself provides context for the haiku. Without that context, both from the travelogue, and from our own experiences of the world upon which the haiku asks us to reect, the poem becomes shallow: you can appreciate the sounds and the structure, but the deeper meaning the real essence of the haiku is lost. Mathematics bears surprising similarities. A well crafted theorem or proof demands reection and contemplation of its deep and wide ranging implications. As with the haiku, however, this depth is something that can only be provided by context. A traditional approach to advanced mathematics, and indeed the approach you will nd in most textbooks, is the axiomatic approach: you lay down the rules you wish to play by, assuming the bare minimum of required knowledge, and rapidly build a path straight up the mountainside. This is certainly an ecient way to get to great heights, but the view from the top is often not rewarding unless you have spent time wandering through the landscape you now look out upon. Simply put, you lack the context to truly appreciate the elegant and deep insights that the theorems have to oer; like the haiku it becomes shallow. My task, then, is to provide you with the necessary experiences in the mathematical landscape; to provide context for the insights that are to follow.
3
Translation by Earl Miner[4]
First Steps
In the previous entry, On Abstraction, I discussed the process of abstraction, and how mathematics builds up layer upon layer of abstraction. The road we must take, the slow road, is the path that winds its way through these layers. Each layer is, in a sense, a small plateau amidst the mountains; to be explored before the next rise begins. The place to start, therefore, is with the area of mathematics that most people already have a fairly strong intuitive sense for: numbers. Many people tend to assume that mathematics is all about numbers, something that simply isnt the case. Numbers are just one of the more extreme abstractions from the external world, amongst many dierent abstractions that make up modern mathematics. Even in antiquity mathematics was divided into arithmetic, which abstracted quantity, and geometry, which abstracted shape and form. Numbers are, however, something that almost everyone has (or thinks they have) a solid intuitive grasp of and studying the nature of that abstraction, and how it is made, will provide some context for other similar abstractions, as well as providing a solid base from which to build further layers of abstraction. The concept of number is both a greedy abstraction, and a remarkable one. It is greedy in that it tries to abstract away as much detail as possible. Given a collection of objects (for now well take collection as intuitive, most peoples everyday experience is sucient for elementary numbers and doesnt run afoul of the pathological cases that require strict denitions to avoid) we forget absolutely everything about the collection, and about the objects themselves, except for a single particular property. The abstraction is remarkable because, by being so very greedy, it is applicable to everything there is simply nothing in our experience of the world that doesnt fall under the umbrella of this particular generalisation: everything can be quantied in some sense, albeit trivially (as one) in many cases. This is the power of mathematics: by seeking greedy abstractions, by generalising as much as possible, it nds properties or concepts that have near universal applicability. Mathematics allows you to speak about everything at once. The catch, of course, is that by abstracting too far you leave yourself unable to say anything useful (you can say nothing about everything). The trick is to forget as much as possible about particular instances, while still leaving some property that can be worked with in a constructive manner toward some purpose or other. Whether youve forgotten too much depends on your particular purpose that is, what structure or property you are interested in. Mathematics is the art of eective forgetting. The eectiveness of numbers comes from arithmetic. The basic operation of addition allows us to describe the results of bringing together two collections and regarding them as one. By regarding this notion of combination in
The Slow Road
the very abstract terms of addition of numbers we gain two things: rst, by removing the messy particularity of the world via abstraction we make the process simple to deal with; second, by using the greedy abstraction of numbers we produce universally applicable results; i.e. 2 + 3 = 5 is a statement about any collection of 2 things and any collection of 3 things. Performing an addition is generalising across incredibly broad classes of real world situations. We are saying an enormous amount incredibly simply. The beauty and complexity starts to unfold when, due to the greediness of the abstraction, we nd that numbers can reect back on themselves. For example, additions can form collections, and as noted, collections have quantitative properties. Thus we can talk about a particular number of additions, for example we might have 5 additions of 3, and arrive at multiplication (That is, 3 + 3 + 3 + 3 + 3 = 5 3). We can talk about a particular number of multiplications and arrive at exponentiation, and so on. By folding the abstraction back on itself we can build layers of structure structure that may be far more complicated than we might rst imagine. In introducing multiplication we raise the question of its inverse, division. That is, if we can nd the quantity that results from some number of additions, we can ask to go the other way and decompose a quantity into some number of additions. In doing this, however, we introduce prime numbers (those which cannot be decomposed into any integer number of additions of integers) and fractions. Whole new expanses of complexity and structure open up before you there is, apparently, a whole world to explore. In the next section well look into what kind of abstractions we can make from the world of numbers, and dip our toes into the beginnings of algebra. In the meantime, however, Ill leave you with a question about numbers to ponder: Can every even integer greater than 2 be written as the sum of 2 primes? This is commonly known as Goldbachs conjecture, and it remains an open problem to this day; no one knows the answer. It is worth taking a moment to think about the problem yourself, and wonder why it may, or may not, be true, and what it really means, and also how little we really know about the strange world of numbers.
First Steps
10
1.3
A Fraction of Algebra
As a mathematician there is a story I hear a lot. It tends to come up whenever I tell someone what I do for the rst time, and they admit that they dont really like, or arent very good at, mathematics. In almost every case, if I bother to ask (and these days I usually do), I nd that the person, once upon a time, was good at and liked mathematics, but somewhere along the way they had a bad teacher, or struck a subject they couldnt grasp at rst, and fell a bit behind. From that point on their experiences of mathematics is a tale of woe: because mathematics piles layer upon layer, if you fall behind then you nd yourself in a never ending game of catch-up, chasing a horizon that you never seem to reach; that can be very dispiriting and depressing. In the previous entries we have dealt with subjects (abstraction in general, and the abstraction of numbers) that most people have a natural intuitive grasp of, even if the details, once exposed, prove to be more complex than most people give them credit for. It is time to start looking at subjects that often prove to be early stumbling blocks for some people: fractions and algebra. There is a reason that these subjects give people pause when they rst encounter them, and that is, quite simply, that they are dicult. They are dicult in that they represent another order of abstraction. Both fractions and elementary algebra must be built from, or abstracted from, the basic concept of numbers. Because of the sheer prevalence of numbers and counting in our lives from practically the moment we are born, people quickly develop a feel for this rst, albeit dramatic, abstraction. It is when people encounter the next step, the next layer of abstraction, in the form of fractions and/or algebra, that they have to actively stretch their minds to embrace a signicant abstraction for the rst time. Most of us, having won this battle long ago, struggle to see the problem in hindsight we might recall that we had trouble with the subject when we were younger, but would have a hard time saying why. We have developed the same sort of intuitive feel for fractions and algebra as we have for numbers and have forgotten that this is hard won knowledge. I want to begin with fractions because, ultimately, it is by far the easier of the two being only a semi-abstraction and will provide an example of the process as background for stepping up to elementary algebra. As was noted in the last entry, the complexity of mathematics begins to open up once we pass from considering numbers as referring to collections of objects and begin to think of them as objects in their own right. Once we have grasped that abstraction we can count numbers themselves, and operations on numbers, giving us the higher order construction of multiplication. Division operates in a similar way, providing an inverse to multiplication in
11
the same way that subtraction provides an inverse to addition. That is, while addition asks if I add a collection of size 3 to a collection of size 2, what size is the resulting collection?, subtraction asks the inverse question if I got a resulting collection of size 5 by adding some collection to a collection of size 3, how much must I have added?; parallelling that we have multiplication asking if I add together 5 collections of size 3, what size is the resulting collection, and division reversing the question: if I have a resulting collection of size 15, how many collections of size 3 must I have added together?. Everything seems ne so far, but there is some subtlety here that complicates the issue. If we are still thinking in terms of collections then dividing a collection of 2 objects into 4 parts doesnt make sense. If we are viewing numbers and operations on them as entities in their own right then we can at least form the construction 2/4, and ask if it might have a practical use. It turns out that it does, since it allows a change of units. What do I mean by this? We can say a given collection has the property of having 2 objects in it, but to do so is make a decision about what constitutes a discrete object. Deciding what counts as an object, however, is not always clear there are often several possible ways to do it, depending on what you wish to consider a whole object (that is, the base unit which you use to count objects in the collection). A simple example: in the World Cup soccer nals, do you count the number of teams, or the number of individual players? Both make sense depending on the kind of result you want to obtain, so considering a team, or each individual player, as a discrete object is a choice. The problem is even more common when dealing with measurement: a distance is measured as a certain number of basic lengths, but what you use as your basic length (the unit of measure) is quite arbitrary. We tend to measure highway distances in miles or kilometres and peoples heights in feet or metres, but we could just as easily switch to dierent units and measure highway distances in feet or metres and still be talking about the same distance. Most importantly we can change our minds, or re-interpret, what constitutes a distinct object after the fact. Using this re-interpretation of what constitutes and discrete object we can make sense of 2/4. If we re-interpret a distinct object such that what we had previously considered a single object is now considered two objects then we will have 4 objects in the collection, and we need 4 of these new objects to arrive at a collection that would be regarded as having 2 old objects. That is, 2/4 is expressible in terms of the re-interpreted objects, and in fact denes the relationship between old objects and new. But heres the rub: we arrived at the new objects by considering each old object as 2 new objects, and so 1/2 expresses the same relationship between old and new objects: 1 old object reinterpreted as 2 results in the
First Steps
12
same new object as 2 old objects re-interpreted as 4. Indeed, we can go on like this, with 3/6, 4/8, 5/10, and so on, all expressing the same relationship of new object to old - all dierent ways to arrive at the same size of new object. And so we have a catch - what are on inspection quite dierent expressions will, in practice, behave the same. Re-interpreting 1 object as 2, or 2 objects as 4 results in the same new objects, so counting, and hence addition, subtraction, and multiplication of these new objects will give the same result, whichever re-interpretation we use. Perhaps it doesnt seem like much of revelation that 2/4 is the same as 1/2, but that is simply because we have learned, through practice, to automatically associate them. The reality is that 2/4 and 1/2 are quite distinct, and it is only because they behave identically with regard to arithmetic that we regard them as the same. In identifying them as the same we are abstracting over such expressions, forgetting the particularities of what size of initial collection we were dividing, caring only about the common behaviour with regard to arithmetic. Making sense of fractions involves abstracting over numbers - they are another level of abstraction, and this, I suspect, is why people nd them dicult when they rst encounter them. There is an important idea in this particular abstraction that is worth paying attention to - it leads the way to algebra. We have an innite number of dierent objects: 1/2, 2/4, 3/6, /4/8, . . . but because they all behave identically with respect to a given set of rules (in this case basic arithmetic) we pick a single symbol to denote the entire class of possible objects. Algebra can be thought of as extending that idea to its logical conclusion. The insight we need to make the step to algebra is that there is a subset of the rules of arithmetic for which all numbers behave identically. For example reversing the order of addition makes no dierence to the result, no matter what numbers you are adding: 1 + 2 = 2 + 1, and 371 + 27 = 27 + 371. If you can identify which rules have the property that the specic numbers dont matter, then you can pick a single symbol to denote the entire class of numbers for any manipulations within that set of rules. This is algebra. This is important because it is a layer of abstraction over and above the abstraction of numbers. With numbers we considered many dierent collections and abstracted away everything about them except a certain property the number of objects they contain. This proved to be useful because with regard to a certain set of rules, the rules of arithmetic, that was the only aspect of the collection that made a dierence. Now we are regarding numbers as objects in their own right and, having identied a set of rules under which the particular number is unimportant, we are abstracting away what particular number we are dealing with. With numbers we could perform calculations and have the result be true regardless of the particular
13
nature of the collections beyond the number of objects. Now, with algebra, we can perform calculations and have the result be true regardless of the particular numbers involved. This is an exceptionally powerful abstraction: it essentially does for numbers what numbers do for collections. This is why the rules of algebra, that subset of arithmetic rules under which all numbers behave identically, are so important. In particular we can say that, no matter what numbers x, y and z are, the following are always true: 1. x + y = y + x and x y = y x. These are referred to as commutative properties. 2. x + (y + z ) = (x + y ) + z and x (y z ) = (x y ) z . These are referred to as associative properties. 3. x (y + z ) = x y + x z . This is referred to as a distributive property. 4. x +0 = x and x 1 = x. This property of 0 and 1 is referred to as being an identity element for addition and multiplication (respectively). 5. There is a number, denoted x such that x + x = 0. This refers to the existence of inverses for addition. We also have one odd one out the existence of inverses for multiplication. The catch here is that it does matter what number x is; inverses exist for almost every number, but if x = 0 there is no multiplicative inverse of x. Thus we have:
1 6. If x is any number other than zero then there is a number, denoted x , 1 such that x x = 1.
If you have any curiousity you will be wondering why this special case occurred, breaking the pattern. Remember that we are talking about abstract properties common to all numbers, so the fact that this is a special case says something quite deep about both multiplication, fractions, and the number zero. Indeed, because we are two layers of abstraction up, referring to all numbers, which in turn each refer to all collections with a given property, the fact that this is a special case has signicance with regard to almost everything in the physical world. It is worth spending some time thinking about what it truly means. We have some further properties with regard to how numbers can be ordered. I havent touched on this topic yet weve only referred to numbers as a property of collections, and not as an ordering but it is suciently
First Steps
14
intuitive (that is, most people have a rm enough grasp on numbers) that I wont get into details here; just be forewarned that numbers as order and numbers as size are actually distinct concepts that, at some point, we will have to carefully tease apart. 7. Either x < y , y < x, or x = y . 8. If x < y and y < z then x < z . 9. If x < y then x + z < y + z . 10. If x < y and 0 < z then x z < y z . Note that, again, 0 and multiplication have a signicant interaction and provide another special case. Note that I gave names to properties 1 through 5 because these properties will keep cropping up again and again later; some will prove to be important, others less so. Which ones are important and which are not may be somewhat of a surprise, but Ill leave that surprise till later. At this point it is worth taking stock of how far weve come. Not only have we built up two layers of abstraction, each of which can be used to great practical eect (just witness how much of modern technology and engineering is built upon arithmetic and elementary algebra!), in doing so weve begun to uncover an even deeper principle the principle that will form the foundation for much of the modern mathematics that is to follow. What do I mean? There is a common thread to how these successive abstractions have been built: we discerned a set of rules for which an entire class of objects (potentially even completely abstract objects) behave identically, and this allowed us to abstract over the entire class. The broader the class the broader the results we can draw; the higher the abstraction (in terms of successive layers) the deeper the results we can draw. The approach now will be to seek out rules, and classes that they allow us to abstract over; the broader and more layered, the better. In so doing we will part ways with numbers entirely. Fractions, ordering, and the diculties of 0, will lead us towards a kind of generalised geometry, while consideration of properties 1 through 6 will lead us to a language of symmetry. We have come to the rst truly signicant incline on our road. Behind us lies a vast plain of numbers, fractions, and algebra. There is much more to explore there we havent even touched on popular topics such as trigonometry but in following the path we have, we have stumbled across a road that leads deep into the mountains. We have identied a common property to the abstractions we are making, and will now seek to generalise it. The
15
importance of this cannot be overstated! We are abstracting over the process of abstraction itself! This is the path to high places from which, when we nally arrive, we can look out, over all the plains we now leave behind, with fresh eyes, and deeper understanding.
First Steps
16
A Fork in the Road

Alice came to a fork in the road. Which road do I take? she asked. Where do you want to go? responded the Cheshire cat. I dont know, Alice answered. Then, said the cat, it doesnt matter.
Lewis Carroll, Alices Adventures in Wonderland[2]
In the later years of his life, after his journey to the interior, Basho lived in a small abandoned thatched hut near lake Biwa that he described as being at the crossroads of unreality1 . Now, still early in our journey, we have come to our own crossroads of unreality. We are caught between dichotomies of unreal, abstract, objects. One road leads to consideration of nite collections, and properties of composition (the algebraic properties 1 through 5 from the previous section); the other road leads to the continuum and questions of ordering and inter-relationship (properties 7 through 10 from the previous section). The rst road will lead to a new fundamental abstraction from nite collections, dierent from, and yet as important as, the abstraction that we call numbers; this way lies group theory and the language of symmetry that has come to underlie so much of modern mathematics and physics. The second road will lead to deep questions about the nature of reality, and, brushing past calculus along the way, lead to a new and minimalist interpretation of a continuous space through the concept of topology. Which road do we take? As the cat said to Alice, It doesnt matter. We are at the crossroads of unreality, and the usual rules need not apply. Which road do we take? Both.
From the translation of Genjan no fu by Donald Keene, in Anthology of Japanese Literature[3]
17
A Fork in the Road
18
2.1
The Paradoxes of the Continuum, Part I
Innity is a slippery concept. Most people tend to nd their metaphorical gaze just slides o it, leaving it as something that can only ever be glimpsed, blurry and unfocused, out of the corner of their eye. The problem is that, for the most part, innity is dened negatively; that is, rather than saying what innity is, we say what it is not. This, in turn, is due to the nature of the abstraction that leads to the concept on innity in the rst place. The ideas of succession and repetition are fairly fundamental, and are apparent in nature in myriad ways. For example, the cycle of day and night repeats, leading to a succession of dierent days. Every such series of successive events is, in our experience, bounded it only extends so far; up to the present moment. Of course such a series of events can extend back to our earliest memories. Via the collective memory of a society, passed down through written or oral records, it can even extend back to well before we were born. Thus, looking back into the past, we come to be aware of series of successive events of vastly varying, though always bounded, length. We can then, at least by suitable juxtaposition of a negation, form the concept of a sequence of succession that does not have a bound. And thus arises the concept of innity. Is the concept coherent? Does succession without bound make any sense? With this conception of innity it is hard to say, for we have only really said it is a thing without a bound. We have said what property innity does not have, but we have said little about what properties it does have. Indeed, despite the basic concept of innity extending back at least as far as ancient Greece, whether innity is a coherent concept has been a point of bitter debate, with no signicant progress made until as recently as the end of the 19th century. Even now, despite having a fairly well grounded denition and theory for transnite numbers, there is room for contention and diering conceptions of innity, and in particular of the continuum. Such modern debate divides over subtle issues which we will come to in due course. First, however, it will be educational to look at some of the more straightforward reasons that people have diculty contemplating innity: the apparent paradoxes and contradictions that arise. Some of the earliest apparent paradoxes that involve the innite are from ancient Greece. Among the more well known are the paradoxes proposed by Zeno of Elea. Interestingly Zenos paradoxes (of which there are three) were not originally intended to discredit the concept of innity on the contrary they assume the coherency of innity as a concept to make their point. Zeno was a student of Parmenides, who held that the universe was actually a static unchanging unity. Zenos paradoxes were intended to demonstrate
19
that motion, and change, are actually just illusions. The paradoxes have, however, come to be associated with the paradoxical nature of the innite. The rst of Zenos paradoxes, the Dichotomy, essentially runs as follows: Before a moving body can reach a given point it must traverse half the distance to that point, and before it can reach that halfway point it must traverse half of that distance (or one quarter of the distance to the end point), and so on. Such division of distance can occur indenitely, however, so to get from a starting point to anywhere else the body must traverse an innite number of smaller distances and surely an innite number of tasks cannot be completed in a nite period of time? The second paradox, the most well known of the three, is about a race between Achilles and a tortoise, in which the tortoise is granted a head start. Zeno points out that, by the time Achilles reaches the point where the tortoise started, the tortoise will have moved ahead a small distance. By the time Achilles catches up to that point, the tortoise will again have moved ahead. This process, with the tortoise moving ahead smaller and smaller distances, can obviously occur an innite number of times. Again we are faced with the diculty of completing an innite number of tasks. Thus Achilles will never overtake the tortoise! The third paradox, the Arrow, raises more subtle questions regarding the continuum, so I will delay discussion of it until later. Taken together the paradoxes were supposed to show that motion is paradoxical and impossible. Few people are actually convinced, however: everyday experience contradicts the results that the paradoxes claim. The common reaction is more along the lines of Okay, sure. Whats the trick?. The trick is actually relatively subtle, and while rough and ready explanations can be given by talking about convergent series, it is worth actually parsing out the ne details here (as weve seen in the past, the devil is often in the details), as it will go a long way toward informing our ideas about innity and continuity. Let us tackle the Dichotomy rst. To ease the arithmetic, let us assume that the moving body in question is traversing an interval of unit length (which we can always do, since we are at liberty to choose what distance we consider to be our base unit), and that it is travelling at a constant speed. We can show that, contrary to Zenos claim, the object can traverse this distance in some unit length of time (again, a matter of simply choosing an appropriate base unit) despite having to traverse an innite number of shorter distances along the way. To see this, consider that, since the body is travelling at a constant speed, it would have to cover a distance of 1/2 in a time of 1/2, and before that it would cover a distance of 1/4 in a time of only 1/4, and so on. The key to resolving this is that the innite sum 1/2 + 1/4 + 1/8 + 1/16 + is equal to 1, and thus the innite tasks can,
A Fork in the Road
20
indeed, be completed in nite time. This tends to be the point where most explanations stop, possibly with a little hand-waving and vague geometric argument about progressively cutting up a unit length. It is at this point, however, that our discussion really begins. You can make intuitive arguments as to why the sum turns out to be 1, but, given that we werent even that clear about what 1 + 1 = 2 means, a little more caution may be in order particularly given that innity is something completely outside our practical experience, so our intuitions about it are hardly trustworthy. Since we cant trust our intuitions about innite sums yet, it seems sensible that we should look at nite sums instead. Certainly we can calculate the sum 1/2 + 1/4 = 3/4, and1/2 + 1/4 + 1/8 = 7/8, and so on. Each of these sums will, in turn, give a slightly better approximation of the innite sum we wish to calculate; the more terms we add, the better the approximation. The obvious thing to do, then, is to consider this sequence of ever more accurate approximations and see if we can say anything sensible about it. To save myself some writing I will use Sn to denote the sum 1/2 + 1/4 + 1/8 + ... + 1/2n (thus S2 = 1/2 + 1/4 and S4 = 1/2 + 1/4 + 1/8 + 1/16, and so on), and talk about the sequence of partial sums S1 , S2 , S3 , . . . It may not seem that weve made much improvement, having shifted from summing up an innite number of terms to considering an innite sequence of sums, but surprisingly innite sequences are easier to deal with than innite sums and we at least only have nite sums to deal with now. The trick from here is to deal with the nth term of the sequence for values of n that are nite, but arbitrarily large. That means we get to work with nite sums (since for any nite n, Sn is a nite sum) which we can understand, but at the same time have no bound on how large n can be, which brings us into contact with the innite. In a sense we are building a bridge from the nite to the innite: any given case is nite, but which term the case deals with is without bound. Before we can get to the arbitrarily large, however, we must rst deal with the arbitrarily small. In some ways it was the arbitrarily small that lead to this problem the paradox is founded on the presumption that the process of dividing in half can go on indenitely, resulting in arbitrarily small distances to be traversed. It is precisely this property of innite divisibility that is a necessary feature of the idea of a continuum: something without breaks or jumps. The opposite of the continuous is the discrete; a discrete set of objects can only be divided into the nest granularity provided by the discrete parts, since any further division would involve a reinterpretation of what constitutes an object. In presuming indenite divisibility we have moved away from discrete collections of objects, and into the realm of continuous things. In the world of the continuous we
21
may talk about the arbitrarily small (a result of arbitrarily many divisions note the relationship between the innite and the continuous). What we are really after is a concept of convergence; the idea that as we move further along the sequence we get closer and closer, and eventually converge to, some particular value. That is, we want to be able to say that, by looking far enough along the sequence we can end up an arbitrarily small distance away from some particular value that the sequence is converging to. This, in turn, leads us to the next concept: distance. We need to be careful here because while the original problem was about a moving object covering a certain distance in the real world, we have abstracted away these details so as to have a problem solely about sequences of numbers. That means we are no longer dealing with practical physical distance, but an abstract concept of distance between numbers. So what does it mean for one number to be close to another? We need a concrete definition rather than vague intuition if we are to proceed. Since numbers are purely abstract objects we could, in theory, have close mean whatever we choose. There is a catch, however: when talking about numbers we generally assume that they are ordered in a particular way. For example, when arriving at rules for algebra we included rules for ordering numbers. This implicit ordering denes closeness in the sense that we would like to think that x < y < z means that y is closer to z than x is. Looking back at the rules regarding ordering we nd that this means that the closer z y is to 0, the closer y is to z . Thats really just saying that the smaller the dierence between y and z , the smaller the distance between them, and so the denition of distance we need is the dierence between y and z ! The nal catch is that we would like to be able to consider the distance from z to y to be the same as the distance from y to z , but z y = (< y z ). The solution is simply to say that the direction of measurement, and hence the sign of the result, is irrelevant and take the absolute value to get: The distance between y and z is |y z |. As a momentary aside, it is worth noting that we have dened a distance between numbers to be another number, but that the number that denes the distance is, in some sense, not the same type of number. The number dening the distance is a higher level of abstraction, since it is a number describing a property of abstract objects, while the numbers that we are measuring distance between are describing concrete reality. For the most part these dierences dont matter numbers are numbers and all behave the same but as we move deeper into the philosophy of mathematics teasing apart these subtleties will be important. Now, back to the problem at hand...
A Fork in the Road
22
It is time to put the power of algebra the ability to work with a number without having to specify exactly which number it is to use. Let epsilon be some non-zero positive number, without specifying exactly what number (Im using because it is the traditional choice among mathematicians to denote a number that we would like to presume is very small that is, very close to zero). Then I can choose N to be a number large enough that 2N is bigger than 1/ , and hence 1/2N is less than . Exactly how big N will have to be will depend on how small is, but since there is no bound on how big N can be, we can always nd a big enough N no matter how small turns out to be. Now, if we note that, for any n, 2n 1 Sn = 2n (which you can verify for yourself fairly easily) then, if we assume that n is bigger than N , we nd that the distance between 1 and Sn is: |1 Sn | = 1 1 2n 2n 1 = n < N < . n n 2 2 2 2
That may not look that profound because it is buried in a certain amount of algebra, but we are actually saying a lot. The main point here is that ; was any non-zero positive number it can be as small as we like; arbitrarily small even. Therefore, what weve just said is that we can always nd a number (which we denoted N ) large enough that every term after the N th term is arbitrarily close to 1. That is, by going far enough down the sequence of partial sums (and there are innitely many terms, so we can go as far as we like), we can reach a point where all the subsequent terms are as close to 1 as we like. This is what we mean when we say that a sequence converges. We have shown that the further along the sequence you go, the closer and closer you get to 1. It follows then, due to the way the sequence was constructed by progressively adding more terms to the sum, that the more terms of the sum we add together, the closer the sum gets to one. There is no limit on how close to 1 we can get, since there is no upper limit on the number of terms we can add. In this sense the innite sum (which has no bound on the number of terms) is equal to 1 (since we are innitesimally close to 1 by this point). The key points here were the ideas of distance between numbers, and of convergence, which lets us show in concrete terms that we can end up an arbitrarily small distance away from our intended target, just by looking far enough (and we can look arbitrarily far) along a sequence. These ideas of dening abstract distance, and of convergence as dened in terms of that
23
distance will continue to be increasingly important as we progress down this road. Zenos second paradox, about Achilles and the tortoise, can be tackled in a similar manner. Once we abstract away the details of the problem and arrive at the question of whether we can sum together all the times for each ever smaller distance that Achilles must run to catch the tortoise, we nd that the same basic tools, involving sequences of partial sums, and convergence, will yield the same kind or result Achilles will overtake the tortoise in a nite period of time. I leave the proof, and the determination of how long it will take Achilles, as an exercise to the reader. So we have resolved two of Zenos paradoxes; in so doing, however, we have developed a much richer theory. I would like to pause and ask you to contemplate what weve actually done here. It is easy to get mired in the details, but the bigger picture is truly remarkable. Through the concept of convergence we have built a bridge between the nite and the innite, between the discrete and the continuous. Convergence provides a tool that allows us to extend our concrete reasoning about the nite and the discrete, step by inexorable step, into the realm of the innite and the continuous. It is a tool that allows us push out the boundaries of what we can reason about from restricted and mundane connes of everyday experience to the very limits of possibility and beyond: we can reason about a lack a bounds! When we next deal with this stretch of road we will look at more potential paradoxes, including Zenos third paradox, in an eort to better understand the continuum, and the innite. Next, however, we will start down a dierent road, and consider other basic abstractions of a nite collection.
A Fork in the Road
24
2.2
Shifting Patterns
How hot the sun glows, Pretending not to notice An autumn wind blows!2
Matsuo Basho
akaaka to hi wa tsurenaku mo aki no kaze
What is a haiku? Or, more specically, what makes a particular composition a haiku, as opposed to one of the many other poetic forms? The dening feature most people will be familiar with is the 5-7-5 syllable structure. Within that basic structure, of course, the possibilities are almost endless, and this is what makes haiku so tantalizing to write: you can shift the words and syllables around to craft your message, and as long as you retain the classic 5-7-5 syllable structure you can still call your work a haiku3 . This is not an isolated trait. We constantly dene, and categorise, and classify, according to patterns. We determine a basic pattern, an underlying structure, and then classify anything consistent with that structure accordingly. This is our natural talent for abstraction at work again, seeking underlying patterns and structure, and mentally grouping together everything that possesses that structure. It is the means by which we partition and cope with the chaotic diversity of the world. And yet, despite our natural talent for this, it wasnt until the last couple of centuries that we had any treatment for this sort of abstraction comparable to our use of numbers to formalise quantity. Since, unlike numbers, very few people have had the requisite abstractions drilled into them from a young age, we will have to go a little more slowly,
Translation by Dorothy Britton [1]. In practice Japanese haiku have rather more subtle demands, and are both more, and less exible than this; this example is more for illustrative purposes.
3 2
25
Shifting Patterns
and try and tease out the details. The rst point to address is that fact that we have been very vague. It is certainly true that we nd patterns, and classify things according to whether they preserve the pattern or not, but the very concept of a pattern is itself only very loosely sketched: we are hiding a lot of detail in words like pattern and structure. The best way to come to grips with this is to start with very simple examples for which we can agree on what we mean by pattern, and see if we cant build up an abstraction from there. Lets consider an arrangement of coloured marbles (red, green, and blue) that looks like in gure 2.1, and agree (hopefully) that by the pattern here, we mean the specic
Figure 2.1: Pattern of coloured marbles triangular layout with the colours arranged just so (two blue marbles in the top corners, a small triangle of green marbles, and a red marble at the bottom corner), with the implicit abstraction that marbles of the same colour are equivalent. We are interested in other arrangements of marbles that also have that pattern. That might sound like an impossible task since the only way to lay out marbles such that they are in that pattern is to lay out the marbles exactly as shown... there are no other ways, right? Not exactly, no. You see each green marble is dierent, so we could swap a couple of the green marbles; the marbles would then be laid out dierently (we have put specic marbles in dierent places) but the pattern of colours has remained the same. It helps if we label the marbles like gure 2.2, and then we can see that this rearrangement of marbles still preserves the pattern of colours, show in gure 2.3. So what happened here? It may help to think in terms of the actions we need to take to go from the initial arrangement to the new rearranged version. We swapped the blue marbles, and rotated the green marbles around in a circle (see gure 2.4). The trick now is to notice
A Fork in the Road
26
Figure 2.2: Labelled pattern of marbles
Figure 2.3: A dierent arrangement of marbles that preserves the pattern
Figure 2.4: Showing how the rearrangement was made
27
Shifting Patterns
that, as long as we are thinking in terms of forming a rearrangement by interchanging marbles, any rearrangement that preserves the pattern works even more generally. That is, if we started with a dierent initial arrangement of the marbles like in gure 2.5, then making the same interchanges of marbles
Figure 2.5: The same marbles in a rectangular arrangement as before (swapping the blue marbles, and cycling the three green marbles) will preserve this pattern as well. Of course if we were to add more marbles, or take some away (and thus alter our numbering scheme) things would once again get more complicated. Still, by thinking in terms of interchanging items we have managed to generalise across a wide variety of particular patterns. We should be taking that as a hint that this particular line of thinking is worth investigating further. What we are seeing is that if we think if terms of rearrangements that preserve the internal relationships that make up a particular pattern, then those rearrangements will continue to preserve those same internal relationships for any other pattern that has them. In our case with the marbles the internal relationships were dened by which marbles we could tell apart from one another that is, which marbles were the same colour. If we had swapped a red and green marble we would have broken the pattern; and that would have happened had we done so with the triangular arrangement, or the rectangular one. As long as we work in terms of rearrangements that refer to swapping marbles we can generalise over all the dierent particular spatial patterns at once. Dont worry if that isnt sinking in yet, theres another example coming up. In the meantime, however, I want you to notice that the rotation of the green marbles can also be achieved by simply swapping marbles 2 and 5, and then swapping marbles 2 and 4 try it out yourself. This sort of decomposition fo rearrangements will prove important. Our next example, to try and get a feel for things, is a square. Were
A Fork in the Road
28
interested all the dierent things we can do to the square that will have it end up looking the same we started (symmetries of the square, if you want to think of it that way). If youre still feeling a little lost with all of this it might help to cut out a square of paper to manipulate yourself as you follow along. First, just as we did with the marbles, were going to label the square so we can keep track of what were doing in this case were going to number the corners (it will probably be helpful to do this on your square of paper if you have one) as in gure 2.6. What we want to do is nd all the
Figure 2.6: A square with labelled corners dierent manipulations of the square that result in a square in exactly the position we started with, and well keep track of the dierent manipulations by how they move the labels in the corners. With a bit of experimentation youll quickly nd that we have three rotations as shown in gure 2.7, and
Figure 2.7: The three dierent rotations of a square we can ip the square across four dierent axes as in gure 2.8, and thats all
29
Shifting Patterns
Figure 2.8: The four dierent ips of a square
we can do; for example, if we tried just swapping corners 1 and 2 we would end up with something that isnt a square anymore (see gure 2.9). How is
Figure 2.9: A distorted square not a square anymore
this similar to our example with marbles? In the same way that we found new arrangements of marbles by swapping marbles around, we are nding new arrangements for the corners of the square. With the marbles we were concerned about the internal relationships formed by the dierent colours (and our ability to distinguish marbles of dierent colour, but not marbles of the same colour). With the square the internal relationships are formed by adjacency relations of the corners; that is, we require, for instance, that the corner 1 is always between corners 4 and 2 and opposite to 3; similarly the corner 2 is always between corners 1 and 3, and opposite to 4. Thus swapping just corners 1 and 2, for example, results in the corner 1 being between 2 and 3, and hence breaking the internal relationship. What determines a pattern is
A Fork in the Road
30
how internal sub-objects relate to one another. What determines a dierent arrangement that preserves a pattern is whether that arrangement preserves those inter-relationships. There is more that we can exploit with this example however. As with the marbles example, we can decompose complex rearrangements in terms of simpler ones. Lets consider just two rearrangements of the square: a rotation by 90 degrees, and a ip through the vertical axis, which well refer to by the letters r and f (as shown in gure 2.10). Through combination of just these
Figure 2.10: Two basic operations for rearranging a square
two rearrangements we can produce all seven possible pattern preserving rearrangements; for example if we rst ip through vertical axis, and then rotate by 90 degrees (which we will shorthand to f r for a ip followed by a rotation) then the resulting arrangement is the same as ipping about the diagonal through corners 2 and 4. Our seven rearrangements turn out to decompose as follows: 1. Rotation by 90 degrees: r 2. Rotation by 180 degrees: rr 3. Rotation by 270 degrees: rrr 4. Flip about vertical axis: f 5. Flip about horizontal axis: f rr 6. Flip about leading diagonal axis: f rrr 7. Flip about trailing diagonal axis: f r
31
Shifting Patterns
More importantly, any combination of ips and rotations will still result in a rearrangement that preserves the square, since each individual ip and rotation along the way will preserve the square. That means, for instance, that the sequence of ips and rotations f rrf rf rrr should correspond to one of these seven possibilities (or simply do nothing at all), but which one? Equally, what happens when we rotate by 270 degrees, then ip about the leading diagonal and rotate by a further 90 degrees? This turns out to be surprisingly easy (no playing with paper squares is required). A rst point to notice is that two consecutive ips (f f ) is the same as doing nothing we end up with our original arrangement. The same happens with four consecutive rotations (rrrr). Letting the symbol stand for the null rearrangement of doing nothing, we can write these rules as ff = rrrr = The last observation we need is that a rotation followed by a ip (rf ) results in the same rearrangement as a ip followed by three rotations (f rrr); that is rf = f rrr We can put these rules together to completely understand any possible combination of ips and rotations. At this point you should be noticing that things are looking a lot less like geometry and a lot more like algebra. This is a dierent sort of algebra altogether however. Previously, we developed algebra by letting a letter stand in for any possible number; something we could do because we had determined which arithmetic rules were true regardless of which particular numbers were used. Here we have letters standing not for numbers, but for rearrangements. The result is that the arithmetic rules look very dierent. When we were abstracting numbers we had the commutative law that x y = y x; here we nd that isnt true at all: instead of rf = f r we have rf = f rrr. We do have, however, exactly what algebra oered us for numbers: a set of rules for what operations we can perform. In this case we know that we can use the fact that rf = f rrr to steadily move all the rs to the right of any f s. That means we can rearrange any sequence of ips and rotations so that all the f s are together on the left, and all the rs are together on the right. Then all we have to do is use the other two rules to cancel down the f s and rs. We can have either 0 or 1 consecutive f s followed by 0, 1, 2, or 3 consecutive rs. A quick scan of our decomposition of seven rearrangements will show these cover all such possibilities (except the null case of 0 rs) .
A Fork in the Road
32
This is perhaps best illustrated with an example, so lets consider our complex sequence of ips and rotations given by f rrf rf rrr. We have f rrf (rf )rrr = f rrf (f rrr)rrr = f rr(f f )(rrrr)rr = f rr rr = f (rrrr) = f =f So the end result is identical to a simple ip about the horizontal axis. Similarly, our other question, what happens if we rotate by 270 degrees, then ip about the leading diagonal and rotate by a further 90 degrees, can be resolved easily by expressing those complex rearrangements in their decomposed form and simplifying according to the rules: (rrr)(f rrr)(r) = rrrf (rrrr) = rrrf = rr(rf ) = rr(f rrr) = r(rf )rrr = r(f rrr)rrr = (rf )(rrrr)rr = (f rrr) rr = f (rrrr)r = f r = fr which is a ip about the trailing diagonal. What we have here is an algebra for the symmetries of a square. In this algebra letters symbolise not numbers, but rearrangements of the corners of a square, and as a result the rules of this algebra are quite dierent. Were we to perform a similar analysis for the rearrangements of marbles in our earlier example, we would nd 11 rearrangements (plus the null rearrangement that does nothing), with three base rearrangements, and a dierent set of rules again. I leave the determination of these rules as an exercise for the interested reader. Indeed each distinct pattern (that is, each distinct set of internal relationships between some set of sub-objects) will have its own set of rules, and its own associated algebra. Our world is lled with patterns, and each and every such pattern has its own algebra describing how objects within the
33
Shifting Patterns
pattern can be rearranged while preserving that pattern. A whole new world begins to open up before us: what are all the possible algebras4 , and what patterns do they describe? Are there dierent sets of rules that produce the same algebras, and if so, how can we tell? Those questions, and a fuller exploration of this rich world which we have only just glimpsed here, will have to wait however. Next we will return to the continuum, and continue to try and unravel the many paradoxes that surround it.
Note that I am using algebra here in an informal sense there is a strict mathematical sense which is quite dierent.
A Fork in the Road
34
2.3
Paradoxes of the Continuum, Part II
Mathematical arguments can be very persuasive. They lead inexorably toward their conclusion; barring any mistakes in the argument, to argue is to argue with the foundations of logic itself. That is why it is particularly disconcerting when a mathematical argument leads you down an unexpected path and leaves you face to face with a bewildering conclusion. Naturally you run back and retrace the path, looking, often in vain, for the wrong turn where things went o track. People often dont deal well with challenges to their world-view. When a winding mountain path leads around a corner to present a view of a new and strange landscape, you realise that the world may be much larger, and much stranger, than you had ever imagined. When faced with such a realisation, some people ee in horror and pretend that such a place doesnt exist; the true challenge is to accept it, and try to understand the vast new world. It is time for us to round a corner and glimpse new and strange landscapes; I invite you to follow me down, in the coming entries, and explore the strange hidden valley. We begin with a mere glimpse of what is to come along this road. Still, even this glimpse has been enough to frighten some. Indeed the (potentially apocryphal) tale of the rst man to tread this road, a member of the Pythagorean Brotherhood, makes this very clear. The story goes that the insight came to him while travelling by ship on the Aegean. Excited, he explained his cunning proof to the fellow members of the Brotherhood aboard the boat. They were so horried by the implications that they immediately pitched him overboard, and he drowned. For the secretive Pythagorean Brotherhood, who believed that reality was simply numbers, mathematics was worth killing over. So what was this truth that the Brotherhood was willing to kill to keep secret? The fact that the square root of 2 is not expressible as a fraction. The proof of this is surprisingly simple, and runs roughly as follows. Lets can be expressed as a fraction, and so we have numbers n presume that 2 and m such that 2 = n/m. As you may recall from A Fraction of Algebra, a particular fraction is really just a chosen representative of an innite number of ways of expressing the same idea we can choose whichever representative we wish. For the purposes of the proof we will assume that we have chosen n/m to be as simple as possible (i.e. there is no common factor that divides both n and m); you may want to verify for yourself that such a thing is always possible (its not too hard). Now, using the allowable manipulations
35 of algebra we have:
n m n2 = 2 = 2 m = 2m2 = n2 2= Now 2m2 is an even number no matter what number m is, so n2 must be an even number as well. However, an odd number squared is always odd (again, this is worth verifying yourself if youre uncertain, again it isnt hard). That means the only way n2 can be even is if n itself is even. That means there must be some number x such that 2x = n. But then 2m2 = n2 = 2m2 = (2x)2 = 2m2 = 4x2 = m2 = 2x2 and so m2 , and hence m, must also be even. If both n and m are even then they have a common factor: 2; yet we specically chose n and m so that wouldnt be the case. Clearly, then, no such n and m exist, and we simply cant express 2 as a fraction!5 This result (if not necessarily the proof) is well known these days; suciently so that many people take it for granted. It is therefore worth probing a little deeper to see what it actually means, and perhaps gain a better understanding of why it so incensed the Pythagorean Brotherhood. The rst point to note is that 2 does crop up in geometry: if you draw a square with sides of unit length (and we can always choose our units such that this is so) then, by Pythagoras Theorem, the diagonal of the square has length 2. That, by itself, is not necessarily troubling; but consider that weve just seen that 2 is not expressible as a fraction. Recall that a fraction can be considered a re-interpretation of the basic unit, and you see that what were really saying is that there simply doesnt exist a unit of length such
This is, of course, a precis version of the proof. The devil is always in the details, and many details here have been glossed over as obvious, or left for the reader to verify. If you are interested in the nitty-gritty however, I recommend you try the Metamath proof of the irrationality of 2 (http://au.metamath.org/mpegif/sqr2irr.html). Each and every step in the proof is referenced and linked to an earlier theorem previously proved. By following the links you can drill all the way down to fundamental axioms of logic and set theory. If you dont care to follow the details yourself, you might note that, in this (extremely) explicit form, the proof can be machine veried.
5
A Fork in the Road
36
that the diagonal of the square can be measured with respect to it. If you were measuring a length in feet and found that it was between 2 and 3 feet then you could simply change your units and work in inches the distance is hopefully an integer number of inches. If inches arent accurate enough we can just use a smaller unit again (eighths of an inch for example). What we are saying when we say that 2 cannot be expressed as a fraction is that, no matter how small a unit we choose, we can still never accurately measure the diagonal of the square. Because we can simply keep dividing indenitely to get smaller and smaller units, that means we need innitely small units. And note the dierence here: unlike in Part I, arbitrarily small is not good enough, we need to go past arbitrarily small to actually innitely small. For the Pythagoreans innity was unreachable something that could never be completed or achieved and thus an innitely small unit could never be realised and thus. Therefore, in their world-view, the diagonal of a square couldnt exist since its length was an unreachable, unattainable, distance6 . That, as you an imagine, caused quite a bit of cognitive dissonance! Hence their desire to pretend such a thing never happened. As you can see, it turns out (even though it may not have looked that way at rst) that we are really butting our heads up against innity again, just from a dierent direction this time. Things get worse however: if we had a line of length 2 then there surely exists a point somewhere along that line that is a distance of 2 away from the origin. We have just seen, however, that such a distance is not one we can deal with in terms of fractions. If we were to put points at every possible fractional distance between 0 and 2 we would have a hole at 2, and continuous lines dont have holes in them. A new problem starts to raise its head. If we wish to have a continuum we have to ll in all the holes. The question is how we can do that where exactly are the holes? And, for that matter, how many holes are there? The rst of these questions turns out to be rather easier than the second (which we will address next time we venture down this fork of the road). The trick to nding holes is to note that, since fractions allow us the arbitrarily (if not innitely) small, we can get arbitrarily close to any point in the continuum, holes included. That is, while we cant actually express a hole in terms of fractions, we can sidle up as close beside it as we like using only fractions. And that means we must reach again for the useful tools of distance and convergence to determine that we are getting closer and closer to, and hence converging to, a hole.
If you think you can get out of this by just starting with the diagonal as your unit of measure you will simply nd that now the sides of the square are unmeasurable distances. The sides and diagonal of the square are incommensurable we cant measure both with the same units, no matter how ne a unit of measure we choose.
6
37
For our current purposes the denition of distance between numbers dened in Part I will be sucient. What we want to do is gure out a way to ensure that a sequence of fractions converges that is, that it gets closer and closer to something, without necessarily knowing what the something is. The trick to this is to require that the distance between dierent terms in the sequence gets smaller and smaller. In this way we can slowly but surely squeeze tighter and tighter about a limit point, without necessarily knowing what it is that we are netting. More formally, if we have an innite sequence S1 , S2 , S3 , . . . then we require that for any > 0 there exists an integer N 1 such that, for all m, n N , |Sn Sm | < (recalling that |x y | gives the distance between numbers x and y ). Such a sequence is called a Cauchy sequence. Now, since any Cauchy sequence converges to something, we can identify (consider equivalent) the sequence and the point at its limit. Furthermore, since we know that using fractions we can get arbitrarily close to any point on the continuum, there must be some sequence of fractions that converges to that point, and so if we consider all the possible innite Cauchy sequences of fractions, we can cover all the points on the continuum we are assured that no holes or gaps can slip in this time. Weve caught all the holes without even having to nd them! It is worth of fractions looking at an example: can we nd a sequence converges to 2? Consider the decimal expansion of 2 which starts out 1.41421... and continues on without any discernible pattern; clearly the seth quence 1, 1.4, 1.41, 1.414, 1.4142, 1.41421, . . . (where the n term agrees with 2 for the rst n 1 decimal places) converges to 2. More importantly each term can be rewritten as a fraction since each term has only nitely many non-zero decimal places; for example 1.4 = 14/10 and 1.4142 = 14142/10000 etc. Finally it is not hard to see that this sequence is a Cauchy sequence. We can do the same trick for any other decimal expansion, arriving at a Cauchy sequence that converges to the point in question. Of course there are many other Cauchy sequences of fractions that will converge to these values: we are dealing with something similar to our dilemma with fractions when we found that there were an innite number ofdierent pairs of natural numbers that all described the same fraction. In that case we simply selected a particular representative pair that was convenient (and could change between dierent pairs that represented the same fraction if it was later convenient to think of the fraction that way). We can do the same here: noting that a point is described by an innite number of Cauchy sequences, we can simply select a convenient representative sequence to describe the point. For our purposes
A Fork in the Road
38
the sequence constructed via the decimal expansion will do nicely in some sense you can think of the Cauchy sequence as an innite decimal expansion. Now that we at least have some idea of what these sequences might look like, it is time to take a step back and consider what is actually going on here. Back in The Slow Road we constructed natural numbers as a property of collections of objects. Then, in A Fraction of Algebra, we created fractions to allow us to re-interpret an object within a collection. This was another layer of abstraction fractions were not really numbers in the same way that natural numbers were fractions were a way of re-interpreting collections, and we could describe those re-interpretations by pairs of natural numbers. Perhaps rather providentially it turned out that the rules of algebra, the rules of arithmetic that were true no matter what natural numbers we chose, also happened to be true no matter what fractions we chose. It is this stroke of good fortune, combined with the fact that certain fractions can take the role of the natural numbers, that allows us to treat what are really quite dierent things in principle (fractions and natural numbers) as the same thing in practice: for practical purposes we usually simply consider natural numbers and fractions as numbers and dont notice that, at heart, they are fundamentally dierent concepts. Now we are about to add a new layer of abstraction, built atop fractions, to allow us to describe points in a continuum. While all that was required to describe the re-interpretation of objects that constituted a fraction was a pair of numbers, points in a continuum can only7 be described by an innite Cauchy sequence of fractions. Thus, in the same way that natural numbers and fractions are actually very dierent object, so fractions and points in a continuum are quite dierent. Again, however, we nd that when we dene arithmetic on sequences (which occurs in the obvious natural way) they all behave appropriately under our algebraic rules. en we consider that it is easy enough to nd sequences that behave as fractions (any constant sequence for instance) it is clear that, again, for practical purposes, we can call these things numbers and assume were talking about the same thing regardless of whether we are actually dealing with natural numbers, fractions, or points in a continuum. It should be pointed out that sometimes these distinctions are actually important. A simple example is computer programming, which does bother to distinguish oating point numbers (ultimately fractions) from integers. You can usually convert or cast from one to the other via a function (and
Technically other methods of describing such points exist. Indeed a very common formal approach is Dedekind cuts. Ultimately, however, Dedekind cuts represent more detail than we need right now, and will serve more as a distraction than anything. The interested reader is, however, encouraged to investigate them, and puzzle out why I chose to go with Cauchy sequences here.
7
39
at times that function can be implicit), but the distinction is important. Later we will start getting into mathematics where the distinction becomes important. So, now that we have this construction, several layers of abstraction deep, that allows us to describe the continuum, does it resolve the problem the Pythagorean Brotherhood struggled with? Certainly within the continuum there is a point corresponding to 2, but even with our construction it is the limit of an innite sequence we still require a completed innity. Of course accepting the idea of a completed innite would get us out of this conundrum; what we require is a coherent theory of the completed innite were we to have that, then we neednt fear the idea as the Pythagorean Brotherhood did. The next time we venture along this particular road we will discuss just such a theory, and explore the remarkable transnite landscape that it leads to. We would be remiss to conclude here, however, without noting that there is some dissent on this topic. While the theory of the continuum based on completed innites we will cover is remarkably widely accepted and used, there are still those who do not wish to have to deal with the completed innite. So what is the alternative? The idea is to construct a continuum using innitesimals: a number such that we have 2 = 0, yet = 0. Using such a value we can create a continuum without holes as desired. If adding a seemingly arbitrary new element to the number system seems like cheating, remember that both fractions, and the innite decimals via Cauchy sequences, are just as much articial additions to the number sequence they just happen to be ones were familiar with and now take for granted. The real dilemma is that, assuming the required properties of innitesimals, we can deduce contradictions. As we noted at the start of this post, when a mathematical argument leads you somewhere you dont wish to go you are left having to challenge the very foundations of logic itself. Surprisingly, that turns out to be the resolution: smooth innitesimal analysis rejects the law of the excluded middle. The logic used for this alternative conception of the continuum rejects the idea that, given a proposition, either the proposition is true, or its negation is true. That means that saying that x = y is not true, does not mean that x = y . This sounds like nonsense at rst, because we generally take the law of excluded middle for granted, and it is ingrained in our thinking. We have to remember, however, that this is a theory dealing in potential, but not completed innites, and it is that key word potential that helps clarify things. Consider two numbers x and y that have innite decimal expansions; are they equal, or are they unequal? We cancheck the rst decimal place, and they might agree; that does not mean they are equal, they might disagree further down; nor does it mean they are unequal, since they might indeed agree. We can check the rst
A Fork in the Road
40
billion decimal places, and they might still agree; that does not mean they are equal, since it might be the billion and rst decimal place at which they disagree; and yet we still cant conclude they are unequal theyve agreed so far and could continue to do so. We even check the rst 102 0 decimal places, and still we cant conclude either way whether x and y are equal or unequal. Because we can never complete the innity and check all the decimal places, unless we have more information (such as that both number are integers8 ), it is not possible conclude either way we have an in between state where the numbers are neither equal, nor unequal, and it is this in-between possibility that causes the law of excluded middle to fall apart. To say that x = y is not true simply means we have not yet concluded that x = y , but that does not require that we must have concluded x = y since we may still be torn in between, unable to reach a conclusion. As strange as this sounds at rst, it actually provides a surprisingly natural and intuitive model of the continuum and a remarkably dierent one from the classical one we will be developing. Enough sidetracks, however; it is time to return to the path. We have rounded the bend, and can make out the rough expanse of the landscape below, but the land itself remains unexplored, and potentially quite alien. The next time we return to this road well try and understand the implications of a continuum of completed innites, including a variety of initially unsettling results. In the meantime, however, we will return the study of patterns and symmetry, and try and build a robust theory from our simple examples.
Why does knowing that both numbers are integers help? because integers have xed decimal expansions the rst decimal place is necessarily the same as all the rest: 0. As long as they agree at the rst decimal place, we are done. An exercise for the interested reader: what other knowledge about the numbers might allow us to conclude equality or inequality?
41
Permutations and Applications
2.4
Numbers are remarkably tricky. We tend not to notice because we live in a world that is immersed in a sea of numbers. We see and deal with numbers all the time, to the point where most basic manipulations seem simple and obvious. It was not always this way of course. In times past anything much beyond counting on ngers was the domain of the educated few. If I ask you what half of 60 is, youll tell me 30 straight away; if I ask you to stop and think about how you know that to be true youll have to think a little more, and start to realise that there is a signicant amount of learning there; learning that you now take for granted. Almost everyone uses numbers regularly every day in our current society, be it through money, weights and measures, times of day, or in the course of their work. Through this constant exposure and use weve come to instinctively manipulate numbers without having to even think about it anymore (in much the same way that you no longer have to sound out words letter by letter to read). That means that when we meet a new abstraction, like the symmetries discussed in Shifting Patterns, it seems comparatively complex and unnatural. In reality the algebra of symmetries is in many ways just as natural as the algebra of numbers, we just lack experience. Thus, the only way forward is to look at more examples, and see how they might apply to the world around us. In terms of examples I would like to take a step into the more abstract rather than dealing with a physical example and determining an abstraction from it, well start with a slightly abstract example and explore from there. The example I have in mind is that of permutations. By a permutation I simply mean a rearrangement of unspecied objects, mapping one position to another. We can view a permutation as a kind of wiring diagram, such as the one depicted in gure 2.11. meaning that we shift whatever is in position 1 to position 3, whatever is in position 2 to position 1, and whatever is in position 3 to position 2. Hopefully you can see how such rearrangements are essentially what we were doing in Shifting Patterns, but here we arent starting out with a specic pattern in mind, but considering all such rearrangements in general. As before we can combine two rearrangements together to get another. In this case we simply connect one wiring diagram to the next and follow the paths from the top right the way to the bottom. We can then simplify that down to a direct wiring diagram as before. An example of this process is shown in gure ??. The rst thing to note is that the number of objects we are permuting, or wiring together, matters. If we take, as our rst and simplest case, permutations of two items, then we nd there are only two permutations: the
A Fork in the Road
42
Figure 2.11: A permutation of three objects
Figure 2.12: Combining two permutations to obtain a new permutation
null permutation where we do nothing (the rst item is connected to the rst item, and the second item is connected to the second item), and a simple swap where we reverse the items (connect the rst item to the second, and the second item to the rst). Using the algebraic terms we established previously we end up a two element algebra: let s be the permutation where we swap, and then we have the rule: ss = (swapping the two items, then swapping them again, is the same as doing nothing) which completely describes our algebra9 .
Through this section I will continue to use algebra in a generic sense to describe the symbolic calculus that we can associate to a pattern. This is a reminder that, in mathematics, algebra also has a more precise denition that does not apply to the objects
9
43
On the other hand, as soon as we consider permutations of three objects we nd things get more complicated. There are a total of 6 (thats 3 2 1) permutations of three objects. If we select two basic permutations appropriately we can generate them all as various combinations of the two. There are, in fact, several dierent pairs we could select (though the resulting algebra will turn out to be the same, no matter how we do it the names might change, but the underlying rules will be the same), and Ive opted for the two depicted in gure ??, where a swaps the rst two elements and leaves
Figure 2.13: Two permutations, labelled a and b
the third alone, and b swaps the second two elements, and leaves the rst alone. Now as with the permutations of two elements, if we swap a pair, and then swap them again, we end up back where we started, so we can see that we have the following two rules: aa = bb = Now, however, we have the possibility of combining together a and b. We already saw that ab results in the rst item going to the third place, the second item to the rst position, and the third item to the second position (as shown in gure 2.12); but if we swap things around to nd ba we get the rather dierent situation shown in gure 2.14, which reverses the situation, with the third item moving the rst place, while the rst and second items get bumped to second and third place respectively. So at the very least we know that we dont have commutativity that is ab = ba. Instead we nd the rule that denes this algebra is the peculiar case depicted in gure 2.15
under consideration here. I do not mean algebra in that more specic sense when using the word here.
A Fork in the Road
44
Figure 2.14: The result of combining b and then a
Figure 2.15: Showing how aba is the same as bab
We can see that this gives us all the permutations by counting up the combinations of a and b that havent been ruled out as being reducible to something simpler. We have 1. The null permutation: 2. a 3. b 4. ab 5. ba 6. aba
45
and anything with four or more as and bs will be reducible. Why is that? Since aa = and bb = any sequence will have to alternate as and bs, otherwise we can just cancel down consecutive pairs. On the other hand, if we have a sequence of more than three alternating as and bs then well have a sequence aba or bab that we can convert using the fact that aba = bab, and end up with a pair of consecutive as or bs that we can then cancel down. For example, if we tried to have a sequence of four as and bs like abab, then we can say abab = (aba)b = ba(bb) = ba = ba With a little thought you can see that this sort of procedure can reduce any sequence of four or more as and bs down to one of three or less. So for permutations of three objects we get an algebra that is described by three rules: aa = bb = aba = bab If we were to consider permutations of four items we would have 24 permutations (thats 4 3 2 1) to deal with, and things would be more complicated yet again. Permutations of ve items provide a total of 120 (5 4 3 2 1) permutations, and an even more complicated algebra with yet more subtle and interesting properties. There are two things that you should take notice of here. The rst is that even simple changes to a pattern as simple as changing the number of items involved can give rise to very dierent dynamics. The character of the algebra that arises from permutations of two objects is very dierent from that of the three object permutation algebra, and four objects is dierent again. To reiterate the point: dierent patterns can have surprisingly dierent and remarkable dynamics. The second thing that you should be noticing is that while we can work with permutations as wiring diagrams and connect them up to see what combinations will result, ultimately everything about the dynamics of the permutations is contained within the algebra we get from it; and the algebra can be described and manipulated using very simple rules. While the pattern provides the algebra, the algebra in turn tells us everything we need to know about the dynamics of the pattern. The advantage of
A Fork in the Road
46
the algebra is that we can reduce the whole problem of patterns to the simple task of manipulating algebraic expressions according to particular rules. By abstracting up to the algebra, weve made the problem much easier to think about and manipulate. Hopefully by this stage youre developing a feel for how this abstraction process works. With numbers we start with a collection and abstract away all the details save a single property: the quantity. Here we have something a little more complex; we start with a pattern and abstract away as much of the detail as possible, while still retaining some information about the nature of the pattern. That information can be eciently encoded into a sort of algebra, in the same way that we encode information about quantity into symbols (numbers). The exact nature and rules of the algebra we generate is the information about the pattern that we have kept. Now, numbers allow us to reason about quantity in general via arithmetic, which we can reduce to a game of manipulating symbols. Our abstraction of pattern allows us to reason about patterns via their associated algebra, which we can also reduce to a game of manipulating symbols. We have turned thinking about patterns into a kind of arithmetic; and doing so allows us to be systematic in studying and analysing such patterns. This, of course, raises the question of why we should be interested in studying and analysing patterns at all. The same question can be asked as to why we should be interested in studying and analysing quantity. The dierence is that our culture is steeped in analysis of and use of quantity; we take its usefulness for granted. So lets step back, and ask why using numbers is useful. As was pointed out in The Slow Road, numbers and quantity are useful because they are everywhere we can apply quantitative analysis to almost everything (and often do, sometimes even where it isnt appropriate). It is worth pointing out that patterns and symmetry are every bit as prevalent in the world. All around us things can be described in terms of their patterns. Pick any collection of objects you care to set your eyes upon, and they will form some manner of pattern; perhaps they will only have a trivial symmetry, or perhaps they will have more complex symmetries. The point is that, just like numbers, symmetries are all around us. The study of pattern and symmetry in the manner weve been describing is very new however, and this means it hasnt entered the mainstream consciousness, nor the language, in the same way that numbers have. We dont describe the world around us in the language of mathematical symmetry, at least not in the same way that we describe the world around us in terms of numbers. Slowly that will change, but it will take a very long time indeed (centuries probably). That means that, in the meantime, the areas to which the language of mathematical symmetry will be applied, and the people who will apply it, will be restricted
47
to those already using advanced mathematical methods. Right now that tends to mean elds such as physics and chemistry. To give examples of applying our abstraction of pattern to physics and/or chemistry runs the risk of delving into the technical details of those subjects, as well as requiring math that is currently beyond the scope of our discussion. For that reason, youll have to forgive me if I gloss over things quite liberally in what follows. Everything has a pattern, and symmetries associated with that pattern, even if it is just the trivial symmetry. In the case of chemistry the obvious thing to start looking at is molecules. Unsurprisingly, the structure of a molecule has a pattern that depends, to a large extent, on its constituent elements. More interestingly, molecules often have interesting symmetries. We can, using a naive view, picture a molecule as a pattern of coloured balls, not dissimilar to our patterns of coloured marbles discussed in Shifting Patterns. Of course the patterns are now in 3 dimensions rather than 2, and connections between the balls/marbles are important, but the fundamental idea is there. Consider, for instance, the picture of the ammonium molecule (NH4 ) shown in gure 2.16. We can, with little trouble, consider the various
Figure 2.16: An ammonium molecule; nitrogen is coloured blue, while hydrogen atoms are depicted as white.
ways in which we can rearrange the 4 indistinguishable hydrogen atoms and yet keep the underlying structure that makes it an ammonium molecule. That is, using our abstraction of pattern, we can describe an algebra that captures the features that make the pattern of four hydrogen atoms and one nitrogen atom a molecule of ammonium. Furthermore, we can do the same
A Fork in the Road
48
for any other molecule we care to consider. The exact algebra that results will dier from molecule to molecule, with the individual idiosyncrasies of the dierent algebras describing the the individual idiosyncrasies of the dierent molecules. To understand the particular nature of the associated algebra is to understand a great deal about the particular nature of the molecule. It is, of course, possible to do this sort of analysis just by staring at the patterns and never resorting to the sort of abstraction weve been discussing. This approach runs the risk of being both haphazard, and supercial. By contrast, working in terms of pattern abstraction algebras aords us the ability to be both comprehensive and systematic in our analysis. Rather than trying to divine properties out of thin air via visual inspection, we take the resulting algebra and, by merely pushing symbols around on a piece of paper, pick apart every last nuance of its behaviour. Indeed, this sort of analysis (which extends to a level of detail in characterising the algebra that we wont touch on for some time) is now fundamental to understanding much in chemistry, from spectroscopy to crystallography. Similar approaches to patterns and symmetry of particles lead to a variety of important results in quantum physics. Our world is lled with patterns that are worth analysing with a systematic approach: understanding the peculiarities of the algebras associated to those patterns can tell us a great deal about our world. New applications of this theory to new elds are still regularly occurring. There is a quiet revolution underway that is changing how we see and describe the world, and the abstraction of pattern is at its heart.
49
A Transnite Landscape
2.5
Problems that involve innity have a tendency to read a little like Zen koans. Take, for example, this problem: Suppose we have three bins (labelled bin A, bin B and bin C) and an innite number of tennis balls. We start by numbering the tennis balls 1,2,3,... and so on, and put them all in bin C. Then we take the two lowest numbered balls in bin C (thats ball 1, and ball 2 to start) and put them in bin A, and then move the lowest numbered ball in bin A from bin A to bin B (that would be ball 1 in the rst round). We repeat this process, moving two balls from bin C to bin A, and one ball from bin A to bin B, an innite number of times. The question is, how many balls are in bin A and how many balls are in bin B when were done? Think carefully! The diculty is in being sure of your answer10 . We require a way to think consistently and coherently about such matters. So what is the answer? bin A has no tennis balls in it, while bin B has an innite number! Does that sound wrong? It certainly seems confusing: we are consistently putting two balls into bin A and only taking one out at each step, so how can we end up with no balls in bin A? The key is to think in terms of a nished state, when the innite process is somehow complete. Every ball is eventually moved to bin B, thus after an innite number of steps all the balls must have been moved to bin B. The counterintuitive aspect is that we dont expect moving one ball at a time to ever catch up with moving two balls at a time, yet oddly this happens. Another tale that highlights this point is that of the hotel with an innite number of rooms11 . The story usually begins with the hotel nding itself full one evening. A lone traveller then arrives, very weary, and asks the hotel manager if there is any chance at all that he can get a room. The hotel manager ponders this for a moment, and then has an idea. He asks each guest to move to the room numbered one higher than their current room. Since every number has a number one greater, and there are an innite number of rooms, everyone is housed; and yet room number 1 is now empty, and the traveller has somewhere to stay. It doesnt end there though. After the lone traveller, an innite tour bus arrives, carrying an innite number of passengers all looking for rooms. After having solved the rst problem,
If you are sure of your answer then either you already know a decent amount of transnite theory, or youre most likely wrong dont be disappointed about being wrong though; the very reason we need clear logical guidelines for reasoning about the innite is precisely because our intuitions are woefully inadequate and misleading. 11 The story is usually attributed to David Hilbert, but this is my own spin on it; errors and lack of clarity that may have crept in via the retelling are, of course, mine.
10
A Fork in the Road
50
however, the hotel manager isnt phased. He asks each guest to move into the room with number twice that of their current room. Again, each number has a number twice as large, and there are an innite number of rooms, so again everyone is housed; this time, however, everyone is housed in even numbered rooms, which leaves innitely many odd numbered rooms in which the bus-load of tourists can be put up for the night. This brings us a little closer to the sticky point where our intuitions start to go astray. For any nite number n we expect there to be (roughly, it depends on whether n is odd or even) half as many even numbers less than n than there are natural numbers less than n; when we have innitely many numbers, however, there seems to be exactly as many even numbers as natural numbers. Its this sort of unexpected equality with a set we initially intuitively think should be half as big that allows the tennis ball problem to fool us. In a sense, looking back from a completed innity, 1 and 2 look pretty much the same. What it really comes down to, however, is the very simple question of what we mean by how many. As weve often seen before, the devil is usually in the details, and even simple things that we think we know and understand bear some thinking about if we want to be sure we actually know what we mean. What happens when we count things? Because counting is a fairly innate skill for most adults it is helpful to consider what children, for whom counting is still somewhat new, do. Usually they count on their ngers (or other similar things), making a correlation between objects counted and ngers held up. At the more advanced adult level we do much the same thing, but we correlate with abstract objects (numbers, which by that time weve solidly beaten into instinctual memory). The point Im trying to get at here is that counting is a matter of correlation; more importantly it is a very particular kind of correlation: in mathematics it is known as a one-to-one correlation. This means that each object corresponds with exactly one other object, and vice versa in practice each object corresponds to exactly one number in our count, and each number in the count corresponds to exactly one object. If you can accept that, at the heart of it, it is that one-to-one correspondence that matters in counting, that it is the correspondence that ultimately determines what we mean by quantity, then we can pull out the mathematicians handy tool of abstraction and forget the other unimportant trivialities we might associate with counting, and use the idea of one-to-one correspondence to count innite quantities. It might not be counting exactly as you would normally do it, but it will have the same core properties that matter about counting and quantity, and in the end as long as we can agree as to what important parts need to be preserved we can happily abstract all the rest away.
51
So how do we count innite sets with one-to-one correspondences? Rather than actually counting innitely many things, we provide an explicit process by which the count could (in theory) be done. Thus, we simply try to set up a one-to-one correspondence just as before, the dierence being that it will be given as a rule we can apply element by element as needed, rather than having every single element to element correspondence laid out ahead of time. If such a correspondence exists then the sets have the same innite quantity. And that is exactly what we are doing, for example, with the innite hotel story. First we are comparing the sizes of the sets {1, 2, 3, . . .} and {2, 3, 4, . . .} by noting that we have a correspondence 1 2 2 3 3 4 4 5 n
n + 1
and that since both sets are innite we will have exactly one element in the second set for every element in the rst set, and vice versa; a one-toone correspondence. The sets have the same quantity thus we can shue everyone down one room and still house them all. When the tour bus shows up we end up comparing the sizes of the sets {1, 2, 3, . . .} and {2, 4, 6, . . .} by making the correspondence 1 2 2 4 3 6 4 8 n 2n
where again the innite sets ensure that each element corresponds to exactly one element in either direction; another one-to-one correspondence, demonstrating the sets have the same quantity we can move everyone to even numbered rooms and still house them all. At this point most people are happy enough to accept that there are the same number of even numbers as there are natural numbers. Their argument runs roughly well there is an innite number of both, and innity is as big a number as there can be, so of course theyre the same. There is, possibly, a little squirming under the fact that the what is clearly only a part apparently has the same size as the whole, but that tends to get swept under the rug of innity is as big as you can go, so we dont have a choice. The real problems start, however, when we come to the realisation that using this same idea of quantity (determined in terms of one-to-one correspondences) we can nd sets with sizes larger than the set of natural numbers: there may be innitely many natural numbers, but that is not as large as you can go!
A Fork in the Road
52
The classic example of something innite and larger in size than the natural numbers is the continuum (at least as classically conceived; the constructivist/intuitionist continuum is a little more tricky on this front) as discussed in Paradoxes of the Continuum, Part II. In that post we determined that points on the continuum were able to be identied with Cauchy sequences, which were akin to (though a little more technical than) innite decimal expansions. Well stick with innite decimal expansions here as most people have a better intuitive grasp of decimals than they do of Cauchy sequences or Dedekind cuts. To make things simple well consider the continuum ranging between 0 and 1; that is, all the possible innite decimals between 0 and 1, such as 0.123123123... We do have to be a little bit careful here since, as you should recall from Paradoxes of the Continuum, Part II, in the same way that there are many fractions that represent the same ratio, there were many Cauchy sequences that represent the same point in the continuum, and in particular there are dierent decimal expansions that represent the same point, such as 0.49999999... and 0.50000000...12 . To be careful we have to make sure we always pick and deal with just one representative in all such cases; to do this we can simply only consider representations that have innitely many non-zero places. Showing this is sucient, and still covers all real numbers between 0 and 1 isnt that hard, but amounts to some technical hoop jumping that is necessary for formal proofs, but not terribly elucidating for discussions such as this. Suce to say that it all works out. The catch now is that we need to show not just that we are incapable of nding a one-to-one correspondence between these points on the continuum and the natural numbers, but that no such correspondence can exist. We do this by the somewhat backwards approach of assuming there is such a correspondence, and then showing that a logical contradiction would result. From that we can conclude that any such correspondence would be contradictory, and thus cant actually exist (at least not in any system that doesnt have contradictions). So, to begin, lets presume we have a correspondence like so13
People have a tendency to object to this, and other similar claims, such as that 0.99999... is equal to 1. The easiest way to see this simple but slightly unintuitive fact is to note that we should be able to take 0.99999..., move the decimal place right one place, subtract 9, and arrive back at the same value (this is akin to shifting the hotel guests down one room to make a spare room at the front, but in reverse). That is, if x = 0.99999 . . . we can say that 10x 9 = x. A little simple algebraic manipulation quickly yields x = 1. This sort of sleight of hand with innite expansions will also play a role if and when we come to p-adic numbers, and discover that, in that case, negative numbers are really just very big positive numbers! 13 The wary should be asking how we can even have a rst element among points in the continuum (keeping in mind, of course, that there are cunning ways of ordering fractions
12
53 1 0.a1 a2 a3 a4 . . . 2 0.b1 b2 b3 b4 . . . 3
A Transnite Landscape 4 0.d1 d2 d3 d4 . . .
0.c1 c2 c3 c4 . . .
where the a1 , a2 , ldots etc. are just digits in the decimal expansion. The trick is to show that despite our best eorts to set up a one-to-one correspondence, the list of points in the continuum given by this correspondence (and since we havent specied what the correspondence actually is, any such one-to-one correspondence) is actually incomplete: weve missed some. We do this by constructing a decimal as follows: for the rst decimal place, choose a digit dierent from a1 , for the second decimal place choose a digit dierent from b2 , for the third choose a digit dierent from c3 , and so on. Now clearly this decimal is between 0 and 1, and hence ought to be in our list somewhere, but by its very manner of construction it will dier from the nth decimal number on the list at the nth decimal place that is, we are guaranteed that it is dierent from every decimal weve listed! Thus despite our assumption that we had a one-to-one correspondence, it isnt, since weve found a point in the continuum for which there is no corresponding natural number. Given such a contradiction, the only conclusion we can draw is that we cannot create a one-to-one correspondence between natural numbers and points in the continuum no matter how we try, well always end up with extra points in the continuum for which there is no corresponding natural number; that is, there are more points in the continuum than there are natural numbers. What all of this means is that we have to come to terms with the fact that some innities are bigger than others. In fact, it gets even worse: some innities are entirely incompatible with others. This particular catch hides in a slightly over-zealous abstraction of numbers. For the most part we do not dierentiate between numbers describing order (2nd position, as opposed to 5th position, etc.) and numbers describing quantity (which is generally the notion of number with which weve been dealing). There is a perfectly good reason for this: when it comes to using and manipulating numbers using standard arithmetic, the numbers describing order (so called ordinal numbers ) behave completely identically to those describing quantity (which we call cardinal numbers ). As so often happens with abstraction (indeed, it essentially is the core idea of abstraction) if there are no practical dierences
such that there is a rst element). This question quickly wades into very deep waters indeed we can appeal to the Well Ordering Principle, which essentially just asserts that this can be done, or its equivalents such as the Axiom of Choice, or Zorns Lemma; all of these are somewhat contentious and tricky. If youre interested it is well worth doing a little reading about them. We may come to discuss these issues ourselves later when we start to cross back and forth between mathematics and logic further down the road with Topos theory.
A Fork in the Road
54
(at least as far as the practical purposes we care about are concerned) between objects, we simply forget that there are any dierences at all. And, indeed, for nite numbers this is a perfectly reasonable thing to do. The catch is that, once we start dealing with innities, ordinals and cardinals start behaving rather dierently it is no longer safe to consider them the same, or even, for that matter, comparable to one another. I wont go into the rather technical theory of transnite ordinals here, and instead just give you a precis of where the diculties lie. To start, lets introduce some standard mathematical notation, and let 0 denote the rst innite cardinal (that is, the quantity of natural numbers), and let denote the rst innite ordinal (that is, the rst position reached after weve exhausted all nite positions). Now, as weve already seen, if we have innite rooms, we can house an extra guest even if theyre all full; that is, 0 +1 = 0 . On the other hand, if we tack on an extra position after ; and all the nite ones, i.e. we have 1st , 2nd , 3rd , ..., , ath , then the ath position turns out to be appreciably dierent to all the positions before it. In other words + 1 = . Now, whereas with nite numbers where are adding one produces the same result for both ordinals and cardinals, for innite numbers it makes a huge dierence (in one case we simply end up with what we started with, and in the other we end up with something entirely new). It shouldnt be too hard to see that, from that simple dierence, whether you have are dealing with a cardinal or ordinal transnite number is going to matter for arithmetic operations; we can no longer ignore the dierence; cardinals and ordinals have to be considered as quite separate and distinct kinds of objects! At this point you might be trying to reconcile the fact that 0 + 1 = 0 with the previously observed fact that there are bigger cardinal innities. How can we get to a bigger innity if adding to 0 ends up going nowhere? To answer that Im going to need to discuss power sets. Given a set A (and for now well keep things informal, the ner technicalities of what actually is and is not a set will come further along our road), the power set of A is the set of all possible subsets of A. An example will help clarify. If we have a set A = {a, b, c}, then the power set of A is P (A) = {{}, {a}, {b}, {c}, {a, b}, {a, c}, {b, c}, {a, b, c}}. Thus each element of the set P (A) is itself a set, and in particular, a subset of a; note that we consider both the empty set and A itself to be subsets of A. With a little combinatorics you can see that if a set has n elements (where n is nite), then its power set will have 2n elements (to make a subset we have to decide if each element is either in, or out, of the subset, thus we have 2 choices, multiplied together n times, or 2n ). The trick is that, using an argument that closely parallels the previous argument showing that there is not a one-to-one correspondence between the natural numbers and points in a continuum, we
55
can show that even if a set has innite cardinality its power set will have a larger cardinality. Thus, borrowing notation from the nite case, 0 < 20 . Using this trick, which applies to any innite set, we can develop an entire hierarchy of dierent orders of innity: 0 < 20 = 1 < 21 = 2 < 22 = 3 < A similar, but dierent, hierarchy of innite ordinals also exists (above and beyond the obvious option of simply adding one to an existing innite ordinal to get a larger one), spiralling ever higher, this time using the concept of tetration14 rather than exponentiation. Contrary to initial expectation, innities exist in innite variety. How many innities are there? We cannot say, on pain of paradox, since such a statement would only reect back on itself in a vicious circle of contradiction. While we began with just a hazy view of the innite, the mists have cleared to reveal a strange and remarkable valley; a transnite landscape with a veritable zoo of innities of dierent kinds and sizes; it is, indeed, a whole new landscape of numbers and possibilities to explore; a hidden valley of the innite. Running through the middle of the valley is a large river, and if we wade in we will nd very deep waters. It is all a matter of asking the right questions. The question we can start with seems innocently simple: Is the cardinality of points on the continuum bigger, smaller, or the same as 1 = 20 ? The answer is deceptively complex. It can be established, with a little work, that the number of points in the continuum is not bigger than 1 , which leaves us with either smaller, or the same size as 1 . From there things get complicated quickly, and mired in a certain degree of technicality, but essentially the result is that the answer doesnt matter. What I mean by that is we may assume that the number of points in the continuum is 1 and no problems or contradictions will arise, yet at the same time we can equally well assume that the number of points in the continuum is strictly less than 1 and still no problems or contradictions arise. Indeed, whether there is any cardinal number between 0 and 1 falls into this category. There is, in a sense, no truth here, merely preference.
Tetration is kind of like exponentiation on steroids; or, more accurately, its the next layer of abstraction up: whereas the number in exponentiation counts the multiplications to be performed, the number in tetration counts the exponentiations to be performed. 33 ) ( 3 Thus the 4th tetration of 3, written 4 3 is equal to 3 , which is 319683 , or really rather remarkably large.
14
A Fork in the Road
56
This highlights deep facts about mathematics. When our journey began we considered numbers, and fractions, and algebra. Relatively speaking these are fairly simple abstractions, and, more importantly, they are abstractions that we tend to use each and every day (particularly in the case of numbers and fractions). Through a mix of immediate concrete associations due to the relatively low level of abstraction, and the sense reality imbued by constant use and exposure, we tend to think of numbers, fractions, algebra, and even mathematics itself, as something real, xed, and concrete. That is, we think of mathematics as describing some platonic reality, that the objects it describes, while abstract, have some real existence. It is natural, then, to think that a number between 0 and 1 either exists, or doesnt exist, in some real and concrete way yet that is not how things have worked out. Imagine being told that whether the number ve existed or not was quite optional, arithmetic would work just ne either way! We have, in essence, been told that existence is merely a preference, not a reality; truth is up for grabs, an option rather than a cold hard absolute. Going all the way back to the rst section, On Abstraction, things start to get a little clearer however. As long as we view mathematics as a matter of making eective and powerful abstractions from the real world, rather than describing some platonic universe, having a choice of abstraction doesnt seem so bad. We can choose how to interpret the continuum to suit our needs indeed, we can even reject transnite arithmetic and opt for the intuitionist conception of the continuum if we wish; we choose the abstraction that best suits our purposes for the moment. You could view it as little dierent than choosing to work at the genetic level as a molecular biologist instead of the considering subatomic particles as a physicist would: the level and manner of abstraction matters only with regard to the level and manner of detail you wish to obtain in the way of results. The more layers of abstraction we apply, the greater the chances of running into quandaries and choices; by abstracting away more and more detail, and by piling abstractions upon abstractions, we push further and further into the realm of pure possibility. This has the potential to lead us to strange and confusing trails, but it also gives us the power to see beyond our own limited horizons. In broadening our minds to embrace worlds of possibility we conceive of realities that transcend our conceptions, and probe our own reality in ways far beyond the limits evolution has shackled our perceptions with. It has been a steep climb, but we have left the plains of ordinary nite numbers far behind us. Weve crested the peak, and found a world that is strange and new. It gives us a chance to stretch our minds and our conceptions, and to begin to change how we look at the world. Equally importantly, it gives us a foundation for the further climb to come, providing a glimpse
57
of the dance between logic and mathematics that will follow. We passed by the crossroads of unreality some time ago, yet there is still a very long way to go.
A Fork in the Road
58
2.6
Grouping Symmetries
Coming soon...
List of Figures
2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 2.10 2.11 2.12 2.13 2.14 2.15 2.16 Pattern of coloured marbles . . . . . . . . . . . . . . . . . . . Labelled pattern of marbles . . . . . . . . . . . . . . . . . . . A dierent arrangement of marbles that preserves the pattern Showing how the rearrangement was made . . . . . . . . . . . The same marbles in a rectangular arrangement . . . . . . . . A square with labelled corners . . . . . . . . . . . . . . . . . . The three dierent rotations of a square . . . . . . . . . . . . The four dierent ips of a square . . . . . . . . . . . . . . . . A distorted square not a square anymore . . . . . . . . . . . Two basic operations for rearranging a square . . . . . . . . . A permutation of three objects . . . . . . . . . . . . . . . . . Combining two permutations to obtain a new permutation . . Two permutations, labelled a and b . . . . . . . . . . . . . . . The result of combining b and then a . . . . . . . . . . . . . . Showing how aba is the same as bab . . . . . . . . . . . . . . . An ammonium molecule; nitrogen is coloured blue, while hydrogen atoms are depicted as white. . . . . . . . . . . . . . . . 25 26 26 26 27 28 28 29 29 30 42 42 43 44 44 47
59
List of Figures
60
Bibliography
[1] Dorothy Britton. Haiku Journey: Bashos Narrow Road to a Far Province. Kodansha International, 1974. [2] Lewis Carroll. Alices Adventures in Wonderland. Macmillan, 1865. [3] Donald Keene. Anthology of Japanese Literature: From the Earliest Era to the Mid-Nineteenth Century. Grove Press, 1994. [4] Earl Miner. Japanese Poetic Diaries. University of California Press, 2004. [5] Bertrand Russell. Essays in Analysis. Allen & Unwin, 1973.
61

Narrow Road

Caricato da

Informazioni sul documento

Descrizione originale:

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Narrow Road

Caricato da

Copyright:

Formati disponibili

The Narrow Road to the Interior

Current as of July 2, 2007 http://jedidiah.stu.gen.nz/wp/

The Slow Road

The Slow Road

Translation by Earl Miner[4]

The Slow Road

A Fork in the Road

From the translation of Genjan no fu by Donald Keene, in Anthology of Japanese Literature[3]

A Fork in the Road

The Paradoxes of the Continuum, Part I

The Paradoxes of the Continuum, Part I

A Fork in the Road

The Paradoxes of the Continuum, Part I

A Fork in the Road

The Paradoxes of the Continuum, Part I

A Fork in the Road

akaaka to hi wa tsurenaku mo aki no kaze

A Fork in the Road

Figure 2.2: Labelled pattern of marbles

Figure 2.3: A dierent arrangement of marbles that preserves the pattern

Figure 2.4: Showing how the rearrangement was made

A Fork in the Road

Figure 2.8: The four dierent ips of a square

Figure 2.9: A distorted square not a square anymore

A Fork in the Road

Figure 2.10: Two basic operations for rearranging a square

A Fork in the Road

A Fork in the Road

Paradoxes of the Continuum, Part II

Paradoxes of the Continuum, Part II

A Fork in the Road

Paradoxes of the Continuum, Part II

A Fork in the Road

Paradoxes of the Continuum, Part II

A Fork in the Road

Permutations and Applications

Permutations and Applications

A Fork in the Road

Figure 2.11: A permutation of three objects

Figure 2.12: Combining two permutations to obtain a new permutation

Permutations and Applications

Figure 2.13: Two permutations, labelled a and b

A Fork in the Road

Figure 2.14: The result of combining b and then a

Figure 2.15: Showing how aba is the same as bab

Permutations and Applications

A Fork in the Road

Permutations and Applications

A Fork in the Road

A Fork in the Road

A Fork in the Road

A Transnite Landscape 4 0.d1 d2 d3 d4 . . .

A Fork in the Road

A Fork in the Road

A Fork in the Road

Potrebbero piacerti anche