Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
ABSTRACT
The aim of this poster is to inform ACS members about the OnLine Encyclopedia of Integer Sequences (or OEIS) [1], which is a freely accessible database containing information about 120,000 number sequences. This poster will describe just a few of the many entries that are of interest to chemists.
1. The OEIS
Entries in the On-Line Encyclopedia of Integer Sequences (or OEIS) give the rst 100 (or sometimes 10,000) terms of the sequences, their denitions, formulas, computer programs to generate them, references to the literature and to the Internet, graphs and other illustrations. Each sequence has a unique identication number: e.g. A000055 gives the number of trees on n nodes. If you come across a sequence in your work (1, 3, 17, 40, 102, . . . , say) and wish to identify it, the OEIS is the place to look. The OEIS is maintained by N.J.A.S. and has been in existence in various forms for over 40 years. The web site is used by thousands of people each day. New sequences are added at a rate of over 12,000 each year. Only a selection of some sequences of interest to chemists will be mentioned here. Our convention is that a(n) usually denotes the nth term of the sequence under discussion, and we will display the rst ten or so terms a(1), a(2), a(3), . . . , although of course the OEIS gives many more terms.
A000055: the number of trees on n nodes (see Fig. 1): 1, 1, 1, 2, 3, 6, 11, 23, 47, 106, 235, 551, 1301, 3159, . . . . A000088: the number of graphs on n nodes (see Fig. 2): 1, 2, 4, 11, 34, 156, 1044, 12346, 274668, 12005168, . . . . (The OEIS contains literally hundreds of sequences related to those two.)
2n 1 2n+1 n
1, 2, 5, 14, 42, 132, 429, 1430, 4862, 16796, 58786, . . . . This is the single most frequently looked-up sequence in the OEIS, with over a hundred dierent interpretations. For example, C (n) is the number of ways to insert n pairs of parentheses in a string of n + 1 letters. E.g. for n = 3 there are ve ways: ((AB)(CD)), (((AB)C)D), ((A(BC))D), (A((BC)D)), (A(B(CD))).
9 20 37 9 15 38
alkanes 1 1 1 2 3 5 9 18 35 75 159 355 802 1858 4347 . . . In fact Cayleys calculations were incorrect above n=11, and the above values are taken from [3], where P olya theory was used to determine exact formulas for these sequences. 7
Figure 3: Illustration of initial terms of A000022, A000200 and A000602, from Henry Bottomley
In [2](a), Cayley attempted to determine the number of alkyl radicals with structure Cn H2n+1 , again ignoring stereoisomers. This is sequence A000598: 1, 1, 2, 4, 8, 17, 39, 89, 211, 507, 1238, 3057, 7639, 19241, . . . . (again Cayleys result were slightly incorrect).
In the 1930s, Henry Henze and Charles Blair, chemists at the Univ. of Texas, wrote a long series of papers in J. Amer. Chem. Soc. [4], [5], [6] in which they found recurrence relations for the numbers of isomers in various families of chemical compounds. For example, they gave recurrences relations for Cayleys sequences A000602 and A000598 mentioned above, as well as for the analogous sequences when stereoisomers are taken into account. Thus we have A000628 (alkanes Cn H2n+2 , taking stereoisomers into account): 1, 1, 1, 2, 3, 5, 11, 24, 55, 136, 345, 900, 2412, 6563, . . . , and A000625 (alkyl radicals Cn H2n+1 , taking stereoisomers into account): 1, 1, 2, 5, 11, 28, 74, 199, 551, 1553, 4436, 12832, . . . . Henze and Blairs recurrences are usually extremely complicated, and it has recently been discovered that there are occasional errors in their tables. Of course their calculations were carried out by hand. P olyas approach, described in the next section, proved to beonce the necessary mathematical machinery had been developeda much simpler and more powerful method.
10
A major breakthrough occurred around 1936 when the mathematician George P olya published two papers [7] in which he developed a general theory of enumeration, applicable to a wide range of problems in chemistry and combinatorics. His work had been anticipated in part by Redeld [9], but nevertheless this method is known today as P olyas Enumeration Theory. The idea is to use the representation theory of permutation groups and their cycle indices to obtain a far-reaching generalization of Burnsides Lemma. This lemma states that if a set S of objects is permuted by a group G of permutations, then the number of equivalence classes (orbits) of objects in S is equal to the average number of objects that are xed under each permutation in G. For an excellent introduction to P olya theory, see the article by de Bruijn [10]. In his two papers, P olya gives a large number of examples from chemistry. In particular, he gives generating functions for several of the sequences mentioned above. To illustrate, here is the beginning of Table I from [7](a), with the names translated from German (and of course with the sequence numbers added). It is interesting to note that he thanks his nephew, the chemical engineer J. P olya, for collaborating in producing the tables.
11
Table 1: Number of structural isomers for homology series and alkyl-derivatives, from P olya, Zeit. f. Kristall., 93 (1936), 415443
1 2 1 1 1 1 1 2 1 2
3 1 2
4 2 4
5 3 8 31 21
6 5 17
5 12 4 9
Cn H2n1 XYZ 1 4 13 42 131 402 Tri-substitued parans A000640 Cn H2n1 X2 Y 1 3 Cn H2n1 X3 1 2 . . 9 27 5 14 . . 81 240 Tri-substitued parans A022014 39 109 Tri-substitued parans A000641 . .
12
In cluster science it is well-known that clusters containing certain special numbers of atoms occur more frequently than others. These special numbers are often called magic numbers, and a search on the Internet for magic number and chemistry will produce over 100,000 references. Many of these sequences of magic numbers are not well-dened mathematically. Probably the most famous example comes from physics: this is the sequence A018226 2, 8, 20, 28, 50, 82, 126, atoms with one of these numbers of protons or neutrons in their nuclei are considered to be stable. In the 1980s Boon Teo and N.J.A.S. carried out a systematic study of magic numbers from a geometrical point of view [11], [12], [13]. In [11](a) we analyze clusters that have the shape of one of the Platonic or Archimedean solids (extending earlier work by Buckmister Fuller, Coxeter and others).
13
Figure 4: Cluster in shape of truncated octahedron with each edge subdivided into two equal segments. This cluster contains G2 = 201 atoms, of which S2 = 122 lie on the surface (or skin)from Teo and Sloane, Inorg. Chem., 24 (1985), 45454558
14
We provide explicit formulas for the analogous magic numbers when the edges of the polygons or polyhedra are subdivided into n equal parts. For the truncated octahedron the formulas for Sn , the number of surface points, and Gn , the total number of points (the magic numbers), are: Sn = 30n2 + 2 (n 1), Gn = 16n3 + 15n2 + 6n + 1 . Reference [11](a) contains a large number of such cluster sequences. The following table shows a small sample.
15
Table 2: Number of Surface Points Sn and Total Number of Points Gn for Various Archimedean and Other Figures (from Teo and Sloane, Inorg. Chem., 24 (1985), 45454558)
Polyhedron Truncated
5 352
6 506
7 688
8 898
10 Sequence 1402 A005905 4576 A005906 1002 A005901 3871 A005902 3002 A005903
Sn 1 16
58 128 68 180 42 92
55 147
Sn 1 32 122 272
octahedron Gn 1 38 201 586 1289 2406 4033 6266 9201 12934 17561 A005910 . . . . . . . . . . . .
16
In [11](b), [12], [13] Teo and N.J.A.S. give a similar analysis for the sizes of spherical clusters that can be found in the hexagonal lattice in the plane, the simple cubic, face-centered cubic (fcc), and body-centered cubic (bcc) lattices in three dimensions, as well as the hexagonal close-packing (hcp) and diamond structures. Generating functions for these cluster series are simply the appropriate theta series of the lattice with respect to the particular point used as the center of the cluster. Again some of these results were already known. For example, consider clusters centered at a lattice point in the fcc lattice. Let Gn (the magic numbers) denote the total number of atoms in the cluster of radius 2n, and let Sn denote the number of atoms on the surface of the cluster. Then we
obtain the cluster series (A004015, A119869): Sn : 1, 12, 6, 24, 12, 24, 8, 48, 6, 36, 24, 24, 24, 72, 0, 48, . . . , Gn : 1, 13, 19, 43, 55, 79, 87, 135, 141, 177, 201, 225, 249, . . . . On the other hand, if the clusters are centered at an octahedral-shaped hole in the fcc lattice, the analogous cluster series are (A005887, A119874): Sn : 6, 8, 24, 0, 30, 24, 24, 0, 48, 24, 48, 0, 30, 32, 72, 0, 48, . . . ,
17
Gn : 6, 14, 38, 38, 68, 92, 116, 116, 164, 188, 236, 236, 266, . . . . The three references contain a large number of cluster series of this type.
18
8. Coordination Sequences
The cluster series described in the previous section give the numbers of atoms in balls of successive radii 0, 2, 4, 6, 8, . . . (say) around the central point.
Alternatively, one may classify atoms according to the number of bonds in the shortest path to the central atom. The zeroth shell consists of a single atom, the number in the next shell is the conventional coordination number, and in general the nth shell consists of those atoms that are bonded to atoms in shell n 1 (and which have not already been counted). The sequence that gives the numbers of atoms in the successive shells is the coordination sequence of the structure, a term introduced in 1971 by Brunner and Laves [14]. Coordination sequences were originally used to investigate the topology of frameworks and to help specify the positions of atoms. They are now routinely used to characterize crystallographic structures. In [15], Ralf Grosse-Kunstleve, G. O. Brunner and N.J.A.S. computed the coordination sequences for all the zeolites in the Meier-Olson Atlas of Zeolite Structure Types. There are almost 400 sequences, all of which are now in the OEIS, often with recurrences and explicit formulas.
19
The following is a more typical, three-dimensional example. This coordination sequence arises in the zeolite structures AFG, CAN, LIO and LOS: (sequence A008013): 1, 4, 10, 20, 34, 54, 78, 104, 134, 168, 210, 256, 302, 352, . . . . The nth term, a(n), is given by ve dierent expressions: a(5m) = 52m2 + 2 , a(5m + 1) = 52m2 + 22m + 4 , a(5m + 2) = 52m2 + 42m + 10 , a(5m + 3) = 52m2 + 62m + 20 , a(5m + 4) = 52m2 + 82m + 34 . This kind of formula is quite typical for coordination sequences of zeolites, although many of the formulas are much more complicated than this one. (Technically, formulas like this, where a(n) depends on the residue of n modulo some number, are called PORC functions, which stands for Polynomial On Residue Classes!)
20
Figure 5: Example: Coordination sequence of a two-dimensional net, with successive shells indicated by dierent colors. From R. W. Grosse-Kunstleve et al., Poster, Sixteenth European Crystallographic Meeting, Lund, Sweden, August 611, 1995
21
Maximally dense sphere packings can be constructed by stacking layers, where each layer consists of an innite sheet of balls in the hexagonal lattice arrangement. Once one layer is in positioncall it a type A layerthen there are three choicescall them A, B, Cfor each of the remaining layers. Such packings are generally called Barlow packings. If the layers alternate . . .A, B, C, A, B, C, A,. . . with period three, we obtain the fcc lattice; whereas if they alternate . . .A, B, A, B, A, B, A,. . . with period two, we obtain the hcp structure. Many authors have studied enumeration problems arising from such packings. For example, T. J. McLarnan [16] used P olya theory to nd the number of Barlow packings in which the layers alternate with period exactly n (A011768): 0, 1, 1, 1, 2, 3, 6, 7, 16, 21, 43, 63, 129, 203, 404, 685, 1343, . . . . The entries a(2) = a(3) = 1 correspond to the hcp and fcc, respectively. In the same paper McLarnan enumerates many related structures. For example, the number of ZnS polytypes with period n (A011957) is: 0, 1, 1, 1, 1, 2, 3, 6, 10, 18, 31, 59, 105, 198, 365, 688, 1285, . . . . The entry a(3) = 1 corresponds to the cubic spharelite structure. 22
For other recent work on enumerating stacking sequences see Thompson and Downs [17], Lord et al. [18], Estevez-Rams et al. [19]. Stacking sequences are also studied extensively in metallurgy.
23
25
Figure 6: Polyhexes with 1, 2 or 3 hexagons, corresponding to the benzene, naphthalene, anthracene, phenalene and phenanthrene structures, respectively.
26
Figure 7:
a(1) = a(2) = 1, a(3) = 2, a(4) = 5, a(5) = 12 of A000105 (from Eric Weissteins World of Mathematics: Polyomino )
27
29
12. Conclusion
The goal of the OEIS is to include information about all interesting number sequences. At the present time over 1,000 of the 120,000 entries have their origin in chemistry, and chemists will nd much material of interest. To nd the OEIS, simply Google sequences. If you come across a number sequence that is not in the OEIS, especially from the chemical literature, please send it in to the database. There is a special web page for contributing a new sequence or a comment. The reasons for doing this are that the next person who comes across the sequence will be grateful for the reference, and your name will be preserved in the OEIS as the person who contributed the sequence. (You need not be the author of the sequence to send it in.)
30
Another use for the OEIS is to help people do well on quizzes. Can you nd the next term in the following? You know where to nd the answers! (1): (2): (3): (4): (5): (6): 61,21,82,43,3,64,24,? (A087409) (A005228) (A001690) (A063108)
1,3,7,12,18,26,35,45,56,69,83,98,114,131,150,?
(A006751)
1,2,2,3,3,4,4,4,5,5,5,6,6,6,6,7,7,7,7,8,8,8,8,9,? (A001462)
31
References
[1] N. J. A. Sloane, The On-Line Encyclopedia of Integer Sequences, published electronically at www.research.att.com/njas/sequences/, 19962006. [2] A. Cayley, (a) Phil. Mag., 67 (1874), 444447; (b) Chem. Ber., 8 (1875), 10561059; (c) Rep. Brit. Assoc. Adv. Sci., 45 (1875), 257305. [3] E. M. Rains and N. J. A. Sloane, J. Integer Sequences, 2 (1999), #99.1.1. [4] C. M. Blair and H. R. Henze, J. Amer. Chem. Soc., 54 (1932), 10981105, 15381545. [5] D. D. Coman, C. M. Blair and H. R. Henze, J. Amer. Chem. Soc., 55 (1933), 252253. [6] H. R. Henze and C. M. Blair, J. Amer. Chem. Soc., 53 (1931), 30423046, 30773085; 55 (1933), 680686; 56 (1934), 157. [7] G. P olya, (a) Zeit. f. Kristall., 93 (1936), 415443; (b) Acta Math., 68 (1937), 145254 [English translation in [8]]. [8] G. P olya and R. C. Read, Combinatorial Enumeration of Groups, Graphs and Chemical Compounds, Springer, 1987. [9] J. H. Redeld, Amer. J. Math., 49 (1927), 433455. [10] N. G. de Bruijn, in Applied Combinatorial Mathematics, ed. E. F. Beckenbach, Wiley, 1964. [11] B. K. Teo and N. J. A. Sloane, (a) Inorg. Chem., 24 (1985), 45454558; (b) 25 (1986), 23152322. [12] N. J. A. Sloane and B. K. Teo, J. Chem. Phys., 83 (1985), 65206534.
32
[13] N. J. A. Sloane, J. Math. Phys., 28 (1987), 16531657. [14] G. O. Brunner and F. Laves, Wiss. Zeitschr. Techn. Univ. Dresden, 20 (1971), 387390. [15] R. W. Grosse-Kunstleve, G. O. Brunner and N. J. A. Sloane, Acta Cryst., A52 (1996), 879889. [16] T. J. McLarnan, Zeitschr. f. Kristall., 155 (1981), 269291. [17] R. M. Thompson and R. T. Downs, Acta Cryst., B57 (2001), 761771; B58 (2002), 153. [18] E. A. Lord et al., Phil. Mag., A82 (2002), 255268. [19] E. Estevez-Rams et al., Acta Cryst., A61 (2005), 201208. [20] Joseph Myers, Polyomino, Polyhex and Polyiamond Tiling, published electronically at www.srcf.ucam.org/jsm28/tiling/, 2005 (the entries in [1] are more up-to-date). [21] Tom as Oliveira e Silva, Animal Enumerations on the {4,4} Euclidean Tiling, published electronically at http://www.ieeta.pt/ tos/animals/a44.html. [22] J. Brunvoll, S. J. Cyvin and B. N. Cyvin, J. Math. Chem., 21 (1997), 193196. [23] P. W. Fowler and D. E. Manolopoulos, An Atlas of Fullerenes, Cambridge Univ. Press, 1995. [24] G. Brinkmann and A. W. M. Dress, J. Algorithms, 23 (1997), 345358. [25] B. D. Hughes, Random Walks and Random Environments, Oxford, 2 vols., 19951996.
33