Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
net/publication/48214673
CITATIONS READS
344 355
4 authors, including:
Pamela S Soltis
University of Florida
585 PUBLICATIONS 39,527 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Douglas E Soltis on 30 August 2016.
Introduction
Both chromosome number and genome size vary tremendously across the
flowering plants (Fedorov 1969; see the Plant DNA C-Values Database at
http://www.rbgkew.org.uk/cval/homepage.html). This variation has stim-
ulated a great deal of speculation about the original genome size and base
chromosome number of the angiosperms, as well as about the patterns of
genome and chromosomal evolution. In earlier reports, some authors pro-
posed that the original base chromosome number for angiosperms was low,
between x = 6 and 9 (e.g., Ehrendorfer et al. 1968; Stebbins 1971; Walker 1972;
Raven 1975; Grant 1981), and that the original genome size of angiosperms
was small (Leitch et al. 1998; D. Soltis et al. 2003b), which would suggest a
close correspondence between the evolution of genome size and chromosome
number. However, analysis of the vast range of chromosome numbers and
genome sizes encountered in angiosperms shows that genome size can vary
independently of chromosome number.
Recent studies have revealed the complexities involved in reconstructing
ancestral base chromosome numbers and genome sizes for angiosperms. A
large component of this complexity is that polyploidy is rampant through-
out angiosperm evolutionary history. The question of what proportion of
angiosperms is of polyploid origin is almost certainly moot, given that sev-
eral recent studies have demonstrated that one or more rounds of ancient
genome duplication characterize most, if not all, angiosperms formerly
288 CHAPTER 13
considered to be diploid. Complete sequencing of the viewed in Leitch et al. 1998). C-values in angiosperms
genome of Arabidopsis thaliana, which has a very small span a huge range. The smallest reported values are for
genome (157 Mb; Bennett et al. 2003), revealed many du- Cardamine amara (Brassicaceae; 1C = 0.05 pg, Bennett
plicate genes and suggested two or three rounds of and Smith 1991) and Fragaria (Rosaceae; 1C = 0.10 pg,
genome duplication (Vision et al. 2000; Bowers et al. Antonius and Ahokas 1996). Arabidopsis thaliana, a well-
2003). Similarly, diploid members of Brassica may be an- known model organism with a very small genome, has
cient tetraploids or hexaploids (Lagercrantz 1998). A ge- 1C = 0.16 pg (Bennett al. 2003); the largest value is for
nomics investigation of rice indicates that it too is an an- Fritillaria assyriaca (Liliaceae; 1C = 127.4 pg, Bennett and
cient polyploid; a polyploidy event apparently occurred Smith 1976).
after the divergence of Poales (see Chapter 4), but be- Despite this enormous range in DNA amount, the
fore the divergence of the major cereals from each other basic complement of genes required for normal growth
(Paterson et al. 2004). Perhaps a more appropriate ques- and development appears to be essentially the same,
tion is: How many rounds of genome duplication have leading to what is termed the “C-value paradox”
occurred in various lineages of angiosperms? (Thomas 1971). That is, the large variation in genome
In this chapter, we discuss the evolution of both size detected in both plants and animals is not correlat-
genome size and base chromosome number in a gen- ed with organismal complexity. There is now general
eral sense across the angiosperms, using the phyloge- agreement that genomes are not simply linear collec-
netic framework now available (e.g., Hoot et al. 1999; tions of genes; differences in amount of DNA can large-
P. Soltis et al. 1999b; Savolainen et al. 2000a, 2000b; D. ly be attributed to changes in the proportion of noncod-
Soltis et al. 2000), as updated by more recent analyses ing, repetitive DNA.
of basal angiosperms (Zanis et al. 2002) and eudicots Several mechanisms have been proposed for this
(D. Soltis et al. 2003a). large variation in genome size in the angiosperms (re-
viewed in Kellogg and Bennetzen 2004). Repeated cy-
cles of polyploidy may increase genome size (e.g.,
Genome Size Leitch and Bennett 1997; D. Soltis and Soltis 1999; Otto
The amount of DNA in an unreplicated gametic nuclear and Whitton 2000; P. Soltis and Soltis 2000; Wendel
genome is referred to as the 1C-value (or simply C- 2000). In addition, transposable elements also appear
value). The C-value is often loosely referred to as to contribute to increases in genome size throughout
genome size, but strictly speaking, genome size is the eukaryotes (e.g., Flavell 1988; Bennetzen 2000, 2002;
amount of DNA in an unreplicated, gametic chromo- Sankoff 2001; Kidwell 2002). Gregory (2001) has
some set. Genome size equals the 2C nuclear DNA suggested that the phrase “C-value paradox” be re-
amount divided by ploidal level, and this equals the C- placed by “C-value enigma” to indicate that the current
value in diploids (Bennett et al. 1998). This formula challenge is to understand the mechanisms and forces
gives an accurate estimate for individuals with con- that determine the amounts of repetitive DNA in a
stituent genomes of equal size (e.g., diploids and au- genome.
topolyploids) but provides only an estimate of the The hypothesis of repeated cycles of polyploidy has
mean for individuals with constituent genomes of dif- received particular attention in flowering plants and is
ferent sizes (e.g., some diploid hybrids and allopoly- supported by isozyme evidence. Several basal lineag-
ploids). Note that for polyploids, genome sizes estimat- es (e.g., Magnoliaceae, Lauraceae, Calycanthaceae), as
ed in this way are always smaller than C-values. For well as some clades of eudicots with uniformly high
example, in the diploid Triticum monococcum, 2C = 12.45 chromosome numbers (e.g., Hippocastanaceae [now
pg, so 1C = 12.45 ÷ 2 = 6.23 pg, which also equals the part of Sapindaceae], Trochodendraceae, Salix + Popu-
genome size. In the tetraploid T. dicoccum, in contrast, lus of Salicaceae), have many duplicated loci, in agree-
2C = 24.05 pg, so 1C = 24.05 ÷ 2, which equals 12.03 pg, ment with ancient polyploidy (D. Soltis and Soltis
but the genome size is 24.05 ÷ 4, or 6.01 pg. 1990), as suggested by Stebbins (1971). As noted, even
C-values have been estimated for more than 3,500 Arabidopsis with its very small genome and low chro-
species of angiosperms (Bennett et al. 1997; Bennett and mosome number appears to be an ancient polyploid.
Leitch 2001, 2003; Hanson et al. 2001a, 2001b; Leitch and Multiple genome duplication events in angiosperm di-
Hanson 2002), representing more than 1% of the ap- versification are also supported by recent analyses of
proximately 250,000 to 300,000 species of flowering genomic evidence (Bowers et al. 2003). By comparing
plants and approximately 48% of all angiosperm fam- gene number and gene order for a gymnosperm
ilies (sensu APG 1998; APG II 2003). The Plant DNA C- (Pinaceae), a monocot, Oryza (Poaceae), and several eu-
Values Database (http://www.rbgkew.org.uk/cval/ dicot lineages (Solanaceae, asterid; Fabaceae, eurosid I;
homepage.html) represents the largest collection of nu- Malvaceae, eurosid II; and Brassicaceae, eurosid II),
clear DNA amounts for any group of organism (re- Bowers et al. (2003) estimated the number and timing
EVOLUTION OF GENOME SIZE AND BASE CHROMOSOME NUMBER 289
300 Mya
170–235 Mya
β 112–156 Mya
112 Mya
83–86 Mya
α 14.5–20.4 Mya
Arabidopsis
Solanaceae
Malvaceae
Medicago
Pinaceae
Glycine
Brassica
Oryza
FIGURE 13.1 Multiple genome duplication events in the wide duplication events inferred and the approximate tim-
evolutionary history of Arabidopsis (modified from Bowers ing of each (see text). Ages are given in million years before
et al. 2003). The symbols α and β indicate two genome- present (Mya).
of genome-wide duplication events (Figure 13.1; see genome sizes have been reported, relatively few species
also Kellogg 2003). Within eurosid II, a genome-wide in each group have large genomes. Furthermore, those
duplication event seems to have occurred after the split species with large genomes tend to be restricted to the
between Malvaceae and Arabidopsis (Brassicaceae). This more derived families within each of these groups
genome duplication event is estimated at 83 to 86 Mya, (Leitch et al. 1998). The most parsimonious explanation
an event that may characterize all Brassicaceae and per- for these observations is that the ancestral angiosperms
haps their close relatives. An earlier genome duplica- had small genomes and that the possession of large
tion event is suggested after the split between rice genomes is derived.
(monocots) and eudicots. This earlier duplication event Within extant seed plants, a small genome is unique
was estimated at 112 to 156 Mya by Bowers et al. (2003); to the angiosperms (Leitch et al. 1998, 2001). Extant
the younger end of this time range seems reasonable gymnosperms are all characterized by larger C-values
based on the probable time of origin of the monocots than angiosperms. The modal C-value for 152 gym-
estimated from both fossil evidence (Gandolfo et al. nosperms is 15.8 pg compared with a modal value of
1998, 2002) and molecular divergence (e.g., K. Bremer 0.6 for angiosperms (Leitch et al. 2001; this modal value
2000; see Chapters 3 and 4). for angiosperms is slightly lower than that reported by
Leitch et al. (1998) calculated mean C-values for Leitch et al. 1998, 0.7 pg). The modal C-value for 50 lep-
many angiosperm species and considered the evolution tosporangiate ferns is 7.95 pg (I.J. Leitch, pers. comm.),
of genome size in light of angiosperm phylogeny. Up- which is also higher than that for angiosperms.
dated calculations (in D. Soltis et al. 2003b) showed (Fig- Since the analysis by Leitch et al. (1998), new devel-
ure 13.2) that despite the enormous range in values, opments have indicated that a reevaluation of the di-
most angiosperms have small C-values, between 0.1 and versification of 1C-values in angiosperms is timely. Re-
3.5 pg. In fact, the modal C-value for angiosperms is ac- cent topologies (e.g., Qiu et al. 1999; Chase et al. 2000a;
tually quite low, 0.7 pg (~675 Mbp). Large genomes are D. Soltis et al. 2000a; Zanis et al. 2002) provide greater
present
SinauerinAssociates,
a few large Inc.clades: monocots, Ranunculales, resolution and internal support for relationships
Santalales,
SOLTIS Caryophyllales, asterids, and rosids. How- throughout the angiosperms than earlier single-gene
Phylogeny
ever, even inandthese
Evolution of the Angiosperms
clades, most members have small analyses (e.g., Chase et al. 1993; see Chapter 2). Further-
SOL1301.eps
or intermediate-sized
100% of size genomes; only two groups con- more, C-values for some crucial taxa (e.g., Amborella,
tain1.27.5
members with very large genomes (≥35 pg, 50 times Austrobaileya, Illicium, Ceratophyllum) have been esti-
the modal value), Santalales and monocots. Consider- mated (e.g., Leitch and Hanson 2002; Hanson et al.
ing just the six groups in which large to very large 2001a, 2001b). Amborella and two species of Nymphaea
290 CHAPTER 13
1000 Magnolia, the reported C-values are 0.90 pg, 5.98 pg, and
7.1 pg. Magnolia kobus has a C-value of 0.90 pg and is a
diploid member of the genus with 2n = 38, which is the
lowest number for Magnolia. Liriodendron, the sister of
750 Magnolia, also with 2n = 38, has a C-value of 0.80, which
is comparable to M. kobus. The highest C-values for Mag-
Number of species
FIGURE 13.3 Parsimony reconstruction of genome size (2001a, 2001b), Leitch and Hanson (2002); see D. Soltis et al.
diversification in the angiosperms obtained using the “all (2003b). Ranges of values for Asparagales, commelinids,
most parsimonious states” option of MacClade. The gener- Dioscoreales, Liliales, and Pandanales are from Leitch et al.
al topology is from D. Soltis et al. (2000), with modifica- (1998); more detail for monocots is provided in Figure 13.5.
tions following Chase et al. (2000), Fishbein et al. (2001), Can = Canellales; Laur = Laurales; Pro = Proteales; Sant =
Zanis et al., (2002) and D. Soltis et al. (2003a; see text). 1C- Santalales. The rosid clade is summarized with Vitaceae as
values are from Bennett and Leitch (2001), Hanson et al. sister to the eurosids (= core rosids; see Chapter 8).
Cycads
Outgroup
Ginkgo
Ephedra
Gnetum
Welwitschia
Pinus
Larix
Amborella
Nymphaeaceae
Illicium
Kadsura
Austrobaileya
Dioscoreales
Commelinids
Liliales
Asparagales
Monocots
Pandanales
Hydrocharitaceae
Potamogetonaceae
Aponogetonaceae
Araceae
Tofieldiaceae
Acorus
Basal angiosperms
Ceratophyllum
Chloranthus
Sarcandra
Can
Canella
Drimys
Asarum
Piperales
Aristolochia
Saururus
Anemopsis
Magnoliids
Peperomia
Piper
Persea
Laur
Cinnamomum
Hernandia
Calycanthus
Myristica
Magnoliales
Horsfieldia
Annona
Asimina
Magnolia
Liriodendron
Glaucidium
Early-diverging eudicots
Ranunculales
Dicranostigma
Papaver
Argemone
Eschscholzia
Fumaria
Menispermaceae
Berberidaceae
Ranunculaceae
Nelumbo
Platanaceae Pro
Proteaceae
Buxaceae
Gunnera
Aizoaceae
Caryophyllales
Cactaceae
Portulacaceae
Amaranthaceae
Caryophyllaceae
Polygonaceae
Plumbaginaceae
Droseraceae
Eudicots
Loranthus
Sant
Lysiana
Dendropthoe
Core eudicots
Viscum
Cornaceae
Ebenaceae
Primulaceae
Ericaceae
Balsaminaceae
Asterids
Bryidae
Tetraphidaceae
Mosses
Andreaeaceae
Polytrichaceae
Sphagnaceae
Lycophytes
Isoetes
Selaginella
Lycopodium
Huperzia
Psilotum
Ophioglossum
Angiopteris
Marattia
Equisetum1
Equisetum2
Todea
Hymenophyllum
Llavea
Ceratopteris
Monilophytes
Microlepia
Leptosporangiate ferns
Dennstaedtia
Pteridium
Davallia
Polypodium
Asplenium
Dicksonia
Cyathea
Plagiogyria
Regnellidium
Marsilea
Azolla
Salvinia
Euphyllophytina
Lygodium
Amborella
Angiosperms
Nymphaea
Kadsura
Illicium
Austrobaileya
Acorus
Cycads
Cycas
Stangeria
Zamia
Ginkgo
Gnetales
Gnetum
Welwitschia
Ephedra
Larix
Pseudotsuga
Pinus
Gymnosperms
Picea
Abies
1C value (pg) (unordered)
Cedrus
Very small (≤1.40)
Araucaria
Conifers
Small (1.41–3.50)
Podocarpus
Intermediate (3.51–13.99)
Sciadopitys
Large (14–34.99)
Taxus
Very large (≥35)
Cephalotaxus
Equivocal
Cunninghamia
Sequoiadendron
FIGURE 13.4 MacClade reconstruction of genome size Metasequoia
diversification in land plants obtained using “all most Cupressus
parsimonious states” resolving option. (Modified from Juniperus
Leitch et al., 2004).
Sinauer Associates, Inc.
SOLTIS
Phylogeny and Evolution of the Angiosperms
SOL1304.eps
100% of size
1.27.5
EVOLUTION OF GENOME SIZE AND BASE CHROMOSOME NUMBER 293
but with liverworts and hornworts excluded (Figure There is also evidence for several independent
13.4), suggest that the original genome size of all land increases in genome size across land plants. One such
plants was equivocal. In contrast to the very small increase may have occurred in Huperzia of the lyco-
genomes of bryophytes, diverse C-values are evident phytes; the largest increase may have occurred in the
in different members of the lycophyte clade. Within the clade that contains Ophioglossaceae (Ophioglossum) +
lycophyte clade, Isoetes has an intermediate C-value, Psilotaceae (Psilotum; Figure 13.4). However, both of
and Selaginella has a very small C-value; the C-values these examples are dependent on the sampling of
of Lycopodium and Huperzia are small and large, respec- bryophytes and the trace option used in the reconstruc-
tively, based on the estimates now available. The an- tions. If all three bryophyte lineages are included, with
cestor of the lycophyte clade is reconstructed as equiv- DELTRAN optimization both Huperzia and Ophioglos-
ocal with the “all most parsimonious states” and saceae + Psilotaceae represent increases in genome size
DELTRAN options; with ACCTRAN the ancestral state is an (not shown). In other reconstructions (those using only
intermediate genome with an increase in genome size mosses and with “all most parsimonious states” and
occurring in Lycopodium and decreases in other taxa DELTRAN options) the ancestral states for these nodes are
(Selaginella and other species of Lycopodium) (Figure equivocal, and hence it is unclear whether they repre-
13.4). sent true increases of genome size. With all trace op-
Following the lycophyte clade, the ancestral state of tions, however, reconstructions indicate that gym-
all remaining vascular plants (the Euphyllophytina of nosperms had an ancestral genome size that was
Kenrick and Crane 1997) again depends on the “trace” intermediate with three separate increases in genome
option used in MacClade (Maddison and Maddison size occurring in various conifer lineages. Multiple in-
1992). With the “all most parsimonious states” and DEL- creases in genome size are also evident in the leptospo-
TRAN options, the ancestral condition of this clade—as rangiate ferns (i.e., Todea, Plagiogyria; Figure 13.4). In
well as the ancestral state of the angiosperms—is am- summary, genome size evolution across land plants ap-
biguous. If the accelerated transformation option (ACC- pears to have been highly dynamic, with both increas-
TRAN) is used, then the ancestor of the Euphyllophytina es and decreases (Leitch et al. 2004).
had an intermediate-sized genome, and the origin of the Returning to angiosperms, reconstructions indicate
angiosperms involved a decrease in genome size (Fig- that a very small genome size was ancestral throughout
ure 13.4). However, in analyses with all three bryophyte the basal angiosperms, the early-diverging eudicots, and
lineages included and with DELTRAN optimization, the also some of the major clades of core eudicots (e.g.,
ancestral state of Euphyllophytina is a very small Caryophyllales, Saxifragales, asterids; Figure 13.3).
genome, and angiosperms simply retained the ple- Among basal angiosperms, the ancestors of the mono-
siomorphic condition (reconstruction not shown). cot and magnoliid clades also appear to have had very
Regardless of the “trace” option used, reconstruc- small genomes. However, the Austrobaileyales clade
tions indicate that several independent decreases and is characterized by C-values in the small and interme-
increases in genome size have taken place across land diate ranges, rather than the very small range (Figure
plants. The evolution of Marsileaceae (represented by 13.3; Leitch and Hanson 2002). Austrobaileya (Austrobai-
Regnellidium and Pilularia) + Salviniaceae (represented leyaceae) has a C-value of 9.52 pg. The 1C-value of Illi-
by Salvinia) within the ferns clearly involved a major cium anisatum (Schisandraceae sensu APG II 2003) has
decrease in genome size (Figure 13.4). There are other been reported to be 3.40; genome size estimates report-
examples of apparent genome size decrease in the lep- ed for three species of Kadsura (Schisandraceae) range
tosporangiate ferns. In all reconstructions, ferns have from 7.4 to 8.9, which are in the small, and intermediate
an ancestral genome size that is intermediate; reduc- ranges (Figure 13.3), respectively. Therefore, not all
tions to small genomes apparently occurred independ- members of early-branching angiosperm lineages have
ently in Ceratopteris and Aspleniuim (Figure 13.4). very small genomes. Austrobaileyales may represent an
Additional examples of genome downsizing are evolutionary lineage that long ago experienced an in-
found in the gymnosperms. Considering groups with- crease in genome size (Figure 13.3; D. Soltis et al. 2003b).
in the gymnosperm clade, the mean C-values are: cy- Reconstructions also suggest that the original
cads, 14.71 pg; Ginkgo, 9.95 pg; Gnetales, 7.23 pg; genome size for monocots was very small (Figure 13.5).
Pinaceae, 22.02 pg; and other conifers, 11.89 pg. The Ceratophyllum, a possible sister group to all monocots
lowest C-values reported for gymnosperms are for Gne- (Zanis et al. 2002), has a very small genome (C-value of
tum (1C = 3.38), which appears to represent a decrease 1.0 pg), as does Acorus, the sister to all other monocots
in genome size within the clade of extant gymnosperms (mean C-value of 0.52; range, 0.40 to 0.65). Following
(Figure 13.4; D. Soltis et al. 2003b; Leitch et al. 2004). Acorus, Alismatales are sister to all remaining monocots
Larix may also represent an example of genome down- (see Chapter 4). However, reconstructions at the base
sizing just within the conifers (Figure 13.4). of the monocots are hampered by a lack of C-values for
294 CHAPTER 13
Ceratophyllum
Acoraceae
Alismataceae
Potamogetonaceae
Limnocharitaceae
Alismatales
Juncaginaceae
Hydrocharitaceae
Butomaceae
Aponogetonaceae
Araceae
Tofieldiacae
Cyclanthaceae
Pandanaceae
Taccaceae
Dioscoreaceae
Typhaceae
Bromeliaceae
Sparganiaceae
Commelinids
Poaceae
Juncaceae
Cyperaceae
Commelinaceae
Pontederiaceae
Zingiberales
Arecaceae
C-values (pg) Convallariaceae
Very small (≤1.4) Agavaceae
Small (>1.4– ≤3.5) Hyacinthaceae
Asparagales
Intermediate (>3.5– <14.0) Asphodelaceae
Large (≥14.0– <35) Asparagaceae
Very large (≥35) Alliaceae
Polymorphic Anthericaceae
Equivocal Amaryllidaceae
Iridaceae
Orchidaceae
FIGURE 13.5 Parsimony reconstruction of genome size Smilacaceae
diversification in the monocots obtained using the “all Liliaceae
most parsimonious states” option of MacClade. Families Melanthiaceae Liliales
indicated in bold have some representatives with “very Uvulariaceae
large” genomes (see text). The general topology is from Luzuriagaceae
Chase et al. (2000) (see text). 1C-values for families are Calochortaceae
from Leitch et al. (1998) and D. Soltis et al. (2003b). Alstromeriaceae
critical taxa; additional estimates are needed for early- large genome originated independently in the two
branching monocots. Nonetheless, small or very small clades. Orchidaceae and Iridaceae (successive sisters to
C-values are characteristic of other monocot families other Asparagales) contain an array of 1C-values, from
such as Typhaceae, Pandanaceae, Dioscoreaceae, and very small to large. Clarifying ancestral genome size
Bromeliaceae (Figure 13.5). and subsequent diversification in Asparagales and Lil-
A very large genome
Sinauer may Inc.
Associates, have evolved at least three iales will require additional genome size estimates as
SOLTIS
times independently in the monocots (Figure 13.5): (1) well as better resolution of relationships among the con-
Phylogeny and Evolution of the Angiosperms
in some Commelinaceae
SOL1305.eps
of the commelinid clade; (2) stituent members of these two clades.
in some members
100% ofof Liliaceae, Melanthiaceae, and Al-
size Some estimates of angiosperm phylogeny place Chlo-
stroemeriaceae1.27.5
of Liliales; and (3) in some Alliaceae and ranthaceae as sister to magnoliids and eudicots, but sup-
Asparagaceae of Asparagales. However, the number of port for this placement is low (see Chapter 3). Genome
origins of very large genomes in the latter two orders sizes for Chloranthaceae are in the small or intermedi-
is uncertain. Although some members of Asparagales ate range. Estimates for two species of Chloranthus are
and Liliales have very large genomes, most have large 1C = 2.90 and 3.59, respectively; Sarcandra has a C-value
or smaller genomes. It remains unclear, however, if the of 4.30 pg. Hence, Chloranthaceae may represent anoth-
common ancestor of these two large clades (Aspara- er ancient lineage that, like Austrobaileyales, experienced
gales and Liliales) had a very large genome, or if a very an early increase in genome size (Figure 13.3).
EVOLUTION OF GENOME SIZE AND BASE CHROMOSOME NUMBER 295
Character-state reconstructions indicate that a very to 8.90 pg. Polyploidy contributes to the two highest
small genome size is ancestral for the entire magnoli- values in Papaver (P. orientale, 1C = 8.90 pg, 2n = 42; P.
id clade, as well as for the Laurales + Magnoliales and setigerum, 1C = 6.70 pg, 2n = 44). The mean 1C-value for
Piperales + Canellales subclades (Figure 13.3). Within known diploids in Papaver is 2.2 pg. Within Papaver-
Canellales, Drimys (Winteraceae) has 1C = 1.13 pg. aceae, many early-branching members (based on the
Within Piperales, Saururaceae have very small topology of Hoot et al. 1997) have very small genome
genomes, with 1C estimates for Anemopsis, Houttuynia, sizes. Fumaria (1C = 0.55 pg), Glaucium (1C = 0.60 pg),
and Saururus of 0.8 pg, 1.3 pg, and 0.5 pg, respectively. Eschscholzia (1C = 1.10 pg), and Argemone (1C = 0.60 pg)
Piperaceae also have small or very small genomes. The all have very small genomes. Genome size estimates
mean value of 10 estimates for the genus Piper is 1C = for diploids and recent phylogenetic trees indicate that
1.25. In Laurales, genome size estimates for Lauraceae the ancestral genome size for Papaveraceae was very
and Calycanthaceae are very small (e.g., Persea ameri- small (1C <1.4 pg).
cana, 1C = 1.2 pg and Calycanthus, 1C = 0.98). Magno- The ancestral genome size for core eudicots is recon-
liales, including Myristicaceae and Magnoliaceae, also structed as equivocal. The single estimate for Gun-
have very small genomes. nerales, the sister to all other core eudicots, is 1C = 7.44
The reconstruction of a very small genome for the pg (using a species of Gunnera). Despite this relatively
common ancestor of extant Magnoliaceae is made more large C-value for Gunnera, a very small genome size
noteworthy because these plants are presumed ancient may be ancestral for most core eudicots. However, re-
polyploids (the diploid parents of which are now ex- constructions are hampered by the uncertainty sur-
tinct; Stebbins 1971), a conclusion supported by rounding the relationships of core eudicot lineages (see
isozyme data (D. Soltis and Soltis 1990). Similarly, Chapters 2 and 5). A very small genome size is recon-
Myristicaceae, Lauraceae, and Winteraceae are exam- structed as ancestral for Saxifragales, Caryophyllales,
ples of other basal lineages with high base chromosome and asterids. Although only three C-values are available
numbers (thought to be ancient polyploids) that also for Saxifragales, all are less than 1.4 pg (Figure 13.3).
have very small genomes. Thus, the original genome Within asterids, Cornales, followed by Ericales, rep-
sizes of the now-extinct “paleodiploid” Magnoliaceae resent the successive sisters to the euasterids (Albach
and other basal lineages such as Myristicaceae, Caly- et al. 2001a; B. Bremer et al. 2002). Diploid Cornales and
canthaceae, Lauraceae, and Winteraceae, were likely Ericales have 1C-values less than 1.4 pg; hence, the an-
even smaller than those estimated for modern taxa. cestral condition for the asterid clade is a very small
Likewise, the eudicot Aesculus hippocastaneum has a genome. Within the asterids, a large genome is found
very small genome (1C = 0.1 pg), but it, as well as the only in Adoxa (Adoxaceae; 1C = 14.30 pg), which oc-
entire genus Aesculus (Hippocastanaceae, now part of cupies a derived position within the euasterid clade
Sapindaceae), are ancient polyploids with 2n = 40 (D. (Figure 13.3) and is a polyploid with 2n = 36. Adoxa-
Soltis and Soltis 1990). It is also apparent, therefore, that ceae are part of euasterid II, and other diploids of this
presumed ancient polyploids do not necessarily have clade have either small or very small genomes.
large genomes. The ancestral genome size of the rosids is unclear.
The ancestral genome for eudicots appears to have Vitaceae appear to represent the sister to all other
been very small (Figure 13.3) based on reconstructions rosids, and Vitis has a very small genome (1C = 0.4 to
for key clades, such as Ranunculales and Proteales, as 0.6 pg for 21 diploids). However, relationships within
well as data for Buxaceae. New data for Platanus (Pla- the rosids remain poorly understood (see Chapter 8),
tanaceae; 1C = 1.30 pg) and Nelumbo (Nelumbonaceae; precluding the reconstruction of an ancestral genome
1C = 0.24 pg), combined with an earlier report for Gre- size for this large clade at this time.
villea (Proteaceae; 1C = 0.83 pg), further suggest that the Although Santalales are often considered to have a
ancestral genome for Proteales was very small (D. Soltis large genome (see Leitch et al. 1998), very large genomes
et al. 2003b). are confined to Viscum (Santalaceae) with C-values of
Ranunculales occupy an important position as sis- 76.0 and 79.3 pg. However, Viscum is clearly derived
ter to all other eudicots and may also have had a very within Santalales (Nickrent and Malécot 2001; see Chap-
small ancestral genome (D. Soltis et al. 2003b). Berberi- ter 6). C-values for other Santalaceae (sensu APG II 2003)
daceae have a very small average genome size (1C = are much lower. Loranthus has a C-value of 15.20 pg, and
1.35 pg for 11 diploids), as do Menispermaceae with 1C Lysiana has a mean C-value of 12.50 pg (based on values
= 0.70 pg. Papaveraceae also appear to have a very for five diploid species that range from 11.03 to 15.28).
small ancestral genome, but this is not evident from the Dendrophthoe (Santalaceae) has a mean C-value of only
mean value for the entire family. The average genome 4.3 pg (range, 2.7 to 6.2 pg), calculated from C-values for
size for Papaveraceae is 1C = 3.05 pg, but this estimate five diploids. The ancestral state for the family is there-
is heavily influenced by numerous estimates for species fore reconstructed as equivocal. Unfortunately, C-values
of the large genus Papaver, which range from 1C = 2.33 are not available for early-branching lineages of Santal-
296 CHAPTER 13
ales; additional C-values would be useful for inferring ever, these approaches are coarse-grained; careful eval-
patterns of genome size evolution. uation of the genetic obesity hypothesis within individ-
ual clades (e.g., within families and genera) is required.
Genetic obesity hypothesis The evolution of genome size at lower taxonomic lev-
Bennetzen and Kellogg (1997) proposed that genome els has rarely been examined in a phylogenetic context.
size evolution in plants would be largely unidirectional, In Pooideae (Poaceae), the ancestral grass genome size
with an overall pattern of increase due to the combined was inferred to be 2C = 3.5 pg (Kellogg 1998). Kellogg
influence of polyploidy and the accumulation of retroele- also provided evidence for a steady increase in genome
ments. These authors suggested that plants have a “one size in Pooideae, ultimately leading to the much larger
way ticket to genetic obesity.” However, the hypothesis genome sizes (2C = 10.7 pg or more) in the Triticeae.
of unidirectional increase in genome size has rarely been However, genome size decreased in some genera, in-
critically evaluated (Bennetzen and Kellogg 1997; Cox et cluding Oryza, Chloris, and Sorghum. In contrast to the
al. 1998; Wendel et al. 2002). Broad surveys of an- model of unidirectional change in genome size, in a clade
giosperms appear to be in general agreement with the of Gossypium (cotton) and allies (the cotton tribe, Gossyp-
genetic obesity hypothesis, with very large genomes con- ieae, Malvaceae), the frequency of genome size decreas-
fined to taxa that occupy derived positions within larg- es exceeded the examples of increases (Figure 13.6; Wen-
er clades (Leitch et al. 1998; D. Soltis et al. 2003b). How- del et al. 2002). In Gossypieae, the shortest tree recovered
0.01 substitutions/site
7, 99 2.0 Gossypium raimondii (2n = 26)
2.8, 3.3
2.8, 3.2
1.7, 1.8
1.2 Kokia drynarioides (2n = 24)
15, 94
3.7, 3.8 5.9 Hampea appendiculata (2n = 26)
8, 94
9, 100 3.2 Thespesia lampas (2n = 26)
4.6, 4.3
3.7, 3.3
FIGURE 13.6 Reconstruction of the evolution of genome numbers. Ancestral genome sizes were estimated using
size in the cotton tribe (Gossypieae). The shortest tree sum-of-squared-changes parsimony analysis (Maddison
recovered from parsimony analysis, inferred from CesA1 1991) and a generalized least squared method (Martins and
sequence data, is identical in topology to the maximum Hansen 1997). A linear (=Wagner) parsimony (Swofford
likelihood phylogeny shown here. The number of character- and Maddison 1987) reconstruction was also conducted,
state changes and jackknife support (%; in italics) from but these estimates are not shown. The former estimates
maximum parsimony analysis are shown above each inter- are shown in boxes on the internal branches. Inferred
nal branch. The tree is rooted with the outgroup Malva genome size increases are shown by shaded branches,
sylvestris (ingroup–outgroup branch length = 0.12). decreases are indicated by unfilled branches, and ambigui-
Genome sizes (in pg) are shown at branch tips before ties or stasis are denoted by hatched branches. (Modified
species names, which are followed by somatic chromosome from Wendel et al. 2002.)
EVOLUTION OF GENOME SIZE AND BASE CHROMOSOME NUMBER 297
Core Eudicots
Vitis
using the “all most parsimonious states” option. This
Saxifragales
reconstruction uses the actual numbers reported in the lit-
Caryophyllales
erature for many of the genera and families indicated.
Dilleniaceae
However, there is evidence for ancient polyploidy in many
Santalales
basal lineages. For those taxa thought to be ancient poly-
asterids
ploids, a hypothetical original base number has been sub-
Gunnera
stituted (see text). Most gymnosperms (outgroups) have n
Myrothamnus
= 11 or 12. This figure represents a simplified version of a
Tetracentron
larger reconstruction involving 172 taxa. The large rosid,
Trochodendron
asterid, Caryophyllales, Santalales, and Saxifragales clades
Buxus
have been reduced to a single terminal. The ancestral state Sabia
shown for each of these clades is that reconstructed using
Early-diverging eudicots
Nelumbo
the larger matrix. Placospermum
Platanus
Akebia
Decaisnea
Glaucidium
Ranunculus
Hydrastis
Nandina
Menispermum
Tinospora
Euptelea
Dicentra
Aristolochia
Lactoris
Asarum
Saruma
Houttuynia
Saururus
Peperomia
Piper
Calycanthus
Sassafras
Asimina
Myristica
Magnolia
Basal angiosperms
Eupomatia
Chloranthus
Hedyosmum
Ceratophyllum
Acorus
Araceae
Tofieldia
Alismataceae
Alocasia
Asparagus
Cyperus
Base chromosome Oryza
number (x =) Zea
(unordered) Oncidium
6 Lilium
7 Austrobaileya
8 Illicium
10 Schisandra
11 Nymphaea
12 Amborella
≥13 Gingko
Polymorphic Ephedra
Equivocal Pseudotsuga
n=8
Outgroup
n=7
Crassula
Crassula
clade
n=8
n=?
Tillaea
n=9
Cotyledon/Tylecodon
n=9
Kalanchoë
n = 17,
clade
18
n=8 Kalanchoë
n=8
n=9
Adromischus
n = 15, 18
Aeonium
Aeonium
clade
n=8
n=8
n=8 Sedum spp.
n=8
n = 6, 7 Leucosedum
Leucosedum
n=? clade
n=8 n = 10
Acre Acre clade
Sempervivum
n = 16, 17, 18, 19
Sempervivum/Jovibarba
clade
n=8 n=8
n = 28 Sedum sect. Rupestre
n = 12, 16, 24
Telephium Telephium clade
FIGURE 13.9 Reconstruction of the evolution of base chro- Crassulaceae. Numbers above branches are the base chro-
mosome number in Crassulaceae. Simplified summary mosome numbers reconstructed using MacClade.
topology of relationships among the major clades in (Redrawn from Mort et al. 2001.)
When the results of phylogenetic analyses are com- Jovibarba clade; n = 12, 16, 24 typify the Telephium
pared with other sources of data (e.g., morphology, clade (Figure 13.9; Mort et al. 2001).
chemistry, chromosome number), it becomes apparent 3. In Saxifragaceae, molecular analyses identified
that chromosome number is often an effective indica- clades of genera that differ from the traditional
tor of relationship. We provide here a sample of clades taxonomy for the family. Chrysosplenium and
in which variation in chromosome number corresponds Peltoboykinia differ significantly in morphology
closely to phylogeny. (placed in different tribes or subtribes), but form a
1. Rosaceae have traditionally been divided into well-supported clade. They are also unusual in
subfamilies by fruit type (e.g., achene, pome, having x = 11 (D. Soltis et al. 2001a, 2001b); most
drupe). However, molecular phylogenies indicate genera of Saxifragaceae have x = 7. Another group
that some fruit types are homoplasious in the fam- of six genera (now called the Darmera group),
ily. In contrast, base chromosome number is in although not previously considered closely relat-
close agreement with the clades recovered (see ed, form a well-supported clade; these genera are
Chapter 8; Morgan et al. 1994). distinctive in the family in having 2n = 36 (x = 18)
or higher (D. Soltis et al. 2001b).
2. In Crassulaceae, a haploid number of n = 8 is com-
mon, and phylogenetic analyses indicate that this 4. In subfamily Pooideae (Poaceae), the core pooids
number is ancestral (Figure 13.9). A base number of (Poeae, Aveneae, Triticeae, and Bromeae) all have
n = 9 unites members of the Kalenchoë clade; n = 6, 7 x = 7, whereas other Pooideae exhibit a diverse
unites the Leucosedum clade; n = 10 unites the Acre array of numbers ranging from x = 5 to 13
clade. Higher numbers, representing episodes of (Kellogg 1998). This situation is particularly excit-
both polyploidy and aneuploidy, unite other clades: ing because the large cytogenetic, genomic, and
n = 16, 17, 18, 19 characterize the Sempervivum– genetic database for Pooideae, coupled with phy-
Sinauer Associates, Inc.
SOLTIS
Phylogeny and Evolution of the Angiosperms
SOL1309.eps
100% of size
1.27.5
302 CHAPTER 13
logenetic information, will ultimately make it pos- Frequent genome contractions—as well as expan-
sible to infer the mechanisms and directionality of sions—have occurred in the course of angiosperm (and
chromosomal change throughout the group. land plant) diversification. However, many more case-
5. In the nitrogen-fixing clade (Rosales, Fagales, by-case investigations are needed to evaluate the dy-
Fabales, and Cucurbitales of eurosid I), base chro- namics of genome size evolution. We still know surpris-
mosome number agrees in large part with relation- ingly little about the mechanisms and causes of changes
ships found using DNA sequence data. Molecular in genome size, particularly genomic downsizing.
phylogenetic studies indicated that Cannabaceae Thus, future studies should focus more on the mechan-
are derived from within Celtidaceae (Sytsma et al. ics of alterations in genome size.
2002). APG II (2003) therefore recognized an The reconstructions of ancestral base chromosome
expanded Cannabaceae that also includes mem- numbers that we conducted for this book are prelimi-
bers of the former Celtidaceae. An expanded nary. Analyses that more rigorously reconstruct diver-
Cannabaceae is also supported by other lines of sification of chromosome numbers within families and
evidence, including chromosome number. genera would be useful. However, because chromoso-
Celtidaceae and Cannabaceae are the only mem- mal evolution is so complex, additional large-scale re-
bers of the nitrogen-fixing clade to possess n = 10 constructions of chromosomal evolution may be of lit-
(Sytsma et al. 2002). tle value.
Isozyme data provide support for ancient poly-
Other examples of the phylogenetic utility of base ploidy in some basal lineages, but no genetic data are
chromosome numbers include Onagraceae (Levin et al. available for many basal lineages. Genetic data are
2003) and Iridaceae (Goldblatt et al. 2002). needed to test the hypothesis that many basal lineag-
es having only high chromosome numbers are ancient
polyploids. For example, is Amborella with 2n = 26 an
Future Studies ancient polyploid, as we suspect, and if so, how many
Considerable progress has been made in the past few rounds of genome duplication has it experienced? With
years in achieving a better understanding of genome the rapidly increasing application of genomic tools to
diversification in the angiosperms. Whereas new C-val- angiosperms, it should soon be possible to address
ues and a phylogenetic approach permitted an im- these questions. In addition, genomic approaches will
proved understanding of genome size evolution in the eventually permit the evaluation of evolutionary trends
angiosperms, to make further progress, additional C- in genome size, as well as genome organization and
values are needed. For example, data are needed in structure. Such studies have already been conducted in
many clades, including basal Santalales and basal some groups, such as the grasses and Brassicaceae. Ge-
monocots. Genome size diversification is much more nomic data for other angiosperms and seed plants will
dynamic than the unidirectional trend toward “genom- ultimately provide additional critical information about
ic obesity” proposed by Bennetzen and Kellogg (1997). genome evolution in flowering plants.