Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Brssow
doi:10.1111/j.1365-2958.2004.04113.x
MicroReview
The impact of prophages on bacterial chromosomes
Introduction
Microbial genomics revealed that, in some bacteria, substantial amounts of the bacterial DNA is in fact prophage
DNA (Canchaya et al., 2003; Casjens, 2003). It also
became increasingly clear that prophage DNA has played
an important role in the evolution of bacterial pathogenicity (Boyd and Brssow, 2002). These data have changed
our understanding of phagebacterium interaction from a
simple parasitehost relationship into a two-way coevolution of viral and bacterial genomes. Here, we provide
a short overview on the impact of prophage integration on
bacterial genome structure and diversification.
Methodological problems of prophage identification
Prophage identification is not an exact science, but a
labour-intensive, empirical approach that needs a lot of
insight (Casjens, 2003). To keep pace with the increasing
number of sequenced bacterial genomes, a simple script
was written in our laboratory (Fournous, 2003) that transforms the GenBank file of a bacterial genome into a FASTA
file of all its open reading frames (ORFs). A BLASTX search
is then conducted with each individual ORF, and the output of the significant database matches is searched by a
semantic program for its annotations using positive (e.g.
phage, integrase, tail, capsid, terminase, portal) and negative (e.g. macrophage, transposase, transposon, insertion) keywords. The hits are plotted for each ORF position
of the genome, and a further script transforms this hit list
into prophage genomes by performing a neighbour analysis for further phage-like genes around each individual
hit. An example of the automatic data output is shown for
Salmonella enterica serovar Typhi strain CT-18. It shows
all phage hits (ticks on the outer circle) and the prophage
identification by neighbour analysis (green bars) (Fig. 1A).
Differences between the automatic and manual annotations by Casjens (2003) shown in the second circle were
minimal and concern only remotely phage-like elements
(Sti5, Sti6; Fig. 1A, centre).
The maps of the candidate prophage elements are
shown in Fig. 1A. Sti1 and Sti4 are clearly lambda-like
prophages in their genetic organization, but not in their
sequence; the structural genes resemble a Photorhabdus
ori
Sti10
Sti9
R1
1:
4:
3:
8:
9:
Sti8
I 2
3 4T
I
Sti1
5
10:
2:
4:
5:
6:
7:
6
7
8
Sti2
1
T
Sti3
Sti4
Sti7
Sti6
Sti5
ter
ori
C
n
ter
prophage. Sti3 is a Mu-like whereas Sti8 and Sti9 are P2like prophages. The well-known gene map of these three
phage types allowed the tentative identification of morons
(extra non-phage genes inserted into the genomes of
temperate phages) (Juhala et al., 2000). Notably, the
morons encode important candidate virulence factors for
P
this typhoid fever-causing Salmonella serovar (see
Fig. 1A legend for annotations). Sti10 resembles a P4-like
prophage and also contains a potential moron (Fig. 1A).
Sti7 represents a tail gene cluster from a P2-like phage
flanked by a pin invertase. The presence of recombinases
(integrase, transposase, invertase) points to the mobile
2004 Blackwell Publishing Ltd, Molecular Microbiology, 53, 918
Prophagechromosome interaction 11
character of these DNA elements (Fig. 1A). In contrast,
Sti2 and 5 show isolated phage genes that flank pertussislike toxin genes (Sti2) and Salmonella serovar Typhimurium virulence genes (sspH and msgA) (Sti5). Sti5 and 6
were identified by a manual search (Casjens, 2003), but
not by the automatic program (Fig. 1A). On the basis of
theoretical models, prophages are predicted to decay by
deletion of phage genes resulting in prophage remnants.
In contrast, genes conferring a selective advantage such
as a moron should be retained (Lawrence et al., 2003).
Sti5 fits this prediction, but its prophage derivation must
await the discovery of related, but less decayed prophage
elements.
Distribution and orientation of prophages
One hundred and ninety prophages identified by the program in 115 sequenced prokaryotic genomes were projected on a hypothetical replichore (one half of the
chromosome) (Fig. 1B). The position of the prophages is
given as a percentage of the distance from ori (origin) to
ter (terminus of replication) regardless of which half (left
or right) of the chromosome the prophages were identified
on. The representation does not take account of the variable length of the different bacterial replichores. Overall,
the prophages were relatively evenly distributed, but the
density of the prophages was greater near ter than near
ori. Approximately half the sequenced prokaryotic
genomes do not contain either prophages or well-defined
remnants (Fig. 1C). This group contains all sequenced
Archaea and most intracellular eubacterial pathogens. It
has been argued that the latter have undergone recent
genome contractions in which all non-essential genes for
the intracellular niche have been lost. Prophages have,
however, been observed in insect endosymbionts (phage
APSE-1 and Wolbachia prophages), suggesting that intracellular bacteria can harbour phages and prophages. A
quarter of all genomes contain one or two prophages.
Approximately one-tenth of the genomes contributed the
majority of the prophage sequences (Fig. 1C). Bacteria
containing six or more prophages in their genome are
mainly pathogens, with the notable exception of Lactococcus lactis, an organism used in cheese fermentation and
under extreme pressure from bacteriophages. The comparisons of genomes from the same species suggest a
stochastic process: prophages might be present or absent
(e.g. Streptococcus agalactiae) (Fig. 2A) or different
strains within a species may contain one, two or three
prophages (e.g. Staphylococcus aureus) (Fig. 2B).
Prophages showed a preferred orientation for integration: the structural genes pointed mostly in the direction
of the majority of the surrounding bacterial genes. When
prophages changed position from the right to the left replichore, they also changed their orientation (Smoot et al.,
2004 Blackwell Publishing Ltd, Molecular Microbiology, 53, 918
2002a; Nakagawa et al., 2003; see also the circled prophage in Fig. 2B). This strong bias in phage orientation is
still unexplained, although avoidance of RNA and DNA
polymerase collision and interference with the normal
functioning of terminus functions (dif site) have been proposed (Campbell, 2002).
Prophages contribute to the genetic individuality of a
bacterial strain
The role of mobile DNA in the diversification of bacterial
genomes becomes apparent when the sequenced
genomes from two different strains of the same species
are aligned in a dot plot. For example, the alignment of
two S. agalactiae serotypes showed about a dozen small
gaps (Fig. 2A). The gaps corresponded nearly exclusively
to mobile DNA; bioinformatic analysis revealed integrative
plasmids, transposons and two prophages. Ten gaps
showed an atypical nucleotide composition suggesting
lateral gene transfer, and eight of the genomic islands
were associated with integrases/transposases (Glaser
et al., 2002). The variable genome parts defined by comparative genomics (Glaser et al., 2002), dot plot alignment
and microarray hybridization (Tettelin et al., 2002) showed
excellent concordance. The relative contribution of prophages to the strain-specific DNA varied sometimes for
strains from the same species. In some Staphylococcus
aureus strain comparisons, prophages were the major
contributors to variability (Fig. 2B) (Kuroda et al., 2001),
whereas in other comparisons, they competed with
genomic islands and transposons for this role (Baba et al.,
2002). An extreme case is presented by Streptococcus
pyogenes in which all major gaps in the alignment of
different M serotypes could be traced to prophage integration events (Smoot et al., 2002a). As all S. pyogenes
and many S. aureus prophages encode proven or suspected virulence factors, the prophage-imposed diversity
between the strains might be of clinical relevance (Beres
et al., 2002).
The role of prophages in the individuality of the
strains is not restricted to Gram-positive bacteria, but
can also be seen in Gram-negative bacteria. Two Salmonella Typhi strains could be aligned over the entire
genome when allowing for a large chromosomal inversion across two rRNA gene clusters. From 113 ORFs
specific for one or other strain, 76 were prophage genes
(Deng et al., 2002). In a Salmonella serovar Typhi and
Typhimurium comparison, two chromosomal inversions
and 12 larger alignment gaps were identified. Nine of
the gaps can be traced to prophages or prophage remnants. The gaps represent either prophage insertion/
deletion events or prophage replacements at a given
chromosomal position. Escherichia coli can serve as a
dramatic example in which half the 1 Mb DNA that dis-
A
2000000
2603V/R
SA2
Tn916
SA1
MSSA476
2500000
SCC
23
ori
Lactobacillus johnsonii
Prophagechromosome interaction 13
tal non-toxigenic O1 El Tor isolate lacked the CTX
prophage encoding the cholera toxin CT, identifying
this prophage as a major determinant of genetic variability. Microarray analysis showed that lateral gene
transfer has played a fundamental role in the diversification of S. aureus: up to 22% of the genome comprised variable genetic material. Prophages comprised
17% of the total amount of 250 kb mobile and variable
DNA (Fitzgerald et al., 2001). As prophages from bacterial pathogens frequently encode virulence factors,
one could suspect that the analysis of pathogenic bacteria overestimates the contribution of prophages to
the genetic individuality of a given strain. However,
similar data were obtained with the gut commensal
Lactobacillus johnsonii. Hybridization of eight molecularly distinct strains of L. johnsonii revealed that the
prophage DNA contributed approximately half the
strain-specific DNA of the reference strain (Fig. 2C)
(Ventura et al., 2003a). When lactobacilli belonging to
seven different Lactobacillus species were included in
that analysis, genes related to the prophages of the
reference strain were not detected. This observation
demonstrates that mobile DNA is not necessarily
widely distributed across species borders.
3'
4'
5'
6'
2'
M3 315
1'
1'
B
6
5
com
1
2
3
4
ssa
M3 SSI-1
ssa
2'
com
B
A
? Xt1
XfP1
9.a.5.c
XfP2
2500000
Xt10
2500000
observed
C
D
Xt8
Xt2
reconstructed
Prophagechromosome interaction 15
tional protein. For example, lambdoid phage 21 inserts
within the isocitrate dehydrogenase gene and introduces
an alternative 165 bp 3 end for that gene (Campbell et al.,
1992). In rare cases, the phage recombination site attP is
located within the phage integrase gene, and prophage
integration leads to an altered int gene, which apparently
stabilizes the lysogen (Magrini et al., 1999), or the bacterial attB site complements the int gene (Bruttin et al.,
1997).
et al., 2002). In other bacteria, rDNA and duplicated bacterial genes were used for genome rearrangements (com
genes in the case of Fig. 3A).
Integration sites
Many prophages from both Gram-negative and Grampositive bacteria integrate into tRNA genes in a preferred
orientation (Campbell, 1992; 2002). Mostly, but not always
(Ventura et al., 2003a), the attP site of the phage reconstitutes the tRNA upon integration. Interestingly, an
increasing number of prophages are described that carry
tRNA genes (Fig. 5D). The phage-encoded tRNA differed
from the chromosomal tRNA and were transcribed in the
lysogen (Ventura et al., 2004). Such constellations might
alleviate the consequences of prophage integration
events into tRNA genes that do not reconstitute the original tRNA gene. However, integration into tRNA genes is
far from being universal. Integration into intergenic regions
and into ORFs has also been described (Canchaya et al.,
2002). The prophage attachment site frequently faithfully
complements the coding sequence of the protein. However, cases of inactivation of the protein-encoding function
have also been described, a well-known case being the
negative lysogenic conversion of the lipase gene by an S.
aureus phage (Lee and Iandolo, 1986). In addition, integration into an ORF can also lead to an altered but func-
ori
2356
124
irp6A
2162
2159 pi3
adhA
DT
539
SPIF
625
pi2
mf2spd1
speC
1734
hmuO
pi1
mf2
922
1520
D
mf4
894
1061
dtxR
1296
Prophagechromosome interaction 17
The mechanism would no doubt be characterized by fixation of particular phage genes, the function of which has
been co-opted by the host. Prophage genes without selective value to the host are therefore likely to be deleted.
Owing to the lack of surrounding prophage genes, completely fixed prophage genes are thus difficult to detect.
In the case of bacterial pathogens, a conspicuous observation is virulence genes flanked by isolated phage-like
genes. Examples are Shiga toxin genes in Shigella dysenteriae (McDonough and Butterton, 1999), sopE2 in Salmonella Typhimurium, sspH and pertussis-like toxin genes
in Salmonella Typhi (Fig. 1A). As related genes are found
as lysogenic conversion genes in well-established prophages (Stx prophages in E. coli O157, SopE prophages in
S. Typhimurium), we probably have here examples of fixed
prophage genes. As these genes are likely to extend the
ecological range of their bacterial hosts, prophages might
have a greater impact on bacterial genomes than is indicated by the presence of clear-cut prophage DNA
sequences in the sequenced bacterial genomes.
Acknowledgements
We thank Sherwood Casjens, Chris Blake, Anne Constable
and Anne Bruttin for critical reading of the manuscript, and
the Swiss National Science foundation for financial support
to Carlos Canchaya (research grant 5002-057832).
References
Baba, T., Takeuchi, F., Kuroda, M., Yuzawa, H., Aoki, K.,
Oguchi, A., et al. (2002) Genome and virulence determinants of high virulence community-acquired MRSA. Lancet
359: 18191827.
Barondess, J.J., and Beckwith, J. (1990) A bacterial virulence
determinant encoded by lysogenic coliphage lambda.
Nature 346: 871874.
Beres, S.B., Sylva, G.L., Barbian, K.D., Lei, B., Hoff, J.S.,
Mammarella, N.D., et al. (2002) Genome sequence of a
serotype M3 strain of group A Streptococcus: phageencoded toxins, the high-virulence phenotype, and clone
emergence. Proc Natl Acad Sci USA 99: 1007810083.
Boyd, E.F., and Brssow, H. (2002) Common themes among
bacteriophage-encoded virulence factors and diversity
among the bacteriophages involved. Trends Microbiol 10:
521529.
Broudy, T.B., and Fischetti, V.A. (2003) In vivo lysogenic
conversion of Tox() Streptococcus pyogenes to Tox(+)
with lysogenic streptococci or free phage. Infect Immun 71:
37823786.
Broudy, T.B., Pancholi, V., and Fischetti, V.A. (2001)
Induction of lysogenic bacteriophage and phageassociated toxin from group A streptococci during coculture with human pharyngeal cells. Infect Immun 69:
14401443.
Broudy, T.B., Pancholi, V., and Fischetti, V.A. (2002) The in
vitro interaction of Streptococcus pyogenes with human
2004 Blackwell Publishing Ltd, Molecular Microbiology, 53, 918