Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
A problem approach
Fundamentals and Practice - I
Fifth edition
Pranav Kumar
Praveen Verma
Usha Mina
Biotechnology
A problem approach
Fifth edition
Pranav Kumar
Former faculty,
Department of Biotechnology,
Jamia Millia Islamia,
New Delhi, India
Praveen Verma
Scientist VI and Group Leader,
National Institute of Plant Genome Research (NIPGR),
New Delhi, India
Usha Mina
Senior Scientist,
CESCRA,
Indian Agricultural Research Institute (IARI),
New Delhi, India
Pathfinder Publication
New Delhi, India
Pranav Kumar
Former faculty,
Department of Biotechnology,
Jamia Millia Islamia,
New Delhi, India
Praveen Verma
Scientist VI and Group Leader,
National Institute of Plant Genome Research (NIPGR),
New Delhi, India
Usha Mina
Senior Scientist,
CESCRA,
Indian Agricultural Research Institute (IARI),
New Delhi, India
Biotechnology: A problem approach, Fifth edition
Pathfinder Publication
A unit of Pathfinder Academy Private Limited, New Delhi, India.
pathfinderpublication.in
Preface
The present century has been considered as one that belongs to biotechnology because
it has an unlimited potential to produce an extensive range of valuable products. This
branch of science has been viewed as something vital for life with numerous scientific
applications in several fields of human endeavours. The branch of science is significant for
mankind that many of the big discoveries of the second half of the last century and early this
century would not have been possible in the absence of our accomplishments in this disci-
pline. Biotechnology – A problem approach, covers the basic concepts, methodologies and
applications of biotechnology. This book provides a balanced introduction to all major areas of
the subject. The chapters such as Biomolecules and catalysis, Bioenergetics and metabolism,
Cell structure and functions, Immunology, Genetics, Bioinformatics and Bioprocess engineer-
ing were selected in a sharply focused manner without overwhelming or excessive details.
Sincere efforts have been made to support textual clarifications and explanations with the help
of flow charts, figures and tables to make learning easy and convincing. The chapters have been
supplemented with self-tests and questions so as to check one’s own level of knowledge.
This book has been designed to serve as a comprehensive biotechnology textbook as well as
a wide-ranging reference book.
Acknowledgements
Our students were the original inspiration for the first edition of this book, and we remain
continually grateful to all of them, because we learn from them how to think about the life
sciences and how to communicate knowledge in most meaningful way. We thank, Abhai Kumar,
Rizwan Ansari, Sarika Srivastava, Shashi Prakash Singh, Lekha Nath and Mr. Ajay Kumar,
reviewers of this book, whose comments and suggestions were invaluable in improving the
text. Any book of this kind requires meticulous and painstaking efforts by all its contribu-
tors. Several diligent and hardworking minds have come together to bring out this book in
this complete form. This book is a team effort, and producing it would be impossible without
the outstanding people of Pathfinder Publication. It was a pleasure to work with many other
dedicated and creative people especially Pradeep Verma of Pathfinder Publication during the
production of this book.
Pranav Kumar
Praveen Verma
Usha Mina
iii
Contents
Chapter 1
Biomolecules and Catalysis
1.1 Amino acids and Proteins 1
1.2.1 Collagen 23
1.2.2 Elastin 24
1.2.3 Keratins 25
1.2.4 Myoglobin 25
1.2.5 Hemoglobin 27
1.3.2 Amyloid 36
1.5.1 Nucleotides 48
1.6.1 B-DNA 53
1.6.2 Z-DNA 55
v
1.6.4 G-quadruplex 56
1.7 RNA 63
1.8 Carbohydrates 66
1.8.1 Monosaccharide 66
1.8.2 Epimers 68
1.8.6 Polysaccharides 73
1.8.7 Glycoproteins 76
1.9 Lipids 76
1.9.3 Phospholipids 80
1.9.4 Glycolipids 82
1.9.5 Steroid 83
1.9.6 Eicosanoid 83
1.10 Vitamins 86
1.12 Enzymes 93
vi
Chapter 2
Bioenergetics and Metabolism
2.1 Bioenergetics 123
vii
2.8 Photorespiration 185
Chapter 3
Cell Structure and Functions
3.1 What is a Cell? 243
viii
3.7.1 Transport of proteins through cisternae 292
ix
3.23 Apoptosis 382
Chapter 4
Prokaryotes and Viruses
4.1 General features of Prokaryotes 397
4.2 Phylogenetic overview 398
4.3 Structure of bacterial cell 398
4.4 Bacterial genome : Bacterial chromosome and plasmid 409
4.5 Bacterial nutrition 413
4.5.1 Culture media 415
4.5.2 Bacterial growth 416
4.6 Horizontal gene transfer and genetic recombination 419
4.6.1 Transformation 420
4.6.2 Transduction 422
4.6.3 Conjugation 426
4.7 Bacterial taxonomy 431
4.8 General features of important bacterial groups 432
Chapter 5
Immunology
5.1 Innate immunity 469
5.2 Adaptive immunity 471
5.3 Cells of the immune system 473
5.3.1 Lymphoid progenitor 474
5.3.2 Myeloid progenitor 476
5.4 Organs involved in the adaptive immune response 477
5.4.1 Primary lymphoid organs 477
x
5.4.2 Secondary lymphoid organs/tissues 478
5.5 Antigens 479
5.6 Major-histocompatibility complex 483
5.6.1 MHC molecules and antigen presentation 485
5.6.2 Antigen processing and presentation 486
5.6.3 Laboratory mice 488
5.7 Immunoglobulins : Structure and function 489
5.7.1 Basic structure of antibody molecule 489
5.7.2 Different classes of immunoglobulin 491
5.7.3 Action of antibody 494
5.7.4 Antigenic determinants on immunoglobulins 494
5.8 B-cell maturation and activation 496
5.9 Kinetics of the antibody response 502
5.10 Monoclonal antibodies and Hybridoma technology 503
5.10.1 Engineered monoclonal antibodies 504
5.11 Organization and expression of Ig genes 506
5.12 Generation of antibody diversity 512
5.13 T-cells and CMI 515
5.13.1 Superantigens 525
5.14 Cytokines 526
5.15 The complement system 529
5.16 Hypersensitivity 533
5.17 Autoimmunity 535
Chapter 6
Genetics
6.1 Mendel’s principles 545
xi
6.3.3 Duplicate recessive epistasis 562
xii
6.10.8 Human nuclear genome 631
xiii
6.18 Regulation of gene transcription 711
Chapter 7
Recombinant DNA technology
7.1 DNA cloning 797
xiv
7.2.4 Ligases 805
xv
7.18.5 Somatic hybridization and cybridization 874
Chapter 8
Bioprocess engineering
8.1 Concept of material and energy balance 887
xvi
Chapter 9
Bioinformatics
9.1 Introduction 954
Index 981
xvii
Chapter 01
Biomolecules and Catalysis
A biomolecule is a carbon-based organic compound that is produced by a living organism. More than 25 naturally
occurring chemical elements are found in biomolecules, but these biomolecules consist primarily of carbon, hydrogen,
nitrogen, oxygen, phosphorus and sulfur. In terms of the percentage of the total number of atoms, four elements
such as hydrogen, oxygen, nitrogen and carbon together make up over 99% of the mass of most cells.
Biomolecules include both small as well as large molecules. The small biomolecules are low molecular weight (less
than 1000) compound which include sugars, fatty acids, amino acids, nucleotides, vitamins, hormones,
neurotransmitters, primary and secondary metabolites. Sugars, fatty acids, amino acids and nucleotides constitute
the four major families of small biomolecules in cells. Large biomolecules which have high molecular weight are
called macromolecules and mostly are polymers of small biomolecules. These macromolecules are proteins,
carbohydrates and nucleic acids.
Small biomolecules Macromolecules
Sugars Polysaccharides
Amino acids Polypeptides (proteins)
Nucleotides Nucleic acids
Fatty acids
Nucleic acids and proteins are informational macromolecules. Proteins are polymers of amino acids and constitute
the largest fraction (besides water) of cells. The nucleic acids, DNA and RNA, are polymers of nucleotides. They
store, transmit, and translate genetic information. The polysaccharides, polymers of simple sugars, have two
major functions. They serve as energy-yielding fuel stores and as extracellular structural elements.
a-carboxyl group
—
COO
+
a-amino group H3N Ca H
R
Side chain
1
Biomolecules and Catalysis
This structure is common to all except one of the α-amino acids (proline is the exception). The R group or side chain
attached to the α-carbon is different in each amino acid. In the simplest case, the R group is a hydrogen atom and
amino acid is glycine.
— —
COO COO
+ +
H3N Ca H H3N Ca H
H=R b CH2
Glycine g CH
2
d CH2 R
e CH2
+
NH 3
Lysine
In α-amino acids both the amino group and the carboxyl group are attached to the same carbon atom. However,
many naturally occurring amino acids not found in protein, have structures that differ from the α-amino acids. In
these compounds the amino group is attached to a carbon atom other than the α-carbon atom and they are called
β, γ, δ, or ε amino acids depending upon the location of the C-atom to which amino group is attached.
— —
COOH COO COO
+ +
H3N Ca H H3N Ca H H2N Ca H
R R R
Figure 1.3 The acid-base behaviour of an amino acid in solution. At low pH, the positively charged species
predominates. As the pH increases, the electrically neutral zwitterion becomes predominant. At higher pH, the
negatively charged species predominates.
2
This page intentionally left blank.
Biomolecules and Catalysis
Glycine Alanine
Optical activity is measured by polarimeter. Optical activity is the ability of an optically active compound to rotate
the plane of linearly polarized light. Optical rotation is a quantitative measure of the rotation of light caused by the
compound. The magnitude of optical rotation indicates the extent to which plane of linearly polarized light is rotated
and sign represents the direction of rotation. Optical rotation of an optically active compound depends on the
concentration of the compound, temperature, wavelength of light used, solvent used to dissolve the sample and
light pathlength. The optical rotation of a solution at a given temperature and wavelength is given by
Å = [α]Tλ × C × l
Specific rotation is the reference value of optical rotation for a given concentration of compound at a given
temperature and fixed wavelength. At a given temperature and for a given wavelength of light, the specific rotation
is defined as the observed value of optical rotation when plane polarized light is passed through a sample with a
path length of 1 decimeter and a sample concentration of 1g per milliliter.
Sample tube
containing a
chiral compound
Figure 1.5 When plane polarized light is passed through a solution that contains an optically active compound,
there is net rotation of the plane polarized light. The light is rotated either clockwise (dextrorotatory) or
counterclockwise (levorotatory) by an angle that depends on the molecular structure and concentration of
the compound, the pathlength and the wavelength of the light.
3
This page intentionally left blank.
Biomolecules and Catalysis
peptides are cyclic in nature. Two cyclic decapeptides (peptides containing 10 amino acid residues) produced by the
bacterium Bacillus brevis are common examples. Both of these peptides, gramicidin S and tyrocidine A, are
antibiotics, and both contain D-amino acids as well as L-amino acids. In addition, both contain the amino acid
ornithine, which does not occur in proteins. Small peptides play many roles in organisms. Some, such as oxytocin
and vasopressin, are important hormones. Others, like glutathione, regulate oxidation–reduction reactions. Still
others, such as enkephalins, are naturally occurring painkillers. Aspartame is a commercially synthesized dipeptide,
L-aspartylphenylalanyl methylester, and is used as an artificial sweetener.
When many amino acid residues are joined, the product is called a polypeptide. Amino acids which have been
incorporated into a peptide or polypeptide are termed amino acid residues. By convention, in a polypeptide the left
end represented by the first amino acid while the right end represented by the last amino acid. The first amino acid
is also called as N-terminal amino acid residue. The last amino acid is called the C-terminal amino acid residue.
H O H O H O
N-terminal C-terminal
H2N C C N C C N C C OH
R H R H R
Figure 1.11 A series of amino acids joined by peptide bonds form a polypeptide chain, and each amino acid
unit in a polypeptide is called a residue. A polypeptide chain has polarity because its ends are different, with
an α-amino group at one end and an α-carboxyl group at the other.
The peptide bonds in proteins are formed between the α-amino and the α-carboxyl groups. But peptides do occur
naturally where the peptide linkage involves a carboxyl or amino group which is attached to a carbon atom other
than the α-carbon. For example a dipeptide formed between the γ -carboxyl group of glutamic acid and the amino
group of alanine is called γ-glutamylalanine.
H O H O
H2N C C OH H N C C OH
R1 H R2
H2O
H O H O
H2N C C N C C OH
R1 H R2
Figure 1.12 The formation of a peptide bond (also called an amide bond) between the α-carboxyl group of
one amino acid to the α-amino group of another amino acid is accompanied by the loss of a water molecule.
13
Biomolecules and Catalysis
HN NH
+ +
HC (CH2)2 CH2 CH2 NH3 H3N CH2 CH2 (CH2)2 CH
O C C O
Lysine Lysine
HN NH
O C C O
Allysine Allysine
HN NH
O C CHO C O
Aldol cross-link
Figure 1.21 Intramolecular cross-links form between allysine after oxidative deamination of ε-amino groups
of lysine residues. Such aldehydes of two side chains then link covalently in a spontaneous nonenzymatic
aldol condensation.
The intermolecular cross-linking of tropocollagens involves the formation of a unique hydroxypyridinium structure
from one lysine and two hydroxylysine residues.
1.2.2 Elastin
Elastin is a highly hydrophobic connective tissue protein that is responsible for extensibility and elasticity. It is the
second major protein in the extracellular matrix, which is the main component of elastic fibers found in ligaments,
large arteries, and lungs. After synthesis, a 72 kDa molecule of soluble tropoelastin is secreted into the matrix. This
protein is rich in nonpolar amino acid and unusually rich in proline and glycine. Unlike collagen it is not glycosylated
and contain some hydroxyproline but no hydroxylysine. The tropoelastin is composed largely of two types of short
sequences that alternate along the polypeptide chain: hydrophobic segments and alanine and lysine rich segments.
After secretion, the tropoelastin molecules become highly cross-linked to one another, generating an extensive
network of elastin fibers. After secretion from the cell, certain lysyl residues of tropoelastin are oxidatively deami-
nated to aldehydes by lysyl oxidase. The condensation of three of these lysine-derived aldehydes with an unmodified
lysine results in formation of a tetrafunctional cross-link called desmosines. Once cross-linked in its mature, extra-
cellular form, elastin is highly insoluble and extremely stable.
24
This page intentionally left blank.
Biomolecules and Catalysis
H H O
N C C
(CH2)2
H N N H
CH2
H C CH2 H2C CH2 CH2 C H
O C C O
+
N
CH2
(CH2)3
N C C
H H O
1.2.3 Keratins
Keratins are fibrous proteins present in eukaryotes. They form a large family, with about 30 members being
distinguished. Keratins have been classified as either α-keratins or β-keratins.
Proteins α-keratin β-keratin
Characteristics Tough, insoluble Soft, flexible
Conformation Helical Extended chain
Basic unit Protofibril Antiparallel β-pleated sheet
α-keratins are intermediate filament proteins present only in many metazoans, including vertebrates. In vertebrates,
α-keratins constitute almost the entire dry weight of hair, wool, feathers, nails, claws, scales, horns, hooves, and
much of the outer layer of skin. The α-keratin polypeptide chain which forms polymerized α-keratin structure, is a
right-handed α-helix and rich in hydrophobic amino acid residues Ala, Val, Leu, Ile, Met and Phe. Every α-keratin
polypeptide chain dimerizes to form heterodimer. The heterodimer is made up of type I (acidic) and the type II
(neutral/basic) α-keratin polypeptide chains. The two chains in heterodimer have a parallel arrangement. Two
heterodimers join in an antiparallel manner to form the fundamental tetrameric subunit (a protofilament). Two
protofilaments constitute a protofibril. Four protofibrils constitute a microfibril, which associates with other microfibrils
to form a macrofibril.
1.2.4 Myoglobin
Myoglobin (Mb), a globular protein, contains a single polypeptide chain of 153 amino acid residues (molecular
weight 17,800), and a single heme group. The inside of myoglobin consists almost exclusively of nonpolar residues,
whereas the outside contains both polar and nonpolar residues. About 75% of the polypeptide chain is α-helical.
There are eight helical segments. These eight helical segments are commonly labeled A–H, starting from the
NH2-terminal end. The interhelical regions are designated as AB, BC, CD,..., GH, respectively. The iron atom of the
heme is directly bonded to a nitrogen atom of a histidine side chain of globin.
Heme
Globin of Mb binds a single heme group by forming a co-ordinate bond. The heterocyclic ring system of heme is a
porphyrin derivative. The porphyrin in heme is known as protoporphyrin IX. It is made up of 4-pyrrole ring and
4-pyrroles are linked by methine (=CH_) bridges to form a tetrapyrrole ring. The Fe atom is present either in Fe2+
or Fe3+ oxidation state in the center of the protoporphyrin IX ring.
25
This page intentionally left blank.
Biomolecules and Catalysis
Biological interaction
Biological interactions (or bonds) in living systems fall under two categories – covalent and non-covalent interactions.
Covalent interactions
A covalent bond is formed when two atoms share one or more pairs of electrons. Covalent bonds are strong
bonds and very stable in nature. These bonds may be either polar or nonpolar. A nonpolar covalent bond such
as that in the hydrogen molecule, H2, the electron pair is shared equally between the two hydrogen nuclei. Both
hydrogen atoms have the same electronegativity (the electronegativity difference between the atoms is zero).
Covalent bonds, such as the one in HF, in which the electron pairs are shared unequally due to difference in
electronegativity are called polar covalent bonds. In HF, the shared electron pair between the two atoms gets
displaced more towards fluorine since the electronegativity of fluorine is far greater than that of hydrogen. The
atom towards which the electron pair shift gets slight negative charge while the other atom acquires a slight
positive charge. The magnitude of electronegativity difference reflects the degree of polarity. Greater the
difference in the electronegativities of the atoms forming the bond, greater will be the charge separation and
hence greater will be the polarity of the molecule. The polarity of a molecule can be expressed by its dipole
moment, which measures the separation of charge within the molecule.
Non-covalent interactions
Non-covalent interactions include ionic bonds, hydrogen bonds, van der Waals forces and hydrophobic interactions.
These interactions are weak interactions. The energy required to break non-covalent interactions is only
1-5 kcal/mol which is much less than the bond energies of covalent bonds.
Ionic bonds
An ionic bond is a chemical bond formed by the electrostatic attraction between positive and negative ions. A
positively charged ion is called a cation and a negatively charged ion is called an anion. In ionic (electrovalent)
bonding, the atoms are bound by electrostatic attraction of opposite ions, whereas, in covalent bonding, atoms
are bound by sharing electrons to attain stable electron configurations. Ionic bond forms when the electronegativity
difference between two elements is large, as between a metal and a nonmetal.
An important aspect of ionic compound in aqueous solution is the hydration of ions. Because water molecules
are polar, they are attracted to charged ions. Shells of water molecules, referred to as solvation spheres, cluster
around both positive and negative ions. As ions become hydrated, the attractive force between them is reduced,
and the charged species dissolves in the water.
Hydrogen bonds
Hydrogen bonding is a weak electrostatic attractive force that exists between a hydrogen atom covalently
bonded to a very electronegative atom, X and a lone pair of electrons on another small, electronegative atom,
Y. A typical hydrogen bond may be depicted as X–H•••Y–, where the three dots denote the bond. X–H represents
the hydrogen bond donor. The atoms X and H are covalently bonded to one another and the X–H bond is
polarized, the H•••Y bond strength increasing with the increase in electronegativity of X. Usually, hydrogen
bonding is seen in case where X and Y are the atoms F, O or N. Y is electronegative atom with a lone pair of
electrons. Although considerably weaker than ionic and covalent bonds, hydrogen bonds are stronger than
most non-covalent bonds. It can be intermolecular and intramolecular hydrogen bond. Hydrogen bonds are
both longer and weaker than covalent bonds between the same atoms. For example, the bond energy of O–H
covalent bond is 110 kcal mol–1 whereas, the energy of hydrogen bonds in water is only about 5 kcal mol–1.
46
This page intentionally left blank.
Biomolecules and Catalysis
1.5.1 Nucleotides
Nucleic acids are polymer. The monomeric units of nucleic acids are called nucleotides. Nucleic acids therefore are
also called polynucleotides. Nucleotides are phosphate esters of nucleosides and made up of three components:
1. A base that has a nitrogen atom (nitrogenous base)
2. A five carbon sugar
3. An ion of phosphoric acid
Nitrogenous bases
Nitrogenous bases are heterocyclic, planar and relatively water insoluble aromatic molecules. There are two general
types of nitrogenous bases - pyrimidines and purines.
H H
7
C6 5 N C4 5
1N C 3N CH
8
2 CH 2
HC C HC CH
4 N9 6
N H N
3 1
Purine Pyrimidine
Purines
Two different nitrogenous bases with a purine ring (composed of carbon and nitrogen) are found in DNA. The two
common purine bases found in DNA and RNA are adenine (6-aminopurine) and guanine (6-oxy-2-aminopurine).
Adenine has an amino group (–NH2) on the C6 position of the ring (carbon at position 6 of the ring). Guanine has an
amino group at the C2 position and a carbonyl group at the C6 position.
Pyrimidines
The two major pyrimidine bases found in DNA are thymine (5-methyl-2,4-dioxypyrimidine) and cytosine (2-oxy-4-
aminopyrimidine) and in RNA they are uracil (2,4-dioxypyrimidine) and cytosine. Thymine contains a methyl group
at the C5 position with carbonyl groups at the C4 and C2 positions. Cytosine contains a hydrogen atom at the C5
position and an amino group at C4. Uracil is similar to thymine but lacks the methyl group at the C5 position. Uracil
is not usually found in DNA. It is a component of RNA.
NH2 O NH2 O O
C C C C C
N N
N C HN C N CH HN CH HN C CH3
CH CH
HC C C C C CH C CH C CH
N H2N N O N O N O N
N H N H H H H
Sugars
Naturally occurring nucleic acids have two types of pentose sugars: ribose and deoxyribose sugar.
Ribose sugar is found in RNA. It is a five carbon monosaccharide with a hydroxyl group (–OH) on each carbon.
Deoxyribose sugar is found in DNA. It is a five carbon monosaccharide, lacking one oxygen atom at 2’ position. The
hydroxyl group (–OH) at 2’ position of ribose sugar is replaced by a hydrogen (–H).
48
This page intentionally left blank.
Biomolecules and Catalysis
1.6.2 Z-DNA
Z-DNA is a left-handed double helical structure with two anti-parallel strands that are held together by Watson-
Crick base pairing. The transition from B- to Z-DNA conformation occurs most readily in DNA segments containing
alternating purines and pyrimidines, especially alternations of C and G on one strand (and also in DNA segments
containing alternations of T G on one strand and C A on the other). The existence of Z-DNA was first suggested by
optical studies demonstrating that a polymer of alternating C and G in one strand in a 4 M NaCl solution. The
physical reason for this finding remained a mystery until an X-ray crystallographic study of a self-complementary
DNA hexamer d(CG)3 revealed a left-handed double helix with two anti-parallel chains that were held together by
Watson–Crick base pairing.
Z-DNA is thinner (18 Å) than B-DNA (20 Å) and there is only one deep, narrow groove equivalent to the minor
groove in B-DNA. No major groove exists. In contrast to B-DNA where a repeating unit is a 1 base pair, in Z-DNA
the repeating unit is a 2 base pairs. This dinucleotide repeat causes the backbone to follow a zigzag path, giving rise
to the name Z-DNA. The glycosidic bond conformations alternate between anti and syn (anti for pyrimidines; syn
for purines). Similarly, the sugar puckers alternate between C3’-endo and C2’-endo (C2’-endo for pyrimidines and
C3’-endo for purines).
Z-DNA can form in regions of alternating purine-pyrimidine sequence; GCGCGC... sequences form Z-DNA most
easily. TGTGTGTG… sequences also form Z-DNA but they require a greater stabilization energy. Formation of
Z-DNA conformation is generally unfavourable. Certain conditions promote Z-DNA conformation from B-DNA
conformation; such as negative DNA supercoiling, high salt concentration or 5-methylated deoxycytosine.
55
Biomolecules and Catalysis
Triple helix can be intermolecular or intramolecular. In the intermolecular Pu.Pu.Py triple helix, the poly-purine
third strand is organized antiparallel with respect to the purine strand of the original Watson-Crick duplex. In the
intermolecular Py.Pu.Py triplex, the polypyrimidine third strand is organized parallel with respect to the purine
strand and the phosphate backbone is positioned.
5' 3’
Polypyrimidine strand
Polypurine
third strand Polypurine strand
Figure 1.49
Intermolecular Pu.Pu.Py triple
5'
helix. The polypurine third strand
(black color) is organized antiparallel
with respect to the purine strand
5' of the original double strand DNA.
3’
An intramolecular triplex (also referred to as H-DNA) could form within a single homopurine-homopyrimidine
duplex DNA region in the supercoiled DNA. As in intermolecular triplexes, when the third strand is the pyrimidine
strand, it forms Hoogsteen pairs in a parallel fashion with the central purine strand. When the third strand is the
purine strand, it forms reverse Hoogsteen pairs in an antiparallel fashion with the central purine strand.
1.6.4 G-quadruplex
Nucleic acid sequences which are rich in guanine are capable of forming four-stranded structures called
G-quadruplexes (also called G-quartat). These consist of a square arrangement of guanines (a tetrad), stabilized
by Hoogsteen hydrogen bonding. The formation and stability of the G-quadruplexes is a monovalent cation-dependent.
A monovalent cation is present in the center of the tetrads. G-quadruplexes can be formed of DNA or RNA. They
can be formed from one, two or four separate strands of DNA or RNA. Depending on the direction of the strands or
parts of a strand that form the tetrads, structures may be described as parallel or antiparallel. All parallel quadruplexes
have all guanine glycosidic angles in an anti conformation. Anti-parallel quadruplexes have both syn and anti
conformations.
H
Anti
N N N H N N Anti
N H
N O N
O N
H
+ N H
H M H
H N
O H
N
O
N H N N
N N
N N H N
Anti H Anti
Figure 1.50 Four-stranded structures can arise from square arrangement of guanines.
56
Biomolecules and Catalysis
3’ 3’
3’ 3’ 5’ 3’
G G
G G
G G G G
G G G G
G G G G
G G G G
G G G G
G G G G
T T G G
T T
T G G
T T
T T T
T T
G G T T
G G T T
G G G G
G G G G
G G G G
G G G G
G G G G
G G G G
T T G G
T T G G
T T
T T
5’ 5’
5’ 5’ 5’ 3’
Parallel Antiparallel
57
This page intentionally left blank.
Biomolecules and Catalysis
TMV type A
Infection of
tobacco leaf
TMV type B
Figure 1.59 In vivo reconstitution of a hybrid TMV virus. There are two strains of virus (TMV type A and
type B) which were separated into protein and RNA. The protein of one strain (type B) was allowed to recombine
with the RNA of the other (type A). The in vivo progeny of this hybrid had the protein originally associated
with its RNA. This proves that the genetic material of TMV is RNA, not protein.
Problem
What is the approximate molecular weight of duplex DNA required to code for glyceraldehyde phosphate
dehydrogenase (MW 40,000)?
Solution
The average molecular weight of an amino acid residue in a protein is 110. Thus, a protein whose molecular weight
is 40,000 contains 40,000/110 = ~364 amino acids and requires a minimum DNA duplex of 3 × 364 = ~1090,
nucleotide pairs. Since each nucleotide pair has an average molecular weight of about 650, the molecular weight of
this gene would be about 1090 × 650 = 708,500. On the average, the molecular weight of coding DNA is about 18
times that of the corresponding protein.
Problem
65
Biomolecules and Catalysis
Solution
1. The genetic code is a triplet code. That is, it takes a sequence of three nucleotides on the coding strand of DNA
to specify one amino acid. The DNA of T4 contains:
1.3 × 108
= 2 × 105 nucloeotide pairs = 2 × 105 nucleotides in the codin
ng strand.
650
2 × 105
= ~ 6.7 × 104 codons.
3
6.7 × 104
6.7 × 104 codons can yield: = 134.
500
1.8 Carbohydrates
Carbohydrates are polyhydroxy aldehydes or polyhydroxy ketones, or compounds that can be hydrolyzed to them.
In the majority of carbohydrates, H and O are present in the same ratio as in water, hence also called as hydrates
of carbon. Carbohydrates are the most abundant biomolecules on Earth. Carbohydrates are classified into following
classes depending upon whether these undergo hydrolysis and if so on the number of products form:
Monosaccharides are simple carbohydrates that consist of a single polyhydroxy aldehyde or ketone unit.
Oligosaccharides are polymers made up of two to ten monosaccharide units joined together by glycosidic linkages.
Oligosaccharides can be classified as di-, tri-, tetra- depending upon the number of monosaccharides present.
Amongst these the most abundant are the disaccharides, with two monosaccharide units.
Polysaccharides are polymers with hundreds or thousands of monosaccharide units. Polysaccharides are not sweet
in taste hence they are also called non-sugars.
1.8.1 Monosaccharide
Monosaccharides consist of a single polyhydroxy aldehyde or ketone unit. Monosaccharides are the simple sugars
and they have a general formula CnH2nOn. Monosaccharides are colorless, crystalline solids that are freely soluble
in water but insoluble in nonpolar solvents. The most abundant monosaccharide in nature is the D-glucose.
Monosaccharides can be further sub classified on the basis of:
66
This page intentionally left blank.
Biomolecules and Catalysis
1.8.7 Glycoproteins
Various types of compound consisting of carbohydrates covalently linked with non-carbohydrates constituent are
classified under the general name called glycoconjugates. The major types of glycoconjugates are the glycoproteins,
peptidoglycans, glycolipids and lipopolysaccharides. Carbohydrates covalently linked with proteins are called
glycoproteins. The carbohydrate may be in the form of a monosaccharide, disaccharide, oligosaccharide,
polysaccharide, or their derivatives. The term glycoprotein also includes proteoglycans, which in the past were
considered as a separate class of glycoconjugates. Proteoglycans are a subclass of glycoproteins in which the
carbohydrates are glycosaminoglycans. In glycoproteins, carbohydrates are attached either to the amide nitrogen
atom in the side chain of asparagine (termed as N-linkage) or to the oxygen atom in the side chain of serine or
threonine (termed as O-linkage).
CH2OH CH2OH
C O O C O
O O CH2 CH Ser O NH C CH2 CH Asn
1 1
OH NH OH NH
O H O H
Monosaccharide Monosaccharide
Core protein Core protein
Figure 1.67 Carbohydrates are covalently attached to many different proteins to form glycoproteins.
Carbohydrates are attached either to the amide nitrogen atom in the side chain of asparagine (termed an
N-linkage) or to the oxygen atom in the side chain of serine or threonine (termed an O-linkage).
Disaccharides like sucrose, trehalose not capable of reducing ferric or cupric ion are called non-reducing sugar.
In sucrose and trehalose, anomeric carbons of both monosaccharides participate in glycosidic bond formation. So,
they do not contain free anomeric carbon atoms. Sucrose and trehalose are therefore non-reducing sugar, and
have no reducing end. So it cannot be oxidized by cupric or ferric ions. In describing disaccharides or polysaccharides,
the end of a chain that has a free anomeric carbon (i.e. is not involved in a glycosidic bond) is called the reducing
end of the chain.
1.9 Lipids
Biological lipids are a chemically diverse group of organic compounds which are insoluble or only poorly soluble in
water. They are readily soluble in nonpolar solvents such as ether, chloroform, or benzene. The hydrophobic nature
of lipids is due to the predominance of hydrocarbon chains (—CH2—CH2—CH2—) in their structures. Unlike the
proteins, nucleic acids, and polysaccharides, lipids are not polymers.
76
Biomolecules and Catalysis
Leukotrienes are hydroxy fatty acid derivatives of arachidonic acid and do not contain a ring structure. Leukotrienes
are distinguished by containing a conjugated triene double-bond arrangement. They are involved in chemotaxis,
inflammation, and allergic reactions.
H
O
COOH
H
CH3
85
Biomolecules and Catalysis
1.10 Vitamins
Vitamins are organic compounds required by the body in trace amounts to perform specific cellular functions. They
can be classified according to their solubility and their functions in metabolism. The requirement for any given
vitamin depends on the organisms. Not all vitamins are required by all organisms. Vitamins are not synthesized by
humans, and therefore must be supplied by the diet. Vitamins may be water soluble or fat soluble. Nine vitamins
(thiamines, riboflavin, niacin, biotin, pantothenic acid, folic acid, cobalamin, pyridoxine, and ascorbic acid) are
classified as water soluble, whereas four vitamins (vitamins A, D, E and K) are termed fat-soluble. Except for
vitamin C, the water soluble vitamins are all precursors of coenzymes.
Thiamine pyrophosphate (TPP) is the biologically active form of the vitamin, formed by the transfer of a pyrophosphate
group from ATP to thiamine. Thiamine is composed of a substituted thiazole ring joined to a substituted pyrimidine
by a methylene bridge.
Thiazolium Aminopyrimidine
Reactive H NH2
H NH2 carbon +
S N N
+
S N N AMP
ATP
N CH3
CH3
N CH3
CH3 O
TPP synthetase
O —
O P O
H
Thiamine O
—
O P O
—
O
TPP serves as a coenzyme in the oxidative decarboxylation of α-keto acid, and in the formation or degradation of
α-ketols (hydroxy ketones) by transketolase.
Pyruvate
decarboxylase
Pyruvate (a-keto acid) Acetaldehyde + CO2
Transketolase
Xylulose-5-Phosphate + Ribose-5-Phosphate Glyceraldehyde–3–Phosphate + Sedoheptulose–7-Phosphate
Beri-Beri is a severe thiamine-deficiency syndrome found in areas where polished rice is the major component of the
diet.
86
Biomolecules and Catalysis
H
O O
Isoalloxazine
H3C N H3C N N
NH NH 2H
+
—
N N O N N O 2e N
H3C H3C
CH2 CH2 H
H C OH ADP PPi H C OH FADH2 (Reduced)
ATP ATP
Ribitol H C OH FMN H C OH
H C OH H C OH
Figure 1.79 Structure and biosynthesis of flavin mononucleotide (FMN) and flavin adenine dinucleotide (FAD).
Niacin
Niacin, or nicotinic acid, is a substituted pyridine derivative. The biologically active coenzyme forms are nicotinamide
adenine dinucleotide (NAD+) and its phosphorylated derivative, nicotinamide adenine dinucleotide phosphate
(NADP+). Nicotinamide, is a derivative of nicotinic acid that contains an amide instead of a carboxyl group. NAD+
and NADP+ serve as coenzymes in oxidation-reduction reactions in which the coenzyme undergoes reduction of the
pyridine ring by accepting a hydride ion (H–). The reduced forms of NAD+ and NADP+ are NADH and NADPH,
respectively.
O O
H H H
NH2
C C
+
N NH2 H NH2
O O
—
5' 5' + 2e :
N H2C O P O P O CH2 N N
N
O
O O
—
O
—
H H NADH (Reduced)
1'
H H H
H
OH OH
OH OH
Adenosine
Deficiency of niacin causes pellagra, a disease involving the skin and central nervous system. The symptoms of
pellagra progress through the three Ds: Dermatitis, Diarrhoea, Dementia, and, if untreated, death.
Biotin
Biotin is a coenzyme in carboxylation reactions, in which it serves as a mobile carboxyl group carrier. Biotin is
covalently bound to the enzyme by an amide linkage between the carboxyl group of its valerate side chain and
the ε-amino group of an enzyme Lys residue to form a biocytin (alternatively, biotinyllysine) residues.
87
This page intentionally left blank.
Chapter 02
Bioenergetics and Metabolism
2.1 Bioenergetics
Bioenergetics is the quantitative study of the energy transductions that occur in living cells and of the nature and
functions of the chemical processes underlying these transductions.
Thermodynamic principles
The First law of thermodynamics states that the energy is neither created nor destroyed, although it can be
transformed from one form to another i.e. the total energy of a system, including surroundings, remains constant.
If Δq is positive, heat has been transferred to the system, giving an increase in internal energy. When Δq is
negative, heat has been transferred to the surroundings, giving a decrease in internal energy. When Δw is positive,
work has been done by the system, giving a decrease in internal energy. When Δw is negative, work has been done
by the surroundings, giving an increase in internal energy.
The Second law of thermodynamics states that the total entropy of a system must increase if a process is to occur
spontaneously. Mathematically, it can be expressed as:
Dq
DS ³ where, ΔS is the change in entropy of the system
T
Entropy is unavailable form of energy and it is very difficult to determine it, so a new thermodynamic term called
free energy is defined.
Free energy
Free energy or Gibb’s free energy indicates the portion of the total energy of a system that is available for useful
work (also known as chemical potential). The change in free energy is denoted as ΔG.
Under constant temperature and pressure, the relationship between free energy change (ΔG) of a reacting system
and the change in entropy (ΔS) is expressed by following the equation:
ΔG = ΔH – TΔS
Where, ΔH is the change in enthalpy and T is absolute temperature. ΔH is the measure of change in heat content of
reactants and products. The change in the free energy, ΔG, can be used to predict the direction of a reaction at
constant temperature and pressure.
123
Bioenergetics and Metabolism
If 'G is negative, the reaction proceeds spontaneously with the loss of free energy ( exergonic),
'G is positive, the reaction proceeds only when free energy can be gained (endergonic),
'G is 0, the system is at equilibrium; both forward and reverse reactions occur at equal rates,
'G of the reaction A o B depends on the concentration of reactant and product. At constant temperature and
pressure, the following relation can be derived:
[B]
'G = 'G0 + RT ln
[A]
Where, 'G0 is the standard free energy change;
R is the gas constant;
T is the absolute temperature;
[A] and [B] are the actual concentrations of reactant and product.
In a reaction A o B, a point of equilibrium is reached at which no further net chemical change takes place–that is,
when A is being converted to B, B is also being converted to A, as fast as A into B. In this state, the ratio of [B] to
[A] is constant, regardless of the actual concentrations of the two compounds:
[B]eq
K eq =
[A]eq
where Keq is the equilibrium constant, and [A]eq and [B]eq are the concentrations of A and B at equilibrium. The
concentration of reactants and products at equilibrium define the equilibrium constant, Keq. The equilibrium constant
Keq depends on the nature of reactants and products, the temperature and the pressure. Under standard physical
conditions (25°C and 1 atm pressure, for biological systems), the Keq is always the same for a given reaction,
whether or not a catalyst is present.
If the reaction A B is allowed to go to equilibrium at constant temperature and pressure, then at equilibrium
the overall free energy change ('G) is zero. Therefore,
[B]eq
'G0 = –RT ln
[A]eq
0
So, 'G = –RT ln Keq
This equation allows some simple predictions:
Keq 'G 0 Reaction
> 1.0 Negative proceeds forward
1.0 Zero is at equilibrium
< 1.0 Positive proceeds in reverse
As we know, the ionic composition of an acid or base varies with pH. So, the standard free energy calculated
according to the biochemistry convention is valid only at pH=7. Hence, under biochemistry convention, 'G0 is
symbolized by 'G0’ and likewise, the biochemical equilibrium constant is represented by K’ eq.
124
This page intentionally left blank.
Bioenergetics and Metabolism
2.3 Respiration
Living cells require an input of free energy. Energy is required for the maintenance of highly organized structures,
synthesis of cellular components, movement, generation of electrical currents and for many other processes. Cells
acquire free energy from the oxidation of organic compounds that are rich in potential energy.
Respiration is an oxidative process, in which free energy released from organic compounds is used in the formation
of ATP. The compounds that are oxidized during the process of respiration are known as respiratory substrates,
which may be carbohydrates, fats, proteins or organic acids. Carbohydrates are most commonly used as respiratory
substrates.
During oxidation within a cell, all the energy contained in respiratory substrates is not released free in a single step.
Free energy is released in multiple steps in a controlled manner and used to synthesise ATP, which is broken down
whenever (and wherever) energy is needed. Hence, ATP acts as the energy currency of the cell.
During cellular respiration, respiratory substrates such as glucose may undergo complete or incomplete oxidation.
The complete oxidation of substrates occurs in the presence of oxygen, which releases CO2, water and a large
amount of energy present in the substrate. A complete oxidation of respiratory substrates in the presence of
oxygen is termed as aerobic respiration.
Although carbohydrates, fats and proteins can all be oxidized as fuel, but here processes have been described by
taking glucose as a respiratory substrate. Oxidation of glucose is an exergonic process. An exergonic reaction
proceeds with a net release of free energy. When one mole of glucose (180 g) is completely oxidized into CO2 and
water, approximately 2870 kJ or 686 kcal energy is liberated. Part of this energy is used for synthesis of ATP. For
each molecule of glucose degraded to carbon dioxide and water by respiration, the cell makes up to about 30 or 32
ATP molecules, each with 7.3 kcal/mol of free energy.
The incomplete oxidation of respiratory substrates occurs under anaerobic conditions i.e. in the absence of oxygen.
As the substrate is never totally oxidized, the energy generated through this type of respiration is lesser than that
during aerobic respiration.
129
Bioenergetics and Metabolism
2.3.2 Glycolysis
Glycolysis (from the Greek glykys, meaning sweet, and lysis, meaning splitting) also known as Embden-Meyerhof
pathway, is an oxidative process in which one mole of glucose is partially oxidized into the two moles of pyruvate
in a series of enzyme-catalyzed reactions. Glycolysis occurs in the cytosol of all cells. It is a unique pathway that
occurs in both aerobic as well as anaerobic conditions and does not involve molecular oxygen.
6 CH2OH
5 O
Glucose (G) 4
OH
1
3 2
HO OH
2+ ATP
Hexokinase, Mg OH
1
DG° (kJ/mol) = –16.7 ADP CH2OP
O
Glucose-6-phosphate (G6P) OH
HO OH
Preparatory phase (Energy investment phase)
Phosphoglucoisomerase OH
2
DG° (kJ/mol) = +1.7
POH2C O CH2OH
Fructose-6-phosphate (F6P) HO
OH
2+ ATP
Phosphofructokinase, Mg OH
3
Step 1 : (Phosphorylation) Glucose is phosphorylated by ATP to form a glucose 6-phosphate. The negative
charge of the phosphate prevents the passage of the glucose 6-phosphate through the plasma membrane, trapping
glucose inside the cell. This irreversible reaction is catalyzed by hexokinase. Hexokinase is present in all cells of all
organisms. Hexokinase requires divalent metal ions such as Mg2+ or Mn2+ for activity. Hepatocytes and β-cells of
the pancreas also contain a form of hexokinase called glucokinase (hexokinase D). Hexokinase and glucokinase
are isozymes. Glucokinase is present in liver and beta-cells of the pancreas and has a high Km and Vmax as
compared to hexokinase.
Step 2 : (Isomerization) A readily reversible rearrangement of the chemical structure (isomerization) moves the
carbonyl oxygen from carbon 1 to carbon 2, forming a ketose from an aldose sugar. Thus, the isomerization of
glucose 6-phosphate to fructose 6-phosphate is a conversion of an aldose into a ketose.
130
This page intentionally left blank.
Bioenergetics and Metabolism
d
a b a b
F1 ATPase
Matrix
g
b e
Inner
mitochondrial F0
a
membrane c c c c cc
+
Intermembrane H
space
Figure 2.16 The enzyme complex consists of an F0 component and F1 ATPase. Proton passing through the
disc of ‘C’ units cause it and the attached γ-subunit to rotate. The γ-subunit fits inside the F1 ATPase of a three
α and three β-subunits, which are fixed to the membrane and do not rotate.
148
Bioenergetics and Metabolism
ATP synthase synthesizes ATP by harnessing the proton motive force. ATP synthase can also function in reverse to
hydrolyze ATP and pump H+ across the inner mitochondrial membrane. It thus acts as a reversible coupling devise,
interconverting electrochemical proton gradient and chemical bond energies, or vice versa.
F1 ATPase was first extracted from the mitochondrial inner membrane and purified by Efraim Racker and his
colleagues. F1 cannot synthesize ATP from ADP and Pi; because it can catalyze the hydrolysis of ATP. Thus the
enzyme was originally called F1ATPase. The complete F0F1 complex, like isolated F1, can hydrolyze ATP to ADP and
Pi, but its biological function is to catalyze the condensation of ADP and Pi to form ATP. The F0F1 complex is,
therefore, more appropriately called ATP synthase.
F0 F1 ATPase
F0
Show electron
transport, but no
ATP synthesis
Figure 2.17 F1 particles are required for ATP synthesis, but not for electron transport. Submitochondrial
vesicles from which F1 is removed by mechanical agitation cannot catalyze ATP synthesis. Because F1 separated
from membranes is capable of catalyzing ATP hydrolysis, it has been called the F1 ATPase.
ATP synthesis
The binding change mechanism is a widely accepted model of ATP synthesis. Paul Boyer developed the binding
change, or flip-flop mechanism, which postulated that ATP synthesis is coupled with a conformational change in the
ATP synthase generated by rotation of the gamma subunit. Proton translocation through F0 powers the rotation of
the γ-subunit of F1 ATPase, leading to changes in the conformation of the nucleotide-binding sites in the F1
β-subunits (as described below). By means of this binding change mechanism, the F0F1 complex harnesses the
proton-motive force to power ATP synthesis.
b1 b2 ATP ATP
P+
(counter-clockwise
ADP+Pi
Pi
as viewed from
Released
the top)
Stage 1 Stage 2
Figure 2.18 The binding-change mechanism of ATP synthesis from ADP and Pi by the F0F1 complex. The
molecule contains three binding sites, which interconvert between three conformational states as the molecule
rotates. The diagram shows one stage of the active cycle. The three αβ-dimers have three different states.
In 1, the open state O is empty; the loose state L contains ADP + Pi ; and the tight state T contains ATP. In
logical intermediate stage (bracketed), rotation of the γ within the (αβ)3 hexamer converts the L state to a T
state, the T state to an O, and the O state to an L. The L state can accept a new charge of substrate, the T state
can form ATP. At stage 2, the ATP has fallen out of the O state, new ADP + Pi have bound to the L state, and
ATP has been synthesized in the T state.
149
This page intentionally left blank.
Bioenergetics and Metabolism
A major function of GSH in the erythrocyte is to eliminate H2O2 and organic hydroperoxides. H2O2, a toxic product
of various oxidative processes, reacts with double bonds in the fatty acid residues of the erythrocyte cell membrane
to form organic hydroperoxides. These, in turn, result in premature cell lysis. Peroxides are eliminated through the
action of glutathione peroxidase, yielding glutathione disulfide (GSSG). So, G6PD deficiency results in hemolytic
anemia caused by the inability to detoxify oxidizing agents.
Figure 2.30 Role of the pentose phosphate pathway in the reduction of oxidized glutathione.
— —
COO COO O O
—
H C OH C O CH3 C C O
6 CH2OH CH2OP
5 O ATP ADP O NADP NADPH HO C H H2O H C H Pyruvate
4 1
OH OH H C OH H C OH
3 2
HO OH HO OH
OH
OH OH H C OH H C OH
POH2C CH CHO
CH2O P CH2O P
Glucose Glucose-6-phosphate
6-Phosphogluconate Glyceraldehyde-3-
2-Keto-3-deoxy-
phosphate
6-phosphogluconate
+
NAD
2 ADP
NADH
2 ATP
Pyruvate
2.7 Photosynthesis
Photosynthesis is a physiochemical process by which photosynthetic organisms convert light energy into chemical
energy in the form of reducing power (as NADPH) and ATP, and use these chemicals to drive carbon dioxide
fixation.
160
Bioenergetics and Metabolism
Sun
Light reaction
ATP + NADPH
Figure 2.32 Photosynthesis is a two stage process. The first process is a light dependent one (light reactions)
that requires the direct energy of light to make energy carrier molecules that are used in the second process.
The Calvin cycle (light independent process) occurs when the products of the light reaction are used in the
formation of carbohydrate.
On the basis of generation of oxygen during photosynthesis, the photosynthetic organisms may be oxygenic or
anoxygenic. Oxygenic photosynthetic organisms include both eukaryotes as well as prokaryotes whereas anoxygenic
photosynthetic organisms include only prokaryotes.
In oxygenic photosynthetic organisms, photosynthetic oxygen generation occurs via the light-dependent oxidation
of water to molecular oxygen. This can be written as the following simplified chemical reaction:
Chlorophylls
Chlorophyll, a light-absorbing green pigment, contains a polycyclic, planar tetrapyrrole ring structure. Chlorophyll
is a lipid soluble pigment. It has the following important features:
1. The central metal ion in chlorophyll is Mg2+.
2. Chlorophyll has a cyclopentanone ring (ring V) fused to pyrrole ring III.
3. The propionyl group on a ring IV of chlorophyll is esterified to a long-chain tetraisoprenoid alcohol. In chlorophyll
a and b it is phytol.
161
This page intentionally left blank.
Bioenergetics and Metabolism
Glycogen storage diseases are caused by a genetic deficiency of one or another of the enzymes of glycogen
metabolism. Many diseases have been characterized that result from an inherited deficiency of the enzyme.
These defects are listed in the table.
O O O
CH2 OH CH2 O C R1 CH2 O C R1 CH2 O C R1
O O
1 2 3
CH OH CH OH CH O C R2 CH O C R2
Fatty Fatty
acyl-CoA acyl-CoA Pi
CH2 OP CH2 OP CH2 OP CH2 OH
Fatty 4
acyl-CoA
Enzymes
1 Glycerol-3-phosphate acyltransferase O
2 Acylglycerophosphate acyltransferase CH2 O C R1
3 Phosphatidic acid phosphohydrolase O
4 Diacylglycerol acyltransferase CH O C R2
O
CH2 O C R3
Triacylglycerol
201
This page intentionally left blank.
Chapter 03
Cell theory
In 1839, Schleiden, a German botanist, and Schwann, a British zoologist, led to the development of the cell theory
or cell doctrine. According to this theory all living things are made up of cells and cell is the basic structural and
functional unit of life. In 1855, Rudolf Virchow proposed an important extension of cell theory that all living cells
arise from pre-existing cells (omnis cellula e cellula). The cell theory holds true for all cellular organisms. Non-
cellular organisms such as virus do not obey cell theory. Over the time, the theory has continued to evolve. The
modern cell theory includes the following components:
• All cellular organisms are made up of one or more cells.
• The cell is the structural and functional unit of life.
• All cells arise from pre-existing cells by division.
• Energy flow occurs within cells.
• Cells contain hereditary information (DNA) which is passed from cell to cell.
• All cells have basically the same chemical composition.
243
Cell Structure and Functions
the capability to derive energy from certain compounds in their environment and to use that energy to synthesize
more and more of their own precursor molecules, thereby becoming less dependent on outside sources of these
molecules-less extremely heterotrophic. A very significant evolutionary event was the development of photosynthetic
ability to fix CO2 into more complex organic compounds. The original electron (hydrogen) donor for these
photosynthetic organisms was probably H2S, yielding elemental sulfur as the byproduct, but at some point, cells
developed the enzymatic capacity to use H2O as the electron donor in photosynthetic reactions, producing O2. The
cyanobacteria are the modern descendants of these early photosynthetic O2 producers.
One important landmark along this evolutionary road occurred when there was a transition from small cells with
relatively simple internal structures - the so-called prokaryotic cells, which include various types of bacteria - to a
flourishing of larger and radically more complex eukaryotic cells such as are found in higher animals and plants.
The fossil record shows that earliest eukaryotic cells evolved about 1.5 billion years ago. Details of the evolutionary
path from prokaryotes to eukaryotes cannot be deduced from the fossil record alone, but morphological and
biochemical comparison of modern organisms has suggested a reasonable sequence of events consistent with the
fossil evidence.
Three major changes must have occurred as prokaryotes gave rise to eukaryotes. First, as cells acquired more
DNA, mechanisms evolved to fold it compactly into discrete complexes with specific proteins and to divide it equally
between daughter cells at cell division. These DNA-protein complexes called chromosomes become especially
compact at the time of cell division. Second, as cells became larger and intracellular membrane organelles developed.
Eukaryotic cells have a nucleus which contains most of the cell’s DNA, enclosed by a double layer of membrane.
The DNA is, thereby, kept in a compartment separate from the rest of the contents of the cell, the cytoplasm, where
most of the cell’s metabolic reactions occur.
Finally, primitive eukaryotic cells, which were incapable of photosynthesis or of aerobic metabolism, pooled their
assets with those of aerobic bacteria or photosynthetic bacteria to form symbiotic associations that became
permanent. Some aerobic bacteria evolved into the mitochondria of modern eukaryotes, and some photosynthetic
cyanobacteria became the chloroplasts of modern plant cells.
Peripheral protein
Phospholipid
bilayer
Integral
protein Peripheral
protein
Figure 3.1 Fluid mosaic model for membrane structure. The fatty acyl chains in the lipid bilayer form a
fluid, hydrophobic region. Integral proteins float in this lipid bilayer. Both proteins and lipids are free to move
laterally in the plane of the bilayer, but movement of either from one face of the bilayer to the other is restricted.
244
This page intentionally left blank.
Cell Structure and Functions
Thermodynamics of transport
The amount of energy needed for the transport of a solute against a concentration gradient can be calculated from
the initial concentration gradient. When there is transport of one mole of a solute (uncharged) from a region in
which its concentration is C1 to a place where its concentration is C2 and the standard free energy change (ΔG0) is
zero, then free energy change (ΔG) is given by
C2
ΔG = RT ln ... (1)
C1
According to this equation, if C2 is less than C1, ΔG is negative, and the process is thermodynamically favourable.
As more and more substance is transferred, C1 decreases and C2 increases, until C2 = C1. At this point ΔG = 0, and
the system is in equilibrium.
However, if the solute is an ion of charge Z, then the free energy change for transport across a cell membrane
involves two contributors: the normal concentration term, as given in equation (1), plus a second term describing
the energy change involved in moving a mole of ions across the potential difference. If we consider a process in
which ions are transported from outside to inside of a cell, then ΔG is given by:
Cin
ΔG = RT ln + Z .F . Δψ
Cout
Here F is the Faraday constant (96.5 kJ mole–1 V–1) and Δψ is the trans-membrane electrical potential (in volts).
Eukaryotic cells typically have electrical potentials across their plasma membranes of about 0.05 to 0.1 V (with the
inside negative relative to the outside).
261
This page intentionally left blank.
Cell Structure and Functions
Problem
When a neurotoxin is placed in the solution bathing an isolated neuron, it affects the action potential of the neuron
as shown in the figure below. What is the probable mechanism of action of this drug on this neuron?
80
Membrane potential (mV)
+Drug
40
0
Neuron
depolarized
–40 –Drug
–80 Time
Solution
One possibility is that the drug maintains the voltage-gated Na+ channels in an open position, although the drug is
not capable of opening voltage-gated channels by itself. Another possibility is that it prevents opening of the
voltage-gated K+ channels that are responsible for the quick return to the resting potential.
3.4.1 Endocytosis
The term endocytosis was coined by Christian de Duve in the year 1963. Endocytosis is a process whereby
eukaryotic cells internalize material from their surrounding environment. Internalization is achieved by the formation
of membrane-bound vesicles at the cell surface that arise by progressive invagination of the plasma membrane,
followed by pinching off and release of free vesicles into the cytoplasm.
Classically, endocytosis has been divided into phagocytosis (cellular eating) and pinocytosis (cellular drinking).
Phagocytosis or cell eating (first reported by Metchnikoff) describes the internalization of large particles following
particle binding to specific plasma membrane receptors and by the formation of large endocytic vesicles (generally
>250 nm in diameter) called phagosomes. The phagosomes fuse with lysosomes to form phagolysosomes. In
protozoa, phagocytosis is a form of feeding: large food particles taken up into phagosomes end up in lysosomes. In
multicellular eukaryotes, few specialized cells – so called professional phagocytes perform phagocytosis for non-
nutritive purposes. In mammals, two classes of white blood cells act as professional phagocytes—macrophage and
neutrophils. Phagocytosis is an active, actin mediated and highly regulated process involving specific cell-surface
receptors and signalling cascades mediated by Rho-family GTPases.
Pinocytosis or cell drinking (also termed as fluid-phase endocytosis) involves the ingestion of fluid by the formation
of small endocytic vesicles (termed pinocytic vesicles) of about 100 nm in diameter. Virtually all eukaryotic cells
perform pinocytosis. Uptake of soluble material dissolved in extracellular fluid during pinocytosis occurs both
selectively as well as non-selectively. Selective and efficient uptake occurs when solutes are captured by specific
high-affinity receptors (receptor mediated endocytosis). In receptor-mediated endocytosis, a specific receptor on
the cell surface binds tightly to the extracellular macromolecule (the ligand) that it recognizes. The plasma membrane
271
This page intentionally left blank.
Cell Structure and Functions
Secretory
vesicle Lyso-
somes
Trans-Golgi
network
Plasma
membrane
Rough ER
Nucleus
Figure 3.36 Diagrammatic representation of secretory pathway. Newly synthesized proteins are inserted
into the lumen of the ER. Those proteins that are transported out of the ER may then pass through various
sub-compartments of the Golgi until they reach the trans-Golgi network, the exit side of the Golgi. In the
trans-Golgi network, proteins are segregated and sorted. Proteins destined for the plasma membrane or
those that are secreted in a constitutive manner are carried out to the cell surface in transport vesicles.
Some proteins enter late endosomes and are selectively transferred to lysosomes.
285
This page intentionally left blank.
Cell Structure and Functions
GPI-linked proteins are lipid linked (or anchored) membrane proteins. These proteins are exactly like type I
transmembrane proteins, with a cleaved N-terminal signal sequence and internal stop transfer sequence. These
proteins are synthesized and initially inserted into the ER membrane. After insertion in the ER membrane, these
proteins are transferred to a glycosylphosphatidylinositol (GPI) anchor. Enzyme transamidase present in the ER
membrane cuts the protein free from its membrane bound C-terminus and simultaneously attaches the new C-
terminus to an amino group on a GPI. GPI helps to direct these proteins to cell membranes.
— Preformed —
COO COO
GPI anchor
Cytosol
ER lumen NH3
+
NH
+
NH 3 C O
+
NH
3
Figure 3.41 GPIs are added to polypeptides anchored in the membrane by a carboxy-terminal membrane
spanning region. The membrane-spanning region is cleaved, and the new carboxy terminus is joined to the
NH2 group after translocation is completed leaving the protein attached to the membrane by the GPI anchor.
289
This page intentionally left blank.
Cell Structure and Functions
3.9 Lysosome
Lysosomes are single membrane-bound organelles present in animal cells. They are heterogeneous structure and
greatly vary in size and shape. Lysosomes have acidic internal pH (about 5) and are filled with hydrolytic enzymes.
They contain about 40 different types of hydrolytic enzymes (including proteases, nucleases, glycosidases, lipases,
phospholipases, phosphatases and sulfatases) which are responsible for the controlled intracellular digestion of
macromolecules. All are acid hydrolases because they require an acidic environment for optimal activity and the
lysosome provides this by maintaining a pH of about 5.0. A vacuolar H+ ATPase in the lysosomal membrane uses
the energy of ATP hydrolysis to pump H+ into the lysosome, thereby maintaining the internal acidic pH.
0.2–0.5 µm
Lysosome Cytosol
pH ~5 pH ~7.2
+
H
ATP ADP
Figure 3.47 The interior of lysosomes has a pH of about 5.0. To create the low pH, V- type H+ ATPase located in
the lysosomal membrane pump protons into the lysosome using energy supplied from ATP. All the lysosomal
enzymes work most efficiently at acidic pH and collectively are termed acid hydrolases.
There are two types of lysosomes: primary lysosomes (do not contain materials for intracellular digestion) and
secondary lysosomes (contain materials that are undergoing digestion or that already have been digested).
Lysosomes are responsible for the digestion of both extracellular as well as intracellular materials. Lysosomal
digestion of materials can be classified into autophagy and heterophagy. The process by which substances are
taken into the cell from external environment and broken down by lysosome is called heterophagy. In contrast,
the degradation of cytoplasmic components within lysosomes is called autophagy.
In heterophagy, there are two different pathways that brings extracellular materials to lysosomes for degradation.
Phagocytic cells, such as macrophages and neutrophils in vertebrates, engulf large particles by the process of
phagocytosis. During phagocytosis, a single-membrane phagosome is generated, and this compartment fuses
directly with a lysosome to form a phagolysosome.
Virtually all eucaryotic cells continually internalize fluid substances in small pinocytic (endocytic) vesicles by the
process of pinocytosis. Most of endocytosed substances eventually end up in lysosomes, where they degraded. In
this process, the endocytosed substances first move from the endocytic vesicles to the endosomes. At the end of
this pathway, the late endosomes convert to endolysosomes and lysosomes as a result of both their fusion with
preexisting lysosomes and progressive acidification.
Autophagy is an intracellular degradation process of cytoplasmic constituents within lysosomes. During autophagy,
sequestration begins with the formation of a phagophore. Phagophores form de novo in the cytoplasm from a
cup-shaped membrane that expands into a double-membrane bound autophagosome surrounding a portion of the
cytoplasm. The autophagosome may fuse with an endosome. The product of the endosome-autophagosome fusion is
called an amphisome. The completed autophagosome or amphisome fuses with a lysosome, which supplies acid hydrolases.
The enzymes in the resulting compartment, an autolysosome, break down the inner membrane from the
autophagosome and degrade the materials. The resulting macromolecules are released and recycled in the cytosol.
296
Cell Structure and Functions
Cytosol
Phagocytosis
Phagosome Plasma
membrane
Early
Late endosome
endosome
Phagophore Autophagosome
Fusion
(b)
Engulfing
cytoplasmic
components
Autolysosome Degradation of
cytoplasmic components
Lysosome
Some lysosomes participate in exocytosis. This enables cells to eliminate undigested contents. For most cells, this
seems to be a minor pathway, used only when the cells are stressed. Some cell types, however, contain specialized
lysosomes that have acquired the necessary machinery for fusion with the plasma membrane. Melanocytes (melanin-
producing cells) in the skin, for example, produce and store melanin pigments in their lysosomes. These pigment
containing melanosomes release their pigment into the extracellular space of the epidermis by exocytosis.
297
Cell Structure and Functions
3.10 Vacuoles
Most plants and fungal cells contain one or several very large, fluid-filled vesicles called vacuoles. They are
surrounded by single membrane called tonoplast and related to the lysosomes of animal cells, containing a variety
of hydrolytic enzymes, but their functions are remarkably diverse. Like a lysosome, the lumen of a vacuole has an
acidic pH, which is maintained by similar transport proteins in the vacuolar membrane. The plant vacuole contains
water and dissolved inorganic ions, organic acids, sugars, enzymes and a variety of secondary metabolites. Solute
accumulation causes osmotic water uptake by the vacuole, which is required for plant cell enlargement. This water
uptake generates the turgor pressure.
The vacuole is different from contractile vacuole. A contractile vacuole is an organelle involved in osmoregulation.
It pumps excess water out of the cell. It is found predominantly in protists (such as Paramecium, Amoeba) and in
unicellular algae (Chlamydomonas). It was previously known as pulsatile or pulsating vacuole.
3.11 Mitochondria
Mitochondria (term coined by C. Benda) are energy-converting organelles, which are present in virtually all eukaryotic
cells. They are the sites of aerobic respiration. They produce cellular energy in the form of ATP, hence they are
called ‘power houses’ of the cell. Mitochondria are membrane-bound mobile as well as plastic organelle. Each
mitochondrion is a double membrane-bound structure with outer and inner membranes. The space between the
outer and inner membranes is called intermembrane space. The outer membrane is fairly smooth. But the inner
membrane is highly convoluted; forming folds called cristae. The inner membrane is also very impermeable to
many solutes due to very high content of a phospholipid called cardiolipin. The cristae greatly increase the inner
membrane’s surface area. The two faces of this membrane are referred to as the matrix side (N-side) and the
cytosolic side (P-side). Inner membrane contains enzyme complex called ATP synthase (or F0-F1 ATPase or oxysome)
that makes ATP. The outer membrane protects the organelle, and contains specialized transport proteins such as
porin which allows free passage for various molecules into the intermembrane space of the mitochondria. Mitochondrial
porins, or voltage-dependent anion-selective channels (VDAC) allow the passage of small molecules across the
mitochondrial outer membrane.
Inner membrane
Outer membrane
Matrix
Intermembrane space
Figure 3.49 A mitochondrion has double-membraned organization and contains: the outer mitochondrial
membrane, the intermembrane space (the space between the outer and inner membranes), the inner
mitochondrial membrane, and the matrix (space within the inner membrane).
The matrix (large internal space) contains multiple copies of the dsDNA (as genetic material), mitochondrial ribosomes
(ranging from 55S-75S), tRNAs and various proteins. Mitochondrial dsDNA is mostly circular. The size of mitochondrial
DNA also varies greatly among different species.
298
This page intentionally left blank.
Cell Structure and Functions
3.12 Plastids
Plastids are double membrane bound semi-autonomous organelles present in all living plant cells and photosynthetic
protists. All plastids contain multiple copies of the dsDNA as genetic materials and 70S ribosomes for proteins
synthesis. Plastids differentiate from proplastids. Proplastids are inherited with cytoplasm of plant egg cells. As
immature plant cells differentiate, the proplastids develop according to the needs of the specialized cell: they can
become chloroplast, leucoplasts or chromoplasts. A collective term used for different kinds of organelles, all derived
from proplastids, is plastid. Chloroplast is the most important member of plastid family. It occurs in all photosynthetic
eukaryotes and acts as site of photosynthesis. It has a double membrane which encloses a fluid-filled region called
the stroma. Embedded in the stroma is a complex network of stacked sacs. Each stack is called a granum and each
of the flattened sacs which makes up the granum is called a thylakoid. The thylakoid membrane, that encloses a
fluid-filled thylakoid interior space, contains photosynthetic pigments. There are many grana in each chloroplast
(usually 10 to 100 grana) which are interconnected by unstacked stromal lamellae. The lipids of the thylakoid
membrane have a distinctive composition. About 80% lipids are uncharged mono- and digalactosyl diacylglycerol
and only about 10% are phospholipids.
}
}
Stroma
Thylakoid lumen
Stroma
Outer Inner
lamella
membrane membrane
Figure 3.52 The two envelope membranes enclose the stroma. The stacks of the thylakoid termed grana
are connected by tubes, forming a continuous thylakoid lumen.
In the dark grown plants, proplastids develop into etioplasts, which have a yellow chlorophyll precursor pigment
protochlorophyll instead of chlorophyll. When exposed to light, the etioplasts rapidly change into chloroplasts by
converting this precursor to chlorophyll.
Chromoplasts are plastids responsible for pigment synthesis and storage. They are rich in carotenoids and mainly
responsible for the yellow, orange, or red colors of many fruits and flowers, as well as of autumn leaves. Leucoplasts
are colorless (non-pigmented) plastids and act as storage organelles. Based on the kind of substance they store,
they are further classified into amyloplasts (for starch storage), elaioplasts (for fat storage) and proteinoplasts or
aleuroplasts (for storing and modifying proteins).
3.13 Peroxisome
Peroxisome (discovered by Christian de Duve in 1965) is a single membrane bound small organelle (approximately
0.5–1 μm in diameter) present in all eukaryotes. The term, peroxisome, was proposed by de Duve because it
produced and consumed hydrogen peroxide. Peroxisomes lack DNA and ribosomes. Thus, all peroxisomal proteins
(peroxisomal matrix and membrane proteins) are encoded by nuclear genes, synthesized on ribosomes present in
the cytosol and then incorporated into pre-existing peroxisomes.
The ability of peroxisomes to divide themselves suggests that the peroxisome may have had an endosymbiotic
origin similar to mitochondria. However, the localization of peroxisomal proteins to the endoplasmic reticulum and
the similarity of some peroxisomal proteins to those localized in the ER suggest an alternative hypothesis: that the
peroxisome was developed from the ER (de novo origin). Aspects of both views may be true. Most peroxisomal
membrane proteins are made in the cytosol by membrane free ribosomes and insert into the membrane of preexisting
ones. However, few others are first synthesized by membrane bound ribosomes of ER and then integrated into the
302
Cell Structure and Functions
ER membrane from where they may bud in specialized peroxisomal precursor vesicles. New peroxisome precursor
vesicles may then fuse with one another and begin importing additional peroxisomal proteins synthesized by membrane
free cytosolic ribosomes to grow into mature peroxisomes, which can enter into a cycle of growth and fission.
Like mitochondria, peroxisomes contain several oxidative enzymes, such as catalase, oxidases. Peroxisomal oxidases
transfer hydrogen atoms to molecular oxygen and form hydrogen peroxide. The enzyme catalase (a member of
the peroxidase family) present in the peroxisome uses the hydrogen peroxide to oxidize a variety of other substrates
such as phenols, formic acid, formaldehyde and alcohol by the peroxidation reaction.
Catalase
H2O2 + RH2 R + 2H2O
When excess hydrogen peroxide accumulates in the cell, catalase converts it to H2O through the reaction:
Catalase
H2O2 + H2O2 2H2O + O2
A major oxidative reaction carried out in peroxisomes is the β-oxidation. β-oxidation in mammalian cells occur
both in mitochondria and peroxisomes; in plant cells, however, this is exclusively found in peroxisomes. Peroxisomes
also have two important roles in plants – photorespiration and glyoxylate cycle. In photorespiration, 2-phosphoglycolate
produced by oxygenase activity of rubisco is metabolized into serine, CO2 and NH3. This pathway involves three
subcellular compartments, the chloroplasts, peroxisomes and mitochondria.
Glyoxysome is a specialized form of peroxisome present in some plant cells, mainly the cells of germinating
seeds. Glyoxysomes contain the enzymes of the glyoxylate cycle – which help to convert stored lipid into
carbohydrates that can be translocated throughout the young plant to provide energy for growth. In the glyoxylate
cycle, two molecules of acetyl-CoA produced by fatty acid breakdown are used to make succinic acid, which then
leaves the glyoxysome and is converted into glucose in the cytosol. The glyoxylate cycle does not occur in animal
cells, and animals are therefore unable to convert the fatty acids in fats into carbohydrates.
Targeting of peroxisomal proteins from cytosol to peroxisome synthesized by membrane free ribosomes
Transport of proteins from cytosol to peroxisomes occur post-translationally. Peroxisomal proteins synthesized on
cytosolic ribosomes are generally fold into their mature conformation in the cytosol before import into the organelle.
Proteins that are involved in peroxisome biogenesis, including peroxisome generation, division as well as matrix and
membrane protein import are called peroxins. At least 23 distinct peroxins participate in the import process,
which is driven by ATP hydrolysis. Proteins that are imported into the peroxisome have peroxisomal targeting
sequences–PTS1 and PTS2. The PTS1 is a tri- or tetrapeptide at the C-terminus. The consensus sequence of PTS1 is
(S/A/C)–(K/R/H)–(L/M). It was first characterized in catalase as a Ser-Lys-Leu sequence (SKL in one-letter code) at
the very C-terminus. PTS1 containing proteins are recognized by the cytoplasmic receptor Pex5 and are imported
into peroxisomes in their fully folded form. The PTS2 signal is a sequence of nine amino acids and can be located
near the N-terminus or internally and recognized by the soluble receptor Pex7. PTS2 exhibits the consensus
sequence (R/K)–(L/V/I)–X5–(H/Q)–(L/A). The importance of the import process in peroxisomes is dramatically
demonstrated by the inherited human disease, Zellweger syndrome. It is a rare, congenital disorder, characterized
by the reduction or absence of peroxisomes due to defect in importing proteins into peroxisomes.
3.14 Nucleus
The nucleus is the controlling center of eukaryotic cell. It contains most of the genetic materials of cell. Most
eukaryotic cells have one nucleus (uninucleate) each, but some have many nuclei (multinucleate) and certain cells,
such as mature red blood cells, do not have it. Paramecia (unicellular ciliate protozoa) have two nuclei – a macronucleus
and a micronucleus. Genes in the macronucleus control the everyday functions of the cell, such as feeding, waste
removal, and maintenance of water balance. Micronucleus controls the sexual reproduction.
Nuclei differ in size depending on the cell type. Most nuclei are spherical, but multilobed nuclei are also common,
such as those found in polymorphonuclear leukocytes or mammalian epididymal cells. A nucleus in G0 phase has
four components: Nuclear envelope, nucleolus, nuclear matrix and chromatins.
303
This page intentionally left blank.
Cell Structure and Functions
Intermediate filament proteins are classified into six major types based on their sequences and tissue distribution:
1. Occluding junctions
Occluding junctions seal cells together in an epithelium in a way that prevents even small molecules from leaking
from one side of the sheet to the other (i.e. forms permeability barrier across epithelial cell sheets). These junctions
are of two types– tight junction and septate junction.
Tight junctions (or zonula occludens) are cell-cell occluding junctions mediated by two major transmembrane
proteins-claudins and occludin. Claudins and occludins associate with intracellular peripheral membrane proteins
called ZO proteins. Tight junctions make the closest contact between adjacent cells and prevent the free passage
of molecules (including ions) across an epithelial sheet in the spaces between cells. They also maintain the polarity
of epithelial cells by preventing the diffusion of molecules between the apical and the basolateral regions of the
plasma membrane. Septate junctions are the main occluding junctions in invertebrates.
Lumen
Figure 3.67
Tight junctions allow cell
Tight sheets to serve as barriers to
junction solute diffusion. Schematic
drawing showing how a small
extracellular molecule present
on one side of an epithelial cell
sheet cannot traverse the tight
Cell 1 Cell 2 Cell 3 Cell 4 junctions that seal adjacent
cells together.
322
Cell Structure and Functions
2. Anchoring junctions
Anchoring junctions mechanically attach cells (and their cytoskeletons) to their neighbours or to the extracellular
matrix and perform the key task of holding cells together into tissues. It includes two main types of junctions–
adherens junction and desmosome.
Adherens junctions
Adherens junctions connect bundles of actin filaments from cell to cell or from cell to the extracellular matrix.
Adhesion belt (or zonula adherens): It is a cell to cell junction, mediated by actin filaments and proteins belonging
to the cadherin family. Adhesion belts are usually located near the apical surface, just below the tight junctions.
Focal contact (or adhesion plaque): It is a cell-matrix junction which is mediated by transmembrane adhesion
proteins of the integrin family and by actin filament.
Desmosomes
Desmosomes are buttonlike points of intercellular contacts which bond neighbouring cells together. It has a dense
cytoplasmic plaque which is composed of a mixture of intracellular attachment proteins, including plakoglobin and
desmoplakins. The cytoplasmic plaque is responsible for connecting the cytoskeleton to the transmembrane linker
proteins of the cadherin family of cell-cell adhesion molecules. Desmosomes contain two specialized cadherin
proteins, desmoglein and desmocollin. Through extracellular domains, cadherins are responsible for holding the
adjacent membranes together. Each plaque is associated with a thick network of keratin intermediate filaments (in
most epithelial cells) and desmin intermediate filaments (in heart muscle cells), which are attached to the surface of
the plaque.
Hemidesmosomes, or half-desmosomes, resemble desmosomes, but instead of joining adjacent epithelial cell
membranes, they connect the basal surface of epithelial cells to the underlying basal lamina- a specialized mat of
extracellular matrix at the interface between the epithelium and connective tissue. The transmembrane linker
proteins in hemidesmosomes belong to the integrin family of extracellular matrix receptors, rather than to the
cadherin family of cell-cell adhesion proteins used in desmosomes.
Plasma Plasma
membrane membrane
Cytoplasmic plaque Cytoplasmic plaque
Cytoplasmic plaque
Plasma
membrane
A. Desmosomes B. Hemidesmosomes
Figure 3.68 A. Schematic drawing of a desmosome. On the cytoplasmic surface of each interacting plasma
membrane is a cytoplasmic plaque. Each plaque is associated with a network of keratin intermediate filaments.
Transmembrane adhesion proteins, which belong to the cadherin family of cell-cell adhesion molecules,
bind to the plaques and interact through their extracellular domains to hold the adjacent membranes together.
B. Schematic drawing of a hemidesmosome, joining adjacent epithelial cell membranes to the underlying
basal lamina.
323
This page intentionally left blank.
Cell Structure and Functions
N C C
S S
S S
a
Fibrin
binding
domains
b g
N N
Heparin
binding
domains
RGD
Three stranded
Cell
coiled coil
binding
a-helical region
domains
Fibrin
binding
domains
C C
LG1 Collagen
binding
Integrin
LG2 domains
binding domain
LG3 Globular
region
Heparin and fibrin
a-dystroglycan LG4 binding
(Proteoglycan) domains
binding domain LG5
C N N
Figure 3.73 (A) Structure of laminin – a heterotrimeric glycoprotein. (B) A fibronectin molecule consists of
two nearly identical polypeptide chains joined by two disulfide bonds near their carboxyl ends. Each polypeptide
chain is folded into a series of globular domains linked by short, flexible segments. The globular domains
have binding sites for extracellular matrix components or for specific receptors on the cell surface. The cell-
binding domain contains the tripeptide sequence RGD (arginine-glycine-aspartate), which is recognized by
fibronectin receptors.
328
Cell Structure and Functions
noncovalent bonding between adjacent chains (18 to 24) within a cellulose microfibril gives this structure a high
tensile strength. Cellulose is also insoluble, chemically stable and relatively resistant to chemical and enzymatic
attack. Cellulose microfibril is synthesized by a plasma membrane bound enzyme complex – cellulose synthase.
Cellulose synthases in plants are encoded by a gene family named cellulose synthase A (CESA). In expanding
cells, the newly synthesized cellulose microfibrils are deposited parallel to cortical microtubules underlying the
plasma membrane.
Matrix polysaccharides
Cellulose microfibrils are embedded in a matrix consisting of proteins and polysaccharides. The major polysaccharides
of the matrix are synthesized by membrane-bound enzymes in the Golgi apparatus and are delivered to the cell
wall via exocytosis of tiny vesicles. Two major types of matrix polysaccharides are hemicelluloses and pectins.
Hemicelluloses (cross linking glycan) are a heterogeneous group of highly branched polysaccharides (such as
xyloglucan, arabinoxylan) that are hydrogen-bonded to the surface of cellulose microfibrils. Pectins are heterogeneous
group of polysaccharides, characteristically containing acidic sugars such as galacturonic acid. The pectins are gel-
forming components of the matrix. Pectin has roles in forming connections between plant cells, adjusting pH and ion
balance, recognizing foreign molecules to alert the cell to the presence of microbes and establishing cell wall
porosity.
Lignin
Lignin is a phenolic polymer. It is a highly branched polymer of three simple phenolic alcohols- coniferyl alcohol,
coumaryl alcohol, and sinapyl alcohol - known as monolignols. Precursors of lignin are synthesized from phenylalanine
and are secreted to the wall. It is insoluble in water and most organic solvents. As lignin forms in the wall, it
displaces water from the matrix and forms a hydrophobic network that bonds tightly to cellulose and prevents wall
enlargement. Lignin adds significant mechanical strength to cell walls and reduces the susceptibility of walls to
attack by pathogens.
Structural proteins
The cell wall also contains several classes of structural proteins. These proteins usually are classified according to
their predominant amino acid composition— for example, hydroxyproline-rich glycoprotein, glycine-rich protein,
arabinogalactan protein and proline-rich protein. With the exception of glycine-rich proteins, all are glycosylated
and contain hydroxyproline. Extensin, a major structural protein in the cell walls of higher plants, is a hydroxyproline-
rich glycoprotein. Cell walls also contain functional proteins such as expansin. It causes the pH-dependent extension
and stress relaxation of cell walls. The molecular basis for expansin action is still uncertain, but most evidence
indicates that expansins act by disrupting non-covalent interactions between wall polysaccharides.
Middle lamella
A thin layer of material, the middle lamella, is present at the junction, where the walls of neighboring cells come into
contact. It acts as cementing material. The composition of the middle lamella differs from the rest of the wall. It is
high in pectin (as calcium pectate) and may be complexed with hydroxyproline-rich glycoproteins.
329
Cell Structure and Functions
In animals, extracellular signaling by signal molecules can be classified into four categories—endocrine, paracrine,
autocrine and juxtacrine signaling.
In endocrine signaling, the signaling molecules act on target cells distantly located from their site of synthesis. It is
a long-range signaling in which signal molecule is transported by the blood stream.
In paracrine signaling, the signaling molecules released by a cell affect target cells only in close proximity. An
example of this is the action of neurotransmitters in carrying signals between nerve cells at a synapse.
In autocrine signaling, the signaling molecules produce an effect on same cell that produces it. One important
example of such is the response of cells of the vertebrate immune system to foreign antigens. Certain types of T-
lymphocytes respond to antigenic stimulation by synthesizing a growth factor that drives their own proliferation,
thereby increasing the number of responsive T-lymphocytes and amplifying the immune response.
In juxtacrine signaling, signal molecules do not diffuse from the cell producing it and cell bearing signal molecules
interact with receptor proteins of adjacent responding cells. Unlike other modes of cell signaling, juxtacrine signaling
requires physical contact between the cells involved. Notch signalling and classical cadherin signalling are examples
of juxtacrine signaling.
Endocrine signaling
Bloodstream
Signal
molecule Target cell
Paracrine signaling
Target cell
Autocrine signaling
Figure 3.74 Long-range signaling between cells is called endocrine when the signal molecule is transported
by the bloodstream (typical for hormones), paracrine when the signal diffuses between neighboring cells
across the extracellular matrix (typical for neurotransmitters and many so-called tissue hormones or local
mediators), and autocrine when the signal re-acts on the transmitter cell.
330
Cell Structure and Functions
Examples of signal molecules (hormones) that interact with cytosolic or nuclear receptor
Steroid hormones (Progesterone, Estradiol, Testosterone, Cortisol, Corticosterone, Aldosterone),
Steroid like hormone (α-ecdysone) and
Non-steroid hormones (Thyroid hormone and Retinoic acid).
Binding of extracellular signaling molecules to the cell surface receptor leads to increase (or decrease) in concentration
of low molecular weight intracellular signaling molecules termed secondary messengers. These low-molecular-
weight signaling molecules include cAMP, cGMP, diacylglycerol (DAG); inositol 1,4,5-trisphosphate (IP3 ),
phosphoinositides and calcium.
3.20.2 Receptors
The cellular response to a particular extracellular signal molecule depends on its binding to a specific receptor
located on the surface of a target cell or in its nucleus or cytosol. Receptors are chemically protein or glycoprotein
molecules which bind to signaling molecules (termed ligand). Binding of a ligand to its receptor causes a conformational
change in the receptor that initiates a sequence of reactions leading to a specific cellular response. Based on
location, receptors are classified into two broad categories - intracellular receptors and cell-surface receptors.
Intracellular receptors
Intracellular receptor proteins are located in the cytosol or the nucleus. These include receptors for steroid hormones,
thyroid hormones, retinoids and vitamin D as well as different “orphan” receptors. The intracellular receptors are
all structurally related and belong to the nuclear receptor superfamily. Within the cell, intracellular receptor – ligand
complex controls the activities of responsive genes. A large number of nuclear receptors have been identified
through sequence similarity to known receptors, but have no identified natural ligand, and are referred to as
nuclear orphan receptor. Nuclear receptors (NRs) are a family of highly conserved transcription factors that regulate
transcription in response to small lipophilic signal molecules. They are ubiquitous and unique to the animal kingdom.
Members of the nuclear-receptor superfamily can both positively and negatively regulate transcription.
Most members of the nuclear receptor superfamily consist of a N-terminal activation domain (also known as the
A/B region), a central DNA binding domain (DBD) and a C-terminal ligand binding domain (LBD). N-terminal
domain contains an AF-1 (activation function-1) sequence which functions as a ligand-independent transcriptional
activator. AF-1 is recognized by coactivators and/or other transcription factors. The DBD is comprised of two zinc-
331
This page intentionally left blank.
Cell Structure and Functions
Cells that do not divide enter into G0 state. Cells are able to enter reversible (quiescent) or irreversible (senescent
and terminally differentiated) G0 states. Most cells in our body are in G0 state. Quiescent state represent a
reversible resting state. Cells in this state remain metabolically active but no longer proliferate unless called on to
do so depending on the requirement of the organism. This state can remain for days, weeks, or even years before
resuming proliferation. Senescent cells are dysfunctional cells that have ceased proliferation and are perma-
nently withdrawn from the cell cycle. Senescence is by the irreversible loss of proliferative potential. Terminally
differentiated cells (e.g. mammalian skeletal muscle cells and nerve cells) are those cells that, in the course of
acquiring specialized functions, have irreversibly lost its ability to proliferate.
M phase
(mitosis)
G2 phase
(Gap 2)
Eukaryotic
G0
Cell cycle
G1 phase
(Gap 1)
S phase
(DNA synthesis)
Figure 3.94 The four successive phases of a standard eukaryotic cell cycle. During interphase the cell grows
continuously; during M phase it divides. DNA replication is confined to the part of interphase known as
S phase. G1 phase is the gap between M phase and S phase; G2 is the gap between S phase and M phase.
Cells in G1, if they have not yet committed themselves to DNA replication, can pause in their progress around
the cycle and enter a specialized resting state, often called G0.
Approximately 95% of the cell cycle is spent in interphase. The duration of the three stages (G1, S and G2) varies
from species to species, and also from cell to cell within a species. Although the length of all phases of the cycle is
variable to some extent, by far, the greatest variation occurs in the duration of G1. Its length is adjusted in response
to growth conditions. In most cells, the whole of M phase takes only about an hour, which is only a small fraction of the
total cycle time. For a typical rapidly proliferating human cell in culture with a total cycle time of approximately
24 hours, the G1 phase might last about 11 hours, S phase about 8 hours, G2 about 4 hours and M about 1 hour.
Figure 3.95 Change in number of chromosomes and amount of DNA during interphase.
355
This page intentionally left blank.
Cell Structure and Functions
G1 phase S phase
G2 phase S phase
G1 phase G2 phase
Figure 3.102 Cell fusion experiments. Cells in S phase were fused either to cells in G1 or to cells in G2.
When G1 cells were fused with S phase cells, the G1 nucleus immediately began to replicate DNA. In contrast,
when G2 cells were fused with S phase cells, only the S phase nucleus continued DNA replication. It therefore
appeared that the G2 nucleus had to pass through M and enter G1 before another round of DNA replication
could be initiated. This elegant series of experiments was the first indication in mammalian cells that the
sequential and unidirectional phases of the cell cycle are controlled by a series of chemical signals that can
diffuse freely between the nucleus and cytoplasm.
362
This page intentionally left blank.
Cell Structure and Functions
Types of meiosis
Meiosis occurs during sexual life cycle of all eukaryotes. Fertilization and meiosis alternate in sexual life cycles,
maintaining a constant number of chromosomes in each species from one generation to the next. Although the
alternation of meiosis and fertilization is common to all organisms that reproduce sexually, the timing of these
two events in the life cycle varies, depending on the species. Meiosis is of three types – gametic, zygotic and
sporogenic – depending on the stage, where it occurs during sexual cycle.
Diploid
(2n)
Gametic
meiosis
In the gametic meiosis, meiosis
occurs at the time of gametes
formation. Gametes are the only
Gametes (n)
(Male and Female) haploid cells. Gametes fertilize
to form the zygote. The diploid
zygote undergoes repeated
Fertilization
mitotic division to give organism.
This type of meiosis occurs in
(2n) most animals including human.
Zygote
Haploid
(n)
Sporophyte
(2n)
Plants and some species of algae
Sporic
meiosis exhibit sporogenic meiosis. In this
Zygote (2n)
case multicellular diploid body
(called the sporophyte) produces
Spores (n) haploid spores by meiotic division.
Fertilization
Unlike a gamete, a haploid spore
doesn't fuse with another cell but
Gametes (n) divides mitotically, generating a
(Male and Female) multicellular haploid body called
the gametophyte. Cells of the
(n)
gametophyte give rise to gametes
Gametophyte by mitosis. Fusion of two haploid
gametes at fertilization results
in a diploid zygote, which develops
into the next sporophyte generation.
380
Cell Structure and Functions
Stem cells
Stem cells are unspecialized (undifferentiated) cells that have the ability to differentiate into other cells and self-
regenerate. These cells divide to produce one daughter cell that remains a stem cell and one that divides and
differentiates. Because the division of stem cells produces new stem cells as well as differentiated daughter
cells, stem cells are self renewing populations of cells that can serve as a source for the production of differentiated
cells throughout life. Typically, stem cells generate an intermediate cell type or types before they achieve their
fully differentiated state.
The intermediate cell is called a precursor or progenitor cell. The ability to differentiate is the potential to
develop into other cell types. Depending on the ability to differentiate into other cell types, stem cells can be
classified as totipotent, pluripotent and multipotent stem cells. Totipotent stem cells are cells that can give rise
to a fully functional organism as well as to every cell type of the body. Pluripotent stem cells can differentiate
into nearly all cell types. Multipotent stem cells can differentiate into a limited number of closely related families
of cells.
RBCs
WBCs
Platelets
There are two broad types of stem cells: embryonic stem cells, which are isolated from the inner cell mass of
blastocysts, and adult stem cells, which are found in various tissues. Embryonic stem cells can become all cell
types of the body because they are pluripotent. An adult stem cell (also termed as somatic stem cell) is an
undifferentiated cell found among differentiated cells in a tissue or organ, can renew itself and differentiate to
yield the major specialized cell types of the tissue or organ. The primary roles of adult stem cells in a living
organism are to maintain and repair the tissue in which they are found. Unlike embryonic stem cells, which are
defined by their origin (the inner cell mass of the blastocyst), the origin of adult stem cells in mature tissues is
unknown. Most adult stem cells are multipotent. The bone marrow contains two kinds of stem cells. One population,
called hematopoietic stem cells, forms all the types of blood cells in the body. A second population called bone
marrow stromal cells generates bone, cartilage, fat and fibrous connective tissue. The adult brain also contains
stem cells that are able to generate the brain’s three major cell types—astrocytes and oligodendrocytes, which
are non-neuronal cells and neurons or nerve cells.
381
Cell Structure and Functions
3.23 Apoptosis
Apoptosis (from the Greek words apo = from and ptosis = falling,) is an energy dependent biochemical mechanism
of programmed cell death. It is a genetically programmed process occurs normally during embryogenesis,
metamorphosis and aging. For example, the differentiation of human fingers in a developing embryo requires the
cells between the fingers to initiate apoptosis so that the fingers can separate. Apoptosis also occurs as a defense
mechanism such as in immune reactions or when cells are damaged by disease or noxious agents. Although
apoptosis is the most common form of programmed cell death (PCD), there are several non-apoptotic programmed
cell death such as autophagy and necroptosis have also been reported.
The demise of cells by apoptosis is marked by a well-defined sequence of morphological changes. Apoptotic cells
become more compact, blebbing occur at the membranes, chromatin becomes condensed and DNA is fragmented.
During the early stage of apoptosis, cell shrinkage and pyknosis (i.e. chromatin condensation) occur. With cell
shrinkage, the cells becomes smaller in size with dense cytoplasm. Pyknosis is the most characteristic feature of
apoptosis. Later, extensive plasma membrane blebbing occurs and separation of cell fragments occurs in the form
of small membrane-bound apoptotic bodies by a process called budding. Apoptotic bodies consist of cytoplasm
with tightly packed organelles with or without a nuclear fragment. These bodies are subsequently phagocytosed by
macrophages or surrounding cells. Chemical changes in the surface of apoptotic cells or bodies allow the surrounding
cells or macrophages to recognize and engulf them. An especially important change occurs in the plasma membrane
of apoptotic cells. The negatively charged phospholipid phosphatidylserine is normally exclusively located in the
inner leaflet of the lipid bilayer of the plasma membrane, but it flips to the outer leaflet in apoptotic cells, where
it can serve as a marker of these cells. There is essentially no inflammatory reaction associated with the process
of apoptosis nor with the removal of apoptotic cells because:
1. apoptotic cells do not release their cellular constituents into the surrounding interstitial tissue;
2. they are quickly phagocytosed by surrounding cells and,
3. the engulfing cells do not produce anti-inflammatory cytokines.
Apoptosis Necrosis
Cell shrinkage and convolution Cell swelling
Pyknosis and karyorrhexis Karyolysis, pyknosis and karyorrhexis
Intact cell membrane Disrupted cell membrane
Cytoplasm retained in apoptotic bodies Cytoplasm released
No inflammation Inflammation usually present
Pyknosis (or karyopyknosis) is the irreversible condensation of chromatin in the nucleus of a cell undergoing
necrosis or apoptosis. Karyorrhexis is the nuclear fragmentation and karyolysis is the complete dissolution of the
chromatins.
Mechanisms of apoptosis
The mechanisms of apoptosis are highly complex and regulated, involving an energy-dependent cascade of molecular
events. There are multiple apoptotic pathways. These pathways are both caspase-dependent as well as caspase-
independent. The classical, caspase-dependent apoptosis is initiated either by extrinsic or intrinsic factors. There
are two main caspase-dependent apoptotic pathways:
382
Prokaryotes and Viruses
Problem
Several Hfr strains are mated with an auxotrophic F— strain by the interrupted mating technique. The order of
transfer for the loci are given in the table.
Strain 1 Strain 2 Strain 3 Strain 4
thr str his thy
lip thy thy his
trp his str trp
his trp ilv lip
thy lip thr thr
What is the order of the loci on the chromosome?
Solution
thr-lip-trp-his-thy-str-ilv
Problem
An F+ strain of E. coli gave rise to Hfr progeny by random integration of the F-factor into the circular chromosome
at many points such that the segregants transfer the genetic markers in different order. When six of the Hfr
segregants were checked for the order of the marker transfer to a recipient by interrupted mating experiments,
following results were obtained. What is the order of the markers?
Hfr segregant Order of marker transfer
1 PAQB
2 CZEF
3 EFBQ
4 FEZC
5 ZCWD
6 APDW
Solution
APDWCZEFBQAP
431
This page intentionally left blank.
Prokaryotes and Viruses
Hepatitis virus
Hepatitis is a liver inflammation commonly caused by an infectious agent. Hepatitis sometimes results in destruction
of functional liver anatomy and cells, a condition known as cirrhosis. Some forms of hepatitis may lead to liver
cancer. Although many viruses and a few bacteria can cause hepatitis, a restricted group of viruses is often
associated with liver disease termed hepatitis viruses. Hepatitis viruses are diverse, and none of these viruses are
genetically related, but all infect cells in the liver, causing hepatitis.
The genome of hepatitis B virus (hepadnavirus) is among the smallest known of any viruses, 3-4 kb. Like retroviruses,
hepatitis B virus uses reverse transcriptase during replication cycle. However, unlike retroviruses the DNA genome
of hepatitis B virus is replicated through an RNA intermediate, the opposite of what occurs in retroviruses. Hepatitis
D virus, classified as a hepatitis delta virus, is considered to be a subviral satellite because it can propagate only in
the presence of the hepatitis B virus. Transmission of hepatitis D virus can occur either via simultaneous infection
with hepatitis B virus (coinfection) or via infection of an individual previously infected with hepatitis B virus
(superinfection). The hepatitis D virus genome consists of a single stranded, negative sense, circular RNA.
RNA
Capsid
Figure 4.51 Tobacco mosaic virus has a rod-like appearance. Its capsid is made up of ~2130 capsomeres.
One molecule of genomic ssRNA, 6400 nucleotides long, present in the centre of the capsid. The capsomere
self-assembles into the rod like helical structure (16.3 capsomeres per helical turn) around the RNA.
463
Prokaryotes and Viruses
Figure 4.52 The normal prions (PrPc) have a large percentage of α-helix, but the abnormal forms (PrPsc)
have more β-pleated sheets.
Prions are novel transmissible agents causing a group of neuro-degenerative diseases that can be perpetuated by
inoculating animal with tissue extracts from infected one. Collectively, prion diseases are described as spongiform
encephalopathies. No prion diseases of plants are known. In 1997, American scientist Stanley B. Prusiner won
the Nobel Prize for this pioneering work with these diseases and with the prion proteins. Kuru was the first naturally
occurring spongiform encephalopathy of humans shown to be caused by prions. It was first described by Gajdusek
and Zigas in 1957. Kuru is characterized by cerebellar ataxia and a shivering-like tremor that produces complete
motor incoordination.
464
This page intentionally left blank.
Chapter 05
Immunology
Immunology is the science that is concerned with immune response to foreign challenges. Immunity (derived
from Latin term immunis, meaning exempt), is the ability of an organism to resist infections by pathogens or state
of protection against foreign organisms or substances. The array of cells, tissues and organs which carry out this
activity constitute the immune system. Immunity is typically divided into two categories—innate and adaptive immunity.
Physical barriers
Physical barriers are the first line of defense against microorganisms. It includes skin and mucous membrane. Most
organisms and foreign substances cannot penetrate intact skin but can enter the body if the skin is damaged.
Secondly, the acidic pH of sweat and sebaceous secretions and the presence of various fatty acids and hydrolytic
enzymes like lysozyme inhibit the growth of most microorganisms. Similarly, respiratory and gastrointestinal tracts
are lined by mucous membranes. Mucus membranes entrap foreign microorganisms. The respiratory tract is also
covered by cilia, which are hair like projections of the epithelial-cell membranes. The synchronous movement of
the cilia propels mucus-entrapped microorganisms out of these tracts. Similarly, the conjunctiva is a specialized,
mucus-secreting epithelial membrane that lines the interior surface of each eyelid. It is kept moist by the continuous
flushing action of tears (lacrimal fluid) from the lacrimal glands. Tears contain lysozyme, lactoferrin, IgA and thus
provide chemical as well as physical protection.
Microorganisms do occasionally breach the epithelial barricades. It is then up to the innate and adaptive immune
systems to recognize and destroy them, without harming the host. In the case of innate immune response, several
antimicrobial chemicals and phagocytic cells provide protection against pathogens.
Chemical mediator
A variety of chemicals mediates protection against microbes during the period before adaptive immunity develops.
The molecules of the innate immune system include complement proteins, cytokines, pattern recognition molecules,
acute-phase proteins, cationic peptides, enzymes like lysozyme and many others.
Complement proteins
The complement proteins are soluble proteins/glycoproteins that are mainly synthesized by liver and circulate in
the blood and extracellular fluid. They were originally identified by their ability to amplify and complement the
469
Immunology
action of antibodies; hence, the name complement. It also bridges innate and adaptive immunity and removes
immune complexes. The complement system is composed of over 30 serum proteins. Activation of complement
proteins in response to certain microorganisms results in a controlled enzymatic cascade, which targets the membrane
of pathogenic organisms and leads to their destruction.
Cytokines
The term cytokine is a generic term for any low molecular weight soluble protein or glycoprotein released by one
cell population which acts as an intercellular mediator. It includes monokines, lymphokines, interleukins, interferons
and others. Cytokines are required for immunoregulation of both innate as well as adaptive immune responses.
Interferons are cytokines made by cells in response to virus infection, which essentially induce a generalized
antiviral state in surrounding cells.
Chemokines are small, positively charged secreted proteins that have a central role in guiding the migrations of
various types of white blood cells. They bind to the surface of endothelial cells, and to negatively charged proteoglycans
of the extracellular matrix in organs. By binding to G-protein-linked receptors on the surface of specific blood cells,
chemokines attract these cells from the bloodstream into an organ, guide them to specific locations within the
organ, and then help stop migration.
Acute phase proteins are a heterogeneous group of plasma proteins mainly produced in the liver as the result of
a microbial stimulus. They include C-reactive protein (CRP), serum amyloid protein A (SAA) and mannose binding
protein (MBP). Cytokines (IL-1, IL-6, IL-8, etc.) released by macrophages upon activation by bacteria stimulate the
liver to rapidly produce acute-phase proteins. These proteins maximize activation of the complement system and
opsonization of invading microbes.
Cellular defenses
Many specialized cell types like neutrophils, macrophages, monocytes, natural killer cells participate in innate host
defense mechanisms. Once a pathogen evades the physical and chemical barriers, these specialized cells play an
important role in protection. Phagocytosis is a fundamental protective mechanism carried out by these cell types,
470
This page intentionally left blank.
Immunology
5.5 Antigens
Adaptive immune responses arise as a result of exposure to foreign compounds. The compound that evokes the
response is referred to as antigen, a term initially coined due to the ability of these compounds to cause antibody
responses to be generated. An antigen is any agent capable of binding specifically to T-cell receptor (TCR) or an
antibody molecule (membrane bound or soluble). The ability of a compound to bind with an antibody or a TCR is
referred to as antigenicity. There is a functional distinction between the term antigen and immunogen. An
immunogen is any agent capable of inducing an immune response and is therefore immunogenic. The distinction
between the terms is necessary because there are many compounds that are incapable of inducing an immune
response, yet they are capable of binding with components of the immune system that have been induced specifically
against them. Thus all immunogens are antigens, but not all antigens are immunogens.
1. Foreignness
The most important feature of an immunogen is that an effective immunogen must be foreign with respect to
the host. The adaptive immune system recognizes and eliminates only foreign (nonself) antigens. Self antigens
are not recognized and thus individuals are tolerant to their own self molecules, even though these same
molecules have the capacity to act as immunogens in other individuals of the same species.
2. Size
The second requirement for being immunogenic is that the compound must have a certain minimal molecular
weight. There is a relationship between the size of immunogen and its immunogenicity. In general, small
compounds with a molecular weight <1000 Da (e.g. penicillin, aspirin) are not immunogenic; those of molecular
weight between 1000 and 6000 Da (e.g. insulin, adrenocorticotropic hormone) may or may not be immunogenic;
and those of molecular weight >6000 Da (e.g. albumin, tetanus toxin) are generally immunogenic. The most
active immunogens tend to have a molecular mass of 100,000 Da or more. In short relatively small substances
have decreased immunogenicity, whereas large substances have increased immunogenicity.
3. Chemical complexity
The third characteristic necessary for a compound to be immunogenic is a certain degree of chemical complexity.
For example, homopolymers of amino acids or sugars are seldom good immunogens regardless of their size.
Similarly, a homopolymer of poly-γ-D-glutamic acid (the capsular material of Bacillus anthracis) with a molecular
weight of 50,000 Da is not immunogenic. The absence of immunogenicity is because these compounds, although
of high molecular weight, are not sufficiently chemically complex.
Virtually all proteins are immunogenic. Furthermore, the greater the degree of complexity of the protein, the
more vigorous will be the immune response to that protein. Carbohydrates are immunogenic only if they have
a complex polysaccharide structure or part of complex molecules such as glycoproteins. Nucleic acids and
lipids are poor immunogens by themselves, but they become immunogenic when they are conjugated to
protein carriers.
479
This page intentionally left blank.
Immunology
Class I Inhibitory
MHC receptor
–
No killing
+
Ligand
Activating
Normal cell receptor NK cell
Killing
+
Ligand
Activating
Altered self cell receptor NK cell
Figure 5.46 An activating receptor on NK-cells interacts with its ligand on normal and altered self cells,
inducing an activation signal that results in killing. However, interaction of inhibitory NK-cell receptors with
class I MHC molecules delivers an inhibition signal that counteracts the activation signal. Expression of class I
molecules on normal cells thus prevents their destruction by NK-cells. Because class I expression is often
decreased on altered self cells (virus infected cells and tumor cells), the killing signal predominates, leading
to their destruction.
5.13.1 Superantigens
Superantigens are viral or bacterial proteins that bind simultaneously to the variable domain of β of a T-cell
receptor (TCR) and to the α-chain of a class II MHC molecule (i.e. outside the peptide-binding groove). Because of
their unique binding ability, superantigens can activate large numbers of T-cells irrespective of their antigenic
specificity. Superantigens can be exogenous and endogenous. Exogenous superantigens are soluble proteins secreted
by bacteria whereas endogenous superantigens are cell-membrane proteins encoded by certain viruses that infect
mammalian cells.
b Ag
a
MHC TCR
a b
Superantigen
APC TH cell
Figure 5.47 Superantigen-mediated cross-linkage of T-cell receptor (TCR) and class II MHC molecules.
Superantigen binds to class II MHC molecule and a part of the Vβ chain of the T-cell receptor that is outside
the normal antigen-binding site and this binding is sufficient to trigger T-cell activation. A superantigen binds
to all TCRs bearing a particular V sequence regardless of their antigen specificity.
525
Immunology
5.14 Cytokines
Cytokines are low-molecular-mass (generally less than 30 kDa) soluble proteins/glycoproteins, non-immunoglobulin
in nature, secreted by a variety of cell types and act nonenzymatically through specific receptors to regulate host
cell function. They do not include the peptide and steroid hormones of the endocrine system. Cytokines play major
roles in the development of cellular and humoral immune responses, induction of the inflammatory response,
regulation of hematopoiesis, control of cellular proliferation and differentiation.
Cytokines can affect the same cell responsible for their production (an autocrine function) or nearby cells (a
paracrine function), or they can be distributed by the circulatory system to distant target cells (an endocrine
function). They are highly potent hormone-like substances, active even at femto molar concentration. However,
they differ from endocrine hormones as being not produced by glands but by widely distributed cells. Cytokines
produce biological actions only when they bind to specific, high-affinity receptors on the surface of target cells. The
biological activities of cytokines exhibit pleiotropy (a given cytokines that has different biological effect on different
target cells), redundancy (two or more cytokines that mediates similar functions), synergy (combined effect of two
cytokines on cellular activity is greater than the additive effect of the individual cytokines) and antagonism (effect
of one cytokines inhibit the effect of another cytokines).
REDUNDANCY
IL-2
Activated TH cells IL-4 B cell Proliferation
IL-5
SYNERGY
IL-4
Activated TH cells + B cell Induces class switch to IgE
IL-5
ANTAGONISM
Activated TH cells IL-4 B cell Blocks class switch of IgE induced by IL-4
IFN-g
Cytokines differ from hormones and growth factors. All three are secretory proteins that elicit their biological
effects at very low concentrations by binding to receptors on target cells. Growth factors tend to be produced
constitutively, whereas cytokines and hormones are secreted in response to discrete stimuli. Unlike hormones,
which generally act long range in an endocrine fashion, most cytokines act over a short distance in an autocrine or
paracrine fashion. In addition, most hormones are produced by specialized glands and tend to have a unique action
on one or a few types of target cell. In contrast, cytokines are often produced by, and bind to, a variety of cells.
There are over 100 different cytokines. The generic name of cytokines includes all proteins with a small molecular
weight, released by cells of the immune system, especially by monocytes and T-lymphocytes. But they are also
secreted by many cells in addition to those of the immune system, such as endothelial cells and fibroblasts. They used
to have different names depending either on their origin, such as lymphokines (produced by lymphocytes), monokines
(substances produced by monocytes or macrophages) or on their activity: chemokines, interleukins, interferons.
Cytokines may be grouped into following categories : hematopoietins, interleukins, interferons, chemokines and
members of the tumor necrosis factor (TNF) family.
526
This page intentionally left blank.
Immunology
Antibodies do not play a role in type IV hypersensitivity reactions. On activation, the TH1 cells release cytokines
that cause accumulation and activation of macrophages, which, in turn, cause local damage. The tuberculin skin
test is an example of a type IV hypersensitivity. This test is done by putting a small amount of tuberculin purified
protein derivative (PPD) under the top layer of skin. If any person has ever been exposed to the Mycobacterium
tuberculosis, skin will react to the antigens by developing a firm red bump at the site within 2 days. It is a standard
method of determining whether a person is infected with Mycobacterium tuberculosis.
Overview of Hypersensitivity
IgE-mediated
Type I hypersensitivity
Ab-mediated
IgG/IgM-mediated
Immediate
Type II hypersensitivity
Symptoms are manifested
within minutes or hours Ag-Ab mediated IgG-mediated
after exposure
Type III hypersensitivity
Hypersensitivity
5.17 Autoimmunity
The body is normally able to distinguish its own self-antigens from foreign nonself antigens and does not mount an
immunologic attack against itself. This phenomenon is called immune tolerance. Autoimmunity is a condition in
which structural or functional damage is produced by the action of immunologically competent cells or Ab against
self antigen. Autoimmunity literally means protection against self, but actually it implies injury to self, and therefore
sometimes the term is also under criticism.
Autoimmune disease results from the activation of self-reactive T and B-cells that, following stimulation by
genetic or environmental triggers, cause actual tissue damage. Four factors influence the development of autoimmune
disease. These factors are genetic, viral, hormonal and psycho-neuro-immunological (the influence of stress and
neurochemicals). All four of these factors can affect gene expression, which directly or indirectly interferes with
important immunoregulatory actions. Based on the site of involvement and nature of lesions autoimmune diseases
may be classified as hemocytolytic, localized (or organ specific), systemic (or non-specific) and transitory diseases.
Important examples of autoimmune diseases in human and their respective autoantigen are given below in the table.
535
Immunology
5.18 Transplantation
The immune system has evolved as a way of discriminating between self and non-self. This discriminating power of
the immune system between self and non-self is undesirable in the case of tissue transplant from one individual to
another for therapeutic purposes. Indeed, result of transplants culminates in the phenomenon of graft rejection.
Before the discussion about the immunological mechanisms associated with graft rejection, it is important to
understand the various gradations in relationship from donor to recipient.
Isograft : Graft between genetically identical individuals (syngeneic). In humans, an isograft (or syngraft) can
be performed between monozygotic twins.
Allograft : Transplants between genetically different individuals within a species.
Xenograft : A graft between individuals from different species.
Autograft : A graft or transplant from one body part to another on the same individual.
Transplanting tissue that is not immunologically privileged generates the possibility that the recipient’s cells will
recognize the donor’s tissue as foreign. This triggers the recipient’s immune mechanisms, which may destroy the
donor tissue. Such a response is called a graft rejection reaction. Some transplanted tissues do not stimulate an
immune response. For example, a transplanted cornea is rarely rejected because lymphocytes do not circulate into
the anterior chamber of the eye. This site is considered an immunologically privileged site. Another example of a
privileged tissue is the heart valve.
A tissue rejection reaction can occur by two different mechanisms. First, foreign class II MHC molecules on transplanted
tissue, or the graft is recognized by host T-helper cells, which aid cytotoxic T-cells in graft destruction. Cytotoxic
T-cells then recognize the graft through the foreign class I MHC molecules. This response is much like the activation
of CTLs by virally infected host cells. A second mechanism involves the T-helper cells reacting to the graft and
releasing cytokines. The cytokines stimulate macrophages to enter, accumulate within the graft, and destroy it. The
MHC molecules play a dominant role in the tissue rejection reaction because of their unique association with the
recognition system of T-cells.
SCID is a genetic disorder which is characterized by a very low number of circulating lymphocytes. Both arms
(B-cells and T-cells) of the adaptive immune system become non-functional. As such patients make neither specific
T-cell dependent antibody responses nor cell-mediated immune responses, and thus cannot develop immunological
memory. Several different defects can lead to the SCID phenotype. In X-linked SCID, which is the commonest
form of SCID, T-cells fail to develop because of defect in the genes code for several cytokine receptors, including
those for the interleukins IL-2, IL-4, IL-7, IL-9 and IL-15. The autosomally inherited SCID occurs due to adenosine
deaminase deficiency. Adenosine deaminase catalyzes conversion of adenosine to inosine, and its deficiency results
in accumulation of adenosine, which interferes with purine metabolism which result in an accumulation of nucleotide
metabolites that are particularly toxic to developing T-cells.
536
Immunology
Chediak-Higashi syndrome
DiGeorge syndrome
DiGeorge syndrome, or congenital thymic aplasia, is not hereditary but occurs sporadically and is result of a
deletion in chromosome 22. The syndrome is caused by defective migration of fetal neural crest cells into the third
and fourth pharyngeal pouch. DiGeorge syndrome in its most severe form is the complete absence of a thymus.
This developmental defect causes immunodeficiency along with hypoparathyroidism, and congenital heart disease.
The immune defect includes a profound depression of T-cell numbers and absence of T-cell responses. Although
B-cells are present in normal numbers, affected individuals do not produce antibody in response to immunization
with specific antigens.
537
This page intentionally left blank.
Immunology
5.21 Vaccines
An individual may be exposed to an antigen to induce formation of antibodies, a type of immunity known as artificial
active immunity. The material used to induce artificial active immunity, the antigen or a mixture of antigens, is
known as a vaccine (or an immunogen), and the process of generating such an immune response is immunization.
Immunization is commonly known as vaccination. The term vaccine has been derived from the Latin word vaccinus
meaning from cows. It can be defined as a nontoxic or non virulent preparation of antigenic material that can be
used to induce long term humoral as well as a cell mediated immune response against pathogens.
In principle anything from whole organisms to small subcellular fragment can be used as antigen in vaccines. Most
current vaccines in use for humans consist of whole organisms, described as whole organism vaccine, which
include live but attenuated organisms or killed organisms and purified antigen vaccines (toxoid, capsular
polysaccharide and recombinant microbial antigens). However, recent advances in molecular biology had provided
alternative methods for producing vaccines such as DNA vaccines and recombinant vector vaccines.
These vaccines are prepared by attenuating pathogenic organisms by growing them in unfavorable conditions
which result in gene mutations due to which organism looses pathogenicity but retains their capacity for transient
growth. Attenuated vaccines have advantages as well as disadvantages. Due to their capacity for transient growth,
these vaccines show prolonged immunogenicity and eliminate the need for repeated boosters. But a major
disadvantage of these vaccines is the associated risk of reverting back to virulent form. However these days
genetic engineering is also used to cause site directed mutation causing irreversible removal of virulence genes
from attenuated organisms making them safe for use.
The Sabin polio vaccine is an example of attenuated vaccine, consisting of three attenuated strains of poliovirus.
Sabin vaccine in the intestines induces production of secretory IgA, which serves as an important defense against
naturally acquired poliovirus.
539
Immunology
DNA vaccines
DNA vaccines, also known as genetic vaccines, use the genetic material of the pathogen itself to immunize the
individual. In this class of vaccine, fragments of the pathogen’s genome encoding antigenic proteins are injected
directly into the host cells where they can integrate into the chromosomal DNA or exist as episomes. Expression of
genes within the host generate foreign proteins to which the host immune system responds. Hence in DNA vaccine,
an immune response is made against the protein encoded by the vaccine DNA. The DNA itself is not immunogenic.
DNA vaccines induce both humoral and cell mediated immunity.
Antigenic protein
coded by plasmid
Humoral
Gene for immunity
antigenic
protein DNA
vaccine Antigenic
peptides
Figure 5.59 DNA vaccines and humoral and cell mediated immunity.
540
This page intentionally left blank.
Chapter 06
Genetics
All living organisms reproduce. Reproduction results in the formation of offspring of the same kind. However, the
resulting offspring need not and, most often, does not totally resemble the parent. Several characteristics may
differ between individuals belonging to the same species. These differences are termed variations. The mechanism
of transmission of characters, resemblances as well as differences, from the parental generation to the offspring,
is called heredity. The scientific study of heredity, variations and the environmental factors responsible for these, is
known as genetics (from the Greek word genno = give birth). The word genetics was first suggested to describe the
study of inheritance and the science of variation by prominent British scientist William Bateson.
Genetics can be divided into three areas: classical genetics, molecular genetics and evolutionary genetics. In
classical genetics, we are concerned with Mendel’s principles, sex determination, sex linkage and cytogenetics.
Molecular genetics is the study of the genetic material: its structure, replication and expression, as well as the
information revolution emanating from the discoveries of recombinant DNA techniques. Evolutionary genetics is the
study of the mechanisms of evolutionary change or changes in gene frequencies in populations (population genetics).
Classical genetics
6.1 Mendel’s principles
Gregor Johann Mendel (1822–1884), known as the Father of Genetics, was an Austrian monk. In 1856, he published
the results of hybridization experiments titled Experiments on Plant Hybrids in a journal “The proceeding of the
Brunn society of natural history” and postulated the principles of inheritance which are popularly known as Mendel’s
laws. But his work was largely ignored by scientists at that time. In 1900, the work was independently rediscovered
by three biologists - Hugo de Vries of Holland, Carl Correns of Germany and Erich Tschermak of Austria. Mendel did
a statistical study (he had a mathematical background). He discovered that individual traits are inherited as discrete
factors which retain their physical identity in a hybrid. Later, these factors came to be known as genes. The term
was coined by Danish botanist Wilhelm Johannsen in 1909. A gene is defined as a unit of heredity that may
influence the outcome of an organism’s traits.
Mendel’s experiment
Mendel chose the garden pea, Pisum sativum, for his experiments since it had the following advantages.
1. Well-defined discrete characters
2. Bisexual flowers
3. Predominant self fertilization
4. Easy hybridization
5. Easy to cultivate and relatively short life cycle
545
Genetics
Flower color is positively correlated with seed coat colors. Seeds with white seed coats were produced by plants
that had white flowers and those with gray seed coats came from plants that had violet flower.
Allele
Each gene may exist in alternative forms known as alleles, which code for different versions of a particular inherited
character. We may also define alleles as genes occupying corresponding positions on homologous chromosomes
and controlling the same characteristic (e.g. height of plant) but producing different effects (tall or short). The term
homologous refers to chromosomes that carry the same set of genes in the same sequence, although they may
not necessarily carry identical alleles of each gene.
2. Heterozygous genotypes possess one of each allele for a particular trait (Tt).
546
This page intentionally left blank.
Genetics
Phenocopy
A phenotype that is not genetically controlled but looks like a genetically controlled one is called phenocopy. It is an
environmentally induced phenotype that resembles the phenotype determined by the genotype. An example of a
phenocopy is Vitamin-D-resistant rickets. A dietary deficiency of vitamin D, for example, produces rickets that is
virtually indistinguishable from genetically caused rickets.
6.1.6 Probability
The chance that an event will occur in the future is called the event’s probability. For example, if you flip a coin, the
probability is 0.50, or 50%, that the head side will be showing when it lands. The probability depends on the number
of possible outcomes. In this case, there are two possible outcomes (head and tail), which are equally likely. This
allows us to predict that there is a 50% chance that a coin flip will produce a head. The general formula for the
probability is:
A probability calculation allows us to predict the likelihood that an event will occur in the future. The accuracy of this
prediction, however, depends to a great extent on the size of the sample.
In genetic problems, we are often interested in the probability that a particular type of offspring will be produced.
For example, when two heterozygous tall pea plants (Tt) are crossed, the phenotypic ratio of the offspring is
3 tall : 1 dwarf. This information can be used to calculate the probability for either type of offspring:
The probability of obtaining a tall plant is 75% and a dwarf plant 25%. When we add together the probabilities of all
the possible outcomes (tall and dwarf), we should get a sum of 100% (here, 75% + 25% = 100%).
There are two basic laws of probability that are used for genetic analysis. The first law, the multiplicative law
(product rule) of probability, states that the chance of two or more independent events occurring together is the
product of the probability of the events occurring separately. Independent events are events whose outcomes do
not influence one another. This is also known as the and rule. The product rule can be used to predict the
probability of independent events that occur in a particular order.
555
This page intentionally left blank.
Genetics
r y
R Y
Meiosis I
r Y r y
r Y r y
R y R Y
R y R Y
r R r R
r R r R
Y y y Y
Y y y Y
Meiosis II
Y r Y r y R y R y r y r Y R Y R
Figure 6.7 Random alignment of bivalents during prophase of meiosis I explains Mendel’s law of independent
assortment.
559
Genetics
and Punnett by considering the involvement of two different (non-allelic) genes; because of the F2 9 : 7 ratio is a
variation of the 9 : 3 : 3 : 1 ratio. Let us consider the formation of the purple pigment in which products of two
different genes are involved.
Genotype Genotype
(CC or Cc) (PP or Pp)
Enzyme A Enzyme B
Colorless precursor Colorless intermediate Purple pigment
(Anthocyanin)
In the above pathway, a colorless precursor molecule must be acted on by two different enzymes to produce the
purple pigment. Gene C encodes a functional enzyme A, which converts the colorless precursor into a colorless
intermediate and finally gene P encodes enzyme B, which gives purple color by converting colorless intermediate.
If any of these two genes will be in homozygous recessive condition (cc or pp) then the purple color will not appear.
Thus the genotype cc can hide or mask the phenotype expression of genotype PP or Pp.
CP Cp cP cp
The purple color appears only when dominant alleles of both genes are present. When one or both genes have only
recessive alleles, the color will be white.
Epistasis
The term epistasis (Greek for standing upon) describes a type of gene interaction when one gene masks or
modifies the expression of another gene at distinct locus. Any gene that masks the expression of another non-allelic
gene is epistatic to that gene. The gene suppressed is hypostatic. In the pathway discussed for the formation of
purple color, when either is homozygous recessive (cc or pp) that gene is epistatic to the other.
Epistasis is different from dominance. Epistasis is the interaction between different genes (non-alleles). Dominance
is the interaction between different alleles of the same gene i.e. intraallelic.
560
This page intentionally left blank.
Genetics
AABB(1), AABb(2), AaBB(2) These have at least one functional allele A and convert all the substrates to
AaBb(4), AAbb(1), Aabb(2) purple product.
(Purple 12)
aaBB(2), aaBb(1) Lack any functional enzyme A, but have a functional enzyme B, which converts
(Red 3) the substrate to a red product.
aabb(1) Have no functional enzymes and cannot synthesize any colored pigment.
(White 1)
Explanation : Let us take the following case, in which F2 phenotypic ratio is 9 Purple : 3 Red : 4 White.
Parent 1 Parent 2
AA bb aa BB
(Red) (White)
F1
AaBb
(Purple)
F2
9 purple : 3 red : 4 white
In this example, the biochemical pathway would again be a simple chain, but the product of enzyme A would be red
in color.
Enzyme A Enzyme B
White substance Red product Purple product
AABB(1), AaBB(2), AABb(2), AaBb(4) have at least one functional copy of both A and B and therefore can
(Purple 9) synthesize the purple pigment.
AAbb(2), Aabb(1) have only functional enzyme A and produce red pigment but do not
(Red 3) convert it to purple pigment.
aaBB(2), aaBb(1) have no functional enzyme A and so cannot synthesize the red product
that is the substrate for enzyme B and will remain white.
aabb(1) have no functional enzymes and cannot synthesize the purple pigment.
(White 4)
562
Genetics
Precursor Product
(Colorless) (Colored)
Enzyme B
(Product of gene B)
Product of gene A
Precursor Malvidin
(Colorless) × (Colored)
Product of gene B
563
This page intentionally left blank.
Genetics
6.6.4 Mosaicism
Mosaicism is a condition in which cells within the same individual have a different genetic makeup. Individuals
showing mosaicism are referred to as mosaics. Mosaicism can be caused by DNA mutations, epigenetic alterations
of DNA, chromosomal abnormalities (change in chromosome number and structure) and the spontaneous reversion
of inherited mutations. Mosaicism can be associated with changes in either nuclear or mitochondrial DNA. An
individual with two or more cell types, differing in chromosome number or structure is either a mosaic or a chimera.
If the two cell types originated from a single zygote, the individual is a mosaic, and when originated from two or
more zygotes that subsequently fused, the individual is a chimera.
Mosaicism can exist in both somatic cells (somatic mosaicism) and germ line cells (germline mosaicism). As their
names imply, somatic and germ line mosaicism refer to the presence of genetically distinct groups of cells within
somatic and germ line tissues, respectively. If the event leading to mosaicism occurs during development, it is
possible that both somatic and germ line cells will become mosaic. In this case, both somatic and germ line tissue
populations would be affected, and an individual could transmit the mosaic genotype to his or her offspring.
Conversely, if the triggering event occurs later in life, it could affect either a germ line or a somatic cell population.
If the mosaicism occurs only in a somatic cell population, the phenotypic effect will depend on the extent of the
mosaic cell population; however, there would be no risk of passing on the mosaic genotype to offspring. On the
other hand, if the mosaicism occurs only in a germ line cell population, the individual would be unaffected, but the
offspring could be affected.
How is somatic mosaicism generated? There are many possible reasons, including somatic mutations, epigenetic
changes in DNA, alterations in chromosome structure and/or number, and spontaneous reversal of inherited mutations.
In all of these cases, a given cell and those cells derived from it could exhibit altered function.
585
This page intentionally left blank.
Genetics
In X-linked inheritance, the pattern of inheritance for loci on the heteromorphic sex chromosome differs from the
pattern for loci on the homomorphic autosomal chromosomes, because sex chromosome alleles are inherited in
association with the sex of offspring. Alleles on a male’s X-chromosome go to his daughters, but not to his sons,
because the presence of his X-chromosome normally determines that his offspring is a daughter. Since the father
passes a trait to his daughters, who passes it to their sons. Hence, this pattern of inheritance is known as criss-cross
pattern of inheritance. In Drosophila, eye color has nothing to do with sex determination, so we see that genes on
the sex chromosomes are not necessarily related to sexual function. The same is true in humans, for whom
pedigree analysis has revealed many X-linked genes, of which few could be constructed as being connected to
sexual function.
Genotype Phenotype
Male Female
BB Bald Bald
Bb Bald Non-bald
bb Non-bald Non-bald
In contrast, a heterozygous female will not be bald. Women who are homozygous for the baldness allele will
develop the trait. Sex influence nature of pattern baldness appears to be related to the levels of the male sex
hormones.
587
This page intentionally left blank.
Genetics
6.9 Cytogenetics
A chromosome is an organized structure of DNA and protein that is found in the nucleus of a eukaryotic cell. The
study of the structure, function and abnormalities of chromosome is called cytogenetics, a discipline that combines
cytology with genetics.
Tijo and Levan (1956) of Sweden found that human cells have 23 pairs or 46 chromosomes. Of the 23 pairs, 22 are
perfectly matched in both males and females, and are called autosomes. The remaining pair, the sex chromosomes,
consists of two similar chromosomes in females and two dissimilar chromosomes in males. In human, females are
designated XX and males XY. The largest autosome is number 1, and the smallest is number 21.
Denver system
According to ‘Denver system’ of classification, the 22 pairs of human chromosomes are placed in seven groups as;
Group Position of centromere Idiogram number
I (A) Metacentric or submetacentric 1, 2, 3
II (B) Submetacentric 4, 5
III (C) Submetacentric 6, 7, 8, 9, 10, 11, 12 and X
IV (D) Acrocentric 13, 14 and 15
V (E) Metacentric or submetacentric 16, 17 and 18
VI (F) Metacentric 19 and 20
VII (G) Metacentric 21, 22 and Y
601
Genetics
Male Female
1 2 3 4 5 1 2 3 4 5
6 7 8 9 10 6 7 8 9 10
11 12 13 14 15 11 12 13 14 15
16 17 18 19 20 16 17 18 19 20
21 22 XY 21 22 XX
C-banding Denature with barium hydroxide and then stain with Dark bands contain constitutive
Giemsa. C stands for Constitutive heterochromatin. heterochromatin
602
This page intentionally left blank.
Genetics
Molecular genetics
6.10 Genome
Genome is the sum total of all genetic material of an organism which store biological information. The nature of the
genome may be either DNA or RNA. All eukaryotes and prokaryotes always have a DNA genome, but viruses may
either have a DNA genome or RNA genome. The eukaryotic genome consists of two distinct parts: Nuclear genome
and organelles (mitochondrial and chloroplast) genome. The nuclear genome consists of linear dsDNA. In a few
lower eukaryotes, double-stranded circular plasmid DNA (for example, 2-micron circle in yeast) is also present
within the nucleus.
The amount of DNA present in the genome of a species is called a C-value, which is characteristic of each species.
The value ranges from <106 bps as in smallest prokaryote, Mycoplasma to more than 1011 bps for eukaryotes such
as amphibians. The genomes of higher eukaryotes contain a large amount of DNA.
Flowering plants
Mammals
Reptiles
Birds
Amphibians
Fish
Echinoderms
Insects
Worms
6 7 8 9 10 11
10 10 10 10 10 10
Figure 6.48 The DNA content of the haploid genome of a range of phyla. The range of values within a phylum
is indicated by the shaded area.
The DNA content of the organism’s genome is related to the morphological complexity of lower eukaryotes, but
varies extensively among the higher eukaryotes. In lower eukaryotic organisms like yeast, amount of DNA increases
with increasing complexity of organisms. However, in higher eukaryotes there is no correlation between increased
genome size and complexity. This lack of correlation between genome size and genetic complexity refers to
C-value paradox. For example, a man is more complex than amphibians in terms of genetic development, but
some amphibian cells contain 30 times more DNA than human cells. Moreover, the genomes of different species of
amphibians can vary 100-fold in their DNA contents.
613
Genetics
S. cerevisiae (yeast) 12
A. thaliana (mustard plant) 120
D. melanogaster (fruit fly) 170
H. sapiens (human) 3,300
H. vulgare (barley) 5,300
dC 2
= − kC
dt
where k is the second-order rate constant. C is the concentration of single-stranded DNA at time t and the second
order rate equation for two complementary strands coming together is given by the rate of decrease in C.
Starting with a concentration, C0, of completely denatured DNA at t=0, the amount of single-stranded DNA remaining
at some time t is
C 1
=
C0 (1 + k.C0.t)
The time for half of the DNA to renature (when C/C0 = 0.5) is defined as t = t1/2. Then,
1
0.5 = and thus 1 + k.C0.t1/2 = 2, yielding
(1 + k.C0.t1 / 2 )
1
C0.t1 / 2 =
k
The product of C0 × t1/2 is called the Cot1/2. It is inversely proportional to the rate constant. Since the Cot1/2 is the
product of the concentration and time required to proceed halfway, a greater Cot1/2 implies a slower reaction. The
renaturation of DNA usually is followed in the form of a Cot curve. A graph of the fraction of single-stranded DNA
reannealed (1 – C/C0) as a function of Cot on a semilogarithmic plot is referred to as a Cot curve.
5 6
Genome size 1 3500 1.7×10 4.2×10 bp
100%
Fraction
reassociated
0 –6 –4 –2 2 Figure 6.49
10 10 10 1 10
Cot curve of dsDNA
–6
2×10 8×10
–3
3×10
–1
9 Cot1/2 from the indicated source.
614
This page intentionally left blank.
Genetics
Centromere
The centromere is a constricted region of a eukaryotic chromatin/chromosome where the kinetochore is assembled
and sister chromatids are held together. Although this constriction is termed as centromere, it is usually not located
exactly in the center of the chromosome and, in some cases, is located almost at the chromosome’s end. The
regions on either side of the centromere are referred to as the chromosome’s arms. Kinetochore associated with
the centromere is a complex of proteins where spindle fibers attach to the chromosome during mitosis/meiosis and
help in the proper segregation of sister chromatids or homologous chromosomes. The centromere has no defined
DNA sequence. It typically consists of large arrays of tandemly repeated DNA sequences. In humans, the centromeric
sequences are made up of 171 bp repeating unit and are called alphoid DNA. In the yeast, Saccharomyces cerevisiae,
the centromeric sequence (CEN) is about 110 bp long and it consists of three types of sequence element:
• CDE-I - 9 bp sequence;
• CDE-II - >90% A·T-rich sequence of 80–90 bp;
• CDE-III - 11 bp highly conserved sequence.
TC A C ATG AT TG ATTTC C G A A
A G TG TA C TA A C TA A A G G C TT
{
{
{
633
This page intentionally left blank.
Genetics
X-chromosome
Right arm of
chromosome 2
Chromocenter
Left arm of
chromosome 2
Figure 6.78 A light micrograph of polytene chromosomes present in Drosophila salivary glands. Each parental
chromosome is tightly paired with its homologue. All the chromosomes are linked together by the pericentromeric
region to create a single chromocenter. Under light microscope, distinct alternating dark bands and light
bands (known as interbands) are visible.
644
Genetics
The loop as shown in the figure 6.79 is an extruded segment of DNA that is being actively transcribed. The lateral
loops extend in pairs, one from each sister chromatid. The loops are surrounded by a matrix of ribonucleoproteins
that contain nascent RNA chains. Lampbrush chromosomes are thought to assist in fulfilling the high demand for
transcripts during oogenesis.
Maternal chromosome
Paternal chromosome
Chromomere
Enlarged section of
a chromosome
Chromatin loop
Chromatin
loop
Sister chromatids
Chromomere
Figure 6.79 Lampbrush chromosome structure. Most of the DNA in each chromosome remains highly
condensed in the chromomeres. Each of the two chromosomes shown consists of two closely apposed sister
chromatids. This four stranded structure is characteristic of diplotene stage of meiosis.
6.11.6 B-chromosomes
The B-chromosomes (also referred to as supernumerary or accessory chromosomes) are additional (extra)
chromosomes that are present in some individuals in some species. In eukaryotic cells normal chromosomes are
termed as A-chromosomes. Most B-chromosomes are mainly or entirely heterochromatic and genetically inert.
They are thought to be selfish genetic elements with no defined functions. The evolutionary origin of B-chromosomes
is not clear, but presumably they must have been derived from heterochromatic segments of normal A-chromosomes.
645
Genetics
Figure 6.80 A. In conservative model, after one round of replication two daughter dsDNA molecules form.
In which one daughter molecule contains both parental DNA strands and the other daughter molecule contains
two newly synthesized DNA strands. B. In semiconservative model, the two parental DNA strands separate
and each of those strands then serves as a template for the synthesis of a new DNA strand. The result is two
DNA double helices, both of which consist of one parental and one new strand. C. In dispersive model, the
parental double helix is broken into double-stranded DNA segments. The segments then reassemble into
complete DNA double helices, each with parental and all newly-synthesized dsDNA segments interspersed.
646
This page intentionally left blank.
Genetics
Topoisomerase
A DNA topoisomerase is a nuclease that breaks a phosphodiester bond in a DNA strand. This reaction is reversible,
and the phosphodiester bond reforms as the enzyme leaves. The first DNA topoisomerase was discovered by
James Wang in 1971 from E. coli. There are several types of topoisomerases present in eukaryotes and
prokaryotes. All topoisomerases can be classified into two classes– type I and type II, depending on whether
they cleave one or two strands of DNA, respectively. Type I topoisomerases cleave one DNA strand and pass
other strand through the break before resealing it, while type II topoisomerases cleave both DNA strands and
pass another double strand through the break followed by resealing of the double strand break. Enzymes with
an odd Roman numeral after their name (for example, topo I and topo V) fall into the type I class, whereas
those with an even Roman numeral after their name are type II. Type I topoisomerases do not require ATP for
activity; the reaction is driven by the energy stored in the supercoiled DNA. So far, the only exception is reverse
gyrase, which introduces positive supercoils with the aid of ATP hydrolysis. Type II topoisomerases also do not
require an external source of energy for the cleavage and religation during reaction, but they do utilize ATP
hydrolysis to drive conformational changes in the protein during the reaction cycle.
All topoisomerases contain a nucleophilic tyrosine, which they use to promote strand cleavage. The tyrosyl
oxygen attacks and breaks phosphodiester bond and at the same time forming a covalent phosphotyrosine
bond. Rejoining of the DNA strand occurs by a second transesterification reaction, which is basically the reverse
of the first.
Type I topoisomerases operate by forming a transient phosphotyrosine covalent bond with one end of the
broken DNA strand, either the 5' or the 3' end, followed by passage of the unbroken strand through the break,
and ultimately resealing of the break. These topoisomerases can be further divided into two subfamilies: Type
IA and Type IB topoisomerases. During DNA hydrolysis, type IA topoisomerases covalently bind 5’-phosphate,
whereas type IB enzymes form a covalent bond with 3’-phosphate. Type IA topoisomerases pass a single-stranded
DNA segment through a transient break in a second single DNA strand. On the contrary, type IB topoisomerases
nick one DNA strand, allowing one duplex end to rotate with respect to the other around the remaining
phosphodiester bond.
dsDNA dsDNA
Type IA Type IB
Tyr Tyr
5’ P 3’ 3’ P 5’
3’ HO 5’ 5’ HO 3’
Figure 6.85 Type IA topoisomerases effect topological changes in DNA through a ‘strand passage’
mechanism, in which one strand of dsDNA is cleaved and the second DNA strand is passed through the gap.
After passage of the second DNA strand, the broken strand is resealed. Type IB topoisomerases effect supercoil
relaxation by nicking a single strand of dsDNA and allowing one DNA strand end to rotate with respect to the
other around the intact phosphodiester bond on the opposing strand.
Type IA topoisomerases comprise three distinct classes– eubacterial Topo IA, eubacterial and eukaryotic Topo III
and eubacterial and archaeal reverse gyrase. These enzymes are primarily responsible for relaxing positively
or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA.
Type IB topoisomerases appear to be represented by a single family member (Topo IB).
651
Genetics
Type II topoisomerases make a transient double-strand break in the DNA helix and form a covalent linkage
to both strands of the DNA helix at the same time. During catalysis, the enzyme introduces a double strand
break in one DNA, termed the G-segment, and pass a second DNA segment termed the T-segment through the
transient break.
G-segment T-segment
ATP ATP
2 ATP
2 ADP + 2 Pi
Figure 6.86 General mechanism of type II topoisomerases. Type II topoisomerases cleave both strands of
a dsDNA and pass another dsDNA through the transient break. The enzyme binds to and bends the G-segment.
The binding of two ATP molecules, allowing the opening of the DNA gate and passage of the T-segment. The
T-segment is dissociated from the bottom of the enzyme, leading to unidirectional strand-passage. Hydrolysis
of the ATP molecules dissociates the ATPase domains and resets the enzyme.
There are two subfamilies of type II topoisomerases– type IIA topoisomerases and type IIB topoisomerases.
Type IIA topoisomerases are found throughout all cellular organisms, as well as in some viruses and can be
divided into three classes– eukaryotic topoisomerase II (topo II), bacterial topoisomerase IV (topo IV) and
bacterial and archaeal DNA gyrase. Type IIB topoisomerases include topo VI from plants and homologues of
Spo11 present in Saccharomyces cerevisiae. Both type IIA and type IIB topoisomerases use a duplex strand
passage mechanism and have the same ATPase and cleavage domains but differ in overall tertiary structure.
Both type I and type II topoisomerases change the linking number of DNA. Type IA topoisomerases change the
linking number by one and type IB topoisomerase change the linking number by any integer, while type IIA and
type IIB topoisomerases change the linking number by two.
Problem
Some viruses, like SV40, are closed circular DNAs carrying nucleosomes. If the SV40 virus is treated with
topoisomerase, and the histone is then removed, it is found to still be supercoiled. However, if histones are
removed before topoisomerase treatment, the DNA is relaxed. Explain.
Solution
The DNA makes 1.75 left-hand super helical turns about each nucleosome. These turns are ‘constrained’ and
cannot be removed while the histones are still present. However, if the histones are first removed, the DNA, so
produced, will have unconstrained supercoils, which can be relaxed by topoisomerase.
652
This page intentionally left blank.
Genetics
5 5
T T UV-B
6 6
Another example is the repair of O6-methylguanine, which forms in the presence of alkylating agents and is a
common and highly mutagenic lesion. It tends to pair with thymine rather than cytosine during replication. Direct
repair of O6-methylguanine is carried out by O6-methylguanine DNA methyltransferase (an alkyl transferase),
which catalyzes the transfer of the methyl group of O6-methylguanine to a specific Cys residue in the same protein.
6
P CH3 P Alkyl P P
transferase
S G C S S G C S
P P P P
Base-excision repair
Base excision repair involves removal of a damaged nucleotide base, excision of a short piece of the polynucleotide
and resynthesis with a DNA polymerase. It is used to repair many minor damage like alkylation and deamination
resulting from exposure to mutagenic agents. Enzyme DNA glycosylase initiates the repair process. A DNA
671
This page intentionally left blank.
Genetics
Introns in tRNA genes are unrelated and there is no consensus sequence that could be recognized by the splicing
enzymes. Thus splicing of tRNA depends principally on recognition of a common secondary structure in tRNA. All
the introns include a sequence that is complementary to the anticodon of the tRNA. The exact sequence and size of
the intron is not important.
2—
5'-OH 5'-PO4
Intron
2—
2'-3' PO4 3'-OH
The average half life of bacterial mRNAs is only about 1.5 minutes. Degradation of mRNA occurs in the 3’–5’
direction and is mediated by several endo- and exonucleases. No enzyme capable of RNA degradation in 5’–3’
direction has yet been reported in bacteria. Endonucleases (RNase E and RNase III), make internal cut in RNA
molecules whereas exonucleases (RNase II and polynucleotide phosphorylase, PNPase), remove nucleotides
sequentially from the 3’ end of an mRNA. In E. coli, RNase E and PNPase along with RNA helicase are located within
a multiprotein complex called the degradosome.
Eukaryotic mRNA
The average half life of eukaryotic mRNA is much longer than bacterial counterpart, on average 10–20 minutes in
lower eukaryotes like yeast to several hours in mammals. For example, mRNA incoding β-globin have half lives of
more than 10 hours. In eukaryotes, there are several degradation pathways of mRNA which occur in both 3’–5’ and
5’–3’ directions. Cytosolic mRNAs are degraded by three different pathways– deadenylation dependent, deadenylation
independent and endonucleolytic pathway. Most eukaryotic mRNA degradation is deadenylation dependent. In this
degradation process, removal of poly(A) tail occurs first. Removal of tail is catalyzed by deadenylase enzyme. The
deadenylated mRNA then may either (1) be decapped and degraded by a 5’–3’ exonuclease or (2) be degraded by a
3’–5’ exonuclease.
In the major 5’–3’ degradation pathway, deadenylation at the 3’ end triggers decapping at the 5’ end. This shows
that each end of the mRNA influences events that occur at the other end. Decapping reaction occurs by cleavage of
1–2 bases from the 5’ end. Removal of the cap triggers the 5’–3’ degradation pathway in which the mRNA is
degraded rapidly from the 5’ end, by the 5’–3’ exonuclease.
In the second degradation pathway, deadenylated mRNAs are degraded by the 3’–5’ exonuclease activity. Exosome,
which is related to degradosome, degrades the mRNA in the 3’ to 5’ direction. The exosome is also found in the
nucleus, where it degrades unspliced precursors to mRNA. In deadenylation independent pathway, mRNAs are
decapped and degraded by the 5’–3’ exonuclease. Some mRNAs are degraded by an endonucleolytic pathway that
does not involve decapping or deadenylation.
710
Genetics
mRNA surveillance
mRNA surveillance is a conserved mRNA degradation mechanism utilized by organisms to ensure fidelity and
quality of mRNA molecules. There are a number of surveillance mechanisms present within cells. Two most important
surveillance mechanisms are the nonsense mediated mRNA decay and the nonstop mediated mRNA decay.
Nonsense mediated decay (NMD) is involved in detection and decay of mRNA transcripts which contain premature
termination codons. This process plays an important role in checking that mRNAs have been properly synthesized
and functions. A critical issue is how normal and aberrant mRNAs are distinguished and how that distinction leads to
differences in mRNA stability. NMD is a translation coupled mechanism that eliminates mRNAs containing premature
translation-termination codons. In eukaryotic cells, NMD requires both active mRNA translation and NMD-specific
trans-acting factors. In yeasts, three well-investigated trans-acting factors in NMD are the proteins encoded by the
UPF1, UPF2 and UPF3 genes. These genes are evolutionarily conserved, and their deletion prevents NMD. UPF1 is a
cytosolic protein that has a Cys-His-rich region at its N-terminus. It is a helicase that has RNA-dependent ATPase
and ATP-dependent 5’ to 3’ helicase activities. It interacts with translation release factors eRF1 and eRF3, providing
a direct link between the translation termination complex and the NMD machinery. In the cytoplasm, ribosomes
associate and translate the mRNA, but are stalled on encountering a premature termination codon. This results in
binding of factors such as UPF1, eRF1 and eRF3 to the ribosome. Subsequent steps that are still being elucidated
lead to mRNA decay.
A unique aspect of mammalian NMD is the involvement of the EJC (Exon Junction Complex), a complex of proteins
deposited at exon-exon junctions during mRNA splicing. In mammals, a premature termination codon is recognized
by its position relative to the last exon-exon junction. As a general rule, mammalian transcripts that contain a stop
codon more than ~50 nucleotides upstream of the last exon-exon junction will be subjected to NMD.
Nonstop mediated decay is involved in the detection and decay of mRNA transcripts which lack in-frame stop
codon. It is hypothesized that these transcripts are identified during translation when the ribosome arrives at the
3’-end of the mRNA and stalls. Presumably the ribosome stalling recruits additional cofactors and the exosome
complex. The exosome degrades the transcript.
711
This page intentionally left blank.
Genetics
Long dsRNA
Dicer
Guide strand
siRNA duplex
Passenger strand
RISC
loading complex
pre-RISC
Target cleavage
Figure 6.164 dsRNA precursors are processed by Dicer to generate siRNA duplexes containing guide and
passenger strands. RISC-loading complex loads the duplex into RISC. The passenger strand is later destroyed
and the guide strand directs RISC to the target RNA.
731
This page intentionally left blank.
Genetics
Noncoding RNA
There are large numbers of functional RNAs that are transcribed but did not encode proteins. These functional
RNAs are called noncoding RNAs (ncRNAs). Noncoding RNAs perform a variety of biological functions. They
regulate gene expression at the levels of transcription, RNA processing and translation. They protect genomes
from foreign nucleic acids. They can guide DNA synthesis or genome rearrangement. Most noncoding RNAs
operate as RNA-protein complexes, including ribosomes, snRNPs, snoRNPs, telomerase, miRNAs and lncRNAs.
Group I and II introns: Catalytic RNAs (ribozymes), catalyze RNA splicing.
RNase P RNAs: Ribozymes, catalyze removal of 5’ leader sequence from pre-tRNAs.
Hammerhead and hepatitis delta virus: Ribozymes, induce RNA cleavage to form 2’,3’-cyclic phosphate and
5’-OH termini; also catalyze the reverse reaction and RNA ligation.
gRNA (guide RNA): Base pairs with an RNA target, orienting bound proteins to carry out a site-specific cleavage,
ligation or modification reaction.
Xist (X-inactive-specific transcript RNA): Coats one X-chromosome in mammalian female, triggering hetero-
chromatization and transcriptional repression.
Telomerase RNA: Provides template for telomeric DNA synthesis and scaffolds protein assembly.
snoRNA (small nucleolar RNA): Essential for pre-rRNA processing or modification by serving as a guide RNA to
direct methylation or pseudouridylation of complementary sequence in rRNA.
siRNA (small interfering RNA): Product of dicer cleavage of dsRNA; when complexed with an AGO protein,
induces cleavage of a perfectly-complementary target RNA.
scaRNA (small Cajal body-associated RNA): Function similar to snoRNAs, but located in the Cajal body to guide
modification of snRNAs.
Riboswitch: RNA element within an mRNA that switch between two conformations upon exposure to a small-
molecule ligand or other stimulus and inhibits or promotes gene expression at the level of transcription, translation,
or RNA splicing.
piRNA (PIWI-associated RNA): RNA that directs the modification of chromatin to repress transcription; best
characterized in the male germline.
lncRNA (long noncoding RNA): Autonomously transcribed RNA that does not encode a protein; often capped and
polyadenylated; can be nuclear, cytoplasmic or both.
6.22 Epigenetics
Although all cells in an organism contain essentially the same DNA, cell types and functions differ because of
qualitative and quantitative differences in their gene expression. Epigenetics refers to both heritable and non-heri-
table changes in gene expression that are not caused by changes in DNA sequence. The epigenetic processes that
stably alter gene expression patterns are thought to include:
1. cytosine methylation,
2. posttranslational modification of histone proteins and remodelling of chromatin and
3. RNA-based mechanisms.
734
Genetics
Methylation of the 5’-position of cytosine residues is a reversible covalent modification of DNA, resulting in production
of 5-methyl-cytosine. In general, DNA methylation is associated with gene repression. As DNA methylation
patterns can be maintained following DNA replication and mitosis, this epigenetic modification is also associated
with inheritance of the repressed state.
Posttranslational modification of histone proteins on transcription is complex and constantly expanding. Three
general principles are thought to be involved:
1. It directly affects the structure of chromatin, regulating its higher order conformation and thus acting in cis to
regulate transcription;
2. It disrupts the binding of proteins that are associated with chromatin (trans effect);
3. It attracts certain effector proteins to the chromatin (trans effect).
RNA-based mechanisms of epigenetic regulation are less well understood than mechanisms based on DNA
methylation and histones. A number of non-coding RNAs (Small non-coding RNAs as well as Long non-coding RNAs)
play important roles in modifying the sequence, structure, or expression of mRNAs and thereby also changes the
protein expression from these genes.
735
Genetics
• The code is nonoverlapping. After translation commences, any single ribonucleotide at a specific location
within the mRNA is part of only one triplet.
• It is usual to describe the genetic code as a universal code, meaning that the same code is used throughout
all life forms. This is not strictly true. There is a few example of context dependent codons also. For example,
selenocysteine is coded by UGA and pyrrolysine by UAG. These codons, therefore, have a dual meaning
because they are mainly used as stop codons. Similarly, some differences in the genetic code have been found,
especially in the mitochondria, chloroplast, some protozoans and others as mentioned in table 6.32. In this
context the code is nearly universal. With only minor exceptions, a single coding dictionary is used by almost all
viruses, prokaryotes, archaea and eukaryotes.
Table 6.32 Some differences between the universal code and mitochondrial genetic codes.
Codon Universal code Unusual code Occurrence
UGA Stop Trp Mycoplasma, Spiroplasma, mitochondria of many species
CUG Leu Thr Mitochondria in yeasts
UAA, UAG Stop Gln Acetabularia, Tetrahymena, Paramecium, etc.
UGA Stop Cys Euplotes
Second position
U C A G
} phe
} } tyr } Cys
UUU UCU UAU UGU U
U UUC UCC UAC UGC C
ser
UUA
UUG } leu UCA
UCG
UAA
UAG } stop UGA
UGG
stop
trp
A
G
} } }
First position (5’-end)
} his
A
AUU
AUC
AUA
} ile
ACU
ACC
ACA } thr
AAU
AAC
AAA
} asn
} lys
AGU
AGC
AGA
} ser
} arg
U
C
A
AUG met ACG AAG AGG G
} } } asp
}
GUU GCU GAU GGU U
GUC GCC
ala GAC GGC C
G val gly
GUA
GUG
GCA
GCG
GAA
GAG
} glu GGA
GGG
A
G
Codon bias
Codon bias is the probability that a given codon will be used to codes for an amino acid over a different codon which
codes for the same amino acid. It refers to the fact that not all codons are used equally in the genes of a particular
organism. For example, of the four valine codons, human genes use GTG four times more frequently than GTA. The
biological reason for codon bias is not understood, but all organisms have a codon bias.
Problem
736
This page intentionally left blank.
Genetics
–1 frameshift
Ribosomes that frameshift produce a translational fusion of the two overlapping ORFs, whereas those that do not
frameshift continue normal in frame decoding and terminate at the end of the first ORF. Each of these products thus
shares a common N terminal region. Many viruses use programmed translational frameshifting to ensure synthesis
of the correct ratios of virus-encoded proteins required for proper viral particle assembly and maturation. The
phenomenon was first described in year 1985 as the way in which the Gag-Pol polyprotein of the retrovirus Rous
Sarcoma Virus (RSV) is expressed from the overlapping gag and pol ORFs. It has been demonstrated that when
ribosomes translate the unspliced genomic RNA of retroviruses, 95% of translation yields Gag proteins while only
about 5% of translation produces Gag-Pol proteins through –1 ribosomal frameshifting.
753
Genetics
Tetracycline : Tetracycline binds to the 30S ribosomal subunit and interferes with aminoacyl-tRNA binding.
Erythromycin : Binds to the 50S ribosomal subunit and inhibits peptide chain elongation.
Fusidic acid : Fusidic acid binds to EF-G and blocks translocation.
Cycloheximide : Cycloheximide blocks the peptidyl transferase of 80S ribosome but not that of 70S bacterial
(and mitochondrial and chloroplast) ribosomes.
Puromycin : Puromycin is a secondary metabolite of Streptomyces alboniger that blocks protein biosynthesis.
Puromycin is a structural analogue of the 3’ end of aminoacyl transfer RNA, but differs from
tRNA insofar as the aminoacyl residue is linked to the ribose via an amide bond rather than an
ester bond. Puromycin, like aminoacyl-tRNA, binds to the A site of the ribosome peptidyl-
transferase center. When the A site is occupied by puromycin, peptidyl-transferase links the
peptide residues of the peptidyl-tRNA in the ribosomal P site covalently to puromycin. Since
the amide bond cannot be cleaved by the ribosome, no further peptidyl transfer takes place,
and the peptidyl-puromycin complex falls off the ribosome.
H3C CH3
NH2 N
N N
N N
tRNA
—
O P O C N HO C N
O N O N
O
O OH HN OH
C O C O
H2N C H H2N C H
CH2 CH2
O
tRNA-phenylalanine
CH3
Puromycin
Diphtheria toxin : Diphtheria toxin, an exotoxin of Corynebacterium diphtheriae infected with a specific temperate
phage (Corynephage β), stops the protein synthesis in eukaryotes by inactivating the elongation
factor eEF2. Inactivation of elongation factor eEF2 occurs due to ADP-ribosylation, which is
catalyzed by A fragment of toxin.
Ricin : A toxic protein of the castor bean (Ricinus communis) that inactivates the 60S subunit of
eukaryotic ribosomes by depurinating a specific adenosine in 28S rRNA.
Primary translation products often undergo a variety of modification reactions, involving the addition of chemical
groups, which are attached covalently to the polypeptide. This can involve simple chemical modification like
hydroxylation and phosphorylation of the side chains of single amino acids or the addition of different types of
carbohydrate or lipid group.
754
This page intentionally left blank.
Genetics
6.25 Mutation
Genome is not a static entity. It is dynamic in nature. It is subject to different types of heritable genetic changes.
A heritable genetic change in the genetic material of an organism that gives rise to alternate forms of any gene is
called mutation. The process by which mutations is produced is called mutagenesis. An organism exhibiting a
novel phenotype as a result of the presence of a mutation is referred to as a mutant. In a broad sense, the term
mutations include all types of heritable genetic changes of an organism not explainable by recombination of preexisting
genetic variability. Mutation may include change in chromosome number, chromosomal aberrations and changes in
chemistry of genes. But here we have described ‘mutation’ in terms of change in chemistry of gene which is known
as gene mutation.
Role of mutation
• Ultimate source of all genetic variation and it provides the raw material for evolution.
• Mutation results into the formation of alleles. Without mutation, all genes would exist in only one form.
• Organisms would able to evolve and adapt to environmental change.
757
This page intentionally left blank.
Genetics
In principle, mutation of a gene might cause a phenotypic change in either of two ways:
• Loss of function (null) mutation : the product may have reduced or no function.
• Gain of function mutation : the product may have increased or new function.
Because mutation events introduce random genetic changes, most of the time they result in loss of function.
Generally, loss of function mutations are found to be recessive. In a wild type diploid cell, there are two wild type
alleles of a gene, both making normal gene product. In heterozygotes, the single wild type allele may be able to
provide enough normal gene product to produce a wild type phenotype. In such cases, loss of function mutations
are recessive. However, some loss of function mutations are dominant. In such cases, the single wild type allele in
the heterozygote cannot provide the enough amount of gene product needed for the cells to be wild type. Gain of
function mutations usually cause dominant phenotypes, because the presence of a normal allele does not prevent
the mutant allele from behaving abnormally.
Normal receptor
Lysis
T1
Mutant type
Mutant receptor
T1 cannot bind
Figure 6.193 When bacteriophage T1 infects wild-type E. coli, it binds to a receptor in the outer membrane,
protein TonB. After phage replication, the E. coli cell is lysed and new phages are released. A mutation in the
tonB gene results in an altered receptor to which T1 can no longer bind and so the cells survive.
769
Genetics
Their test is known as the fluctuation test because it measures the degree of fluctuation in the number of mutants
found in replicate cultures. They proved that mutations occur before selection. The fluctuation test is also useful in
determining mutation rates during nonselective growth.
Replica plating
After incubation
Figure 6.194 Replica plating. For the detection of mutants, cells are transferred on to successive plates
containing either a selective medium or a non-selective medium. Colonies form on the non-selective plate
in the same pattern as on the master plate. Only mutant cells can grow on the selective plate; the mutant
colonies that are formed derive from colonies on the master plate that are mutant.
In this way, the velvet picked up a colony ‘imprint’ from the whole plate. On touching the velvet to replica plates
containing selective medium (that is, containing T1 phages), cells clinging to the velvet are inoculated onto the
replica plates in the same relative positions as those of the colonies on the original master plate. As expected, rare
Tomr mutant colonies were found on the replica plates, but the multiple replica plates showed identical patterns of
resistant colonies. If the mutations had occurred after exposure to the selective agents, the patterns for each plate
would have been as random as the mutations themselves. The mutation events must have occurred before exposure
to the selective agent.
Replica plating has become an important technique of microbial genetics. It is useful in screening for mutants that
fail to grow under the selective regime. The position of an absent colony on the replica plate is used to retrieve the
mutant from the master. For example, replica plating can be used to screen auxotrophic mutants in precisely this
way. In general, replica plating is a way of retaining an original set of strains on a master plate while simultaneously
subjecting replicas to various kinds of tests on different media or under different environmental conditions.
770
Genetics
Problem
In the Ames test, auxotrophic strains of Salmonella that are unable to produce histidine are mixed with a rat liver
extract and a suspected mutagen. The cells are then plated on a medium without histidine. The plates are incubated
to allow any revertant bacteria (those able to produce histidine) to grow. The number of colonies is a measure of
the mutagenicity of the suspected mutagen. Why is the rat liver extract included?
Solution
Most mutagens cannot act unless they are converted to electrophile by liver enzymes called mixed-function oxidase,
which include the cytochromes P-450s. The rat liver extract in the Ames test contains enzymes for converting
suspected mutagens to compounds that would be physiologically relevant mutation-causing agents in a mammal.
771
This page intentionally left blank.
Genetics
5. Large population size: The population is sufficiently large so that the frequencies of alleles do not change from
generation to generation because of chance. In small populations, significant random fluctuations in allele
frequencies are possible due to sampling error. The random change in allele frequencies simply as a result of
chance from one generation to next in a finite population is called genetic drift. Drift ultimately leads to the
fixation of one allele at a locus and the loss of all other alleles. In diploid organisms, the rate at which genetic
variability is lost by random genetic drift is 1/2N, where N is the population size.
Let’s take one example of a population of diploid organisms having a gene with two alleles A and a, with
respective frequencies p and q. Let’s assume that neither allele has any effects on fitness; that is, A and a are
selectively neutral. Furthermore let’s assume that the population mates randomly and that in any given generation,
the genotypes are present in Hardy-Weinberg proportions. In a very large population – essentially infinite in
size – the frequencies of A and a will be constant and the frequency of the heterozygotes that carry these two
alleles will be 2pq. In a small population of finite size N, the allele frequencies will change randomly as a result
of genetic drift. Because of these changes, the frequency of heterozygotes will also change. To express the magnitude
of this change over one generation, let’s consider the current frequency of heterozygotes as H and the frequency
of heterozygotes in the next generation as H’. Then the mathematical relationship between H and H’ is
§ 1 ·
H' ¨1 ¸ H
© 2N¹
This equation tells us that in one generation, random genetic drift causes the heterozygosity to decline by a
factor of 1/2N. Over many generations, the heterozygosity will eventually be reduced to 0, at which point all
genetic variability in the population will be lost. At this point the population will possess only one allele of the
gene, and either p = 1 and q = 0, or p = 0 and q = 1. Thus, through random changes in allele frequencies, drift
steadily erodes the genetic variability of a population, ultimately leading to the fixation and loss of alleles.
If the frequency of a homozygous dominant genotype in a randomly mating population is 0.09, what is the frequency
of the dominant allele? What is the combined frequency of all the other alleles of this gene?
Solution
p2 = 0.09, and so p = (0.09)1/2 = 0.30. All other alleles have a combined frequency of 1 – 0.30 = 0.70.
A particular recessive disorder is present in one in ten thousand individuals. If the population is in Hardy-Weinberg
equilibrium, what are the frequencies of the two alleles?
Solution
If the population is in equilibrium, there should be p2 of AA + 2pq of Aa + q2 of aa individuals.
Since, 1/10,000 shows the recessive trait, this is q2.
1
Therefore, q 0.0001 0.01.
10,000
6.27.3 Inbreeding
Inbreeding is a mating between individuals that are closely related through common ancestry. The extent of
inbreeding occurring in a population is measured by inbreeding coefficient. The inbreeding coefficient (expressed
as F) is the probability that two alleles of a given gene in an individual are identical by descent. Such a genotype
would be homozygous and considered autozygous since the alleles were inherited from a common ancestor
786
Genetics
(homozygosity by descent). Hence, inbreeding coefficient is also defined as the probability of autozygosity. When
two alleles are not identical by descent, we call the genotype allozygous (allo- means other). Note that allozygous
can be either homozygous or heterozygous.
Figure 6.202 A diagram showing how both allozygous and autozygous individuals can be generated within
the same family. Two unrelated heterozygotes in first generation produced four offspring, all with different
genotypes (second generation). Inbreeding occurs among the siblings of second generation resulting in
two allozygous and one autozygous individual in third generation. The allozygous individuals include both
a heterozygote and a homozygote.
787
This page intentionally left blank.
Genetics
Pedigree 3
Path = DBACE
Number of individual per path (n) = 5
n
§1·
FI ¨ ¸ (1 FA )
©2¹
Wahlund effect
So far we have applied population genetics within a single, uniform population. In practice, a species may consist
of a number of separate populations, each more or less isolated from the others. For example, the members of a
species might inhabit a number of islands, with each island population being separated by the sea from the others.
Individuals might migrate between islands from time to time, but each island population would evolve to some
extent independently. A species with a number of more or less independent subpopulations is said to have population
subdivision. The effect of population subdivision on genotypic frequencies was first investigated by S. Wahlund in
1928. In the large, fused population there are fewer homozygotes than in the average for the set of subdivided
populations. The increased frequency of homozygotes in subdivided populations is called the Wahlund effect. A
subdivided population contains fewer heterozygotes than predicted despite the fact that all subdivisions are in
Hardy-Weinberg equilibrium.
791
This page intentionally left blank.
Chapter 07
Recombinant DNA technology
Recombinant DNA technology is the set of techniques that enable the DNA from different sources to be identified,
isolated and recombined so that new characteristics can be introduced into an organism. The invention of recombinant
DNA technology—the way in which genetic material from one organism is artificially introduced into the genome of
another organism and then replicated and expressed by that other organism—was largely the work of Paul Berg,
Herbert W. Boyer, and Stanley N. Cohen, although many other scientists made important contributions to the new
technology as well. Paul Berg developed the first recombinant DNA molecules that combined DNA from SV40 virus
and lambda phage. Later, Herbert Boyer and Stanley Cohen develop recombinant DNA technology, showing that
genetically engineered DNA molecules may be cloned in foreign cells.
One important aspect in recombinant DNA technology is DNA cloning. It is a set of techniques that are used to
assemble recombinant DNA molecules and to direct their replication within host organisms. The use of the word
cloning refers to the fact that the method involves the replication of a single DNA molecule starting from a single
living cell to generate a large population of cells containing identical DNA molecules.
797
This page intentionally left blank.
Recombinant DNA technology
The polymerase chain reaction (PCR) is a newer form of DNA cloning which is enzyme mediated and is conducted
entirely in vitro. PCR (developed in 1983 by Kary Mullis) is a revolutionary technique used for selective amplification
of specific target sequence of nucleic acid by using short primers. It is a rapid, inexpensive and simple method of
copying specific DNA sequence.
DNA polymerase I (Kornberg enzyme) has both the 3’-5’ and 5’-3’ exonuclease activities and 5’-3’ polymerase
activity.
Reverse transcriptase, also known as RNA-directed DNA polymerase, synthesizes DNA from RNA.
Reverse transcriptase was discovered by Howard Temin at the University of Wisconsin, and independently by David
Baltimore at about the same time. The two shared the 1975 Nobel Prize in Physiology or Medicine.
Taq DNA polymerase is a DNA polymerase derived from a thermostable bacterium, Thermus aquaticus. It operates
at 72°C and is reasonably stable above 90°C and used in PCR. It has a 5’ to 3’ polymerase activity and a 5’ to 3’
exonuclease activity, but it lacks a 3’ to 5’ exonuclease (proofreading) activity.
7.2.2 Nucleases
Nucleases are enzymes that degrade nucleic acids by breaking the phosphodiester bonds that link one nucleotide
to the next. Ribonucleases (RNases) attack RNA and deoxyribonucleases (DNases) attack DNA. Some nucleases
will only attack single stranded nucleic acids, others will only attack double-stranded nucleic acids and a few will
attack either kind. Nuclease are of two different kinds – exonucleases and endonucleases. Exonucleases remove
nucleotides one at a time from the end of a nucleic acid whereas endonucleases are able to break internal
phosphodiester bonds within a nucleic acid. Any particular exonuclease attacks either the 3’-end or the 5’-end but
not both.
S1 nuclease
The S1 nuclease is an endonuclease purified from Aspergillus oryzae. This enzyme degrades RNA or single stranded
DNA, but does not degrade dsDNA or RNA-DNA hybrids in native conformation. Thus, its activity is similar to mung
bean nuclease, however, the enzyme will also cleave a strand opposite a nick on the complementary strand.
RNase A
RNase A is an endonuclease, which digests ssRNA at the 3’ end of pyrimidine residues.
RNase H
It is an endonuclease which digests the RNA strand of an RNA-DNA heteroduplex. The enzyme does not digest ss or
dsDNA.
799
Recombinant DNA technology
Restriction endonuclease
A restriction endonuclease (or restriction enzyme) is a bacterial enzyme that cuts dsDNA into fragments after
recognizing specific nucleotide sequences known as recognition or restriction site. The term restriction comes from
the fact that these enzymes restrict the entry of foreign DNA in the bacteria. Restriction enzymes, therefore, are
believed to be evolved by bacteria to resist viral attack.
The existence of restriction enzymes was first postulated by W. Arber. He noticed that when the DNA of a bacteriophage
entered a host bacterium it was cut up into smaller pieces and, for this, he theorized the presence of restriction
enzyme. In 1970, Hamilton Smith and his co-workers first isolated a restriction enzyme from the bacterium
Haemophilus influenzae strain Rd. The enzyme, called HindII, recognizes a six base-pair dsDNA sequence. After
discovery of HindII restriction enzyme, EcoRI, was isolated and characterized from Escherichia coli strain RY13.
Nomenclature
The name of any restriction endonuclease consists of three parts:
1. An abbreviation of the genus and species of the organism to three letters, e.g. Eco for Escherichia coli
identified by the first letter of the genus and the first two letters of the species.
2. A letter, number or combination of the two to indicate the strain of the relevant species.
3. A Roman numeral to indicate the order in which different restriction modification systems were found in the
same organism or strain.
For example, the name of the EcoRI restriction enzyme was derived as:
E Escherichia (genus)
co coli (species)
R RY13 (strain)
I First identified (order of identification in the bacterium)
Restriction sites
Rather than cutting DNA indiscriminately, a restriction enzyme cuts only double-helical segments that contain a
particular nucleotide sequence of four to eight base pairs in length, known as a restriction or recognition site. These
are generally palindromic sequences. The position at which the restriction enzyme cuts is usually shown by the
symbol ‘/’. Restriction enzymes make either blunt or staggered cuts. Thus, restriction fragments may have:
• Blunt ends (the cleavage points occur exactly on the axis of symmetry).
• Overhanging ends (the cleavage points do not fall on the symmetry axis, so that the resulting restriction
fragments possess sticky ends or cohesive ends).
5’ G A T A T C 3’ EcoRV 5’ G A T A T C 3’
3’ C T A T A G 5’ 3’ C T A T A G 5’
5’ G A A T T C 3’ EcoRI 5’ G 5’ A A T T C 3’
3’ C T T A A G 5’ 3’ C T T A A 5’ G 5’
After the staggered cuts, the resulting restriction fragments possess so-called 5’ overhangs or 3’ overhangs. For
example, the recognition site for EcoRI enzyme is 5’-GAATTC-3’. Once the staggered cuts have been made, the
resulting fragments have 5’ overhangs or staggered ends. Similarly, restriction enzyme PstI create staggered cuts
in the recognition site (5’-CTGCAG-3’) that results in 3’ overhangs or staggered ends.
800
This page intentionally left blank.
Recombinant DNA technology
P element, a transposon, is used as a vector in Drosophila. The P element is 2.9 kb in length and contains three
genes flanked by short inverted repeat sequences at either end of the element. The genes code for transposase,
the enzyme that carries out the transposition process. The inverted repeats form the recognition sequences that
enable the enzyme to identify the two ends of the inserted transposon. The vector is a plasmid that carries two
P elements, one of which contains the insertion site for the DNA that will be cloned. Insertion of the new DNA into
this P element results in disruption of its transposase gene. But the second P element carried by the plasmid has an
intact version of the transposase gene that provides transposase enzyme to carry out transposition.
Vectors for insects based on viral DNA are not common. However, dsDNA of baculoviruses is used as cloning
vector for many insects. Baculoviruses have rod-shaped capsids and large, dsDNA genomes.
For mammal
The genome of many viruses are used as cloning vectors for mammals. The first vector used for mammalian cell
was based on SV40 virus genome. SV40 is a small virus that infects monkey (simian). Now genome of many
viruses such as adenoviruses and papillomaviruses which have a relatively high insert capacity are used as vectors
for cloning/expression of genes in mammalian cells. At present, retroviruses are the most commonly used vectors.
Chemical transformation method: Bacteria which are able to uptake DNA are called ‘competent’ and are made
so by chemical treatment. Competency is a physiologic state, which changes the structure and permeability of the
cell membrane so the naked DNA can enter the cell. The chemical transformation method utilizing CaCl2 and heat
shock to promote DNA entry into cells. The chemical method uses bacteria that are incubated with DNA on ice cold
salt solution containing CaCl2 followed by a brief heat shock at 42°C. Exactly how this treatment works is not
understood. Possibly CaCl2 causes the DNA to precipitate onto the surface of the cells, or perhaps the salt is
responsible for some kind of change in the cell wall that improves DNA binding.
Electroporation: Competency can also be achieved through the use of electrical pulses called electroporation. It
uses a short pulse of electric charge to facilitate DNA uptake. Electroporation induces formation of microscopic
pores within a biological membrane. These pores, called electropores, allow molecules, ions and water to pass from
one side of the membrane to the other.
A. Vector-mediated methods
The vector-mediated methods (or indirect gene transfer methods) exploit the natural ability of certain bacteria
(Agrobacterium species) and viruses to naturally transfer DNA to the genomes of infected plant cells.
Agrobacterium-mediated transformation
Members of the genus Agrobacterium are also known as natural genetic engineers of plants since these bacteria
have ability to transfer T-DNA of their plasmid (Ti and Ri) into plant genome upon infection of cells at the wound site.
In the natural environment, Agrobacterium introduces its T-DNA into compatible host plant cells and via highly
evolved molecular mechanisms stably integrates the new DNA into the plant genome.
819
Recombinant DNA technology
The foreign gene is cloned in the T-DNA region of Ti- or Ri-plasmid by replacing unwanted sequences. Agrobacterium
transfers T-DNA, which makes up a small (~5%–10%) region of the Ti- or Ri-plasmid. Transfer requires three
major elements:
1. The right and left border sequences that flank the T-DNA (imperfect, direct repeats of 25 base pairs and the
only essential cis-elements for T-DNA transfer),
2. vir genes located on the Ti and Ri-plasmid and
3. Some chromosomal genes (chromosomal virulence and other genes) located on the bacterial chromosomes.
These chromosomal genes generally are involved in bacterial exopolysaccharide synthesis, maturation and
secretion.
The first step in the process of gene transfer to plant cell involves the formation of the recombinant plasmid. For the
recombinant formation, T-DNA needs to be disarmed. T-DNA contains phytohormone synthesis genes, whose
expression causes infected plants to suffer from unregulated growth. Thus, wild-type Ti plasmids are not suitable as
general vectors. Hence, we must use vectors in which the T-DNA has been disarmed. To do this, the genes encoding
the proteins for the production of phytohormones are simply removed from the T-DNA fragment. New DNA can,
then, be inserted between the left and right border repeats.
Earlier, to introduce gene of interest into T-DNA for subsequent transfer to plants was very cumbersome process.
This was because Ti and Ri-plasmids are very large, low copy number plasmid, difficult to isolate and manipulate in
vitro, and do not replicate in Escherichia coli. In large DNA molecules, there is a problem of unique restriction site
also. However, this problem has been resolved by using two novel strategies:
Binary vector strategy: The T-DNA does not need to be physically associated with the vir genes in order to
become integrated into the plant genome. T-DNA regions of Ti-plasmids could be split onto two separate replicons.
As long as both of these replicons are located within the same Agrobacterium cell, proteins encoded by vir genes
could act upon T-DNA in trans to mediate its processing and export to the plant. Systems in which T-DNA and vir
genes are located on separate replicons were eventually termed T-DNA binary systems. Thus, in binary vector
strategy, two plasmids are used and both complement each other in the same bacterial cell. The T-DNA carried by
one plasmid is transferred to the plant chromosomal DNA by proteins coded by vir genes carried by other plasmid.
ori
RB LB
Marker Gene of
interest
ori
Figure 7.13 Schematic diagram of T-DNA binary vector systems. Genes of interest are maintained within
the T-DNA region of a binary vector. Vir proteins encoded by genes on a separate replicon (vir helper) mediate
T-DNA processing from the binary vector and T-DNA transfer from the bacterium to the host cell.
Co-integration vector strategy: Although disarmed wild-type Ti plasmids can be used as vector, they are not
easy to manipulate, because their large size makes them difficult to manipulate in vitro and there are no unique
restriction sites in the T-DNA. This problem can be overcomed by the construction of co-integrative vectors. In this
strategy, the gene of interest to be introduced into the Ti plasmid vector is first sub-cloned in a conventional E. coli
plasmid vector (such as pBR322) for easy manipulation, producing a so-called intermediate vector. The insertion of
gene of interest into a Ti plasmid results from the recombination of intermediate vector and a Ti plasmid. The
820
This page intentionally left blank.
Recombinant DNA technology
these pores. Plant cell electroporation generally utilizes the protoplast because thick plant cell walls restrict
macromolecule movement. Electrical pulses are applied to a suspension of protoplasts with DNA placed between
electrodes in an electroporation cuvette. Short high-voltage electrical pulses induce the formation of transient
micropores in cell membranes allowing DNA to enter the cell and then the nucleus.
Microinjection: Extensively used with the animal cell, microinjection of DNA into plant cells has achieved only a
limited success. This is largely because of difficulties in getting the protoplasts immobilized and injecting DNA into
the protoplast without damaging the tonoplast, which surrounds the plant cell vacuole.
Particle bombardment: Particle bombardment or microprojectile bombardment or biolistic transformation employs
foreign DNA coated high velocity gold or tungsten particles (0.2–0.4 μm) to deliver DNA into plant cells. Different
approaches are being used to accelerate the particles. Particle gun accelerated particles penetrates even deep into
the tissues. This method is being widely used because of its ability to deliver foreign DNA into regenerable cells,
tissues or organs irrespective of the monocots or dicots. Because of the physical nature of the process, there is no
biological limitation to the actual DNA delivery, thus it is genotype independent.
Chloroplast transformation
Genetic material in plants is distributed into nucleus and the chloroplast and mitochondria in the cytoplasm. Each of
these three compartments carries its own genome and expresses heritable traits. The chloroplast present in
photosynthetic eukaryotes. There are up to 300 chloroplasts in one plant cell. Chloroplast genomes are usually
circular dsDNA and usually vary in length from 120-190 kb. In most species, chloroplasts are usually maternally
inherited in most (~80%) angiosperm plant species. It is also not influenced by polyploidy, gene duplication and
recombination that are widespread features of the nuclear genomes of plants. Therefore, chloroplast DNA varies
little among angiosperms in terms of size, structure and gene content.
Chloroplasts transformation can involve delivery of DNA into chloroplasts. For chloroplast transformation, DNA
has to be delivered through the cell wall and through at least three membranes (the plasma membrane and two
chloroplast membranes). Efficient chloroplast transformation has been achieved both through particle bombardment
and polyethylene glycol (PEG)-mediated transformation. PEG-mediated transformation of plastids requires
enzymatically removing the cell wall to obtain protoplasts, then exposing the protoplasts to purified DNA in the
presence of PEG. The protoplasts first shrink in the presence of PEG, then lyse due to disintegration of the cell
membrane. Removing PEG before the membrane is irreversibly damaged reverses the process. Biolistic delivery
is the routine system for most laboratories. The flowering plants contain a variety of plastids (including chloroplast,
leucoplasts or chromoplasts), thus the term plastid transformation is more accurate than chloroplast transformation.
Plants with transformed plastid genomes are termed transplastomic .
The major difficulty in chloroplast transformation for production of transplastomic plants is in generating homoplasmic
plants in which all the chloroplasts are uniformly transformed. This is due to the presence of about 10-100 chloroplasts
in one cell, each of which has up to 100 copies of the chloroplast genome, that does not allow achieving homoplastomic
state. Apart from this, getting high level of protein expression, even though the gene copy number is high, is another
problem.
822
Recombinant DNA technology
The second method is to package the DNA inside an animal virus, since viruses have evolved mechanisms to
naturally infect cells and introduce their own nucleic acid. The transfer of foreign DNA into a cell by this route is
termed transduction.
Transfection
The direct transfer of DNA into animal cells can be accomplished by a number of techniques that either force the
cells to take in DNA by breaching the cell membrane or exploit the natural ability of cells to internalize certain
molecules in their environment. The term transfection was originally coined to describe the introduction of phage
DNA into bacterial cells. In the same way, forcing animal cells to take up DNA from the surrounding medium using
a variety of chemical and physical methods is also termed transfection, and can be a highly efficient way to
introduce DNA either transiently or stably into cultured cells or cells in vivo. When the term transformation is
applied to animal cells, it usually refers to a stable change of genotype brought about, either by incorporation of the
transfected DNA into the genome, or its long-term episomal maintenance. However, the same term is also used to
indicate oncogenic transformation, that is, the change in phenotype resulting from the activation of an oncogene.
The fate of DNA introduced into the cells depends on the vector system being used. In one fate, the DNA introduced
into the cells replicate and express without integration i.e. maintained in the nucleus in an extrachromosomal state
(episomally). This is known as transient transfection. In second fate, DNA may integrate into a random chromosomal
site of the host genome and replicate as a normal part of the genome. If the introduced DNA integrates into the host
genome and maintained permanently in the cell, this is called stable transfection. If the transfected exogenous DNA
is non-replicative, stable transfection must occur by integration of the DNA into the genome.
823
This page intentionally left blank.
Recombinant DNA technology
Promoter
Regulatory
gene Ribosome binding site
Start
codon
Coding sequence
Origin
Stop
codon
C-terminal tag
Selectable
marker Transcription
terminator
Figure 7.18 The basic architecture of an E. coli expression vector. It contains the features: origin of replication,
promoter, regulatory gene (repressor), selectable marker and transcription terminator. Ribosome binding
site (Shine-Dalgarno sequence), multiple cloning site and N- or C-terminal tags. N- or C-terminal tags offer
several potential advantages such as improved expression and solubility, improved detection and purification.
829
This page intentionally left blank.
Recombinant DNA technology
The genetic map gives the relative position of genetic markers according to the frequency of recombination,
expressed in term of centimorgans (cM). Genetic maps illustrate the order of genetic markers on a chromosome
and the relative distances between those markers.
The cytological map depicts the locations of genetic markers in a chromosome relative to visible landmarks. In
most cases, each chromosome has a characteristic banding pattern, which may be either naturally present, (e.g. in
polytene chromosomes of Drosophila) or more commonly generated by specific staining protocols (e.g. in case of
human chromosomes); the genetic markers are mapped cytologically relative to these band locations. Fluorescent in
situ hybridization (FISH) is widely used to map the cytological locations of genes and other DNA sequences within
large eukaryotic chromosomes.
The physical maps describe the absolute distance between two genetic markers in term of base pairs.
Classical markers include morphological markers, cytological markers and biochemical markers.
Morphological markers: Morphological (or visible) markers are usually visually characterized phenotypic traits
or characters such as flower color, seed shape, growth habits or pigmentation. However, morphological markers
are very limited, and many of these markers are not associated with important economic traits (e.g. yield and
quality). These markers are also influenced by environmental factors or the developmental stages.
Cytological markers: Cytological markers are the unique structural features of chromosomes such as bands,
secondary constrictions. These chromosome features are used not only for characterization of normal chromosomes
and detection of chromosomal mutation, but also widely used in mapping and linkage group identification. However,
direct use of cytological markers has been very limited in genetic mapping.
Biochemical markers are gene products that can be detected easily by electrophoresis and specific staining.
Enzyme variants such as isozymes and allozymes are commonly used as biochemical markers. Allozymes are
enzymes encoded by different alleles of a gene but have the same catalytic activity or function. Allozymes can be
separated by electrophoresis and other separating techniques on the basis of differences in molecular size, shape
and electrical charge. Isozymes are different from allozymes. Isozymes are enzymes that perform the same
catalytic function, but are encoded by different nonallelic genes located at different loci. Allozymes reflect the
products of different alleles of a gene rather than different nonallelic genes located at different loci. Biochemical
markers are also called as protein markers. The major disadvantages of biochemical markers are that they are
limited in number.
A DNA marker is defined as a particular segment of DNA that is representative of the differences at the genome
level. DNA marker is also called as molecular marker. Strictly speaking, protein markers and DNA markers are
both molecular markers, but the current uses of the term is limited to DNA markers. DNA markers should not be
considered as normal genes, as they usually do not have any biological effect, and instead can be thought of as
constant landmarks in the genome. They are identifiable DNA sequences, found at specific locations of the genome,
842
Recombinant DNA technology
and transmitted by the standard laws of inheritance from one generation to the next. An ideal DNA marker should
have the following criteria:
1. High level of polymorphism,
2. Even distribution across the whole genome,
3. Provide adequate resolution of genetic differences,
4. Co-dominance in expression (so that heterozygotes can be distinguished from homozygotes),
5. Have linkage to distinct phenotypes,
6. Genome-specific in nature.
P1 P2 F1 P1 P2 F1
AA aa Aa BB bb Bb
(a) (b)
Figure 7.27 Comparison between (a) codominant and (b) dominant markers. Codominant markers can
clearly discriminate between homozygotes and heterozygotes whereas dominant markers do not. Genotypes
at two marker loci (A and B) are indicated below the gel diagrams.
RFLPs
RFLP (Restriction Fragment Length Polymorphisms) is the most widely used hybridization-based molecular marker.
RFLP markers were first used in 1975 to identify DNA sequence polymorphisms for genetic mapping. RFLPs arise
because mutations can create or destroy the sites recognized by specific restriction enzymes, leading to variations
between individuals in the length of restriction fragments produced from identical regions of the genome. Although
two individuals of the same species have almost identical genomes, they will always differ at a few nucleotides due
to point mutation and insertion/deletion. Some of the differences in DNA sequences at the restriction sites can
result in the gain, loss or relocation of a restriction site.
A single base change within a restriction site is a readily detectable genetic marker because the mutated site is no
longer cleaved by the enzyme in question. Two chromosomes that differ by such a mutation are then distinguish-
able on the basis of a restriction fragment length polymorphism (RFLP), which arises because a particular cleavage
site is present in only one of the two DNA molecules. A mutation that gives rise to an RFLP, thus represents a genetic
marker. RFLPs have only two alleles: the site is present or absent. The maximum heterozygosity is 0.5. The RFLP
markers are codominantly inherited and highly reproducible.
843
This page intentionally left blank.
Recombinant DNA technology
Induce G0 phase
Nucleus
Enucleated
oocyte Fusion and
activation
Renucleated
oocyte
In vitro Implant
embryo
culture
Figure 7.37 Cloning sheep by nuclear transfer. The nucleus of an ovum is removed with a pipette. Cells
from the mammary epithelium of an adult are grown in culture, and the G0 state is induced by inhibiting cell
growth. A G0 cell and an enucleated ovum are fused, and the renucleated ovum is grown in culture or in
ligated oviducts until an early embryonic stage before it is implanted into a foster mother, where development
proceeds to term.
857
This page intentionally left blank.
Recombinant DNA technology
Bioprocess engineering is a specialization of chemical engineering that deals with the design and development of
equipment and processes for the manufacturing of products such as food, pharmaceuticals and polymers from
biological materials. It uses the capabilities of organisms in industrial, medical, environmental or agricultural processes
in order to produce useful biological materials.
Bioprocess engineers work at the frontiers of biological and engineering sciences to bring engineering to Life
through the conversion of biological materials into other forms needed by mankind. One of the main tasks of a
bioprocess engineer is to control and maintenance of a biological processes such as the production of beverages,
pharmaceuticals, antibiotics, enzymes, biochemicals, food processing and biological waste treatment. These processes
require a well-designed growth environment to obtain the maximum yield of the product and consequently, these
conditions need to be carefully controlled. Environmental design comprises the determination of the environment of
the process, while fermentation engineering provides the means for meeting those requirements.
A process causes changes in the system or surroundings. In bioprocess, the process can be batch, semi-batch,
fed-batch and continuous processes.
A batch process operates in a closed system. All materials are added to the system at the start of the process, the
system is then closed and products removed only when the process is complete.
887
Bioprocess engineering
A semi-batch process allows either input or output of mass, but not both.
A fed-batch process allows input of material to the system but not output.
A continuous process allows matter to flow in and out of the system. If rates of mass input and output are equal,
continuous processes can be operated indefinitely.
Note: During chemical or biochemical reactions, the following two quantities are conserved:
• Total mass, so that total mass of reactants = total mass of products.
• Number of atoms of each element, so that, for example, the number of C, H and O atoms in the reactants = the
number of C, H and O atoms, respectively, in the products.
Bioprocess engineers do a mass balance to account for what happens to each of the chemicals that is used in a
chemical process. For example, in a plant that is producing sugar, if the total quantity of sugar going into the plant
is not equalled by the total of the purified sugar and the sugar in the waste liquors, then there is something wrong.
Sugar is either being burned (chemically changed) or accumulating in the plant or else it is going unnoticed down
the drain somewhere. In this case, the mass balance is:
Mass balances can be based on total mass, mass of dry solids or mass of particular components, for example
protein. If a mass balance is written using the total mass in each process stream, then it is called total balance.
888
Bioprocess engineering
A separate mass balance can be written for a particular chemical component in the total mass. This is called
component balance. Thus, for a component mass balance the simplest expression is:
This is called the general steady-state mass balance equation. If there is no reaction in the system, then there is
neither generation nor consumption of mass. In such situation at steady state,
In unsteady state, when the mass of the system varies as a function of time we can apply the following flow diagram.
889
This page intentionally left blank.
Bioprocess engineering
unfamiliar with the relative magnitudes of the various forms of energy entering into a particular processing situation,
it is wise to put them all down. Then after some preliminary calculations, the important ones emerge and minor
ones can be lumped together or even ignored without introducing substantial errors.
Let us take the following system in which mass Min enters the system while mass Mout leaves. Both these masses
have energy (enthalpy, kinetic and potential) associated with them. During bioprocessing, high velocity motion and
large changes in height or electromagnetic field do not generally occur. Thus, we can ignore the values of kinetic
and potential energy. Enthalpy (H) is the total heat content of the system. It is defined as sum of the internal energy
plus the product of the pressure and volume.
Energy leaves the system as heat, Q and work, WS is done on the system by the surroundings. The total change or
accumulation of energy in the system,
ΔE = (MH)in – (MH)Out – Q + WS
WS
It is the mathematical expression of the law of conservation of energy. Energy flow represented by Q and WS can
be directed either into or out of the system; appropriate signs must be used to indicate the direction of flow. In the
above figure we have followed the convention that:
• Work is positive when energy flows from the surroundings to the system.
• Work will be considered negative when the system supplies work energy to the surroundings.
• Heat is positive when the temperature of the system is higher than the surroundings.
At steady state, there is no change in the energy of the system; ΔE = 0
The steady state energy balance equation is (MH)in – (MH)out – Q + WS = 0
Enthalpy calculation
The most common energy form is heat energy. In a constant pressure system, with negligible changes in potential
and kinetic energies, the energy balance can be cast in terms of enthalpy changes. A heat balance is calculated on
the basis of the enthalpies of the substances taking part in a process and the heats of the corresponding chemical
reactions. Enthalpy is a measure of the total energy. It is an extensive property. Since internal energy cannot be
measured in absolute terms; thus absolute enthalpy of compound cannot be calculated. Changes in enthalpy are
evaluated relative to reference states. Change in enthalpy can occur as a result of – temperature change, phase
change, mixing and reaction.
Temperature change
The amount of energy released or absorbed by a chemical substance during a change of enthalpy is termed as
sensible heat. The sensible heat of a process may be calculated as the product of the body’s mass with its specific
heat and its change in temperature:
ΔH = M CP ΔT
893
Bioprocess engineering
Phase change
When a substance changes from one phase of matter to another, we say that it has undergone a change of phase.
These changes of phase always occur with a change of heat. Heat either comes into the material during a change
of phase or heat comes out of the material during this change. However, the heat content of the material changes,
the temperature does not. The amount of energy released or absorbed during a phase change is called latent heat.
The latent heat for a different mass of the substance can be calculated using the equation:
Q = ML
where, Q is the amount of energy released or absorbed during the change of phase of the substance
M is the mass of the substance and
L is the specific latent heat per gram for a particular substance; substituted as Lf to represent the specific
latent heat of fusion, Lv as the specific latent heat of vaporization.
When compounds are mixed or dissolved, the bonds between molecules in the solvent and solute are broken and
reformed so a net absorption or release of energy takes place due to which internal energy and enthaply of mixture
change. The enthaply change during mixing of non-ideal two compounds A and B is given by
Lag phase
During the lag phase, there is no increase in cell number. It is a period of adaptation of cells to a new environment.
There is no change in number, but an increase in mass. Thus in this phase, cells are not dormant. The length of the
lag phase is determined in part by the characteristics of the cells and conditions in the media. Multiple lag phases
may sometimes be observed when the medium contains more than one carbon source. This phenomenon is known
as diauxic growth. It occurs due to a shift in metabolic pathways in the middle of a growth cycle. After one carbon
source is exhausted, the cells adapt their metabolic activities to utilize the second carbon source.
894
This page intentionally left blank.
Bioprocess engineering
The residual substrate concentration in the reactor is controlled by the dilution. Any alteration to this dilution rate
results in a change in the growth rate of the cells that will be dependent on substrate availability at the new dilution
rate. Thus, growth is controlled by the availability of a rate-limiting nutrient. This system, where the concentration
of the rate-limiting nutrient entering the system is fixed, is often described as a chemostat as opposed to operation
as a turbidostat, where nutrients in the medium are not limiting. In turbidostat, turbidity of the culture is monitored
and maintained at a constant value by regulating the dilution rate, i.e. cell concentration is held constant.
8.3 Fermentation
Fermentation (derived from the Latin verb fervere, to boil) is a general term for the anaerobic catabolism of organic
compounds such as sugar to obtain energy. In biochemistry, the term ‘fermentation’ has been used in a strict sense
to mean an energy-generation process in which organic compounds act as both electron donors and terminal
electron acceptors. However, industrial microbiologists have extended the term fermentation to describe any anaerobic
as well as aerobic process for the production of the product by the mass culture of a microorganisms. The bioreactors
used for fermentation process can be called fermentors. Although the term bioreactor is often considered to
synonymous with fermentor, not all bioreactors are fermentors. Bioreactors are the apparatus in which biochemical
reactions are performed, involving the organisms or biochemically active substances which are derived from such
organisms. Bioreactors which use living cells are usually called fermentors. Some bioprocess engineers use term
fermentor for vessels used to grow microbial cells and bioreactors for those in which plant and mammalian cells can
be cultured.
902
This page intentionally left blank.
Bioprocess engineering
8.4 Bioreactor
A bioreactor (in biochemical engineering, we also use terms like a biochemical reactor, biological reactor, fermenter
or microbial reactor, which are all synonymous) is a vessel in which biochemical reactions are performed, involving
the organisms or biochemically active substances which are derived from such organisms. Bioreactors are commonly
cylindrical, ranging in size from some liter to cubic meters and are often made of stainless steel. The process in the
bioreactor can either be aerobic or anaerobic. The term bioreactor is often used synonymously with fermenter;
however, in the strict sense, a fermenter is a system in which living cells are used.
Bioreactor design
Bioreactor design is a relatively complex engineering task. The goal of an effective bioreactor is to control, contain
and positively influence the biological reaction. Suitable bioreactor design criteria include:
• Microbiological and biochemical characteristics of the cell systems (microbial, mammalian, plant cell).
• Hydrodynamic characteristics of the bioreactor.
• Mass and heat characteristics of the bioreactor.
• Kinetics of cell growth and product formation.
• Genetic stability characteristics of the cell system.
• Sterilization and maintenance of sterility.
• Agitation (for mixing of cells and medium) and aeration (aerobic fermenters; for O 2 supply).
• Process monitoring and control (regulation of factors like temperature, pH, pressure, aeration, nutrient).
• Implication of bioreactor design on downstream product separation.
• Capital and operating costs of the bioreactor.
• Potential for bioreactor scale-up.
In addition to controlling these, the bioreactor must be designed to both promote formation of the optimal morphology
of the organism and eliminate or reduce contaminations by unwanted organisms or mutation of the organisms.
There are a wide variety of bioreaction systems, and any attempt to categorize them by their various attributes will
naturally result in some overlap of system characteristics.
Mixing is one of the most important operations in bioprocessing. Within a fermenter, there is a need to mix three
different phases:
• Liquid phase, which contains dissolved nutrients and metabolites.
• Gaseous phase, which is predominantly oxygen and CO 2.
• Solid phase, which is made up of the cells and any solid substance that may be present.
Purpose of mixing
• Air bubble dispersion;
• Mass transfer from air bubbles (i.e. oxygen supply) to the liquid and then to the cells;
• Supply of the nutrient components to cells (more precisely, cell agglomerates);
• Prevention of sedimentation;
• Securing of heat transfer;
• Solubility of the nutrient’s components which are less soluble.
904
This page intentionally left blank.
Bioprocess engineering
Chemicals controlling foams have been classified into antifoams, which are added in the medium to prevent foam
formation, and defoamers which are added to knock down foams once these are formed. Natural antifoams include
plant oils (e.g. from sunflower and rapeseed), deodorized fish oil and mineral oils. The synthetic antifoams are
mostly silicon oils, polyalcohols and alkylated glycols. An ideal antifoam should have the following properties:
8.6 Sterilization
If a culture medium or a part of the equipment used for fermentation becomes contaminated by living foreign
microorganisms, the target microorganisms must grow in competition with the contaminating microorganisms.
Thus, not only the medium but also all of the fermentation equipment being used must be sterilized prior to the start
of fermentation, so that they are perfectly free from any living microorganisms and spores. In case of aerobic
fermentations, the air supplied to the fermentor should also be free from contaminating microorganisms. Steriliza-
tion is a term referring to any process that eliminates or kills all forms of life. Sterilization can be achieved by
applying the proper combinations of heat, chemicals, irradiation and filtration.
Thermal sterilization
For all microorganisms, there is a maximum temperature for growth, beyond which viability decreases. At very
high temperature, virtually all macromolecules lose their structure and their ability to function. The destruction of
microorganisms by heat at a constant temperature follows a first-order rate equation. If the initial number of
cells = N0, the number of destroyed cells = N’ at time t and N the surviving cells, then death rate is
dN
k d (N 0 N') kdN
dt
where kd is specific death constant dependent on microbial type and temperature. When integrated between N 0 at
time t = 0 and N at time t = t, the following equation is obtained.
N
ln k dt
N0
The rate constant kd is a function of temperature. It increases sharply with temperature and can be experimentally
determined for an organism. According to Arrhenius equation, the relationship between temperature and k d can be
expressed as:
917
This page intentionally left blank.
Bioprocess engineering
(e.g. water and ethanol) are Newtonian fluids. In Non-Newtonian fluids, viscosity varies with shear or agitation
rates. Depending on how the shear stress varies with the shear rate, these are categorized into pseudoplastic,
dilatant and Bingham plastic fluids. The viscosity of pseudoplastic fluids decreases with increasing shear rate,
whereas dilatant fluids show an increase in viscosity with shear rate. Bingham plastic fluids do not flow until a
threshold stress called the yield stress is applied, after which the shear stress increases linearly with the shear rate.
Bingham fluid
Pseudoplastic fluid
Newtonian fluid
Shear stress, Pa
Dilatant fluid
–1
Shear rate, s
Figure 8.14 Relationship between shear rate and shear stress for Newtonian and non-Newtonian fluids.
For Newtonian fluids the viscosity is independent of the shear rate – i.e. it is constant – whereas for non-Newtonian
fluids it is a function of the shear rate. The viscosities of fluids vary over a wide range. Water and many fermentation
broths containing yeasts or bacteria, can be considered to be Newtonian fluids. However, fermentations involving
filamentous fungi or fermentations in which polymers are excreted, will often exhibit non-Newtonian behaviour.
Thus, bioreactor performance influenced by broth rheology is determined by:
• Biomass concentration
• Cell morphology, including size, shape and mass
• Biomass growth rate
• Flexibility and deformability of cells
• Osmotic pressure of the suspending fluid
• Concentration of polymeric substrate
• Concentration of polymeric product
930
Bioprocess engineering
To obtain maximum reaction rates, the particle size of the support material and enzyme loading need to be
optimized, and a support material with the correct surface characteristics must be selected.
Various methods for enzyme immobilization may be subdivided into two general classes: entrapment and bound
methods.
Entrapment method
The entrapment method is based on the localization of an enzyme within the micro space. Enzymes can be immobilized
by entrapment in a porous matrix or by encapsulation in a semipermeable membrane capsule or between membranes,
such as in a hollow-fiber unit. In entrapment, the enzymes are not directly attached to the support surface, but
simply trapped inside the polymer matrix. Thus, loss of enzyme activity upon immobilization is minimum.
Matrix entrapment
Matrix entrapment involves entrapping enzymes within the interstitial spaces of a cross-linked water-insoluble
polymer. Matrices used are usually polymeric materials such as calcium alginate, agar and polyacrylamide. All
these gels can be formed with a simple and similar procedures. In all the protocols, enzymes are well mixed with
monomers/polymers and cross-linking agents in a solution. The solution is then exposed to polymerization promoters
to start the process of gel formation. The solution is poured into a mold to achieve the desired shape. Among
different gels, polyacrylamide is the most widely used matrix for entrapping enzymes. It has the advantage of
being non-ionic.
Membrane entrapment
E E E
E E E E Membrane
E E E
E E E
E
Bound method
931
This page intentionally left blank.
Bioprocess engineering
8.11 Scale up
The term scale up is used for the step from small scale to production scale. Transferring to an industrial scale
processes successfully developed at the lab scale is not a simple procedure. Scale up is the successful start up and
operation of a commercial size unit whose design and operating procedures are in part based on experimentation
and demonstration at a smaller scale. While the scale-up of any bioprocess can involve a host of issues, the
challenges are compounded when the process involves batch fermentation. The phenomenon that needs to be
taken into account during scale up can be divided into physical processes (transport phenomenon) and metabolic
processes (microbial kinetics).
Due to the typical fragility of the engineered microorganisms, large-scale fermentation vessels must be designed
with the ability to:
• Remove the heat buildup that results from metabolic processes;
• Manage agitation and mixing with minimal shear damage;
• Effectively control the highly variable liquid flow rates and turndowns that are associated with batch fermentation;
• Execute safeguards and sterilization techniques to guard against potential contamination.
Upstream process
Fermentation process
Crude products
Downstream process
Finished products
Bioprocessing treats raw materials and generates useful products. A problem common to all biological processes,
whether based on fermentation or cell culture technology, is the need to recover the product. Fermentation broths
are complex, aqueous mixtures of cells, comprising the soluble extracellular, intracellular products and any
unconverted substrates. The fermentation broth has to be processed and passed through several stages of
separation and purification. Any treatment of the culture broth after fermentation to concentrate and purify the
product is known as downstream processing. Individual operations or steps in the process that alter the
properties of materials are called unit operations. It usually refers to processes that cause physical modifications
935
Bioprocess engineering
to materials, such as a change of phase or component concentration. Unit operations involve centrifugation,
chromatography, crystallization, membrane processes (such as dialysis, ultrafiltration, microfiltration, and reverse
osmosis), distillation, drying, evaporation, mixing, precipitation, solvent extraction etc.
The recovery of product from fermentation broths depends very much on the type of cell and how the bioreactor
is designed and operated. The selection of appropriate purification step depends on the nature of the end product,
its concentration, the side product present, the stability of the biological materials and necessary degree of
purification. Although each recovery scheme will be different, the sequence of steps in downstream processing
can be generalized depending on whether the biomass itself is the desired product (e.g. bakers’ yeast), whether
the product is contained within the cells (e.g. enzymes and recombinant proteins), or whether the product
accumulates outside the cells in the fermentation liquor (e.g. ethanol, antibiotics, and monoclonal antibodies).
General schemes for these three types of downstream processing operation are represented in figure and involve
the following major steps.
The strategy during downstream processing can be generalized into five steps:
1. Cell separation: It involves the separation of cells from fermentation broths. This step involves unit operations
such as filtration, centrifugation and flocculation.
2. Cell disruption and cell debris removal: In those cases where the intracellular products (i.e. the products are
located inside the cells) are required, the cells must first be disrupted. Some products may be present in the
solution within the cytoplasm, while others may be insoluble and exist as membrane-bound proteins or small
insoluble particles called inclusion bodies. In the latter case, these must be solubilized before further purification.
Unit operations such as high-pressure homogenization, ultrasonication are used to break open the cells and
release their contents for subsequent purification. The cell debris generated during cell disruption is separated
from the product by filtration or centrifugation. A variety of methods is available to disrupt cells. Cell disruption
methods can be classified as either mechanical and non-mechanical. Typical animal cells are fragile and can be
easily ruptured by using a low shearing force or a change in the osmotic pressure. In contrast, bacteria, fungi
such as yeasts, and plant cells have rigid cell walls, the disruption of which requires high shearing forces.
Mechanical methods include grinding with abrasives, high-speed agitation, high-pressure homogenization
and ultrasonication. Non-mechanical methods such as osmotic shock, enzymic digestion of cell walls, and
treatment with solvents and detergents can also be applied. A widely-used technique for cell disruption is high-
pressure homogenization. In this process, cell disintegration takes place upon the application of high hydrostatic
pressure followed by immediate pressure release as the cell suspension passes through a valve. Ultrasonication
(on the order of 20 kHz) causes high-frequency pressure fluctuations in the liquid, leading to the repeated
formation and collapse of bubbles. Cell disruption by ultrasonication is used extensively in the laboratory.
3. Product isolation or concentration: It involves removal of those components whose properties vary markedly
from that of the desired product. Solvent extraction, adsorption, ultrafiltration and precipitation are some of the
unit operations involved in products isolation. Liquid extraction, also known as solvent extraction and partitioning,
is a method to separate compounds based on their relative solubilities in two different immiscible liquids,
usually water and an organic solvent. It is an extraction of a substance from one liquid into another liquid phase.
4. Product purification: It is done to separate those contaminants that resemble the product very closely in
physical and chemical properties. Examples of some unit operations used in purification include chromatography
and membrane separation (ultrafiltration and reverse osmosis).
5. Product preparation: It describes the final processing steps which end with the packaging of the product in a
form that is stable, easily transportable and convenient. This step involves unit operations such as drying,
crystallization and freeze-drying.
936
This page intentionally left blank.
Bioprocess engineering
Treatment processes
Waste materials in wastewater can be classified into three major categories: industrial wastes, domestic wastes
and agricultural wastes. Each of these waste materials has its own characteristics, and thus treatment methods
vary. The waste treatment methods include physical, chemical and biological treatments.
1. Physical treatment
Physical treatment includes screening, flocculation, sedimentation and filtration, which are usually used for the
removal of insoluble materials.
2. Chemical treatment
Chemical treatment includes chemical oxidations and chemical precipitation.
3. Biological treatment
Biological treatment includes the aerobic and anaerobic treatment of wastewater by a mixed culture of
microorganisms.
945
Bioprocess engineering
The physical, chemical and biological treatments can be further categorized as primary and secondary (or biological)
treatments.
Primary treatment includes physical and chemical processes, such as coagulation and sedimentation, to remove
particles too large to pass through simple screening devices. Primary treatment typically decreases 25 to 40
percent of the BOD and removes about 60 percent of the suspended solids.
In secondary treatment (or biological treatment), the physical and chemical processes that make up primary
treatment are augmented with processes that involve the microbial oxidation of wastes. Such biological treatment
mimics nature by utilizing microorganisms to oxidize the organic compounds. The main purpose of secondary
treatment is to lower BOD.
Any treatment above secondary treatment is defined as tertiary treatment. This treatment is sometimes called
as the final or advanced treatment and consists of removing the organic load left after secondary treatment.
A typical wastewater treatment process includes the primary and secondary treatment, depending on the degree of
purification. Biological treatment may be achieved aerobically or anaerobically in a number of ways. The most
widely used aerobic processes are trickling filters, activated sludge processes and their modifications. The anaerobic
processes (digestion, filtration and sludge blankets) are used both in the treatment of specific wastewaters and in
sludge conditioning.
Trickling filters
The basic principle of aerobic trickle filters is that a microbial population is allowed to develop as a biofilm on an
inert support material within a biological reactor. Wastewater is continuously sprayed over the surface of the
support material which percolates through the filter bed, where it is biodegraded by the microbial population.
Aeration is achieved by exploiting the difference in temperature between the inside and the outside of the reactor,
resulting in a countercurrent of air. High microbial activity within the reactor causes a rise in temperature, and warm
air rises and allows fresh air to enter at the bottom of the reactor.
Activated sludge
Activated sludge processes include a well agitated and aerated continuous flow reactor and a settling tank. This is
a two step process, involving biological treatment and secondary settlement. Biological treatment is performed in
an aeration tank containing a diverse range of microorganisms. To maintain aerobic conditions, air or oxygen is
pumped into the tank and the mixture is kept thoroughly agitated.
Secondary settlement occurs when the treated effluent (mixed liquor) from the aeration tank passes into a secondary
settlement tank. In settlement tank, solids mostly bacterial masses are separated from the liquid by subsidence.
Sludge treatment
The purpose of trickling filter and activated sludge process is to lower BOD from the wastewater before the treated
liquid is released to a water body. What remains to be disposed off is a mixture of solids and water, called sludge.
Sludge treatment process can be divided into two categories– either aerobic or anaerobic. In aerobic treatment
sludge is brought into contact with the mixed microbial population of aerobic microorganisms and oxygen. During
the process, part of the biodegraded material is converted into carbon dioxide and a portion becomes new biomass.
A major problem associated with aerobic treatment is the disposal of excess biomass produced during the degradation
of waste. The traditional method of sludge processing utilizes anaerobic treatment. That is, it involves bacteria that
thrive in the absence of oxygen. Anaerobic digestion is slower than aerobic digestion, but has the advantage that
only a small percentage of the wastes is converted into new bacterial cells. Most of the organics are converted into
carbon dioxide and methane gas.
The anaerobic digestion process is very complex and can be divided into two phases. In the first phase, complex
organic compounds such as proteins, lipids and carbohydrates are converted into simpler organic materials. The
bacteria that perform this conversion are commonly referred to as acid formers. In the second phase, organic
materials are converted slowly into CO2, CH4 and other stable end products by methane forming bacteria. These
946
Bioprocess engineering
bacteria are very sensitive to temperature, pH, toxins and oxygen. The optimal temperature and pH range for
methanogenic bacteria are 35° to 40°C and pH 7 to 7.8. Methanogenic bacteria used for this purpose are
Methanobacterium, Methanobacillus and Methanococcus.
Oxidation ponds
Oxidation ponds are large, shallow ponds, typically 1-2 m deep. It acts as a shallow waste-treatment reactor where
raw or partially treated sewage is decomposed by microorganisms. The conditions are similar to eutrophic lake.
The ponds can be designed to maintain aerobic conditions. Oxidation ponds are also used to augment secondary
treatment, in which case they are often called polishing ponds.
8.15 Bioremediation
Bioremediation is a biological process whereby organic wastes are biologically degraded under controlled conditions.
This process involves the use of living organisms, primarily microorganisms, to degrade the environmental
contaminants. In this process, contaminant compounds are transformed by living organisms through reactions that
take place as a part of their metabolic processes. For bioremediation to be effective, microorganisms must
enzymatically attack the contaminants and convert them to harmless products. Hence, it is effective only where
environmental conditions permit microbial growth and activity. Thus, its application involves the manipulation of
environmental parameters to allow microbial growth and degradation to proceed at a faster rate. The control and
optimization of bioremediation processes is a complex phenomenon. Various factors influencing this process include:
the existence of a microbial population capable of degrading the pollutants; the availability of contaminants to the
microbial population; and the environment factors (type of soil, temperature, pH, the presence of oxygen or other
electron acceptors, and nutrients).
Bioremediation strategies
Bioremediation strategies can be in-situ or ex-situ. In-situ bioremediation involves treating the contaminated material
at the site while ex-situ bioremediation involves the removal of the contaminated material to be treated elsewhere.
In-situ bioremediation techniques are generally the most desirable options due to lower cost and less disturbance
since they provide the treatment at a site avoiding excavation and transport of contaminants. Ex-situ bioremediation
requires transport of the contaminated water or excavation of contaminated soil prior to remediation treatments.
In-situ and ex-situ bioremediation strategies involve different technologies such as bioventing, biosparging, bioreactor,
composting, landfarming, bioaugmentation and biostimulation.
Bioventing is an in-situ bioremediation technology that uses microorganisms to biodegrade organic constituents
adsorbed on soils in the unsaturated zone (extends from the top of the ground surface to the water table). Bioventing
enhances the activity of indigenous bacteria and stimulates the natural in-situ biodegradation of contaminated
materials in soil by inducing air or oxygen flow into the unsaturated zone and, if necessary, by adding nutrients.
Biosparging is also an in-situ bioremediation technology that uses indigenous microorganisms to biodegrade
organic constituents in the saturated zone. In biosparging, air (or oxygen) and nutrients (if needed) are injected
into the saturated zone to increase the biological activity of the indigenous microorganisms.
947
This page intentionally left blank.
Chapter 09
Bioinformatics
9.1 Introduction
Bioinformatics is a discipline at the intersection of biology, computer science, information technology and mathematics.
There are a number of definitions put forth for bioinformatics. Most accepted definition of bioinformatics is ‘a
subject of genetic data collection, analysis and dissemination to the research community’. Bioinformatics aims at
integrating and analyzing a wealth of biological data with the aim of identifying and assigning a function to each. It
is applied, for example, in the construction of genetic and physical maps of genomes, gene discovery, the inference
of the molecular function and three-dimensional structure of their products, the interpretation of the effect of gene
variations on the phenotype, the reconstruction of interaction and signal transduction pathways and the simulation
of biological systems.
Database classification
Biological databases can be classified into sequence and structure databases or primary and secondary databases.
Primary and secondary databases are classified on the basis of source of data.
Primary databases
Databases consisting of data derived experimentally such as nucleotide sequences and three-dimensional structures
are known as primary databases. Examples of these include GenBank, EMBL and DDBJ for nucleotide sequences
and the Protein Data Bank (PDB) for 3D-protein structures. GenBank, EMBL and DDBJ are three major public
sequence databases that store raw nucleic acid sequence data produced and submitted by researchers worldwide:
which are all freely available on the Internet.
Secondary databases
A secondary database derives from the analysis or treatment of the primary database. A secondary sequence
database contains information like the conserved sequence, signature sequence and active site residues of the
protein families arrived by multiple sequence alignment of a set of related proteins.
954
Bioinformatics
Primary databases
EMBL (the European Molecular Biology Laboratory)
GenBank
DDBJ (DNA Data Bank of Japan)
Swiss-Prot
PIR (Protein Information Resource)
Uniprot
Secondary databases
TrEMBL (Translation of EMBL nucleotide sequence database)
Pfam
Prosite
Structure databases
Primary databases
PDB (Protein Data Bank)
Secondary databases
SCOP (Structural Classification of Proteins)
CATH (Class, Architecture, Topology and Homology)
NDB (Nucleic Acid Database)
DSSP (Database of Secondary Structure Assignments)
HSSP (Homology-derived Secondary Structure of Proteins)
Swiss-Prot
Swiss-Prot (developed at the Swiss Institute of Bioinformatics) is a protein sequence database which strives to
provide a high level of annotations (such as the description of the function of a protein, its domains structure, post-
translational modifications, variants etc.), a minimal level of redundancy and high level of integration with other
databases. Two related databases closely associated with Swiss-Prot are the ENZYME DB and PROSITE (a set of
motifs pattern database). The ENZYME DB stores the following information about enzymes:
• EC Number
• Recommended name
• Alternative names, if any
• Catalytic activity
• Cofactors, if any
• Pointers to Swiss-Prot and other data banks
• Pointers to disease associated with enzyme deficiency if any known
955
This page intentionally left blank.
Bioinformatics
Genome databases
Genome sequences form entries in the standard nucleic acid sequence databases. Many species like Arabidopsis
thaliana, C. elegans, Rice etc., have special databases that bring together the genome sequence and its annotation
with other data related to the species.
NCBI
The National Center for Biotechnology Information (NCBI) was established on November 4, 1988 at the National
Institutes of Health (NIH) with an objective to develop new information technologies to help in the understanding of
fundamental molecular and genetic processes that control health and disease. The NCBI has been assigned with
creating automated systems for storing and analyzing knowledge about molecular biology, biochemistry, and genetics;
facilitating the use of such databases and software by the research and medical community; coordinating efforts to
gather biotechnology information both nationally and internationally; and performing research into advanced methods
of computer-based information processing for analyzing the structure and function of biologically important molecules.
At NCBI, databases are linked through a unique search and retrieval system, called Entrez. Entrez allows a user
to not only access and retrieve specific information from a single database but also access integrated information
from many NCBI databases. It is a gateway that allows text-based searches for a wide variety of data, including
annotated genetic sequence information, structural information, as well as citations and abstracts, full papers, and
taxonomic data.
Specialized databases
Specialized databases normally serve a specific research community or focus on a particular organism. Many
individuals or groups select, annotate, and recombine data focused on particular topics, and include links affording
streamlined access to information about subjects of interest. The protein kinase resource is a specialized com-
pilation that includes sequences, structures and functional information, laboratory procedures, list of interested
scientists, tools for analysis, a bulletin board and links. The HIV protease database store structures of HIV1
proteinases, HIV2 proteinases and SIV proteinases, and their complexes and provides tools for their analysis and
other links.
957
This page intentionally left blank.
Bioinformatics
>P1;CRAB_ANAPL
ALPHA CRYSTALLIN B CHAIN (ALPHA(B)-CRYSTALLIN).
MDITIHNPLIRRPLFSWLAPSRIFDQIFGEHLQESELLPASPSLSPFLMRSPIFRMPSWL
ETGLSEMRLEKDKFSVNLDVKHFSPEELKVKVLGDMVEIHGKHEERQDEHGFIAREFN
RKYRIPADVDPLTITSSLSLDGVLTVSAPRKQSDVPERSIPITREEKPAIAGAQRK*
960
Bioinformatics
Pairwise alignment is used between two query sequences at a time. It is used to identify regions of similarity that
may indicate functional, structural and/or evolutionary relationships between two biological sequences (protein or
nucleic acid). It involves matching of homologous positions in two sequences. Positions with no homologous pair
are matched with a space ‘–’ and a group of consecutive spaces is a gap.
C A – – G AT T C G A AT
C G C C G AT T– – – AT
{
gap
The three primary methods of producing pairwise alignments are dot-matrix methods, dynamic programming
and word methods. Although each method has its individual strengths and weaknesses, all three pairwise methods
have difficulty with highly repetitive sequences.
Dynamic programming
The idea behind dynamic programming is, to solve a given problem, we need to solve different parts of the problem
(subproblems), then combine the solutions of the subproblems to reach an overall solution. Often, many of these
subproblems are really the same. The dynamic programming approach seeks to solve each subproblem only once,
961
This page intentionally left blank.
Bioinformatics
Phylogenetic trees
In phylogenetic studies, the most convenient way of visually presenting evolutionary relationships among a group
of organisms is through illustrations called phylogenetic trees. Phylogenetic tree is a two-dimensional graph showing
evolutionary relation between organisms, or genes from various organisms. It is represented by branches and
nodes. Nodes can be internal or external. Each internal node represents the last common ancestor of the two
lineages. External nodes (also termed as terminal nodes, leaves or Operational Taxonomic Units) represent tip of
the tree i.e. extant taxonomic unit under consideration. Nodes correspond to species, organisms or sequences.
Similarly, branches can be internal or external. Internal branches or internodes connect two nodes, whereas
external branches connect a tip and a node.
The branching pattern in a tree is called tree topology. When all branches bifurcate on a phylogenetic tree, it is
referred to as dichotomy. In this case, each ancestor divides and gives rise to two descendants. Sometimes, a
branch point on a phylogenetic tree may have more than two descendants, resulting in a multifurcating node. The
phylogeny with multifurcating branches is called polytomy.
D C B A External node
Branch
Internal node
A phylogenetic tree may be rooted or unrooted. A rooted tree infers the existence of a common ancestor from
which all the other species originate and indicates the direction on the evolutionary process. A rooted tree in which
every node has two descendants is called a binary tree. An unrooted tree does not provide any information about
their common ancestor and shows only the evolutionary relationships between the organisms. Most phylogenetic
trees are rooted.
967
This page intentionally left blank.
Bioinformatics
This analysis is continued for every position in the sequence alignment. Finally, those trees that produce the
smallest number of changes overall for all sequence positions are identified. This method is best suited for sequences
that are quite similar and is limited to small numbers of sequences.
Maximum likelihood approach: This method uses probability calculations to find a tree that best accounts for the
variation in a set of sequences. All possible trees are considered. Hence, the method is only feasible for a small
number of sequences. For each tree, the number of sequence changes or mutations that may have occurred to the
given sequence variation is considered. Because the rate of appearance of new mutations is very small, the more
mutations needed to fit a tree to the data, the less likely the tree.
The maximum likelihood method presents an additional opportunity to evaluate trees with variations in mutation
rates in different lineages, and to use explicit evolutionary models such as the Jukes-Cantor and Kimura models.
The method can be used to explore relationships among more diverse sequences and conditions that are not well
handled by maximum parsimony methods.
Methods for prediction of protein structure from amino acid sequence include:
• Attempts to predict secondary structure without attempting to assemble these regions in three dimensions.
• Homology modeling prediction of the three-dimensional structure of a protein from the known structures of one
or more related proteins.
• Fold recognition, from a library of known structures, determine which of them shares a folding pattern with a
query protein of known sequence but unknown structure.
• Prediction of novel folds, either by a priori or knowledge based methods.
970
Bioinformatics
The Chou-Fasman method for secondary structure prediction is one of the oldest and simplest methods. The basic
idea is that each amino acid residue is assigned three numbers that describes its propensity to be part of α-helices,
β-sheets and turns respectively. A large number (above 100) corresponds to a propensity for that kind of structure.
These parameters may be determined from the occurrence of different amino acids in different types of secondary
structure in known protein structures.
The GOR method is based on the assumption that amino acids flanking the central amino acid residue influence the
secondary structure that the central residue is likely to adopt. This method uses principles of information theory to
derive predictions. This method takes into account not only the probability of each amino acid having a particular
secondary structure, but also the conditional probability of the amino acid assuming each structure given the
contributions of its neighbors (it does not assume that the neighbors have that same structure). The approach is
both more sensitive and more accurate than that of Chou and Fasman because amino acid structural propensities
are only strong for a small number of amino acids such as proline and glycine.
Homology-based methods
Homology-based method combines the ab initio secondary structure prediction of individual sequences and alignment
information from multiple homologous sequences (> 35% identity). The idea behind this approach is that close
protein homologs should adopt the same secondary and tertiary structure.
Table 9.2 Selected programs for performing protein secondary structure prediction
Program Method
BSC Linear discrimination
NNPRED Neural networks enhanced to detect sequence periodicity
Protein sequence analysis (PSA) Discrete space models (hidden Markov models) for patterns of alpha helices,
beta strands, tight turns, and loops in specific structural classes
PREDATOR Based on analysis of long and short-range amino acid interactions and alignments
of sequence pairs
Predict protein server Neural networks of multiple sequence alignment
PSSP Nearest neighbor enhanced by non-intersecting local and multiple sequence
alignments
SOPM, SOPMA Nearest-neighbor method
SSP Linear discriminant analysis based on amino acid composition of local and adjacent
regions
Homology modeling
Homology means having a common evolutionary origin, but does not necessarily mean similarity. It is a qualitative
description of the nature of the relationship between two or more things, and it cannot be partial. Either there is an
evolutionary relationship or there is not.
A major goal of structural biology is to predict the three-dimensional structure of proteins from the sequence of
amino acids. Technique such as X-ray diffraction or NMR are being used to develop the three-dimensional structure
of proteins. Many proteins are simply too large for NMR analysis and cannot be crystallized for X-ray diffraction.
However, alternative strategies are being applied to develop models of protein structure. One method that can be
applied to generate reasonable models of protein structures is homology modeling. It is based on the reasonable
assumption that two homologous proteins will share very similar structures. This procedure, also termed comparative
modeling or knowledge-based modeling, develops a three-dimensional model from a protein sequence based on
the structures of homologous proteins. It is based on two major observations:
1. The structure of a protein is uniquely determined by its amino acid sequence.
2. During evolution, the structure is more stable and changes much slower than the associated sequence, so that
similar sequences adopt practically identical structures, and distantly related sequences still fold into similar structures.
971
This page intentionally left blank.
Bioinformatics
• Structural genomics – It refers to the initial phase of genome analysis, which includes construction of genetic
and physical maps of a genome, identification of genes, annotation of gene features and comparison of genome
structures.
• Functional genomics – It focuses on the dynamic aspects such as gene transcription, translation and protein–
protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or
structures. Functional genomics not only simply attempts to answer the function of the identified genes but also
the organization and control of genetic pathways that come together to make up the physiology of an organism.
• Comparative genomics – It focuses on the analysis and comparison of genomes from different species. It
includes comparison of gene number, gene location, and gene content from these genomes. The comparison
helps to reveal the extent of conservation among genomes, which will provide insights into the mechanism of
genome evolution and gene transfer among genomes. The comparison helps to reveal the extent of conservation
among genomes, which will provide insights into the mechanism of genome evolution and gene transfer among
genomes. It helps to identify genes that are conserved among species, as well as genes that give each organism
its unique characteristics. In addition, the evolutionary perspective may prove extremely helpful in understanding
disease susceptibility. For example, chimpanzees do not suffer from some of the diseases that occur in humans,
such as malaria and AIDS, even though chimpanzees’ DNA sequence is 98.8 percent identical to ours. A
comparison of the sequence of genes involved in disease susceptibility may reveal the reasons for this species
barrier, thereby suggesting new pathways for prevention of human disease. Similarly, comparative analysis of
the fruit fly genome with the human genome discovered that about 60 percent of genes are conserved between
fly and human. Researchers have found that two-thirds of human genes known to be involved in cancer have
counterparts in the fruit fly. Even more surprisingly, when scientists inserted a human gene associated with
early-onset of Parkinson’s disease into fruit flies, they displayed symptoms similar to those seen in humans with
the disorder, raising the possibility that the tiny insects could serve as a new model for testing therapies aimed
at Parkinson’s.
9.9.2 Proteomics
Proteomics involves studying the structure, expression, localization, interactions and cellular roles of all of the
proteins present in a particular organism i.e. proteome (the word proteome refers to the complete set of proteins
encoded by the genome, including the added variation due to post-translational modifications. The proteome is
neither as uniform nor as static as the genome). It is a field that encompasses: protein expression and purification,
separation, visualization and identification, quantification, interactions, sequence analysis, structural analysis and
protein modification. This field also focuses on the development of new high-throughput techniques and the
computational machinery needed to analyze the data. The central aim of proteomics is the quantitative detection of
proteins in cells and tissues and comparison in different conditions (health, disease, differentiation, drug treatment,
etc). In general, proteomic approaches can be used
974
This page intentionally left blank.