Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
TRANSCRIPTOM
E 1
In 1961, Jacob and Monod proposed a model that the protein-coding gene is
transcribed into a special short-lived intermediate associated with the
ribosome, which was designated as mRNA.
A short, stable RNA, transfer RNA (tRNA), was identified as the predicted
“adaptor”.
3
In 1998, Fire and Mello found that double-stranded RNAs
(dsRNAs) could recognize specific mRNA sequence and then led
to the degradation of the target mRNAs, which was known as
RNA interference (RNAi).
6
TRANSCRIPTOMICS AIMS
I. To catalogue all species of transcripts.
7
TRANSCRIPTOMES GIVE US
INFORMATION OF GENE
EXPRESSION
Why use transcriptomes in biological research?
Pros Cons
• Easy, accessible • Snapshot in time (different times,
• Immediate access to the different expression patterns)
protein coding portion of the • Absence of gene expression does not
genome mean it is not present in the genome.
• Identify alternative splicing • Difficult to ensure that you have
• Identify SNPs in coding sampled a single cell type.
regions • Statistical analysis is highly 8
dependent on experimental design
TECHNOLOGIES
Hybridization-based approaches
– fluorescently labelled cDNA with custom-made microarrays
– commercial high-density oligo microarrays
Sequence-based approaches
– Sanger sequencing of cDNA or EST libraries
– serial analysis of gene expression (SAGE)
– cap analysis of gene expression (CAGE)
– massively parallel signature sequencing (MPSS)
9
CDNA LIBRARIES AND DATA
MINING
Collections of cDNAs were obtained from various normal and diseased
tissues, as well as from reference cell lines.
10
Global gene expression information provided by cDNA/EST
databases made electronic northern feasible.
11
CDNA SUBTRACTION
The data mining approach was limited by the existing sequence
information and the available gene annotations.
12
cDNA subtraction is a method for separating cDNA molecules that
distinguish related cDNA samples
cDNAs prepared from two different cell types to be compared are rendered
single-stranded, subsequently mixed, and incubated to allow annealing of
sequences common to both cell species.
These sequences will hybridize, while sequences unique to one of the cells
will stay single-stranded.
The two populations are hybridized with a driver to tester ratio of at least 10:1. Because of
the large excess of driver molecules, tester sequences are more likely to form driver-tester
hybrids than double-stranded tester.
Only the sequences in common between the tester and the driver hybridize, however, leaving
the remaining tester sequences either single-stranded or forming tester-tester pairs.
14
The driver-tester, double-stranded driver and any single-stranded
driver molecules are subsequently removed (the "subtractive" step),
leaving only tester molecules enriched for sequences not found in the
driver.(target sequences)
15
PROCESS
16
DISADVANTAGE
Abundant mRNAs (cDNAs) are over-represented due to the lack of
normalization.
The normalization step equalizes the abundance of cDNA fragments within the
target population, and the subtraction step excludes sequences that are common
to the cell populations being contrasted 17
Differential display PCR is a method to separate and clone
individual mRNAs that are differentially expressed.
20
2. (a) Add biotin-labeled dT B
primer:
(b) Synthesize ds cDNA.
21
22
(b) Cleave with “anchoring enzyme”(restriction
enzyme)
Type II restriction
enzyme used (E.g. NlaIII.)
Average length of cDNA
–256bp with sticky ends
created.
23
5. DIVIDE INTO TWO POOLS AND ADD LINKER SEQUENCES
24
Linker
s
B B
B B
Pool A Pool B
25
6.CLEAVING WITH TAGGING ENZYME
Tagging enzyme, (usually BsmF1) cleave DNA, releasing the
linker-adapted SAGE tag from each cDNA.
Repair of ends to make blunt ended tags using DNA polymerase
(Klenow fragments) and dNTPs.
26
7. Combine pools and ligate.(formation of ditags)
This leaves a “sticky” end with the sequence GTAC (or CAGT on the
other strand) at each end of the ditag.
27
9. Ligate digitags. (Concatamerization of Ditags)
28
APPLICATIONS OF SAGE
To analyze differences between gene expression patterns of
cancer cells and their normal counter parts.
Accurate.
30
PROBLEMS IN
SAGE…
• Length of gene tag is extremely short (13 or 14bp), so
if the tag is derived from an unknown gene, it is
difficult to analyze with such a short sequence.
The small fragments from the very beginnings of mRNAs (5' ends
of capped transcripts) are extracted reverse-transcribed to
DNA PCR amplified sequenced.
34
TRANSCRIPTOME ANALYSIS BASED
ON MICROARRAYS
DNA microarray or chip method is based on nucleic acid hybridization.
36
BASIC FLOWCHART OF
TRANSCRIPTOME STUDY USING
MICROARRAY/DNA CHIP Use Reverse
Collect mRNA Transcriptase (RT) Label cDNA with
molecules enzyme to
flourescent dyes
from a cell produce cDNA
molecules from
the mRNA
Prepare Hybridization
microarray/DNA of labeled
chip (cDNA from
Place labeled cDNA with
reference genes or cDNA on cDNA
oligonucleotide microarray (complimentary
mixture) slide ) on microarray
The RNA samples are labelled with different fluorophores, tumor RNA with
a red fluorophore, normal RNA with a green one.
Both samples are hybridized together on the microarray. If the spot then
appears in red, this means higher expression of that gene in tumor tissue
compared to normal tissue.
44
COMPLICA
TIONS
• When comparing more than one transcriptome,
differences in mRNA amount and hybridization intensities
must be due to difference in transcripts rather than due
to experimental errors.
45
NORMALIZATION PROCEDURES
TO COUNTER
EXPERIMENTAL FACTORS
• Enables results from different array experiments to
be accurately compared
• Normalization procedure:
– Negative controls, so that background can be
determined in each experiment
– Positive controls, always give identical signals
46
• In vertebrates, actin gene is used as positive
control
– Its expression level is fairly constant in
any particular tissue
– Even in developmental, or diseased state
47
APPLICATIONS
Studying the transcriptome can lead to various applications
50
Class prediction means to find a set of features that are
predictive for a certain, predefined class.
It is usually based on class discovery, followed by the idea to
establish a classifier.
52
The prototype of NGS is massive parallel signature sequencing (MPSS), which
applies four rounds of restriction enzyme digestion and ligation reactions to
determine the nucleotide sequence of cDNA ends generating a 17–20 bp
sequence as the fingerprint of a corresponding RNA.
However, due to the nature of digestion and ligation reactions, a large fraction
of the sequence signatures obtained is not long enough to be unique
fingerprints of RNA molecules.
54
APPLICATION
Disease diagnosis.
56
57
SUMMARY
Microarray-based analysis of gene expression permits assessment of
normal mRNA levels and disease-related alterations on a genome
wide scale.
58
In molecular pathology, gene expression patterns (often called gene
signatures, molecular portraits, or gene profiles) were identified
that correlate with disease state or clinical outcome. This enables
the nomination of candidate genes that may play a regulatory role in
the processes under study.
60