Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Single-cell transcriptomics examines the gene expression level of individual cells in a given population by
simultaneously measuring the messenger RNA (mRNA) concentration of hundreds to thousands of genes.[1] The
unraveling of heterogenous cell populations, reconstruction of cellular developmental trajectories and modelling of
transcriptional dynamics all previously masked in bulk transcriptome measurements is made possible through
analysis of this transcriptomic data.[2]
Contents
Background
Experimental steps
Isolating single cells
Quantitative PCR (qPCR)
Single-cell RNA-seq
Considerations
Data analysis
Clustering
Dimensionality reduction
Differential expression
Pseudotemporal ordering
Network inference
See also
References
External links
Background
Gene expression analysis has become routine through the development of high-throughput RNA sequencing (RNA-
seq) and microarrays. RNA analysis that was previously limited to tracing individual transcripts by Northern blots or
quantitative PCR is now used frequently to characterize the expression profiles of populations of thousands of
cells.The data produced from the bulk based assays has led to the identification of genes that are differentially
expressed in distinct cell populations and biomarker discovery.[3]
These genomic studies are limited as they provide measurements for whole tissues and as a result show an average
expression profile for all the constituent cells. In multicellular organisms different cell types within the same
population can have distinct roles and form subpopulations with different transcriptional profiles. Correlations in the
gene expression of the subpopulations can often be missed due to the lack of subpopulation identification.[4]
Moreover, bulk assays fail to identify if a change in the expression profile is due to a change in regulation or
composition, in which one cell type arises to dominate the population. Lastly, when examining cellular progression
through differentiation, average expression profiles are only able to order cells by time rather than their stage of
development and are consequently unable to show trends in gene expression levels specific to certain stages.[5]
Recent advances in biotechnology allow the measurement of gene expression in hundreds to thousands of individual
cells simultaneously. Whilst these breakthroughs in transcriptomics technologies have enabled the generation of
single-cell transcriptomic data there are new computational and analytical challenges presented by the data produced.
Techniques used for analysing RNA-seq data from bulk cell populations can be used for single-cell data but many new
computational approaches have been designed for this data type to facilitate a complete and detailed study of single-
cell expression profiles.[6]
Experimental steps
There is currently no standardized technique to generate single-cell data, all methods must include cell isolation from
the population, lysate formation, amplification through reverse transcription and quantification of expression levels.
Common techniques for measuring expression are quantitative PCR or RNA-seq.
Micropipetting
Cytoplasmic aspiration (http://www.single-cell-analysis.com/tag/cytoplas
mic-aspiration/)
Laser capture microdissection.
High throughput methods are able to quickly isolate hundreds to tens of
thousands of cells.[7] Common techniques include: Fluorescence Assisted Cell
Sorting workflow (FACS)
Fluorescence activated cell sorting (FACS)
Microfluidic devices
Single-cell RNA-seq
The Single-cell RNA-seq technique converts a population of RNAs to a library of cDNA fragments. These fragments
are sequenced by high-throughput next generation sequencing techniques and the reads are mapped back to the
reference genome, providing a count of the number of reads associated with each gene.[10]
Normalisation of RNA-seq data accounts for cell to cell variation in the efficiencies of the cDNA library formation and
sequencing. One method relies on the use of extrinsic RNA spike-ins (RNA sequences of known sequence and
quantity) that are added in equal quantities to each cell lysate and used to normalise read count by the number of
reads mapped to spike-in mRNA.[11]
When using RNA spike-ins for normalisation the assumption is made that the amplification and sequencing
efficiencies for the endogenous and spike-in RNA are the same. Evidence suggests that this is not the case given
fundamental differences in size and features, such as the lack of a polyadenylated tail in spike-ins and therefore
shorter length.[14] Additionally, normalisation using UMIs assumes the cDNA library is sequenced to saturation,
which is not always the case.[15]
Data analysis
Insights based on single-cell data analysis assumes that the input is a matrix of normalised gene expression counts,
generated by the approaches outline above, and can provide opportunities that are not obtainable by bulk.
1. Identification and characterization of cell types and their spatial organisation in time
2. Inference of gene regulatory networks and their strength across individual cells
3. Classification of the stochastic component of transcription
The techniques outlined have been designed to help visualise and explore patterns in the data in order to facilitate the
revelation of these three features.
Clustering
Clustering allows for the formation of subgroups in the cell population. Cells can be clustered by their transcriptomic
profile in order to analyse the sub-population structure and identify rare cell types or cell subtypes. Alternatively,
genes can be clustered by their expression states in order to identify covarying genes. A combination of both clustering
approaches, known as biclustering, has been used to simultaneously cluster by genes and cells to find genes that
behave similarly within cell clusters.[17]
Clustering methods applied can be K-means clustering, forming disjoint groups or Hierarchical clustering, forming
nested partitions.
Biclustering
Biclustering provides several advantages by improving the resolution of clustering. Genes that are only informative to
a subset of cells and are
hence only expressed there
can be identified through
biclustering. Moreover,
similarly behaving genes that
differentiate one cell cluster
from another can be
identified using this
method.[18]
Dimensionality
reduction
Dimensionality reduction K-Means-Gaussian-data
Differential expression
Detecting differences in gene expression level between two populations is used both single-cell and bulk
transcriptomic data. Specialised methods have been designed for single-cell data that considers single cell features
such as technical dropouts and shape of the distribution e.g. Bimodal vs. unimodal.[21]
Gene ontology enrichment
Gene ontology terms describe gene functions and the relationships between those functions into three classes:
1. Molecular function
2. Cellular component
3. Biological process
Gene Ontology (GO) term enrichment is a technique used to identify which GO terms are over-represented or under-
represented in a given set of genes. In single-cell analysis input list of genes of interest can be selected based on
differentially expressed genes or groups of genes generated from biclustering. The number of genes annotated to a GO
term in the input list is normalised against the number of genes annotated to a GO term in the background set of all
genes in genome to determine statistical significance.[22]
Pseudotemporal ordering
Pseudo-temporal ordering (or trajectory inference) is a technique that aims
to infer gene expression dynamics from snapshot single-cell data. The
method tries to order the cells in such a way that similar cells are closely
positioned to each other. This trajectory of cells can be linear, but can also
bifurcate or follow more complex graph structures. The trajectory therefore
enables the inference of gene expression dynamics and the ordering of cells
by their progression through differentiation or response to external stimuli.
The method relies on the assumptions that the cells follow the same path
through the process of interest and that their transcriptional state correlates
to their progression. The algorithm can be applied to both mixed populations
and temporal samples.
Network inference
Gene regulatory network inference is a technique that aims to construct a network, shown as a graph, in which the
nodes represent the genes and edges indicate co-regulatory interactions. The method relies on the assumption that a
strong statistical relationship between the expression of genes is an indication of a potential functional
relationship.[26] The most commonly used method to measure the strength of a statistical relationship is correlation.
However, correlation fails to identify non-linear relationships and mutual information is used as an alternative. Gene
clusters linked in a network signify genes that undergo coordinated changes in expression.[27]
See also
Single-cell analysis
Single-cell sequencing
Transcriptome
Transcriptomics
References
1. Kanter, Itamar; Kalisky, Tomer (1 January 2015). "Single Cell Transcriptomics: Methods and Applications" (http://j
ournal.frontiersin.org/article/10.3389/fonc.2015.00053/full). Frontiers in Oncology. 5.
doi:10.3389/fonc.2015.00053 (https://doi.org/10.3389%2Ffonc.2015.00053). ISSN 2234-943X (https://www.world
cat.org/issn/2234-943X).
2. Liu, Serena; Trapnell, Cole (17 February 2016). "Single-cell transcriptome sequencing: recent advances and
remaining challenges" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4758375). F1000Research. 5.
doi:10.12688/f1000research.7223.1 (https://doi.org/10.12688%2Ff1000research.7223.1). ISSN 2046-1402 (https:
//www.worldcat.org/issn/2046-1402). PMC 4758375 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4758375).
3. Szabo, David T. "Chapter 62 - Transcriptomic biomarkers in safety and risk assessment of chemicals".
Biomarkers in Toxicology (http://www.sciencedirect.com/science/article/pii/B9780124046306000622.). Academic
Press. pp. 1033–1038. ISBN 9780124046306.
4. Kanter, Itamar; Kalisky, Tomer (10 March 2015). "Single Cell Transcriptomics: Methods and Applications" (https://
www.ncbi.nlm.nih.gov/pmc/articles/PMC4354386). Frontiers in Oncology. 5. doi:10.3389/fonc.2015.00053 (https:/
/doi.org/10.3389%2Ffonc.2015.00053). ISSN 2234-943X (https://www.worldcat.org/issn/2234-943X).
PMC 4354386 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4354386).
5. Trapnell, Cole (1 October 2015). "Defining cell types and states with single-cell genomics" (http://genome.cshlp.or
g/content/25/10/1491.short). Genome Research. 25 (10): 1491–1498. doi:10.1101/gr.190595.115 (https://doi.org/
10.1101%2Fgr.190595.115). ISSN 1088-9051 (https://www.worldcat.org/issn/1088-9051). PMC 4579334 (https://
www.ncbi.nlm.nih.gov/pmc/articles/PMC4579334).
6. Stegle, Oliver; Teichmann, Sarah A.; Marioni, John C. (1 March 2015). "Computational and analytical challenges
in single-cell transcriptomics" (http://www.nature.com/nrg/journal/v16/n3/full/nrg3833.html). Nature Reviews
Genetics. 16 (3): 133–145. doi:10.1038/nrg3833 (https://doi.org/10.1038%2Fnrg3833). ISSN 1471-0056 (https://
www.worldcat.org/issn/1471-0056).
7. Poulin, Jean-Francois; Tasic, Bosiljka; Hjerling-Leffler, Jens; Trimarchi, Jeffrey M.; Awatramani, Rajeshwar (1
September 2016). "Disentangling neural cell diversity using single-cell transcriptomics" (http://www.nature.com/ne
uro/journal/v19/n9/fig_tab/nn.4366_F2.html). Nature Neuroscience. 19 (9): 1131–1141. doi:10.1038/nn.4366 (http
s://doi.org/10.1038%2Fnn.4366). ISSN 1097-6256 (https://www.worldcat.org/issn/1097-6256).
8. Radonić, Aleksandar; Thulke, Stefanie; Mackay, Ian M.; Landt, Olfert; Siegert, Wolfgang; Nitsche, Andreas (23
January 2004). "Guideline to reference gene selection for quantitative real-time PCR". Biochemical and
Biophysical Research Communications. 313 (4): 856–862. ISSN 0006-291X (https://www.worldcat.org/issn/0006-
291X). PMID 14706621 (https://www.ncbi.nlm.nih.gov/pubmed/14706621).
9. Wildsmith, S. E.; Archer, G. E.; Winkley, A. J.; Lane, P. W.; Bugelski, P. J. (1 January 2001). "Maximization of
signal derived from cDNA microarrays". BioTechniques. 30 (1): 202–206, 208. ISSN 0736-6205 (https://www.worl
dcat.org/issn/0736-6205). PMID 11196312 (https://www.ncbi.nlm.nih.gov/pubmed/11196312).
10. Wang, Zhong; Gerstein, Mark; Snyder, Michael (23 March 2017). "RNA-Seq: a revolutionary tool for
transcriptomics" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2949280). Nature Reviews. Genetics. 10 (1): 57–
63. doi:10.1038/nrg2484 (https://doi.org/10.1038%2Fnrg2484). ISSN 1471-0056 (https://www.worldcat.org/issn/1
471-0056). PMC 2949280 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2949280).
11. Jiang, Lichun; Schlesinger, Felix; Davis, Carrie A.; Zhang, Yu; Li, Renhua; Salit, Marc; Gingeras, Thomas R.;
Oliver, Brian (23 March 2017). "Synthetic spike-in standards for RNA-seq experiments" (https://www.ncbi.nlm.nih.
gov/pmc/articles/PMC3166838). Genome Research. 21 (9): 1543–1551. doi:10.1101/gr.121095.111 (https://doi.or
g/10.1101%2Fgr.121095.111). ISSN 1088-9051 (https://www.worldcat.org/issn/1088-9051). PMC 3166838 (https:/
/www.ncbi.nlm.nih.gov/pmc/articles/PMC3166838).
12. Islam, Saiful; Zeisel, Amit; Joost, Simon; La Manno, Gioele; Zajac, Pawel; Kasper, Maria; Lönnerberg, Peter;
Linnarsson, Sten (1 February 2014). "Quantitative single-cell RNA-seq with unique molecular identifiers" (http://w
ww.nature.com/nmeth/journal/v11/n2/full/nmeth.2772.html). Nature Methods. 11 (2): 163–166.
doi:10.1038/nmeth.2772 (https://doi.org/10.1038%2Fnmeth.2772). ISSN 1548-7091 (https://www.worldcat.org/iss
n/1548-7091).
13. Kharchenko, Peter V.; Silberstein, Lev; Scadden, David T. (1 July 2014). "Bayesian approach to single-cell
differential expression analysis" (http://www.nature.com/nmeth/journal/v11/n7/abs/nmeth.2967.html). Nature
Methods. 11 (7): 740–742. doi:10.1038/nmeth.2967 (https://doi.org/10.1038%2Fnmeth.2967). ISSN 1548-7091 (h
ttps://www.worldcat.org/issn/1548-7091). PMC 4112276 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4112276
).
14. Svensson, Valentine; Natarajan, Kedar Nath; Ly, Lam-Ha; Miragaia, Ricardo J.; Labalette, Charlotte; Macaulay,
Iain C.; Cvejic, Ana; Teichmann, Sarah A. (6 March 2017). "Power analysis of single-cell RNA-sequencing
experiments" (http://www.nature.com/nmeth/journal/vaop/ncurrent/full/nmeth.4220.html?WT.feed_name=subjects
_rna-sequencing). Nature Methods. advance online publication. doi:10.1038/nmeth.4220 (https://doi.org/10.1038
%2Fnmeth.4220). ISSN 1548-7105 (https://www.worldcat.org/issn/1548-7105).
15. Islam, Saiful; Zeisel, Amit; Joost, Simon; La Manno, Gioele; Zajac, Pawel; Kasper, Maria; Lönnerberg, Peter;
Linnarsson, Sten (1 February 2014). "Quantitative single-cell RNA-seq with unique molecular identifiers" (http://w
ww.nature.com/nmeth/journal/v11/n2/abs/nmeth.2772.html). Nature Methods. 11 (2): 163–166.
doi:10.1038/nmeth.2772 (https://doi.org/10.1038%2Fnmeth.2772). ISSN 1548-7091 (https://www.worldcat.org/iss
n/1548-7091).
16. Stegle, Oliver; Teichmann, Sarah A.; Marioni, John C. (1 March 2015). "Computational and analytical challenges
in single-cell transcriptomics" (http://www.nature.com/nrg/journal/v16/n3/full/nrg3833.html). Nature Reviews
Genetics. 16 (3): 133–145. doi:10.1038/nrg3833 (https://doi.org/10.1038%2Fnrg3833). ISSN 1471-0056 (https://
www.worldcat.org/issn/1471-0056).
17. Buettner, Florian; Natarajan, Kedar N.; Casale, F. Paolo; Proserpio, Valentina; Scialdone, Antonio; Theis, Fabian
J.; Teichmann, Sarah A.; Marioni, John C.; Stegle, Oliver (1 February 2015). "Computational analysis of cell-to-
cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells" (http://www.nature.
com/nbt/journal/v33/n2/full/nbt.3102.html). Nature Biotechnology. 33 (2): 155–160. doi:10.1038/nbt.3102 (https://
doi.org/10.1038%2Fnbt.3102). ISSN 1087-0156 (https://www.worldcat.org/issn/1087-0156).
18. Ntranos, Vasilis; Kamath, Govinda M.; Zhang, Jesse M.; Pachter, Lior; Tse, David N. (26 May 2016). "Fast and
accurate single-cell RNA-seq analysis by clustering of transcript-compatibility counts" (https://www.ncbi.nlm.nih.g
ov/pmc/articles/PMC4881296). Genome Biology. 17. doi:10.1186/s13059-016-0970-8 (https://doi.org/10.1186%2
Fs13059-016-0970-8). ISSN 1474-7596 (https://www.worldcat.org/issn/1474-7596). PMC 4881296 (https://www.n
cbi.nlm.nih.gov/pmc/articles/PMC4881296).
19. Pierson, Emma; Yau, Christopher (1 January 2015). "ZIFA: Dimensionality reduction for zero-inflated single-cell
gene expression analysis" (https://genomebiology.biomedcentral.com/articles/10.1186/s13059-015-0805-z).
Genome Biology. 16: 241. doi:10.1186/s13059-015-0805-z (https://doi.org/10.1186%2Fs13059-015-0805-z).
ISSN 1474-760X (https://www.worldcat.org/issn/1474-760X).
20. Treutlein, Barbara; Brownfield, Doug G.; Wu, Angela R.; Neff, Norma F.; Mantalas, Gary L.; Espinoza, F. Hernan;
Desai, Tushar J.; Krasnow, Mark A.; Quake, Stephen R. (15 May 2014). "Reconstructing lineage hierarchies of
the distal lung epithelium using single-cell RNA-seq" (http://www.nature.com/nature/journal/v509/n7500/abs/natur
e13173.html). Nature. 509 (7500): 371–375. doi:10.1038/nature13173 (https://doi.org/10.1038%2Fnature13173).
PMC 4145853 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4145853).
21. Korthauer, Keegan D.; Chu, Li-Fang; Newton, Michael A.; Li, Yuan; Thomson, James; Stewart, Ron; Kendziorski,
Christina (1 January 2016). "A statistical approach for identifying differential distributions in single-cell RNA-seq
experiments" (https://genomebiology.biomedcentral.com/articles/10.1186/s13059-016-1077-y). Genome Biology.
17: 222. doi:10.1186/s13059-016-1077-y (https://doi.org/10.1186%2Fs13059-016-1077-y). ISSN 1474-760X (http
s://www.worldcat.org/issn/1474-760X).
22. Haghverdi, Laleh; Büttner, Maren; Wolf, F. Alexander; Buettner, Florian; Theis, Fabian J. (1 October 2016).
"Diffusion pseudotime robustly reconstructs lineage branching" (http://www.nature.com/nmeth/journal/v13/n10/full
/nmeth.3971.html). Nature Methods. 13 (10): 845–848. doi:10.1038/nmeth.3971 (https://doi.org/10.1038%2Fnmet
h.3971). ISSN 1548-7091 (https://www.worldcat.org/issn/1548-7091).
23. Saelens, Wouter; Cannoodt, Robrecht; Todorov, Helena; Saeys, Yvan (2018-03-05). "A comparison of single-cell
trajectory inference methods: towards more accurate and robust tools" (https://www.biorxiv.org/content/early/2018
/03/05/276907). bioRxiv: 276907. doi:10.1101/276907 (https://doi.org/10.1101%2F276907). Retrieved
2018-03-12.
24. Trapnell, Cole; Cacchiarelli, Davide; Grimsby, Jonna; Pokharel, Prapti; Li, Shuqiang; Morse, Michael; Lennon,
Niall J.; Livak, Kenneth J.; Mikkelsen, Tarjei S.; Rinn, John L. (23 March 2017). "Pseudo-temporal ordering of
individual cells reveals dynamics and regulators of cell fate decisions" (https://www.ncbi.nlm.nih.gov/pmc/articles/
PMC4122333). Nature Biotechnology. 32 (4): 381–386. doi:10.1038/nbt.2859 (https://doi.org/10.1038%2Fnbt.285
9). ISSN 1087-0156 (https://www.worldcat.org/issn/1087-0156). PMC 4122333 (https://www.ncbi.nlm.nih.gov/pmc
/articles/PMC4122333).
25. Haghverdi, Laleh; Büttner, Maren; Wolf, F. Alexander; Buettner, Florian; Theis, Fabian J. (1 October 2016).
"Diffusion pseudotime robustly reconstructs lineage branching" (http://www.nature.com/nmeth/journal/v13/n10/full
/nmeth.3971.html). Nature Methods. 13 (10): 845–848. doi:10.1038/nmeth.3971 (https://doi.org/10.1038%2Fnmet
h.3971). ISSN 1548-7091 (https://www.worldcat.org/issn/1548-7091).
26. Wei, J.; Hu, X.; Zou, X.; Tian, T. (1 December 2016). "Inference of genetic regulatory network for stem cell using
single cells expression data" (http://ieeexplore.ieee.org/document/7822521/?reload=true). 2016 IEEE
International Conference on Bioinformatics and Biomedicine (BIBM): 217–222. doi:10.1109/BIBM.2016.7822521 (
https://doi.org/10.1109%2FBIBM.2016.7822521).
27. Moignard, Victoria; Macaulay, Iain C.; Swiers, Gemma; Buettner, Florian; Schütte, Judith; Calero-Nieto, Fernando
J.; Kinston, Sarah; Joshi, Anagha; Hannah, Rebecca; Theis, Fabian J.; Jacobsen, Sten Eirik; de Bruijn, Marella
F.; Göttgens, Berthold (1 April 2013). "Characterization of transcriptional networks in blood stem and progenitor
cells using high-throughput single-cell gene expression analysis" (http://www.nature.com/ncb/journal/v15/n4/abs/
ncb2709.html). Nature Cell Biology. 15 (4): 363–372. doi:10.1038/ncb2709 (https://doi.org/10.1038%2Fncb2709).
ISSN 1465-7392 (https://www.worldcat.org/issn/1465-7392). PMC 3796878 (https://www.ncbi.nlm.nih.gov/pmc/art
icles/PMC3796878).
External links
Dissecting Tumor Heterogeneity with Single-Cell Transcriptomics (https://www.youtube.com/watch?v=2-N3SPPHt
6o)
Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using
this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia
Foundation, Inc., a non-profit organization.