Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Markus Ringnér
Principal component analysis is often incorporated into genome-wide expression studies, but what is it and how can it be used to
explore high-dimensional data?
PC
All
1
0 0 aspects of the samples. This suggests that PCA
XBP1
XBP1
ER–
−2 −2
can serve as a useful first step before clustering
ER+ or classification of samples. However, decid-
−4 −4 ing how many and which components to use
−4 −2 0 2 −4 −2 0 2 −4 −2 0 2 in the subsequent analysis is a major chal-
GATA3 GATA3 Projection onto PC1 lenge that can be addressed in several ways1.
d
© 2008 Nature Publishing Group http://www.nature.com/naturebiotechnology
14
e f For example, one can use components that
P1
40 40
correlate with a phenotype of interest9 or use
Proportion of variance (%)
XB
12