Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Error-correcting Other encoding schemes, such as Golay codes, may also prove
useful. Briefly, Hamming codes, like all error-correcting codes, use
barcoded primers for the principle of redundancy and add redundant parity bits to
© 2008 Nature Publishing Group http://www.nature.com/naturemethods
1Department of Computer Science, UCB 430 and 2Department of Molecular, Cellular and Developmental Biology, UCB 347, University of Colorado, Boulder, Colorado
80309, USA. 3Department of Pediatrics, University of Colorado Denver and Health Sciences Center, Children’s Hospital, 13123 E. 16th Avenue Box B395, Aurora,
Colorado 80045, USA. 4Department of Chemistry and Biochemistry, UCB 215, University of Colorado, Boulder, Colorado 80309, USA. Correspondence should be
addressed to R.K. (rob.knight@colorado.edu).
RECEIVED 15 OCTOBER 2007; ACCEPTED 14 JANUARY 2008; PUBLISHED ONLINE 10 FEBRUARY 2008; DOI:10.1038/NMETH.1184
end of the sequence rather than both ends as in references 6 and 8. AUTHOR CONTRIBUTIONS
This feature is especially important for 16S rRNA sequencing, M.H. and R.K. designed and implemented the analyses, and wrote the manuscript.
J.J.W., J.K.H. and N.J.G. generated the 454 dataset.
because variation in the length of variable regions in different
species may preclude the second tag from being read. Our strategy, Published online at http://www.nature.com/naturemethods/
including alternative encoding schemes, should be useful for many Reprints and permissions information is available online at
applications: although there is a tradeoff between the number of http://npg.nature.com/reprintsandpermissions
samples per run and the number of sequences per sample, the
1. Margulies, M. et al. Nature 437, 376–380 (2005).
average of B1,500 sequences/sample in this study exceeded the 2. Sogin, M.L. et al. Proc. Natl. Acad. Sci. USA 103, 12115–12120 (2006).
number of sequences collected in all but the largest Sanger 3. Huber, J.A. et al. Science 318, 97–100 (2007).
sequencing studies13. The combination of error-correcting bar- 4. Roesch, L.F.W. et al. ISME J. 1, 283–290 (2007).
© 2008 Nature Publishing Group http://www.nature.com/naturemethods
codes and massively parallel sequencing will rapidly revo- 5. Pace, N.R. Science 276, 734–740 (1997).
6. Binladen, J. et al. PLoS ONE 2, e197 (2007).
lutionize our understanding of microbial habitats located 7. Hoffmann, C. et al. Nucleic Acids Res. 35, e91 (2007).
throughout our biosphere as well as those associated with our 8. Parameswaran, P. et al. Nucleic Acids Res. 35, e130 (2007).
human bodies. 9. Huse, S.M., Huber, J.A., Morrison, H.G., Sogin, M.L. & Welch, D.M. Genome Biol. 8,
R143 (2007).
Note: Supplementary information is available on the Nature Methods website. 10. Morelos-Zaragoza, R.H. The Art of Error-Correcting Coding (John Wiley & Sons,
Hoboken, New Jersey, 2006).
ACKNOWLEDGMENTS 11. Liu, Z., Lozupone, C., Hamady, M., Bushman, F.D. & Knight, R. Nucleic Acids Res.
We thank N. Pace, L. Gold and F. Accurso for support and encouragement, 35, e120 (2007).
J.I. Gordon and R. Bushman for helpful discussions, and R. Ley, C. Lozupone 12. Dojka, M.A., Hugenholtz, P., Haack, S.K. & Pace, N.R. Appl. Environ. Microbiol. 64,
and D. McDonald for feedback on the manuscript. This work was supported in 3869–3877 (1998).
part by the US National Institutes of Health–University of Colorado at Boulder 13. Lozupone, C.A. & Knight, R. Proc. Natl. Acad. Sci. USA 104, 11436–11440
Molecular Biophysics Training Program (T32GM065103), and grants (2007).
from the Cystic Fibrosis Foundation and National Institutes of Health 14. Lozupone, C., Hamady, M. & Knight, R. BMC Bioinformatics 7, 371 (2006).
(U01 HL081335-01, P01DK078669). 15. Lozupone, C. & Knight, R. Appl. Environ. Microbiol. 71, 8228–8235 (2005).