Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Genomic Annotation
Introduction: Sequences and Consequences
Cheryl A. Kerfeld*
Almost a decade ago, Bio2010: Transforming Life Sciences Education for Future Research Biologists [1] called for a reformulation
of undergraduate life sciences majors instruction, citing that the
typical curriculum no longer reflected modern research in biology. It advocated the importance of giving students the experience
of real research to better understand biological principles and
because experiencing the power and beauty of creative inquiry
is the best way to engage students in learning about science.
Likewise, in the last decade, new sequencing technologies
and the development of computational tools to explore genomic
sequence data have transformed life sciences research.
Genomes are being sequenced at an increasingly fast pace
[2,3]. The results of this technological advance are increasingly
informing research. This is reflected in the number of citations
in the PubMed database for articles involving sequence analysis; there were less than 1,200 in 1992 and over 200,000 in
2011 (Fig. 1a). The ability to readily interpret genomic data
required concomitant advances in computational methods for
making sense of raw sequence data. The number of articles
published in computational biology has increased over the last
20 years with the greatest growth seen over the last decade
(Fig. 1b). As shown in Fig. 1c, there were at least 45,295
articles published related to bioinformatics in the last five years
(dark gray portion of pie graph). That is nearly double what
was published in the previous five years (22,721 articles as
shown in the light gray portion of the pie graph). Collectively,
these data and the tools to interpret them have enabled
12
researchers in fields across the continuum of biological organizationfrom molecule to ecosystemto situate their experimental results in a genomic context. In a manner analogous to
the way the inventions of the telescope and the microscope
opened up unseen worlds for scientific exploration, genomics
and bioinformatics has created a new way to view life.
Genomics and bioinformatics have likewise created new
opportunities for the teaching of and learning in the life sciences.
Empowering students to work with real data and state-of-the art
computational tools can catalyze the reforms in undergraduate
life sciences instruction called for by Bio2010 [1] and Vision and
Change [4]. Methodologically, it requires only a computer and an
internet connection to introduce bioinformatics into a course.
Conceptually, bioinformatics starts with the chemical formula for
an organic molecule that can be used to trace connections from
predicted molecular behavior to organismal fitness. Accordingly,
genomics and bioinformatics can be used to illustrate and interconnect concepts across the life sciences curriculum.
At the same time, it can be used to give students an experience of research in the context of their courses. Working
with bioinformatics tools and genomics data can be scaled to
provide research experience to large numbers of students,
because it can be provided to students in parallel. Each student uses the same set of tools but has their own unique
sequence data set. It makes tractable the aim of giving all students a chance to build their understanding of the life sciences
with bioinformatics tools and real data.
To realize the potential for the use of genomics and bioinformatics across the life sciences curriculum, the Genomics
and Bioinformatics Education Program of the Department of
Energy Joint Genome Institute (JGI) created IMG-ACT (Integrated Microbial Genomes Annotation Collaboration Toolkit).
IMG-ACT is a fusion of a flexible rich text editor and web portal
through which students explore microbial genome datasets and
record their findings in an online notebook. IMG-ACT consolidates various databases and tools used in microbial genome
analysis [5]. It is structured as a series of modules which are
essentially guides to different types of bioinformatic analysis.
FIG 1
Kerfeld
13
Biochemistry and
Molecular Biology Education
FIG 2
IMG-ACT serves as a hub to network courses, students, and research experience. [Color figure can be viewed in the online
issue, which is available at wileyonlinelibrary.com.]
14
Acknowledgements
The author would like to thank James Bristow and Edwin Rubin
whose visionary leadership at the JGI enabled the creation of
IMG-ACT. Likewise, Daniel Drell (DOE headquarters) and Jonathan Eisen (UC Davis) provided both inspiration and conceptual
support. IMG-ACT was created by the JGI Informatics Team;
David Hays and Rene Perrier, in particular, played key roles. The
author also thanks the Integrated Microbial Genomes Team,
especially Amy Chen, Konstantinos Mavrommatis and Ernest
Szeto for making possible the seamless integration of IMG EDU
with IMG-ACT. The IMG-ACT system was in large part designed
by faculty advisers who first met at the JGI in June 2007: Zhaohui
Xu (Bowling Green State University), Sharyn Freyermuth (University of Missouri-Columbia), Kelynne Reed (Austin College), Jayna
L. Ditty (The University of St. Thomas), Christopher Kvaal (St.
Cloud State University), Cheryl Bailey (University of Nebraska),
Sabine Heinhorst (University of Southern Mississippi), Kathleen
Scott (University of South Florida), Robert Britton (Michigan State
University), Erin Sanders (University of California, Los Angeles),
Rick Johns (Northern Illinois University), A. Malcolm Campbell
(Davidson College), Brad Goodner (Hiram College), and Stuart
Gordon (Hiram College). The creativity, commitment, and generosity with their time is the key reason for the success of IMG-ACT.
Seth Axen (JGI) and Jordan Moberg-Parker contributed figures
and statistics used in this article. Finally, the author acknowledges Desiree Stanley, Edwin Kim, and especially Seth Axen,
References
[1] The National Research Council. (2003) BIO 2010: Transforming Undergraduate Education for Future Research Biologists, The National Academies Press,
Washington, DC.
[2] Medini, D., Serruto, D., Parkhill, J., Relman, D., Donati, C., Moxon, R., Falkow,
S., and Rappuoli, R. (2008) Microbiology in the post-genomic era. Nat. Rev.
Microbiol. 6, 419430.
Kerfeld
[3] Schuster, S. C. (2008) Next-generation sequencing transforms todays biology. Nat. Method 5, 1618.
[4] AAAS. (2011) Vision and Change in Undergraduate Biology Education: A Call
to Action, AAAS Press, Washington, DC.
[5] Ditty, J. L., Kvaal, C. A., Goodner, B., Freyermuth, S. K., Bailey, C., Britton, R.
A., Gordon, S. G., Heinhorst, S., Reed, K., Xu, Z., Sanders-Lorenz, E. R.,
Axen, S. D., Kim, E., Johns, M., Scott, K. M., and Kerfeld, C. A. (2010) Incorporating genomics and bioinformatics across the life sciences curriculum:
Development and implementation of the integrated microbial genomes
annotation collaboration toolkit. PLoS Biol. 8, e1000448.
[6] Kerfeld, C. A. and Scott, K. M. (2011) Using BLAST to teach E-value-tionary
concepts. PLoS Biol. 9, e100101.
15