Genomic Science Program. Click to return to home page.
Department of Energy Office of Science. Click to visit main DOE SC site.

Genomes to Life Contractor-Grantee Workshop I
Arlington, Virginia, February 9-12, 2003

Workshop Breakout Session

Data Standards, Data Sharing, Software Openness

Co-chairs -

  • Ed Uberbacher, Oak Ridge National Laboratory
  • Mike Colvin, Lawrence Livermore National Laboratory
  • Arie Shoshani, Lawrence Berkeley National Laboratory

The GTL enterprise will generate staggering amounts of data and with inherent complexity that dwarfs that of the genome era. GTL Life sciences research will continue to be take place in a distributed, heterogeneous, experimental, and computational environment. To solve the next generation of complex life science problems will require a systems approach that integrates data across various scales and provides interoperability among computational methods and data resources. It will be essential for GTL biologists to guide their research process by leveraging knowledge from multiple research groups and specialized facilities, taking advantage of the results, data and models generated by others. The persistent heterogeneity is likely to include data types, metadata, file formats, computing systems, programming languages, and systems biology models and concepts. To allow the GTL biologist to function in such an environment will require innovative improvements in our approach to organizing data and the integration and interoperation of data resources and the tools that utilize them. The session will focus on lessons learned in the genome era, and on the necessary design elements and requirements for developing data resources and software tools that provide integration of multimodal data and information across time and scale. The discussion will include new methods of data specification and syntactic interoperation such as BioPerl, BioXML, data warehousing and methods for query of multiple heterogeneous data sources, Data Grid approaches to accessing data across distributed environments, and Web-based approaches. We will specifically address barriers to facilitating interoperability and integration, the current state of software technologies and methodologies for interacting with data.

Now Featuring

2012 Genomic Science Awardee Meeting X Abstracts


report cover

Applications of New DOE National User Facilities in Biology report


Biosystems Design: Report from the July 2011 Workshop


News

Research

Genomic Science-Related BER Research Highlights

  • Switchgrass Sequencing Provides Insight into Genome Structure and Organization [Apr 12, 2012]
    Perennial switchgrass (Panicum virgatum L.) is capable of producing high biomass yields wi [more...]
  • Understanding How Bacteria Use Sunlight [Apr 05, 2012]
    Cyanobacteria are prime candidates for the biological production of biofuels, especially hydrogen [more...]
  • Using Systems Biology to Understand Complex Microbial Communities [Mar 27, 2012]
    The ability to effectively model and predict integrated functional properties across complex grou [more...]
  • Microbes Stress Out During Conversion of Pretreated Biomass to Biofuels [Mar 02, 2012]
    Chemical pretreatment of plant biomass prior to enzymatic breakdown significantly improves the re [more...]
  • Using High-Performance Computing to Study the Hydration of Cellobiose [Feb 16, 2012]
    Cellobiose, the two glucose basic repeat unit of cellulose, is formed during enzymatic or acidic [more...]
  • More BER Research Highlights »