Authors: Harris A. Lewin, Stephen Richards, Erich D. Jarvis, Katherine Beal, Giulio Formenti, Kerstin Howe, Marcela Uliano-Silva, Olivier Fedrigo, Sadye Paez, Arang Rhie, and 200+ additional contributors
Journal: Proceedings of the National Academy of Sciences
Year: 2022
Citations: 258
DOI: 10.1073/pnas.2115635118
Abstract
The Earth BioGenome Project (EBP) is a moonshot for biology that aims to sequence, catalog, and characterize the genomes of all of Earth’s eukaryotic biodiversity over a period of ten years. The project will sequence approximately 1.5 million species, representing all known families of eukaryotic life on Earth. This ambitious initiative will provide unprecedented insights into the evolution, ecology, and conservation of life on our planet. The EBP will generate reference-quality genome assemblies using state-of-the-art sequencing technologies and computational methods. The project is organized into a global network of affiliated projects and institutions, with standardized protocols and data sharing frameworks. This paper outlines the scientific rationale, technical approaches, organizational structure, and expected outcomes of the EBP, marking the official launch of this transformative global initiative.
Summary
The Earth BioGenome Project represents one of the most ambitious scientific undertakings in modern biology, aiming to sequence, catalog, and characterize the genomes of all known eukaryotic species on Earth over a ten-year period. This moonshot initiative will sequence approximately 1.5 million species, representing all known families of eukaryotic life and providing unprecedented insights into the evolution, ecology, and conservation of biodiversity on our planet. The project’s scope and scale are comparable to other transformative scientific initiatives like the Human Genome Project, but with far broader implications for understanding life’s diversity and complexity.
The technical approach involves generating reference-quality genome assemblies using state-of-the-art sequencing technologies and computational methods, ensuring that the resulting genomic data meets the highest standards for accuracy and completeness. The project is organized as a global network of affiliated projects and institutions, with standardized protocols and comprehensive data sharing frameworks that enable coordinated efforts across multiple countries and research organizations. This collaborative structure ensures that the massive undertaking can be distributed effectively while maintaining consistency and quality across all participating groups.
The scientific rationale for the EBP extends beyond basic research to address critical challenges in conservation biology, agriculture, medicine, and biotechnology. By providing comprehensive genomic information for Earth’s biodiversity, the project will enable better understanding of evolutionary relationships, identification of genes and pathways important for adaptation and survival, and development of new approaches for species conservation and ecosystem management. The project will also support discovery of novel biomolecules and biological processes with potential applications in medicine, agriculture, and biotechnology, making it a foundational resource for both basic and applied research for decades to come.
Main Takeaways
• Unprecedented Biodiversity Genomics: The project aims to sequence approximately 1.5 million species representing all known families of eukaryotic life, creating the most comprehensive genomic catalog of Earth’s biodiversity ever assembled.
• Global Collaborative Framework: The initiative operates as a worldwide network of institutions using standardized protocols and data sharing frameworks, enabling coordinated efforts while maintaining quality and consistency across all participants.
• Reference-Quality Standards: The project employs state-of-the-art sequencing technologies and computational methods to generate high-quality genome assemblies that will serve as foundational resources for decades of future research.
• Conservation Applications: The genomic data will revolutionize conservation efforts by enabling better understanding of species vulnerability, genetic health, and adaptation mechanisms crucial for biodiversity preservation.
• Biotechnology and Medical Discovery: The comprehensive genomic catalog will support discovery of novel biomolecules and biological processes with potential applications in medicine, agriculture, and biotechnology development.
• Evolutionary Understanding: The project will provide unprecedented insights into the evolution, ecology, and relationships among all eukaryotic life forms, fundamentally advancing our understanding of life’s diversity and complexity on Earth.