Abstract

Model organisms are essential experimental platforms for discovering gene functions, defining protein and genetic networks, uncovering functional consequences of human genome variation, and for modeling human disease. For decades, researchers who use model organisms have relied on Model Organism Databases (MODs) and the Gene Ontology Consortium (GOC) for expertly curated annotations, and for access to integrated genomic and biological information obtained from the scientific literature and public data archives. Through the development and enforcement of data and semantic standards, these genome resources provide rapid access to the collected knowledge of model organisms in human readable and computation-ready formats that would otherwise require countless hours for individual researchers to assemble on their own. Since their inception, the MODs for the predominant biomedical model organisms [Mus sp. (laboratory mouse), Saccharomyces cerevisiae, Drosophila melanogaster, Caenorhabditis elegans, Danio rerio, and Rattus norvegicus] along with the GOC have operated as a network of independent, highly collaborative genome resources. In 2016, these six MODs and the GOC joined forces as the Alliance of Genome Resources (the Alliance). By implementing shared programmatic access methods and data-specific web pages with a unified “look and feel,” the Alliance is tackling barriers that have limited the ability of researchers to easily compare common data types and annotations across model organisms. To adapt to the rapidly changing landscape for evaluating and funding core data resources, the Alliance is building a modern, extensible, and operationally efficient “knowledge commons” for model organisms using shared, modular infrastructure.

Highlights

  • Model organisms are essential experimental platforms for discovering gene functions, defining protein and genetic networks, uncovering functional consequences of human genome variation, and for modeling human disease

  • Acknowledgments of the Model Organism Databases (MODs) and the Gene Ontology Consortium (GOC) in the peer-reviewed scientific literature demonstrate that these resources are widely used to support science funded across all National Institutes of Health (NIH) Institutes and have global impact

  • The gene function (GO) has spurred the development of a number of other ontologies for related biological domains, including Cell Ontology (Diehl et al 2016) and Uberon, the multispecies anatomy ontology (Mungall et al 2012)

Read more

Summary

The Changing Landscape for Sustaining Core Data Resources

Several of the current major MODs existed prior to the genome era (Table 1), a large investment was made in genome knowledgebases by the NIH and, in particular, the NHGRI, starting around the time of the Human Genome Project in the early 1990s. These investments were made in recognition of the importance of model organisms for understanding the biology of the human genome and for advancing the application of genomics to medical practice. The MOD/GOC coalition submitted a formal proposal to fund the initial steps needed

Year founded
The Alliance of Genome Resources
Future Directions for the Alliance and Core Data Resource Sustainability
Literature Cited
The Alliance of Genome Resources Consortium
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call