Abstract

Genome sequencing instruments are capable of sequencing thousands of samples in parallel, leading to the production of several terabytes of raw genome sequencing data in one day, which poses the challenges of transportation and storage of big data (Peta-bytes). At the meantime, genome data analysis requires the use of multiple tools executed in pipelines on large computing clusters. The storage and analysis of genome data are remaining challenging for much of the biomedical researchers and research community. In this paper, we give a hybrid cloud solution for genomics Next Generation Sequencing (NGS) service, the solution can provide unlimited storage and computing ability for both the internal and external customers. For internal users, the private cloud can provide the scalability of computing and storage resources by integrating the computing and storage resources from public cloud. For external customers, the public genomics cloud platform provides big genome data storage and distribute ability as well as an on-line genome data analysis software store that software can be seamlessly composed to form analysis pipelines. We also introduce the ecosystem of genomics cloud platform, which provides a one-stop service for the genome data's sequencing, storage, management, analysis, and interpretation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call