Abstract

A wide array of clouds have been built and adopted by business and research communities. Similarly, many research communities use workflow environments and tools such as Taverna and Galaxy to model the execution of tasks to support and expedite the reenactment of complex processes, and ultimately support the repeatability of science. As a result, a number of systems that integrate clouds with these workflow systems have emerged including CloudMap, CloudMan, Galaxy cloud, BioBlend, etc. Though these systems have proven to be successful in service and data integration, they have not dealt with the data security issue inherent in cloud-based systems with the outsourced models of infrastructure provisioning. For many domains, e.g., health, this poses serious challenges regarding the adoption of cloud infrastructures. Yet such domains also have much to gain from clouds especially given the explosion of genomics data and opportunities for personalized medicine in the big data era. This paper addresses this problem by presenting a trusted storage cloud for scientific workflows, called TruXy. The paper describes the TruXy architecture, the corresponding protocols and illustrates the adoption of TruXy to support collaborative bioinformatics research in endocrine genomics. A range of experiments have been performed to measure the performance of TruXy for processing of exome data sets on individuals with a rare genetic disorders: disorders of sex development (DSD). Our results show that the performance of TruXy is comparable to that of using a standalone workflow tool and that it can handle the big data security challenges required.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call