Grid Computing Solutions for Distributed Repositories of Protein Folding and Unfolding Simulations

Martin Swain,Vitaliy Ostropytskyy,Rui M M Brito,Werner Dubitzky,Olivier Riche,Frederic Stahl,Cândida G Silva

doi:10.1007/978-3-540-69389-5_10

Abstract

AbstractThe P-found protein folding and unfolding simulation repository is designed to allow scientists to perform analyses across large, distributed simulation data sets. There are two storage components in P-found: a primary repository of simulation data and a data warehouse. Here we demonstrate how grid technologies can support multiple, distributed P-found installations. In particular we look at two aspects, first how grid data management technologies can be used to access the distributed data warehouses; and secondly, how the grid can be used to transfer analysis programs to the primary repositories – this is an important and challenging aspect of P-found because the data volumes involved are too large to be centralised. The grid technologies we are developing with the P-found system will allow new large data sets of protein folding simulations to be accessed and analysed in novel ways, with significant potential for enabling new scientific discoveries.KeywordsData WarehouseClient ApplicationGrid TechnologyResource BrokerGlobus ToolkitThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Full Text