Abstract

UltraScan Solution Modeler (US-SOMO) processes atomic and lower-resolution bead model representations of biological and other macromolecules to compute various hydrodynamic parameters, such as the sedimentation and diffusion coefficients, relaxation times and intrinsic viscosity, and small angle scattering curves, that contribute to our understanding of molecular structure in solution. Knowledge of biological macromolecules' structure aids researchers in understanding their function as a path to disease prevention and therapeutics for conditions such as cancer, thrombosis, Alzheimer's disease and others. US-SOMO provides a convergence of experimental, computational, and modeling techniques, in which detailed molecular structure and properties are determined from data obtained in a range of experimental techniques that, by themselves, give incomplete information. Our goal in this work is to develop the infrastructure and user interfaces that will enable a wide range of scientists to carry out complicated experimental data analysis techniques on XSEDE. Our user community predominantly consists of biophysics and structural biology researchers. A recent search on PubMed reports 9,205 papers in the decade referencing the techniques we support. We believe our software will provide these researchers a convenient and unique framework to refine structures, thus advancing their research.The computed hydrodynamic parameters and scattering curves are screened against experimental data, effectively pruning potential structures into equivalence classes. Experimental methods may include analytical ultracentrifugation, dynamic light scattering, small angle X-ray and neutron scattering, NMR, fluorescence spectroscopy, and others. One source of macromolecular models is X-ray crystallography. However, the conformation in solution may not match that observed in the crystal form. Using computational techniques, an initial fixed model can be expanded into a search space utilizing high temperature molecular dynamic approaches or stochastic methods such as Brownian dynamics. The number of structures produced can vary greatly, ranging from hundreds to tens of thousands or more. This introduces a number of cyberinfrastructure challenges. Computing hydrodynamic parameters and small angle scattering curves can be computationally intensive for each structure, and therefore cluster compute resources are essential for timely results. Input and output data sizes can vary greatly from less than 1 MB to 2 GB or more. Although the parallelization is trivial, along with data size variability there is a large range of compute sizes, ranging from one to potentially thousands of cores with compute time of minutes to hours.In addition to the distributed computing infrastructure challenges, an important concern was how to allow a user to conveniently submit, monitor and retrieve results from within the C++/Qt GUI application while maintaining a method for authentication, approval and registered publication usage throttling. Middleware supporting these design goals has been integrated into the application with assistance from the Open Gateway Computing Environments (OGCE) collaboration team. The approach was tested on various XSEDE clusters and local compute resources. This paper reviews current US-SOMO functionality and implementation with a focus on the newly deployed cluster integration.

Highlights

  • 1.1 PurposeUnderstanding the functions of individual and functional collections of biological macromolecules is fundamental to the prevention and treatment of diseases

  • A discrete molecular dynamics simulation can be run by loading a structure and selecting to run DMD on the main UltraScan Solution Modeler (US-SOMO) window

  • US-SOMO is a comprehensive package for the calculation of hydrodynamic parameters and small angle scattering curves for biological macromolecules

Read more

Summary

Purpose

Understanding the functions of individual and functional collections of biological macromolecules is fundamental to the prevention and treatment of diseases. A first step to understanding the function of a biomacromolecule is to know its structure Towards this end, various experimental methods provide structural information of varying accuracy and precision. The alternative method of nuclear magnetic resonance (NMR) can instead provide structures in solution, but the data are usually collected at quite high macromolecular concentrations and often in non native conditions, such at low pH. The overarching goal of our software is to provide an extensible general framework for generating collections of candidate structures from an initial structure or structures, modeling candidate structures under various experimental methods and conditions, and subsequently globally fitting and screening candidate structure's models against sets of experimental data. Excepting the "extensible general framework", which is in planning, it is possible to perform the steps of our goal in the current software release for a defined set of experimental methods and parameters

UltraScan and SOMO
Atomic Structures and Bead Models
Discrete Molecular Dynamics
Experimental Background
Analytical Ultracentrifugation
Small Angle Scattering
Hydrodynamic Computations
Small Angle Scattering Computations
Fitting Methods
Batch Module
Cluster Module
Gateway Middleware Services
Software Components: Apache Rave
Gateway Architecture
User Identity
File transfer
Token Validation: The token service facilitates the gateway requirements by
Job Management: The core gateway job requirements include
FUTURE
CONCLUSIONS
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call