Abstract

Biological network alignment aims to discover important similarities and differences and thus find a mapping between topological and/or functional components of different biological molecular networks. Then, the mapped components can be considered to correspond to both their places in the network topology and their biological attributes. Development and evolution of biological network alignment methods has been accelerated by the rapidly increasing availability of such biological networks, yielding a repertoire of tens of methods based upon graph theory. However, most biological processes, especially the metabolic reactions, are more sophisticated than simple pairwise interactions and contain three or more participating components. Such multi-lateral relations are not captured by graphs, and computational methods to overcome this limitation are currently lacking. This paper introduces hypergraphs and association hypergraphs to describe metabolic networks and their potential alignments, respectively. Within this framework, metabolic networks are aligned by identifying the maximal Z-eigenvalue of a symmetric tensor. A shifted higher-order power method was utilized to identify a solution. A rotational strategy has been introduced to accelerate the tensor-vector product by 250-fold on average and reduce the storage cost by up to 1,000-fold. The algorithm was implemented on a spark-based distributed computation cluster to significantly increase the convergence rate further by 50- to 80-fold. The parameters have been explored to understand their impact on alignment accuracy and speed. In particular, the influence of initial value selection on the stationary point has been simulated to ensure an accurate approximation of the global optimum. This framework was demonstrated by alignments among the genome-wide metabolic networks of Escherichia coli MG-1655 and Halophilic archaeon DL31. To our knowledge, this is the first genome-wide metabolic network alignment at both the metabolite level and the enzyme level. These results demonstrate that it can supply quite a few valuable insights into metabolic networks. First, this method can access the driving force of organic reactions through the chemical evolution of metabolic network. Second, this method can incorporate the chemical information of enzymes and structural changes of compounds to offer new way defining reaction class and module, such as those in KEGG. Third, as a vertex-focused treatment, this method can supply novel structural and functional annotation for ill-defined molecules. The related source code is available on request.

Highlights

  • In recent years, whole-genome sequencing has been gradually completed for thousands of organisms, enabling a deeper and broader understanding of the functions represented by gene sequences[1]

  • A metabolic network in Fig. 1A and its original storage format cannot be directly targeted by a simple graph

  • This study describes the background and a methodological framework for using hypergraphs to represent metabolic networks and tensors to align the networks

Read more

Summary

Introduction

Whole-genome sequencing has been gradually completed for thousands of organisms, enabling a deeper and broader understanding of the functions represented by gene sequences[1]. Improvements in determining the interactions among biological molecules, this sequencing effort has produced a huge number of biological networks at different scales for various species. Singh et al used the idea of PageRank as the definition of similarities between vertices from different networks They used a spectral graph method to rapidly identify the highest-ranking match from all possible matches in terms of the total score of all the aligned vertices[17]. Ay et al proposed the SubMAP method, in which pathways are represented as compound-enzyme bipartite graphs and the alignment is converted into a conventional optimization problem[18] Ay and his team again proposed a method that included a compression and decompression process of the pathways followed by the SubMAP. Such multi-lateral relations cannot be captured by a graph

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call