Abstract
BackgroundIdentifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. Here, we present an approach to classify the three domains of life using topological information about the underlying metabolic networks. These networks have been shown to share domain-independent structural similarities, which pose a special challenge for our endeavour. We quantify specific structural information by using topological network descriptors to classify this set of metabolic networks. Such measures quantify the structural complexity of the underlying networks. In this study, we use such measures to capture domain-specific structural features of the metabolic networks to classify the data set. So far, it has been a challenging undertaking to examine what kind of structural complexity such measures do detect. In this paper, we apply two groups of topological network descriptors to metabolic networks and evaluate their classification performance. Moreover, we combine the two groups to perform a feature selection to estimate the structural features with the highest classification ability in order to optimize the classification performance.ResultsBy combining the two groups, we can identify seven topological network descriptors that show a group-specific characteristic by ANOVA. A multivariate analysis using feature selection and supervised machine learning leads to a reasonable classification performance with a weighted F-score of 83.7% and an accuracy of 83.9%. We further demonstrate that our approach outperforms alternative methods. Also, our results reveal that entropy-based descriptors show the highest classification ability for this set of networks.ConclusionsOur results show that these particular topological network descriptors are able to capture domain-specific structural characteristics for classifying metabolic networks between the three domains of life.
Highlights
Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments
To obtain a better biological interpretation of the results, we plan to analyse the topology of every single metabolic network in detail
There does not exist a study that investigates which biological characteristics are captured by topological network descriptors, but we plan to perform one in future work
Summary
Identifying group-specific characteristics in metabolic networks can provide better insight into evolutionary developments. We present an approach to classify the three domains of life using topological information about the underlying metabolic networks These networks have been shown to share domainindependent structural similarities, which pose a special challenge for our endeavour. We use such measures to capture domain-specific structural features of the metabolic networks to classify the data set. Jeong et al demonstrated that despite their evolutionary distances, the domains of life share significant similarities in the topology of their metabolic networks [2].
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have