Abstract

Fundamental questions such as what are the genes that are really necessary for the survival of cells have motivated many studies to investigate the essentiality of genes in different species. Initial efforts have attempted to address this problem through exhaustive knockout experiments in simple bacteria. Recently, results obtained in these studies have also been applied to the emerging field of synthetic biology with possible implications in many other fields such as health and energy. Motivated by the evolution of DNA sequencing technology and high-throughput biological data generation, many recent efforts have also been made for building and understanding biological networks. In particular, metabolic networks represent the set of known biochemical reactions within a cell. Essential genes are expected to play a key role in these networks, as they must be involved in vital metabolic pathways. Even though some studies investigated the correlation between essential genes and biological network information, different types of networks and other biological information were usually combined and the effect of each of them in the obtained results was not stressed. This paper describes an attempt to predict essential genes using solely topological features from metabolic networks. The networks were built from a common repository, the KEGG database, ensuring data uniformity. Experimentally, considering different prediction scenarios and reference organisms, the use of topological features from metabolic networks achieved mean AUC of about 70% in the prediction of gene essentiality. This reveals that more factors affect essentiality and should indeed be considered in order to obtain more accurate predictions.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call