Abstract

Temporal modeling and analysis and more specifically, temporal ordering are very important problems within the fields of bioinformatics and computational biology, as the temporal analysis of the events characterizing a certain biological process could provide significant insights into its development and progression. Particularly, in the case of cancer, understanding the dynamics and the evolution of this disease could lead to better methods for prediction and treatment. In this paper we tackle, from a computational perspective, the temporal ordering problem, which refers to constructing a sorted collection of multi-dimensional biological data, collection that reflects an accurate temporal evolution of biological systems. We introduce a novel approach, based on reinforcement learning, more precisely, on Q-learning, for the biological temporal ordering problem. The experimental evaluation is performed using several DNA microarray data sets, two of which contain cancer gene expression data. The obtained solutions are correlated either to the given correct ordering (in the cases where this is provided for validation), or to the overall survival time of the patients (in the case of the cancer data sets), thus confirming a good performance of the proposed model and indicating the potential of our proposal.

Highlights

  • The progresses from the last decades in the field of biology have resulted in an exponential increase in the amount of biological information

  • For the computational experiments developed in order to test the performance of our method, we firstly used an artificially generated data set and we continued our experiments on several real data sets, which were chosen for the following reasons: N They are publicly available

  • N They resulted from different types of biological experiments

Read more

Summary

Introduction

The progresses from the last decades in the field of biology have resulted in an exponential increase in the amount of biological information. Depending on the type and purpose of biological experiments, the gathered data may vary from nucleotide or protein sequences, structures or functions, to molecular interactions and metabolic pathways. Analysis of this data reveals important insights into different biological processes and eventually leads to a better understanding of living organisms. The need to extract dynamic information from static data appears and a possible way of achieving this goal would be to infer temporal orderings to this data. Reinforcement learning deals with the problem of how an autonomous agent that perceives and acts in its environment can learn to choose optimal actions to achieve its goals [25]. Agents are acting in behalf of users, are flexible [27], meaning that they are reactive (able to respond to changes that occur in their environment), pro-active (able to exhibit goal directed behavior) and have a social ability (are capable of interacting with other agents)

Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call