Abstract

Systems biology requires the development of algorithms that use omics data to infer interaction networks among biomolecules working within an organism. One major type of evolutionary algorithm, genetic programming (GP), is useful for its high heuristic ability as a search method for obtaining suitable solutions expressed as tree structures. However, because GP determines the values of parameters such as coefficients by random values, it is difficult to apply in the inference of state equations that describe oscillatory biochemical reaction systems with high nonlinearity. Accordingly, in this study, we propose a new GP procedure called “k-step GP” intended for inferring the state equations of oscillatory biochemical reaction systems. The k-step GP procedure consists of two algorithms: 1) Parameter optimization using the modified Powell method—after genetic operations such as crossover and mutation, the values of parameters such as coefficients are optimized by applying the modified Powell method with secondary convergence. 2) GP using divided learning data—to improve the inference efficiency, imposes perturbations through the addition of learning data at various intervals and adaptations to these changes result in state equations with higher fitness. We are confident that k-step GP is an algorithm that is particularly well suited to inferring state equations for oscillatory biochemical reaction systems and contributes to solving inverse problems in systems biology.

Highlights

  • In recent years, the development of experimental technologies has enabled researchers to obtain various types of omics data

  • Because genetic programming (GP) determines the values of parameters such as coefficients by random values, it is difficult to apply in the inference of state equations that describe oscillatory biochemical reaction systems with high nonlinearity

  • The comparison was conducted based on the inference of state equations expressing two- and three-variable oscillatory biochemical reaction systems

Read more

Summary

Introduction

The development of experimental technologies has enabled researchers to obtain various types of omics data. State equations (i.e., both structures and parameter values) of biochemical reaction systems must be inferred from empirical results, a process which can be referred to as an inverse problem. To overcome this inverse problem encountered in systems biology, researchers predict and scrutinize unknown biochemical reaction systems from experimentally observed time-series data under various conditions. They formulate the state equations that are expressed by using simultaneous ordinary differential equations based on the general mass action (GMA) law. Nonterminal symbols need not contain a minus symbol because the coefficient a is contained in all terminal symbols and the optimum value of each coefficient a (including a negative value) is calculated using the modified Powell method [29]

Modified Powell Method
Fitness
Genetic Programming Using Divided Learning Data
Experiments and Results
Two-Variable Oscillatory Biochemical Reaction System
Three-Variable Oscillatory Biochemical Reaction System
Conclusions
Future Works
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call