Parameter estimation in models of biological oscillators: an automated regularised estimation approach

Jake Alan Pitt,Julio R. Banga

doi:10.1186/s12859-019-2630-y

Jake Alan Pitt, Julio R. Banga

Open Access

https://doi.org/10.1186/s12859-019-2630-y

Copy DOI

Journal: BMC bioinformatics	Publication Date: Feb 15, 2019
Citations: 19	License type: open-access

Affiliation: RWTH Aachen University, Joint Research Centre

Abstract

BackgroundDynamic modelling is a core element in the systems biology approach to understanding complex biosystems. Here, we consider the problem of parameter estimation in models of biological oscillators described by deterministic nonlinear differential equations. These problems can be extremely challenging due to several common pitfalls: (i) a lack of prior knowledge about parameters (i.e. massive search spaces), (ii) convergence to local optima (due to multimodality of the cost function), (iii) overfitting (fitting the noise instead of the signal) and (iv) a lack of identifiability. As a consequence, the use of standard estimation methods (such as gradient-based local ones) will often result in wrong solutions. Overfitting can be particularly problematic, since it produces very good calibrations, giving the impression of an excellent result. However, overfitted models exhibit poor predictive power.Here, we present a novel automated approach to overcome these pitfalls. Its workflow makes use of two sequential optimisation steps incorporating three key algorithms: (1) sampling strategies to systematically tighten the parameter bounds reducing the search space, (2) efficient global optimisation to avoid convergence to local solutions, (3) an advanced regularisation technique to fight overfitting. In addition, this workflow incorporates tests for structural and practical identifiability.ResultsWe successfully evaluate this novel approach considering four difficult case studies regarding the calibration of well-known biological oscillators (Goodwin, FitzHugh–Nagumo, Repressilator and a metabolic oscillator). In contrast, we show how local gradient-based approaches, even if used in multi-start fashion, are unable to avoid the above-mentioned pitfalls.ConclusionsOur approach results in more efficient estimations (thanks to the bounding strategy) which are able to escape convergence to local optima (thanks to the global optimisation approach). Further, the use of regularisation allows us to avoid overfitting, resulting in more generalisable calibrated models (i.e. models with greater predictive power).

Highlights

Dynamic modelling is a core element in the systems biology approach to understanding complex biosystems
In this figure we show the convergence paths followed by a multi-start of a local optimisation method (NL2SOL [109]), illustrating how most of the runs converge to local solutions or saddle points close to the initial point
To illustrate the results obtained during the different steps, we will focus on the FHN problem

Summary

Introduction

Dynamic modelling is a core element in the systems biology approach to understanding complex biosystems. Its workflow makes use of two sequential optimisation steps incorporating three key algorithms: (1) sampling strategies to systematically tighten the parameter bounds reducing the search space, (2) efficient global optimisation to avoid convergence to local solutions, (3) an advanced regularisation technique to fight overfitting. This workflow incorporates tests for structural and practical identifiability. The study of the behaviour of populations of coupled oscillators has greatly benefited from mathematical analysis and computer simulations [40,41,42,43,44,45,46,47].

Methods

Results

Conclusion