ABCpy: A High-Performance Computing Perspective to Approximate Bayesian Computation

Ritabrata Dutta,Jukka-Pekka Onnela,Antonietta Mira,Marcel Schoengens,Pierre Künzli,Lorenzo Pacchiardi,Avinash Ummadisingu,Nicole Widmer

doi:10.18637/jss.v100.i07

Abstract

ABCpy is a highly modular scientific library for Approximate Bayesian Computation (ABC) written in Python. The main contribution of this paper is to document a software engineering effort that enables domain scientists to easily apply ABC to their research without being ABC experts; using ABCpy they can easily run large parallel simulations without much knowledge about parallelization. Further, ABCpy enables ABC experts to easily develop new inference schemes and evaluate them in a standardized environment and to extend the library with new algorithms. These benefits come mainly from the modularity of ABCpy. We give an overview of the design of ABCpy and provide a performance evaluation concentrating on parallelization. This points us towards the inherent imbalance in some of the ABC algorithms. We develop a dynamic scheduling MPI implementation to mitigate this issue and evaluate the various ABC algorithms according to their adaptability towards high-performance computing.

Highlights

Today, computers are used to simulate different aspects of nature
In Algorithm 1, we provide a description of the Population Monte Carlo ABC (PMCABC) algorithm, which we will use in the following to illustrate the idea of approximate Bayesian computation (ABC) algorithms and their parallelization
We conclude that the performance of APMCABC and SABC is significantly better compared to PMCABC due to the absence of imbalance in them and are better suited for a parallelization with the map-reduce paradigm

Summary

Introduction

Computers are used to simulate different aspects of nature. Natural scientists traditionally hypothesize models underlying natural phenomena. Our goal is to overcome the need for users to have knowledge of parallel programming, as is required for using ABC-sysbio, and to make a software package available for scientists across domains These objectives were partly addressed by parallelization of SMCABC using MPI/OpenMPI (Stram, Marjoram, and Chen 2015), and by making SMCABC available for the astronomical community (Jennings and Madigan 2017). In many real-world problems, the analytic form of the posterior distribution is unknown because the likelihood is not analytically available This is typical for simulator-based models for which the likelihood function is often intractable or difficult to compute (as for instance the Lorenz model above or other integrations of stochastic differential equation models), and the inference schemes are adapted following two alternative approaches: (i) by measuring the discrepancy between simulated and observed dataset, and (ii) by approximating the likelihood function

Measuring discrepancy

Approximate likelihood

Implemented algorithms

Modular API

API design decisions

Parallelism

Performance evaluation

Dynamic allocation for MPI

Parallelism and ABC algorithms

Innovations of ABCpy compared with similar packages

Learning summary statistics

Probabilistic dependency between random variables

Joint perturbation kernels

Nested parallelization

Convergence diagnostic tools

Discussion

Findings

Details on parameter inference in the Lorenz95 model

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Statistical Software	Publication Date: Jan 1, 2021
Citations: 7	License type: cc-by

R Discovery Prime

R Discovery Prime

ABCpy: A High-Performance Computing Perspective to Approximate Bayesian Computation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Statistical Software

Lead the way for us

Similar Papers

The ensemble Kalman filter is an ABC algorithm
David J Nott ... Lucy Marshall
Statistics and Computing | VOL. 22
David J Nott, et. al.David J Nott ... Lucy Marshall
23 Nov 2011
Statistics and Computing | VOL. 22

Approximate Bayesian computation (ABC) gives exact results under the assumption of model error
Richard David Wilkinson
Statistical Applications in Genetics and Molecular Biology | VOL. 12
Richard David WilkinsonRichard David Wilkinson
06 Jan 2013
Statistical Applications in Genetics and Molecular Biology | VOL. 12

Approximate Bayesian Computation Via the Energy Statistic
Hien Duy Nguyen ... Florence Forbes
IEEE Access | VOL. 8
Hien Duy Nguyen, et. al.Hien Duy Nguyen ... Florence Forbes
09 Dec 2019
IEEE Access | VOL. 8

Recent Advances in Approximate Bayesian Computation Methodology (Application in structural dynamics).
...
-
, et. al. ...
21 Feb 2017
21 Feb 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ABCpy: A High-Performance Computing Perspective to Approximate Bayesian Computation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Statistical Software