Particle MCMC algorithms and architectures for accelerating inference in state-space models

Grigorios Mingas,Leonardo Bottolo,Christos-Savvas Bouganis

doi:10.1016/j.ijar.2016.10.011

Abstract

Particle Markov Chain Monte Carlo (pMCMC) is a stochastic algorithm designed to generate samples from a probability distribution, when the density of the distribution does not admit a closed form expression. pMCMC is most commonly used to sample from the Bayesian posterior distribution in State-Space Models (SSMs), a class of probabilistic models used in numerous scientific applications. Nevertheless, this task is prohibitive when dealing with complex SSMs with massive data, due to the high computational cost of pMCMC and its poor performance when the posterior exhibits multi-modality. This paper aims to address both issues by: 1) Proposing a novel pMCMC algorithm (denoted ppMCMC), which uses multiple Markov chains (instead of the one used by pMCMC) to improve sampling efficiency for multi-modal posteriors, 2) Introducing custom, parallel hardware architectures, which are tailored for pMCMC and ppMCMC. The architectures are implemented on Field Programmable Gate Arrays (FPGAs), a type of hardware accelerator with massive parallelization capabilities. The new algorithm and the two FPGA architectures are evaluated using a large-scale case study from genetics. Results indicate that ppMCMC achieves 1.96x higher sampling efficiency than pMCMC when using sequential CPU implementations. The FPGA architecture of pMCMC is 12.1x and 10.1x faster than state-of-the-art, parallel CPU and GPU implementations of pMCMC and up to 53x more energy efficient; the FPGA architecture of ppMCMC increases these speedups to 34.9x and 41.8x respectively and is 173x more power efficient, bringing previously intractable SSM-based data analyses within reach.

Highlights

Markov Chain Monte Carlo (MCMC) algorithms are one of the fundamental tools used to sample from complex probability distributions
For constant P, adding chains improves E S/sec by up to 3.96x vs. Particle MCMC (pMCMC) (2.8x in Matlab). These results confirm that the combination of Population-based Particle MCMC (ppMCMC) with a specialized architecture offers significant gains over existing algorithms and accelerators when the posterior is multi-modal. These results reveal that ppMCMC and its Field Programmable Gate Arrays (FPGAs) architecture offer large gains in performance compared to other algorithms and devices when the target distribution is multi-modal
This work introduced ppMCMC, an MCMC algorithm which combined pMCMC with population-based MCMC to improve mixing when sampling from multi-modal State Space models (SSMs) posteriors

Summary

Introduction

Markov Chain Monte Carlo (MCMC) algorithms are one of the fundamental tools used to sample from complex probability distributions. The work presented here was initially motivated by such a complex problem: SSMs in genetics, where T , which corresponds to DNA bases, can reach millions (see [13] and Section 6) This situation forces practitioners to collect fewer MCMC samples (which leads to increased variance) or use a simpler model and/or fewer data. The new algorithm and the two FPGA samplers are applied to a large-scale inference problem in genetics – an SSM model of DNA methylation with unknown parameters (Section 6). This model can lead to uni-modal or multi-modal posteriors.

Bayesian inference

State-space models with unknown parameters

8: Remaining iterations

Field Programmable Gate Arrays

Related work

Summary

9: Remaining iterations

Update and exchange operations

Parallelism in the algorithms

Performance models

Case study

Investigation and results

Resource utilization

Power efficiency

Findings

Conclusions and future work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Approximate Reasoning	Publication Date: Nov 14, 2016
Citations: 21	License type: cc-by

R Discovery Prime

R Discovery Prime

Particle MCMC algorithms and architectures for accelerating inference in state-space models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Approximate Reasoning

Lead the way for us

Similar Papers

FPGA Delay-Oriented Process Mapping Algorithm of Xiangxi Minority Based on LUT
Yun Xiao ... Sang-Bing Tsai
Mathematical Problems in Engineering | VOL. 2022
Yun Xiao, et. al.Yun Xiao ... Sang-Bing Tsai
07 Feb 2022
Mathematical Problems in Engineering | VOL. 2022

A novel configurable FPGA architecture for hardware implementation of multilayer feedforward neural networks suitable for digital pre-distortion technique
J Renteria-Cedano ... S Ortega-Cisneros
-
J Renteria-Cedano, et. al.J Renteria-Cedano ... S Ortega-Cisneros
01 Oct 2016
01 Oct 2016

Exploration of Mesh-Based FPGA Architecture: Comparison of 2D and 3D Technologies in Terms of Power, Area and Performance
Sonda Chtourou ... Habib Mehrez
-
Sonda Chtourou, et. al.Sonda Chtourou ... Habib Mehrez
01 Feb 2016
01 Feb 2016

Guest Editors Introduction: Field Programmable Logic and Applications
P.Y.K Cheung ... J.T De Sousa
IEEE Transactions on Computers | VOL. 53
P.Y.K Cheung, et. al.P.Y.K Cheung ... J.T De Sousa
01 Nov 2004
IEEE Transactions on Computers | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Particle MCMC algorithms and architectures for accelerating inference in state-space models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Approximate Reasoning