Paradoxes of probabilistic programming: and how to condition on events of measure zero with infinitesimal probabilities

Jules Jacobs

doi:10.1145/3434339

Abstract

Abstract Probabilistic programming languages allow programmers to write down conditional probability distributions that represent statistical and machine learning models as programs that use observe statements. These programs are run by accumulating likelihood at each observe statement, and using the likelihood to steer random choices and weigh results with inference algorithms such as importance sampling or MCMC. We argue that naive likelihood accumulation does not give desirable semantics and leads to paradoxes when an observe statement is used to condition on a measure-zero event, particularly when the observe statement is executed conditionally on random data. We show that the paradoxes disappear if we explicitly model measure-zero events as a limit of positive measure events, and that we can execute these type of probabilistic programs by accumulating infinitesimal probabilities rather than probability densities. Our extension improves probabilistic programming languages as an executable notation for probability distributions by making it more well-behaved and more expressive, by allowing the programmer to be explicit about which limit is intended when conditioning on an event of measure zero.

Highlights

Probabilistic programming languages such as Stan [Carpenter et al 2017], Church [Goodman et al 2008], and Anglican [Wood et al 2014] allow programmers to express probabilistic models in statistics and machine learning in a structured way, and run these models with generic inference algorithms such as importance sampling, Metropolis-Hastings, SMC, HMC
The pragmatist says that probabilistic programs are a convenient way to write down a likelihood function, and the purist says that probabilistic programs are a notation for structured probabilistic models
We identify a problem with existing probabilistic programming languages, in which likelihood accumulation with probability densities can result in three different types of paradoxes when conditioning on a measure-zero event

Summary

INTRODUCTION

Probabilistic programming languages such as Stan [Carpenter et al 2017], Church [Goodman et al 2008], and Anglican [Wood et al 2014] allow programmers to express probabilistic models in statistics and machine learning in a structured way, and run these models with generic inference algorithms such as importance sampling, Metropolis-Hastings, SMC, HMC. We identify a problem with existing probabilistic programming languages, in which likelihood accumulation with probability densities can result in three different types of paradoxes when conditioning on a measure-zero event. We propose a change to probabilistic programming languages to avoid the paradoxes of the continuous measure-zero case, by changing the observe construct to condition on measurezero events E as an explicit limit ε → 0 of Eε (Sections 4 and 5), and – a method for computing the limit by accumulating infinitesimal probabilities instead of probability densities, which we use to implement the adjusted observe construct, – a theorem that shows that infinitesimal probabilities correctly compute the limit of Eε , ensuring that programs that use observe on measure-zero events are paradox free, – a translation from the existing observe construct to our new observe construct, which gives the same output if the original program was non-paradoxical, – language support for parameter transformations, which we use to show that the meaning of programs in our language is stable under parameter transformations, – an implementation of our language as an embedded DSL in Julia [Jacobs 2020] (Section 6)

ON THE EVENT THAT OBSERVE CONDITIONS ON

THREE TYPES OF PARADOXES

Paradox of Type 1

Paradox of Type 2

Paradox of Type 3

AVOIDING EVENTS OF MEASURE ZERO WITH INTERVALS

Conditioning on Measure Zero Events as a Limit of Positive Measure Events

USING INFINITESIMAL NUMBERS TO HANDLE MEASURE-ZERO OBSERVATIONS

Intervals of Infinitesimal Width Make Paradoxes Disappear

Importance Sampling with Infinitesimal Probabilities

The Correspondence Between Observe on Points and Observe on Intervals

Parameter Transformations as a Language Feature

IMPLEMENTATION IN JULIA

Findings

CONCLUSION & FUTURE WORK

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the ACM on Programming Languages	Publication Date: Jan 4, 2021
Citations: 6	License type: cc-by

R Discovery Prime

R Discovery Prime

Paradoxes of probabilistic programming: and how to condition on events of measure zero with infinitesimal probabilities

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages

Lead the way for us

Similar Papers

Logic + probabilistic programming + causal laws.
Vaishak Belle
Royal Society open science | VOL. 10
Vaishak BelleVaishak Belle
01 Sep 2023
Royal Society open science | VOL. 10

The principles and practice of probabilistic programming
Noah D Goodman
-
Noah D GoodmanNoah D Goodman
23 Jan 2013
23 Jan 2013

Comparing Machine Learning Models and Statistical Models for Predicting Heart Failure Events: A Systematic Review and Meta-Analysis.
Zhoujian Sun ... Hanrui Shi
Frontiers in Cardiovascular Medicine | VOL. 9
Zhoujian Sun, et. al.Zhoujian Sun ... Hanrui Shi
06 Apr 2022
Frontiers in Cardiovascular Medicine | VOL. 9

Probabilistic (logic) programming concepts
Luc De Raedt ... Angelika Kimmig
Machine Learning | VOL. 100
Luc De Raedt, et. al.Luc De Raedt ... Angelika Kimmig
08 May 2015
Machine Learning | VOL. 100

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Paradoxes of probabilistic programming: and how to condition on events of measure zero with infinitesimal probabilities

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages