Deep reinforcement learning for the olfactory search POMDP: a quantitative benchmark.

Aurore Loisy,Robin A Heinonen

doi:10.1140/epje/s10189-023-00277-8

Deep reinforcement learning for the olfactory search POMDP: a quantitative benchmark.

Aurore Loisy, Robin A Heinonen

Open Access

https://doi.org/10.1140/epje/s10189-023-00277-8

Copy DOI

Journal: The European physical journal. E, Soft matter	Publication Date: Mar 1, 2023
Citations: 9

Affiliation: Centrale Marseille, Institut de Recherche sur les Phénomènes Hors Équilibre, French National Centre for Scientific Research

#Deep Reinforcement Learning #Partially Observable Markov Decision Process + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The olfactory search POMDP (partially observable Markov decision process) is a sequential decision-making problem designed to mimic the task faced by insects searching for a source of odor in turbulence, and its solutions have applications to sniffer robots. As exact solutions are out of reach, the challenge consists in finding the best possible approximate solutions while keeping the computational cost reasonable. We provide a quantitative benchmarking of a solver based on deep reinforcement learning against traditional POMDP approximate solvers. We show that deep reinforcement learning is a competitive alternative to standard methods, in particular to generate lightweight policies suitable for robots.

Full Text