Paper ID: 2302.00706

Deep reinforcement learning for the olfactory search POMDP: a quantitative benchmark

Aurore Loisy, Robin A. Heinonen

The olfactory search POMDP (partially observable Markov decision process) is a sequential decision-making problem designed to mimic the task faced by insects searching for a source of odor in turbulence, and its solutions have applications to sniffer robots. As exact solutions are out of reach, the challenge consists in finding the best possible approximate solutions while keeping the computational cost reasonable. We provide a quantitative benchmarking of a solver based on deep reinforcement learning against traditional POMDP approximate solvers. We show that deep reinforcement learning is a competitive alternative to standard methods, in particular to generate lightweight policies suitable for robots.

Submitted: Feb 1, 2023

Topics

Deep Reinforcement Learning
Sequential Decision Making Problem
Observable Markov Decision Process
Benchmark Score
POMDP Solver

Links

arXiv PDF