Clock-state olfactory search in turbulent flows using Q-learning: The geometry of plume recovery

📅 2026-05-15
📈 Citations: 0
Influential: 0
📄 PDF

career value

217K/year
🤖 AI Summary
This work addresses the challenge of efficiently localizing odor sources in turbulent environments by transforming sparse olfactory observations into robust navigation strategies. The authors propose a tabular Q-learning–based reinforcement learning approach that employs an extremely minimal state representation—namely, the time elapsed since the last odor detection—as a clock-like cue. This formulation enables the agent to learn an interpretable policy that effectively integrates hallmark insect-inspired behaviors, including surging, casting, and upwind returns. Evaluated on odor plumes generated via direct numerical simulation of turbulence, the method not only reproduces biologically plausible search patterns but also significantly enhances adaptability to odor intermittency and overall robustness through increased policy flexibility.
📝 Abstract
Finding an odor source in a turbulent flow requires effectively leveraging the history of olfactory observations into a robust navigation strategy. In this work, we use tabular Q-learning to train an olfactory search agent with a minimal memory of past observations: only a running clock since the last whiff. This agent learns an interpretable strategy to recover the plume which combines well-known behaviors observed in insects: surging, casting, and a return downwind. While achieving good performance on data from direct numerical simulations of turbulence, the agent is limited by an inability to adapt its strategy to the local intermittency level; we show that providing more flexibility improves robustness.
Problem

Research questions and friction points this paper is trying to address.

olfactory search
turbulent flows
plume recovery
intermittency
navigation strategy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Q-learning
olfactory search
turbulent flows
plume tracking
clock-state memory
🔎 Similar Papers
M
Marco Rando
Université Côte d'Azur, Inria, CNRS, Laboratoire J.A. Dieudonné, 28 avenue Valrose, 06108, Nice, France
Robin A. Heinonen
Robin A. Heinonen
postdoc at University of Genova
turbulent transportplasma physicsfluid dynamicsmachine learningreinforcement learning
Y
Yujia Qi
Machine Learning Genoa Center & Department of Civil, Chemical and Environmental Engineering, University of Genova, Via Montallegro 1, 16145 Genoa, Italy, now at University of California, Dept Mechanical Engineering, Engineering II, Santa Barbara, CA 93106-5070, USA
A
Agnese Seminara
Machine Learning Genoa Center & Department of Civil, Chemical and Environmental Engineering, University of Genova, Via Montallegro 1, 16145 Genoa, Italy