Sector search strategies for odor trail tracking

Reddy, Gautam; Shraiman, Boris I.; Vergassola, Massimo

doi:10.1101/2021.03.03.433838

Cited by 4 publications

(5 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The agent perceives the exact magnitude of odor at its current location, which is represented in the magnitude of the green color channel in each image observation. This is in contrast to Reddy et al [71], which uses a Poissonbased odor detection model. See Figure 4a for a plot of an example trail, including the trajectory of a successful agent.…”

Section: Trail Trackingmentioning

confidence: 83%

“…To construct naturalistic trail geometries, we use the procedure described in Reddy et al [71]. Trail characteristics are modulated with the following parameters:…”

Section: Trail Trackingmentioning

confidence: 99%

“…Online: α = 0.05, β = 0.Naive: α = 0.16, β = 3.Window: α = 0.26, β = 4.83, k = 10 • Sampling: α = 0.1, k = 3 7. Trail trackingTrails are generated using generalized worm-like chain ensembles, using the procedure described in Reddy et al[71]. The parameters used to sample each trail are…”

mentioning

confidence: 99%

See 2 more Smart Citations

Adaptive algorithms for shaping behavior

Tong,

Iyer,

Murthy

et al. 2023

Preprint

Self Cite

View full text Add to dashboard Cite

Dogs and laboratory mice are commonly trained to perform complex tasks by guiding them through a curriculum of simpler tasks (‘shaping’). What are the principles behind effective shaping strategies? Here, we propose a machine learning framework for shaping animal behavior, where an autonomous teacher agent decides its student’s task based on the student’s transcript of successes and failures on previously assigned tasks. Using autonomous teachers that plan a curriculum in a common sequence learning task, we show that near-optimal shaping algorithms adaptively alternate between simpler and harder tasks to carefully balance reinforcement and extinction. Based on this intuition, we derive an adaptive shaping heuristic with minimal parameters, which we show is near-optimal on the sequence learning task and robustly trains deep reinforcement learning agents on navigation tasks that involve sparse, delayed rewards. Extensions to continuous curricula are explored. Our work provides a starting point towards a general computational framework for shaping animal behavior.

show abstract

Section: Trail Trackingmentioning

confidence: 83%

“…To construct naturalistic trail geometries, we use the procedure described in Reddy et al [71]. Trail characteristics are modulated with the following parameters:…”

Section: Trail Trackingmentioning

confidence: 99%

See 1 more Smart Citation

Adaptive algorithms for shaping behavior

Tong,

Iyer,

Murthy

et al. 2023

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…They trained this model using DRL to solve four tasks and then analyzed the virtual rodent's emergent behavior and neural activity, finding similarities at an abstract level between their agent and observations from rodent studies. Reddy et al [2021] studied the trail tracking strategies of terrestrial animals with one (e.g. one antenna) or two (e.g.…”

Section: Related Workmentioning

confidence: 99%

“…We build on the approach of these recent papers that study artificial agents solving neural inspired tasks, and our work is also distinct in several key ways. First, we simulate a more computationally challenging task than those tackled in Reddy et al [2021] and Rapp and Nawrot [2020], because our odor environment is configurable, dynamic, and stochastic. Second, we have made several simplifications and abstractions that make analysis more tractable, so that we may focus on the general principles behind plume tracking.…”

Section: Related Workmentioning

confidence: 99%

Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes

Singh¹,

Breugel²,

N.³

et al. 2021

Preprint

View full text Add to dashboard Cite

Tracking a turbulent plume to locate its source is a complex control problem because it requires multi-sensory integration and must be robust to intermittent odors, changing wind direction, and variable plume statistics. This task is routinely performed by flying insects, often over long distances, in pursuit of food or mates. Several aspects of this remarkable behavior have been studied in detail in many experimental studies. Here, we take a complementary in silico approach, using artificial agents trained with reinforcement learning to develop an integrated understanding of the behaviors and neural computations that support plume tracking. Specifically, we use deep reinforcement learning (DRL) to train recurrent neural network (RNN) agents to locate the source of simulated turbulent plumes. Interestingly, the agents' emergent behaviors resemble those of flying insects, and the RNNs learn to represent task-relevant variables, such as head direction and time since last odor encounter. Our analyses suggest an intriguing experimentally testable hypothesis for tracking plumes in changing wind direction-that agents follow local plume shape rather than the current wind direction. While reflexive short-memory behaviors are sufficient for tracking plumes in constant wind, longer timescales of memory are essential for tracking plumes that switch direction. At the level of neural dynamics, the RNNs' population activity is low-dimensional and organized into distinct dynamical structures, with some correspondence to behavioral modules. Our in silico approach provides key intuitions for turbulent plume tracking strategies and motivates future targeted experimental and theoretical developments.

show abstract

Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes

Singh¹,

Breugel²,

Rao³

et al. 2022

Preprint

View full text Add to dashboard Cite

Tracking a turbulent plume to locate its source under variable wind and plume statistics is a complex task; flying insects routinely accomplish such tracking, often over long distances, in pursuit of food or mates. Several aspects of this remarkable behavior and its underlying neural circuitry have been studied experimentally. Here, we take a complementary in silico approach to develop an integrated understanding of behavior and neural computations. Specifically, we train artificial recurrent neural network (RNN) agents using deep reinforcement learning (DRL) to locate the source of simulated turbulent plumes. Interestingly, the agents' emergent behaviors resemble those of flying insects, and the RNNs learn to compute task-relevant variables with distinct dynamic structures in population activity. Our analyses put forward a testable behavioral hypothesis for tracking plumes in changing wind direction, and we provide key intuitions for memory requirements and neural dynamics in turbulent plume tracking.

show abstract

Sector search strategies for odor trail tracking

Cited by 4 publications

References 25 publications

Adaptive algorithms for shaping behavior

Adaptive algorithms for shaping behavior

Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes

Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes

Contact Info

Product

Resources

About