Active particles using reinforcement learning to navigate in complex motility landscapes

Monderkamp, Paul A.; Schwarzendahl, Fabian Jan; Klatt, Michael A.; Löwen, Hartmut

doi:10.1088/2632-2153/aca7b0

Cited by 16 publications

(10 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To understand how evolution shaped navigation and search strategies, one can use reinforcement learning (RL) [25] and genetic algorithms [26,27] to identify optimal and alternative strategies. Recently it has been demonstrated how agents trained with RL (eventually combined with genetic algorithms) are able to find advantageous swimming strategies in several situations such as in viscous solutions [28][29][30], simple energy landscapes [31], steady flows [32][33][34], turbulent fluids [35][36][37][38], and complex motility landscapes [39]. Notwithstanding their merits, in all these studies, either the goal of the particle is different from reaching a specific target or, if a target region has to be met, its position is fixed and then implicitly learned during the learning process.…”

Section: Introductionmentioning

confidence: 99%

Adaptive active Brownian particles searching for targets of unknown positions

Kaur

Franosch

Caraglio

2023

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

Developing behavioral policies designed to eﬃciently solve target-search problems is a crucial issue both in nature and in the nanotechnology of the 21st century. Here, we characterize the target-search strategies of simple microswimmers in a homogeneous environment containing sparse targets of unknown positions. The microswimmers are capable of controlling their dynamics by switching between Brownian motion and an active Brownian particle and by selecting the time duration of each of the two phases. The speciﬁc conduct of a single microswimmer depends on an internal decision-making process determined by a simple neural network associated with the agent itself. Starting from a population of individuals with random behavior, we exploit the genetic algorithm NeuroEvolution of Augmenting Topologies to show how an evolutionary pressure based on the target-search performances of single individuals helps to ﬁnd the optimal duration of the two diﬀerent phases. Our ﬁndings reveal that the optimal policy strongly depends on the magnitude of the particle’s self-propulsion during the active phase and that a broad spectrum of network topology solutions exists, diﬀering in the number of connections and hidden nodes.

show abstract

Section: Introductionmentioning

confidence: 99%

Adaptive active Brownian particles searching for targets of unknown positions

Kaur

Franosch

Caraglio

2023

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

show abstract

“…Yet, electrolytes are but a special case of particles with long-range interactions (decaying as 1/r where r is the distance between particles), which include also one-component plasmas, active particles, and many others. [59][60][61][62]96,97 A recent investigation showed remarkable results where long-range correlations were observed both in driven electrolytes and active particle systems, 95,98 for the same underlying mathematical reason. This raises the question of whether the time-dependent behaviour uncovered in the present work extends to this broad class of systems and whether other universal signatures may be unravelled.…”

Section: Hyperuniformity In Timementioning

confidence: 99%

Ionic fluctuations in finite volumes: fractional noise and hyperuniformity

2023

View full text Add to dashboard Cite

show abstract

“…[43] demonstrated the artificial self-thermophoretic micro-swimmers can navigate under the influence of Brownian motion; Monderkamp et al [44] trained active Brownian particles through complex motility landscapes; Gazzola et al [45] and Verma et al [46] found optimal swimming strategies that minimize drag and energy consumption in the school of fish. The above five examples adopt the off-policy learning techniques, which means that each update stochastic samples the data collected at any point during training, namely,…”

Section: Iii2 Optimal Control Via the Reinforcement Learningmentioning

confidence: 99%

Long-distance migration with minimal energy consumption in a thermal turbulent environment

Xu¹,

Wu²,

Xi³

2023

Preprint

View full text Add to dashboard Cite

We adopt the reinforcement learning algorithm to train the self-propelling agent migrating longdistance in a thermal turbulent environment. We choose the Rayleigh-Bénard turbulent convection cell with an aspect ratio (Γ, which is defined as the ratio between cell length and cell height) of 2 as the training environment. Our results showed that, compared to a naive agent that moves straight from the origin to the destination, the smart agent can learn to utilize the carrier flow currents to save propelling energy. We then apply the optimal policy obtained from the Γ = 2 cell and test the smart agent migrating in convection cells with Γ up to 32. In a larger Γ cell, the dominant flow modes of horizontally stacked rolls are less stable, and the energy contained in higher-order flow modes increases. We found that the optimized policy can be successfully extended to convection cells with a larger Γ. In addition, the ratio of propelling energy consumed by the smart agent to that of the naive agent decreases with the increase of Γ, indicating more propelling energy can be saved by the smart agent in a larger Γ cell. We also evaluate the optimized policy when the agents are being released from the randomly chosen origin, which aims to test the robustness of the learning framework, and possible solutions to improve the success rate are suggested. This work has implications for long-distance migration problems, such as unmanned aerial vehicles patrolling in a turbulent convective environment, where planning energy-efficient trajectories can be beneficial to increase their endurance.

show abstract

Active particles using reinforcement learning to navigate in complex motility landscapes

Cited by 16 publications

References 44 publications

Adaptive active Brownian particles searching for targets of unknown positions

Adaptive active Brownian particles searching for targets of unknown positions

Ionic fluctuations in finite volumes: fractional noise and hyperuniformity

Long-distance migration with minimal energy consumption in a thermal turbulent environment

Contact Info

Product

Resources

About