Minne Li scite author profile

A fundamental question in any peer-to-peer ridesharing system is how to, both effectively and efficiently, dispatch user's ride requests to the right driver in real time. Traditional rule-based solutions usually work on a simplified problem setting, which requires a sophisticated hand-crafted weight design for either centralized authority control or decentralized multi-agent scheduling systems. Although recent approaches have used reinforcement learning to provide centralized combinatorial optimization algorithms with informative weight values, their single-agent setting can hardly model the complex interactions between drivers and orders. In this paper, we address the order dispatching problem using multi-agent reinforcement learning (MARL), which follows the distributed nature of the peer-to-peer ridesharing problem and possesses the ability to capture the stochastic demand-supply dynamics in largescale ridesharing scenarios. Being more reliable than centralized approaches, our proposed MARL solutions could also support fully distributed execution through recent advances in the Internet of Vehicles (IoV) and the Vehicle-to-Network (V2N). Furthermore, we adopt the mean field approximation to simplify the local interactions by taking an average action among neighborhoods. The mean field approximation is capable of globally capturing dynamic demand-supply variations by propagating many local interactions between agents and the environment. Our extensive experiments have shown the significant improvements of MARL order dispatching algorithms over several strong baselines on the gross merchandise volume (GMV), and order response rate measures. Besides, the simulated experiments with real data have also justified that our solution can alleviate the supply-demand gap during the rush hours, thus possessing the capability of reducing traffic congestion.

show abstract

Mean Field Multi-Agent Reinforcement Learning

Yang¹,

Luo²,

Li³

et al. 2018

Preprint

View full text Add to dashboard Cite

Bi-Level Actor-Critic for Multi-Agent Coordination

Zhang

Chen

Huang

et al. 2020

AAAI

View full text Add to dashboard Cite

Coordination is one of the essential problems in multi-agent systems. Typically multi-agent reinforcement learning (MARL) methods treat agents equally and the goal is to solve the Markov game to an arbitrary Nash equilibrium (NE) when multiple equilibra exist, thus lacking a solution for NE selection. In this paper, we treat agents unequally and consider Stackelberg equilibrium as a potentially better convergence point than Nash equilibrium in terms of Pareto superiority, especially in cooperative environments. Under Markov games, we formally define the bi-level reinforcement learning problem in finding Stackelberg equilibrium. We propose a novel bi-level actor-critic learning method that allows agents to have different knowledge base (thus intelligent), while their actions still can be executed simultaneously and distributedly. The convergence proof is given, while the resulting learning algorithm is tested against the state of the arts. We found that the proposed bi-level actor-critic algorithm successfully converged to the Stackelberg equilibria in matrix games and find a asymmetric solution in a highway merge environment.

show abstract

Evaluation of Volitional Swimming Behavior of Schizothorax prenanti Using an Open-Channel Flume with Spatially Heterogeneous Turbulent Flow

Chen

et al. 2022

Animals

View full text Add to dashboard Cite

Effective fishway design requires knowledge of fish swimming behavior in streams and channels. Appropriate tests with near-natural flow conditions are required to assess the interaction between fish behavior and turbulent flows. In this study, the volitional swimming behavior of S. prenanti was tested and quantified in an open-channel flume with three (low, moderate, and high) flow regimes. The results showed that, when confronted with alternative flow regimes, S. prenanti preferred to select regions with low flow velocities (0.25–0.50 m/s) and turbulent kinetic energy (<0.05 m2/s2) for swimming, while avoiding high-turbulence areas. Moreover, S. prenanti primarily employed steady swimming behavior to search for flow velocities lower than the average current to conserve energy in low- and moderate-flow regimes. It is hypothesized that in regions with higher flow velocities, fish may change their swimming strategy from energy conservation to time conservation. Additionally, the average and maximum burst speeds of S. prenanti were 2.63 ± 0.37 and 3.49 m/s, respectively, which were 2.21- and 2.28-fold higher than the average (1.19 m/s) and maximum (1.53 m/s) burst speeds estimated from the enclosed swim chamber for fish of similar length. This study contributes a novel research approach that provides more reliable information about fish volitional swimming behavior in natural habitats, as well as recommendations for hydraulic criteria for fishways and the identification of barriers to fish migrations.

show abstract

Bi-level Actor-Critic for Multi-agent Coordination

Zhang

Chen

Huang

et al. 2019

Preprint

View full text Add to dashboard Cite

Coordination is one of the essential problems in multiagent systems. Typically multi-agent reinforcement learning (MARL) methods treat agents equally and the goal is to solve the Markov game to an arbitrary Nash equilibrium (NE) when multiple equilibra exist, thus lacking a solution for NE selection. In this paper, we treat agents unequally and consider Stackelberg equilibrium as a potentially better convergence point than Nash equilibrium in terms of Pareto superiority, especially in cooperative environments. Under Markov games, we formally define the bi-level reinforcement learning problem in finding Stackelberg equilibrium. We propose a novel bi-level actor-critic learning method that allows agents to have different knowledge base (thus intelligent), while their actions still can be executed simultaneously and distributedly. The convergence proof is given, while the resulting learning algorithm is tested against the state of the arts. We found that the proposed bi-level actor-critic algorithm successfully converged to the Stackelberg equilibria in matrix games and find a asymmetric solution in a highway merge environment.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Minne Li

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning

Mean Field Multi-Agent Reinforcement Learning

Bi-Level Actor-Critic for Multi-Agent Coordination

Evaluation of Volitional Swimming Behavior of Schizothorax prenanti Using an Open-Channel Flume with Spatially Heterogeneous Turbulent Flow

Bi-level Actor-Critic for Multi-agent Coordination

Contact Info

Product

Resources

About