Optimal Routing for Autonomous Taxis using Distributed Reinforcement Learning

Rahili, Salar; Rivière, Benjamin; Olivier, S. C. J.; Chung, Soon‐Jo

doi:10.1109/icdmw.2018.00087

Cited by 11 publications

(6 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…. This is trivially satisfied by the original Q t update equation (16). Now, let's rewrite (22) for a specific state-action pair (i, π[i]).…”

Section: Distributed Sarsa Rl For Non-stationary Environmentmentioning

confidence: 97%

“…IV-B, we focus on ride sharing and courier taxi service routing. As compared to our preliminary work presented in an eight-page-long workshop article [16], this paper include many revision in all the sections, including two additional mathematically-rigorous proofs of convergence, more complete proofs of the theorems, and an appendix detailing some math used in the main proofs.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Distributed Adaptive Reinforcement Learning: A Method for Optimal Routing

Rahili¹,

Rivière²,

Chung³

2020

Preprint

Self Cite

View full text Add to dashboard Cite

“…. This is trivially satisfied by the original Q t update equation (16). Now, let's rewrite (22) for a specific state-action pair (i, π[i]).…”

Section: Distributed Sarsa Rl For Non-stationary Environmentmentioning

confidence: 97%

Section: Introductionmentioning

confidence: 99%

Distributed Adaptive Reinforcement Learning: A Method for Optimal Routing

Rahili¹,

Rivière²,

Chung³

2020

Preprint

Self Cite

View full text Add to dashboard Cite

“…Classical scheduling algorithms, such as greedy methods, are widely used in large companies, such as finding the nearest driver to serve customers [22], or using a first-in, first-out queue strategy [23]; although they are easy to dispatch, it only obtained nice profits in the short term; the spatiotemporal sequence does not match the supply-demand relationship in the long-term operation, which will lead to some suboptimal results [15]. Later, this dispatching process improved by using the central system through the taxi GPS trajectory and brute force method for the best path recommendation [24,25], considering whether the driver took the initiative to find hot spots to provide the scheduling strategy [11] and focusing on minimizing total customer waiting time by simultaneously scheduling multiple taxis and allowing taxis to exchange their booking tasks [9], taking into account the overall benefits of a more global and far-sighted approach [26].…”

Section: Related Workmentioning

confidence: 99%

An Intelligent Offloading System Based on Multiagent Reinforcement Learning

Chu

Shi

2021

Security and Communication Networks

View full text Add to dashboard Cite

Intelligent vehicles have provided a variety of services; there is still a great challenge to execute some computing-intensive applications. Edge computing can provide plenty of computing resources for intelligent vehicles, because it offloads complex services from the base station (BS) to the edge computing nodes. Before the selection of the computing node for services, it is necessary to clarify the resource requirement of vehicles, the user mobility, and the situation of the mobile core network; they will affect the users’ quality of experience (QoE). To maximize the QoE, we use multiagent reinforcement learning to build an intelligent offloading system; we divide this goal into two suboptimization problems; they include global node scheduling and independent exploration of agents. We apply the improved Kuhn–Munkres (KM) algorithm to node scheduling and make full use of existing edge computing nodes; meanwhile, we guide intelligent vehicles to the potential areas of idle computing nodes; it can encourage their autonomous exploration. Finally, we make some performance evaluations to illustrate the effectiveness of our constructed system on the simulated dataset.

show abstract

“…Dynamic or demand-driven ridesharing services are paid more attention in recent studies. The dynamic ridesharing problem can be formulated as the dynamic vehicle routing problem (DVRP) [35] [36], in which the constraint that the order of the passing node in the current taxi schedule must keep intact when the next passenger joins in the taxi trip is required [11][37] [38]. Wang et al [33] proposed a ridesharing strategy that allowed a vehicle to change its route at most once while it was serving a passenger to respond to another ad hoc request.…”

Section: Literature Reviewmentioning

confidence: 99%

Exploring the Ridesharing Efficiency of Taxi Services

Zeng

Sun

et al. 2020

IEEE Access

View full text Add to dashboard Cite

The application of ridesharing strategy to autonomous taxi system holds great promise for improving the efficiency in the future on-demand ride-hailing services. Prior to the implementation of dispatching strategies to the autonomous taxi system, it is necessary to gain insight into the performance of the dispatching strategies. This study aims to solve the dynamic ridesharing problem and conduct a comprehensive quantity analysis of the ridesharing efficiency for various demand levels and carrying capacities in a metropolitan area. We quantify the success rate of serviced requests, the trip travel time, the discount rate of taxi fare, and total energy consumption for different carrying capacities and demand levels in the road network of Shenzhen city. The simulation results show that the ride-matching success rate within 3 minutes enables to increase by more than 13% in the ridesharing mode, and over 80% of the passengers can be served within 6 minutes if the carrying capacity is set to four. The trip travel time and energy consumption also show a significant downward trend as the capacity of the taxi increases in the ridesharing mode.

show abstract

Optimal Routing for Autonomous Taxis using Distributed Reinforcement Learning

Cited by 11 publications

References 23 publications

Distributed Adaptive Reinforcement Learning: A Method for Optimal Routing

Distributed Adaptive Reinforcement Learning: A Method for Optimal Routing

An Intelligent Offloading System Based on Multiagent Reinforcement Learning

Exploring the Ridesharing Efficiency of Taxi Services

Contact Info

Product

Resources

About