The popularity of smartphones and 4G/5G network have enabled various novel transportation modes based on shared mobility, such as app-based ride-hailing services (e.g., Uber and Lyft) and shared micromobility services (e.g., Veo and Gotcha). However, little is known about to what degree their operations impact urban traffic, which is important for transportation planning and policy making. These companies seldom share their ride data due to business and user privacy reasons, and they are young and still exploring and growing their market to new locations. Recently, transportation engineering researchers began to collect data in large cities trying to understand the transportation impacts of shared mobility services, but (1) there does not exist a general analytics framework applicable to any city, and (2) the studies were based on historical data and cannot project the future easily to catch up with the rapid development of shared mobility services.
In this paper, we introduce a general framework for multi-modal urban traffic analytics. The goal is to build a digital twin of the transportation in a city, i.e., an accurate agent-based transportation simulation model, based on a medium-sized dataset of the interested transportation modes collected by the research group combined with other open data sources such as US Census Bureau. With this digital twin, transportation engineering researchers can flexibly analyze the impact of shared mobility services under different scenarios, such as “if the number of Uber drivers doubles” or “if the number of deployed e-scooters doubles”. The digital twin may also enable new opportunities, such as being an environment for learning policies with reinforcement learning. Our framework consists of three steps: (1) fitting the spatiotemporal distribution of the shared mobility rides on the underlying road network, (2) generating the travel day-plans of the entire urban population based on the learned spatiotemporal ride distribution and user-specified parameters, and (3) configuring an agent-based simulation software such as MATSim to execute the generated realistic day-plans, which provides detailed transportation data for analytics. At the core of our approach is a new spatiotemporal network kernel density estimation (KDE) method that fixes the flaw of prior methods where the contributions of different data samples are not equal. We also propose a crowdsourcing method to collect app-based ride data that is easy to carry out in any city. Using Birmingham, AL as an example, we demonstrate how our framework can be applied to help transportation engineering researchers analyze the impacts of Uber ride-hailing and Veo e-scooter services.