The Expressway (controlled-access highways) of China is the longest in the world and plays an important role in people's daily life. Accurate short-term traffic prediction is essential for travel schedule and active traffic management. There are two coexisting charging systems for expressway in China, Electronic Toll Collection (ETC) and Manual Toll Collection (MTC), which have different passing capacity and variation pattern. In this work, we demonstrate that the exit traffic flow prediction at Shanghai Xinqiao toll station using entry traffic flows from multiple close-related stations with Long Short-Term Memory (LSTM) model. Based on the origin-destination (OD) traffic data of a month, we present a new method to predict the exit station's traffic flow in the future 5 minutes. After deleting abnormal data, we select 12 of the 109 entry toll stations for the experiment. The traffic flow of these 12 entry stations account for 86% of the total exit traffic flow. This method uses the spatial-temporal matrix to deal with different three scenes that are ETC and MTC charging systems individually, the mix of ETC and MTC. We use the LSTM model with various lengths of flow sequence and amounts of hidden layer neurons for three different scenes. Lastly, we validate our model and carefully select the hyperparameters for better prediction accuracy by three evaluation metrics. The experimental results demonstrate that predicting the ETC is the best in the three scenes.
As an essential component of Intelligent Transportation Systems (ITS), short-term traffic flow prediction is a key step to anticipate traffic congestion. Due to the availability of massive traffic data, data-driven methods with a variety of features have been applied widely to improve the traffic flow prediction. China has the longest total length of expressways in the world and there is significant information recorded when vehicles enter and exit the expressway. In this paper, we collect data at an expressway exit station in Shanghai, split the data according to its originating entry stations and predict the corresponding exit station traffic flow using the multi split traffic flows. First, the original records are collected, preprocessed, split, aggregated and normalized. Second, the Long Short-Term Memory (LSTM) model is applied to learn from the features of the overall flow and split traffic flows to predict the overall exit flow. The baselines are models which only overall flow information is considered. Compared with the baselines, in other models, the split flows according entry stations are also considered for prediction. Finally, the LSTM model is made comparison with the Convolutional LSTM(ConvLSTM), the K-Nearest Neighbor (KNN) model and the Support Vector Regression (SVR) model. When the information of overall flow and 6 split traffic flows is used and step is set to 11 (with 5 minute aggregation), the model prediction performs best. Compared with the best result of LSTM baseline model, the improvement of prediction accuracy is up to 5.48 percent by Mean Absolute Error (MAE).
INDEX TERMSTraffic flow prediction, LSTM model, feature selection, split flows.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.