Novel Three-Stage Framework for Prioritizing and Selecting Feature  Variables for Short-Term Metro  Passenger Flow Prediction

Zhao, Yangyang; Ren, Lu; Ma, Zhenliang; Jiang, Xinguo

doi:10.1177/0361198120926504

Cited by 8 publications

(4 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Zhang et al combined with Residual Network (ResNet), Graphic Convolutional Network (GCN), and Long and Short-Term Memory (LSTM) put forward a deep learning architecture [14]. Zhao et al proposed a new three-stage framework based on a hierarchical clustering algorithm (AHC) and tree-based models to select the appropriate feature variables [15]. They proposed a hybrid spatial and temporal deep learning neural network (HSTDL-NET) [16].…”

Section: Literature Reviewmentioning

confidence: 99%

Passenger Flow Prediction Using Smart Card Data from Connected Bus System Based on Interpretable XGBoost

Zou

Lin

et al. 2022

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

Bus passenger flow prediction is a critical component of advanced transportation information system for public traffic management, control, and dispatch. With the development of artificial intelligence, many previous studies attempted to apply machine learning models to extract comprehensive correlations from transit networks to improve passenger flow prediction accuracy, given that the variety and volume of traffic data have been easily obtained. The passenger flow on a station is highly affected by various factors such as the previous time step, peak hours or nonpeak hours, and extracting the key features from the data is essential for a passenger flow prediction model. Although the neural networks, k -nearest neighbor, and some deep learning models have been adopted to mine the temporal correlations of the passenger flow data, the lack of interpretability of the influenced variables is still a big problem. Classical tree-based models can mine the correlations between variables and rank the importance of each variable. In this study, we presented a method to extract passenger flow of different routes on the station and implemented a XGBoost model to find the contributions of variables to the prediction of passenger flow. Comparing to benchmark models, the proposed model can reach state-of-the-art prediction accuracy and computational efficiency on the real-world dataset. Moreover, the XGBoost model can interpret the predicted results. It can be seen that period is the most important variable for the passenger flow prediction, and so the management of buses during peak hours should be improved.

show abstract

Section: Literature Reviewmentioning

confidence: 99%

Passenger Flow Prediction Using Smart Card Data from Connected Bus System Based on Interpretable XGBoost

Zou

Lin

et al. 2022

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

show abstract

“…Table 1 summarizes the relevant literature for passenger flow prediction in metro systems. Most models are proposed under normal scenarios, such as empirical mode decomposition (EMD) with backpropagation neural network (BPNN) ( 13 ), autoregressive integrated moving average model ARIMA ( 14 ), linear regression (LR) ( 15 ), Bayesian model ( 16 ), ARIMA and generalized autoregressive conditional heteroskedasticity ( 9 ), random forest (RF) ( 17 ), gradient boosting decision trees (GBDT) ( 18 ), state-space models ( 19 ), a gravity model with deep learning (DL) ( 20 ), a model based on BiLSTM-CNN (bidirectional long short-term memory neural networks and convolutional neural networks) ( 21 ), a Seq2Seq (sequence-to-sequence) model with attention mechanism ( 22 ), and DL-based models ( 23 – 26 ).…”

Section: Literature Reviewmentioning

confidence: 99%

“…The naïve Bayes model (i.e., the transition mechanism) is developed to infer the prediction scenario and selects the sub-predictor from GBDT or DL accordingly. The GBDT is used for prediction under normal conditions given its capabilities in handling dense numerical features ( 18 , 36 , 37 ). A DL-based model ( 23 ) is adjusted for prediction under planned events.…”

Section: Literature Reviewmentioning

confidence: 99%

Naïve Bayes-Based Transition Model for Short-Term Metro Passenger Flow Prediction under Planned Events

Zhao

2022

Transportation Research Record: Journal of the Transportation R

Self Cite

View full text Add to dashboard Cite

Short-term passenger flow prediction under planned events is important to reduce passenger delay and ensure operational safety in metro systems. However, most studies make predictions under normal conditions. The study proposes a naïve Bayes transition model for short-term passenger flow prediction under planned events. The target prediction scenario identification is modeled as a binary classification problem using naïve Bayes. The sub-models are developed using gradient boosting decision tree (GBDT) and deep learning (DL) models for normal and planned event scenarios with predictor variables tailored to different passenger demand patterns. The sub-predictor from GBDT or DL is selected based on the inferred prediction scenario. The case study uses automatic fare collection (AFC) data of Shanghai and Hong Kong metro systems. The results show that the proposed model outperforms other representative individual and fusion models. The results also highlight the effectiveness of the predictive transition mechanism between the normal and planned events and also the event information representation.

show abstract

“…Many approaches have been proposed for prediction under typical situations, including ARIMA (16,17), fuzzy logic (18), Kalman filtering (19), support vector machines (5), back-propagation neural networks (20), tree-based models-for example, gradient-boosting decision trees (GBDTs) (21)(22)(23) and random forest (RF) (23,24)-and deep learning models (2)(3)(4). They have different capabilities in capturing complex and nonlinear relationships between inputs and outputs.…”

mentioning

confidence: 99%

Short-Term Metro Ridership Prediction During Unplanned Events

Zhao

Jiang

et al. 2021

Transportation Research Record: Journal of the Transportation R

Self Cite

View full text Add to dashboard Cite

Unplanned events present significant challenges for operations and management in metro systems. Short-term ridership prediction can help agencies to better design contingency strategies under unplanned events. Though many short-term prediction methods have been proposed in the literature, most studies focused on typical situations or planned events. The study develops methods for the short-term metro ridership prediction under unplanned events. It explores event impact representation mechanisms and deals with the imbalanced data training problem in building the prediction model under unplanned events. Typical machine learning and deep learning methods are developed for exploration. A large-scale automatic fare collection (AFC) dataset and event record data for a heavily used metro system are used for empirical studies. The analysis found that the same type of unplanned event shares a similar and consistent demand change pattern (with respect to the demand under typical situations) at the station level. The synthetic minority oversampling technique (SMOTE) can enrich the ridership observations under unplanned events and generate a balanced dataset for model training. Given the occurrence of unplanned events, the results show that a combination of demand change ratio and the SMOTE oversampling technique enables the prediction models to learn the impact of unplanned events and improve the prediction accuracy under unplanned events. However, the oversampling methods (i.e., SMOTE and replication) slightly deteriorate the prediction accuracy for ridership under normal conditions. The findings provide insights into mechanisms for disruption impact representation and oversampling imbalanced data in model training, and guide the development of models for short-term prediction under unplanned events.

show abstract

Novel Three-Stage Framework for Prioritizing and Selecting Feature Variables for Short-Term Metro Passenger Flow Prediction

Cited by 8 publications

References 33 publications

Passenger Flow Prediction Using Smart Card Data from Connected Bus System Based on Interpretable XGBoost

Passenger Flow Prediction Using Smart Card Data from Connected Bus System Based on Interpretable XGBoost

Naïve Bayes-Based Transition Model for Short-Term Metro Passenger Flow Prediction under Planned Events

Short-Term Metro Ridership Prediction During Unplanned Events

Contact Info

Product

Resources

About