“…Recent studies developed deep learning models, such as bidirectional transformers(BERT) [10](which are widely using in review tasks [13]), recurrent neural network(RNN), [27] recursive neural networks (RvNN) [39], Long-Short Term Memory (LSTM), generative adversarial network (GAN) [16,56], transformer [7,20] and Convolutional Neural Networks (CNN) [11,17], to learn sequential features from information propagation patterns over time. [1] These models also have widespread applications in other fields, such as data security [21,22], vision learning [18,52], material analysis [15,51] , compiling [33] and hardware designing [34], E-commerce [37], image segmentation [43], traffic controlling [38], communication [28,32] and Aerial Search [30] . These methods, however, only learn the correlations from local neighbors in the structure of information propagation while ignore the global structures of rumor dispersion.…”