“…Modeling spatial interactions can guide the model to avoid pedestrian collisions for more realistic trajectory prediction. Current temporal interaction modeling approaches for capturing historical motion factors of pedestrians typically employ LSTM [14,15,16,17,18,19] or GRU [20,21], Transformers [22,23,24,25], and graph convolution neural networks [26,27,28]. Existing spatial interaction modeling methods capture social interactions through pooling mechanisms [14,15,29], attention mechanisms [18,23,24,30,31], and graph convolutions and their variants [21,26,27,28,32].…”