Multi-scale space-time transformer for driving behavior detection

Gao, Jun; Yi, Jiangang; Murphey, Yi Lu

doi:10.1007/s11042-023-14499-7

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

2024

Publication Types

Select...

Article2

Other1

Relationship

Self Cite0

Independent3

Authors

Journals

Cited by 3 publications

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Developing Multi-Scale Patch Representations for Low Energy Data Aggregation Using Unit Block Building Model

Kumar,

Kharbas

2024

2024 International Conference on Optimization Computing and Wireless Communication (ICOCWC)

View full text Add to dashboard Cite

Developing Multi-Scale Patch Representations for Low Energy Data Aggregation Using Unit Block Building Model

Kumar,

Kharbas

2024

2024 International Conference on Optimization Computing and Wireless Communication (ICOCWC)

View full text Add to dashboard Cite

Context-Aware Attention Encoder-Decoder Network for Connected Heavy-Duty Vehicle Aggressive Driving Identification Under Naturalistic Driving Conditions

Tang,

Yang,

et al. 2024

IEEE Trans. Intell. Transport. Syst.

View full text Add to dashboard Cite

Distilled Routing Transformer for Driving Behavior Prediction

Gao,

Yi,

Murphey

2023

SAE Int. J. Trans. Safety

View full text Add to dashboard Cite

<div>The uncertainty of a driver’s state, the variability of the traffic environment, and the complexity of road conditions have made driving behavior a critical factor affecting traffic safety. Accurate predicting of driving behavior is therefore crucial for ensuring safe driving. In this research, an efficient framework, distilled routing transformer (DRTR), is proposed for driving behavior prediction using multiple modality data, i.e., front view video frames and vehicle signals. First, a cross-modal attention distiller is introduced, which distills the cross-modal attention knowledge of a fusion-encoder transformer to guide the training of our DRTR and learn deep interactions between different modalities. Second, since the multi-modal learning usually requires information from the macro view to the micro view, a self-attention (SA)-routing module is custom-designed for SA layers in DRTR for dynamic scheduling of global and local attentions for each input instance. Finally, a Mogrifier long short-term memory (Mogrifier LSTM) network is employed for DRTR to predict driving behaviors. We applied our approach to real-world data collected during drives in both urban and freeway environments by an instrumented vehicle. The experimental results demonstrate that the DRTR can predict the imminent driving behavior effectively while enjoying a faster inference speed than other state-of-the-art (SOTA) baselines.</div>

show abstract

Multi-scale space-time transformer for driving behavior detection

Cited by 3 publications

References 35 publications

Developing Multi-Scale Patch Representations for Low Energy Data Aggregation Using Unit Block Building Model

Developing Multi-Scale Patch Representations for Low Energy Data Aggregation Using Unit Block Building Model

Context-Aware Attention Encoder-Decoder Network for Connected Heavy-Duty Vehicle Aggressive Driving Identification Under Naturalistic Driving Conditions

Distilled Routing Transformer for Driving Behavior Prediction

Contact Info

Product

Resources

About