2023 IEEE International Conference on Robotics and Automation (ICRA) 2023
DOI: 10.1109/icra48891.2023.10160609
|View full text |Cite
|
Sign up to set email alerts
|

Wayformer: Motion Forecasting via Simple & Efficient Attention Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 86 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…Trajectory forecasts must include precisely 16 position samples, each corresponding to the next 8 seconds and sampled at a rate of 2 Hz. Wayformer's attention-based scene encoder/decoder is modest [16]. Nigamaa Nayakanti and all study scene encoder early, late, and hierarchical input fusion [16].…”
Section: Leaderboard Best Solutionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Trajectory forecasts must include precisely 16 position samples, each corresponding to the next 8 seconds and sampled at a rate of 2 Hz. Wayformer's attention-based scene encoder/decoder is modest [16]. Nigamaa Nayakanti and all study scene encoder early, late, and hierarchical input fusion [16].…”
Section: Leaderboard Best Solutionsmentioning
confidence: 99%
“…Wayformer's attention-based scene encoder/decoder is modest [16]. Nigamaa Nayakanti and all study scene encoder early, late, and hierarchical input fusion [16]. Factorized or latent query attention balances efficiency and quality for each fusion type.…”
Section: Leaderboard Best Solutionsmentioning
confidence: 99%
“…Given past observations of objects in a dynamic scene, motion forecasting aims to predict future trajectories of the objects. Current state-of-the-art methods [12,22,25,27,42,56,60] learn the complex and nuanced interactions from data through deep neural networks. Some other methods study joint 3D object detection and motion forecasting [1,4,5,20,23,28,48,61], where detection can be an intermediate task.…”
Section: Related Workmentioning
confidence: 99%
“…In existing works, researchers have primarily focused on the construction of encoders and decoders. For encoders, the emphasis is on how to better aggregate HD map features [5,6]. As for decoders, the focus is on whether to use regression-based methods [2,7] or target-based methods [8,9], and how to construct trajectory optimization modules [10].…”
Section: Introductionmentioning
confidence: 99%