Yung-Hsu Yang scite author profile

A reliable and accurate 3D tracking framework is essential for predicting future locations of surrounding objects and planning the observer's actions in numerous applications such as autonomous driving. We propose a framework that can effectively associate moving objects over time and estimate their full 3D bounding box information from a sequence of 2D images captured on a moving platform. The object association leverages quasi-dense similarity learning to identify objects in various poses and viewpoints with appearance cues only. After initial 2D association, we further utilize 3D bounding boxes depth-ordering heuristics for robust instance association and motion-based 3D trajectory prediction for re-identification of occluded vehicles. In the end, an LSTM-based object velocity learning module aggregates the long-term trajectory information for more accurate motion extrapolation. Experiments on our proposed simulation data and real-world benchmarks, including KITTI, nuScenes, and Waymo datasets, show that our tracking framework offers robust object association and tracking on urban-driving scenarios. On the Waymo Open benchmark, we establish the first camera-only baseline in the 3D tracking and 3D detection challenges. Our quasi-dense 3D tracking pipeline achieves impressive improvements on the nuScenes 3D tracking benchmark with near five times tracking accuracy of the best vision-only submission among all published methods. Our code, data and trained models are available at https://github.com/SysCV/qd-3dt.

show abstract

Dense Prediction with Attentive Feature Aggregation

Yang¹,

Huang²,

Bulò³

et al. 2021

Preprint

View full text Add to dashboard Cite

CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion

Fischer¹,

Yang²,

Kumar³

et al. 2022

Preprint

View full text Add to dashboard Cite

Dense Prediction with Attentive Feature Aggregation

Yang

Huang²,

Sun

et al. 2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yung-Hsu Yang

Monocular Quasi-Dense 3D Object Tracking

Monocular Quasi-Dense 3D Object Tracking

Dense Prediction with Attentive Feature Aggregation

CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion

Dense Prediction with Attentive Feature Aggregation

Contact Info

Product

Resources

About