OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication

Xu, Runsheng; Xiang, Hao; Xia, Xin; Han, Xu; Liu, Jinlong; Ma, Jiaqi

doi:10.1109/icra46639.2022.9812038

Cited by 225 publications

(170 citation statements)

References 19 publications

Supporting

Mentioning

169

Contrasting

Unclassified

Order By: Relevance

“…V2X Perception: V2X perception investigates how to leverage the visual information from nearby AVs and intelligent infrastructure to enhance the perception capability. Based on the collaboration strategies, there are three major classes: early [21], late [22], [23], [24], and intermediate fusion [5], [7], [10], [4], [25], [26], [27]. The early fusion method delivers the raw point clouds across agents, and each agent will feed the aggregated point clouds to the network for 3D detection.…”

Section: Related Workmentioning

confidence: 99%

“…[5] proposes a spatial-aware graph neural network for joint perception and prediction, and [4] employs knowledge distillation to advance the learning with the supervision of early fusion. [7] proposes a location-wise self-attention mechanism to fuse the features from different AVs. This work evaluates all three fusion strategies and the single-agent perception method.…”

Section: Related Workmentioning

confidence: 99%

“…Thus instead, we design an efficient search algorithm. We adopt the intermediate fusion model AttFuse [7] to construct the collaboration graph where the learnable edge weights are considered to represent the importance of that agent's feature contribution to the ego agent. We leverage these learnable edge weights to define each agent's weakness level so that lower values represent smaller contributions to the overall perception system and thus, assigning these weak agents with higher sampling probability can increase the chance of finding adversarial collaborators with inferior performance.…”

Section: B Adversarial Collaborator Searchmentioning

confidence: 99%

“…It can be sliced and rearranged to get the updated feature H i . Before applying this self-attention model in our scene generator, we will train it on the augmented OPV2V dataset [7] to learn the collaboration graph construction by sending H i to a detection header to produce bounding box predictions. During the adversarial procedure, we fix the network's weights and directly exploit calculated a mn to detect the adversarial collaborators.…”

Section: B Adversarial Collaborator Searchmentioning

confidence: 99%

“…Sooner or later, these autonomous systems will be deployed on roads at scale, opening up opportunities for cooperation between them. Previous works in [4], [5], [6], [7], [8], [9], [10], [11], [12] have demonstrated that by leveraging the Vehicle-to-Everything (V2X) communication technology, AVs and infrastructure can perform cooperative perception by using the shared sensing information and thus significantly enhance the perception performance. Despite the remarkable improvement, these works evaluate the proposed systems on the dataset with natural scenarios that do not contain sufficient safety-critical scenes.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

V2XP-ASG: Generating Adversarial Scenes for Vehicle-to-Everything Perception

Xiang¹,

Xu²,

Xia³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Recent advancements in Vehicle-to-Everything communication technology have enabled autonomous vehicles to share sensory information to obtain better perception performance. With the rapid growth of autonomous vehicles and intelligent infrastructure, the V2X perception systems will soon be deployed at scale, which raises a safety-critical question: how can we evaluate and improve its performance under challenging traffic scenarios before the real-world deployment? Collecting diverse large-scale real-world test scenes seems to be the most straightforward solution, but it is expensive and timeconsuming, and the collections can only cover limited scenarios. To this end, we propose the first open adversarial scene generator V2XP-ASG that can produce realistic, challenging scenes for modern LiDAR-based multi-agent perception system. V2XP-ASG learns to construct an adversarial collaboration graph and simultaneously perturb multiple agents' poses in an adversarial and plausible manner. The experiments demonstrate that V2XP-ASG can effectively identify challenging scenes for a large range of V2X perception systems. Meanwhile, by training on the limited number of generated challenging scenes, the accuracy of V2X perception systems can be further improved by 12.3% on challenging and 4% on normal scenes.

show abstract