CONCORD: Clone-Aware Contrastive Learning for Source Code

Ding, Yangruibo; Chakraborty, Saikat; Buratti, Luca; Pujar, Saurabh; Morari, Alessandro; Kaiser, Gail E.; Ray, Baishakhi

doi:10.1145/3597926.3598035

Cited by 3 publications

(1 citation statement)

References 66 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To train a detection model robust to random perturbations, we borrow the contrastive learning technique to learn better feature representations. Despite the similarity in terms of the high-level design idea [4,23,34,46], i.e., pre-training a self-supervised featureacquisition model over a large unlabeled code database, and performing supervised fine-tuning over labeled dataset to transfer it to a specific downstream SE task, we employ an additional supervised contrastive loss term to effectively leverage label information.…”

Section: Combinatorial Contrastive Learningmentioning

confidence: 99%

Coca: Improving and Explaining Graph Neural Network-Based Vulnerability Detection Systems

Cao,

Sun,

et al. 2024

Proceedings of the IEEE/ACM 46th International Conference on Software Engineering

View full text Add to dashboard Cite

Recently, Graph Neural Network (GNN)-based vulnerability detection systems have achieved remarkable success. However, the lack of explainability poses a critical challenge to deploy blackbox models in security-related domains. For this reason, several approaches have been proposed to explain the decision logic of the detection model by providing a set of crucial statements positively contributing to its predictions. Unfortunately, due to the weaklyrobust detection models and suboptimal explanation strategy, they have the danger of revealing spurious correlations and redundancy issue.In this paper, we propose Coca, a general framework aiming to 1) enhance the robustness of existing GNN-based vulnerability detection models to avoid spurious explanations; and 2) provide both concise and effective explanations to reason about the detected vulnerabilities. Coca consists of two core parts referred to as Trainer and Explainer. The former aims to train a detection model which is robust to random perturbation based on combinatorial contrastive learning, while the latter builds an explainer to derive crucial code statements that are most decisive to the detected vulnerability via dual-view causal inference as explanations. We

show abstract

Section: Combinatorial Contrastive Learningmentioning

confidence: 99%