2022
DOI: 10.18293/seke2022-039
|View full text |Cite
|
Sign up to set email alerts
|

Two-Stage AST Encoding for Software Defect Prediction

Abstract: Software defect prediction (SDP) can find potential containing defect modules, which assists software developers in allocating limited test resources more efficiently. Because traditional software features fail to capture the semantics of source code, various studies have turned to extracting deep learning features. Existing related approaches often parse the program source code into Abstract Syntax Trees (ASTs) for further processing. However, most of these approaches ignore AST nodes' hierarchical and positi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 13 publications
0
3
0
Order By: Relevance
“…Similar to previous SDD research [10,12,34], we use four evaluation metrics to examine experimental results: F1-score, Recall, Precision, and MCC. Considering the context of the experimental dataset, we opt for the weighted versions of the F1-score, Recall, and Precision instead of the default binary [35].…”
Section: Evaluation Metricsmentioning
confidence: 99%
See 1 more Smart Citation
“…Similar to previous SDD research [10,12,34], we use four evaluation metrics to examine experimental results: F1-score, Recall, Precision, and MCC. Considering the context of the experimental dataset, we opt for the weighted versions of the F1-score, Recall, and Precision instead of the default binary [35].…”
Section: Evaluation Metricsmentioning
confidence: 99%
“…(1) TSE: This method designs a two-stage SDD using selfattention mechanism and tree-based LSTMs [34]. (2) PHAN: This method employs a positional hierarchical attention network to extract semantic features from programs [36].…”
Section: Approachmentioning
confidence: 99%
“…Firstly, whether AST is flattened as a text sequence or maintains its original structure, they encode the AST as a whole, ignoring the information at a moderate granularity level. Secondly, although some studies in SDP have taken a hierarchical structure into account, such as decomposing code into tokens and lines levels [1], and splitting AST into nodes and subtrees levels [6], there is a DOI reference number: 10.18293/SEKE23-119 lack of encoding at the path granularity in SDP. Furthermore, existing AST presentation by mining paths [7] does not consider the positional information between paths, while the positional difference may indicate the existence of defects.…”
Section: Introductionmentioning
confidence: 99%