2020
DOI: 10.4314/jcsia.v26i1.8
|View full text |Cite
|
Sign up to set email alerts
|

Impact of feature selection on classification via clustering techniques in software defect prediction

Abstract: Software testing using software defect prediction aims to detect as many defects as possible in software before the software release. This plays an important role in ensuring quality and reliability. Software defect prediction can be modeled as a classification problem that classifies software modules into two classes: defective and non-defective; and classification algorithms are used for this process. This study investigated the impact of feature selection methods on classification via clustering techniques … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 0 publications
0
5
0
Order By: Relevance
“…Exploratory analysis and dimensionality reduction are two typical unsupervised ML applications. Unsupervised ML approaches may be utilized to acquire first insights into data in situations when a human examination is difficult [ 131 ]. The findings may be used to test various theories.…”
Section: Artificial Intelligencementioning
confidence: 99%
“…Exploratory analysis and dimensionality reduction are two typical unsupervised ML applications. Unsupervised ML approaches may be utilized to acquire first insights into data in situations when a human examination is difficult [ 131 ]. The findings may be used to test various theories.…”
Section: Artificial Intelligencementioning
confidence: 99%
“…In addition, Marjuni, Adji [46] developed an unsupervised ML technique named signed Laplacianbased spectral classifier for SDP. The unsupervised ML technique is applied when working on unlabelled datasets, unlike supervised ML [14]. However, the performance of an ML technique depends largely on the quality of the datasets used for training such an ML technique [47][48][49].…”
Section: Related Workmentioning
confidence: 99%
“…In particular, SDP deploys ML methods on software features that are defined by software metrics to contain defects in software modules or components [7][8][9]. Several studies have proposed and implemented both supervised and unsupervised forms of ML methods for SDP [10][11][12][13][14][15]. Nevertheless, the predictive performance of SDP models is flatly dependent on the quality and inherent nature of the software datasets used for developing such SDP models.…”
Section: Introductionmentioning
confidence: 99%
“…Specifically, SDP is the application of ML techniques on software defect datasets which are characterized by software metrics (as features) to ascertain defects in software modules or components [8][9][10]. From studies, both supervised and unsupervised types of ML techniques have been proposed and implemented for SDP [11][12][13][14][15][16]. However, the prediction performance of SDP models categorically depends on the nature (quality) and characteristics of software datasets used in developing SDP models.…”
Section: Introductionmentioning
confidence: 99%