2022
DOI: 10.15388/22-infor473
|View full text |Cite
|
Sign up to set email alerts
|

Approach for Multi-Label Text Data Class Verification and Adjustment Based on Self-Organizing Map and Latent Semantic Analysis

Abstract: In this paper, a new approach has been proposed for multi-label text data class verification and adjustment. The approach helps to make semi-automated revisions of class assignments to improve the quality of the data. The data quality significantly influences the accuracy of the created models, for example, in classification tasks. It can also be useful for other data analysis tasks. The proposed approach is based on the combination of the usage of the text similarity measure and two methods: latent semantic a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(9 citation statements)
references
References 24 publications
0
9
0
Order By: Relevance
“…In the first step, the analyzed multilabel text data must be adjusted according to the approach proposed by Kurasova and Stefanovič [ 12 ]. In this way, the quality of the data could be improved.…”
Section: The Combined Approach For Multilabel Text Data Classificationmentioning
confidence: 99%
See 4 more Smart Citations
“…In the first step, the analyzed multilabel text data must be adjusted according to the approach proposed by Kurasova and Stefanovič [ 12 ]. In this way, the quality of the data could be improved.…”
Section: The Combined Approach For Multilabel Text Data Classificationmentioning
confidence: 99%
“…It is hard to decide which two classes represent text the best; it is subjective and usually depends on the opinion of the expert who assigned the class manually. For this reason, in our experimental investigation, we used the original multilabel text data and compared the results with the text data, where the class was adjusted using the approach proposed by Stefanovič and Kurasova [ 12 ]. The proposed approach allows automatic multilabel text data class adjustment using latent semantic analysis [ 23 ] and self-organizing map [ 24 ].…”
Section: The Combined Approach For Multilabel Text Data Classificationmentioning
confidence: 99%
See 3 more Smart Citations