2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA) 2014
DOI: 10.1109/icaicta.2014.7005910
|View full text |Cite
|
Sign up to set email alerts
|

Automatic multilabel categorization using learning to rank framework for complaint text on Bandung government

Abstract: Learning to rank is a technique in machine learning for ranking problem. This paper aims to investigate this technique to classify the responsible agencies of each complaint text of LAPOR, which is our government complaint management system. Since this categorization problem is multilabel one and the latest work using learning to rank for multilabel classification gave promising result, we work on experiment to compare the typical classification solution with our proposed approaches on this multilabel categori… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
6
0
1

Year Published

2015
2015
2023
2023

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 23 publications
0
6
0
1
Order By: Relevance
“…LTR was proposed in the context of ad hoc information retrieval in which the goal is to create a ranking model that ranks documents with respect to queries . The LTR approach has been used for constructing a ranking model to rank classes with respect to a given document and select the most probable classes for the document as its labels (Yang and Gopal 2012; Ju, Moschitti, and Johansson 2013; Fauzan and Khodra 2014; Azarbonyad and Marx 2019). Yang and Gopal (2012) mapped MLTC to the ad hoc retrieval problem and used LTR for learning a ranking model.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…LTR was proposed in the context of ad hoc information retrieval in which the goal is to create a ranking model that ranks documents with respect to queries . The LTR approach has been used for constructing a ranking model to rank classes with respect to a given document and select the most probable classes for the document as its labels (Yang and Gopal 2012; Ju, Moschitti, and Johansson 2013; Fauzan and Khodra 2014; Azarbonyad and Marx 2019). Yang and Gopal (2012) mapped MLTC to the ad hoc retrieval problem and used LTR for learning a ranking model.…”
Section: Related Workmentioning
confidence: 99%
“…While this method has been shown to have a good performance in MLTC, it does not take the relations between classes into account in the MLTC task. Fauzan and Khodra (2014) used the same framework for classifying documents; however, they focused on text classification, and instead of using meta-level features, they used typical features such as TF-IDF weights of words for learning an LTR model. Their method also outperforms traditional classification-based approaches.…”
Section: Related Workmentioning
confidence: 99%
“…To our knowledge, there are only 2 previous researches [6], [7] which focus on handling imbalanced dataset for multi-label text categorization. This paper uses LAPOR dataset which was used before in previous research [8], but [8] did not handle the issue. On the other hand, we consider the dataset is too small to generalize all possible occurrences because [8] only has 2230 data.…”
Section: Introductionmentioning
confidence: 99%
“…This paper uses LAPOR dataset which was used before in previous research [8], but [8] did not handle the issue. On the other hand, we consider the dataset is too small to generalize all possible occurrences because [8] only has 2230 data. Thus, we acquired 2921 more instances from the institution who collects those data.…”
Section: Introductionmentioning
confidence: 99%
“…Administrator yang terbatas dan angka laporan pengaduan yang cukup tinggi menjadi penyebab utama kurangnya kualitas layanan terutama karakteristik daya tanggap (responsivitas) [2]. Penanganan laporan bergantung pada administrator sistem yang membaca secara manual setiap laporan yang masuk [3]. Hal ini dapat menyebabkan kesalahan dalam menangani keluhan [4], dan jika aliran datanya sangat besar dapat membutuhkan waktu minimal tiga hari, hal ini sensitif terhadap inkonsistensi [3].…”
unclassified