2014
DOI: 10.1016/j.eswa.2014.02.017
|View full text |Cite
|
Sign up to set email alerts
|

Efficient classification of multi-labeled text streams by clashing

Abstract: We present a method for the classification of multi-labelled text documents explicitly designed for data stream applications that require to process a virtually infinite sequence of data using constant memory and constant processing time.Our method is composed of an online procedure used to efficiently map text into a low-dimensional feature space and a partition of this space into a set of regions for which the system extracts and keeps statistics used to predict multi-label text annotations. Documents are fe… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(9 citation statements)
references
References 68 publications
0
9
0
Order By: Relevance
“…(16), (17) and (18) In this paper, we randomly select 2000 entries as experimental dataset from a dataset released by Hylanda Information Technology Co. Ltd. in Tianjin of China which is comprised of almost all news report containing keywords of public security events. In the dataset, the public security events are divided into three types: violent terrorist attacks, campus attacks, and explosions.…”
Section: Simulation Results and Analysismentioning
confidence: 99%
See 2 more Smart Citations
“…(16), (17) and (18) In this paper, we randomly select 2000 entries as experimental dataset from a dataset released by Hylanda Information Technology Co. Ltd. in Tianjin of China which is comprised of almost all news report containing keywords of public security events. In the dataset, the public security events are divided into three types: violent terrorist attacks, campus attacks, and explosions.…”
Section: Simulation Results and Analysismentioning
confidence: 99%
“…VSM, firstly put forward by Dr. Salton in 1968, uses vector to represent texts, which has always been the most classic calculation for text representation [17,18]. Its idea originates from the fact that all the texts and queries contain some independent property that is expressed by some characteristic items to reveal their contents and that can be regarded as one dimension of vector space; thus the texts and queries can be expressed as the collection of these attributes ignoring complicated relation among paragraphs, sentences, and words.…”
Section: Text Representation Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…In this case, the multi-label text data is obtained, so analysis becomes more complex. There are various researches where the multi-label text data has been analysed using different techniques, but the class adjustment and verification are not considered (Nanculef et al, 2014;Park and Lee, 2008). The main problem of multi-label text data class verification and adjustment is deciding which class of the data item should be changed and which class should be assigned instead.…”
Section: Introductionmentioning
confidence: 99%
“…In machine learning applications, the traditional single label classification (SLC) problem has been explored substantially. However, more recently, the multi-label classification (MLC) problem has attracted increasing research interest because of its wide range of applications, such as text classification [ 1 , 2 ], social network analysis [ 3 ], gene function classification [ 4 ], and image/video annotation [ 5 ]. With SLC, one instance only belongs to one category, whereas with MLC, it can be allocated to multiple categories simultaneously.…”
Section: Introductionmentioning
confidence: 99%