2018
DOI: 10.1007/s10844-018-0541-4
|View full text |Cite
|
Sign up to set email alerts
|

Clustering of semantically enriched short texts

Abstract: The paper is devoted to the issue of clustering small sets of very short texts. Such texts are often incomplete and highly inconclusive, so establishing a notion of proximity between them is a challenging task. In order to cope with polysemy we adapt the SenseSearcher algorithm (SnS), by Kozlowski and Rybinski in Computational Intelligence 33(3): 335-367, 2017b. In addition, we test the possibilities of improving the quality of clustering ultra-short texts by means of enriching them semantically. We present tw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
17
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(17 citation statements)
references
References 38 publications
0
17
0
Order By: Relevance
“…Recently, some artificial neural network based methods were developed for short text representation learning. Kozlowski and Rybinski [24] have used neural network based distributional semantic model for enriching the semantic meaning of short text for clustering. Similarly, Xu et al [4] have proposed STC 2 , which adopts deep learning techniques for short text representation learning.…”
Section: A Representation-based Methodsmentioning
confidence: 99%
“…Recently, some artificial neural network based methods were developed for short text representation learning. Kozlowski and Rybinski [24] have used neural network based distributional semantic model for enriching the semantic meaning of short text for clustering. Similarly, Xu et al [4] have proposed STC 2 , which adopts deep learning techniques for short text representation learning.…”
Section: A Representation-based Methodsmentioning
confidence: 99%
“…Impressive progress has been made on the problem of text classification, but few studies have tackled sentence classification (Kozlowski and Rybinski 2019;Zelikovitz and Hirsh 2000;Cheng et al 2014;Yin et al 2017;Khoo et al 2006;Kim 2014;Jurafsky and Martin 2019;Aggarwal 2018). Unlike the traditional text classification problem, sentence classification pose two main challenges.…”
Section: Sentence Classification and Short Text Classificationmentioning
confidence: 99%
“…Several techniques have been proposed to tackle the challenges posed by sentence classification, including dimension reduction (Zelikovitz and Hirsh 2000;Sriram et al 2010;Khoo et al 2006;Bollegala et al 2018), topic modeling (Cheng et al 2014;Chen et al 2011;Yang et al 2015), clustering (Kozlowski and Rybinski 2019;Yin et al 2017;Bollegala et al 2018;Dai et al 2013;Kozlowski and Rybinski 2017;Yang et al 2019), and word embedding (Kozlowski and Rybinski 2019;Kim 2014;Lee and Dernoncourt 2016;Hill et al 2016). Kim (Kim 2014) proposed a single layer of CNN applied for sentence classification.…”
Section: Sentence Classification and Short Text Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…Many researchers have examined the task of mining clinical text for applications in healthcare (Demner-Fushman et al, 2009;Sevenster et al, 2015;C. Friedman, Shagina, Lussier, & Hripcsak, 2004;Byrd et al, 2014;Torii et al, 2015;Khalifa & Meystre, 2015;Kozlowski & Rybinski, 2019;Shen et al, 2018) and have approached it as a basic text classification problem. Two major challenges in clinical 2.1.…”
Section: Introductionmentioning
confidence: 99%