2020
DOI: 10.1177/0165551520911590
|View full text |Cite
|
Sign up to set email alerts
|

An ensemble clustering approach for topic discovery using implicit text segmentation

Abstract: Text segmentation (TS) is the process of dividing multi-topic text collections into cohesive segments using topic boundaries. Similarly, text clustering has been renowned as a major concern when it comes to multi-topic text collections, as they are distinguished by sub-topic structure and their contents are not associated with each other. Existing clustering approaches follow the TS method which relies on word frequencies and may not be suitable to cluster multi-topic text collections. In this work, we propose… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 34 publications
0
1
0
Order By: Relevance
“…Domain independence and language dependence are the major features of the aggregation-based approach to multi-document text summarization. This paper presents a multi-document text summarization system, which aggregates sentences using a similarity-based transmission algorithm to select multiple subtopics (topics) from a relevant document input set and selects representative sentences from appropriate combinations to form the summary [7]. There are large groups of texts in various fields in digital information systems, and it is always necessary to obtain technology that helps retrieve information as quickly and accurately as possible-for example, working in search engines to retrieve information.…”
Section: Introductionmentioning
confidence: 99%
“…Domain independence and language dependence are the major features of the aggregation-based approach to multi-document text summarization. This paper presents a multi-document text summarization system, which aggregates sentences using a similarity-based transmission algorithm to select multiple subtopics (topics) from a relevant document input set and selects representative sentences from appropriate combinations to form the summary [7]. There are large groups of texts in various fields in digital information systems, and it is always necessary to obtain technology that helps retrieve information as quickly and accurately as possible-for example, working in search engines to retrieve information.…”
Section: Introductionmentioning
confidence: 99%