2017
DOI: 10.1016/j.eswa.2017.04.054
|View full text |Cite
|
Sign up to set email alerts
|

A topic modeling based approach to novel document automatic summarization

Abstract: Most of existing text automatic summarization algorithms are targeted for multi-documents of relatively short length, thus difficult to be applied immediately to novel documents of structure freedom and long length. In this paper, aiming at novel documents, we propose a topic modeling based approach to extractive automatic summarization, so as to achieve a good balance among compression ratio, summarization quality and machine readability. First, based on topic modeling, we extract the candidate sentences asso… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
18
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 69 publications
(19 citation statements)
references
References 24 publications
1
18
0
Order By: Relevance
“…Most of the current summarization models can be categorized into two main categories, extractive-based and abstractive-based. The extractive-based ones are the most common, in which important words/sentences are extracted from text documents, and then recombined to form a summary [11]. In the following subsections, we introduce some recent related extractive and abstractive works with more attention to the works that are higher relevant to the study of this paper, including LSA based approaches, embedding-based approaches, and deep learning-based ones.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Most of the current summarization models can be categorized into two main categories, extractive-based and abstractive-based. The extractive-based ones are the most common, in which important words/sentences are extracted from text documents, and then recombined to form a summary [11]. In the following subsections, we introduce some recent related extractive and abstractive works with more attention to the works that are higher relevant to the study of this paper, including LSA based approaches, embedding-based approaches, and deep learning-based ones.…”
Section: Related Workmentioning
confidence: 99%
“…, where ( , ) refers to the similarity between word and sentence , calculated using Equation 7, and ( , ) refers to the similarity score of word with respect to the entire document calculated using Equation (11).…”
Section: B) Embedding-based Weighting Scheme (Embawef)mentioning
confidence: 99%
“…The researchers have been striving to utilize any advancement in NLP to create a more efficient summary. Extractive summarization techniques are the most common ones, whose basic idea is to extract important sentences from text documents and then recombine them to form a summary [14]. Recently, several summarization models have been proposed.…”
Section: Related Workmentioning
confidence: 99%
“…In this method, highly connected node represents central sentences and may indicate the main topic of the document (Ferreira et al, 2014b;Oliveira et al, 2016). Wu et al (2017) proposed topic-modelling-based approach for long length single document Novel summarisation. They used external resources like SemCor and synonym thesaurus to solve the issue of semantic confusion to smoothen the summary readability.…”
Section: Related Workmentioning
confidence: 99%