EcForest: Extractive document summarization through enhanced sentence embedding and cascade forest

Yang, Kang; He, Hongye; Al-Sabahi, Kamal; Zhang, Zuping

doi:10.1002/cpe.5206

Cited by 10 publications

(5 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Specifically, the strategy utilizing tf-isf vectors has a word bag that contains all the stemmed words found in the dataset (word stemming using Porter's stemmer 5 ). For the strategy ESE, we pre-trained all different sentence embeddings on Daily Mail dataset (Hermann et al, 2015) by following the guideline of Yang et al (2019) (the dimension of concatenated embedding is 800).…”

Section: Methodsmentioning

confidence: 99%

“…Many prior works have adopted A for the MDS task, such as Yang et al (2018) and Yang et al (2019). Each element in the affinity matrix A is a pairwise affinity of two different sentences.…”

Section: Affinity Matrixmentioning

confidence: 99%

“…More details can be found in Wan et al (2007) and Wang et al (2017). ESE: the enhanced feature embedding model (Yang et al, 2019). The embedding of each sentence is the concatenation of all components: paragraph vector, positional embedding and three feature embeddings (namely word-part-of-speech, bigram and trigram).…”

Section: Affinity Matrixmentioning

confidence: 99%

See 2 more Smart Citations

A Spectral Method for Unsupervised Multi-Document Summarization

Wang¹,

Chang²,

Sui³

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Multi-document summarization (MDS) aims at producing a good-quality summary for several related documents. In this paper, we propose a spectral-based hypothesis, which states that the goodness of summary candidate is closely linked to its so-called spectral impact.Here spectral impact considers the perturbation to the dominant eigenvalue of affinity matrix when dropping the summary candidate from the document cluster. The hypothesis is validated by three theoretical perspectives: semantic scaling, propagation dynamics and matrix perturbation. According to the hypothesis, we formulate the MDS task as the combinatorial optimization of spectral impact and propose an accelerated greedy solution based on a surrogate of spectral impact. The evaluation results on various datasets demonstrate:(1) The performance of the summary candidate is positively correlated with its spectral impact, which accords with our hypothesis; (2) Our spectral-based method has a competitive result as compared to state-of-the-art MDS systems.

show abstract

Section: Methodsmentioning

confidence: 99%

“…Many prior works have adopted A for the MDS task, such as Yang et al (2018) and Yang et al (2019). Each element in the affinity matrix A is a pairwise affinity of two different sentences.…”

Section: Affinity Matrixmentioning

confidence: 99%

See 1 more Smart Citation

A Spectral Method for Unsupervised Multi-Document Summarization

Wang¹,

Chang²,

Sui³

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

show abstract

“…The results show that the designed approach overcomes the problem of text overload by generating effective summaries. According to K. Yang [9], EcForest is an abstraction summary model with Enhanced Sentence Embedding and Cascade Forest. Sentence representation is very important for many summarization methods.…”

Section: Related Workmentioning

confidence: 99%

“…Bags of words can barely capture semantics, and typical embedding models fail to capture more complex semantic features such as ambiguity and phrase meaning. To this end, we propose an Extended Sentence Embedding (ESE) model [9].…”

mentioning

confidence: 99%

Deep Learning based Text Abstraction

Chougule,

Dudhabale,

Havaldar

2023

IJRASET

View full text Add to dashboard Cite

Text abstraction based on deep learning has proven to be a promising method for the task of extracting large amounts of text while preserving the most important information. This article provides an overview of text abstraction based on deep learning, highlighting various techniques and applications in this field. This article reviews the existing literature on text abstraction based on deep learning, focusing on various methods such as sentence compression, text summarization, and paraphrase, and compares their advantages and disadvantages. The article also describes various deep learning techniques used in the field, including neural networks, recurrent neural networks, and convolution of neural networks. In addition, this article presents studies on the effectiveness of deep learning-based text in a variety of applications, including journalism, finance, health, and education. The article discusses the challenges faced by the field, such as resolving ambiguity and ensuring consistency and readability in produced texts. Finally, this article discusses future directions and potential areas for further research in deep learning-based text abstraction. This questionnaire is useful for researchers and practitioners interested in text abstraction and applications based on deep learning. The article also explores the ethical implications of deep learning-based reading, particularly with regard to issues such as prejudice and privacy. The benefits of this technology must be weighed against the risks, and it is important to ensure that deep learning-based text is created and used responsibly

show abstract