2019
DOI: 10.1002/cpe.5206
|View full text |Cite
|
Sign up to set email alerts
|

EcForest: Extractive document summarization through enhanced sentence embedding and cascade forest

Abstract: We present EcForest, an extractive summarization model through Enhanced Sentence Embedding and Cascade Forest. Sentence representation is of great significance for many summarization methods. Bag-of-words mostly fails to grasp the semantics, and typical embedding models cannot capture more complex semantic features, such as polysemy and the meaning of a phrase, which is usually ignored by simply averaging the word embeddings included in a sentence. To this end, we propose Enhanced Sentence Embedding (ESE) mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…Specifically, the strategy utilizing tf-isf vectors has a word bag that contains all the stemmed words found in the dataset (word stemming using Porter's stemmer 5 ). For the strategy ESE, we pre-trained all different sentence embeddings on Daily Mail dataset (Hermann et al, 2015) by following the guideline of Yang et al (2019) (the dimension of concatenated embedding is 800).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Specifically, the strategy utilizing tf-isf vectors has a word bag that contains all the stemmed words found in the dataset (word stemming using Porter's stemmer 5 ). For the strategy ESE, we pre-trained all different sentence embeddings on Daily Mail dataset (Hermann et al, 2015) by following the guideline of Yang et al (2019) (the dimension of concatenated embedding is 800).…”
Section: Methodsmentioning
confidence: 99%
“…Many prior works have adopted A for the MDS task, such as Yang et al (2018) and Yang et al (2019). Each element in the affinity matrix A is a pairwise affinity of two different sentences.…”
Section: Affinity Matrixmentioning
confidence: 99%
See 1 more Smart Citation
“…The results show that the designed approach overcomes the problem of text overload by generating effective summaries. According to K. Yang [9], EcForest is an abstraction summary model with Enhanced Sentence Embedding and Cascade Forest. Sentence representation is very important for many summarization methods.…”
Section: Related Workmentioning
confidence: 99%
“…Bags of words can barely capture semantics, and typical embedding models fail to capture more complex semantic features such as ambiguity and phrase meaning. To this end, we propose an Extended Sentence Embedding (ESE) model [9].…”
mentioning
confidence: 99%