Information ordering is a nontrivial task in multi-document summarization (MDS), which typically relies on the traditional vector space model (VSM) notorious for semantic deficiency. In this article, we propose a novel event-enriched VSM to alleviate the problem by building event semantics into sentence representations. The mediation of event information between sentence and term, especially in the news domain, has an intuitive appeal as well as technical advantage in common sentence-level operations such as sentence similarity computation. Inspired by the block-style writing by humans, we base the sentence ordering algorithm on sentence clustering. To accommodate the complexity introduced by event information, we adopt a soft-to-hard clustering strategy on the event and sentence levels, using expectation-maximization clustering and K-means, respectively. For the purpose of clusterbased sentence ordering, the event-enriched VSM enables us to design an ordering algorithm to enhance event coherence computed between sentence and sentence-context pairs. Drawing on the findings of earlier research, we also incorporate topic continuity measures and time information into the scheme. We evaluate the performance of the model and its variants automatically and manually, with experimental results showing clear advantage of the event-based model over baseline and non-event-based models in information ordering for multi-document news summarization. We are confident that the event-enriched VSM has even greater potential in summarization and beyond, which awaits further research.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.