Proceedings of the 21st ACM International Conference on Information and Knowledge Management 2012
DOI: 10.1145/2396761.2396857
|View full text |Cite
|
Sign up to set email alerts
|

Temporal corpus summarization using submodular word coverage

Abstract: In many areas of life, we now have almost complete electronic archives reaching back for well over two decades. This includes, for example, the body of research papers in computer science, all news articles written in the US, and most people's personal email. However, we have only rather limited methods for analyzing and understanding these collections. While keyword-based retrieval systems allow efficient access to individual documents in archives, we still lack methods for understanding a corpus as a whole. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
43
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 64 publications
(44 citation statements)
references
References 28 publications
0
43
0
Order By: Relevance
“…Closely related to our work are the unsupervised key phrase extraction approaches that have been explored to reinforce summarization [32,29,11,22,24]. Namely, Litval and Last [11] and Riedhammer et al [22] propose the use of key phrases to summarize news articles [11] and meetings [22].…”
Section: Two-stage Methodsmentioning
confidence: 91%
“…Closely related to our work are the unsupervised key phrase extraction approaches that have been explored to reinforce summarization [32,29,11,22,24]. Namely, Litval and Last [11] and Riedhammer et al [22] propose the use of key phrases to summarize news articles [11] and meetings [22].…”
Section: Two-stage Methodsmentioning
confidence: 91%
“…Constrained submodular maximization has found several new applications in recent years. Some of these include data summarization [LB11,SSSJ12,DKR13], influence maximization in social networks [KKT03, CWY09, CWW10, GBL11, SS13], generalized assignment [CCPV07], mechanism design [BIK07], and network monitoring [LKG + 07].…”
Section: Introductionmentioning
confidence: 99%
“…Over the recent years, submodular optimization has been identified as a powerful tool for numerous data mining and machine learning applications including viral marketing [17], network monitoring [22], news article recommendation [10], nonparametric learning [14,29], document and corpus summarization [23,7,33], network inference [30], and Determinantal Point Processes [13]. A problem of key importance in all these applications is to maximize a monotone submodular function subject to a cardinality constraint (i.e., a bound on the number k of elements that can be selected).…”
Section: Background and Related Workmentioning
confidence: 99%
“…pelling approach that has gained a lot of interest in recent years is data summarization: selecting representative subsets of manageable size out of large data sets. Applications range from exemplar-based clustering [8], to document [23,7] and corpus summarization [33], to recommender systems [10,9], just to name a few. A systematic way for data summarization, used in all the aforementioned applications, is to turn the problem into selecting a subset of data elements optimizing a utility function that quantifies "representativeness" of the selected set.…”
Section: Introductionmentioning
confidence: 99%