Proceedings of the Nineteenth ACM Conference on Hypertext and Hypermedia 2008
DOI: 10.1145/1379092.1379117
|View full text |Cite
|
Sign up to set email alerts
|

Generating links by mining quotations

Abstract: Scanning books, magazines, and newspapers has become a widespread activity because people believe that much of the worlds information still resides off-line. In general after works are scanned they are indexed for search and processed to add links. This paper describes a new approach to automatically add links by mining popularly quoted passages. Our technique connects elements that are semantically rich, so strong relations are made. Moreover, link targets point within a work, facilitating navigation. This pa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
37
0

Year Published

2008
2008
2015
2015

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(38 citation statements)
references
References 23 publications
1
37
0
Order By: Relevance
“…on the chronological line. The only approach known to us that can be paralleled to ours is the one described in Kolak and Schilit (2008) for quotation mining within the Google Books corpus with algorithm searching for verbatim quotations only.…”
Section: Quotations and Their Definitionmentioning
confidence: 99%
See 2 more Smart Citations
“…on the chronological line. The only approach known to us that can be paralleled to ours is the one described in Kolak and Schilit (2008) for quotation mining within the Google Books corpus with algorithm searching for verbatim quotations only.…”
Section: Quotations and Their Definitionmentioning
confidence: 99%
“…1 That is why we call these links edges and not arcs, and possibly, the graph could be called a semi-directed graph. Kolak and Schilit (2008) observe that the standard plagiarism detection algorithms are useless for unmarked quotation mining and suggest straightforward and efficient algorithm for repeated passage extraction. The algorithm is suitable for modern English texts, since quotations are more or less verbatim and the word order is stable.…”
Section: The Grid and The Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…From human being perspective, such research assumes a particular interest when the involved data are natural language documents and the relationships are defined between entities described in text, e.g. [23,24,34,47].…”
Section: Introductionmentioning
confidence: 99%
“…for the automatic generation of hyperlinks between related entities across documents [15] or digital media indexing and integration [24]. In bioinformatics, studies on relation mining are carried out on three main different data types: natural language texts, molecular structures expressed in text format (e.g.…”
Section: Introductionmentioning
confidence: 99%