2012 International Conference on Information Retrieval &Amp; Knowledge Management 2012
DOI: 10.1109/infrkm.2012.6204977
|View full text |Cite
|
Sign up to set email alerts
|

Automatic identification of cross-document structural relationships

Abstract: Analysis on inter-document relationship is one of the important studies in multi document analysis. In this paper, we will focus on some special properties that multi document articles hold, specifically news articles. Information across news articles reporting on the same story are often related. Cross-document Structure Theory (CST) gives the relationship between pairs of sentences from different documents. For example, two sentences might have relationships such as identical, overlapping or contradicting. O… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(6 citation statements)
references
References 10 publications
0
6
0
Order By: Relevance
“…The best result was achieved with CBR based on the cosine similarity measure. It expressed improved results than in K u m a r et al [12]: Identity 0.966, Subsumption 0.803, Description 0.786, and 0.722 for Overlap.…”
Section: Related Workmentioning
confidence: 66%
See 1 more Smart Citation
“…The best result was achieved with CBR based on the cosine similarity measure. It expressed improved results than in K u m a r et al [12]: Identity 0.966, Subsumption 0.803, Description 0.786, and 0.722 for Overlap.…”
Section: Related Workmentioning
confidence: 66%
“…The features of A l e i x o and P a r d o [1] were expanded with: 1) cosine similarity of word vectors, 2) intersection of common words measured with the Jaccard Index, 3) an indicator of longer sentence (1 if S1 was longer, 0 if equal, -1 if S1 was shorter), 4) and uni-directional word coverage ratio (S1 → S2 and S2 → S1). K u m a r et al [12] followed Z a h r i and F u k u m o t o [37], but restricted the set of relations further down to four: Identity, Subsumption, Overlap and Elaboration.…”
Section: Related Workmentioning
confidence: 99%
“…In our work we proposed a method for the recognition of the full set of 17 CST relations, in contrast to the limited of subsets used in literature, e.g. in (Kumar et al, 2012a). Our method outperforms also the state of the art algorithm when compared on a corpus of the similar origin and content.…”
Section: Discussionmentioning
confidence: 99%
“…The features of (Zahri and Fukumoto, 2011) were extended with the Jaccard based similarity of noun phrases and verb phrases. CBR based on the cosine similarity measure expressed improved results than in (Kumar et al, 2012a): Identity 0.966, Subsumption 0.803, Description 0.786, and 0.722 for Overlap.…”
Section: Related Workmentioning
confidence: 93%
See 1 more Smart Citation