Proceedings of the Seventeenth Australasian Document Computing Symposium 2012
DOI: 10.1145/2407085.2407103
|View full text |Cite
|
Sign up to set email alerts
|

Pairwise similarity of TopSig document signatures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2012
2012
2016
2016

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 8 publications
0
10
0
Order By: Relevance
“…Early approaches used to generate document signatures are based on the bitwise OR composition of binary signatures associated with terms in documents [9,7]. Further refinements of the signature generation process have been proposed.…”
Section: Patent Classification With Signaturesmentioning
confidence: 99%
See 1 more Smart Citation
“…Early approaches used to generate document signatures are based on the bitwise OR composition of binary signatures associated with terms in documents [9,7]. Further refinements of the signature generation process have been proposed.…”
Section: Patent Classification With Signaturesmentioning
confidence: 99%
“…Further refinements of the signature generation process have been proposed. A recent approach, called TopSig [10,7], uses random indexing for compressing the standard term-document matrix, followed by aggressive numerical precision reduction to maintain only the sign bits of the projected term-document matrix. This approach has been shown to be superior to standard signature approaches: we thus rely on the TopSig method to generate patent signatures.…”
Section: Patent Classification With Signaturesmentioning
confidence: 99%
“…In order to employ RMI for the construction of a VSM at reduced dimension, two model parameters should be decided: (a) the dimension of the VSM, which is shown by m, and (b) the number of non-zero elements in index vectors, which is determined by s in Equation 5. In contrast to the classic one-dimension-per-context-element methods of VSM construction, 5 the value of m in RPs and thus in RMI is chosen independently of the number of context elements n in the model.…”
Section: Random Manhattan Indexingmentioning
confidence: 99%
“…In contrast to the classic one-dimension-per-context-element methods of VSM construction, 5 the value of m in RPs and thus in RMI is chosen independently of the number of context elements n in the model. In RMI, as shown in [18], m is established by the probability and the maximum expected amount of distortions in pairwise distances and the number of vectors p in the model: a larger m yields to lower bounds on the distortion with a higher probability, i.e.…”
Section: Random Manhattan Indexingmentioning
confidence: 99%
See 1 more Smart Citation