1995
DOI: 10.1109/tit.1995.476344
|View full text |Cite
|
Sign up to set email alerts
|

Managing Gigabytes: Compressing and Indexing Documents and Images

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
1,199
0
18

Year Published

2001
2001
2010
2010

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 700 publications
(1,219 citation statements)
references
References 1 publication
2
1,199
0
18
Order By: Relevance
“…In our approach, we use an inverted file [33] to index images. The inverted index consists of two components: one includes indexed visual words and visual phrases, and the other includes vectors containing the information about the spatial weighting of the visual words and the occurrence of the visual phrases.…”
Section: Image Indexingmentioning
confidence: 99%
“…In our approach, we use an inverted file [33] to index images. The inverted index consists of two components: one includes indexed visual words and visual phrases, and the other includes vectors containing the information about the spatial weighting of the visual words and the occurrence of the visual phrases.…”
Section: Image Indexingmentioning
confidence: 99%
“…Gap compression [22] is effective when the gaps between sorted docIDs are small. To reduce the gap size, we propose to periodically remap docIDs from 160-bit hashes to dense numbers from 1 to the number of documents.…”
Section: Gap Compressionmentioning
confidence: 99%
“…Output-sensitive data structures are at the heart of text searching [13], geometric searching [5], database searching [28], and information retrieval in general [3,31]. They are the result of preprocessing n items (these can be textual data, geometric data, database records, multimedia, or any other kind of data) into O(n polylog(n)) space in such a way, as to allow quickly answering on-line queries in O(t(n) + ℓ) time, where t(n) = o(n) is the cost of querying the data structure (typically t(n) = polylog(n)).…”
Section: Introductionmentioning
confidence: 99%
“…While ranking itself has been the subject of intense theoretical investigation in the context of search engines [17,18,24], we could not find any explicit study pertaining to ranking in the context of data structures. The only published data structure of this kind is the inverted lists [31] in which the documents are sorted according to their rank order. McCreight's paper on priority search trees [19] refers to enumeration in increasing order along the yaxis but it does not indeed discuss how to report the items in sorted order along the y-axis.…”
Section: Introductionmentioning
confidence: 99%