2012
DOI: 10.1093/comjnl/bxr133
|View full text |Cite
|
Sign up to set email alerts
|

A Parallel Framework for In-Memory Construction of Term-Partitioned Inverted Indexes

Abstract: With the advances in cloud computing and huge RAMs provided by 64-bit architectures, it is possible to tackle large problems using memory-based solutions. Construction of term-based, partitioned, parallel inverted indexes is a communication intensive task and suitable for memory-based modeling. In this paper, we provide an efficient parallel framework for in-memory construction of term-based partitioned, inverted indexes. We show that, by utilizing an efficient bucketing scheme, we can eliminate the need for t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 23 publications
0
2
0
Order By: Relevance
“…Assuming the documents are already uniformly distributed on the nodes, constructing a document-based-partitioned index is a relatively trivial task since each node can simply build a local index on its collection. Constructing a term-based-partitioned index via message passing, on the other hand, requires coordination and communication among the nodes Kucukyilmaz et al 2012]. -Load imbalance.…”
Section: Term-based Versus Document-based Inverted Index Partitioningmentioning
confidence: 99%
“…Assuming the documents are already uniformly distributed on the nodes, constructing a document-based-partitioned index is a relatively trivial task since each node can simply build a local index on its collection. Constructing a term-based-partitioned index via message passing, on the other hand, requires coordination and communication among the nodes Kucukyilmaz et al 2012]. -Load imbalance.…”
Section: Term-based Versus Document-based Inverted Index Partitioningmentioning
confidence: 99%
“…Therefore, advances in high-performance computing provide the infrastructure to handle large workloads [Kucukyilmaz et al 2012]. For this reason, in this paper, we propose a heterogeneous parallel novel architecture for inverted index generation from the in-memory inversion algorithm.…”
Section: Introductionmentioning
confidence: 99%