2016
DOI: 10.48550/arxiv.1608.05634
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Thrill: High-Performance Algorithmic Distributed Batch Data Processing with C++

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2018
2018
2018
2018

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…We propose two distributed graph clustering algorithms, DSLM-Mod and DSLM-Map, that optimize modularity and map equation, respectively. Our algorithms are the first graph clustering algorithms based on Thrill [4], a distributed big data processing framework written in C++ that implements an extended MapReduce model. Our algorithms are easy to extend for optimizing different density-based quality measures.…”
Section: Contributionmentioning
confidence: 99%
See 1 more Smart Citation
“…We propose two distributed graph clustering algorithms, DSLM-Mod and DSLM-Map, that optimize modularity and map equation, respectively. Our algorithms are the first graph clustering algorithms based on Thrill [4], a distributed big data processing framework written in C++ that implements an extended MapReduce model. Our algorithms are easy to extend for optimizing different density-based quality measures.…”
Section: Contributionmentioning
confidence: 99%
“…Thrill [4] is a distributed C++ big data processing framework. It can distribute the program execution over multiple machines and threads within a machine.…”
Section: Thrillmentioning
confidence: 99%
“…We explored using the Thrill [26] library to track the most energetic particles for the results of VPIC plasma physics simulation [32]. Thrill is a research project that aims to provide a bridge between big data analytics and HPC platforms.…”
Section: Solution Approachmentioning
confidence: 99%
“…We implemented five suffix array construction algorithms using the distributed big data batch computation framework Thrill [2]. Thrill works with distributed immutable arrays (DIAs) storing tuples.…”
Section: A Short Introduction Into Thrillmentioning
confidence: 99%
“…= [ (0, 3),(4,3),(8,2),(1,7),(5,7), (2, 1), (6, 0),(3,6),(7,5) ] // 3.= [ (0, 3, 3), (4, 3, 2), (8, 2, 0), (1, 7, 7), (5, 7, 0), (2, 1, 0), (6, 0, 0), (3, 6, 5), (7, 5, 0) ] // 3.= [ (6, 0, 0), (2, 1, 0), (8, 2, 0), (4, 3, 2), (0, 3, 3), (7, 5, 0),(3,6,5),(5,7, 0),(1,7,7) ] // 1.= [ (6, 0), (2, 1),(8,2), (4, 3), (0, 4),(7,5),(3,6),(5,7),(1,8) ] // 1.5 1 item with rank 0 // 1.6…”
mentioning
confidence: 99%