2015
DOI: 10.14778/2824032.2824052
|View full text |Cite
|
Sign up to set email alerts
|

Asynchronous and fault-tolerant recursive datalog evaluation in shared-nothing engines

Abstract: We present a new approach for data analytics with iterations. Users express their analysis in Datalog with bag-monotonic aggregate operators, which enables the expression of computations from a broad variety of application domains. Queries are translated into query plans that can execute in shared-nothing engines, are incremental, and support a variety of iterative models (synchronous, asynchronous, different processing priorities) and failure-handling techniques. The plans require only small extensions to an … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
42
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 49 publications
(42 citation statements)
references
References 15 publications
0
42
0
Order By: Relevance
“…5 of [39] and the program in Fig. 2 of [52]. Although the iterated fixpoint computation produces the correct results for these programs, this is not true for other programs.…”
Section: Examplementioning
confidence: 99%
See 1 more Smart Citation
“…5 of [39] and the program in Fig. 2 of [52]. Although the iterated fixpoint computation produces the correct results for these programs, this is not true for other programs.…”
Section: Examplementioning
confidence: 99%
“…However, it seems that both [39] and [52] ignore this problem, and allow unrestricted use of count or sum in recursion, e.g., the program in Fig. 5 of [39] and the program in Fig.…”
Section: Examplementioning
confidence: 99%
“…This has the effect of giving smaller labels a "head start" and reduces the amount of churn among nodes with large identifiers. It accomplishes the same goal as asynchronous systems, like Myria [40] and SociaLite [38], that propagate more successful labels.…”
Section: Explanations For Connected Componentsmentioning
confidence: 99%
“…Figure 14 also depicts the performance of the state-of-the-art systems for graph analytics, Myria [40] and SociaLite [38], on the label propagation algorithm. Note that we were unable to get SociaLite to report more than 1.7M labels for the Twitter graph, thus, we do not report its elapsed time for this dataset.…”
Section: Explanations For Connected Componentsmentioning
confidence: 99%
“…A growing body of research on scalable data analytics has brought a renaissance of interest in Datalog because of its ability to specify, declaratively, advanced data-intensive applications that execute efficiently over different systems and architectures, including massively parallel ones (Seo et al 2013;Shkapsky et al 2013;Yang and Zaniolo 2014;Aref et al 2015;Wang et al 2015;Yang et al 2015;Shkapsky et al 2016;Yang et al 2017). A common thread in this new generation of Datalog systems is the use of aggregates in recursion, since aggregates enable the concise expression and efficient support of much more powerful algorithms than those expressible by programs that are stratified w.r.t.…”
Section: Introductionmentioning
confidence: 99%