Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques 2018
DOI: 10.1145/3243176.3243190
|View full text |Cite
|
Sign up to set email alerts
|

Data motifs

Abstract: The complexity and diversity of big data and AI workloads make understanding them difficult and challenging. This paper proposes a new approach to modelling and characterizing big data and AI workloads. We consider each big data and AI workload as a pipeline of one or more classes of units of computation performed on different initial or intermediate data inputs. Each class of unit of computation captures the common requirements while being reasonably divorced from individual implementations, and hence we call… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 28 publications
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…For handling the big data use cases, graph traversal , machine learning , text analytics , and statistics algorithms are commonly used 72,73 . In this work, only those applications have been tested to explore compiler search space for 3Vs (Volume, Velocity, Variety) which are part of standard big data benchmarks, representing graph mining , machine learning , and text search categories 72,73 . These applications have been selected through well‐known C/C++ based benchmark suites including Rodinia, 27 Graphbig, 28 Phoneix, 29 CortexSuite, 31 genann, 32 and grep‐bench 30 .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For handling the big data use cases, graph traversal , machine learning , text analytics , and statistics algorithms are commonly used 72,73 . In this work, only those applications have been tested to explore compiler search space for 3Vs (Volume, Velocity, Variety) which are part of standard big data benchmarks, representing graph mining , machine learning , and text search categories 72,73 . These applications have been selected through well‐known C/C++ based benchmark suites including Rodinia, 27 Graphbig, 28 Phoneix, 29 CortexSuite, 31 genann, 32 and grep‐bench 30 .…”
Section: Methodsmentioning
confidence: 99%
“…Big data has been emerging in domains like social networks, search engines, ecommerce, multimedia processing, and bioinformatics. For handling the big data use cases, graph traversal , machine learning , text analytics , and statistics algorithms are commonly used 72,73 . In this work, only those applications have been tested to explore compiler search space for 3Vs (Volume, Velocity, Variety) which are part of standard big data benchmarks, representing graph mining , machine learning , and text search categories 72,73 .…”
Section: Methodsmentioning
confidence: 99%
“…In the Semantic Web (SW), ontologies have traditionally been used to act as lenses over data eventually leading to an entire field of study: Ontology-Based Data Access (Calvanese et al, 2015). More recently, lenses have been used to create multiple views over chemistry data (Batchelor et al, 2014), help manage large data sets (Lenzerini, 2018), or understand big data and AI workloads (Gao et al, 2018).…”
Section: Related Workmentioning
confidence: 99%