2019
DOI: 10.48550/arxiv.1909.02976
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle

Abstract: Machine learning (ML) applications become increasingly common in many domains. ML systems to execute these workloads include numerical computing frameworks and libraries, ML algorithm libraries, and specialized systems for deep neural networks and distributed ML. These systems focus primarily on efficient model training and scoring. However, the data science process is exploratory, and deals with underspecified objectives and a wide variety of heterogeneous data sources. Therefore, additional tools are employe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 32 publications
0
3
0
Order By: Relevance
“…These tools will often collect provenance information in a degree. Unfortunately, they require programs to be written in a domain-specific-language suitable for their specialized ML systems such as SystemDS [9] and HELIX [45], and they only provide provenance for matrix manipulation operations or high-level conceptual ML operators. They do not have the ability to provide flexible data provenance for general-purpose scripts.…”
Section: Related Workmentioning
confidence: 99%
“…These tools will often collect provenance information in a degree. Unfortunately, they require programs to be written in a domain-specific-language suitable for their specialized ML systems such as SystemDS [9] and HELIX [45], and they only provide provenance for matrix manipulation operations or high-level conceptual ML operators. They do not have the ability to provide flexible data provenance for general-purpose scripts.…”
Section: Related Workmentioning
confidence: 99%
“…In recent years, researchers in the database community have been working on raising the level of abstractions of machine learning (ML) and integrating such functionality into today's data management systems [95,96], e.g., SystemML [25], SystemDS [8], Snorkel [71], ZeroER [91], TFX [5,9], Query 2.0 [92], Krypton [66], Cerebro [67], ModelDB [86], MLFlow [94], Deep-Dive [14], HoloClean [72], EaseML [1], ActiveClean [48], and NorthStar [47]. End-to-end AutoML systems [93,97,33] have been an emerging type of systems that has significantly raised the level of abstractions of building ML applications.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, researchers in the database community have been working on raising the level of abstractions of machine learning (ML) and integrating such functionality into today's data management systems, e.g., SystemML [18], SystemDS [5],…”
Section: Introductionmentioning
confidence: 99%