2021
DOI: 10.1109/tbdata.2019.2907588
|View full text |Cite
|
Sign up to set email alerts
|

Incorporating Data Context to Cost-Effectively Automate End-to-End Data Wrangling

Abstract: The process of preparing potentially large and complex data sets for further analysis or manual examination is often called data wrangling. In classical warehousing environments, the steps in such a process are carried out using Extract-Transform-Load platforms, with significant manual involvement in specifying, configuring or tuning many of them. In typical big data applications, we need to ensure that all wrangling steps, including web extraction, selection, integration and cleaning, benefit from automation … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 44 publications
0
2
0
Order By: Relevance
“…This research area is commonly called AutoML. It turned to be a hot research area in recent years (Bilalli et al, 2019;Giovanelli et al, 2021;Koehler et al, 2021;Quemy, 2019). (Kedziora et al, 2020) provides an excellent state of the art of this research area.…”
Section: Machine Learningmentioning
confidence: 99%
“…This research area is commonly called AutoML. It turned to be a hot research area in recent years (Bilalli et al, 2019;Giovanelli et al, 2021;Koehler et al, 2021;Quemy, 2019). (Kedziora et al, 2020) provides an excellent state of the art of this research area.…”
Section: Machine Learningmentioning
confidence: 99%
“…Ajax [53] brings a SQL-like language to data transformations. Early work on automating end-to-end data wrangling seems promising, but there is likely much more to do [98].…”
Section: Current Research Lines and Future Challengesmentioning
confidence: 99%