2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE) 2021
DOI: 10.1109/icse43902.2021.00033
|View full text |Cite
|
Sign up to set email alerts
|

An Empirical Study of Refactorings and Technical Debt in Machine Learning Systems

Abstract: Machine Learning (ML), including Deep Learning (DL), systems, i.e., those with ML capabilities, are pervasive in today's data-driven society. Such systems are complex; they are comprised of ML models and many subsystems that support learning processes. As with other complex systems, ML systems are prone to classic technical debt issues, especially when such systems are long-lived, but they also exhibit debt specific to these systems. Unfortunately, there is a gap of knowledge in how ML systems actually evolve … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(4 citation statements)
references
References 57 publications
0
4
0
Order By: Relevance
“…Despite Python's and ML's meteoric rise [58]- [61], support for automated code evolution is still in its infancy. To advance the science and tooling for automating code changes in Python, we built PYEVOLVE, which infers transformation rules and then applies them.…”
Section: Discussionmentioning
confidence: 99%
“…Despite Python's and ML's meteoric rise [58]- [61], support for automated code evolution is still in its infancy. To advance the science and tooling for automating code changes in Python, we built PYEVOLVE, which infers transformation rules and then applies them.…”
Section: Discussionmentioning
confidence: 99%
“…Many studies have empirically investigated different aspects in developing, deploying and maintaining DL systems, e.g., software engineering for DL systems [2,14,35], challenges in developing DL systems [1,62] and deploying DL systems [8], pain-points in using cloud services of computer vision [10], accuracy variance in training DL systems [43], performance variance in deploying DL models to different mobile devices and web browsers [15,33], discussion topics about DL frameworks [17], API evolution in DL frameworks [66], technical debt in DL frameworks [31,32] and DL systems [38,45,50], and dependency supply chain of DL libraries [16,49].…”
Section: Empirical Studies About DLmentioning
confidence: 99%
“…While technical debt can have solid technical reasons, this debt-like fiscal debt-needs to be serviced by, e.g., refactoring code, and removing unnecessary code (dead code) and unnecessary dependencies. The technical debt of machine learning (ML) systems has recently come under scrutiny [42,55,61]. Sculley et al [55] identify eight main sources of technical debt in ML systems including common code issues such as dead code that is never used and duplicate code.…”
Section: Introductionmentioning
confidence: 99%