“…Another benefit of datadriven software platforms is illustrated by a common strategy [31] is to limit retention of data and trained ML models to, say, 35 days, which improves data privacy but requires infrastructure for feature lineage tracking and automation for ML model retraining. ML platforms [20,30,39] often support and automate workflows that train ML models on data to perform prediction, estimation, ranking, selection, and other ML tasks. Such applications necessitate regular data collection and retraining of ML models [45], which provides strong motivation for platforms in practice.…”