Unrolling SGD: Understanding Factors Influencing Machine Unlearning

Thudi, Anvith; Deza, Gabriel; Chandrasekaran, Varun; Papernot, Nicolas

doi:10.48550/arxiv.2109.13398

Cited by 5 publications

(7 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Scalability to large deletion sets: Many prior unlearning methods assume tiny deletion sets [11,30,53,61,68]. However, we argue practical applications typically require the deletion of larger subsets of data, necessitating that unlearning procedures should scale well to this setting.…”

Section: Problem Formulationmentioning

confidence: 99%

“…Second, the prohibitive cost of collecting a representative sample of deep networks makes it infeasible to measure the equivalence of model distributions. Most prior work addresses this by measuring similarity of weights [39,61,68] or outputs [27][28][29]51] between a single model sampled from φ u and φ r . However, measuring similarity between two model instances is not representative of the similarity between the distributions they are drawn from.…”

Section: Against Model Indistinguishabilitymentioning

confidence: 99%

See 1 more Smart Citation

Towards Adversarial Evaluations for Inexact Machine Unlearning

Goel¹,

Prabhu²,

Kumaraguru³

2022

Preprint

View full text Add to dashboard Cite

Existing works in inexact machine unlearning focus on achieving indistinguishability from models retrained after removing the deletion set. We argue that indistinguishability is unnecessary, infeasible to measure, and its practical relaxations can be insufficient. We redefine the goal of unlearning as forgetting all information specific to the deletion set while maintaining high utility and resource efficiency.Motivated by the practical application of removing mislabelled and biased data from models, we introduce a novel test to measure the degree of forgetting called Interclass Confusion (IC). It allows us to analyze two aspects of forgetting: (i) memorization and (ii) property generalization. Despite being a black-box test, IC can investigate whether information from the deletion set was erased until the early layers of the network. We empirically show that two simple unlearning methods, exact-unlearning and catastrophic-forgetting the final k layers of a network, scale well to large deletion sets unlike prior unlearning methods. k controls the forgettingefficiency tradeoff at similar utility. Overall, we believe our formulation of unlearning and the IC test will guide the design of better unlearning algorithms.* Equal Contribution 1 to the extent that bias is a dataset problem [37].

show abstract

Section: Problem Formulationmentioning

confidence: 99%

Section: Against Model Indistinguishabilitymentioning

confidence: 99%

Towards Adversarial Evaluations for Inexact Machine Unlearning

Goel¹,

Prabhu²,

Kumaraguru³

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Since then further extensions to scenarios where efficient analytic solutions could be found were given [24,9], and an extension to unlearn deep neural networks (DNNs) were proposed [2,13,20,12,11,10]. With the growing field of unlearning, there has emerged two categories of machine unlearning algorithms [23]: exact and approximate unlearning, differing by how unlearning is done, and also how the concept of "unlearning" is understood. Exact unlearning for DNNs is based on retraining.…”

Section: Machine Unlearningmentioning

confidence: 99%

“…Existing work have proposed to use techniques such as membership inference [21] to verify the effectiveness of approximate unlearning [12,1] to show that their approximately unlearned models cannot be easily distinguished from models that are not trained on the data points to be unlearned. Alternatively, others compare the similarity of the approximately unlearned models parameters to exactly unlearned models parameters [10,11,25,23].…”

Section: Machine Unlearningmentioning

confidence: 99%

“…In previous literature related to unlearning [23,12,10,11], a model is said to have unlearned if the impact of the data points to be unlearned is removed from the model, where removing the impact of the data points from the model is equivalent to obtaining a model that one could get from retraining without those points. In other words, a model is unlearned if its parameters are in the subspace of the parameter space that can be achieved by training on a dataset that does not contain the points to be unlearned; further work requires the process of obtaining those parameters to also reproduce the same distribution one gets from retraining [10,11,13].…”

Section: Data Ordering Attackmentioning

confidence: 99%

See 1 more Smart Citation

On the Necessity of Auditable Algorithmic Definitions for Machine Unlearning

Thudi¹,

Jia²,

Shumailov³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Machine unlearning, i.e. having a model forget about some of its training data, has become increasingly more important as privacy legislation promotes variants of the right-to-be-forgotten. In the context of deep learning, approaches for machine unlearning are broadly categorized into two classes: exact unlearning methods, where an entity has formally removed the data point's impact on the model by retraining the model from scratch, and approximate unlearning, where an entity approximates the model parameters one would obtain by exact unlearning to save on compute costs. In this paper we first show that the definition that underlies approximate unlearning, which seeks to prove the approximately unlearned model is close to an exactly retrained model, is incorrect because one can obtain the same model using different datasets. Thus one could unlearn without modifying the model at all. We then turn to exact unlearning approaches and ask how to verify their claims of unlearning. Our results show that even for a given training trajectory one cannot formally prove the absence of certain data points used during training. We thus conclude that unlearning is only well-defined at the algorithmic level, where an entity's only possible auditable claim to unlearning is that they used a particular algorithm designed to allow for external scrutiny during an audit.

show abstract

Supporting Trustworthy AI Through Machine Unlearning

Hine,

Novelli,

Taddeo

et al. 2024

Sci Eng Ethics

View full text Add to dashboard Cite

Machine unlearning (MU) is often analyzed in terms of how it can facilitate the “right to be forgotten.” In this commentary, we show that MU can support the OECD’s five principles for trustworthy AI, which are influencing AI development and regulation worldwide. This makes it a promising tool to translate AI principles into practice. We also argue that the implementation of MU is not without ethical risks. To address these concerns and amplify the positive impact of MU, we offer policy recommendations across six categories to encourage the research and uptake of this potentially highly influential new technology.

show abstract

Unrolling SGD: Understanding Factors Influencing Machine Unlearning

Cited by 5 publications

References 25 publications

Towards Adversarial Evaluations for Inexact Machine Unlearning

Towards Adversarial Evaluations for Inexact Machine Unlearning

On the Necessity of Auditable Algorithmic Definitions for Machine Unlearning

Supporting Trustworthy AI Through Machine Unlearning

Contact Info

Product

Resources

About