Searching for Robustness: Loss Learning for Noisy Classification Tasks

Gao, Boyan; Gouk, Henry; Hospedales, Timothy M.

doi:10.1109/iccv48922.2021.00660

Cited by 9 publications

(7 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, in [24,28], differentiable surrogates of non-differentiable performance metrics are learned to reduce the misalignment problem between the performance metric and the loss function. Alternatively, in [4,9,27,46], loss functions are learned to improve sample efficiency and asymptotic performance in supervised and reinforcement learning, while in [3,20,35], they improved on the robustness of a model to domain-shifts and improved domain-generalization.…”

Section: Gradient-based Approachesmentioning

confidence: 99%

Fast and Efficient Local-Search for Genetic Programming Based Loss Function Learning

Raymond

Chen

Xue

et al. 2023

Proceedings of the Genetic and Evolutionary Computation Conference

View full text Add to dashboard Cite

In this paper, we develop upon the topic of loss function learning, an emergent meta-learning paradigm that aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a new meta-learning framework for task and model-agnostic loss function learning via a hybrid search approach. The framework first uses genetic programming to find a set of symbolic loss functions. Second, the set of learned loss functions is subsequently parameterized and optimized via unrolled differentiation. The versatility and performance of the proposed framework are empirically validated on a diverse set of supervised learning tasks. Results show that the learned loss functions bring improved convergence, sample efficiency, and inference performance on tabulated, computer vision, and natural language processing problems, using a variety of task-specific neural network architectures. CCS Concepts• Computing methodologies → Machine learning algorithms.

show abstract

Section: Gradient-based Approachesmentioning

confidence: 99%

Fast and Efficient Local-Search for Genetic Programming Based Loss Function Learning

Raymond

Chen

Xue

et al. 2023

Proceedings of the Genetic and Evolutionary Computation Conference

View full text Add to dashboard Cite

show abstract

“…MetaReg Balaji et al (2018) meta-learns regularization parameters to improve domain generalisation. ARL (Gao et al, 2021) meta-learns a loss function to improve robustness of learning form noisy labels.…”

Section: Related Workmentioning

confidence: 99%

“…This tells us that the expected convergence rate on novel tasks depends on the learning divergence on training tasks, plus complexity terms such as the F-norm of the meta-learned optimiser weights M . Note that restricting the diameter r of the parameter space is usually required to obtain generalisation guarantees (Bartlett et al, 2017;Long & Sedghi, 2020;Gouk et al, 2021), so this is not an unusual or counterproductive requirement.…”

Section: Generalisation Of the Learned Optimisermentioning

confidence: 99%

Meta Mirror Descent: Optimiser Learning for Fast Convergence

Gao¹,

Gouk²,

Lee³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Optimisers are an essential component for training machine learning models, and their design influences learning speed and generalisation. Several studies have attempted to learn more effective gradient-descent optimisers via solving a bilevel optimisation problem where generalisation error is minimised with respect to optimiser parameters. However, most existing optimiser learning methods are intuitively motivated, without clear theoretical support. We take a different perspective starting from mirror descent rather than gradient descent, and meta-learning the corresponding Bregman divergence. Within this paradigm, we formalise a novel meta-learning objective of minimising the regret bound of learning. The resulting framework, termed Meta Mirror Descent (MetaMD), learns to accelerate optimisation speed. Unlike many meta-learned optimisers, it also supports convergence and generalisation guarantees and uniquely does so without requiring validation data. We evaluate our framework on a variety of tasks and architectures in terms of convergence rate and generalisation error and demonstrate strong performance.

show abstract

“…A promising alternative paradigm is to use evolution-based methods to learn M, favoring their inherent ability to avoid local optima via maintaining a population of solutions, their ease of parallelization of computation across multiple processors, and their ability to optimize for non-differentiable functions directly. Examples of such work include [16] and [17], which both represent M as parameterized Taylor polynomials optimized with covariance matrix adaptation evolutionary strategies (CMA-ES). These approaches successfully derive interpretable loss functions, but similar to previously, they also assume the parametric form via the degree of the polynomial.…”

Section: Evolution-based Approachesmentioning

confidence: 99%

“…In particular, many loss function learning approaches use a parametric loss function representation such as a neural network [15] or Taylor polynomial [16], [17], which is limited as it imposes unnecessary assumptions and constraints on the structure of the learned loss function. However, the current non-parametric alternative to this is to use a two-stage discovery and optimization process, which infers both the loss function structure and parameters simultaneously using genetic programming and covariance matrix adaptation [18], and quickly become intractable for large-scale optimization problems.…”

Section: Introductionmentioning

confidence: 99%

Learning Symbolic Model-Agnostic Loss Functions via Meta-Learning

Raymond¹,

Chen²,

Xue³

et al. 2022

Preprint

View full text Add to dashboard Cite

In this paper, we develop upon the emerging topic of loss function learning, which aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a new meta-learning framework for learning model-agnostic loss functions via a hybrid neuro-symbolic search approach. The framework first uses evolution-based methods to search the space of primitive mathematical operations to find a set of symbolic loss functions. Second, the set of learned loss functions are subsequently parameterized and optimized via an end-to-end gradient-based training procedure. The versatility of the proposed framework is empirically validated on a diverse set of supervised learning tasks. Results show that the meta-learned loss functions discovered by the newly proposed method outperform both the cross-entropy loss and state-of-the-art loss function learning methods on a diverse range of neural network architectures and datasets.

show abstract

Searching for Robustness: Loss Learning for Noisy Classification Tasks

Cited by 9 publications

References 21 publications

Fast and Efficient Local-Search for Genetic Programming Based Loss Function Learning

Fast and Efficient Local-Search for Genetic Programming Based Loss Function Learning

Meta Mirror Descent: Optimiser Learning for Fast Convergence

Learning Symbolic Model-Agnostic Loss Functions via Meta-Learning

Contact Info

Product

Resources

About