Aviv Navon scite author profile

Aviv Navon

4Publications

10Citation Statements Received

105Citation Statements Given

How they've been cited

How they cite others

103

Affiliations

Publications

Order By: Most citations

Auxiliary Learning by Implicit Differentiation

Navon¹,

Achituve²,

Maron³

et al. 2020

Preprint

View full text Add to dashboard Cite

Training with multiple auxiliary tasks is a common practice used in deep learning for improving the performance on the main task of interest. Two main challenges arise in this multi-task learning setting: (i) Designing useful auxiliary tasks; and (ii) Combining auxiliary tasks into a single coherent loss. We propose a novel framework, AuxiLearn, that targets both challenges, based on implicit differentiation. First, when useful auxiliaries are known, we propose learning a network that combines all losses into a single coherent objective function. This network can learn non-linear interactions between auxiliary tasks. Second, when no useful auxiliary task is known, we describe how to learn a network that generates a meaningful, novel auxiliary task. We evaluate AuxiLearn in a series of tasks and domains, including image segmentation and learning with attributes. We find that AuxiLearn consistently improves accuracy compared with competing methods.

show abstract

Multi-Task Learning as a Bargaining Game

Navon¹,

Shamsian²,

Achituve³

et al. 2022

Preprint

View full text Add to dashboard Cite

Auxiliary Learning as an Asymmetric Bargaining Game

Shamsian¹,

Navon²,

Glazer³

et al. 2023

Preprint

View full text Add to dashboard Cite

Equivariant Architectures for Learning in Deep Weight Spaces

Navon¹,

Shamsian²,

Achituve³

et al. 2023

Preprint

View full text Add to dashboard Cite

Designing machine learning architectures for processing neural networks in their raw weight matrix form is a newly introduced research direction. Unfortunately, the unique symmetry structure of deep weight spaces makes this design very challenging. If successful, such architectures would be capable of performing a wide range of intriguing tasks, from adapting a pre-trained network to a new domain to editing objects represented as functions (INRs or NeRFs). As a first step towards this goal, we present here a novel network architecture for learning in deep weight spaces. It takes as input a concatenation of weights and biases of a pre-trained MLP and processes it using a composition of layers that are equivariant to the natural permutation symmetry of the MLP's weights: Changing the order of neurons in intermediate layers of the MLP does not affect the function it represents. We provide a full characterization of all affine equivariant and invariant layers for these symmetries and show how these layers can be implemented using three basic operations: pooling, broadcasting, and fully connected layers applied to the input in an appropriate manner. We demonstrate the effectiveness of our architecture and its advantages over natural baselines in a variety of learning tasks.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Aviv Navon

Auxiliary Learning by Implicit Differentiation

Multi-Task Learning as a Bargaining Game

Auxiliary Learning as an Asymmetric Bargaining Game

Equivariant Architectures for Learning in Deep Weight Spaces

Contact Info

Product

Resources

About