The Symplectic Adjoint Method: Memory-Efficient Backpropagation of Neural-Network-Based Differential Equations

Matsubara, Takashi; Miyatake, Yuto; Yaguchi, Takaharu

doi:10.1109/tnnls.2023.3242345

Cited by 6 publications

(9 citation statements)

References 57 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Augmented Neural ODEs [14] extended Neural ODEs by using the additional dimensions to learn more complex functions. Matsubara et al [21] improved Neural ODEs via the symplectic adjoint method and demonstrated that the symplectic adjoint method consumes much less memory than the naive backpropagation algorithm and checkpointing schemes, performing faster than the adjoint method.…”

Section: Reconsidering Dnns From the Perspective Of Dynamic Systemsmentioning

confidence: 99%

Improving neural ordinary differential equations via knowledge distillation

Chu,

Wei,

et al. 2023

IET Computer Vision

View full text Add to dashboard Cite

Neural ordinary differential equations (ODEs) (Neural ODEs) construct the continuous dynamics of hidden units using ODEs specified by a neural network, demonstrating promising results on many tasks. However, Neural ODEs still do not perform well on image recognition tasks. The possible reason is that the one‐hot encoding vector commonly used in Neural ODEs can not provide enough supervised information. A new training based on knowledge distillation is proposed to construct more powerful and robust Neural ODEs fitting image recognition tasks. Specially, the training of Neural ODEs is modelled into a teacher‐student learning process, in which ResNets are proposed as the teacher model to provide richer supervised information. The experimental results show that the new training manner can improve the classification accuracy of Neural ODEs by 5.17%, 24.75%, 7.20%, and 8.99%, on Street View House Numbers, CIFAR10, CIFAR100, and Food‐101, respectively. In addition, the effect of knowledge distillation is also evaluated in Neural ODEs on robustness against adversarial examples. The authors discover that incorporating knowledge distillation, coupled with the increase of the time horizon, can significantly enhance the robustness of Neural ODEs. The performance improvement is analysed from the perspective of the underlying dynamical system.

show abstract

Section: Reconsidering Dnns From the Perspective Of Dynamic Systemsmentioning

confidence: 99%

Improving neural ordinary differential equations via knowledge distillation

Chu,

Wei,

et al. 2023

IET Computer Vision

View full text Add to dashboard Cite

show abstract

“…Neural networks are a type of typical supervised learning methods, which can effectively overcome the limitations of linear models by learning the non-linear mapping between inputs and outputs [14][15][16][17]. Neural networks are composed of some functions associated with a directed graph.…”

Section: Neural Networkmentioning

confidence: 99%

“…Backpropagation [15,16] is a widely used training strategy. It utilises the derivative chain rule in order to obtain the weights of each layer.…”

Section: Neural Networkmentioning

confidence: 99%

“…Machine learning techniques have been widely used in pattern recognition and automated data processing, including computer vision, speech processing, and some physical science fields [14][15][16][17]. Machine learning methods include three major categories: supervised, unsupervised, and reinforcement learning [17].…”

Section: Introductionmentioning

confidence: 99%

“…In fact, the time delays in SAS can be understood as a mapping relationship between the echoes and the platform motion, which suggests that supervised learning techniques may be effectively utilised in the time delay estimation of SAS. There are many different forms of mapping in supervised learning, including decision trees, logistic regression, kernel machines, support vector machines (SVMs), Bayesian classifiers, and neural networks [14][15][16][17][18][19][20][21][22][23][24][25][26]. In recent years, deep neural networks have made remarkable progress in the field of supervised learning [14].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Deep learning‐based time delay estimation for motion compensation in synthetic aperture sonars

Chen,

Chi,

Zhang

et al. 2023

IET Radar Sonar & Navi

View full text Add to dashboard Cite

Accurate and robust time delay estimation is crucial for synthetic aperture sonar (SAS) imaging. A two‐step time delay estimation method based on displaced phase centre antenna (DPCA) micronavigation has been widely applied in motion estimation and compensation of SASs. However, the existing methods for time delay estimation are not sufficiently robust, which reduces the performance of SAS motion estimation. Deep learning is currently one of the cutting‐edge techniques and has brought about a remarkable progress in the field of underwater acoustic signal processing. In this study, a deep learning‐based time delay estimation method is introduced to SAS motion estimation and compensation. The subband processing is first applied to obtain ambiguous time delays between adjacent pings from phases of SAS echoes. Then, a lightweight neural network is utilised to construct phase unwrapping. The model of the employed neural network is trained with simulation data and applied to real SAS data. The results of time delay estimation and motion compensation demonstrate that the proposed neural network‐based method has much better performance than the two‐step and joint‐subband methods.

show abstract