Compressing deep‐quaternion neural networks with targeted regularisation

Vecchi, Riccardo; Scardapane, Simone; Comminiello, Danilo; Uncini, Aurelio

doi:10.1049/trit.2020.0020

Cited by 23 publications

(13 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The decoder has a mirrored structure, and thus quaternion fully connected layers are piled up as

, with an additional refiner layer at the end of the stack. We do not consider including the quaternion batch normalization [ 38 , 45 ] since it can introduce randomness which may affect the correct learning of the distribution statistics [ 24 , 66 ]. For every experiment, the prior distribution is a centered isotropic

-proper quaternion Gaussian distribution, as described in Section 4 .…”

Section: Experimental Resultsmentioning

confidence: 99%

“…This properties have been widely exploited in shallow learning models, such as linear and nonlinear adaptive filters [ 27 , 28 , 29 , 30 , 31 , 32 , 33 , 34 ]. Another fundamental property of quaternion-valued learning is the Hamilton product, which has recently favored the proliferation of convolutional neural networks in the quaternion domain [ 35 , 36 , 37 , 38 ]. Due to their capabilities, quaternion-valued learning methods have been applied in several applications, including spoken language understanding [ 39 ], color image processing [ 40 , 41 ], 3D audio [ 42 , 43 ], speech recognition [ 44 ], image generation [ 45 ], quantum mechanics [ 46 ], risk diversification [ 47 ], gait data analysis [ 48 ].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An Information-Theoretic Perspective on Proper Quaternion Variational Autoencoders

Grassucci

Comminiello

Uncini

2021

Entropy

Self Cite

View full text Add to dashboard Cite

Variational autoencoders are deep generative models that have recently received a great deal of attention due to their ability to model the latent distribution of any kind of input such as images and audio signals, among others. A novel variational autoncoder in the quaternion domain H, namely the QVAE, has been recently proposed, leveraging the augmented second order statics of H-proper signals. In this paper, we analyze the QVAE under an information-theoretic perspective, studying the ability of the H-proper model to approximate improper distributions as well as the built-in H-proper ones and the loss of entropy due to the improperness of the input signal. We conduct experiments on a substantial set of quaternion signals, for each of which the QVAE shows the ability of modelling the input distribution, while learning the improperness and increasing the entropy of the latent space. The proposed analysis will prove that proper QVAEs can be employed with a good approximation even when the quaternion input data are improper.

show abstract

“…The decoder has a mirrored structure, and thus quaternion fully connected layers are piled up as

-proper quaternion Gaussian distribution, as described in Section 4 .…”

Section: Experimental Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

An Information-Theoretic Perspective on Proper Quaternion Variational Autoencoders

Grassucci

Comminiello

Uncini

2021

Entropy

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, the covariance matrix is not able to recover the complete second-order statistics in the quaternion domain [4] and the decomposition requires heavy matrix calculations and computational time [18]. Another remarkable approach is introduced in [34], where the input is standardized computing the average of the variance of each component. Nevertheless, describing the second-order statistics of a signal in the quaternion domain needs meticulous computations and the approach in [34] is an approximation of the complete variance.…”

Section: Quaternion Batch Normalizationmentioning

confidence: 99%

Quaternion Generative Adversarial Networks

Grassucci¹,

Cicero²,

Comminiello³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Latest Generative Adversarial Networks (GANs) are gathering outstanding results through a large-scale training, thus employing models composed of millions of parameters requiring extensive computational capabilities. Building such huge models undermines their replicability and increases the training instability. Moreover, multi-channel data, such as images or audio, are usually processed by real-valued convolutional networks that flatten and concatenate the input, losing any intra-channel spatial relation. To address these issues, here we propose a family of quaternion-valued generative adversarial networks (QGANs). QGANs exploit the properties of quaternion algebra, e.g., the Hamilton product for convolutions. This allows to process channels as a single entity and capture internal latent relations, while reducing by a factor of 4 the overall number of parameters. We show how to design QGANs and to extend the proposed approach even to advanced models. We compare the proposed QGANs with real-valued counterparts on multiple image generation benchmarks. Results show that QGANs are able to generate visually pleasing images and to obtain better FID scores with respect to their real-valued GANs. Furthermore, QGANs save up to 75% of the training parameters. We believe these results may pave the way to novel, more accessible, GANs capable of improving performance and saving computational resources.

show abstract

“…These advantages are due to the properties of quaternion algebras, including the Hamilton product that is used in quaternion convolutions. This has recently paved the way to the development of novel deep quaternion neural networks [11,13,14], often tailored to specific applications, including theme identification in telephone conversation [15], 3D sound event localization and detection [16,17], heterogeneous image processing [18] and speech recognition [19]. Other properties of quaternion algebra that may be exploited in learning processes are related to the second-order statistics.…”

Section: Introductionmentioning

confidence: 99%

A Quaternion-Valued Variational Autoencoder

Grassucci

Comminiello

Uncini

2021

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Self Cite

View full text Add to dashboard Cite

Deep probabilistic generative models have achieved incredible success in many fields of application. Among such models, variational autoencoders (VAEs) have proved their ability in modeling a generative process by learning a latent representation of the input. In this paper, we propose a novel VAE defined in the quaternion domain, which exploits the properties of quaternion algebra to improve performance while significantly reducing the number of parameters required by the network. The success of the proposed quaternion VAE with respect to traditional VAEs relies on the ability to leverage the internal relations between quaternion-valued input features and on the properties of second-order statistics which allow to define the latent variables in the augmented quaternion domain. In order to show the advantages due to such properties, we define a plain convolutional VAE in the quaternion domain and we evaluate its performance with respect to its real-valued counterpart on the CelebA face dataset.

show abstract

Compressing deep‐quaternion neural networks with targeted regularisation

Cited by 23 publications

References 27 publications

An Information-Theoretic Perspective on Proper Quaternion Variational Autoencoders

An Information-Theoretic Perspective on Proper Quaternion Variational Autoencoders

Quaternion Generative Adversarial Networks

A Quaternion-Valued Variational Autoencoder

Contact Info

Product

Resources

About