Distributed Variational Representation Learning

Aguerri, Iñaki Estella; Zaidi, Abdellatif

doi:10.1109/tpami.2019.2928806

Cited by 53 publications

(49 citation statements)

References 51 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We note that although it was assumed that the number of classes was known beforehand (as was the case for almost all competing algorithms in its category), that number could be found (or estimated to within a certain accuracy) through inspection of the resulting bifurcations on the associated information-plane, as was observed for the standard information bottleneck method. Finally, we mention that among the interesting research directions in this line of work, one important question pertains to the distributed learning setting, i.e., along the counterpart, to the unsupervised setting, of the recent work [ 31 , 32 , 33 ], which contained distributed IB algorithms for both discrete and vector Gaussian data models.…”

Section: Discussionmentioning

confidence: 99%

Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding

Uğur

Arvanitakis

Zaidi

2020

Entropy

Self Cite

View full text Add to dashboard Cite

In this paper, we develop an unsupervised generative clustering framework that combines variational information bottleneck and the Gaussian Mixture Model. Specifically, in our approach we use the variational information bottleneck method and model the latent space as a mixture of Gaussians. We derive a bound on the cost function of our model that generalizes the evidence lower bound (ELBO); and provide a variational inference type algorithm that allows to compute it. In the algorithm, the coders' mappings are parametrized using neural networks and the bound is approximated by Markov sampling and optimized with stochastic gradient descent. Numerical results on real datasets are provided to support the efficiency of our method.Preprint. Under review.

show abstract

Section: Discussionmentioning

confidence: 99%

Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding

Uğur

Arvanitakis

Zaidi

2020

Entropy

Self Cite

View full text Add to dashboard Cite

show abstract

“…The equality in (6) holds due to I(X ; Z) = H(X ) − H(X |Z) and the fact that H(X ) can be skipped because it does not depend on the mapping p(z|y). The comparison of ( 5) and ( 6) with (2) reveals that the IB approach is a special formulation of the remote sensing problem using the logarithmic loss functiond(x, z) = − log p(x|z) as a distortion measure whose expectation is H(X |Z) = E X ,Z [− log p(x|z)] [13], [20]. In this case, distortion minimization means maximization of the relevant mutual information I(X ; Z) for given H(X ).…”

Section: B Information Bottleneck Methodsmentioning

confidence: 99%

“…Although initiated in different areas, a tight connection between the CEO problem and the IB framework exists. For the logarithmic loss function as a distortion measure, the CEO problem can be formulated as a distributed IB problem [20]. Meanwhile, a rich set of IB applications can be found in communications [21]- [26].…”

Section: Information Bottleneckmentioning

confidence: 99%

Reduced-Complexity Optimization of Distributed Quantization Using the Information Bottleneck Principle

Steiner

Kuehn

Stark

et al. 2021

IEEE Open J. Commun. Soc.

View full text Add to dashboard Cite

This paper addresses the optimization of distributed compression in a sensor network. A direct communication among the sensors is not possible so that noisy measurements of a single relevant signal have to be locally compressed in order to meet the rate constraints of the communication links to a common receiver. This scenario is widely known as the Chief Executive Officer (CEO) problem and represents a long-standing problem in information theory. In recent years significant progress has been achieved and the rate region has been completely characterized for specific distributions of involved processes and distortion measures. While algorithmic solutions of the CEO problem are principally known, their practical implementation quickly becomes challenging due to complexity reasons. In this contribution, an efficient greedy algorithm to determine feasible solutions of the CEO problem is derived using the information bottleneck (IB) approach. Following the Wyner-Ziv coding principle, the quantizers are successively designed using already optimized quantizer mappings as side-information. However, processing this side-information in the optimization algorithm becomes a major bottleneck because the memory complexity grows exponentially with number of sensors. Therefore, a sequential compression scheme leading to a compact representation of the side-information and ensuring moderate memory requirements even for larger networks is introduced. This internal compression is optimized again by means of the IB method. Numerical results demonstrate that the overall loss in terms of relevant mutual information can be made sufficiently small even with a significant compression of the side-information. The performance is compared to separately optimized quantizers and a centralized quantization. Moreover, the influence of the optimization order for asymmetric scenarios is discussed.

show abstract

“…We use the Information Bottleneck (IB) principle presented in [ 6 ] to build the theory behind centralized and decentralized classification models. The analysis of the supervised and unsupervised information bottleneck problems was performed in [ 23 ] and generalized to the distributed setup in [ 24 ]. In this work, we extend the IBN to demonstrate the importance of compression in the form of vector quantization for the classification problem.…”

Section: Related Workmentioning

confidence: 99%

Information Bottleneck Classification in Extremely Distributed Systems

Ullmann

Rezaeifar

Taran

et al. 2020

Entropy

View full text Add to dashboard Cite

We present a new decentralized classification system based on a distributed architecture. This system consists of distributed nodes, each possessing their own datasets and computing modules, along with a centralized server, which provides probes to classification and aggregates the responses of nodes for a final decision. Each node, with access to its own training dataset of a given class, is trained based on an auto-encoder system consisting of a fixed data-independent encoder, a pre-trained quantizer and a class-dependent decoder. Hence, these auto-encoders are highly dependent on the class probability distribution for which the reconstruction distortion is minimized. Alternatively, when an encoding–quantizing–decoding node observes data from different distributions, unseen at training, there is a mismatch, and such a decoding is not optimal, leading to a significant increase of the reconstruction distortion. The final classification is performed at the centralized classifier that votes for the class with the minimum reconstruction distortion. In addition to the system applicability for applications facing big-data communication problems and or requiring private classification, the above distributed scheme creates a theoretical bridge to the information bottleneck principle. The proposed system demonstrates a very promising performance on basic datasets such as MNIST and FasionMNIST.

show abstract

Distributed Variational Representation Learning

Cited by 53 publications

References 51 publications

Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding

Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding

Reduced-Complexity Optimization of Distributed Quantization Using the Information Bottleneck Principle

Information Bottleneck Classification in Extremely Distributed Systems

Contact Info

Product

Resources

About