2017
DOI: 10.48550/arxiv.1711.04366
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Parameter Estimation in Finite Mixture Models by Regularized Optimal Transport: A Unified Framework for Hard and Soft Clustering

Abstract: In this short paper, we formulate parameter estimation for finite mixture models in the context of discrete optimal transportation with convex regularization. The proposed framework unifies hard and soft clustering methods for general mixture models. It also generalizes the celebrated k-means and expectation-maximization algorithms in relation to associated Bregman divergences when applied to exponential family mixture models.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(11 citation statements)
references
References 47 publications
0
11
0
Order By: Relevance
“…Many methods that combine OT and clustering [34,35,36,37] focus on using the Wasserstein distance to identify barycenters that serves as the centroid of clusters. When finding barycenters for the source and target separately, this could be treated as the case of LOT where C z = 0 while C x , C y are defined using a squared L2 distance.…”
Section: Relationship To Ot-based Clustering Methodsmentioning
confidence: 99%
“…Many methods that combine OT and clustering [34,35,36,37] focus on using the Wasserstein distance to identify barycenters that serves as the centroid of clusters. When finding barycenters for the source and target separately, this could be treated as the case of LOT where C z = 0 while C x , C y are defined using a squared L2 distance.…”
Section: Relationship To Ot-based Clustering Methodsmentioning
confidence: 99%
“…the bibliography in [42] and Section 2 below, these papers concern the discrete Schödinger bridge problem [48,34]. Hardly any attention, however, has been given to the case when only samples of continuous marginals are available (one exception is [28] which deals with using regularized optimal transport for hard and soft clustering). One might think that the latter case may be readily treated by discretizing the spatial variables through grids.…”
Section: Prior Workmentioning
confidence: 99%
“…while the Fokker-Planck equation (27b) has been replaced by the continuity equation (31b). Both Problems 6 and 7 can be thought of as regularizations of the Benamou-Brenier problem (28) and as dynamic counterparts of (9). Also notice that, precisely as in Problem (28), the optimal current velocity (29) in Problem 7 is of the gradient type.…”
Section: Stochastic Control and Fluid-dynamic Formulationsmentioning
confidence: 99%
“…the Euclidean distance), Wasserstein distance defines a distance between two measures as the minimal transportation cost between them. This notion of distance leads to a host of important applications, including text classification [28], clustering [23,24,14], unsupervised learning [21], semi-supervised learning [44], statistics [36,37,46,19], and others [5,39,45]. Given a set of measures in the same space, the 2-Wasserstein barycenter is defined as the measure minimizing the sum of squared 2-Wasserstein distances to all measures in the set.…”
Section: Introductionmentioning
confidence: 99%
“…The Wasserstein barycenter better captures the underlying geometric structure than the barycenter defined by the Euclidean or other distances. As a result, the Wasserstein barycenter has applications in clustering [23,24,14], image retrieval [13] and others [30,41].…”
Section: Introductionmentioning
confidence: 99%