2007
DOI: 10.1109/tpami.2007.1085
|View full text |Cite
|
Sign up to set email alerts
|

Segmentation of Multivariate Mixed Data via Lossy Data Coding and Compression

Abstract: In this paper, based on ideas from lossy data coding and compression, we present a simple but effective technique for segmenting multivariate mixed data that are drawn from a mixture of Gaussian distributions, which are allowed to be almost degenerate. The goal is to find the optimal segmentation that minimizes the overall coding length of the segmented data, subject to a given distortion. By analyzing the coding length/rate of mixed data, we formally establish some strong connections of data segmentation to m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
301
0
1

Year Published

2010
2010
2016
2016

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 406 publications
(304 citation statements)
references
References 30 publications
2
301
0
1
Order By: Relevance
“…Note that the first term of the equation stands for the coding length required to codeW , and the second term of the equation stands for the additional coding length of the mean vector. The equation also gives a good upper bound for degenerated Gaussian data or subspace-like data [15]. Moreover, the coding length has proven to be effective for clustering [16] and classification [15].…”
Section: Lossy Coding Length Of Multivariate Gaussian Datamentioning
confidence: 98%
See 3 more Smart Citations
“…Note that the first term of the equation stands for the coding length required to codeW , and the second term of the equation stands for the additional coding length of the mean vector. The equation also gives a good upper bound for degenerated Gaussian data or subspace-like data [15]. Moreover, the coding length has proven to be effective for clustering [16] and classification [15].…”
Section: Lossy Coding Length Of Multivariate Gaussian Datamentioning
confidence: 98%
“…The equation also gives a good upper bound for degenerated Gaussian data or subspace-like data [15]. Moreover, the coding length has proven to be effective for clustering [16] and classification [15].…”
Section: Lossy Coding Length Of Multivariate Gaussian Datamentioning
confidence: 98%
See 2 more Smart Citations
“…The segmented images are expected to consist of regions within which the image content is homogeneous, while the contrast between neighboring regions is high. Typical methods falling into this category include region growing, watershed, some MRF-based methods [3], mean-shift [9] and the recently presented lossy data compression-based approach [10]. Segmentation methods based on the boundary or edge information are designed to exploit the discontinuity of the image features, such as difference in texture or pixel intensity, on the two sides of the boundary.…”
Section: Boundary (Edge) Informationmentioning
confidence: 99%