2010
DOI: 10.1007/978-3-642-14980-1_32
|View full text |Cite
|
Sign up to set email alerts
|

Non-parametric Mixture Models for Clustering

Abstract: Abstract. Mixture models have been widely used for data clustering. However, commonly used mixture models are generally of a parametric form (e.g., mixture of Gaussian distributions or GMM), which significantly limits their capacity in fitting diverse multidimensional data distributions encountered in practice. We propose a non-parametric mixture model (NMM) for data clustering in order to detect clusters generated from arbitrary unknown distributions, using non-parametric kernel density estimates. The propose… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(17 citation statements)
references
References 15 publications
0
17
0
Order By: Relevance
“…In both cases, each observation is assumed to be drawn randomly from one of several populations. However, in clustering the probability density functions of the populations are often clearly differentiated (Mallapragada, Jin, and Jain 2010), whereas in our problem these functions do not even have to be different. In addition, in clustering, the likelihood that an observation belongs to a specific population is unknown, and identifying the population to which a particular observation belongs is of interest.…”
Section: Introductionmentioning
confidence: 85%
“…In both cases, each observation is assumed to be drawn randomly from one of several populations. However, in clustering the probability density functions of the populations are often clearly differentiated (Mallapragada, Jin, and Jain 2010), whereas in our problem these functions do not even have to be different. In addition, in clustering, the likelihood that an observation belongs to a specific population is unknown, and identifying the population to which a particular observation belongs is of interest.…”
Section: Introductionmentioning
confidence: 85%
“…2 Table I summarizes the statistics of these datasets. We use the Gaussian kernel with kernel width set to be 5th percentile of the pairwise distances [Mallapragada et al 2010;Abrahamsen and Hansen 2011]. The column σ is the kernel width for each dataset.…”
Section: Methodsmentioning
confidence: 99%
“…Unlike the parametric methods nonparametric density based clustering methods do not choose particular density function but normally use kernel density estimate with kernels one one's choice. The Gaussian kernel has so far proved to be the commonly used [12,13,14,15]. General researchers use the multidimensional kernel density estimator with adaptive bandwidth suggested by [16] and represented as equation (21):…”
Section: Nonparametric Density Based Clusteringmentioning
confidence: 99%
“…Several authors have represented nonparametric mixture models in different ways with the general assumption that each cluster is generated by its own unknown density function [15]. Nonparametric mixture models can therefore also mean that no assumptions are made about the form of the density f j of model (1.1), even though the weights πj maybe scalar parameters as alluded by [17].…”
Section: Nonparametric Density Based Clusteringmentioning
confidence: 99%