2013
DOI: 10.1186/1471-2105-14-213
|View full text |Cite
|
Sign up to set email alerts
|

PFClust: a novel parameter free clustering algorithm

Abstract: BackgroundWe present the algorithm PFClust (Parameter Free Clustering), which is able automatically to cluster data and identify a suitable number of clusters to group them into without requiring any parameters to be specified by the user. The algorithm partitions a dataset into a number of clusters that share some common attributes, such as their minimum expectation value and variance of intra-cluster similarity. A set of n objects can be clustered into any number of clusters from one to n, and there are many… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 16 publications
(20 citation statements)
references
References 27 publications
0
20
0
Order By: Relevance
“…Figure  2 shows an example family from ChEMBL, one of the Androgen Receptor families (ChEMBL1871), with a number of different clusters of compounds. Splitting such a family into smaller groups based on ligand structure will allow us to identify the different sets of ligands; therefore PFClust [24] (brief description in Additional file 1) was applied to all the filtered ChEMBL families. We selected the PFClust algorithm because it is a parameter free clustering algorithm and does not require any kind of parameter tuning.…”
Section: Methodsmentioning
confidence: 99%
“…Figure  2 shows an example family from ChEMBL, one of the Androgen Receptor families (ChEMBL1871), with a number of different clusters of compounds. Splitting such a family into smaller groups based on ligand structure will allow us to identify the different sets of ligands; therefore PFClust [24] (brief description in Additional file 1) was applied to all the filtered ChEMBL families. We selected the PFClust algorithm because it is a parameter free clustering algorithm and does not require any kind of parameter tuning.…”
Section: Methodsmentioning
confidence: 99%
“…This leads to a set of refined families, each consisting of a group of molecules, which share similar chemical structure and bioactivity. The refined families of the ChEMBL dataset will allow us to identify the different sets of ligands [56,58,61].…”
Section: Filtered and Refined Families Of The Chembl Datasetmentioning
confidence: 99%
“…Using these position vectors for each compound, we calculated the Euclidean distances between the resulting points and a similarity matrix was created. Finally, we clustered the vectors using PFClust [61].…”
Section: Identifying the Off-targets Of The Novel Multipotent Compoundsmentioning
confidence: 99%
See 1 more Smart Citation
“…Moreover, the abovementioned evaluation indices are distance-based measures; therefore, they can only evaluate the qualities of spherical clusters and cannot be used for arbitrary-shaped clusters. In [15], Mavridis et al proposed the algorithm PFClust (Parameter Free Clustering). e term "parameter free" means that the algorithm can automatically determine the number of clusters without requiring any user-defined parameters.…”
Section: Introductionmentioning
confidence: 99%