2020
DOI: 10.1093/bioinformatics/btaa742
|View full text |Cite
|
Sign up to set email alerts
|

CLoNe: automated clustering based on local density neighborhoods for application to biomolecular structural ensembles

Abstract: Motivation Proteins are intrinsically dynamic entities. Flexibility sampling methods, such as molecular dynamics or those arising from integrative modeling strategies are now commonplace and enable the study of molecular conformational landscapes in many contexts. Resulting structural ensembles increase in size as technological and algorithmic advancements take place, making their analysis increasingly demanding. In this regard, cluster analysis remains a go-to approach for their classificati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 10 publications
(21 citation statements)
references
References 60 publications
0
21
0
Order By: Relevance
“…While the algorithm is fairly robust to cutoff choice, a list-position based cutoff may present issues with clusters of varying densities. 35 In order to include information from all data points, while minimizing user input, for all segment-based clustering trials, the kernel density estimator cutoff was set to the average distance to the ln( N )-th nearest neighbor, where N is the number of trajectory segments considered. This choice of cutoff was motivated by the idea that the number of nearest neighbors k ( N ) must adapt to the underlying data distribution as the number of samples N → ∞.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…While the algorithm is fairly robust to cutoff choice, a list-position based cutoff may present issues with clusters of varying densities. 35 In order to include information from all data points, while minimizing user input, for all segment-based clustering trials, the kernel density estimator cutoff was set to the average distance to the ln( N )-th nearest neighbor, where N is the number of trajectory segments considered. This choice of cutoff was motivated by the idea that the number of nearest neighbors k ( N ) must adapt to the underlying data distribution as the number of samples N → ∞.…”
Section: Methodsmentioning
confidence: 99%
“…In the original implementation, this value is set to a fixed (second) percentile of the sorted list of pairwise distances. While the algorithm is fairly robust to cutoff choice, a list-position based cutoff may present issues with clusters of varying densities . In order to include information from all data points, while minimizing user input, for all segment-based clustering trials, the kernel density estimator cutoff was set to the average distance to the ln­( N )-th nearest neighbor, where N is the number of trajectory segments considered.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations