2022
DOI: 10.1186/s12859-022-04769-w
|View full text |Cite
|
Sign up to set email alerts
|

Euclidean distance-optimized data transformation for cluster analysis in biomedical data (EDOtrans)

Abstract: Background Data transformations are commonly used in bioinformatics data processing in the context of data projection and clustering. The most used Euclidean metric is not scale invariant and therefore occasionally inappropriate for complex, e.g., multimodal distributed variables and may negatively affect the results of cluster analysis. Specifically, the squaring function in the definition of the Euclidean distance as the square root of the sum of squared differences between data points has th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
7
2
1

Relationship

2
8

Authors

Journals

citations
Cited by 20 publications
(6 citation statements)
references
References 29 publications
0
6
0
Order By: Relevance
“…3 and 4). The Euclidean distance has been utilized to analyze the dissimilarity in cluster analyses of biomedical data 14 . We calculated the Z-score-normalized Euclidean distance.…”
Section: Resultsmentioning
confidence: 99%
“…3 and 4). The Euclidean distance has been utilized to analyze the dissimilarity in cluster analyses of biomedical data 14 . We calculated the Z-score-normalized Euclidean distance.…”
Section: Resultsmentioning
confidence: 99%
“…To check the similarity and dissimilarity of transcriptomic effects of drugs, we utilized multidimensional scaling analysis [35] with edgeR [36] and limma [37] packages in R programming language. Distances of the drugs’ transcriptomic data on dimensional plot were calculated based on the Euclidean distance algorithm [38,39].…”
Section: Methodsmentioning
confidence: 99%
“…The Euclidean distance is the prevalent measure of the distance between two points. If the coordinates represent measured values, random fluctuations of varying magnitude will likely occur [33]. In such cases, the rational approach is to weight the coordinates so that the coordinates that change more have less weight than those that change less.…”
Section: Improved Extend Kalman Filtermentioning
confidence: 99%