2006
DOI: 10.1109/iembs.2006.4398006
|View full text |Cite
|
Sign up to set email alerts
|

A Cluster-Based Approach for Semantic Similarity in the Biomedical Domain

Abstract: Abstract-We propose a new cluster-based semantic similarity/distance measure for the biomedical domain within the framework of UMLS. The proposed measure is based mainly on the cross-modified path length feature between the concept nodes, and two new features: (1) the common specificity of two concept nodes, and (2) the local granularity of the clusters. We also applied, for comparison purpose, five existing general English ontology-based similarity measures into the biomedical domain within UMLS. The proposed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
46
0

Year Published

2011
2011
2015
2015

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(46 citation statements)
references
References 7 publications
0
46
0
Order By: Relevance
“…Let | path ( x i , x j )| ≥ 0 be the length of this path. In order to quantify a semantic dissimilarity value between x i and x j , an intuitive method has been originally proposed in (Al-Mubaid and Nguyen, 2006). It relies on a cluster-based strategy that combines the minimum path length between the semantic terms and the taxonomical depth of the considered branches.…”
Section: Methodsmentioning
confidence: 99%
“…Let | path ( x i , x j )| ≥ 0 be the length of this path. In order to quantify a semantic dissimilarity value between x i and x j , an intuitive method has been originally proposed in (Al-Mubaid and Nguyen, 2006). It relies on a cluster-based strategy that combines the minimum path length between the semantic terms and the taxonomical depth of the considered branches.…”
Section: Methodsmentioning
confidence: 99%
“…To this end, we define α ( x i , x j ) as α(xi,xj)=α(xj,xi)={0ifxi=xjSϴ(xi,xj)otherwisetrue} where s Θ is an extension of the semantic dissimilarity measure proposed by Al-Mubaid and Nguyen in [37]. The original version of this measure and its proposed extension are presented hereinafter.…”
Section: Methodsmentioning
confidence: 99%
“…Consequently, most of the proposed measures act by counting the number of taxonomic links from each term to their Least Common Subsumer (LCS) ( i.e ., the most concrete taxonomical ancestor that subsumes these two terms) and also the number of links of the LCS to the root of the ontology. Among them, Al-Mubaid and Nguyen have proposed in [37] a measure based on a cluster-based strategy that combines both the minimum path length and the taxonomical depth of the considered branches. The definition of this measure was also extended to deal with terms belonging simultaneously to multiple ontologies [38], enabling to evaluate the term similarity from complementary sources of knowledge.…”
Section: Linguistic Proximity and Semantic Distancesmentioning
confidence: 99%
“…Some of the most common methods rely on edge counting, shortest path, and ontological depth [6,14,15], while others add the least common subsumer (LCS) to capture the granularity of a concept in the ontology [16,17]. More recent advances have incorporated into similarity computation the distance to the LCS, assigning weights to the different path types (ontological depth, distance from concepts to LCS) [18], as well as all of the super concepts between two terms as a way to account for multiple inheritances [19].…”
Section: Introductionmentioning
confidence: 99%