2011
DOI: 10.1093/bib/bbr066
|View full text |Cite
|
Sign up to set email alerts
|

Semantic similarity analysis of protein data: assessment with biological features and issues

Abstract: The integration of proteomics data with biological knowledge is a recent trend in bioinformatics. A lot of biological information is available and is spread on different sources and encoded in different ontologies (e.g. Gene Ontology). Annotating existing protein data with biological information may enable the use (and the development) of algorithms that use biological ontologies as framework to mine annotated data. Recently many methodologies and algorithms that use ontologies to extract knowledge from data, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
160
0
4

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 192 publications
(165 citation statements)
references
References 37 publications
1
160
0
4
Order By: Relevance
“…Several surveys related to the usage of semantic measures in the biomedical domain underline the diversity of their applications, e.g. for diagnosis, disease classification, drug design and gene analysis (Pedersen et al, 2007;Pesquita et al, 2009a;Guzzi et al, 2012). As an illustration, here we focus on applications related to studies on the Gene Ontology (GO) (Ashburner et al, 2000).…”
Section: Biomedical Informatics and Bioinformaticsmentioning
confidence: 99%
See 1 more Smart Citation
“…Several surveys related to the usage of semantic measures in the biomedical domain underline the diversity of their applications, e.g. for diagnosis, disease classification, drug design and gene analysis (Pedersen et al, 2007;Pesquita et al, 2009a;Guzzi et al, 2012). As an illustration, here we focus on applications related to studies on the Gene Ontology (GO) (Ashburner et al, 2000).…”
Section: Biomedical Informatics and Bioinformaticsmentioning
confidence: 99%
“…In other cases, semantic measures will be evaluated by analysing the correlation between the semantic similarity of gene conceptual annotations (groups of concepts defined into an ontology) and the score of similarity of gene DNA sequences. Please refer to the work of Guzzi et al (2012) and Pesquita et al (2009a,b) for details related to evaluations in Bioinformatics.…”
Section: Other Evaluations Not Based On Human Ratingsmentioning
confidence: 99%
“…Guzzi et al [ 9 ] have identifi ed several issues affecting SS measures, which they categorize into external issues, which are usually related to annotation corpora, and internal issues, inherent to the design of the measures. They do however recognize that both kinds of issues can be entangled, for instance when measures make erroneous assumptions about the corpora.…”
Section: Issues and Challenges In Ssmentioning
confidence: 99%
“…These include sequence similarity, family similarity, protein-protein interactions, functional modules and complexes, and expression profi le similarity. [ 9 ] GOA-based IC measures were regarded as the best performing measures for most settings, the new wave of more complex structural-based measures, such as SSDD [ 13 ], SORA [ 23 ] and TCSS [ 1 ] are now on the lead, though closely followed by SimGIC. SSDD is based on the concept of semantic "totipotency" whereby terms are assigned values according to their distance to the root and the number of descendants for each of the levels in that path, and then similarity corresponds to the smallest sum of "totipotencies" along a path between two terms.…”
Section: Evaluating and Comparing Ss Measuresmentioning
confidence: 99%
See 1 more Smart Citation