2005
DOI: 10.1002/cfg.451
|View full text |Cite
|
Sign up to set email alerts
|

Towards a semantic lexicon for biological language processing

Abstract: This paper explores the use of the resources in the National Library of Medicine's Unified Medical Language System (UMLS) for the construction of a lexicon useful for processing texts in the field of molecular biology. A lexicon is constructed from overlapping terms in the UMLS SPECIALIST lexicon and the UMLS Metathesaurus to obtain both morphosyntactic and semantic information for terms, and the coverage of a domain corpus is assessed. Over 77% of tokens in the domain corpus are found in the constructed lexic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2005
2005
2015
2015

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 4 publications
0
5
0
Order By: Relevance
“…The low overlap between UMLS and PubMed text has led to a few efforts for enriching controlled vocabularies. Mostly, it has been done by either filtering UMLS terms [ 21 , 27 , 29 , 34 ] or reclassifying UMLS concepts [ 35 , 36 ] for NLP problems. Bodenreider et al [ 37 ], however, suggested an idea of using adjectival modifiers and demodified terms to extend the UMLS Metathesaurus.…”
Section: Introductionmentioning
confidence: 99%
“…The low overlap between UMLS and PubMed text has led to a few efforts for enriching controlled vocabularies. Mostly, it has been done by either filtering UMLS terms [ 21 , 27 , 29 , 34 ] or reclassifying UMLS concepts [ 35 , 36 ] for NLP problems. Bodenreider et al [ 37 ], however, suggested an idea of using adjectival modifiers and demodified terms to extend the UMLS Metathesaurus.…”
Section: Introductionmentioning
confidence: 99%
“…It has been previously noted that terms in structured controlled vocabularies, such as the Gene Ontology (GO) (The Gene Ontology Consortium, 2000 ), often have a highly regular, even compositional, linguistic structure and that this structure can be exploited for the purposes of accessing those terms computationally and reasoning over them (Mungall, 2004 ; Ogren et al , 2004 ; Verspoor, 2005 ). This regularity is particularly important now that there are efforts to perform intra- or inter-ontology enrichment by linking terms (Bada and Hunter, 2008 ), because the tools that are used to support these efforts analyze the formal structure of the terms and take advantage of patterns of expression.…”
Section: Introductionmentioning
confidence: 99%
“…Two of the major approaches are preparing sharable semantic lexica and preparing sharable semantic grammar rules:

To create a semantic lexicon especially for processing discharge summaries, Johnson [28] proposed associating the Specialist lexemes via Metathesaurus concepts to appropriate UMLS semantic types. Similarly, Verspoor [29] created a semantic lexicon for processing biological literature.

As early as late 60s, Pratt & Pacak [30] already proposed intricate syntacto-semantic grammars that incorporated the semantic classes ( Etiology , Function , General , Morphology , and Topography ) of the Systematized Nomenclature of Pathology (SNOP) [31], the precursor of the SNOMED [32]. Decades later, Do Amaral Marcio & Satomura [33] picked up the idea again and incorporated SNOMED semantic classes into a syntacto-semantic grammar.
…”
Section: Related Workmentioning
confidence: 99%