2005
DOI: 10.1016/j.ijmedinf.2004.03.010
|View full text |Cite
|
Sign up to set email alerts
|

UMLF: a unified medical lexicon for French

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0
2

Year Published

2007
2007
2017
2017

Publication Types

Select...
5
2
2

Relationship

2
7

Authors

Journals

citations
Cited by 35 publications
(26 citation statements)
references
References 4 publications
0
24
0
2
Order By: Relevance
“…This method applied an exact-match strategy based on a French UMLS dictionary and a lexicon derived from small samples of previously manually annotated documents. The pre-annotation process consisted of the following steps: sentence segmentation and tokenization, lemmatization with the French lemmatizer Flemm (Namer 2004), generation of spelling and derivational variants (using the Unified Medical Lexicon for French, UMLF (Zweigenbaum et al 2005)), application of regular expressions to detect measurements (e.g., 3 cm) and durations (e.g., 2 weeks), and matching with the two lexicons. This matching was first applied to the original token and then to the lemma and variants when no match was found.…”
Section: Pre-annotation Methodsmentioning
confidence: 99%
“…This method applied an exact-match strategy based on a French UMLS dictionary and a lexicon derived from small samples of previously manually annotated documents. The pre-annotation process consisted of the following steps: sentence segmentation and tokenization, lemmatization with the French lemmatizer Flemm (Namer 2004), generation of spelling and derivational variants (using the Unified Medical Lexicon for French, UMLF (Zweigenbaum et al 2005)), application of regular expressions to detect measurements (e.g., 3 cm) and durations (e.g., 2 weeks), and matching with the two lexicons. This matching was first applied to the original token and then to the lemma and variants when no match was found.…”
Section: Pre-annotation Methodsmentioning
confidence: 99%
“…Zweigenbaum et al . [22] worked on a project to pool lexical resources scattered among several sources in a unified medical lexicon for French (UMLF). Namer [23] described a method which enables neoclassical compound nouns and adjectives of a biomedical specialized corpus to be automatically related by synonymy, hyponymy and approximation links.…”
Section: Introductionmentioning
confidence: 99%
“…), the OCR quality of the OCRed documents decreases substantially for the older documents. In a test set of 100 randomly selected documents from the corpus, we found that 16.4% of the words 3 did not appear in the Unified Medical Lexicon for French (Zweigenbaum et al, 2005), a word list with specific technical terms. Of these 16.4%, 3.8% pertained to words that were domain-specific terms that has been correctly identified in the OCR process but which did not feature in the UMLF, and 10.8% were words which contained at least one OCR error.…”
Section: Ocr Quality In Corpusmentioning
confidence: 97%