Researchers utilizing phenotypic data from diverse sources require matching of phenotypes to standard clinical vocabularies. Mapping phenotypes to vocabulary can be difficult, as existing tools are often incomplete, can be difficult to access, and can be cumbersome to use, especially for non-experts. We created WikiMedMap as a simple tool that leverages Wikipedia and maps phenotype strings to standard clinical vocabularies. We assessed WikiMedMap by mapping phenotype strings from questionnaires in the UK Biobank and from Mendelian diseases in Online Mendelian Inheritance in Man (OMIM) database to eight vocabularies: International Classification of Diseases, Ninth Revision (ICD-9), ICD-10, ICD-O, Medical Subject Headings (MeSH), OMIM, Disease Database, and MedlinePlus. WikiMedMap outperformed conventional mapping tools in finding potential matches for phenotype strings. We envision WikiMedMap as a technique that complements existing and established tools to map strings to clinical vocabularies that usually do not coexist in one source.
Word Count: 2650 words