2020
DOI: 10.1200/cci.19.00124
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing the Quality of Hierarchic Relations in the National Cancer Institute Thesaurus to Enable Faceted Query of Cancer Registry Data

Abstract: PURPOSE To audit and improve the completeness of the hierarchic (or is-a) relations of the National Cancer Institute (NCI) Thesaurus to support its role as a faceted system for querying cancer registry data. METHODS We performed quality auditing of the 19.01d version of the NCI Thesaurus. Our hybrid auditing method consisted of three main steps: computing nonlattice subgraphs, constructing lexical features for concepts in each subgraph, and performing subsumption reasoning with each subgraph to automatically s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
5
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 8 publications
0
5
0
Order By: Relevance
“…In our previous work [ 28 ], we developed a lexical-based approach to identify missing IS-A relations in the NCI Thesaurus. The lexical feature used in that work was the enriched bag-of-words (i.e., in this paper).…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…In our previous work [ 28 ], we developed a lexical-based approach to identify missing IS-A relations in the NCI Thesaurus. The lexical feature used in that work was the enriched bag-of-words (i.e., in this paper).…”
Section: Discussionmentioning
confidence: 99%
“…EVS experts confirmed 73 out of 253 suggested missing IS-A relations. We compared our hybrid approach in this paper with the lexical-based approach in previous work [ 28 ] in two aspects.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…It includes tools that support cancer research at molecular, 3-6a cellular, 7 tissue, 8-10 organ, [11][12][13][14][15] individual, [16][17][18][19] and population [20][21][22][23] levels. The tools described also support a range of cancer informatics and data science functions, including data integration, 13,[24][25][26][27][28][29] data curation, 30 deep learning, 9 information retrieval, 20,31,32 natural language processing, 22 and statistical analysis. 5,6 A catalog of all available ITCR tools can be found on the ITCR website 33…”
mentioning
confidence: 99%