Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019) 2019
DOI: 10.18653/v1/d19-6220
|View full text |Cite
|
Sign up to set email alerts
|

Ontological attention ensembles for capturing semantic concepts in ICD code prediction from clinical text

Abstract: We present a semantically interpretable system for automated ICD coding of clinical text documents. Our contribution is an ontological attention mechanism which matches the structure of the ICD ontology, in which shared attention vectors are learned at each level of the hierarchy, and combined into label-dependent ensembles. Analysis of the attention heads shows that shared concepts are learned by the lowest common denominator node. This allows child nodes to focus on the differentiating concepts, leading to e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 21 publications
(20 citation statements)
references
References 10 publications
0
19
0
Order By: Relevance
“…To alleviate the burden of the status quo of manual coding, several Machine learning (ML) automated coding models have been developed (Larkey and Croft, 1996;Aronson et al, 2007;Farkas and Szarvas, 2008;Perotte et al, 2014;Ayyar et al, 2016;Baumel et al, 2018;Mullenbach et al, 2018;Falis et al, 2019). However, despite continued interest, translation of ML systems into real-world deployments has been limited.…”
Section: Introductionmentioning
confidence: 99%
“…To alleviate the burden of the status quo of manual coding, several Machine learning (ML) automated coding models have been developed (Larkey and Croft, 1996;Aronson et al, 2007;Farkas and Szarvas, 2008;Perotte et al, 2014;Ayyar et al, 2016;Baumel et al, 2018;Mullenbach et al, 2018;Falis et al, 2019). However, despite continued interest, translation of ML systems into real-world deployments has been limited.…”
Section: Introductionmentioning
confidence: 99%
“…Out of the 256 publications, we excluded 65 publications, as the described Natural Language Processing algorithms in those publications were not evaluated. The full text of the remaining 191 publications was assessed and 114 a As reported by authors [11,30,33,35,37,41,44,46,49,55,58,64,68,69,72,73,75,76,83,87,90,95,98,100,104] Included reference to dataset 21 (27%) [11,30,35,37,41,44,46,49,55,58,64,72,75,76,83,87,90,95,98,100,104] Training of algorithm Trained 47 (61%) [11, 12, 29, 31, 32, 34, 37, 39, 41, 42, 44, 45, 48-53, 55-59, 62, 63, 65, 66, 68, 69, 73, 74, 76, 78-84, 87, 88, 90, 95, 96, 98, 99, 104] Not listed 3 (3.9%) …”
Section: Resultsmentioning
confidence: 99%
“…Use of development set 16 (21%) [12,29,31,34,37,49,55,60,63,69,74,80,87,90,94,95] Not listed 4 (5.2%) [30,82,83,101] publications did not meet our criteria, of which 3 publications in which the algorithm was not evaluated, resulting in 77 included articles describing 77 studies. Reference checking did not provide any additional publications.…”
Section: Development Of Algorithmmentioning
confidence: 99%
“…ICD coding vs. outcome prediction. There is a variety of work in the related field of automated ICD coding (Xie et al, 2018;Falis et al, 2019). Zhang et al ( 2020) recently presented a model able to identify up to 2,292 ICD codes from text.…”
Section: Related Workmentioning
confidence: 99%