Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment

Falissard, Louis; Morgand, Claire; Ghosn, Walid; Imbaud, Claire; Bounebache, Karim; Rey, Grégoire

doi:10.2196/26353

Cited by 12 publications

(5 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Section: Discussionsupporting

confidence: 91%

“…These results are aligned with those of previous studies using AUTOCOD [ 12 , 13 ] and, in general, with the literature on deep neural networks applied to the automatic classification of DCs [ 14 , 27 , 28 ]. Falissard et al [ 14 ] developed a deep neural network for automated coding of the underlying cause of death with a test accuracy of 0.978 (95% CI 0.977-0.979) and an F -measure value of 0.952 (95% CI 0.946-0.957) [ 27 ]. The proposed approach by Della Mea et al [ 28 ] for automated coding of causes of death had an accuracy of 0.990 (95% CI 0.990-0.991) and a macroaveraged accuracy and F 1 -score of 0.974 and 0.968, respectively.…”

Section: Discussionsupporting

confidence: 91%

See 1 more Smart Citation

Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis

Ferreira¹,

Simões²,

Carvalho³

et al. 2023

JMIR AI

View full text Add to dashboard Cite

Background In 2021, the European Union reported >270,000 excess deaths, including >16,000 in Portugal. The Portuguese Directorate-General of Health developed a deep neural network, AUTOCOD, which determines the primary causes of death by analyzing the free text of physicians’ death certificates (DCs). Although AUTOCOD’s performance has been established, it remains unclear whether its performance remains consistent over time, particularly during periods of excess mortality. Objective This study aims to assess the sensitivity and other performance metrics of AUTOCOD in classifying underlying causes of death compared with manual coding to identify specific causes of death during periods of excess mortality. Methods We included all DCs between 2016 and 2019. AUTOCOD’s performance was evaluated by calculating various performance metrics, such as sensitivity, specificity, positive predictive value (PPV), and F1-score, using a confusion matrix. This compared International Statistical Classification of Diseases and Health-Related Problems, 10th Revision (ICD-10), classifications of DCs by AUTOCOD with those by human coders at the Directorate-General of Health (gold standard). Subsequently, we compared periods without excess mortality with periods of excess, severe, and extreme excess mortality. We defined excess mortality as 2 consecutive days with a Z score above the 95% baseline limit, severe excess mortality as 2 consecutive days with a Z score >4 SDs, and extreme excess mortality as 2 consecutive days with a Z score >6 SDs. Finally, we repeated the analyses for the 3 most common ICD-10 chapters focusing on block-level classification. Results We analyzed a large data set comprising 330,098 DCs classified by both human coders and AUTOCOD. AUTOCOD demonstrated high sensitivity (≥0.75) for 10 ICD-10 chapters examined, with values surpassing 0.90 for the more prevalent chapters (chapter II—“Neoplasms,” chapter IX—“Diseases of the circulatory system,” and chapter X—“Diseases of the respiratory system”), accounting for 67.69% (223,459/330,098) of all human-coded causes of death. No substantial differences were observed in these high-sensitivity values when comparing periods without excess mortality with periods of excess, severe, and extreme excess mortality. The same holds for specificity, which exceeded 0.96 for all chapters examined, and for PPV, which surpassed 0.75 in 9 chapters, including the more prevalent ones. When considering block classification within the 3 most common ICD-10 chapters, AUTOCOD maintained a high performance, demonstrating high sensitivity (≥0.75) for 13 ICD-10 blocks, high PPV for 9 blocks, and specificity of >0.98 in all blocks, with no significant differences between periods without excess mortality and those with excess mortality. Conclusions Our findings indicate that, during periods of excess and extreme excess mortality, AUTOCOD’s performance remains unaffected by potential text quality degradation because of pressure on health services. Consequently, AUTOCOD can be dependably used for real-time cause-specific mortality surveillance even in extreme excess mortality situations.

show abstract

Section: Discussionsupporting

confidence: 91%

Section: Discussionsupporting

confidence: 91%

Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis

Ferreira¹,

Simões²,

Carvalho³

et al. 2023

JMIR AI

View full text Add to dashboard Cite

show abstract

“…Nevertheless, the trends observed within this set of keywords are also reflected in the analysis provided in the following sections. [23], construction of cohorts of similar patients [24], processing of electronic medical records [25], understanding of patient's answers in a French medical chatbot [26]; • German: evaluation of Transformers on clinical notes [27]; • Greek: improving the performance of localized healthcare virtual assistants [28]; • Hindi: classification of COVID-19 texts [29], chatbot for information sexual and reproductive health for young people [30]; • Italian: analysis of social media for quality of life in Parkinson's patients [31], sentiment analysis of opinion on COVID-19 vaccines [32,33], estimation of the incidence of infectious disease cases [34]; • Japanese: understanding psychiatric illness [35], detection of adverse events from narrative clinical documents [36]; • Korean: BERT model for processing med-ical documents [37], sentiment analysis of tweets about COVID-19 vaccines [38];…”

Section: Analysis Of Abstract From Publicationsmentioning

confidence: 99%

“…and institutions (like MIMIC-III), as well as data from social media, hospitals, bibliographical datasets, clinical trials, etc. The research in other languages is possible mainly thanks to the availability of data from social media [7,9,19,20,22,38,43,47] and documents from local hospitals [10,13,14,17,18,23,25,27,36,37,40,42]. Besides, this set of works in languages other than English relies on the dedicated language models, which cover a great variety of languages by now.…”

Section: Languages Addressedmentioning

confidence: 99%

Year 2022 in Medical Natural Language Processing: Availability of Language Models as a Step in the Democratization of NLP in the Biomedical Area

Grouin,

Grabar

2023

Yearb Med Inform

View full text Add to dashboard Cite

Objectives: To analyse the content of publications within the medical Natural Language Processing (NLP) domain in 2022. Methods: Automatic and manual preselection of publications to be reviewed, and selection of the best NLP papers of the year. Analysis of the important issues. Results: Three best papers have been selected. We also propose an analysis of the content of the NLP publications in 2022, stressing on some of the topics. Conclusion: The main trend in 2022 is certainly related to the availability of large language models, especially those based on Transformers, and to their use by non-NLP researchers. This leads to the democratization of the NLP methods. We also observe the renewal of interest to languages other than English, the continuation of research on information extraction and prediction, the massive use of data from social media, and the consideration of needs and interests of patients.

show abstract

“…Yet, the need for coding electronic health records also with ICF has been recognized as useful [4]. Thus, the use of tools to support this task is welcomed, although not yet researched as much as for other biomedical classifications like ICD, e.g., in [5,6,7,8].…”

Section: Introductionmentioning

confidence: 99%

Automated ICF Coding of Rehabilitation Notes for Low-Resource Languages via Continual Training of Language Models

Roitero,

Martinuzzi,

Armellin

et al. 2023

Caring Is Sharing – Exploiting the Value in Data for Health and Innovation

View full text Add to dashboard Cite

The coding of medical documents and in particular of rehabilitation notes using the International Classification of Functioning, Disability and Health (ICF) is a difficult task showing low agreement among experts. Such difficulty is mainly caused by the specific terminology that needs to be used for the task. In this paper, we address the task developing a model based on a large language model, BERT. By leveraging continual training of such a model using ICF textual descriptions, we are able to effectively encode rehabilitation notes expressed in Italian, an under-resourced language.

show abstract

Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment

Cited by 12 publications

References 7 publications

Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis

Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis

Year 2022 in Medical Natural Language Processing: Availability of Language Models as a Step in the Democratization of NLP in the Biomedical Area

Automated ICF Coding of Rehabilitation Notes for Low-Resource Languages via Continual Training of Language Models

Contact Info

Product

Resources

About