Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification

Banerjee, Imon; Yuan, Ling; Chen, Matthew C.; Hasan, Sadid A.; Langlotz, Curtis P.; Moradzadeh, N; Chapman, Brian E.; Amrhein, Timothy J.; Mong, David A.; Farri, Oladimeji; Lungren, Matthew P.

doi:10.1016/j.artmed.2018.11.004

Cited by 195 publications

(114 citation statements)

References 44 publications

Supporting

Mentioning

109

Contrasting

Unclassified

Order By: Relevance

“…DL has shown remarkable results in extracting low-and high-level abstractions from raw text data with semantic and syntactic capabilities. This ability is often accompanied by excellent performance across translational science applications (25,32) and as highlighted below.…”

Section: Word Embeddingmentioning

confidence: 99%

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

et al. 2019

View full text Add to dashboard Cite

Current models for correlating electronic medical records with-omics data largely ignore clinical text, which is an important source of phenotype information for patients with cancer. This data convergence has the potential to reveal new insights about cancer initiation, progression, metastasis, and response to treatment. Insights from this real-world data will catalyze clinical care, research, and regulatory activities. Natural language processing (NLP) methods are needed to extract these rich cancer phenotypes from clinical text. Here, we review the advances of NLP and information extraction methods relevant to oncology based on publications from PubMed as well as NLP and machine learning conference proceedings in the last 3 years. Given the interdisciplinary nature of the fields of oncology and information extraction, this analysis serves as a critical trail marker on the path to higher fidelity oncology phenotypes from real-world data.

show abstract

Section: Word Embeddingmentioning

confidence: 99%

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

et al. 2019

View full text Add to dashboard Cite

show abstract

“…DL is quickly emerging in the literature as a viable alternative method to traditional ML for the classification of clinical narratives, even in situations where limited labeled data is available [37]. The technique can help in the recognition of a limited number of categories from biomedical text [39,40]; identify psychiatric conditions of patients based on short clinical histories [41]; and accurately classify whether or not radiology reports indicate pulmonary embolism [42,43] whilst outperforming baseline methods (e.g. RFs or DTs).…”

Section: Background and Significancementioning

confidence: 99%

Deep learning facilitates rapid classification of human and veterinary clinical narratives

Pineda

Walk

Venkataraman

et al. 2018

Preprint

View full text Add to dashboard Cite

Objective: Currently, dedicated tagging staff spend considerable effort assigning clinical codes to patient summaries for public health purposes, and machine-learning automated tagging is bottlenecked by availability of electronic medical records. Veterinary medical records, a largely untapped data source that could benefit both human and non-human patients, could fill the gap. Materials and Methods:In this retrospective study, we trained long short-term memory (LSTM) recurrent neural networks (RNNs) on 52,722 human and 89,591 veterinary records. We established relevant baselines by training Decision Trees (DT) and Random Forests (RF) on the same data. We finally investigated the effect of merging data across clinical settings and probed model portability. Results:We show that the LSTM-RNNs accurately classify veterinary/human text narratives into top-level categories with an average weighted macro F 1 score of 0.735/0.675 respectively. The evaluation metric for the LSTM was 7 and 8% higher than that of the DT and RF models respectively. We generally did not find evidence of model portability albeit moderate performance increases in select categories. Discussion:We see a strong positive correlation between number of training samples and classification performance, which is promising for future efforts. The use of LSTM-RNN models represents a scalable structure that could prove useful in cohort selection, which could in turn better address emerging public health concerns. Conclusion:Digitization of human and veterinary health information will continue to be a reality. Our approach is a step forward for these two domains to learn from, and inform, one another.

show abstract

“…With respect to automated text classification, in this work, we compared the approaches from the two main paradigms: (1) symbolic text classification, in which texts are represented with sparse vectors of TF-IDF weights, used as input features for traditional machine learning algorithms, such as Logistic Regression (LR) or Support Vector Machine (SVM); and (2) a more recent semantic text classification paradigm, in which dense semantic representations of words-word embeddings-are introduced as input to a neural architecture. Different deep learning architectures have been tried in a number of medical text classification tasks [25][26][27], including automated classification of radiology reports [6,28,29]. While recurrent [29,30] and attention-based neural networks [27,31] may present a viable solution, convolutional neural networks (CNN) seem to generally offer an edge in classification performance as well as faster training times [6,29].…”

Section: Introductionmentioning

confidence: 99%

“…Different deep learning architectures have been tried in a number of medical text classification tasks [25][26][27], including automated classification of radiology reports [6,28,29]. While recurrent [29,30] and attention-based neural networks [27,31] may present a viable solution, convolutional neural networks (CNN) seem to generally offer an edge in classification performance as well as faster training times [6,29]. Furthermore, due to their efficiency and being less data-hungry than, e.g., recurrent networks, CNNs have profiled themselves as a go-to text classification architecture in general-purpose natural language processing [32][33][34].…”

Section: Introductionmentioning

confidence: 99%

Automatic Annotation of Narrative Radiology Reports

et al. 2020

View full text Add to dashboard Cite

Narrative texts in electronic health records can be efficiently utilized for building decision support systems in the clinic, only if they are correctly interpreted automatically in accordance with a specified standard. This paper tackles the problem of developing an automated method of labeling free-form radiology reports, as a precursor for building query-capable report databases in hospitals. The analyzed dataset consists of 1295 radiology reports concerning the condition of a knee, retrospectively gathered at the Clinical Hospital Centre Rijeka, Croatia. Reports were manually labeled with one or more labels from a set of 10 most commonly occurring clinical conditions. After primary preprocessing of the texts, two sets of text classification methods were compared: (1) traditional classification models—Naive Bayes (NB), Logistic Regression (LR), Support Vector Machine (SVM), and Random Forests (RF)—coupled with Bag-of-Words (BoW) features (i.e., symbolic text representation) and (2) Convolutional Neural Network (CNN) coupled with dense word vectors (i.e., word embeddings as a semantic text representation) as input features. We resorted to nested 10-fold cross-validation to evaluate the performance of competing methods using accuracy, precision, recall, and F 1 score. The CNN with semantic word representations as input yielded the overall best performance, having a micro-averaged F 1 score of 86 . 7 % . The CNN classifier yielded particularly encouraging results for the most represented conditions: degenerative disease ( 95 . 9 % ), arthrosis ( 93 . 3 % ), and injury ( 89 . 2 % ). As a data-hungry deep learning model, the CNN, however, performed notably worse than the competing models on underrepresented classes with fewer training instances such as multicausal disease or metabolic disease. LR, RF, and SVM performed comparably well, with the obtained micro-averaged F 1 scores of 84 . 6 % , 82 . 2 % , and 82 . 1 % , respectively.

show abstract

Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification

Cited by 195 publications

References 44 publications

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

Deep learning facilitates rapid classification of human and veterinary clinical narratives

Automatic Annotation of Narrative Radiology Reports

Contact Info

Product

Resources

About