On the Automatic Generation of Medical Imaging Reports

Jing, Baoyu; Xie, Pengtao; Xing, Eric P.

doi:10.18653/v1/p18-1240

Cited by 364 publications

(375 citation statements)

References 28 publications

Supporting

Mentioning

375

Contrasting

Order By: Relevance

“…Chest Radiographic Observations: The task is formulated as a multi-label classification with 14 common radiographic observations following [5] including: enlarged cardiom, cardiomegaly, lung opacity, lung lesion, edema, consolidation, pneumonia, atelectasis, pneumothorax, pleural effusion, pleural other, fracture, support devices, and no finding. Compared with previous studies using pretrained encoders based on ImageNet [6,14], pretraining with images from the same domain yields better results. We add one full-connected layer as classifier and compute the binary cross entropy (BCE) loss.…”

Section: Image Encodermentioning

confidence: 64%

“…Radiology Report Generation: The evaluation metrics we use are BLEU [9], METEOR [2], and ROUGE [8] scores, all of which are widely used in image captioning and machine translation tasks. We compared the proposed model with several state-of-the-art baselines: (1) a visual attention based image captioning model (Vis-Att) [13]; (2) radiology report generation models, including a hierarchical decoder with co-attention (Co-Att) [6], multimodal generative model with visual attention (MM-Att) [14], and knowledge-drive retrieval based report generation (KERP) [7]; and (3) the proposed multi-view encoder with hierarchical decoder (MvH) model, the base model with visual attentions and early fusion (MvH+AttE), MvH with late fusion fashion (MvH+AttL), and the combination of late fusion with medical concepts (MvH+AttL+MC). MvH+AttL+MC* is an oracle run based on ground-truth medical concepts and considered as the upper bound of the improvement caused by applying medical concepts.…”

Section: Methodsmentioning

confidence: 99%

“…Different from previous studies using ImageNet which is collected for general-purposed object recognition, we pretrain with large scale chest x-ray images from the same domain, namely CheXpert [5], to better capture domain specific image features for decoding. Second, most of previous studies using chest x-ray images for disease classification and report generation consider the frontal and lateral images from the same patient as two independent cases [6,12]. We argue that lateral images contain complementary information to frontal images in the process of interpreting medical images.…”

Section: Introductionmentioning

confidence: 90%

See 2 more Smart Citations

Automatic Radiology Report Generation Based on Multi-view Image Fusion and Medical Concept Enrichment

Yuan

Liao

Luo

et al. 2019

Lecture Notes in Computer Science

145

106

View full text Add to dashboard Cite

Generating radiology reports is time-consuming and requires extensive expertise in practice. Therefore, reliable automatic radiology report generation is highly desired to alleviate the workload. Although deep learning techniques have been successfully applied to image classification and image captioning tasks, radiology report generation remains challenging in regards to understanding and linking complicated medical visual contents with accurate natural language descriptions. In addition, the data scales of open-access datasets that contain paired medical images and reports remain very limited. To cope with these practical challenges, we propose a generative encoder-decoder model and focus on chest x-ray images and reports with the following improvements. First, we pretrain the encoder with a large number of chest x-ray images to accurately recognize 14 common radiographic observations, while taking advantage of the multi-view images by enforcing the cross-view consistency. Second, we synthesize multi-view visual features based on a sentence-level attention mechanism in a late fusion fashion. In addition, in order to enrich the decoder with descriptive semantics and enforce the correctness of the deterministic medical-related contents such as mentions of organs or diagnoses, we extract medical concepts based on the radiology reports in the training data and fine-tune the encoder to extract the most frequent medical concepts from the x-ray images. Such concepts are fused with each decoding step by a word-level attention model. The experimental results conducted on the Indiana University Chest X-Ray dataset demonstrate that the proposed model achieves the state-of-the-art performance compared with other baseline approaches.

show abstract

Section: Image Encodermentioning

confidence: 64%

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 90%

See 1 more Smart Citation

Automatic Radiology Report Generation Based on Multi-view Image Fusion and Medical Concept Enrichment

Yuan

Liao

Luo

et al. 2019

Lecture Notes in Computer Science

145

106

View full text Add to dashboard Cite

show abstract

“…The reports used in both cases are far more structured than their raw counterparts and so this approach cannot be directly translated to hospital data. Training on raw hospital reports, Jing et al [9] demonstrated how they can be generated by first training a multi-label CNN on the images and the Medical Text Indexer (MTI) tags identified in the original raw reports of the Openi chest x-ray dataset. However, reports can be very long and heterogeneous, and the authors do not evaluate the model's ability to determine whether visually and clinically-relevant medical concepts have been identified.…”

Section: Radiology Report Generationmentioning

confidence: 99%

“…Recently, we have seen supervised learning approaches that aim to take advantage of past radiological exams containing reports in order to either autogenerate the reports [17,9,23], or to assist in classification tasks [16,21,20,24,22]. The noise present in medical reports in addition to the presence of non-visually significant information, such as the negation of pathologies, make it difficult to learn from them directly as done in natural image captioning frameworks.…”

Section: Introductionmentioning

confidence: 99%

Automated Enriched Medical Concept Generation for Chest X-ray Images

Gasimova

2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Decision support tools that rely on supervised learning require large amounts of expert annotations. Using past radiological reports obtained from hospital archiving systems has many advantages as training data above manual single-class labels: they are expert annotations available in large quantities, covering a population-representative variety of pathologies, and they provide additional context to pathology diagnoses, such as anatomical location and severity. Learning to auto-generate such reports from images present many challenges such as the difficulty in representing and generating long, unstructured textual information, accounting for spelling errors and repetition/redundancy, and the inconsistency across different annotators. We therefore propose to first learn visually-informative medical concepts from raw reports, and, using the concept predictions as image annotations, learn to autogenerate structured reports directly from images. We validate our approach on the OpenI [2] chest x-ray dataset, which consists of frontal and lateral views of chest x-ray images, their corresponding raw textual reports and manual medical subject heading (MeSH R ) annotations made by radiologists.

show abstract

Deep Learning in Medicine—Promise, Progress, and Challenges

2019

View full text Add to dashboard Cite

Recent years have seen a surge of interest in machine learning and artificial intelligence techniques in health care. 1 Deep learning 2 represents the latest iteration in a progression of artificial intelligence technologies that have allowed machines to mimic human intelligence in increasingly sophisticated and independent ways. 3 Early medical artificial intelligence systems relied heavily on experts to train computers by encoding clinical knowledge as logic rules for specific clinical scenarios. More advanced machine learning systems train themselves to learn these rules by identifying and weighing relevant features from the data, such as pixels from medical images, or raw information from electronic health records (EHRs).For machine learning approaches to work well in practice, feature engineering directed by experts is often required (eg, quantization of laboratory tests into discrete value ranges or the extraction of descriptors that characterize the texture of medical images). The newer deep learning techniques, which require less supervision, use an end-to-end learning mechanism to map raw inputs (such as laboratory test values or image pixels) to outputs without human-directed manipulation of the data. The resulting maps are composed of multiple layers of interconnected nonlinear processing units-the "neurons" of deep learning. Deep learning techniques may still require some expert involvement to design optimal model architectures and identify the best set of parameters.At present, the most successful applications of deep learning in medicine have been for analyzing medical images. Deep learning algorithms are capable of performing automatic and accurate detection of diabetic retinopathy 4 and skin cancer 5 from retinal fundus and skin images, respectively. The potential of deep learning to disentangle complex, subtle discriminative patterns in images suggests that these techniques may be useful in other areas of medicine. Substantial challenges must be addressed, however, before deep learning can be applied more broadly.

show abstract

On the Automatic Generation of Medical Imaging Reports

Cited by 364 publications

References 28 publications

Automatic Radiology Report Generation Based on Multi-view Image Fusion and Medical Concept Enrichment

Automatic Radiology Report Generation Based on Multi-view Image Fusion and Medical Concept Enrichment

Automated Enriched Medical Concept Generation for Chest X-ray Images

Deep Learning in Medicine—Promise, Progress, and Challenges

Contact Info

Product

Resources

About