Combining Optical Character Recognition With Paper ECG Digitization

Ganesh, Shambavi; Bhatti, Pamela; Alkhalaf, Mhmtjamil; Gupta, Shishir Kumar; Shah, Amit; Tridandapani, Srini

doi:10.1109/jtehm.2021.3083482

Cited by 10 publications

(9 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the ECG context, various digitization methods have been proposed; including the grayscale thresholding and contour-based digitization method by Ravichandran et al ( 2013 ), color segmentation and median filtering for noise removal by Garg et al ( 2012 ), and the combination of optical character recognition (OCR) with image processing techniques for digitization and artifact removal (Baydoun et al 2019 , Ganesh et al 2021 ). However, classical image processing methods, sensitive to input quality and environmental artifacts, often struggle with low-quality, distorted paper ECG records.…”

Section: Related Workmentioning

confidence: 99%

“…The user of the toolbox has the option to choose whether there should be an overlap between ECG segments and printed text artifacts. Although overlapped characters pose a problem in digitizing paper ECG records (Ganesh et al 2021 ), they are added to represent realistic paper ECGs, which occasionally print text with partial overlap with the ECG traces. Further, to add other printed information such as date, patient record numbers, etc the toolkit uses the corresponding fields from the WFDB header files that accompany all PhysioNet data files, or through a customizable text-based template file.…”

Section: Synthetic Ecg Image Generation Pipelinementioning

confidence: 99%

See 1 more Smart Citation

ECG-Image-Kit: a synthetic image generation toolbox to facilitate deep learning-based electrocardiogram digitization

Shivashankara,

Deepanshi,

Mehri Shervedani

et al. 2024

Physiol. Meas.

View full text Add to dashboard Cite

Objective: Cardiovascular diseases are a major cause of mortality globally, and electrocardiograms (ECGs) are crucial for diagnosing them. Traditionally, ECGs are stored in printed formats. However, these printouts, even when scanned, are incompatible with advanced ECG diagnosis software that require time-series data. Digitizing ECG images is vital for training machine learning models in ECG diagnosis, leveraging the extensive global archives collected over decades. Deep learning models for image processing are promising in this regard, although the lack of clinical ECG archives with reference time-series data is challenging. Data augmentation techniques using realistic generative data models provide a solution.Approach: We introduce ECG-Image-Kit, an open-source toolbox for generating synthetic multi-lead ECG images with realistic artifacts from time-series data, aimed at automating the conversion of scanned ECG images to ECG data points. The tool synthesizes ECG images from real time-series data, applying distortions like text artifacts, wrinkles, and creases on a standard ECG paper background.Main results: As a case study, we used ECG-Image-Kit to create a dataset of 21,801 ECG images from the PhysioNet QT database. We developed and trained a combination of a traditional computer vision and deep neural network model on this dataset to convert synthetic images into time-series data for evaluation. We assessed digitization quality by calculating the signal-to-noise ratio (SNR) and compared clinical parameters like QRS width, RR, and QT intervals recovered from this pipeline, with the ground truth extracted from ECG time-series. The results show that this deep learning pipeline accurately digitizes paper ECGs, maintaining clinical parameters, and highlights a generative approach to digitization.Significance: The toolbox has broad applications, including model development for ECG image digitization and classification. The toolbox currently supports data augmentation for the 2024 PhysioNet Challenge, focusing on digitizing and classifying paper ECG images.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Synthetic Ecg Image Generation Pipelinementioning

confidence: 99%

ECG-Image-Kit: a synthetic image generation toolbox to facilitate deep learning-based electrocardiogram digitization

Shivashankara,

Deepanshi,

Mehri Shervedani

et al. 2024

Physiol. Meas.

View full text Add to dashboard Cite

show abstract

“…The image character recognition model is the second step in the two steps of detection and recognition. The document character recognition model is also simpler than the character recognition task in the natural scene [5] . Since there is a large difference between the foreground and background in the foreground distribution map, a relatively simple global threshold is selected here M calculate the average value of all pixels in the foreground distribution according to the formula P and standard deviation λ is the available feature extraction algorithm is:…”

Section: End To End Document Image Text Feature Extraction Algorithmmentioning

confidence: 99%

Research on end-to-end document image character recognition based on differentiable binarization

Liu¹,

Shao²,

Wang³

et al. 2022

Fifth International Conference on Mechatronics and Computer Technology Engineering (MCTE 2022)

View full text Add to dashboard Cite

In view of the poor effect of the current image and character recognition, a research method of end-to-end document image and character recognition based on the differentiable binarization is proposed. Combining the differentiable binarization algorithm to collect and recognize the document image features, an end-to-end document image and character feature extraction algorithm is constructed. Based on the calculation results, the document image is segmented, the regional image features are extracted, and the end-to-end document image and character recognition process is simplified, Finally, the experiment proves that the accuracy of the end-to-end document image and character recognition method based on the differentiable binarization can reach more than 75%, and the overall recognition effect is improved by more than 15% compared with the traditional BFGS method, which meets the research requirements.

show abstract

“…Due to the shooting light, shooting background, shooting Angle and the lack of cleanliness of the invoice itself, the invoice image will have strong noise and even the information on the invoice cannot be recognized [11]. Direct use of the original invoice image to identify, will cause a great error in the identification results, resulting in poor identification results.…”

Section: Image Preprocessingmentioning

confidence: 99%

Invoice Recognition System based on Neural Network

2020

View full text Add to dashboard Cite

The reimbursement process of invoices is very complicated, which requires manual input of key information in invoices, which wastes a lot of manpower and time. Therefore, it is particularly important to design an algorithm for intelligent identification of invoice information. This paper mainly carries on the invoice recognition system based on neural network. This paper firstly preprocessed the image and improved the Hough transform to detect the tilt Angle of the invoice image by taking the long horizontal line in the invoice layout as the target. After that, the stamp of the invoice image is removed to reduce the interference of text detection and recognition. Secondly, this paper improves invoice recognition based on YOLOv3 detection algorithm. In this paper, the invoice recognition system is constructed and compared with the other two systems. Through the experimental comparison results, it can be known that the system improves the efficiency of the staff in processing paper invoices and reduces the workload of their later registration and verification of invoices.

show abstract

Combining Optical Character Recognition With Paper ECG Digitization

Cited by 10 publications

References 24 publications

ECG-Image-Kit: a synthetic image generation toolbox to facilitate deep learning-based electrocardiogram digitization

ECG-Image-Kit: a synthetic image generation toolbox to facilitate deep learning-based electrocardiogram digitization

Research on end-to-end document image character recognition based on differentiable binarization

Invoice Recognition System based on Neural Network

Contact Info

Product

Resources

About