Document scanning often suffers from skewing, which may seriously influence the efficiency of Optical Character Recognition (OCR). Therefore, it is necessary to correct the skewed document before document image information analysis. In this article, we propose a novel adaptive deskewing algorithm for document images, which mainly includes Skeleton Line Detection (SKLD), Piecewise Projection Profile (PPP), Morphological Clustering (MC), and the image classification method. The image type is determined firstly based on the image’s layout feature. Thus, adaptive correcting is applied to deskew the image according to its type. Our method maintains high accuracy on the Document Image Skew Estimation Contest (DISEC’2013) and PubLayNet datasets, which achieved 97.6% and 80.1% accuracy, respectively. Meanwhile, extensive experiments show the superiority of the proposed algorithm.
The aim of this paper is to screen out abnormal data which is caused by human factors and natural factors. We proposed an algorithm of vehicle’s data cleaning and monitoring. First, the valid data is filtered out by an improved DBSCAN method, through data analysis, and then get the threshold range. After that, it screens out the abnormal data in the valid data through the threshold range. Finally, the abnormal data is classified and counted according to the factors which was stipulated by the enterprise. The results show that the proposed algorithm can simpler and faster to process the abnormal data than the other similar algorithm.
As an important identification certificate for citizens, ID card plays a significant role in daily life and its information has found its way into almost every aspect. However, traditional ways tend to adopt manual input, which is not only time-consuming and labor-intensive, but also expensive as well as inaccuracy. In this paper, we proposed a novel algorithm to locate and recognize ID card information, in which several fresh strategies are presented to rectify image, detect boundary, and locate information, respectively. To solve the problem of image rotating, the image is rectified by searching the best rotating angle that can lead to the maximum corner point projection peak. Meanwhile, the boundary of ID card is detected by finding the best lines in the predicted boundary area based on the deviation between the predicted boundary and the detected boundary, and the position of information is located by incorporating the prior information and the location relation between the key information. Experimental results show that the proposed algorithm can achieve a state-of-the-art effect for recognizing ID card’s information.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.