2020
DOI: 10.1049/iet-ipr.2020.0532
|View full text |Cite
|
Sign up to set email alerts
|

HU‐PageScan: a fully convolutional neural network for document page crop

Abstract: November The offer of online, automated, and impersonal services demand users to upload scanned copies of their documents to the organisations. As a consequence of this decentralisation, the documents present more challenges to the already complex process of image processing and information extraction. To address this problem, the authors presented an optimised fully convolutional neural network model for document segmentation that works on mobile devices to detect the region of the document in the captured im… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 26 publications
0
8
0
Order By: Relevance
“…The task is to clas-sify each pixel to either document or a background. The most natural metrics to evaluate such a task is to compute a pixel-wise Jaccard score [15].…”
Section: Document Location Via Semantic Segmentationmentioning
confidence: 99%
See 2 more Smart Citations
“…The task is to clas-sify each pixel to either document or a background. The most natural metrics to evaluate such a task is to compute a pixel-wise Jaccard score [15].…”
Section: Document Location Via Semantic Segmentationmentioning
confidence: 99%
“…To evaluate the semantic segmentation method, we used the published methods HU-PageScan [15] and HoughEncoder [16], both based on a U-net modification. HU-PageScan has an openly available source code and a pre-trained model, which was used for this evaluation.…”
Section: Document Location Via Semantic Segmentationmentioning
confidence: 99%
See 1 more Smart Citation
“…The systems are listed in descending order by the average accuracy for all four SmartDoc subsets. The results clearly show that based on the overall accuracy, our system is the second best, surpassed only by [19] proposed by das Neves et al This method is based on a Unet like neural network. On the Fig.…”
Section: Results Of Experiments On Smartdocmentioning
confidence: 74%
“…There are millions of parameters in such networks, so researchers are concentrating on the reduction of the parameters. Ricardo et al in [19] modified the U-net network architecture, reducing the number of parameters by more than 70 %. In [20], the number of parameters was reduced by a factor of 100 via the Fast Hough Transform [21] and its inverse.…”
Section: Introductionmentioning
confidence: 99%