2021
DOI: 10.18287/2412-6179-co-811
|View full text |Cite
|
Sign up to set email alerts
|

Algorithm for choosing the best frame in a video stream in the task of identity document recognition

Abstract: During the process of document recognition in a video stream using a mobile device camera, the image quality of the document varies greatly from frame to frame. Sometimes recognition system is required not only to recognize all the specified attributes of the document, but also to select final document image of the best quality. This is necessary, for example, for archiving or providing various services; in some countries it can be required by law. In this case, recognition system needs to assess the quality o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…Since its publication, MIDV-500 dataset and its extension MIDV-2019 were used to evaluate the methods of identity document images classification [12 -14]; identity document location [11,15], including the methods based on semantic segmentation [16]; detecting of faces on images of identity documents [17]; and methods related to text fields recognition, including single text line recognition [18], per-frame recognition results combination [19,20] and making a stopping decision in a video stream [21,22]. The dataset was also used to evaluate the methods of choosing a single best frame in the identity document video capture [23] and assessing the quality of the frame for its processing by an identity analysis system [24], detection and masking of sensitive and private information [25] and general ID verification [26].…”
Section: Introductionmentioning
confidence: 99%
“…Since its publication, MIDV-500 dataset and its extension MIDV-2019 were used to evaluate the methods of identity document images classification [12 -14]; identity document location [11,15], including the methods based on semantic segmentation [16]; detecting of faces on images of identity documents [17]; and methods related to text fields recognition, including single text line recognition [18], per-frame recognition results combination [19,20] and making a stopping decision in a video stream [21,22]. The dataset was also used to evaluate the methods of choosing a single best frame in the identity document video capture [23] and assessing the quality of the frame for its processing by an identity analysis system [24], detection and masking of sensitive and private information [25] and general ID verification [26].…”
Section: Introductionmentioning
confidence: 99%
“…-Document detection and localization in the image [35][36][37]; -Document type identification [35,37]; -Document layout analysis; -Detection of faces in document images [38] and the choice of the best photo of the document owner [39]; -Integration of the recognition results [40]; -Video frame quality assessment [41] and the choice of the best frame [42].…”
Section: Discussionmentioning
confidence: 99%
“…Using mock documents from the MIDV-2020 collection as targets for shooting DLC-2021 video makes it easy to use field values and document geometry markup from MIDV-2020 templates. The prepared open dataset can be used for other ID-recognition tasks: Document detection and localization in the image [ 35 , 36 , 37 ]; Document type identification [ 35 , 37 ]; Document layout analysis; Detection of faces in document images [ 38 ] and the choice of the best photo of the document owner [ 39 ]; Integration of the recognition results [ 40 ]; Video frame quality assessment [ 41 ] and the choice of the best frame [ 42 ]. …”
Section: Discussionmentioning
confidence: 99%
“…Applications of such assessment include user interaction, selection of the best frame in a video stream, rejection of analyzing an image of obviously low quality, etc. The respective field of science is rapidly developing [5,6,7], and special methods for assessing distortions of a given type (noise, blurring, low illumination, etc.) are of particular interest due to the possibility of seeking feedback from the user or applying interference correction methods [8].…”
Section: Introductionmentioning
confidence: 99%