2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA) 2016
DOI: 10.1109/dicta.2016.7797031
|View full text |Cite
|
Sign up to set email alerts
|

Deep Neural Networks for Page Stream Segmentation and Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 13 publications
0
9
0
1
Order By: Relevance
“…Gallo et al proposed in their study a hybrid technique using CNN, followed by a Deep Neural Network (DNN) to extract the visual features of the text and classified the documents. The page stream segmentation was done using the document classification in the proposed method [4].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Gallo et al proposed in their study a hybrid technique using CNN, followed by a Deep Neural Network (DNN) to extract the visual features of the text and classified the documents. The page stream segmentation was done using the document classification in the proposed method [4].…”
Section: Related Workmentioning
confidence: 99%
“…We have traced the evolution (Table 1) of the DSS technologies starting from the stochastic Markov chain model in 2009 [3] through the deep image-based page feature extraction and classification in 2016 [4], rule-based approach in 2017 [5] to a more sophisticated state-of-the-art multi-modal deep learning approach combining text and image features of the document page until 2021 [6]. Although, there is a clear dearth of the overall study observed in the domain of DSS, the recent breakthrough in the domain of DSS by Wiedemann and Heyer [7] shows promising results with Tobacco800 public data set and a proprietary data set.…”
Section: Introductionmentioning
confidence: 99%
“…In this case, the system is iterated on the successive pairs of pages, takes as input the current pair of pages, looks at whether the two pages contain indices of similarity or continuity, then, decides whether they represent a rupture or a continuity. The second one is based on page classification [Gallo et al, 2016], according to the assumption that for two successive pages belong to the same document, they must belong to the same class. After having all the classes of stream pages, consecutive pages that belong to the same class are grouped as a single document.…”
Section: Continuitymentioning
confidence: 99%
“…More recently, one can find in the state of the art deep learning techniques with convolutional neuronal models for the classification of documents, as [Gallo et al, 2016, Harley et al, 2015, Wiedemann and Heyer, 2017. While the first two use only textual information, the last two use textual and visual information.…”
Section: Related Workmentioning
confidence: 99%
“…Mais recentemente, uma série de trabalhos [Jain and Wigington 2019, Gallo et al 2016, Wiedemann and Heyer 2019, Audebert et al 2019] propuseram a criação de modelos de segmentação que levam em consideração as informações que estão presentes tanto nas imagens quanto nos textos, em um processo de integração de informações. Os trabalhos supracitados consideraram documentos provenientes de arquivos de grandes empresas, como a base de dados Tobacco800 [Wiedemann and Heyer 2019].…”
unclassified