2020
DOI: 10.1016/j.patcog.2020.107535
|View full text |Cite
|
Sign up to set email alerts
|

Self-supervised deep reconstruction of mixed strip-shredded text documents

Abstract: The reconstruction of shredded documents consists of coherently arranging fragments of paper (shreds) to recover the original document(s). A great challenge in computational reconstruction is to properly evaluate the compatibility between the shreds. While traditional pixel-based approaches are not robust to real shredding, more sophisticated solutions compromise significantly time performance. The solution presented in this work extends our previous deep learning method for single-page reconstruction to a mor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 38 publications
0
6
0
Order By: Relevance
“…• A deep classification-based approach [12], [13] (Section II). The problem of evaluating compatibility is formulated as a classification problem using a deep model: the obtained results demonstrate the effectiveness of this approach in reconstructing 100 documents with an accuracy superior to 90%; • A deep metric-learning approach [14] (Section III).…”
Section: Permutation Search Compatibility Evaluationmentioning
confidence: 99%
“…• A deep classification-based approach [12], [13] (Section II). The problem of evaluating compatibility is formulated as a classification problem using a deep model: the obtained results demonstrate the effectiveness of this approach in reconstructing 100 documents with an accuracy superior to 90%; • A deep metric-learning approach [14] (Section III).…”
Section: Permutation Search Compatibility Evaluationmentioning
confidence: 99%
“…In contrast, as a new paradigm between unsupervised and supervised learning, SSL can generate labels based on the property of unlabeled data itself to train the neural network in a supervised manner similar to natural learning experiences. With excellent performance on representation learning and dealing with the issue of unlabelled data, SSL [20][21][22] has been successfully implemented in a wide range of fields, including image recognition 23 , audio representation 24 , computer vision 25 , document reconstruction 26 , atmosphere 27 , astronomy 28 , medical 29 , person re-identification 30 , remote sensing 31 , robotics 32 , omnidirectional imaging 33 , manufacturing 34 , nano-photonics 35 , and civil engineering 36 , etc. However, this method has not been formally attempted in material science.…”
Section: High-efficient Low-cost Characterization Of Materials Properties Using Domain-knowledge-guided Selfsupervised Learningmentioning
confidence: 99%
“…In document information retrieval, the authors of [4] propose an interesting and promising approach using clustered SIFT descriptors ( [25]) as pretext-task labels for writer retrieval. The authors of [33] generate simulated-shredded documents to train a model for reconstructing mixed strip-shredded text documents. An other interesting approach is proposed in [9], where the authors apply various random transformations to the unlabelled source images to create a variety of images that compose a surrogate class, then used for supervised training.…”
Section: Related Workmentioning
confidence: 99%