Logical segmentation for article extraction in digitized old newspapers

Palfray, Thomas; Hebert, David G.; Nicolas, Stéphane; Tranouez, Pierrick; Paquet, Thierry

doi:10.1145/2361354.2361383

Cited by 20 publications

(2 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, there are many deep learning models that address aspects of document analysis and recognition tasks that we can leverage in our research. There exist deep learning models for detecting tables in documents [19]- [21], mathematical formula detection and recognition [22]- [24], document structure detection systems [25]- [28], and more. The technical details of our PDF remediation method are presented in Chapter 4 and the evaluation of our methods in Chapter 5.1.…”

Section: Deep Learning For Pdf Remediationmentioning

confidence: 99%

Accessible PDFs: Applying Artificial Intelligence for Automated Remediation of STEM PDFs

Schmitt-Koopmann

Huang

Darvishy

2022

Proceedings of the 24th International ACM SIGACCESS Conference on Computers and Accessibility

View full text Add to dashboard Cite

People with visual impairments use assistive technology, e.g., screen readers, to navigate and read PDFs. However, such screen readers need extra information about the logical structure of the PDF, such as the reading order, header levels, and mathematical formulas, described in readable form to navigate the document in a meaningful way. This logical structure can be added to a PDF with tags. Creating tags for a PDF is time-consuming, and requires awareness and expert knowledge. Hence, most PDFs are left untagged, and as a result, they are poorly readable or unreadable for people who rely on screen readers. STEM documents are particularly problematic with their complex document structure and complicated mathematical formulae. These inaccessible PDFs present a major barrier for people with visual impairments wishing to pursue studies or careers in STEM felds, who cannot easily read studies and publications from their feld. The goal of this Ph.D. is to apply artifcial intelligence for document analysis to reasonably automate the remediation process of PDFs and present a solution for large mathematical formulae accessibility in PDFs. With these new methods, the Ph.D. research aims to lower barriers to creating accessible scientifc PDFs, by reducing the time, efort, and expertise necessary to do so, ultimately facilitating greater access to scientifc documents for people with visual impairments. CCS CONCEPTS• Human-centered computing → Accessibility; Accessibility systems and tools; Accessibility; Accessibility technologies; • Applied computing → Document management and text processing; Document capture; Document analysis.

show abstract

Section: Deep Learning For Pdf Remediationmentioning

confidence: 99%

Accessible PDFs: Applying Artificial Intelligence for Automated Remediation of STEM PDFs

Schmitt-Koopmann

Huang

Darvishy

2022

Proceedings of the 24th International ACM SIGACCESS Conference on Computers and Accessibility

View full text Add to dashboard Cite

show abstract

“…Palfray et al [4] focus on the challenge of digitizing antique newspapers. Their approach not only performs segmentation but also extracts the reading order.…”

Section: Related Workmentioning

confidence: 99%

Fully Convolutional Neural Networks for Newspaper Article Segmentation

Meier

Stadelmann

Stampfli

et al. 2017

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)

View full text Add to dashboard Cite

Segmenting newspaper pages into articles that semantically belong together is a necessary prerequisite for article-based information retrieval on print media collections like e.g. archives and libraries. It is challenging due to vastly differing layouts of papers, various content types and different languages, but commercially very relevant for e.g. media monitoring. We present a semantic segmentation approach based on the visual appearance of each page. We apply a fully convolutional neural network (FCN) that we train in an end-to-end fashion to transform the input image into a segmentation mask in one pass. We show experimentally that the FCN performs very well: it outperforms a deep learning-based commercial solution by a large margin in terms of segmentation quality while in addition being computationally two orders of magnitude more efficient.

show abstract

SPEdu: A Toolbox for Processing Digitized Historical Documents

Rocha

Rodríguez

2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Logical segmentation for article extraction in digitized old newspapers

Cited by 20 publications

References 6 publications

Accessible PDFs: Applying Artificial Intelligence for Automated Remediation of STEM PDFs

Accessible PDFs: Applying Artificial Intelligence for Automated Remediation of STEM PDFs

Fully Convolutional Neural Networks for Newspaper Article Segmentation

SPEdu: A Toolbox for Processing Digitized Historical Documents

Contact Info

Product

Resources

About