Automated Processing of Digitized Historical Newspapers beyond the Article Level: Sections and Regular Features

Allen, Robert B.; Hall, Catherine

doi:10.1007/978-3-642-13654-2_11

Cited by 2 publications

(1 citation statement)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We consider a sampling of content categories. These are generally well recognized categories but they are not always clearly differentiated (Allen & Hall, 2010).…”

Section: Resultsmentioning

confidence: 99%

How historians use historical newspapers

Allen

Sieczkiewicz

2010

Proc. Am. Soc. Info. Sci. Tech.

Self Cite

View full text Add to dashboard Cite

Newspapers have long been rich resources for historians. In the past several years many historical newspapers have been digitized, offering the promise of improved access and powerful searching. In this research, we focus on historians' needs for searching collections of newspapers and managing the information they find. This is a deeper and more targeted investigation than much previous work that was based on surveys rather than personal interviews. We interviewed eight academic historians who largely embraced digitized newspapers but suggest the current systems still have many limitations. We also discuss the implications for the design of interfaces and services that would serve as a historians' workbench.

show abstract

“…We consider a sampling of content categories. These are generally well recognized categories but they are not always clearly differentiated (Allen & Hall, 2010).…”

Section: Resultsmentioning

confidence: 99%

How historians use historical newspapers

Allen

Sieczkiewicz

2010

Proc. Am. Soc. Info. Sci. Tech.

Self Cite

View full text Add to dashboard Cite

show abstract

Looking Back to 1850 in 2025

Costa,

Mateus,

Pinto

et al. 2025

Advances in Linguistics and Communication Studies

View full text Add to dashboard Cite

This chapter analyses current technologies and the challenges involved in extracting and classifying articles and news headlines from historical journals, as well as converting images to text format. The work to develop a tool focused on digitising historical journals was carried out by a multidisciplinary team of experts in media studies, artificial intelligence, image processing, and cultural heritage preservation. The data used derives from two historic Portuguese journals, Diário de Notícias and Jornal de Notícias, which were created in the mid-19th century. This project is based on a mixture of heuristics, computer vision, pattern recognition, and other artificial intelligence and machine learning techniques. The main challenges included the variability in the design of historical journals, preserving the quality of images over time, and continuously improving image processing and OCR techniques to adapt to different styles and periods of newspapers.

show abstract

Automated Processing of Digitized Historical Newspapers beyond the Article Level: Sections and Regular Features

Cited by 2 publications

References 6 publications

How historians use historical newspapers

How historians use historical newspapers

Looking Back to 1850 in 2025

Contact Info

Product

Resources

About