2020
DOI: 10.6084/m9.figshare.11560059
|View full text |Cite
|
Sign up to set email alerts
|

The Atlas of Digitised Newspapers and Metadata: Reports from Oceanic Exchanges

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…Recent evaluation studies reported in International Conference on Document Analysis and Recognition 2017 and 2019 (Clausner et al, 2017(Clausner et al, , 2019 show that especially state-of-the-art systems (commercial and open source) used many times by libraries in data production do not perform very well in page segmentation and region classification of complex layout document pages. Among the 10-digitized historical newspaper collections gathered in Beals and Bell (2020), some do have article extraction, but many do not have it. Article extraction in our study collection has been produced automatically based on a machine learning model and the quality of the result has not been assessed on a large scale, only with a small evaluation collection (cf.…”
Section: Topic Creation For the Studymentioning
confidence: 99%
See 1 more Smart Citation
“…Recent evaluation studies reported in International Conference on Document Analysis and Recognition 2017 and 2019 (Clausner et al, 2017(Clausner et al, , 2019 show that especially state-of-the-art systems (commercial and open source) used many times by libraries in data production do not perform very well in page segmentation and region classification of complex layout document pages. Among the 10-digitized historical newspaper collections gathered in Beals and Bell (2020), some do have article extraction, but many do not have it. Article extraction in our study collection has been produced automatically based on a machine learning model and the quality of the result has not been assessed on a large scale, only with a small evaluation collection (cf.…”
Section: Topic Creation For the Studymentioning
confidence: 99%
“…Since that several national libraries and other stakeholders, such as publishers, have produced and are currently producing more and more digitized historical content online out of their newspaper collections. In a recent publication describing in detail 10 different digitized historical newspaper collections, Beals and Bell (2020) state thatover the past thirty years, national libraries, universities and commercial publishers around the world have made available hundreds of millions of pages of historical newspapers through mass digitisation and currently release over one million new pages per month worldwide. These have become vital resources not only for academics but for journalists, politicians, schools, and the general public.…”
Section: Introductionmentioning
confidence: 99%
“…These projects are working toward collaborative and integrative approaches to get closer to the shared vision of “finding meaning” in digitized historical newspaper data. The Atlas of Digitized Newspapers (Beals & Bell, 2020 ), an open access guide prepared by leading computational periodicals researchers from six European countries, already made an important step toward facilitating more historically informed understandings of digitized newspapers for researchers across disciplines.…”
Section: Introductionmentioning
confidence: 99%
“…Beals and Bell (2020), The Atlas of Digitised Newspapers and Metadata reveals the variety of metadata available across ten different newspaper databases.…”
mentioning
confidence: 99%