Design and Implementation of a Historical German Firm-level Financial Database

Gram, Dennis; Karapanagiotis, Pantelis; Liebald, Marius; Walz, Uwe

doi:10.1145/3531533

Search citation statements

Order By: Relevance

Paper Sections

Select...

A Use Case: Ocr-extracted Historical Firm-level Data1

Introduction1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

2024

Publication Types

Select...

Article1

Preprint1

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

(2 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This use case refers to the matching of entities from repeated annual cross-sections of firm data extracted via OCR from historical German yearbooks. Gram et al (2022) describe the data extraction process, detail the underlying relational data model, and provide summary statistics for a subset of the fields extracted for the period 1920 -1932. The database is implemented according to the FAIR principles and will be made fully available the the public in the upcoming years.…”

Section: A Use Case: Ocr-extracted Historical Firm-level Datamentioning

confidence: 99%

“…Finally, we present an application of our matching framework in a domain with dirty firm-level financial data that we extracted from historical archives by using Optical Character Recognition (OCR) software (Kamlah et al, 2022). The data represent German firms operating in the period from 1910 to 1919 with non-harmonized and non-standardized attributes extracted from the "Handbuch der deutschen Aktiengesellschaften" (see also Gram et al, 2022). In a 5-fold cross-validation with 30% train and 70% test random sample splits, our framework achieves an average 99.36 F-score in the test sub-sample.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation