2022
DOI: 10.1007/978-3-030-99739-7_44
|View full text |Cite
|
Sign up to set email alerts
|

Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents

Abstract: We present the HIPE-2022 shared task on named entity processing in multilingual historical documents. Following the success of the first CLEF-HIPE-2020 evaluation lab, this edition confronts systems with the challenges of dealing with more languages, learning domain-specific entities, and adapting to diverse annotation tag sets. HIPE-2022 is part of the ongoing efforts of the natural language processing and digital humanities communities to adapt and develop appropriate technologies to efficiently retrieve and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 11 publications
0
4
0
Order By: Relevance
“…The StanfordNER-Toolkit is frequently used to train or reuse genre-specific classifiers to annotate historical documents such as letters (cf. Ehrmann et al 2021) and can be considered as a well-established tool for NER tasks. The software uses Conditional Random Field (CRF) algorithms (cf.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The StanfordNER-Toolkit is frequently used to train or reuse genre-specific classifiers to annotate historical documents such as letters (cf. Ehrmann et al 2021) and can be considered as a well-established tool for NER tasks. The software uses Conditional Random Field (CRF) algorithms (cf.…”
Section: Methodsmentioning
confidence: 99%
“…An automated and transferable method for extracting specific entities is already under discussion (cf. Hildenbrandt/Kamzelak 2019; Ehrmann et al 2021). A practical application has so far remained elusive but is currently developed in the course of the project Dehmel digital (cf.…”
Section: Modelling Narrative Space In Novels and Letter Correspondencesmentioning
confidence: 99%
“…[5]. The development of various neural-based approaches helped increase the performance of NER systems on historical materials with F-scores going from 60-70% on average for rule based and traditional ML systems to, for the best neural systems, 80% [6].…”
Section: Analysis Of Research and Publicationsmentioning
confidence: 99%
“…The digitization of newspapers has greatly improved accessibility and clearly changed the nature of historical research, by enabling easier data access and analysis at scale through multilingual semantic data enrichment [7,42,10,6]. Through better document analysis results and semantic enrichment e.g., named entity recognition (NER), relation extraction (RE), event extraction (EE), the quality of the newspaper data offered by the libraries to its users is substantially improved [12,14,13,15]. Preserving the historical memory of entities and events from historical documents and making them accessible to a larger audience, not only limited to humanities scholars and experts, could lead to better organization of our historical knowledge [44,2,38].…”
Section: Introductionmentioning
confidence: 99%