2012
DOI: 10.1007/978-3-031-02146-6
|View full text |Cite
|
Sign up to set email alerts
|

Natural Language Processing for Historical Texts

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0
1

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 59 publications
(16 citation statements)
references
References 88 publications
0
15
0
1
Order By: Relevance
“…Information retrieval of historical newspapers has several challenges, among which are, e.g. OCR quality, spelling variation of historical language, lack of proper tools for natural language processing of older language and lack of structure in the optically read documents (Lopresti, 2009;Gotscharek et al, 2011;Piotrowski, 2012;J€ arvelin et al, 2016;Karlgren et al, 2019;Pfanzelter et al, 2021;Torget et al, 2022). In their present state, historical newspapers are a hard task for information retrieval engines, and users of the collections, such as researchers of digital humanities, are many times not very satisfied with the search possibilities and may have low trust in the search results (Jarlbrink and Snickars, 2017;Pfanzelter et al, 2021).…”
Section: Related Researchmentioning
confidence: 99%
“…Information retrieval of historical newspapers has several challenges, among which are, e.g. OCR quality, spelling variation of historical language, lack of proper tools for natural language processing of older language and lack of structure in the optically read documents (Lopresti, 2009;Gotscharek et al, 2011;Piotrowski, 2012;J€ arvelin et al, 2016;Karlgren et al, 2019;Pfanzelter et al, 2021;Torget et al, 2022). In their present state, historical newspapers are a hard task for information retrieval engines, and users of the collections, such as researchers of digital humanities, are many times not very satisfied with the search possibilities and may have low trust in the search results (Jarlbrink and Snickars, 2017;Pfanzelter et al, 2021).…”
Section: Related Researchmentioning
confidence: 99%
“…Applying NLP tools, such as POS taggers, syntactic parsers, and named entity recognisers to historical texts is difficult, because most existing NLP tools are developed for modern languages [118,140] and historical language use often differs significantly from its modern counterpart. The two often have different linguistic aspects, such as lexicon, morphology, syntax, and semantics which make a naive use of these tools problematic [144,159].…”
Section: Nlp Challengesmentioning
confidence: 99%
“…Projects, scientific meetings 1 and studies like OPATCH dealing with historical texts (Piotrowski, 2012) are numerous and one recurring theme is the struggle for clean OCR-ed data.…”
Section: Previous Workmentioning
confidence: 99%
“…Grasping this unique opportunity however calls for advanced methods for the automatic semantic analysis of digital historical sources. The application of NLP methods and tools to historical texts is indeed attracting growing interest and raises in-teresting and highly challenging research issues (Piotrowsky 2012).…”
Section: Introductionmentioning
confidence: 99%