2007
DOI: 10.1093/llc/fqm044
|View full text |Cite
|
Sign up to set email alerts
|

The Identification of Spelling Variants in English and German Historical Texts: Manual or Automatic?

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(6 citation statements)
references
References 7 publications
0
6
0
Order By: Relevance
“…This is the case of research using NER for materials with disparate qualities of digitization and OCR, non-European or classical languages, or collections featuring spelling variations (Alex et al, 2012;Batjargal et al, 2014;Neudecker et al, 2014;Nagai et al, 2015;Erdmann et al, 2016;Kettunen et al, 2016). Previous research within Digital Humanities has also tackled the related problem of text geoparsing, leveraging NER methods for recognizing place references in text, often together with other heuristics for the complete resolution of place references into gazetteer entries and/or geographical coordinates (Rayson et al, 2006;Baron and Rayson, 2008;Pilz et al, 2008;Grover et al, 2010;Freire et al, 2011;Gregory and Hardie, 2011;Brown et al, 2012;Alex et al, 2015;Gregory et al, 2015;Santos et al, 2015a,b;Wing, 2015;Clifford et al, 2016).…”
Section: Ner In Historical Documentsmentioning
confidence: 99%
“…This is the case of research using NER for materials with disparate qualities of digitization and OCR, non-European or classical languages, or collections featuring spelling variations (Alex et al, 2012;Batjargal et al, 2014;Neudecker et al, 2014;Nagai et al, 2015;Erdmann et al, 2016;Kettunen et al, 2016). Previous research within Digital Humanities has also tackled the related problem of text geoparsing, leveraging NER methods for recognizing place references in text, often together with other heuristics for the complete resolution of place references into gazetteer entries and/or geographical coordinates (Rayson et al, 2006;Baron and Rayson, 2008;Pilz et al, 2008;Grover et al, 2010;Freire et al, 2011;Gregory and Hardie, 2011;Brown et al, 2012;Alex et al, 2015;Gregory et al, 2015;Santos et al, 2015a,b;Wing, 2015;Clifford et al, 2016).…”
Section: Ner In Historical Documentsmentioning
confidence: 99%
“…Koolen et al considered the spelling and pronunciation differences between ancient and modern Dutch [20], while Gotscharek et al [21] and Hauser et al [22] considered the spelling differences and variations between modern and archaic German. Pilz et al considered spelling variations of English and German historical texts [23]. In general, the main challenge for historical European languages like Dutch, English and German is the spelling variants.…”
Section: Adopting Cross-language and Cross-chronological Information mentioning
confidence: 99%
“…This approach outperformed different combinations of stemming, edit distance, and word form generation, showing that handling both inflection and historical variation is important for highly inflectional languages. Pilz, Ernst‐Gerlach, Kempken, Rayson, and Archer () found that automatic approaches to historical variant generation can reproduce manually generated gold standard rules quite well and may also capture variation that is not discovered manually. They argued for generic letter‐replacement heuristics for Germanic languages and showed that an edit distance variant where the edit costs were learned from German historical corpus outperformed the standard edit distance of English historical data.…”
Section: Related Researchmentioning
confidence: 99%