Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanit 2017
DOI: 10.18653/v1/w17-2202
|View full text |Cite
|
Sign up to set email alerts
|

Machine Translation and Automated Analysis of the Sumerian Language

Abstract: This paper presents a newly funded international project for machine translation and automated analysis of ancient cuneiform 1 languages where NLP specialists and Assyriologists collaborate to create an information retrieval system for Sumerian. 2 This research is conceived in response to the need to translate large numbers of administrative texts that are only available in transcription, in order to make them accessible to a wider audience. The methodology includes creation of a specialized NLP pipeline an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
3

Relationship

2
8

Authors

Journals

citations
Cited by 17 publications
(14 citation statements)
references
References 9 publications
0
14
0
Order By: Relevance
“…For example, a sign detector might fuse detections from 2D images from different angles of a clay tablet in order to deal with difficult cases. Similarly, if language models for cuneiform script become available in the future and problems like data sparsity are overcome [50], their integration will also help with ambiguous detections.…”
Section: Discussionmentioning
confidence: 99%
“…For example, a sign detector might fuse detections from 2D images from different angles of a clay tablet in order to deal with difficult cases. Similarly, if language models for cuneiform script become available in the future and problems like data sparsity are overcome [50], their integration will also help with ambiguous detections.…”
Section: Discussionmentioning
confidence: 99%
“…This is the only related work we are aware of in which cuneiform texts encoded in Unicode cuneiform have been processed previous to our experiments. Pagé-Perron et al (2017) present a project dedicated to creating a pipeline for Sumerian texts. The pipeline is planned to take in transliterated Sumerian and to produce POS tag annotations and lemmatization as well as machine translation into English.…”
Section: Cuneiform Script and Computational Methodsmentioning
confidence: 99%
“…Homburg and Chiarcos (2016) report preliminary results on automatic word segmentation for Akkadian cuneiform using rule-based, dictionary based, and data-driven statistical techniques. Pagé-Perron et al (2017) furnish an analysis of Sumerian text including morphology, parts-ofspeech (POS) tagging, syntactic parsing, and machine translation using a parallel corpus. Although Sumerian and Akkadian are both geographically and chronologically close to PE, these corpora are very large (e.g.…”
Section: Lda Summarymentioning
confidence: 99%