Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 2 2017
DOI: 10.18653/v1/e17-2080
|View full text |Cite
|
Sign up to set email alerts
|

Mapping the Perfect via Translation Mining

Abstract: Semantic analyses of the PERFECT often defeat their own purpose: by restricting their attention to 'real' perfects (like the English one), they implicitly assume the PERFECT has predefined meanings and usages. We turn the tables and focus on form, using data extracted from multilingual parallel corpora to automatically generate semantic maps (Haspelmath, 1997) of the sequence 'HAVE/BE + past participle' in five European languages (German, English, Spanish, French, Dutch). This technique, which we dub Translati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(16 citation statements)
references
References 7 publications
0
16
0
Order By: Relevance
“…[4] The software applications PerfectExtractor and TimeAlign have been developed in collaboration with the Utrecht Digital Humanities Lab, and are freely available through the project's website, see https://time-in-translation.hum.uu.nl/. The technicalities of the Translation Mining methodology are outlined in Van der Klis et al (2017), and illustrated there with a multilingual dataset of PERFECTs extracted from the Europarl parallel corpus (Koehn 2005). Here, we focus on the use we make of the Translation Mining methodology for linguistic analysis.…”
Section: Data Collection and Annotationmentioning
confidence: 99%
See 1 more Smart Citation
“…[4] The software applications PerfectExtractor and TimeAlign have been developed in collaboration with the Utrecht Digital Humanities Lab, and are freely available through the project's website, see https://time-in-translation.hum.uu.nl/. The technicalities of the Translation Mining methodology are outlined in Van der Klis et al (2017), and illustrated there with a multilingual dataset of PERFECTs extracted from the Europarl parallel corpus (Koehn 2005). Here, we focus on the use we make of the Translation Mining methodology for linguistic analysis.…”
Section: Data Collection and Annotationmentioning
confidence: 99%
“…As one could expect with 29 attested 7-tuple types, clusters are not always easy to [7] We leave aside technicalities on MDS here. See Van der Klis et al (2017) and Van der Klis & Tellings (published online 13 March 2021) for a more thorough introduction to MDS and the way we implement it here.…”
Section: A Cartographic Inventory Of the Datamentioning
confidence: 99%
“…The methodology relies on parallel corpus data, and we use multidimensional scaling as a statistical and visualization technique to reveal the patterns. This resembles the approaches in Wälchli and Cysouw (2012), Wälchli (2018Wälchli ( /2019, and has been dubbed Translation Mining by van der Klis et al (2017). The methodology will be introduced in Section 3, but see van der Klis and Tellings (2022) for a more exhaustive overview.…”
Section: Before (Na)mentioning
confidence: 99%
“…MDS is a statistical technique that reduces a complex dataset with variation in many dimensions to a lower-dimensional representation that can be displayed visually as a scatterplot, known as a semantic map. This methodology has been used both in large-scale cross-linguistic examinations, such as Croft and Poole (2008) and Wälchli and Cysouw (2012), as well as in studies comparing just a few languages, such as van der Klis et al (2017). van der Klis and Tellings (2022) provide the technical background of MDS, an explanation of how to interpret MDS maps, and an overview of the application of MDS in linguistic theory.…”
Section: Mds Semantic Mapsmentioning
confidence: 99%
“…In the GMB and the AMR Bank tense is simply ignored. Annotating aspect is complex-for instance, the use of the perfect differs enormously even between closely related languages such as English, Dutch, and Italian (van der Klis et al, 2017). These complications lead to a simple annotation model in the PMB where tense is reduced to a manageable set of three tenses: past, present and future.…”
Section: Divide and Conquermentioning
confidence: 99%