2012
DOI: 10.1075/ijcl.17.3.05cer
|View full text |Cite
|
Sign up to set email alerts
|

The case of InterCorp, a multilingual parallel corpus

Abstract: This paper introduces InterCorp, a parallel corpus including texts in Czech and 27 other languages, available for online searches via a web interface. After discussing some issues and merits of a multilingual resource we argue that it has an important role especially for languages with fewer native speakers, supporting both comparative research and studies of the language from the perspective of other languages. We proceed with an overview of the corpus -the strategy and criteria for including new texts, the r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
22
0
4

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 58 publications
(26 citation statements)
references
References 18 publications
(10 reference statements)
0
22
0
4
Order By: Relevance
“…De-participial secondary prepositions and conjunctions, such as according to, assuming that, concerning, including, regarding, were noted but not included in the data set since they can no longer be considered adverbials or modifiers. The data were drawn from the synchronic multilingual parallel corpus InterCorp, which is being compiled at Charles University in Prague (Čermák and Rosen 2012). The corpus comprises 31 languages, with Czech as the pivot language, i.e.…”
Section: Materials and Scope Of The Studymentioning
confidence: 99%
“…De-participial secondary prepositions and conjunctions, such as according to, assuming that, concerning, including, regarding, were noted but not included in the data set since they can no longer be considered adverbials or modifiers. The data were drawn from the synchronic multilingual parallel corpus InterCorp, which is being compiled at Charles University in Prague (Čermák and Rosen 2012). The corpus comprises 31 languages, with Czech as the pivot language, i.e.…”
Section: Materials and Scope Of The Studymentioning
confidence: 99%
“…Esto quiere decir que herramientas específicas para corpus paralelos, tales como AntPConc, ParaConc o WordSmith Tools, entre otras herramientas habituales para la alineación y gestión de corpus paralelos (Torrellas Castillo, 2009:11;Zanettin, 2012:180;Vigier y Sánchez, 2017:265), no podrán usarse para el presente corpus. En cuanto a otras herramientas que sí admiten archivos en formato tmx, caso de Corpus Query Processor (CQP), InterText o NoSketchEngine (Čermák y Rosen, 2012;Vondřička, 2014;Molés-Cases, 2016:161), se ha descartado su empleo por precisar de trabajo informático especializado para la creación del corpus.…”
Section: Compilación Y Gestión Del Corpusunclassified
“…Complementary literary texts, however, can easily be obtained from InterCorp [3]. The most recent, ninth version contains a section with original Serbian texts.…”
Section: Source Of Datamentioning
confidence: 99%
“…Since other relevant criteria for our study are size, morphosyntactic annotation, type and variety of texts, the number of relevant sources drops drastically. srWaC [9] seems currently to be the most suitable source for studies on rarely occurring phenomena 3 . 3 The only alternative is offered by the Corpus of Contemporary Serbian Langauge (version SrpKor2013) developed by Miloš Utvić and Duško Vitas, but it is five times smaller.…”
Section: Source Of Datamentioning
confidence: 99%
See 1 more Smart Citation