Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguisti 2014
DOI: 10.3115/v1/e14-2014
|View full text |Cite
|
Sign up to set email alerts
|

Finding Terms in Corpora for Many Languages with the Sketch Engine

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
46
0
8

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 60 publications
(54 citation statements)
references
References 2 publications
0
46
0
8
Order By: Relevance
“…A corpus can be collected from the web, using the Corpus Factory (Kilgarriff et al 2010) or TenTen (Jakubíček et al 2013) method.…”
mentioning
confidence: 99%
“…A corpus can be collected from the web, using the Corpus Factory (Kilgarriff et al 2010) or TenTen (Jakubíček et al 2013) method.…”
mentioning
confidence: 99%
“…In this regard, it is a common feature across different languages that specifications do not need to be marked with a connective. For example, in both French (example 5) and German (example 6), specifications without a connective can be left implicit without loss of coherence (all French examples are taken from the French Web corpus frTenTen17; Jakubíček, Kilgarriff, Kovář, Rychlý & Suchomel, 2013; all German examples are taken from the German corpus German WebTenTen13, Jakubíček et al, 2013; by using the search engine SketchEngine; Kilgarriff, Baisa, Bušta, Jakubíček, Kovář, Michelfeit, et al, 2014) . It is, however, also possible in both languages to explicitly mark specifications.…”
Section: Research Backgroundmentioning
confidence: 99%
“…Sketch Engine is named after one of its key features-word sketches. It employs a contrastive two-step approach to terminology extraction-first the grammatical validity of a phrase (unithood) is assessed using the term grammar, next the normalized frequencies of TCs from the focus corpus are contrasted (termhood) with those in the reference corpus [31] by using the "Simple Math" (with an add-N parameter of one) statistics [32]. Sketch Engine implements ATE as two separate processes, dependent on the user needs-keywords extraction and multiwords extraction.…”
Section: Automatic Extraction Taskmentioning
confidence: 99%