Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Volume 2: Short Pa 2014
DOI: 10.3115/v1/e14-4036
|View full text |Cite
|
Sign up to set email alerts
|

Active Learning for Post-Editing Based Incrementally Retrained MT

Abstract: Machine translation, in particular statistical machine translation (SMT), is making big inroads into the localisation and translation industry. In typical workflows (S)MT output is checked and (where required) manually post-edited by human translators. Recently, a significant amount of research has concentrated on capturing human post-editing outputs as early as possible to incrementally update/modify SMT models to avoid repeat mistakes. Typically in these approaches, MT and post-edits happen sequentially and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2014
2014

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 7 publications
0
2
0
Order By: Relevance
“…On the other hand, some of the large scale curation efforts, such as ChEMBL, provide funding for expert curators to manually annotate bioassay data, but this is too labor intensive to execute in detail, and is currently limited to identifying the target ( Gaulton et al, 2012 ). Approaches such as Active Learning (AL) have also been applied to classification of domain specific text documents ( Cohn, Ghahraman & Jordan, 1996 ; Dara et al, 2014 ; Tomanek & Hahn, 2007 ). Our objective in this work is to provide the necessary capabilities to annotate bioassay protocols, in a significant level of detail, such that the semantic content is a relatively complete description of the assay.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, some of the large scale curation efforts, such as ChEMBL, provide funding for expert curators to manually annotate bioassay data, but this is too labor intensive to execute in detail, and is currently limited to identifying the target ( Gaulton et al, 2012 ). Approaches such as Active Learning (AL) have also been applied to classification of domain specific text documents ( Cohn, Ghahraman & Jordan, 1996 ; Dara et al, 2014 ; Tomanek & Hahn, 2007 ). Our objective in this work is to provide the necessary capabilities to annotate bioassay protocols, in a significant level of detail, such that the semantic content is a relatively complete description of the assay.…”
Section: Introductionmentioning
confidence: 99%
“…The application of AL techniques to MT involves asking a human oracle to supervise a fraction of the incoming data (Bloodgood and Callison-Burch, 2010). Once the human has revised these samples, they are used to improve the MT system, via incremental (González-Rubio et al, 2012) or batch learning (Dara et al, 2014). Therefore, a key element of AL is the so-called sampling strategy, which determines the sentences that should be corrected by the human.…”
Section: Active Learning For the Interactive-predictive Translation Omentioning
confidence: 99%