Active learning for statistical natural language parsing

Tang, Min; Luo, Xiaoqiang; Roukos, Salim

doi:10.3115/1073083.1073105

Cited by 105 publications

(82 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Such has happened in the case of part of speech tagging, where the query by committee methods are generalized to apply to hidden Markov models (Dagan and Engelson 1995). In parsing, uncertainty sampling (Hwa 2004) and other heuristic approaches have been applied (Tang et al 2002). A recent trend in the pool-based active learning literature has been to take various approaches, usually uncertainty sampling or query by committee and try to improve performance through additional heuristics.…”

Section: Heuristic Generalizations and Variationsmentioning

confidence: 99%

“…A trend of the last ten years (Abe and Mamitsuka 1998;Banko and Brill 2001;Chen et al 2006;Dagan and Engelson 1995;Hwa 2004;Lewis and Gale 1994;McCallum and Nigam 1998;Melville and Mooney 2004;Roy and McCallum 2001;Tang et al 2002) has been to employ heuristic methods of active learning with no explicitly defined objective function. Uncertainty sampling (Lewis and Gale 1994), query by committee (Seung et al 1992), 1 and variants have proven particularly attractive because of their portability across a wide spectrum of machine learning algorithms.…”

Section: Background and Related Workmentioning

confidence: 99%

“…Uncertainty sampling (Lewis and Gale 1994), query by committee (Seung et al 1992), 1 and variants have proven particularly attractive because of their portability across a wide spectrum of machine learning algorithms. A subtrend in the field has sought to improve performance of heuristics by combining them with secondary heuristics such as: similarity weighting (McCallum and Nigam 1998), interleaving active learning with EM (McCallum and Nigam 1998), interleaving active learning with co-training (Steedman et al 2003), and sampling from clusters (Tang et al 2002), among others.…”

Section: Background and Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Active learning for logistic regression: an evaluation

2007

View full text Add to dashboard Cite

Which active learning methods can we expect to yield good performance in learning binary and multi-category logistic regression classifiers? Addressing this question is a natural first step in providing robust solutions for active learning across a wide variety of exponential models including maximum entropy, generalized linear, log-linear, and conditional random field models. For the logistic regression model we re-derive the variance reduction method known in experimental design circles as 'A-optimality.' We then run comparisons against different variations of the most widely used heuristic schemes: query by committee and uncertainty sampling, to discover which methods work best for different classes of problems and why. We find that among the strategies tested, the experimental design methods are most likely to match or beat a random sample baseline. The heuristic alternatives produced mixed results, with an uncertainty sampling variant called margin sampling and a derivative method called QBB-MM providing the most promising performance at very low computational cost. Computational running times of the experimental design methods were a bottleneck to the evaluations. Meanwhile, evaluation of the heuristic methods lead to an accumulation of negative results. We explore alternative evaluation design parameters to test whether these negative results are merely an artifact of settings where experimental design methods can be applied. The results demonstrate a need for improved active learning methods that will provide reliable performance at a reasonable computational cost.

show abstract

Section: Heuristic Generalizations and Variationsmentioning

confidence: 99%

Section: Background and Related Workmentioning

confidence: 99%

Section: Background and Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Active learning for logistic regression: an evaluation

2007

View full text Add to dashboard Cite

show abstract

“…One actively researched approach to this problem is to develop weakly supervised algorithms that require less training data, such as active learning (Hermjakob and Mooney 1997;Tang et al 2002;Baldridge and Osborne 2003;Hwa 2004) and co-training (Sarkar 2001;Steedman et al 2003). In this article, we explore an alternative: using parallel text as a means for transferring syntactic knowledge from a resource-rich language to a language with fewer resources.…”

Section: Introductionmentioning

confidence: 99%

Bootstrapping parsers via syntactic projection across parallel texts

et al. 2005

View full text Add to dashboard Cite

Broad coverage, high quality parsers are available for only a handful of languages. A prerequisite for developing broad coverage parsers for more languages is the annotation of text with the desired linguistic representations (also known as "treebanking"). However, syntactic annotation is a labor intensive and time-consuming process, and it is difficult to find linguistically annotated text in sufficient quantities. In this article, we explore using parallel text to help solving the problem of creating syntactic annotation in more languages. The central idea is to annotate the English side of a parallel corpus, project the analysis to the second language, and then train a stochastic analyzer on the resulting noisy annotations. We discuss our background assumptions, describe an initial study on the "projectability" of syntactic relations, and then present two experiments in which stochastic parsers are developed with minimal human intervention via projection from English.

show abstract

“…AL has been successfully applied to many tasks in natural language processing, including parsing (Tang et al, 2002), named entity recognition (Miller et al, 2004), to mention just a few. See (Olsson, 2009) for a comprehensie overview of the application of AL to natural language processing.…”

Section: Introduction and Related Researchmentioning

confidence: 99%

Active Learning for Post-Editing Based Incrementally Retrained MT

Dara

Genabith

et al. 2014

Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Volume 2: Short Pa

View full text Add to dashboard Cite

Machine translation, in particular statistical machine translation (SMT), is making big inroads into the localisation and translation industry. In typical workflows (S)MT output is checked and (where required) manually post-edited by human translators. Recently, a significant amount of research has concentrated on capturing human post-editing outputs as early as possible to incrementally update/modify SMT models to avoid repeat mistakes. Typically in these approaches, MT and post-edits happen sequentially and chronologically, following the way unseen data (the translation job) is presented. In this paper, we add to the existing literature addressing the question whether and if so, to what extent, this process can be improved upon by Active Learning, where input is not presented chronologically but dynamically selected according to criteria that maximise performance with respect to (whatever is) the remaining data. We explore novel (source side-only) selection criteria and show performance increases of 0.67-2.65 points TER absolute on average on typical industry data sets compared to sequential PEbased incrementally retrained SMT.

show abstract

Active learning for statistical natural language parsing

Cited by 105 publications

References 14 publications

Active learning for logistic regression: an evaluation

Active learning for logistic regression: an evaluation

Bootstrapping parsers via syntactic projection across parallel texts

Active Learning for Post-Editing Based Incrementally Retrained MT

Contact Info

Product

Resources

About