Genetic Programming, Validation Sets, and Parsimony Pressure

Gagné, Christian; Schoenauer, Marc; Parizeau, Marc; Tomassini, Marco

doi:10.1007/11729976_10

Cited by 60 publications

(56 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To determine loss for a build script for example, the value may be determined by counting the number of actions that execute successfully and dividing by the total number of steps. A further consideration in semantic evaluation is parsimony, which is the general expectation that the shortest adequate solution is to be preferred (Gagne et al, 2006). To incorporate parsimony in the evaluation we can add a measure(s) of the solution's cost(s), such as the size of the label y and/or execution resources consumed, to L.…”

Section: Loss Function Variationsmentioning

confidence: 99%

Proceedings of the ACL 2014 Workshop on Semantic Parsing

Lewis¹,

Steedman²

2014

View full text Add to dashboard Cite

While there has been significant recent work on learning semantic parsers for specific task/ domains, the results don't transfer from one domain to another domains. We describe a project to learn a broad-coverage semantic lexicon for domain independent semantic parsing. The technique involves several bootstrapping steps starting from a semantic parser based on a modest-sized hand-built semantic lexicon. We demonstrate that the approach shows promise in building a semantic lexicon on the scale of WordNet, with more coverage and detail that currently available in widely-used resources such as VerbNet. We view having such a lexicon as a necessary prerequisite for any attempt at attaining broad-coverage semantic parsing in any domain. The approach we described applies to all word classes, but in this paper we focus here on verbs, which are the most critical phenomena facing semantic parsing. Introduction and MotivationRecently we have seen an explosion of work on learning semantic parsers (e.g., Matuszek, et al, 2012;Tellex et al, 2013; Branavan et al, 2010, Chen et al, 2011). While such work shows promise, the results are highly domain dependent and useful only for that domain. One cannot, for instance, reuse a lexical entry learned in one robotic domain in another robotic domain, let alone in a database query domain. Furthermore, the techniques being developed require domains that are simple enough so that the semantic models can be produced, either by hand or induced from the application. Language in general, however, involves much more complex concepts and connections, including discussion of involves abstract concepts, such as plans, theories, political views, and so on. It is not clear how the techniques currently being developed could be generalized to such language.The challenge we are addressing is learning a broad-coverage, domain-independent semantic parser, i.e., a semantic parser that can be used in any domain. At present, there is a tradeoff between the depth of semantic representation produced and the coverage of the techniques. One of the critical gaps in enabling more general, deeper semantic systems is the lack of any broadcoverage deep semantic lexicon. Such a lexicon must contain at least the following information: i. an enumeration of the set of distinct senses for the word (e.g., as in WordNet, PropBank), linked into an ontology that supports reasoning ii. For each sense, we would have• Deep argument structure, i.e., semantic roles with selectional preferences • Constructions that map syntax to the deep argument structure (a.k.a. linking rules) • Lexical entailments that characterize the temporal consequences of the event described by the verb The closest example to such lexical entries can be found in VerbNet (Kipper et al, 2008), a handbuilt resource widely used for a range of general applications. An example entry from VerbNet is seen in Figure 1, which describes a class of verbs called murder-42.1. VerbNet clusters verbs by the constructions they take, not by sense or meaning, alt...

show abstract

Section: Loss Function Variationsmentioning

confidence: 99%

Proceedings of the ACL 2014 Workshop on Semantic Parsing

Lewis¹,

Steedman²

2014

View full text Add to dashboard Cite

show abstract

“…While generalisation has traditionally been underexplored in the GP literature, there have been a number of recent papers examining this important issue [3,6,9,11,19,22,25]. Among the techniques proposed to counteract overfitting, popular examples include the use of parsimony constraints, and the use of a validation set.…”

Section: Causes Of Overfittingmentioning

confidence: 99%

“…Gagné and co-authors [9] also investigate the use of three datasets (training data, validation data, and test data). They evolve solutions to binary classification problems using Genetic Programming.…”

Section: Validation Sets and Parsimonymentioning

confidence: 99%

“…Gagné and co-authors [9] also investigate the use of lexicographic parsimony pressure to reduce the complexity of the evolved models. Lexicographic parsimony pressure [12] involves minimizing the error rate on the entire training set, and using the size as a second measure to compare when the error rates are exactly the same.…”

Section: Validation Sets and Parsimonymentioning

confidence: 99%

“…Some of these [9,21,22] point to the need for additional criteria to determine the point at which training is stopped to avoid overfitting. Thus inspired, we present an implementation of the stopping criteria detailed in Section 8.4 using an alternative form of Grammar-based GP to GE.…”

Section: Investigations: Stopping Criteria Applied To a Financial Datmentioning

confidence: 99%

See 2 more Smart Citations