Symbolic Regression via Neural-Guided Genetic Programming Population Seeding

Mundhenk, T. Nathan; Landajuela, Mikel; Glatt, Ruben; Santiago, Claudio; Faissol, Daniel M.; Petersen, Brenden K.

doi:10.48550/arxiv.2111.00053

Cited by 7 publications

(11 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…EDHiE can then evaluate the fit of the obtained expressions against the measurements. We conjecture that the performance of EDHiE on standard benchmarks [13,14] would compare favorably to that of a state-ofthe-art symbolic regression method [15]. The results of our empirical evaluation of HVAE and EDHiE confirm our conjectures.…”

Section: Introductionsupporting

confidence: 72%

“…Recently, many symbolic regression approaches based on neural networks have been proposed [14,[25][26][27][28][29]. In particular, Deep Symbolic Optimization, DSO combines autoencoders with genetic programming to approach symbolic regression, among other combinatorial optimization tasks [15]. The neural networks are used to sample the individuals in the initial population of the evolutionary algorithm and are retrained at each iteration to focus on expressions leading to better fit.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Efficient Generator of Mathematical Expressions for Symbolic Regression

Mežnar¹,

Džeroski²,

Todorovski³

2023

Preprint

View full text Add to dashboard Cite

We propose an approach to symbolic regression based on a novel variational autoencoder for generating hierarchical structures, HVAE. It combines simple atomic units with shared weights to recursively encode and decode the individual nodes in the hierarchy. Encoding is performed bottom-up and decoding top-down. We empirically show that HVAE can be trained efficiently with small corpora of mathematical expressions and can accurately encode expressions into a smooth low-dimensional latent space. The latter can be efficiently explored with various optimization methods to address the task of symbolic regression. Indeed, random search through the latent space of HVAE performs better than random search through expressions generated by manually crafted probabilistic grammars for mathematical expressions. Finally, EDHiE system for symbolic regression, which applies an evolutionary algorithm to the latent space of HVAE, reconstructs equations from a standard symbolic regression benchmark better than a state-of-the-art system based on a similar combination of deep learning and evolutionary algorithms.

show abstract

Section: Introductionsupporting

confidence: 72%

Section: Related Workmentioning

confidence: 99%

Efficient Generator of Mathematical Expressions for Symbolic Regression

Mežnar¹,

Džeroski²,

Todorovski³

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Figure 12: Illustration of our model on a few benchmark datasets from the litterature. We show the prediction of our model on six 2-dimensional datasets presented in [36] and used as a comparison point in a few recent works [37]. The input points are marked as black crosses.…”

Section: E Additional Out-of-domain Resultsmentioning

confidence: 99%

End-to-end symbolic regression with transformers

Kamienny¹,

d’Ascoli²,

Lample³

et al. 2022

Preprint

View full text Add to dashboard Cite

Symbolic regression, the task of predicting the mathematical expression of a function from the observation of its values, is a di cult task which usually involves a two-step procedure: predicting the "skeleton" of the expression up to the choice of numerical constants, then tting the constants by optimizing a non-convex loss function. The dominant approach is genetic programming, which evolves candidates by iterating this subroutine a large number of times. Neural networks have recently been tasked to predict the correct skeleton in a single try, but remain much less powerful.In this paper, we challenge this two-step procedure, and task a Transformer to directly predict the full mathematical expression, constants included. One can subsequently re ne the predicted constants by feeding them to the non-convex optimizer as an informed initialization. We present ablations to show that this end-to-end approach yields better results, sometimes even without the re nement step. We evaluate our model on problems from the SRBench benchmark and show that our model approaches the performance of state-of-the-art genetic programming with several orders of magnitude faster inference.

show abstract

“…Symbolic regression using neural networks. As discussed in Section 1, the high approximation capacity of NNs can facilitate SR in both the symbol search [3,[12][13][14][15] and the expression representation [6][7][8][9][10]. In addition, there are studies to indirectly help SR with NNs.…”

Section: Related Workmentioning

confidence: 99%

“…The second group employs NNs to search the symbol connections, and non-linear optimizations like BFGS [11] can be employed to estimate symbol coefficients. For the searching procedure, [3,[12][13][14] leverage a Recursive Neural Network (RNN) as a policy network to iteratively generate optimal actions that can select and connect symbols. [15] employs large-scale pre-training to directly map from data to the symbolic equations.…”

Section: Introductionmentioning

confidence: 99%

CoNSoLe: Convex Neural Symbolic Learning

Li¹,

Yang²,

Tong³

2022

Preprint

View full text Add to dashboard Cite

Learning the underlying equation from data is a fundamental problem in many disciplines. Recent advances rely on Neural Networks (NNs) but do not provide theoretical guarantees in obtaining the exact equations owing to the non-convexity of NNs. In this paper, we propose Convex Neural Symbolic Learning (CONSOLE) to seek convexity under mild conditions. The main idea is to decompose the recovering process into two steps and convexify each step. In the first step of searching for right symbols, we convexify the deep Q-learning. The key is to maintain double convexity for both the negative Q-function and the negative reward function in each iteration, leading to provable convexity of the negative optimal Q function to learn the true symbol connections. Conditioned on the exact searching result, we construct a Locally-Convex equation Learner (LOCAL) neural network to convexify the estimation of symbol coefficients. With such a design, we quantify a large region with strict convexity in the loss surface of LOCAL for commonly used physical functions. Finally, we demonstrate the superior performance of the CONSOLE framework over the state-of-the-art on a diverse set of datasets.More recent SR studies leverage Neural Networks (NNs) with high representational power. For the NN-based SR, we mainly categorize them into two groups based on the roles of the NNs. The first Preprint. Under review.

show abstract

Symbolic Regression via Neural-Guided Genetic Programming Population Seeding

Cited by 7 publications

References 20 publications

Efficient Generator of Mathematical Expressions for Symbolic Regression

Efficient Generator of Mathematical Expressions for Symbolic Regression

End-to-end symbolic regression with transformers

CoNSoLe: Convex Neural Symbolic Learning

Contact Info

Product

Resources

About