2021
DOI: 10.1002/aic.17457
|View full text |Cite
|
Sign up to set email alerts
|

Deterministic symbolic regression with derivative information: General methodology and application to equations of state

Abstract: Symbolic regression methods simultaneously determine the model functional form and the regression parameter values by generating expression trees. Symbolic regression can capture the complexity of real‐world phenomena but the use of deterministic optimization for symbolic regression has been limited due to the complexity of the search space of existing formulations. We present a novel deterministic mixed‐integer nonlinear programming formulation for symbolic regression that incorporates derivative constraints … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…All of these approaches build on SR based on GAs, the original and most popular technique [1,13]. In contrast, Engle and Sahinidis developed a deterministic SR algorithm (using mixed-integer nonlinear programming) that constrains the equation space to functions that obey derivative constraints from theory, improving the quality of expressions for thermodynamic equations of state [25]. Another approach is the Bayesian machine scientist (BMS) [26].…”
Section: Incorporating Background Knowledge Into Srmentioning
confidence: 99%
“…All of these approaches build on SR based on GAs, the original and most popular technique [1,13]. In contrast, Engle and Sahinidis developed a deterministic SR algorithm (using mixed-integer nonlinear programming) that constrains the equation space to functions that obey derivative constraints from theory, improving the quality of expressions for thermodynamic equations of state [25]. Another approach is the Bayesian machine scientist (BMS) [26].…”
Section: Incorporating Background Knowledge Into Srmentioning
confidence: 99%
“…The latter, although extremely fast, produces non-interpretable equations [ 84 ]. In addition, there also exist Mixed-Integer Non-Linear Programming (MINLP) formulations [ 85 , 86 ], models that identify the SR problem as a linguistic [ 87 ], others that incorporate probabilistic features such as probabilistic framework [ 88 ] or probabilistic grammars [ 89 ], Bayesian approaches [ 90 ] and more [ 91 , 92 ].…”
Section: Symbolic Regressionmentioning
confidence: 99%
“…By leveraging fundamental results from real algebraic geometry, we obtain formal proofs of the correctness of our laws as a byproduct of the optimization problems. This is notable, because existing automated approaches to scientific discovery 22 – 25 , as reviewed in Section 1 of our supplementary material , often rely upon deep learning techniques that do not provide formal proofs and are prone to hallucinating incorrect scientific laws that cannot be automatically proven or disproven, analogously to output from state-of-the-art Large Language Models such as GPT-4 26 . As such, any new laws derived from these systems cannot easily be explained or justified.…”
Section: Introductionmentioning
confidence: 99%