2020
DOI: 10.1002/wics.1534
|View full text |Cite
|
Sign up to set email alerts
|

Differential equations in data analysis

Abstract: Differential equations have proven to be a powerful mathematical tool in science and engineering, leading to better understanding, prediction, and control of dynamic processes. In this paper, we review the role played by differential equations in data analysis. More specifically, we consider the intersection between differential equations and data analysis in the light of modern statistical learning methodologies. This article is categorized under: Data: Types and Structure > Time Series, Stochastic Processes,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 227 publications
(243 reference statements)
0
10
0
Order By: Relevance
“…It is clear that ɛ ( k ), u 1 ( k ) and u 2 ( k ) in Equation (20), and ζ ( k ), v 1 ( k ) and v 2 ( k ) in Equation (21), explicitly containing the measurement error e ( k ), makes the resultant estimates biased and inconsistent and also indicates the main source of errors causing bias. The pseudo linear regression Equation (20) and Equation (21) are, in fact, error-in-variables models and thus can be solved by using advanced methods, such as the two-stage least squares (Dattner, 2020) and prediction error minimization-based non-linear least squares (Wei and Xie, 2021).…”
Section: Simulationsmentioning
confidence: 99%
“…It is clear that ɛ ( k ), u 1 ( k ) and u 2 ( k ) in Equation (20), and ζ ( k ), v 1 ( k ) and v 2 ( k ) in Equation (21), explicitly containing the measurement error e ( k ), makes the resultant estimates biased and inconsistent and also indicates the main source of errors causing bias. The pseudo linear regression Equation (20) and Equation (21) are, in fact, error-in-variables models and thus can be solved by using advanced methods, such as the two-stage least squares (Dattner, 2020) and prediction error minimization-based non-linear least squares (Wei and Xie, 2021).…”
Section: Simulationsmentioning
confidence: 99%
“…20 On the other hand, accounting for unnecessary age groups (over stratifying the population) inflates the number of parameters that need to be inferred and can lead to both identification problems and a heavy computational burden. [21][22][23] In addition, complex models are often too difficult to study using analytic techniques and are instead examined using numerical methods. Having a data-driven criterion for clustering incidence data into age groups is instrumental in constructing accurate, yet parsimonious age group models.…”
Section: Introductionmentioning
confidence: 99%
“…Despite its compute advantages, traditional two-stage approaches suffer from limited accuracy for real systems and, at best, are used to provide an initial guess for parameter values . This is because, especially when data is noisy or contains outliers, data-driven models used to interpolate data tend to yield low-quality derivative estimates, which reduces the quality of parameter estimates obtained when solving the NLP .…”
Section: Introductionmentioning
confidence: 99%
“…Despite its compute advantages, traditional two-stage approaches suffer from limited accuracy for real systems and, at best, are used to provide an initial guess for parameter values. 40 This is because, especially when data is noisy or contains outliers, data-driven models used to interpolate data tend to yield low-quality derivative estimates, which reduces the quality of parameter estimates obtained when solving the NLP. 41 Furthermore, it is often the desire to experimentally explore a system using multiple experimental runs with varying conditions, yet none of the derivative estimation techniques currently proposed have a straightforward way to account for multiple batches of data with a single data-driven model.…”
Section: Introductionmentioning
confidence: 99%