Henry Díaz scite author profile

This paper presents a combined identification/Qfunction fitting methodology, which involves identification of a Takagi-Sugeno model, computation of (sub)optimal controllers from Linear Matrix Inequalities, and subsequent data-based fitting of the Q-function via monotonic optimisation. The LMIbased initialisation provides a conservative solution but it is a sensible starting point to avoid convergence/local-minima issues in raw data-based fitted Q-iteration or Bellman residual minimisation. An inverted-pendulum experimental case study illustrates the approach.

show abstract

Metodología de programación dinámica aproximada para control óptimo basada en datos

Díaz

Armesto

Sala

2019

Rev. iberoam. autom. inform. ind.

View full text Add to dashboard Cite

<p>En este artículo se presenta una metodología para el aprendizaje de controladores óptimos basados en datos, en el contexto de la programación dinámica aproximada. Existen soluciones previas en programación dinámica que utilizan programación lineal en espacios de estado discretos, pero que no se pueden aplicar directamente a espacios continuos. El objetivo de la metodología es calcular controladores óptimos para espacios de estados continuos, basados en datos, obtenidos mediante una estimación inferior del coste acumulado a través de aproximadores funcionales con parametrización lineal. Esto se resuelve de forma no iterativa con programación lineal, pero requiere proporcionar las condiciones adecuadas de regularización de regresores e introducir un coste de abandono de la región con datos válidos, con el fin de obtener resultados satisfactorios (evitando soluciones no acotadas o mal condicionadas).</p>

show abstract

Multi-object tracking with deep learning ensemble for unmanned aerial system applications

Xie

Ide

Izadi

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Henry Díaz

Hierarchical Reinforcement Learning for Air-to-Air Combat

Hierarchical Reinforcement Learning for Air Combat at DARPA's AlphaDogfight Trials

Fitted Q-Function Control Methodology Based on Takagi–Sugeno Systems

Metodología de programación dinámica aproximada para control óptimo basada en datos

Multi-object tracking with deep learning ensemble for unmanned aerial system applications

Contact Info

Product

Resources

About