Using Human Ratings for Feedback Control: A Supervised Learning Approach With Application to Rehabilitation Robotics

Menner, Marcel; Neuner, Lukas; Lünenburger, Lars; Zeilinger, Melanie N.

doi:10.1109/tro.2020.2964147

Cited by 11 publications

(7 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Further related-but conceptually different-approaches are to learn policies rather than an objective function [38], [39], which is often referred to as imitation learning, or to use labeled data in order to learn an objective function, e.g., using supervised learning [40]- [42]. Notably, [40] uses semisupervised learning with a similar motivation, where drivers are classified into aggressive and normal driving styles based on a few labeled data points.…”

Section: Imitation Learning and Supervised Learningmentioning

confidence: 99%

“…Therefore, we are more constrained in the solution but need less data and have properties that are invariant through the learning process, which we can use to enforce safe behaviors while learning. Compared to supervised learning methods [40]- [42], we do not require labeled data in order to learn a control objective. On the other hand, inverse learning methods that use unlabeled data, such as IOC, IRL, and our method, require the assumption that the data represent desirable behavior.…”

Section: Imitation Learning and Supervised Learningmentioning

confidence: 99%

See 1 more Smart Citation

Inverse Learning for Data-Driven Calibration of Model-Based Statistical Path Planning

Menner

Berntorp

Zeilinger

et al. 2021

IEEE Trans. Intell. Veh.

Self Cite

View full text Add to dashboard Cite

This paper presents a method for inverse learning of a control objective defined in terms of requirements and their joint probability distribution from data. The probability distribution characterizes tolerated deviations from the deterministic requirements and is learned using maximum likelihood estimation from data. Further, this paper introduces both parametrized requirements for motion planning in autonomous driving applications and methods for the estimation of their parameters from driving data. Both the parametrized requirements and their joint probability distributions are estimated using a posterior distribution such that the control objective is personalized from a prior as driver data are accumulated. Finally, three variants of the learning method are presented that vary in computational complexity and data storage requirements. Key advantages of the proposed inverse learning method are a relatively low computational complexity, a need for a limited amount of data, and that the data do not have to be segmented into specific maneuvers, which makes the method easily implementable. Learning results using data of five human drivers in a simulation environment suggest that the proposed model for human-conscious driving along with the proposed learning method enable a more natural and personalized driving style of autonomous vehicles for their human passengers.

show abstract

Section: Imitation Learning and Supervised Learningmentioning

confidence: 99%

Section: Imitation Learning and Supervised Learningmentioning

confidence: 99%

Inverse Learning for Data-Driven Calibration of Model-Based Statistical Path Planning

Menner

Berntorp

Zeilinger

et al. 2021

IEEE Trans. Intell. Veh.

Self Cite

View full text Add to dashboard Cite

show abstract

“…2. The estimates Pi,t , qi,t and ri,t of the unknown parameters P i , q i and r i of U i are updated by means of an ad-hoc learning procedure (7). Such a procedure relies on a recursive least square scheme which makes use only of the most updated data (y i,t , x i,t ), thus not requiring to store and use all the past points generated by the distributed algorithm.…”

Section: Distributed Algorithm Descriptionmentioning

confidence: 99%

“…The aim of the learning part of Algorithm 1 (cf. (7)) is to provide a recursive scheme to let each agent i estimate the unknown parameters of U i . Specifically, the considered scheme aims at solving, for each t, the least squares (LS) problem minimize…”

Section: Parameters Estimation Via Recursive Least Squaresmentioning

confidence: 99%

“…Important examples of this class of systems are the energy grid, whenever human preferences are considered into energy demands and consumption schedules [3,4], or transportation infrastructures, whenever human behavior and routing preferences are taken into account in the decision making process [5,6]. Other examples stem from personalized healthcare [7] and robotics [8].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Distributed Personalized Gradient Tracking with Convex Parametric Models

Notarnicola¹,

Simonetto²,

Farina³

et al. 2020

Preprint

View full text Add to dashboard Cite

We present a distributed optimization algorithm for solving online personalized optimization problems over a network of computing and communicating nodes, each of which linked to a specific user. The local objective functions are assumed to have a composite structure and to consist of a known time-varying (engineering) part and an unknown (userspecific) part. Regarding the unknown part, it is assumed to have a known parametric (e.g., quadratic) structure a priori, whose parameters are to be learned along with the evolution of the algorithm. The algorithm is composed of two intertwined components: (i) a dynamic gradient tracking scheme for finding local solution estimates and (ii) a recursive least squares scheme for estimating the unknown parameters via user's noisy feedback on the local solution estimates. The algorithm is shown to exhibit a bounded regret under suitable assumptions. Finally, a numerical example corroborates the theoretical analysis.

show abstract

Surface Electromyography-Controlled Vehicle Braking Assistance System Using Deep Learning

Tran

Wang

Yusuke

et al. 2021

Advances in Human Aspects of Transportation

View full text Add to dashboard Cite

Using Human Ratings for Feedback Control: A Supervised Learning Approach With Application to Rehabilitation Robotics

Cited by 11 publications

References 40 publications

Inverse Learning for Data-Driven Calibration of Model-Based Statistical Path Planning

Inverse Learning for Data-Driven Calibration of Model-Based Statistical Path Planning

Distributed Personalized Gradient Tracking with Convex Parametric Models

Surface Electromyography-Controlled Vehicle Braking Assistance System Using Deep Learning

Contact Info

Product

Resources

About