Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics.

Michael Meindl, Dustin Lehmann, Thomas Seel
Author Information
  1. Michael Meindl: Embedded Mechatronics Laboratory, Hochschule Karlsruhe, Karlsruhe, Germany.
  2. Dustin Lehmann: Control Systems Group, Technische Universität Berlin, Berlin, Germany.
  3. Thomas Seel: Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.

Abstract

This work addresses the problem of reference tracking in autonomously learning robots with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that learns to approximate the unknown dynamics by a Gaussian Process (GP), which is used to optimize and apply a feedforward control input on each trial. Unlike existing approaches, the proposed method neither requires knowledge of the system states and their dynamics nor knowledge of an effective feedback control structure. All algorithm parameters are chosen automatically, i.e. the learning method works plug and play. The proposed method is validated in extensive simulations and real-world experiments. In contrast to most existing work, we study learning dynamics for more than one motion task as well as the robustness of performance across a large range of learning parameters. The method's plug and play applicability is demonstrated by experiments with a balancing robot, in which the proposed method rapidly learns to track the desired output. Due to its model-agnostic and plug and play properties, the proposed method is expected to have high potential for application to a large class of reference tracking problems in systems with unknown, nonlinear dynamics.

Keywords

References

  1. Neural Netw. 2008 May;21(4):682-97 [PMID: 18482830]
  2. Clin Otolaryngol. 2008 Aug;33(4):343-7 [PMID: 18983344]
  3. IEEE Trans Neural Netw Learn Syst. 2020 Apr;31(4):1170-1182 [PMID: 31251197]
  4. IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):14-25 [PMID: 20350860]
  5. IEEE Trans Neural Netw Learn Syst. 2021 Aug;32(8):3377-3390 [PMID: 32857701]

Word Cloud

Created with Highcharts 10.0.0learningdynamicsmethodcontrolproposedunknownnonlinearexperimentsplugplaysystemsLearningworkreferencetrackingextensivevalidatedreal-worldlearnsGaussianGPexistingknowledgeparameterslargerobotaddressesproblemautonomouslyrobotsExistingsolutionsrequiremodelinformationparametertuningrarelyproposeschemeapproximateProcessusedoptimizeapplyfeedforwardinputtrialUnlikeapproachesneitherrequiressystemstateseffectivefeedbackstructurealgorithmchosenautomaticallyieworkssimulationscontraststudyonemotiontaskwellrobustnessperformanceacrossrangemethod'sapplicabilitydemonstratedbalancingrapidlytrackdesiredoutputDuemodel-agnosticpropertiesexpectedhighpotentialapplicationclassproblemsBridgingReinforcementIterativeControl:AutonomousMotionUnknownNonlinearDynamicsprocessesautonomousiterativereinforcement

Similar Articles

Cited By