Improving robot dual-system motor learning with intrinsically motivated meta-control and latent-space experience imagination
Improving robot dual-system motor learning with intrinsically motivated meta-control and latent-space experience imagination
Combining model-based and model-free learning systems has been shown to improve the sample efficiency of learning to perform complex robotic tasks. However, dual-system approaches fail to consider the reliability of the learned model when it is applied to make multiple-step predictions, resulting in a compounding of prediction errors and performance degradation. In this paper, we present a novel dual-system motor learning approach where a meta-controller arbitrates online between model-based and model-free decisions based on an estimate of the local reliability of the learned model. The reliability estimate is used in computing an intrinsic feedback signal, encouraging actions that lead to data that improves the model. Our approach also integrates arbitration with imagination where a learned latent-space model generates imagined experiences, based on its local reliability, to be used as additional training data. We evaluate our approach against baseline and state-of-the-art methods on learning vision-based robotic grasping in simulation and real world. The results show that our approach outperforms the compared methods and learns near-optimal grasping policies in dense- and sparse-reward environments.
Hafez, Muhammad Burhan
e8c991ab-d800-46f2-abeb-cb169a1ed47e
Weber, Cornelius
4e097e6c-840c-460a-8572-e8759f137e43
Kerzel, Matthias
a7ec71f0-3fa1-4acb-a46b-9198ba76ff14
Wermter, Stefan
80682cc6-4251-420a-af8a-f4d616fb0fcc
4 September 2020
Hafez, Muhammad Burhan
e8c991ab-d800-46f2-abeb-cb169a1ed47e
Weber, Cornelius
4e097e6c-840c-460a-8572-e8759f137e43
Kerzel, Matthias
a7ec71f0-3fa1-4acb-a46b-9198ba76ff14
Wermter, Stefan
80682cc6-4251-420a-af8a-f4d616fb0fcc
Hafez, Muhammad Burhan, Weber, Cornelius, Kerzel, Matthias and Wermter, Stefan
(2020)
Improving robot dual-system motor learning with intrinsically motivated meta-control and latent-space experience imagination.
Robotics and Autonomous Systems, 133.
(doi:10.1016/j.robot.2020.103630).
Abstract
Combining model-based and model-free learning systems has been shown to improve the sample efficiency of learning to perform complex robotic tasks. However, dual-system approaches fail to consider the reliability of the learned model when it is applied to make multiple-step predictions, resulting in a compounding of prediction errors and performance degradation. In this paper, we present a novel dual-system motor learning approach where a meta-controller arbitrates online between model-based and model-free decisions based on an estimate of the local reliability of the learned model. The reliability estimate is used in computing an intrinsic feedback signal, encouraging actions that lead to data that improves the model. Our approach also integrates arbitration with imagination where a learned latent-space model generates imagined experiences, based on its local reliability, to be used as additional training data. We evaluate our approach against baseline and state-of-the-art methods on learning vision-based robotic grasping in simulation and real world. The results show that our approach outperforms the compared methods and learns near-optimal grasping policies in dense- and sparse-reward environments.
Text
main
- Version of Record
More information
Accepted/In Press date: 18 August 2020
Published date: 4 September 2020
Identifiers
Local EPrints ID: 495805
URI: http://eprints.soton.ac.uk/id/eprint/495805
ISSN: 0921-8890
PURE UUID: 1f8effab-e52e-48db-94b3-5301f5faf9e2
Catalogue record
Date deposited: 22 Nov 2024 18:06
Last modified: 23 Nov 2024 03:11
Export record
Altmetrics
Contributors
Author:
Muhammad Burhan Hafez
Author:
Cornelius Weber
Author:
Matthias Kerzel
Author:
Stefan Wermter
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics