Impact makes a sound and sound makes an impact: sound guides representations and explorations
Impact makes a sound and sound makes an impact: sound guides representations and explorations
Sound is one of the most informative and abundant modalities in the real world while being robust to sense without contacts by small and cheap sensors that can be placed on mobile devices. Although deep learning is capable of extracting information from multiple sensory inputs, there has been little use of sound for the control and learning of robotic actions. For unsupervised reinforcement learning, an agent is expected to actively collect experiences and jointly learn representations and policies in a self-supervised way. We build realistic robotic manipulation scenarios with physics-based sound simulation and propose the Intrinsic Sound Curiosity Module (ISCM). The ISCM provides feedback to a reinforcement learner to learn robust representations and to reward a more efficient exploration behavior. We perform experiments with sound enabled during pre-training and disabled during adaptation, and show that representations learned by ISCM outperform the ones by vision-only baselines and pre-trained policies can accelerate the learning process when applied to downstream tasks.
2512-2518
Zhao, Xufeng
ae4f9f4a-4377-4e49-adb7-9efeb4999cc7
Weber, Cornelius
4e097e6c-840c-460a-8572-e8759f137e43
Hafez, Muhammad Burhan
e8c991ab-d800-46f2-abeb-cb169a1ed47e
Wermter, Stefan
80682cc6-4251-420a-af8a-f4d616fb0fcc
26 December 2022
Zhao, Xufeng
ae4f9f4a-4377-4e49-adb7-9efeb4999cc7
Weber, Cornelius
4e097e6c-840c-460a-8572-e8759f137e43
Hafez, Muhammad Burhan
e8c991ab-d800-46f2-abeb-cb169a1ed47e
Wermter, Stefan
80682cc6-4251-420a-af8a-f4d616fb0fcc
Zhao, Xufeng, Weber, Cornelius, Hafez, Muhammad Burhan and Wermter, Stefan
(2022)
Impact makes a sound and sound makes an impact: sound guides representations and explorations.
In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
IEEE.
.
(doi:10.1109/IROS47612.2022.9981510).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Sound is one of the most informative and abundant modalities in the real world while being robust to sense without contacts by small and cheap sensors that can be placed on mobile devices. Although deep learning is capable of extracting information from multiple sensory inputs, there has been little use of sound for the control and learning of robotic actions. For unsupervised reinforcement learning, an agent is expected to actively collect experiences and jointly learn representations and policies in a self-supervised way. We build realistic robotic manipulation scenarios with physics-based sound simulation and propose the Intrinsic Sound Curiosity Module (ISCM). The ISCM provides feedback to a reinforcement learner to learn robust representations and to reward a more efficient exploration behavior. We perform experiments with sound enabled during pre-training and disabled during adaptation, and show that representations learned by ISCM outperform the ones by vision-only baselines and pre-trained policies can accelerate the learning process when applied to downstream tasks.
Text
ISCM
- Accepted Manuscript
More information
Published date: 26 December 2022
Venue - Dates:
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022, , Kyoto, Japan, 2022-10-23 - 2022-10-27
Identifiers
Local EPrints ID: 496191
URI: http://eprints.soton.ac.uk/id/eprint/496191
PURE UUID: 0b466946-689d-44e1-97ff-6872af7dfc0b
Catalogue record
Date deposited: 06 Dec 2024 17:36
Last modified: 10 Jan 2025 03:18
Export record
Altmetrics
Contributors
Author:
Xufeng Zhao
Author:
Cornelius Weber
Author:
Muhammad Burhan Hafez
Author:
Stefan Wermter
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics