The University of Southampton
University of Southampton Institutional Repository

Impact makes a sound and sound makes an impact: sound guides representations and explorations

Impact makes a sound and sound makes an impact: sound guides representations and explorations
Impact makes a sound and sound makes an impact: sound guides representations and explorations
Sound is one of the most informative and abundant modalities in the real world while being robust to sense without contacts by small and cheap sensors that can be placed on mobile devices. Although deep learning is capable of extracting information from multiple sensory inputs, there has been little use of sound for the control and learning of robotic actions. For unsupervised reinforcement learning, an agent is expected to actively collect experiences and jointly learn representations and policies in a self-supervised way. We build realistic robotic manipulation scenarios with physics-based sound simulation and propose the Intrinsic Sound Curiosity Module (ISCM). The ISCM provides feedback to a reinforcement learner to learn robust representations and to reward a more efficient exploration behavior. We perform experiments with sound enabled during pre-training and disabled during adaptation, and show that representations learned by ISCM outperform the ones by vision-only baselines and pre-trained policies can accelerate the learning process when applied to downstream tasks.
2512-2518
IEEE
Zhao, Xufeng
ae4f9f4a-4377-4e49-adb7-9efeb4999cc7
Weber, Cornelius
4e097e6c-840c-460a-8572-e8759f137e43
Hafez, Muhammad Burhan
e8c991ab-d800-46f2-abeb-cb169a1ed47e
Wermter, Stefan
80682cc6-4251-420a-af8a-f4d616fb0fcc
Zhao, Xufeng
ae4f9f4a-4377-4e49-adb7-9efeb4999cc7
Weber, Cornelius
4e097e6c-840c-460a-8572-e8759f137e43
Hafez, Muhammad Burhan
e8c991ab-d800-46f2-abeb-cb169a1ed47e
Wermter, Stefan
80682cc6-4251-420a-af8a-f4d616fb0fcc

Zhao, Xufeng, Weber, Cornelius, Hafez, Muhammad Burhan and Wermter, Stefan (2022) Impact makes a sound and sound makes an impact: sound guides representations and explorations. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE. pp. 2512-2518 . (doi:10.1109/IROS47612.2022.9981510).

Record type: Conference or Workshop Item (Paper)

Abstract

Sound is one of the most informative and abundant modalities in the real world while being robust to sense without contacts by small and cheap sensors that can be placed on mobile devices. Although deep learning is capable of extracting information from multiple sensory inputs, there has been little use of sound for the control and learning of robotic actions. For unsupervised reinforcement learning, an agent is expected to actively collect experiences and jointly learn representations and policies in a self-supervised way. We build realistic robotic manipulation scenarios with physics-based sound simulation and propose the Intrinsic Sound Curiosity Module (ISCM). The ISCM provides feedback to a reinforcement learner to learn robust representations and to reward a more efficient exploration behavior. We perform experiments with sound enabled during pre-training and disabled during adaptation, and show that representations learned by ISCM outperform the ones by vision-only baselines and pre-trained policies can accelerate the learning process when applied to downstream tasks.

Text
ISCM - Accepted Manuscript
Download (3MB)

More information

Published date: 26 December 2022
Venue - Dates: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2022, , Kyoto, Japan, 2022-10-23 - 2022-10-27

Identifiers

Local EPrints ID: 496191
URI: http://eprints.soton.ac.uk/id/eprint/496191
PURE UUID: 0b466946-689d-44e1-97ff-6872af7dfc0b
ORCID for Muhammad Burhan Hafez: ORCID iD orcid.org/0000-0003-1670-8962

Catalogue record

Date deposited: 06 Dec 2024 17:36
Last modified: 10 Jan 2025 03:18

Export record

Altmetrics

Contributors

Author: Xufeng Zhao
Author: Cornelius Weber
Author: Muhammad Burhan Hafez ORCID iD
Author: Stefan Wermter

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×