The University of Southampton
University of Southampton Institutional Repository

Beyond the rainbow: high performance deep reinforcement learning on a desktop PC

Beyond the rainbow: high performance deep reinforcement learning on a desktop PC
Beyond the rainbow: high performance deep reinforcement learning on a desktop PC
Rainbow Deep Q-Network (DQN) demonstrated combining multiple independent enhancements could significantly boost a reinforcement learning (RL) agent’s performance. In this paper, we present 'Beyond The Rainbow' (BTR), a novel algorithm that integrates six improvements from across the RL literature to Rainbow DQN, establishing a new state-of-the-art for RL using a desktop PC, with a human-normalized interquartile mean (IQM) of 7.4 on Atari-60. Beyond Atari, we demonstrate BTR's capability to handle complex 3D games, successfully training agents to play Super Mario Galaxy, Mario Kart, and Mortal Kombat with minimal algorithmic changes. Designing BTR with computational efficiency in mind, agents can be trained using a high-end desktop PC on 200 million Atari frames within 12 hours. Additionally, we conduct detailed ablation studies of each component, analyzing the performance and impact using numerous measures. Code is available at https://github.com/VIPTankz/BTR.
Reinforcement Learning, Atari, Control, Vision, deep learning (DL)
Clark, Tyler
cfbf31c0-1d37-4736-9d55-67e71fd3c861
Towers, Mark
18e6acc7-29c4-4d0c-9058-32d180ad4f12
Evers, Christine
93090c84-e984-4cc3-9363-fbf3f3639c4b
Hare, Jonathon
65ba2cda-eaaf-4767-a325-cd845504e5a9
Clark, Tyler
cfbf31c0-1d37-4736-9d55-67e71fd3c861
Towers, Mark
18e6acc7-29c4-4d0c-9058-32d180ad4f12
Evers, Christine
93090c84-e984-4cc3-9363-fbf3f3639c4b
Hare, Jonathon
65ba2cda-eaaf-4767-a325-cd845504e5a9

Clark, Tyler, Towers, Mark, Evers, Christine and Hare, Jonathon (2025) Beyond the rainbow: high performance deep reinforcement learning on a desktop PC. International Conference on Machine Learning 2025, Vancouver, Canada, Vancouver, Canada. 11 - 19 Jul 2025. 28 pp . (In Press)

Record type: Conference or Workshop Item (Paper)

Abstract

Rainbow Deep Q-Network (DQN) demonstrated combining multiple independent enhancements could significantly boost a reinforcement learning (RL) agent’s performance. In this paper, we present 'Beyond The Rainbow' (BTR), a novel algorithm that integrates six improvements from across the RL literature to Rainbow DQN, establishing a new state-of-the-art for RL using a desktop PC, with a human-normalized interquartile mean (IQM) of 7.4 on Atari-60. Beyond Atari, we demonstrate BTR's capability to handle complex 3D games, successfully training agents to play Super Mario Galaxy, Mario Kart, and Mortal Kombat with minimal algorithmic changes. Designing BTR with computational efficiency in mind, agents can be trained using a high-end desktop PC on 200 million Atari frames within 12 hours. Additionally, we conduct detailed ablation studies of each component, analyzing the performance and impact using numerous measures. Code is available at https://github.com/VIPTankz/BTR.

Text
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (4MB)

More information

Accepted/In Press date: 1 May 2025
Venue - Dates: International Conference on Machine Learning 2025, Vancouver, Canada, Vancouver, Canada, 2025-07-11 - 2025-07-19
Keywords: Reinforcement Learning, Atari, Control, Vision, deep learning (DL)

Identifiers

Local EPrints ID: 502630
URI: http://eprints.soton.ac.uk/id/eprint/502630
PURE UUID: f6bfb37a-0b85-480e-9c1c-a26c166b4938
ORCID for Mark Towers: ORCID iD orcid.org/0000-0002-2609-2041
ORCID for Christine Evers: ORCID iD orcid.org/0000-0003-0757-5504
ORCID for Jonathon Hare: ORCID iD orcid.org/0000-0003-2921-4283

Catalogue record

Date deposited: 02 Jul 2025 16:39
Last modified: 22 Aug 2025 02:31

Export record

Contributors

Author: Tyler Clark
Author: Mark Towers ORCID iD
Author: Christine Evers ORCID iD
Author: Jonathon Hare ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×