Beyond the rainbow: high performance deep reinforcement learning on a desktop PC

Rainbow Deep Q-Network (DQN) demonstrated combining multiple independent enhancements could significantly boost a reinforcement learning (RL) agent’s performance. In this paper, we present 'Beyond The Rainbow' (BTR), a novel algorithm that integrates six improvements from across the RL literature to Rainbow DQN, establishing a new state-of-the-art for RL using a desktop PC, with a human-normalized interquartile mean (IQM) of 7.4 on Atari-60. Beyond Atari, we demonstrate BTR's capability to handle complex 3D games, successfully training agents to play Super Mario Galaxy, Mario Kart, and Mortal Kombat with minimal algorithmic changes. Designing BTR with computational efficiency in mind, agents can be trained using a high-end desktop PC on 200 million Atari frames within 12 hours. Additionally, we conduct detailed ablation studies of each component, analyzing the performance and impact using numerous measures. Code is available at https://github.com/VIPTankz/BTR.

Reinforcement Learning, Atari, Control, Vision, deep learning (DL)

Clark, Tyler

cfbf31c0-1d37-4736-9d55-67e71fd3c861

Towers, Mark

18e6acc7-29c4-4d0c-9058-32d180ad4f12

Evers, Christine

93090c84-e984-4cc3-9363-fbf3f3639c4b

Hare, Jonathon

65ba2cda-eaaf-4767-a325-cd845504e5a9

Clark, Tyler

cfbf31c0-1d37-4736-9d55-67e71fd3c861

Towers, Mark

18e6acc7-29c4-4d0c-9058-32d180ad4f12

Evers, Christine

93090c84-e984-4cc3-9363-fbf3f3639c4b

Hare, Jonathon

65ba2cda-eaaf-4767-a325-cd845504e5a9

Clark, Tyler, Towers, Mark, Evers, Christine and Hare, Jonathon (2025) Beyond the rainbow: high performance deep reinforcement learning on a desktop PC. International Conference on Machine Learning 2025, Vancouver, Canada, Vancouver, Canada. 11 - 19 Jul 2025. 28 pp . (In Press)

Record type: Conference or Workshop Item (Paper)

Abstract

Text

Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC - Accepted Manuscript

Available under License Creative Commons Attribution.

Download (4MB)

More information

Accepted/In Press date: 1 May 2025

Venue - Dates: International Conference on Machine Learning 2025, Vancouver, Canada, Vancouver, Canada, 2025-07-11 - 2025-07-19

Keywords: Reinforcement Learning, Atari, Control, Vision, deep learning (DL)

Learn more about the Vision, Learning and Control Learn more about the School of Electronics and Computer Science

Identifiers

Local EPrints ID: 502630

URI: http://eprints.soton.ac.uk/id/eprint/502630

PURE UUID: f6bfb37a-0b85-480e-9c1c-a26c166b4938

ORCID for Mark Towers:

orcid.org/0000-0002-2609-2041

ORCID for Christine Evers:

orcid.org/0000-0003-0757-5504

ORCID for Jonathon Hare:

orcid.org/0000-0003-2921-4283

Catalogue record

Date deposited: 02 Jul 2025 16:39

Last modified: 22 Aug 2025 02:31

Export record

Share this record

Share this on Facebook Share this on Twitter Share this on Weibo

Contributors

Author: Tyler Clark

Author: Mark Towers

Author: Christine Evers

Author: Jonathon Hare

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Library staff additional information