The University of Southampton
University of Southampton Institutional Repository

Open RL benchmark: comprehensive tracked experiments for reinforcement learning

Open RL benchmark: comprehensive tracked experiments for reinforcement learning
Open RL benchmark: comprehensive tracked experiments for reinforcement learning
In many Reinforcement Learning (RL) papers, learning curves are useful indicators to measure the effectiveness of RL algorithms. However, the complete raw data of the learning curves are rarely available. As a result, it is usually necessary to reproduce the experiments from scratch, which can be time-consuming and error-prone. We present Open RL Benchmark, a set of fully tracked RL experiments, including not only the usual data such as episodic return, but also all algorithm-specific and system metrics. Open RL Benchmark is community-driven: anyone can download, use, and contribute to the data. At the time of writing, more than 25,000 runs have been tracked, for a cumulative duration of more than 8 years. Open RL Benchmark covers a wide range of RL libraries and reference implementations. Special care is taken to ensure that each experiment is precisely reproducible by providing not only the full parameters, but also the versions of the dependencies used to generate it. In addition, Open RL Benchmark comes with a command-line interface (CLI) for easy fetching and generating figures to present the results. In this document, we include two case studies to demonstrate the usefulness of Open RL Benchmark in practice. To the best of our knowledge, Open RL Benchmark is the first RL benchmark of its kind, and the authors hope that it will improve and facilitate the work of researchers in the field.
cs.LG
arXiv
Huang, Shengyi
2e4b76dc-ee43-406e-b606-1476d0a76956
Gallouédec, Quentin
a98fd9d7-c253-4287-b214-0e36398b7e2d
Felten, Florian
ebec57c8-d46a-433f-9a04-7d34362a435b
Raffin, Antonin
0fdffa49-e8bd-4349-bf48-bc8112bfbb3c
Dossa, Rousslan Fernand Julien
c1609ef1-e4ff-4f58-816c-96bf77c29d4f
Zhao, Yanxiao
019f1f6e-f15b-4028-a424-4496db99baaa
Sullivan, Ryan
0445d118-b346-4cbf-8906-f796af1c8cd3
Makoviychuk, Viktor
67976095-b6f1-4a11-9a2f-ccd4ad0f94fa
Makoviichuk, Denys
6f5cc543-879a-4914-93b9-8bf71c4883d2
Danesh, Mohamad H.
b021f5cc-6363-4fb6-b20e-615b213523c5
Roumégous, Cyril
e7c6a504-a183-432c-9d3d-1b06e3f8c17f
Weng, Jiayi
d8e192d6-bc79-4c00-9775-0feb924edc1c
Chen, Chufan
fe8653f8-e65e-40d2-84e7-b71d4b1161d5
Rahman, Md Masudur
e18e7868-17cf-45f3-aa90-c5b96707debe
Araújo, João G.M.
771dd0b1-f6ec-4542-9484-f79cb60648cf
Quan, Guorui
475a0a7f-5547-4acf-a540-b9e97185396a
Tan, Daniel
5c244e97-c5d2-4747-ab51-21fcb41e2c8a
Klein, Timo
641d1c76-f74c-4229-a8a6-52fa06f83791
Charakorn, Rujikorn
40643d64-f78e-47cd-a059-3f9a3d3642c4
Towers, Mark
18e6acc7-29c4-4d0c-9058-32d180ad4f12
Berthelot, Yann
180e45c9-bd2e-4682-9a7b-bcb2f1a1342c
Mehta, Kinal
0dbc8375-0497-4b36-9244-1cccca814898
Chakraborty, Dipam
71c62c10-e0ac-4837-a2cf-8bb613eccbb3
Arjun, KG
1259cff7-3214-40bb-9f04-29af4d54f369
Charraut, Valentin
e3612c51-a480-4d5d-a24a-734e0110ab12
Ye, Chang
b26d5517-0198-4440-bb48-f6ea49ce553e
Liu, Zichen
c1c7b438-7c63-4602-9ff3-f974393b1214
Alegre, Lucas N.
e3dc2d97-d80a-494d-a940-3cff955b4beb
Nikulin, Alexander
3f2d8a06-3e01-4fc9-ba03-5b43950fd521
Hu, Xiao
532fd08c-7d5d-494f-8b91-67660f5f767a
Liu, Tianlin
4f877858-4bdf-4f02-80fb-4f96a52f78c8
Choi, Jongwook
94bed326-3ca4-4f02-a53f-0241c3f6386f
Yi, Brent
12f1508c-29cc-4753-99f2-659d037c28c7
Huang, Shengyi
2e4b76dc-ee43-406e-b606-1476d0a76956
Gallouédec, Quentin
a98fd9d7-c253-4287-b214-0e36398b7e2d
Felten, Florian
ebec57c8-d46a-433f-9a04-7d34362a435b
Raffin, Antonin
0fdffa49-e8bd-4349-bf48-bc8112bfbb3c
Dossa, Rousslan Fernand Julien
c1609ef1-e4ff-4f58-816c-96bf77c29d4f
Zhao, Yanxiao
019f1f6e-f15b-4028-a424-4496db99baaa
Sullivan, Ryan
0445d118-b346-4cbf-8906-f796af1c8cd3
Makoviychuk, Viktor
67976095-b6f1-4a11-9a2f-ccd4ad0f94fa
Makoviichuk, Denys
6f5cc543-879a-4914-93b9-8bf71c4883d2
Danesh, Mohamad H.
b021f5cc-6363-4fb6-b20e-615b213523c5
Roumégous, Cyril
e7c6a504-a183-432c-9d3d-1b06e3f8c17f
Weng, Jiayi
d8e192d6-bc79-4c00-9775-0feb924edc1c
Chen, Chufan
fe8653f8-e65e-40d2-84e7-b71d4b1161d5
Rahman, Md Masudur
e18e7868-17cf-45f3-aa90-c5b96707debe
Araújo, João G.M.
771dd0b1-f6ec-4542-9484-f79cb60648cf
Quan, Guorui
475a0a7f-5547-4acf-a540-b9e97185396a
Tan, Daniel
5c244e97-c5d2-4747-ab51-21fcb41e2c8a
Klein, Timo
641d1c76-f74c-4229-a8a6-52fa06f83791
Charakorn, Rujikorn
40643d64-f78e-47cd-a059-3f9a3d3642c4
Towers, Mark
18e6acc7-29c4-4d0c-9058-32d180ad4f12
Berthelot, Yann
180e45c9-bd2e-4682-9a7b-bcb2f1a1342c
Mehta, Kinal
0dbc8375-0497-4b36-9244-1cccca814898
Chakraborty, Dipam
71c62c10-e0ac-4837-a2cf-8bb613eccbb3
Arjun, KG
1259cff7-3214-40bb-9f04-29af4d54f369
Charraut, Valentin
e3612c51-a480-4d5d-a24a-734e0110ab12
Ye, Chang
b26d5517-0198-4440-bb48-f6ea49ce553e
Liu, Zichen
c1c7b438-7c63-4602-9ff3-f974393b1214
Alegre, Lucas N.
e3dc2d97-d80a-494d-a940-3cff955b4beb
Nikulin, Alexander
3f2d8a06-3e01-4fc9-ba03-5b43950fd521
Hu, Xiao
532fd08c-7d5d-494f-8b91-67660f5f767a
Liu, Tianlin
4f877858-4bdf-4f02-80fb-4f96a52f78c8
Choi, Jongwook
94bed326-3ca4-4f02-a53f-0241c3f6386f
Yi, Brent
12f1508c-29cc-4753-99f2-659d037c28c7

[Unknown type: UNSPECIFIED]

Record type: UNSPECIFIED

Abstract

In many Reinforcement Learning (RL) papers, learning curves are useful indicators to measure the effectiveness of RL algorithms. However, the complete raw data of the learning curves are rarely available. As a result, it is usually necessary to reproduce the experiments from scratch, which can be time-consuming and error-prone. We present Open RL Benchmark, a set of fully tracked RL experiments, including not only the usual data such as episodic return, but also all algorithm-specific and system metrics. Open RL Benchmark is community-driven: anyone can download, use, and contribute to the data. At the time of writing, more than 25,000 runs have been tracked, for a cumulative duration of more than 8 years. Open RL Benchmark covers a wide range of RL libraries and reference implementations. Special care is taken to ensure that each experiment is precisely reproducible by providing not only the full parameters, but also the versions of the dependencies used to generate it. In addition, Open RL Benchmark comes with a command-line interface (CLI) for easy fetching and generating figures to present the results. In this document, we include two case studies to demonstrate the usefulness of Open RL Benchmark in practice. To the best of our knowledge, Open RL Benchmark is the first RL benchmark of its kind, and the authors hope that it will improve and facilitate the work of researchers in the field.

Text
2402.03046v1 - Author's Original
Available under License Other.
Download (29MB)

More information

Published date: 5 February 2024
Additional Information: Under review
Keywords: cs.LG

Identifiers

Local EPrints ID: 488075
URI: http://eprints.soton.ac.uk/id/eprint/488075
PURE UUID: 99e41149-01ca-4d2d-8d7d-51484729bfe0
ORCID for Mark Towers: ORCID iD orcid.org/0000-0002-2609-2041

Catalogue record

Date deposited: 14 Mar 2024 18:33
Last modified: 29 May 2024 02:00

Export record

Altmetrics

Contributors

Author: Shengyi Huang
Author: Quentin Gallouédec
Author: Florian Felten
Author: Antonin Raffin
Author: Rousslan Fernand Julien Dossa
Author: Yanxiao Zhao
Author: Ryan Sullivan
Author: Viktor Makoviychuk
Author: Denys Makoviichuk
Author: Mohamad H. Danesh
Author: Cyril Roumégous
Author: Jiayi Weng
Author: Chufan Chen
Author: Md Masudur Rahman
Author: João G.M. Araújo
Author: Guorui Quan
Author: Daniel Tan
Author: Timo Klein
Author: Rujikorn Charakorn
Author: Mark Towers ORCID iD
Author: Yann Berthelot
Author: Kinal Mehta
Author: Dipam Chakraborty
Author: KG Arjun
Author: Valentin Charraut
Author: Chang Ye
Author: Zichen Liu
Author: Lucas N. Alegre
Author: Alexander Nikulin
Author: Xiao Hu
Author: Tianlin Liu
Author: Jongwook Choi
Author: Brent Yi

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×