The University of Southampton
University of Southampton Institutional Repository

Data for 'Safe Reward Learning from Human Preferences and Justifications'

Data for 'Safe Reward Learning from Human Preferences and Justifications'
Data for 'Safe Reward Learning from Human Preferences and Justifications'
This data supports the publication: AUTHORS: Ilias Kazantzidis, Timothy J. Norman, Christopher T. Freeman, Yali Du TITLE: Safe Reward Learning from Human Preferences and Justifications CONFERENCE: International Conference on Agents and Artificial Intelligence (ICAART 2026)
reinforcement learning, human-agent interaction, supervised learning, safe reinforcement learning, human-in-the-loop reinforcement learning, dataset
University of Southampton
Kazantzidis, Ilias
10862613-d212-44fb-980c-ff8c6d0c4a95
Norman, Tim
663e522f-807c-4569-9201-dc141c8eb50d
Du, Yali
0b0d4eef-0820-4753-b384-72db5058df32
Freeman, Chris
ccdd1272-cdc7-43fb-a1bb-b1ef0bdf5815
Kazantzidis, Ilias
10862613-d212-44fb-980c-ff8c6d0c4a95
Norman, Tim
663e522f-807c-4569-9201-dc141c8eb50d
Du, Yali
0b0d4eef-0820-4753-b384-72db5058df32
Freeman, Chris
ccdd1272-cdc7-43fb-a1bb-b1ef0bdf5815

Kazantzidis, Ilias (2025) Data for 'Safe Reward Learning from Human Preferences and Justifications'. University of Southampton doi:10.5258/SOTON/D3749 [Dataset]

Record type: Dataset

Abstract

This data supports the publication: AUTHORS: Ilias Kazantzidis, Timothy J. Norman, Christopher T. Freeman, Yali Du TITLE: Safe Reward Learning from Human Preferences and Justifications CONFERENCE: International Conference on Agents and Artificial Intelligence (ICAART 2026)

Other
data.7z.001 - Dataset
Available under License Creative Commons Attribution.
Download (3GB)
Other
data.7z.002 - Dataset
Available under License Creative Commons Attribution.
Download (1GB)
Archive
models.7z - Model
Download (812MB)
Text
README_DROPJ-DATA.txt - Text
Available under License Creative Commons Attribution.
Download (8kB)

More information

Published date: 2025
Keywords: reinforcement learning, human-agent interaction, supervised learning, safe reinforcement learning, human-in-the-loop reinforcement learning, dataset

Identifiers

Local EPrints ID: 509692
URI: http://eprints.soton.ac.uk/id/eprint/509692
PURE UUID: d912c220-1751-44ce-9aab-8cd3cff9a0d5
ORCID for Ilias Kazantzidis: ORCID iD orcid.org/0000-0002-1127-3843
ORCID for Tim Norman: ORCID iD orcid.org/0000-0002-6387-4034
ORCID for Chris Freeman: ORCID iD orcid.org/0000-0003-0305-9246

Catalogue record

Date deposited: 02 Mar 2026 17:51
Last modified: 06 Mar 2026 03:18

Export record

Altmetrics

Contributors

Creator: Ilias Kazantzidis ORCID iD
Research team head: Tim Norman ORCID iD
Research team head: Yali Du
Research team head: Chris Freeman ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×