The University of Southampton
University of Southampton Institutional Repository

UoS Data Rescue

UoS Data Rescue
UoS Data Rescue
UoS_Data_Rescue Dataset is a dataset of 1,113 historical logbooks with 594,000 annotated text cells, tackling challenges like handwritten entries, aging artifacts, and intricate layouts. Cite: Singh, L.G. Middleton, S.E. Tabular Context-aware Optical Character Recognition and Tabular Data Reconstruction for Historical Records, International Journal on Document Analysis and Recognition (IJDAR), 2025
NLP, Machine Learning, Data Rescue, Document Layout Analysis
University of Southampton
Middleton, Stuart
404b62ba-d77e-476b-9775-32645b04473f
Loitongbam, Gyanendro
c1d8ea4f-7a54-4c78-8830-3c3064e26ae6
Middleton, Stuart
404b62ba-d77e-476b-9775-32645b04473f
Loitongbam, Gyanendro
c1d8ea4f-7a54-4c78-8830-3c3064e26ae6

Middleton, Stuart (2025) UoS Data Rescue. University of Southampton doi:10.5281/zenodo.15730545 [Dataset]

Record type: Dataset

Abstract

UoS_Data_Rescue Dataset is a dataset of 1,113 historical logbooks with 594,000 annotated text cells, tackling challenges like handwritten entries, aging artifacts, and intricate layouts. Cite: Singh, L.G. Middleton, S.E. Tabular Context-aware Optical Character Recognition and Tabular Data Reconstruction for Historical Records, International Journal on Document Analysis and Recognition (IJDAR), 2025

This record has no associated files available for download.

More information

Published date: 25 June 2025
Keywords: NLP, Machine Learning, Data Rescue, Document Layout Analysis

Identifiers

Local EPrints ID: 502651
URI: http://eprints.soton.ac.uk/id/eprint/502651
PURE UUID: 5dfbeedc-eaf5-443c-8862-c5da7fee8da8
ORCID for Stuart Middleton: ORCID iD orcid.org/0000-0001-8305-8176

Catalogue record

Date deposited: 03 Jul 2025 16:36
Last modified: 04 Jul 2025 01:39

Export record

Altmetrics

Contributors

Creator: Stuart Middleton ORCID iD
Contributor: Gyanendro Loitongbam

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×