UoS Data Rescue
UoS Data Rescue
UoS_Data_Rescue Dataset is a dataset of 1,113 historical logbooks with 594,000 annotated text cells, tackling challenges like handwritten entries, aging artifacts, and intricate layouts.
Cite: Singh, L.G. Middleton, S.E. Tabular Context-aware Optical Character Recognition and Tabular Data Reconstruction for Historical Records, International Journal on Document Analysis and Recognition (IJDAR), 2025
NLP, Machine Learning, Data Rescue, Document Layout Analysis
University of Southampton
Middleton, Stuart
404b62ba-d77e-476b-9775-32645b04473f
Loitongbam, Gyanendro
c1d8ea4f-7a54-4c78-8830-3c3064e26ae6
Middleton, Stuart
404b62ba-d77e-476b-9775-32645b04473f
Loitongbam, Gyanendro
c1d8ea4f-7a54-4c78-8830-3c3064e26ae6
Abstract
UoS_Data_Rescue Dataset is a dataset of 1,113 historical logbooks with 594,000 annotated text cells, tackling challenges like handwritten entries, aging artifacts, and intricate layouts.
Cite: Singh, L.G. Middleton, S.E. Tabular Context-aware Optical Character Recognition and Tabular Data Reconstruction for Historical Records, International Journal on Document Analysis and Recognition (IJDAR), 2025
This record has no associated files available for download.
More information
Published date: 25 June 2025
Keywords:
NLP, Machine Learning, Data Rescue, Document Layout Analysis
Identifiers
Local EPrints ID: 502651
URI: http://eprints.soton.ac.uk/id/eprint/502651
PURE UUID: 5dfbeedc-eaf5-443c-8862-c5da7fee8da8
Catalogue record
Date deposited: 03 Jul 2025 16:36
Last modified: 04 Jul 2025 01:39
Export record
Altmetrics
Contributors
Contributor:
Gyanendro Loitongbam
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics