The University of Southampton
University of Southampton Institutional Repository

A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives

A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives
A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives
This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentration camps and is the result of a multinational collaboration under the MEMORIAL project funded (€1.5M) by the European Union (www.memorial-project.info). Results on a representative selection of documents show a significant improvement not only in terms of OCR accuracy but also in terms of overall time/cost involved in converting these documents for digital archives. This work is supported by the European Union grant IST-2001-33441.
Historical document analysis, MEMORIAL
90-101
Springer Verlag
Antonacopoulos, Apostolos
9369bee5-b30f-4d4c-a63d-fe54984578cc
Karatzas, Dimosthenis
4d7e3927-2252-4039-88a4-0daca766e943
Marinai, S
Dengel, A
Antonacopoulos, Apostolos
9369bee5-b30f-4d4c-a63d-fe54984578cc
Karatzas, Dimosthenis
4d7e3927-2252-4039-88a4-0daca766e943
Marinai, S
Dengel, A

Antonacopoulos, Apostolos and Karatzas, Dimosthenis (2004) A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives. In, Marinai, S and Dengel, A (eds.) Document Analysis Systems VI, Springer Lecture Notes in Computer Science, LNCS 3163. Springer Verlag, pp. 90-101.

Record type: Book Section

Abstract

This paper presents a complete system that historians/archivists can use to digitize whole collections of documents relating to personal information. The system integrates tools and processes that facilitate scanning, image indexing, document (physical and logical) structure definition, document image analysis, recognition, proofreading/correction and semantic tagging. The system is described in the context of different types of typewritten documents relating to prisoners in World-War II concentration camps and is the result of a multinational collaboration under the MEMORIAL project funded (€1.5M) by the European Union (www.memorial-project.info). Results on a representative selection of documents show a significant improvement not only in terms of OCR accuracy but also in terms of overall time/cost involved in converting these documents for digital archives. This work is supported by the European Union grant IST-2001-33441.

Text
DAS2004_Antonacopoulos.pdf - Other
Download (3MB)

More information

Published date: 2004
Keywords: Historical document analysis, MEMORIAL
Organisations: Electronics & Computer Science

Identifiers

Local EPrints ID: 263527
URI: http://eprints.soton.ac.uk/id/eprint/263527
PURE UUID: 8f6c2585-3178-40d2-8b33-f07612093996

Catalogue record

Date deposited: 19 Feb 2007
Last modified: 06 Aug 2019 18:59

Export record

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×