The University of Southampton
University of Southampton Institutional Repository

Interoperate with whom? formality, archaeology and the semantic web

Interoperate with whom? formality, archaeology and the semantic web
Interoperate with whom? formality, archaeology and the semantic web
‘Interoperability’ is often cited as a desirable end-goal for information systems, but the highly abstract nature of this apparent benefit sits uneasily with the task-oriented realities of data-management. The approach most frequently advocated is to increase the formality of the system, which facilitates system-integration yet also raises additional barriers to entry that reduce the potential pool of systems to interoperate with.

The Semantic Web initiative in particular has faced accusations that difficulties associated with its adoption can outweigh the perceived benefits of data-sharing. This issue will be discussed in reference to current doctoral research being undertaken in Humanities data integration. It will argue that technologies that either heavily front-load or defer dealing with semantic complexity are unlikely to be viable across the producer spectrum. Recourse to altruistic arguments suggests a tacit acceptance that application of such technologies may not be in the immediate interest of the curator. An approach which offers multiple ‘pay-off points’ is inherently more attractive to potential adopters. In particular, we focus on means by which data-driven microproviders - owners of the small but important datasets that tend to form the ‘long tail’ of academic data in the Humanities - can participate in semantics-driven datasharing.

We propose that (at least) five escalating levels of semantic formalization can be identified, each with differing requirements and benefits for the implementer: i. Literal Standardization, ii. Instance URI generation, iii. Canonical URI mapping, iv. RDF generation, and v. Database-schema-to-Ontology mapping.

We note that Linked Data - hitherto seen as the simplest semantic approach - is relatively advanced in this scheme. We argue that data providers should be encouraged to migrate towards full semantic formalization only as their requirements dictate, rather than all at once. Such an approach acts as both a short and long-term investment in semantic approaches, in turn encouraging increased community engagement.

We also propose that for such processes to be accessible to data-curators with low technical literacy, assistive software must be created to facilitate these steps. We have been developing a prototype package targeted specifically at archaeologists that enables them to produce valid, globally-integrated RDF from unnormalized excavation data with minimal technical knowledge. This takes the form of a Wizard that inspects legacy relational data, and provides predictive mapping and association which users can confirm or amend.

A secondary program uses the resulting output in conjunction with the original data in order to produce valid RDF/XML as desired. Using this software we have demonstrated that archaeological excavation data encoded in a variety of formats, languages and schemas can be successfully integrated by its curators.
Isaksen, Leif
ecb71d6b-bea8-423c-8685-33d4c2658467
Martinez, Kirk
5f711898-20fc-410e-a007-837d8c57cb18
Gibbins, Nicholas
98efd447-4aa7-411c-86d1-955a612eceac
Earl, Graeme
724c73ef-c3dd-4e4f-a7f5-0557e81f8326
Keay, Simon
52b4cdfd-fc5e-4fa0-bd3e-8dd896624f41
Isaksen, Leif
ecb71d6b-bea8-423c-8685-33d4c2658467
Martinez, Kirk
5f711898-20fc-410e-a007-837d8c57cb18
Gibbins, Nicholas
98efd447-4aa7-411c-86d1-955a612eceac
Earl, Graeme
724c73ef-c3dd-4e4f-a7f5-0557e81f8326
Keay, Simon
52b4cdfd-fc5e-4fa0-bd3e-8dd896624f41

Isaksen, Leif, Martinez, Kirk, Gibbins, Nicholas, Earl, Graeme and Keay, Simon (2010) Interoperate with whom? formality, archaeology and the semantic web. Web Science Conference 2010, , Raleigh, United States. 26 - 27 Apr 2010. 1 pp .

Record type: Conference or Workshop Item (Poster)

Abstract

‘Interoperability’ is often cited as a desirable end-goal for information systems, but the highly abstract nature of this apparent benefit sits uneasily with the task-oriented realities of data-management. The approach most frequently advocated is to increase the formality of the system, which facilitates system-integration yet also raises additional barriers to entry that reduce the potential pool of systems to interoperate with.

The Semantic Web initiative in particular has faced accusations that difficulties associated with its adoption can outweigh the perceived benefits of data-sharing. This issue will be discussed in reference to current doctoral research being undertaken in Humanities data integration. It will argue that technologies that either heavily front-load or defer dealing with semantic complexity are unlikely to be viable across the producer spectrum. Recourse to altruistic arguments suggests a tacit acceptance that application of such technologies may not be in the immediate interest of the curator. An approach which offers multiple ‘pay-off points’ is inherently more attractive to potential adopters. In particular, we focus on means by which data-driven microproviders - owners of the small but important datasets that tend to form the ‘long tail’ of academic data in the Humanities - can participate in semantics-driven datasharing.

We propose that (at least) five escalating levels of semantic formalization can be identified, each with differing requirements and benefits for the implementer: i. Literal Standardization, ii. Instance URI generation, iii. Canonical URI mapping, iv. RDF generation, and v. Database-schema-to-Ontology mapping.

We note that Linked Data - hitherto seen as the simplest semantic approach - is relatively advanced in this scheme. We argue that data providers should be encouraged to migrate towards full semantic formalization only as their requirements dictate, rather than all at once. Such an approach acts as both a short and long-term investment in semantic approaches, in turn encouraging increased community engagement.

We also propose that for such processes to be accessible to data-curators with low technical literacy, assistive software must be created to facilitate these steps. We have been developing a prototype package targeted specifically at archaeologists that enables them to produce valid, globally-integrated RDF from unnormalized excavation data with minimal technical knowledge. This takes the form of a Wizard that inspects legacy relational data, and provides predictive mapping and association which users can confirm or amend.

A secondary program uses the resulting output in conjunction with the original data in order to produce valid RDF/XML as desired. Using this software we have demonstrated that archaeological excavation data encoded in a variety of formats, languages and schemas can be successfully integrated by its curators.

Text
WebSciPoster.pdf - Other
Download (2MB)

More information

Published date: April 2010
Venue - Dates: Web Science Conference 2010, , Raleigh, United States, 2010-04-26 - 2010-04-27

Identifiers

Local EPrints ID: 150319
URI: http://eprints.soton.ac.uk/id/eprint/150319
PURE UUID: 3cd803c1-cf0d-477c-934e-fc12977b4c09
ORCID for Kirk Martinez: ORCID iD orcid.org/0000-0003-3859-5700
ORCID for Nicholas Gibbins: ORCID iD orcid.org/0000-0002-6140-9956
ORCID for Graeme Earl: ORCID iD orcid.org/0000-0001-9077-4605

Catalogue record

Date deposited: 05 May 2010 07:56
Last modified: 14 Mar 2024 02:42

Export record

Contributors

Author: Leif Isaksen
Author: Kirk Martinez ORCID iD
Author: Nicholas Gibbins ORCID iD
Author: Graeme Earl ORCID iD
Author: Simon Keay

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×