Adapting integrity enforcement techniques for data reconciliation


Embury, S.M., Brandt, S.M., Robinson, J.S., Sutherland, I., Bisby, F.A., Gray, W.A., Jones, A.C. and White, R.J. (2001) Adapting integrity enforcement techniques for data reconciliation. Information Systems, 26, (8), 657-689. (doi:10.1016/S0306-4379(01)00044-8).

Download

[img] PDF
Restricted to Registered users only

Download (883Kb) | Request a copy

Description/Abstract

Integration of data sources opens up possibilities for new and valuable applications of data that cannot be supported
by the individual sources alone. Unfortunately, many data integration projects are hindered by the inherent
heterogeneities in the sources to be integrated. In particular, differences in the way that real world data is encoded
within sources can cause a range of difficulties, not least of which is that the conflicting semantics may not be recognised
until the integration project is well under way. Once identified, semantic conflicts of this kind are typically dealt with by
configuring a data transformation engine, that can convert incoming data into the form required by the integrated
system. However, determination of a complete and consistent set of data transformations for any given integration task
is far from trivial. In this paper, we explore the potential application of techniques for integrity enforcement in
supporting this process. We describe the design of a data reconciliation tool (LITCHI) based on these techniques that
aims to assist taxonomists in the integration of biodiversity data sets. Our experiences have highlighted several
limitations of integrity enforcement when applied to this real world problem, and we describe how we have overcome
these in the design of our system.

Item Type: Article
ISSNs: 0306-4379 (print)
Related URLs:
Keywords: data integration; data reconciliation; integrity constraints; integrity enforcement; biodiversity information
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QH Natural history > QH301 Biology
Divisions: University Structure - Pre August 2011 > School of Biological Sciences
Item ID: 30261
Date Deposited: 11 May 2006
Last Modified: 28 Jun 2012 10:16
Contributors: Embury, S.M. (Author)
Brandt, S.M. (Author)
Robinson, J.S. (Author)
Sutherland, I. (Author)
Bisby, F.A. (Author)
Gray, W.A. (Author)
Jones, A.C. (Author)
White, R.J. (Author)
Date: December 2001
Status: Published
Contact Email Address: s.m.embury@cs.man.ac.uk
URI: http://eprints.soton.ac.uk/id/eprint/30261

Actions (login required)

View Item View Item