The University of Southampton
University of Southampton Institutional Repository

Tree-sheets and Structured Documents

Tree-sheets and Structured Documents
Tree-sheets and Structured Documents
For data to be used effectively, it needs to be structured and annotated to make it amenable to machine processing. The Web Consortium's XML markup language promises to allow this, and increasing quantities of data are now available in the format. XML's support for namespacing allows information from diverse data sources to be combined and communicated reliably, making XML the keystone of the emerging Semantic Web. Rather than searching the web by giving a number of text strings which pages must contain, as is common now, this semantically enriched web will allow searches on precisely specified concepts, identified by the page's meta-data. Inferencing engines can be applied to this web of formally expressed facts, so that searches return information which has been deduced from the knowledge on the web, without it having been stated explicitly. There is a strong need for tools that provide flexible ways to create and manipulate XML documents. Just as the usefulness of having scientific data available in computer-readable CSV (comma separated values) format is greatly enhanced by the existence of the spreadsheet, so this new XML-based data requires a new general purpose end-user processing tool. XML is a tree-structured format, and we propose in this thesis the tree-sheet: a general purpose end-user tool for the manipulation of tree structured data. In this thesis we present Dome, our implementation of a tree-sheet. Dome is operated through a familiar direct manipulation style interface, with a 'record' mode being used to create programs. Programming features such as looping and conditional execution are also created while editing samples of actual data, making them easier to comprehend. We demonstrate several uses for Dome, including web harvesting, web page creation and visual programming.
Tree-sheets, Web harvesting, Semantic web, Dome, XML, XSLT, MIME
Leonard, Thomas A
24981826-b8c3-4f0e-8dc6-17e0ae24f804
Leonard, Thomas A
24981826-b8c3-4f0e-8dc6-17e0ae24f804

Leonard, Thomas A (2004) Tree-sheets and Structured Documents. University of Southampton, ECS, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

For data to be used effectively, it needs to be structured and annotated to make it amenable to machine processing. The Web Consortium's XML markup language promises to allow this, and increasing quantities of data are now available in the format. XML's support for namespacing allows information from diverse data sources to be combined and communicated reliably, making XML the keystone of the emerging Semantic Web. Rather than searching the web by giving a number of text strings which pages must contain, as is common now, this semantically enriched web will allow searches on precisely specified concepts, identified by the page's meta-data. Inferencing engines can be applied to this web of formally expressed facts, so that searches return information which has been deduced from the knowledge on the web, without it having been stated explicitly. There is a strong need for tools that provide flexible ways to create and manipulate XML documents. Just as the usefulness of having scientific data available in computer-readable CSV (comma separated values) format is greatly enhanced by the existence of the spreadsheet, so this new XML-based data requires a new general purpose end-user processing tool. XML is a tree-structured format, and we propose in this thesis the tree-sheet: a general purpose end-user tool for the manipulation of tree structured data. In this thesis we present Dome, our implementation of a tree-sheet. Dome is operated through a familiar direct manipulation style interface, with a 'record' mode being used to create programs. Programming features such as looping and conditional execution are also created while editing samples of actual data, making them easier to comprehend. We demonstrate several uses for Dome, including web harvesting, web page creation and visual programming.

Other
thesis.ps - Other
Download (5MB)

More information

Published date: July 2004
Keywords: Tree-sheets, Web harvesting, Semantic web, Dome, XML, XSLT, MIME
Organisations: University of Southampton, Electronics & Computer Science, IT Innovation

Identifiers

Local EPrints ID: 259924
URI: http://eprints.soton.ac.uk/id/eprint/259924
PURE UUID: d896d2dd-2d46-40bc-8d29-c4c2b8f2c2cc

Catalogue record

Date deposited: 18 Oct 2004
Last modified: 29 Jan 2020 15:38

Export record

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×