Tree-sheets and Structured Documents
Tree-sheets and Structured Documents
For data to be used effectively, it needs to be structured and annotated to make it amenable to machine processing. The Web Consortium's XML markup language promises to allow this, and increasing quantities of data are now available in the format. XML's support for namespacing allows information from diverse data sources to be combined and communicated reliably, making XML the keystone of the emerging Semantic Web. Rather than searching the web by giving a number of text strings which pages must contain, as is common now, this semantically enriched web will allow searches on precisely specified concepts, identified by the page's meta-data. Inferencing engines can be applied to this web of formally expressed facts, so that searches return information which has been deduced from the knowledge on the web, without it having been stated explicitly. There is a strong need for tools that provide flexible ways to create and manipulate XML documents. Just as the usefulness of having scientific data available in computer-readable CSV (comma separated values) format is greatly enhanced by the existence of the spreadsheet, so this new XML-based data requires a new general purpose end-user processing tool. XML is a tree-structured format, and we propose in this thesis the tree-sheet: a general purpose end-user tool for the manipulation of tree structured data. In this thesis we present Dome, our implementation of a tree-sheet. Dome is operated through a familiar direct manipulation style interface, with a 'record' mode being used to create programs. Programming features such as looping and conditional execution are also created while editing samples of actual data, making them easier to comprehend. We demonstrate several uses for Dome, including web harvesting, web page creation and visual programming.
Tree-sheets, Web harvesting, Semantic web, Dome, XML, XSLT, MIME
Leonard, Thomas A
24981826-b8c3-4f0e-8dc6-17e0ae24f804
July 2004
Leonard, Thomas A
24981826-b8c3-4f0e-8dc6-17e0ae24f804
Leonard, Thomas A
(2004)
Tree-sheets and Structured Documents.
University of Southampton, ECS, Doctoral Thesis.
Record type:
Thesis
(Doctoral)
Abstract
For data to be used effectively, it needs to be structured and annotated to make it amenable to machine processing. The Web Consortium's XML markup language promises to allow this, and increasing quantities of data are now available in the format. XML's support for namespacing allows information from diverse data sources to be combined and communicated reliably, making XML the keystone of the emerging Semantic Web. Rather than searching the web by giving a number of text strings which pages must contain, as is common now, this semantically enriched web will allow searches on precisely specified concepts, identified by the page's meta-data. Inferencing engines can be applied to this web of formally expressed facts, so that searches return information which has been deduced from the knowledge on the web, without it having been stated explicitly. There is a strong need for tools that provide flexible ways to create and manipulate XML documents. Just as the usefulness of having scientific data available in computer-readable CSV (comma separated values) format is greatly enhanced by the existence of the spreadsheet, so this new XML-based data requires a new general purpose end-user processing tool. XML is a tree-structured format, and we propose in this thesis the tree-sheet: a general purpose end-user tool for the manipulation of tree structured data. In this thesis we present Dome, our implementation of a tree-sheet. Dome is operated through a familiar direct manipulation style interface, with a 'record' mode being used to create programs. Programming features such as looping and conditional execution are also created while editing samples of actual data, making them easier to comprehend. We demonstrate several uses for Dome, including web harvesting, web page creation and visual programming.
More information
Published date: July 2004
Keywords:
Tree-sheets, Web harvesting, Semantic web, Dome, XML, XSLT, MIME
Organisations:
University of Southampton, Electronics & Computer Science, IT Innovation
Identifiers
Local EPrints ID: 259924
URI: http://eprints.soton.ac.uk/id/eprint/259924
PURE UUID: d896d2dd-2d46-40bc-8d29-c4c2b8f2c2cc
Catalogue record
Date deposited: 18 Oct 2004
Last modified: 14 Mar 2024 06:29
Export record
Contributors
Author:
Thomas A Leonard
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics