N. D. Beitner, W. Hall, C. A. Goble
Department of Electronics and Computer Science
University of Southampton
Southampton SO17 1BJ, UK
e-mail: {ndb88r, wh}@ecs.soton.ac.uk
Department of Computer Science
University of Manchester
Oxford Road
Manchester M13 9PL, UK
e-mail:carole@cs.man.ac.uk
The paper describes the design of Microcosm++, an object oriented extensible service-based architecture for building consistent integrated hypermedia systems. It is based on the Microcosm hypermedia system which was developed at Southampton. The current implementation of Microcosm++ demonstrates the flexibility of object-based services for making hypermedia more viable in a working environment. The approach described reduces authoring effort significantly while at the same time increasing the integrity of the link structures and providing a unified model for media integration.
KEYWORDS: Open hypermedia systems, Linking mechanisms, Multimedia, Object oriented services.
This paper discusses the issues involved in putting the media into a hypermedia system. The thrust of the paper is that to-date media representation and underlying link strategies have been too closely tied together in the move from hypertext to hypermedia. We argue that it is necessary to separate the issues of media from link structure, and we present a model which solves some of the problems of genuine media integration in a hypermedia system. At the same time this model provides scope for a new linking architecture that provides support for the creation of links between data of different media types in a conceptually meaningful way. Microcosm++ is an object oriented extensible service-based architecture for building consistent integrated hypermedia systems.
The paper begins with a short summary of related work and discusses the shortcomings of current models in respect to media integration, including our own hypermedia system Microcosm. It describes work we have undertaken within Microcosm which has resulted in ideas for a new unified model for media integration. This model is presented and discussed in the final section of the paper.
In the current literature there is little discussion of what it involves to define a link from an object in a non-text document to objects in other documents, other than to say that there is an association between the objects identified. Few projects have considered the problems of authoring support for the creation of links between different media types, nor issues of support for users in effective navigation of an information space, particularly with respect to temporal media such as video and sound. In this section we discuss some of the projects that have attempted to address these issues.
Although Intermedia puts much emphasis on providing a consistent interface to links and anchors across media, there are a number of limitations. It provides a limited anchor model for dealing with links from temporal media, buttons in temporal media are not similar to those seen in still images and can only represent a temporal segment of the whole image shown. In effect at the appropriate moment in a sequence the whole video window becomes a single anchor. When the scope of the link expires the viewing window ceases to be active. The context of such links is very restrictive and fails to allow the user to distinguish between the available links during an active temporal segment.
While Intermedia has a high level of functionality including many useful navigation tools, the system as a whole suffers from inconsistencies at the media integration level due to the initial lack of consideration for future multimedia support.
The project involved experimentation in new areas of media interaction, and developed ideas that if applied in real hypermedia systems would do much to support and complement existing models. The Elastic Charles project (5) is a demonstration of the 'elastic' tools that are discussed here. The aim was to build a hypermedia journal - "The Elastic Charles" - as a hypermedia edition of the hypothetical "American River Magazine". The magazine was authored by fifteen students and contained stories focusing on the history, ecology, recreation, and construction relating to the Charles River. Most of the stories had a strong video component which was augmented by text, graphics and sound.
The work on the Elastic Charles was the first to really consider the place of video in hypermedia and its impact. Where a hypertext system is completely user driven, the addition of dynamic media causes many problems with links and user response. Brøndmo used the concept of micons (motion, moving or movie icons) to represent links to video inside source documents.
The traditional icon is a pictorial representation of an entity. The micon works in much the same way except that it is animated. Brøndmo used micons to represent links to video. A small section of the destination video is digitised and played in a loop; it is used to indicate to the user that there is a link to video and also to provide some information about the sequence before it is viewed. This mechanism is more informative than a text description alone.
The problems of links from temporal media are tackled by generating a list of links while the temporal medium is being viewed: that way if a link is missed the user can follow it at a later date. Links to video pop up as micons at the appropriate moment, and when the link becomes invalid, in a temporal sense, it is removed. This model is superior to that of Intermedia, as it distinguishes between the different link anchors. Links do not have to be merged into one anchor, and users are able to select an anchor of interest when multiple anchors appear simultaneously. The use of micons to represent links to video allows users to make an informed choice about which links to follow. It is evident that this extension of icon representation is beneficial to a user interface within a multimedia system. At the time of the Elastic Charles project, Brøndmo only made use of micons to represent links from video to video: it is clear that there is wider scope for application. Micons do not allow authors to make links from particular entities within a temporal sequence so they do not correspond to the metaphor of link following from a text object used in text based systems.
In summary micons preview the destination document from the source document, improving user navigational behaviour. Their down side is that they interfere significantly with the source document and are restricted in the types of link and the number of destinations they can represent.
This work is closely related to recent research in the area of specifying and delivering synchronised multimedia documents e.g. Buchanan (6) and Newcomb (21). It is a very important issue in the development of multimedia systems, and does tackle the issues of extensible hypermedia in terms of integration of multimedia with different link strategies.
In this paper we are more concerned with techniques for link creation and link following between different multimedia documents rather than within composite documents.
In the next section of the paper we describe an open hypermedia system that has been developed at the University of Southampton and discuss ways in which we have experimented with integrating multimedia information into this system.
Microcosm allows any number of customisable actions to be taken on any selected item of interest, resulting in a system that supports more than simply clicking on buttons to follow links. In Microcosm the user selects the item of interest and then chooses an action to take. A button in Microcosm is simply a binding of a specific selection and a particular action. Each of the actions/functions is provided by a filter and new filters can be added to extend the systems functionality (15).
In addition to traditional explicit link authoring, the specification of active regions of a document, a particular feature of Microcosm is the facility for implicit anchor specifications for text, known as generic links. A generic link is a generalisation of a text anchor specification. Instead of an anchor being described using some form of co-ordinate system, the source anchor is encoded using the textual representation of the object bounded by the anchor. Depending on the constraints associated with the generic link any occurrence of the same text selection in other text documents automatically becomes an anchor for that link.
Explicit anchor definitions can be used for button definitions or hidden anchors. The former presents itself to the user and is easily used, the latter are not automatically shown. This invisible anchor does not interfere with the presentation of a document, but users are not easily aware of their existence. Generic links are not bound to a specific document, and by their unconstrained nature have this disadvantage. They allow a small number of links to be available from a large number of locations, but users must actively search for them.
Generic links allow the rapid incorporation of new material, since any new document immediately provides access to all those generic links whose anchors it contains. However, the new document does not become the destination of any links until explicitly specified as such by the author. The use of dynamic destination anchor resolution functions, like full text retrieval (18) or document attribute keying, allows predefined links to return newly added documents with only minimal extra authoring, and reduces the need to define more explicit or implicit links.
As far as interface behaviour and link structure is concerned, Microcosm provides exceptional explicit link support in many media including, text selections, regions in still images, dynamic moving buttons in video (2), and temporal anchors in sound documents (11). It is our belief that in these terms Microcosm is one of the most consistent systems available.
Nevertheless, as with many other hypermedia systems, Microcosm was built upon a fundamentally text-oriented architecture. In its present form it presents a number of restrictive practices that impede the development of advanced link services across the range of available media.
While the document viewers allow multimedia interfaces, the navigation and authoring tools provided within the Microcosm framework have a predominantly textual look and feel. The underlying link representation and storage is also text based. For example, the link database encodes anchors in terms of text selection and offset, so all other representations must be converted to a textual representation which is encoded using these two terms. This leads to obvious restrictions when considering other media types.
While creative implementation can allow latitude within these storage requirements, the interface issues in the current model are not so easily circumvented.
During the process of improving the user interface, extending generic link functionality and the consideration of generic links in media other than text, it became evident that the architecture could no longer stand the constant 'creeping featurism' that has typified the development of hypermedia to date.
Consequently an object oriented support services approach has been developed to allow separation of hypermedia issues from the technical problems of providing specific support for media. The remainder of this paper discusses re-engineering of the Microcosm architecture to allow object oriented support services and an improved interface for media interaction.
In order to indicate the location of generic links in text documents, the link database must be aware of the media involved, be able to load the current document, and search through the document for valid occurrences of an anchor within it. One of the first tasks of object support services was to allow the link database, or linkbase, to compare the link specification index against a document and derive a set of "explicit" anchors from the generic list for the text viewer to show as buttons (Fig 2b). This removes the requirement for a user to actively select a block of text and query the link base for an anchor match. Once generic link specifications can be used to describe buttons their value as an authoring tool becomes much more significant.
Object services allow the linkbase to separate the techniques for matching generic link specifications from the basic linkbase implementation. Depending on the document being viewed an object service provider matches link specifications against the document and returns a set of buttons.
In order to provide such a service, an API for anchor specific functions is defined by the developer of the linkbase, and these are implemented in the object support services hierarchy (see Fig. 1). The functions are geared around providing the following services.
As well as using object services for providing extensible media support for tools to use, tools can also provide object services to cater for extension or replacement of shared components of the system. A link is the unification of a source anchor specification, which is link database specific, and some resolution which is APS component specific. The link creation tool needs to be able to cater for various anchor types, linkbase specific requirements and resolution specific requirements. As such the linker requires the object services hierarchy to provide the following support:
Fig 1. Shows how different aspects of Microcosm tools may be integrated in the OSH.
Fig 1. A typical object services hierarchy (OSH)
Document viewers (e.g. Fig. 2a & 2b) may use the objects services to render the document objects, thus allowing a single common application to handle rendition of a large number of specific data types of a basic media subset, such as general still image and video support. A viewer may also use them to encode selection objects for transfer through the Microcosm communication system.
Fig 2a. Arbitrary selection from a still image.
Fig 2b. Arbitrary selection from a text document.
The link generator (Fig. 3) uses the OSH to render anchors for consistent user interface behaviour, and also uses the OSH to generate the appropriate source anchor specifications for a given linkbase as well as to generate the resolution to be evaluated by an APS component when the link is eventually followed.
Figures 2a, 2b, 3 & 4 show the interface consistency that occurs as a result of having the OSH for rendition. Each of the services is dynamic and can be changed while the applications are running. When a new media rendition service is added, say for a third party application, the anchor objects can automatically be visualised in the navigation and authoring tools. An audio service provider is next to be developed, and without any extra work to the link generator, anchor visualisations for sound will be available.
Fig 3. Anchor collection in the link generator tool.
Finally the linkbase uses object services to provide anchor specification support for media. Interfaces for specific link database implementations are implemented at the OSH level and the linker uses those services to define the anchor specification that will eventually be stored in the link database. The linkbase service objects use the OSH to provide specific media support for the overall linkbase service object.
In a similar manner to the linker, the Microcosm++ available resolutions interface (Fig. 4) presents a list of destination documents as a result of a link following action or selection action process. Whereas the linker attempts to render the exact selection made in defining an anchor, the available resolutions interface provides document abstraction.
Fig 4. Available resolutions with document abstraction
Figure 4 shows abstraction for three documents. The first is plain text, has no layout and hence the destination selection is shown as the abstraction. The second document is a still image; the resulting abstract is derived at run time from the original document. The final abstract is for a piece of digital video, and a predefined Micon is used. Limited examples of document thumbnails can be seen in some of the more advanced multimedia database browsing systems (22, 8).
The scope for document abstraction is endless, and issues regarding text and audio are currently being considered. In cases where text documents have layout, a thumbnail of the destination page, or the document heading, may provide useful navigational clues. In sound documents various techniques can be used depending on the type of sound involved. A visual music score may be used to represent instrumental music, whereas a wave form could be used to represent other arbitrary noise. Then there is also the case of non visual representations for the documents being abstracted e.g. Gaver (10), Hardman (3).
The various features available, and the media that they are available for can easily be extended or modified by alteration to the OSH. Further development of abstraction techniques does not require changes to the available resolutions application. Likewise the inclusion of generic specification techniques does not affect the implementation of the link database, and matching algorithms are implemented at the OSH level.
Additionally, for some media not all services are currently supported, and the OSH utilises the most relevant technique available to service any object request. For a text type object such as RTF, a plain text representation is also included to allow services that only understant ASCII text to work with RTF objects.
Fig 5. Explicit authoring of a small hypermedia network.
Fig 6. Implicit authoring using conceptual relations.
Earlier in this document there was brief discussion about support for implicit authoring strategies across media. Links are physical rather than conceptual; linking from instantiations of an object in both text and image anchors to a destination requires two separate and independent links. There is currently no direct correlation between an implicit specification for an anchor in one medium and an anchor in another.
Systems like Hyperset (24) and Strathtutor (19), provide an interesting approach to link authoring which holds some useful strategies that may provide the solution to consistent authoring across media. Both these systems link at a document to document level, and use keywords or attributes to imply binding between documents. Working at the document level allows a degree of simplification that bypasses the media issues. Mechanisms like keyword association are used to cluster documents together. From any single document you can move to a set of similar documents that share common attributes.
Microcosm++ gives us an extensible platform to take these initail ideas and with the OSH extend them to provide conceptual content oriented links which are specified independently of any media. Whereas these systems are document based links, those in Microcosm++ are specified at any level of granularity that the appropriate media can support.
The OSH can be used to move from a media dependent representation space into the information domain where it then becomes possible to apply new types of link architecture.
Link abstraction has been moved to the higher level of concept to concept authoring. This opens up a completely new area of hypermedia, allowing integration of previously separate research topics e.g. semantic authoring (20), automatic document clustering techniques (25, 17, 16), and knowledge based systems, in a modular and unobtrusive manner.
Figures 5 & 6 show the structural difference, and reduction in authoring effort, between using explicit links and implicit links created at a conceptual level to describe relationships between documents. The structure in Figure 6 has at least the functionality of Figure 5 but is also future proofed and contains many more links than can be derived from the explicit definition. Conceptual links in Microcosm++ are still created manually, but are an expression in terms of the authors perspective and are independant of the physical content and data representation in the information space.
The scope for link abstraction made possible by the Object Services Hierarchy leads to a consistent and uniform management of inter-media relationships. This promotes links to a status independent of the media and the documents involved, allowing minimal authoring to describe maximum connectivity of data.
2 Beitner, N.D. "Media support in Hypermedia", Mini-Thesis, Department of Electronics and Computer Science, University of Southampton, 12-29, July 1993.
3 Brewster, S.A., Write, P.C. & Edwards, A.D.N. "An evaluation of earcons for use in auditory human-computer interfaces", In Human Factors in Computing Systems - INTERCHI'93 Conference Proceedings, 222-227, April 1993.
4 Brøndmo, H.P. "The elastic tools: a descriptive overview", Media Lab Working Paper, MIT Media Laboratory, Cambridge Massachusetts, 1989.
5 Brøndmo, H.P. & Davenport, G. "Creating and Viewing the Elastic Charles - a Hypermedia Journal", In McAleese, R & Green, C. eds. Hypertext, State of the Art. Intellect Ltd, 1990.
6 Buchanan, M.C. & Zellweger, P.T. "Specifying Temporal Behaviour in Hypermedia Documents", In: D. Lucarella, J. Nanard, M. Nanard, P. Paolini. eds. The Proceedings of the ACM Conference on Hypertext, ECHT '92 Milano, ACM. 262-271, November 1992.
7 Davis, H.C., Hall, W., Heath, I., Hill, G. & Wilkins, R. "Towards an Integrated Information Environment with Open Hypermedia Systems", In: D. Lucarella, J. Nanard, M. Nanard, P. Paolini. eds. The Proceedings of the ACM Conference on Hypertext, ECHT '92 Milano, ACM. 181-190, November 1992
8 Egido, C. & Patterson, J. "Pictures and category labels as navigational aids for catalogue browsing", In Proceedings of the ACM - CHI'88 (Human Factors in Computing Systems), 127-132, May 1988.
9 Fountain, A.M., Hall, W., Heath, I. & Davis, H.C. "MICROCOSM: An Open
Model for Hypermedia With Dynamic Linking", In: A. Rizk, N. Streitz and
J. Andre eds. Hypertext: Concepts, Systems and Applications. The
Proceedings of The European Conference on Hypertext, INRIA, France.
Cambridge University Press, 298-311, November 1990
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27