Hugh Davis, Wendy Hall and Ian Heath
(c) University of Southampton
Multimedia facilities are now becoming common-place on desk top PC systems. In order for such facilities to be of use to users it is important that software seamlessly integrates the different media types. Hypermedia is a technology that can enable users to browse through a corpus of multimedia information, and to make and follow links within this information. At the University of Southampton we have developed an open hypermedia system known as Microcosm, which provides a link service across a range of applications and media types.
This paper describes the facilities available when dealing with textual information within Microcosm, and then examines the key issues which limit the extent to which these facilities may be extended to all media types.
Contents
Typical closed hypermedia systems hold the information about the links as mark-up within the data itself in much the same way that a word processor usually holds its formatting information as hidden mark-up within the textual data. The problem with this approach is that one has to commit ones data to the hypermedia program: once link data has been embedded in the user data, it will no longer be possible to view or edit the data using the original application that created it. Consequently such hypermedia systems tend to deal with static data, and thus have limited application areas.
The latest generation of hypermedia systems hold the information about links separately. This approach has the advantages that:
It is possible to have links that have anchors in read only material, such as CD-ROM or videodisc.One such hypermedia system is Microcosm (Davis et al., 1992, Fountain et al., 1990) which has been developed at the University of Southampton, and is currently in use in a range of application areas, including delivery of resource based educational materials, technical documentation and geographic/urban information systems. This paper describes the features of this system, and explores the issues that limit the ability to apply all the functionality to different media types.
It is possible to have links into data, without putting mark-up into that data. This feature is important as it allows links to be made into and out of data that is created and displayed by a third party applications, removing the necessity for the hypermedia designer to duplicate their functionality.
It is possible to manipulate and process the complete set of links. This is difficult if the link information is distributed over all the data files.
It is possible to have multiple sets of links covering the same set of data. These different sets may be installed, joined and removed as required, making it possible to have different user views of a given data set.
Figure 1: The Microcosm Model
Microcosm allows a number of different actions to be taken on any selected item of interest, so consequently use of the system involves more than simply clicking on buttons to follow links. In Microcosm the user selects the item of interest (e.g. a piece of text) and then chooses an action to take. We may see this as selecting an object then sending it a message. A button in Microcosm is simply a binding of a specific selection and a particular action. A particular feature of Microcosm is the ability to generalise source anchors. In most hypertext systems the source anchor for any link is fixed at a particular point in the text. In Microcosm it is possible for the author to specify three levels of generality of link sources.
1) The generic link. The user will be able to follow the link after selecting the given anchor at any point in any document.Generic links are of considerable benefit to the author in that a new document may be created and immediately have access to all the generic links that have been defined for the system.
2) The local link. The user will be able to follow the link after selecting the given anchor at any point in the current document.
3) The specific link. The user will be able to follow the link only after selecting the anchor at a specific location in the current document. Specific links may be made into buttons.
The basic Microcosm processes are viewers and filters.
The task of the viewer is to allow the user to peruse the document, to make selections and to choose actions. Typical actions are follow link, make link and complete link (where links may be to processes as well as to documents). The actions themselves are not effected by the viewer. The viewer is responsible for binding the information into a message, which is sent on to the filter chain where it will look for one or more processes that can satisfy this request. Any Windows application might be used as a viewer, with the proviso that it is possible to select objects, and either communicate an object to the Dynamic Data Exchange (DDE) or copy it to the clipboard.
A major strength of Microcosm is its ability to integrate other applications. In fact Microcosm may be seen as an umbrella environment, allowing the user to make links from documents in one application package to documents in another application package.
Figure 2: Microcosm as an environment for integrating applications and tools.
Link Databases
Link Databases hold all the information referring to links. More than
one database may be installed at a time. When new links are created they are
inserted into the first link database in the chain. This makes it possible to
have a concept of public and private databases. The public database may
contain all the links made, say, by the original author, and private databases
may contain links made by individual users. This ensures that private
annotations and links do not become absorbed into the view of the system, as
seen by other users.
Compute Links
Sometimes no links have been defined for a particular subject. On these
occasions it is desirable to offer the user some further assistance. Microcosm
has a facility that allows a user to batch a set of text files and to index
these documents (Li et al., 1992). Once this indexing has been done a block of
text may be selected and the action, compute link, may be chosen. The system
will very rapidly return a number of other documents within the system that
have a similar vocabulary to the selected block, in the order of best match.
Clearly this filter is only able to identify links to text documents.
Consistency across interfaces to different media types is very important in providing the user with an integrated information environment. This will be particularly important when the information is created and used by groups of people over a network. We have a large Teaching and Learning Technology Project (TLTP) running at Southampton to create a central database of multimedia resource material to be delivered across the campus network for use by staff and students. Microcosm is the software platform that will provide the mechanism for the delivery of this material, integrated with various database and information retrieval technologies. Issues of media integration are therefore central to our development plan. Our approach to some of these issues is described below.
A specific issue concerns the problem of storing and transferring files, particularly sound and video. In the "ideal" environment all users would sit at terminals which were connected to a CD player and a videodisc player. However cost prohibits such a solution, and anyway in an open resource based system, the users would spend all their time swapping disks rather than moving effortlessly from one piece of data to another as preferred. Some projects have attempted to circumvent this problem by installing "juke-boxes" and broad band cable networks for delivering such media (e.g. Applebaum, 1989), but these solutions are costly, and anyway not commonly available. For these reasons we have moved towards the use of software digital video and sound. Such files tend to be very large by current norms. E.g. one minute of compressed 320 by 200 video might typically require 3 megabytes of storage and one minute of reasonable quality digitised sound might require 1 megabyte. This presents problems since such files quickly become too large to be held in the memory of current machines so they must be played from secondary storage. When the secondary storage is the local hard disk or CD-ROM, the delivery rate will vary depending upon the transfer rate available from the disk, so it is not possible to define a standard quality, and when the secondary storage is a fileserver, transfer rates will depend upon network loading.
Wherever large data files are stored there will be a time delay in following a link to such data. We have used the idea of Micons which are abstracts of video files, as introduced in (Brøndmo and Davenport, 1990). Whenever a link is followed to a destination which is video, the user is shown a small bitmap picture, or a few frames of video playing in a loop in such a way as to convey sufficient information about the intended video, so that the user is able to decide whether the cost of following the link through is appropriate. This also provides a minimum fallback situation in the case where the intended link is to a CD-ROM or videodisc which is unavailable. In the case of audio destination files the best we have been able to do is to offer textual abstracts to describe the intended file.
1. Keywords may be associated with all data items, and then all the normal Boolean logic operations may be applied to retrieve sets of documents that meet particular criteria. This method depends upon the skill and care with which keywords are attached to documents.A further problem involves extending generic links to work with source anchors in media other than text. A text string makes a sensible anchor for a generic link, as a given string is identical from whatever place it is selected, and by using a thesaurus it is possible to follow links from synonyms. However what would be the equivalent of a generic link from say a picture or a video? Ideally what one would wish is to be able to say, for example, whenever a user clicks on a picture of a particular person, follow a link to a biography of that person. However, such a scheme would require the software to have much greater intelligence than is currently achieved.2. All documents are filed in directories, and then given textual descriptions. In just the same way as one routinely navigates any hierarchical file system, it is possible to search for files in Microcosm. This method depends upon the author providing an appropriate description and upon the file structure being carefully pre-organised.
3. In the case of text documents only, it is possible to use the computed linker. This is a text retrieval system, described earlier, which enables the user to query the system based upon statistical similarity between given documents and the query: it is a highly successful method but is not extensible to other types of media.
Various solutions are being investigated within Microcosm. One solution is to associate documents together in "compound documents". Such documents consist of text automatically linked to other media, in such a way that whenever a text file is offered as a destination of a link or any other search, the other linked documents will also be offered as well. Thus, for example, one could discover a biography of a particular character by following a computed link, and then be offered video, pictures and audio recordings of this character at the same time. We are also starting some research in the area of generic links within pictures, by investigating the possibility of identifying images from within a limited domain within a picture. As an example, ordnance survey maps are created from a finite set of regular symbols, and it should be possible to apply image processing techniques in real time to identify the selected symbol and to follow a generic link to information about that symbol.
Another research area involves buttons in moving video. At present the author creates an active area over the first frame containing the object on which the button is to be placed, then plays the video, while manually dragging the active area along on top of the object. The details of the active area are then saved so that they can be replayed with the video. However, image tracking algorithms are now quite sophisticated (Dobie & Lewis, 1992) and it should be possible to provide batch tools for the author to create moving buttons automatically.
There has been much recent research in the area of specifying and delivering synchronised multimedia documents (Bulterman et al., 1991., Buchanan and Zellweger, 1992., HyTime, 1992., Hardman et al., 1993., Ramanathan & Venkat Rangan, 1993). However, the bottom line is that synchronisation between separate processes can only be achieved if appropriate services are provided at operating system level. Currently the Windows Media Control Interface (MCI) does not provide such facilities, unless a single hub program is given control of all media: this seems inconsistent with Microcosm's open approach.
The ability to provide synchronisation between separate processes in an open system is clearly essential. Examples that we have been involved with include
Playing alternative commentaries for a piece of video.It has been possible to achieve a rough and ready solution to these problems, but proper synchronisation facilities would be preferred. The examples above indicate the range of media types which might need to be synchronised: the combinations are endless, and a solution which involves a single application to deliver all the different media is necessarily limited.
Playing Music beside musical score (preferably with a "now point" in the music).
Displaying help text at various points in a user driven simulation.
Brøndmo, H.P. & Davenport, G. (1990). Creating and Viewing the Elastic Charles - a Hypermedia Journal. in McAleese, R & Green, C. eds. Hypertext, State of the Art. Intellect Ltd.
Buchanan, M.C. & Zellweger, P.T. (1992). Specifying Temporal Behavior in Hypermedia Documents. In: D. Lucarella, J. Nanard, M. Nanard, P. Paolini. eds. The Proceedings of the ACM Conference on Hypertext, ECHT '92 Milano, ACM. 262-271.
Bulterman, D.C.A., van Rossum, G. & van Liere, R. (1991). A structure for Transportable, Dynamic Multimedia Documents. in: The Proceedings of USENIX 1991.
Davis, H.C., Hall, W., Heath, I., Hill, G. & Wilkins, R. (1992) . Towards an Integrated Information Environment with Open Hypermedia Systems. In: D. Lucarella, J. Nanard, M. Nanard, P. Paolini. eds. The Proceedings of the ACM Conference on Hypertext, ECHT '92 Milano, ACM. 181-190.
Dobie, M.R & Lewis, P.H. (1992). Object Tracking in Multimedia Systems. in: The Proceedings of the 4th IEE Conference on Image Processing and its Applications, Maastricht. pp 41-44
Fountain, A.M., Hall, W., Heath, I. & Davis, H.C. (1990). MICROCOSM: An Open Model for Hypermedia With Dynamic Linking, in A. Rizk, N. Streitz and J. Andre eds. Hypertext: Concepts, Systems and Applications. The Proceedings of The European Conference on Hypertext, INRIA, France. Cambridge University Press.
Gevers, T. & Smeulders, A.W.M. (1991). Indexing of Images by Pictorial Information. in: IFIP WG 2.6, 2nd Working Conference on Visual Database Systems, Budapest Hungary.
Hardman, L., Bulterman, D.C.A. & van Rossum, G. (1993). The Amsterdam Hypermedia Model: Extending Hypertext to Support Real Multimedia. Hypermedia. (In Press).
HyperODA (1992). ISO/IEC JTC1/SC18/WG3 N1898. HyperODA - a Working Draft for Extending ODA Standards to Support Hypermedia Applications
H yTime (1992). ISO/IEC 10744. Hypermedia/Time-based Structuring Language.
Li, Z., Davis, H.C. & Hall, W. (1992). Hypermedia Links and Information Retrieval. The Proceedings of the 14th British Computer Society Research Colloquium on Information Retrieval, Lancaster University.
MHEG (1992). ISO/IEC JTC1/SC29/WG12/NO26. The MHEG Standard and its Relation with the Multimedia and Hypermedia Area.
O'Docherty, M.H. & Daskalakis, C.N. (1991). Multimedia Information Systems - The Management and Semantic Retrieval of all Electronic Data Types. The Computer Journal 34(3), 225-238.
Orlandic, R. (1992). Problems of Content-Based Retrieval in Image Databases. in: Proc. 3rd Symposium on "New Generation" Knowledge Engineering, IAKE '92, Washington D.C.
Ramanathan, S. & Venkat Rangan, P. (1993). Feedback Techniques for Intra-Media Continuity and Inter-Media Synchronisation in Distributed Multimedia Systems. Computer Journal 36(1) 19-31.
Hugh Davis, Wendy Hall and Ian Heath
Image and Media Laboratory
Department of Electronics and Computer Science
University of Southampton
Southampton SO9 5NH
Contact: Hugh Davis (hcd@uk.ac.soton.ecs) Tel. 0703
593669