The University of Southampton
University of Southampton Institutional Repository

Information extraction from multimedia web documents: an open-source platform and testbed

Dupplaw, David, Matthews, Michael, Johansson, Richard, Boato, Giulia, Costanzo, Andrea, Fontani, Marco, Minack, Enrico, Demidova, Elena, Blanco, Roi, Griffiths, Thomas, Lewis, Paul H., Hare, Jonathon and Moschitti, Alessandro (2014) Information extraction from multimedia web documents: an open-source platform and testbed International Journal of Multimedia Information Retrieval, 3, (2), pp. 97-111. (doi:10.1007/s13735-014-0051-2).

Record type: Article


The LivingKnowledge project aimed to enhance the current state of the art in search, retrieval and knowledge management on the web by advancing the use of sentiment and opinion analysis within multimedia applications. To achieve this aim, a diverse set of novel and complementary analysis techniques have been integrated into a single, but extensible software platform on which such applications can be built. The platform combines state-of-the-art techniques for extracting facts, opinions and sentiment from multimedia documents, and unlike earlier platforms, it exploits both visual and textual techniques to support multimedia information retrieval. Foreseeing the usefulness of this software in the wider community, the platform has been made generally available as an open-source project. This paper describes the platform design, gives an overview of the analysis algorithms integrated into the system and describes two applications that utilise the system for multimedia information retrieval.

PDF 237/art%3A10.1007%2Fs13735-014-0051-2.pdf_auth66=1395824070_9417f334b9173792e299c86ef4043256&ext=.pdf - Accepted Manuscript
Download (1MB)
PDF IJMIRpaperv2pljh.pdf - Author's Original
Download (1MB)

More information

Accepted/In Press date: 20 February 2014
e-pub ahead of print date: 21 March 2014
Published date: June 2014
Keywords: multimedia retrieval, web analysis, text analysis, opinion analysis, image analysis, open-source software
Organisations: Web & Internet Science


Local EPrints ID: 363375
ISSN: 2192-6611
PURE UUID: 5d9cc697-b5d8-442c-8795-e2c333a895df
ORCID for Jonathon Hare: ORCID iD

Catalogue record

Date deposited: 24 Mar 2014 09:04
Last modified: 17 Aug 2017 16:34

Export record



Author: David Dupplaw
Author: Michael Matthews
Author: Richard Johansson
Author: Giulia Boato
Author: Andrea Costanzo
Author: Marco Fontani
Author: Enrico Minack
Author: Elena Demidova
Author: Roi Blanco
Author: Thomas Griffiths
Author: Paul H. Lewis
Author: Jonathon Hare ORCID iD
Author: Alessandro Moschitti

University divisions

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton:

ePrints Soton supports OAI 2.0 with a base URL of

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.