The University of Southampton
University of Southampton Institutional Repository

Information extraction from multimedia web documents: an open-source platform and testbed

Information extraction from multimedia web documents: an open-source platform and testbed
Information extraction from multimedia web documents: an open-source platform and testbed
The LivingKnowledge project aimed to enhance the current state of the art in search, retrieval and knowledge management on the web by advancing the use of sentiment and opinion analysis within multimedia applications. To achieve this aim, a diverse set of novel and complementary analysis techniques have been integrated into a single, but extensible software platform on which such applications can be built. The platform combines state-of-the-art techniques for extracting facts, opinions and sentiment from multimedia documents, and unlike earlier platforms, it exploits both visual and textual techniques to support multimedia information retrieval. Foreseeing the usefulness of this software in the wider community, the platform has been made generally available as an open-source project. This paper describes the platform design, gives an overview of the analysis algorithms integrated into the system and describes two applications that utilise the system for multimedia information retrieval.
multimedia retrieval, web analysis, text analysis, opinion analysis, image analysis, open-source software
2192-6611
97-111
Dupplaw, David
c563ca2b-756a-4d3f-bf99-4f60bb2be1ce
Matthews, Michael
98e4becd-c15c-40b7-b113-b0e436ed91bb
Johansson, Richard
9edcc729-3986-4690-9aae-1e5e4b1709c1
Boato, Giulia
1542ec1a-7d42-4a0a-96ce-6b9b454684f5
Costanzo, Andrea
975bc69e-bed3-42e4-81c0-df694d487d6c
Fontani, Marco
5189ab2f-52de-4fcc-98e8-ba56f7d2d038
Minack, Enrico
db06f699-4d0e-4767-b7c8-05bcec8990a1
Demidova, Elena
acc207c5-e7cc-4896-949a-6a9a2bb2e8c8
Blanco, Roi
b4c5846e-9c6c-491e-b070-3e4f9f3fcc83
Griffiths, Thomas
b70cb172-1ddf-48c3-b1f8-d79d2dc08665
Lewis, Paul H.
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
Hare, Jonathon
65ba2cda-eaaf-4767-a325-cd845504e5a9
Moschitti, Alessandro
28f637af-8d08-4187-9414-3ab10c066e71
Dupplaw, David
c563ca2b-756a-4d3f-bf99-4f60bb2be1ce
Matthews, Michael
98e4becd-c15c-40b7-b113-b0e436ed91bb
Johansson, Richard
9edcc729-3986-4690-9aae-1e5e4b1709c1
Boato, Giulia
1542ec1a-7d42-4a0a-96ce-6b9b454684f5
Costanzo, Andrea
975bc69e-bed3-42e4-81c0-df694d487d6c
Fontani, Marco
5189ab2f-52de-4fcc-98e8-ba56f7d2d038
Minack, Enrico
db06f699-4d0e-4767-b7c8-05bcec8990a1
Demidova, Elena
acc207c5-e7cc-4896-949a-6a9a2bb2e8c8
Blanco, Roi
b4c5846e-9c6c-491e-b070-3e4f9f3fcc83
Griffiths, Thomas
b70cb172-1ddf-48c3-b1f8-d79d2dc08665
Lewis, Paul H.
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
Hare, Jonathon
65ba2cda-eaaf-4767-a325-cd845504e5a9
Moschitti, Alessandro
28f637af-8d08-4187-9414-3ab10c066e71

Dupplaw, David, Matthews, Michael, Johansson, Richard, Boato, Giulia, Costanzo, Andrea, Fontani, Marco, Minack, Enrico, Demidova, Elena, Blanco, Roi, Griffiths, Thomas, Lewis, Paul H., Hare, Jonathon and Moschitti, Alessandro (2014) Information extraction from multimedia web documents: an open-source platform and testbed. International Journal of Multimedia Information Retrieval, 3 (2), 97-111. (doi:10.1007/s13735-014-0051-2).

Record type: Article

Abstract

The LivingKnowledge project aimed to enhance the current state of the art in search, retrieval and knowledge management on the web by advancing the use of sentiment and opinion analysis within multimedia applications. To achieve this aim, a diverse set of novel and complementary analysis techniques have been integrated into a single, but extensible software platform on which such applications can be built. The platform combines state-of-the-art techniques for extracting facts, opinions and sentiment from multimedia documents, and unlike earlier platforms, it exploits both visual and textual techniques to support multimedia information retrieval. Foreseeing the usefulness of this software in the wider community, the platform has been made generally available as an open-source project. This paper describes the platform design, gives an overview of the analysis algorithms integrated into the system and describes two applications that utilise the system for multimedia information retrieval.

Text
237/art%3A10.1007%2Fs13735-014-0051-2.pdf_auth66=1395824070_9417f334b9173792e299c86ef4043256&ext=.pdf - Accepted Manuscript
Download (1MB)
Text
IJMIRpaperv2pljh.pdf - Author's Original
Download (1MB)

More information

Accepted/In Press date: 20 February 2014
e-pub ahead of print date: 21 March 2014
Published date: June 2014
Keywords: multimedia retrieval, web analysis, text analysis, opinion analysis, image analysis, open-source software
Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 363375
URI: http://eprints.soton.ac.uk/id/eprint/363375
ISSN: 2192-6611
PURE UUID: 5d9cc697-b5d8-442c-8795-e2c333a895df
ORCID for Jonathon Hare: ORCID iD orcid.org/0000-0003-2921-4283

Catalogue record

Date deposited: 24 Mar 2014 09:04
Last modified: 15 Mar 2024 03:25

Export record

Altmetrics

Contributors

Author: David Dupplaw
Author: Michael Matthews
Author: Richard Johansson
Author: Giulia Boato
Author: Andrea Costanzo
Author: Marco Fontani
Author: Enrico Minack
Author: Elena Demidova
Author: Roi Blanco
Author: Thomas Griffiths
Author: Paul H. Lewis
Author: Jonathon Hare ORCID iD
Author: Alessandro Moschitti

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×