SemanticNews: Enriching publishing of news stories
SemanticNews: Enriching publishing of news stories
A central goal for the EPSRC funded Semantic Media Network project is to support interesting collaboration opportunities between researchers in order to foster relationships and encourage working together (EPSRC priority 'Working Together'). SemanticNews was one of the four projects funded in the first round of Semantic Media Network mini-projects, and was collaboration between the Universities of Southampton and Sheffield, together with the BBC.
The SemanticNews project aimed to promote people's comprehension and assimilation of news by augmenting broadcast news discussion and debate with information from the semantic web in the form of linked open data (LOD). The project has laid the foundations for a toolkit for (semi- ) automatic provision of semantic analysis and contextualization of the discussion of current events, encompassing state of the art semantic web technologies including text mining, consolidation against Linked Open Data, and advanced visualisation.
SemanticNews was bootstrapped using episodes of the BBC Question Time programme that already had transcripts and manually curated metadata, which included a list of the topical questions being debated. This information was used to create a workflow that a) extracts relevant entities using established named entity recognition techniques to identify the types of information to contextualise for a news article; b) provides associations with concepts from LOD resources; and, c) visualises the context using information derived from the LOD cloud.
This document forms the final report of the SemanticNews project, and describes in detail the processes and techniques explored for the enrichment of Question Time episodes. The final section of the report discusses how this work could be expanded in the future, and also makes a few recommendations for additional data that could be could be captured during the production process that would make the automatic generation of the contextualisation easier.
University of Southampton
Hare, Jonathon
65ba2cda-eaaf-4767-a325-cd845504e5a9
Newman, David
eb21ecdd-5ad0-49ec-9dd0-5bf98dc0c6f9
Peters, Wim
a0caaa52-614e-4cb6-a20a-2c8683c4b0f3
Greenwood, Mark
6dcd2b83-1ae7-4c3e-85bc-d86b6cd4f2e8
Eggink, Jana
c4e0cf6c-b1ff-4e0a-99a9-0b8098195a95
9 January 2014
Hare, Jonathon
65ba2cda-eaaf-4767-a325-cd845504e5a9
Newman, David
eb21ecdd-5ad0-49ec-9dd0-5bf98dc0c6f9
Peters, Wim
a0caaa52-614e-4cb6-a20a-2c8683c4b0f3
Greenwood, Mark
6dcd2b83-1ae7-4c3e-85bc-d86b6cd4f2e8
Eggink, Jana
c4e0cf6c-b1ff-4e0a-99a9-0b8098195a95
Hare, Jonathon, Newman, David, Peters, Wim, Greenwood, Mark and Eggink, Jana
(2014)
SemanticNews: Enriching publishing of news stories
University of Southampton
Record type:
Monograph
(Project Report)
Abstract
A central goal for the EPSRC funded Semantic Media Network project is to support interesting collaboration opportunities between researchers in order to foster relationships and encourage working together (EPSRC priority 'Working Together'). SemanticNews was one of the four projects funded in the first round of Semantic Media Network mini-projects, and was collaboration between the Universities of Southampton and Sheffield, together with the BBC.
The SemanticNews project aimed to promote people's comprehension and assimilation of news by augmenting broadcast news discussion and debate with information from the semantic web in the form of linked open data (LOD). The project has laid the foundations for a toolkit for (semi- ) automatic provision of semantic analysis and contextualization of the discussion of current events, encompassing state of the art semantic web technologies including text mining, consolidation against Linked Open Data, and advanced visualisation.
SemanticNews was bootstrapped using episodes of the BBC Question Time programme that already had transcripts and manually curated metadata, which included a list of the topical questions being debated. This information was used to create a workflow that a) extracts relevant entities using established named entity recognition techniques to identify the types of information to contextualise for a news article; b) provides associations with concepts from LOD resources; and, c) visualises the context using information derived from the LOD cloud.
This document forms the final report of the SemanticNews project, and describes in detail the processes and techniques explored for the enrichment of Question Time episodes. The final section of the report discusses how this work could be expanded in the future, and also makes a few recommendations for additional data that could be could be captured during the production process that would make the automatic generation of the contextualisation easier.
Text
SemanticNewsFinalReport.pdf
- Version of Record
More information
Published date: 9 January 2014
Organisations:
Web & Internet Science
Identifiers
Local EPrints ID: 366832
URI: http://eprints.soton.ac.uk/id/eprint/366832
PURE UUID: 6ca0b562-d6f5-46fe-831a-1adac7e0828d
Catalogue record
Date deposited: 11 Jul 2014 13:53
Last modified: 15 Mar 2024 03:26
Export record
Contributors
Author:
Jonathon Hare
Author:
David Newman
Author:
Wim Peters
Author:
Mark Greenwood
Author:
Jana Eggink
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics