Geoparsing and geosemantics for social media: spatio-temporal grounding of content propagating rumours to support trust and veracity analysis during breaking news
Geoparsing and geosemantics for social media: spatio-temporal grounding of content propagating rumours to support trust and veracity analysis during breaking news
In recent years there has been a growing trend to use publically available social media sources within the field of journalism. Breaking news has tight reporting deadlines, measured in minutes not days, but content must still be checked and rumours verified. As such journalists are looking at automated content analysis to pre-filter large volumes of social media content prior to manual verification. This paper describes a real-time social media analytics framework for journalists. We extend our previously published geoparsing approach to improve its scalability and efficiency. We develop and evaluate a novel approach to geosemantic feature extraction, classifying evidence in terms of situatedness, timeliness, confirmation and validity. Our approach works for new unseen news topics. We report results from 4 experiments using 5 Twitter datasets crawled during different English-language news events. One of our datasets is the standard TREC 2012 microblog corpus. Our classification results are promising, with F1 scores varying by class from 0.64 to 0.92 for unseen event types. We lastly report results from two case studies during real-world news stories, showcasing different ways our system can assist journalists filter and cross check content as they examine the trust and veracity of content and sources
1-27
Middleton, Stuart E.
404b62ba-d77e-476b-9775-32645b04473f
Krivcovs, Vadims
e51d9530-743c-4f35-acc7-6d021745b4fe
Middleton, Stuart E.
404b62ba-d77e-476b-9775-32645b04473f
Krivcovs, Vadims
e51d9530-743c-4f35-acc7-6d021745b4fe
Middleton, Stuart E. and Krivcovs, Vadims
(2016)
Geoparsing and geosemantics for social media: spatio-temporal grounding of content propagating rumours to support trust and veracity analysis during breaking news.
[in special issue: Trust and Veracity of Information in Social Media]
ACM Transactions on Information Systems, 34 (3), part Article 16, , [16].
(doi:10.1145/2842604).
Abstract
In recent years there has been a growing trend to use publically available social media sources within the field of journalism. Breaking news has tight reporting deadlines, measured in minutes not days, but content must still be checked and rumours verified. As such journalists are looking at automated content analysis to pre-filter large volumes of social media content prior to manual verification. This paper describes a real-time social media analytics framework for journalists. We extend our previously published geoparsing approach to improve its scalability and efficiency. We develop and evaluate a novel approach to geosemantic feature extraction, classifying evidence in terms of situatedness, timeliness, confirmation and validity. Our approach works for new unseen news topics. We report results from 4 experiments using 5 Twitter datasets crawled during different English-language news events. One of our datasets is the standard TREC 2012 microblog corpus. Our classification results are promising, with F1 scores varying by class from 0.64 to 0.92 for unseen event types. We lastly report results from two case studies during real-world news stories, showcasing different ways our system can assist journalists filter and cross check content as they examine the trust and veracity of content and sources
Text
390820.pdf
- Accepted Manuscript
More information
Accepted/In Press date: 1 February 2016
e-pub ahead of print date: 1 April 2016
Organisations:
IT Innovation
Identifiers
Local EPrints ID: 390820
URI: http://eprints.soton.ac.uk/id/eprint/390820
PURE UUID: 37837379-b716-4846-bfd5-88d39b191975
Catalogue record
Date deposited: 07 Apr 2016 12:00
Last modified: 15 Mar 2024 03:08
Export record
Altmetrics
Contributors
Author:
Vadims Krivcovs
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics