The University of Southampton
University of Southampton Institutional Repository

Predicting sense of community and participation by applying machine learning to open government data

Predicting sense of community and participation by applying machine learning to open government data
Predicting sense of community and participation by applying machine learning to open government data
Community capacity is used to monitor socioeconomic development. It is composed of a number ofdimensions that can be measured to understand issues possibly arising in the implementation of apolicy or of a project targeting a community. Measuring these dimensions is thus highly valuable for policymakers and local administrator, though expensive and time consuming. To address this issue, we evaluated their estimation through a machine learning technique—Random Forests—applied to secondary open government data and determined the most important variables for prediction. We focused on two dimensions: sense of community and participation. The variables included in the data sets used to train the predictive models complied with two criteria: nationwide availability and sufficiently fine-grained geographic breakdown, that is, neighborhood level. Our resultant models are more accurate than others based on traditional statistics found in the literature, showing the feasibility of the approach. The most determinant variables in our models were only partially in agreement with the most influential factors for sense of community and participation according to the social science literature consulted, providing a starting point for future investigation under a social science perspective. Moreover, due to the lack of geographic detail of the outcome measures available, further research is required to apply the predictive models to a neighborhood level.
open government data, Machine Learning, communities
1944-2866
55-75
Piscopo, Alessandro
c4a3c65a-bd85-4bfa-926b-8a2228da127d
Siebes, Ronald
d54ab678-4d38-4564-9a5a-190512ce9e2d
Hardman, Lynda
87518f50-c4fd-436c-89f1-eab0575bfb79
Piscopo, Alessandro
c4a3c65a-bd85-4bfa-926b-8a2228da127d
Siebes, Ronald
d54ab678-4d38-4564-9a5a-190512ce9e2d
Hardman, Lynda
87518f50-c4fd-436c-89f1-eab0575bfb79

Piscopo, Alessandro, Siebes, Ronald and Hardman, Lynda (2017) Predicting sense of community and participation by applying machine learning to open government data. Policy and Internet, 9 (1), 55-75. (doi:10.1002/poi3.145).

Record type: Article

Abstract

Community capacity is used to monitor socioeconomic development. It is composed of a number ofdimensions that can be measured to understand issues possibly arising in the implementation of apolicy or of a project targeting a community. Measuring these dimensions is thus highly valuable for policymakers and local administrator, though expensive and time consuming. To address this issue, we evaluated their estimation through a machine learning technique—Random Forests—applied to secondary open government data and determined the most important variables for prediction. We focused on two dimensions: sense of community and participation. The variables included in the data sets used to train the predictive models complied with two criteria: nationwide availability and sufficiently fine-grained geographic breakdown, that is, neighborhood level. Our resultant models are more accurate than others based on traditional statistics found in the literature, showing the feasibility of the approach. The most determinant variables in our models were only partially in agreement with the most influential factors for sense of community and participation according to the social science literature consulted, providing a starting point for future investigation under a social science perspective. Moreover, due to the lack of geographic detail of the outcome measures available, further research is required to apply the predictive models to a neighborhood level.

This record has no associated files available for download.

More information

Accepted/In Press date: 16 March 2017
e-pub ahead of print date: 16 March 2017
Published date: March 2017
Keywords: open government data, Machine Learning, communities

Identifiers

Local EPrints ID: 413741
URI: http://eprints.soton.ac.uk/id/eprint/413741
ISSN: 1944-2866
PURE UUID: 22397a9e-2e0c-4ba7-9a4d-26768423a06c
ORCID for Alessandro Piscopo: ORCID iD orcid.org/0000-0002-0362-4826

Catalogue record

Date deposited: 01 Sep 2017 16:32
Last modified: 15 Apr 2024 17:04

Export record

Altmetrics

Contributors

Author: Alessandro Piscopo ORCID iD
Author: Ronald Siebes
Author: Lynda Hardman

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×