Streaming Bayesian inference for crowdsourced classification
Streaming Bayesian inference for crowdsourced classification
A key challenge in crowdsourcing is inferring the ground truth from noisy and unreliable data. To do so, existing approaches rely on collecting redundant information from the crowd, and aggregating it with some probabilistic method. However, oftentimes such methods are computationally inefficient, are restricted to some specific settings, or lack theoretical guarantees. In this paper, we revisit the problem of binary classification from crowdsourced data. Specifically we propose Streaming Bayesian Inference for Crowdsourcing (SBIC), a new algorithm that does not suffer from any of these limitations. First, SBIC has low complexity and can be used in a real-time online setting. Second, SBIC has the same accuracy as the best state-of-the-art algorithms in all settings. Third, SBIC has provable asymptotic guarantees both in the online and offline settings.
Manino, Edoardo
e5cec65c-c44b-45de-8255-7b1d8edfc04d
Tran-Thanh, Long
e0666669-d34b-460e-950d-e8b139fab16c
Jennings, Nicholas
ab3d94cc-247c-4545-9d1e-65873d6cdb30
Manino, Edoardo
e5cec65c-c44b-45de-8255-7b1d8edfc04d
Tran-Thanh, Long
e0666669-d34b-460e-950d-e8b139fab16c
Jennings, Nicholas
ab3d94cc-247c-4545-9d1e-65873d6cdb30
Manino, Edoardo, Tran-Thanh, Long and Jennings, Nicholas
(2019)
Streaming Bayesian inference for crowdsourced classification.
33rd Conference on Neural Information Processing Systems (NeurIPS 2019), , Vancouver, Canada.
08 - 14 Dec 2019.
11 pp
.
Record type:
Conference or Workshop Item
(Paper)
Abstract
A key challenge in crowdsourcing is inferring the ground truth from noisy and unreliable data. To do so, existing approaches rely on collecting redundant information from the crowd, and aggregating it with some probabilistic method. However, oftentimes such methods are computationally inefficient, are restricted to some specific settings, or lack theoretical guarantees. In this paper, we revisit the problem of binary classification from crowdsourced data. Specifically we propose Streaming Bayesian Inference for Crowdsourcing (SBIC), a new algorithm that does not suffer from any of these limitations. First, SBIC has low complexity and can be used in a real-time online setting. Second, SBIC has the same accuracy as the best state-of-the-art algorithms in all settings. Third, SBIC has provable asymptotic guarantees both in the online and offline settings.
Text
Paper v4
- Version of Record
More information
Accepted/In Press date: 2019
e-pub ahead of print date: December 2019
Venue - Dates:
33rd Conference on Neural Information Processing Systems (NeurIPS 2019), , Vancouver, Canada, 2019-12-08 - 2019-12-14
Identifiers
Local EPrints ID: 435912
URI: http://eprints.soton.ac.uk/id/eprint/435912
PURE UUID: 988e738c-4142-42ac-971b-8c9e27f8474f
Catalogue record
Date deposited: 22 Nov 2019 17:30
Last modified: 16 Mar 2024 05:13
Export record
Contributors
Author:
Edoardo Manino
Author:
Long Tran-Thanh
Author:
Nicholas Jennings
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics