The University of Southampton
University of Southampton Institutional Repository

Learning from data with structured missingness

Learning from data with structured missingness
Learning from data with structured missingness
Missing data are an unavoidable complication in many machine learning tasks. When data are ‘missing at random’ there exist a range of tools and techniques to deal with the issue. However, as machine learning studies become more ambitious, and seek to learn from ever-larger volumes of heterogeneous data, an increasingly encountered problem arises in which missing values exhibit an association or structure, either explicitly or implicitly. Such ‘structured missingness’ raises a range of challenges that have not yet been systematically addressed, and presents a fundamental hindrance to machine learning at scale. Here we outline the current literature and propose a set of grand challenges in learning from data with structured missingness.
2522-5839
13-23
Mitra, Robin
2cabf033-a871-4203-95b7-d8ce0160f69e
McGough, Sarah F.
5e2ee4f8-dfbf-49c6-95ab-2bc63a8d14de
Chakraborti, Tapabrata
26a5ab6f-fd15-4be2-bc8b-ed53f8913548
Holmes, Chris
a4d53eee-2215-4c7c-a680-ec13202a629f
Copping, Ryan
366dd047-c0f2-40ab-bb65-d716e25f88f6
Hagenbuch, Niels
c26af76b-92ed-41ff-b5fb-e4b6b33ed94d
Biedermann, Stefanie
20375ace-9daa-4820-9fd4-ad15432031d4
Noonan, Jack
daf2f66a-dacf-4ab1-9daf-c3af8e84b067
Lehmann, Brieuc
5d1fc034-8a7f-4bc1-b475-b0ab1d86f914
Shenvi, Aditi
13c24145-c55a-405a-8eb6-a975fa1f7864
Doan, Xuan Vinh
73cb290e-8fd8-49d9-8707-48db697af588
Leslie, David
84e28104-7d08-4e97-b742-9a6517a6518a
Bianconi, Ginestra
654d0a71-cae6-4e5a-98c3-b40b3d8a2a96
Sanchez-Garcia, Ruben
8246cea2-ae1c-44f2-94e9-bacc9371c3ed
Davies, Alisha
415499ca-3f43-4a3e-baa4-2d5b2edff90f
Mackintosh, Maxine
0854d0df-abcf-4df7-bc6b-c1f9a484776a
Andrinopoulou, Eleni-Rosalina
58e19830-2989-4da5-8a41-a256e822dda5
Basiri, Anahid
543d3d82-30a9-4c08-b94c-dccac6fa9b3e
Harbron, Chris
c9053c59-3f33-4842-aead-905f4a5b20ec
MacArthur, Ben D.
2c0476e7-5d3e-4064-81bb-104e8e88bb6b
Mitra, Robin
2cabf033-a871-4203-95b7-d8ce0160f69e
McGough, Sarah F.
5e2ee4f8-dfbf-49c6-95ab-2bc63a8d14de
Chakraborti, Tapabrata
26a5ab6f-fd15-4be2-bc8b-ed53f8913548
Holmes, Chris
a4d53eee-2215-4c7c-a680-ec13202a629f
Copping, Ryan
366dd047-c0f2-40ab-bb65-d716e25f88f6
Hagenbuch, Niels
c26af76b-92ed-41ff-b5fb-e4b6b33ed94d
Biedermann, Stefanie
20375ace-9daa-4820-9fd4-ad15432031d4
Noonan, Jack
daf2f66a-dacf-4ab1-9daf-c3af8e84b067
Lehmann, Brieuc
5d1fc034-8a7f-4bc1-b475-b0ab1d86f914
Shenvi, Aditi
13c24145-c55a-405a-8eb6-a975fa1f7864
Doan, Xuan Vinh
73cb290e-8fd8-49d9-8707-48db697af588
Leslie, David
84e28104-7d08-4e97-b742-9a6517a6518a
Bianconi, Ginestra
654d0a71-cae6-4e5a-98c3-b40b3d8a2a96
Sanchez-Garcia, Ruben
8246cea2-ae1c-44f2-94e9-bacc9371c3ed
Davies, Alisha
415499ca-3f43-4a3e-baa4-2d5b2edff90f
Mackintosh, Maxine
0854d0df-abcf-4df7-bc6b-c1f9a484776a
Andrinopoulou, Eleni-Rosalina
58e19830-2989-4da5-8a41-a256e822dda5
Basiri, Anahid
543d3d82-30a9-4c08-b94c-dccac6fa9b3e
Harbron, Chris
c9053c59-3f33-4842-aead-905f4a5b20ec
MacArthur, Ben D.
2c0476e7-5d3e-4064-81bb-104e8e88bb6b

Mitra, Robin, McGough, Sarah F., Chakraborti, Tapabrata, Holmes, Chris, Copping, Ryan, Hagenbuch, Niels, Biedermann, Stefanie, Noonan, Jack, Lehmann, Brieuc, Shenvi, Aditi, Doan, Xuan Vinh, Leslie, David, Bianconi, Ginestra, Sanchez-Garcia, Ruben, Davies, Alisha, Mackintosh, Maxine, Andrinopoulou, Eleni-Rosalina, Basiri, Anahid, Harbron, Chris and MacArthur, Ben D. (2023) Learning from data with structured missingness. Nature Machine Intelligence, 5 (1), 13-23. (doi:10.1038/s42256-022-00596-z).

Record type: Article

Abstract

Missing data are an unavoidable complication in many machine learning tasks. When data are ‘missing at random’ there exist a range of tools and techniques to deal with the issue. However, as machine learning studies become more ambitious, and seek to learn from ever-larger volumes of heterogeneous data, an increasingly encountered problem arises in which missing values exhibit an association or structure, either explicitly or implicitly. Such ‘structured missingness’ raises a range of challenges that have not yet been systematically addressed, and presents a fundamental hindrance to machine learning at scale. Here we outline the current literature and propose a set of grand challenges in learning from data with structured missingness.

Text
Structured_missingness_v3 - Accepted Manuscript
Download (491kB)

More information

Accepted/In Press date: 21 November 2021
Published date: 25 January 2023
Additional Information: Funding Information: This work was sponsored by the Turing-Roche Strategic Partnership. We thank C. Matus for her talents in figure illustrations and design and V. Hellon for her expert community management. Publisher Copyright: © 2023, Springer Nature Limited.

Identifiers

Local EPrints ID: 475147
URI: http://eprints.soton.ac.uk/id/eprint/475147
ISSN: 2522-5839
PURE UUID: ab41abc8-5fdf-4146-a352-9a5fccf03527
ORCID for Ruben Sanchez-Garcia: ORCID iD orcid.org/0000-0001-6479-3028
ORCID for Ben D. MacArthur: ORCID iD orcid.org/0000-0002-5396-9750

Catalogue record

Date deposited: 10 Mar 2023 17:44
Last modified: 17 Mar 2024 07:40

Export record

Altmetrics

Contributors

Author: Robin Mitra
Author: Sarah F. McGough
Author: Tapabrata Chakraborti
Author: Chris Holmes
Author: Ryan Copping
Author: Niels Hagenbuch
Author: Stefanie Biedermann
Author: Jack Noonan
Author: Brieuc Lehmann
Author: Aditi Shenvi
Author: Xuan Vinh Doan
Author: David Leslie
Author: Ginestra Bianconi
Author: Alisha Davies
Author: Maxine Mackintosh
Author: Eleni-Rosalina Andrinopoulou
Author: Anahid Basiri
Author: Chris Harbron

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×