Spatiotemporal representation learning for short and long medical image time series
Spatiotemporal representation learning for short and long medical image time series
Analyzing temporal developments is crucial for the accurate prognosis of many medical conditions. Temporal changes that occur over short time scales are key to assessing the health of physiological functions, such as the cardiac cycle. Moreover, tracking longer term developments that occur over months or years in evolving processes, such as age-related macular degeneration (AMD), is essential for accurate prognosis. Despite the importance of both short and long term analysis to clinical decision making, they remain understudied in medical deep learning. State of the art methods for spatiotemporal representation learning, developed for short natural videos, prioritize the detection of temporal constants rather than temporal developments. Moreover, they do not account for varying time intervals between acquisitions, which are essential for contextualizing observed changes. To address these issues, we propose two approaches. First, we combine clip-level contrastive learning with a novel temporal embedding to adapt to irregular time series. Second, we propose masking and predicting latent frame representations of the temporal sequence. Our two approaches outperform all prior methods on temporally-dependent tasks including cardiac output estimation and three prognostic AMD tasks. Overall, this enables the automated analysis of temporal patterns which are typically overlooked in applications of deep learning to medicine.
656–666
Shen, Chengzhi
652d0531-cc84-4e36-8263-394c1d4dd240
Menten, Martin J.
d6fda4c6-08d5-48b4-bff4-9ccae4a04cef
Bogunovic, Hrvoje
d94f0d97-5a27-44d7-912d-9e043c2d449b
Schmidt-Erfurth, Ursula
af993078-6680-4d2a-bc50-ebf6abc3857f
Scholl, Hendrik P.N.
2c38ca3c-90a8-455d-a22c-b52508a9890e
Sivaprasad, Sobha
6ae3de79-66b3-4fc5-a91d-7de5f5887038
Lotery, Andrew
5ecc2d2d-d0b4-468f-ad2c-df7156f8e514
Rueckert, Daniel
3c5b51eb-7e44-4a00-ba79-cafb7ab3d970
Hager, Paul
f26a6330-e181-447e-9e12-3328c3bdf461
Holland, Robbie
0a264fb8-e671-487d-8d29-cca2f5bced91
3 October 2024
Shen, Chengzhi
652d0531-cc84-4e36-8263-394c1d4dd240
Menten, Martin J.
d6fda4c6-08d5-48b4-bff4-9ccae4a04cef
Bogunovic, Hrvoje
d94f0d97-5a27-44d7-912d-9e043c2d449b
Schmidt-Erfurth, Ursula
af993078-6680-4d2a-bc50-ebf6abc3857f
Scholl, Hendrik P.N.
2c38ca3c-90a8-455d-a22c-b52508a9890e
Sivaprasad, Sobha
6ae3de79-66b3-4fc5-a91d-7de5f5887038
Lotery, Andrew
5ecc2d2d-d0b4-468f-ad2c-df7156f8e514
Rueckert, Daniel
3c5b51eb-7e44-4a00-ba79-cafb7ab3d970
Hager, Paul
f26a6330-e181-447e-9e12-3328c3bdf461
Holland, Robbie
0a264fb8-e671-487d-8d29-cca2f5bced91
Shen, Chengzhi, Menten, Martin J., Bogunovic, Hrvoje, Schmidt-Erfurth, Ursula, Scholl, Hendrik P.N., Sivaprasad, Sobha, Lotery, Andrew, Rueckert, Daniel, Hager, Paul and Holland, Robbie
(2024)
Spatiotemporal representation learning for short and long medical image time series.
Linguraru, Marius George, Dou, Qi, Feragen, Aasa, Giannarou, Stamatia, Glocker, Ben, Lekadir, Karim and Schnabel, Julia A.
(eds.)
In Medical Image Computing and Computer Assisted Intervention – MICCAI 2024: 27th International Conference, Marrakesh, Morocco, October 6–10, 2024, Proceedings, Part XI.
vol. 15011,
Springer Cham.
.
(doi:10.1007/978-3-031-72120-5_61).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Analyzing temporal developments is crucial for the accurate prognosis of many medical conditions. Temporal changes that occur over short time scales are key to assessing the health of physiological functions, such as the cardiac cycle. Moreover, tracking longer term developments that occur over months or years in evolving processes, such as age-related macular degeneration (AMD), is essential for accurate prognosis. Despite the importance of both short and long term analysis to clinical decision making, they remain understudied in medical deep learning. State of the art methods for spatiotemporal representation learning, developed for short natural videos, prioritize the detection of temporal constants rather than temporal developments. Moreover, they do not account for varying time intervals between acquisitions, which are essential for contextualizing observed changes. To address these issues, we propose two approaches. First, we combine clip-level contrastive learning with a novel temporal embedding to adapt to irregular time series. Second, we propose masking and predicting latent frame representations of the temporal sequence. Our two approaches outperform all prior methods on temporally-dependent tasks including cardiac output estimation and three prognostic AMD tasks. Overall, this enables the automated analysis of temporal patterns which are typically overlooked in applications of deep learning to medicine.
Text
Manuscript - Spatiotemporal Representation Learning for Short and Long Medical Image Time Series
Restricted to Repository staff only
Request a copy
More information
Published date: 3 October 2024
Venue - Dates:
27th Medical Image Computing and Computer Assisted Intervention 2024 MICCAI, Morocco, 2024-10-06 - 2024-10-10
Identifiers
Local EPrints ID: 497438
URI: http://eprints.soton.ac.uk/id/eprint/497438
ISSN: 0302-9743
PURE UUID: 2b135502-9262-4a9a-99d5-04b9009a7b06
Catalogue record
Date deposited: 22 Jan 2025 17:58
Last modified: 23 Jan 2025 02:39
Export record
Altmetrics
Contributors
Author:
Chengzhi Shen
Author:
Martin J. Menten
Author:
Hrvoje Bogunovic
Author:
Ursula Schmidt-Erfurth
Author:
Hendrik P.N. Scholl
Author:
Sobha Sivaprasad
Author:
Daniel Rueckert
Author:
Paul Hager
Author:
Robbie Holland
Editor:
Marius George Linguraru
Editor:
Qi Dou
Editor:
Aasa Feragen
Editor:
Stamatia Giannarou
Editor:
Ben Glocker
Editor:
Karim Lekadir
Editor:
Julia A. Schnabel
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics