The University of Southampton
University of Southampton Institutional Repository

4D-VQ-GAN: a world model for synthesizing medical scans at any time point for personalized disease progression modeling of idiopathic pulmonary fibrosis

4D-VQ-GAN: a world model for synthesizing medical scans at any time point for personalized disease progression modeling of idiopathic pulmonary fibrosis
4D-VQ-GAN: a world model for synthesizing medical scans at any time point for personalized disease progression modeling of idiopathic pulmonary fibrosis
Understanding the progression trajectories of diseases is crucial for early diagnosis and effective treatment planning. This is especially vital for life-threatening conditions such as Idiopathic Pulmonary Fibrosis (IPF), a chronic, progressive lung disease with a prognosis comparable to many cancers. Computed tomography (CT) imaging has been established as a reliable diagnostic tool for IPF. Accurately predicting future CT scans of early-stage IPF patients can aid in developing better treatment strategies, thereby improving survival outcomes. Recent world models have shown success in simulating the temporal dynamics of the physical world by learning from videos. In this paper, we propose the first world model for IPF, to generate realistic scans of early-stage IPF patients at any time point. We term our model 4D Vector Quantised Generative Adversarial Networks (4D-VQ-GAN). Our model is trained using a two-stage approach. In the first stage, a 3D-VQ-GAN is trained to reconstruct CT volumes. In the second stage, a Neural Ordinary Differential Equation (ODE) model is trained to capture the temporal dynamics of the quantised embeddings, which are generated by the encoder trained in the first stage. For clinical validation, we conduct survival analysis using imaging biomarkers derived from generated CT scans and achieve a C-index either better than or comparable to that of biomarkers derived from the real CT scans. The survival analysis results suggest the potential clinical utility inherent to generated longitudinal CT scans, showing that they can reliably predict survival outcomes.
2640-3498
ML Research Press
Zhao, An
695a1a7d-4b04-4536-a6ab-bac4eaaf7cc1
Xu, Moucheng
cf54e767-665d-4088-a9d2-7095196f0a5b
Shahin, Ahmed H.
7a365bbe-dda9-4f49-a71a-fc410d7680a5
Wuyts, Wim
8eee8452-2e7c-4472-adb1-ce69b988f2bc
Jones, Mark G.
a6fd492e-058e-4e84-a486-34c6035429c1
Jacob, Joseph
b93a90c4-de81-4001-8f6f-bfd9f54a3d29
Alexander, Daniel C.
2ec40902-1964-4584-92ea-8a55b8fa6317
Zhao, An
695a1a7d-4b04-4536-a6ab-bac4eaaf7cc1
Xu, Moucheng
cf54e767-665d-4088-a9d2-7095196f0a5b
Shahin, Ahmed H.
7a365bbe-dda9-4f49-a71a-fc410d7680a5
Wuyts, Wim
8eee8452-2e7c-4472-adb1-ce69b988f2bc
Jones, Mark G.
a6fd492e-058e-4e84-a486-34c6035429c1
Jacob, Joseph
b93a90c4-de81-4001-8f6f-bfd9f54a3d29
Alexander, Daniel C.
2ec40902-1964-4584-92ea-8a55b8fa6317

Zhao, An, Xu, Moucheng, Shahin, Ahmed H., Wuyts, Wim, Jones, Mark G., Jacob, Joseph and Alexander, Daniel C. (2025) 4D-VQ-GAN: a world model for synthesizing medical scans at any time point for personalized disease progression modeling of idiopathic pulmonary fibrosis. In Medical Imaging with Deep Learning 2025. ML Research Press. 31 pp . (In Press)

Record type: Conference or Workshop Item (Paper)

Abstract

Understanding the progression trajectories of diseases is crucial for early diagnosis and effective treatment planning. This is especially vital for life-threatening conditions such as Idiopathic Pulmonary Fibrosis (IPF), a chronic, progressive lung disease with a prognosis comparable to many cancers. Computed tomography (CT) imaging has been established as a reliable diagnostic tool for IPF. Accurately predicting future CT scans of early-stage IPF patients can aid in developing better treatment strategies, thereby improving survival outcomes. Recent world models have shown success in simulating the temporal dynamics of the physical world by learning from videos. In this paper, we propose the first world model for IPF, to generate realistic scans of early-stage IPF patients at any time point. We term our model 4D Vector Quantised Generative Adversarial Networks (4D-VQ-GAN). Our model is trained using a two-stage approach. In the first stage, a 3D-VQ-GAN is trained to reconstruct CT volumes. In the second stage, a Neural Ordinary Differential Equation (ODE) model is trained to capture the temporal dynamics of the quantised embeddings, which are generated by the encoder trained in the first stage. For clinical validation, we conduct survival analysis using imaging biomarkers derived from generated CT scans and achieve a C-index either better than or comparable to that of biomarkers derived from the real CT scans. The survival analysis results suggest the potential clinical utility inherent to generated longitudinal CT scans, showing that they can reliably predict survival outcomes.

Text
midl_2025_camera_ready-1 - Accepted Manuscript
Download (8MB)

More information

Accepted/In Press date: 8 May 2025
Additional Information: For the purpose to support future research, the author has applied a CC BY-NC 4.0 copyright licence to any author accepted manuscript version arising from this submission.
Venue - Dates: Medical Imaging with Deep Learning, , Salt Lake City, United States, 2025-07-09 - 2025-07-11

Identifiers

Local EPrints ID: 502450
URI: http://eprints.soton.ac.uk/id/eprint/502450
ISSN: 2640-3498
PURE UUID: 8fb2f9ff-d55a-4a47-8b85-f29a9722e618
ORCID for Mark G. Jones: ORCID iD orcid.org/0000-0001-6308-6014

Catalogue record

Date deposited: 26 Jun 2025 17:00
Last modified: 11 Jul 2025 04:01

Export record

Contributors

Author: An Zhao
Author: Moucheng Xu
Author: Ahmed H. Shahin
Author: Wim Wuyts
Author: Mark G. Jones ORCID iD
Author: Joseph Jacob
Author: Daniel C. Alexander

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×