PhySwin: an efficient and physically-informed foundation model for multispectral earth observation
PhySwin: an efficient and physically-informed foundation model for multispectral earth observation
Recent progress on Remote Sensing Foundation Models (RSFMs) aims toward universal representations for Earth observation imagery. However, current effort soften scale up in size significantly without addressing efficiency constraints critical for real-world applications (e.g., onboard processing, rapid disaster response) or treat multispectral (MS) data as generic imagery, overlooking valuable physical priors. We introduce PhySwin, a foundation model for MS data that integrates physical priors with computational efficiency. PhySwin combines three innovations:(i) physics-informed pretraining objectives leveraging radiometric constraints to enhance feature learning; (ii) an efficient Mix MAE formulation tailored to SwinV2for low-FLOP, scalable pretraining; and (iii) token-efficient spectral embedding to retain spectral detail without increasing token counts. Pretrained on over 1MSentinel-2 tiles, PhySwin achieves SOTA results (+1.32% mIoU segmentation,+0.80% F1 change detection) while reducing inference latency by up to 14.4× and computational complexity by up to 43.6× compared to ViT-based RSFMs.
Tang, Chong
9409c6d1-69d2-4598-8b43-bbb7f51f6fe2
Powell, Joseph
8ff08ead-cbd7-4e87-b3c0-c26cda4fd33b
Koch, Dirk
aa152f03-73a8-4874-b87b-55cc00bb9eb3
Mullins, Robert D.
8b498efe-a7e8-42cb-8d49-b8a1a33165b0
Weddell, Alex S.
3d8c4d63-19b1-4072-a779-84d487fd6f03
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
18 September 2025
Tang, Chong
9409c6d1-69d2-4598-8b43-bbb7f51f6fe2
Powell, Joseph
8ff08ead-cbd7-4e87-b3c0-c26cda4fd33b
Koch, Dirk
aa152f03-73a8-4874-b87b-55cc00bb9eb3
Mullins, Robert D.
8b498efe-a7e8-42cb-8d49-b8a1a33165b0
Weddell, Alex S.
3d8c4d63-19b1-4072-a779-84d487fd6f03
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Tang, Chong, Powell, Joseph, Koch, Dirk, Mullins, Robert D., Weddell, Alex S. and Chauhan, Jagmohan
(2025)
PhySwin: an efficient and physically-informed foundation model for multispectral earth observation.
In NeurIPS 2025.
27 pp
.
Record type:
Conference or Workshop Item
(Paper)
Abstract
Recent progress on Remote Sensing Foundation Models (RSFMs) aims toward universal representations for Earth observation imagery. However, current effort soften scale up in size significantly without addressing efficiency constraints critical for real-world applications (e.g., onboard processing, rapid disaster response) or treat multispectral (MS) data as generic imagery, overlooking valuable physical priors. We introduce PhySwin, a foundation model for MS data that integrates physical priors with computational efficiency. PhySwin combines three innovations:(i) physics-informed pretraining objectives leveraging radiometric constraints to enhance feature learning; (ii) an efficient Mix MAE formulation tailored to SwinV2for low-FLOP, scalable pretraining; and (iii) token-efficient spectral embedding to retain spectral detail without increasing token counts. Pretrained on over 1MSentinel-2 tiles, PhySwin achieves SOTA results (+1.32% mIoU segmentation,+0.80% F1 change detection) while reducing inference latency by up to 14.4× and computational complexity by up to 43.6× compared to ViT-based RSFMs.
Text
238_PhySwin_An_Efficient_and_P
- Version of Record
More information
Published date: 18 September 2025
Venue - Dates:
NeurIPS 2025: 39th Annual Conference on Neural Information Processing Systems, , San Diego, United States, 2025-12-02 - 2025-12-07
Identifiers
Local EPrints ID: 511237
URI: http://eprints.soton.ac.uk/id/eprint/511237
PURE UUID: 48b2c6ee-e211-4960-9dad-2a06b4ac0cdc
Catalogue record
Date deposited: 08 May 2026 16:54
Last modified: 09 May 2026 01:44
Export record
Contributors
Author:
Chong Tang
Author:
Joseph Powell
Author:
Dirk Koch
Author:
Robert D. Mullins
Author:
Alex S. Weddell
Author:
Jagmohan Chauhan
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics