NavMapFusion: diffusion-based fusion of navigation maps for online Vectorized HD map construction
NavMapFusion: diffusion-based fusion of navigation maps for online Vectorized HD map construction
Accurate environmental representations are essential for autonomous driving, providing the foundation for safe and efficient navigation. Traditionally, high-definition (HD) maps are providing this representation of the static road infrastructure to the autonomous system a priori. However, because the real world is constantly changing, such maps must be constructed online from on-board sensor data. Navigation-grade standard-definition (SD) maps are widely available, but their resolution is insufficient for direct deployment. Instead, they can be used as coarse prior to guide the online map construction process. We propose NavMapFusion, a diffusion-based framework that performs iterative denoising conditioned on high-fidelity sensor data and on low-fidelity navigation maps. This paper strives to answer: (1) How can coarse, potentially outdated navigation maps guide online map construction? (2) What advantages do diffusion models offer for map fusion? We demonstrate that diffusion-based map construction provides a robust framework for map fusion. Our key insight is that discrepancies between the prior map and online perception naturally correspond to noise within the diffusion process; consistent regions reinforce the map construction, whereas outdated segments are suppressed. On the nuScenes benchmark, NavMapFusion conditioned on coarse road lines from OpenStreetMap data reaches a 21.4 % relative improvement on 100 m, and even stronger improvements on larger perception ranges, while maintaining real-time capabilities. By fusing low-fidelity priors with high-fidelity sensor data, the proposed method generates accurate and upto-date environment representations, guiding towards safer and more reliable autonomous driving. The code is available at https://github.com/tmonnin/navmapfusion.
7945-7954
Monninger, Thomas
4b9da19d-b0db-44fa-81df-85cfa01bb716
Zhang, Zihan
26d6216a-8a66-4f2a-a76d-fe2326d4a8fc
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49
Ding, Sihao
9418f5ac-b2f1-4ad9-9cd0-fa7190279ceb
5 May 2026
Monninger, Thomas
4b9da19d-b0db-44fa-81df-85cfa01bb716
Zhang, Zihan
26d6216a-8a66-4f2a-a76d-fe2326d4a8fc
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49
Ding, Sihao
9418f5ac-b2f1-4ad9-9cd0-fa7190279ceb
Monninger, Thomas, Zhang, Zihan, Staab, Steffen and Ding, Sihao
(2026)
NavMapFusion: diffusion-based fusion of navigation maps for online Vectorized HD map construction.
In 2026 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
IEEE.
.
(doi:10.1109/WACV61042.2026.00767).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Accurate environmental representations are essential for autonomous driving, providing the foundation for safe and efficient navigation. Traditionally, high-definition (HD) maps are providing this representation of the static road infrastructure to the autonomous system a priori. However, because the real world is constantly changing, such maps must be constructed online from on-board sensor data. Navigation-grade standard-definition (SD) maps are widely available, but their resolution is insufficient for direct deployment. Instead, they can be used as coarse prior to guide the online map construction process. We propose NavMapFusion, a diffusion-based framework that performs iterative denoising conditioned on high-fidelity sensor data and on low-fidelity navigation maps. This paper strives to answer: (1) How can coarse, potentially outdated navigation maps guide online map construction? (2) What advantages do diffusion models offer for map fusion? We demonstrate that diffusion-based map construction provides a robust framework for map fusion. Our key insight is that discrepancies between the prior map and online perception naturally correspond to noise within the diffusion process; consistent regions reinforce the map construction, whereas outdated segments are suppressed. On the nuScenes benchmark, NavMapFusion conditioned on coarse road lines from OpenStreetMap data reaches a 21.4 % relative improvement on 100 m, and even stronger improvements on larger perception ranges, while maintaining real-time capabilities. By fusing low-fidelity priors with high-fidelity sensor data, the proposed method generates accurate and upto-date environment representations, guiding towards safer and more reliable autonomous driving. The code is available at https://github.com/tmonnin/navmapfusion.
Text
Monninger_NavMapFusion_Diffusion-based_Fusion_of_Navigation_Maps_for_Online_Vectorized_HD_WACV_2026_paper
- Accepted Manuscript
More information
Published date: 5 May 2026
Venue - Dates:
The IEEE/CVF Winter Conference on Applications of Computer Vision, , Tucson, Arizona, United States, 2026-03-06 - 2026-03-10
Identifiers
Local EPrints ID: 511655
URI: http://eprints.soton.ac.uk/id/eprint/511655
PURE UUID: b02cef84-31f2-4620-b2e6-e347a2a6c526
Catalogue record
Date deposited: 26 May 2026 17:04
Last modified: 27 May 2026 01:48
Export record
Altmetrics
Contributors
Author:
Thomas Monninger
Author:
Zihan Zhang
Author:
Steffen Staab
Author:
Sihao Ding
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics