Geometry-enhanced attentive multi-view stereo for challenging matching scenarios
Geometry-enhanced attentive multi-view stereo for challenging matching scenarios
Deep networks have made remarkable progress in Multi-View Stereo (MVS) task in recent years. However, the problem of finding accurate correspondences across different views under ill-posed matching situations remains unresolved and crucial. To address this issue, this paper proposes a Geometryenhanced Attentive Multi-View Stereo (GA-MVS) network, which can access multi-view consistent feature representation and achieve accurate depth estimation in challenging situations. Specifically, we propose a geometry-enhanced feature extractor to explore illumination-invariant geometric features and incorporate them with common texture features to improve matching accuracy when dealing with view-dependent photometric effects, such as shadow and specularity. Then, we design a novel attentive learning framework to explore per-pixel adaptive supervision, effectively improving the depth estimation performance of textureless regions. The experimental results on the DTU and Tanks & Temples benchmarks demonstrate that our method achieves state-of-the-art results compared to other advanced MVS models.
3D Reconstruction, Costs, Deep Learning, Depth Estimation, Estimation, Feature extraction, Geometric features, Loss measurement, Multi-View Stereo, Pipelines, Reliability, Three-dimensional displays, depth estimation, Multi-view stereo, deep learning, 3D reconstruction, geometric features
7401-7416
Liu, Yimei
7a0af0a6-ab47-4ba7-af50-63315b1ad96c
Cai, Qin
0ccb84a1-cc9b-4d1f-a8e8-16cec26ae7ba
Wang, Congcong
d65cd371-4ae5-4b16-ad38-d04ca26947d6
Yang, Jian
a95e75db-6340-4085-af35-54e655b46b6f
Fan, Hao
9313d4fe-cd51-4c5e-a5cf-548b9ba8d0c8
Dong, Junyu
cb626ba3-7c15-4441-b364-bc33349ad5b4
Chen, Sheng
9310a111-f79a-48b8-98c7-383ca93cbb80
12 August 2024
Liu, Yimei
7a0af0a6-ab47-4ba7-af50-63315b1ad96c
Cai, Qin
0ccb84a1-cc9b-4d1f-a8e8-16cec26ae7ba
Wang, Congcong
d65cd371-4ae5-4b16-ad38-d04ca26947d6
Yang, Jian
a95e75db-6340-4085-af35-54e655b46b6f
Fan, Hao
9313d4fe-cd51-4c5e-a5cf-548b9ba8d0c8
Dong, Junyu
cb626ba3-7c15-4441-b364-bc33349ad5b4
Chen, Sheng
9310a111-f79a-48b8-98c7-383ca93cbb80
Liu, Yimei, Cai, Qin, Wang, Congcong, Yang, Jian, Fan, Hao, Dong, Junyu and Chen, Sheng
(2024)
Geometry-enhanced attentive multi-view stereo for challenging matching scenarios.
IEEE Transactions on Circuits and Systems for Video Technology, 34 (8), .
(doi:10.1109/TCSVT.2024.3376692).
Abstract
Deep networks have made remarkable progress in Multi-View Stereo (MVS) task in recent years. However, the problem of finding accurate correspondences across different views under ill-posed matching situations remains unresolved and crucial. To address this issue, this paper proposes a Geometryenhanced Attentive Multi-View Stereo (GA-MVS) network, which can access multi-view consistent feature representation and achieve accurate depth estimation in challenging situations. Specifically, we propose a geometry-enhanced feature extractor to explore illumination-invariant geometric features and incorporate them with common texture features to improve matching accuracy when dealing with view-dependent photometric effects, such as shadow and specularity. Then, we design a novel attentive learning framework to explore per-pixel adaptive supervision, effectively improving the depth estimation performance of textureless regions. The experimental results on the DTU and Tanks & Temples benchmarks demonstrate that our method achieves state-of-the-art results compared to other advanced MVS models.
Text
oucPaper7MR
- Accepted Manuscript
Text
TCSVT2024-Aug
- Version of Record
Restricted to Repository staff only
Request a copy
More information
Accepted/In Press date: 6 March 2024
e-pub ahead of print date: 18 March 2024
Published date: 12 August 2024
Keywords:
3D Reconstruction, Costs, Deep Learning, Depth Estimation, Estimation, Feature extraction, Geometric features, Loss measurement, Multi-View Stereo, Pipelines, Reliability, Three-dimensional displays, depth estimation, Multi-view stereo, deep learning, 3D reconstruction, geometric features
Identifiers
Local EPrints ID: 487878
URI: http://eprints.soton.ac.uk/id/eprint/487878
ISSN: 1558-2205
PURE UUID: 5e74d409-5993-4993-9f0c-57a684de2917
Catalogue record
Date deposited: 08 Mar 2024 17:31
Last modified: 18 Dec 2024 17:57
Export record
Altmetrics
Contributors
Author:
Yimei Liu
Author:
Qin Cai
Author:
Congcong Wang
Author:
Jian Yang
Author:
Hao Fan
Author:
Junyu Dong
Author:
Sheng Chen
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics