The University of Southampton
University of Southampton Institutional Repository

Geometry-enhanced attentive multi-view stereo for challenging matching scenarios

Geometry-enhanced attentive multi-view stereo for challenging matching scenarios
Geometry-enhanced attentive multi-view stereo for challenging matching scenarios
Deep networks have made remarkable progress in Multi-View Stereo (MVS) task in recent years. However, the problem of finding accurate correspondences across different views under ill-posed matching situations remains unresolved and crucial. To address this issue, this paper proposes a Geometryenhanced Attentive Multi-View Stereo (GA-MVS) network, which can access multi-view consistent feature representation and achieve accurate depth estimation in challenging situations. Specifically, we propose a geometry-enhanced feature extractor to explore illumination-invariant geometric features and incorporate them with common texture features to improve matching accuracy when dealing with view-dependent photometric effects, such as shadow and specularity. Then, we design a novel attentive learning framework to explore per-pixel adaptive supervision, effectively improving the depth estimation performance of textureless regions. The experimental results on the DTU and Tanks & Temples benchmarks demonstrate that our method achieves state-of-the-art results compared to other advanced MVS models.
3D Reconstruction, Costs, Deep Learning, Depth Estimation, Estimation, Feature extraction, Geometric features, Loss measurement, Multi-View Stereo, Pipelines, Reliability, Three-dimensional displays, depth estimation, Multi-view stereo, deep learning, 3D reconstruction, geometric features
1558-2205
7401-7416
Liu, Yimei
7a0af0a6-ab47-4ba7-af50-63315b1ad96c
Cai, Qin
0ccb84a1-cc9b-4d1f-a8e8-16cec26ae7ba
Wang, Congcong
d65cd371-4ae5-4b16-ad38-d04ca26947d6
Yang, Jian
a95e75db-6340-4085-af35-54e655b46b6f
Fan, Hao
9313d4fe-cd51-4c5e-a5cf-548b9ba8d0c8
Dong, Junyu
cb626ba3-7c15-4441-b364-bc33349ad5b4
Chen, Sheng
9310a111-f79a-48b8-98c7-383ca93cbb80
Liu, Yimei
7a0af0a6-ab47-4ba7-af50-63315b1ad96c
Cai, Qin
0ccb84a1-cc9b-4d1f-a8e8-16cec26ae7ba
Wang, Congcong
d65cd371-4ae5-4b16-ad38-d04ca26947d6
Yang, Jian
a95e75db-6340-4085-af35-54e655b46b6f
Fan, Hao
9313d4fe-cd51-4c5e-a5cf-548b9ba8d0c8
Dong, Junyu
cb626ba3-7c15-4441-b364-bc33349ad5b4
Chen, Sheng
9310a111-f79a-48b8-98c7-383ca93cbb80

Liu, Yimei, Cai, Qin, Wang, Congcong, Yang, Jian, Fan, Hao, Dong, Junyu and Chen, Sheng (2024) Geometry-enhanced attentive multi-view stereo for challenging matching scenarios. IEEE Transactions on Circuits and Systems for Video Technology, 34 (8), 7401-7416. (doi:10.1109/TCSVT.2024.3376692).

Record type: Article

Abstract

Deep networks have made remarkable progress in Multi-View Stereo (MVS) task in recent years. However, the problem of finding accurate correspondences across different views under ill-posed matching situations remains unresolved and crucial. To address this issue, this paper proposes a Geometryenhanced Attentive Multi-View Stereo (GA-MVS) network, which can access multi-view consistent feature representation and achieve accurate depth estimation in challenging situations. Specifically, we propose a geometry-enhanced feature extractor to explore illumination-invariant geometric features and incorporate them with common texture features to improve matching accuracy when dealing with view-dependent photometric effects, such as shadow and specularity. Then, we design a novel attentive learning framework to explore per-pixel adaptive supervision, effectively improving the depth estimation performance of textureless regions. The experimental results on the DTU and Tanks & Temples benchmarks demonstrate that our method achieves state-of-the-art results compared to other advanced MVS models.

Text
oucPaper7MR - Accepted Manuscript
Download (17MB)
Text
TCSVT2024-Aug - Version of Record
Restricted to Repository staff only
Request a copy

More information

Accepted/In Press date: 6 March 2024
e-pub ahead of print date: 18 March 2024
Published date: 12 August 2024
Keywords: 3D Reconstruction, Costs, Deep Learning, Depth Estimation, Estimation, Feature extraction, Geometric features, Loss measurement, Multi-View Stereo, Pipelines, Reliability, Three-dimensional displays, depth estimation, Multi-view stereo, deep learning, 3D reconstruction, geometric features

Identifiers

Local EPrints ID: 487878
URI: http://eprints.soton.ac.uk/id/eprint/487878
ISSN: 1558-2205
PURE UUID: 5e74d409-5993-4993-9f0c-57a684de2917

Catalogue record

Date deposited: 08 Mar 2024 17:31
Last modified: 18 Dec 2024 17:57

Export record

Altmetrics

Contributors

Author: Yimei Liu
Author: Qin Cai
Author: Congcong Wang
Author: Jian Yang
Author: Hao Fan
Author: Junyu Dong
Author: Sheng Chen

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×