Title | PSP-MVSNet: Deep Patch-Based Similarity Perceptual for Multi-view Stereo Depth Inference |
Creator | |
Date Issued | 2022 |
Conference Name | 31st International Conference on Artificial Neural Networks, ICANN 2022 |
Source Publication | ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I (Lecture Notes in Computer Science)
![]() |
Editor | Elias Pimenidis, Plamen Angelov, Chrisina Jayne, Antonios Papaleonidas, Mehmet Aydin |
ISBN | 9783031159190 |
ISSN | 0302-9743 |
Volume | Lecture Notes in Computer Science (LNCS, volume 13529) |
Pages | 316-328 |
Conference Date | SEP 06-09, 2022 |
Conference Place | Bristol |
Country | England |
Publisher | Springer |
Abstract | This paper proposes PSP-MVSNet for depth inference problem in multi-view stereo (MVS). We first introduce a novel patch-based similarity perceptual (PSP) module for effectively constructing 3D cost volume. Unlike previous methods that leverage variance-based operators to fuse feature volumes of different views, our method leverages a cosine similarity measure to calculate matching scores for pairs of deep feature vectors and then treats these scores as weights for constructing the 3D cost volume. This is based on an important observation that many performance degradation factors, e.g., illumination changes or occlusions, will lead to pixel differences between multi-view images. We demonstrate that a patch-based cosine similarity can be used as explicit supervision for feature learning and can help speed up convergence. Furthermore, To adaptively set different depth ranges for different pixels, we extend an existing dynamic depth range searching method with a simple yet effective improvement. We can use this improved searching method to train our model in an end-to-end manner and further improve the performance of our method. Experimental results show that our method achieves state-of-the-art performance on the DTU dataset and comparative results on the intermediate set of Tanks and Temples dataset. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG. |
Keyword | Depth estimation Dynamic depth range Multi-view stereo Patch-based similarity |
DOI | 10.1007/978-3-031-15919-0_27 |
URL | View source |
Indexed By | CPCI-S |
Language | 英语English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Theory & Methods |
WOS ID | WOS:000866210600027 |
Scopus ID | 2-s2.0-85138793695 |
Citation statistics | |
Document Type | Conference paper |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/11531 |
Collection | Faculty of Science and Technology |
Corresponding Author | Zhang, Hui |
Affiliation | 1.Department of Computer Science, Hong Kong Baptist University, Hong Kong 2.Guangdong Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, China |
First Author Affilication | Beijing Normal-Hong Kong Baptist University |
Corresponding Author Affilication | Beijing Normal-Hong Kong Baptist University |
Recommended Citation GB/T 7714 | Jie, Leiping,Zhang, Hui. PSP-MVSNet: Deep Patch-Based Similarity Perceptual for Multi-view Stereo Depth Inference[C]//Elias Pimenidis, Plamen Angelov, Chrisina Jayne, Antonios Papaleonidas, Mehmet Aydin: Springer, 2022: 316-328. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment