Paper
3 February 2023 Self-supervised monocular depth estimation with coordinate attention
YuHong Chen, HongFei Yu, LaiDe Guo, Yang Cao
Author Affiliations +
Proceedings Volume 12511, Third International Conference on Computer Vision and Data Mining (ICCVDM 2022); 125113D (2023) https://doi.org/10.1117/12.2660728
Event: Third International Conference on Computer Vision and Data Mining (ICCVDM 2022), 2022, Hulun Buir, China
Abstract
Predicting depth information from a single image has recently become an important research topic in computer vision. In particular, the self-supervised strategy for learning the depth is more attractive because it is not necessary to label any ground truth information. Under the framework of self-supervised learning we propose a CA-depth network to improve the accuracy of a single image depth estimation. We added the attention mechanism to the monocular depth estimation network to address the issues of observable artifacts and inaccurate prediction geometry in monocular depth estimation images. The spatial position information in the high-dimensional feature map is used to pay attention to the essential features, and to weaken the artifact phenomenon in the depth prediction map. We used Resnet as the encoder to extract the input image's feature map, the coordinate attention mechanism to realize the optimal allocation of convolution feature map weight, and the decoding network structure to predict the depth. Experimental results on public datasets show that the depth prediction accuracy of the CA-depth network is higher than the state-of-art methods.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
YuHong Chen, HongFei Yu, LaiDe Guo, and Yang Cao "Self-supervised monocular depth estimation with coordinate attention", Proc. SPIE 12511, Third International Conference on Computer Vision and Data Mining (ICCVDM 2022), 125113D (3 February 2023); https://doi.org/10.1117/12.2660728
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer vision technology

Computer programming

Convolution

Machine vision

Image analysis

Signal attenuation

RGB color model

RELATED CONTENT

Feature attention network (FA Net) a deep learning based...
Proceedings of SPIE (October 12 2022)
Motion-aware deep video coding network
Proceedings of SPIE (April 21 2020)
ADA interpretative system for image algebra
Proceedings of SPIE (June 01 1992)
Introduction to Image Algebra Ada
Proceedings of SPIE (July 01 1991)
Connection Machine Vision Applications
Proceedings of SPIE (June 06 1987)

Back to Top