Paper
26 May 2023 Cross-modal deformable DETR for RGB-D object detection
Siyuan Qin, Zongqing Lu, An Wang, Nan Su
Author Affiliations +
Proceedings Volume 12700, International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023); 1270010 (2023) https://doi.org/10.1117/12.2682571
Event: International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023), 2023, Nanchang, China
Abstract
RGB-D object detection is a challenging task due to the demand of effectively processing of visible modality and depth modality features. However, pre-existing RGB-D object detection models have several deficiencies, including demand for hand-crafted settings, and insufficient ability of fusing cross-modal features. In this paper, we propose a novel Cross-modal RGB-D object detection model, based on Deformable DETR, named as CM-DETR. Our proposed model can effectively fuse multi-modal information, and don’t need hand-crafted settings resulted from prior information. Extensive experiments show that our model has achieved extraordinary improvement, which exceeds the baseline by more than 4.6% mAP on SUN-RGBD and 6.9% mAP on NYUDv2.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Siyuan Qin, Zongqing Lu, An Wang, and Nan Su "Cross-modal deformable DETR for RGB-D object detection", Proc. SPIE 12700, International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023), 1270010 (26 May 2023); https://doi.org/10.1117/12.2682571
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Deformation

Feature fusion

Transformers

Back to Top