9 February 2022 In-and-Out: a data augmentation technique for computer vision tasks
Chenghao Li, Jing Zhang, Li Hu, Hao Zhao, Huilong Zhu, Maomao Shan
Author Affiliations +
Abstract

This study focuses on the over-fitting problem in the training process of the deep convolutional neural network model and the problem of poor robustness when the model is applied in an occlusion environment. We propose a unique data augmentation method, In-and-Out. First, the information variance is enhanced through dynamic local operation while maintaining the overall geometric structure of the training image; compared with the global data augmentation method, our method effectively alleviates the overfitting problem of model training and significantly improves the generalization ability of the model. Then through the dynamic information removal operation, the image is hidden according to the dynamic patch generated by multiple parameters. Compared with other information removal methods, our method can better simulate the real-world occlusion environment, thus improving the robustness of the model in various occlusion scenes. This method is simple and easy to implement and can be integrated with most CNN-based computer vision tasks. Our extensive experiments show that our method surpasses previous methods on the Canadian Institute for Advanced Research dataset for image classification, the PASCAL Visual Object Classes dataset for object detection, and the Cityscapes dataset for semantic segmentation. In addition, our robustness experiments show that our method has good robustness to occlusion in various scenes.

© 2022 SPIE and IS&T 1017-9909/2022/$28.00 © 2022 SPIE and IS&T
Chenghao Li, Jing Zhang, Li Hu, Hao Zhao, Huilong Zhu, and Maomao Shan "In-and-Out: a data augmentation technique for computer vision tasks," Journal of Electronic Imaging 31(1), 013023 (9 February 2022). https://doi.org/10.1117/1.JEI.31.1.013023
Received: 19 August 2021; Accepted: 13 January 2022; Published: 9 February 2022
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer vision technology

Data modeling

Machine vision

Visual process modeling

Image classification

Image segmentation

Image enhancement

RELATED CONTENT


Back to Top