Paper
31 July 2019 MBNet: multi-scale bilinear convolutional neural networks for fine-grained visual classification towards real-time tasks
Tingqiang Deng, Rui Li, Chunguo Li, Rutian Liao, Yang Liu, Zhe Yang, Luxi Yang
Author Affiliations +
Proceedings Volume 11198, Fourth International Workshop on Pattern Recognition; 1119806 (2019) https://doi.org/10.1117/12.2540365
Event: Fourth International Workshop on Pattern Recognition, 2019, Nanjing, China
Abstract
Fine-grained visual classification (FGVC) is difficult due to the under-utilization of low-level features. This paper proposes a real-time method MBNet based on multi-stream multi-scale cross bilinear CNN that contributes to solving the problem. First, each layer of the multi-stream CNN is extracted by basic network such as VGGNet and others, followed by calculating multi-stream cross bilinear vector and bottom bilinear vector of low and high level features respectively. The FGVC results are predicted after feature fusion, which solves the problem that small and low-level details in the original image are easily overlooked. In the widely used datasets Caltech-UCSD Birds, Stanford Cars and Aircraft, the proposed method shows that the accuracy is significantly improved compared to the existing methods, reaching to state of the art level of 88.51%, 94.73% and 92.41%. It also meets the requirements of real-time tasks.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Tingqiang Deng, Rui Li, Chunguo Li, Rutian Liao, Yang Liu, Zhe Yang, and Luxi Yang "MBNet: multi-scale bilinear convolutional neural networks for fine-grained visual classification towards real-time tasks", Proc. SPIE 11198, Fourth International Workshop on Pattern Recognition, 1119806 (31 July 2019); https://doi.org/10.1117/12.2540365
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Data modeling

Convolutional neural networks

Computer vision technology

Feature extraction

Lithium

Detection and tracking algorithms

RELATED CONTENT


Back to Top