Restricted by the limited accessible data resources and the high cost of frame-by-frame labeling, fully supervised object detection is difficult to meet the needs of satellite video applications. In this paper, we propose a weakly-supervised satellite video detector based on salient feature fusion and boundary noise exploitation that enables moving ship detection without relying on object instance labeling. To mitigate pseudo-motion disturbances such as background movement, waves, and illumination changes, we first construct salient fusion features and then use Gaussian background modeling to generate high-quality pseudo-labels. To fully exploit the boundary information of noisy masks in pseudo label, we improve the Mask R-CNN by designing a noise-tolerant branch that fuses low-resolution features to mitigate the interference of inaccurate mask boundaries and guiding the network to learn boundary-related region features through boundary preserving mapping to enable better alignment of the predicted mask with the actual object. The experimental evaluation results show that the proposed method significantly outperforms other weakly supervised moving object detection methods and achieves comparable performance to fully supervised method.
|